# Copy Number Variant Calling

The DRAGEN Copy Number Variant (CNV) Pipeline detects copy number aberrations and regions with loss of heterozygosity (LOH) from next-generation sequencing (NGS) data. It supports germline and somatic workflows for both whole-genome sequencing (WGS) and whole-exome sequencing (WES) in a single interface via the DRAGEN Host Software.

## Choose Your Workflow

Use the table below to identify the right pipeline for your use case and jump directly to its documentation.

| Sample type | Data type | Input samples  | Documentation                                                                                                                     |
| ----------- | --------- | -------------- | --------------------------------------------------------------------------------------------------------------------------------- |
| Germline    | WGS / WES | Single / Multi | [Germline CNV Calling](https://help.dragen.illumina.com/product-guides/dragen-v4.5/dragen-dna-pipeline/cnv-overview/cnv-germline) |
| Somatic     | WGS / WES | Single         | [Somatic CNV Calling](https://help.dragen.illumina.com/product-guides/dragen-v4.5/dragen-dna-pipeline/cnv-overview/cnv-somatic)   |

Example commandlines are provided under [DRAGEN Recipes](https://github.com/illumina-swi/dragen-docs/blob/release/4.5-prod/product-guides/dragen-v4.5/user-guide/dragen-recipes/README.md).

Visit [Reference](https://help.dragen.illumina.com/product-guides/dragen-v4.5/dragen-dna-pipeline/cnv-overview/cnv-reference) for more details on the CNV component.

## Before You Begin

Before running the CNV pipeline, ensure the following prerequisites are in place:

1. **CNV-enabled reference hashtable** — The hashtable must be built with `--ht-build-cnv-hashtable true`. This generates an additional k-mer uniqueness map used to correct mappability biases.
2. **Aligned BAM or CRAM input** — The pipeline accepts pre-aligned reads. If you are starting from FASTQ, first run map/align or use streaming alignments.
3. **Panel of normals (required for WES)** — WES normalization requires a panel of normals. WGS can use self-normalization instead.

## Pipeline Overview

The DRAGEN CNV Pipeline processes the input signal through the following stages:

**1. Target Counts** — Read counts and other signals are extracted from alignments and binned into target intervals.

![](https://25033470-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FG9szlFZupV6Q2DasL98y%2Fuploads%2Fgit-blob-b9f011edb411777e8603fb4dbccfdedc4ca191b9%2Fcnv-calling.SignalTargetCounts.png?alt=media)

**2. Normalization** — The case sample is normalized against a panel of normals or against the estimated normal ploidy. Systematic biases (e.g., GC bias) are corrected to amplify event-level signals.

![](https://25033470-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FG9szlFZupV6Q2DasL98y%2Fuploads%2Fgit-blob-1ed9998ea6b8062f6cdafa88458df5c92e6f3f5c%2Fcnv-calling.SignalNormalized.png?alt=media)

**3. Segmentation** — The normalized signal is segmented using one of the available segmentation algorithms.

![](https://25033470-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FG9szlFZupV6Q2DasL98y%2Fuploads%2Fgit-blob-f303c4ee46d0b8ca21974200f6093f32f00416b7%2Fcnv-calling.SignalSegments.png?alt=media)

**4. CNV Calling** — Events are called from the segments, scored, and emitted in the output VCF.

![](https://25033470-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FG9szlFZupV6Q2DasL98y%2Fuploads%2Fgit-blob-c7fe7ecefe956cb573102a5f97104328284c4ec9%2Fcnv-calling.SignalCalledEvents.png?alt=media)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.dragen.illumina.com/product-guides/dragen-v4.5/dragen-dna-pipeline/cnv-overview.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
