DNA Amplicon

A DRAGEN recipe, like this one, is a predefined set of analysis parameters and workflow settings tailored to a specific type of genomic analysis. For clarity, some default parameters are explicitly included and annotated with comments. This recipe includes the recommended commands for DNA amplicon samples.

DRAGEN Standard DNA Amplicon settings

/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
# Inputs 
--tumor-fastq-list $PATH                #see 'Input Options' for FQ, BAM or CRAM 
--tumor-fastq-list-sample-id $STRING 
--fastq-list $PATH                      #see 'Input Options' for FQ, BAM or CRAM 
--fastq-list-sample-id $STRING 
# DNA amplicon 
--enable-dna-amplicon true 
--amplicon-target-bed $PATH 
# Mapper 
--enable-map-align true                 #optional with BAM/CRAM input 
--enable-map-align-output true          #optionally save the output BAM 
# Small variant caller 
--enable-variant-caller true 
--vc-target-bed $VC_TARGET_BED 
--vc-systematic-noise $PATH             #optional for SNV systematic noise
--vc-target-vaf 0.03                    #Default = 0.03 (>= 3% VAF) except for ctDNA (which is 0.1%)
# SV 
--enable-sv true 
--sv-systematic-noise $PATH             #optional for SV systematic noise
# CNV 
--enable-cnv true 
--cnv-combined-counts $PATH             #CNV PON 
# Annotation 
--enable-variant-annotation true 
--variant-annotation-data PATH 
# Microsatellite Instability (MSI)      #optional for panels that support MSI
--amplicon-enable-msi=true
--msi-microsatellites-file $PATH        #MSI site file
--msi-ref-normal-input $PATH            #MSI PON file

DRAGEN Amplicon Pillar Panel Specific Settings

To support the varied designs of amplicon panels and the specific requirements of different analysis types (e.g., SNV, CNV, SV, MSI, RNA fusion, RNA splice variants, and RNA 3'/5' imbalance ratio), panel-specific parameter settings have been integrated into the command-line options. Each supported Pillar panel has a dedicated option, and the details for these DNA panels are listed in the table below:

Panel Name

Short Name

Panel Code

Sample Type

Default variant caller enabled

Command Line Options

oncoReveal BRCA1 & BRCA2 plus CNV

BRCA CNV

BR283

DNA

SNV, CNV

--amplicon-enable-dna-brca

oncoReveal Lymphoid

Lymphoid

P-LYM-01

DNA

SNV, SV

--amplicon-enable-dna-lymphoid

oncoReveal Core LBx

Core LBx

P-LBX-01

cfDNA

SNV, CNV, MSI

--amplicon-enable-cfdna-core

oncoReveal Essential LBx

Essential LBx

P-LBX-04

cfDNA

SNV, CNV, MSI

--amplicon-enable-cfdna-essential

oncoReveal Essential MPN

MPN

MY7

DNA

SNV

--amplicon-enable-dna-mpn

oncoReveal Multi-Cancer v4 with CNV

Multi-Cancer with CNV

HS341

DNA

SNV, CNV

--amplicon-enable-dna-multicancer

oncoReveal Myeloid

Myeloid

MY766

DNA

SNV, SV

--amplicon-enable-dna-myeloid

oncoReveal Nexus 21 Gene

Nexus

P-CMC-01

DNA

SNV, SV

--amplicon-enable-dna-nexus

oncoReveal Solid Tumor v2

Solid Tumor v2

P-ST-02

DNA

SNV

--amplicon-enable-dna-solidtumor

For more detail on the amplicon pipeline, please refer to DRAGEN Amplicon Pipeline

Notes and additional options

Hashtable

For DRAGEN amplicon runs it is recommended to use the linear hashtable.

See: Product Files

Input options

DRAGEN input sources include: fastq list, fastq, bam, or cram. For BCL input, first create FASTQs using BCL conversion.

FQ list Input

--tumor-fastq-list $PATH 
--tumor-fastq-list-sample-id $STRING 
--fastq-list $PATH 
--fastq-list-sample-id $STRING 

FQ Input

--tumor-fastq1 $PATH 
--tumor-fastq2 $PATH 
--RGSM-tumor $STRING 
--RGID-tumor $STRING 
--fastq-file1 $PATH 
--fastq-file2 $PATH 
--RGSM $STRING 
--RGID $STRING 

BAM Input

--tumor-bam-input $PATH 
--bam-input $PATH 

CRAM Input

--tumor-cram-input $PATH 
--cram-input $PATH 

Mapping and Aligning

Option
Description

--enable-map-align true

Enables map-align.

--enable-map-align-output true

Optionally save the output BAM (Default=false).

Amplicon post-alignment processing

Option
Description

--amplicon-primer-length INT

If an alignment starts inside the primer region of the amplicon target, the alignment is assigned to the amplicon.

--amplicon-allow-partial-target true

In order to detect deletion events that are close to the target boundaries, we now require only one of the reads to start in the primer region (Default=true)

For more detail on the amplicon post-alignment processing, please refer to DRAGEN Amplicon Pipeline

Duplicate Marking

Option
Description

--enable-duplicate-marking false

The Amplicon Pipeline disables duplicate marking. In amplicon assays, fragments originate from a limited number of unique start and end positions, making conventional duplicate detection inappropriate. (Default=false)

SNV

Option
Description

--vc-target-bed

Limit variant calling to region of interest. Default is amplicon target bed.

--vc-combine-phased-variants-distance INT

Maximum distance in base pairs (BP) over which phased variants will be combined. Set to 0 to disable. Valid range is [0; 15] BP (Default=2).

--vc-systematic-noise $PATH

Systematic noise file. This filter is recommended for removing systematic noise observed in normal samples (i.e. systematic alignment errors, sequencing errors, etc.).

--vc-target-vaf FLOAT

The default is 0.03 (3%). For ctDNA, the default is 0.001 (0.1%).

--vc-af-call-threshold FLOAT

If the AF filter is enabled using --vc-enable-af-filter=true, the option sets the allele frequency call threshold for nuclear chromosomes to emit a call in the VCF. The default value is 0.01. For ctDNA, the default is 0.001.

--vc-af-filter-threshold FLOAT

If the AF filter is enabled using --vc-enable-af-filter=true, the option sets the allele frequency filter threshold for nuclear chromosomes to mark emitted VCF calls as filtered. The default value is 0.05. For ctDNA, the default is 0.003.

For more detail on the small variant caller in somatic mode please refer to Somatic Mode

CNV

Option
Description

--cnv-segmentation-mode $SEG_MODE

Option to override the default segmentation algorithm. By default, bed is used for standard panels and hslm for Pillar panels with a pre-built PON.

--amplicon-cnv-use-default-pon false

We recommend including in-run normal samples—matched in sample type and library preparation—in the same sequencing run to serve as the PON. If generating a custom PON is not feasible, for Pillar panels, the pre-packaged panel-specific PON can be used as a fallback. To enable this, set the option to true

--cnv-segmentation-bed $PATH

You can bypass segmentation by specifying a cnv-segmentation-bed and using cnv-segmentation-mode=bed. If bed segmentation mode is used, the segmentation bed is auto-generated from amplicon target bed by default

For more information, see CNV Calling.

Annotation

For instructions on how to download the Nirvana annotation database, please refer to Nirvana

MSI

Option
Description and recommended setting

--msi-coverage-threshold INT

Minimum coverage for a microsatellite: 500 (default)

--msi-distance-threshold FLOAT

Minimum Jensen-Shannon distance between tumor and PON for a microsatellite: 0.3 (default)

SV

Option
Description

--sv-call-regions-bed

Specifies a BED file containing the set of regions to call. Default as amplicon target bed.

--enable-variant-deduplication true

Relevant when both SV and SNV callers are enabled in amplicon workflows. Can increase sensitivity and prevent the occurrence of replicated variants within genes such as FLT3 and KMT2A. Filter all small indels in the structural variant VCF that appear and are passing in the small variant VCF. DRAGEN will create a new VCF that contains variants in SV VCF that are not matching a variant from SNV VCF file. The new deduplicated SV VCF file will have the same prefix passed by --output-file-prefix followed by sv.small_indel_dedup. DRAGEN normalizes variants by trimming and left shifting by up to 500 bases.

--sv-systematic-noise $BEDPE

Optional systematic noise BEDPE file containing the set of noisy paired regions (optionally gzip or bzip compressed).

For more information, see Structural Variant Calling.

SNV Systematic Noise

Systematic noise files are considered essential in Tumor-Only workflows.

DRAGEN has pre-built systematic noise files for Pillar panels. To achieve high sensitivity, we recommend generating a custom systematic noise file as described in the Custom section

Custom

This section describes how to generate systematic noise files from phenotypically normal (non-tumor) samples to optimize the performance of a specific assay. For best accuracy, the normal samples should ideally closely match the sequencer, sample type, library prep, and coverage of the tumor samples of interest. It is typically recommended to use 30-50 normals when building a noise file, but fewer can be used.

Step 1. Run DRAGEN somatic tumor-only on each of approximately 30-50 normal samples.

  
/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
--tumor-fastq-list $PATH                #see 'Input Options' for FQ, BAM or CRAM 
--tumor-fastq-list-sample-id $STRING 
--vc-detect-systematic-noise=true 
--enable-dna-amplicon true 
--amplicon-target-bed $PATH 
--vc-enable-germline-tagging=true 
--variant-annotation-data $PATH 
--intermediate-results-dir $PATH 
--output-directory $PATH 
--output-file-prefix $STRING 

Gather the full paths to the small variant hard filtered VCFs (not GVCFs) from step 1 and create a lines file ${VCF_LIST} by specifying 1 file per line.

Step 2. Generate the final noise file.

This step generates a bed file containing mean and max noise estimates per position. This can be used directly during variant calling (argument --vc-systematic-noise).

  
/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
--build-sys-noise-vcfs-list ${VCF_LIST} 

The SNV systematic noise files can also be built in the cloud using the DRAGEN Baseline Builder App on BaseSpace or the DRAGEN Systematic Noise File Builder Pipeline on ICA.

SV Systematic Noise

SV systematic noise files are considered experimental in amplicon.

Custom

Custom systematic noise files can be generated for amplicon Panels. For best accuracy the normal samples should ideally closely match the sequencer, sample type, library prep and coverage of the tumor samples of interest. It is typically recommended to use 30 - 50 normals when building a noise file, but fewer can be used.

Step 1. Run DRAGEN somatic tumor-only on normal samples with --sv-detect-systematic-noise set to true to generate VCF output per normal sample.

  
/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
--enable-dna-amplicon true 
--amplicon-target-bed $PATH 
--tumor-fastq-list $PATH                #see 'Input Options' for FQ, BAM or CRAM 
--tumor-fastq-list-sample-id $STRING 
--sv-detect-systematic-noise true 

Step 2. Build the BEDPE file using input VCFs from previous step.

  
/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
--sv-build-systematic-noise-vcfs-list $VCF_LIST#one VCF per line. 

Systematic noise BEDPE files can also be built in the cloud using the DRAGEN Baseline Builder App on BaseSpace or the DRAGEN Systematic Noise File Builder Pipeline on ICA.

CNV Panel of Normals (PON)

For CNV PON requirements and generation options see CNV Preprocessing | Panel of Normals.

If a matched normal is available it is recommended to include it in the PON.

Step 1. Generate CNV target counts of individual normal samples.

Any samples that should not be included in the final PON file can be excluded from this step. Any options used for CNV target counts generation should be matched when processing the case samples.

  
/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
--enable-dna-amplicon true 
--amplicon-target-bed $PATH 
--tumor-fastq-list $PATH                #see 'Input Options' for FQ, BAM or CRAM 
--tumor-fastq-list-sample-id $STRING 
# CNV 
--enable-cnv true 

Step 2. CNV combined counts file generation.

  
/opt/dragen/$VERSION/bin/dragen         #DRAGEN install path 
--ref-dir $REF_DIR                      #path to DRAGEN linear hashtable 
--output-directory $OUTPUT 
--intermediate-results-dir $PATH        #e.g. SSD /staging 
--output-file-prefix $PREFIX 
--enable-cnv true 
--cnv-generate-combined-counts true 
--cnv-normals-list $CNV_NORMALS_LIST 

$CNV_NORMALS_LIST is a text file with one line for each path to a CNV target counts file generated in step 1 (<output-file-prefix>.target.counts.gz as cnv-enable-gcbias-correction is by default false in amplicon). Individual target counts files are merged into a single <output-file-prefix>.combined.counts.txt.gz PON file in the output directory. The PON file is used for each case sample run of DRAGEN CNV using the --cnv-combined-counts option.

Last updated

Was this helpful?