DRAGEN Server App
Analysis on DRAGEN Server
Prerequisites
DRAGEN Phase 3 or 4 server
DRAGEN License
Network Storage Sever
DRAGEN server
DRAGEN phase 4 server is recommended especially for datasets from NovaSeq X instruments. The server has 12 TB of intermediate data storage space for full processing of a NovaSeq X 25B flow cell.
DRAGEN phase 3 server has 6 TB of intermediate data storage space, and is recommended for flow cells from the NovaSeq 6000 or 6000 Dx instruments.
DRAGEN license
The pipeline uses the standard DRAGEN license without requiring any special licenses.
NFS and CIFS file servers
The pipeline is designed to stream data from a network file server onto the DRAGEN server, complete the analysis using the /staging
area of the high performance SSD. and then stream the analysis output back to the network file server.
The network file server may be mounted to the DRAGEN server using the NFS or CIFS protocol.
Starting from BCL Files
If starting from BCL (*.bcl) files, the pipeline requires the run folder to contain certain files and folders. These inputs are required for Docker.
The run folder contains data from the sequencing run, make sure that the folder contains the following files:
Config folder
Configuration files
Data folder
*.bcl files
Images folder
[Optional] Raw sequencing image files.
Interop folder
Interop metric files.
Logs folder
[Optional] Sequencing system log files.
RTALogs folder
Real-Time Analysis (RTA) log files.
RunInfo.xml file
Run information.
RunParameters.xml file
Run parameters.
SampleSheet.csv file
Sample information. If you want to use a sample sheet that is not in the run folder or a sample sheet named something other than SampleSheet.csv
, provide the full path.
Starting from FASTQ Files
The following inputs are required for running the using FASTQ (*.fastq) files. The requirements apply to Docker.
Full path to an existing FASTQ folder.
The sample sheet is in the FASTQ folder path, or you can set the path to the sample sheet with the
--sampleSheet
override command line option.
Make sure there is sufficient disk space for the analysis to complete. Refer to the --help
command line argument details for disk space requirements.
Use BCL Convert to produce FASTQ files for the pipeline. Using bcl2fastq does not produce the same results and is discouraged.Make sure that BCL Convert is set to write UMI sequences to the read headers in the FASTQ files.
FASTQ File Organization
Store FASTQ files in individual subfolders that correspond to a specific Sample_ID. Keep file pairs together in the same folder. Alternatively, store the FASTQ files in one flat folder structure where the FASTQ files are stored in one folder.
The pipeline requires separate FASTQ files per sample. Do not merge FASTQ files.
The instrument generates two FASTQ files per flow cell lane, so that there are eight FASTQ files per sample.
Sample1_S1_L001_R1_001.fastq.gz
Sample1 represents the Sample ID.
The S in S1 means sample, and the 1 in S1 is based on the order of samples in the sample sheet, so S1 is the first sample.
L001 represents the flow cell lane number.
The R in R1 means Read, so R1 refers to Read 1.
Last updated
Was this helpful?