16S Sample JSON

16S Sample JSON

A JSON file is generated for each sample in the analysis and is named <outputFilePrefix>.<sampleId>.ap.json. It contains general metadata, version information, sample QC, statistics, and analysis options specified by the user.

Top-Level Node

The top-level section of the report JSON contains general metadata and version information.

Field
Description

.accession

Identifier used for the sample

.analysisId

Identifier used for the analysis

.analysisPipelineVersion

Analysis Pipeline release version

.deploymentEnvironment

Environment in which the results were produced

.dragenVersion

DRAGEN release version

.errors

List of errors encountered during the analysis

.reportTime

Date and time the report was generated

.runId

Identifier used for the sequencing run

.testName

Full name of test. "DRAGEN 16S Plus Analysis Pipeline"

.testType

Type of test ("Refseq-RDP-v1", "Custom")

.testUse

Test use. "For Research Use Only. Not for use in diagnostic procedures."

.testVersion

Test release version

.warnings

List of warnings encountered during the analysis

.qcReport.sampleQc Node

This section contains information about sample quality control (QC). The fields are relative to .qcReport.sampleQc.

Field
Description

.entropy

Shannon entropy of the counts of 5-mers in the reads after read QC processing, which is a measure of randomness

.gContent

Proportion of guanine (G) base calls in reads after read QC processing

.libraryQScore

Quality score of the library after read QC processing

.postQualityMeanReadLength

Average read length after read QC processing

.postQualityReads

Number of reads (or read pairs for paired-end data) in sample after read QC processing, inclusive of any duplicate reads

.postQualityReadsProportion

Proportion of post-quality reads (or read pairs for paired-end data) in sample relative to total raw reads

.preQualityMeanReadLength

Average read length before read QC processing

.totalRawBases

Number of base pairs in sample before read QC processing

.totalRawReads

Number of reads (or read pairs for paired-end data) in sample before read QC processing

.uniqueReads

Number of distinct reads (or read pairs for paired-end data) in sample before read QC processing

.uniqueReadsProportion

Proportion of distinct reads (or read pairs for paired-end data) in sample before read QC processing

.statistics Node

This section contains information about sample statistics. The fields are relative to .statistics.

Field
Description

.classificationCounts

Array of objects containing "count" and "rank" information for each of the seven canonical taxonomic levels

.numberOfSpeciesIdentified

Count of species identified in the sample after any read count threshold is applied

.shannonDiversity

Measure of species diversity within a community that takes into account both species richness and evenness

.userOptions Node

This section gives information about analysis options specified by the user. The fields are relative to .userOptions.

Field
Description

.readCountThreshold

Read count threshold for reporting taxa. Value is an integer >= 0. Default is 0 or unfiltered

.readQcEnabled

Boolean indicating if read QC (trimming and filtering based on quality and read length) is enabled

.customDatabaseFastaFile

Basename of the user-provided FASTA file when a custom reference database is specified

Last updated

Was this helpful?