Analysis Manifest JSON

To set up the analysisManifest.json to use in the CLI, use the following structure.

{

"libraryType": "<wgs>" or "<wes>",

"analysisFiles": <Analysis files. See Analysis Files>,

"analysisType": "rugd",

"dragenVersion": "<DRAGEN software version>",

"genomeReference": "<Reference genome>",

"notes": {"additional":"notes"},

"pedigree": [<Pedigree files. See Pedigree>],

"pedigreeName": "<Pedigree ID>"

}

Analysis Files

To specify the location of analysis and metrics files, use the analysisFiles configuration in the analysisManifest.json. The following example specifies files in the specified storage container.

If libraryType is set to wes then "targetedRegionsBedFile": "<targeted regions>.bed.gz" is required.

"analysisFiles": {

"targetedRegionsBedFile": "<targeted regions>.bed.gz" (Not used if libraryType is set to wgs).

"jointFiles": <Pedigree data files. See Joint Files>,

"sampleFiles": <Sample files. See Sample Files>,

"referenceTar": "<Genome reference file>.tar",

"graphExpansionDependencies": <Reference files for STR plots. See Graph-Expansion Files"

}

Joint Files

Joint files include multi-sample VCFs and metrics files generated from the DRAGEN joint genotyping pipeline. At least one joint VCF file is required.

To specify the location of joint files, use the jointFiles configuration in the analysisManifest.json. The following example specifies the location of joint genotyping files.

"jointFiles": {

"jointCnvVcf": "<pedigreeName>.cnv.vcf.gz",

"jointRepeatsVcf": "<pedigreeName>.repeats.merged.vcf.gz",

"jointSnvVcf": "<pedigreeName>.snv.vcf.gz",

"jointSvVcf": "<pedigreeName>.sv.vcf.gz",

"metricsFiles": [

"<pedigreeName>.<metricInfo>_metrics.csv",

]

},

Sample Files

To enable coverage and callability metrics, include both the coverageReportBed and the callabilityReportBed output files. For more information, see Coverage/Callability Reports Use Cases and Expected Output .

DRAGEN produces pipeline-specific metrics reports during each run. Illumina recommends using the version of the files generated from map-align.

If using DRAGEN 3.9.5 analysis with SMN calling enabled then "smnTsv": "<Sample ID>.smn.tsv" is required.

"sampleFiles": {

"<SampleID>": {

"smnTsv": "<Sample ID>.smn.tsv" (Required if SMN is enabled with DRAGEN 3.9.6.

"alignments": "<SampleID>.bam",

"rohBed": "<SampleID>.roh.bed",

"bafBw": "<SampleID>.hard-filtered.baf.bw",

"targetCountsCorrected": "<SampleID>.target.counts.gc-corrected.gz",

"coverageReportBed": "<SampleID>.qc-coverage-region-1_cov_report.bed",

"callabilityReportBed": "<SampleID>.qc-coverage-region-1_callability.bed",

"repeatsData": {

"bam": "<SampleID>.repeats.bam",

"vcf": "<SampleID>.repeats.vcf.gz"

},

"metricsFiles": [

"<SampleID>.cnv_metrics.csv",

"<SampleID>.fastqc_metrics.csv",

"<SampleID>.gc_metrics.csv",

"<SampleID>.mapping_metrics.csv",

"<SampleID>.ploidy_estimation_metrics.csv",

"<SampleID>.qc-coverage-region-1_coverage_metrics.csv",

"<SampleID>.roh_metrics.csv",

"<SampleID>.trimmer_metrics.csv",

"<SampleID>.vc_metrics.csv",

"<SampleID>.wgs_coverage_metrics.csv"

]

},

Graph-Expansion Files

To specify the location of input files used for generating STR plots, use the graphExpansionDependencies configuration in the analysisManifest.json. The following example specifies DRAGEN files.

"graphExpansionDependencies": {

"genomeFastaUrl": "<URL to the FASTA reference file>",

"smnRepeatSpecLink": "<URL to the repeat call specification file>"

},

Pedigree

To specify pedigree metadata, use the pedigree configuration in the analysisManifest.json.

"pedigree": [{

"id": "<Sample ID>",

"isAffected": "<Subject affected status>",

"maternalId": "<Maternal subject sample ID>",

"paternalId": "<Paternal subject sample ID>",

"sex": "<The sex of the sample subject>"

]}