Somatic Mode Options

Somatic mode has the following command line options:

Option

Description

--tumor-fastq1

--tumor-fastq2

Inputs a pair of FASTQ files into the mapper aligner and somatic variant caller. You can use these options with OTHER FASTQ options to run in tumor-normal mode. For example:

dragen -f -r /staging/human/reference/hg19/hg19.fa.k_21.f_16.m_149 \
--tumor-fastq1 <TUMOR_FASTQ1> \
--tumor-fastq2 <TUMOR_FASTQ2> \
--RGID-tumor <RG0-tumor> -–RGSM-tumor <SM0-tumor> \
-1 <NORMAL_FASTQ1> \
-2 <NORMAL_FASTQ2> \
--RGID <RG0> –RGSM <SM0> \
--enable-variant-caller true \
--output-directory /staging/examples/ \
--output-file-prefix SRA056922_30x_e10_50M

--tumor-fastq-list

Inputs a list of FASTQ files into the mapper aligner and somatic variant caller. You can use these options with other FASTQ options to run in tumor-normal mode. For example:

dragen -f \
-r /staging/human/reference/hg19/hg19.fa.k_21.f_16.m_149 \
‑‑tumor-fastq-list <TUMOR_FASTQ_LIST> \
--fastq-list <NORMAL_FASTQ_LIST> \
--enable-variant-caller true \
--output-directory /staging/examples/ \
--output-file-prefix SRA056922_30x_e10_50M

--tumor-bam-input

--tumor-cram-input

Inputs a mapped BAM or CRAM file into the somatic variant caller. You can use these options with other BAM/CRAM options to run in tumor-normal mode.

--vc-min-tumor-read-qual

Specifies the minimum read quality (MAPQ) to be considered for variant calling. The default value is 3 for tumor-normal analysis or 20 for tumor-only analysis.

--vc-callability-tumor-thresh

Specifies the callability threshold for tumor samples. The somatic callable regions report includes all regions with tumor coverage above the tumor threshold. The default value is 15. For more information on the somatic callable regions report, see Somatic Callable Regions Report

--vc-callability-normal-thresh

Specifies the callability threshold for normal samples, if present. If applicable, the somatic callable regions report includes all regions with normal coverage above the normal threshold. The default value is 5. For more information on the somatic callable regions report, see Somatic Callable Regions Report.

--vc-somatic-hotspots

--vc-use-somatic-hotspots

--vc-use-somatic-hotspots

Use --vc-somatic-hotspots to specify a hotspots input VCF. The variants in the hotspots VCF indicate positions where more somatic mutations are expected. For example, a hotspots input VCF can be derived from COSMIC. DRAGEN genotyping priors are boosted for all positions specified in the VCF, so you can call a variant at one of these sites with fewer supporting reads. If you do not provide a value for vc-somatic-hotspots, DRAGEN automatically selects a reference-specific hotspots VCF file from /opt/edico/config/somatic_hotspots*. If you provide a vc-somatic-hotspots VCF, the VCF always takes precedence over the default hotspots VCF.

To disable the hotspots feature, use vc-use-somatic-hotspots=false. Neither the default VCF nor a provided VCF is considered.

Input VCF records must be sorted in the same order as contigs in the selected reference.

--vc-hotspot-log10-prior-boost

If using --vc-use-somatic-hotspots, use vc-hotspotlog10-prior-boost to control the size of the adjustment. The default value is 4 (log10 scale) corresponding to an increase of 40 Phred.

--vc-enable-liquid-tumor-mode

--vc-tin-contam-tolerance

In a tumor-normal analysis, DRAGEN accounts for tumor-in-normal (TiN) contamination by running liquid tumor mode. Liquid tumor mode is disabled by default. When liquid tumor mode is enabled, DRAGEN is able to call variants in the presence of TiN contamination up to a specified maximum tolerance level. --vc-enable-liquid-tumor-mode enables liquid tumor mode with a default maximum contamination TiN tolerance of 0.15. If using the default maximum contamination TiN tolerance, somatic variants are expected to be observed in the normal sample with allele frequencies up to 15% of the corresponding allele in the tumor sample. vc-tin-contam-tolerance enables liquid tumor mode and allows you to set the maximum contamination TiN tolerance.

Liquid tumor mode is not equivalent to liquid biopsy. Liquid tumors in liquid tumor mode refer to hematological cancers, such as leukemia. For liquid tumors, it is not feasible to use blood as a normal control because the tumor is present in the blood. Skin or saliva is typically used as the normal sample. However, skin and saliva samples can still contain blood cells, so that the matched normal control sample contains some traces of the tumor sample and somatic variants are observed at low frequencies in the normal sample. If the contamination is not accounted for, it can severely impact sensitivity by suppressing true somatic variants.

Liquid tumor mode typically uses a library that is WGS or WES with medium depth for example (100x T/ 40xN), and the lowest VAF detected for these types of depths is ~5%. Liquid biopsy typically uses a targeted gene panel (eg, 500 genes), with very high raw depth (> 2000–5000x), and uses UMI indexing to enable sensitivity at VAF down to 0.5 % LoD.

--vc-override-tumor-pcr-params-with-normal

If using different sequencing systems or different library preparation methods for tumor and normal samples, DRAGEN recommends setting this option to false. In tumor-normal mode, DRAGEN estimates a set of PCR error parameters separately for each of the tumor and normal samples. By default, DRAGEN ignores the tumor-sample parameters and uses normal-sample parameters for analysis of both samples. This default prevents overestimation of tumor-sample error rates that can occur if the somatic variant rate is high. For example, the error rates can occur in germline admixture data sets where the somatic variant rate is equal to the germline variant rate.

--vc-enable-unequal-ntd-errors

--vc-enable-trimer-context

--vc-snp-error-cal-bed

These options control the Sample-specific NTD Error Bias Estimation feature.

--vc-af-lod

The probability calculation in the somatic caller assesses variant and noise hypotheses at fixed allele frequencies defined by a discrete grid (by default at coverages <200: 0, 0.05, 0.1, ... 1.0). This means that the calculation will assess variants with allele frequencies below 0.05 as if the true frequency is equal to 0.05; this strategy does not preclude such variants from being called but may result in lower scores compared to if the true frequency had been considered. At positions with higher coverage, DRAGEN adds extra grid points as in the table below to consider hypotheses involving lower allele frequencies and effectively achieve a lower limit of detection (LOD), with the lowest VAF halving every time the coverage doubles:

Coverage

Lowest AF

0-199

0.05

200-399

0.025

400-799

0.0125

To make sure that variant hypotheses at a specific LOD are always considered even at positions with low coverage. You can use the vc-af-lod option to always add extra grid points until there is a grid point at or below the specified level. Positions with high coverage may still have extra grid points added beyond this level. Using this option may result in extra low-AF calls being made, but note that it also causes low-AF noise hypotheses to be considered, which can have the opposite effect (suppression of calls).