Systematic Noise Filtering

The DRAGEN systematic noise filter is available in somatic mode, and can be used to reduce false positive calls by accounting for site-specific noise. This filter replaces the panel of normals option. Unlike the panel of normals filter that blocked certain positions, the systematic noise filter uses a statistical model. The systematic noise at each position is estimated by extracting the variant call allele frequencies at the same position from the normal samples and computing the mean or max allele frequency. During the somatic run variant calls are not filtered if the variant call's allele frequency is statistically much higher than the estimated noise at the same position. The filter is considered essential for tumor-only runs where a matched normal is not available, and is also recommended in tumor-normal mode.

During the construction of the noise file DRAGEN aims to detect germline calls and not include them as noise. The recommended strategy to identify germline variants in the normal samples is enabled by --vc-enable-germline-tagging=true along with the required Nirvana settings. To explicitly skip this step when generating the normal VCFs please enable vc-skip-germline-tagging=true along with an optional AF cutoff for build-sys-noise-germline-vaf-threshold. DRAGEN can estimate the noise as either the mean or max AF of all the normal variants at a location.

To enable the systematic noise filter during somatic variant calling use the option --vc-systematic-noise {NOISE_FILE_PATH}. For each site a P-value test will be conducted to assess whether a somatic variant may be explained by the noise model. The systematic noise uses a binomial model where the null hypothesis assumes that the variant's alt supporting reads can be explained by the observed systematic noise. If the variant call has an allele frequency that is significantly higher than the systematic noise, then the P value will be low, indicating that the null hypothesis can be discarded and the variant treated as a real variant rather than noise.