DNA Expanded Metrics
DNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.
Metric |
Description |
Troubleshooting |
|||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
TOTAL_PF_READS (count) |
Total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence. |
Primarily driven by data output of sequencer, quality of library and balancing of library in library pool. If TOTAL_PF_READS is in line with other samples, but coverage metrics are more may suggest non-specific enrichment.
|
|||||||||
MEAN_FAMILY_SIZE (count) |
A UMI Family is a group of reads that all have the same UMI barcode. The family size is the number of reads in family. MEAN_FAMILY_SIZE is the mean of the entire population of reads assembled into UMI families. |
The mean UMI family size decreases with increased unique read numbers, and more input DNA leads to more unique reads. Conversely over sequencing of a fixed population of unique DNA molecules leads to increased family size. As a guide, for a good run with optimal cluster density, passing specs, even sample pooling, and good quality DNA we usually observe values <10.
|
|||||||||
MEDIAN_TARGET_COVERAGE (count) |
Median depth across all the unique loci occurring in all regions of the manifest file. |
Lower median target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output. |
|||||||||
PCT_CHIMERIC_READS (%) |
Chimeric reads occur when one sequencing read aligns to two distinct portions of the genome with little or no overlap. Metric is proportion of total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence. |
While this can be indicative of large-scale structural rearrangement of the genome, values that are elevated above the usual baseline may indicate enrichment probe contamination during library preparation. |
|||||||||
PCT_EXON_500X (%) |
Percentage of exon bases with 500X fragment coverage. Calculated against all regions in manifest containing _exon in name. |
Can be used in combination with 1000x and 1500x to understand under or over coverage of exons. |
|||||||||
PCT_EXON_1500X (%) |
Percentage of exon bases with 1500X fragment coverage. Calculated against all regions in manifest containing _exon in name. |
Can be used in combination with 500x and 1000x to understand under or over coverage of exons. |
|||||||||
PCT_READ_ENRICHMENT (%) |
Percentage of reads that have overlapping sequence with the target regions defined in the sample manifest. |
Indicative of general enrichment performance. Reduced proportions of enriched reads may indicate issues with the enrichment proportion of the library preparation. |
|||||||||
PCT_USABLE_UMI_READS (%) |
Percentage of reads that have valid UMI sequences associated with them. |
As UMI reads are sequenced at the start of each read, loss of valid UMI sequence may be cause by sequencing issues impacting the quality of base calling in this portion of the sequencing read. |
|||||||||
MEAN_TARGET_COVERAGE (count) |
Mean depth across all the unique loci defined in the manifest file. |
Lower mean target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output. Large differences between the median and mean target coverage values may indicated a skewed distribution of target coverage. |
|||||||||
PCT_ALIGNED_READS (%) |
Proportion of aligned reads that are non-supplementary, non-secondary and pass QC versus aligned reads that are non-supplementary, non-secondary, mapped and pass QC. |
|
|||||||||
PCT_CONTAMINATION_EST (%) |
Percentage contamination estimation. The contamination level is computed by taking 2.0 * the average of the adjusted allele frequencies of all variants that were selected. The adjusted alllele frequency is either the actual allele frequency of the variant if it is less than 0.5, or 1 -allele frequency if it is greater than or equal to 0.5. |
High contamination estimates may be due to any of the following:
|
|||||||||
PCT_TARGET_0.4X_MEAN (%) |
Parentage of target (all locations in manifest) reads that have a coverage depth of greater the 0.4x the mean target coverage depth (see definition above). |
Provides an indication of uniformity of coverage of the target regions in the manifest file. When trended over time reductions in this metric may indicate an issue with the enrichment process resulting in coverage bias. |
|||||||||
PCT_TARGET_500X (%) |
Percentage of target bases with 500X fragment coverage. Calculated against all regions in manifest file. |
Can be used in combination with 1000x and 1500x to understand under or over coverage of targets. |
|||||||||
PCT_TARGET_1000X (%) |
Percentage of target bases with 1000X fragment coverage. Calculated against all regions in manifest file. |
Can be used in combination with 500x and 1500x to understand under or over coverage of targets. |
|||||||||
PCT_TARGET_1500X (%) |
Percentage of target bases with 1500X fragment coverage. Calculated against all regions in manifest file. |
Can be used in combination with 500x and 1000x to understand under or over coverage of targets. |