Unique Molecular Identifiers (UMI) Support

DRAGEN support FASTQ files that contain UMI barcodes during the alignment phase. The principle and requirement are identical to DNA UMI. During library prep, (methyl-treated) DNA fragments could be barcoded by unique molecular identifiers, so that true signals from the original fragments can be separated from PCR error and sequencing error, which enables more accurate methylation calling. The *.fastq files need to have UMI barcode in 7th field of the QNAME. See the following example.

@NS500561:434:H5LC2BGXJ:1:11101:10798:1359:CACATGA+ACATTC 1:N:0:TGGTACCTAA+AGTACTCATG

To enable UMI, either set --umi-enable true if you are using random UMI (common), or set --tso500-solid-umi true if you are using the same non-random UMIs as the TSO500 solid panel. If so, read collapsing will be performed among reads with the same UMI that are mapped to the same genomic location at the same strand, either from top (OT/CTOT) or bottom (OB/CTOB) strand.

UMI is not supported in the deprecated multi-pass mode, so make sure --methylation-mapping-implementation single-pass is set.

For more information, refer to Unique Molecular Identifiers.