CYP2D6 Caller

The CYP2D6 Caller is capable of genotyping the CYP2D6 gene from whole-genome sequencing (WGS) data and is derived from the method implemented in Cyrius¹. Due to high sequence similarity with its pseudogene paralog CYP2D7 and a wide variety of common structural variants (SVs), a specialized caller is necessary to resolve variants and identify likely star allele haplotypes.

The CYP2D6 Caller performs the following steps:

Determines total CYP2D6 and CYP2D7 copy number from read depth.

Determines CYP2D6-derived copy number at CYP2D6/CYP2D7 differentiating sites.

Detects SV breakpoints by calculating the changes in CYP2D6-derived copy number along the CYP2D6 gene.

Calls small variants in CYP2D6 copies.

Identifies star alleles from the detected SV breakpoints and small variants.

Identifies the most likely genotype for the called star alleles.

The CYP2D6 Caller requires WGS data aligned to a human reference genome with at least 30x coverage.

Structural Variant Calling

The CYP2D6-derived copy number along the CYP2D6 gene is used to identify known population structural variants (SVs), including whole gene deletions and duplications as well as certain gene conversions and gene fusions. The following fusion variants are detected:

Fusion Breakpoint	Hybrid Gene Structure	Star-Allele Designation
exon 9	2D6-2D7	4.013, 36, 57, 83
exon 9	2D7-2D6	*13
intron 4	2D7-2D6	*13
intron 1	2D7-2D6	*13
intron 1	2D6-2D7	*68

In addition to the exon 9 fusion breakpoints, exon 9 can participate in CYP2D7 gene conversion resulting in an embedded CYP2D7 sequence instead of a true hybrid. The structural variant caller also detects exon 9 gene conversions. Because only changes in CYP2D6-derived copy number yield structural variant calls, there might be rare cases where two hybrid copies result in no structural variant calls. For example, when both *36 and *13 with fusion breakpoint in exon 9 are present. However, the structural variant caller is capable of detecting multiple copies of the same fusion type (eg, *36x2) or cases where both an exon 9 gene conversion copy and an exon 9 2D6-2D7 hybrid are present.

CYP2D6 Output File

The CYP2D6 Caller prints out its calls in the targeted callers output file, <prefix>.targeted.json (that also aggregates calls from other targeted callers). An example of this file with the CYP2D6 caller set is as follows:

Copy

{
    "dragenVersion": "4.2.0-724-gb600fcef",
    "sample": "NA19374",
    "pharmcatMetabolismStatusResourceUrl": "https://github.com/PharmGKB/PharmCAT/blob/aeecfe5f787e95dfb31ede62884e287affef45b3/src/main/resources/org/pharmgkb/pharmcat/definition/gene_phenotypes.json",
    "cyp2d6": {
    "genotype": "*17/*2",
    "genotypeFilter": "PASS",
    "pharmcatDescription": "An individual carrying two normal function alleles",
    "pharmcatMetabolismStatus": "Normal Metabolizer"
  },
}

For CYP2D6 caller, the fields are defined as follows.

Fields in JSON	Explanation	Type and Possible Values
dragenVersion	Version of DRAGEN	string
sample	sample id	string
pharmcatMetabolismStatusResourceUrl	an URL containing the genotype to PharmCAT mapping information	string (web link)
cyp2d6	a json array containing the CYP2D6 call for this sample	json-array
genotype	star allele genotype identified for sample	string
genotypeFilter	The filter status for the genotype call	string (The value can include: PASS, No_call, or More_than_one_possible_genotype)
pharmcatDescription	The description corresponding to the genotype, mapped from PharmCAT	string
pharmcatMetabolismStatus	The metabolism status corresponding to the genotype, mapped from PharmCAT	string

When the option --targeted-enable-legacy-output=true is set, the CYP2D6 Caller also generates a <output-file-prefix>.cyp2d6.tsv file in the output directory. The output file contains the following tab-delimited fields with no header line:

•

Sample name.

•

One or more semicolon-delimited CYP2D6 genotypes or None for no call.

•

The filter status. The value can include: PASS, No_call, or More_than_one_possible_genotype.

Each CYP2D6 genotype contains two haplotypes separated by a slash (eg, *1/*2). Each haplotype consists of one or more star alleles separated by a plus sign (eg, *10+*36). When a haplotype contains more than one copy of the same star allele, that star allele only appears once and is followed by a multiplication sign, and then the number of copies (eg, *1x2 for two copies of *1).

Command-line Examples

To enable the CYP2D6 Caller, use --enable-cyp2d6=true. The CYP2D6 Caller is disabled by default. The CYP2D6 Caller can run directly from FASTQ input with the mapper or from prealigned BAM/CRAM input. You can also enable the CYP2D6 Caller in parallel with any other germline variant callers as part of a WGS germline analysis workflow. For more information on other variant callers, see DRAGEN DNA Pipeline.

¹Chen X, Shen F, Gonzaludo N, et al. Cyrius: accurate CYP2D6 genotyping using whole-genome sequencing data. The Pharmacogenomics Journal. 2021;21(2):251-261. doi:10.1038/s41397-020-00205-5