Fusion Filtering
In assays, there are hundreds to thousands of fusion candidates in a single sample. Most fusion candidates (~99%) are false positives. The fusion filtering tool, DNA Fusion Filter (DNAFF), distinguishes true fusion calls from the false positives. DNAFF performs the following functions:
|
•
|
Removes spurious fusions including fusions with only one supporting read pair and fusions that overlap with repeat regions, which are more likely to have sequencing errors. |
|
•
|
Filters nonconfident supporting reads for all fusion candidates based on the following criteria: |
|
–
|
Filter reads with low-sequence identity with the fusion contig. |
|
–
|
Filter erroneous reads, which are reads that do not support the fusion. For example, reads that have suspicious supplementary alignment. |
|
–
|
Deduplicate reads based on UMI information. |
|
•
|
Applies the following rules for the final fusion output |
|
–
|
If the fusion gene pair has been reported in COSMIC, it must have ≥ 2 supporting reads. |
|
–
|
If the fusion gene pair has not been reported in COSMIC, it must have ≥ 3 supporting reads. |
|
–
|
At least one fusion breakpoint must fall within the 23 target genes. |