VCF Imputation

DRAGEN v.4.0 extends the capabilities to infer bi-allelic SNP variants from low-coverage sequencing samples by packaging the GLIMPSE software (2020, Olivier Delaneau & Simone Rubinacci). The DRAGEN implementation of the GLIMPSE software allows for scalability of variant imputation by adding the following features:

End-to-end pipeline, where the 3 phases of the GLIMPSE software (Chunk, Phase, and Ligate) get executed by a single command, on one chromosome or multiple chromosomes
Software supported acceleration

The tool generates imputed variants for all samples provided in the specified regions, based on a pre-existing reference panel available for download on the Illumina DRAGEN Bio-IT Platform Product Files page.

The VCF imputation tool only supports:

Input sample data generated with the DRAGEN Bio-IT platform
Human sequencing data

The following is an example of commands to impute vcf on a single chromosome:

dragen

--enable-imputation true

--imputation-ref-panel-dir <REF_PANEL_DIR>

--imputation-ref-panel-prefix <IRPv1>

--imputation-chunk-input-region <chr22>

--imputation-phase-input-list <VCF_to_be_imputed.txt>

--imputation-genome-map-dir <MAP_DIR>

--output-directory <OUT_DIR>

--output-file-prefix <OUT_PREFIX>