QUAL, QD, and GQ
In single sample VCF and gVCF, the QUAL follows the definition of the VCF specification. For more information on the VCF specification, see the most current VCF documentation available on samtools/hts-specs GitHub repository.
• | QUAL is the Phred-scaled probability that the site has no variant and is computed as: |
QUAL = -10*log10 (posterior genotype probability of a homozygous-reference genotype (GT=0/0))
That is, QUAL = GP (GT=0/0), where GP is the posterior genotype probability in Phred scale.
QUAL = 20 means there is 99% probability that there is a variant at the site. The GP values are also given in Phred-scaled in the VCF file.
• | GQ is the Phred-scaled probability that the call is incorrect. |
GQ=-10*log10(p), where p is the probability that the call is incorrect.
GQ=-10*log10(sum(10.^(-GP(i)/10))) where the sum is taken over the GT that did not win.
GQ of 3 indicates a 50 percent chance that the call is incorrect, and GQ of 20 indicates a 1 percent chance that the call is incorrect.
• | QD is the QUAL normalized by the read depth, DP. |
The formulation is summarized in the following table.
Metric |
QUAL |
GQ |
QD |
Description |
Probability that the site has no variant |
Probability that the call is incorrect |
Qual normalized by Depth |
Formulation |
QUAL = GP(GT=0/0) |
GQ=-10*log10(p) |
QUAL/DP |
Scale |
Unsigned Phred |
Unsigned Phred |
Unsigned Phred |
Numerical example |
QUAL=20: 1 % chance that there is no variant at the site QUAL=50: 1 in 1e5 chance that there is no variant at the site |
GQ=3, 50% chance that the call is incorrect GQ=20, 1% chance that the call is incorrect |
|