Build a Methylation Hash Table
To run DRAGEN methylation pipeline, you need to build a methylation-specific hash table. Both multi-pass and single-pass options require methylation hash tables. The type of hash table must match the method used during alignment (methylation-mapping-implementation: single-pass| multi-pass). For more information on multi-pass and single-pass methods, see Mapping Method Options.
If using the single-pass method, make sure that your hash table meets the following requirements:
• | Set methylation-mapping-implementation: single-pass. |
• | Set ht-methylated-combined=true to generate a combined hash table in the methyl_converted subdirectory. |
• | Single-pass mode supports alt-contig references. Each contig in the original FASTA appears twice in the combined reference genome files, one time with each conversion type. |
If using the multi-pass method, make sure that your hash table meets the following requirements:
• | Set methylation-mapping-implementation: multi-pass. |
• | Set ht-methylated=true to generate the CT_converted and GA_converted subdirectories. |
• | Multi-pass mode does not support alt-contig references. The subdirectories contain the genome index for C to T and G to A conversion. |
You can set ht-methylated-combined=true and ht-methylated=true together to create a hash table directory that can be used for both methylation-mapping methods.
The following is an example command line for a single-pass hash table.
dragen --build-hash-table true \
--output-directory=sample.output.directory \
--ht-reference=sample.input.fa \
--ht-num-threads 40 \
--ht-methylated=true \
--ht-methylated-combined=true \
--ht-seed-len 27
The --ht-seed-len 27 option is important for optimal results. The --ht-num-threads option enables multiple numbers of threads for faster building.