BCL Convert v4.0
The Illumina BCL Convert v4.0 is a standalone local software app that converts the Binary Base Call (BCL) files produced by Illumina sequencing systems to FASTQ files. BCL Convert v4.0 also provides adapter handling (through masking and trimming), unique molecular identifier (UMI) trimming, and produces metric outputs.
The minimum operating requirements for BCL Convert v4.0 are as follows.
Hardware Requirements
• | Single multiprocessor or multicore computer |
• | Minimum 64 GB RAM |
Exact storage requirements vary based on throughput and sample count. For more information, refer to the BCL Convert v4.0 pages of the Illumina support center.
Software Requirements
• | Linux CentOS 6 or later |
• | Root access to your computer |
BCL Convert v4.0 is installed from an RPM package downloaded from the Illumina support center. To start installation navigate to /usr/local/bin/bcl-convert.
Install the RPM package as follows.
• | To install the software in the default location, use the following command: |
rpm --install <rpm package-name>
• | To specify a custom install location, use the following commandr: |
rpm --install --prefix <user-specified directory>
<rpm package-name>
Run BCL Convert v4.0 as follows.
1. | Open a command-line window. |
2. | Enter the following command and add options as needed: |
nohup /usr/local/bin/bcl-convert
For example, the following command populates the output directory with FASTQ files:
nohup /usr/local/bin/bcl-convert -bcl-input-directory
<RunFolder> --output-directory <OutputDirectory>
The Tools and Utilities section of the Illumina DRAGEN Bio-IT Platform documentation provides additional information for using BCL Convert v4.0.
BCL Convert v4.0 produces the following FASTQ and metrics output files. All output files can be found in the Reports folder of the output directory.
For additional information on FASTQ output and directory files and BCL metric output files, refer to the Tools and Utilities section of the Illumina DRAGEN Bio-IT Platform documentation.
FASTQ Output Files
On the Files tab, BCL Convert v4.0 generates one FASTQ data set per sample.
These files are arranged in the following folder structure:
• | <Sample ID> data set—Contains complete FASTQ (*.fastq.gz) files for each sample. |
Index Metrics Output File
The index metrics output file (IndexMetricsOut.bin) is a binary file in BIN format. The file contains index statistics for each sample and index combination per lane provided to BCL Convert v4.0 in the sample sheet. The content of the file is documented as follows.
• | Byte 0: file version number (2) |
• | The remaining bytes represent records, which are composed of the following information: |
– | 2 bytes: lane number (uint16) |
– | 4 bytes: tile number (uint32) |
– | 2 bytes: read number (uint16) |
– | 2 bytes: indexLength, the length in bytes of index name (uint16) |
– | indexLength bytes: string representing index name |
– | 8 bytes: number of occurrences of index (uint64) |
– | 2 bytes: sampleLength, the length in bytes of the sample name (uint16) |
– | sampleLength bytes: string representing sample name |
– | 2 bytes: projectLength, the length in bytes of the project name (uint16) |
– | projectLength bytes: string representing project name |
Use the following to troubleshoot or resolve error conditions experienced while using IlluminaBCL Convert v4.0.
Barcode Collision Error and Solution
Previous versions of BCL Convert allowed the overall concentrated sequence to pass the single index hamming distance rules if the concentration was sufficiently diverse. BCL Convert v3.10.5 and later uses strict barcode collision logic to support increased high-throughput and complex sample pooling. Each index in a dual setup must individually meet the hamming distance requirements set by the BarcodeMismatchesIndex# value. If either i7 or i5 does not meet the hamming distance requirements the program will error.
For default BarcodeMismatchesIndex1 and BarcodeMismatchesIndex2, the following apply:
• | Barcodes must differ by at least 3 bases. |
• | If any two samples in i7 differ by fewer than 3 bases, an error is produced and the run will not proceed, regardless of the samples i5 values. |
• | If any two samples in i5 differ by fewer than 3 bases, an error is produced and the run will not proceed, regardless of the samples i7 values. |
If you receive errors with current versions of DRAGEN or BCL Convert, lower the mismatch tolerance for the index producing the error. To lower mismatch tolerance, use the BarcodeMismatchesIndex1 or BarcodeMismatchesIndex2 sample sheet settings. For additional information on barcode mismatches, refer to the Tools and Utilities section of the Illumina DRAGEN Bio-IT Platform v4.0 documentation.
The BCL Convert v4.0 support pages on the Illumina support center provide additional resources. These resources include training, compatible products, and other considerations. Always check the support center for the latest versions.
Document # |
Date |
Description of Change |
---|---|---|
200023259 v00 |
July 2022 |
Initial release. |