Bcl2fastq2 Sample Sheet Generation (NGS v5.16 and earlier)

Compatibility and Package Version:

Support for bcl2fastq v2.15 sample sheet generation is available in multiple Illumina sequencer integration packages. Refer to the release notes for your installed package for information about the features available.
Support for single and dual indexes, and indexes of different lengths, on a single flow cell has been validated and is supported with HCS v3.3.76

Overview

This article discusses the Illumina sequencer integration packages that generate a sample sheet for use with bcl2fastq v2.15 analysis software.

Sample sheet generation is configured on the step prior to the sequencing run, which is the step where samples are placed on the flow cells or reagent cartridges that will be placed in the instrument.

The sample sheet is generated by means of a script, which the user initiates by clicking a button on the Record Details screen of the step. This generates a bcl2fastq2 format sample sheet file for the container loaded during the step, where the name of the sample sheet will be <container name>.csv.

Configured step UDFs

Field Name

Field Type

Notes

Experiment Name

Text

Workflow

Text

Read 1 Cycles

Numeric

Read 2 Cycles

Numeric

Entering a value other than 0 indicates a paired-end reads run

Adapter

Text

Required if using 'Adapter Read 2'

Adapter Read 2

Text

Mask Adapter

Text

Required if using 'Mask Adapter Read 2'

Mask Adapter Read 2

Text

Submitted sample UDFs

Field Name

Field Type

Required?

Notes

Description

Text

No

Script parameters and usage

The following table lists the parameters used by the script.

Parameter

Description

u, username

LIMS username (Required)

p, password

LIMS password (Required)

i, processURI

LIMS process URI (Required) (lowercase I)

c, csvFileLimsIds

Sample sheet CSV file LIMS ID (Required - may be provided multiple times)

e, errorLogFileName

Log file name (Required)

l, useProjectLimsID

Project LIMS ID will be used instead of project name in the Project column of the sample sheet (Optional)

Accepted values: true or false .
Provide with quotes e.g. -l 'true' (lower case L)

s, useSampleLimsID

Sample LIMS ID will be used instead of sample name in the SampleName column of the sample sheet (Optional) (S ee Enabling unique FASTQ file names )

Accepted values: true or false .
Provide with quotes e.g. -s 'true'

a, appendLimsID

LIMS ID of the protocol step will be appended to sample names in the SampleName column of the sample sheet. (Optional)

Use this option to guarantee unique FASTQ file names per run. (See Enabling unique FASTQ file names)

Accepted values: true or false.
Provide with quotes e.g. -a 'true'

-r, useReverseComplement

If paired-end reads are used, the reverse complement of index2 will be displayed instead of the standard sequence.

Accepted values: true or fals e.
Provide with quotes e.g. -r 'true'

Usage

Below is an example automation command showing the script in use. The sample sheet generation portion of the parameter string is shown in bold.

bash -c "/opt/gls/clarity/bin/java -jar /opt/gls/clarity/extensions/<package_name>/<package_version>/EPP/<extensions.jar> \
-u {username} \
-p {password} \
-i {processURI:v2} \>
script:generate_bcl2fastq_sample_sheet \>
-c {compoundOutputFileLuid3} \>
-e {compoundOutputFileLuid4}"

Support for container types

All one-dimensional container types with both numeric rows and numeric columns are supported.

Sample sheet data

The following table lists the fields that are always populated in the sample sheet, along with all other fields that may also be present.

Note the following:

In the table, only items in bold are used by the bcl2fastq2 tool when processing. Other fields will be ignored by the tool and are included for your convenience.
If no upstream pooling is detected, BaseSpace Clarity LIMS will populate the sample sheet with the SampleID and SampleName of the submitted sample; other fields are populated with data from the samples that were input to the step (i.e. derived samples).
If upstream pooling is detected, BaseSpace Clarity LIMS will populate the sample sheet with the first upstream pooled inputs found – not with the submitted sample or step input fields.

For information about ordering of sample sheet data, illegal characters, and other rules and constraints, see Rules and constraints .

Field Name

Always Present?

Populated by

Comments

[Header]

Static text

Investigator Name

Yes

LIMS technician name

Experiment Name

No

Step UDF

Always present in NextSeq v1.0.1. If the UDF is not entered then the line will display as "Experiment Name".

Date

Yes

Script run date

Workflow

Yes

Step UDF

[Reads]

Static Text

Read 1 Cycles

No

Step UDF

Read 2 Cycles

No

Step UDF

[Settings]

Static text

Adapter

No

Step UDF

Required if AdapterRead 2 is used.

AdapterRead 2

No

Step UDF

MaskAdapter

No

Step UDF

Required if MaskAdapterRead 2 is used.

MaskAdapterRead 2

No

Step UDF

[Data]

Static text

Lane

Yes

Sample Output Placement

Not present for NextSeq v1.0.1.

SampleID

Yes

Sample LIMS ID

SampleName

Yes

Sample Name

Depending on the command-line value, this will either be the sample name or the LIMS ID of the submitted sample of the input.

The additional '-a' command line option appends the LIMS ID to the end of this value, e.g."Sample1-1234" (see Script parameters and usage )

SamplePlate

Yes

Sample input plate

This will be the input plate to the current step. If the input is a pool,then this will be the input plate of the given sample prior to entering the pool.

SampleWell

Yes

Sample input well location

This will be the well location of the sample on the input plate. If the input is a pool, then this will be the well location of the given sample prior to entering the pool.

index

No

Sample reagent label

Dual index reagents will contain a hyphen-separated DNA sequence. This field will use the first half of that value.

index2

No

Sample secondary reagent label

Dual index reagents will contain a hyphen-separated DNA sequence. This field will use a second half of that value.

Project

Yes

Sample project

Will be the name of LIMS ID of the project for the sample (see Script parameters and usage )

Description

Yes

Submitted sample UDF

File format and contents

This section outlines the format and contents of the generated sample sheet and associated log file.

When validating the installation of your integration, refer to this information to ensure that the sample sheet and log files are correctly generated.

Note:

The sample sheet file supports single and dual indexes, and indexes of different lengths, on a single flow cell (currently tested and supported with HCS v. 2.2.70 only). See Information and warning messages .

Bcl2fastq2 sample sheet

When single and dual indexes are present in the same flow cell:

No information or warning messages display if indexes of different lengths are present on the same flow cell.

To enable sample sheet generation for multiple containers, you must modify the process type on which the sample sheet generation automation is configured, creating a placeholder for each file:

To enable unique FASTQ file names per sequencing run, the EPP command on the process type must be configured to use the following parameter options:

.