Illumina HiSeq/GAIIx Integration v5.5 / v5.6 User and Configuration Guide

Compatibility: See Illumina HiSeq/GAIIx Release Notes (for Clarity LIMS 4.0 or later).

Contents

The Illumina HiSeq/GAIIx Integration Package v5.5 supports Illumina HiSeq 1000, 1500, 2000, 2500, GAII(x), and HiScanSQ instruments.

The integration provides automated generation of a sample sheet file for use with the HiSeq or GAII(x) sequencing control software (HCS/SCS) and with bcl2fastq v2.15 or CASAVA v1.8.2 analysis software, and allows for automated tracking of an Illumina sequencing run in BaseSpace Clarity LIMS. Tracking includes instrument run status, automated reporting, and capturing and parsing of run statistics.

This page describes the integration between BaseSpace Clarity LIMS and the IlluminaHiSeq/GAII(x) system. It includes information about protocols and automations, script parameters and usage, configuration options, installed components, and rules and constraints.
For instructions on validating and troubleshooting the Illumina HiSeq/GAIIx Integration Package v5.5, see the Illumina HiSeq/GAIIx Integration v5.5 / v5.6 Validation and Troubleshooting Guide.
For overview information about how the Illumina – BaseSpace Clarity LIMS interface works, and which events are monitored, see How the Illumina Instrument and LIMS Software Interact.

Overview

This Illumina HiSeq/GAIIx Integration includes the following:

Preconfigured TruSeq Exome Enrichment 5.0, Illumina SBS (HiSeq GAIIx) 5.0, and HiSeq/GAIIx Validation protocols
Automatic generation of sample sheet files for use with bcl2fastq v2.15 or CASAVA v1.8.2 analysis software
Automated tracking in BaseSpace Clarity LIMS of the progress and metrics of an Illumina sequencing run
Per-instrument tracking of sequencing runs
Ability to directly monitor run status (cycle number) across multiple instruments from within BaseSpace Clarity LIMS
Automated tracking of Real Time Analysis (RTA) run directory location and other run specific information
Automated tracking of sequencing run parameters
Automatic generation of a report, which includes the current run’s statistics and a comparison  to the last five runs for any statistics that are one standard deviation off
Integration with Illumina's Bcl to FASTQ conversion and demultiplexing tool for CASAVA

Information on these features is provided in the following sections.

Protocols and Automations

The following out-of-the-box protocols are included in the Illumina HiSeq/GAIIx Integration Package v5.5:

TruSeq Exome Enrichment 5.0
Illumina SBS (HiSeq GAIIx) 5.0
HiSeq/GAIIx Validation

Figures 1 - 3 show these protocols as they display to the lab scientist working in Platform SW BaseSpace Clarity LIMS Lab View.

TruSeq Exome Enrichment 5.0 Protocol

Figure 1: Platform SW BaseSpace Clarity LIMS Lab View showing out-of-the-box TruSeq Exome Enrichment 5.0 protocol and steps

Illumina SBS (HiSeq GAIIx) 5.0 Protocol

Figure 2: Platform SW BaseSpace Clarity LIMS Lab View showing out-of-the-box Illumina SBS (HiSeq GAIIx) 5.0 protocol and steps

HiSeq/GAIIx Validation Protocol

Figure 3: BaseSpace Clarity LIMS Lab View showing HiSeq/GAIIx Validation protocol and step

For instructions on using the HiSeq/GAIIx Validation workflow to validate automated sample sheet generation, see the Illumina HiSeq/GAIIx Integration v5.5 / v5.6 Validation and Troubleshooting Guide.

Automations

Table 1 lists the automations included in this integration, and the protocol steps on which they are configured. Additional details are provided in the sections following the table.

Note that the TruSeq Exome Enrichment 5.0 protocol does not include any automation.

Table 1: Illumina HiSeq/GAIIx Integration out-of-the-box protocols and automations

Protocol

Automation and step

Illumina SBS (HiSeq GAIIx) 5.0

Create Normalization CSVConfigured in Library Normalization (Illumina SBS) 5.0 step
Assign QC flagsConfigured in qPCR QC 5.0step

Illumina SBS (HiSeq GAIIx) 5.0

HiSeq/GAIIx Validation

1. Generate HiSeq Sample Sheet
2. Generate CASAVA Sample Sheet
3. Generate bcl2fastqv2 Sample Sheet - Configured in Cluster Generation (Illumina SBS) 5.0 step

One of these automations is disabled upon installation of the integration. See Configuring Sample Sheet Generation.

Bcl conversion and demultiplexingConfigured on Bcl Conversion and Demultiplexing (Illumina SBS) 5.0 step

Depending on the requirements of the system, the BCL Conversion and Demultiplexing (Illumina SBS) 5.0 step may be removed from the protocol. This is typically the case when the system is configured to use the bcl2fastq2 sample sheet for downstream analysis. See Configuring Sample Sheet Generation.

The following sections describe how these automations work, and how the lab scientist interacts with them in the lab.

For more information on automations, including how to configure them, see the following topics in the LIMS Documentation.

LIMS v4.1 and v4.2 Documentation:

Configuring Protocol Steps (Protocols section)
Passing Parameters via EPP (Integrations via EPP section)

LIMS v5 Documentation:

Configuring Protocol Steps (Protocols section)
Passing Files and Tokens to Third-Party Programs (Automations section)

As of Platform SW BaseSpace Clarity LIMS v5.0, the term user-defined field (UDF) has been replaced with custom field in the user interface (the API resource is still called udf).

There are two types of custom fields:

Master step fields - configured on master steps. These fields only apply to the master step on which they are configured, and the steps derived from those master steps.
Global fields - configured on entities such as submitted sample, derived sample, measurement, etc. These fields apply to the whole Clarity LIMS system.

Library Normalization File Generation

How it Works

See Normalization Buffer Volumes.

User Interaction

On the Record Details screen of the Library Normalization (Illumina SBS) 5.0 step:

1. Enter values for each of the following:
Library volume (uL) transferred to destination plate
Normalized conc (nM)
Maximum destination container volume (uL)
Any custom fields/UDFs
2. Click Create Normalization CSV.

Figure 4: Record Details screen of Library Normalization (Illumina SBS) 5.0 step

Once the script has completed, the Normalization buffer volumes CSV file is generated and attached to the step.

QC Flag Assignment

How it Works

The assignQC script compares the values entered into the Sample Details table by the lab scientist with the criteria values set on the Record Details screen, and uses this information to set appropriate pass or fail QC flags on the output samples.

User Interaction

On the Record Details screen of the qPCR QC 5.0 step:

1. Upload the qPCR Result File to the placeholder and manually complete the Sample Details table.
2. Set the desired Criteria Source Data Fields, Criteria Operators, and Criteria Threshold Values.
3. Click Assign QC flags.

Figure 5: Record Details screen of qPCR QC 5.0 step

4. Once the script has completed:
QC flags are set on the output samples
The QC Assignment Log File and QC Assignment Report are generated and attached to the step

Sample Sheet Generation

The integration allows for automatic generation of two sample sheets:

One sample sheet designed to be used with the instrument control software
One sample sheet designed to be used with CASAVA v1.8.2 or bcl2fastq v2.15 sequencing analysis software. One of these analysis options is disabled when the system is initially configured – see Configuring Sample Sheet Generation.

By default, one instrument sample sheet and one analysis sample sheet are created for the flow cell loaded during the Cluster Generation (Illumina SBS) 5.0 step.

If required, you may configure the step to generate multiple analysis sample sheets, and to generate sample sheets for multiple flow cells.

Performing analysis with CASAVA or bcl2fastq2 software is optional. If you prefer to use a different analysis pipeline, you will need to create your own sample sheet. You can use Illumina Experiment Manager to do this.

Quick Links:

Guided sample sheet creation and setup
Illumina Experiment Manager User Guide
Illumina Experiment Manager Support

Table 2: Sample sheet files and placeholders generated by Cluster Generation (Illumina SBS) 5.0 step

Item

Description

Lab Tracking Form placeholder

This placeholder in the LIMS allows the user to manually attach a lab-specific tracking form to the step.

HiSeq Sample Sheet (CASAVA format)

This CSV file is automatically generated by the LIMS for use with the HiSeq/GAII(x) instrument.

May be opened as a text file, or as an MS Excel spreadsheet
A copy of the generated sample sheet is automatically placed in the Basecalls folder, the default location required by CASAVA

Sample Sheet Generation Log

Automatically generated by the LIMS, this log file captures any errors that the LIMS may encounter when generating the HiSeq/GAIIx sample sheet.

Analysis Sample Sheet (for use with CASAVA or bcl2fastq2)

This CSV file is automatically generated by the LIMS for use in CASAVA or bcl2fastq2 post-sequencing analysis.

May be opened as a text file, or as an MS Excel spreadsheet
If the CASAVA sample sheet is in use, a copy of the generated sample sheet is automatically placed in the Basecalls folder, the default location required by CASAVA

Analysis Sample Sheet Generation Log

Automatically generated by the LIMS, this log file captures any errors that the LIMS may encounter when generating the analysis sample sheet.

How it Works

1. Sample sheet generation is configured on the Record Details screen of the Cluster Generation (Illumina SBS) 5.0 step – this is when the samples are loaded onto the flow cell that will be placed in the instrument.

Figure 6 and Figure 7 show the three automations configured on the step, in LIMS v4.x and LIMS v5.x.

Figure 6: Cluster Generation (Illumina SBS) 5.0 automation configuration - LIMS v4.x

Figure 7: Cluster Generation I (Illumina SBS) 5.0 automation configuration - LIMS v5.x

2. Each of these automations invokes a script that generates the sample sheet – a CASAVA or bcl2fastq2 format comma-separated text file containing the required columns.
3. If the CASAVA format analysis sample sheet is used, once the sequencing run is in progress on the instrument and run data capture begins, the sample sheet for the current run is copied to the following directory for use with Illumina’s Bcl to FASTQ conversion and demultiplexing tool for CASAVA:

<run data directory>/Data/Intensities/BaseCalls/

See the CASAVA 1.8.2 Sample Sheet Generation and bcl2fastq2 Sample Sheet Generation articles for detailed information on:

Sample sheet generation script parameters and usage
Sample sheet data and configuration options
Sample sheet rules and constraints

User interaction

1. On the Placement screen of the Cluster Generation (Illumina SBS) 5.0 step:
Scan the flow cell ID (FCID) to populate the Container Name field.
Place the samples and proceed to Record Details screen.
2. On the Record Details screen of the Cluster Generation (Illumina SBS) 5.0 step:
Note that the value of the Workflow field is already set to GenerateFASTQ.

Figure 8: Workflow field of Cluster Generation step

In the Cluster Generation Kit field, select the desired kit from the drop-down list.
In the Cluster Generation Workflow field, select Paired-end or Single End Readfrom the drop-down list.

Figure 9: Cluster Generation Kit and Workflowfields in Record Details screen of Cluster Generation (Illumina SBS) 5.0 step

In the Reagent Lot Tracking section, select the desired TruSeq Cluster Kit from the drop-down list.
In the Step Details area:
3. For Paired End Reads: In the Read 2 Cycles field, enter a value other than 0.
4. Enter values for the remaining required custom fields / UDFs.
5. Click the first button – Generate HiSeq Sample Sheet – to generate the sample sheet to be uploaded to the HiSeq/GAIIx instrument.
6. Once the Generate HiSeq Sample Sheet automation has completed, the HiSeq sample sheet and log file are generated and attached to the step.
Download the sample sheet file and upload to the HiSeq/GAIIx sequencing control software (HCS/SCS) to start a run.
7. Click the second button to generate the analysis sample sheet. In the screenshot below, the step has been configured to generate a bcl2fastq2 format analysis sample sheet (see Configuring Sample Sheet Generation).

Figure 10: Configured sample sheet generation automations in Record Details screen of Cluster Generation (Illumina SBS) 5.0 step

8. Once the automation has completed:
bcl2fastq2: Download the sample sheet file; this will be used for analysis later in the workflow.
CASAVA: You do not need to download this file; it will be accessed during the Bcl Conversion and Demultiplexing (Illumina SBS) 5.0 step.
The analysis sample sheet and log file are generated and attached to the step.
9. (Optional) Upload the Lab Tracking Form to the appropriate placeholder.

Figure 11: Record Details screen of Cluster Generation (Illumina SBS) 5.0 step (Files section)

Sequencing Results Parsing

The Illumina Sequencing (Illumina SBS) 5.0 step records information for each lane of the flow cell and generates a report summarizing the results. In addition, run parameters, run info, the first base report (if generated), and a link to the run folder are automatically captured.

Generated and Captured Files

Table 3 describes the run information files, reports, placeholders, and links that the LIMS automatically generates or captures during a sequencing run.

Table 3: Run information generated or captured by Illumina Sequencing (Illumina SBS) 5.0 step

Item

Description

Run Info

Run Parameters (HCS-controlled instruments only)

RTA Configuration XML (SCS-controlled instruments only)

These XML files are automatically captured by the LIMS from the instrument's run folder.

They include the key run parameters, many of which are parsed out into key custom fields/UDFs.

Link to Run Folder

Automatically generated by the LIMS, this is a link to the network run folder where the data that was captured from the instrument during the run is stored.

Illumina Run Report

Automatically generated by the LIMS, this report provides key information about the run and the samples on the flow cell.

Information includes the flow cell ID, run directory location, instrument name, as well as primary analysis metrics for the instrument – summarized per flow cell lane for the entire run, and individual reads in the case of paired-end runs.

These metrics are compared against the instrument's per lane averages, calculated using metrics from the last 5 sequencing runs. Any values outside of 1 standard deviation are highlighted.

First Base Report (HCS-controlled instruments only)

Optionally generated during the instrument run by some Illumina instruments and captured by the LIMS, this report provides metrics to help determine the quality of the run.

Note that manual file import is required for Sequencing Control Software (SCS)-controlled instruments.

Lab Tracking Form placeholder

This placeholder in the LIMS allows the user to manually attach a lab-specific tracking form to the step.

Metadata

Listed below is the metadata that the LIMS automatically captures from the Illumina sequencing software as part of a sequencing run (see Figure 12). This information is gathered from various run result files and events.

Date Run* – run start date
Instrument Run ID – sequencer instrument name
Control Lane – PhiX control lane (blank or 1-8)
Experiment Name – entered in software
Finish Date* – run completion date
Flow Cell ID
Flow Cell Position – A or B
Flow Cell Version
Output Folder – run folder root
Index 1 Read Cycles – intended Index cycles
Index 2 Read Cycles – intended Index cycles
Run ID – the unique run ID
Run Type – sequencing run type description
Status – current status of the sequencing run on the instrument
SBS Kit Lot #
SBS Kit Type

Date Run is set to the date that the Begin Run event file is first picked up and associated with a step in BaseSpace Clarity LIMS, not the date of the Begin Run event itself (the date the run was performed on the instrument).

Finish Date is populated as follows:

If the End Run event contains a date in the format YYYY-MM-DD, Finish Date will be set to the date in the event file
If the End Run event does not contain a date or the date is in the wrong format, Finish Date will be set to the date when the event file is processed

Primary analysis metrics

Table 4 lists the Real Time Analysis (RTA) primary analysis metrics the LIMS automatically captures and records, per read, for samples in each flow cell lane. Some of these metrics are captured as soon as they become available during the run; others are captured upon run completion.

Table 4: RTA primary analysis metrics captured by Illumina Sequencing (Illumina SBS) 5.0 step

RTA primary analysis metrics

Corresponding LIMS custom fields/UDFs

Raw Yield (Gb)

Yield PF (Gb) R1

Yield PF (Gb) R2

% Bases >Q30

% Bases >=Q30 R1

% Bases >=Q30 R2

Cluster Density (K/mm^2)

Cluster Density (K/mm^2) R1

Cluster Density (K/mm^2) R2

Clusters Raw

Clusters Raw R1

Clusters Raw R2

Clusters PF

Clusters PF R1

Clusters PF R2

%PF

%PF R1

%PF R2

Intensity Cycle 1

Intensity Cycle 1 R1

Intensity Cycle 1 R2

% Intensity Cycle 20

% Intensity Cycle 20 R1

% Intensity Cycle 20 R2

% Phasing

% Phasing R1

% Phasing R2

% Prephasing

% Prephasing R1

% Prephasing R2

% Aligned

% Aligned R1

% Aligned R2

% Error Rate

% Error Rate R1

% Error Rate R2

How it Works

1. The sequencing service runs in the background on the LIMS server. The service detects event files that the instrument software (RTA) is producing as the run progresses; this in turn lets the service know where to find the run data.
2. As the run data is written out and the events come in (Begin Run, Cycle Complete, End Run):
The data is matched to the step based on the flow cell ID, which was entered by the user as the Container Name on the Cluster Generation step.
Read-only custom field / UDF values on the Record Details screen are populated accordingly (see Figure 12 ).

Figure 12: Record Details screen of Illumina Sequencing (Illumina SBS) 5.0 step showing custom fields/ UDFs used for result parsing

User Interaction

On the Record Details screen of the Illumina Sequencing (Illumina SBS) 5.0 step:

1. The read-only custom field/UDF values are automatically populated as the sequencer runs. Figure 12 shows the Record Details screen fields that are used for result parsing.
2. Once the run has completed:
The read-only custom fields/UDFs shown in Figure 12 are populated.
The Illumina Run Report is generated and attached.
The Run Parameters and Run Info files are attached.
The Link to Run Folder is generated and attached.
QC flags are set on each lane.
3. [Optional] In theFiles section, upload the First Base Report and Lab Tracking Form.

Figure 13: Record Details screen of Illumina Sequencing (Illumina SBS) 5.0 step (Files section)

Bcl Conversion and Demultiplexing

How it Works

Invoking the Bcl conversion and demultiplexing automation triggers the following events:

1. Initiate Bcl conversion with the correct parameters.
2. Parse Flowcell_demux_summary.xml .
3. Attach corresponding folder link file to each LIMS sample.
4. Parse sample stats.
5. Attach Demultiplex_Stats.htm and log file.

User Interaction

On the Record Details screen of the Bcl Conversion and Demultiplexing (Illumina SBS) 5.0 step:

1. Review the Bcl parameters provided, and edit as required.
2. Click Bcl conversion and demultiplexing.

Figure 14: Record Details screen of Bcl Conversion and Demultiplexing (Illumina SBS) 5.0 step

3. Once the script has completed:
Demultiplex Stats and Script Log Details files are generated and attached to the step
HTML link files are attached to each demultiplexed sample
Parsed metrics are updated on the step
QC flags are set on the samples

Setting the InstrumentType Parameter Value

If you are upgrading to the Illumina HiSeq/GAIIx Integration Package v5.5, you will need to set the value of the instrumentType parameter (-t) in the BCL Conversion and Demultiplexing step's EPP string.

This parameter value should be set to ‘hiseqga’, which is the instrument type prefix of the database properties used by the string.

The correctly configured parameter will look like this:

-t 'hiseqga’

In new installations of the Illumina HiSeq/GAIIx Integration Package v5.5, this parameter value is already set in the out-of-the-box configuration.

Script Parameters and Usage

For details, see the following articles in the LIMS Integrations documentation:

Normalization Buffer Volumes
Assign QC flags
CASAVA 1.8.2 Sample Sheet Generation
Bcl2fastq2 Sample Sheet Generation
BCL Conversion

Components Installed with this Integration

The following sections describe the various components - files, database properties, reagent kits, and containers - that are installed by default as part of this integration. Information on installed protocols, steps, and automations is provided in the Protocols and automations section, above.

The Illumina HiSeq/GAII Integration is distributed as an RPM package that installs the components listed in Table 7. Note that this RPM package requires installation of the associated NGS Commons Package (see Release Notes).

Table 7: Illumina HiSeq/GAII Integration RPM package components

Component Installed

Location and Description

configure_extensions_illumina_hiseqgaii_workflow- v5.sh

/opt/gls/clarity/config/

Script that imports the integration configuration.

configure_extensions_hiseqgaii_sequencingservice- v5 .sh

/opt/gls/clarity/config/

Script that imports the database properties for the integration.

hcs-instrument-package- <version> ployment-bundle.zip

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/InstrumentIntegrations

Instrument deployment package for HiSeq Control Software (HCS). Contains installation instructions and files to be deployed on the instrument.

scs-instrument-package- <version> .sh -deployment-bundle.zip

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/InstrumentIntegrations

Instrument deployment package for Sequencing Control Software (SCS). Contains installation instructions and files to be deployed on the instrument.

log4j.properties, log4j.xml

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/SequencingService/conf

File containing the settings for the sequencing jar’s logging. Note that as of HiSeq v. 5.4, the .properties file has been replaced by the XML file.

libs directory, with multiple library jars

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/SequencingService

Directory containing libraries used by the sequencing jar. Obsolete as of HiSeq v. 5.4, where these libraries are packaged with the jar file.

hiseqgaii-extensions.jar

/opt/gls/clarity/extensions/Illumina_HiSeq/<version>/EPP

Jar file containing API-based LIMS extensions used throughout the workflow.

hiseqgaii-sequencing.jar

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/SequencingService

Sequencing service jar file that captures the run results.

hiseqgaii_seqservice- v5.sh

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/SequencingService/bin

Provides commands for interacting with the sequencing service. Accessible at /etc/init.d

run_hiseqgaii_seqservice

/opt/gls/clarity/extensions/Illumina_HiSeq/v5/SequencingService/bin

Called by hiseqgaii_seqservice-v5, this script performs some basic system validation, then starts the sequencing service.

Database Properties Installed

Table 8 lists the database properties installed with the Illumina HiSeq/GAIIx Integration Package.

Note that several additional properties, each with the ‘99’ suffix appended to their name, are also installed. These are intended for use by the BaseSpace Clarity LIMS Support team in automated validation tests and are not listed in the table.

Constraints

Sequencing runs are matched using the flow cell ID and the sequencing step’s base name – Illumina Sequencing (Illumina SBS).

Do not change this base name – it is expected by the sequencing service that captures instrument run results. The base name is stored in the sequenceProcessBaseName database property (see Table 8). If this name is changed without the database property being updated, the 'flow cell ID <-> sequencing step base name' matching system will fail.

If required, you may modify the step name by editing or adding to the text after the base name portion as this is not used in the matching system. For example, you could change Illumina Sequencing (Illumina SBS) 5.0 to Illumina Sequencing (Illumina SBS) v5.

Table 8: Database properties installed with the Illumina HiSeq/GAIIx Integration Package

Property

Description

Default Value

hiseqga.seqservice.ignoreUnmatchedContainerIds

A flag indicating if event files that cannot be matched to flow cells in the LIMS should be archived after a certain period of time (true), or continually reprocessed (false).

false

To prevent the gls_events file directory from becoming cluttered, it is recommended that the value of this property is set to ‘true’.

hiseqga.seqservice.

ignoreUnmatchedContainerIdsWaitDays

The number of days between when the event is created and the event file is archived.

14

hiseqga.seqservice.synchronizationPeriod

Invocation period in seconds.

60

hiseqga.seqservice.sequenceProcessBaseName

Sequencing process type base display name. Partial matching is used to look up the process type.

Illumina Sequencing (Illumina SBS)

hiseqga.seqservice.eventFileDirectory.1

A network location monitored for event files.

/mnt/gls_events

hiseqga.seqservice.netPathPrefixSearch.1

The network directory prefix contained in the event file.

\\nas\network\run_data

hiseqga.seqservice.netPathPrefixReplace.1

The mapped network directory mount name on the server used to access the run data directory.

/mnt/run_data

hiseqga.seqservice.eventFileDirectorySuffixes

A list of eventFileDirectory path entries to monitor for event files. The value is one or more comma separated integers.

99

hiseqga.seqservice.runReportViewsVersion

The current version of the Run Report views in the database. The value 0 represents the state before the views are created. This property is automatically updated by the run report.

0

hiseqga.seqservice.netPathPrefixSearchReplaceSuffixes

A list of netPathPrefix search and replace entries for transforming Windows to Linux network paths. The value is one or more comma-separated integers.

99

hiseqga.bcl.netPathPrefixSearch.1

(BCL conversion) The network directory prefix contained in the event file.

\\nas\network\run_data

hiseqga.bcl.netPathPrefixReplace.1

(BCL conversion) The mapped network directory mount name on the server used to access the run data directory.

/mnt/run_data

hiseqga.bcl.replacementSuffixes.bclserver

(BCL conversion) A list of network path replacements associated with the BCL EPP Channel. The value is one or more comma-separated integers. The property format is hiseqga.bcl.replacementSuffixes.<EPP_CHANNEL_NAME>

99

Support for multiple seqservice.netPathPrefixSearch property values

It is possible to configure support for multiple, identical seqservice.netPathPrefixSearch property values. For details, see Configuring Multiple Identical netPathPrefixSearch values.

Reagent Kits Installed

The following reagent kits are included in the default configuration for the Illumina HiSeq/GAIIx Integration:

AMPure XP Beads
TruSeq Cluster Kit
TruSeq Exome Enrichment - Box 1
TruSeq Exome Enrichment - Box 2

Control Types Installed

The following control types are included in the default configuration for the Illumina HiSeq/GAIIx Integration:

Endogenous Positive Control
Exogenous Positive Control
No Amplification Control
No Reverse Transcriptase Control
No Template Control
PhiX v3

Container Types Installed

The following container types are included in the default configuration for the Illumina HiSeq/GAIIx Integration:

Illumina Flow Cell
Illumina Rapid Flow Cell

Instrument Integration

To ensure that your Illumina instrument warranty remains valid, the instrument integration must be performed and maintained by the BaseSpace Clarity LIMS Support team. To perform this integration, the Support team will require direct access to the instrument via WebEx or Remote Desktop while the instrument is idle.

Below is an overview of the steps performed by the LIMS support team when configuring the instrument for use with the Illumina HiSeq/GAIIx Integration v5.5 to BaseSpace Clarity LIMS.

Configuring the Illumina Instrument

1. Create directory on the local computer to hold the batch files. These batch files write event files to the network attached storage (NAS) share(s).
2. Create directory on the NAS to hold the event files.
3. Modify Illumina software configuration files to call the batch files that create the event files.
4. Update sequencing service default database properties to match the specifics of the installation.

Configuring Instruments

We recommend configuring sequencer instrument names in the LIMS to track the specific lab instrument on which a run was completed.

This information should be c onfigured on the Illumina Sequencing(Illumina SBS)5.0 step, and should be set to the same name as is configured in the Illumina sequencing software – i.e., this will be value of the Instrument Run ID field described in the Metadata section above.

When the integration reads the Instrument Run ID field in the run results, it will update the instrument on the step in the LIMS, eliminating the need for the user to manually enter this information.

For instructions on configuring instrument names, refer to the following articles:

LIMS v4.1 and v4.2 Documentation:

Adding Instruments and Instrument Types (System Configuration section)

LIMS v5 Documentation:

Adding and Configuring Instruments (Consumables section)

Configuring Sample Sheet Generation

Sample sheet generation occurs on the step prior to the sequencing run – Cluster Generation (Illumina SBS) 5.0 – which is the step where samples are placed on the container that will be loaded in the instrument.

The out-of-the-box configuration provides three automations (listed below in Table 3). These generate the following:

One sample sheet for use with the HiSeq/GAII(x) instrument sequencing control software (CASAVA v1.8.2 format).
Two options for an analysis sample sheet for use with CASAVA v1.8.2 or bcl2fastq v2.15 downstream analysis – at least one of which should be disabled when the system is initially configured (instructions provided below ). If you do not require a sample sheet for use in downstream analysis, you may prefer to disable both options.

By default, one instrument and one analysis sample sheet are created for the flow cell loaded during the Cluster Generation (Illumina SBS) 5.0 step.

If required, you may configure the step to generate analysis sample sheets for multiple flow cells. For details, refer to the sample sheet generation article for your analysis software:

CASAVA 1.8.2 Sample Sheet Generation
Bcl2fastq2 Sample Sheet Generation (NGS v5.16 and earlier)

The sample sheet generation automations are configured on the Record Details screen of the Cluster Generation (Illumina SBS) 5.0step.

Figure 15: Record Details screen of Cluster Generation step showing out-of-the-box sample sheet generation automations prior to configuration

Table 5 shows the format of the sample sheets generated by each of these automations.

Table 5: Sample sheet generation automations and file formats

Automation

Generated Sample Sheet Format

1. Generate HiSeq Sample Sheet

CASAVA v1.8.2

2. Generate CASAVA Sample Sheet

CASAVA v1.8.2

2. Generate bcl2fastqv2 Sample Sheet

bcl2fastq v2.15

Upgrading Cluster Generation (Illumina SBS) 5.0 Master Step Custom Fields/Step UDFs

If you are upgrading to the Illumina HiSeq/GAIIx Integration Package v5.5, note the following:

To enable generation of the bcl2fastq2 sample sheet, additional fields / UDFs have been added to the Cluster Generation (Illumina SBS) 5.0 step. These are listed in Table 6.

Table 6: Cluster Generation (Illumina SBS) 5.0 - additional master step fields/step UDFs

Name

Type

Field Constraints

Preset Values

Adapter

Single-line Text

Required

Adapter Read 2

Single-line Text

Read 1 Cycles

Numeric

Required, minimum value 0

151, 76, and 51

Read 2 Cycles

Numeric

Required, minimum value 0

151 and 76

Experiment Name

Single-line Text

Required

Workflow

Single-line Text

Required

Use first Preset Value as the default value option selected

GenerateFASTQ

The TruSeq Cluster Generation Kit master step field/step UDF has been renamed Cluster Generation Kit and its preset values have been updated as follows:
TruSeq Cluster Kit v2 (cBot-HS)
TruSeq Cluster Kit v3 (cBot-HS)
HiSeq PE Cluster Kit v4
HiSeq SR Rapid Cluster Kit v2
HiSeq PE Rapid Cluster Kit v2
The master step fields / step UDFs Cluster Generation Kit and Cluster Generation Workflow are now required.

Disable bcl2fastqv2 and Use the CASAVA Sample Sheet for Analysis

LIMS 4.x

1. In the LIMS Web Interface, on the Configuration screen, click Protocols.
2. Edit the Illumina SBS (HiSeq GAIIx) 5.0 protocol.
3. Edit the Cluster Generation (Illumina SBS) 5.0 step.
4. On the Automation tab, set the value for 2. Generate bcl2fastqv2 Sample Sheet to "Not used."

Figure 16: Disabling generation of the bcl2fastqv2 sample sheet

5. Save the step configuration.

LIMS 5.x

1. In the LIMS, click the Configuration menu item and then click the Lab Work tab.
2. On the Lab Work configuration screen, in the Workflows, Protocols, Steps, and Master Steps navigation panel, select (click) the Illumina SBS (HiSeq GAIIx) 5.0 protocol. The Steps and Master Steps lists update, highlighting the steps included in the protocol, and the master steps on which those steps are based.
3. Select the Cluster Generation (Illumina SBS) 5.0step.

Figure 17: Selecting the Cluster Generation (Illumina SBS) 5.0 step

4. Below the navigation panel, the step and master step configuration forms display. Scroll down to the Automation configuration section. For the 2. Generate bcl2fastqv2 Sample Sheet automation, set the Trigger Location to Not Used.

Figure 18: Disabling generation of the bcl2fastqv2 sample sheet

5. Save the step configuration.

After completing the configuration steps above, when the step is run, only two buttons will be visible in Record Details view.

Note that both generate the same CASAVA format sample sheet.

Figure 19: Automations configured for use with HiSeq/GAII and CASAVA downstream analysis

Disable CASAVA and Use the bcl2fastq2 Sample Sheet for Analysis

LIMS 4.x

1. In the LIMS Web Interface, on the Configuration screen, click Protocols .
2. Edit the Illumina SBS (HiSeq GAIIx) 5.0 protocol.
3. Remove the BCL Conversion and Demultiplexing (Illumina SBS) 5.0 step.

Figure 20: Removing BCL Conversion and Demultiplexing from Illumina SBS (HiSeq GAIIx) protocol

4. Save the protocol configuration.
5. Edit the Cluster Generation (Illumina SBS) 5.0 step.
6. On the Automation tab, set the value for 2. Generate CASAVA Sample Sheet to "Not used."

Figure 21: Disabling generation of the CASAVA sample sheet

7. Save the step configuration.

LIMS 5.x

1. In the LIMS, click the Configuration menu item and then click the Lab Work tab.
2. On the Lab Work configuration screen, in the Workflows, Protocols, Steps, and Master Steps navigation panel, select (click) the Illumina SBS (HiSeq GAIIx) 5.0 protocol. The Steps and Master Steps lists update, highlighting the steps included in the protocol, and the master steps on which those steps are based.
3. Select the BCL Conversion and Demultiplexing (Illumina SBS) 5.0 step and click Delete.

Figure 22: Removing BCL Conversion and Demultiplexing from Illumina SBS (HiSeq GAIIx) 5.0 protocol

4. Save the protocol configuration.
5. In the Steps list, select the Cluster Generation (Illumina SBS) 5.0step.

Figure 23: Selecting the Cluster Generation (Illumina SBS) 5.0 Step

6. Below the navigation panel, the step and master step configuration forms display. Scroll down to the Automation configuration section. For the 2. Generate CASAVA Sample Sheet automation, set the Trigger Location to Not Used.

Figure 24: Disabling Generation of the CASAVA Sample Sheet

7. Save the step configuration.

After completing the configuration steps above, when the step is run, only two buttons will be visible in Record Details view.

Figure 25: Automations configured for use with HiSeq/GAIIx and bcl2fastq2 downstream analysis

See the CASAVA 1.8.2 Sample Sheet Generation and Bcl2fastq2 Sample Sheet Generation (NGS v5.16 and earlier) articles for detailed information on:

Sample sheet generation script parameters and usage
Sample sheet data and configuration options
Enabling sample sheet generation for multiple flow cells
Enabling unique FASTQ file names per sequencing run
Sample sheet rules and constraints

Support for container types

All one-dimensional container types with both numeric rows and numeric columns are supported.

Rules and Constraints

This integration operates with the following constraints:

The flow cell ID must be unique. There should not be multiple flow cell containers in the system with the same name.
The flow cell ID must be scanned as the flow cell Container Name on the Cluster Generation (Illumina SBS) 5.0 step.
The sample sheet provided in the BaseCalls directory must be named SampleSheet.csv.
For sample sheet generation constraints, refer to the appropriate article for your analysis software:
CASAVA 1.8.2 Sample Sheet Generation
Bcl2fastq2 Sample Sheet Generation (NGS v5.16 and earlier)

Vendor Resources

Illumina HiSeq 2500 System information
Guided sample sheet creation and setup
Illumina Experiment Manager User Guide
Illumina Experiment Manager Support