Cell-Hashing

To enable cell-hashing sample demultiplexing, specify the following command line options.

•

--single-cell-cell-hashing-reference—Specify a CSV or FASTA cell-hashing reference file that contains sample-specific oligo-tags.

•

--single-cell-demux-detect-doublets—Enable doublet detection in cell-hashing sample demultiplexing. The default value is false.

•

--single-cell-demux-sample-fastq—Output sample-specific FASTQ files. See Sample-Specific FASTQ Output Files for more information.

Outputs

The <prefix>.scRNA.barcodeSummary.tsv file contains per-cell metrics, including cell barcodes. The following column in the <prefix>.scRNA.barcodeSummary.tsv contains cell-hashing per-cell information. For more information on the <prefix>.scRNA.barcodeSummary.tsv file, see Outputs.

Column

Description

SampleIdentity

The SampleIdentity column can contain the following values:

•

sampleX—The particular cell (barcode) is uniquely assigned to a sample.

•

AMB(sampleX,sampleY)—The algorithm cannot determine the sample to assign the barcode to.

•

MIX(mixing_coef*sampleX+(100-mixing_coef)*sampleY)—The cell barcode is classified as doublet. For example, MIX(50*sampleX+50*sampleY).

The <prefix>.scRNA.demux.tsv file contains sample demultiplexing statistics that were used to infer sample identity of each cell.

Column	Description
Barcode	The cell barcode associated with the cell.
Pure samples	Cell-hashing read count for each sample.

Sample-Specific FASTQ Output Files

If you have enabled either of the sample demultiplexing algorithms, you can output sample-specific FASTQ files after the sample identities for each cell is available. Use the following command line.

--single-cell-demux-sample-fastq

If gzip is specified, then the sample-specific output FASTQ files are compressed in gzip format. If fastq is specified, then the sample-specific FASTQ files are not compressed. The default option is none, which indicates that no sample-specific FASTQ files are output.