Networked Streaming

AWS S3, Azure Bob Storage, and HTTP Input Streaming

DRAGEN can stream input files directly from an AWS S3 bucket, an Azure blob, or using HTTP presigned URLs. You do not need to download the input files to a local disk prior to being processed. The files are streamed over the network directly into the DRAGEN processor.

Input streaming is most beneficial for large input files. DRAGEN supports input streaming for BAMs and compressed FASTQ files. For FASTQ files, input streaming can be used in all the configurations that use single-end FASTQs, paired-end FASTQs, and FASTQ lists.

Input streaming is supported for the following use cases.

Mapping/aligning of FASTQ and BAM.
Germline and somatic small variant calling from BAM (without remapping).

For other file types that are significantly smaller in size, download them locally before running the analysis.

Examples

The following examples display possible methods to stream input files directly with DRAGEN.

AWS S3, Azure Blob Storage, Output Streaming

DRAGEN can stream its output to an AWS S3 Bucket or an Azure Storage Account Container. Output streaming is beneficial for large output files and sharing results.

Security and permissions

To stream input files or write to a cloud providers storage, you must have permission to access the remote files.