Technology Spotlight: Illumina
®
Sequencing
Introduction
Illumina sequencing technology leverages clonal array formation and
proprietary reversible terminator technology for rapid and accurate
large-scale sequencing. The innovative and exible sequencing system
enables a broad array of applications in genomics, transcriptomics,
and epigenomics.
Cluster Generation
Sequencing templates are immobilized on a proprietary ow cell
surface (Figure 1) designed to present the DNA in a manner that
facilitates access to enzymes while ensuring high stability of surface-
bound template and low non-specic binding of uorescently labeled
nucleotides. Solid-phase amplication (Figures 2–7) creates up to
1,000 identical copies of each single template molecule in close prox-
imity (diameter of one micron or less). Because this process does not
involve photolithography, mechanical spotting, or positioning of beads
into wells, densities on the order of ten million single-molecule clusters
per square centimeter are achieved.
Sequencing by Synthesis
Sequencing by synthesis (SBS) technology uses four uorescently-
labeled nucleotides to sequence the tens of millions of clusters on
the ow cell surface in parallel (Figure 8–12). During each sequencing
cycle, a single labeled deoxynucleoside triphosphate (dNTP) is added
to the nucleic acid chain. The nucleotide label serves as a terminator
for polymerization, so after each dNTP incorporation, the uorescent
dye is imaged to identify the base and then enzymatically cleaved to
allow incorporation of the next nucleotide. Since all four reversible
terminator-bound dNTPs (A, C, T, G) are present as single, separate
molecules, natural competition minimizes incorporation bias. Base
calls are made directly from signal intensity measurements during each
cycle, which greatly reduces raw error rates compared to other tech-
nologies. The end result is highly accurate base-by-base sequencing
that eliminates sequence-context specic errors, enabling robust base
calling across the genome, including repetitive sequence regions and
within homopolymers.
Analysis Pipeline
The Illumina sequencing approach is built around a massive quantity
of sequence reads in parallel. Deep sampling and uniform cover-
age is used to generate a consensus and ensure high condence in
determination of genetic differences. Deep sampling allows the use of
weighted majority voting and statistical analysis, similar to conventional
methods, to identify homozygotes and heterozygotes and to distin-
guish sequencing errors. Each raw read base has an assigned quality
score so that the software can apply a weighting factor in calling differ-
ences and generating condence scores.
Data Collection, Processing, and Analysis
Illumina data collection software enables users to align sequences
to a reference in resequencing applications (Figure 13). Developed
in collaboration with leading researchers, this software suite includes
the full range of data collection, processing, and analysis modules to
streamline collection and analysis of data with minimal user interven-
tion. The open format of the software allows easy access to data at
various stages of processing and analysis using simple application
program interfaces.
Data Accuracy and Workow Simplicity
The TruSeq family of reagents represents the latest advancement of
Illumina’s sequencing by synthesis (SBS) technology. From sample
prep through DNA sequencing, TruSeq reagent chemistry enables
Illumina sequencing to provide the most accurate data across a broad
range of applications. With highest yield of error-free reads and most
base calls above Q30, researchers can have the highest condence in
their data integrity to draw sound biological conclusions.
A highly automated, streamlined workow requires minimal instrument
hands-on time. With the ability to generate several gigabases of DNA
sequence per run, even large mammalian genomes can be sequenced
in weeks rather than years. The capacity to accommodate many
samples per ow cell means that runs can be tailored to the demands
of diverse applications.
Illumina Sequencing Technology
Highest data accuracy, simple workow, and a broad range of applications.
Figure 1: Illumina Flow Cell
Several samples can be loaded onto the eight-lane ow cell for simultane-
ous analysis on an Illumina Sequencing System.