Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jun 9;15(1):449.
doi: 10.1186/1471-2164-15-449.

Performance comparison of four exome capture systems for deep sequencing

Affiliations

Performance comparison of four exome capture systems for deep sequencing

Chandra Sekhar Reddy Chilamakuri et al. BMC Genomics. .

Abstract

Background: Recent developments in deep (next-generation) sequencing technologies are significantly impacting medical research. The global analysis of protein coding regions in genomes of interest by whole exome sequencing is a widely used application. Many technologies for exome capture are commercially available; here we compare the performance of four of them: NimbleGen's SeqCap EZ v3.0, Agilent's SureSelect v4.0, Illumina's TruSeq Exome, and Illumina's Nextera Exome, all applied to the same human tumor DNA sample.

Results: Each capture technology was evaluated for its coverage of different exome databases, target coverage efficiency, GC bias, sensitivity in single nucleotide variant detection, sensitivity in small indel detection, and technical reproducibility. In general, all technologies performed well; however, our data demonstrated small, but consistent differences between the four capture technologies. Illumina technologies cover more bases in coding and untranslated regions. Furthermore, whereas most of the technologies provide reduced coverage in regions with low or high GC content, the Nextera technology tends to bias towards target regions with high GC content.

Conclusions: We show key differences in performance between the four technologies. Our data should help researchers who are planning exome sequencing to select appropriate exome capture technology for their particular application.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Venn diagram showing the overlap between different features. A) Overlap among Agilent, NimbleGen and Illumina capture targets. B) Overlap among RefSeq, CCDS, and ENSEMBL protein coding exon databases. Coverage of exome capture technology for C) CCDS coding exons, D) RefSeq coding exons, E) ENSEMBL coding exons, and F) RefSeq UTRs.
Figure 2
Figure 2
Read statistics. A) Bar plot showing percent of initial reads, mapped reads and reads left after filtering for four different technologies; each bar shows the number of reads in millions. B) Stacked bar plot showing subgroups of filtered reads.
Figure 3
Figure 3
Coverage efficiency comparison by technology. Coverage efficiency defined as the percent of the total targeted bases covered at particular depths. A) Coverage efficiency for intended targeted bases for each technology. B) Coverage efficiency for bases, which are shared, by all four technologies (26.2 MB). Smooth line indicates replicate 1, and dotted line indicates replicate 2.
Figure 4
Figure 4
Coverage efficiency as a function of number of reads. The percent of targeted bases covered at A) ≥10x, B) ≥20x, C) ≥30x, D) ≥40x, E) ≥50x, and F) ≥100x depths.
Figure 5
Figure 5
Density plots showing GC content against normalized mean read depth for A) Agilent, B) NimbleGen, C) TruSeq, and D) Nextera technologies.
Figure 6
Figure 6
SNV detection by technology as a function of increasing read counts on A) intended target region, B) regions common among technologies, C) CCDS exons, D) RefSeq exons, E) Ensembl exons, and F) UTRs. Solid-lines indicate technology specific SNVs, dashed-lines indicate total number of SNVs, and solid pink lines indicate the SNVs common between the four technologies.
Figure 7
Figure 7
Indels detection by technology as a function of increasing read counts on A) intended target region, B) regions common among the technologies, C) CCDS exons, D) RefSeq exons, E) Ensembl exons, and F) UTRs. Solid-lines indicate technology specific SNVs, dashed-lines indicate total number of SNVs, and solid pink lines indicate the SNVs common between four technologies.
Figure 8
Figure 8
Overview of the computational pipeline.

References

    1. Luikart G, England PR, Tallmon D, Jordan S, Taberlet P. The power and promise of population genomics: from genotyping to genome typing. Nat Rev Genet. 2003;4:981–994. doi: 10.1038/nrg1226. - DOI - PubMed
    1. Yu TW, Chahrour MH, Coulter ME, Jiralerspong S, Okamura-Ikeda K, Ataman B, Schmitz-Abe K, Harmin DA, Adli M, Malik AN, D'Gama AM, Lim ET, Sanders SJ, Mochida GH, Partlow JN, Sunu CM, Felie JM, Rodriguez J, Nasir RH, Ware J, Joseph RM, Hill RS, Kwan BY, Al-Saffar M, Mukaddes NM, Hashmi A, Balkhy S, Gascon GG, Hisama FM, LeClair E, et al. Using whole-exome sequencing to identify inherited causes of autism. Neuron. 2013;77:259–273. doi: 10.1016/j.neuron.2012.11.002. - DOI - PMC - PubMed
    1. Schuster B, Knies K, Stoepker C, Velleuer E, Friedl R, Gottwald-Muhlhauser B, de Winter JP, Schindler D. Whole exome sequencing reveals uncommon mutations in the recently identified Fanconi anemia gene SLX4/FANCP. Hum Mutat. 2013;34:93–96. doi: 10.1002/humu.22221. - DOI - PubMed
    1. Kalsoom UE, Klopocki E, Wasif N, Tariq M, Khan S, Hecht J, Krawitz P, Mundlos S, Ahmad W. Whole exome sequencing identified a novel zinc-finger gene ZNF141 associated with autosomal recessive postaxial polydactyly type A. J Med Genet. 2013;50:47–53. doi: 10.1136/jmedgenet-2012-101219. - DOI - PubMed
    1. Izumi R, Niihori T, Aoki Y, Suzuki N, Kato M, Warita H, Takahashi T, Tateyama M, Nagashima T, Funayama R, Abe K, Nakayama K, Aoki M, Matsubara Y. Exome sequencing identifies a novel TTN mutation in a family with hereditary myopathy with early respiratory failure. J Hum Genet. 2013;58:259–266. doi: 10.1038/jhg.2013.9. - DOI - PubMed

Publication types