Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2015 Aug;36(8):815-22.
doi: 10.1002/humu.22813. Epub 2015 Jun 11.

Comparison of Exome and Genome Sequencing Technologies for the Complete Capture of Protein-Coding Regions

Affiliations
Comparative Study

Comparison of Exome and Genome Sequencing Technologies for the Complete Capture of Protein-Coding Regions

Stefan H Lelieveld et al. Hum Mutat. 2015 Aug.

Abstract

For next-generation sequencing technologies, sufficient base-pair coverage is the foremost requirement for the reliable detection of genomic variants. We investigated whether whole-genome sequencing (WGS) platforms offer improved coverage of coding regions compared with whole-exome sequencing (WES) platforms, and compared single-base coverage for a large set of exome and genome samples. We find that WES platforms have improved considerably in the last years, but at comparable sequencing depth, WGS outperforms WES in terms of covered coding regions. At higher sequencing depth (95x-160x), WES successfully captures 95% of the coding regions with a minimal coverage of 20x, compared with 98% for WGS at 87-fold coverage. Three different assessments of sequence coverage bias showed consistent biases for WES but not for WGS. We found no clear differences for the technologies concerning their ability to achieve complete coverage of 2,759 clinically relevant genes. We show that WES performs comparable to WGS in terms of covered bases if sequenced at two to three times higher coverage. This does, however, go at the cost of substantially more sequencing biases in WES approaches. Our findings will guide laboratories to make an informed decision on which sequencing platform and coverage to choose.

Keywords: coverage; exome sequencing; genome sequencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Coverage of the Ensembl and RefSeq annotated protein‐coding regions and (full) coverage of 2,759 clinically relevant OMIM+ genes. A: The percentage of base pairs of Ensembl (in yellow) and RefSeq (in blue) annotated protein‐coding regions covered by at least 20 reads for the tested platforms. B: Percentage of base pairs covered by at least 20 reads for the longest OMIM+ transcripts (in green). The red bars depict the percentage of the (longest) OMIM+ transcript base pairs that are fully covered by at least 20 reads.
Figure 2
Figure 2
Overview of 56 genes and the percentage of coding bases not covered at 20x. The boxplots depicting the percentage of bases not covered by at least 20x reads. For each of the 56 ACMG‐recommended genes, the coverage of the longest RefSeq transcript was analyzed. A: Shows the performance of all tested exome capture libraries. B: Shows the performance of the tested WGS platforms.
Figure 3
Figure 3
Assessments of three different sequence coverage biases. A: Evenness scores (a measure for uniform read mapping) for the different platforms based on the RefSeq annotated protein‐coding regions. B: The difference in average coverage at 20x of WGS libraries for RefSeq transcripts grouped by strand. The symbols + and – indicate average coverage level at the plus and minus strand. C: Idem for WES libraries. D: Density plot of allele ratio distribution for heterozygous SNPs. The green line depicts the ideal heterozygous allele ratio of 0.5.

References

    1. Ajay SS, Parker SCJ, Abaan HO, Fajardo KVF, Margulies EH. 2011. Accurate and comprehensive sequencing of personal genomes. Genome Res 21:1498–1505. - PMC - PubMed
    1. Bainbridge MN, Wang M, Wu Y, Newsham I, Muzny DM, Jefferies JL, Albert TJ, Burgess DL, Gibbs RA. 2011. Targeted enrichment beyond the consensus coding DNA sequence exome reveals exons with higher variant densities. Genome Biol 12:R68. - PMC - PubMed
    1. Benjamini Y, Speed TP. 2012. Summarizing and correcting the GC content bias in high‐throughput sequencing. Nucleic Acids Res 40:e72. - PMC - PubMed
    1. Clark MJ, Chen R, Lam HYK, Karczewski KJ, Chen R, Euskirchen G, Butte AJ, Snyder M. 2011. Performance comparison of exome DNA sequencing technologies. Nat Biotechnol 29:908–914. - PMC - PubMed
    1. Dewey FE, Grove ME, Pan C, Goldstein BA, Bernstein JA, Chaib H, Merker JD, Goldfeder RL, Enns GM, David SP, Pakdaman N, Ormond KE, et al. 2014. Clinical interpretation and implications of whole‐genome sequencing. JAMA 311:1035–1045. - PMC - PubMed

Publication types

LinkOut - more resources