Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec 18;30(1):78-82.
doi: 10.1038/nbt.2065.

Performance comparison of whole-genome sequencing platforms

Affiliations

Performance comparison of whole-genome sequencing platforms

Hugo Y K Lam et al. Nat Biotechnol. .

Erratum in

  • Nat Biotechnol. 2012 Jun;30(6):562

Abstract

Whole-genome sequencing is becoming commonplace, but the accuracy and completeness of variant calling by the most widely used platforms from Illumina and Complete Genomics have not been reported. Here we sequenced the genome of an individual with both technologies to a high average coverage of ∼76×, and compared their performance with respect to sequence coverage and calling of single-nucleotide variants (SNVs), insertions and deletions (indels). Although 88.1% of the ∼3.7 million unique SNVs were concordant between platforms, there were tens of thousands of platform-specific calls located in genes and other genomic regions. In contrast, 26.5% of indels were concordant between platforms. Target enrichment validated 92.7% of the concordant SNVs, whereas validation by genotyping array revealed a sensitivity of 99.3%. The validation experiments also suggested that >60% of the platform-specific variants were indeed present in the genome. Our results have important implications for understanding the accuracy and completeness of the genome sequencing platforms.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Genome coverage at different read depths
(a) Percentage of genome covered by different read depths in different platforms. (b) Histogram of genome coverage at different read depths.
Figure 2
Figure 2. SNV detection and intersection
(a) SNVs detected from the PBMC and saliva samples in each platform were combined. The unions of SNVs in each platform were then intersected. Sensitivity was measured against the Illumina Omni array. Ti/Tv is the transition-to-transversion ratio. The known and novel counts were based on dbSNP. ‘Sanger’ and ‘validated’ represent validation by Sanger sequencing and Illumina sequencing (with Agilent target enrichment capture), respectively. (b) Comparing platform-specific SNVs to non-SNV calls in another platform. IL, Illumina; CG, Complete Genomics.
Figure 3
Figure 3. SNV association with different genomic elements
(a) Gene elements: UTR, exonic, intronic and intergenic regions. Inset: number of SNVs associated with UTR5, UTR3 and exonic regions. (b) Gene elements: splicing sites, noncoding RNA and upstream/downstream (<1 kb) regions of genes. (c) Repetitive elements: centromere, telomere, tRNA and rRNA. (d) Repetitive elements: L1, Alu, simple repeat and low-complexity repeat. (e) SNV frequency at different chromosomal locations. Tracks from outer to inner: SNV frequency for Illumina (IL), Complete Genomics (CG), concordant, IL-specific and CG-specific calls. Outermost: chromosome ideogram.
Figure 4
Figure 4. Indel detection and intersection
(a) Indels detected from the PBMC and saliva samples in each platform were combined. The unions of indels in each platform were then intersected. Note: 5,668 IL and 8,415 CG indels were removed after 5b-window merging. (b) Indel size distribution. Negative size represents deletion and positive size represents insertion.

References

    1. Ajay SS, Parker SC, Ozel Abaan H, Fuentes Fajardo KV, Margulies EH. Accurate and comprehensive sequencing of personal genomes. Genome Research. 2011;21:1498–1505. - PMC - PubMed
    1. Ashley EA, et al. Clinical assessment incorporating a personal genome. Lancet. 2010;375:1525–1535. - PMC - PubMed
    1. Wheeler DA, et al. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–876. - PubMed
    1. McKernan KJ, et al. Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. Genome Res. 2009;19:1527–1541. - PMC - PubMed
    1. Roach JC, et al. Analysis of genetic inheritance in a family quartet by wholegenome sequencing. Science. 2010;328:636–639. - PMC - PubMed

Publication types