Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2002 Mar 19;99(6):3712-6.
doi: 10.1073/pnas.042692499. Epub 2002 Mar 5.

On the sequencing of the human genome

Affiliations

On the sequencing of the human genome

Robert H Waterston et al. Proc Natl Acad Sci U S A. .

Abstract

Two recent papers using different approaches reported draft sequences of the human genome. The international Human Genome Project (HGP) used the hierarchical shotgun approach, whereas Celera Genomics adopted the whole-genome shotgun (WGS) approach. Here, we analyze whether the latter paper provides a meaningful test of the WGS approach on a mammalian genome. In the Celera paper, the authors did not analyze their own WGS data. Instead, they decomposed the HGP's assembled sequence into a "perfect tiling path", combined it with their WGS data, and assembled the merged data set. To study the implications of this approach, we perform computational analysis and find that a perfect tiling path with 2-fold coverage is sufficient to recover virtually the entirety of a genome assembly. We also examine the manner in which the assembly was anchored to the human genome and conclude that the process primarily depended on the HGP's sequence-tagged site maps, BAC maps, and clone-based sequences. Our analysis indicates that the Celera paper provides neither a meaningful test of the WGS approach nor an independent sequence of the human genome. Our analysis does not imply that a WGS approach could not be successfully applied to assemble a draft sequence of a large mammalian genome, but merely that the Celera paper does not provide such evidence.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sequencing strategies. (Left) The hierarchical shotgun (HS) strategy involves decomposing the genome into a tiling path of overlapping BAC clones, performing shotgun sequencing on and reassembling each BAC, and then merging the sequences of adjacent clones. The method has the advantage that all sequence contigs and scaffolds derived from a BAC belong to a single compartment with respect to anchoring to the genome. (Right) Whole-genome shotgun (WGS) strategy involves performing shotgun sequencing on the entire genome and attempting to reassemble the entire collection. With the WGS method, each contig and scaffold is an independent component that must be anchored to the genome. In general, many scaffolds may not be anchored without directed efforts. (Contigs are contiguous blocks of sequence; scaffolds are sets of contigs joined by paired reads from both ends of a plasmid insert.)
Figure 2
Figure 2
Random vs. perfect spacing in 100 kb. (Upper) Two-fold coverage in perfectly spaced reads. (Lower) Two-fold coverage in randomly selected reads. There are frequent regions in which the adjacent reads either fail to overlap or the overlap is too small to allow reliable detection (< 40 bp). These breaks in continuity are indicated by vertical lines.

Comment in

  • Whole-genome disassembly.
    Green P. Green P. Proc Natl Acad Sci U S A. 2002 Apr 2;99(7):4143-4. doi: 10.1073/pnas.082095999. Epub 2002 Mar 19. Proc Natl Acad Sci U S A. 2002. PMID: 11904394 Free PMC article. No abstract available.
  • On the sequencing and assembly of the human genome.
    Myers EW, Sutton GG, Smith HO, Adams MD, Venter JC. Myers EW, et al. Proc Natl Acad Sci U S A. 2002 Apr 2;99(7):4145-6. doi: 10.1073/pnas.092136699. Epub 2002 Mar 19. Proc Natl Acad Sci U S A. 2002. PMID: 11904395 Free PMC article. No abstract available.

References

    1. International Human Genome Sequencing Consortium. Nature (London) 2001;409:860–921. - PubMed
    1. Venter J C, Adams M D, Myers E W, Li P W, Mural R J, Sutton G G, Smith H O, Yandell M, Evans C A, Holt R A, et al. Science. 2001;291:1304–1351. - PubMed
    1. Sanger F, Coulson A R, Hong G F, Hill D F, Petersen G B. J Mol Biol. 1982;162:729–773. - PubMed
    1. Edwards A, Voss H, Rice P, Civitello A, Stegemann J, Schwager C, Zimmermann J, Erfle H, Caskey C T, Ansorge W. Genomics. 1990;6:593–608. - PubMed
    1. Adams M D, Celniker S E, Holt R A, Evans C A, Gocayne J D, Amanatides P G, Scherer S E, Li P W, Hoskins R A, Galle R F, et al. Science. 2000;287:2185–2195. - PubMed