Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

Beyond 100 genomes

Paul Janssen et al. Genome Biol. 2003.

Abstract

By the end of 2002, we witnessed the landmark submission of the 100th complete genome sequence in the databases. An overview of these genomes reveals certain interesting trends and provides valuable insights into possible future developments.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Cumulative number of protein sequence entries (y-axis) in completed genomes (CoGenT, in blue) and Swiss-Prot (in red) as a function of time (x-axis).
Figure 2
Figure 2
Phylogenetic distribution of genome sequencing projects. Archaea and Bacteria are shown to the phylum level and Eukarya to their first taxonomic branching, with the exception of Metazoa and Fungi. The numbers in parentheses represent the number of completed, published (red) and ongoing (blue) genome projects. The tree is based on the taxonomy database from the National Center for Biotechnology Information (NCBI). Information about ongoing genome projects has been obtained from the Genomes OnLine Database (GOLD) [14], as of 22 January 2003.
Figure 3
Figure 3
Representation of completed genome sequences over time (x-axis) and size (y-axis, in Mb, logarithmic scale) labeled according to their social impact. Genomes from Archaea (squares), Bacteria (circles) and Eukarya (triangles) are colored according to their academic (blue), medical (pink), agricultural (light green), ecological (dark green) and industrial (black) relevance.

References

    1. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269:496–512. - PubMed
    1. Nelson KE, Paulsen IT, Heidelberg JF, Fraser CM. Status of genome projects for nonpathogenic bacteria and archaea. Nat Biotechnol. 2000;18:1049–1054. doi: 10.1038/80235. - DOI - PubMed
    1. Akman L, Yamashita A, Wataname HOK, Shiba T, Hattori M, Aksoy S. Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidia. Nat Genet. 2002;32:402–407. doi: 10.1038/ng986. - DOI - PubMed
    1. Complete Genome Tracking Database. http://maine.ebi.ac.uk:8000/services/cogent
    1. Bairoch A, Apweiler R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 2000;28:45–8. - PMC - PubMed

LinkOut - more resources