Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Aug;28(8):398-406.
doi: 10.1016/j.tibtech.2010.05.006.

From complete genome sequence to 'complete' understanding?

Affiliations

From complete genome sequence to 'complete' understanding?

Michael Y Galperin et al. Trends Biotechnol. 2010 Aug.

Abstract

The rapidly accumulating genome sequence data allow researchers to address fundamental biological questions that were not even asked just a few years ago. A major problem in genomics is the widening gap between the rapid progress in genome sequencing and the comparatively slow progress in the functional characterization of sequenced genomes. Here we discuss two key questions of genome biology: whether we need more genomes, and how deep is our understanding of biology based on genomic analysis. We argue that overly specific annotations of gene functions are often less useful than the more generic, but also more robust, functional assignments based on protein family classification. We also discuss problems in understanding the functions of the remaining 'conserved hypothetical' genes.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Accumulation of protein sequences of unknown function in the genome databases. Open symbols indicate the total number of protein sequences encoded in prokaryotic (blue) and eukaryotic (red) genomes; filled symbols indicate the number of “hypothetical” or “uncharacterized” proteins. The data are taken from the NCBI’s RefSeq database [68]; the numbers for 2010 are extrapolated from the first 4 months.

Similar articles

Cited by

References

    1. Fleischmann RD, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269:496–512. - PubMed
    1. Liolios K, et al. The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res. 2010;38:D346–D354. - PMC - PubMed
    1. Ley RE, et al. Worlds within worlds: evolution of the vertebrate gut microbiota. Nat. Rev. Microbiol. 2008;6:776–788. - PMC - PubMed
    1. Wu D, et al. A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea. Nature. 2009;462:1056–1060. - PMC - PubMed
    1. Whitworth DE. Genomes and knowledge - a questionable relationship? Trends Microbiol. 2008;16:512–519. - PubMed

Publication types