Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2012 Dec 20;434(2):175-80.
doi: 10.1016/j.virol.2012.09.027. Epub 2012 Oct 18.

Microbial virus genome annotation-mustering the troops to fight the sequence onslaught

Affiliations
Review

Microbial virus genome annotation-mustering the troops to fight the sequence onslaught

J Rodney Brister et al. Virology. .

Abstract

The revolution in virus genome sequencing promises to effectively map the extant biological universe and reveal fundamental relationships between viral biology, genome structure, and evolution. Indeed, microbial virus genomes include large numbers of conserved coding sequences of unknown function as well as unique gene combinations, implying that that these viruses will be a significant source of novel protein biochemistry and genome architecture. Yet, making sense of the approaching phalanx of A's, G's, T's, and C's stretching across the genome sequencing horizon will require innovation and an unprecedented coordination of annotation efforts among stakeholders.

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1. Growth of virus RefSeq records
A. Cumulative number of virus nucleotide sequence records deposited in the RefSeq database from 1999 to 20121,2. B. Cumulative number of protein records deposited in the RefSeq database from 1999 to 20122. 1Individual viral segments are included in tabulations, not complete constellations. 2Number of records calculated on September 11, 2012.
Figure 2
Figure 2. Distribution of microbial virus RefSeq records
A. Current number1 of microbial virus nucleotide sequence records deposited in the RefSeq database broken down by host – algae, archaea, bacteria, diatom, fungi, protozoa. B. Current number1 of microbial virus protein sequence records deposited in the RefSeq database broken down by host – algae, archaea, bacteria, diatom, fungi, protozoa. 1Number of records calculated on September 11, 2012.
Figure 3
Figure 3. “Complete genome” sequences of virus RefSeq records
A. Cumulative number of bacteriophage genome sequence records indexed by GenBank as “complete genomes” by year, 1999 to 20121. Note some of these sequences have yet to be validated by RefSeq as full-length genomes. B. Cumulative number of bacteriophage protein sequence records derived from genome sequences in (A). 1Number of records calculated on September 11, 2012.

References

    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–9. - PMC - PubMed
    1. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75. - PMC - PubMed
    1. Berardini TZ, Li D, Muller R, Chetty R, Ploetz L, Singh S, Wensel A, Huala E. Assessment of community-submitted ontology annotations from a novel database-journal partnership. Database (Oxford) 2012;(0):bas030. - PMC - PubMed
    1. Besemer J, Borodovsky M. GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res. 2005;33(Web Server issue):W451–4. - PMC - PubMed
    1. Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A. UniProtKB/Swiss-Prot. Methods Mol Biol. 2007;406:89–112. - PubMed

Publication types