Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jan;43(Database issue):D571-7.
doi: 10.1093/nar/gku1207. Epub 2014 Nov 26.

NCBI viral genomes resource

Affiliations

NCBI viral genomes resource

J Rodney Brister et al. Nucleic Acids Res. 2015 Jan.

Abstract

Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Number of validated virus and viroid segments. The numbers of validated virus and viroid segments available in INSDC databases are depicted by the black line, and the numbers of RefSeq virus and viroid segments by the gray columns. Data was calculated at the end of each year from 2000 to 2014, except for 2014 when data was calculated on September 15. INSDC influenza virus segments are not included.
Figure 2.
Figure 2.
Number of ICTV species and viral RefSeq genomes. The numbers of viral species in the NCBI Taxonomy database that are represented by a RefSeq genome record are depicted by the gray bars, and the number of viral species recognized by the ICTV by the black area. Data was calculated at the end of each year from 2000 to 2014, except for 2014 when data was calculated on September 15. The number of ICTV recognized viral species for each year shown was derived from http://www.ictvonline.org/taxonomyReleases.asp.
Figure 3.
Figure 3.
Distribution of host types among viral RefSeq genomes. The distribution of viral host types assigned to RefSeq genomes is depicted for algae, archaea, bacteria, environment, fungi, human, invertebrates, plants, protozoa and vertebrates host groups. Diatom is not shown.

References

    1. Baize S., Pannetier D., Oestereich L., Rieger T., Koivogui L., Magassouba N., Soropogui B., Sow M.S., Keita S., De Clerck H., et al. Emergence of Zaire Ebola virus disease in Guinea—preliminary report. N. Engl. J. Med. 2014;371:1418–1425. - PubMed
    1. Gire S.K., Goba A., Andersen K.G., Sealfon R.S., Park D.J., Kanneh L., Jalloh S., Momoh M., Fullah M., Dudas G., et al. Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak. Science. 2014;345:1369–1372. - PMC - PubMed
    1. Haagmans B.L., Al Dhahiry S.H., Reusken C.B., Raj V.S., Galiano M., Myers R., Godeke G.J., Jonges M., Farag E., Diab A., et al. Middle East respiratory syndrome coronavirus in dromedary camels: an outbreak investigation. Lancet Infect. Dis. 2014;14:140–145. - PMC - PubMed
    1. Cotten M., Watson S.J., Kellam P., Al-Rabeeah A.A., Makhdoom H.Q., Assiri A., Al-Tawfiq J.A., Alhakeem R.F., Madani H., AlRabiah F.A., et al. Transmission and evolution of the Middle East respiratory syndrome coronavirus in Saudi Arabia: a descriptive genomic study. Lancet. 2013;382:1993–2002. - PMC - PubMed
    1. Karsch-Mizrachi I., Nakamura Y., Cochrane G., International Nucleotide Sequence Database Collaboration. The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res. 2012;40:D33–D37. - PMC - PubMed

Publication types