Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009:2009:bap014.
doi: 10.1093/database/bap014. Epub 2009 Oct 12.

ORION-VIRCAT: a tool for mapping ICTV and NCBI taxonomies

Affiliations

ORION-VIRCAT: a tool for mapping ICTV and NCBI taxonomies

Willy Valdivia-Granda et al. Database (Oxford). 2009.

Abstract

Viruses, viroids and prions are the smallest infectious biological entities that depend on their host for replication. The number of pathogenic viruses is considerably large and their impact in human global health is well documented. Currently, the International Committee on the Taxonomy of Viruses (ICTV) has classified approximately 4379 virus species while the National Center for Biotechnology Information Viral Genomes Resource (NCBI-VGR) database has mapped 617 705 proteins to eight large taxonomic groups. Despite these efforts, an automated approach for mapping the ICTV master list and its officially accepted virus naming to the NCBI-VGR's taxonomical classification is not available. Due to metagenomic sequencing, it is likely that the discovery and naming of new viral species will increase by at least ten fold. Unfortunately, existing viral databases are not adequately prepared to scale, maintain and annotate automatically ultra-high throughput sequences and place this information into specific taxonomic categories. ORION-VIRCAT is a scalable and interoperable object-relational database designed to serve as a resource for the integration and verification of taxonomical classifications generated by the ICTV and NCBI-VGR. The current release (v1.0) of ORION-VIRCAT is implemented in PostgreSQL and it has been extended to ORACLE, MySQL and SyBase. ORION-VIRCAT automatically mapped and joined 617 705 entries from the NCBI-VGR to the viral naming of the ICTV. This detailed analysis revealed that 399 095 entries from the NCBI-VGR can be mapped to the ICTV classification and that one Order, 10 families, 35 genera and 503 species listed in the ICTV disagree with the the NCBI-VGR classification schema. Nevertheless, we were eable to correct several discrepancies mapping 234 000 additional entries.Database URL:http://www.orionbiosciences.com/research/orion-vircat.html.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Integration process of ORION-VIRCAT. We mapped different ICTV (blue) and NCBI-VGR (green) taxonomies classifications.
Figure 2.
Figure 2.
Summary of the implementation of the genomic catalog.

Similar articles

Cited by

References

    1. Wheeler D.L., Barrett T., Benson D.A., et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2008;36:D13–D21. - PMC - PubMed
    1. Bao Y., Federhen S., Leipe D., et al. National center for biotechnology information viral genomes project. J. Virol. 2004;78:7291–7298. - PMC - PubMed
    1. Baltimore D. Expression of animal virus genomes. Bacteriol. Rev. 1971;35:235–241. - PMC - PubMed
    1. Buchen-Osmond C. Further progress in ICTVdB, a universal virus database. Arch. Virol. 1997;142:1734–1739. - PubMed
    1. Buechen-Osmond C., Dallwitz M. Towards a universal virus database—progress in the ICTVdB. Arch. Virol. 1996;141:392–399. - PubMed