Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 8;47(D1):D309-D314.
doi: 10.1093/nar/gky1085.

eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses

Affiliations

eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses

Jaime Huerta-Cepas et al. Nucleic Acids Res. .

Abstract

eggNOG is a public database of orthology relationships, gene evolutionary histories and functional annotations. Here, we present version 5.0, featuring a major update of the underlying genome sets, which have been expanded to 4445 representative bacteria and 168 archaea derived from 25 038 genomes, as well as 477 eukaryotic organisms and 2502 viral proteomes that were selected for diversity and filtered by genome quality. In total, 4.4M orthologous groups (OGs) distributed across 379 taxonomic levels were computed together with their associated sequence alignments, phylogenies, HMM models and functional descriptors. Precomputed evolutionary analysis provides fine-grained resolution of duplication/speciation events within each OG. Our benchmarks show that, despite doubling the amount of genomes, the quality of orthology assignments and functional annotations (80% coverage) has persisted without significant changes across this update. Finally, we improved eggNOG online services for fast functional annotation and orthology prediction of custom genomics or metagenomics datasets. All precomputed data are publicly available for downloading or via API queries at http://eggnog.embl.de.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Taxonomic levels for which OGs have been independently computed based on (A) prokaryotic, (B) eukaryotic and (C) viral genomes. Names in blue indicate new taxonomic levels with respect to previous eggNOG versions. Numbers indicate the the amount of OGs per level (red), number of species covered (black) and functional annotation coverage (green).
Figure 2.
Figure 2.
Visualization of the phylogeny associated to the OG ENOG5048VVQ at the vertebrate level (A) extracted from the eggNOG website. Target orthologs were restricted to primates in the phylogenetic tree to facilitate exploration (B). Duplication nodes (in-paralogies) are labeled in red, and speciation events in blue (C). The functional profile of each orthologous sequence is shown in the presence/absence matrix (D). Functional differences can be noticed at both sides of the duplication event separating EPX from MPO sequences (E) in both GO Slim terms (red squares in matrix D) and KEGG Modules (blue squares in matrix D), while having similar domain architectures (F).

References

    1. Fitch W.M. Distinguishing homologous from analogous proteins. Syst. Zool. 1970; 19:99–113. - PubMed
    1. Fitch W.M. Homology a personal view on some of the problems. Trends Genet. 2000; 16:227–231. - PubMed
    1. Kachroo A.H., Laurent J.M., Yellman C.M., Meyer A.G., Wilke C.O., Marcotte E.M.. Evolution. Systematic humanization of yeast genes reveals conserved functions and genetic modularity. Science. 2015; 348:921–925. - PMC - PubMed
    1. Zhang J. Evolution by gene duplication: an update. Trends Ecol. Evol. 2003; 18:292–298.
    1. Gabaldón T., Koonin E.V.. Functional and evolutionary implications of gene orthology. Nat. Rev. Genet. 2013; 14:360–366. - PMC - PubMed

Publication types