Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 17:6:e289.
doi: 10.7717/peerj-cs.289. eCollection 2020.

Database limitations for studying the human gut microbiome

Affiliations

Database limitations for studying the human gut microbiome

Camila K Dias et al. PeerJ Comput Sci. .

Abstract

Background: In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome.

Methods: In this work, we critically evaluate the output of the cross-reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Non-supervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome.

Results: From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the "core" functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenome-assembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies.

Keywords: Database; EggNOG; Functional diversity; Gut microbiome; Human Microbiome; KEGG; Uniprot.

PubMed Disclaimer

Conflict of interest statement

The authors declare there are no competing interests.

Figures

Figure 1
Figure 1. The relationship between the number of taxa sharing NOGs and KOs.
(A) Number of genera sharing EggNOGs, (B) number of species sharing EggNOGs, (C) number of genera sharing KOs and (D) number of species sharing KOs.
Figure 2
Figure 2. Abundance of KOs and NOGs throughout different genera detected in UniprotKB.
(A) EggNOG couts and (B) KEGG counts. Genera are ordered by the sum of identifiers in both databases. Some genera only had identifiers in one database.
Figure 3
Figure 3. Abundance of KOs and NOGs throughout different species detected in UniprotKB.
(A) EggNOG counts and (B) KEGG counts. Species are ordered by the sum of identifiers in both databases. Some species only had identifiers in one database.
Figure 4
Figure 4. Relative abundance of NOGs and KOs throughout phyla (A) and genera (B) for EggNOG and KEGG cross-searches.

References

    1. McLaren MR, Willis AD, Callahan BJ. Consistent and correctable bias in metagenomic sequencing measurements. bioRxiv. 2019 doi: 10.1101/559831. - DOI - PMC - PubMed
    1. Chauhan NS, Pandey R, Mondal AK, Gupta S, Verma MK, Jain S, Ahmed V, Patil R, Agarwal D, Girase B, Shrivastava A. Western Indian rural gut microbial diversity in extreme Prakriti endo-phenotypes reveals signature microbes. Frontiers in Microbiology. 2018;9(118):1–12. doi: 10.3389/fmicb.2018.00118. - DOI - PMC - PubMed
    1. Cryan JF, Dinan TG. Mind-altering microorganisms: the impact of the gut microbiota on brain and behaviour. Nature Reviews Neuroscience. 2012;13(10):701–712. doi: 10.1038/nrn3346. - DOI - PubMed
    1. David LA, Maurice CF, Carmody RN, Gootenberg DB, Button JE, Wolfe BE, Ling AV, Devlin AS, Varma Y, Fischbach MA, Biddinger SB. Diet rapidly and reproducibly alters the human gut microbiome. Nature. 2014;505(7484):559–563. doi: 10.1038/nature12820. - DOI - PMC - PubMed
    1. Di Bella JM, Bao Y, Gloor GB, Burton JP, Reid G. High throughput sequencing methods and analysis for microbiome research. Journal of Microbiological Methods. 2013;95(3):401–414. doi: 10.1016/j.mimet.2013.08.011. - DOI - PubMed

LinkOut - more resources