Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 9;22(1):312.
doi: 10.1186/s12859-021-04212-6.

Expanding the taxonomic range in the fecal metagenome

Affiliations

Expanding the taxonomic range in the fecal metagenome

Theo R Allnutt et al. BMC Bioinformatics. .

Abstract

Background: Except for bacteria, the taxonomic diversity of the human fecal metagenome has not been widely studied, despite the potential importance of viruses and eukaryotes. Widely used bioinformatic tools contain limited numbers of non-bacterial species in their databases compared to available genomic sequences and their methodologies do not favour classification of rare sequences which may represent only a small fraction of their parent genome. In seeking to optimise identification of non-bacterial species, we evaluated five widely-used metagenome classifier programs (BURST, Kraken2, Centrifuge, MetaPhlAn2 and CCMetagen) for their ability to correctly assign and count simulations of bacterial, viral and eukaryotic DNA sequence reads, including the effect of taxonomic order of analysis of bacteria, viruses and eukaryotes and the effect of sequencing depth.

Results: We found that the precision of metagenome classifiers varied significantly between programs and between taxonomic groups. When classifying viruses and eukaryotes, ordering the analysis such that bacteria were classified first significantly improved classification precision. Increasing sequencing depth decreased classification precision and did not improve recall of rare species.

Conclusions: Choice of metagenome classifier program can have a marked effect on results with respect to precision of species assignment in different taxonomic groups. The order of taxonomic classification can markedly improve precision. Increasing sequencing depth can decrease classification precision and yields diminishing returns in probability of species detection.

Keywords: Benchmarking; Classifier; Eukaryotes; Metagenomics; Viruses.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Recall determined by classifier programs using an unordered analysis approach
Fig. 2
Fig. 2
Precision determined by classifier programs using an unordered analysis approach
Fig. 3
Fig. 3
Precision determined with BURST classifier for each taxonomic group, using seven different orders of analysis: a = viruses > bacteria > eukaryotes; b = viruses > eukaryotes > bacteria; c = bacteria > eukaryotes > viruses; d = bacteria > viruses > eukaryotes; e = eukaryotes > bacteria > viruses; f = eukaryotes > viruses > bacteria; g = unordered
Fig. 4
Fig. 4
Precision determined by each classifier program after ordered analysis ‘d’ (bacteria > viruses > eukaryotes) and unordered analysis
Fig. 5
Fig. 5
Effect of sequencing depth on precision determined by BURST classifier for each taxonomic group, ordered using method ‘d’ (bacteria > viruses > eukaryotes)
Fig. 6
Fig. 6
Effect of increasing sequencing depth (grouped on x-axis) on the probability of detecting and classifying species within six expected relative abundance bins (key denotes lower limit of bin)

References

    1. Breitwieser FP, Lu J, Salzberg SL. A review of methods and databases for metagenomic classification and assembly. Brief Bioinform. 2019;20(4):1125–1136. doi: 10.1093/bib/bbx120. - DOI - PMC - PubMed
    1. Lynch SV, Pedersen O. The human intestinal microbiome in health and disease. N Engl J Med. 2016;375(24):2369–2379. doi: 10.1056/NEJMra1600266. - DOI - PubMed
    1. Clemente JC, Ursell LK, Parfrey LW, Knight R. The impact of the gut microbiota on human health: an integrative view. Cell. 2012;148(6):1258–1270. doi: 10.1016/j.cell.2012.01.035. - DOI - PMC - PubMed
    1. Sender R, Fuchs S, Milo R. Revised estimates for the number of human and bacteria cells in the body. PLoS Biol. 2016;14(8):e1002533. doi: 10.1371/journal.pbio.1002533. - DOI - PMC - PubMed
    1. Garmaeva S, Sinha T, Kurilshikov A, Fu J, Wijmenga C, Zhernakova A. Studying the gut virome in the metagenomic era: challenges and perspectives. BMC Biol. 2019;17(1):84. doi: 10.1186/s12915-019-0704-y. - DOI - PMC - PubMed

LinkOut - more resources