Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 18:6:19233.
doi: 10.1038/srep19233.

An evaluation of the accuracy and speed of metagenome analysis tools

Affiliations

An evaluation of the accuracy and speed of metagenome analysis tools

Stinus Lindgreen et al. Sci Rep. .

Erratum in

Abstract

Metagenome studies are becoming increasingly widespread, yielding important insights into microbial communities covering diverse environments from terrestrial and aquatic ecosystems to human skin and gut. With the advent of high-throughput sequencing platforms, the use of large scale shotgun sequencing approaches is now commonplace. However, a thorough independent benchmark comparing state-of-the-art metagenome analysis tools is lacking. Here, we present a benchmark where the most widely used tools are tested on complex, realistic data sets. Our results clearly show that the most widely used tools are not necessarily the most accurate, that the most accurate tool is not necessarily the most time consuming, and that there is a high degree of variability between available tools. These findings are important as the conclusions of any metagenomics study are affected by errors in the predicted community composition and functional capacity. Data sets and results are freely available from http://www.ucbioinformatics.org/metabenchmark.html.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Plot of performance metrics for all methods included in this benchmark.
The closer each metric is to 1, the more accurate the method. The values are based on classifying reads mapped at the level of phylum (A) or genus (B).
Figure 2
Figure 2. Analysis of performance at the level of phylum (left) and genus (right).
(A,B) Sum of absolute log-odds scores at the phylum (A) or genus (B) level for each tool (bars) and log2 of run time in minutes (asterisks, *). Sum of log-odds scores indicate the overall performance in terms of deviation from the known proportions. A low sum indicates a high accuracy. (C,D) NMDS plot of relative abundances at the level of phylum (C) and genus (D) for the known and predicted communities in replicates. Eukaryotes are not included. Metagenomes in set A are gray, and metagenomes in set B are black. The known communities are shown with a star.
Figure 3
Figure 3. Shifts in relative abundance of the three functional categories (or set of categories) that vary between set A and set B for the tools that analyze the functional capacity of metagenomes.
A positive log-odds score means an increase in set A relative to set B, and a negative log-odds score means a decrease in set A relative to set B.

Similar articles

Cited by

References

    1. Margulies M. et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437, 376–380 (2005). - PMC - PubMed
    1. Shendure J. et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science 309, 1728–1732 (2005). - PubMed
    1. Pedersen M. W. et al. Ancient and modern environmental DNA. Philos. Trans. R. Soc. Lond. B Biol. Sci. 370 (2015). - PMC - PubMed
    1. Devaraj S., Hemarajata P. & Versalovic J. The human gut microbiome and body metabolism: implications for obesity and diabetes. Clin. Chem. 59, 617–628 (2013). - PMC - PubMed
    1. Foster J. A. & McVey Neufeld K.-A. Gut–brain axis: how the microbiome influences anxiety and depression. Trends Neurosci. 36, 305–312 (2013). - PubMed

Publication types