Species classifier choice is a key consideration when analysing low-complexity food microbiome data
- PMID: 29554948
- PMCID: PMC5859664
- DOI: 10.1186/s40168-018-0437-0
Species classifier choice is a key consideration when analysing low-complexity food microbiome data
Abstract
Background: The use of shotgun metagenomics to analyse low-complexity microbial communities in foods has the potential to be of considerable fundamental and applied value. However, there is currently no consensus with respect to choice of species classification tool, platform, or sequencing depth. Here, we benchmarked the performances of three high-throughput short-read sequencing platforms, the Illumina MiSeq, NextSeq 500, and Ion Proton, for shotgun metagenomics of food microbiota. Briefly, we sequenced six kefir DNA samples and a mock community DNA sample, the latter constructed by evenly mixing genomic DNA from 13 food-related bacterial species. A variety of bioinformatic tools were used to analyse the data generated, and the effects of sequencing depth on these analyses were tested by randomly subsampling reads.
Results: Compositional analysis results were consistent between the platforms at divergent sequencing depths. However, we observed pronounced differences in the predictions from species classification tools. Indeed, PERMANOVA indicated that there was no significant differences between the compositional results generated by the different sequencers (p = 0.693, R2 = 0.011), but there was a significant difference between the results predicted by the species classifiers (p = 0.01, R2 = 0.127). The relative abundances predicted by the classifiers, apart from MetaPhlAn2, were apparently biased by reference genome sizes. Additionally, we observed varying false-positive rates among the classifiers. MetaPhlAn2 had the lowest false-positive rate, whereas SLIMM had the greatest false-positive rate. Strain-level analysis results were also similar across platforms. Each platform correctly identified the strains present in the mock community, but accuracy was improved slightly with greater sequencing depth. Notably, PanPhlAn detected the dominant strains in each kefir sample above 500,000 reads per sample. Again, the outputs from functional profiling analysis using SUPER-FOCUS were generally accordant between the platforms at different sequencing depths. Finally, and expectedly, metagenome assembly completeness was significantly lower on the MiSeq than either on the NextSeq (p = 0.03) or the Proton (p = 0.011), and it improved with increased sequencing depth.
Conclusions: Our results demonstrate a remarkable similarity in the results generated by the three sequencing platforms at different sequencing depths, and, in fact, the choice of bioinformatics methodology had a more evident impact on results than the choice of sequencer did.
Keywords: Low-complexity microbiome; Sequencing platform comparison; Shotgun metagenomics.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable
Consent for publication
Not applicable
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures








Similar articles
-
Optimisation of methods for bacterial skin microbiome investigation: primer selection and comparison of the 454 versus MiSeq platform.BMC Microbiol. 2017 Jan 21;17(1):23. doi: 10.1186/s12866-017-0927-4. BMC Microbiol. 2017. PMID: 28109256 Free PMC article.
-
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007. Gigascience. 2017. PMID: 28327976 Free PMC article.
-
A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome.BMC Microbiol. 2017 Sep 13;17(1):194. doi: 10.1186/s12866-017-1101-8. BMC Microbiol. 2017. PMID: 28903732 Free PMC article.
-
High throughput sequencing methods for microbiome profiling: application to food animal systems.Anim Health Res Rev. 2012 Jun;13(1):40-53. doi: 10.1017/S1466252312000126. Anim Health Res Rev. 2012. PMID: 22853944 Review.
-
High throughput sequencing methods and analysis for microbiome research.J Microbiol Methods. 2013 Dec;95(3):401-14. doi: 10.1016/j.mimet.2013.08.011. Epub 2013 Sep 9. J Microbiol Methods. 2013. PMID: 24029734 Review.
Cited by
-
Crowdsourced benchmarking of taxonomic metagenome profilers: lessons learned from the sbv IMPROVER Microbiomics challenge.BMC Genomics. 2022 Aug 30;23(1):624. doi: 10.1186/s12864-022-08803-2. BMC Genomics. 2022. PMID: 36042406 Free PMC article.
-
Sensitivity of shotgun metagenomics to host DNA: abundance estimates depend on bioinformatic tools and contamination is the main issue.Access Microbiol. 2020 Feb 17;2(4):acmi000104. doi: 10.1099/acmi.0.000104. eCollection 2020. Access Microbiol. 2020. PMID: 33005868 Free PMC article.
-
Sequencing of the Cheese Microbiome and Its Relevance to Industry.Front Microbiol. 2018 May 23;9:1020. doi: 10.3389/fmicb.2018.01020. eCollection 2018. Front Microbiol. 2018. PMID: 29875744 Free PMC article. Review.
-
Metagenomic and Physicochemical Analyses Reveal Microbial Community and Functional Differences Between Three Different Grades of Hongxin Low-Temperature Daqu.Foods. 2025 Mar 22;14(7):1104. doi: 10.3390/foods14071104. Foods. 2025. PMID: 40238191 Free PMC article.
-
Differences in Faecal Microbiome Taxonomy, Diversity and Functional Potential in a Bovine Cohort Experimentally Challenged with Mycobacterium avium subsp. paratuberculosis (MAP).Animals (Basel). 2023 May 16;13(10):1652. doi: 10.3390/ani13101652. Animals (Basel). 2023. PMID: 37238082 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases