Meta-Storms: efficient search for similar microbial communities based on a novel indexing scheme and similarity score for metagenomic data
- PMID: 22843983
- DOI: 10.1093/bioinformatics/bts470
Meta-Storms: efficient search for similar microbial communities based on a novel indexing scheme and similarity score for metagenomic data
Abstract
Background: It has long been intriguing scientists to effectively compare different microbial communities (also referred as 'metagenomic samples' here) in a large scale: given a set of unknown samples, find similar metagenomic samples from a large repository and examine how similar these samples are. With the current metagenomic samples accumulated, it is possible to build a database of metagenomic samples of interests. Any metagenomic samples could then be searched against this database to find the most similar metagenomic sample(s). However, on one hand, current databases with a large number of metagenomic samples mostly serve as data repositories that offer few functionalities for analysis; and on the other hand, methods to measure the similarity of metagenomic data work well only for small set of samples by pairwise comparison. It is not yet clear, how to efficiently search for metagenomic samples against a large metagenomic database.
Results: In this study, we have proposed a novel method, Meta-Storms, that could systematically and efficiently organize and search metagenomic data. It includes the following components: (i) creating a database of metagenomic samples based on their taxonomical annotations, (ii) efficient indexing of samples in the database based on a hierarchical taxonomy indexing strategy, (iii) searching for a metagenomic sample against the database by a fast scoring function based on quantitative phylogeny and (iv) managing database by index export, index import, data insertion, data deletion and database merging. We have collected more than 1300 metagenomic data from the public domain and in-house facilities, and tested the Meta-Storms method on these datasets. Our experimental results show that Meta-Storms is capable of database creation and effective searching for a large number of metagenomic samples, and it could achieve similar accuracies compared with the current popular significance testing-based methods.
Conclusion: Meta-Storms method would serve as a suitable database management and search system to quickly identify similar metagenomic samples from a large pool of samples.
Contact: ningkang@qibebt.ac.cn
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
[Meta-Mesh: metagenomic data analysis system].Sheng Wu Gong Cheng Xue Bao. 2014 Jan;30(1):6-17. Sheng Wu Gong Cheng Xue Bao. 2014. PMID: 24818475 Chinese.
-
Parallel-META: efficient metagenomic data analysis based on high-performance computation.BMC Syst Biol. 2012;6 Suppl 1(Suppl 1):S16. doi: 10.1186/1752-0509-6-S1-S16. Epub 2012 Jul 16. BMC Syst Biol. 2012. PMID: 23046922 Free PMC article.
-
COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.PLoS One. 2015 Nov 11;10(11):e0142102. doi: 10.1371/journal.pone.0142102. eCollection 2015. PLoS One. 2015. PMID: 26561344 Free PMC article.
-
[A review on the bioinformatics pipelines for metagenomic research].Dongwuxue Yanjiu. 2012 Dec;33(6):574-85. doi: 10.3724/SP.J.1141.2012.06574. Dongwuxue Yanjiu. 2012. PMID: 23266976 Review. Chinese.
-
Metagenomics and Bioinformatics in Microbial Ecology: Current Status and Beyond.Microbes Environ. 2016 Sep 29;31(3):204-12. doi: 10.1264/jsme2.ME16024. Epub 2016 Jul 5. Microbes Environ. 2016. PMID: 27383682 Free PMC article. Review.
Cited by
-
Method development for cross-study microbiome data mining: Challenges and opportunities.Comput Struct Biotechnol J. 2020 Aug 1;18:2075-2080. doi: 10.1016/j.csbj.2020.07.020. eCollection 2020. Comput Struct Biotechnol J. 2020. PMID: 32802279 Free PMC article. Review.
-
The Enhanced Pharmacological Effects of Modified Traditional Chinese Medicine in Attenuation of Atherosclerosis Is Driven by Modulation of Gut Microbiota.Front Pharmacol. 2020 Oct 15;11:546589. doi: 10.3389/fphar.2020.546589. eCollection 2020. Front Pharmacol. 2020. PMID: 33178012 Free PMC article.
-
Visibiome: an efficient microbiome search engine based on a scalable, distributed architecture.BMC Bioinformatics. 2017 Jul 24;18(1):353. doi: 10.1186/s12859-017-1763-0. BMC Bioinformatics. 2017. PMID: 28738824 Free PMC article.
-
Mechanosensory Piezo2 regulated by gut microbiota participates in the development of visceral hypersensitivity and intestinal dysmotility.Gut Microbes. 2025 Dec;17(1):2497399. doi: 10.1080/19490976.2025.2497399. Epub 2025 Apr 28. Gut Microbes. 2025. PMID: 40296251 Free PMC article.
-
Hierarchical Meta-Storms enables comprehensive and rapid comparison of microbiome functional profiles on a large scale using hierarchical dissimilarity metrics and parallel computing.Bioinform Adv. 2021 May 12;1(1):vbab003. doi: 10.1093/bioadv/vbab003. eCollection 2021. Bioinform Adv. 2021. PMID: 36700101 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources