Association rule mining of the human gut microbiome
- PMID: 40550997
- DOI: 10.1007/s11427-024-2865-1
Association rule mining of the human gut microbiome
Abstract
The human gut carries a vast and diverse microbial community that is essential for human health. Understanding the structure of this complex community is a crucial step toward comprehending human-microbiome interactions. Traditional co-occurrence and correlation analyses typically focus on pairwise relationships and ignore higher-order relationships. Association rule mining (ARM) is a well-developed technique in data mining and has been applied to human microbiome data to identify higher-order relationships. Yet, existing attempts suffer from small sample sizes and low taxonomic resolution. We developed an advanced ARM framework and systematically investigated the interactions between microbial species using a public large-scale uniformly processed human microbiome data from the curatedMetagenomicData (CMD) together with ARM. First, we inferred association rules in the gut microbiome samples of healthy individuals (n=2,815) in CMD. Then we compared those rules with those inferred from the individuals with different diseases: inflammatory bowel disease (IBD, n=768), colorectal cancer (CRC, n=368), impaired glucose tolerance (IGT, n=199), and type 2 diabetes (T2D, n=164). Finally, we demonstrated that ARM is an efficient feature selection tool that can improve the performance of microbiome-based disease classification. Together, this study illustrates the higher-order microbial relationships in the human gut microbiome and highlights the critical importance of incorporating association rules in microbiome-based disease classification.
Keywords: gut microbiome; high-order associations; machine learning.
© 2025. Science China Press.
Conflict of interest statement
Compliance and ethics. The authors declare that they have no conflict of interest.
Similar articles
-
Integrating Gut Microbiome and Metabolomics with Magnetic Resonance Enterography to Advance Bowel Damage Prediction in Crohn's Disease.J Inflamm Res. 2025 Jun 11;18:7631-7649. doi: 10.2147/JIR.S524671. eCollection 2025. J Inflamm Res. 2025. PMID: 40535353 Free PMC article.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
-
Synbiotics, prebiotics and probiotics for solid organ transplant recipients.Cochrane Database Syst Rev. 2022 Sep 20;9(9):CD014804. doi: 10.1002/14651858.CD014804.pub2. Cochrane Database Syst Rev. 2022. PMID: 36126902 Free PMC article.
-
HIV infection and exposure is associated with increased cariogenic taxa, reduced taxonomic turnover, and homogenized spatial differentiation for the supragingival microbiome.Microbiome. 2025 Jun 16;13(1):144. doi: 10.1186/s40168-025-02123-9. Microbiome. 2025. PMID: 40524270 Free PMC article.
-
Effects of a gluten-reduced or gluten-free diet for the primary prevention of cardiovascular disease.Cochrane Database Syst Rev. 2022 Feb 24;2(2):CD013556. doi: 10.1002/14651858.CD013556.pub2. Cochrane Database Syst Rev. 2022. PMID: 35199850 Free PMC article.
References
-
- Agrawal, R., Imieliński, T., and Swami, A. (1993). Mining association rules between sets of items in large databases. SIGMOD Rec 22, 207–216. - DOI
-
- Almeida-Neto, M., Guimarães, P., Guimarães Jr, P.R., Loyola, R.D., and Ulrich, W. (2008). A consistent metric for nestedness analysis in ecological systems: reconciling concept and measurement. Oikos 117, 1227–1239. - DOI
LinkOut - more resources
Full Text Sources