Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 11:11:645951.
doi: 10.3389/fcimb.2021.645951. eCollection 2021.

Leveraging 16S rRNA Microbiome Sequencing Data to Identify Bacterial Signatures for Irritable Bowel Syndrome

Affiliations

Leveraging 16S rRNA Microbiome Sequencing Data to Identify Bacterial Signatures for Irritable Bowel Syndrome

Yuxia Liu et al. Front Cell Infect Microbiol. .

Abstract

Irritable bowel syndrome (IBS) is a chronic gastrointestinal disorder characterized by abdominal pain or discomfort. Previous studies have illustrated that the gut microbiota might play a critical role in IBS, but the conclusions of these studies, based on various methods, were almost impossible to compare, and reproducible microorganism signatures were still in question. To cope with this problem, previously published 16S rRNA gene sequencing data from 439 fecal samples, including 253 IBS samples and 186 control samples, were collected and processed with a uniform bioinformatic pipeline. Although we found no significant differences in community structures between IBS and healthy controls at the amplicon sequence variants (ASV) level, machine learning (ML) approaches enabled us to discriminate IBS from healthy controls at genus level. Linear discriminant analysis effect size (LEfSe) analysis was subsequently used to seek out 97 biomarkers across all studies. Then, we quantified the standardized mean difference (SMDs) for all significant genera identified by LEfSe and ML approaches. Pooled results showed that the SMDs of nine genera had statistical significance, in which the abundance of Lachnoclostridium, Dorea, Erysipelatoclostridium, Prevotella 9, and Clostridium sensu stricto 1 in IBS were higher, while the dominant abundance genera of healthy controls were Ruminococcaceae UCG-005, Holdemanella, Coprococcus 2, and Eubacterium coprostanoligenes group. In summary, based on six published studies, this study identified nine new microbiome biomarkers of IBS, which might be a basis for understanding the key gut microbes associated with IBS, and could be used as potential targets for microbiome-based diagnostics and therapeutics.

Keywords: 16S rRNA; biomarkers; gut microbiome; irritable bowel syndrome; machine learning algorithm.

PubMed Disclaimer

Conflict of interest statement

Authors LW, YW, JS and YS were employed by company Tianjin Zhongxin Pharmaceutical Group Co., Ltd. The reamining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
The top frames represent the three main steps, and the rectangular frame shows the data process. DADA2, divisive amplicon denoising algorithm 2; ASV, amplicon sequence variants; LEfSe, linear discriminant analysis effect size; RF, random forest; IBS, irritable bowel syndrome.
Figure 2
Figure 2
Most of the studies showed microbiome changes, and potential signatures were found at genus-level in each dataset. (A) Left: Area under the ROC curve (AUC) is calculated by the AdaBoost algorithm. Right: Area under the ROC curve (AUC) is calculated by the random forest algorithm. X-axis starts at 0.5. (B) The number of genera biomarkers with p < 0.05 were identified by linear discriminant analysis effect size (LEfSe) analysis.
Figure 3
Figure 3
The AdaBoost and random forest classifiers identified important feature genera in distinguishing IBS from healthy subjects. (A) The important features selected by the AdaBoost classifier. a1. The top 23 genus biomarkers were ranked in descending order of the most relevant features to the model. a2. 10-fold cross-validation score on the Y-axis and the number of features on the X-axis. (B) The important features contributed to random forest corresponding to AdaBoost. b1. 25 important genera sorted in descending order based on mean decrease accuracy (MDA). b2. 10-fold cross-validation error on the Y-axis and the number of features on the X-axis.
Figure 4
Figure 4
Forest plot reporting effect sizes calculated using a meta-analysis of standardized mean differences and a fixed-effects model on nine genera abundances between carcinomas and controls. The SMD had a positive value confirming the higher number of genera in patients with IBS, and a negative value showing more abundance of genera in healthy controls. The length of the error bar depicts the 95% CIs. The fixed model indicates the overall effect sizes SMD value of all studies.

References

    1. Abbas A., Kong X. B., Liu Z., Jing B. Y., Gao X. (2013). Automatic Peak Selection by a Benjamini-Hochberg-Based Algorithm. PloS One 8, e53112. 10.1371/journal.pone.0053112 - DOI - PMC - PubMed
    1. Armour C. R., Nayfach S., Pollard K. S., Sharpton T. J. (2019). A Metagenomic Meta-Analysis Reveals Functional Signatures of Health and Disease in the Human Gut Microbiome. mSystems 4 (4), e00332–18. 10.1128/mSystems.00332-18 - DOI - PMC - PubMed
    1. Asghar Z., Thoufeeq M., Kurien M., Ball A. J., Rej A., David Tai F. W., et al. . (2020). Diagnostic Yield of Colonoscopy in Patients With Symptoms Compatible With Rome IV Functional Bowel Disorders. Clin. Gastroenterol. Hepatol. S1542-3565 (20), 31221–0. 10.1016/j.cgh.2020.08.062 - DOI - PubMed
    1. Bolyen E., Rideout J. R., Dillon M. R., Bokulich N. A., Abnet C. C., Al-Ghalith G. A., et al. . (2019). Reproducible, Interactive, Scalable and Extensible Microbiome Data Science Using QIIME 2. Nat. Biotechnol. 37, 852–857. 10.1038/s41587-019-0209-9 - DOI - PMC - PubMed
    1. Brynjolfsson E., Mitchell T. (2017). What can Machine Learning do? Workforce Implications. Science 358, 1530–1534. 10.1126/science.aap8062 - DOI - PubMed

Publication types

Substances

Supplementary concepts

LinkOut - more resources