Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2018 Jan 15;6(1):13.
doi: 10.1186/s40168-018-0398-3.

Multi-omics differentially classify disease state and treatment outcome in pediatric Crohn's disease

Affiliations
Comparative Study

Multi-omics differentially classify disease state and treatment outcome in pediatric Crohn's disease

Gavin M Douglas et al. Microbiome. .

Abstract

Background: Crohn's disease (CD) has an unclear etiology, but there is growing evidence of a direct link with a dysbiotic microbiome. Many gut microbes have previously been associated with CD, but these have mainly been confounded with patients' ongoing treatments. Additionally, most analyses of CD patients' microbiomes have focused on microbes in stool samples, which yield different insights than profiling biopsy samples.

Results: We sequenced the 16S rRNA gene (16S) and carried out shotgun metagenomics (MGS) from the intestinal biopsies of 20 treatment-naïve CD and 20 control pediatric patients. We identified the abundances of microbial taxa and inferred functional categories within each dataset. We also identified known human genetic variants from the MGS data. We then used a machine learning approach to determine the classification accuracy when these datasets, collapsed to different hierarchical groupings, were used independently to classify patients by disease state and by CD patients' response to treatment. We found that 16S-identified microbes could classify patients with higher accuracy in both cases. Based on follow-ups with these patients, we identified which microbes and functions were best for predicting disease state and response to treatment, including several previously identified markers. By combining the top features from all significant models into a single model, we could compare the relative importance of these predictive features. We found that 16S-identified microbes are the best predictors of CD state whereas MGS-identified markers perform best for classifying treatment response.

Conclusions: We demonstrate for the first time that useful predictors of CD treatment response can be produced from shotgun MGS sequencing of biopsy samples despite the complications related to large proportions of host DNA. The top predictive features that we identified in this study could be useful for building an improved classifier for CD and treatment response based on sufferers' microbiome in the future. The BISCUIT project is funded by a Clinical Academic Fellowship from the Chief Scientist Office (Scotland)-CAF/08/01.

Keywords: Crohn’s disease; Machine learning; Microbiome; Pediatric; Treatment response; Treatment-naïve.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Ethical approval was granted by North of Scotland Research Ethics Service (09/S0802/24), and written informed consent was obtained from the parents of all subjects. Informed assent was also obtained from older children who were deemed capable of understanding the nature of the study. This study is publicly registered on the United Kingdom Clinical Research Network Portfolio (9633). An ethics amendment allowed further review of the cohort participants’ therapies and clinical response for the first year following diagnosis.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Diagram of the different datasets used for classification in this study. Datasets in orange were derived from the shotgun metagenomic sequencing (MGS) data (n = 40) and the datasets in blue were derived from the 16S rRNA gene (16S) sequencing data (n = 38*). These datasets were used to classify both disease state and treatment response as input to random forest machine learning models. *Note two Crohn’s disease samples were removed from both the 16S sequencing and MGS datasets due to low sequencing coverage, but their genetic profile was inferred from the MGS
Fig. 2
Fig. 2
Classification accuracies for all datasets classifying a disease state and b treatment response. Each bar corresponds to a different model. Accuracies are based on random forest (RF) leave-one-out cross-validation (LOOCV) in all cases, except for number of observed OTUs (# OTUs) and genetic risk scores (GRS) which are based on LOOCV of simple linear cut-off models. The symbols *, **, and *** indicate significance at P < 0.05, P < 0.01, and P < 0.001, respectively. RF model significances were based on a permutation test. P values for # OTUs and GRS are based on one-tailed Mann-Whitney-Wilcoxon Tests
Fig. 3
Fig. 3
Variable importance of features in combined random forest models for a disease state classification and b treatment response classification. Red and blue are used to indicate which class has a higher mean standardized relative abundance. Features that did not significantly differ (P ≥ 0.05) between classes based on a two-tailed Mann-Whitney-Wilcoxon test are indicated in gray. Features in black and green font indicate 16S rRNA gene and shotgun metagenomics sequencing origins, respectively. “Un” stands for “Unclassified” when used in taxa names

References

    1. Cleynen I, Boucher G, Jostins L, Schumm LP, Zeissig S, Ahmad T, et al. Inherited determinants of Crohn’s disease and ulcerative colitis phenotypes: a genetic association study. Lancet. 2016;387:156–167. doi: 10.1016/S0140-6736(15)00465-1. - DOI - PMC - PubMed
    1. Neovius M, Arkema EV, Blomqvist P, Ekbom A, Smedby KE. Patients with ulcerative colitis miss more days of work than the general population, even following colectomy. Gastroenterology. 2013;144:536–543. doi: 10.1053/j.gastro.2012.12.004. - DOI - PubMed
    1. Cho JH. The genetics and immunopathogenesis of inflammatory bowel disease. Nat Rev Immunol. 2008;8:458–466. doi: 10.1038/nri2340. - DOI - PubMed
    1. Molodecky NA, Soon IS, Rabi DM, Ghali WA, Ferris M, Chernoff G, et al. Increasing incidence and prevalence of the inflammatory bowel diseases with time, based on systematic review. Gastroenterology. 2012;142:46–54. doi: 10.1053/j.gastro.2011.10.001. - DOI - PubMed
    1. Henderson P, Hansen R, Cameron FL, Gerasimidis K, Rogers P, Bisset WM, et al. Rising incidence of pediatric inflammatory bowel disease in Scotland. Inflamm Bowel Dis. 2012;18:999–1005. doi: 10.1002/ibd.21797. - DOI - PubMed

Publication types

Supplementary concepts