Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 1;13(9):1570.
doi: 10.3390/genes13091570.

Unexpected Actors in Inflammatory Bowel Disease Revealed by Machine Learning from Whole-Blood Transcriptomic Data

Affiliations

Unexpected Actors in Inflammatory Bowel Disease Revealed by Machine Learning from Whole-Blood Transcriptomic Data

Jan K Nowak et al. Genes (Basel). .

Abstract

Although big data from transcriptomic analyses have helped transform our understanding of inflammatory bowel disease (IBD), they remain underexploited. We hypothesized that the application of machine learning using lasso regression to transcriptomic data from IBD patients and controls can help identify previously overlooked genes. Transcriptomic data provided by Ostrowski et al. (ENA PRJEB28822) were subjected to a two-stage process of feature selection to discriminate between IBD and controls. First, a principal component analysis was used for dimensionality reduction. Second, the least absolute shrinkage and selection operator (lasso) regression was employed to identify genes potentially involved in the pathobiology of IBD. The study included data from 294 participants: 100 with ulcerative colitis (48 adults and 52 children), 99 with Crohn's disease (45 adults and 54 children), and 95 controls (46 adults and 49 children). IBD patients presented a wide range of disease severity. Lasso regression preceded by principal component analysis successfully selected interesting features in the IBD transcriptomic data and yielded 12 models. The models achieved high discriminatory value (range of the area under the receiver operating characteristic curve 0.61-0.95) and identified over 100 genes as potentially associated with IBD. PURA, GALNT14, and FCGR1A were the most consistently selected, highlighting the role of the cell cycle, glycosylation, and immunoglobulin binding. Several known IBD-related genes were among the results. The results included genes involved in the TGF-beta pathway, expressed in NK cells, and they were enriched in ontology terms related to immunity. Future IBD research should emphasize the TGF-beta pathway, immunoglobulins, NK cells, and the role of glycosylation.

Keywords: Crohn’s disease; TGF-beta; expression; inflammatory bowel disease; ulcerative colitis.

PubMed Disclaimer

Conflict of interest statement

J.K.N. reports personal fees from Norsa Pharma, grant support from the Biocodex Microbiota Foundation, and nonfinancial support from Nutricia outside of the submitted work. J.W. reports personal fees and nonfinancial support from Biocodex, BGP Products, Chiesi, Hipp, Humana, Mead Johnson Nutrition, Merck Sharp and Dohme, Nestle, Norsa Pharma, Nutricia, Roche, Sequoia Pharmaceuticals, and Vitis Pharma, outside the submitted work, as well as grants, personal fees, and nonfinancial support from Nutricia Research Foundation Poland, also outside the submitted work. The remaining authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Gene ontology analysis of the genes included in the lasso models for the prediction of IBD, UC, and CD status. The ratio of the number of genes in overlap (k) to all genes in the given gene set (K) is indicated.

References

    1. Charbit-Henrion F., Parlato M., Hanein S., Duclaux-Loras R., Nowak J., Begue B., Rakotobe S., Bruneau J., Fourrage C., Alibeu O., et al. Diagnostic Yield of Next-Generation Sequencing in Very Early-Onset Inflammatory Bowel Diseases: A Multicentre Study. J. Crohn’s Colitis. 2018;12:1104–1112. doi: 10.1093/ecco-jcc/jjy068. - DOI - PMC - PubMed
    1. Parlato M., Charbit-Henrion F., Pan J., Romano C., Duclaux-Loras R., Le Du M.-H., Warner N., Francalanci P., Bruneau J., Bras M., et al. Human ALPI Deficiency Causes Inflammatory Bowel Disease and Highlights a Key Mechanism of Gut Homeostasis. EMBO Mol. Med. 2018;10 doi: 10.15252/emmm.201708483. - DOI - PMC - PubMed
    1. Jostins L., Ripke S., Weersma R.K., Duerr R.H., McGovern D.P., Hui K.Y., Lee J.C., Schumm L.P., Sharma Y., Anderson C.A., et al. Host-Microbe Interactions Have Shaped the Genetic Architecture of Inflammatory Bowel Disease. Nature. 2012;491:119–124. doi: 10.1038/nature11582. - DOI - PMC - PubMed
    1. Ostrowski J., Paziewska A., Lazowska I., Ambrozkiewicz F., Goryca K., Kulecka M., Rawa T., Karczmarski J., Dabrowska M., Zeber-Lubecka N., et al. Genetic Architecture Differences between Pediatric and Adult-Onset Inflammatory Bowel Diseases in the Polish Population. Sci. Rep. 2016;6:39831. doi: 10.1038/srep39831. - DOI - PMC - PubMed
    1. Travis M.A., Reizis B., Melton A.C., Masteller E., Tang Q., Proctor J.M., Wang Y., Bernstein X., Huang X., Reichardt L.F., et al. Loss of Integrin α(v)β8 on Dendritic Cells Causes Autoimmunity and Colitis in Mice. Nature. 2007;449:361–365. doi: 10.1038/nature06110. - DOI - PMC - PubMed

Publication types

Substances