Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 23;26(1):105868.
doi: 10.1016/j.isci.2022.105868. eCollection 2023 Jan 20.

Quantitative analysis of disease-related metabolic dysregulation of human microbiota

Affiliations

Quantitative analysis of disease-related metabolic dysregulation of human microbiota

Maria Rita Fumagalli et al. iScience. .

Abstract

The metabolic activity of all the micro-organism composing the human microbiome interacts with the host metabolism contributing to human health and disease in a way that is not fully understood. Here, we introduce STELLA, a computational method to derive the spectrum of metabolites associated with the microbiome of an individual. STELLA integrates known information on metabolic pathways associated with each bacterial species and extracts from these the list of metabolic products of each singular reaction by means of automatic text analysis. By comparing the result obtained on a single subject with the metabolic profile data of a control set of healthy subjects, we are able to identify individual metabolic alterations. To illustrate the method, we present applications to autism spectrum disorder and multiple sclerosis.

Keywords: Computational bioinformatics; Microbiome.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Schematic of the algorithm The OTU abundances matrix obtained from microbiota sequencing experiments are combined with information on the metabolic pathways present in each OTU, retrieved from the Macadam database, and with the stoichiometry of metabolites produced and consumed in each pathway, obtained from the Metacyc database. All the information is combined to obtain a metabolite-patient matrix for each experimental dataset.
Figure 2
Figure 2
Algorithm validation (A) Heatmap of the z-scores of the metabolites recorded in and (B) the corresponding predictions made by the STELLA algorithm. (C) Confusion matrix for the algorithm predictions.
Figure 3
Figure 3
Batch effect removal (A and B) Correlation matrix of OTUs abundances obtained merging the datasets in the study by Kang et al. and Kong et al. before (A) and after (B) batch effect removal. Hierarchical clustering on correlation matrix shows that differences between the two datasets (gray, red) is predominant in the merged dataset, and the two major blocks disappear after batch removal. Hierachical clustering does not allow us to distinguish between patients and healthy controls (yellow, blue) suggesting a more elaborated procedure is needed. (C and D) Panels show the data from the intersection of data from the study by Kang et al. and Kong et al. projected onto the first two principal components and divided into healthy controls (red) and autistic patients (blue) before (C) and after (D) one-step batch removal. In panel (C), the variance between the data from different datasets is higher than the distance between the autistic patients and the healthy controls. Different symbols represent the two studies (circles, triangles as in legend).
Figure 4
Figure 4
LDA algorithm (A) Schematic of the LDA algorithm. After removal of highly correlated OTUs and metabolites, LDA feature selection is applied in order to obtain an optimal set of OTUs and metabolites to discriminate between patients (yellow circles) and healthy controls (blue circles). (B) Heatmaps show the ensemble of selected OTUs and metabolites for each replica of LDA feature selection for the combined datasets in the study by Kang et al. and Kong et al. after batch removal. The presence of a specific OTU or metabolite in a given replica is represented in blue and its absence in white and ordered using hierarchical clustering. (C and D) Performance of the models obtained from LDA feature selection. Correctness rate is evaluated after the inclusion of each new (C) OTU or (D) metabolite. The plots show, for each replica and in chronological order of selection, the increasing correctness rate of the model after the addition of every taxonomy.
Figure 5
Figure 5
LDA feature selection results for ASD Panel (A) show heatmaps representing the selected taxonomies for each replica of LDA feature selection reordered using hierarchical clustering and co-occurrence matrix representing the number of replica two compounds are selected as significant by LDA. The presence of a specific taxonomy in a given iteration is represented in blue and its absence in white. Panels (B and C) Illustrative reconstruction of reverse metabolic network. Plot shows the five most frequently selected metabolites during LDA analysis of metabolite-host matrix and the pathways in which they are mapped (B). For a specific metabolite (2-methyladenine), we report the obtained pathway-compound network (C). The network includes 2-methyladenine and its correlated metabolites (r1-r47) and the related pathways (p1-p35). The complete list of labels is reported Data S1. Lines represent the links between the metabolites (green, light green) and the pathways (gray). For all the panels, data are reported for the combined datasets from the study by Kang et al. and Kong et al. after batch removal.
Figure 6
Figure 6
LDA feature selection result for MS (A) Heatmaps representing the selected taxonomies for each replica of LDA feature selection reordered using hierarchical clustering and co-occurrence matrix representing the number of replica two compounds are selected as significant by LDA. The presence of a specific taxonomy in a given iteration is represented in blue and its absence in white. (B) Illustrative reconstruction of reverse metabolic network. Plot shows the five most frequently selected metabolites during LDA analysis of metabolite-host matrix and the pathways in which they are mapped. Lines represent the links between the metabolites (green, light green) and the pathways (gray) For all the panels, data are reported for the dataset in the study by Cekanaviciute et al. Metabolites names are reported in the figure, while full pathways names are only reported in Data S1.

References

    1. Thursby E., Juge N. Introduction to the human gut microbiota. Biochem. J. 2017;474:1823–1836. doi: 10.1042/BCJ20160510. - DOI - PMC - PubMed
    1. Dixit K., Chaudhari D., Dhotre D., Shouche Y., Saroj S. Restoration of dysbiotic human gut microbiome for homeostasis. Life Sci. 2021;278:119622. doi: 10.1016/j.lfs.2021.119622. - DOI - PubMed
    1. Quigley E.M. Gut bacteria in health and disease. Gastroenterol. Hepatol. 2013;9:560. - PMC - PubMed
    1. Sartor R.B., Wu G.D. Roles for intestinal bacteria, viruses, and fungi in pathogenesis of inflammatory bowel diseases and therapeutic approaches. Gastroenterology. 2017;152:327–339.e4. doi: 10.1053/j.gastro.2016.10.012. - DOI - PMC - PubMed
    1. Hughes H.K., Rose D., Ashwood P., Hughes H. The gut microbiota and dysbiosis in autism spectrum disorders. Curr. Neurol. Neurosci. Rep. 2018;18:81. doi: 10.1007/s11910-018-0887-6. - DOI - PMC - PubMed