Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 17;10(1):24.
doi: 10.1186/s40635-022-00445-8.

Machine learning approaches to the human metabolome in sepsis identify metabolic links with survival

Affiliations

Machine learning approaches to the human metabolome in sepsis identify metabolic links with survival

Leah B Kosyakovsky et al. Intensive Care Med Exp. .

Abstract

Background: Metabolic predictors and potential mediators of survival in sepsis have been incompletely characterized. We examined whether machine learning (ML) tools applied to the human plasma metabolome could consistently identify and prioritize metabolites implicated in sepsis survivorship, and whether these methods improved upon conventional statistical approaches.

Methods: Plasma gas chromatography-liquid chromatography mass spectrometry quantified 411 metabolites measured ≤ 72 h of ICU admission in 60 patients with sepsis at a single center (Brigham and Women's Hospital, Boston, USA). Seven ML approaches were trained to differentiate survivors from non-survivors. Model performance predicting 28 day mortality was assessed through internal cross-validation, and innate top-feature (metabolite) selection and rankings were compared across the 7 ML approaches and with conventional statistical methods (logistic regression). Metabolites were consensus ranked by a summary, ensemble ML ranking procedure weighing their contribution to mortality risk prediction across multiple ML models.

Results: Median (IQR) patient age was 58 (47, 62) years, 45% were women, and median (IQR) SOFA score was 9 (6, 12). Mortality at 28 days was 42%. The models' specificity ranged from 0.619 to 0.821. Partial least squares regression-discriminant analysis and nearest shrunken centroids prioritized the greatest number of metabolites identified by at least one other method. Penalized logistic regression demonstrated top-feature results that were consistent with many ML methods. Across the plasma metabolome, the 13 metabolites with the strongest linkage to mortality defined through an ensemble ML importance score included lactate, bilirubin, kynurenine, glycochenodeoxycholate, phenylalanine, and others. Four of these top 13 metabolites (3-hydroxyisobutyrate, indoleacetate, fucose, and glycolithocholate sulfate) have not been previously associated with sepsis survival. Many of the prioritized metabolites are constituents of the tryptophan, pyruvate, phenylalanine, pentose phosphate, and bile acid pathways.

Conclusions: We identified metabolites linked with sepsis survival, some confirming prior observations, and others representing new associations. The application of ensemble ML feature-ranking tools to metabolomic data may represent a promising statistical platform to support biologic target discovery.

Keywords: Artificial intelligence; Machine learning; Metabolism; Metabolomics; Sepsis.

PubMed Disclaimer

Conflict of interest statement

RMB takes part in advisory boards for Merck and Genentech; all remaining authors have no relevant financial disclosures.

Figures

Fig. 1
Fig. 1
Heatmap showing normalized metabolite levels grouped by super-pathway among patients with sepsis. Heatmap showing normalized metabolite levels (rows), grouped by super-pathway, among patients with sepsis following hierarchical clustering (dendrogram, top); survival status is annotated (grey = yes, black = no). Differences in the metabolome among survivors and non-survivors were noted across multiple metabolic super-pathways
Fig. 2
Fig. 2
Comparison of top metabolites selected by each analysis method’s innate feature selection algorithm. Comparison of top metabolites selected by each analysis method’s innate feature selection algorithm, identifying metabolites that more meaningfully contribute to successful sepsis mortality prediction models. Such approaches may identify measures of association and individual metabolic links with mortality. Agreement was noted between the lists of top metabolites identified by several machine learning methods, which also overlapped with those identified by conventional panelized logistic regression. FDA flexible discriminant analysis, GBM generalized boosted regression models, LR logistic regression, NSC nearest shrunken centroids, PLS-DA partial least squares-discriminant analysis, sparse LDA sparse discriminant analysis

References

    1. Rudd KE, Johnson SC, Agesa KM, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the Global Burden of Disease Study. Lancet. 2020;395:200–211. doi: 10.1016/S0140-6736(19)32989-7. - DOI - PMC - PubMed
    1. Eckerle M, Ambroggio L, Puskarich MA, Winston B, Jones AE, Standiford TJ, Stringer KA. Metabolomics as a driver in advancing precision medicine in sepsis. Pharmacotherapy. 2017;37:1023–1032. doi: 10.1002/phar.1974. - DOI - PMC - PubMed
    1. Banoei MM, Donnelly SJ, Mickiewicz B, Weljie A, Vogel HJ, Winston BW. Metabolomics in critical care medicine: a new approach to biomarker discovery. Clin Investig Med. 2014;37:E363–E376. doi: 10.25011/cim.v37i6.22241. - DOI - PubMed
    1. Cheng S, Shah SH, Corwin EJ, et al. Potential impact and study considerations of metabolomics in cardiovascular health and disease: a Scientific Statement from the American Heart Association. Circ Cardiovasc Genet. 2017;10(2):e000032. doi: 10.1161/HCG.0000000000000032. - DOI - PMC - PubMed
    1. Nicholson JK, Holmes E, Kinross JM, Darzi AW, Takats Z, Lindon JC. Metabolic phenotyping in clinical and surgical environments. Nature. 2012;491:384–392. doi: 10.1038/nature11708. - DOI - PubMed

LinkOut - more resources