Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2025 Dec 15;39(23):e71266.
doi: 10.1096/fj.202503352R.

An Interpretable Machine Learning Model Based on Metabolomics for Predicting Plaque Burden in Cryptogenic Stroke

Affiliations
Multicenter Study

An Interpretable Machine Learning Model Based on Metabolomics for Predicting Plaque Burden in Cryptogenic Stroke

Zi-Miao Liu et al. FASEB J. .

Abstract

Cryptogenic stroke represents 25%-40% of ischemic strokes, with many cases harboring unrecognized large artery atherosclerosis (LAA) requiring specific secondary prevention. In this multicenter pilot study, we developed a metabolomics-based machine learning model to identify LAA and predict plaque burden in cryptogenic stroke patients. Plasma metabolites from 572 acute ischemic stroke patients across three hospitals were analyzed using untargeted metabolomics. A two-stage machine learning approach was developed: Model 1 distinguished cardioembolic (CE) from non-CE stroke, and Model 2 separated LAA from small vessel occlusion (SVO). Models were integrated to directly predict LAA among all subtypes. Model 1 achieved exceptional performance distinguishing CE from non-CE (AUC = 0.998, accuracy = 97.9%), with pyruvate and glutamine as key discriminators. Model 2 differentiated LAA from SVO with AUC = 0.949, identifying pyroglutamate and 2-hydroxybutyrate as primary markers. The combined model predicted LAA with AUC = 0.821. Critically, cryptogenic stroke patients predicted as LAA showed significantly higher plaque burden than those predicted as CE/SVO (23.9% vs. 10.9%, p = 0.014), with increased neovascularization and ulcerative features. This metabolomics-based approach accurately identifies LAA in cryptogenic stroke patients and predicts atherosclerotic plaque burden, offering a novel diagnostic tool to guide personalized antithrombotic therapy selection. Trial Registration: Chinese Clinical Trial Registry registration number: ChiCTR1800015956.

Keywords: acute ischemic stroke; machine learning; plasma metabolites.

PubMed Disclaimer

References

    1. GBD 2019 Stroke Collaborators, “Global, Regional, and National Burden of Stroke and Its Risk Factors, 1990–2019: A Systematic Analysis for the Global Burden of Disease Study 2019,” Lancet Neurology 20, no. 10 (2021): 795–820, https://doi.org/10.1016/S1474‐4422(21)00252‐0.
    1. M. O. Owolabi, A. G. Thrift, A. Mahal, et al., “Primary Stroke Prevention Worldwide: Translating Evidence Into Action,” Lancet Public Health 7, no. 1 (2022): e74–e85, https://doi.org/10.1016/S2468‐2667(21)00230‐9.
    1. H. J. Adams, B. H. Bendixen, L. J. Kappelle, et al., “Classification of Subtype of Acute Ischemic Stroke. Definitions for Use in a Multicenter Clinical Trial. TOAST. Trial of Org 10172 in Acute Stroke Treatment,” Stroke 24, no. 1 (1993): 35–41, https://doi.org/10.1161/01.str.24.1.35.
    1. O. Y. Bang, “Intracranial Atherosclerosis: Current Understanding and Perspectives,” Journal of Stroke 16, no. 1 (2014): 27–35, https://doi.org/10.5853/jos.2014.16.1.27.
    1. J. S. Kim and D. Bonovich, “Research on Intracranial Atherosclerosis From the East and West: Why Are the Results Different?,” Journal of Stroke 16, no. 3 (2014): 105–113, https://doi.org/10.5853/jos.2014.16.3.105.

Publication types

LinkOut - more resources