An Interpretable Machine Learning Model Based on Metabolomics for Predicting Plaque Burden in Cryptogenic Stroke
- PMID: 41313589
- DOI: 10.1096/fj.202503352R
An Interpretable Machine Learning Model Based on Metabolomics for Predicting Plaque Burden in Cryptogenic Stroke
Abstract
Cryptogenic stroke represents 25%-40% of ischemic strokes, with many cases harboring unrecognized large artery atherosclerosis (LAA) requiring specific secondary prevention. In this multicenter pilot study, we developed a metabolomics-based machine learning model to identify LAA and predict plaque burden in cryptogenic stroke patients. Plasma metabolites from 572 acute ischemic stroke patients across three hospitals were analyzed using untargeted metabolomics. A two-stage machine learning approach was developed: Model 1 distinguished cardioembolic (CE) from non-CE stroke, and Model 2 separated LAA from small vessel occlusion (SVO). Models were integrated to directly predict LAA among all subtypes. Model 1 achieved exceptional performance distinguishing CE from non-CE (AUC = 0.998, accuracy = 97.9%), with pyruvate and glutamine as key discriminators. Model 2 differentiated LAA from SVO with AUC = 0.949, identifying pyroglutamate and 2-hydroxybutyrate as primary markers. The combined model predicted LAA with AUC = 0.821. Critically, cryptogenic stroke patients predicted as LAA showed significantly higher plaque burden than those predicted as CE/SVO (23.9% vs. 10.9%, p = 0.014), with increased neovascularization and ulcerative features. This metabolomics-based approach accurately identifies LAA in cryptogenic stroke patients and predicts atherosclerotic plaque burden, offering a novel diagnostic tool to guide personalized antithrombotic therapy selection. Trial Registration: Chinese Clinical Trial Registry registration number: ChiCTR1800015956.
Keywords: acute ischemic stroke; machine learning; plasma metabolites.
© 2025 Federation of American Societies for Experimental Biology.
References
-
- GBD 2019 Stroke Collaborators, “Global, Regional, and National Burden of Stroke and Its Risk Factors, 1990–2019: A Systematic Analysis for the Global Burden of Disease Study 2019,” Lancet Neurology 20, no. 10 (2021): 795–820, https://doi.org/10.1016/S1474‐4422(21)00252‐0.
-
- M. O. Owolabi, A. G. Thrift, A. Mahal, et al., “Primary Stroke Prevention Worldwide: Translating Evidence Into Action,” Lancet Public Health 7, no. 1 (2022): e74–e85, https://doi.org/10.1016/S2468‐2667(21)00230‐9.
-
- H. J. Adams, B. H. Bendixen, L. J. Kappelle, et al., “Classification of Subtype of Acute Ischemic Stroke. Definitions for Use in a Multicenter Clinical Trial. TOAST. Trial of Org 10172 in Acute Stroke Treatment,” Stroke 24, no. 1 (1993): 35–41, https://doi.org/10.1161/01.str.24.1.35.
-
- O. Y. Bang, “Intracranial Atherosclerosis: Current Understanding and Perspectives,” Journal of Stroke 16, no. 1 (2014): 27–35, https://doi.org/10.5853/jos.2014.16.1.27.
-
- J. S. Kim and D. Bonovich, “Research on Intracranial Atherosclerosis From the East and West: Why Are the Results Different?,” Journal of Stroke 16, no. 3 (2014): 105–113, https://doi.org/10.5853/jos.2014.16.3.105.
Publication types
MeSH terms
Substances
Grants and funding
- 2017YFC1307500/National Key Research and Development Program of China
- 82271305/National Natural Science Foundation of China
- 2022A1515012525/Basic and Applied Basic Research Foundation of Guangdong Province ()
- 2021A1515011027/Basic and Applied Basic Research Foundation of Guangdong Province ()
- 2024A1515140109/Basic and Applied Basic Research Foundation of Guangdong Province
LinkOut - more resources
Full Text Sources
Medical
