Using machine learning and electronic health record (EHR) data for the early prediction of Alzheimer's Disease and Related Dementias
- PMID: 40246680
- PMCID: PMC12321625
- DOI: 10.1016/j.tjpad.2025.100169
Using machine learning and electronic health record (EHR) data for the early prediction of Alzheimer's Disease and Related Dementias
Abstract
Background: Over 6 million patients in the United States are affected by Alzheimer's Disease and Related Dementias (ADRD). Early detection of ADRD can significantly improve patient outcomes through timely treatment.
Objective: To develop and validate machine learning (ML) models for early ADRD diagnosis and prediction using de-identified EHR data from the University of Missouri (MU) Healthcare.
Design: Retrospective case-control study.
Setting: The study used de-identified EHR data provided by the MU NextGen Biomedical Informatics, modeled with the PCORnet Common Data Model (CDM).
Participants: An initial cohort of 380,269 patients aged 40 or older with at least two healthcare encounters was narrowed to a final dataset of 4,012 ADRD cases and 119,723 controls.
Methods: Six ML classifier models: Gradient-Boosted Trees (GBT), Light Gradient-Boosting Machine (LightGBM), Random Forest (RF), eXtreme Gradient-Boosting (XGBoost), Logistic Regression (LR), and Adaptive Boosting (AdaBoost) were evaluated using Area Under the Receiver Operating Characteristic Curve (AUC-ROC), accuracy, sensitivity, specificity, and F1 score. SHAP (SHapley Additive exPlanations) analysis was applied to interpret predictions.
Results: The GBT model achieved the best AUC-ROC scores of 0.809-0.833 across 1- to 5-year prediction windows. SHAP analysis identified depressive disorder, age groups 80-90 yrs and 70-80 yrs, heart disease, anxiety, and the novel risk factors of sleep apnea, and headache.
Conclusion: This study underscores the potential of ML models for leveraging EHR data to enable early ADRD prediction, supporting timely interventions, and improving patient outcomes. By identifying both established and novel risk factors, these findings offer new opportunities for personalized screening and management strategies, advancing both clinical and informatics science.
Keywords: Alzheimer's disease; Dementias; Early prediction; Electronic health record data; Machine learning (ML).
Copyright © 2025. Published by Elsevier Masson SAS.
Conflict of interest statement
Declaration of competing interest The authors have no conflicts of interest to disclose.
Figures




Similar articles
-
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733. J Med Internet Res. 2025. PMID: 40418571 Free PMC article.
-
Predicting Early-Onset Colorectal Cancer in Individuals Below Screening Age Using Machine Learning and Real-World Data: Case Control Study.JMIR Cancer. 2025 Jun 19;11:e64506. doi: 10.2196/64506. JMIR Cancer. 2025. PMID: 40537065 Free PMC article.
-
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066. JMIR Cardio. 2025. PMID: 40354648 Free PMC article.
-
Plasma and cerebrospinal fluid amyloid beta for the diagnosis of Alzheimer's disease dementia and other dementias in people with mild cognitive impairment (MCI).Cochrane Database Syst Rev. 2014 Jun 10;2014(6):CD008782. doi: 10.1002/14651858.CD008782.pub4. Cochrane Database Syst Rev. 2014. PMID: 24913723 Free PMC article.
-
Passive digital markers for Alzheimer's disease and other related dementias: A systematic evidence review.J Am Geriatr Soc. 2023 Sep;71(9):2966-2974. doi: 10.1111/jgs.18426. Epub 2023 May 30. J Am Geriatr Soc. 2023. PMID: 37249252
Cited by
-
Integrative machine learning approach to risk prediction for dementia and Alzheimer's disease.Geroscience. 2025 Aug 27. doi: 10.1007/s11357-025-01828-x. Online ahead of print. Geroscience. 2025. PMID: 40864401
References
-
- Šerý O., Povová J., Míšek I., Pešák L., Janout V. Molecular mechanisms of neuropathological changes in Alzheimer's disease: a review. Folia Neuropathol. 2013;51(1):1–9. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous