Development and validation of a novel predictive model for dementia risk in middle-aged and elderly depression individuals: a large and longitudinal machine learning cohort study
- PMID: 40361211
- PMCID: PMC12070709
- DOI: 10.1186/s13195-025-01750-6
Development and validation of a novel predictive model for dementia risk in middle-aged and elderly depression individuals: a large and longitudinal machine learning cohort study
Abstract
Background: Depression serves as a prodromal symptom of dementia, and individuals with depression exhibit a significantly higher risk of developing dementia. The aim of this study is to develop and validate a novel dementia risk prediction tool among middle-aged and elderly individuals with depression based on machine learning algorithms.
Methods: This study included 31,587 middle-aged and elderly individuals with depression who did not have a diagnosis of dementia at baseline from a large UK population-based prospective cohort. A rigorous variable selection strategy was employed to identify risk and protective factors of dementia from an initial pool of 190 candidate variables, ultimately retaining 27 variables. Eight distinct data analysis strategies were utilized to develop and validate the dementia risk prediction model. The DeLong's test was applied to compare the statistical differences between different models.
Results: During a median follow-up of 7.98 years, 896 incident dementia cases were identified among study participants. In model development employing an 8:2 data split (fivefold cross-validation for training), the Adaboost classifier achieved the optimal performance (AUC 0.861 ± 0.003), followed by XGBoost (AUC 0.839 ± 0.005) and CatBoost (AUC 0.828 ± 0.007) classifiers. To facilitate community generalization and clinical applicability, we develop a simplified model through a forward feature subset selection algorithm, retaining 12 variables. The simplified model maintained robust performance, with AdaBoost achieving the highest discriminative ability (AUC 0.859 ± 0.002), followed by XGBoost (AUC 0.835 ± 0.001) and CatBoost (AUC 0.821 ± 0.005). The DeLong's test revealed no statistically significant difference in AUC values between models using 12 and 27 variables (p = 0.278). For practical implementation, we deployed the optimal model to a web application for visualization and dementia risk assessment, named DRP-Depression.
Conclusions: We developed a practical and easy-to-promote risk prediction model based on machine learning algorithms, and deployed it to a web application to provide a new and convenient tool for dementia risk prediction in the middle-aged and elderly individuals with depression.
Keywords: DRP-Depression; Dementia; Depression; Machine learning; Risk prediction; UK Biobank; Web application.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: Our study was approved by the North West Multi-Center Research Ethics Committee (REC reference: 16/NW/0274). All participants were asked to sign an informed consent form before participating in this study. Our study followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guideline Competing interests: The authors declare no competing interests.
Figures



Similar articles
-
Development and validation of a convenient dementia risk prediction tool for diabetic population: A large and longitudinal machine learning cohort study.J Affect Disord. 2025 Jul 1;380:298-307. doi: 10.1016/j.jad.2025.03.135. Epub 2025 Mar 25. J Affect Disord. 2025. PMID: 40147608
-
A Multivariable Prediction Model for Mild Cognitive Impairment and Dementia: Algorithm Development and Validation.JMIR Med Inform. 2024 Nov 22;12:e59396. doi: 10.2196/59396. JMIR Med Inform. 2024. PMID: 39576972 Free PMC article.
-
Development of a novel dementia risk prediction model in the general population: A large, longitudinal, population-based machine-learning study.EClinicalMedicine. 2022 Sep 23;53:101665. doi: 10.1016/j.eclinm.2022.101665. eCollection 2022 Nov. EClinicalMedicine. 2022. PMID: 36187723 Free PMC article.
-
Early prediction of sepsis associated encephalopathy in elderly ICU patients using machine learning models: a retrospective study based on the MIMIC-IV database.Front Cell Infect Microbiol. 2025 Apr 17;15:1545979. doi: 10.3389/fcimb.2025.1545979. eCollection 2025. Front Cell Infect Microbiol. 2025. PMID: 40313459 Free PMC article.
-
Statistical primer: sample size considerations for developing and validating clinical prediction models.Eur J Cardiothorac Surg. 2025 May 6;67(5):ezaf142. doi: 10.1093/ejcts/ezaf142. Eur J Cardiothorac Surg. 2025. PMID: 40279277 Free PMC article. Review.
Cited by
-
Suppression of COVID-19 death incidence on open west coasts in the USA.Sci Rep. 2025 Aug 5;15(1):28542. doi: 10.1038/s41598-025-12972-x. Sci Rep. 2025. PMID: 40764350 Free PMC article.
-
Machine Learning-Powered Smart Healthcare Systems in the Era of Big Data: Applications, Diagnostic Insights, Challenges, and Ethical Implications.Diagnostics (Basel). 2025 Jul 30;15(15):1914. doi: 10.3390/diagnostics15151914. Diagnostics (Basel). 2025. PMID: 40804880 Free PMC article. Review.
References
-
- World Health Organization. Life expectancy and leading causes of death and disability. https://www.who.int/data/gho/data/themes/mortality-and-global-health-est.... Accessed 16 Oct 2022.
-
- Alzheimer's Disease International. Dementia statistics. https://www.alzint.org/about/dementia-facts-figures/dementia-statistics/. Accessed 16 Oct 2022.
-
- The DT, Initiative EDP. Lancet Neurol. 2013;12(3):227–8. https://doi.org/10.1016/S14744422(13)70030-9. - PubMed
-
- Yang L, Deng YT, Leng Y, Ou YN, Li YZ, Chen SD, He XY, Wu BS, Huang SY, Zhang YR, Kuo K, Feng W, Dong Q, Feng JF, Suckling J, Smith AD, Li F, Cheng W, Yu JT. Depression, depression treatments, and risk of incident dementia: a prospective cohort study of 354,313 participants. Biol Psychiatry. 2023;93(9):802–9. 10.1016/j.biopsych.2022.08.026. Epub 2022 Sep 5 PMID: 36526487. - PubMed
-
- WHO. Depression and other common mental disorders: global health estimates. Geneva: World Health Organization; 2017.
Publication types
MeSH terms
Grants and funding
- No. 82460341/National Natural Science Foundation of China
- 20224BAB216077/Natural Science Foundation of Jiangxi Province
- 2022JJ40417 and 2021SK52203/Natural Science Foundation of Hunan Province
- YXZN2021003/Key Laboratory of Medical Imaging and Artificial Intelligence of Hunan Province
- No. 20242BAB23086/Science Foundation for Distinguished Young Scholars of Jiangxi Province
LinkOut - more resources
Full Text Sources
Medical