Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 27.
doi: 10.1007/s11357-025-01828-x. Online ahead of print.

Integrative machine learning approach to risk prediction for dementia and Alzheimer's disease

Affiliations

Integrative machine learning approach to risk prediction for dementia and Alzheimer's disease

Amos Stern et al. Geroscience. .

Abstract

Dementia, particularly Alzheimer's disease (AD), presents a growing global health challenge characterized by cognitive decline, behavioral changes, and loss of independence. With increasing life expectancy, early diagnosis and improved clinical strategies are urgently needed. This study developed and evaluated machine learning (ML) models to predict AD risk using UK Biobank data, integrating health, genetic, and lifestyle factors. The cohort included 2878 AD cases and 72,366 controls. Among several algorithms, CatBoost performed best (ROC-AUC = 0.773), especially in females. Inputs included ICD-10 codes from 5 years pre-diagnosis, ApoE-ε4 genotype, and large collection of modifiable risk factors. Despite fewer cases, the risk predictive models for vascular dementia (VaD) outperformed the unique AD models. ApoE-ε4 was the most predictive genetic marker, while other common variants had limited utility. Key non-genetic predictors included comorbidities (e.g., diabetes, hypertension), education, physical activity, and diet. These findings highlight the value of integrating diverse data sources for dementia risk prediction and emphasize the role of sex-specific modeling and modifiable factors in early, personalized intervention strategies.

Keywords: APOE; AUC; Feature selection; GWAS; PWAS; SHAP values; UK Biobank.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics: The study was approved by the University Committee for the Use of Human Subjects in Research Approval number 12072022 (July 2025). This study uses the UK-Biobank (UKB) application ID 26664 (Linial lab). Competing interests: The authors declare no competing interests.

Similar articles

References

    1. Scheltens P, De Strooper B, Kivipelto M, Holstege H, Chételat G, Teunissen CE, Cummings J, van der Flier WM. Alzheimer’s disease. Lancet. 2021;397:1577–90. - PubMed - PMC
    1. Garre-Olmo J. Epidemiology of Alzheimer’s disease and other dementias. Rev Neurol. 2018;66:377–86. - PubMed
    1. Zhang X-X, Tian Y, Wang Z-T, Ma Y-H, Tan L, Yu J-T. The epidemiology of Alzheimer’s disease modifiable risk factors and prevention. J Prevent Alzheim Dis. 2021;8:313–21.
    1. Li X, Feng X, Sun X, Hou N, Han F, Liu Y. Global, regional, and national burden of Alzheimer’s disease and other dementias, 1990–2019. Front Ag Neurosci. 2022;14:937486.
    1. Kumar A, Singh A. A review on Alzheimer’s disease pathophysiology and its management: an update. Pharmacol Rep. 2015;67:195–203. - PubMed

LinkOut - more resources