AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease
- PMID: 37429512
- PMCID: PMC11131134
- DOI: 10.1016/j.jbi.2023.104442
AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease
Abstract
Objective: We develop a deep learning framework based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model using unstructured clinical notes from electronic health records (EHRs) to predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD).
Methods: We identified 3657 patients diagnosed with MCI together with their progress notes from Northwestern Medicine Enterprise Data Warehouse (NMEDW) between 2000 and 2020. The progress notes no later than the first MCI diagnosis were used for the prediction. We first preprocessed the notes by deidentification, cleaning and splitting into sections, and then pre-trained a BERT model for AD (named AD-BERT) based on the publicly available Bio+Clinical BERT on the preprocessed notes. All sections of a patient were embedded into a vector representation by AD-BERT and then combined by global MaxPooling and a fully connected network to compute the probability of MCI-to-AD progression. For validation, we conducted a similar set of experiments on 2563 MCI patients identified at Weill Cornell Medicine (WCM) during the same timeframe.
Results: Compared with the 7 baseline models, the AD-BERT model achieved the best performance on both datasets, with Area Under receiver operating characteristic Curve (AUC) of 0.849 and F1 score of 0.440 on NMEDW dataset, and AUC of 0.883 and F1 score of 0.680 on WCM dataset.
Conclusion: The use of EHRs for AD-related research is promising, and AD-BERT shows superior predictive performance in modeling MCI-to-AD progression prediction. Our study demonstrates the utility of pre-trained language models and clinical notes in predicting MCI-to-AD progression, which could have important implications for improving early detection and intervention for AD.
Keywords: Alzheimer's disease; Electronic health records; Mild cognitive impairment; Pre-trained language model.
Copyright © 2023 Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures




Similar articles
-
Predicting cognitive decline: Deep-learning reveals subtle brain changes in pre-MCI stage.J Prev Alzheimers Dis. 2025 May;12(5):100079. doi: 10.1016/j.tjpad.2025.100079. Epub 2025 Feb 6. J Prev Alzheimers Dis. 2025. PMID: 39920001 Free PMC article.
-
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138. JMIR Med Inform. 2025. PMID: 40465350 Free PMC article.
-
Mini-Mental State Examination (MMSE) for the detection of Alzheimer's disease and other dementias in people with mild cognitive impairment (MCI).Cochrane Database Syst Rev. 2015 Mar 5;2015(3):CD010783. doi: 10.1002/14651858.CD010783.pub2. Cochrane Database Syst Rev. 2015. Update in: Cochrane Database Syst Rev. 2021 Jul 27;7:CD010783. doi: 10.1002/14651858.CD010783.pub3. PMID: 25740785 Free PMC article. Updated.
-
18F PET with florbetapir for the early diagnosis of Alzheimer's disease dementia and other dementias in people with mild cognitive impairment (MCI).Cochrane Database Syst Rev. 2017 Nov 22;11(11):CD012216. doi: 10.1002/14651858.CD012216.pub2. Cochrane Database Syst Rev. 2017. PMID: 29164603 Free PMC article.
-
18F PET with flutemetamol for the early diagnosis of Alzheimer's disease dementia and other dementias in people with mild cognitive impairment (MCI).Cochrane Database Syst Rev. 2017 Nov 22;11(11):CD012884. doi: 10.1002/14651858.CD012884. Cochrane Database Syst Rev. 2017. PMID: 29164602 Free PMC article.
Cited by
-
Introduction to Large Language Models (LLMs) for dementia care and research.Front Dement. 2024 May 14;3:1385303. doi: 10.3389/frdem.2024.1385303. eCollection 2024. Front Dement. 2024. PMID: 39081594 Free PMC article.
-
Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation.JMIR Form Res. 2024 Dec 23;8:e63866. doi: 10.2196/63866. JMIR Form Res. 2024. PMID: 39715540 Free PMC article.
-
Evaluating the validity of the nursing statements algorithmically generated based on the International Classifications of Nursing Practice for respiratory nursing care using large language models.J Am Med Inform Assoc. 2024 May 20;31(6):1397-1403. doi: 10.1093/jamia/ocae070. J Am Med Inform Assoc. 2024. PMID: 38630586 Free PMC article.
-
Engineering of Generative Artificial Intelligence and Natural Language Processing Models to Accurately Identify Arrhythmia Recurrence.Circ Arrhythm Electrophysiol. 2025 Jan;18(1):e013023. doi: 10.1161/CIRCEP.124.013023. Epub 2024 Dec 16. Circ Arrhythm Electrophysiol. 2025. PMID: 39676642
-
Determining the Importance of Clinical Modalities for NeuroDegenerative Disorders and Risk of Patient Injury Using Machine Learning and Survival Analysis.AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:385-394. eCollection 2025. AMIA Jt Summits Transl Sci Proc. 2025. PMID: 40502273 Free PMC article.
References
-
- Gauthier S, Rosa-Neto P, Morais J, Webster C. World Alzheimer Report 2021: Journey through the diagnosis of dementia. Alzheimer’s Disease International. 2021;
-
- Sperling RA, Aisen PS, Beckett LA, et al. Toward defining the preclinical stages of Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & dementia. 2011;7(3):280–292. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical