. 2023 Aug:144:104442.

doi: 10.1016/j.jbi.2023.104442. Epub 2023 Jul 8.

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

Affiliations

¹ Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States.
² Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, United States; Weill Cornell Medicine, New York, NY, United States.
³ Weill Cornell Medicine, New York, NY, United States.
⁴ Department of Neurology, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States.
⁵ Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, United States.
⁶ Mayo Clinic, Rochester, MN, United States.
⁷ Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States. Electronic address: yuan.luo@northwestern.edu.

PMID: 37429512
PMCID: PMC11131134
DOI: 10.1016/j.jbi.2023.104442

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

Chengsheng Mao et al. J Biomed Inform. 2023 Aug.

. 2023 Aug:144:104442.

doi: 10.1016/j.jbi.2023.104442. Epub 2023 Jul 8.

Authors

Affiliations

¹ Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States.
² Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, United States; Weill Cornell Medicine, New York, NY, United States.
³ Weill Cornell Medicine, New York, NY, United States.
⁴ Department of Neurology, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States.
⁵ Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, United States.
⁶ Mayo Clinic, Rochester, MN, United States.
⁷ Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States. Electronic address: yuan.luo@northwestern.edu.

PMID: 37429512
PMCID: PMC11131134
DOI: 10.1016/j.jbi.2023.104442

Abstract

Objective: We develop a deep learning framework based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model using unstructured clinical notes from electronic health records (EHRs) to predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD).

Methods: We identified 3657 patients diagnosed with MCI together with their progress notes from Northwestern Medicine Enterprise Data Warehouse (NMEDW) between 2000 and 2020. The progress notes no later than the first MCI diagnosis were used for the prediction. We first preprocessed the notes by deidentification, cleaning and splitting into sections, and then pre-trained a BERT model for AD (named AD-BERT) based on the publicly available Bio+Clinical BERT on the preprocessed notes. All sections of a patient were embedded into a vector representation by AD-BERT and then combined by global MaxPooling and a fully connected network to compute the probability of MCI-to-AD progression. For validation, we conducted a similar set of experiments on 2563 MCI patients identified at Weill Cornell Medicine (WCM) during the same timeframe.

Results: Compared with the 7 baseline models, the AD-BERT model achieved the best performance on both datasets, with Area Under receiver operating characteristic Curve (AUC) of 0.849 and F1 score of 0.440 on NMEDW dataset, and AUC of 0.883 and F1 score of 0.680 on WCM dataset.

Conclusion: The use of EHRs for AD-related research is promising, and AD-BERT shows superior predictive performance in modeling MCI-to-AD progression prediction. Our study demonstrates the utility of pre-trained language models and clinical notes in predicting MCI-to-AD progression, which could have important implications for improving early detection and intervention for AD.

Keywords: Alzheimer's disease; Electronic health records; Mild cognitive impairment; Pre-trained language model.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Figure 1**
Inclusion and exclusion criteria for the study cohorts for (a) NMEDW and (b) WCM.

**Figure 2**
The overview of our framework. The notes of a patient are split into sections, which are then fed to the pretrained AD-BERT model to generate a representation for each section. The patient representation is generated by global MaxPooling that aggregates all the section representations. Finally, a linear classifier combined with a sigmoid activation layer is used to predict probability of MCI-to-AD progression.

**Figure 3**
The illustration of samples in case and control groups. (a) For no-restrict prediction, the case and control groups are differentiated by the AD diagnosis condition after MCI diagnosis, as reflected in all diagnostic records. (b) For x-month prediction, in addition to the AD diagnosis condition within x months after MCI diagnosis, we also enforce a time constraint on the control group by requiring the last encounter to occur after x months to ensure the patients have a conversion time of at least x months.

**Figure 4**
Attention visualization of AD-BERT. The model pays more attention to the terms like “memory”, “MCI” and “difficulty recalling dates” than others.

See this image and copyright information in PMC

Cited by

Introduction to Large Language Models (LLMs) for dementia care and research.
Treder MS, Lee S, Tsvetanov KA. Treder MS, et al. Front Dement. 2024 May 14;3:1385303. doi: 10.3389/frdem.2024.1385303. eCollection 2024. Front Dement. 2024. PMID: 39081594 Free PMC article.
Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation.
Sprint G, Schmitter-Edgecombe M, Cook D. Sprint G, et al. JMIR Form Res. 2024 Dec 23;8:e63866. doi: 10.2196/63866. JMIR Form Res. 2024. PMID: 39715540 Free PMC article.
Evaluating the validity of the nursing statements algorithmically generated based on the International Classifications of Nursing Practice for respiratory nursing care using large language models.
Kim H, Park H, Kang S, Kim J, Kim J, Jung J, Taira R. Kim H, et al. J Am Med Inform Assoc. 2024 May 20;31(6):1397-1403. doi: 10.1093/jamia/ocae070. J Am Med Inform Assoc. 2024. PMID: 38630586 Free PMC article.
Engineering of Generative Artificial Intelligence and Natural Language Processing Models to Accurately Identify Arrhythmia Recurrence.
Feng R, Brennan KA, Azizi Z, Goyal J, Deb B, Chang HJ, Ganesan P, Clopton P, Pedron M, Ruipérez-Campillo S, Desai YB, De Larochellière H, Baykaner T, Perez MV, Rodrigo M, Rogers AJ, Narayan SM. Feng R, et al. Circ Arrhythm Electrophysiol. 2025 Jan;18(1):e013023. doi: 10.1161/CIRCEP.124.013023. Epub 2024 Dec 16. Circ Arrhythm Electrophysiol. 2025. PMID: 39676642
Determining the Importance of Clinical Modalities for NeuroDegenerative Disorders and Risk of Patient Injury Using Machine Learning and Survival Analysis.
Noshin K, Boland MR, Hou B, He W, Lu V, Manning C, Shen L, Zhang A. Noshin K, et al. AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:385-394. eCollection 2025. AMIA Jt Summits Transl Sci Proc. 2025. PMID: 40502273 Free PMC article.

See all "Cited by" articles

References

1. 2021 Alzheimer’s disease facts and figures. Alzheimers Dement. Mar 2021;17(3):327–406. doi:10.1002/alz.12328 - DOI - PubMed
1. Wong W Economic burden of Alzheimer disease and managed care considerations. Am J Manag Care. Aug 2020;26(8 Suppl):S177–S183. doi:10.37765/ajmc.2020.88482 - DOI - PubMed
1. Gauthier S, Rosa-Neto P, Morais J, Webster C. World Alzheimer Report 2021: Journey through the diagnosis of dementia. Alzheimer’s Disease International. 2021;
1. Jack CR Jr., Albert MS, Knopman DS, et al. Introduction to the recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. May 2011;7(3):257–62. doi:10.1016/j.jalz.2011.03.004 - DOI - PMC - PubMed
1. Sperling RA, Aisen PS, Beckett LA, et al. Toward defining the preclinical stages of Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & dementia. 2011;7(3):280–292. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

Affiliations

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical