Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Aug:144:104442.
doi: 10.1016/j.jbi.2023.104442. Epub 2023 Jul 8.

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

Affiliations

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease

Chengsheng Mao et al. J Biomed Inform. 2023 Aug.

Abstract

Objective: We develop a deep learning framework based on the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model using unstructured clinical notes from electronic health records (EHRs) to predict the risk of disease progression from Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD).

Methods: We identified 3657 patients diagnosed with MCI together with their progress notes from Northwestern Medicine Enterprise Data Warehouse (NMEDW) between 2000 and 2020. The progress notes no later than the first MCI diagnosis were used for the prediction. We first preprocessed the notes by deidentification, cleaning and splitting into sections, and then pre-trained a BERT model for AD (named AD-BERT) based on the publicly available Bio+Clinical BERT on the preprocessed notes. All sections of a patient were embedded into a vector representation by AD-BERT and then combined by global MaxPooling and a fully connected network to compute the probability of MCI-to-AD progression. For validation, we conducted a similar set of experiments on 2563 MCI patients identified at Weill Cornell Medicine (WCM) during the same timeframe.

Results: Compared with the 7 baseline models, the AD-BERT model achieved the best performance on both datasets, with Area Under receiver operating characteristic Curve (AUC) of 0.849 and F1 score of 0.440 on NMEDW dataset, and AUC of 0.883 and F1 score of 0.680 on WCM dataset.

Conclusion: The use of EHRs for AD-related research is promising, and AD-BERT shows superior predictive performance in modeling MCI-to-AD progression prediction. Our study demonstrates the utility of pre-trained language models and clinical notes in predicting MCI-to-AD progression, which could have important implications for improving early detection and intervention for AD.

Keywords: Alzheimer's disease; Electronic health records; Mild cognitive impairment; Pre-trained language model.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Figure 1
Figure 1
Inclusion and exclusion criteria for the study cohorts for (a) NMEDW and (b) WCM.
Figure 2
Figure 2
The overview of our framework. The notes of a patient are split into sections, which are then fed to the pretrained AD-BERT model to generate a representation for each section. The patient representation is generated by global MaxPooling that aggregates all the section representations. Finally, a linear classifier combined with a sigmoid activation layer is used to predict probability of MCI-to-AD progression.
Figure 3
Figure 3
The illustration of samples in case and control groups. (a) For no-restrict prediction, the case and control groups are differentiated by the AD diagnosis condition after MCI diagnosis, as reflected in all diagnostic records. (b) For x-month prediction, in addition to the AD diagnosis condition within x months after MCI diagnosis, we also enforce a time constraint on the control group by requiring the last encounter to occur after x months to ensure the patients have a conversion time of at least x months.
Figure 4
Figure 4
Attention visualization of AD-BERT. The model pays more attention to the terms like “memory”, “MCI” and “difficulty recalling dates” than others.

Similar articles

Cited by

References

    1. 2021 Alzheimer’s disease facts and figures. Alzheimers Dement. Mar 2021;17(3):327–406. doi:10.1002/alz.12328 - DOI - PubMed
    1. Wong W Economic burden of Alzheimer disease and managed care considerations. Am J Manag Care. Aug 2020;26(8 Suppl):S177–S183. doi:10.37765/ajmc.2020.88482 - DOI - PubMed
    1. Gauthier S, Rosa-Neto P, Morais J, Webster C. World Alzheimer Report 2021: Journey through the diagnosis of dementia. Alzheimer’s Disease International. 2021;
    1. Jack CR Jr., Albert MS, Knopman DS, et al. Introduction to the recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. May 2011;7(3):257–62. doi:10.1016/j.jalz.2011.03.004 - DOI - PMC - PubMed
    1. Sperling RA, Aisen PS, Beckett LA, et al. Toward defining the preclinical stages of Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & dementia. 2011;7(3):280–292. - PMC - PubMed

Publication types

LinkOut - more resources