Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 1;52(4):afad046.
doi: 10.1093/ageing/afad046.

Predicting future falls in older people using natural language processing of general practitioners' clinical notes

Affiliations

Predicting future falls in older people using natural language processing of general practitioners' clinical notes

Noman Dormosh et al. Age Ageing. .

Abstract

Background: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls.

Methods: We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach.

Results: Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700-0.719), 0.685 (0.676-0.694) and 0.718 (0.708-0.727), respectively. All the models showed good calibration.

Conclusions: Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited.

Keywords: accidental falls; electronic health records; fall prediction; free text; natural language processing; older people; topic modelling.

PubMed Disclaimer

Conflict of interest statement

None.

Figures

Figure 1
Figure 1
An illustration of embeddings of words, documents and topics generated by top2vec, adapted from [23]. The large dark ellipses reflect the clusters of the documents. Words, documents and topics are represented in a two-dimensional space for illustration purposes, but they usually have between 50 and 300 dimensions.
Figure 2
Figure 2
The calibration plots of three prediction models: Baseline, Topic-based and Combi models. The diagonal line reflects the calibration of an ideal model. The dashed line indicates the actual model calibration. Points below the diagonal line reflect overestimation, whereas points above the diagonal line reflect underestimation. The graph in the lower compartment of each plot represents a histogram of the distribution of the predicted probabilities.

References

    1. EuroSafe . EuroSafe: injuries in the European Union, summary on injury statistics 2012–2014. 6th ed, EuroSafe, Amsterdam 2016; 505–18.
    1. Moreland B, Kakara R, Henry A. Trends in nonfatal falls and fall-related injuries among adults aged ≥65 years—United States, 2012–2018. MMWR Morb Mortal Wkly Rep 2020; 69: 875–81. - PMC - PubMed
    1. Stel VS, Smit JH, Pluijm SMFet al. . Consequences of falling in older men and women and risk factors for health service use and functional decline. Age Ageing 2004; 33: 58–65. - PubMed
    1. Hartholt KA, Van Beeck EF, Polinder Set al. . Societal consequences of falls in the older population: injuries, healthcare costs, and long-term reduced quality of life. J Trauma Inj Infect Crit Care 2011; 71: 748–53. - PubMed
    1. Montero-Odasso M, Van Der Velde N, Alexander NBet al. . New horizons in falls prevention and management for older adults: a global initiative. Age Ageing 2021; 50: 1499–507. - PubMed

Publication types