The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification
- PMID: 29972595
- DOI: 10.1111/jgs.15411
The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification
Abstract
Objectives: To examine the value of unstructured electronic health record (EHR) data (free-text notes) in identifying a set of geriatric syndromes.
Design: Retrospective analysis of unstructured EHR notes using a natural language processing (NLP) algorithm.
Setting: Large multispecialty group.
Participants: Older adults (N=18,341; average age 75.9, 58.9% female).
Measurements: We compared the number of geriatric syndrome cases identified using structured claims and structured and unstructured EHR data. We also calculated these rates using a population-level claims database as a reference and identified comparable epidemiological rates in peer-reviewed literature as a benchmark.
Results: Using insurance claims data resulted in a geriatric syndrome prevalence ranging from 0.03% for lack of social support to 8.3% for walking difficulty. Using structured EHR data resulted in similar prevalence rates, ranging from 0.03% for malnutrition to 7.85% for walking difficulty. Incorporating unstructured EHR notes, enabled by applying the NLP algorithm, identified considerably higher rates of geriatric syndromes: absence of fecal control (2.1%, 2.3 times as much as structured claims and EHR data combined), decubitus ulcer (1.4%, 1.7 times as much), dementia (6.7%, 1.5 times as much), falls (23.6%, 3.2 times as much), malnutrition (2.5%, 18.0 times as much), lack of social support (29.8%, 455.9 times as much), urinary retention (4.2%, 3.9 times as much), vision impairment (6.2%, 7.4 times as much), weight loss (19.2%, 2.9 as much), and walking difficulty (36.34%, 3.4 as much). The geriatric syndrome rates extracted from structured data were substantially lower than published epidemiological rates, although adding the NLP results considerably closed this gap.
Conclusion: Claims and structured EHR data give an incomplete picture of burden related to geriatric syndromes. Geriatric syndromes are likely to be missed if unstructured data are not analyzed. Pragmatic NLP algorithms can assist with identifying individuals at high risk of experiencing geriatric syndromes and improving coordination of care for older adults.
Keywords: case identification; electronic health records; geriatric syndromes; natural language processing and text-mining; unstructured free-text data.
© 2018, Copyright the Authors Journal compilation © 2018, The American Geriatrics Society.
Similar articles
-
Comparing clinician descriptions of frailty and geriatric syndromes using electronic health records: a retrospective cohort study.BMC Geriatr. 2017 Oct 25;17(1):248. doi: 10.1186/s12877-017-0645-7. BMC Geriatr. 2017. PMID: 29070036 Free PMC article.
-
Identifying vulnerable older adult populations by contextualizing geriatric syndrome information in clinical notes of electronic health records.J Am Med Inform Assoc. 2019 Aug 1;26(8-9):787-795. doi: 10.1093/jamia/ocz093. J Am Med Inform Assoc. 2019. PMID: 31265063 Free PMC article.
-
Defining and Assessing Geriatric Risk Factors and Associated Health Care Utilization Among Older Adults Using Claims and Electronic Health Records.Med Care. 2018 Mar;56(3):233-239. doi: 10.1097/MLR.0000000000000865. Med Care. 2018. PMID: 29438193
-
Malnutrition and its contributing factors for older people living in residential aged care facilities: Insights from natural language processing of aged care records.Technol Health Care. 2023;31(6):2267-2278. doi: 10.3233/THC-230229. Technol Health Care. 2023. PMID: 37302059 Review.
-
Leveraging Natural Language Processing and Machine Learning Methods for Adverse Drug Event Detection in Electronic Health/Medical Records: A Scoping Review.Drug Saf. 2025 Apr;48(4):321-337. doi: 10.1007/s40264-024-01505-6. Epub 2025 Jan 9. Drug Saf. 2025. PMID: 39786481 Free PMC article.
Cited by
-
Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review.JAMIA Open. 2024 May 24;7(2):ooae044. doi: 10.1093/jamiaopen/ooae044. eCollection 2024 Jul. JAMIA Open. 2024. PMID: 38798774 Free PMC article. Review.
-
Characterizing Fall Circumstances in Community-Dwelling Older Adults: A Mixed Methods Approach.J Gerontol A Biol Sci Med Sci. 2023 Aug 27;78(9):1683-1691. doi: 10.1093/gerona/glad130. J Gerontol A Biol Sci Med Sci. 2023. PMID: 37210687 Free PMC article.
-
Social and Behavioral Determinants of Health in the Era of Artificial Intelligence with Electronic Health Records: A Scoping Review.Health Data Sci. 2021 Aug 24;2021:9759016. doi: 10.34133/2021/9759016. eCollection 2021. Health Data Sci. 2021. PMID: 38487504 Free PMC article.
-
Natural language processing for detecting adverse drug events: A systematic review protocol.NIHR Open Res. 2024 Dec 10;3:67. doi: 10.3310/nihropenres.13504.3. eCollection 2023. NIHR Open Res. 2024. PMID: 39931191 Free PMC article.
-
Improving the Prediction of Persistent High Health Care Utilizers: Retrospective Analysis Using Ensemble Methodology.JMIR Med Inform. 2022 Mar 24;10(3):e33212. doi: 10.2196/33212. JMIR Med Inform. 2022. PMID: 35275063 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous