Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review
- PMID: 30726935
- PMCID: PMC6657282
- DOI: 10.1093/jamia/ocy173
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review
Abstract
Objective: Natural language processing (NLP) of symptoms from electronic health records (EHRs) could contribute to the advancement of symptom science. We aim to synthesize the literature on the use of NLP to process or analyze symptom information documented in EHR free-text narratives.
Materials and methods: Our search of 1964 records from PubMed and EMBASE was narrowed to 27 eligible articles. Data related to the purpose, free-text corpus, patients, symptoms, NLP methodology, evaluation metrics, and quality indicators were extracted for each study.
Results: Symptom-related information was presented as a primary outcome in 14 studies. EHR narratives represented various inpatient and outpatient clinical specialties, with general, cardiology, and mental health occurring most frequently. Studies encompassed a wide variety of symptoms, including shortness of breath, pain, nausea, dizziness, disturbed sleep, constipation, and depressed mood. NLP approaches included previously developed NLP tools, classification methods, and manually curated rule-based processing. Only one-third (n = 9) of studies reported patient demographic characteristics.
Discussion: NLP is used to extract information from EHR free-text narratives written by a variety of healthcare providers on an expansive range of symptoms across diverse clinical specialties. The current focus of this field is on the development of methods to extract symptom information and the use of symptom information for disease classification tasks rather than the examination of symptoms themselves.
Conclusion: Future NLP studies should concentrate on the investigation of symptoms and symptom documentation in EHR free-text narratives. Efforts should be undertaken to examine patient characteristics and make symptom-related NLP algorithms or pipelines and vocabularies openly available.
Keywords: electronic health records; natural language processing; review; signs and symptoms.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Figures


Similar articles
-
A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data.Int J Med Inform. 2019 May;125:37-46. doi: 10.1016/j.ijmedinf.2019.02.008. Epub 2019 Feb 20. Int J Med Inform. 2019. PMID: 30914179 Free PMC article.
-
Natural Language Processing Accurately Differentiates Cancer Symptom Information in Electronic Health Record Narratives.JCO Clin Cancer Inform. 2024 Aug;8:e2300235. doi: 10.1200/CCI.23.00235. JCO Clin Cancer Inform. 2024. PMID: 39116379
-
Natural language processing with machine learning methods to analyze unstructured patient-reported outcomes derived from electronic health records: A systematic review.Artif Intell Med. 2023 Dec;146:102701. doi: 10.1016/j.artmed.2023.102701. Epub 2023 Nov 1. Artif Intell Med. 2023. PMID: 38042599 Free PMC article.
-
Natural Language Processing in Surgery: A Systematic Review and Meta-analysis.Ann Surg. 2021 May 1;273(5):900-908. doi: 10.1097/SLA.0000000000004419. Ann Surg. 2021. PMID: 33074901
-
Identifying Symptom Information in Clinical Notes Using Natural Language Processing.Nurs Res. 2021 May-Jun 01;70(3):173-183. doi: 10.1097/NNR.0000000000000488. Nurs Res. 2021. PMID: 33196504 Free PMC article.
Cited by
-
Human-centered approaches that integrate sensor technology across the lifespan: Opportunities and challenges.Nurs Outlook. 2020 Nov-Dec;68(6):734-744. doi: 10.1016/j.outlook.2020.05.004. Epub 2020 Jul 4. Nurs Outlook. 2020. PMID: 32631796 Free PMC article.
-
Collecting specialty-related medical terms: Development and evaluation of a resource for Spanish.BMC Med Inform Decis Mak. 2021 May 4;21(1):145. doi: 10.1186/s12911-021-01495-w. BMC Med Inform Decis Mak. 2021. PMID: 33947365 Free PMC article.
-
Challenges and opportunities for public health made possible by advances in natural language processing.Can Commun Dis Rep. 2020 Jun 4;46(6):161-168. doi: 10.14745/ccdr.v46i06a02. eCollection 2020 Jun 4. Can Commun Dis Rep. 2020. PMID: 32673380 Free PMC article.
-
Using Natural Language Processing Techniques to Detect Adverse Events From Progress Notes Due to Chemotherapy.Cancer Inform. 2022 Mar 22;21:11769351221085064. doi: 10.1177/11769351221085064. eCollection 2022. Cancer Inform. 2022. PMID: 35342285 Free PMC article.
-
Smart Work Injury Management (SWIM) System: A Machine Learning Approach for the Prediction of Sick Leave and Rehabilitation Plan.Bioengineering (Basel). 2023 Jan 28;10(2):172. doi: 10.3390/bioengineering10020172. Bioengineering (Basel). 2023. PMID: 36829666 Free PMC article.
References
-
- Mehta N, Pandit A.. Concurrence of big data analytics and healthcare: a systematic review. Int J Med Inform 2018; 114: 57–65. - PubMed
-
- Yim W-W, Yetisgen M, Harris WP, et al. Natural language processing in oncology. JAMA Oncol 2016; 2 (6): 797–804. - PubMed
-
- Fleuren WWM, Alkema W.. Application of text mining in the biomedical domain. Methods 2015; 74: 97–106. - PubMed
-
- Institute of Medicine (US) Committee on Data Standards for Patient Safety. Key Capabilities of an Electronic Health Record System: Letter Report Washington, DC: National Academies Press. 2003. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources