A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data
- PMID: 30914179
- PMCID: PMC6438188
- DOI: 10.1016/j.ijmedinf.2019.02.008
A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data
Abstract
Objective: In this systematic review, we aim to synthesize the literature on the use of natural language processing (NLP) and text mining as they apply to symptom extraction and processing in electronic patient-authored text (ePAT).
Materials and methods: A comprehensive literature search of 1964 articles from PubMed and EMBASE was narrowed to 21 eligible articles. Data related to purpose, text source, number of users and/or posts, evaluation metrics, and quality indicators were recorded.
Results: Pain (n = 18) and fatigue and sleep disturbance (n = 18) were the most frequently evaluated symptom clinical content categories. Studies accessed ePAT from sources such as Twitter and online community forums or patient portals focused on diseases, including diabetes, cancer, and depression. Fifteen studies used NLP as a primary methodology. Studies reported evaluation metrics including the precision, recall, and F-measure for symptom-specific research questions.
Discussion: NLP and text mining have been used to extract and analyze patient-authored symptom data in a wide variety of online communities. Though there are computational challenges with accessing ePAT, the depth of information provided directly from patients offers new horizons for precision medicine, characterization of sub-clinical symptoms, and the creation of personal health libraries as outlined by the National Library of Medicine.
Conclusion: Future research should consider the needs of patients expressed through ePAT and its relevance to symptom science. Understanding the role that ePAT plays in health communication and real-time assessment of symptoms, through the use of NLP and text mining, is critical to a patient-centered health system.
Keywords: Electronic patient-authored text; Natural language processing; Review; Signs and symptoms.
Copyright © 2019 Elsevier B.V. All rights reserved.
Conflict of interest statement
CONFLICT OF INTEREST
We have no conflicts of interest to disclose.
Figures
Similar articles
-
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173. J Am Med Inform Assoc. 2019. PMID: 30726935 Free PMC article.
-
Systematic Evaluation of Research Progress on Natural Language Processing in Medicine Over the Past 20 Years: Bibliometric Study on PubMed.J Med Internet Res. 2020 Jan 23;22(1):e16816. doi: 10.2196/16816. J Med Internet Res. 2020. PMID: 32012074 Free PMC article. Review.
-
General Symptom Extraction from VA Electronic Medical Notes.Stud Health Technol Inform. 2017;245:356-360. Stud Health Technol Inform. 2017. PMID: 29295115
-
Automatically Detecting Failures in Natural Language Processing Tools for Online Community Text.J Med Internet Res. 2015 Aug 31;17(8):e212. doi: 10.2196/jmir.4612. J Med Internet Res. 2015. PMID: 26323337 Free PMC article.
-
Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UK.BMJ Open. 2021 Mar 25;11(3):e042274. doi: 10.1136/bmjopen-2020-042274. BMJ Open. 2021. PMID: 33766838 Free PMC article.
Cited by
-
Applying natural language processing and machine learning techniques to patient experience feedback: a systematic review.BMJ Health Care Inform. 2021 Mar;28(1):e100262. doi: 10.1136/bmjhci-2020-100262. BMJ Health Care Inform. 2021. PMID: 33653690 Free PMC article.
-
The Use of BP Neural Network Algorithm and Natural Language Processing in the Impact of Social Audit on Enterprise Innovation Ability.Comput Intell Neurosci. 2022 May 18;2022:7297769. doi: 10.1155/2022/7297769. eCollection 2022. Comput Intell Neurosci. 2022. PMID: 35634059 Free PMC article.
-
A natural language processing pipeline to synthesize patient-generated notes toward improving remote care and chronic disease management: a cystic fibrosis case study.JAMIA Open. 2021 Sep 29;4(3):ooab084. doi: 10.1093/jamiaopen/ooab084. eCollection 2021 Jul. JAMIA Open. 2021. PMID: 34604710 Free PMC article.
-
Adverse Event Signal Detection Using Patients' Concerns in Pharmaceutical Care Records: Evaluation of Deep Learning Models.J Med Internet Res. 2024 Apr 16;26:e55794. doi: 10.2196/55794. J Med Internet Res. 2024. PMID: 38625718 Free PMC article.
-
Enhanced effective convolutional attention network with squeeze-and-excitation inception module for multi-label clinical document classification.Sci Rep. 2025 May 16;15(1):16988. doi: 10.1038/s41598-025-98719-0. Sci Rep. 2025. PMID: 40379823 Free PMC article.
References
-
- Fox S, Duggan M. Health Online 2013 Pew Research Center Internet & American Life Project; 2013:1–55.
-
- Calvo RA, Milne DN, Hussain MS, Christensen H. Natural language processing in mental health applications using non-clinical texts. Nat. Lang. Eng 2017:1–37. doi:10.1017/S1351324916000383. - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources