Automatically Detecting Failures in Natural Language Processing Tools for Online Community Text
- PMID: 26323337
- PMCID: PMC4642409
- DOI: 10.2196/jmir.4612
Automatically Detecting Failures in Natural Language Processing Tools for Online Community Text
Abstract
Background: The prevalence and value of patient-generated health text are increasing, but processing such text remains problematic. Although existing biomedical natural language processing (NLP) tools are appealing, most were developed to process clinician- or researcher-generated text, such as clinical notes or journal articles. In addition to being constructed for different types of text, other challenges of using existing NLP include constantly changing technologies, source vocabularies, and characteristics of text. These continuously evolving challenges warrant the need for applying low-cost systematic assessment. However, the primarily accepted evaluation method in NLP, manual annotation, requires tremendous effort and time.
Objective: The primary objective of this study is to explore an alternative approach-using low-cost, automated methods to detect failures (eg, incorrect boundaries, missed terms, mismapped concepts) when processing patient-generated text with existing biomedical NLP tools. We first characterize common failures that NLP tools can make in processing online community text. We then demonstrate the feasibility of our automated approach in detecting these common failures using one of the most popular biomedical NLP tools, MetaMap.
Methods: Using 9657 posts from an online cancer community, we explored our automated failure detection approach in two steps: (1) to characterize the failure types, we first manually reviewed MetaMap's commonly occurring failures, grouped the inaccurate mappings into failure types, and then identified causes of the failures through iterative rounds of manual review using open coding, and (2) to automatically detect these failure types, we then explored combinations of existing NLP techniques and dictionary-based matching for each failure cause. Finally, we manually evaluated the automatically detected failures.
Results: From our manual review, we characterized three types of failure: (1) boundary failures, (2) missed term failures, and (3) word ambiguity failures. Within these three failure types, we discovered 12 causes of inaccurate mappings of concepts. We used automated methods to detect almost half of 383,572 MetaMap's mappings as problematic. Word sense ambiguity failure was the most widely occurring, comprising 82.22% of failures. Boundary failure was the second most frequent, amounting to 15.90% of failures, while missed term failures were the least common, making up 1.88% of failures. The automated failure detection achieved precision, recall, accuracy, and F1 score of 83.00%, 92.57%, 88.17%, and 87.52%, respectively.
Conclusions: We illustrate the challenges of processing patient-generated online health community text and characterize failures of NLP tools on this patient-generated health text, demonstrating the feasibility of our low-cost approach to automatically detect those failures. Our approach shows the potential for scalable and effective solutions to automatically assess the constantly evolving NLP tools and source vocabularies to process patient-generated text.
Keywords: UMLS; automatic data processing; information extraction; natural language processing; quantitative evaluation.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
Similar articles
-
Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.J Biomed Inform. 2016 Oct;63:22-32. doi: 10.1016/j.jbi.2016.07.017. Epub 2016 Jul 18. J Biomed Inform. 2016. PMID: 27444186
-
Evaluation of Natural Language Processing (NLP) systems to annotate drug product labeling with MedDRA terminology.J Biomed Inform. 2018 Jul;83:73-86. doi: 10.1016/j.jbi.2018.05.019. Epub 2018 Jun 1. J Biomed Inform. 2018. PMID: 29860093
-
Semantic biomedical resource discovery: a Natural Language Processing framework.BMC Med Inform Decis Mak. 2015 Sep 30;15:77. doi: 10.1186/s12911-015-0200-4. BMC Med Inform Decis Mak. 2015. PMID: 26423616 Free PMC article.
-
A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data.Int J Med Inform. 2019 May;125:37-46. doi: 10.1016/j.ijmedinf.2019.02.008. Epub 2019 Feb 20. Int J Med Inform. 2019. PMID: 30914179 Free PMC article.
-
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173. J Am Med Inform Assoc. 2019. PMID: 30726935 Free PMC article.
Cited by
-
Harnessing Reddit to Understand the Written-Communication Challenges Experienced by Individuals With Mental Health Disorders: Analysis of Texts From Mental Health Communities.J Med Internet Res. 2018 Apr 10;20(4):e121. doi: 10.2196/jmir.8219. J Med Internet Res. 2018. PMID: 29636316 Free PMC article.
-
Factors Contributing to Dropping-out in an Online Health Community: Static and Longitudinal Analyses.AMIA Annu Symp Proc. 2017 Feb 10;2016:2090-2099. eCollection 2016. AMIA Annu Symp Proc. 2017. PMID: 28269969 Free PMC article.
-
Knowledge Discovery from Posts in Online Health Communities Using Unified Medical Language System.Int J Environ Res Public Health. 2018 Jun 19;15(6):1291. doi: 10.3390/ijerph15061291. Int J Environ Res Public Health. 2018. PMID: 29921824 Free PMC article.
-
Development of a keyword library for capturing PRO-CTCAE-focused "symptom talk" in oncology conversations.JAMIA Open. 2023 Feb 9;6(1):ooad009. doi: 10.1093/jamiaopen/ooad009. eCollection 2023 Apr. JAMIA Open. 2023. PMID: 36789287 Free PMC article.
-
Longitudinal Changes in Psychological States in Online Health Community Members: Understanding the Long-Term Effects of Participating in an Online Depression Community.J Med Internet Res. 2017 Mar 20;19(3):e71. doi: 10.2196/jmir.6826. J Med Internet Res. 2017. PMID: 28320692 Free PMC article.
References
-
- Fox S, Rainie L. Pew Research Center Internet, Science & Tech. 2014. Feb 27, [2015-04-23]. The Web at 25 in the US The overall verdict: The internet has been a plus for society and an especially good thing for individual users http://www.pewinternet.org/2014/02/27/the-web-at-25-in-the-u-s/
-
- Fox S. Pew Research Center Internet, Science & Tech. 2005. May 17, [2015-04-23]. Health Information online: Eight in ten internet users have looked for health information online, with increased interest in diet, fitness, drugs, health insurance, experimental treatments, and particular doctors and hospitals http://www.pewinternet.org/2005/05/17/health-information-online/
-
- Fox S. Pew Research Center Internet, Science & Tech. 2011. Feb 28, [2015-04-25]. Peer-to-peer healthcare: The internet gives patients and caregivers access not only to information, but also to each other http://www.pewinternet.org/2011/02/28/peer-to-peer-health-care-2/
-
- Eysenbach G. Medicine 2.0: social networking, collaboration, participation, apomediation, and openness. J Med Internet Res. 2008;10(3):e22. doi: 10.2196/jmir.1030. http://www.jmir.org/2008/3/e22/ v10i3e22 - DOI - PMC - PubMed
-
- Starbird K, Palen L. ‘Voluntweeters’: Self-Organizing by Digital Volunteers in Times of Crisis. ACM CHI Conference on Human Factors in Computing Systems; May 07-12, 2011; Vancouver, BC. ACM; 2011. pp. 1071–1080. http://dl.acm.org/citation.cfm?id=1979102 - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources