Using clinical Natural Language Processing for health outcomes research: Overview and actionable suggestions for future advances
- PMID: 30368002
- PMCID: PMC6986921
- DOI: 10.1016/j.jbi.2018.10.005
Using clinical Natural Language Processing for health outcomes research: Overview and actionable suggestions for future advances
Abstract
The importance of incorporating Natural Language Processing (NLP) methods in clinical informatics research has been increasingly recognized over the past years, and has led to transformative advances. Typically, clinical NLP systems are developed and evaluated on word, sentence, or document level annotations that model specific attributes and features, such as document content (e.g., patient status, or report type), document section types (e.g., current medications, past medical history, or discharge summary), named entities and concepts (e.g., diagnoses, symptoms, or treatments) or semantic attributes (e.g., negation, severity, or temporality). From a clinical perspective, on the other hand, research studies are typically modelled and evaluated on a patient- or population-level, such as predicting how a patient group might respond to specific treatments or patient monitoring over time. While some NLP tasks consider predictions at the individual or group user level, these tasks still constitute a minority. Owing to the discrepancy between scientific objectives of each field, and because of differences in methodological evaluation priorities, there is no clear alignment between these evaluation approaches. Here we provide a broad summary and outline of the challenging issues involved in defining appropriate intrinsic and extrinsic evaluation methods for NLP research that is to be used for clinical outcomes research, and vice versa. A particular focus is placed on mental health research, an area still relatively understudied by the clinical NLP research community, but where NLP methods are of notable relevance. Recent advances in clinical NLP method development have been significant, but we propose more emphasis needs to be placed on rigorous evaluation for the field to advance further. To enable this, we provide actionable suggestions, including a minimal protocol that could be used when reporting clinical NLP method development and its evaluation.
Keywords: Clinical informatics; Epidemiology; Evaluation; Information extraction; Mental Health Informatics; Natural Language Processing; Public Health; Text analytics.
Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
The authors declare that there are no conflicts of interest.
Figures


Similar articles
-
A comparison of word embeddings for the biomedical natural language processing.J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12. J Biomed Inform. 2018. PMID: 30217670 Free PMC article.
-
Recent Advances in Clinical Natural Language Processing in Support of Semantic Analysis.Yearb Med Inform. 2015 Aug 13;10(1):183-93. doi: 10.15265/IY-2015-009. Yearb Med Inform. 2015. PMID: 26293867 Free PMC article. Review.
-
SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks.J Biomed Semantics. 2022 May 8;13(1):13. doi: 10.1186/s13326-022-00269-1. J Biomed Semantics. 2022. PMID: 35527259 Free PMC article.
-
A scoping review of publicly available language tasks in clinical natural language processing.J Am Med Inform Assoc. 2022 Sep 12;29(10):1797-1806. doi: 10.1093/jamia/ocac127. J Am Med Inform Assoc. 2022. PMID: 35923088 Free PMC article.
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
Cited by
-
Coding Free-Text Chief Complaints from a Health Information Exchange: A Preliminary Study.AMIA Annu Symp Proc. 2021 Jan 25;2020:638-647. eCollection 2020. AMIA Annu Symp Proc. 2021. PMID: 33936438 Free PMC article.
-
Medical Information Extraction in the Age of Deep Learning.Yearb Med Inform. 2020 Aug;29(1):208-220. doi: 10.1055/s-0040-1702001. Epub 2020 Aug 21. Yearb Med Inform. 2020. PMID: 32823318 Free PMC article. Review.
-
Evaluation of the clinical application effect of eSource record tools for clinical research.BMC Med Inform Decis Mak. 2022 Apr 11;22(1):98. doi: 10.1186/s12911-022-01824-7. BMC Med Inform Decis Mak. 2022. PMID: 35410214 Free PMC article.
-
Classifying the lifestyle status for Alzheimer's disease from clinical notes using deep learning with weak supervision.BMC Med Inform Decis Mak. 2022 Jul 7;22(Suppl 1):88. doi: 10.1186/s12911-022-01819-4. BMC Med Inform Decis Mak. 2022. PMID: 35799294 Free PMC article.
-
Assessing the Performance of Clinical Natural Language Processing Systems: Development of an Evaluation Methodology.JMIR Med Inform. 2021 Jul 23;9(7):e20492. doi: 10.2196/20492. JMIR Med Inform. 2021. PMID: 34297002 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous