A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries

Yonghui Wu¹, Joshua C Denny, S Trent Rosenbloom, Randolph A Miller, Dario A Giuse, Hua Xu

Affiliations

PMID: 23304375
PMCID: PMC3540461

Comparative Study

A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries

Yonghui Wu et al. AMIA Annu Symp Proc. 2012.

. 2012:2012:997-1003.

Epub 2012 Nov 3.

Authors

Yonghui Wu¹, Joshua C Denny, S Trent Rosenbloom, Randolph A Miller, Dario A Giuse, Hua Xu

Affiliation

¹ Department of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, TN, USA.

PMID: 23304375
PMCID: PMC3540461

Abstract

Clinical Natural Language Processing (NLP) systems extract clinical information from narrative clinical texts in many settings. Previous research mentions the challenges of handling abbreviations in clinical texts, but provides little insight into how well current NLP systems correctly recognize and interpret abbreviations. In this paper, we compared performance of three existing clinical NLP systems in handling abbreviations: MetaMap, MedLEE, and cTAKES. The evaluation used an expert-annotated gold standard set of clinical documents (derived from from 32 de-identified patient discharge summaries) containing 1,112 abbreviations. The existing NLP systems achieved suboptimal performance in abbreviation identification, with F-scores ranging from 0.165 to 0.601. MedLEE achieved the best F-score of 0.601 for all abbreviations and 0.705 for clinically relevant abbreviations. This study suggested that accurate identification of clinical abbreviations is a challenging task and that more advanced abbreviation recognition modules might improve existing clinical NLP systems.

PubMed Disclaimer

Figures

**Figure 1.**
An example from the annotation interface

See this image and copyright information in PMC

References

1. Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008:128–144. - PubMed
1. Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. Journal of the American Medical Informatics Association : JAMIA. 2011 Sep-Oct;18(5):544–551. - PMC - PubMed
1. Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp; 2001. pp. 17–21. - PMC - PubMed
1. Aronson AR, Lang FM. An overview of MetaMap: historical perspective and recent advances. Journal of the American Medical Informatics Association : JAMIA. 2010 May-Jun;17(3):229–236. - PMC - PubMed
1. Meystre S, Haug PJ. Natural language processing to extract medical problems from electronic clinical documents: performance evaluation. Journal of biomedical informatics. 2006 Dec;39(6):589–599. - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries

Affiliation

A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources