An evaluation of the UMLS in representing corpus derived clinical concepts
- PMID: 22195097
- PMCID: PMC3243131
An evaluation of the UMLS in representing corpus derived clinical concepts
Abstract
We performed an evaluation of the Unified Medical Language System (UMLS) in representing concepts derived from medical narrative documents from three domains: chest x-ray reports, discharge summaries and admission notes. We detected concepts in these documents by identifying noun phrases (NPs) and N-grams, including unigrams (single words), bigrams (word pairs) and trigrams (word triples). After removing NPs and N-grams that did not represent discrete clinical concepts, we processed the remaining with the UMLS MetaMap program. We manually reviewed the results of MetaMap processing to determine whether MetaMap found full, partial or no representation of the concept. For full representations, we determined whether post-coordination was required. Our results showed that a large portion of concepts found in clinical narrative documents are either unrepresented or poorly represented in the current version of the UMLS Metathesaurus and that post-coordination was often required in order to fully represent a concept.
Figures
Similar articles
-
Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.J Biomed Inform. 2016 Oct;63:22-32. doi: 10.1016/j.jbi.2016.07.017. Epub 2016 Jul 18. J Biomed Inform. 2016. PMID: 27444186
-
A normalized lexical lookup approach to identifying UMLS concepts in free text.Stud Health Technol Inform. 2007;129(Pt 1):545-9. Stud Health Technol Inform. 2007. PMID: 17911776
-
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4. J Biomed Inform. 2014. PMID: 24316051
-
Improved identification of noun phrases in clinical radiology reports using a high-performance statistical natural language parser augmented with the UMLS specialist lexicon.J Am Med Inform Assoc. 2005 May-Jun;12(3):275-85. doi: 10.1197/jamia.M1695. Epub 2005 Jan 31. J Am Med Inform Assoc. 2005. PMID: 15684131 Free PMC article.
-
Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.J Am Med Inform Assoc. 2001 Nov-Dec;8(6):598-609. doi: 10.1136/jamia.2001.0080598. J Am Med Inform Assoc. 2001. PMID: 11687566 Free PMC article.
Cited by
-
A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries.AMIA Annu Symp Proc. 2012;2012:997-1003. Epub 2012 Nov 3. AMIA Annu Symp Proc. 2012. PMID: 23304375 Free PMC article.
-
Development and empirical user-centered evaluation of semantically-based query recommendation for an electronic health record search engine.J Biomed Inform. 2017 Mar;67:1-10. doi: 10.1016/j.jbi.2017.01.013. Epub 2017 Jan 25. J Biomed Inform. 2017. PMID: 28131722 Free PMC article.
-
OTO: Ontology Term Organizer.BMC Bioinformatics. 2015 Feb 15;16(1):47. doi: 10.1186/s12859-015-0488-1. BMC Bioinformatics. 2015. PMID: 25887779 Free PMC article.
-
Fine-grained information extraction from German transthoracic echocardiography reports.BMC Med Inform Decis Mak. 2015 Nov 12;15:91. doi: 10.1186/s12911-015-0215-x. BMC Med Inform Decis Mak. 2015. PMID: 26563260 Free PMC article.
-
Applying MetaMap to Medline for identifying novel associations in a large clinical dataset: a feasibility analysis.J Am Med Inform Assoc. 2014 Sep-Oct;21(5):925-37. doi: 10.1136/amiajnl-2014-002767. Epub 2014 Jun 13. J Am Med Inform Assoc. 2014. PMID: 24928177 Free PMC article.
References
-
- de Lusignan S, van Weel C. The use of routinely collected computer data for research in primary care: opportunities and challenges. Family Practice. 2006 Apr;23(2):253–63. 2006. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources