Recognizing noun phrases in medical discharge summaries: an evaluation of two natural language parsers
- PMID: 8947647
- PMCID: PMC2233192
Recognizing noun phrases in medical discharge summaries: an evaluation of two natural language parsers
Abstract
We evaluated the ability of two natural language parsers, CLARIT and the Xerox Tagger, to identify simple, noun phrases in medical discharge summaries. In twenty randomly selected discharge summaries, there were 1909 unique simple noun phrases. CLARIT and the Xerox Tagger exactly identified 77.0% and 68.7% of the phrases, respectively, and partially identified 85.7% and 80.8% of the phrases. Neither system had been specially modified or tuned to the medical domain. These results suggest that it is possible to apply existing natural language processing (NLP) techniques to large bodies of medical text, in order to empirically identify the terminology used in medicine. Virtually all the noun phrases could be regarded as having special medical connotation and would be candidates for entry into a controlled medical vocabulary.
Similar articles
-
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4. J Biomed Inform. 2014. PMID: 24316051
-
Extracting noun phrases for all of MEDLINE.Proc AMIA Symp. 1999:671-5. Proc AMIA Symp. 1999. PMID: 10566444 Free PMC article.
-
Empirical, automated vocabulary discovery using large text corpora and advanced natural language processing tools.Proc AMIA Annu Fall Symp. 1996:159-63. Proc AMIA Annu Fall Symp. 1996. PMID: 8947648 Free PMC article.
-
Effects on Text Simplification: Evaluation of Splitting Up Noun Phrases.J Health Commun. 2016;21 Suppl 1(Suppl):18-26. doi: 10.1080/10810730.2015.1131775. J Health Commun. 2016. PMID: 27043754 Free PMC article. Review.
-
Noun phrases for nursing diagnoses.Nurs Diagn. 1997 Apr-Jun;8(2):49-54. doi: 10.1111/j.1744-618x.1997.tb00137.x. Nurs Diagn. 1997. PMID: 9305106 Review.
Cited by
-
Improved identification of noun phrases in clinical radiology reports using a high-performance statistical natural language parser augmented with the UMLS specialist lexicon.J Am Med Inform Assoc. 2005 May-Jun;12(3):275-85. doi: 10.1197/jamia.M1695. Epub 2005 Jan 31. J Am Med Inform Assoc. 2005. PMID: 15684131 Free PMC article.
-
Puya: a method of attracting attention to relevant physical findings.Proc AMIA Annu Fall Symp. 1997:509-13. Proc AMIA Annu Fall Symp. 1997. PMID: 9357678 Free PMC article.
-
Text structures in medical text processing: empirical evidence and a text understanding prototype.Proc AMIA Annu Fall Symp. 1997:819-23. Proc AMIA Annu Fall Symp. 1997. PMID: 9357739 Free PMC article.
-
The benefits and challenges of an electronic medical record: much more than a "word-processed" patient chart.West J Med. 1998 Sep;169(3):176-83. West J Med. 1998. PMID: 9771161 Free PMC article. Review.
-
UMLS concept indexing for production databases: a feasibility study.J Am Med Inform Assoc. 2001 Jan-Feb;8(1):80-91. doi: 10.1136/jamia.2001.0080080. J Am Med Inform Assoc. 2001. PMID: 11141514 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources