The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text
- PMID: 14759819
- DOI: 10.1016/j.jbi.2003.11.003
The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text
Abstract
Interpretation of semantic propositions in free-text documents such as MEDLINE citations would provide valuable support for biomedical applications, and several approaches to semantic interpretation are being pursued in the biomedical informatics community. In this paper, we describe a methodology for interpreting linguistic structures that encode hypernymic propositions, in which a more specific concept is in a taxonomic relationship with a more general concept. In order to effectively process these constructions, we exploit underspecified syntactic analysis and structured domain knowledge from the Unified Medical Language System (UMLS). After introducing the syntactic processing on which our system depends, we focus on the UMLS knowledge that supports interpretation of hypernymic propositions. We first use semantic groups from the Semantic Network to ensure that the two concepts involved are compatible; hierarchical information in the Metathesaurus then determines which concept is more general and which more specific. A preliminary evaluation of a sample based on the semantic group Chemicals and Drugs provides 83% precision. An error analysis was conducted and potential solutions to the problems encountered are presented. The research discussed here serves as a paradigm for investigating the interaction between domain knowledge and linguistic structure in natural language processing, and could also make a contribution to research on automatic processing of discourse structure. Additional implications of the system we present include its integration in advanced semantic interpretation processors for biomedical text and its use for information extraction in specific domains. The approach has the potential to support a range of applications, including information retrieval and ontology engineering.
Similar articles
-
Consistency across the hierarchies of the UMLS Semantic Network and Metathesaurus.J Biomed Inform. 2003 Dec;36(6):450-61. doi: 10.1016/j.jbi.2003.11.001. J Biomed Inform. 2003. PMID: 14759818
-
OQAFMA Querying agent for the Foundational Model of Anatomy: a prototype for providing flexible and efficient access to large semantic networks.J Biomed Inform. 2003 Dec;36(6):501-17. doi: 10.1016/j.jbi.2003.11.004. J Biomed Inform. 2003. PMID: 14759821
-
Designing metaschemas for the UMLS enriched semantic network.J Biomed Inform. 2003 Dec;36(6):433-49. doi: 10.1016/j.jbi.2003.11.006. J Biomed Inform. 2003. PMID: 14759817
-
Status of text-mining techniques applied to biomedical text.Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011. Drug Discov Today. 2006. PMID: 16580973 Review.
-
Information retrieval and knowledge discovery utilising a biomedical Semantic Web.Brief Bioinform. 2005 Sep;6(3):252-62. doi: 10.1093/bib/6.3.252. Brief Bioinform. 2005. PMID: 16212773 Review.
Cited by
-
User-centered design of a web-based crowdsourcing-integrated semantic text annotation tool for building a mental health knowledge base.J Biomed Inform. 2020 Oct;110:103571. doi: 10.1016/j.jbi.2020.103571. Epub 2020 Sep 19. J Biomed Inform. 2020. PMID: 32961307 Free PMC article.
-
Detecting Lifestyle Risk Factors for Chronic Kidney Disease With Comorbidities: Association Rule Mining Analysis of Web-Based Survey Data.J Med Internet Res. 2019 Dec 10;21(12):e14204. doi: 10.2196/14204. J Med Internet Res. 2019. PMID: 31821152 Free PMC article.
-
Machine learning approach to literature mining for the genetics of complex diseases.Database (Oxford). 2019 Jan 1;2019:baz124. doi: 10.1093/database/baz124. Database (Oxford). 2019. PMID: 31768545 Free PMC article.
-
Rare disease knowledge enrichment through a data-driven approach.BMC Med Inform Decis Mak. 2019 Feb 14;19(1):32. doi: 10.1186/s12911-019-0752-9. BMC Med Inform Decis Mak. 2019. PMID: 30764825 Free PMC article.
-
Expediting knowledge acquisition by a web framework for Knowledge Graph Exploration and Visualization (KGEV): case studies on COVID-19 and Human Phenotype Ontology.BMC Med Inform Decis Mak. 2022 Jun 2;22(Suppl 2):147. doi: 10.1186/s12911-022-01848-z. BMC Med Inform Decis Mak. 2022. PMID: 35655307 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources