Semantic similarity in biomedical ontologies
- PMID: 19649320
- PMCID: PMC2712090
- DOI: 10.1371/journal.pcbi.1000443
Semantic similarity in biomedical ontologies
Abstract
In recent years, ontologies have become a mainstream topic in biomedical research. When biological entities are described using a common schema, such as an ontology, they can be compared by means of their annotations. This type of comparison is called semantic similarity, since it assesses the degree of relatedness between two entities by the similarity in meaning of their annotations. The application of semantic similarity to biomedical ontologies is recent; nevertheless, several studies have been published in the last few years describing and evaluating diverse approaches. Semantic similarity has become a valuable tool for validating the results drawn from biomedical studies such as gene clustering, gene expression data analysis, prediction and validation of molecular interactions, and disease gene prioritization. We review semantic similarity measures applied to biomedical ontologies and propose their classification according to the strategies they employ: node-based versus edge-based and pairwise versus groupwise. We also present comparative assessment studies and discuss the implications of their results. We survey the existing implementations of semantic similarity measures, and we describe examples of applications to biomedical research. This will clarify how biomedical researchers can benefit from semantic similarity measures and help them choose the approach most suitable for their studies.Biomedical ontologies are evolving toward increased coverage, formality, and integration, and their use for annotation is increasingly becoming a focus of both effort by biomedical experts and application of automated annotation procedures to create corpora of higher quality and completeness than are currently available. Given that semantic similarity measures are directly dependent on these evolutions, we can expect to see them gaining more relevance and even becoming as essential as sequence similarity is today in biomedical research.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
 
              
              
              
              
                
                
                 
              
              
              
              
                
                
                 
              
              
              
              
                
                
                References
- 
    - Joslyn C, Mniszewski S, Fulmer A, Heaton G. The gene ontology categorizer. Bioinformatics. 2004;20:i169–177. - PubMed
 
- 
    - Rada R, Mili H, Bicknell E, Blettner M. Development and application of a metric on semantic nets. 1989. pp. 17–30. In: IEEE Transaction on Systems, Man, and Cybernetics. 19.
 
- 
    - Wu Z, Palmer MS. Verb semantics and lexical selection. Proceedings of the 32nd. Annual Meeting of the Association for Computational Linguistics (ACL 1994) 1994. pp. 133–138. URL http://dblp.uni-trier.de/db/conf/acl/acl94.html#WuP94.
 
- 
    - Budanitsky A. Lexical semantic relatedness and its application in natural language processing. 1999. URL http://citeseer.ist.psu.edu/budanitsky99lexical.html.
 
Publication types
MeSH terms
LinkOut - more resources
- Full Text Sources
- Other Literature Sources
 
        