Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2009 Jul;5(7):e1000443.
doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.

Semantic similarity in biomedical ontologies

Affiliations
Review

Semantic similarity in biomedical ontologies

Catia Pesquita et al. PLoS Comput Biol. 2009 Jul.

Abstract

In recent years, ontologies have become a mainstream topic in biomedical research. When biological entities are described using a common schema, such as an ontology, they can be compared by means of their annotations. This type of comparison is called semantic similarity, since it assesses the degree of relatedness between two entities by the similarity in meaning of their annotations. The application of semantic similarity to biomedical ontologies is recent; nevertheless, several studies have been published in the last few years describing and evaluating diverse approaches. Semantic similarity has become a valuable tool for validating the results drawn from biomedical studies such as gene clustering, gene expression data analysis, prediction and validation of molecular interactions, and disease gene prioritization. We review semantic similarity measures applied to biomedical ontologies and propose their classification according to the strategies they employ: node-based versus edge-based and pairwise versus groupwise. We also present comparative assessment studies and discuss the implications of their results. We survey the existing implementations of semantic similarity measures, and we describe examples of applications to biomedical research. This will clarify how biomedical researchers can benefit from semantic similarity measures and help them choose the approach most suitable for their studies.Biomedical ontologies are evolving toward increased coverage, formality, and integration, and their use for annotation is increasingly becoming a focus of both effort by biomedical experts and application of automated annotation procedures to create corpora of higher quality and completeness than are currently available. Given that semantic similarity measures are directly dependent on these evolutions, we can expect to see them gaining more relevance and even becoming as essential as sequence similarity is today in biomedical research.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Section of the GO graph showing the three aspects (molecular function, biological process, and cellular component) and some of their descendant terms.
The fact that GO is a DAG rather than a tree is illustrated by the term “transcription factor activity” which has two parents. An example of a part of relationship is also shown between the terms cell part and cell.
Figure 2
Figure 2. Main approaches for comparing terms: node-based and edge-based and the techniques used by each approach.
DCA, disjoint common ancestors; IC, information content; MICA, most informative common ancestor.
Figure 3
Figure 3. Main approaches for comparing gene products: pairwise and groupwise and the techniques used by each approach.

References

    1. GO-Consortium. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Research. 2004;32:D258–D261. - PMC - PubMed
    1. Joslyn C, Mniszewski S, Fulmer A, Heaton G. The gene ontology categorizer. Bioinformatics. 2004;20:i169–177. - PubMed
    1. Rada R, Mili H, Bicknell E, Blettner M. Development and application of a metric on semantic nets. 1989. pp. 17–30. In: IEEE Transaction on Systems, Man, and Cybernetics. 19.
    1. Wu Z, Palmer MS. Verb semantics and lexical selection. Proceedings of the 32nd. Annual Meeting of the Association for Computational Linguistics (ACL 1994) 1994. pp. 133–138. URL http://dblp.uni-trier.de/db/conf/acl/acl94.html#WuP94.
    1. Budanitsky A. Lexical semantic relatedness and its application in natural language processing. 1999. URL http://citeseer.ist.psu.edu/budanitsky99lexical.html.

Publication types