Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach
- PMID: 29560823
- PMCID: PMC5861498
- DOI: 10.1186/s12918-018-0539-0
Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach
Abstract
Background: Gene Ontology (GO) is one of the most popular bioinformatics resources. In the past decade, Gene Ontology-based gene semantic similarity has been effectively used to model gene-to-gene interactions in multiple research areas. However, most existing semantic similarity approaches rely only on GO annotations and structure, or incorporate only local interactions in the co-functional network. This may lead to inaccurate GO-based similarity resulting from the incomplete GO topology structure and gene annotations.
Results: We present NETSIM2, a new network-based method that allows researchers to measure GO-based gene functional similarities by considering the global structure of the co-functional network with a random walk with restart (RWR)-based method, and by selecting the significant term pairs to decrease the noise information. Based on the EC number (Enzyme Commission)-based groups of yeast and Arabidopsis, evaluation test shows that NETSIM2 can enhance the accuracy of Gene Ontology-based gene functional similarity.
Conclusions: Using NETSIM2 as an example, we found that the accuracy of semantic similarities can be significantly improved after effectively incorporating the global gene-to-gene interactions in the co-functional network, especially on the species that gene annotations in GO are far from complete.
Keywords: Gene Ontology; Random walk with restart; Semantic similarity.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures





Similar articles
-
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7. BMC Bioinformatics. 2015. PMID: 25886899 Free PMC article.
-
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2. BMC Bioinformatics. 2019. PMID: 30917779 Free PMC article.
-
Measure the Semantic Similarity of GO Terms Using Aggregate Information Content.IEEE/ACM Trans Comput Biol Bioinform. 2014 May-Jun;11(3):468-76. doi: 10.1109/TCBB.2013.176. IEEE/ACM Trans Comput Biol Bioinform. 2014. PMID: 26356015
-
How Does the Scientific Community Contribute to Gene Ontology?Methods Mol Biol. 2017;1446:85-93. doi: 10.1007/978-1-4939-3743-1_7. Methods Mol Biol. 2017. PMID: 27812937 Review.
-
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020. Front Genet. 2020. PMID: 32391061 Free PMC article. Review.
Cited by
-
Semantic similarity and machine learning with ontologies.Brief Bioinform. 2021 Jul 20;22(4):bbaa199. doi: 10.1093/bib/bbaa199. Brief Bioinform. 2021. PMID: 33049044 Free PMC article. Review.
-
TS-GOEA: a web tool for tissue-specific gene set enrichment analysis based on gene ontology.BMC Bioinformatics. 2019 Nov 25;20(Suppl 18):572. doi: 10.1186/s12859-019-3125-6. BMC Bioinformatics. 2019. PMID: 31760951 Free PMC article.
-
Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform.BMC Bioinformatics. 2022 Jun 16;23(1):232. doi: 10.1186/s12859-022-04753-4. BMC Bioinformatics. 2022. PMID: 35710324 Free PMC article.
-
RWRNET: A Gene Regulatory Network Inference Algorithm Using Random Walk With Restart.Front Genet. 2020 Sep 25;11:591461. doi: 10.3389/fgene.2020.591461. eCollection 2020. Front Genet. 2020. PMID: 33101398 Free PMC article.
-
Effective norm emergence in cell systems under limited communication.BMC Bioinformatics. 2018 Apr 11;19(Suppl 5):119. doi: 10.1186/s12859-018-2097-2. BMC Bioinformatics. 2018. PMID: 29671391 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous