Neuro-symbolic representation learning on biological knowledge graphs
- PMID: 28449114
- PMCID: PMC5860058
- DOI: 10.1093/bioinformatics/btx275
Neuro-symbolic representation learning on biological knowledge graphs
Abstract
Motivation: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge. Results: We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of Semantic Web based knowledge bases in biology to use in machine learning and data analytics.
Availability and implementation: https://github.com/bio-ontology-research-group/walking-rdf-and-owl.
Contact: robert.hoehndorf@kaust.edu.sa.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2017. Published by Oxford University Press.
Figures


Similar articles
-
mOWL: Python library for machine learning with biomedical ontologies.Bioinformatics. 2023 Jan 1;39(1):btac811. doi: 10.1093/bioinformatics/btac811. Bioinformatics. 2023. PMID: 36534832 Free PMC article.
-
Learning and reasoning with graph data.Front Artif Intell. 2023 Aug 22;6:1124718. doi: 10.3389/frai.2023.1124718. eCollection 2023. Front Artif Intell. 2023. PMID: 37675398 Free PMC article. Review.
-
Biological applications of knowledge graph embedding models.Brief Bioinform. 2021 Mar 22;22(2):1679-1693. doi: 10.1093/bib/bbaa012. Brief Bioinform. 2021. PMID: 32065227 Review.
-
Unsupervised construction of computational graphs for gene expression data with explicit structural inductive biases.Bioinformatics. 2022 Feb 7;38(5):1320-1327. doi: 10.1093/bioinformatics/btab830. Bioinformatics. 2022. PMID: 34888618 Free PMC article.
-
A large collection of bioinformatics question-query pairs over federated knowledge graphs: methodology and applications.Gigascience. 2025 Jan 6;14:giaf045. doi: 10.1093/gigascience/giaf045. Gigascience. 2025. PMID: 40378136 Free PMC article.
Cited by
-
Improving protein function prediction using protein sequence and GO-term similarities.Bioinformatics. 2019 Apr 1;35(7):1116-1124. doi: 10.1093/bioinformatics/bty751. Bioinformatics. 2019. PMID: 30169569 Free PMC article.
-
PFP-WGAN: Protein function prediction by discovering Gene Ontology term correlations with generative adversarial networks.PLoS One. 2021 Feb 25;16(2):e0244430. doi: 10.1371/journal.pone.0244430. eCollection 2021. PLoS One. 2021. PMID: 33630862 Free PMC article.
-
Knowledge-Based Biomedical Data Science.Annu Rev Biomed Data Sci. 2020 Jul;3:23-41. doi: 10.1146/annurev-biodatasci-010820-091627. Epub 2020 Apr 7. Annu Rev Biomed Data Sci. 2020. PMID: 33954284 Free PMC article.
-
Predicting candidate genes from phenotypes, functions and anatomical site of expression.Bioinformatics. 2021 May 5;37(6):853-860. doi: 10.1093/bioinformatics/btaa879. Bioinformatics. 2021. PMID: 33051643 Free PMC article.
-
Representation Learning: Recommendation With Knowledge Graph via Triple-Autoencoder.Front Genet. 2022 Jun 3;13:891265. doi: 10.3389/fgene.2022.891265. eCollection 2022. Front Genet. 2022. PMID: 35719384 Free PMC article.
References
-
- Baader F. et al. (2003) The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press, Cambridge, UK.
-
- Belhajjame K. et al. (2012). PROV-O: The PROV ontology. Technical report, W3C.
-
- Belleau F. et al. (2008) Bio2RDF: Towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inf., 41, 706–716. - PubMed
-
- Berners-Lee T. et al. (2001) The Semantic Web. Sci. Am., 284, 28–37.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous