Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep 1;33(17):2723-2730.
doi: 10.1093/bioinformatics/btx275.

Neuro-symbolic representation learning on biological knowledge graphs

Affiliations

Neuro-symbolic representation learning on biological knowledge graphs

Mona Alshahrani et al. Bioinformatics. .

Abstract

Motivation: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge. Results: We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of Semantic Web based knowledge bases in biology to use in machine learning and data analytics.

Availability and implementation: https://github.com/bio-ontology-research-group/walking-rdf-and-owl.

Contact: robert.hoehndorf@kaust.edu.sa.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Overview over the main steps in our workflow. We first build biological knowledge graphs by integrating Linked Data, biomedical ontologies and ontology-based annotations in a single, two-layered graph, then deductively close the graph using automated reasoning and apply feature learning on the inferred graph to take into account both explicitly represented data and inferred information. The two layers of the knowledge graph arise from the different semantics of linked biological data (represented in the graph-based language RDF) and the ontologies (represented in the model-theoretic language OWL); we formally connect the entities in the data layer through the rdf:type relation to ontology classes
Fig. 2
Fig. 2
ROCAUC test scores of SIDER drug pairs the for predicting novel indications or targets or both

Similar articles

Cited by

References

    1. Ashburner M. et al. (2000) Gene ontology: tool for the unification of biology. Nat. Genet., 25, 25–29. - PMC - PubMed
    1. Baader F. et al. (2003) The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press, Cambridge, UK.
    1. Belhajjame K. et al. (2012). PROV-O: The PROV ontology. Technical report, W3C.
    1. Belleau F. et al. (2008) Bio2RDF: Towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inf., 41, 706–716. - PubMed
    1. Berners-Lee T. et al. (2001) The Semantic Web. Sci. Am., 284, 28–37.