Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
- PMID: 37140542
- PMCID: PMC10209523
- DOI: 10.1093/bioinformatics/btad297
Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
Abstract
Summary: Heterogeneous knowledge graphs (KGs) have enabled the modeling of complex systems, from genetic interaction graphs and protein-protein interaction networks to networks representing drugs, diseases, proteins, and side effects. Analytical methods for KGs rely on quantifying similarities between entities, such as nodes, in the graph. However, such methods must consider the diversity of node and edge types contained within the KG via, for example, defined sequences of entity types known as meta-paths. We present metapaths, the first R software package to implement meta-paths and perform meta-path-based similarity search in heterogeneous KGs. The metapaths package offers various built-in similarity metrics for node pair comparison by querying KGs represented as either edge or adjacency lists, as well as auxiliary aggregation methods to measure set-level relationships. Indeed, evaluation of these methods on an open-source biomedical KG recovered meaningful drug and disease-associated relationships, including those in Alzheimer's disease. The metapaths framework facilitates the scalable and flexible modeling of network similarities in KGs with applications across KG learning.
Availability and implementation: The metapaths R package is available via GitHub at https://github.com/ayushnoori/metapaths and is released under MPL 2.0 (Zenodo DOI: 10.5281/zenodo.7047209). Package documentation and usage examples are available at https://www.ayushnoori.com/metapaths.
© The Author(s) 2023. Published by Oxford University Press.
Conflict of interest statement
None declared.
Figures
References
-
- Alsentzer E, Finlayson S, Li M. et al. Subgraph neural networks. In: Larochelle H, Ranzato M, Hadsell R. et al. (eds.) Advances in Neural Information Processing Systems. New York: Curran Associates, Inc., 2020, 8017–29.
-
- Hogan A, Blomqvist E, Cochez M. et al. Knowledge graphs. ACM Comput Surv 2022;54:1–37.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Molecular Biology Databases
