Hetnet connectivity search provides rapid insights into how biomedical entities are related
- PMID: 37503959
- PMCID: PMC10375517
- DOI: 10.1093/gigascience/giad047
Hetnet connectivity search provides rapid insights into how biomedical entities are related
Abstract
Background: Hetnets, short for "heterogeneous networks," contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet, connects 11 types of nodes-including genes, diseases, drugs, pathways, and anatomical structures-with over 2 million edges of 24 types. Previous work has demonstrated that supervised machine learning methods applied to such networks can identify drug repurposing opportunities. However, a training set of known relationships does not exist for many types of node pairs, even when it would be useful to examine how nodes of those types are meaningfully connected. For example, users may be curious about not only how metformin is related to breast cancer but also how a given gene might be involved in insomnia.
Findings: We developed a new procedure, termed hetnet connectivity search, that proposes important paths between any 2 nodes without requiring a supervised gold standard. The algorithm behind connectivity search identifies types of paths that occur more frequently than would be expected by chance (based on node degree alone). Several optimizations were required to precompute significant instances of node connectivity at the scale of large knowledge graphs.
Conclusion: We implemented the method on Hetionet and provide an online interface at https://het.io/search. We provide an open-source implementation of these methods in our new Python package named hetmatpy.
Keywords: Hetionet; algorithms; bioinformatics; connectivity; hetnets; knowledge graphs; matrix multiplication; networks; path counts; search.
© The Author(s) 2023. Published by Oxford University Press GigaScience.
Conflict of interest statement
This work was supported, in part, by Pfizer Worldwide Research, Development, and Medical.
Figures







Update of
-
Hetnet connectivity search provides rapid insights into how two biomedical entities are related.bioRxiv [Preprint]. 2023 Jan 7:2023.01.05.522941. doi: 10.1101/2023.01.05.522941. bioRxiv. 2023. Update in: Gigascience. 2022 Dec 28;12:giad047. doi: 10.1093/gigascience/giad047. PMID: 36711546 Free PMC article. Updated. Preprint.
References
-
- Himmelstein D, Greene C, Baranzini S. Renaming ‘Heterogeneous Networks’ to a More Concise and Catchy Term. ThinkLab; 2015. 10.15363/thinklab.d104. - DOI
-
- Himmelstein D. Announcing PharmacotherapyDB: The Open Catalog of Drug Therapies for Disease. ThinkLab; 2016. 10.15363/thinklab.d182. - DOI
-
- Himmelstein D. Our Hetnet Edge Prediction Methodology: The Modeling Framework for Project Rephetio. ThinkLab; 2016. 10.15363/thinklab.d210. - DOI
-
- Liben-Nowell D, Kleinberg J. The link-prediction problem for social networks. J Am Soc Inf Sci. 2007;58:1019–31.. 10.1002/asi.20591. - DOI
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources