This is a preprint.
Hetnet connectivity search provides rapid insights into how two biomedical entities are related
- PMID: 36711546
- PMCID: PMC9882000
- DOI: 10.1101/2023.01.05.522941
Hetnet connectivity search provides rapid insights into how two biomedical entities are related
Update in
-
Hetnet connectivity search provides rapid insights into how biomedical entities are related.Gigascience. 2022 Dec 28;12:giad047. doi: 10.1093/gigascience/giad047. Epub 2023 Jul 28. Gigascience. 2022. PMID: 37503959 Free PMC article.
Abstract
Hetnets, short for "heterogeneous networks", contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet connects 11 types of nodes - including genes, diseases, drugs, pathways, and anatomical structures - with over 2 million edges of 24 types. Previous work has demonstrated that supervised machine learning methods applied to such networks can identify drug repurposing opportunities. However, a training set of known relationships does not exist for many types of node pairs, even when it would be useful to examine how nodes of those types are meaningfully connected. For example, users may be curious not only how metformin is related to breast cancer, but also how the GJA1 gene might be involved in insomnia. We developed a new procedure, termed hetnet connectivity search, that proposes important paths between any two nodes without requiring a supervised gold standard. The algorithm behind connectivity search identifies types of paths that occur more frequently than would be expected by chance (based on node degree alone). We find that predictions are broadly similar to those from previously described supervised approaches for certain node type pairs. Scoring of individual paths is based on the most specific paths of a given type. Several optimizations were required to precompute significant instances of node connectivity at the scale of large knowledge graphs. We implemented the method on Hetionet and provide an online interface at https://het.io/search . We provide an open source implementation of these methods in our new Python package named hetmatpy .
Figures







References
-
- Himmelstein Daniel, Greene Casey, Baranzini Sergio, Renaming ‘heterogeneous networks’ to a more concise and catchy term, ThinkLab (2015-08-16) https://doi.org/f3mn4v, DOI: 10.15363/thinklab.d104 - DOI
-
- Himmelstein Daniel Scott, Lizee Antoine, Hessler Christine, Brueggeman Leo, Chen Sabrina L, Hadley Dexter, Green Ari, Khankhanian Pouya, Baranzini Sergio E, Systematic integration of biomedical knowledge prioritizes drugs for repurposing, eLife (2017-09-22) https://doi.org/cdfk, DOI: 10.7554/elife.26726 - DOI - PMC - PubMed
-
- Himmelstein Daniel, Announcing PharmacotherapyDB: the Open Catalog of Drug Therapies for Disease, ThinkLab (2016-03-15) https://doi.org/f3mqtv, DOI: 10.15363/thinklab.d182 - DOI
-
- Himmelstein Daniel, Our hetnet edge prediction methodology: the modeling framework for Project Rephetio, ThinkLab (2016-05-04) https://doi.org/f3qbmj, DOI: 10.15363/thinklab.d210 - DOI
-
- Liben-Nowell David, Kleinberg Jon, The link-prediction problem for social networks, Journal of the American Society for Information Science and Technology (2007) https://doi.org/c56765, DOI: 10.1002/asi.20591 - DOI
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources