HeteroKGRep: Heterogeneous Knowledge Graph based Drug Repositioning
- PMID: 39610660
- PMCID: PMC11600970
- DOI: 10.1016/j.knosys.2024.112638
HeteroKGRep: Heterogeneous Knowledge Graph based Drug Repositioning
Abstract
The process of developing new drugs is both time-consuming and costly, often taking over a decade and billions of dollars to obtain regulatory approval. Additionally, the complexity of patent protection for novel compounds presents challenges for pharmaceutical innovation. Drug repositioning offers an alternative strategy to uncover new therapeutic uses for existing medicines. Previous repositioning models have been limited by their reliance on homogeneous data sources, failing to leverage the rich information available in heterogeneous biomedical knowledge graphs. We propose HeteroKGRep, a novel drug repositioning model that utilizes heterogeneous graphs to address these limitations. HeteroKGRep is a multi-step framework that first generates a similarity graph from hierarchical concept relations. It then applies SMOTE over-sampling to address class imbalance before generating node sequences using a heterogeneous graph neural network. Drug and disease embeddings are extracted from the network and used for prediction. We evaluated HeteroKGRep on a graph containing biomedical concepts and relations from ontologies, pathways and literature. It achieved state-of-the-art performance with 99% accuracy, 95% AUC ROC and 94% average precision on predicting repurposing opportunities. Compared to existing homogeneous approaches, HeteroKGRep leverages diverse knowledge sources to enrich representation learning. Based on heterogeneous graphs, HeteroKGRep can discover new drug-desease associations, leveraging de novo drug development. This work establishes a promising new paradigm for knowledge-guided drug repositioning using multimodal biomedical data.
Keywords: biomedical heterogeneous graph; deep learning; drug repurposing.
Conflict of interest statement
Declaration of Interest statement The authors declare that they have no conflict of interest.
Similar articles
-
HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27. J Biomed Inform. 2019. PMID: 31255713 Free PMC article.
-
Task-driven knowledge graph filtering improves prioritizing drugs for repurposing.BMC Bioinformatics. 2022 Mar 4;23(1):84. doi: 10.1186/s12859-022-04608-y. BMC Bioinformatics. 2022. PMID: 35246025 Free PMC article.
-
Deep multiple instance learning on heterogeneous graph for drug-disease association prediction.Comput Biol Med. 2025 Jan;184:109403. doi: 10.1016/j.compbiomed.2024.109403. Epub 2024 Nov 21. Comput Biol Med. 2025. PMID: 39577348
-
Knowledge graphs and their applications in drug discovery.Expert Opin Drug Discov. 2021 Sep;16(9):1057-1069. doi: 10.1080/17460441.2021.1910673. Epub 2021 Apr 12. Expert Opin Drug Discov. 2021. PMID: 33843398 Review.
-
Toward better drug discovery with knowledge graph.Curr Opin Struct Biol. 2022 Feb;72:114-126. doi: 10.1016/j.sbi.2021.09.003. Epub 2021 Oct 11. Curr Opin Struct Biol. 2022. PMID: 34649044 Review.
References
-
- Cantürk S, Singh A, St-Amant P, Behrmann J, Machine-learning driven drug repurposing for covid-19, arXiv preprint arXiv:2006.14707 (2020).
-
- Yingngam B, Machine learning applications for drug repurposing, Artificial Intelligence and Machine Learning in Drug Design and Development (2024) 251–294.
-
- Papikinos T, Krokidis MG, Vrahatis AG, Vlachakis D, Vlamos P, Exarchos TP, Deep learning methods for drug repurposing through heterogeneous data, in: Advances in Artificial Intelligence, Elsevier, 2024, pp. 295–313.
-
- Zhang C, Song D, Huang C, Swami A, Chawla NV, Heterogeneous graph neural network, in: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, 2019, pp. 793–803.
-
- Gao Z, Ding P, Xu R, Iuphar review–data-driven computational drug repurposing approaches for opioid use disorder, Pharmacological Research 199 (2024) 106960. - PubMed