. 2021 Apr 27;13(9):2111.

doi: 10.3390/cancers13092111.

A Novel Method to Predict Drug-Target Interactions Based on Large-Scale Graph Representation Learning

Bo-Wei Zhao^{1

2

3}, Zhu-Hong You^{1

2

3}, Lun Hu^{1

2

3}, Zhen-Hao Guo^{1

2

3}, Lei Wang^{1

2

3}, Zhan-Heng Chen⁴, Leon Wong^{1

2

3}

Affiliations

¹ The Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.
² University of Chinese Academy of Sciences, Beijing 100049, China.
³ Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China.
⁴ College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China.

PMID: 33925568
PMCID: PMC8123765
DOI: 10.3390/cancers13092111

A Novel Method to Predict Drug-Target Interactions Based on Large-Scale Graph Representation Learning

Bo-Wei Zhao et al. Cancers (Basel). 2021.

. 2021 Apr 27;13(9):2111.

doi: 10.3390/cancers13092111.

Authors

Bo-Wei Zhao^{1

2

3}, Zhu-Hong You^{1

2

3}, Lun Hu^{1

2

3}, Zhen-Hao Guo^{1

2

3}, Lei Wang^{1

2

3}, Zhan-Heng Chen⁴, Leon Wong^{1

2

3}

Affiliations

¹ The Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.
² University of Chinese Academy of Sciences, Beijing 100049, China.
³ Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China.
⁴ College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China.

PMID: 33925568
PMCID: PMC8123765
DOI: 10.3390/cancers13092111

Abstract

Identification of drug-target interactions (DTIs) is a significant step in the drug discovery or repositioning process. Compared with the time-consuming and labor-intensive in vivo experimental methods, the computational models can provide high-quality DTI candidates in an instant. In this study, we propose a novel method called LGDTI to predict DTIs based on large-scale graph representation learning. LGDTI can capture the local and global structural information of the graph. Specifically, the first-order neighbor information of nodes can be aggregated by the graph convolutional network (GCN); on the other hand, the high-order neighbor information of nodes can be learned by the graph embedding method called DeepWalk. Finally, the two kinds of feature are fed into the random forest classifier to train and predict potential DTIs. The results show that our method obtained area under the receiver operating characteristic curve (AUROC) of 0.9455 and area under the precision-recall curve (AUPR) of 0.9491 under 5-fold cross-validation. Moreover, we compare the presented method with some existing state-of-the-art methods. These results imply that LGDTI can efficiently and robustly capture undiscovered DTIs. Moreover, the proposed model is expected to bring new inspiration and provide novel perspectives to relevant researchers.

Keywords: computational method; drug discovery; drug-target interactions; large-scale graph representation learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
A schematic diagram of the drug molecular structure is constructed as bit vectors. A is the structure of a drug molecule, and B, C, and D are all substructures of the drug molecule, corresponding to the converted bit (represented by the small black box), respectively.

**Figure 2**
An example of large-scale graph representation learning. (A) The schematic diagram of the relationship between drugs and targets. (B) An example of the graph embedding in drug-target interactions (DTIs). (C) An example of the graph convolutional network.

**Figure 3**
The flowchart of the proposed large-scale graph representation learning DTI (LGDTI). (a) A bipartite graph of DTIs. The solid black line is described as known DTIs, and the dashed red line is described as latent DTIs. (b) Part A constructed an adjacency graph containing a self-loop, in which green nodes are drugs and purple nodes are targets, and the information of *first-order* neighbors of each node is aggregated through graph convolutional network. Part B represented *high-order* information of each node in a bipartite graph by DeepWalk. (c) The two kinds of representation features are integrated. (d) Random forest classifier is trained and used for predicting new DTIs.

**Figure 4**
The receiver operating characteristic (ROC) and precision-recall (PR) curves under 5-fold cross-validation.

**Figure 5**
Comparison of the ROC and PR curves performed based on different machine learning classifier.

**Figure 6**
Comparison of the ROC and PR curves performed by random forest classifier based on different features.

See this image and copyright information in PMC

Cited by

Advancing drug-target interaction prediction: a comprehensive graph-based approach integrating knowledge graph embedding and ProtBert pretraining.
Djeddi WE, Hermi K, Ben Yahia S, Diallo G. Djeddi WE, et al. BMC Bioinformatics. 2023 Dec 19;24(1):488. doi: 10.1186/s12859-023-05593-6. BMC Bioinformatics. 2023. PMID: 38114937 Free PMC article.
Drug repurposing and prediction of multiple interaction types via graph embedding.
Amiri Souri E, Chenoweth A, Karagiannis SN, Tsoka S. Amiri Souri E, et al. BMC Bioinformatics. 2023 May 17;24(1):202. doi: 10.1186/s12859-023-05317-w. BMC Bioinformatics. 2023. PMID: 37193964 Free PMC article.
iPiDA-GCN: Identification of piRNA-disease associations based on Graph Convolutional Network.
Hou J, Wei H, Liu B. Hou J, et al. PLoS Comput Biol. 2022 Oct 27;18(10):e1010671. doi: 10.1371/journal.pcbi.1010671. eCollection 2022 Oct. PLoS Comput Biol. 2022. PMID: 36301998 Free PMC article.
Leveraging pre-trained language models for mining microbiome-disease relationships.
Karkera N, Acharya S, Palaniappan SK. Karkera N, et al. BMC Bioinformatics. 2023 Jul 19;24(1):290. doi: 10.1186/s12859-023-05411-z. BMC Bioinformatics. 2023. PMID: 37468830 Free PMC article.
Robust and accurate prediction of self-interacting proteins from protein sequence information by exploiting weighted sparse representation based classifier.
Li Y, Hu XG, You ZH, Li LP, Li PP, Wang YB, Huang YA. Li Y, et al. BMC Bioinformatics. 2022 Dec 1;23(Suppl 7):518. doi: 10.1186/s12859-022-04880-y. BMC Bioinformatics. 2022. PMID: 36457083 Free PMC article.

See all "Cited by" articles

References

1. Dickson M., Gagnon J.P. The cost of new drug discovery and development. Discov. Med. 2009;4:172–179. - PubMed
1. DiMasi J.A., Hansen R.W., Grabowski H.G. The price of innovation: New estimates of drug development costs. J. Health Econ. 2003;22:151–185. doi: 10.1016/S0167-6296(02)00126-1. - DOI - PubMed
1. Li J., Zheng S., Chen B., Butte A.J., Swamidass S.J., Lu Z. A survey of current trends in computational drug repositioning. Brief. Bioinform. 2016;17:2–12. doi: 10.1093/bib/bbv020. - DOI - PMC - PubMed
1. Napolitano F., Zhao Y., Moreira V.M., Tagliaferri R., Kere J., D’Amato M., Greco D. Drug repositioning: A machine-learning approach through data integration. J. Cheminform. 2013;5:30. doi: 10.1186/1758-2946-5-30. - DOI - PMC - PubMed
1. Wu C., Gudivada R.C., Aronow B.J., Jegga A.G. Computational drug repositioning through heterogeneous network clustering. BMC Syst. Biol. 2013;7:S6. doi: 10.1186/1752-0509-7-S5-S6. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Novel Method to Predict Drug-Target Interactions Based on Large-Scale Graph Representation Learning

Affiliations

A Novel Method to Predict Drug-Target Interactions Based on Large-Scale Graph Representation Learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources