SSGraphCPI: A Novel Model for Predicting Compound-Protein Interactions Based on Deep Learning
- PMID: 35409140
- PMCID: PMC8998983
- DOI: 10.3390/ijms23073780
SSGraphCPI: A Novel Model for Predicting Compound-Protein Interactions Based on Deep Learning
Abstract
Identifying compound-protein (drug-target, DTI) interactions (CPI) accurately is a key step in drug discovery. Including virtual screening and drug reuse, it can significantly reduce the time it takes to identify drug candidates and provide patients with timely and effective treatment. Recently, more and more researchers have developed CPI's deep learning model, including feature representation of a 2D molecular graph of a compound using a graph convolutional neural network, but this method loses much important information about the compound. In this paper, we propose a novel three-channel deep learning framework, named SSGraphCPI, for CPI prediction, which is composed of recurrent neural networks with an attentional mechanism and graph convolutional neural network. In our model, the characteristics of compounds are extracted from 1D SMILES string and 2D molecular graph. Using both the 1D SMILES string sequence and the 2D molecular graph can provide both sequential and structural features for CPI predictions. Additionally, we select the 1D CNN module to learn the hidden data patterns in the sequence to mine deeper information. Our model is much more suitable for collecting more effective information of compounds. Experimental results show that our method achieves significant performances with RMSE (Root Mean Square Error) = 2.24 and R2 (degree of linear fitting of the model) = 0.039 on the GPCR (G Protein-Coupled Receptors) dataset, and with RMSE = 2.64 and R2 = 0.018 on the GPCR dataset RMSE, which preforms better than some classical deep learning models, including RNN/GCNN-CNN, GCNNet and GATNet.
Keywords: IC50 value; compound properties; compound-protein interactions; deep learning; protein preperties.
Conflict of interest statement
The authors declare no conflict of interest.
Figures






Similar articles
-
DL-SMILES#: A Novel Encoding Scheme for Predicting Compound Protein Affinity Using Deep Learning.Comb Chem High Throughput Screen. 2022;25(4):642-650. doi: 10.2174/1386207324666210219102728. Comb Chem High Throughput Screen. 2022. PMID: 33605851
-
MDL-CPI: Multi-view deep learning model for compound-protein interaction prediction.Methods. 2022 Aug;204:418-427. doi: 10.1016/j.ymeth.2022.01.008. Epub 2022 Jan 31. Methods. 2022. PMID: 35114401
-
Effectively Identifying Compound-Protein Interaction Using Graph Neural Representation.IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):932-943. doi: 10.1109/TCBB.2022.3198003. Epub 2023 Apr 3. IEEE/ACM Trans Comput Biol Bioinform. 2023. PMID: 35951570
-
Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7. Methods Mol Biol. 2021. PMID: 32804365 Review.
-
An inductive graph neural network model for compound-protein interaction prediction based on a homogeneous graph.Brief Bioinform. 2022 May 13;23(3):bbac073. doi: 10.1093/bib/bbac073. Brief Bioinform. 2022. PMID: 35275993 Free PMC article. Review.
Cited by
-
Induced Pluripotent Stem Cell-Based Drug Screening by Use of Artificial Intelligence.Pharmaceuticals (Basel). 2022 Apr 30;15(5):562. doi: 10.3390/ph15050562. Pharmaceuticals (Basel). 2022. PMID: 35631387 Free PMC article. Review.
-
SaeGraphDTI: drug-target interaction prediction based on sequence attribute extraction and graph neural network.BMC Bioinformatics. 2025 Jul 15;26(1):177. doi: 10.1186/s12859-025-06195-0. BMC Bioinformatics. 2025. PMID: 40670964 Free PMC article.
-
MCL-DTI: using drug multimodal information and bi-directional cross-attention learning method for predicting drug-target interaction.BMC Bioinformatics. 2023 Aug 26;24(1):323. doi: 10.1186/s12859-023-05447-1. BMC Bioinformatics. 2023. PMID: 37633938 Free PMC article.
-
In silico protein function prediction: the rise of machine learning-based approaches.Med Rev (2021). 2023 Nov 29;3(6):487-510. doi: 10.1515/mr-2023-0038. eCollection 2023 Dec. Med Rev (2021). 2023. PMID: 38282798 Free PMC article. Review.
-
Integrating Multiple Single-Cell RNA Sequencing Datasets Using Adversarial Autoencoders.Int J Mol Sci. 2023 Mar 13;24(6):5502. doi: 10.3390/ijms24065502. Int J Mol Sci. 2023. PMID: 36982574 Free PMC article.
References
-
- Scannell J.W., Blanckley A., Boldon H., Warrington B. Diagnosing the decline in pharmaceutical R&D efficiency. Nat. Rev. Drug Discov. 2012;11:191–200. - PubMed
-
- Weininger D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 1988;28:31–36. doi: 10.1021/ci00057a005. - DOI
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources