Multimodal deep representation learning for protein interaction identification and protein family classification
- PMID: 31787089
- PMCID: PMC6886253
- DOI: 10.1186/s12859-019-3084-y
Multimodal deep representation learning for protein interaction identification and protein family classification
Abstract
Background: Protein-protein interactions(PPIs) engage in dynamic pathological and biological procedures constantly in our life. Thus, it is crucial to comprehend the PPIs thoroughly such that we are able to illuminate the disease occurrence, achieve the optimal drug-target therapeutic effect and describe the protein complex structures. However, compared to the protein sequences obtainable from various species and organisms, the number of revealed protein-protein interactions is relatively limited. To address this dilemma, lots of research endeavor have investigated in it to facilitate the discovery of novel PPIs. Among these methods, PPI prediction techniques that merely rely on protein sequence data are more widespread than other methods which require extensive biological domain knowledge.
Results: In this paper, we propose a multi-modal deep representation learning structure by incorporating protein physicochemical features with the graph topological features from the PPI networks. Specifically, our method not only bears in mind the protein sequence information but also discerns the topological representations for each protein node in the PPI networks. In our paper, we construct a stacked auto-encoder architecture together with a continuous bag-of-words (CBOW) model based on generated metapaths to study the PPI predictions. Following by that, we utilize the supervised deep neural networks to identify the PPIs and classify the protein families. The PPI prediction accuracy for eight species ranged from 96.76% to 99.77%, which signifies that our multi-modal deep representation learning framework achieves superior performance compared to other computational methods.
Conclusion: To the best of our knowledge, this is the first multi-modal deep representation learning framework for examining the PPI networks.
Keywords: Knowledge graph representation learning; Multimodal deep neural network; Protein-protein interaction network.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures











Similar articles
-
Predicting protein-protein interaction with interpretable bilinear attention network.Comput Methods Programs Biomed. 2025 Jun;265:108756. doi: 10.1016/j.cmpb.2025.108756. Epub 2025 Mar 30. Comput Methods Programs Biomed. 2025. PMID: 40174317
-
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8. BMC Bioinformatics. 2020. PMID: 32693790 Free PMC article.
-
Completing sparse and disconnected protein-protein network by deep learning.BMC Bioinformatics. 2018 Mar 22;19(1):103. doi: 10.1186/s12859-018-2112-7. BMC Bioinformatics. 2018. PMID: 29566671 Free PMC article.
-
Targeting Virus-host Protein Interactions: Feature Extraction and Machine Learning Approaches.Curr Drug Metab. 2019;20(3):177-184. doi: 10.2174/1389200219666180829121038. Curr Drug Metab. 2019. PMID: 30156155 Review.
-
Protein-protein interaction detection using deep learning: A survey, comparative analysis, and experimental evaluation.Comput Biol Med. 2025 Feb;185:109449. doi: 10.1016/j.compbiomed.2024.109449. Epub 2024 Dec 6. Comput Biol Med. 2025. PMID: 39644584 Review.
Cited by
-
Biological network analysis with deep learning.Brief Bioinform. 2021 Mar 22;22(2):1515-1530. doi: 10.1093/bib/bbaa257. Brief Bioinform. 2021. PMID: 33169146 Free PMC article. Review.
-
Product Manifold Representations for Learning on Biological Pathways.ArXiv [Preprint]. 2025 Feb 4:arXiv:2401.15478v2. ArXiv. 2025. PMID: 39975438 Free PMC article. Preprint.
-
Recent advances in deep learning for protein-protein interaction: a review.BioData Min. 2025 Jun 16;18(1):43. doi: 10.1186/s13040-025-00457-6. BioData Min. 2025. PMID: 40524189 Free PMC article. Review.
-
Synthetic whole-slide image tile generation with gene expression profile-infused deep generative models.Cell Rep Methods. 2023 Jul 19;3(8):100534. doi: 10.1016/j.crmeth.2023.100534. eCollection 2023 Aug 28. Cell Rep Methods. 2023. PMID: 37671024 Free PMC article.
-
Detecting Protein Communities in Native Cell Extracts by Machine Learning: A Structural Biologist's Perspective.Front Mol Biosci. 2021 Apr 15;8:660542. doi: 10.3389/fmolb.2021.660542. eCollection 2021. Front Mol Biosci. 2021. PMID: 33937337 Free PMC article.
References
-
- Zhou Yu Zhen, Gao Yun, Zheng Ying Ying. Communications in Computer and Information Science. Berlin, Heidelberg: Springer Berlin Heidelberg; 2011. Prediction of Protein-Protein Interactions Using Local Description of Amino Acid Sequence; pp. 254–262.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources