Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network
- PMID: 33815480
- PMCID: PMC8010314
- DOI: 10.3389/fgene.2021.645932
Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network
Abstract
In recent years a number of calculative models based on protein-protein interaction (PPI) networks have been proposed successively. However, due to false positives, false negatives, and the incompleteness of PPI networks, there are still many challenges affecting the design of computational models with satisfactory predictive accuracy when inferring key proteins. This study proposes a prediction model called WPDINM for detecting key proteins based on a novel weighted protein-domain interaction (PDI) network. In WPDINM, a weighted PPI network is constructed first by combining the gene expression data of proteins with topological information extracted from the original PPI network. Simultaneously, a weighted domain-domain interaction (DDI) network is constructed based on the original PDI network. Next, through integrating the newly obtained weighted PPI network and weighted DDI network with the original PDI network, a weighted PDI network is further constructed. Then, based on topological features and biological information, including the subcellular localization and orthologous information of proteins, a novel PageRank-based iterative algorithm is designed and implemented on the newly constructed weighted PDI network to estimate the criticality of proteins. Finally, to assess the prediction performance of WPDINM, we compared it with 12 kinds of competitive measures. Experimental results show that WPDINM can achieve a predictive accuracy rate of 90.19, 81.96, 70.72, 62.04, 55.83, and 51.13% in the top 1%, top 5%, top 10%, top 15%, top 20%, and top 25% separately, which exceeds the prediction accuracy achieved by traditional state-of-the-art competing measures. Owing to the satisfactory identification effect, the WPDINM measure may contribute to the further development of key protein identification.
Keywords: computational model; domain-domain interaction network; essential proteins; protein-domain interaction network; protein-protein interaction network.
Copyright © 2021 Meng, Kuang, Chen, Zhang, Tan, Li and Wang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor QZ declared a past co-authorship/collaboration with one of the authors LW.
Figures







Similar articles
-
A Novel Model for Identifying Essential Proteins Based on Key Target Convergence Sets.Front Genet. 2021 Jul 29;12:721486. doi: 10.3389/fgene.2021.721486. eCollection 2021. Front Genet. 2021. PMID: 34394201 Free PMC article.
-
Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network.Front Genet. 2021 Jun 29;12:708162. doi: 10.3389/fgene.2021.708162. eCollection 2021. Front Genet. 2021. PMID: 34267785 Free PMC article.
-
An iteration method for identifying yeast essential proteins from heterogeneous network.BMC Bioinformatics. 2019 Jun 24;20(1):355. doi: 10.1186/s12859-019-2930-2. BMC Bioinformatics. 2019. PMID: 31234779 Free PMC article.
-
A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins.Front Genet. 2021 Oct 21;12:763153. doi: 10.3389/fgene.2021.763153. eCollection 2021. Front Genet. 2021. PMID: 34745230 Free PMC article. Review.
-
An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information.BMC Bioinformatics. 2021 Sep 8;22(1):430. doi: 10.1186/s12859-021-04300-7. BMC Bioinformatics. 2021. PMID: 34496745 Free PMC article.
Cited by
-
A Novel Model for Identifying Essential Proteins Based on Key Target Convergence Sets.Front Genet. 2021 Jul 29;12:721486. doi: 10.3389/fgene.2021.721486. eCollection 2021. Front Genet. 2021. PMID: 34394201 Free PMC article.
-
Identification of essential proteins based on edge features and the fusion of multiple-source biological information.BMC Bioinformatics. 2023 May 17;24(1):203. doi: 10.1186/s12859-023-05315-y. BMC Bioinformatics. 2023. PMID: 37198530 Free PMC article.
-
ECDEP: identifying essential proteins based on evolutionary community discovery and subcellular localization.BMC Genomics. 2024 Jan 26;25(1):117. doi: 10.1186/s12864-024-10019-5. BMC Genomics. 2024. PMID: 38279081 Free PMC article.
-
A deep learning framework for identifying essential proteins based on multiple biological information.BMC Bioinformatics. 2022 Aug 4;23(1):318. doi: 10.1186/s12859-022-04868-8. BMC Bioinformatics. 2022. PMID: 35927611 Free PMC article.
-
Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network.Front Genet. 2021 Jun 29;12:708162. doi: 10.3389/fgene.2021.708162. eCollection 2021. Front Genet. 2021. PMID: 34267785 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Other Literature Sources