Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture
- PMID: 30463553
- PMCID: PMC6249717
- DOI: 10.1186/s12918-018-0628-0
Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture
Abstract
Background: Ubiquitination, which is also called "lysine ubiquitination", occurs when an ubiquitin is attached to lysine (K) residues in targeting proteins. As one of the most important post translational modifications (PTMs), it plays the significant role not only in protein degradation, but also in other cellular functions. Thus, systematic anatomy of the ubiquitination proteome is an appealing and challenging research topic. The existing methods for identifying protein ubiquitination sites can be divided into two kinds: mass spectrometry and computational methods. Mass spectrometry-based experimental methods can discover ubiquitination sites from eukaryotes, but are time-consuming and expensive. Therefore, it is priority to develop computational approaches that can effectively and accurately identify protein ubiquitination sites.
Results: The existing computational methods usually require feature engineering, which may lead to redundancy and biased representations. While deep learning is able to excavate underlying characteristics from large-scale training data via multiple-layer networks and non-linear mapping operations. In this paper, we proposed a deep architecture within multiple modalities to identify the ubiquitination sites. First, according to prior knowledge and biological knowledge, we encoded protein sequence fragments around candidate ubiquitination sites into three modalities, namely raw protein sequence fragments, physico-chemical properties and sequence profiles, and designed different deep network layers to extract the hidden representations from them. Then, the generative deep representations corresponding to three modalities were merged to build the final model. We performed our algorithm on the available largest scale protein ubiquitination sites database PLMD, and achieved 66.4% specificity, 66.7% sensitivity, 66.43% accuracy, and 0.221 MCC value. A number of comparative experiments also indicated that our multimodal deep architecture outperformed several popular protein ubiquitination site prediction tools.
Conclusion: The results of comparative experiments validated the effectiveness of our deep network and also displayed that our method outperformed several popular protein ubiquitination site prediction tools. The source codes of our proposed method are available at https://github.com/jiagenlee/deepUbiquitylation .
Keywords: Convolution neural network; Deep learning; Deep neural network; Multiple modalities; Protein ubiquitination site.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures





Similar articles
-
DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins.BMC Bioinformatics. 2019 Feb 18;20(1):86. doi: 10.1186/s12859-019-2677-9. BMC Bioinformatics. 2019. PMID: 30777029 Free PMC article.
-
Prediction of lysine ubiquitination with mRMR feature selection and analysis.Amino Acids. 2012 Apr;42(4):1387-95. doi: 10.1007/s00726-011-0835-0. Epub 2011 Jan 26. Amino Acids. 2012. PMID: 21267749
-
DeepTL-Ubi: A novel deep transfer learning method for effectively predicting ubiquitination sites of multiple species.Methods. 2021 Aug;192:103-111. doi: 10.1016/j.ymeth.2020.08.003. Epub 2020 Aug 10. Methods. 2021. PMID: 32791338
-
Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089. Brief Bioinform. 2019. PMID: 30285084 Free PMC article. Review.
-
Towards Computational Models of Identifying Protein Ubiquitination Sites.Curr Drug Targets. 2019;20(5):565-578. doi: 10.2174/1389450119666180924150202. Curr Drug Targets. 2019. PMID: 30246637 Review.
Cited by
-
Machine learning-based approaches for ubiquitination site prediction in human proteins.BMC Bioinformatics. 2023 Nov 28;24(1):449. doi: 10.1186/s12859-023-05581-w. BMC Bioinformatics. 2023. PMID: 38017391 Free PMC article.
-
An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites.BMC Bioinformatics. 2021 Oct 24;22(1):519. doi: 10.1186/s12859-021-04445-5. BMC Bioinformatics. 2021. PMID: 34689734 Free PMC article.
-
CL-ACP: a parallel combination of CNN and LSTM anticancer peptide recognition model.BMC Bioinformatics. 2021 Oct 20;22(1):512. doi: 10.1186/s12859-021-04433-9. BMC Bioinformatics. 2021. PMID: 34670488 Free PMC article.
-
A Caps-Ubi Model for Protein Ubiquitination Site Prediction.Front Plant Sci. 2022 May 25;13:884903. doi: 10.3389/fpls.2022.884903. eCollection 2022. Front Plant Sci. 2022. PMID: 35693166 Free PMC article.
-
Ubigo-X: Protein ubiquitination site prediction using ensemble learning with image-based feature representation and weighted voting.Comput Struct Biotechnol J. 2025 Jul 14;27:3137-3146. doi: 10.1016/j.csbj.2025.07.025. eCollection 2025. Comput Struct Biotechnol J. 2025. PMID: 40727425 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous