A New Scheme to Characterize and Identify Protein Ubiquitination Sites
- PMID: 26887002
- DOI: 10.1109/TCBB.2016.2520939
A New Scheme to Characterize and Identify Protein Ubiquitination Sites
Abstract
Protein ubiquitination, involving the conjugation of ubiquitin on lysine residue, serves as an important modulator of many cellular functions in eukaryotes. Recent advancements in proteomic technology have stimulated increasing interest in identifying ubiquitination sites. However, most computational tools for predicting ubiquitination sites are focused on small-scale data. With an increasing number of experimentally verified ubiquitination sites, we were motivated to design a predictive model for identifying lysine ubiquitination sites for large-scale proteome dataset. This work assessed not only single features, such as amino acid composition (AAC), amino acid pair composition (AAPC) and evolutionary information, but also the effectiveness of incorporating two or more features into a hybrid approach to model construction. The support vector machine (SVM) was applied to generate the prediction models for ubiquitination site identification. Evaluation by five-fold cross-validation showed that the SVM models learned from the combination of hybrid features delivered a better prediction performance. Additionally, a motif discovery tool, MDDLogo, was adopted to characterize the potential substrate motifs of ubiquitination sites. The SVM models integrating the MDDLogo-identified substrate motifs could yield an average accuracy of 68.70 percent. Furthermore, the independent testing result showed that the MDDLogo-clustered SVM models could provide a promising accuracy (78.50 percent) and perform better than other prediction tools. Two cases have demonstrated the effective prediction of ubiquitination sites with corresponding substrate motifs.
Similar articles
-
UbiSite: incorporating two-layered machine learning method with substrate motifs to predict ubiquitin-conjugation site on lysines.BMC Syst Biol. 2016 Jan 11;10 Suppl 1(Suppl 1):6. doi: 10.1186/s12918-015-0246-z. BMC Syst Biol. 2016. PMID: 26818456 Free PMC article.
-
DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins.BMC Bioinformatics. 2019 Feb 18;20(1):86. doi: 10.1186/s12859-019-2677-9. BMC Bioinformatics. 2019. PMID: 30777029 Free PMC article.
-
Characterization and identification of lysine glutarylation based on intrinsic interdependence between positions in the substrate sites.BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):384. doi: 10.1186/s12859-018-2394-9. BMC Bioinformatics. 2019. PMID: 30717647 Free PMC article.
-
Proteomic techniques to probe the ubiquitin landscape.Proteomics. 2016 Jan;16(2):273-87. doi: 10.1002/pmic.201500290. Epub 2015 Dec 15. Proteomics. 2016. PMID: 26460060 Review.
-
Proteomic identification of protein ubiquitination events.Biotechnol Genet Eng Rev. 2013;29(1):73-109. doi: 10.1080/02648725.2013.801232. Biotechnol Genet Eng Rev. 2013. PMID: 24568254 Free PMC article. Review.
Cited by
-
Incorporating Deep Learning With Word Embedding to Identify Plant Ubiquitylation Sites.Front Cell Dev Biol. 2020 Sep 30;8:572195. doi: 10.3389/fcell.2020.572195. eCollection 2020. Front Cell Dev Biol. 2020. PMID: 33102477 Free PMC article.
-
Lysine 222 in PPAR γ1 functions as the key site of MuRF2-mediated ubiquitination modification.Sci Rep. 2023 Feb 3;13(1):1999. doi: 10.1038/s41598-023-28905-5. Sci Rep. 2023. PMID: 36737649 Free PMC article.
-
A Caps-Ubi Model for Protein Ubiquitination Site Prediction.Front Plant Sci. 2022 May 25;13:884903. doi: 10.3389/fpls.2022.884903. eCollection 2022. Front Plant Sci. 2022. PMID: 35693166 Free PMC article.
-
Predictive modeling for ubiquitin proteins through advanced machine learning technique.Heliyon. 2024 Jun 6;10(12):e32517. doi: 10.1016/j.heliyon.2024.e32517. eCollection 2024 Jun 30. Heliyon. 2024. PMID: 38975176 Free PMC article.
-
Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture.BMC Syst Biol. 2018 Nov 22;12(Suppl 6):109. doi: 10.1186/s12918-018-0628-0. BMC Syst Biol. 2018. PMID: 30463553 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources