Prediction of lysine ubiquitylation with ensemble classifier and feature selection
- PMID: 22272076
- PMCID: PMC3257073
- DOI: 10.3390/ijms12128347
Prediction of lysine ubiquitylation with ensemble classifier and feature selection
Abstract
Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.
Keywords: ensemble classifier; lysine ubiquitylation sites; support vector machine; ubiquitylation.
Figures






Similar articles
-
UbiSite: incorporating two-layered machine learning method with substrate motifs to predict ubiquitin-conjugation site on lysines.BMC Syst Biol. 2016 Jan 11;10 Suppl 1(Suppl 1):6. doi: 10.1186/s12918-015-0246-z. BMC Syst Biol. 2016. PMID: 26818456 Free PMC article.
-
Computational identification of ubiquitylation sites from protein sequences.BMC Bioinformatics. 2008 Jul 15;9:310. doi: 10.1186/1471-2105-9-310. BMC Bioinformatics. 2008. PMID: 18625080 Free PMC article.
-
Improved Species-Specific Lysine Acetylation Site Prediction Based on a Large Variety of Features Set.PLoS One. 2016 May 16;11(5):e0155370. doi: 10.1371/journal.pone.0155370. eCollection 2016. PLoS One. 2016. PMID: 27183223 Free PMC article.
-
Towards Computational Models of Identifying Protein Ubiquitination Sites.Curr Drug Targets. 2019;20(5):565-578. doi: 10.2174/1389450119666180924150202. Curr Drug Targets. 2019. PMID: 30246637 Review.
-
Non-lysine ubiquitylation: Doing things differently.Front Mol Biosci. 2022 Sep 19;9:1008175. doi: 10.3389/fmolb.2022.1008175. eCollection 2022. Front Mol Biosci. 2022. PMID: 36200073 Free PMC article. Review.
Cited by
-
Prediction of protein phosphorylation sites by using the composition of k-spaced amino acid pairs.PLoS One. 2012;7(10):e46302. doi: 10.1371/journal.pone.0046302. Epub 2012 Oct 22. PLoS One. 2012. PMID: 23110047 Free PMC article.
-
Characterization and identification of ubiquitin conjugation sites with E3 ligase recognition specificities.BMC Bioinformatics. 2015;16 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-16-S1-S1. Epub 2015 Jan 21. BMC Bioinformatics. 2015. PMID: 25707307 Free PMC article.
-
Prediction of bioluminescent proteins using auto covariance transformation of evolutional profiles.Int J Mol Sci. 2012;13(3):3650-3660. doi: 10.3390/ijms13033650. Epub 2012 Mar 19. Int J Mol Sci. 2012. PMID: 22489173 Free PMC article.
-
Identifying DNA-binding proteins by combining support vector machine and PSSM distance transformation.BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S10. doi: 10.1186/1752-0509-9-S1-S10. Epub 2015 Feb 6. BMC Syst Biol. 2015. PMID: 25708928 Free PMC article.
-
PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context.Sci Rep. 2016 Jun 10;6:27653. doi: 10.1038/srep27653. Sci Rep. 2016. PMID: 27282833 Free PMC article.
References
-
- Pickart C.M. Ubiquitin enters the new millennium. Mol. Cell. 2001;8:499–504. - PubMed
-
- Aguilar R.C., Wendland B. Ubiquitin: Not just for proteasomes anymore. Curr. Opin. Cell Biol. 2003;15:184–190. - PubMed
-
- Saghatelian A., Cravatt B.F. Assignment of protein function in the postgenomic era. Nat. Chem. Biol. 2005;1:130–142. - PubMed
-
- Herrmann J., Lerman L.O., Lerman A. Ubiquitin and ubiquitin-like proteins in protein regulation. Circ. Res. 2007;100:1276–1291. - PubMed
-
- Hicke L., Dunn R. Regulation of membrane protein transport by ubiquitin and ubiquiti-binding proteins. Annu. Rev. Cell Dev. Biol. 2003;19:141–172. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources