Predicting lysine-malonylation sites of proteins using sequence and predicted structural features
- PMID: 29761520
- DOI: 10.1002/jcc.25353
Predicting lysine-malonylation sites of proteins using sequence and predicted structural features
Abstract
Malonylation is a recently discovered post-translational modification (PTM) in which a malonyl group attaches to a lysine (K) amino acid residue of a protein. In this work, a novel machine learning model, SPRINT-Mal, is developed to predict malonylation sites by employing sequence and predicted structural features. Evolutionary information and physicochemical properties are found to be the two most discriminative features whereas a structural feature called half-sphere exposure provides additional improvement to the prediction performance. SPRINT-Mal trained on mouse data yields robust performance for 10-fold cross validation and independent test set with Area Under the Curve (AUC) values of 0.74 and 0.76 and Matthews' Correlation Coefficient (MCC) of 0.213 and 0.20, respectively. Moreover, SPRINT-Mal achieved comparable performance when testing on H. sapiens proteins without species-specific training but not in bacterium S. erythraea. This suggests similar underlying physicochemical mechanisms between mouse and human but not between mouse and bacterium. SPRINT-Mal is freely available as an online server at: http://sparks-lab.org/server/SPRINT-Mal/. © 2018 Wiley Periodicals, Inc.
Keywords: lysine-malonylation sites prediction; post translational modification; support vector machines.
© 2018 Wiley Periodicals, Inc.
Similar articles
-
Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection.Sci Rep. 2016 Dec 2;6:38318. doi: 10.1038/srep38318. Sci Rep. 2016. PMID: 27910954 Free PMC article.
-
Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.Brief Bioinform. 2019 Nov 27;20(6):2185-2199. doi: 10.1093/bib/bby079. Brief Bioinform. 2019. PMID: 30351377 Free PMC article.
-
Mal-Prec: computational prediction of protein Malonylation sites via machine learning based feature integration : Malonylation site prediction.BMC Genomics. 2020 Nov 23;21(1):812. doi: 10.1186/s12864-020-07166-w. BMC Genomics. 2020. PMID: 33225896 Free PMC article.
-
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.Database (Oxford). 2024 Jan 19;2024:baad094. doi: 10.1093/database/baad094. Database (Oxford). 2024. PMID: 38245002 Free PMC article. Review.
-
Lysine Malonylation and Its Links to Metabolism and Diseases.Aging Dis. 2023 Feb 1;14(1):84-98. doi: 10.14336/AD.2022.0711. eCollection 2023 Feb 1. Aging Dis. 2023. PMID: 36818560 Free PMC article. Review.
Cited by
-
A hybrid feature extraction scheme for efficient malonylation site prediction.Sci Rep. 2022 Apr 6;12(1):5756. doi: 10.1038/s41598-022-08555-9. Sci Rep. 2022. PMID: 35388017 Free PMC article.
-
RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites.Comput Struct Biotechnol J. 2020 Mar 4;18:852-860. doi: 10.1016/j.csbj.2020.02.012. eCollection 2020. Comput Struct Biotechnol J. 2020. PMID: 32322367 Free PMC article.
-
PRISMOID: a comprehensive 3D structure database for post-translational modifications and mutations with functional impact.Brief Bioinform. 2020 May 21;21(3):1069-1079. doi: 10.1093/bib/bbz050. Brief Bioinform. 2020. PMID: 31161204 Free PMC article.
-
Protein Succinylation and Malonylation as Potential Biomarkers in Schizophrenia.J Pers Med. 2022 Aug 30;12(9):1408. doi: 10.3390/jpm12091408. J Pers Med. 2022. PMID: 36143193 Free PMC article.
-
SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure.Molecules. 2018 Dec 10;23(12):3260. doi: 10.3390/molecules23123260. Molecules. 2018. PMID: 30544729 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous