Current computational tools for protein lysine acylation site prediction
- PMID: 39316944
- PMCID: PMC11421846
- DOI: 10.1093/bib/bbae469
Current computational tools for protein lysine acylation site prediction
Abstract
As a main subtype of post-translational modification (PTM), protein lysine acylations (PLAs) play crucial roles in regulating diverse functions of proteins. With recent advancements in proteomics technology, the identification of PTM is becoming a data-rich field. A large amount of experimentally verified data is urgently required to be translated into valuable biological insights. With computational approaches, PLA can be accurately detected across the whole proteome, even for organisms with small-scale datasets. Herein, a comprehensive summary of 166 in silico PLA prediction methods is presented, including a single type of PLA site and multiple types of PLA sites. This recapitulation covers important aspects that are critical for the development of a robust predictor, including data collection and preparation, sample selection, feature representation, classification algorithm design, model evaluation, and method availability. Notably, we discuss the application of protein language models and transfer learning to solve the small-sample learning issue. We also highlight the prediction methods developed for functionally relevant PLA sites and species/substrate/cell-type-specific PLA sites. In conclusion, this systematic review could potentially facilitate the development of novel PLA predictors and offer useful insights to researchers from various disciplines.
Keywords: deep learning; lysine acylation; post-translation modification; protein language model; transfer learning.
Published by Oxford University Press 2024.
Similar articles
-
Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089. Brief Bioinform. 2019. PMID: 30285084 Free PMC article. Review.
-
Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features.J Proteome Res. 2016 Dec 2;15(12):4234-4244. doi: 10.1021/acs.jproteome.6b00240. Epub 2016 Nov 2. J Proteome Res. 2016. PMID: 27774790
-
LMPTMSite: A Platform for PTM Site Prediction in Proteins Leveraging Transformer-Based Protein Language Models.Methods Mol Biol. 2025;2867:261-297. doi: 10.1007/978-1-0716-4196-5_16. Methods Mol Biol. 2025. PMID: 39576587
-
A systematic identification of species-specific protein succinylation sites using joint element features information.Int J Nanomedicine. 2017 Aug 28;12:6303-6315. doi: 10.2147/IJN.S140875. eCollection 2017. Int J Nanomedicine. 2017. PMID: 28894368 Free PMC article.
-
A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19. Genomics Proteomics Bioinformatics. 2023. PMID: 37863385 Free PMC article. Review.
Cited by
-
An efficient machine-learning framework for predicting protein post-translational modification sites.Sci Rep. 2025 Aug 25;15(1):31179. doi: 10.1038/s41598-025-13178-x. Sci Rep. 2025. PMID: 40854916 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials