Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations
- PMID: 34058752
- DOI: 10.1093/bib/bbab184
Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations
Abstract
Understanding how a mutation might affect protein stability is of significant importance to protein engineering and for understanding protein evolution genetic diseases. While a number of computational tools have been developed to predict the effect of missense mutations on protein stability protein stability upon mutations, they are known to exhibit large biases imparted in part by the data used to train and evaluate them. Here, we provide a comprehensive overview of predictive tools, which has provided an evolving insight into the importance and relevance of features that can discern the effects of mutations on protein stability. A diverse selection of these freely available tools was benchmarked using a large mutation-level blind dataset of 1342 experimentally characterised mutations across 130 proteins from ThermoMutDB, a second test dataset encompassing 630 experimentally characterised mutations across 39 proteins from iStable2.0 and a third blind test dataset consisting of 268 mutations in 27 proteins from the newly published ProThermDB. The performance of the methods was further evaluated with respect to the site of mutation, type of mutant residue and by ranging the pH and temperature. Additionally, the classification performance was also evaluated by classifying the mutations as stabilizing (∆∆G ≥ 0) or destabilizing (∆∆G < 0). The results reveal that the performance of the predictors is affected by the site of mutation and the type of mutant residue. Further, the results show very low performance for pH values 6-8 and temperature higher than 65 for all predictors except iStable2.0 on the S630 dataset. To illustrate how stability and structure change upon single point mutation, we considered four stabilizing, two destabilizing and two stabilizing mutations from two proteins, namely the toxin protein and bovine liver cytochrome. Overall, the results on S268, S630 and S1342 datasets show that the performance of the integrated predictors is better than the mechanistic or individual machine learning predictors. We expect that this paper will provide useful guidance for the design and development of next-generation bioinformatic tools for predicting protein stability changes upon mutations.
Keywords: bioinformatics; deep learning; feature engineering; machine learning; predictors; protein stability change.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Similar articles
-
Assessing computational tools for predicting protein stability changes upon missense mutations using a new dataset.Protein Sci. 2024 Jan;33(1):e4861. doi: 10.1002/pro.4861. Protein Sci. 2024. PMID: 38084013 Free PMC article.
-
PremPS: Predicting the impact of missense mutations on protein stability.PLoS Comput Biol. 2020 Dec 30;16(12):e1008543. doi: 10.1371/journal.pcbi.1008543. eCollection 2020 Dec. PLoS Comput Biol. 2020. PMID: 33378330 Free PMC article.
-
Computational tools help improve protein stability but with a solubility tradeoff.J Biol Chem. 2017 Sep 1;292(35):14349-14361. doi: 10.1074/jbc.M117.784165. Epub 2017 Jul 14. J Biol Chem. 2017. PMID: 28710274 Free PMC article.
-
Reviewing Challenges of Predicting Protein Melting Temperature Change Upon Mutation Through the Full Analysis of a Highly Detailed Dataset with High-Resolution Structures.Mol Biotechnol. 2021 Oct;63(10):863-884. doi: 10.1007/s12033-021-00349-0. Epub 2021 Jun 8. Mol Biotechnol. 2021. PMID: 34101125 Free PMC article. Review.
-
Machine learning algorithms for predicting protein folding rates and stability of mutant proteins: comparison with statistical methods.Curr Protein Pept Sci. 2011 Sep;12(6):490-502. doi: 10.2174/138920311796957630. Curr Protein Pept Sci. 2011. PMID: 21787301 Review.
Cited by
-
Predicting the mutation effects of protein-ligand interactions via end-point binding free energy calculations: strategies and analyses.J Cheminform. 2022 Aug 20;14(1):56. doi: 10.1186/s13321-022-00639-y. J Cheminform. 2022. PMID: 35987841 Free PMC article.
-
Exploring the effects of missense mutations on protein thermodynamics through structure-based approaches: findings from the CAGI6 challenges.Hum Genet. 2025 Mar;144(2-3):327-335. doi: 10.1007/s00439-023-02623-4. Epub 2024 Jan 16. Hum Genet. 2025. PMID: 38227011 Free PMC article.
-
Guidelines for releasing a variant effect predictor.Genome Biol. 2025 Apr 15;26(1):97. doi: 10.1186/s13059-025-03572-z. Genome Biol. 2025. PMID: 40234898 Free PMC article.
-
Structural heterogeneity and precision of implications drawn from cryo-electron microscopy structures: SARS-CoV-2 spike-protein mutations as a test case.Eur Biophys J. 2022 Dec;51(7-8):555-568. doi: 10.1007/s00249-022-01619-8. Epub 2022 Sep 27. Eur Biophys J. 2022. PMID: 36167828 Free PMC article.
-
Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects.Acta Biochim Biophys Sin (Shanghai). 2023 Mar 25;55(3):343-355. doi: 10.3724/abbs.2023033. Acta Biochim Biophys Sin (Shanghai). 2023. PMID: 37143326 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources