SKEMPI: a Structural Kinetic and Energetic database of Mutant Protein Interactions and its use in empirical models
- PMID: 22859501
- DOI: 10.1093/bioinformatics/bts489
SKEMPI: a Structural Kinetic and Energetic database of Mutant Protein Interactions and its use in empirical models
Abstract
Motivation: Empirical models for the prediction of how changes in sequence alter protein-protein binding kinetics and thermodynamics can garner insights into many aspects of molecular biology. However, such models require empirical training data and proper validation before they can be widely applied. Previous databases contained few stabilizing mutations and no discussion of their inherent biases or how this impacts model construction or validation.
Results: We present SKEMPI, a database of 3047 binding free energy changes upon mutation assembled from the scientific literature, for protein-protein heterodimeric complexes with experimentally determined structures. This represents over four times more data than previously collected. Changes in 713 association and dissociation rates and 127 enthalpies and entropies were also recorded. The existence of biases towards specific mutations, residues, interfaces, proteins and protein families is discussed in the context of how the data can be used to construct predictive models. Finally, a cross-validation scheme is presented which is capable of estimating the efficacy of derived models on future data in which these biases are not present.
Availability: The database is available online at http://life.bsc.es/pid/mutation_database/.
Similar articles
-
SKEMPI 2.0: an updated benchmark of changes in protein-protein binding energy, kinetics and thermodynamics upon mutation.Bioinformatics. 2019 Feb 1;35(3):462-469. doi: 10.1093/bioinformatics/bty635. Bioinformatics. 2019. PMID: 30020414 Free PMC article.
-
dbMPIKT: a database of kinetic and thermodynamic mutant protein interactions.BMC Bioinformatics. 2018 Nov 27;19(1):455. doi: 10.1186/s12859-018-2493-7. BMC Bioinformatics. 2018. PMID: 30482172 Free PMC article.
-
PROXiMATE: a database of mutant protein-protein complex thermodynamics and kinetics.Bioinformatics. 2017 Sep 1;33(17):2787-2788. doi: 10.1093/bioinformatics/btx312. Bioinformatics. 2017. PMID: 28498885
-
Volume-based solvation models out-perform area-based models in combined studies of wild-type and mutated protein-protein interfaces.BMC Bioinformatics. 2008 Oct 21;9:448. doi: 10.1186/1471-2105-9-448. BMC Bioinformatics. 2008. PMID: 18939984 Free PMC article.
-
Predicting the Effect of Amino Acid Single-Point Mutations on Protein Stability-Large-Scale Validation of MD-Based Relative Free Energy Calculations.J Mol Biol. 2017 Apr 7;429(7):948-963. doi: 10.1016/j.jmb.2016.12.007. Epub 2016 Dec 10. J Mol Biol. 2017. PMID: 27964946
Cited by
-
Nabe: an energetic database of amino acid mutations in protein-nucleic acid binding interfaces.Database (Oxford). 2021 Aug 14;2021:baab050. doi: 10.1093/database/baab050. Database (Oxford). 2021. PMID: 34389843 Free PMC article.
-
Climbing Up and Down Binding Landscapes through Deep Mutational Scanning of Three Homologous Protein-Protein Complexes.J Am Chem Soc. 2021 Oct 20;143(41):17261-17275. doi: 10.1021/jacs.1c08707. Epub 2021 Oct 5. J Am Chem Soc. 2021. PMID: 34609866 Free PMC article.
-
MutaBind estimates and interprets the effects of sequence variants on protein-protein interactions.Nucleic Acids Res. 2016 Jul 8;44(W1):W494-501. doi: 10.1093/nar/gkw374. Epub 2016 May 5. Nucleic Acids Res. 2016. PMID: 27150810 Free PMC article.
-
Computational Approaches to Prioritize Cancer Driver Missense Mutations.Int J Mol Sci. 2018 Jul 20;19(7):2113. doi: 10.3390/ijms19072113. Int J Mol Sci. 2018. PMID: 30037003 Free PMC article. Review.
-
MVSF-AB: accurate antibody-antigen binding affinity prediction via multi-view sequence feature learning.Bioinformatics. 2025 May 6;41(5):btae579. doi: 10.1093/bioinformatics/btae579. Bioinformatics. 2025. PMID: 39363630 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources