Progress and challenges in predicting protein-protein interaction sites
- PMID: 19346321
- DOI: 10.1093/bib/bbp021
Progress and challenges in predicting protein-protein interaction sites
Abstract
The identification of protein-protein interaction sites is an essential intermediate step for mutant design and the prediction of protein networks. In recent years a significant number of methods have been developed to predict these interface residues and here we review the current status of the field. Progress in this area requires a clear view of the methodology applied, the data sets used for training and testing the systems, and the evaluation procedures. We have analysed the impact of a representative set of features and algorithms and highlighted the problems inherent in generating reliable protein data sets and in the posterior analysis of the results. Although it is clear that there have been some improvements in methods for predicting interacting sites, several major bottlenecks remain. Proteins in complexes are still under-represented in the structural databases and in particular many proteins involved in transient complexes are still to be crystallized. We provide suggestions for effective feature selection, and make it clear that community standards for testing, training and performance measures are necessary for progress in the field.
Similar articles
-
Protein-protein interaction site prediction based on conditional random fields.Bioinformatics. 2007 Mar 1;23(5):597-604. doi: 10.1093/bioinformatics/btl660. Epub 2007 Jan 18. Bioinformatics. 2007. PMID: 17234636
-
Prediction of protein-protein interaction sites using support vector machines.Protein Eng Des Sel. 2004 Feb;17(2):165-73. doi: 10.1093/protein/gzh020. Epub 2004 Jan 20. Protein Eng Des Sel. 2004. PMID: 15047913
-
ProMate: a structure based prediction program to identify the location of protein-protein binding sites.J Mol Biol. 2004 Apr 16;338(1):181-99. doi: 10.1016/j.jmb.2004.02.040. J Mol Biol. 2004. PMID: 15050833
-
Electrostatic design of protein-protein association rates.Methods Mol Biol. 2006;340:235-49. doi: 10.1385/1-59745-116-9:235. Methods Mol Biol. 2006. PMID: 16957340 Review.
-
Three-dimensional modeling of protein interactions and complexes is going 'omics.Curr Opin Struct Biol. 2011 Apr;21(2):200-8. doi: 10.1016/j.sbi.2011.01.005. Epub 2011 Feb 12. Curr Opin Struct Biol. 2011. PMID: 21320770 Review.
Cited by
-
ProDis-ContSHC: learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval.BMC Bioinformatics. 2012 May 8;13 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2105-13-S7-S2. BMC Bioinformatics. 2012. PMID: 22594999 Free PMC article.
-
EXIA2: web server of accurate and rapid protein catalytic residue prediction.Biomed Res Int. 2014;2014:807839. doi: 10.1155/2014/807839. Epub 2014 Sep 11. Biomed Res Int. 2014. PMID: 25295274 Free PMC article.
-
Enhancing interacting residue prediction with integrated contact matrix prediction in protein-protein interaction.EURASIP J Bioinform Syst Biol. 2016 Oct 22;2016(1):17. doi: 10.1186/s13637-016-0051-z. eCollection 2016 Dec. EURASIP J Bioinform Syst Biol. 2016. PMID: 27818677 Free PMC article.
-
Prediction of protein binding sites in protein structures using hidden Markov support vector machine.BMC Bioinformatics. 2009 Nov 20;10:381. doi: 10.1186/1471-2105-10-381. BMC Bioinformatics. 2009. PMID: 19925685 Free PMC article.
-
MEG-PPIS: a fast protein-protein interaction site prediction method based on multi-scale graph information and equivariant graph neural network.Bioinformatics. 2024 Jan 5;40(5):btae269. doi: 10.1093/bioinformatics/btae269. Bioinformatics. 2024. PMID: 38640481 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources