Improved prediction of protein-protein binding sites using a support vector machines approach
- PMID: 15613384
- DOI: 10.1093/bioinformatics/bti242
Improved prediction of protein-protein binding sites using a support vector machines approach
Abstract
Motivation: Structural genomics projects are beginning to produce protein structures with unknown function, therefore, accurate, automated predictors of protein function are required if all these structures are to be properly annotated in reasonable time. Identifying the interface between two interacting proteins provides important clues to the function of a protein and can reduce the search space required by docking algorithms to predict the structures of complexes.
Results: We have combined a support vector machine (SVM) approach with surface patch analysis to predict protein-protein binding sites. Using a leave-one-out cross-validation procedure, we were able to successfully predict the location of the binding site on 76% of our dataset made up of proteins with both transient and obligate interfaces. With heterogeneous cross-validation, where we trained the SVM on transient complexes to predict on obligate complexes (and vice versa), we still achieved comparable success rates to the leave-one-out cross-validation suggesting that sufficient properties are shared between transient and obligate interfaces.
Availability: A web application based on the method can be found at http://www.bioinformatics.leeds.ac.uk/ppi_pred. The dataset of 180 proteins used in this study is also available via the same web site.
Contact: westhead@bmb.leeds.ac.uk
Supplementary information: http://www.bioinformatics.leeds.ac.uk/ppi-pred/supp-material.
Similar articles
-
NOXclass: prediction of protein-protein interaction types.BMC Bioinformatics. 2006 Jan 19;7:27. doi: 10.1186/1471-2105-7-27. BMC Bioinformatics. 2006. PMID: 16423290 Free PMC article.
-
Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.Bioinformatics. 2007 Dec 1;23(23):3147-54. doi: 10.1093/bioinformatics/btm505. Epub 2007 Oct 17. Bioinformatics. 2007. PMID: 17942444
-
Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites.Bioinformatics. 2005 May 1;21(9):1908-16. doi: 10.1093/bioinformatics/bti315. Epub 2005 Feb 8. Bioinformatics. 2005. PMID: 15701681
-
Predicting 3D structures of protein-protein complexes.Curr Pharm Biotechnol. 2008 Apr;9(2):57-66. doi: 10.2174/138920108783955209. Curr Pharm Biotechnol. 2008. PMID: 18393862 Review.
-
Protein complexes: structure prediction challenges for the 21st century.Curr Opin Struct Biol. 2005 Feb;15(1):15-22. doi: 10.1016/j.sbi.2005.01.012. Curr Opin Struct Biol. 2005. PMID: 15718128 Review.
Cited by
-
RF_phage virion: Classification of phage virion proteins with a random forest model.Front Genet. 2023 Feb 8;13:1103783. doi: 10.3389/fgene.2022.1103783. eCollection 2022. Front Genet. 2023. PMID: 36846294 Free PMC article.
-
Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data.J Integr Bioinform. 2018 Dec 5;16(1):20180065. doi: 10.1515/jib-2018-0065. J Integr Bioinform. 2018. PMID: 30517077 Free PMC article.
-
Prediction of protein-protein interaction types using association rule based classification.BMC Bioinformatics. 2009 Jan 28;10:36. doi: 10.1186/1471-2105-10-36. BMC Bioinformatics. 2009. PMID: 19173748 Free PMC article.
-
2D Zernike polynomial expansion: Finding the protein-protein binding regions.Comput Struct Biotechnol J. 2020 Dec 4;19:29-36. doi: 10.1016/j.csbj.2020.11.051. eCollection 2021. Comput Struct Biotechnol J. 2020. PMID: 33363707 Free PMC article.
-
Statistical analysis of interface similarity in crystals of homologous proteins.J Mol Biol. 2008 Aug 29;381(2):487-507. doi: 10.1016/j.jmb.2008.06.002. Epub 2008 Jun 7. J Mol Biol. 2008. PMID: 18599072 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources