Prediction-based fingerprints of protein-protein interactions
- PMID: 17152079
- DOI: 10.1002/prot.21248
Prediction-based fingerprints of protein-protein interactions
Abstract
The recognition of protein interaction sites is an important intermediate step toward identification of functionally relevant residues and understanding protein function, facilitating experimental efforts in that regard. Toward that goal, the authors propose a novel representation for the recognition of protein-protein interaction sites that integrates enhanced relative solvent accessibility (RSA) predictions with high resolution structural data. An observation that RSA predictions are biased toward the level of surface exposure consistent with protein complexes led the authors to investigate the difference between the predicted and actual (i.e., observed in an unbound structure) RSA of an amino acid residue as a fingerprint of interaction sites. The authors demonstrate that RSA prediction-based fingerprints of protein interactions significantly improve the discrimination between interacting and noninteracting sites, compared with evolutionary conservation, physicochemical characteristics, structure-derived and other features considered before. On the basis of these observations, the authors developed a new method for the prediction of protein-protein interaction sites, using machine learning approaches to combine the most informative features into the final predictor. For training and validation, the authors used several large sets of protein complexes and derived from them nonredundant representative chains, with interaction sites mapped from multiple complexes. Alternative machine learning techniques are used, including Support Vector Machines and Neural Networks, so as to evaluate the relative effects of the choice of a representation and a specific learning algorithm. The effects of induced fit and uncertainty of the negative (noninteracting) class assignment are also evaluated. Several representative methods from the literature are reimplemented to enable direct comparison of the results. Using rigorous validation protocols, the authors estimated that the new method yields the overall classification accuracy of about 74% and Matthews correlation coefficients of 0.42, as opposed to up to 70% classification accuracy and up to 0.3 Matthews correlation coefficient for methods that do not utilize RSA prediction-based fingerprints. The new method is available at http://sppider.cchmc.org.
2006 Wiley-Liss, Inc.
Similar articles
-
Combining prediction of secondary structure and solvent accessibility in proteins.Proteins. 2005 May 15;59(3):467-75. doi: 10.1002/prot.20441. Proteins. 2005. PMID: 15768403
-
Accurate prediction of solvent accessibility using neural networks-based regression.Proteins. 2004 Sep 1;56(4):753-67. doi: 10.1002/prot.20176. Proteins. 2004. PMID: 15281128
-
Predicting small ligand binding sites in proteins using backbone structure.Bioinformatics. 2008 Dec 15;24(24):2865-71. doi: 10.1093/bioinformatics/btn543. Epub 2008 Oct 21. Bioinformatics. 2008. PMID: 18940825 Free PMC article.
-
Computational approaches to the identification of heparin-binding sites on the surfaces of proteins.Biochem Soc Trans. 2006 Jun;34(Pt 3):431-4. doi: 10.1042/BST0340431. Biochem Soc Trans. 2006. PMID: 16709179 Review.
-
Fingerprinting Interactions between Proteins and Ligands for Facilitating Machine Learning in Drug Discovery.Biomolecules. 2024 Jan 5;14(1):72. doi: 10.3390/biom14010072. Biomolecules. 2024. PMID: 38254672 Free PMC article. Review.
Cited by
-
Protein-protein binding site identification by enumerating the configurations.BMC Bioinformatics. 2012 Jul 6;13:158. doi: 10.1186/1471-2105-13-158. BMC Bioinformatics. 2012. PMID: 22768846 Free PMC article.
-
Defining the Escherichia coli SecA dimer interface residues through in vivo site-specific photo-cross-linking.J Bacteriol. 2013 Jun;195(12):2817-25. doi: 10.1128/JB.02269-12. Epub 2013 Apr 12. J Bacteriol. 2013. PMID: 23585536 Free PMC article.
-
Algorithmic approaches to protein-protein interaction site prediction.Algorithms Mol Biol. 2015 Feb 15;10:7. doi: 10.1186/s13015-015-0033-9. eCollection 2015. Algorithms Mol Biol. 2015. PMID: 25713596 Free PMC article.
-
Specific inter-domain interactions stabilize a compact HIV-1 Gag conformation.PLoS One. 2019 Aug 22;14(8):e0221256. doi: 10.1371/journal.pone.0221256. eCollection 2019. PLoS One. 2019. PMID: 31437199 Free PMC article.
-
AKT mutant allele-specific activation dictates pharmacologic sensitivities.Nat Commun. 2022 Apr 19;13(1):2111. doi: 10.1038/s41467-022-29638-1. Nat Commun. 2022. PMID: 35440569 Free PMC article. Clinical Trial.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources