Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Nov 30;589(23):3516-26.
doi: 10.1016/j.febslet.2015.10.003. Epub 2015 Oct 13.

Computational prediction of protein interfaces: A review of data driven methods

Affiliations
Review

Computational prediction of protein interfaces: A review of data driven methods

Li C Xue et al. FEBS Lett. .

Abstract

Reliably pinpointing which specific amino acid residues form the interface(s) between a protein and its binding partner(s) is critical for understanding the structural and physicochemical determinants of protein recognition and binding affinity, and has wide applications in modeling and validating protein interactions predicted by high-throughput methods, in engineering proteins, and in prioritizing drug targets. Here, we review the basic concepts, principles and recent advances in computational approaches to the analysis and prediction of protein-protein interfaces. We point out caveats for objectively evaluating interface predictors, and discuss various applications of data-driven interface predictors for improving energy model-driven protein-protein docking. Finally, we stress the importance of exploiting binding partner information in reliably predicting interfaces and highlight recent advances in this emerging direction.

Keywords: Cross validation on instance level; Cross validation on protein level; Docking; Evaluation caveats; Machine learning; Partner-specific interface prediction; Protein–protein interaction.

PubMed Disclaimer

Figures

Figure 1
Figure 1. A surface patch and a sequence window
A) A surface patch defined by a target residue (blue) and its spatial neighboring residues (magenta) that fall within a virtual sphere of diameter, d, centered on the target residue. B) A sequence window centered on a target residue (purple).
Figure 2
Figure 2. Partner-specific interface predictors outperform non-partner-specific predictors
(A) A comparison of the top 20 predicted interfacial residues for a complex of Acetylcholinesterase (blue ribbons) and Toxin F-VII Fasciculin-2 (red ribbons) (PDB ID: 1MAH) by the partner-specific method, PPIPP (Ahmad & Mizuguchi, 2011), and the corresponding non-partner-specific version. By including partner information, PPIPP is able to predict interfacial residues (green) clustering around the interaction location specific to the binding partner whereas those predicted by the non-partner-specific method (red) are scattered over the surface of query protein. Figure credit: (Ahmad & Mizuguchi, 2011)(B) Prediction performance comparisons over a set of 123 non-redundant protein-protein complexes in Docking Benchmark 3.0 (Hwang, Pierce, Mintseris, Janin, & Weng, 2008). We compared two partner-specific predictors, PAIRpred (Afsar Minhas et al., 2013) and PPiPP (a sequence-based predictor) (Ahmad & Mizuguchi, 2011), with two non-partner-specific machine learning predictors: PSIVER, a sequence-based predictor (Murakami & Mizuguchi, 2010) and SPPIDER, a structure-based predictor (Porollo & Meller, 2007). With partner information, PAIRpred and PPiPP outperform the two predictors that do not consider partner information when making predictions, improving Area Under Curves (AUCs) from 0.63 (PSIVER) and 0.58 (SPPIDER) to 0.73 (PPiPP) and 0.89 (PAIRpred). AUC values are extracted from (Afsar Minhas et al., 2013; Ahmad & Mizuguchi, 2011).
Figure 3
Figure 3. Protein-protein docking and its two major challenges
Figure 4
Figure 4. Machine learning toward improved 3D protein interaction prediction

References

    1. Afsar Minhas FUA, Geiss BJ, Ben-Hur A. PAIRpred: Partner-specific prediction of interacting residues from sequence and structure. Proteins: Structure, Function, and Bioinformatics. 2013;82(7):1142–1155. http://doi.org/10.1002/prot.24479. - DOI - PMC - PubMed
    1. Ahmad S, Mizuguchi K. Partner-Aware Prediction of Interacting Residues in Protein-Protein Complexes from Sequence Data. PLoS One. 2011;6(12):e29104. http://doi.org/10.1371/journal.pone.0029104. - DOI - PMC - PubMed
    1. Baldi P, Brunak S, Chauvin Y, Andersen CA, Nielsen H. Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics. 2000;16(5):412–424. - PubMed
    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic Acids Research. 2000;28(1):235–242. http://doi.org/10.1093/nar/28.1.235. - DOI - PMC - PubMed
    1. Bonvin Alexandre. Flexible protein-protein docking. Current Opinion in Structural Biology. 2006;16(2):194–200. http://doi.org/10.1016/j.sbi.2006.02.002. - DOI - PubMed

Publication types