Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2015 Sep;83(9):1563-70.
doi: 10.1002/prot.24736. Epub 2015 Jun 13.

Structural templates for comparative protein docking

Affiliations
Comparative Study

Structural templates for comparative protein docking

Ivan Anishchenko et al. Proteins. 2015 Sep.

Abstract

Structural characterization of protein-protein interactions is important for understanding life processes. Because of the inherent limitations of experimental techniques, such characterization requires computational approaches. Along with the traditional protein-protein docking (free search for a match between two proteins), comparative (template-based) modeling of protein-protein complexes has been gaining popularity. Its development puts an emphasis on full and partial structural similarity between the target protein monomers and the protein-protein complexes previously determined by experimental techniques (templates). The template-based docking relies on the quality and diversity of the template set. We present a carefully curated, nonredundant library of templates containing 4950 full structures of binary complexes and 5936 protein-protein interfaces extracted from the full structures at 12 Å distance cut-off. Redundancy in the libraries was removed by clustering the PDB structures based on structural similarity. The value of the clustering threshold was determined from the analysis of the clusters and the docking performance on a benchmark set. High structural quality of the interfaces in the template and validation sets was achieved by automated procedures and manual curation. The library is included in the Dockground resource for molecular recognition studies at http://dockground.bioinformatics.ku.edu.

Keywords: benchmark sets; protein interactions; protein modeling; protein recognition; structure prediction.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart of algorithm for generation of full-structure and interface template libraries.
Figure 2
Figure 2
Example of a “bad” complex. Chains D and E from 1fma are shown in magenta and yellow, respectively, with penetrating chain removed by the automated procedure described in the text. Buried Val and Thr residues at the C-terminal of chain F identified by the procedure are shown as sticks (the last two residues at the terminus are Gly).
Figure 3
Figure 3
Properties of similarity graphs. (A) Protein-protein complexes and (B) protein-protein interfaces. The main panels show distributions of connected component size at different thresholds of the clustering TM-score (TMT). The inserts display dependence of clustering coefficient CC on TMT.
Figure 4
Figure 4
Correlation of protein-protein and interface-interface TM-scores.
Figure 5
Figure 5
Number of connected components and clusters as a function of clustering threshold. (A) Protein-protein complexes, and (B) protein-protein interfaces. NCC is the number of connected components, and NC is the number of clusters. Green lines (scaled to the right-hand axes) show the relative increase in the number of connected graph parts after splitting the connected components into tightly connected clusters.
Figure 6
Figure 6
Quality of clusters at different clustering thresholds. (A) Protein-protein complexes, and (B) protein-protein interfaces. Distributions of missing edges per cluster are shown as box-and-whickers plots with horizontal lines for minimal, maximal and median values in the distributions and boxes containing second and third quartiles of data. The circles (scaled to the right hand axis) show how the fraction of clusters, which are not complete sub-graphs of the initial similarity graphs, depends on TMT.
Figure 7
Figure 7
Performance of structure alignment at different clustering thresholds. Full-structure (circles) and interface (triangles) libraries were generated at different threshold values. Success rates were calculated for the entire pool of structures excluding templates similar to the target (see text).

References

    1. Vakser IA. Protein-protein docking: From interaction to interactome. Biophys J. 2014;107:1785–1793. - PMC - PubMed
    1. Vakser IA. Low-resolution structural modeling of protein interactome. Curr Opin Struct Biol. 2013;23:198–205. - PMC - PubMed
    1. Aloy P, Pichaud M, Russell RB. Protein complexes: Structure prediction challenges for the 21st century. Curr Opin Struct Biol. 2005;15:15–22. - PubMed
    1. Szilagyi A, Zhang Y. Template-based structure modeling of protein–protein interactions. Curr Opin Struct Biol. 2014;24:10–23. - PMC - PubMed
    1. Dey F, Zhang QC, Petrey D, Honig B. Toward a “structural BLAST”: Using structural relationships to infer function. Protein Sci. 2013;22:359–366. - PMC - PubMed

Publication types

LinkOut - more resources