. 2017 Mar;85(3):513-527.

doi: 10.1002/prot.25165. Epub 2016 Oct 14.

Human and server docking prediction for CAPRI round 30-35 using LZerD with combined scoring functions

Lenna X Peterson¹, Hyungrae Kim¹, Juan Esquivel-Rodriguez², Amitava Roy^{1

3

4}, Xusi Han¹, Woong-Hee Shin¹, Jian Zhang¹, Genki Terashi^{1

5}, Matt Lee⁶, Daisuke Kihara^{1

2}

Affiliations

¹ Department of Biological Sciences, Purdue University, West Lafayette, Indiana.
² Department of Computer Science, Purdue University, West Lafayette, Indiana.
³ Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana.
⁴ Bioinformatics and Computational Biosciences Branch, Rocky Mountain Laboratories, NIAID, National Institutes of Health, Hamilton, Montana, 59840.
⁵ School of Pharmacy, Kitasato University, Minato-Ku, Tokyo, 108-8641, Japan.
⁶ Lilly Biotechnology Center San Diego, 10300 Campus Point Drive, San Diego, California.

PMID: 27654025
PMCID: PMC5313330
DOI: 10.1002/prot.25165

Human and server docking prediction for CAPRI round 30-35 using LZerD with combined scoring functions

Lenna X Peterson et al. Proteins. 2017 Mar.

. 2017 Mar;85(3):513-527.

doi: 10.1002/prot.25165. Epub 2016 Oct 14.

Authors

Lenna X Peterson¹, Hyungrae Kim¹, Juan Esquivel-Rodriguez², Amitava Roy^{1

3

4}, Xusi Han¹, Woong-Hee Shin¹, Jian Zhang¹, Genki Terashi^{1

5}, Matt Lee⁶, Daisuke Kihara^{1

2}

Affiliations

¹ Department of Biological Sciences, Purdue University, West Lafayette, Indiana.
² Department of Computer Science, Purdue University, West Lafayette, Indiana.
³ Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana.
⁴ Bioinformatics and Computational Biosciences Branch, Rocky Mountain Laboratories, NIAID, National Institutes of Health, Hamilton, Montana, 59840.
⁵ School of Pharmacy, Kitasato University, Minato-Ku, Tokyo, 108-8641, Japan.
⁶ Lilly Biotechnology Center San Diego, 10300 Campus Point Drive, San Diego, California.

PMID: 27654025
PMCID: PMC5313330
DOI: 10.1002/prot.25165

Abstract

We report the performance of protein-protein docking predictions by our group for recent rounds of the Critical Assessment of Prediction of Interactions (CAPRI), a community-wide assessment of state-of-the-art docking methods. Our prediction procedure uses a protein-protein docking program named LZerD developed in our group. LZerD represents a protein surface with 3D Zernike descriptors (3DZD), which are based on a mathematical series expansion of a 3D function. The appropriate soft representation of protein surface with 3DZD makes the method more tolerant to conformational change of proteins upon docking, which adds an advantage for unbound docking. Docking was guided by interface residue prediction performed with BindML and cons-PPISP as well as literature information when available. The generated docking models were ranked by a combination of scoring functions, including PRESCO, which evaluates the native-likeness of residues' spatial environments in structure models. First, we discuss the overall performance of our group in the CAPRI prediction rounds and investigate the reasons for unsuccessful cases. Then, we examine the performance of several knowledge-based scoring functions and their combinations for ranking docking models. It was found that the quality of a pool of docking models generated by LZerD, that is whether or not the pool includes near-native models, can be predicted by the correlation of multiple scores. Although the current analysis used docking models generated by LZerD, findings on scoring functions are expected to be universally applicable to other docking methods. Proteins 2017; 85:513-527. © 2016 Wiley Periodicals, Inc.

Keywords: CAPRI; computational methods; prediction accuracy; protein docking prediction; protein structure prediction; protein-protein docking; structure modeling.

PubMed Disclaimer

Figures

**Figure 1**
Protein docking prediction pipeline used in our group. The tertiary structure of single proteins of a CAPRI target are modeled following the protocol described in Methods. For the human prediction of CAPRI Round 30, we also used structure models selected from CASP server models. Three parallel runs of LZerD protein docking are performed: two runs with(+)/without(−) binding residue constraints taken from prediction by BindML (the gray and white arrows in the diagram) and cons- PPISP using single chain models generated by our lab protocol, and the third LZerD run (only for human prediction, hashed arrows labeled as CASP) using single chain models selected from CASP server predictions. For each of the three tracks, decoys are ranked by ITScorePro, and top 1000 decoys are selected. For LZerD server prediction, top 5 models each from decoys with(+)/without(− ) binding residue constraints using our single chain models were submitted. 1000 models from each track are further reduced to top 10 models by GOAP, which are ranked by PRESCO and DFIRE, independently. Finally, out of the 30 models in total, models that are consistently ranked among the top by two or more scoring functions are chosen in principle for final submission. Usually such models do not fill the ten slots for submission, and rests are filled with models ranked high by either of the scores and visual inspection. Biological information from literature is also applied for final selection if available.

**Figure 2**
Single and pairwise score distributions for decoys of target T93. This decoy set is a successful example of docking, which contains ten acceptable decoys out of 9999 total. The scatter plots show pairwise score distributions. Acceptable models are shown in squares. Along the diagonal, histograms of the Z-scores of individual scores are shown.

**Figure 3**
Single and pairwise score distributions for decoys of target T72. This is an unsuccessful decoy set example, which contains no acceptable decoys out of 9999 total.

**Figure 4**
Score distribution of docking decoys of T91 computed using two single chain models of different quality. T91 is a homo-dimer, but the two subunits have slightly different conformations in the native structure, which resulted in different RMSD values for each model compared to the two subunits. The first model has RMSDs of 5.4/5.5 Å to the native structures of the two chains. Another model, a CASP server model (Zhang-Server_TS1), has RMSDs of 4.1/5.1 Å (Tab. S1). A, Distributions of Z-score of GOAP and DFIRE. Left, docking decoys from the Zhang-Server_TS1 single chain model. There are 37 acceptable decoys and one medium decoy out of 4793 total. Right, decoys from the former single chain model computed in our group. No interface prediction was applied. There are nine acceptable decoys out of 6168 total. Acceptable and medium quality models are shown in gold squares and green triangles, respectively. The left bottom corner (labeled A) are subsets of decoys that have Z-score below n = 2 for the two scores (Equation 1). The Spearman correlation coefficient for the decoys in A (Equation 2) is 0.56 (p = 0.0002) for the left distribution, and 0.17 (p = 0.5) for the right. B, Two single chain models superimposed to its native structure, T91, chain C. Green, native; blue, Zhang-Server_TS1; orange, our model. C, the best model from our submission (orange) superimposed to the native complex strucure (green). *f_nat*: 0.33, L-RMSD: 9.0 Å; I-RMSD 4.2 Å.

**Figure 5**
Prediction of decoy pool quality based on score pair distribution shape. “Funnel score” is the sum of n over all score pairs where the SCC for the nσ-outliers is significant (p < 0.05) and greater than 0.4 (Equation 3). The dotted line indicates a minimum Funnel score of 3, which classifies 9 true positives, 4 true negatives, 2 false positives (T77 and T88), and 4 false negatives (T75, T86, T89, and T92).

See this image and copyright information in PMC

Cited by

Modeling protein-nucleic acid complexes with extremely large conformational changes using Flex-LZerD.
Christoffer C, Kihara D. Christoffer C, et al. Proteomics. 2023 Sep;23(17):e2200322. doi: 10.1002/pmic.202200322. Epub 2022 Dec 25. Proteomics. 2023. PMID: 36529945 Free PMC article.
Assembly of Protein Complexes In and On the Membrane with Predicted Spatial Arrangement Constraints.
Christoffer C, Harini K, Archit G, Kihara D. Christoffer C, et al. bioRxiv [Preprint]. 2023 Nov 9:2023.10.20.563303. doi: 10.1101/2023.10.20.563303. bioRxiv. 2023. Update in: J Mol Biol. 2024 Mar 15;436(6):168486. doi: 10.1016/j.jmb.2024.168486. PMID: 37961264 Free PMC article. Updated. Preprint.
Integrative Protein Assembly With LZerD and Deep Learning in CAPRI 47-55.
Christoffer C, Kagaya Y, Verburgt J, Terashi G, Shin WH, Jain A, Sarkar D, Aderinwale T, Maddhuri Venkata Subramaniya SR, Wang X, Zhang Z, Zhang Y, Kihara D. Christoffer C, et al. Proteins. 2025 Mar 17:10.1002/prot.26818. doi: 10.1002/prot.26818. Online ahead of print. Proteins. 2025. PMID: 40095385
Performance and Its Limits in Rigid Body Protein-Protein Docking.
Desta IT, Porter KA, Xia B, Kozakov D, Vajda S. Desta IT, et al. Structure. 2020 Sep 1;28(9):1071-1081.e3. doi: 10.1016/j.str.2020.06.006. Epub 2020 Jul 9. Structure. 2020. PMID: 32649857 Free PMC article.
What method to use for protein-protein docking?
Porter KA, Desta I, Kozakov D, Vajda S. Porter KA, et al. Curr Opin Struct Biol. 2019 Apr;55:1-7. doi: 10.1016/j.sbi.2018.12.010. Epub 2019 Feb 1. Curr Opin Struct Biol. 2019. PMID: 30711743 Free PMC article. Review.

See all "Cited by" articles

References

1. Kihara D, Skolnick J. Microbial genomes have over 72threading algorithm PROSPECTOR_Q. Proteins: Struct, Funct, Bioinf. 2004;55:464–473. - PubMed
1. Chen H, Kihara D. Effect of using suboptimal alignments in template-based protein structure prediction. Proteins: Struct, Funct, Bioinf. 2011;79:315–34. - PMC - PubMed
1. Pieper U, Webb BM, Dong GQ, Schneidman-Duhovny D, Fan H, Kim SJ, Khuri N, Spill YG, Weinkam P, Hammel M, Tainer JA, Nilges M, Sali A. ModBase, a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 2014;42:D336–D346. - PMC - PubMed
1. King NP, Sheffler W, Sawaya MR, Vollmar BS, Sumida JP, André I, Gonen T, Yeates TO, Baker D. Computational design of self-assembling protein nanomaterials with atomic level accuracy. Science. 2012;336:1171–1174. - PMC - PubMed
1. Gonen S, DiMaio F, Gonen T, Baker D. Design of ordered two-dimensional arrays mediated by noncovalent protein-protein interfaces. Science. 2015;348:1365–8. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

R01 GM097528/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Human and server docking prediction for CAPRI round 30-35 using LZerD with combined scoring functions

Affiliations

Human and server docking prediction for CAPRI round 30-35 using LZerD with combined scoring functions

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources