D3R grand challenge 2015: Evaluation of protein-ligand pose and affinity predictions

Symon Gathiaka¹, Shuai Liu¹, Michael Chiu¹, Huanwang Yang², Jeanne A Stuckey³, You Na Kang³, Jim Delproposto³, Ginger Kubish³, James B Dunbar Jr⁴, Heather A Carlson⁴, Stephen K Burley^{2

5

6}, W Patrick Walters⁷, Rommie E Amaro^{8

9}, Victoria A Feher^{10

11

12}, Michael K Gilson^{13

14

15}

Affiliations

¹ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
² RCSB Protein Data Bank, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Department Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ, 08854, USA.
³ Center for Structural Biology, Life Sciences Institute, University of Michigan, 210 Washtenaw Avenue, Ann Arbor, MI, 48109-2216, USA.
⁴ Department of Medicinal Chemistry, University of Michigan, 428 Church St., Ann Arbor, MI, 48109-1065, USA.
⁵ Department of Pharmacy, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
⁶ San Diego Supercomputer Center, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
⁷ Relay Therapeutics, 215 First St, Cambridge, MA, 20142, USA.
⁸ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. drugdesigndata@gmail.com.
⁹ Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. drugdesigndata@gmail.com.
¹⁰ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. victoria.feher@schrodinger.com.
¹¹ Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. victoria.feher@schrodinger.com.
¹² Schrodinger, Inc., New York, NY, USA. victoria.feher@schrodinger.com.
¹³ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. mgilson@ucsd.edu.
¹⁴ Department of Pharmacy, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. mgilson@ucsd.edu.
¹⁵ Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. mgilson@ucsd.edu.

PMID: 27696240
PMCID: PMC5562487
DOI: 10.1007/s10822-016-9946-8

D3R grand challenge 2015: Evaluation of protein-ligand pose and affinity predictions

Symon Gathiaka et al. J Comput Aided Mol Des. 2016 Sep.

. 2016 Sep;30(9):651-668.

doi: 10.1007/s10822-016-9946-8. Epub 2016 Sep 30.

Authors

Affiliations

¹ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
² RCSB Protein Data Bank, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Department Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ, 08854, USA.
³ Center for Structural Biology, Life Sciences Institute, University of Michigan, 210 Washtenaw Avenue, Ann Arbor, MI, 48109-2216, USA.
⁴ Department of Medicinal Chemistry, University of Michigan, 428 Church St., Ann Arbor, MI, 48109-1065, USA.
⁵ Department of Pharmacy, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
⁶ San Diego Supercomputer Center, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
⁷ Relay Therapeutics, 215 First St, Cambridge, MA, 20142, USA.
⁸ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. drugdesigndata@gmail.com.
⁹ Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. drugdesigndata@gmail.com.
¹⁰ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. victoria.feher@schrodinger.com.
¹¹ Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. victoria.feher@schrodinger.com.
¹² Schrodinger, Inc., New York, NY, USA. victoria.feher@schrodinger.com.
¹³ Drug Design Data Resource, Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. mgilson@ucsd.edu.
¹⁴ Department of Pharmacy, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. mgilson@ucsd.edu.
¹⁵ Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA. mgilson@ucsd.edu.

PMID: 27696240
PMCID: PMC5562487
DOI: 10.1007/s10822-016-9946-8

Abstract

The Drug Design Data Resource (D3R) ran Grand Challenge 2015 between September 2015 and February 2016. Two targets served as the framework to test community docking and scoring methods: (1) HSP90, donated by AbbVie and the Community Structure Activity Resource (CSAR), and (2) MAP4K4, donated by Genentech. The challenges for both target datasets were conducted in two stages, with the first stage testing pose predictions and the capacity to rank compounds by affinity with minimal structural data; and the second stage testing methods for ranking compounds with knowledge of at least a subset of the ligand-protein poses. An additional sub-challenge provided small groups of chemically similar HSP90 compounds amenable to alchemical calculations of relative binding free energy. Unlike previous blinded Challenges, we did not provide cognate receptors or receptors prepared with hydrogens and likewise did not require a specified crystal structure to be used for pose or affinity prediction in Stage 1. Given the freedom to select from over 200 crystal structures of HSP90 in the PDB, participants employed workflows that tested not only core docking and scoring technologies, but also methods for addressing water-mediated ligand-protein interactions, binding pocket flexibility, and the optimal selection of protein structures for use in docking calculations. Nearly 40 participating groups submitted over 350 prediction sets for Grand Challenge 2015. This overview describes the datasets and the organization of the challenge components, summarizes the results across all submitted predictions, and considers broad conclusions that may be drawn from this collaborative community endeavor.

Keywords: D3R; Docking; Free energy; Ligand; Protein; Scoring.

PubMed Disclaimer

Figures

**Figure 1**
HSP90 RMSD box plots of rank 1 and best of top 5 poses for all submitted-methods. The methods are ordered by the median RMSD. The mean is shown by the circles, the connect line is for the median, the box is for the interquartile range (IQR) with the whiskers indicating 1.5X IQR. Table S3 (SI-Methods) has the names of the Submitted Methods and a summary of the protocols. Each Submitted Method’s box plot contains 5 data points.

**Figure 2**
HSP90 box plot distributions for the rank 1 pose, color coded by chemical series and organized by receptor-ligand PDB selection type, “similarity” or “cross-docking”. The mean is shown by the circles with a plus sign, the median by the filled circle, the box is for the interquartile range (IQR) with the whiskers indicating 1.5X IQR. The outliers are indicated by asterisks above the whiskers. The box plots contain 12, 27, 24, 15, 11, and 28 data points, respectively.

**Figure 3**
Binding site conformations of of HSP90 (a), with ligands 73 (grey) and 179 (cyan) in the experimental co-crystal structures. The open conformation with ligand 73 is shown in yellow while 179’s closed conformation is shown in purple, with the positioning of Thr109 depicted. (b) Chemical structures of the two ligands: 4YKW (HSP90_73) and 4YKU (HSP90_179), respectively.

**Figure 4**
**(a)** Ligands 164 (grey) and 175 (cyan) in the experimental co-crystal structures. **(b)** Their 2D structures; 4YKX (HSP90_164) and 4YKZ (HSP90_175), respectively.

**Figure 5**
HSP90 box plot of RMSD distributions for rank 1 poses of ligands HSP90_175 and HSP90_164, separated according to the conformation of the protein structure used and whether the crucial water was present or absent in the docked structure. (An open conformation and the water-present structure was not tested for ligand HSP90_175.) The means are shown by circles with a plus sign, the medians by the filled circles, the boxes are for the interquartile ranges (IQR), and the whiskers indicat 1.5 × IQR. The respective box plots contain 6, 21 and 13 predictions for ligand 175; and 8, 17, 2, 12 predictions for ligand 164.

**Figure 6**
MAP4K4 RMSD box plots of rank 1 and best of top 5 poses for all submitted-methods. Means are shown by circles, the connecting line is for the medians, the box is for the interquartile range (IQR), and the whiskers indicate 1.5 × IQR. Outliers are indicated by asterisks above the bars. Table S4 (SI-Methods) has the names of the Submitted Methods and a summary of the protocols. Each Submitted Method’s box plot contains 30 data points.

**Figure 7**
HSP90 Kendall Tau correlation coefficient scores between the predicted scores and experimental binding affinities. Green bars are for ligand-based scoring methods, and unfilled bars are for null models. The method names corresponding to the Method IDs are in Tables S5 and S6 (SI-Methods). The error bars are 1σ confidence intervals based on 10,000 bootstrap samples.

**Figure 8**
MAP4K4 Kendall Tau correlation coefficient scores between the predicted scores and experimental binding affinities. The green bars are for ligand-based scoring methods, and unfilled bars are for null models. The names corresponding to the Submitted Method’s number are in Tables S7 and S8 (SI - Methods). The error bars are 1σ confidence intervals based on 10,000 bootstrap samples. They are fairly large for the MAP4K4 dataset due to a relatively big experimental uncertainty.

**Figure 9**
HSP90 Kendall Tau correlation coefficient scores between the predicted scores and experimental binding affinities, separated by the three chemotypes. The names corresponding to the Submitted Methods number are in Table S5. The error bars are 1σ confidence intervals based on 10,000 bootstrap samples.

**Figure 10**
RMSE_c (top row) and Kendall’s tau (bottom row) for the three free energy prediction sets. Methods using explicit solvent alchemical free energy simulations (5, 6 and 11) are shown in red. The X-axis labels are the Method IDs from Table S9, and are in order of increasing average RMSE_c across all three sets. Error bars indicate 1σ ranges based on 10,000 bootstrap samples.

See this image and copyright information in PMC

References

1. Smith RD, Dunbar JB, Jr, Ung PM, et al. CSAR benchmark exercise of 2010: combined evaluation across all submitted scoring functions. J Chem Inf Model. 2011;51:2115–2131. doi: 10.1021/ci200269q. - DOI - PMC - PubMed
1. Damm-Ganamet KL, Smith RD, Dunbar JB, Jr, et al. CSAR benchmark exercise 2011–2012: evaluation of results from docking and relative ranking of blinded congeneric series. J Chem Inf Model. 2013;53:1853–1870. doi: 10.1021/ci400025f. - DOI - PMC - PubMed
1. Smith RD, Damm-Ganamet KL, Dunbar JB, Jr, et al. CSAR Benchmark Exercise 2013: Evaluation of Results from a Combined Computational Protein Design, Docking, and Scoring/Ranking Challenge. J Chem Inf Model. 2016;56:1022–1031. doi: 10.1021/acs.jcim.5b00387. - DOI - PMC - PubMed
1. Carlson HA, Smith RD, Damm-Ganamet KL, et al. CSAR 2014: A Benchmark Exercise Using Unpublished Data from Pharma. J Chem Inf Model. 2016;56:1063–1077. doi: 10.1021/acs.jcim.5b00523. - DOI - PMC - PubMed
1. Solit DB, Rosen N. Hsp90: a novel target for cancer therapy. Curr Top Med Chem. 2006;6:1205–1214. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

D3R grand challenge 2015: Evaluation of protein-ligand pose and affinity predictions

Affiliations

D3R grand challenge 2015: Evaluation of protein-ligand pose and affinity predictions

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous