. 2009 Jun 15;25(12):i296-304.

doi: 10.1093/bioinformatics/btp204.

Prediction of sub-cavity binding preferences using an adaptive physicochemical structure representation

Izhar Wallach¹, Ryan H Lilien

Affiliations

Affiliation

¹ Department of Computer Science, Donnelly Centre for Cellular and Biomolecular Research and Banting and Best, University of Toronto, Toronto, Ontario, Canada. izharw@cs.toronto.edu

PMID: 19478002
PMCID: PMC2687958
DOI: 10.1093/bioinformatics/btp204

Prediction of sub-cavity binding preferences using an adaptive physicochemical structure representation

Izhar Wallach et al. Bioinformatics. 2009.

. 2009 Jun 15;25(12):i296-304.

doi: 10.1093/bioinformatics/btp204.

Authors

Izhar Wallach¹, Ryan H Lilien

Affiliation

¹ Department of Computer Science, Donnelly Centre for Cellular and Biomolecular Research and Banting and Best, University of Toronto, Toronto, Ontario, Canada. izharw@cs.toronto.edu

PMID: 19478002
PMCID: PMC2687958
DOI: 10.1093/bioinformatics/btp204

Abstract

Motivation: The ability to predict binding profiles for an arbitrary protein can significantly improve the areas of drug discovery, lead optimization and protein function prediction. At present, there are no successful algorithms capable of predicting binding profiles for novel proteins. Existing methods typically rely on manually curated templates or entire active site comparison. Consequently, they perform best when analyzing proteins sharing significant structural similarity with known proteins (i.e. proteins resulting from divergent evolution). These methods fall short when used to characterize the binding profile of a novel active site or one for which a template is not available. In contrast to previous approaches, our method characterizes the binding preferences of sub-cavities within the active site by exploiting a large set of known protein-ligand complexes. The uniqueness of our approach lies not only in the consideration of sub-cavities, but also in the more complete structural representation of these sub-cavities, their parametrization and the method by which they are compared. By only requiring local structural similarity, we are able to leverage previously unused structural information and perform binding inference for proteins that do not share significant structural similarity with known systems.

Results: Our algorithm demonstrates the ability to accurately cluster similar sub-cavities and to predict binding patterns across a diverse set of protein-ligand complexes. When applied to two high-profile drug targets, our algorithm successfully generates a binding profile that is consistent with known inhibitors. The results suggest that our algorithm should be useful in structure-based drug discovery and lead optimization.

PubMed Disclaimer

Figures

**Fig. 1.**
The algorithmic framework is divided of two stages: the sub-cavity generation stage where the sub-cavities are initially identified and the sub-cavity comparison stage where sub-cavities are clustered and reshaped.

**Fig. 2.**
A simplified illustration of the physicochemical analysis of a sub-cavity. (A) illustrates the binding site identification described in Section 3.3. Open square, circle and triangle correspond to inner volume, binding surface and inner surface grid points, respectively. Star corresponds to a chemical group of a flanking amino acid residue. The dashed axes represent the principal axes of the sub-cavity. (B) Illustrates the initialization of the sub-cavity similarity maximization stage described in Section 3.5. The sub-cavity is rigidly transformed such that its principle axes are aligned with the x and y axes and its center of mass is at the origin. The cumulative effect of each flanking functional group (illustrated by the shaded regions) is computed for every grid point in the new coordinate frame that overlaps the sub-cavity.

**Fig. 3.**
Results on 20 simulated template sub-cavity structures. While the homogeneity scores are fairly consistent during refinement (A) the outlier scores dramatically improve after the first iteration (B). The percentage improvement in net cluster similarity for each noise level is shown in (C). Significant improvement is not observed for the 10% noise experiments as the clusters are already extremely similar. A larger improvement in similarity is observed for experiments run with higher noise levels.

**Fig. 4.**
The distribution of homogeneity scores by ligand function type for the six enzyme class experiments. The majority of clusters are highly homogeneous.

**Fig. 5.**
The structural scaffold common to many HIV protease inhibitors contains a central seven-member ring with seven peripheral R-groups (A). A sub-cavity is defined for each R-group. The binding site of HIV-1P 1HWR with its bound ligand is shown in (B). The extent of the binding site is illustrated as a blue mesh. A *sub-cavity* of one of the 1-butene groups is shown in red.

See this image and copyright information in PMC

Cited by

Binding of protein kinase inhibitors to synapsin I inferred from pair-wise binding site similarity measurements.
Defranchi E, Schalon C, Messa M, Onofri F, Benfenati F, Rognan D. Defranchi E, et al. PLoS One. 2010 Aug 16;5(8):e12214. doi: 10.1371/journal.pone.0012214. PLoS One. 2010. PMID: 20808948 Free PMC article.
Considerations of Protein Subpockets in Fragment-Based Drug Design.
Bartolowits M, Davisson VJ. Bartolowits M, et al. Chem Biol Drug Des. 2016 Jan;87(1):5-20. doi: 10.1111/cbdd.12631. Epub 2015 Aug 31. Chem Biol Drug Des. 2016. PMID: 26307335 Free PMC article.
DrugOn: a fully integrated pharmacophore modeling and structure optimization toolkit.
Vlachakis D, Fakourelis P, Megalooikonomou V, Makris C, Kossida S. Vlachakis D, et al. PeerJ. 2015 Jan 13;3:e725. doi: 10.7717/peerj.725. eCollection 2015. PeerJ. 2015. PMID: 25648563 Free PMC article.
Local functional descriptors for surface comparison based binding prediction.
Cipriano GM, Phillips GN Jr, Gleicher M. Cipriano GM, et al. BMC Bioinformatics. 2012 Nov 24;13:314. doi: 10.1186/1471-2105-13-314. BMC Bioinformatics. 2012. PMID: 23176080 Free PMC article.

References

1. Ala PJ, et al. Molecular recognition of cyclic urea HIV-1 protease inhibitors. J. Biol. Chem. 1998;273:12325–12331. - PubMed
1. Bairoch A. The ENZYME database in 2000. Nucleic Acids Res. 2000;28:304–305. - PMC - PubMed
1. Berman HM, et al. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–242. - PMC - PubMed
1. Binkowski AT, et al. Inferring functional relationships of proteins from local sequence and spatial surface patterns. J. Mol. Biol. 2003;332:505–526. - PubMed
1. Chen X, et al. Automated pharmacophore identification for large chemical data sets1. J. Chem. Infor. Comput. Sci. 1999;39:887–896. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Prediction of sub-cavity binding preferences using an adaptive physicochemical structure representation

Affiliation

Prediction of sub-cavity binding preferences using an adaptive physicochemical structure representation

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources