CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites
- PMID: 26854760
- PMCID: PMC4794384
- DOI: 10.1016/j.jmb.2016.01.029
CryptoSite: Expanding the Druggable Proteome by Characterization and Prediction of Cryptic Binding Sites
Abstract
Many proteins have small-molecule binding pockets that are not easily detectable in the ligand-free structures. These cryptic sites require a conformational change to become apparent; a cryptic site can therefore be defined as a site that forms a pocket in a holo structure, but not in the apo structure. Because many proteins appear to lack druggable pockets, understanding and accurately identifying cryptic sites could expand the set of drug targets. Previously, cryptic sites were identified experimentally by fragment-based ligand discovery and computationally by long molecular dynamics simulations and fragment docking. Here, we begin by constructing a set of structurally defined apo-holo pairs with cryptic sites. Next, we comprehensively characterize the cryptic sites in terms of their sequence, structure, and dynamics attributes. We find that cryptic sites tend to be as conserved in evolution as traditional binding pockets but are less hydrophobic and more flexible. Relying on this characterization, we use machine learning to predict cryptic sites with relatively high accuracy (for our benchmark, the true positive and false positive rates are 73% and 29%, respectively). We then predict cryptic sites in the entire structurally characterized human proteome (11,201 structures, covering 23% of all residues in the proteome). CryptoSite increases the size of the potentially "druggable" human proteome from ~40% to ~78% of disease-associated proteins. Finally, to demonstrate the utility of our approach in practice, we experimentally validate a cryptic site in protein tyrosine phosphatase 1B using a covalent ligand and NMR spectroscopy. The CryptoSite Web server is available at http://salilab.org/cryptosite.
Keywords: cryptic binding sites; machine learning; protein dynamics; undruggable proteins.
Published by Elsevier Ltd.
Figures



References
-
- Nisius B, Sha F, Gohlke H. Structure-based computational analysis of protein binding sites for function and druggability prediction. Journal of biotechnology. 2012;159(3):123–134. - PubMed
-
- Campbell SJ, Gold ND, Jackson RM, Westhead DR. Ligand binding: functional site location, similarity and docking. Current opinion in structural biology. 2003;13(3):389–395. - PubMed
-
- Laurie AT, Jackson RM. Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites. Bioinformatics. 2005;21(9):1908–1916. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- R21 GM110580/GM/NIGMS NIH HHS/United States
- P30 DK063720/DK/NIDDK NIH HHS/United States
- R01 GM083960/GM/NIGMS NIH HHS/United States
- T32 GM008692/GM/NIGMS NIH HHS/United States
- U54 RR022220/RR/NCRR NIH HHS/United States
- Howard Hughes Medical Institute/United States
- DP5 OD009180/OD/NIH HHS/United States
- U54 GM094662/GM/NIGMS NIH HHS/United States
- T32 GM064337/GM/NIGMS NIH HHS/United States
- F31 CA180378/CA/NCI NIH HHS/United States
- P41 GM109824/GM/NIGMS NIH HHS/United States
- P01 AI091575/AI/NIAID NIH HHS/United States
- U01 GM098256/GM/NIGMS NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources