Kinase-kernel models: accurate in silico screening of 4 million compounds across the entire human kinome
- PMID: 22133092
- DOI: 10.1021/ci200314j
Kinase-kernel models: accurate in silico screening of 4 million compounds across the entire human kinome
Abstract
Reliable in silico prediction methods promise many advantages over experimental high-throughput screening (HTS): vastly lower time and cost, affinity magnitude estimates, no requirement for a physical sample, and a knowledge-driven exploration of chemical space. For the specific case of kinases, given several hundred experimental IC(50) training measurements, the empirically parametrized profile-quantitative structure-activity relationship (profile-QSAR) and surrogate AutoShim methods developed at Novartis can predict IC(50) with a reliability approaching experimental HTS. However, in the absence of training data, prediction is much harder. The most common a priori prediction method is docking, which suffers from many limitations: It requires a protein structure, is slow, and cannot predict affinity. (1) Highly accurate profile-QSAR (2) models have now been built for roughly 100 kinases covering most of the kinome. Analyzing correlations among neighboring kinases shows that near neighbors share a high degree of SAR similarity. The novel chemogenomic kinase-kernel method reported here predicts activity for new kinases as a weighted average of predicted activities from profile-QSAR models for nearby neighbor kinases. Three different factors for weighting the neighbors were evaluated: binding site sequence identity to the kinase neighbors, similarity of the training set for each neighbor model to the compound being predicted, and accuracy of each neighbor model. Binding site sequence identity was by far most important, followed by chemical similarity. Model quality had almost no relevance. The median R(2) = 0.55 for kinase-kernel interpolations on 25% of the data of each set held out from method optimization for 51 kinase assays, approached the accuracy of median R(2) = 0.61 for the trained profile-QSAR predictions on the same held out 25% data of each set, far faster and far more accurate than docking. Validation on the full data sets from 18 additional kinase assays not part of method optimization studies also showed strong performance with median R(2) = 0.48. Genetic algorithm optimization of the binding site residues used to compute binding site sequence identity identified 16 privileged residues from a larger set of 46. These 16 are consistent with the kinase selectivity literature and structural biology, further supporting the scientific validity of the approach. A priori kinase-kernel predictions for 4 million compounds were interpolated from 51 existing profile-QSAR models for the remaining >400 novel kinases, totaling 2 billion activity predictions covering the entire kinome. The method has been successfully applied in two therapeutic projects to generate predictions and select compounds for activity testing.
Similar articles
-
Profile-QSAR: a novel meta-QSAR method that combines activities across the kinase family to accurately predict affinity, selectivity, and cellular activity.J Chem Inf Model. 2011 Aug 22;51(8):1942-56. doi: 10.1021/ci1005004. Epub 2011 Jul 19. J Chem Inf Model. 2011. PMID: 21667971
-
QSAR models for predicting the similarity in binding profiles for pairs of protein kinases and the variation of models between experimental data sets.J Chem Inf Model. 2009 Aug;49(8):1974-85. doi: 10.1021/ci900176y. J Chem Inf Model. 2009. PMID: 19639957
-
Trends in kinase selectivity: insights for target class-focused library screening.J Med Chem. 2011 Jan 13;54(1):54-66. doi: 10.1021/jm101195a. Epub 2010 Dec 3. J Med Chem. 2011. PMID: 21128601
-
Kinase-Kernel Models: Accurate Chemogenomic Method for the Entire Human Kinome.Mol Inform. 2013 Dec;32(11-12):922-8. doi: 10.1002/minf.201300091. Epub 2013 Dec 13. Mol Inform. 2013. PMID: 27481138 Review.
-
Receptor-based 3D-QSAR in Drug Design: Methods and Applications in Kinase Studies.Curr Top Med Chem. 2016;16(13):1463-77. doi: 10.2174/1568026615666150915120943. Curr Top Med Chem. 2016. PMID: 26369822 Review.
Cited by
-
KinMap: a web-based tool for interactive navigation through human kinome data.BMC Bioinformatics. 2017 Jan 5;18(1):16. doi: 10.1186/s12859-016-1433-7. BMC Bioinformatics. 2017. PMID: 28056780 Free PMC article.
-
Assessing the information content of structural and protein-ligand interaction representations for the classification of kinase inhibitor binding modes via machine learning and active learning.J Cheminform. 2020 May 24;12(1):36. doi: 10.1186/s13321-020-00434-7. J Cheminform. 2020. PMID: 33431025 Free PMC article.
-
Cheminformatics aspects of high throughput screening: from robots to models: symposium summary.J Comput Aided Mol Des. 2013 May;27(5):443-53. doi: 10.1007/s10822-013-9646-6. Epub 2013 May 1. J Comput Aided Mol Des. 2013. PMID: 23636795 Free PMC article.
-
De novo design of protein kinase inhibitors by in silico identification of hinge region-binding fragments.ACS Chem Biol. 2013 May 17;8(5):1044-52. doi: 10.1021/cb300729y. Epub 2013 Mar 27. ACS Chem Biol. 2013. PMID: 23534475 Free PMC article.
-
Perspective on computational and structural aspects of kinase discovery from IPK2014.Biochim Biophys Acta. 2015 Oct;1854(10 Pt B):1595-604. doi: 10.1016/j.bbapap.2015.03.014. Epub 2015 Apr 7. Biochim Biophys Acta. 2015. PMID: 25861861 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous