Nonlinear scoring functions for similarity-based ligand docking and binding affinity prediction
- PMID: 24171431
- DOI: 10.1021/ci400510e
Nonlinear scoring functions for similarity-based ligand docking and binding affinity prediction
Abstract
A common strategy for virtual screening considers a systematic docking of a large library of organic compounds into the target sites in protein receptors with promising leads selected based on favorable intermolecular interactions. Despite a continuous progress in the modeling of protein-ligand interactions for pharmaceutical design, important challenges still remain, thus the development of novel techniques is required. In this communication, we describe eSimDock, a new approach to ligand docking and binding affinity prediction. eSimDock employs nonlinear machine learning-based scoring functions to improve the accuracy of ligand ranking and similarity-based binding pose prediction, and to increase the tolerance to structural imperfections in the target structures. In large-scale benchmarking using the Astex/CCDC data set, we show that 53.9% (67.9%) of the predicted ligand poses have RMSD of <2 Å (<3 Å). Moreover, using binding sites predicted by recently developed eFindSite, eSimDock models ligand binding poses with an RMSD of 4 Å for 50.0-39.7% of the complexes at the protein homology level limited to 80-40%. Simulations against non-native receptor structures, whose mean backbone rearrangements vary from 0.5 to 5.0 Å Cα-RMSD, show that the ratio of docking accuracy and the estimated upper bound is at a constant level of ∼0.65. Pearson correlation coefficient between experimental and predicted by eSimDock Ki values for a large data set of the crystal structures of protein-ligand complexes from BindingDB is 0.58, which decreases only to 0.46 when target structures distorted to 3.0 Å Cα-RMSD are used. Finally, two case studies demonstrate that eSimDock can be customized to specific applications as well. These encouraging results show that the performance of eSimDock is largely unaffected by the deformations of ligand binding regions, thus it represents a practical strategy for across-proteome virtual screening using protein models. eSimDock is freely available to the academic community as a Web server at http://www.brylinski.org/esimdock .
Similar articles
-
Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA.J Chem Inf Model. 2014 Oct 27;54(10):2697-717. doi: 10.1021/ci5003735. Epub 2014 Sep 30. J Chem Inf Model. 2014. PMID: 25266271
-
Binding affinity prediction for protein-ligand complexes based on β contacts and B factor.J Chem Inf Model. 2013 Nov 25;53(11):3076-85. doi: 10.1021/ci400450h. Epub 2013 Nov 5. J Chem Inf Model. 2013. PMID: 24191692
-
Beware of machine learning-based scoring functions-on the danger of developing black boxes.J Chem Inf Model. 2014 Oct 27;54(10):2807-15. doi: 10.1021/ci500406k. Epub 2014 Sep 24. J Chem Inf Model. 2014. PMID: 25207678
-
From heptahelical bundle to hits from the Haystack: structure-based virtual screening for GPCR ligands.Methods Enzymol. 2013;522:279-336. doi: 10.1016/B978-0-12-407865-9.00015-7. Methods Enzymol. 2013. PMID: 23374191 Review.
-
An Overview of Scoring Functions Used for Protein-Ligand Interactions in Molecular Docking.Interdiscip Sci. 2019 Jun;11(2):320-328. doi: 10.1007/s12539-019-00327-w. Epub 2019 Mar 15. Interdiscip Sci. 2019. PMID: 30877639 Review.
Cited by
-
Using diverse potentials and scoring functions for the development of improved machine-learned models for protein-ligand affinity and docking pose prediction.J Comput Aided Mol Des. 2021 Nov;35(11):1095-1123. doi: 10.1007/s10822-021-00423-4. Epub 2021 Oct 28. J Comput Aided Mol Des. 2021. PMID: 34708263
-
Elucidating the druggability of the human proteome with eFindSite.J Comput Aided Mol Des. 2019 May;33(5):509-519. doi: 10.1007/s10822-019-00197-w. Epub 2019 Mar 19. J Comput Aided Mol Des. 2019. PMID: 30888556 Free PMC article.
-
Computational redesign of bacterial biotin carboxylase inhibitors using structure-based virtual screening of combinatorial libraries.Molecules. 2014 Apr 2;19(4):4021-45. doi: 10.3390/molecules19044021. Molecules. 2014. PMID: 24699146 Free PMC article.
-
Improving the accuracy of high-throughput protein-protein affinity prediction may require better training data.BMC Bioinformatics. 2017 Mar 23;18(Suppl 5):102. doi: 10.1186/s12859-017-1533-z. BMC Bioinformatics. 2017. PMID: 28361672 Free PMC article.
-
Machine learning classification can reduce false positives in structure-based virtual screening.Proc Natl Acad Sci U S A. 2020 Aug 4;117(31):18477-18488. doi: 10.1073/pnas.2000585117. Epub 2020 Jul 15. Proc Natl Acad Sci U S A. 2020. PMID: 32669436 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources