. 2018 Jan 22;58(1):119-133.

doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment

Hossam M Ashtawy¹, Nihar R Mahapatra¹

Affiliations

PMID: 29190087
DOI: 10.1021/acs.jcim.7b00309

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment

Hossam M Ashtawy et al. J Chem Inf Model. 2018.

. 2018 Jan 22;58(1):119-133.

doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

Authors

Hossam M Ashtawy¹, Nihar R Mahapatra¹

Affiliation

¹ Department of Electrical and Computer Engineering, Michigan State University , East Lansing, Michigan 48824-1226, United States.

PMID: 29190087
DOI: 10.1021/acs.jcim.7b00309

Abstract

Molecular docking, scoring, and virtual screening play an increasingly important role in computer-aided drug discovery. Scoring functions (SFs) are typically employed to predict the binding conformation (docking task), binding affinity (scoring task), and binary activity level (screening task) of ligands against a critical protein target in a disease's pathway. In most molecular docking software packages available today, a generic binding affinity-based (BA-based) SF is invoked for all three tasks to solve three different, but related, prediction problems. The limited predictive accuracies of such SFs in these three tasks has been a major roadblock toward cost-effective drug discovery. Therefore, in this work, we develop BT-Score, an ensemble machine-learning (ML) SF of boosted decision trees and thousands of predictive descriptors to estimate BA. BT-Score reproduced BA of out-of-sample test complexes with correlation of 0.825. Even with this high accuracy in the scoring task, we demonstrate that the docking and screening performance of BT-Score and other BA-based SFs is far from ideal. This has motivated us to build two task-specific ML SFs for the docking and screening problems. We propose BT-Dock, a boosted-tree ensemble model trained on a large number of native and computer-generated ligand conformations and optimized to predict binding poses explicitly. This model has shown an average improvement of 25% over its BA-based counterparts in different ligand pose prediction scenarios. Similar improvement has also been obtained by our screening-based SF, BT-Screen, which directly models the ligand activity labeling task as a classification problem. BT-Screen is trained on thousands of active and inactive protein-ligand complexes to optimize it for finding real actives from databases of ligands not seen in its training set. In addition to the three task-specific SFs, we propose a novel multi-task deep neural network (MT-Net) that is trained on data from the three tasks to simultaneously predict binding poses, affinities, and activity levels. We show that the performance of MT-Net is superior to conventional SFs and on a par with or better than models based on single-task neural networks.

PubMed Disclaimer

Cited by

GNINA 1.0: molecular docking with deep learning.
McNutt AT, Francoeur P, Aggarwal R, Masuda T, Meli R, Ragoza M, Sunseri J, Koes DR. McNutt AT, et al. J Cheminform. 2021 Jun 9;13(1):43. doi: 10.1186/s13321-021-00522-2. J Cheminform. 2021. PMID: 34108002 Free PMC article.
Nonparametric chemical descriptors for the calculation of ligand-biopolymer affinities with machine-learning scoring functions.
Moman E, Grishina MA, Potemkin VA. Moman E, et al. J Comput Aided Mol Des. 2019 Nov;33(11):943-953. doi: 10.1007/s10822-019-00248-2. Epub 2019 Nov 14. J Comput Aided Mol Des. 2019. PMID: 31728812
Integrated Molecular Modeling and Machine Learning for Drug Design.
Xia S, Chen E, Zhang Y. Xia S, et al. J Chem Theory Comput. 2023 Nov 14;19(21):7478-7495. doi: 10.1021/acs.jctc.3c00814. Epub 2023 Oct 26. J Chem Theory Comput. 2023. PMID: 37883810 Free PMC article. Review.
PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications.
Korlepara DB, C S V, Srivastava R, Pal PK, Raza SH, Kumar V, Pandit S, Nair AG, Pandey S, Sharma S, Jeurkar S, Thakran K, Jaglan R, Verma S, Ramachandran I, Chatterjee P, Nayar D, Priyakumar UD. Korlepara DB, et al. Sci Data. 2024 Feb 9;11(1):180. doi: 10.1038/s41597-023-02872-y. Sci Data. 2024. PMID: 38336857 Free PMC article.
Deep learning and virtual drug screening.
Carpenter KA, Cohen DS, Jarrell JT, Huang X. Carpenter KA, et al. Future Med Chem. 2018 Nov;10(21):2557-2567. doi: 10.4155/fmc-2018-0314. Epub 2018 Oct 5. Future Med Chem. 2018. PMID: 30288997 Free PMC article.

See all "Cited by" articles

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
- American Chemical Society
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment

Affiliation

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources