Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening
- PMID: 38577892
- DOI: 10.1021/acs.accounts.4c00093
Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening
Abstract
Molecular docking, also termed ligand docking (LD), is a pivotal element of structure-based virtual screening (SBVS) used to predict the binding conformations and affinities of protein-ligand complexes. Traditional LD methodologies rely on a search and scoring framework, utilizing heuristic algorithms to explore binding conformations and scoring functions to evaluate binding strengths. However, to meet the efficiency demands of SBVS, these algorithms and functions are often simplified, prioritizing speed over accuracy.The emergence of deep learning (DL) has exerted a profound impact on diverse fields, ranging from natural language processing to computer vision and drug discovery. DeepMind's AlphaFold2 has impressively exhibited its ability to accurately predict protein structures solely from amino acid sequences, highlighting the remarkable potential of DL in conformation prediction. This groundbreaking advancement circumvents the traditional search-scoring frameworks in LD, enhancing both accuracy and processing speed and thereby catalyzing a broader adoption of DL algorithms in binding pose prediction. Nevertheless, a consensus on certain aspects remains elusive.In this Account, we delineate the current status of employing DL to augment LD within the VS paradigm, highlighting our contributions to this domain. Furthermore, we discuss the challenges and future prospects, drawing insights from our scholarly investigations. Initially, we present an overview of VS and LD, followed by an introduction to DL paradigms, which deviate significantly from traditional search-scoring frameworks. Subsequently, we delve into the challenges associated with the development of DL-based LD (DLLD), encompassing evaluation metrics, application scenarios, and physical plausibility of the predicted conformations. In the evaluation of LD algorithms, it is essential to recognize the multifaceted nature of the metrics. While the accuracy of binding pose prediction, often measured by the success rate, is a pivotal aspect, the scoring/screening power and computational speed of these algorithms are equally important given the pivotal role of LD tools in VS. Regarding application scenarios, early methods focused on blind docking, where the binding site is unknown. However, recent studies suggest a shift toward identifying binding sites rather than solely predicting binding poses within these models. In contrast, LD with a known pocket in VS has been shown to be more practical. Physical plausibility poses another significant challenge. Although DLLD models often achieve higher success rates compared to traditional methods, they may generate poses with implausible local structures, such as incorrect bond angles or lengths, which are disadvantageous for postprocessing tasks like visualization. Finally, we discuss the future perspectives for DLLD, emphasizing the need to improve generalization ability, strike a balance between speed and accuracy, account for protein conformation flexibility, and enhance physical plausibility. Additionally, we delve into the comparison between generative and regression algorithms in this context, exploring their respective strengths and potential.
Similar articles
-
Harnessing deep learning for enhanced ligand docking.Trends Pharmacol Sci. 2024 Feb;45(2):103-106. doi: 10.1016/j.tips.2023.12.004. Epub 2023 Dec 30. Trends Pharmacol Sci. 2024. PMID: 38160084
-
Normalized Protein-Ligand Distance Likelihood Score for End-to-End Blind Docking and Virtual Screening.J Chem Inf Model. 2025 Feb 10;65(3):1101-1114. doi: 10.1021/acs.jcim.4c01014. Epub 2025 Jan 17. J Chem Inf Model. 2025. PMID: 39823352 Free PMC article.
-
A fully differentiable ligand pose optimization framework guided by deep learning and a traditional scoring function.Brief Bioinform. 2023 Jan 19;24(1):bbac520. doi: 10.1093/bib/bbac520. Brief Bioinform. 2023. PMID: 36502369
-
Recent advances in AI-driven protein-ligand interaction predictions.Curr Opin Struct Biol. 2025 Jun;92:103020. doi: 10.1016/j.sbi.2025.103020. Epub 2025 Feb 24. Curr Opin Struct Biol. 2025. PMID: 39999605 Review.
-
Advances in Docking.Curr Med Chem. 2019;26(42):7555-7580. doi: 10.2174/0929867325666180904115000. Curr Med Chem. 2019. PMID: 30182836 Review.
Cited by
-
The role of JNK signaling pathway in organ fibrosis.J Adv Res. 2025 Aug;74:207-223. doi: 10.1016/j.jare.2024.09.029. Epub 2024 Oct 2. J Adv Res. 2025. PMID: 39366483 Free PMC article. Review.
-
Ten quick tips to perform meaningful and reproducible molecular docking calculations.PLoS Comput Biol. 2025 May 9;21(5):e1013030. doi: 10.1371/journal.pcbi.1013030. eCollection 2025 May. PLoS Comput Biol. 2025. PMID: 40344147 Free PMC article.
-
Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening.J Chem Inf Model. 2025 May 26;65(10):4833-4843. doi: 10.1021/acs.jcim.5c00380. Epub 2025 May 9. J Chem Inf Model. 2025. PMID: 40343848 Free PMC article.
-
Artificial intelligence-driven discovery of YH395A: A novel TGFβR1 inhibitor with potent anti-tumor activity against triple-negative breast cancer.Cell Commun Signal. 2025 Jul 8;23(1):326. doi: 10.1186/s12964-025-02337-2. Cell Commun Signal. 2025. PMID: 40629347 Free PMC article.
-
OpenDock: a pytorch-based open-source framework for protein-ligand docking and modelling.Bioinformatics. 2024 Nov 1;40(11):btae628. doi: 10.1093/bioinformatics/btae628. Bioinformatics. 2024. PMID: 39432683 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials