Integrated data-driven and experimental approaches to accelerate lead optimization targeting SARS-CoV-2 main protease
- PMID: 37314632
- DOI: 10.1007/s10822-023-00509-1
Integrated data-driven and experimental approaches to accelerate lead optimization targeting SARS-CoV-2 main protease
Abstract
Identification of potential therapeutic candidates can be expedited by integrating computational modeling with domain aware machine learning (ML) models followed by experimental validation in an iterative manner. Generative deep learning models can generate thousands of new candidates, however, their physiochemical and biochemical properties are typically not fully optimized. Using our recently developed deep learning models and a scaffold as a starting point, we generated tens of thousands of compounds for SARS-CoV-2 Mpro that preserve the core scaffold. We utilized and implemented several computational tools such as structural alert and toxicity analysis, high throughput virtual screening, ML-based 3D quantitative structure-activity relationships, multi-parameter optimization, and graph neural networks on generated candidates to predict biological activity and binding affinity in advance. As a result of these combined computational endeavors, eight promising candidates were singled out and put through experimental testing using Native Mass Spectrometry and FRET-based functional assays. Two of the tested compounds with quinazoline-2-thiol and acetylpiperidine core moieties showed IC[Formula: see text] values in the low micromolar range: [Formula: see text] [Formula: see text]M and 3.41±0.0015 [Formula: see text]M, respectively. Molecular dynamics simulations further highlight that binding of these compounds results in allosteric modulations within the chain B and the interface domains of the Mpro. Our integrated approach provides a platform for data driven lead optimization with rapid characterization and experimental validation in a closed loop that could be applied to other potential protein targets.
Keywords: Deep learning; Lead optimization; Molecular docking; Quantitative structure activity relationship; Scaffold.
© 2023. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.
Similar articles
-
Optimization Rules for SARS-CoV-2 Mpro Antivirals: Ensemble Docking and Exploration of the Coronavirus Protease Active Site.Viruses. 2020 Aug 26;12(9):942. doi: 10.3390/v12090942. Viruses. 2020. PMID: 32859008 Free PMC article.
-
Anthocyanin derivatives as potent inhibitors of SARS-CoV-2 main protease: An in-silico perspective of therapeutic targets against COVID-19 pandemic.J Biomol Struct Dyn. 2021 Oct;39(16):6171-6183. doi: 10.1080/07391102.2020.1801510. Epub 2020 Aug 3. J Biomol Struct Dyn. 2021. PMID: 32741312
-
Protein-Ligand Docking Simulations with AutoDock4 Focused on the Main Protease of SARS-CoV-2.Curr Med Chem. 2021;28(37):7614-7633. doi: 10.2174/0929867328666210329094111. Curr Med Chem. 2021. PMID: 33781188
-
Deep Lead Optimization: Leveraging Generative AI for Structural Modification.J Am Chem Soc. 2024 Nov 20;146(46):31357-31370. doi: 10.1021/jacs.4c11686. Epub 2024 Nov 5. J Am Chem Soc. 2024. PMID: 39499822 Review.
-
Enhancing computational fluid dynamics with machine learning.Nat Comput Sci. 2022 Jun;2(6):358-366. doi: 10.1038/s43588-022-00264-7. Epub 2022 Jun 27. Nat Comput Sci. 2022. PMID: 38177587 Review.
Cited by
-
Native Mass Spectrometry Dissects the Structural Dynamics of an Allosteric Heterodimer of SARS-CoV-2 Nonstructural Proteins.J Am Soc Mass Spectrom. 2024 May 1;35(5):912-921. doi: 10.1021/jasms.3c00453. Epub 2024 Mar 27. J Am Soc Mass Spectrom. 2024. PMID: 38535992 Free PMC article.
-
Applications of Artificial Intelligence in Biotech Drug Discovery and Product Development.MedComm (2020). 2025 Jul 30;6(8):e70317. doi: 10.1002/mco2.70317. eCollection 2025 Aug. MedComm (2020). 2025. PMID: 40740484 Free PMC article. Review.
References
-
- Sun D, Gao W, Hu H, Zhou S (2022) Why 90% of clinical drug development fails and how to improve it? Acta Pharm Sin B 12:3059 - DOI
-
- Shivanyuk A, Ryabukhin S, Tolmachev A, Bogolyubsky A, Mykytenko D, Chupryna A, Heilman W, Kostyuk A (2007) Enamine real database: making chemical diversity real. Chem Today 25(6):58–59
-
- Kiss R, Sandor M, Szalai F.A (2012) a public web service for drug discovery. J Cheminform 4(1):1–1 ( http://mcule.com )
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous