Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 23;64(18):7097-7107.
doi: 10.1021/acs.jcim.4c01451. Epub 2024 Sep 9.

DrugSynthMC: An Atom-Based Generation of Drug-like Molecules with Monte Carlo Search

Affiliations

DrugSynthMC: An Atom-Based Generation of Drug-like Molecules with Monte Carlo Search

Milo Roucairol et al. J Chem Inf Model. .

Abstract

A growing number of deep learning (DL) methodologies have recently been developed to design novel compounds and expand the chemical space within virtual libraries. Most of these neural network approaches design molecules to specifically bind a target based on its structural information and/or knowledge of previously identified binders. Fewer attempts have been made to develop approaches for de novo design of virtual libraries, as synthesizability of generated molecules remains a challenge. In this work, we developed a new Monte Carlo Search (MCS) algorithm, DrugSynthMC (Drug Synthesis using Monte Carlo), in conjunction with DL and statistical-based priors to generate thousands of interpretable chemical structures and novel drug-like molecules per second. DrugSynthMC produces drug-like compounds using an atom-based search model that builds molecules as SMILES, character by character. Designed molecules follow Lipinski's "rule of 5″, show a high proportion of highly water-soluble nontoxic predicted-to-be synthesizable compounds, and efficiently expand the chemical space within the libraries, without reliance on training data sets, synthesizability metrics, or enforcing during SMILES generation. Our approach can function with or without an underlying neural network and is thus easily explainable and versatile. This ease in drug-like molecule generation allows for future integration of score functions aimed at different target- or job-oriented goals. Thus, DrugSynthMC is expected to enable the functional assessment of large compound libraries covering an extensive novel chemical space, overcoming the limitations of existing drug collections. The software is available at https://github.com/RoucairolMilo/DrugSynthMC.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
AiZynthFinder synthetic routes search: (A) histogram plot showing the number of drugs from a data set comprising 909 drugs retrieved from the FDA subset of the ZINC20 database versus AiZynthFinder synthetic routes search time in seconds. (B) Histogram plot showing the number drugs for which AiZynthFinder successfully finds synthetic routes in less than 2 min versus drugs Molecular Weight. “Solved 1-step” drugs that can be produced directly from commercially available compounds; “Already in stock” drugs that are identified as commercially available by AiZynthFinder.
Figure 2
Figure 2
Physicochemical properties of generated drug-like Mmolecules. Comparison of physicochemical properties among drug-like compounds generated with Random and Ngram, and the “Rule of 5” drugs within the FDA subset of the ZINC20 database. Violin plots showing (A) number of hydrogen bond donors (HBD), (B) hydrogen bond acceptors (HBA), (C) molecular weight (MW), (D) total number of Carbons, (E) heavy atoms, (F) hydrogens, (G) nitrogen, (H) oxygen, (I) fluorine, (K) sulfur, (L) chlorine, (M) pentamers, (N) hexamers, (O) heptamers, (P) aromatic cycles, and (Q) clogP. Physicochemical properties were calculated on libraries containing 1,000 generated compounds.

Similar articles

References

    1. Zhang H.; Chen S. Cyclic peptide drugs approved in the last two decades. RSC Chem. Biol 2022, 3, 18–31. 10.1039/D1CB00154J. - DOI - PMC - PubMed
    1. Zhavoronkov A. Artificial Intelligence for Drug Discovery, Biomarker Development, and Generation of Novel Chemistry. Mol. Pharmaceutics 2018, 15, 4311–4313. 10.1021/acs.molpharmaceut.8b00930. - DOI - PubMed
    1. Salmaso V.; Moro S. Bridging Molecular Docking to Molecular Dynamics in Exploring Ligand-Protein Recognition Process: An Overview. Front. Pharmacol. 2018, 9, 923.10.3389/fphar.2018.00923. - DOI - PMC - PubMed
    1. Grinter S. Z.; Zou X. Challenges, applications, and recent advances of protein-ligand docking in structure-based drug design. Molecules 2014, 19 (7), 10150–10176. 10.3390/molecules190710150. - DOI - PMC - PubMed
    1. Yuriev E.; Holien J.; Ramsland P. A. Improvements, trends, and new ideas in molecular docking: 2012–2013 in review. J. Mol. Recognit. 2015, 28, 581–604. 10.1002/jmr.2471. - DOI - PubMed

Substances

LinkOut - more resources