Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Dec 20;3(12):100794.
doi: 10.1016/j.xcrm.2022.100794. Epub 2022 Oct 27.

Deep generative molecular design reshapes drug discovery

Affiliations
Review

Deep generative molecular design reshapes drug discovery

Xiangxiang Zeng et al. Cell Rep Med. .

Abstract

Recent advances and accomplishments of artificial intelligence (AI) and deep generative models have established their usefulness in medicinal applications, especially in drug discovery and development. To correctly apply AI, the developer and user face questions such as which protocols to consider, which factors to scrutinize, and how the deep generative models can integrate the relevant disciplines. This review summarizes classical and newly developed AI approaches, providing an updated and accessible guide to the broad computational drug discovery and development community. We introduce deep generative models from different standpoints and describe the theoretical frameworks for representing chemical and biological structures and their applications. We discuss the data and technical challenges and highlight future directions of multimodal deep generative models for accelerating drug discovery.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests E.F.F. has a CRADA arrangement with ChromaDex (USA) and is consultant to Aladdin Healthcare Technologies (UK and Germany), the Vancouver Dementia Prevention Centre (Canada), Intellectual Labs (Norway), and MindRank AI (China). The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government. S.K. and W.C. are employees of IBM TJ Watson Research Center. The other authors declare no competing interests.

Figures

Figure 1
Figure 1
AI and deep generative model applications in the drug discovery pipeline Several successful applications of AI and deep generative models in various stage of the drug development pipeline: (A) AI-assistant target selection and validation, (B) molecular design, lead optimization, and chemical synthesis, (C) biological evaluation (in vitro and in vivo), clinical development, and post marketing surveillance, and (D) several successful preclinical and clinical molecules identified by AI and deep generative models. DDR1, discoidin domain receptor 1; DDR2, discoidin domain receptor tyrosine kinase 2; GSK3B, glycogen synthase kinase 3 beta; JNK3, c-Jun N-terminal kinase 3.
Figure 2
Figure 2
A diagram illustrating three molecular representation approaches Three molecular representation approaches include: (A) one-dimensional (1D) sequence-based representation; (B) graph-based representation; and (C) 3D representation for both small molecules and macromolecules (i.e., proteins). The value of contact map matrix is 1 if the distance is greater than a predetermined threshold, otherwise it is 0.
Figure 3
Figure 3
A diagram illustrating the theory framework of five deep generative models (A–E) in the drug discovery applications RNN, recurrent neural networks; VAE, variational autoencoder; GAN, generative adversarial networks; RL, reinforcement learning.
Figure 4
Figure 4
A proposed multimodal generative model in the drug discovery applications (A) A hybrid data model can fully capture diverse information during drug design, including chemical, drug-target interactions, drug-disease knowledge, and disease-relevant expression of target (protein/gene). (B) A multimodal generative model can consider various drug discovery pipeline components to increase likelihood of success of clinical trials. ADME-Tox, absorption, distribution, metabolism, and excretion-toxicity; IC50, half-maximal inhibitory concentration.

References

    1. Avorn J. The $2.6 billion pill–methodologic and policy considerations. N. Engl. J. Med. 2015;372:1877–1879. - PubMed
    1. Fleming N. How artificial intelligence is changing drug discovery. Nature. 2018;557:S55–S57. - PubMed
    1. Schütt K.T., Gastegger M., Tkatchenko A., Müller K.R., Maurer R.J. Unifying machine learning and quantum chemistry with a deep neural network for molecular wavefunctions. Nat. Commun. 2019;10:5024. - PMC - PubMed
    1. Zeng X., Zhu S., Lu W., Liu Z., Huang J., Zhou Y., Fang J., Huang Y., Guo H., Li L., et al. Target identification among known drugs by deep learning from heterogeneous networks. Chem. Sci. 2020;11:1775–1797. - PMC - PubMed
    1. Hie B., Zhong E.D., Berger B., Bryson B. Learning the language of viral evolution and escape. Science. 2021;371:284–288. - PubMed

Publication types

MeSH terms