Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Jul;9(7):e17575.
doi: 10.1016/j.heliyon.2023.e17575. Epub 2023 Jun 26.

AI in drug discovery and its clinical relevance

Affiliations
Review

AI in drug discovery and its clinical relevance

Rizwan Qureshi et al. Heliyon. 2023 Jul.

Abstract

The COVID-19 pandemic has emphasized the need for novel drug discovery process. However, the journey from conceptualizing a drug to its eventual implementation in clinical settings is a long, complex, and expensive process, with many potential points of failure. Over the past decade, a vast growth in medical information has coincided with advances in computational hardware (cloud computing, GPUs, and TPUs) and the rise of deep learning. Medical data generated from large molecular screening profiles, personal health or pathology records, and public health organizations could benefit from analysis by Artificial Intelligence (AI) approaches to speed up and prevent failures in the drug discovery pipeline. We present applications of AI at various stages of drug discovery pipelines, including the inherently computational approaches of de novo design and prediction of a drug's likely properties. Open-source databases and AI-based software tools that facilitate drug design are discussed along with their associated problems of molecule representation, data collection, complexity, labeling, and disparities among labels. How contemporary AI methods, such as graph neural networks, reinforcement learning, and generated models, along with structure-based methods, (i.e., molecular dynamics simulations and molecular docking) can contribute to drug discovery applications and analysis of drug responses is also explored. Finally, recent developments and investments in AI-based start-up companies for biotechnology, drug design and their current progress, hopes and promotions are discussed in this article.

Keywords: Artificial intelligence; Biotechnology; Drug discovery; Graph neural networks; Molecular dynamics simulation; Molecule representation; Reinforcement learning.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1
Figure 1
Applications of AI-based methods at different stages of a drug discovery pipeline. There are about 2700 known potential drug target proteins in the human body and about 9600 FDA-approved small molecule drugs , , . Machine learning can be used to identify the targeted protein, GNNs can be used for predicting drug-target interactions and binding affinity, and reinforcement learning can be used to optimize the properties of a molecule. Computer vision can determine the spatial state of the tumor microenvironment. Generative models can be employed to design new molecules, simulation-based studies can suggest properties of protein-drug complexes, such as stability and dynamics, and NLP can be used to mine the existing scientific literature for drug re-purposing, FDA review, and post-market analysis.
Figure 2
Figure 2
Illustration of different formats of Small Molecule Representations. Molecules can be represented as Kekule diagrams with bonds and atoms, SMILES strings (which can be converted into a one-hot encoding), and as molecular graphs, where adjacency, node, and feature matrices can be constructed.
Figure 3
Figure 3
Graph Neural Networks in Prediction Mode. Molecules can be represented as linear data structures, such as adjacency, node, or feature matrices. These matrices can be fed to graph neural networks to learn an embedding, which can be used to predict molecular properties.
Figure 4
Figure 4
Graph Neural Networks in Generation Mode. Initialization is performed to add the first atom to the empty graph G0. A graph transition (append, connect, or terminate) is sampled and performed on the intermediate molecule structure at each step .
Figure 5
Figure 5
The variational auto-encoder (VAE) for de novo design of molecules with desired properties . The neural network converts the discrete input molecule into a Gaussian distribution. The latent variables are reparametrized against the mean and variance. The decoder generates a new molecule from the sampled latent space.
Figure 6
Figure 6
A pipeline for a molecular dynamics simulation. The MD simulation pipeline can be divided into three steps, (i) System preparation, including solvation and topology and coordinate file generation (ii) System simulation for the desired time scale (iii) System or trajectory analysis using analytical methods.
Figure 7
Figure 7
Molecular docking and molecular dynamics simulations can investigate the efficacy of a protein-ligand system, using binding free energy and geometrical properties.
Figure 8
Figure 8
Statistics of AI start-ups for drug discovery.
Figure 9
Figure 9
A typical learning pyramid with critical questions that must be kept in mind while developing AI applications for drug discovery.
Figure 10
Figure 10
Learning from various data sources can aid drug design, clinical decision support, and public health policy. The collaborative intelligence resulting from the merger of “mind and machine” is expected to improve decision-making in healthcare.

References

    1. Organization W.H. World Health Organization; 2017. New perspectives on global health spending for universal health coverage. Tech. Rep.
    1. Wouters O.J., McKee M., Luyten J. Estimated research and development investment needed to bring a new medicine to market, 2009-2018. JAMA. 2020;323(9):844–853. - PMC - PubMed
    1. Khanna I. Drug discovery in pharmaceutical industry: productivity challenges and trends. Drug Discov. Today. 2012;17(19–20):1088–1102. - PubMed
    1. Lin A., Giuliano C.J., Palladino A., John K.M., Abramowicz C., Yuan M.L., Sausville E.L., Lukow D.A., Liu L., Chait A.R., et al. Off-target toxicity is a common mechanism of action of cancer drugs undergoing clinical trials. Sci. Transl. Med. 2019;11(509) - PMC - PubMed
    1. Freedman D.H., et al. Hunting for new drugs with ai. Nature. 2019;576(7787):S49–S53. - PubMed

LinkOut - more resources