Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Nov 5;23(21):13568.
doi: 10.3390/ijms232113568.

Application of Computational Biology and Artificial Intelligence in Drug Design

Affiliations
Review

Application of Computational Biology and Artificial Intelligence in Drug Design

Yue Zhang et al. Int J Mol Sci. .

Abstract

Traditional drug design requires a great amount of research time and developmental expense. Booming computational approaches, including computational biology, computer-aided drug design, and artificial intelligence, have the potential to expedite the efficiency of drug discovery by minimizing the time and financial cost. In recent years, computational approaches are being widely used to improve the efficacy and effectiveness of drug discovery and pipeline, leading to the approval of plenty of new drugs for marketing. The present review emphasizes on the applications of these indispensable computational approaches in aiding target identification, lead discovery, and lead optimization. Some challenges of using these approaches for drug design are also discussed. Moreover, we propose a methodology for integrating various computational techniques into new drug discovery and design.

Keywords: artificial intelligence-aided drug design (AIDD); computational biology; computer-aided drug design (CADD); deep learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The process of drug research and development. The details in drug development have been improved in the past forty years. Nowadays, the complete process in drug research includes drug discovery, clinical testing, and approval for production. The process of drug discovery includes target identification, lead discovery, lead optimization, and preclinical testing. This process usually takes 7–10 years and $600 M–$800 M. Then approximately 200 compounds enter preclinical testing step while about 5 compounds enter clinical testing process. This process includes three steps, phase I, II and III clinical trials, respectively. It is a long and expensive process that costs 6–12 years and billions of dollars. The compounds that have passed clinical testing enter the process of approval for production. Approved compounds by FDA/EMA can commercialize on the market. This process takes 1–2 years and about $50 M.
Figure 2
Figure 2
The workflow of structure-based drug design (SBDD) and ligand-based drug design (LBDD). For SBDD, it starts with target identification. Then, binding site of the target requires identifying and compound library needs to be prepared. Next, dock each compound from the library into the identified binding site evaluate the score. In molecular docking, MD simulations can be utilized to obtain more flexible target and rescore for the docking process. Additionally, MD simulations can be applied to lead optimization through ligand-target interactions. Through these steps, leads are obtained primarily. For LBDD, it starts with known ligands with bioactivity. Then, extracting the chemical features of these ligands and build pharmacophore or QSAR model. Next, according to the information of known ligands (e.g., ligand similarity), ligand-based virtual screening is performed in the compound library and leads are screened. These leads are further optimized in wet and dry lab.
Figure 3
Figure 3
Overview of the machine learning-based de novo drug design procedure, from left-top to right-bottom: appropriate data selection; data filtering and classification; molecular data and de-sired properties storage; feature representation for molecules and properties; molecule generation by machine learning methods; generation model optimization by reinforcement learning strategy and property prediction models; de novo molecules generation.
Figure 4
Figure 4
Example of a message passing neural network. The left side is an example of a graphical molecule. Ten atoms are regarded as nodes, with each node connecting with one or more nodes. The right side shows the message passing procedure for the target node I through a multilayer neural network (or one layer). The information from the directly connected nodes J, C, and D is passed to the target node I. As nodes J, C, and D have their own directly connected nodes (G, I, A, E), message is also passed by each of their neural networks. Message passing between nodes in a graph is a circular iteration process.
Figure 5
Figure 5
Variational autoencoder architecture. It consists of an encoder and a decoder, and they are deep neural networks. In general, the encoder maps input molecular data x into latent codes z by parameterizing a posterior distribution qØ(z|x), and the decoder reconstructs molecular data from the learned distribution pθ(x|z).
Figure 6
Figure 6
Flowchart of drug design using GANs. It contains a generator and a discriminator, and they are all deep neural networks. The generator transforms latent vectors that are sampled from a prior distribution such as Gaussian into novel molecular data samples, and the discriminator distinguishes fake molecular data generated by the generator from the actual points sampled from the distribution of training data and gives feed-back.
Figure 7
Figure 7
Architecture of normalizing flows. It contains a series of invertible functions for transforming molecular data into simple distribution and converting the distribution into high-dimensional molecular data for de novo drug design. Each of the functions optimizes the data distribution.

References

    1. DiMasi J.A., Grabowski H.G., Hansen R.W. Innovation in the pharmaceutical industry: New estimates of R&D costs. J. Health Econ. 2016;47:20–33. doi: 10.1016/j.jhealeco.2016.01.012. - DOI - PubMed
    1. Song C.M., Lim S.J., Tong J.C. Recent advances in computer-aided drug design. Brief. Bioinform. 2009;10:579–591. doi: 10.1093/bib/bbp023. - DOI - PubMed
    1. Kalyan K. Pharmaceutical Medicine and Translational Clinical Research. Curr. Sci. 2018;115:1403.
    1. Sabe V.T., Ntombela T., Jhamba L.A., Maguire G.E., Govender T., Naicker T., Kruger H.G. Current trends in computer aided drug design and a highlight of drugs discovered via computational techniques: A review. Eur. J. Med. Chem. 2021;224:113705. doi: 10.1016/j.ejmech.2021.113705. - DOI - PubMed
    1. Gurung A.B., Ali M.A., Lee J., Farah M.A., Al-Anazi K.M. An Updated Review of Computer-Aided Drug Design and Its Application to COVID-19. BioMed Res. Int. 2021;2021:8853056. doi: 10.1155/2021/8853056. - DOI - PMC - PubMed

LinkOut - more resources