Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Nov 7;4(4):206-213.
doi: 10.1136/svn-2019-000290. eCollection 2019 Dec.

Artificial intelligence and big data facilitated targeted drug discovery

Affiliations
Review

Artificial intelligence and big data facilitated targeted drug discovery

Benquan Liu et al. Stroke Vasc Neurol. .

Abstract

Different kinds of biological databases publicly available nowadays provide us a goldmine of multidiscipline big data. The Cancer Genome Atlas is a cancer database including detailed information of many patients with cancer. DrugBank is a database including detailed information of approved, investigational and withdrawn drugs, as well as other nutraceutical and metabolite structures. PubChem is a chemical compound database including all commercially available compounds as well as other synthesisable compounds. Protein Data Bank is a crystal structure database including X-ray, cryo-EM and nuclear magnetic resonance protein three-dimensional structures as well as their ligands. On the other hand, artificial intelligence (AI) is playing an important role in the drug discovery progress. The integration of such big data and AI is making a great difference in the discovery of novel targeted drug. In this review, we focus on the currently available advanced methods for the discovery of highly effective lead compounds with great absorption, distribution, metabolism, excretion and toxicity properties.

Keywords: artificial intelligence; big data; targeted drug.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None declared.

Figures

Figure 1
Figure 1
Schematic procedure of artificial intelligence (AI)-assisted virtual screening. Millions of structurally diverse chemical compounds are docked to a specific therapeutic target. AI scoring function is used to select the best hits from millions of docked results.
Figure 2
Figure 2
Schematic procedure of artificial intelligence (AI)-assisted reverse docking. More than 100 000 structurally diverse protein structures are reversely docked to a specific chemical compound/natural product. AI scoring function is used to select the best hits from millions of docked results.
Figure 3
Figure 3
AI-assisted ADMET properties prediction. (A) Deep learning algorithm to calculate logBB for a specific chemical compound. (B) Deep learning algorithm to calculate logPapp for a specific chemical compound. (C) PCA(Principal Component Analysis) analysis on 48 186 reverse-docked proteins for 55 FDA-approved drugs (yellow dots) and 224 FDA-withdrawn drugs (blue dots). (D) PLS-DA(Partial Least Squares Discriminant Analysis) analysis on 48 186 reverse-docked proteins for 55 FDA-approved drugs (blue dots) and 224 FDA-withdrawn drugs (yellow dots). ADMET, absorption, distribution, metabolism, excretion and toxicity; AI, artificial intelligence; FDA, Food and Drug Administration;TPSA,total polar surface area.

References

    1. Moore TJ, Zhang H, Anderson G, et al. . Estimated costs of pivotal trials for novel therapeutic agents Approved by the US food and drug administration, 2015-2016. JAMA Intern Med 2018;178:1451–7. 10.1001/jamainternmed.2018.3931 - DOI - PMC - PubMed
    1. Ferreira LG, Dos Santos RN, Oliva G, et al. . Molecular docking and structure-based drug design strategies. Molecules 2015;20:13384–421. 10.3390/molecules200713384 - DOI - PMC - PubMed
    1. da Silva Rocha SFL, Olanda CG, Fokoue HH, et al. . Virtual screening techniques in drug discovery: review and recent applications. Curr Top Med Chem 2019;19:1751–67. 10.2174/1568026619666190816101948 - DOI - PubMed
    1. Zhavoronkov A, Ivanenkov YA, Aliper A, et al. . Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat Biotechnol 2019;37:1038–40. 10.1038/s41587-019-0224-x - DOI - PubMed
    1. Tomczak K, Czerwińska P, Wiznerowicz M. The cancer genome atlas (TCGA): an immeasurable source of knowledge. Contemp Oncol 2015;19:A68–77. 10.5114/wo.2014.47136 - DOI - PMC - PubMed

Publication types

Substances