Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug;25(3):1439-1460.
doi: 10.1007/s11030-021-10256-w. Epub 2021 Jun 23.

Evolving scenario of big data and Artificial Intelligence (AI) in drug discovery

Affiliations

Evolving scenario of big data and Artificial Intelligence (AI) in drug discovery

Manish Kumar Tripathi et al. Mol Divers. 2021 Aug.

Abstract

The accumulation of massive data in the plethora of Cheminformatics databases has made the role of big data and artificial intelligence (AI) indispensable in drug design. This has necessitated the development of newer algorithms and architectures to mine these databases and fulfil the specific needs of various drug discovery processes such as virtual drug screening, de novo molecule design and discovery in this big data era. The development of deep learning neural networks and their variants with the corresponding increase in chemical data has resulted in a paradigm shift in information mining pertaining to the chemical space. The present review summarizes the role of big data and AI techniques currently being implemented to satisfy the ever-increasing research demands in drug discovery pipelines.

Keywords: Artificial intelligence; Autoencoders; Big data; Deep learning; Drug discovery; Machine learning.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Growth of machine learning with the subsequent increase in big data and computation power; KB—Kilobyte, MB—Megabyte, CPU—Central processing unit, GPU—Graphics processing unit, HTS—High throughput sequencing
Fig. 2
Fig. 2
Workflow of machine learning (ML) process in drug discovery
Fig. 3
Fig. 3
a Deep learning neural network (DLNN) without dropout b Deep learning neural network (DLNN) with dropout
Fig. 4
Fig. 4
De novo chemical design using generative adversarial networks (GANs)
Fig. 5
Fig. 5
Representation of an autoencoder. The green circles represent the hidden layer
Fig. 6
Fig. 6
A deep autoencoder with hidden layers. The hierarchical representations from the hidden layers can be used as features in the training of learning algorithms
Fig. 7
Fig. 7
Schematic representation of stacking ensemble approach
Fig. 8
Fig. 8
Role of AI technology in different phases of drug discovery

References

    1. Dash S, Shakyawar SK, Sharma M, Kaushik S. Big data in healthcare: management, analysis and future prospects. J Big Data. 2019;6:54. doi: 10.1186/s40537-019-0217-0. - DOI
    1. De Mauro A, Greco M, Grimaldi M. A formal definition of Big Data based on its essential features. Libr Rev. 2016;65:122–135. doi: 10.1108/LR-06-2015-0061. - DOI
    1. Sivarajah U, Kamal MM, Irani Z, Weerakkody V. Critical analysis of Big Data challenges and analytical methods. J Bus Res. 2017;70:263–286. doi: 10.1016/j.jbusres.2016.08.001. - DOI
    1. Alemayehu D, Berger ML. Big Data: transforming drug development and health policy decision making. Heal Serv Outcomes Res Methodol. 2016;16:92–102. doi: 10.1007/s10742-016-0144-x. - DOI - PMC - PubMed
    1. Kim RS, Goossens N, Hoshida Y. Use of big data in drug development for precision medicine. Expert Rev Precis Med drug Dev. 2016;1:245–253. doi: 10.1080/23808993.2016.1174062. - DOI - PMC - PubMed

LinkOut - more resources