Machine learning in chemoinformatics and drug discovery
- PMID: 29750902
- PMCID: PMC6078794
- DOI: 10.1016/j.drudis.2018.05.010
Machine learning in chemoinformatics and drug discovery
Abstract
Chemoinformatics is an established discipline focusing on extracting, processing and extrapolating meaningful data from chemical structures. With the rapid explosion of chemical 'big' data from HTS and combinatorial synthesis, machine learning has become an indispensable tool for drug designers to mine chemical information from large compound databases to design drugs with important biological properties. To process the chemical data, we first reviewed multiple processing layers in the chemoinformatics pipeline followed by the introduction of commonly used machine learning models in drug discovery and QSAR analysis. Here, we present basic principles and recent case studies to demonstrate the utility of machine learning techniques in chemoinformatics analyses; and we discuss limitations and future directions to guide further development in this evolving field.
Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Figures
References
-
- Varnek A, Baskin I. Machine learning methods for property prediction in chemoinformatics: Quo Vadis? J Chem Inf Model. 2012;52:1413–1437. - PubMed
-
- Ali SM, et al. Butitaxel analogues: synthesis and structure-activity relationships. J Med Chem. 1997;40:236–241. - PubMed
-
- Kubinyi H. Free Wilson analysis. Theory, applications and its relationship to Hansch analysis. Quantitative Structure–Activity Relationships. 1988;7:121–133.
-
- Gasteiger J, editor. Handbook of Chemoinformatics: from Data to Knowledge. Wiley-VCH; 2003.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
