Exploring uncharted territories: predicting activity cliffs in structure-activity landscapes
- PMID: 22873578
- PMCID: PMC3448951
- DOI: 10.1021/ci300047k
Exploring uncharted territories: predicting activity cliffs in structure-activity landscapes
Abstract
The notion of activity cliffs is an intuitive approach to characterizing structural features that play a key role in modulating biological activity of a molecule. A variety of methods have been described to quantitatively characterize activity cliffs, such as SALI and SARI. However, these methods are primarily retrospective in nature; highlighting cliffs that are already present in the data set. The current study focuses on employing a pairwise characterization of a data set to train a model to predict whether a new molecule will exhibit an activity cliff with one or more members of the data set. The approach is based on predicting a value for pairs of objects rather than the individual objects themselves (and thus allows for robust models even for small structure-activity relationship data sets). We extracted structure-activity data for several ChEMBL assays and developed random forest models to predict SALI values, from pairwise combinations of molecular descriptors. The models exhibited reasonable RMSE's though, surprisingly, performance on the more significant cliffs tended to be better than on the lesser ones. While the models do not exhibit very high levels of accuracy, our results indicate that they are able to prioritize molecules in terms of their ability to activity cliffs, thus serving as a tool to prospectively identify activity cliffs.
Figures
References
-
- Johnson M, Maggiora G. Concepts and Applications of Molecular Similarity; John Wiley & Sons; New York: 1990.
-
- Maggiora GM. On Outliers and Activity Cliffs–Why QSAR Often Disappoints. J Chem Inf Model. 2006;46:1535–1535. - PubMed
-
- Leach A, Jones H, Cosgrove D, Kenny P, Ruston L, MacFaul P, Wood J, Col-clough N, Law B. Matched Molecular Pairs as a Guide in the Optimization of Pharmaceutical Properties; a Study of Aqueous Solubility, Plasma Protein Binding and Oral Exposure. J Med Chem. 2006;49:6672–6682. - PubMed
-
- Shanmugasundaram V, Maggiora G. Characterizing Property and Activity Landscapes Using an Information-Theoretic Approach. CINF-032. 222nd ACS National Meeting; Chicago, IL, United States. Washington, D.C: American Chemical Society; 2001.
-
- Guha R, Van Drie J. The Structure-Activity Landscape Index: Identifying and Quantifying Activity-Cliffs. J Chem Inf Model. 2008;48:646–658. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
