. 2006 Sep-Oct;46(5):1984-95.

doi: 10.1021/ci060132x.

A novel automated lazy learning QSAR (ALL-QSAR) approach: method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models

Shuxing Zhang¹, Alexander Golbraikh, Scott Oloff, Harold Kohn, Alexander Tropsha

Affiliations

Affiliation

¹ Division of Medicinal Chemistry and Natural Products, School of Pharmacy, CB # 7360 Beard Hall, University of North Carolina, Chapel Hill, North Carolina 27599, USA.

PMID: 16995729
PMCID: PMC2536695
DOI: 10.1021/ci060132x

A novel automated lazy learning QSAR (ALL-QSAR) approach: method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models

Shuxing Zhang et al. J Chem Inf Model. 2006 Sep-Oct.

. 2006 Sep-Oct;46(5):1984-95.

doi: 10.1021/ci060132x.

Authors

Shuxing Zhang¹, Alexander Golbraikh, Scott Oloff, Harold Kohn, Alexander Tropsha

Affiliation

¹ Division of Medicinal Chemistry and Natural Products, School of Pharmacy, CB # 7360 Beard Hall, University of North Carolina, Chapel Hill, North Carolina 27599, USA.

PMID: 16995729
PMCID: PMC2536695
DOI: 10.1021/ci060132x

Abstract

A novel automated lazy learning quantitative structure-activity relationship (ALL-QSAR) modeling approach has been developed on the basis of the lazy learning theory. The activity of a test compound is predicted from a locally weighted linear regression model using chemical descriptors and the biological activity of the training set compounds most chemically similar to this test compound. The weights with which training set compounds are included in the regression depend on the similarity of those compounds to a test compound. We have applied the ALL-QSAR method to several experimental chemical data sets including 48 anticonvulsant agents with known ED50 values, 48 dopamine D1-receptor antagonists with known competitive binding affinities (Ki), and a Tetrahymena pyriformis data set containing 250 phenolic compounds with toxicity IGC50 values. When applied to database screening, models developed for anticonvulsant agents identified several known anticonvulsant compounds that were not only absent in the training set but highly chemically dissimilar to the training set compounds. This initial success indicates that ALL-QSAR can be further exploited as a general tool for accurate bioactivity prediction and database screening in drug design and discovery. Because of its local nature, the ALL-QSAR approach appears to be especially well-suited for the development of highly predictive models for the sparse or unevenly distributed data sets.

PubMed Disclaimer

Figures

**Figure 1**
Locally weighted regression. The Figure highlights the difference between the global linear regression and the locally weighted linear regression. The green line is the global linear regression and the red straight line is the weighted linear regression, where the thickness of gray lines indicates the strength of the weight. The red curve line is the final function obtained after combining local linear regressions for all the points.

**Figure 2**
Flowchart of the ALL-QSAR method.

**Figure 3**
The ALL-QSAR statistical modeling workflow.

**Figure 4**
Flowchart of database mining that employs predictive ALL-QSAR models.

**Figure 5**
The correlation between the ridge regression parameter (λ) and the R² for one of the Phenol test sets.

**Figure 6**
R² trajectory with respect to the kernel width during the model development for 39 anticonvulsant agents in the training set and 9 compounds in the test set. Iterations are shown for the real dataset (black) and the dataset with activity randomized (gray).

**Figure 7**
Activity prediction with ALL-QSAR models for 9 anticonvulsants in the test set. R² = 0.90 (Model 1 in Table 1).

**Figure 8**
Activity prediction with ALL-QSAR models for 14 anticonvulsants in the test set. R² = 0.76 (Model 8 in Table 1).

**Figure 9**
Correlation between experimental and predicted pK_i for 11 D₁ antagonists in the test set. Training set included 37 compounds. R² = 0.97 (Model 1 in Table 2)

**Figure 10**
Correlation between experimental and predicted pKi for 14 D1 antagonists in the test set. Training set included 32 compounds. R2 = 0.87 (Model 4 in Table 2). Two compounds, Ant08 and NNC01-0127, are outside of the applicability domain and not shown in the plot.

**Figure 11**
The best ALL-QSAR model with 150 phenols in the training set: R² = 0.90 for the prediction of 50 compounds in the test set (Model 1 in Table 3).

**Figure 12**
The consensus prediction of 50 external toxic phenol compounds with the 10 best ALL-QSAR models affords high accuracy of prediction with R² = 0.86 (Table 3 and 4).

**Figure 13**
Workflow for the identification of novel anticonvulsant agents using consensus database mining.

**Figure 14**
One of the structures identified in virtual screening (top) and Dimmock’s semicarbazone scaffold (bottom).

See this image and copyright information in PMC

Cited by

Predicting Cytotoxicity of Metal Oxide Nanoparticles using Isalos Analytics Platform.
Papadiamantis AG, Jänes J, Voyiatzis E, Sikk L, Burk J, Burk P, Tsoumanis A, Ha MK, Yoon TH, Valsami-Jones E, Lynch I, Melagraki G, Tämm K, Afantitis A. Papadiamantis AG, et al. Nanomaterials (Basel). 2020 Oct 13;10(10):2017. doi: 10.3390/nano10102017. Nanomaterials (Basel). 2020. PMID: 33066094 Free PMC article.
QSAR study of anti-Human African Trypanosomiasis activity for 2-phenylimidazopyridines derivatives using DFT and Lipinski's descriptors.
Chtita S, Ghamali M, Ousaa A, Aouidate A, Belhassan A, Taourati AI, Masand VH, Bouachrine M, Lakhlifi T. Chtita S, et al. Heliyon. 2019 Mar 7;5(3):e01304. doi: 10.1016/j.heliyon.2019.e01304. eCollection 2019 Mar. Heliyon. 2019. PMID: 30899832 Free PMC article.
Assessing the Effects of Alloxydim Phototransformation Products by QSAR Models and a Phytotoxicity Study.
Villaverde JJ, Santín-Montanyá I, Sevilla-Morán B, Alonso-Prados JL, Sandín-España P. Villaverde JJ, et al. Molecules. 2018 Apr 24;23(5):993. doi: 10.3390/molecules23050993. Molecules. 2018. PMID: 29695081 Free PMC article.
Computational modeling of novel inhibitors targeting the Akt pleckstrin homology domain.
Du-Cuny L, Song Z, Moses S, Powis G, Mash EA, Meuillet EJ, Zhang S. Du-Cuny L, et al. Bioorg Med Chem. 2009 Oct 1;17(19):6983-92. doi: 10.1016/j.bmc.2009.08.022. Epub 2009 Aug 19. Bioorg Med Chem. 2009. PMID: 19734051 Free PMC article.
Machine learning prediction of intestinal α-glucosidase inhibitors using a diverse set of ligands: a drug repurposing effort with drugBank database screening.
Odugbemi AI, Nyirenda C, Christoffels A, Egieyeh SA. Odugbemi AI, et al. In Silico Pharmacol. 2025 Jun 25;13(2):95. doi: 10.1007/s40203-025-00384-8. eCollection 2025. In Silico Pharmacol. 2025. PMID: 40575395 Free PMC article.

See all "Cited by" articles

References

1. Dietrich SW, Dreyer ND, Hansch C, Bentley DL. Confidence-Interval Estimators for Parameters Associated with Quantitative Structure-Activity-Relationships. J Med Chem. 1980;23:1201–1205. - PubMed
1. Hadjipavloulitina D, Hansch C. Quantitative Structure-Activity-Relationships of the Benzodiazepines - A Review and Reevaluation. Chem Rev. 1994;94:1483–1505.
1. Hansch C, Muir RM, Fujita T, Maloney PP, Geiger E, Streich M. The Correlation of Biological Activity of Plant Growth Regulators and Chloromycetin Derivatives with Hammett Constants and Partition Coefficients. J Am Chem Soc. 1963;85:2817–2824.
1. Hansch C, Kurup A, Garg R, Gao H. Chem-bioinformatics and QSAR: A review of QSAR lacking positive hydrophobic terms. Chem Rev. 2001;101:619–672. - PubMed
1. Hansch C, Leo A, Mekapati SB, Kurup A. Qsar and Adme. Bioorg Med Chem. 2004;12:3391–3400. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A novel automated lazy learning QSAR (ALL-QSAR) approach: method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models

Affiliation

A novel automated lazy learning QSAR (ALL-QSAR) approach: method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources