Similarity-potency trees: a method to search for SAR information in compound data sets and derive SAR rules
- PMID: 20726598
- DOI: 10.1021/ci100197b
Similarity-potency trees: a method to search for SAR information in compound data sets and derive SAR rules
Abstract
An intuitive and generally applicable analysis method, termed similarity-potency tree (SPT), is introduced to mine structure-activity relationship (SAR) information in compound data sets of any source. Only compound potency values and nearest-neighbor similarity relationships are considered. Rather than analyzing a data set as a whole, in part overlapping compound neighborhoods are systematically generated and represented as SPTs. This local analysis scheme simplifies the evaluation of SAR information and SPTs of high SAR information content are easily identified. By inspecting only a limited number of compound neighborhoods, it is also straightforward to determine whether data sets contain only little or no interpretable SAR information. Interactive analysis of SPTs is facilitated by reading the trees in two directions, which makes it possible to extract SAR rules, if available, in a consistent manner. The simplicity and interpretability of the data structure and the ease of calculation are characteristic features of this approach. We apply the methodology to high-throughput screening and lead optimization data sets, compare the approach to standard clustering techniques, illustrate how SAR rules are derived, and provide some practical guidance how to best utilize the methodology. The SPT program is made freely available to the scientific community.
Similar articles
-
Extraction of discontinuous structure-activity relationships from compound data sets through particle swarm optimization.J Chem Inf Model. 2011 Jul 25;51(7):1545-51. doi: 10.1021/ci2001692. Epub 2011 Jun 24. J Chem Inf Model. 2011. PMID: 21644503
-
Assessing the confidence level of public domain compound activity data and the impact of alternative potency measurements on SAR analysis.J Chem Inf Model. 2011 Dec 27;51(12):3131-7. doi: 10.1021/ci2004434. Epub 2011 Nov 14. J Chem Inf Model. 2011. PMID: 22059677
-
From activity cliffs to activity ridges: informative data structures for SAR analysis.J Chem Inf Model. 2011 Aug 22;51(8):1848-56. doi: 10.1021/ci2002473. Epub 2011 Aug 4. J Chem Inf Model. 2011. PMID: 21761918
-
Data structures and computational tools for the extraction of SAR information from large compound sets.Drug Discov Today. 2010 Aug;15(15-16):630-9. doi: 10.1016/j.drudis.2010.06.004. Epub 2010 Jun 12. Drug Discov Today. 2010. PMID: 20547243 Review.
-
Systematic computational analysis of structure-activity relationships: concepts, challenges and recent advances.Future Med Chem. 2009 Jun;1(3):451-66. doi: 10.4155/fmc.09.41. Future Med Chem. 2009. PMID: 21426126 Review.
Cited by
-
Identification of a μ-δ opioid receptor heteromer-biased agonist with antinociceptive activity.Proc Natl Acad Sci U S A. 2013 Jul 16;110(29):12072-7. doi: 10.1073/pnas.1222044110. Epub 2013 Jul 1. Proc Natl Acad Sci U S A. 2013. PMID: 23818586 Free PMC article.
-
Exploring uncharted territories: predicting activity cliffs in structure-activity landscapes.J Chem Inf Model. 2012 Aug 27;52(8):2181-91. doi: 10.1021/ci300047k. Epub 2012 Aug 16. J Chem Inf Model. 2012. PMID: 22873578 Free PMC article.
-
Activity cliffs in PubChem confirmatory bioassays taking inactive compounds into account.J Comput Aided Mol Des. 2013 Feb;27(2):115-24. doi: 10.1007/s10822-012-9632-4. Epub 2013 Jan 8. J Comput Aided Mol Des. 2013. PMID: 23296990
-
Extracting SAR Information from a Large Collection of Anti-Malarial Screening Hits by NSG-SPT Analysis.ACS Med Chem Lett. 2011 Jan 5;2(3):201-6. doi: 10.1021/ml100240z. eCollection 2011 Mar 10. ACS Med Chem Lett. 2011. PMID: 24900303 Free PMC article.
-
Structure-based virtual screening of small-molecule antagonists of platelet integrin αIIbβ3 that do not prime the receptor to bind ligand.J Comput Aided Mol Des. 2012 Sep;26(9):1005-15. doi: 10.1007/s10822-012-9594-6. Epub 2012 Aug 15. J Comput Aided Mol Des. 2012. PMID: 22893377 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous