Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections
- PMID: 21482810
- PMCID: PMC3084049
- DOI: 10.1073/pnas.1015024108
Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections
Abstract
Using a diverse collection of small molecules we recently found that compound sets from different sources (commercial; academic; natural) have different protein-binding behaviors, and these behaviors correlate with trends in stereochemical complexity for these compound sets. These results lend insight into structural features that synthetic chemists might target when synthesizing screening collections for biological discovery. We report extensive characterization of structural properties and diversity of biological performance for these compounds and expand comparative analyses to include physicochemical properties and three-dimensional shapes of predicted conformers. The results highlight additional similarities and differences between the sets, but also the dependence of such comparisons on the choice of molecular descriptors. Using a protein-binding dataset, we introduce an information-theoretic measure to assess diversity of performance with a constraint on specificity. Rather than relying on finding individual active compounds, this measure allows rational judgment of compound subsets as groups. We also apply this measure to publicly available data from ChemBank for the same compound sets across a diverse group of functional assays. We find that performance diversity of compound sets is relatively stable across a range of property values as judged by this measure, both in protein-binding studies and functional assays. Because building screening collections with improved performance depends on efficient use of synthetic organic chemistry resources, these studies illustrate an important quantitative framework to help prioritize choices made in building such collections.
Conflict of interest statement
The authors declare no conflict of interest.
Figures





Similar articles
-
Small molecules of different origins have distinct distributions of structural complexity that correlate with protein-binding profiles.Proc Natl Acad Sci U S A. 2010 Nov 2;107(44):18787-92. doi: 10.1073/pnas.1012741107. Epub 2010 Oct 18. Proc Natl Acad Sci U S A. 2010. PMID: 20956335 Free PMC article.
-
Selecting optimally diverse compounds from structure databases: a validation study of two-dimensional and three-dimensional molecular descriptors.J Med Chem. 1997 Apr 11;40(8):1219-29. doi: 10.1021/jm960352+. J Med Chem. 1997. PMID: 9111296
-
Molecular diversity management strategies for building and enhancement of diverse and focused lead discovery compound screening collections.Comb Chem High Throughput Screen. 2004 Dec;7(8):771-81. doi: 10.2174/1386207043328238. Comb Chem High Throughput Screen. 2004. PMID: 15578939 Review.
-
The Use of Informer Sets in Screening: Perspectives on an Efficient Strategy to Identify New Probes.SLAS Discov. 2021 Aug;26(7):855-861. doi: 10.1177/24725552211019410. Epub 2021 Jun 16. SLAS Discov. 2021. PMID: 34130532 Free PMC article.
-
Charting, navigating, and populating natural product chemical space for drug discovery.J Med Chem. 2012 Jul 12;55(13):5989-6001. doi: 10.1021/jm300288g. Epub 2012 May 11. J Med Chem. 2012. PMID: 22537178 Review.
Cited by
-
Plane of best fit: a novel method to characterize the three-dimensionality of molecules.J Chem Inf Model. 2012 Oct 22;52(10):2516-25. doi: 10.1021/ci300293f. Epub 2012 Sep 26. J Chem Inf Model. 2012. PMID: 23009689 Free PMC article.
-
Chemical probes and drug leads from advances in synthetic planning and methodology.Nat Rev Drug Discov. 2018 May;17(5):333-352. doi: 10.1038/nrd.2018.53. Epub 2018 Apr 13. Nat Rev Drug Discov. 2018. PMID: 29651105 Free PMC article. Review.
-
Radical [3 + 2]-annulation of divinylcyclopropanes: rapid synthesis of complex meloscine analogs.Org Lett. 2014 Jan 3;16(1):94-7. doi: 10.1021/ol403078e. Epub 2013 Dec 6. Org Lett. 2014. PMID: 24313360 Free PMC article.
-
An informatic pipeline for managing high-throughput screening experiments and analyzing data from stereochemically diverse libraries.J Comput Aided Mol Des. 2013 May;27(5):455-68. doi: 10.1007/s10822-013-9641-y. Epub 2013 Apr 13. J Comput Aided Mol Des. 2013. PMID: 23585218 Free PMC article.
-
Identification of Novel Potential Inhibitors of Pteridine Reductase 1 in Trypanosoma brucei via Computational Structure-Based Approaches and in Vitro Inhibition Assays.Molecules. 2019 Jan 1;24(1):142. doi: 10.3390/molecules24010142. Molecules. 2019. PMID: 30609681 Free PMC article.
References
-
- Iwasa J, Fujita T, Hansch C. Substituent constants for aliphatic functions obtained from partition coefficients. J Med Chem. 1965;8:150–153. - PubMed
-
- Fujita T, Hansch C. Analysis of the structure-activity relationship of the sulfonamide drugs using substituent constants. J Med Chem. 1967;10:991–1000. - PubMed
-
- Hansch C. A quantitative approach to biochemical structure-activity relationships. Acc Chem Res. 1969;2:232–239.
-
- Clemons PA. Chemical Informatics. In: Schreiber SL, Kapoor TM, Wess G, editors. Chemical Biology: From Small Molecules to Systems Biology and Drug Design. Vol 2. Weinheim Germany: Wiley-VCH; 2007. pp. 723–759.
-
- Drewry DH, Macarron R. Enhancements of screening collections to address areas of unmet medical need: An industry perspective. Curr Opin Chem Biol. 2010;14:289–298. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources