Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2012 Sep 19;3(9):649-57.
doi: 10.1021/cn3000422. Epub 2012 Apr 25.

Exploring chemical space for drug discovery using the chemical universe database

Affiliations
Review

Exploring chemical space for drug discovery using the chemical universe database

Jean-Louis Reymond et al. ACS Chem Neurosci. .

Abstract

Herein we review our recent efforts in searching for bioactive ligands by enumeration and virtual screening of the unknown chemical space of small molecules. Enumeration from first principles shows that almost all small molecules (>99.9%) have never been synthesized and are still available to be prepared and tested. We discuss open access sources of molecules, the classification and representation of chemical space using molecular quantum numbers (MQN), its exhaustive enumeration in form of the chemical universe generated databases (GDB), and examples of using these databases for prospective drug discovery. MQN-searchable GDB, PubChem, and DrugBank are freely accessible at www.gdb.unibe.ch.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Color-coded MQN-map of the PubChem chemical space (19.2 million structures) as the (PC2,PC3)-plane, marked as the horizontal and vertical axes starting from the (0,0) coordinate where hydrogen is located. The values corresponding to each color are indicated on the maps. PC2, PC3 refer to the 2nd and 3rd principal component, respectively, in the PCA of the MQN data for PubChem. (A) Average number of non-hydrogen atoms per molecule. (B) Average fraction of cyclic atoms per molecule. (C) Average fraction of H-bond acceptor atoms per molecule. (D) Compound categories including computationally enumerated molecules (up to hac = 500) for each category. Ro3 are Congreve’s “rule of 3” molecules, and Ro5 are Lipinski’s “rule of 5” molecules.
Figure 2
Figure 2
Overview of the chemical universe database GDB-13 containing 977 million structures up to 13 atoms of C, N, O, Cl, S. (A–C) Color-coded MQN-map of the (PC1,PC2) plane. PC1 (horizontal dimension) and PC2 (vertical dimension) refer to the 1st respectively 2nd principal component in the PCA of the MQN data for GDB-13. (D) Size of the GDB database, its 43.7 M subset, and PubChem as a function of molecular size.
Figure 3
Figure 3
NMDA glycine site and GLT-1 inhibitors identified from GDB-11 by virtual screening, synthesis and testing. Activities were determined by electrophysiology (NMDA glycine site) or by radioactive ligand uptake inhibition (GLT-1) for the human receptors expressed in Xenopus oocytes.
Figure 4
Figure 4
Discovery of α7 nAChR inhibitors by fragment-based diversification of known ligands using GDB-11.
Figure 5
Figure 5
Discovery of α7 nAChR inhibitors by nearest neighbor searching in the MQN-space of GDB-13.

References

    1. van der Horst E.; Peironcely J. E.; van Westen G. J.; van den Hoven O. O.; Galloway W. R.; Spring D. R.; Wegner J. K.; van Vlijmen H. W.; Ijzerman A. P.; Overington J. P.; Bender A. (2011) Chemogenomics approaches for receptor deorphanization and extensions of the chemogenomics concept to phenotypic space. Curr. Top. Med. Chem. 11, 1964–1977. - PubMed
    2. Bon R. S.; Waldmann H. (2010) Bioactivity-guided navigation of chemical space. Acc. Chem. Res. 43, 1103–1114. - PubMed
    1. Harrison P. M.; Kumar A.; Lang N.; Snyder M.; Gerstein M. (2002) A question of size: the eukaryotic proteome and the problems in defining it. Nucleic Acids Res. 30, 1083–1090. - PMC - PubMed
    2. Montelione G. T.; Szyperski T. (2010) Advances in protein NMR provided by the NIGMS Protein Structure Initiative: impact on drug discovery. Curr. Opin. Drug Discovery Dev. 13, 335–349. - PMC - PubMed
    3. Bruckner A.; Polge C.; Lentze N.; Auerbach D.; Schlattner U. (2009) Yeast two-hybrid, a powerful tool for systems biology. Int. J. Mol. Sci. 10, 2763–2788. - PMC - PubMed
    1. Bleicher K. H.; Bohm H. J.; Muller K.; Alanine A. I. (2003) Hit and lead generation: Beyond high-throughput screening. Nat. Rev. Drug Discovery 2, 369–378. - PubMed
    1. Garcia-Serna R.; Mestres J. (2010) Anticipating drug side effects by comparative pharmacology. Expert Opin. Drug Metab. Toxicol. 6, 1253–1263. - PubMed
    2. Brown J. B.; Okuno Y. (2012) Systems biology and systems chemistry: new directions for drug discovery. Chem. Biol. 19, 23–28. - PubMed
    1. Schuffenhauer A.; Brown N.; Selzer P.; Ertl P.; Jacoby E. (2005) Relationships between Molecular Complexity, Biological Activity, and Structural Diversity. J. Chem. Inf. Model. 46, 525–535. - PubMed
    2. Klebe G. (2006) Virtual ligand screening: strategies, perspectives and limitations. Drug Discovery Today 11, 580–594. - PMC - PubMed

Publication types