Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jul;5(7):479-83.
doi: 10.1038/nchembio.180. Epub 2009 May 31.

Quantifying biogenic bias in screening libraries

Affiliations

Quantifying biogenic bias in screening libraries

Jérôme Hert et al. Nat Chem Biol. 2009 Jul.

Abstract

In lead discovery, libraries of 10(6) molecules are screened for biological activity. Given the over 10(60) drug-like molecules thought possible, such screens might never succeed. The fact that they do, even occasionally, implies a biased selection of library molecules. We have developed a method to quantify the bias in screening libraries toward biogenic molecules. With this approach, we consider what is missing from screening libraries and how they can be optimized.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overlap between commercially available molecules and the GDB gives the purchasable GDB.
Figure 2
Figure 2. Compounds in screening libraries are biased toward biogenic molecules
Percentage of the GDB and purchasable-GDB databases as a function of the Tanimoto similarity to their nearest neighbor in [a] the KEGG and [b] the Dictionary of Natural Compound databases. Percentage of the GDB, the purchasable-GDB, Asinex (360 042 compounds − 815 GDB compliant compounds), Chembridge (473 745 compounds − 389 GDB compliant compounds); IBS (424 806 compounds − 884 GDB compliant compounds), Life Chemicals (285 581 compounds − 172 GDB compliant compounds), Otava (121 657 compounds − 287 GDB compliant compounds) databases as a function of the Tanimoto similarity to their nearest neighbor to the [c] KEGG and the [d] Dictionary of Natural Products databases.
Figure 3
Figure 3
Ratio of the percentage of compounds in the purchasable-GDB and GDB databases that had a similarity ≥ 0.75 to their nearest neighbor in [a] the KEGG and [b] the Dictionary of Natural Products databases versus the number of heavy atoms up to which the database compound (in purchasable-GDB and GDB) are considered.
None

References

    1. Wilhelm S, et al. Discovery and development of sorafenib: a multikinase inhibitor for treating cancer. Nat. Rev. Drug Discovery. 2006;5:835–844. - PubMed
    1. Spencer RW. High-throughput screening of historic collections: observations on file size, biological targets, and file diversity. Biotechnol. Bioeng. 1998;61:61–67. - PubMed
    1. Fox S, Farr-Jones S, Sopchak L, Boggs A, Comley J. High-Throughput Screening: Searching for Higher Productivity. J. Biomol. Screen. 2004;9:354–358. - PubMed
    1. Macarron R. Critical review of the role of HTS in drug discovery. Drug Discov. Today. 2006;11:277–279. - PubMed
    1. Pereira DA, Williams JA. Origin and evolution of high throughput screening. Br. J. Pharmacol. 2007;152:53–61. - PMC - PubMed

Publication types

MeSH terms