Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 11;3(1):127.
doi: 10.1038/s42004-020-00374-1.

Navigating the DNA encoded libraries chemical space

Affiliations

Navigating the DNA encoded libraries chemical space

Alfredo Martín et al. Commun Chem. .

Abstract

DNA-encoded library (DEL) technology is a novel ligand identification strategy that allows the synthesis and screening of unprecedented chemical diversity more efficiently than conventional methods. However, no reports have been published to systematically study how to increase the diversity and improve the molecular property space that can be covered with DEL. This report describes the development and application of eDESIGNER, an algorithm that comprehensively generates all possible library designs, enumerates and profiles samples from each library and evaluates them to select the libraries to be synthesized. This tool utilizes suitable on-DNA chemistries and available building blocks to design and identify libraries with a pre-defined molecular weight distribution and maximal diversity compared with compound collections from other sources.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. eDESIGNER concepts.
a Description of the overall eDESIGNER process: reactions and building block types (BBTs) are combined to generate all possible eDESIGNs, then eDESIGNs are merged into libDESIGNS when they can be implemented together in the same experimental conditions. Only libDESIGNs meeting specific criteria survive and their diversity is analyzed through a sample of 10,000 compounds per libDESIGN. b eDESIGN and libDESIGN objects: BBTs and reactions are combined in graph objects in which edges represent reactions and nodes are BBTs. libDESIGNS combine several eDESIGNS with the same topology and compatible reactions. c Functional group (FG) and BBT objects: a functional FG is a handle that can be used to link BBTs through reactions. Some FGs are exposed for reactions, and some are protected and need deprotection reactions to become exposed. The null FG is added to simplify the code. A combination of FGs defines a BBT. d Reaction object: connecting reactions connect two FGs to link two BBTs eliminating the FGs used (entry 1) or creating a new one (entry 2). Deprotection reactions transform an FG into a different one without addition of mass (entry 3) or incorporating a scaffold (entry 4). e Iterative process to generate an eDESIGN: a growing eDESIGN, characterized by its list of FGs, can then further incorporate new BBTs using additional reactions, convert protected FGs in exposed FGs, or add a non-coded scaffold.
Fig. 2
Fig. 2. eDESIGNER design examples.
a 2-cycle libDESIGN. b 3-cycle libDESIGN.
Fig. 3
Fig. 3. Building block analysis.
a Effective heavy atom distribution of BBs by source (magenta: external BBs, cyan: internal BBs). b Number of building blocks with respect to the multiplicity in their functionality (cyan: monofunctional, magenta: bifunctional, orange: trifunctional). c Analysis of BBs used by eDESIGNER in the two-cycle library designs by BB multiplicity. d Analysis of BBs used by eDESIGNER in the three-cycle library designs by BB multiplicity.
Fig. 4
Fig. 4. Molecular property profiles.
Property distributions for calculated properties. Sample of 10,000 compounds enumerated from each two-cycle libDESIGN (magenta), three-cycle libDESIGN (cyan), previously synthesized libraries (ADEL collection, orange), and 140,000 drug-like compounds belonging to the Lilly diversity cassette (LDC collection, yellow).
Fig. 5
Fig. 5. Diversity analysis.
a Average distance of two-cycle libDESIGN library samples to ADEL. b Average distance of two-cycle libDESIGN library samples to LDC collection. Scores are represented versus library size and lines connect enumeration sets (10,000 compounds each) drawn from samples prepared using internal BBs only to samples using all types of BBs. c Spread design using the ADEL and LDC collections as reference for two-cycle libDESIGNs using internal BBs only. Scores are represented versus library size.

Similar articles

Cited by

References

    1. Brenner, S. & Lerner, R. A. Encoded combinatorial chemistry. Proc. Natl Acad. Sci. USA, 5381–5383 (1992). - PMC - PubMed
    1. Goodnow RA, Jr., Dumelin CE, Keefe AD. DNA-encoded chemistry: enabling the deeper sampling of chemical space. Nat. Rev. Drug Discov. 2017;16:131–147. doi: 10.1038/nrd.2016.213. - DOI - PubMed
    1. Neri D, Lerner RA. DNA-encoded chemical libraries: a selection system based on endowing organic compounds with amplifiable information. Annu. Rev. Biochem. 2018;87:479–502. doi: 10.1146/annurev-biochem-062917-012550. - DOI - PMC - PubMed
    1. Favalli N, Bassi G, Scheuermann J, Neri D. DNA-encoded chemical libraries: achievements and remaining challenges. FEBS Lett. 2018;592:2168–2180. doi: 10.1002/1873-3468.13068. - DOI - PMC - PubMed
    1. Kunig V, Potowski M, Gohla A, Brunschweiger A. DNA-encoded libraries—an efficient small molecule discovery technology for the biomedical sciences. Biol. Chem. 2018;399:691–710. doi: 10.1515/hsz-2018-0119. - DOI - PubMed