Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 10;14(6):1443-1452.
doi: 10.1039/d2sc05709c. eCollection 2023 Feb 8.

AlphaFold accelerates artificial intelligence powered drug discovery: efficient discovery of a novel CDK20 small molecule inhibitor

Affiliations

AlphaFold accelerates artificial intelligence powered drug discovery: efficient discovery of a novel CDK20 small molecule inhibitor

Feng Ren et al. Chem Sci. .

Abstract

The application of artificial intelligence (AI) has been considered a revolutionary change in drug discovery and development. In 2020, the AlphaFold computer program predicted protein structures for the whole human genome, which has been considered a remarkable breakthrough in both AI applications and structural biology. Despite the varying confidence levels, these predicted structures could still significantly contribute to structure-based drug design of novel targets, especially the ones with no or limited structural information. In this work, we successfully applied AlphaFold to our end-to-end AI-powered drug discovery engines, including a biocomputational platform PandaOmics and a generative chemistry platform Chemistry42. A novel hit molecule against a novel target without an experimental structure was identified, starting from target selection towards hit identification, in a cost- and time-efficient manner. PandaOmics provided the protein of interest for the treatment of hepatocellular carcinoma (HCC) and Chemistry42 generated the molecules based on the structure predicted by AlphaFold, and the selected molecules were synthesized and tested in biological assays. Through this approach, we identified a small molecule hit compound for cyclin-dependent kinase 20 (CDK20) with a binding constant Kd value of 9.2 ± 0.5 μM (n = 3) within 30 days from target selection and after only synthesizing 7 compounds. Based on the available data, a second round of AI-powered compound generation was conducted and through this, a more potent hit molecule, ISM042-2-048, was discovered with an average Kd value of 566.7 ± 256.2 nM (n = 3). Compound ISM042-2-048 also showed good CDK20 inhibitory activity with an IC50 value of 33.4 ± 22.6 nM (n = 3). In addition, ISM042-2-048 demonstrated selective anti-proliferation activity in an HCC cell line with CDK20 overexpression, Huh7, with an IC50 of 208.7 ± 3.3 nM, compared to a counter screen cell line HEK293 (IC50 = 1706.7 ± 670.0 nM). This work is the first demonstration of applying AlphaFold to the hit identification process in drug discovery.

PubMed Disclaimer

Conflict of interest statement

Insilico Medicine is a company developing an AI-based end-to-end integrated pipeline for drug discovery and development and engaged in aging and cancer research. Alán Aspuru-Guzik is co-founder and Chief Vision officer of Kebotix, an AI-powered materials and molecular discovery company and co-founder and Chief Scientific Officer of Zapata Computing, a quantum software computing company. Alán Aspuru-Guzik is a scientific advisor to Insilico Medicine. Michael Levitt is an advisor to Insilico Medicine.

Figures

Fig. 1
Fig. 1. The pipeline to combine AlphaFold with Insilico Medicine end-to-end, and AI-powered drug discovery platforms PandaOmics and Chemistry42 in the drug discovery for hepatocellular carcinoma from target selection and hit generation to hit identification. A novel therapeutic target was identified from a pool of dark targets that have AlphaFold-predicted but lack experimentally determined structures. Such a target represents a first-in-class novel target which is revealed for the first time to treat HCC.
Fig. 2
Fig. 2. Reported CDK20 inhibitors from the literature and the novel inhibitor ISM042-2-048 discovered in this paper. The Tanimoto similarities of reported molecules to ISM042-2-048 are calculated from Morgan fingerprints using RDKit.
Fig. 3
Fig. 3. Insilico Medicine generative procedures for CDK20 hits.
Fig. 4
Fig. 4. (A) The AlphaFold predicted structure of CDK20 (AF-Q8IZL9-F1-model_v1); (B) ATP pocket of CDK20 with a DFG-in (residue Phe146) conformation. Met84 is the hinge residue. P-loop is colored in green. Two acid centers Asp87 and Glu90 are located in the solvent-exposed region of the protein.
Fig. 5
Fig. 5. Chemical structures for the selected 7 molecules from the first-round Chemistry42 generation for synthesis and testing in CDK20 binding assay.
Fig. 6
Fig. 6. (A) Representative binding affinity curve of ISM042-2-001 in CDK20 kinase binding assay. Data points are presented as the mean of duplicate wells in one experiment. Similar results were obtained in three independent experiments and the KD is the mean ± SD of three independent experiments. (B) Predicted binding pose for ISM042-2-001 with CDK20.
Fig. 7
Fig. 7. Chemical structures for the second-round Chemistry42 generation for synthesis and testing in CDK20 binding and kinase activity assays. Biological data with a standard deviation are presented from three independent experiments.
Fig. 8
Fig. 8. (A) Representative binding affinity curve for ISM042-2-048 in CDK20 kinase binding assay. Data points are presented as the mean of duplicate wells in one experiment. Similar results were obtained in three independent experiments and the KD is the mean ± SD of the three independent experiments. (B) Predicted binding pose for ISM042-2-048 in CDK20.
Fig. 9
Fig. 9. (A) Representative dose–response curve for ISM042-2-048 in CDK20 kinase activity assay. Similar results were obtained in three independent experiments and the IC50 is the mean ± SD of the three independent experiments. (B) Cell viability curves for ISM042-2-048 in cell line Huh7 and counter screen cell line HEK293. Data points are presented as the mean of duplicate wells in one experiment. Similar results were obtained in three independent experiments.

References

    1. RDKit, RDKit: Open-Source Cheminformatics Software, 2022, https://rdkit.org/
    1. Batool M. Ahmad B. Choi S. Int. J. Mol. Sci. 2019;20(11):2783. doi: 10.3390/ijms20112783. - DOI - PMC - PubMed
    1. Nyiri K. Koppany G. Vertessy B. G. Cancer Metastasis Rev. 2020;39:1091–1105. doi: 10.1007/s10555-020-09914-6. - DOI - PMC - PubMed
    1. Marineau J. J. Hamman K. B. Hu S. Alnemy S. Mihalich J. Kabro A. Whitmore K. M. Winter D. K. Roy S. Ciblat S. Ke N. Savinainen A. Wilsily A. Malojcic G. Zahler R. Schmidt D. Bradley M. J. Waters N. J. Chuaqui C. J. Med. Chem. 2022;65:1458–1480. doi: 10.1021/acs.jmedchem.1c01171. - DOI - PubMed
    1. Jumper J. Evans R. Pritzel A. Green T. Figurnov M. Ronneberger O. Tunyasuvunakool K. Bates R. Zidek A. Potapenko A. Bridgland A. Meyer C. Kohl S. A. A. Ballard A. J. Cowie A. Romera-Paredes B. Nikolov S. Jain R. Adler J. Back T. Petersen S. Reiman D. Clancy E. Zielinski M. Steinegger M. Pacholska M. Berghammer T. Silver D. Vinyals O. Senior A. W. Kavukcuoglu K. Kohli P. Hassabis D. Proteins. 2021;89:1711–1721. doi: 10.1002/prot.26257. - DOI - PMC - PubMed