Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 30:2:958378.
doi: 10.3389/fbinf.2022.958378. eCollection 2022.

The druggable genome: Twenty years later

Affiliations

The druggable genome: Twenty years later

Chris J Radoux et al. Front Bioinform. .

Abstract

The concept of the druggable genome has been with us for 20 years. During this time, researchers have developed several methods and resources to help assess a target's druggability. In parallel, evidence for target-disease associations has been collated at scale by Open Targets. More recently, the Protein Data Bank in Europe (PDBe) have built a knowledge base matching per-residue annotations with available protein structure. While each resource is useful in isolation, we believe there is enormous potential in bringing all relevant data into a single knowledge graph, from gene-level to protein residue. Automation is vital for the processing and assessment of all available structures. We have developed scalable, automated workflows that provide hotspot-based druggability assessments for all available structures across large numbers of targets. Ultimately, we will run our method at a proteome scale, an ambition made more realistic by the arrival of AlphaFold 2. Bringing together annotations from the residue up to the gene level and building connections within the graph to represent pathways or protein-protein interactions will create complexity that mirrors the biological systems they represent. Such complexity is difficult for the human mind to utilise effectively, particularly at scale. We believe that graph-based AI methods will be able to expertly navigate such a knowledge graph, selecting the targets of the future.

Keywords: artificial intelligence; drug target identification; druggability; druggable genome; knowledge graph; tractability.

PubMed Disclaimer

Conflict of interest statement

Authors CR, FV, JM, ND, and AB were employed by the company Exscientia.

Figures

FIGURE 1
FIGURE 1
High-level schematic showing the overall structure-based assessment workflow.
FIGURE 2
FIGURE 2
(A) Clustered pockets across all AlphaFold 2 structures of the human kinome, with the AlphaFold 2 model of cyclin-dependent kinase 7 (UniProt ID P50613). The cyan points show the ATP binding site of each kinase, allowing us to rank the kinome purely based on ATP-pocket druggability. (B) The frequency distribution of ATP site scores is shown in yellow, with known druggable (blue) and non-druggable (green) targets for comparison.
FIGURE 3
FIGURE 3
(A) Tractability for PLpro (yellow) vs. known druggable (blue) and non-druggable (green) targets. (BE) Plots to show conservation metrics across coronavirus species HKU1, 229E, OC43, NL63, MERS, SARS-CoV-2 and SARS-CoV. In each plot, the hotspot tractability score for a given residue is shown as a black cross. (B) Number of different amino acids. (C) Number of amino acid properties. (D) Frequency of the most common amino acid observed. (E) Frequency with which an amino acid is differentially conserved between species for each predicted hotspot residue. A position is considered differentially conserved when it is conserved within a given species (JS-divergence score >0.8) and different from a corresponding conserved position in a different species. As we consider multiple species, this plot shows the average number of times a given position is labelled as differentially conserved.

Similar articles

Cited by

References

    1. Aggarwal R., Gupta A., Chelur V., Jawahar C. V., Priyakumar U. D. (2021). DeepPocket: Ligand binding site detection and segmentation using 3D convolutional neural networks. J. Chem. Inf. Model.. 10.1021/acs.jcim.1c00799. 10.1021/acs.jcim.1c00799 - DOI - PubMed
    1. Akdel M., Pires D. E. V., Pardo E. P., Jänes J., Zalevsky A. O., Mészáros B., et al. (2021). A structural biology community assessment of AlphaFold 2 applications. Biorxiv 2021, 461876. 10.1101/2021.09.26.461876 - DOI - PMC - PubMed
    1. Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. (1990). Basic local alignment search tool. J. Mol. Biol. 215, 403–410. 10.1016/s0022-2836(05)80360-2 - DOI - PubMed
    1. Alvarez-Garcia D., Barril X. (2014). Molecular simulations with solvent competition quantify water displaceability and provide accurate interaction maps of protein binding sites. J. Med. Chem. 57, 8530–8539. 10.1021/jm5010418 - DOI - PubMed
    1. Amaro R. E., Baudry J., Chodera J., Demir Ö., McCammon J. A., Miao Y., et al. (2018). Ensemble docking in drug discovery. Biophys. J. 114, 2271–2278. 10.1016/j.bpj.2018.02.038 - DOI - PMC - PubMed

LinkOut - more resources