Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 May;17(5):317-332.
doi: 10.1038/nrd.2018.14. Epub 2018 Mar 23.

Unexplored therapeutic opportunities in the human genome

Affiliations
Review

Unexplored therapeutic opportunities in the human genome

Tudor I Oprea et al. Nat Rev Drug Discov. 2018 May.

Erratum in

  • Unexplored therapeutic opportunities in the human genome.
    Oprea TI, Bologa CG, Brunak S, Campbell A, Gan GN, Gaulton A, Gomez SM, Guha R, Hersey A, Holmes J, Jadhav A, Jensen LJ, Johnson GL, Karlson A, Leach AR, Ma'ayan A, Malovannaya A, Mani S, Mathias SL, McManus MT, Meehan TF, von Mering C, Muthas D, Nguyen DT, Overington JP, Papadatos G, Qin J, Reich C, Roth BL, Schürer SC, Simeonov A, Sklar LA, Southall N, Tomita S, Tudose I, Ursu O, Vidovic D, Waller A, Westergaard D, Yang JJ, Zahoránszky-Köhalmi G. Oprea TI, et al. Nat Rev Drug Discov. 2018 May;17(5):377. doi: 10.1038/nrd.2018.52. Epub 2018 Mar 23. Nat Rev Drug Discov. 2018. PMID: 29567993

Abstract

A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare competing interests: see Web version for details.

Figures

Figure 1 |
Figure 1 |. Target development level categories applied to the human proteome.
a | Percentages of the whole proteome are shown in the inner ring. Percentages of each target development level (TDL) category for selected major protein families are shown in the outer ring, with the Tclin category expanded. Inner ring colours are as follows: Tdark, black; Tbio, red; Tchem, green; and Tclin, blue. b | TDL distribution across protein families, coloured by TDL category. Data show 3,644 proteins that have a confirmed disease association according to the Online Mendelian Inheritance in Man (OMIM) database. The enzyme category excludes kinases, which are considered separately. GPCR, G protein-coupled receptor.
Figure 2 |
Figure 2 |. Patterns of target development level distribution across different data: visualizing the knowledge deficit.
a | The three criteria used in establishing the target development level are to the left, and their independent validation by four other data types are to the right. For PubMed abstracts, Gene Reference Into Function (RIF) annotations, antibodies, Gene Ontology, R01 grants and patents, the score for each target is the count of those entities associated with the target, normalized between 0 and 1. The values for the Harmonizome data availability score were computed differently, as described in the main text. See FIG. 1 for colour codes and Supplementary Table S4 for further details. b | Patterns of scientific curiosity: STRING database access counts by target development level (January–December 2016).

References

    1. Knowles J & Gromo G Target selection in drug discovery. Nat. Rev. Drug Discov. 2, 63–69 (2003). - PubMed
    1. Edwards AM et al. Too many roads not taken.Nature 470, 163–165 (2011). - PubMed
    1. Alberts B, Kirschner MW, Tilghman S &Varmus H Rescuing US biomedical research from its systemic flaws. Proc. Natl Acad. Sci. USA 111, 5773–5777 (2014). - PMC - PubMed
    1. Kim S et al. PubChem Substance and Compound databases. Nucleic Acids Res. 44, D1202–D1213 (2016). - PMC - PubMed
    1. Gaulton A et al. The ChEMBL database in 2017.Nucleic Acids Res. 45, D945–D954 (2017). - PMC - PubMed

Publication types