Unexplored therapeutic opportunities in the human genome

Tudor I Oprea^{1

2

3

4}, Cristian G Bologa¹, Søren Brunak⁴, Allen Campbell⁵, Gregory N Gan², Anna Gaulton⁶, Shawn M Gomez^{7

8}, Rajarshi Guha⁹, Anne Hersey⁶, Jayme Holmes¹, Ajit Jadhav⁹, Lars Juhl Jensen⁴, Gary L Johnson⁸, Anneli Karlson^{6

10}, Andrew R Leach⁶, Avi Ma'ayan¹¹, Anna Malovannaya¹², Subramani Mani¹, Stephen L Mathias¹, Michael T McManus¹³, Terrence F Meehan⁶, Christian von Mering¹⁴, Daniel Muthas¹⁵, Dac-Trung Nguyen⁹, John P Overington^{6

16}, George Papadatos^{6

17}, Jun Qin¹², Christian Reich¹⁸, Bryan L Roth⁸, Stephan C Schürer¹⁹, Anton Simeonov⁹, Larry A Sklar^{2

20

21}, Noel Southall⁹, Susumu Tomita²², Ilinca Tudose^{6

23}, Oleg Ursu¹, Dušica Vidovic¹⁹, Anna Waller²⁰, David Westergaard⁴, Jeremy J Yang¹, Gergely Zahoránszky-Köhalmi^{1

24}

Affiliations

¹ Department of Internal Medicine, University of New Mexico School of Medicine, Albuquerque, NM, USA.
² UNM Comprehensive Cancer Center, Albuquerque, NM, USA.
³ Department of Rheumatology and Inflammation Research, Institute of Medicine, Sahlgrenska Academy at University of Gothenburg, Gothenburg, Sweden.
⁴ Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
⁵ IQVIA, Plymouth Meeting, PA, USA.
⁶ European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK.
⁷ Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University, Chapel Hill, NC, USA.
⁸ Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, NC, USA.
⁹ National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD, USA.
¹⁰ SciBite Limited, BioData Innovation Centre, Wellcome Genome Campus, Hinxton, Cambridge, UK.
¹¹ Icahn School of Medicine at Mount Sinai, New York, NY, USA.
¹² Baylor College of Medicine, Houston, TX, USA.
¹³ University of California, San Francisco, CA, USA.
¹⁴ Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.
¹⁵ Respiratory, Inflammation and Autoimmunity Diseases, Innovative Medicines and Early Development Biotech Unit, AstraZeneca R&D Gothenburg, Mölndal, Sweden.
¹⁶ Medicines Discovery Catapult, Alderley Edge, UK.
¹⁷ GlaxoSmithKline, Stevenage, UK.
¹⁸ IQVIA, Cambridge, MA, USA.
¹⁹ Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, FL, USA.
²⁰ Center for Molecular Discovery, University of New Mexico Cancer Center, University of New Mexico, Albuquerque, NM, USA.
²¹ Department of Pathology, University of New Mexico, Albuquerque, NM, USA.
²² Yale School of Medicine, Yale University, New Haven, CT, USA.
²³ Google Germany GmbH, München, Germany.
²⁴ NIH-NCATS, Rockville, MD, USA.

PMID: 29472638
PMCID: PMC6339563
DOI: 10.1038/nrd.2018.14

Review

Unexplored therapeutic opportunities in the human genome

Tudor I Oprea et al. Nat Rev Drug Discov. 2018 May.

. 2018 May;17(5):317-332.

doi: 10.1038/nrd.2018.14. Epub 2018 Mar 23.

Authors

Affiliations

¹ Department of Internal Medicine, University of New Mexico School of Medicine, Albuquerque, NM, USA.
² UNM Comprehensive Cancer Center, Albuquerque, NM, USA.
³ Department of Rheumatology and Inflammation Research, Institute of Medicine, Sahlgrenska Academy at University of Gothenburg, Gothenburg, Sweden.
⁴ Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
⁵ IQVIA, Plymouth Meeting, PA, USA.
⁶ European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK.
⁷ Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University, Chapel Hill, NC, USA.
⁸ Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, NC, USA.
⁹ National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD, USA.
¹⁰ SciBite Limited, BioData Innovation Centre, Wellcome Genome Campus, Hinxton, Cambridge, UK.
¹¹ Icahn School of Medicine at Mount Sinai, New York, NY, USA.
¹² Baylor College of Medicine, Houston, TX, USA.
¹³ University of California, San Francisco, CA, USA.
¹⁴ Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.
¹⁵ Respiratory, Inflammation and Autoimmunity Diseases, Innovative Medicines and Early Development Biotech Unit, AstraZeneca R&D Gothenburg, Mölndal, Sweden.
¹⁶ Medicines Discovery Catapult, Alderley Edge, UK.
¹⁷ GlaxoSmithKline, Stevenage, UK.
¹⁸ IQVIA, Cambridge, MA, USA.
¹⁹ Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, FL, USA.
²⁰ Center for Molecular Discovery, University of New Mexico Cancer Center, University of New Mexico, Albuquerque, NM, USA.
²¹ Department of Pathology, University of New Mexico, Albuquerque, NM, USA.
²² Yale School of Medicine, Yale University, New Haven, CT, USA.
²³ Google Germany GmbH, München, Germany.
²⁴ NIH-NCATS, Rockville, MD, USA.

PMID: 29472638
PMCID: PMC6339563
DOI: 10.1038/nrd.2018.14

Erratum in

Unexplored therapeutic opportunities in the human genome.
Oprea TI, Bologa CG, Brunak S, Campbell A, Gan GN, Gaulton A, Gomez SM, Guha R, Hersey A, Holmes J, Jadhav A, Jensen LJ, Johnson GL, Karlson A, Leach AR, Ma'ayan A, Malovannaya A, Mani S, Mathias SL, McManus MT, Meehan TF, von Mering C, Muthas D, Nguyen DT, Overington JP, Papadatos G, Qin J, Reich C, Roth BL, Schürer SC, Simeonov A, Sklar LA, Southall N, Tomita S, Tudose I, Ursu O, Vidovic D, Waller A, Westergaard D, Yang JJ, Zahoránszky-Köhalmi G. Oprea TI, et al. Nat Rev Drug Discov. 2018 May;17(5):377. doi: 10.1038/nrd.2018.52. Epub 2018 Mar 23. Nat Rev Drug Discov. 2018. PMID: 29567993

Abstract

A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.

PubMed Disclaimer

Conflict of interest statement

Competing interests

The authors declare competing interests: see Web version for details.

Figures

**Figure 1 |. Target development level categories applied to the human proteome.**
a | Percentages of the whole proteome are shown in the inner ring. Percentages of each target development level (TDL) category for selected major protein families are shown in the outer ring, with the T_clin category expanded. Inner ring colours are as follows: T_dark, black; T_bio, red; T_chem, green; and T_clin, blue. b | TDL distribution across protein families, coloured by TDL category. Data show 3,644 proteins that have a confirmed disease association according to the Online Mendelian Inheritance in Man (OMIM) database. The enzyme category excludes kinases, which are considered separately. GPCR, G protein-coupled receptor.

**Figure 2 |. Patterns of target development level distribution across different data: visualizing the knowledge deficit.**
a | The three criteria used in establishing the target development level are to the left, and their independent validation by four other data types are to the right. For PubMed abstracts, Gene Reference Into Function (RIF) annotations, antibodies, Gene Ontology, R01 grants and patents, the score for each target is the count of those entities associated with the target, normalized between 0 and 1. The values for the Harmonizome data availability score were computed differently, as described in the main text. See FIG. 1 for colour codes and Supplementary Table S4 for further details. b | Patterns of scientific curiosity: STRING database access counts by target development level (January–December 2016).

See this image and copyright information in PMC

References

1. Knowles J & Gromo G Target selection in drug discovery. Nat. Rev. Drug Discov. 2, 63–69 (2003). - PubMed
1. Edwards AM et al. Too many roads not taken.Nature 470, 163–165 (2011). - PubMed
1. Alberts B, Kirschner MW, Tilghman S &Varmus H Rescuing US biomedical research from its systemic flaws. Proc. Natl Acad. Sci. USA 111, 5773–5777 (2014). - PMC - PubMed
1. Kim S et al. PubChem Substance and Compound databases. Nucleic Acids Res. 44, D1202–D1213 (2016). - PMC - PubMed
1. Gaulton A et al. The ChEMBL database in 2017.Nucleic Acids Res. 45, D945–D954 (2017). - PMC - PubMed

Publication types

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Unexplored therapeutic opportunities in the human genome

Affiliations

Unexplored therapeutic opportunities in the human genome

Authors

Affiliations

Erratum in

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources