Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 6;51(D1):D1405-D1416.
doi: 10.1093/nar/gkac1033.

Pharos 2023: an integrated resource for the understudied human proteome

Affiliations

Pharos 2023: an integrated resource for the understudied human proteome

Keith J Kelleher et al. Nucleic Acids Res. .

Erratum in

Abstract

The Illuminating the Druggable Genome (IDG) project aims to improve our understanding of understudied proteins and our ability to study them in the context of disease biology by perturbing them with small molecules, biologics, or other therapeutic modalities. Two main products from the IDG effort are the Target Central Resource Database (TCRD) (http://juniper.health.unm.edu/tcrd/), which curates and aggregates information, and Pharos (https://pharos.nih.gov/), a web interface for fusers to extract and visualize data from TCRD. Since the 2021 release, TCRD/Pharos has focused on developing visualization and analysis tools that help reveal higher-level patterns in the underlying data. The current iterations of TCRD and Pharos enable users to perform enrichment calculations based on subsets of targets, diseases, or ligands and to create interactive heat maps and UpSet charts of many types of annotations. Using several examples, we show how to address disease biology and drug discovery questions through enrichment calculations and UpSet charts.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
TDL changes over time • Chart of the TDL changes between an early version (TCRD v3.0), a version at the time of the last update publication (TCRD v6.7), and the current version (TCRD 6.13). The decrease of Tdark and subsequent increase of other development levels shows an overall increase in target illumination.
Figure 2
Figure 2
Target details updates. (A) Expression data from five sources is color coded according to expression type (red: aggregate score, blue: RNA expression, yellow: protein expression) and displayed as a heat map on the left panel. Cells in the heat map display details about the expression data for each tissue when clicked. The right panel contains tabs to show either a circular treemap, where tissues are grouped according to the UBERON hierarchy, and an anatomogram, where tissues on a human form are shaded according to their expression level. The circular treemap is interactive and can be used to filter the heat map. (B) Text-mined target-disease associations from TIN-X are shown in an interactive scatter plot of importance vs novelty adjacent to a circular treemap. The circular treemap groups the associations based on the hierarchy defined by the Mondo Disease Ontology. Selecting a circle in the right panel highlights corresponding points on the scatter plot for those diseases. This dynamic visualization helps users find classes of diseases which tend to be both high in the importance and novelty metrics. (C) GWAS traits associated with a single target, scored and ranked according to the TIGA data processing pipeline. More reliable associations tend to have a higher Mean Rank Score and a higher Beta Count.
Figure 3.
Figure 3.
Structure search. The component contains a Marvin JS widget showing an editable query structure. Structures can also be edited via the Query SMILES input or resolved using an external tool (not pictured). This serves as a starting point for performing a similarity, or substructure, search of compounds in TCRD, or a search for predicted targets, using the QSAR models in NCATS Predictor.
Figure 4.
Figure 4.
Enrichment analysis. (A) Left shows the number of targets associated with different diseases in a full target list. Right shows the number of targets associated with different diseases in a target list consisting of targets with a documented protein-protein interaction with DRD2. Note how there is a lot of overlap between the entries in this list, when the list is sorted by the naive counts. (B) Enrichment score results after performing Fisher's Exact Test on the filtered list. Note how the top entries in the list comprises a different set of diseases, including a lot of neurological disorders, and substance dependence disorders, diseases which are more commonly known to be related to DRD2.
Figure 5.
Figure 5.
UpSet chart examples. (A) For a list of ligands, this plot shows different combinations of targets against which each ligand has been shown to be active. (B) For a target list, this plot shows how many targets in the list were on each combination of NIH lists. Gold highlighting shows filters that are currently applied to the list, so this list would consist of 12 targets that were on the IDG list from 2017 to 2020, and removed in 2022. (C) In this case, the corresponding target list would show 385 targets that have data from ProKinO, but not from Dark Kinase Knowledgebase. (D) In this case, the corresponding target list would show 21 targets annotated by PANTHER to be a RNA binding protein, but not a DNA binding protein.

References

    1. Edwards A.M., Isserlin R., Bader G.D., Frye S.V., Willson T.M., Yu F.H.. Too many roads not taken. Nature. 2011; 470:163–165. - PubMed
    1. Stoeger T., Gerlach M., Morimoto R.I., Nunes Amaral L.A.. Large-scale investigation of the reasons why potentially important genes are ignored. PLoS Biol. 2018; 16:e2006643. - PMC - PubMed
    1. Nguyen D.-T., Mathias S., Bologa C., Brunak S., Fernandez N., Gaulton A., Hersey A., Holmes J., Jensen L.J., Karlsson A.et al. .. Pharos: collating protein information to shed light on the druggable genome. Nucleic Acids Res. 2017; 45:D995–D1002. - PMC - PubMed
    1. Sheils T.K., Mathias S.L., Kelleher K.J., Siramshetty V.B., Nguyen D.-T., Bologa C.G., Jensen L.J., Vidović D., Koleti A., Schürer S.C.et al. .. TCRD and pharos 2021: mining the human proteome for disease biology. Nucleic Acids Res. 2021; 49:D1334–D1346. - PMC - PubMed
    1. Santos R., Ursu O., Gaulton A., Bento A.P., Donadi R.S., Bologa C.G., Karlsson A., Al-Lazikani B., Hersey A., Oprea T.I.et al. .. A comprehensive map of molecular drug targets. Nat. Rev. Drug Discov. 2017; 16:19–34. - PMC - PubMed

Publication types