A substrate-based ontology for human solute carriers
- PMID: 32697042
- PMCID: PMC7374931
- DOI: 10.15252/msb.20209652
A substrate-based ontology for human solute carriers
Abstract
Solute carriers (SLCs) are the largest family of transmembrane transporters in the human genome with more than 400 members. Despite the fact that SLCs mediate critical biological functions and several are important pharmacological targets, a large proportion of them is poorly characterized and present no assigned substrate. A major limitation to systems-level de-orphanization campaigns is the absence of a structured, language-controlled chemical annotation. Here we describe a thorough manual annotation of SLCs based on literature. The annotation of substrates, transport mechanism, coupled ions, and subcellular localization for 446 human SLCs confirmed that ~30% of these were still functionally orphan and lacked known substrates. Application of a substrate-based ontology to transcriptomic datasets identified SLC-specific responses to external perturbations, while a machine-learning approach based on the annotation allowed us to identify potential substrates for several orphan SLCs. The annotation is available at https://opendata.cemm.at/gsflab/slcontology/. Given the increasing availability of large biological datasets and the growing interest in transporters, we expect that the effort presented here will be critical to provide novel insights into the functions of SLCs.
Keywords: SLCs; annotation; de-orphanization; ontology; solute carriers.
© 2020 The Authors. Published under the terms of the CC BY 4.0 license.
Conflict of interest statement
The authors declare that they have no conflict of interest.
Figures
- A
Frequencies of unknown annotations for 446 SLCs in four annotation categories.
- B–E
Distribution of annotated terms for substrate class, coupled ions, transport mechanism, and subcellular localization. Terms annotated to less than ten SLCs were summarized as “Other” in (C) and (E).
Individual steps in ontology creation workflow.
Distribution of the number of SLCs per ontology term. Red line indicates the median value.
Distribution of the number of ontology terms associated with one SLC. Red line indicates the median value.
Exemplified visualization of term “L‐alpha amino acid (aa)” and its sub‐terms. This is a sub‐graph and SLC substrates (gray) are connected to more terms in the full ontology. Please refer to Fig EV1 for an extended example of the term “amino acid” and its sub‐terms.
Visualization of the resulting SLC‐specific ontology: role sub‐ontology (left) and chemical entity sub‐ontology (right).
- A, B
Number of (A) differentially expressed genes and of (B) up‐ and downregulated SLC genes for different amino acid depletion conditions in HEK293T cells.
- C, D
Number of (C) differentially expressed genes and of (D) up‐ and downregulated SLC genes for different amino acid depletion conditions in MCF7 cells.
- A, B
Ontology term enrichment analysis in set of SLCs upregulated in (A) HEK293T and (B) in MCF7 cells after amino acid (aa) deprivation conditions using SLC ontology terms and GO terms (TMT: transmembrane transporter). Enrichments were calculated using Fisher's exact test. For simplification, only enrichment of the most specific terms is shown in (A,B), for complete version see Fig EV3.
- C
Area under the receiver operating characteristic (AUROC) and under the precision recall curve (AUPRC) derived from out‐of‐bag (OOB) error estimates for random forest classifiers for the 18 selected SLC substrate terms.
- D
Statistical performance measures for the binary classifiers from OOB estimates.
- E
Predicted substrate term probabilities for orphan SLCs, normalized to a decision threshold of 0.5.
- A, B
SLC ontology term enrichment analysis in upregulated SLCs (all terms) in (A) HEK293T cells and (B) in MCF7 cells. Enrichments were calculated using Fisher's exact test.
References
-
- Cavuoto P, Fenech MF (2012) A review of methionine dependency and the role of methionine restriction in cancer growth control and life‐span extension. Cancer Treat Rev 38: 726–736 - PubMed
-
- César‐Razquin A, Snijder B, Frappier‐Brinton T, Isserlin R, Gyimesi G, Bai X, Reithmeier RA, Hepworth D, Hediger MA, Edwards AM et al (2015) A call for systematic research on solute carriers. Cell 162: 478–487 - PubMed
Publication types
MeSH terms
Substances
Associated data
- Actions
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
