Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 19:9:708754.
doi: 10.3389/fcell.2021.708754. eCollection 2021.

Characterization of Five Transmembrane Proteins: With Focus on the Tweety, Sideroflexin, and YIP1 Domain Families

Affiliations

Characterization of Five Transmembrane Proteins: With Focus on the Tweety, Sideroflexin, and YIP1 Domain Families

Misty M Attwood et al. Front Cell Dev Biol. .

Abstract

Transmembrane proteins are involved in many essential cell processes such as signal transduction, transport, and protein trafficking, and hence many are implicated in different disease pathways. Further, as the structure and function of proteins are correlated, investigating a group of proteins with the same tertiary structure, i.e., the same number of transmembrane regions, may give understanding about their functional roles and potential as therapeutic targets. This analysis investigates the previously unstudied group of proteins with five transmembrane-spanning regions (5TM). More than half of the 58 proteins identified with the 5TM architecture belong to 12 families with two or more members. Interestingly, more than half the proteins in the dataset function in localization activities through movement or tethering of cell components and more than one-third are involved in transport activities, particularly in the mitochondria. Surprisingly, no receptor activity was identified within this dataset in large contrast with other TM groups. The three major 5TM families, which comprise nearly 30% of the dataset, include the tweety family, the sideroflexin family and the Yip1 domain (YIPF) family. We also analyzed the evolutionary origin of these three families. The YIPF family appears to be the most ancient with presence in bacteria and archaea, while the tweety and sideroflexin families are first found in eukaryotes. We found no evidence of common decent for these three families. About 30% of the 5TM proteins have prominent expression in the brain, liver, or testis. Importantly, 60% of these proteins are identified as cancer prognostic markers, where they are associated with clinical outcomes of various tumor types. Nearly 10% of the 5TMs are still not fully characterized and further investigation of their functional activities and expression is warranted. This study provides the first comprehensive analysis of proteins with the 5TM architecture, providing details of their unique characteristics.

Keywords: YIPF family; cancer prognostic marker; protein trafficking; sideroflexin family; transmembrane protein; tweety family.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
The five transmembrane architecture. (A) The basic topology of the 5TM dataset. More than half the proteins in the dataset have the amino (N)-terminal region in the cytoplasmic environment and the carboxyl (C)-terminal in the luminal region. Many of the proteins are expected to contain targeting signals embedded in the first transmembrane region along with possibly amino acid residues in the N-terminus. (B) The domain structures and important residue modifications affecting localizations of the three major 5TM families. The description of the tweety family includes estimates of four possible glycosylation sites in purple; the important pore-forming amino acid (R165) in TTYH1 indicated in yellow (Han et al., 2019); and the Pfam tweety domain (PF04906) in light orange. The Sideroflexin family is annotated with a possible acetylation site at residue one or two and colored orange; the conserved HPDT residues are the red symbol; and the sideroflexin Pfam domain (PF03820) is in light orange. Many of the YIPF proteins have an acetylation site at residue one or two that is colored orange; three conserved motifs are indicated in red; and the YIPF Pfam domains (PF03878 and PF04893) are shown in light orange.
FIGURE 2
FIGURE 2
The major cellular localizations of the 5TM proteins. Localization information and analysis for with the number of proteins identified for each locale is in parenthesis and compartments that are overrepresented in comparison to the human transmembrane proteome are indicated in italics. Proteins that localize to the nuclear outer membrane-endoplasmic reticulum network, the inner and outer membrane of the mitochondria, the Golgi trans cisterna, vacuoles, the plasma membrane, and COPII-coated ER to Golgi vesicles are over-represented. Data for this figure is solely from the PANTHER Classification System and the overrepresentation analysis is from the PANTHER Overrepresentation Test (v14.1) (Mi et al., 2019) with the Gene Ontology (GO) Annotation database released on 2019-07-03. Fisher’s Exact test was performed and the False Discovery Rate was calculated with p < 0.05. The human transmembrane protein identities are from Attwood et al. (2017). 5,725 out of 5,779 proteins were successfully mapped while 55 of 58 proteins from the 5TM dataset were successfully mapped using GO annotation.
FIGURE 3
FIGURE 3
Phylogenetic analysis of Sideroflexin family. Phylogenetic reconstruction is the result of Bayesian inference posterior probabilities and bootstrapping analysis with the best-scoring maximum likelihood tree using RAxML (v8.2.10) (Minjarez et al., 2016) on 30 taxa with 67 sequences. Support values are given in percent at the nodes of the major clades differentiating the sideroflexin gene families. The protein sequences were aligned using Mafft (v6) (Fang et al., 2018) with E-INSI-I algorithm and JTT substitution model. MrBayes was used with amino acid mixed model run for 1,000,000 generations. The PROTGAMMAAUTO model in RAxML was used with 500 bootstrap replicates.
FIGURE 4
FIGURE 4
(A) Enhanced tissue expression of 5TM dataset. The enhanced or enriched expression of proteins in the 5TM dataset with the different types of tissues on the bottom part of the figure and associated proteins on the top part. Data are from The Tissue Atlas (Amorim et al., 2017). More than 35% of the proteins have enhanced or enriched expression in the cerebral cortex, liver, testis and blood tissues. The category Varied tissues includes intestine, breast, thyroid, parathyroid, gall bladder, prostate, and pancreas tissues. The category All Other 5TM includes thirty proteins in the dataset that have low tissue specificity. (B) 5TM proteins as prognostic markers for cancer. The nine different tumor types are on the bottom part of the figure while the 35 prognostic proteins associated with them are on the top half. Approximately 60% of the genes in the dataset are identified in the Pathology Atlas as candidate prognostic genes that are associated with the clinical outcome of different tumor types. The genes are identified from correlation analyses of gene expression and clinical outcome where Kaplan-Meier plots with high significance (p < 0.001) were considered prognostic (Pujar et al., 2018). Of the 35 proteins identified, 21 are associated with several different types of cancers. Gynecologic cancer includes cervical, endometrial, ovarian, and urothelial cancers. Proteins not identified as prognostic are not included in the figure.

References

    1. Almén M. S., Nordström K. J., Fredriksson R., Schiöth H. B. (2009). Mapping the human membrane proteome: a majority of the human membrane proteins can be classified according to function and evolutionary origin. BMC Biol. 7:50. 10.1186/1741-7007-7-50 - DOI - PMC - PubMed
    1. Attwood M. M., Krishnan A., Almén M. S., Schiöth H. B. (2017). Highly diversified expansions shaped the evolution of membrane bound proteins in metazoans. Sci. Rep. 7:12387. - PMC - PubMed
    1. Fagerberg L., Jonasson K., von Heijne G., Uhlén M., Berglund L. (2010). Prediction of the human membrane proteome. Proteomics 10 1141–1149. 10.1002/pmic.200900258 - DOI - PubMed
    1. Gabaldón T., Pittis A. A. (2015). Origin and evolution of metabolic sub-cellular compartmentalization in eukaryotes. Biochimie 119 262–268. 10.1016/j.biochi.2015.03.021 - DOI - PMC - PubMed
    1. Lagerström M. C., Schiöth H. B. (2008). Structural diversity of G protein-coupled receptors and significance for drug discovery. Nat. Rev. Drug. Discov. 7 339–357. 10.1038/nrd2518 - DOI - PubMed