Genomic analysis of membrane protein families: abundance and conserved motifs
- PMID: 12372142
- PMCID: PMC134483
- DOI: 10.1186/gb-2002-3-10-research0054
Genomic analysis of membrane protein families: abundance and conserved motifs
Abstract
Background: Polytopic membrane proteins can be related to each other on the basis of the number of transmembrane helices and sequence similarities. Building on the Pfam classification of protein domain families, and using transmembrane-helix prediction and sequence-similarity searching, we identified a total of 526 well-characterized membrane protein families in 26 recently sequenced genomes. To this we added a clustering of a number of predicted but unclassified membrane proteins, resulting in a total of 637 membrane protein families.
Results: Analysis of the occurrence and composition of these families revealed several interesting trends. The number of assigned membrane protein domains has an approximately linear relationship to the total number of open reading frames (ORFs) in 26 genomes studied. Caenorhabditis elegans is an apparent outlier, because of its high representation of seven-span transmembrane (7-TM) chemoreceptor families. In all genomes, including that of C. elegans, the number of distinct membrane protein families has a logarithmic relation to the number of ORFs. Glycine, proline, and tyrosine locations tend to be conserved in transmembrane regions within families, whereas isoleucine, valine, and methionine locations are relatively mutable. Analysis of motifs in putative transmembrane helices reveals that GxxxG and GxxxxxxG (which can be written GG4 and GG7, respectively; see Materials and methods) are among the most prevalent. This was noted in earlier studies; we now find these motifs are particularly well conserved in families, however, especially those corresponding to transporters, symporters, and channels.
Conclusions: We carried out a genome-wide analysis on patterns of the classified polytopic membrane protein families and analyzed the distribution of conserved amino acids and motifs in the transmembrane helix regions in these families.
Figures





Similar articles
-
Analysis of protein domain families in Caenorhabditis elegans.Genomics. 1997 Dec 1;46(2):200-16. doi: 10.1006/geno.1997.4989. Genomics. 1997. PMID: 9417907
-
Proportion of membrane proteins in proteomes of 15 single-cell organisms analyzed by the SOSUI prediction system.Biophys Chem. 1999 Dec 13;82(2-3):165-71. doi: 10.1016/s0301-4622(99)00116-7. Biophys Chem. 1999. PMID: 10631799
-
Statistical analysis of amino acid patterns in transmembrane helices: the GxxxG motif occurs frequently and in association with beta-branched residues at neighboring positions.J Mol Biol. 2000 Feb 25;296(3):921-36. doi: 10.1006/jmbi.1999.3488. J Mol Biol. 2000. PMID: 10677292
-
Classification of all putative permeases and other membrane plurispanners of the major facilitator superfamily encoded by the complete genome of Saccharomyces cerevisiae.FEMS Microbiol Rev. 1997 Sep;21(2):113-34. doi: 10.1111/j.1574-6976.1997.tb00347.x. FEMS Microbiol Rev. 1997. PMID: 9348664 Review.
-
Comparing genomes in terms of protein structure: surveys of a finite parts list.FEMS Microbiol Rev. 1998 Oct;22(4):277-304. doi: 10.1111/j.1574-6976.1998.tb00371.x. FEMS Microbiol Rev. 1998. PMID: 10357579 Review.
Cited by
-
Non-detergent isolation of a cyanobacterial photosystem I using styrene maleic acid alternating copolymers.RSC Adv. 2019 Oct 7;9(54):31781-31796. doi: 10.1039/c9ra04619d. eCollection 2019 Oct 1. RSC Adv. 2019. PMID: 35527920 Free PMC article.
-
Prediction of the burial status of transmembrane residues of helical membrane proteins.BMC Bioinformatics. 2007 Aug 20;8:302. doi: 10.1186/1471-2105-8-302. BMC Bioinformatics. 2007. PMID: 17708758 Free PMC article.
-
Physical Mapping of Peroxidase Genes and Development of Functional Markers for TaPod-D1 on Bread Wheat Chromosome 7D.Front Plant Sci. 2019 Apr 24;10:523. doi: 10.3389/fpls.2019.00523. eCollection 2019. Front Plant Sci. 2019. PMID: 31068962 Free PMC article.
-
Graph representation of high-dimensional alpha-helical membrane protein data.BioData Min. 2013 Dec 2;6(1):21. doi: 10.1186/1756-0381-6-21. BioData Min. 2013. PMID: 24294896 Free PMC article.
-
Evolutionary Influenced Interaction Pattern as Indicator for the Investigation of Natural Variants Causing Nephrogenic Diabetes Insipidus.Comput Math Methods Med. 2015;2015:641393. doi: 10.1155/2015/641393. Epub 2015 May 28. Comput Math Methods Med. 2015. PMID: 26180540 Free PMC article.
References
-
- Paulsen IT, Sliwinski MK, Saier MHJ. Microbial genome analyses: global comparisons of transport capabilities based on phylogenies, bioenergetics and substrate specificities. J Mol Biol. 1998;277:573–592. - PubMed
-
- Paulsen IT, Nguyen L, Sliwinski MK, Rabus R, Saier MHJ. Microbial genome analyses: comparative transport capabilities in eighteen prokaryotes. J Mol Biol. 2000;301:75–100. - PubMed
-
- Gerstein M. A structural census of genomes: comparing bacterial, eukaryotic, and archaeal genomes in terms of protein structure. J Mol Biol. 1997;274:562–576. - PubMed
-
- Gerstein M. Patterns of protein-fold usage in eight microbial genomes: a comprehensive structural census. Proteins. 1998;33:518–534. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases