An improved hidden Markov model for transmembrane protein detection and topology prediction and its applications to complete genomes
- PMID: 15691854
- DOI: 10.1093/bioinformatics/bti303
An improved hidden Markov model for transmembrane protein detection and topology prediction and its applications to complete genomes
Abstract
Motivation: Knowledge of the transmembrane helical topology can help identify binding sites and infer functions for membrane proteins. However, because membrane proteins are hard to solubilize and purify, only a very small amount of membrane proteins have structure and topology experimentally determined. This has motivated various computational methods for predicting the topology of membrane proteins.
Results: We present an improved hidden Markov model, TMMOD, for the identification and topology prediction of transmembrane proteins. Our model uses TMHMM as a prototype, but differs from TMHMM by the architecture of the submodels for loops on both sides of the membrane and also by the model training procedure. In cross-validation experiments using a set of 83 transmembrane proteins with known topology, TMMOD outperformed TMHMM and other existing methods, with an accuracy of 89% for both topology and locations. In another experiment using a separate set of 160 transmembrane proteins, TMMOD had 84% for topology and 89% for locations. When utilized for identifying transmembrane proteins from non-transmembrane proteins, particularly signal peptides, TMMOD has consistently fewer false positives than TMHMM does. Application of TMMOD to a collection of complete genomes shows that the number of predicted membrane proteins accounts for approximately 20-30% of all genes in those genomes, and that the topology where both the N- and C-termini are in the cytoplasm is dominant in these organisms except for Caenorhabditis elegans.
Availability: http://liao.cis.udel.edu/website/servers/TMMOD/
Similar articles
-
ZPRED: predicting the distance to the membrane center for residues in alpha-helical membrane proteins.Bioinformatics. 2006 Jul 15;22(14):e191-6. doi: 10.1093/bioinformatics/btl206. Bioinformatics. 2006. PMID: 16873471
-
Principles governing amino acid composition of integral membrane proteins: application to topology prediction.J Mol Biol. 1998 Oct 23;283(2):489-506. doi: 10.1006/jmbi.1998.2107. J Mol Biol. 1998. PMID: 9769220
-
OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar.Bioinformatics. 2008 Aug 1;24(15):1662-8. doi: 10.1093/bioinformatics/btn221. Epub 2008 May 12. Bioinformatics. 2008. PMID: 18474507
-
State-of-the-art in membrane protein prediction.Appl Bioinformatics. 2002;1(1):21-35. Appl Bioinformatics. 2002. PMID: 15130854 Review.
-
Prediction in 1D: secondary structure, membrane helices, and accessibility.Methods Biochem Anal. 2003;44:559-87. Methods Biochem Anal. 2003. PMID: 12647405 Review. No abstract available.
Cited by
-
A role for the CXCR4-CXCL12 axis in the little skate, Leucoraja erinacea.Am J Physiol Regul Integr Comp Physiol. 2018 Aug 1;315(2):R218-R229. doi: 10.1152/ajpregu.00322.2017. Epub 2018 Apr 11. Am J Physiol Regul Integr Comp Physiol. 2018. PMID: 29641231 Free PMC article.
-
Unveiling the Role of β-Glucosidase Genes in Bletilla striata's Secondary Metabolism: A Genome-Wide Analysis.Int J Mol Sci. 2024 Dec 8;25(23):13191. doi: 10.3390/ijms252313191. Int J Mol Sci. 2024. PMID: 39684901 Free PMC article.
-
Solution NMR studies reveal the location of the second transmembrane domain of the human sigma-1 receptor.FEBS Lett. 2015 Feb 27;589(5):659-65. doi: 10.1016/j.febslet.2015.01.033. Epub 2015 Jan 31. FEBS Lett. 2015. PMID: 25647032 Free PMC article.
-
Predicting the Assembly of the Transmembrane Domains of Viral Channel Forming Proteins and Peptide Drug Screening Using a Docking Approach.Biomolecules. 2022 Dec 10;12(12):1844. doi: 10.3390/biom12121844. Biomolecules. 2022. PMID: 36551274 Free PMC article.
-
Molecular characterization and differential expression suggested diverse functions of P-type II Ca2+ATPases in Triticum aestivum L.BMC Genomics. 2018 May 23;19(1):389. doi: 10.1186/s12864-018-4792-9. BMC Genomics. 2018. PMID: 29792165 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources