Enhanced recognition of protein transmembrane domains with prediction-based structural profiles
- PMID: 16293670
- DOI: 10.1093/bioinformatics/bti784
Enhanced recognition of protein transmembrane domains with prediction-based structural profiles
Abstract
Motivation: Membrane domain prediction has recently been re-evaluated by several groups, suggesting that the accuracy of existing methods is still rather limited. In this work, we revisit this problem and propose novel methods for prediction of alpha-helical as well as beta-sheet transmembrane (TM) domains. The new approach is based on a compact representation of an amino acid residue and its environment, which consists of predicted solvent accessibility and secondary structure of each amino acid. A recently introduced method for solvent accessibility prediction trained on a set of soluble proteins is used here to indicate segments of residues that are predicted not to be accessible to water and, therefore, may be 'buried' in the membrane. While evolutionary profiles in the form of a multiple alignment are used to derive these simple 'structural profiles', they are not used explicitly for the membrane domain prediction and the overall number of parameters in the model is significantly reduced. This offers the possibility of a more reliable estimation of the free parameters in the model with a limited number of experimentally resolved membrane protein structures.
Results: Using cross-validated training on available sets of structurally resolved and non-redundant alpha and beta membrane proteins, we demonstrate that membrane domain prediction methods based on such a compact representation outperform approaches that utilize explicitly evolutionary profiles and multiple alignments. Moreover, using an external evaluation by the TMH Benchmark server we show that our final prediction protocol for the TM helix prediction is competitive with the state-of-the-art methods, achieving per-residue accuracy of approximately 89% and per-segment accuracy of approximately 80% on the set of high resolution structures used by the TMH Benchmark server. At the same time the observed rates of confusion with signal peptides and globular proteins are the lowest among the tested methods. The new method is available online at http://minnou.cchmc.org.
Similar articles
-
A knowledge-based scale for the analysis and prediction of buried and exposed faces of transmembrane domain proteins.Bioinformatics. 2004 Aug 12;20(12):1822-35. doi: 10.1093/bioinformatics/bth143. Epub 2004 Feb 26. Bioinformatics. 2004. PMID: 14988128
-
A combinatorial pattern discovery approach for the prediction of membrane dipping (re-entrant) loops.Bioinformatics. 2006 Jul 15;22(14):e290-7. doi: 10.1093/bioinformatics/btl209. Bioinformatics. 2006. PMID: 16873484
-
Modeling protein loops with knowledge-based prediction of sequence-structure alignment.Bioinformatics. 2007 Nov 1;23(21):2836-42. doi: 10.1093/bioinformatics/btm456. Epub 2007 Sep 7. Bioinformatics. 2007. PMID: 17827204
-
Membrane protein structure: prediction versus reality.Annu Rev Biochem. 2007;76:125-40. doi: 10.1146/annurev.biochem.76.052705.163539. Annu Rev Biochem. 2007. PMID: 17579561 Review.
-
The protein structure code: what is its present status?Comput Appl Biosci. 1991 Apr;7(2):133-42. doi: 10.1093/bioinformatics/7.2.133. Comput Appl Biosci. 1991. PMID: 2059837 Review.
Cited by
-
Predicting Protein Interaction Sites Using PITHIA.Methods Mol Biol. 2023;2690:375-383. doi: 10.1007/978-1-0716-3327-4_29. Methods Mol Biol. 2023. PMID: 37450160
-
An unusual ERAD-like complex is targeted to the apicoplast of Plasmodium falciparum.Eukaryot Cell. 2009 Aug;8(8):1134-45. doi: 10.1128/EC.00083-09. Epub 2009 Jun 5. Eukaryot Cell. 2009. PMID: 19502583 Free PMC article.
-
In silico characterization and homology modeling of thylakoid-bound ascorbate peroxidase from a drought tolerant wheat cultivar.Genomics Proteomics Bioinformatics. 2009 Dec;7(4):185-93. doi: 10.1016/S1672-0229(08)60048-0. Genomics Proteomics Bioinformatics. 2009. PMID: 20172491 Free PMC article.
-
Transmembrane helix prediction using amino acid property features and latent semantic analysis.BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-9-S1-S4. BMC Bioinformatics. 2008. PMID: 18315857 Free PMC article.
-
In silico analysis and modeling of ACP-MIP-PilQ chimeric antigen from Neisseria meningitidis serogroup B.Rep Biochem Mol Biol. 2015 Oct;4(1):50-9. Rep Biochem Mol Biol. 2015. PMID: 26989750 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources