Improving the accuracy of transmembrane protein topology prediction using evolutionary information
- PMID: 17237066
- DOI: 10.1093/bioinformatics/btl677
Improving the accuracy of transmembrane protein topology prediction using evolutionary information
Abstract
Motivation: Many important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion are mediated by membrane proteins. Unfortunately, as these proteins are not water soluble, it is extremely hard to experimentally determine their structure. Therefore, improved methods for predicting the structure of these proteins are vital in biological research. In order to improve transmembrane topology prediction, we evaluate the combined use of both integrated signal peptide prediction and evolutionary information in a single algorithm.
Results: A new method (MEMSAT3) for predicting transmembrane protein topology from sequence profiles is described and benchmarked with full cross-validation on a standard data set of 184 transmembrane proteins. The method is found to predict both the correct topology and the locations of transmembrane segments for 80% of the test set. This compares with accuracies of 62-72% for other popular methods on the same benchmark. By using a second neural network specifically to discriminate transmembrane from globular proteins, a very low overall false positive rate (0.5%) can also be achieved in detecting transmembrane proteins.
Availability: An implementation of the described method is available both as a web server (http://www.psipred.net) and as downloadable source code from http://bioinf.cs.ucl.ac.uk/memsat. Both the server and source code files are free to non-commercial users. Benchmark and training data are also available from http://bioinf.cs.ucl.ac.uk/memsat.
Similar articles
-
OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar.Bioinformatics. 2008 Aug 1;24(15):1662-8. doi: 10.1093/bioinformatics/btn221. Epub 2008 May 12. Bioinformatics. 2008. PMID: 18474507
-
An improved hidden Markov model for transmembrane protein detection and topology prediction and its applications to complete genomes.Bioinformatics. 2005 May 1;21(9):1853-8. doi: 10.1093/bioinformatics/bti303. Epub 2005 Feb 2. Bioinformatics. 2005. PMID: 15691854
-
Prediction of transmembrane regions of beta-barrel proteins using ANN- and SVM-based methods.Proteins. 2004 Jul 1;56(1):11-8. doi: 10.1002/prot.20092. Proteins. 2004. PMID: 15162482
-
Topology of membrane proteins-predictions, limitations and variations.Curr Opin Struct Biol. 2018 Jun;50:9-17. doi: 10.1016/j.sbi.2017.10.003. Epub 2017 Nov 5. Curr Opin Struct Biol. 2018. PMID: 29100082 Review.
-
Topology prediction of helical transmembrane proteins: how far have we reached?Curr Protein Pept Sci. 2010 Nov;11(7):550-61. doi: 10.2174/138920310794109184. Curr Protein Pept Sci. 2010. PMID: 20887261 Review.
Cited by
-
Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.Sci Rep. 2013;3:1895. doi: 10.1038/srep01895. Sci Rep. 2013. PMID: 23719418 Free PMC article.
-
Protein transport across and into cell membranes in bacteria and archaea.Cell Mol Life Sci. 2010 Jan;67(2):179-99. doi: 10.1007/s00018-009-0160-x. Epub 2009 Oct 10. Cell Mol Life Sci. 2010. PMID: 19823765 Free PMC article. Review.
-
Identification of a dehydrogenase required for lactose metabolism in Caulobacter crescentus.Appl Environ Microbiol. 2010 May;76(9):3004-14. doi: 10.1128/AEM.02085-09. Epub 2010 Feb 26. Appl Environ Microbiol. 2010. PMID: 20190087 Free PMC article.
-
CoBaltDB: Complete bacterial and archaeal orfeomes subcellular localization database and associated resources.BMC Microbiol. 2010 Mar 23;10:88. doi: 10.1186/1471-2180-10-88. BMC Microbiol. 2010. PMID: 20331850 Free PMC article.
-
Origin, evolution, and divergence of plant class C GH9 endoglucanases.BMC Evol Biol. 2018 May 30;18(1):79. doi: 10.1186/s12862-018-1185-2. BMC Evol Biol. 2018. PMID: 29848310 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources