MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM
- PMID: 17586467
- DOI: 10.1016/j.bbrc.2007.06.027
MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM
Abstract
Given an uncharacterized protein sequence, how can we identify whether it is a membrane protein or not? If it is, which membrane protein type it belongs to? These questions are important because they are closely relevant to the biological function of the query protein and to its interaction process with other molecules in a biological system. Particularly, with the avalanche of protein sequences generated in the Post-Genomic Age and the relatively much slower progress in using biochemical experiments to determine their functions, it is highly desired to develop an automated method that can be used to help address these questions. In this study, a 2-layer predictor, called MemType-2L, has been developed: the 1st layer prediction engine is to identify a query protein as membrane or non-membrane; if it is a membrane protein, the process will be automatically continued with the 2nd-layer prediction engine to further identify its type among the following eight categories: (1) type I, (2) type II, (3) type III, (4) type IV, (5) multipass, (6) lipid-chain-anchored, (7) GPI-anchored, and (8) peripheral. MemType-2L is featured by incorporating the evolution information through representing the protein samples with the Pse-PSSM (Pseudo Position-Specific Score Matrix) vectors, and by containing an ensemble classifier formed by fusing many powerful individual OET-KNN (Optimized Evidence-Theoretic K-Nearest Neighbor) classifiers. The success rates obtained by MemType-2L on a new-constructed stringent dataset by both the jackknife test and the independent dataset test are quite high, indicating that MemType-2L may become a very useful high throughput tool. As a Web server, MemType-2L is freely accessible to the public at http://chou.med.harvard.edu/bioinf/MemType.
Similar articles
-
Signal-3L: A 3-layer approach for predicting signal peptides.Biochem Biophys Res Commun. 2007 Nov 16;363(2):297-303. doi: 10.1016/j.bbrc.2007.08.140. Epub 2007 Aug 31. Biochem Biophys Res Commun. 2007. PMID: 17880924
-
Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types.Biochem Biophys Res Commun. 2005 Aug 19;334(1):288-92. doi: 10.1016/j.bbrc.2005.06.087. Biochem Biophys Res Commun. 2005. PMID: 16002049
-
Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21. Biochem Biophys Res Commun. 2006. PMID: 16808903
-
High-throughput modeling and analysis of protein structural dynamics.Brief Bioinform. 2007 Nov;8(6):432-45. doi: 10.1093/bib/bbm014. Epub 2007 May 7. Brief Bioinform. 2007. PMID: 17485424 Review.
-
SheddomeDB: the ectodomain shedding database for membrane-bound shed markers.BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):42. doi: 10.1186/s12859-017-1465-7. BMC Bioinformatics. 2017. PMID: 28361715 Free PMC article. Review.
Cited by
-
iMem-Seq: A Multi-label Learning Classifier for Predicting Membrane Proteins Types.J Membr Biol. 2015 Aug;248(4):745-52. doi: 10.1007/s00232-015-9787-8. Epub 2015 Mar 22. J Membr Biol. 2015. PMID: 25796484
-
Classifying Multifunctional Enzymes by Incorporating Three Different Models into Chou's General Pseudo Amino Acid Composition.J Membr Biol. 2016 Aug;249(4):551-7. doi: 10.1007/s00232-016-9904-3. Epub 2016 Apr 25. J Membr Biol. 2016. PMID: 27113936
-
QSAR for RNases and theoretic-experimental study of molecular diversity on peptide mass fingerprints of a new Leishmania infantum protein.Mol Divers. 2010 May;14(2):349-69. doi: 10.1007/s11030-009-9178-0. Epub 2009 Jul 4. Mol Divers. 2010. PMID: 19578942 Free PMC article.
-
Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble.BMC Bioinformatics. 2021 Jun 23;22(Suppl 3):340. doi: 10.1186/s12859-021-04251-z. BMC Bioinformatics. 2021. PMID: 34162327 Free PMC article.
-
Shear force-based genetic screen reveals negative regulators of cell adhesion and protrusive activity.Proc Natl Acad Sci U S A. 2017 Sep 12;114(37):E7727-E7736. doi: 10.1073/pnas.1616600114. Epub 2017 Aug 28. Proc Natl Acad Sci U S A. 2017. PMID: 28847951 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials