Prediction of protein signal sequences
- PMID: 12470215
- DOI: 10.2174/1389203023380468
Prediction of protein signal sequences
Abstract
Newly synthesized proteins have an intrinsic signal sequence, functioning as "address tags" or "zip codes", that is essential for guiding them wherever they are needed. Owing to such a unique function, protein signals have become a crucial tool in finding new drugs or reprogramming cells for gene therapy. However, to effectively use protein signals as a desirable vehicle in the field of proteomics, the first important thing is to find a fast and powerful method to identify the "address tag" or "zip code" entity. Although all signal sequences contain a hydrophobic core region, they show great variation in both overall length and amino acid sequence. It is this variation that makes it possible to deliver thousands of proteins to many different cellular locations by varieties of modes. It is also this variation that makes it very difficult to formulate a general algorithm to predict signal sequences. Nevertheless, various prediction models and algorithms have been developed during the past 17 years. This Review summarizes the development in this area, from the pioneering methods to neural network approaches, and to the sub-site coupling approaches. Meanwhile, the future challenges in this area, as well as some promising avenues for further improving the prediction quality, have been briefly addressed as well.
Similar articles
-
Predicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments.Amino Acids. 2007;32(4):493-6. doi: 10.1007/s00726-006-0466-z. Epub 2006 Nov 15. Amino Acids. 2007. PMID: 17103116
-
Prediction of protein signal sequences and their cleavage sites by statistical rulers.Biochem Biophys Res Commun. 2005 Dec 16;338(2):1005-11. doi: 10.1016/j.bbrc.2005.10.046. Epub 2005 Oct 21. Biochem Biophys Res Commun. 2005. PMID: 16256954
-
Prediction of signal peptides using scaled window.Peptides. 2001 Dec;22(12):1973-9. doi: 10.1016/s0196-9781(01)00540-x. Peptides. 2001. PMID: 11786179
-
Machine learning approaches for the prediction of signal peptides and other protein sorting signals.Protein Eng. 1999 Jan;12(1):3-9. doi: 10.1093/protein/12.1.3. Protein Eng. 1999. PMID: 10065704 Review.
-
Prediction of subcellular locations of proteins: where to proceed?Proteomics. 2010 Nov;10(22):3970-83. doi: 10.1002/pmic.201000274. Epub 2010 Nov 2. Proteomics. 2010. PMID: 21080490 Review.
Cited by
-
Prediction of protein domain with mRMR feature selection and analysis.PLoS One. 2012;7(6):e39308. doi: 10.1371/journal.pone.0039308. Epub 2012 Jun 15. PLoS One. 2012. PMID: 22720092 Free PMC article.
-
DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest.Oncotarget. 2017 Dec 8;9(2):1944-1956. doi: 10.18632/oncotarget.23099. eCollection 2018 Jan 5. Oncotarget. 2017. PMID: 29416743 Free PMC article.
-
MLACP: machine-learning-based prediction of anticancer peptides.Oncotarget. 2017 Aug 19;8(44):77121-77136. doi: 10.18632/oncotarget.20365. eCollection 2017 Sep 29. Oncotarget. 2017. PMID: 29100375 Free PMC article.
-
PROlocalizer: integrated web service for protein subcellular localization prediction.Amino Acids. 2011 Mar;40(3):975-80. doi: 10.1007/s00726-010-0724-y. Epub 2010 Sep 2. Amino Acids. 2011. PMID: 20811800 Free PMC article.
-
Application of density similarities to predict membrane protein types based on pseudo-amino acid composition.J Theor Biol. 2011 May 7;276(1):132-7. doi: 10.1016/j.jtbi.2011.01.048. Epub 2011 Feb 4. J Theor Biol. 2011. PMID: 21296088 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous