Machine learning approaches for the prediction of signal peptides and other protein sorting signals
- PMID: 10065704
- DOI: 10.1093/protein/12.1.3
Machine learning approaches for the prediction of signal peptides and other protein sorting signals
Abstract
Prediction of protein sorting signals from the sequence of amino acids has great importance in the field of proteomics today. Recently, the growth of protein databases, combined with machine learning approaches, such as neural networks and hidden Markov models, have made it possible to achieve a level of reliability where practical use in, for example automatic database annotation is feasible. In this review, we concentrate on the present status and future perspectives of SignalP, our neural network-based method for prediction of the most well-known sorting signal: the secretory signal peptide. We discuss the problems associated with the use of SignalP on genomic sequences, showing that signal peptide prediction will improve further if integrated with predictions of start codons and transmembrane helices. As a step towards this goal, a hidden Markov model version of SignalP has been developed, making it possible to discriminate between cleaved signal peptides and uncleaved signal anchors. Furthermore, we show how SignalP can be used to characterize putative signal peptides from an archaeon, Methanococcus jannaschii. Finally, we briefly review a few methods for predicting other protein sorting signals and discuss the future of protein sorting prediction in general.
Similar articles
-
Improved prediction of signal peptides: SignalP 3.0.J Mol Biol. 2004 Jul 16;340(4):783-95. doi: 10.1016/j.jmb.2004.05.028. J Mol Biol. 2004. PMID: 15223320
-
Prediction of signal peptides and signal anchors by a hidden Markov model.Proc Int Conf Intell Syst Mol Biol. 1998;6:122-30. Proc Int Conf Intell Syst Mol Biol. 1998. PMID: 9783217
-
Evaluation of signal peptide prediction algorithms for identification of mycobacterial signal peptides using sequence data from proteomic methods.Microbiology (Reading). 2009 Jul;155(Pt 7):2375-2383. doi: 10.1099/mic.0.025270-0. Epub 2009 Apr 23. Microbiology (Reading). 2009. PMID: 19389770 Free PMC article.
-
Large-scale predictions of secretory proteins from mammalian genomic and EST sequences.Curr Opin Biotechnol. 2000 Feb;11(1):13-8. doi: 10.1016/s0958-1669(99)00048-8. Curr Opin Biotechnol. 2000. PMID: 10679337 Review.
-
Advances in the prediction of protein targeting signals.Proteomics. 2004 Jun;4(6):1571-80. doi: 10.1002/pmic.200300786. Proteomics. 2004. PMID: 15174127 Review.
Cited by
-
Genome sequencing and mapping reveal loss of heterozygosity as a mechanism for rapid adaptation in the vegetable pathogen Phytophthora capsici.Mol Plant Microbe Interact. 2012 Oct;25(10):1350-60. doi: 10.1094/MPMI-02-12-0028-R. Mol Plant Microbe Interact. 2012. PMID: 22712506 Free PMC article.
-
Fast subcellular localization by cascaded fusion of signal-based and homology-based methods.Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S8. doi: 10.1186/1477-5956-9-S1-S8. Proteome Sci. 2011. PMID: 22166017 Free PMC article.
-
Evolution of the Translocation and Assembly Module (TAM).Genome Biol Evol. 2015 May 20;7(6):1628-43. doi: 10.1093/gbe/evv097. Genome Biol Evol. 2015. PMID: 25994932 Free PMC article.
-
The antiretroviral lectin cyanovirin-N targets well-known and novel targets on the surface of Entamoeba histolytica trophozoites.Eukaryot Cell. 2010 Nov;9(11):1661-8. doi: 10.1128/EC.00166-10. Epub 2010 Sep 17. Eukaryot Cell. 2010. PMID: 20852023 Free PMC article.
-
A highly acid-stable and thermostable endo-beta-glucanase from the thermoacidophilic archaeon Sulfolobus solfataricus.Biochem J. 2005 Jan 15;385(Pt 2):581-8. doi: 10.1042/BJ20041388. Biochem J. 2005. PMID: 15456402 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Other Literature Sources
Research Materials