Signal-3L: A 3-layer approach for predicting signal peptides
- PMID: 17880924
- DOI: 10.1016/j.bbrc.2007.08.140
Signal-3L: A 3-layer approach for predicting signal peptides
Abstract
Functioning as an "address tag" that directs nascent proteins to their proper cellular and extracellular locations, signal peptides have become a crucial tool in finding new drugs or reprogramming cells for gene therapy. To effectively and timely use such a tool, however, the first important thing is to develop an automated method for rapidly and accurately identifying the signal peptide for a given nascent protein. With the avalanche of new protein sequences generated in the post-genomic era, the challenge has become even more urgent and critical. In this paper, we have developed a novel method for predicting signal peptide sequences and their cleavage sites in human, plant, animal, eukaryotic, Gram-positive, and Gram-negative protein sequences, respectively. The new predictor is called Signal-3L that consists of three prediction engines working, respectively, for the following three progressively deepening layers: (1) identifying a query protein as secretory or non-secretory by an ensemble classifier formed by fusing many individual OET-KNN (optimized evidence-theoretic K nearest neighbor) classifiers operated in various dimensions of PseAA (pseudo amino acid) composition spaces; (2) selecting a set of candidates for the possible signal peptide cleavage sites of a query secretory protein by a subsite-coupled discrimination algorithm; (3) determining the final cleavage site by fusing the global sequence alignment outcome for each of the aforementioned candidates through a voting system. Signal-3L is featured by high success prediction rates with short computational time, and hence is particularly useful for the analysis of large-scale datasets. Signal-3L is freely available as a web-server at http://chou.med.harvard.edu/bioinf/Signal-3L/ or http://202.120.37.186/bioinf/Signal-3L, where, to further support the demand of the related areas, the signal peptides identified by Signal-3L for all the protein entries in Swiss-Prot databank that do not have signal peptide annotations or are annotated with uncertain terms but are classified by Signal-3L as secretory proteins are provided in a downloadable file. The large-scale file is prepared with Microsoft Excel and named "Tab-Signal-3L.xls", and will be updated once a year to include new protein entries and reflect the continuous development of Signal-3L.
Similar articles
-
Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.Amino Acids. 2007 Jul;33(1):57-67. doi: 10.1007/s00726-006-0478-8. Epub 2007 Jan 19. Amino Acids. 2007. PMID: 17235453
-
Signal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides.Biochem Biophys Res Commun. 2007 Jun 8;357(3):633-40. doi: 10.1016/j.bbrc.2007.03.162. Epub 2007 Apr 5. Biochem Biophys Res Commun. 2007. PMID: 17434148
-
MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15. Biochem Biophys Res Commun. 2007. PMID: 17586467
-
Architecture, function and prediction of long signal peptides.Brief Bioinform. 2009 Sep;10(5):569-78. doi: 10.1093/bib/bbp030. Epub 2009 Jun 17. Brief Bioinform. 2009. PMID: 19535397 Review.
-
Rapid retrieval of protein structures from databases.Drug Discov Today. 2007 Sep;12(17-18):732-9. doi: 10.1016/j.drudis.2007.07.014. Epub 2007 Aug 28. Drug Discov Today. 2007. PMID: 17826686 Review.
Cited by
-
Polyphenol oxidase as a biochemical seed defense mechanism.Front Plant Sci. 2014 Dec 10;5:689. doi: 10.3389/fpls.2014.00689. eCollection 2014. Front Plant Sci. 2014. PMID: 25540647 Free PMC article.
-
The roles of gene duplication, gene conversion and positive selection in rodent Esp and Mup pheromone gene families with comparison to the Abp family.PLoS One. 2012;7(10):e47697. doi: 10.1371/journal.pone.0047697. Epub 2012 Oct 19. PLoS One. 2012. PMID: 23094077 Free PMC article.
-
A predicted physicochemically distinct sub-proteome associated with the intracellular organelle of the anammox bacterium Kuenenia stuttgartiensis.BMC Genomics. 2010 May 12;11:299. doi: 10.1186/1471-2164-11-299. BMC Genomics. 2010. PMID: 20459862 Free PMC article.
-
Chitinase Chit62J4 Essential for Chitin Processing by Human Microbiome Bacterium Clostridium paraputrificum J4.Molecules. 2021 Oct 2;26(19):5978. doi: 10.3390/molecules26195978. Molecules. 2021. PMID: 34641521 Free PMC article.
-
Donut-shaped fingerprint in homologous polypeptide relationships--a topological feature related to pathogenic structural changes in conformational disease.J Theor Biol. 2009 May 21;258(2):294-301. doi: 10.1016/j.jtbi.2009.02.009. Epub 2009 Feb 25. J Theor Biol. 2009. PMID: 19248793 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources