Evaluation of signal peptide prediction algorithms for identification of mycobacterial signal peptides using sequence data from proteomic methods
- PMID: 19389770
- PMCID: PMC2885676
- DOI: 10.1099/mic.0.025270-0
Evaluation of signal peptide prediction algorithms for identification of mycobacterial signal peptides using sequence data from proteomic methods
Abstract
Secreted proteins play an important part in the pathogenicity of Mycobacterium tuberculosis, and are the primary source of vaccine and diagnostic candidates. A majority of these proteins are exported via the signal peptidase I-dependent pathway, and have a signal peptide that is cleaved off during the secretion process. Sequence similarities within signal peptides have spurred the development of several algorithms for predicting their presence as well as the respective cleavage sites. For proteins exported via this pathway, algorithms exist for eukaryotes, and for Gram-negative and Gram-positive bacteria. However, the unique structure of the mycobacterial membrane raises the question of whether the existing algorithms are suitable for predicting signal peptides within mycobacterial proteins. In this work, we have evaluated the performance of nine signal peptide prediction algorithms on a positive validation set, consisting of 57 proteins with a verified signal peptide and cleavage site, and a negative set, consisting of 61 proteins that have an N-terminal sequence that confirms the annotated translational start site. We found the hidden Markov model of SignalP v3.0 to be the best-performing algorithm for predicting the presence of a signal peptide in mycobacterial proteins. It predicted no false positives or false negatives, and predicted a correct cleavage site for 45 of the 57 proteins in the positive set. Based on these results, we used the hidden Markov model of SignalP v3.0 to analyse the 10 available annotated proteomes of mycobacterial species, including annotations of M. tuberculosis H37Rv from the Wellcome Trust Sanger Institute and the J. Craig Venter Institute (JCVI). When excluding proteins with transmembrane regions among the proteins predicted to harbour a signal peptide, we found between 7.8 and 10.5% of the proteins in the proteomes to be putative secreted proteins. Interestingly, we observed a consistent difference in the percentage of predicted proteins between the Sanger Institute and JCVI. We have determined the most valuable algorithm for predicting signal peptidase I-processed proteins of M. tuberculosis, and used this algorithm to estimate the number of mycobacterial proteins with the potential to be exported via this pathway.
Figures

Similar articles
-
Signal peptide prediction based on analysis of experimentally verified cleavage sites.Protein Sci. 2004 Oct;13(10):2819-24. doi: 10.1110/ps.04682504. Epub 2004 Aug 31. Protein Sci. 2004. PMID: 15340161 Free PMC article.
-
Identification of putative exported/secreted proteins in prokaryotic proteomes.Gene. 2001 May 16;269(1-2):195-204. doi: 10.1016/s0378-1119(01)00436-x. Gene. 2001. PMID: 11376951
-
Prediction of lipoprotein signal peptides in Gram-positive bacteria with a Hidden Markov Model.J Proteome Res. 2008 Dec;7(12):5082-93. doi: 10.1021/pr800162c. J Proteome Res. 2008. PMID: 19367716
-
Machine learning approaches for the prediction of signal peptides and other protein sorting signals.Protein Eng. 1999 Jan;12(1):3-9. doi: 10.1093/protein/12.1.3. Protein Eng. 1999. PMID: 10065704 Review.
-
Peptide signal molecules and bacteriocins in Gram-negative bacteria: a genome-wide in silico screening for peptides containing a double-glycine leader sequence and their cognate transporters.Peptides. 2004 Sep;25(9):1425-40. doi: 10.1016/j.peptides.2003.10.028. Peptides. 2004. PMID: 15374646 Review.
Cited by
-
Arming the troops: Post-translational modification of extracellular bacterial proteins.Sci Prog. 2020 Oct-Dec;103(4):36850420964317. doi: 10.1177/0036850420964317. Sci Prog. 2020. PMID: 33148128 Free PMC article. Review.
-
Comparative Genomics of Field Isolates of Mycobacterium bovis and M. caprae Provides Evidence for Possible Correlates with Bacterial Viability and Virulence.PLoS Negl Trop Dis. 2015 Nov 19;9(11):e0004232. doi: 10.1371/journal.pntd.0004232. eCollection 2015 Nov. PLoS Negl Trop Dis. 2015. PMID: 26583774 Free PMC article.
-
Comprehensive characterization of methicillin-resistant Staphylococcus aureus subsp. aureus COL secretome by two-dimensional liquid chromatography and mass spectrometry.Mol Cell Proteomics. 2010 Sep;9(9):1898-919. doi: 10.1074/mcp.M900494-MCP200. Epub 2010 Apr 24. Mol Cell Proteomics. 2010. PMID: 20418541 Free PMC article.
-
Bioinformatic identification of Mycobacterium tuberculosis proteins likely to target host cell mitochondria: virulence factors?Microb Inform Exp. 2012 Dec 22;2(1):9. doi: 10.1186/2042-5783-2-9. Microb Inform Exp. 2012. PMID: 23259719 Free PMC article.
-
Microscopy and genomic analysis of Mycoplasma parvum strain Indiana.Vet Res. 2014 Aug 13;45(1):86. doi: 10.1186/s13567-014-0086-7. Vet Res. 2014. PMID: 25113534 Free PMC article.
References
-
- Abdallah, A. M., Gey van Pittius, N. C., Champion, P. A., Cox, J., Luirink, J., Vandenbroucke-Grauls, C. M., Appelmelk, B. J. & Bitter, W. (2007). Type VII secretion – mycobacteria show the way. Nat Rev Microbiol 5, 883–891. - PubMed
-
- Andersen, P. (2007). Vaccine strategies against latent tuberculosis infection. Trends Microbiol 15, 7–13. - PubMed
-
- Bendtsen, J. D., Nielsen, H., von Heijne, G. & Brunak, S. (2004). Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340, 783–795. - PubMed
-
- Camus, J. C., Pryor, M. J., Medigue, C. & Cole, S. T. (2002). Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv. Microbiology 148, 2967–2973. - PubMed
-
- Chou, K. C. (2002). Prediction of protein signal sequences. Curr Protein Pept Sci 3, 615–622. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources