Robust accurate identification of peptides (RAId): deciphering MS2 data using a structured library search with de novo based statistics
- PMID: 16105903
- DOI: 10.1093/bioinformatics/bti620
Robust accurate identification of peptides (RAId): deciphering MS2 data using a structured library search with de novo based statistics
Abstract
Motivation: The key to MS -based proteomics is peptide sequencing. The major challenge in peptide sequencing, whether library search or de novo, is to better infer statistical significance and better attain noise reduction. Since the noise in a spectrum depends on experimental conditions, the instrument used and many other factors, it cannot be predicted even if the peptide sequence is known. The characteristics of the noise can only be uncovered once a spectrum is given. We wish to overcome such issues.
Results: We designed RAId to identify peptides from their associated tandem mass spectrometry data. RAId performs a novel de novo sequencing followed by a search in a peptide library that we created. Through de novo sequencing, we establish the spectrum-specific background score statistics for the library search. When the database search fails to return significant hits, the top-ranking de novo sequences become potential candidates for new peptides that are not yet in the database. The use of spectrum-specific background statistics seems to enable RAId to perform well even when the spectral quality is marginal. Other important features of RAId include its potential in de novo sequencing alone and the ease of incorporating post-translational modifications.
Similar articles
-
De novo sequencing methods in proteomics.Methods Mol Biol. 2010;604:105-21. doi: 10.1007/978-1-60761-444-9_8. Methods Mol Biol. 2010. PMID: 20013367
-
RAId_DbS: peptide identification using database searches with realistic statistics.Biol Direct. 2007 Oct 25;2:25. doi: 10.1186/1745-6150-2-25. Biol Direct. 2007. PMID: 17961253 Free PMC article.
-
De novo peptide sequencing using ion peak intensity and amino acid cleavage intensity ratio.Bioinformatics. 2007 May 1;23(9):1068-72. doi: 10.1093/bioinformatics/btm062. Epub 2007 Mar 6. Bioinformatics. 2007. PMID: 17341498
-
Algorithms for the de novo sequencing of peptides from tandem mass spectra.Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54. Expert Rev Proteomics. 2011. PMID: 21999834 Review.
-
Software for computational peptide identification from MS-MS data.Drug Discov Today. 2006 Jul;11(13-14):595-600. doi: 10.1016/j.drudis.2006.05.011. Drug Discov Today. 2006. PMID: 16793527 Review.
Cited by
-
RAId_aPS: MS/MS analysis with multiple scoring functions and spectrum-specific statistics.PLoS One. 2010 Nov 16;5(11):e15438. doi: 10.1371/journal.pone.0015438. PLoS One. 2010. PMID: 21103371 Free PMC article.
-
Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.J Proteome Res. 2008 Aug;7(8):3354-63. doi: 10.1021/pr8001244. Epub 2008 Jul 3. J Proteome Res. 2008. PMID: 18597511 Free PMC article.
-
Calibrating E-values for MS2 database search methods.Biol Direct. 2007 Nov 5;2:26. doi: 10.1186/1745-6150-2-26. Biol Direct. 2007. PMID: 17983478 Free PMC article.
-
Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance.J Am Soc Mass Spectrom. 2016 Feb;27(2):194-210. doi: 10.1007/s13361-015-1271-2. Epub 2015 Oct 28. J Am Soc Mass Spectrom. 2016. PMID: 26510657 Free PMC article.
-
Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.Mol Cell Proteomics. 2009 Jan;8(1):53-69. doi: 10.1074/mcp.M800103-MCP200. Epub 2008 Aug 14. Mol Cell Proteomics. 2009. PMID: 18703573 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources