PepSOM: an algorithm for peptide identification by tandem mass spectrometry based on SOM
- PMID: 17503392
PepSOM: an algorithm for peptide identification by tandem mass spectrometry based on SOM
Abstract
Peptide identification by tandem mass spectrometry is both an important and challenging problem in proteomics. At present, huge amount of spectrum data are generated by high throughput mass spectrometers at a very fast pace, but algorithms to analyze these spectra are either too slow, not accurate enough, or only gives partial sequences or sequence tags. In this paper, we emphasize on the balance between identification completeness and efficiency with reasonable accuracy for peptide identification by tandem mass spectrum. Our method works by converting spectra to vectors in high-dimensional space, and subsequently use self-organizing map (SOM) and multi-point range query (MPRQ) algorithm as a coarse filter reduce the number of candidates to achieve efficient and accurate database search. Experiments show that our algorithm is both fast and accurate in peptide identification.
Similar articles
-
An accurate and efficient algorithm for Peptide and ptm identification by tandem mass spectrometry.Genome Inform. 2007;19:119-30. Genome Inform. 2007. PMID: 18546510
-
Speeding up tandem mass spectrometry database search: metric embeddings and fast near neighbor search.Bioinformatics. 2007 Mar 1;23(5):612-8. doi: 10.1093/bioinformatics/btl645. Epub 2007 Jan 19. Bioinformatics. 2007. PMID: 17237061
-
A hybrid method for peptide identification using integer linear optimization, local database search, and quadrupole time-of-flight or OrbiTrap tandem mass spectrometry.J Proteome Res. 2008 Apr;7(4):1584-93. doi: 10.1021/pr700577z. Epub 2008 Mar 7. J Proteome Res. 2008. PMID: 18324765
-
Elective affinities--bioinformatic analysis of proteomic mass spectrometry data.Arch Physiol Biochem. 2009 Dec;115(5):311-9. doi: 10.3109/13813450903390039. Arch Physiol Biochem. 2009. PMID: 19911947 Review.
-
Algorithms for the de novo sequencing of peptides from tandem mass spectra.Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54. Expert Rev Proteomics. 2011. PMID: 21999834 Review.
Cited by
-
Two-phase Filtering Strategy for Efficient Peptide Identification from Mass Spectrometry.J Proteomics Bioinform. 2010 Apr 1;3:121-129. doi: 10.4172/jpb.1000130. J Proteomics Bioinform. 2010. PMID: 20717493 Free PMC article.
-
Classification of premalignant pancreatic cancer mass-spectrometry data using decision tree ensembles.BMC Bioinformatics. 2008 Jun 11;9:275. doi: 10.1186/1471-2105-9-275. BMC Bioinformatics. 2008. PMID: 18547427 Free PMC article.