Algorithms and databases
- PMID: 19544027
- DOI: 10.1007/978-1-60761-157-8_14
Algorithms and databases
Abstract
The capacity of proteomics methods and mass spectrometry instrumentation to generate data has grown substantially over the past years. This data volume growth has in turn led to an increased reliance on software to identify peptide or protein sequences from the recorded mass spectra. Diverse algorithms can be applied for the processing of these data, each performing a specific task such as spectrum quality filtering, spectral clustering and merging, assigning a sequence to a spectrum, and assessing the validity of these assignments. The key algorithms to mass spectral processing pipelines are the ones that assign a sequence to a spectrum. The most commonly used variants of these are crucially dependent on the information contained in the sequences database, which they use as a basis for identification. Since these sequence databases are constructed in different ways and can therefore vary substantially in the amount and type of data they contain, they are also discussed here.
Similar articles
-
Algorithms for the de novo sequencing of peptides from tandem mass spectra.Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54. Expert Rev Proteomics. 2011. PMID: 21999834 Review.
-
Artificial decoy spectral libraries for false discovery rate estimation in spectral library searching in proteomics.J Proteome Res. 2010 Jan;9(1):605-10. doi: 10.1021/pr900947u. J Proteome Res. 2010. PMID: 19916561
-
Algorithms and tools for analysis and management of mass spectrometry data.Brief Bioinform. 2008 Mar;9(2):144-55. doi: 10.1093/bib/bbn007. Epub 2008 Mar 20. Brief Bioinform. 2008. PMID: 18356204
-
Speeding up tandem mass spectrometry database search: metric embeddings and fast near neighbor search.Bioinformatics. 2007 Mar 1;23(5):612-8. doi: 10.1093/bioinformatics/btl645. Epub 2007 Jan 19. Bioinformatics. 2007. PMID: 17237061
-
Protein and peptide identification algorithms using MS for use in high-throughput, automated pipelines.Proteomics. 2005 Nov;5(16):4082-95. doi: 10.1002/pmic.200402091. Proteomics. 2005. PMID: 16196103 Review.
Cited by
-
DeltAMT: a statistical algorithm for fast detection of protein modifications from LC-MS/MS data.Mol Cell Proteomics. 2011 May;10(5):M110.000455. doi: 10.1074/mcp.M110.000455. Epub 2011 Feb 14. Mol Cell Proteomics. 2011. PMID: 21321130 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous