Protein and peptide identification algorithms using MS for use in high-throughput, automated pipelines
- PMID: 16196103
- DOI: 10.1002/pmic.200402091
Protein and peptide identification algorithms using MS for use in high-throughput, automated pipelines
Abstract
Current proteomics experiments can generate vast quantities of data very quickly, but this has not been matched by data analysis capabilities. Although there have been a number of recent reviews covering various aspects of peptide and protein identification methods using MS, comparisons of which methods are either the most appropriate for, or the most effective at, their proposed tasks are not readily available. As the need for high-throughput, automated peptide and protein identification systems increases, the creators of such pipelines need to be able to choose algorithms that are going to perform well both in terms of accuracy and computational efficiency. This article therefore provides a review of the currently available core algorithms for PMF, database searching using MS/MS, sequence tag searches and de novo sequencing. We also assess the relative performances of a number of these algorithms. As there is limited reporting of such information in the literature, we conclude that there is a need for the adoption of a system of standardised reporting on the performance of new peptide and protein identification algorithms, based upon freely available datasets. We go on to present our initial suggestions for the format and content of these datasets.
Similar articles
-
Development and assessment of scoring functions for protein identification using PMF data.Electrophoresis. 2007 Mar;28(5):864-70. doi: 10.1002/elps.200600305. Electrophoresis. 2007. PMID: 17265538
-
De novo sequencing methods in proteomics.Methods Mol Biol. 2010;604:105-21. doi: 10.1007/978-1-60761-444-9_8. Methods Mol Biol. 2010. PMID: 20013367
-
CHOMPER: a bioinformatic tool for rapid validation of tandem mass spectrometry search results associated with high-throughput proteomic strategies.Proteomics. 2002 Sep;2(9):1097-103. doi: 10.1002/1615-9861(200209)2:9<1097::AID-PROT1097>3.0.CO;2-X. Proteomics. 2002. PMID: 12362328
-
Software for computational peptide identification from MS-MS data.Drug Discov Today. 2006 Jul;11(13-14):595-600. doi: 10.1016/j.drudis.2006.05.011. Drug Discov Today. 2006. PMID: 16793527 Review.
-
Filtering strategies for improving protein identification in high-throughput MS/MS studies.Proteomics. 2009 Feb;9(4):848-60. doi: 10.1002/pmic.200800517. Proteomics. 2009. PMID: 19160393 Review.
Cited by
-
Evaluating peptide mass fingerprinting-based protein identification.Genomics Proteomics Bioinformatics. 2007 Dec;5(3-4):152-7. doi: 10.1016/S1672-0229(08)60002-9. Genomics Proteomics Bioinformatics. 2007. PMID: 18267296 Free PMC article.
-
A Perl procedure for protein identification by Peptide Mass Fingerprinting.BMC Bioinformatics. 2009 Oct 15;10 Suppl 12(Suppl 12):S11. doi: 10.1186/1471-2105-10-S12-S11. BMC Bioinformatics. 2009. PMID: 19828071 Free PMC article.
-
Feature-matching pattern-based support vector machines for robust peptide mass fingerprinting.Mol Cell Proteomics. 2011 Dec;10(12):M110.005785. doi: 10.1074/mcp.M110.005785. Epub 2011 Jul 20. Mol Cell Proteomics. 2011. PMID: 21775775 Free PMC article.
-
Precursor-ion mass re-estimation improves peptide identification on hybrid instruments.J Proteome Res. 2008 Sep;7(9):4031-9. doi: 10.1021/pr800307m. Epub 2008 Aug 16. J Proteome Res. 2008. PMID: 18707148 Free PMC article.
-
Tools for exploring the proteomosphere.J Proteomics. 2009 Mar 6;72(2):137-44. doi: 10.1016/j.jprot.2009.01.012. Epub 2009 Jan 22. J Proteomics. 2009. PMID: 19167528 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials