Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions
- PMID: 17622186
- DOI: 10.1021/ac070262k
Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions
Abstract
Most algorithms for identifying peptides from tandem mass spectra use information only from the final spectrum, ignoring non-mass-based information acquired routinely in liquid chromatography tandem mass spectrometry analyses. One physiochemical property that is always obtained but rarely exploited is peptide chromatographic retention time. Efforts to use chromatographic retention time to improve peptide identification are complicated because of the variability of retention time in different experimental conditions-making retention time calculations nongeneralizable. We show that peptide retention time can be reliably predicted by training and testing a support vector regressor on a small collection of data from a single liquid chromatography run. This model can be used to filter peptide identifications with observed retention time that deviates from predicted retention time. After filtering, positive peptide identifications increase by as much as 50% at a false discovery rate of 3%. We demonstrate that our dynamically trained model generalizes well across diverse chromatography conditions and methods for generating peptides, in particular improving peptide identification using nonspecific proteases.
Similar articles
-
Improving peptide identification in proteome analysis by a two-dimensional retention time filtering approach.J Proteome Res. 2009 Aug;8(8):4109-15. doi: 10.1021/pr900064b. J Proteome Res. 2009. PMID: 19492844
-
Improving peptide identification using an empirical peptide retention time database.Rapid Commun Mass Spectrom. 2009 Jan;23(1):109-18. doi: 10.1002/rcm.3851. Rapid Commun Mass Spectrom. 2009. PMID: 19065623
-
Support vector machines for improved peptide identification from tandem mass spectrometry database search.Methods Mol Biol. 2009;492:453-60. doi: 10.1007/978-1-59745-493-3_28. Methods Mol Biol. 2009. PMID: 19241051
-
Informatics for peptide retention properties in proteomic LC-MS.Proteomics. 2008 Feb;8(4):787-98. doi: 10.1002/pmic.200700692. Proteomics. 2008. PMID: 18214845 Review.
-
Quantitative bioanalysis of peptides by liquid chromatography coupled to (tandem) mass spectrometry.J Chromatogr B Analyt Technol Biomed Life Sci. 2008 Sep 1;872(1-2):1-22. doi: 10.1016/j.jchromb.2008.07.021. Epub 2008 Jul 26. J Chromatogr B Analyt Technol Biomed Life Sci. 2008. PMID: 18701357 Review.
Cited by
-
Elucidating Proteoform Families from Proteoform Intact-Mass and Lysine-Count Measurements.J Proteome Res. 2016 Apr 1;15(4):1213-21. doi: 10.1021/acs.jproteome.5b01090. Epub 2016 Mar 16. J Proteome Res. 2016. PMID: 26941048 Free PMC article.
-
RT-SVR+q: a strategy for post-Mascot analysis using retention time and q value metric to improve peptide and protein identifications.J Proteomics. 2011 Dec 21;75(2):480-90. doi: 10.1016/j.jprot.2011.08.013. Epub 2011 Aug 24. J Proteomics. 2011. PMID: 21888997 Free PMC article.
-
Visualization and application of amino acid retention coefficients obtained from modeling of peptide retention.J Sep Sci. 2018 Sep;41(18):3644-3653. doi: 10.1002/jssc.201800488. Epub 2018 Sep 4. J Sep Sci. 2018. PMID: 30047222 Free PMC article.
-
Rapid and accurate peptide identification from tandem mass spectra.J Proteome Res. 2008 Jul;7(7):3022-7. doi: 10.1021/pr800127y. Epub 2008 May 28. J Proteome Res. 2008. PMID: 18505281 Free PMC article.
-
Statistical calibration of the SEQUEST XCorr function.J Proteome Res. 2009 Apr;8(4):2106-13. doi: 10.1021/pr8011107. J Proteome Res. 2009. PMID: 19275164 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources