A dynamic wavelet-based algorithm for pre-processing tandem mass spectrometry data
- PMID: 20628072
- DOI: 10.1093/bioinformatics/btq403
A dynamic wavelet-based algorithm for pre-processing tandem mass spectrometry data
Abstract
Motivation: Mass spectrometry (MS)-based proteomics is one of the most commonly used research techniques for identifying and characterizing proteins in biological and medical research. The identification of a protein is the critical first step in elucidating its biological function. Successful protein identification depends on various interrelated factors, including effective analysis of MS data generated in a proteomic experiment. This analysis comprises several stages, often combined in a pipeline or workflow. The first component of the analysis is known as spectra pre-processing. In this component, the raw data generated by the mass spectrometer is processed to eliminate noise and identify the mass-to-charge ratio (m/z) and intensity for the peaks in the spectrum corresponding to the presence of certain peptides or peptide fragments. Since all downstream analyses depend on the pre-processed data, effective pre-processing is critical to protein identification and characterization. There is a critical need for more robust pre-processing algorithms that perform well on tandem mass spectra under a variety of different conditions and can be easily integrated into sophisticated data analysis pipelines for practical wet-lab applications.
Result: We have developed a new pre-processing algorithm. Based on wavelet theory, our method uses a dynamic peak model to identify peaks. It is designed to be easily integrated into a complete proteomic analysis workflow. We compared the method with other available algorithms using a reference library of raw MS and tandem MS spectra with known protein composition information. Our pre-processing algorithm results in the identification of significantly more peptides and proteins in the downstream analysis for a given false discovery rate.
Availability: Software available at: http://www.maths.usyd.edu.au/u/penghao/index.html.
Similar articles
-
VEMS 3.0: algorithms and computational tools for tandem mass spectrometry based identification of post-translational modifications in proteins.J Proteome Res. 2005 Nov-Dec;4(6):2338-47. doi: 10.1021/pr050264q. J Proteome Res. 2005. PMID: 16335983
-
Proteomic data analysis workflow for discovery of candidate biomarker peaks predictive of clinical outcome for patients with acute myeloid leukemia.J Proteome Res. 2008 Jun;7(6):2332-41. doi: 10.1021/pr070482e. Epub 2008 May 2. J Proteome Res. 2008. PMID: 18452325
-
Isotopic peak intensity ratio based algorithm for determination of isotopic clusters and monoisotopic masses of polypeptides from high-resolution mass spectrometric data.Anal Chem. 2008 Oct 1;80(19):7294-303. doi: 10.1021/ac800913b. Epub 2008 Aug 28. Anal Chem. 2008. PMID: 18754627
-
Filtering strategies for improving protein identification in high-throughput MS/MS studies.Proteomics. 2009 Feb;9(4):848-60. doi: 10.1002/pmic.200800517. Proteomics. 2009. PMID: 19160393 Review.
-
Processing and classification of protein mass spectra.Mass Spectrom Rev. 2006 May-Jun;25(3):409-49. doi: 10.1002/mas.20072. Mass Spectrom Rev. 2006. PMID: 16463283 Review.
Cited by
-
DEIMoS: An Open-Source Tool for Processing High-Dimensional Mass Spectrometry Data.Anal Chem. 2022 Apr 26;94(16):6130-6138. doi: 10.1021/acs.analchem.1c05017. Epub 2022 Apr 17. Anal Chem. 2022. PMID: 35430813 Free PMC article.
-
Wavelet-based method for time-domain noise analysis and reduction in a frequency-scan ion trap mass spectrometer.J Am Soc Mass Spectrom. 2012 Nov;23(11):1855-64. doi: 10.1007/s13361-012-0455-2. Epub 2012 Aug 21. J Am Soc Mass Spectrom. 2012. PMID: 22907169
-
A simple method for predicting transmembrane proteins based on wavelet transform.Int J Biol Sci. 2013;9(1):22-33. doi: 10.7150/ijbs.5371. Epub 2012 Dec 19. Int J Biol Sci. 2013. PMID: 23289014 Free PMC article.
-
Wavelet-based peak detection and a new charge inference procedure for MS/MS implemented in ProteoWizard's msConvert.J Proteome Res. 2015 Feb 6;14(2):1299-307. doi: 10.1021/pr500886y. Epub 2014 Dec 2. J Proteome Res. 2015. PMID: 25411686 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous