Charger: combination of signal processing and statistical learning algorithms for precursor charge-state determination from electron-transfer dissociation spectra
- PMID: 18081262
- DOI: 10.1021/ac071332q
Charger: combination of signal processing and statistical learning algorithms for precursor charge-state determination from electron-transfer dissociation spectra
Abstract
Tandem mass spectrometry in combination with liquid chromatography has emerged as a powerful tool for characterization of complex protein mixtures in a high-throughput manner. One of the bioinformatics challenges posed by the mass spectral data analysis is the determination of precursor charge when unit mass resolution is used for detecting fragment ions. The charge-state information is used to filter database sequences before they are correlated to experimental data. In the absence of the accurate charge state, several charge states are assumed. This dramatically increases database search times. To address this problem, we have developed an approach for charge-state determination of peptides from their tandem mass spectra obtained in fragmentations via electron-transfer dissociation (ETD) reactions. Protein analysis by ETD is thought to enhance the range of amino acid sequences that can be analyzed by mass spectrometry-based proteomics. One example is the improved capability to characterize phosphorylated peptides. Our approach to charge-state determination uses a combination of signal processing and statistical machine learning. The signal processing employs correlation and convolution analyses to determine precursor masses and charge states of peptides. We discuss applicability of these methods to spectra of different charge states. We note that in our applications correlation analysis outperforms the convolution in determining peptide charge states. The correlation analysis is best suited for spectra with prevalence of complementary ions. It is highly specific but is dependent on quality of spectra. The linear discriminant analysis (LDA) approach uses a number of other spectral features to predict charge states. We train LDA classifier on a set of manually curated spectral data from a mixture of proteins of known identity. There are over 5000 spectra in the training set. A number of features, pertinent to spectra of peptides obtained via ETD reactions, have been used in the training. The loading coefficients of LDA indicate the relative importance of different features for charge-state determination. We have applied our model to a test data set generated from a mixture of 49 proteins. We search the spectra with and without use of the charge-state determination. The charge-state determination helps to significantly save the database search times. We discuss the cost associated with the possible misclassification of charge states.
Similar articles
-
CIFTER: automated charge-state determination for peptide tandem mass spectra.Anal Chem. 2008 Mar 1;80(5):1520-8. doi: 10.1021/ac702038q. Epub 2008 Feb 2. Anal Chem. 2008. PMID: 18247484
-
Evaluation of several MS/MS search algorithms for analysis of spectra derived from electron transfer dissociation experiments.Anal Chem. 2009 Sep 1;81(17):7170-80. doi: 10.1021/ac9006107. Anal Chem. 2009. PMID: 19639959
-
Improved peptide identification for proteomic analysis based on comprehensive characterization of electron transfer dissociation spectra.J Proteome Res. 2010 Dec 3;9(12):6354-67. doi: 10.1021/pr100648r. Epub 2010 Nov 12. J Proteome Res. 2010. PMID: 20883037
-
'Top down' protein characterization via tandem mass spectrometry.J Mass Spectrom. 2002 Jul;37(7):663-75. doi: 10.1002/jms.346. J Mass Spectrom. 2002. PMID: 12124999 Review.
-
Filtering strategies for improving protein identification in high-throughput MS/MS studies.Proteomics. 2009 Feb;9(4):848-60. doi: 10.1002/pmic.200800517. Proteomics. 2009. PMID: 19160393 Review.
Cited by
-
Trans-Proteomic Pipeline supports and improves analysis of electron transfer dissociation data sets.Proteomics. 2010 Mar;10(6):1190-5. doi: 10.1002/pmic.200900567. Proteomics. 2010. PMID: 20082347 Free PMC article.
-
A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8. J Proteomics. 2010. PMID: 20816881 Free PMC article. Review.
-
Multiplexed Post-Experimental Monoisotopic Mass Refinement (mPE-MMR) to Increase Sensitivity and Accuracy in Peptide Identifications from Tandem Mass Spectra of Cofragmentation.Anal Chem. 2017 Jan 17;89(2):1244-1253. doi: 10.1021/acs.analchem.6b03874. Epub 2016 Dec 22. Anal Chem. 2017. PMID: 27966901 Free PMC article.
-
XDIA: improving on the label-free data-independent analysis.Bioinformatics. 2010 Mar 15;26(6):847-8. doi: 10.1093/bioinformatics/btq031. Epub 2010 Jan 26. Bioinformatics. 2010. PMID: 20106817 Free PMC article.
-
Increasing peptide identifications and decreasing search times for ETD spectra by pre-processing and calculation of parent precursor charge.Proteome Sci. 2012 Feb 9;10(1):8. doi: 10.1186/1477-5956-10-8. Proteome Sci. 2012. PMID: 22321509 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources