Identifying differences in protein expression levels by spectral counting and feature selection
- PMID: 18551400
- PMCID: PMC2703009
- DOI: 10.4238/vol7-2gmr426
Identifying differences in protein expression levels by spectral counting and feature selection
Abstract
Spectral counting is a strategy to quantify relative protein concentrations in pre-digested protein mixtures analyzed by liquid chromatography online with tandem mass spectrometry. In the present study, we used combinations of normalization and statistical (feature selection) methods on spectral counting data to verify whether we could pinpoint which and how many proteins were differentially expressed when comparing complex protein mixtures. These combinations were evaluated on real, but controlled, experiments (yeast lysates were spiked with protein markers at different concentrations to simulate differences), which were therefore verifiable. The following normalization methods were applied: total signal, Z-normalization, hybrid normalization, and log preprocessing. The feature selection methods were: the Golub index, the Student t-test, a strategy based on the weighting used in a forward-support vector machine (SVM-F) model, and SVM recursive feature elimination. The results showed that Z-normalization combined with SVM-F correctly identified which and how many protein markers were added to the yeast lysates for all different concentrations. The software we used is available at http://pcarvalho.com/patternlab.
Figures


Similar articles
-
PatternLab for proteomics: a tool for differential shotgun proteomics.BMC Bioinformatics. 2008 Jul 21;9:316. doi: 10.1186/1471-2105-9-316. BMC Bioinformatics. 2008. PMID: 18644148 Free PMC article.
-
Comparative shotgun proteomics using spectral count data and quasi-likelihood modeling.J Proteome Res. 2010 Aug 6;9(8):4295-305. doi: 10.1021/pr100527g. J Proteome Res. 2010. PMID: 20586475 Free PMC article.
-
Targeted Feature Detection for Data-Dependent Shotgun Proteomics.J Proteome Res. 2017 Aug 4;16(8):2964-2974. doi: 10.1021/acs.jproteome.7b00248. Epub 2017 Jul 19. J Proteome Res. 2017. PMID: 28673088 Free PMC article.
-
Simple, efficient and thorough shotgun proteomic analysis with PatternLab V.Nat Protoc. 2022 Jul;17(7):1553-1578. doi: 10.1038/s41596-022-00690-x. Epub 2022 Apr 11. Nat Protoc. 2022. PMID: 35411045 Review.
-
Statistical Approaches to Candidate Biomarker Panel Selection.Adv Exp Med Biol. 2016;919:463-492. doi: 10.1007/978-3-319-41448-5_22. Adv Exp Med Biol. 2016. PMID: 27975231 Free PMC article. Review.
Cited by
-
Upregulation of the phthiocerol dimycocerosate biosynthetic pathway by rifampin-resistant, rpoB mutant Mycobacterium tuberculosis.J Bacteriol. 2012 Dec;194(23):6441-52. doi: 10.1128/JB.01013-12. Epub 2012 Sep 21. J Bacteriol. 2012. PMID: 23002228 Free PMC article.
-
Comparison between Proteome and Transcriptome Response in Potato (Solanum tuberosum L.) Leaves Following Potato Virus Y (PVY) Infection.Proteomes. 2017 Jul 6;5(3):14. doi: 10.3390/proteomes5030014. Proteomes. 2017. PMID: 28684682 Free PMC article.
-
Genetic control of the mouse HDL proteome defines HDL traits, function, and heterogeneity.J Lipid Res. 2019 Mar;60(3):594-608. doi: 10.1194/jlr.M090555. Epub 2019 Jan 8. J Lipid Res. 2019. PMID: 30622162 Free PMC article.
-
Urine proteomics for profiling of human disease using high accuracy mass spectrometry.Proteomics Clin Appl. 2009 Sep 1;3(9):1052-1061. doi: 10.1002/prca.200900008. Proteomics Clin Appl. 2009. PMID: 21127740 Free PMC article.
-
Quantitative proteomics analysis of signalosome dynamics in primary T cells identifies the surface receptor CD6 as a Lat adaptor-independent TCR signaling hub.Nat Immunol. 2014 Apr;15(4):384-392. doi: 10.1038/ni.2843. Epub 2014 Mar 2. Nat Immunol. 2014. PMID: 24584089 Free PMC article.
References
-
- Badr G, Oommen BJ. On optimizing syntactic pattern recognition using tries and AI-based heuristic-search strategies. IEEE Trans Syst.Man Cybern.B Cybern. 2006;36:611–622. - PubMed
-
- Bylund D, Danielsson R, Malmquist G, Markides KE. Chromatographic alignment by warping and dynamic programming as a pre-processing tool for PARAFAC modelling of liquid chromatography-mass spectrometry data. J Chromatogr.A. 2002;961:237–244. - PubMed
-
- Carlson JM, Chakravarty A, Gross RH. BEAM: a beam search algorithm for the identification of cis-regulatory elements in groups of genes. J Comput.Biol. 2006;13:686–701. - PubMed
-
- Carvalho PC, Carvalho MGC, Degrave W, Lilla S, De Nucci G, Fonseca R, Spector N, Musacchio J, Domont GB. Differential protein expression patterns obtained by mass spectrometry can aid in the diagnosis of Hodgkin's disease. J.Exp.Ther.Oncol. 2007;6:137–145. - PubMed
-
- Carvalho PC, Freitas SS, Lima AB, Barros M, Bittencourt I, Degrave W, Cordovil I, Fonseca R, Carvalho MGC, Moura Neto RS, Cabello PH. Personalized diagnosis by cached solutions with hypertension as a study model. Genet.Mol.Res. 2006;5:856–867. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources