Robust Summarization and Inference in Proteome-wide Label-free Quantification
- PMID: 32321741
- PMCID: PMC7338080
- DOI: 10.1074/mcp.RA119.001624
Robust Summarization and Inference in Proteome-wide Label-free Quantification
Abstract
Label-Free Quantitative mass spectrometry based workflows for differential expression (DE) analysis of proteins impose important challenges on the data analysis because of peptide-specific effects and context dependent missingness of peptide intensities. Peptide-based workflows, like MSqRob, test for DE directly from peptide intensities and outperform summarization methods which first aggregate MS1 peptide intensities to protein intensities before DE analysis. However, these methods are computationally expensive, often hard to understand for the non-specialized end-user, and do not provide protein summaries, which are important for visualization or downstream processing. In this work, we therefore evaluate state-of-the-art summarization strategies using a benchmark spike-in dataset and discuss why and when these fail compared with the state-of-the-art peptide based model, MSqRob. Based on this evaluation, we propose a novel summarization strategy, MSqRobSum, which estimates MSqRob's model parameters in a two-stage procedure circumventing the drawbacks of peptide-based workflows. MSqRobSum maintains MSqRob's superior performance, while providing useful protein expression summaries for plotting and downstream analysis. Summarizing peptide to protein intensities considerably reduces the computational complexity, the memory footprint and the model complexity, and makes it easier to disseminate DE inferred on protein summaries. Moreover, MSqRobSum provides a highly modular analysis framework, which provides researchers with full flexibility to develop data analysis workflows tailored toward their specific applications.
Keywords: Biostatistics; bioinformatics; bioinformatics software; differential expression; label-free quantification; mass spectrometry; ridge regression; shotgun proteomics; summarization.
© 2020 Sticker et al.
Conflict of interest statement
Conflict of interest—Authors declare no competing interests.
Figures






Similar articles
-
Experimental design and data-analysis in label-free quantitative LC/MS proteomics: A tutorial with MSqRob.J Proteomics. 2018 Jan 16;171:23-36. doi: 10.1016/j.jprot.2017.04.004. Epub 2017 Apr 5. J Proteomics. 2018. PMID: 28391044
-
Peptide-level Robust Ridge Regression Improves Estimation, Sensitivity, and Specificity in Data-dependent Quantitative Label-free Shotgun Proteomics.Mol Cell Proteomics. 2016 Feb;15(2):657-68. doi: 10.1074/mcp.M115.055897. Epub 2015 Nov 13. Mol Cell Proteomics. 2016. PMID: 26566788 Free PMC article.
-
A comparative analysis of computational approaches to relative protein quantification using peptide peak intensities in label-free LC-MS proteomics experiments.Proteomics. 2013 Feb;13(3-4):493-503. doi: 10.1002/pmic.201200269. Epub 2012 Nov 8. Proteomics. 2013. PMID: 23019139 Free PMC article. Review.
-
MS-EmpiRe Utilizes Peptide-level Noise Distributions for Ultra-sensitive Detection of Differentially Expressed Proteins.Mol Cell Proteomics. 2019 Sep;18(9):1880-1892. doi: 10.1074/mcp.RA119.001509. Epub 2019 Jun 24. Mol Cell Proteomics. 2019. PMID: 31235637 Free PMC article.
-
Methods and Algorithms for Quantitative Proteomics by Mass Spectrometry.Methods Mol Biol. 2020;2051:161-197. doi: 10.1007/978-1-4939-9744-2_7. Methods Mol Biol. 2020. PMID: 31552629 Review.
Cited by
-
Secretory Proteomic Responses of Endometrial Epithelial Cells to Trophoblast-Derived Extracellular Vesicles.Int J Mol Sci. 2023 Jul 25;24(15):11924. doi: 10.3390/ijms241511924. Int J Mol Sci. 2023. PMID: 37569298 Free PMC article.
-
LFQ-Based Peptide and Protein Intensity Differential Expression Analysis.J Proteome Res. 2023 Jun 2;22(6):2114-2123. doi: 10.1021/acs.jproteome.2c00812. Epub 2023 May 23. J Proteome Res. 2023. PMID: 37220883 Free PMC article.
-
Tidyproteomics: an open-source R package and data object for quantitative proteomics post analysis and visualization.BMC Bioinformatics. 2023 Jun 6;24(1):239. doi: 10.1186/s12859-023-05360-7. BMC Bioinformatics. 2023. PMID: 37280522 Free PMC article.
-
Hypoxia promotes the generation of a versican-rich extracellular matrix by human coronary artery endothelial cells.J Biol Chem. 2025 Jul 5;301(8):110459. doi: 10.1016/j.jbc.2025.110459. Online ahead of print. J Biol Chem. 2025. PMID: 40623650 Free PMC article.
-
The mouse metallomic landscape of aging and metabolism.Nat Commun. 2022 Feb 1;13(1):607. doi: 10.1038/s41467-022-28060-x. Nat Commun. 2022. PMID: 35105883 Free PMC article.
References
-
- Goeminne L. J. E., Gevaert K., and Clement L. (2018) Experimental design and data-analysis in label-free quantitative LC/MS proteomics: A tutorial with MSqRob. J. Proteomics 171, 23–36 - PubMed
-
- Tebbe A., Klammer M., Sighart S., Schaab C., and Daub H. (2015) Systematic evaluation of label-free and super-SILAC quantification for proteome expression analysis. Rapid Commun. Mass Spectrom. 29, 795–801 - PubMed
-
- Lazar C., Gatto L., Ferro M., Bruley C., and Burger T. (2016) Accounting for the multiple natures of missing values in label-free quantitative proteomics data sets to compare imputation strategies. J. Proteome Res. 15, 1116–1125 - PubMed
-
- Goeminne L.J.E., Argentini A., Martens L., and Clement L. (2015) Summarization vs peptide-based models in label-free quantitative proteomics: performance, pitfalls, and data analysis guidelines. J. Proteome Res. 14, 2457–2465 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources