This is a preprint.
omicsGMF: a multi-tool for dimensionality reduction, batch correction and imputation applied to bulk- and single cell proteomics data
- PMID: 40196514
- PMCID: PMC11974731
- DOI: 10.1101/2025.03.24.644996
omicsGMF: a multi-tool for dimensionality reduction, batch correction and imputation applied to bulk- and single cell proteomics data
Abstract
The unprecedented speed and sensitivity of mass spectrometry (MS) unlocked large-scale applications of proteomics and even enabled proteome profiling of single cells. However, this fast-evolving field is hindered by a lack of scalable dimensionality reduction tools that can compensate for substantial batch effects and missingness across MS runs. Therefore, we present omicsGMF, a fast, scalable, and interpretable matrix factorization method, tailored for bulk and single-cell proteomics data. Unlike current workflows that sequentially apply imputation, batch correction, and principal component analysis, omicsGMF integrates these steps into a unified framework, dramatically enhancing data processing and dimensionality reduction. Additionally, omicsGMF provides robust imputation of missing values, outperforming bespoke state-of-the-art imputation tools. We further demonstrate how this integrated approach increases statistical power to detect differentially abundant proteins in the downstream data analysis. Hence, omicsGMF is a highly scalable approach to dimensionality reduction in proteomics, that dramatically improves many important steps in proteomics data analysis.
Conflict of interest statement
Ethics declarations Competing interests. The authors declare no competing interests
Figures
References
-
- Bader J.M., Geyer P.E., Müller J.B., Strauss M.T., Koch M., Leypoldt F., Koertvelyessy P., Bittner D., Schipke C.G., Incesoy E.I., Peters O., Deigendesch N., Simons M., Jensen M.K., Zetterberg H., Mann M.: Proteome profiling in cerebrospinal fluid reveals novel biomarkers of alzheimer’s disease. Molecular Systems Biology 16(6), 9356 (2020) 10.15252/msb.20199356 - DOI - PMC - PubMed
-
- Niu L., Thiele M., Geyer P.E., Rasmussen D.N., Webel H.E., Santos A., Gupta R., Meier F., Strauss M., Kjaergaard M., Lindvig K., Jacobsen S., Rasmussen S., Hansen T., Krag A., Mann M.: Noninvasive proteomic biomarkers for alcohol-related liver disease. Nature Medicine 28(6), 1277–1287 (2022) 10.1038/s41591-022-01850-y - DOI - PMC - PubMed
-
- Petrosius V., Aragon-Fernandez P., Üresin N., Kovacs G., Phlairaharn T., Furtwängler B., Op De Beeck J., Skovbakke S.L., Goletz S., Thomsen S.F., Keller U.a.d., Natarajan K.N., Porse B.T., Schoof E.M.: Exploration of cell state heterogeneity using single-cell proteomics through sensitivity-tailored data-independent acquisition. Nature Communications 14(1), 5910 (2023) 10.1038/s41467-023-41602-1 - DOI - PMC - PubMed
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources