Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 May 30:2025.03.24.644996.
doi: 10.1101/2025.03.24.644996.

omicsGMF: a multi-tool for dimensionality reduction, batch correction and imputation applied to bulk- and single cell proteomics data

Affiliations
Free PMC article

omicsGMF: a multi-tool for dimensionality reduction, batch correction and imputation applied to bulk- and single cell proteomics data

Alexandre Segers et al. bioRxiv. .
Free PMC article

Abstract

The unprecedented speed and sensitivity of mass spectrometry (MS) unlocked large-scale applications of proteomics and even enabled proteome profiling of single cells. However, this fast-evolving field is hindered by a lack of scalable dimensionality reduction tools that can compensate for substantial batch effects and missingness across MS runs. Therefore, we present omicsGMF, a fast, scalable, and interpretable matrix factorization method, tailored for bulk and single-cell proteomics data. Unlike current workflows that sequentially apply imputation, batch correction, and principal component analysis, omicsGMF integrates these steps into a unified framework, dramatically enhancing data processing and dimensionality reduction. Additionally, omicsGMF provides robust imputation of missing values, outperforming bespoke state-of-the-art imputation tools. We further demonstrate how this integrated approach increases statistical power to detect differentially abundant proteins in the downstream data analysis. Hence, omicsGMF is a highly scalable approach to dimensionality reduction in proteomics, that dramatically improves many important steps in proteomics data analysis.

PubMed Disclaimer

Conflict of interest statement

Ethics declarations Competing interests. The authors declare no competing interests

Similar articles

References

    1. Nat Commun. 2024 Jun 26;15(1):5405 - PubMed
    1. Mol Syst Biol. 2020 Jun;16(6):e9356 - PubMed
    1. Anal Chem. 2020 May 5;92(9):6278-6287 - PubMed
    1. Proc Natl Acad Sci U S A. 2018 May 22;115(21):E4767-E4776 - PubMed
    1. Nat Biotechnol. 2008 Dec;26(12):1367-72 - PubMed

Publication types

LinkOut - more resources