Imputing single-cell protein abundance in multiplex tissue imaging
- PMID: 40404617
- PMCID: PMC12098973
- DOI: 10.1038/s41467-025-59788-x
Imputing single-cell protein abundance in multiplex tissue imaging
Abstract
Multiplex tissue imaging enables single-cell spatial proteomics and transcriptomics but remains limited by incomplete molecular profiling, tissue loss, and probe failure. Here, we apply machine learning to impute single-cell protein abundance using multiplex tissue imaging data from a breast cancer cohort. We evaluate regularized linear regression, gradient-boosted trees, and deep learning autoencoders, incorporating spatial context to enhance imputation accuracy. Our models achieve mean absolute errors between 0.05-0.3 on a [0,1] scale, closely approximating ground truth values. Using imputed data, we classify single cells as pre- or post-treatment, demonstrating their biological relevance. These findings establish the feasibility of imputing missing protein abundance, highlight the advantages of spatial information, and support machine learning as a powerful tool for improving single-cell tissue imaging.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: The authors declare no competing interests.
Figures








Update of
-
Imputing Single-Cell Protein Abundance in Multiplex Tissue Imaging.bioRxiv [Preprint]. 2024 Jul 27:2023.12.05.570058. doi: 10.1101/2023.12.05.570058. bioRxiv. 2024. Update in: Nat Commun. 2025 May 22;16(1):4747. doi: 10.1038/s41467-025-59788-x. PMID: 38106203 Free PMC article. Updated. Preprint.
References
MeSH terms
Grants and funding
- U24 CA284167/CA/NCI NIH HHS/United States
- U2C CA233280/CA/NCI NIH HHS/United States
- U24CA231877/U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
- U2CCA233280/U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
- U24 CA231877/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Medical