This is a preprint.
DeMixSC: a deconvolution framework that uses single-cell sequencing plus a small benchmark dataset for improved analysis of cell-type ratios in complex tissue samples
- PMID: 37873318
- PMCID: PMC10592762
- DOI: 10.1101/2023.10.10.561733
DeMixSC: a deconvolution framework that uses single-cell sequencing plus a small benchmark dataset for improved analysis of cell-type ratios in complex tissue samples
Update in
-
A deconvolution framework that uses single-cell sequencing plus a small benchmark data set for accurate analysis of cell type ratios in complex tissue samples.Genome Res. 2025 Jan 22;35(1):147-161. doi: 10.1101/gr.278822.123. Genome Res. 2025. PMID: 39586714 Free PMC article.
Abstract
Bulk deconvolution with single-cell/nucleus RNA-seq data is critical for understanding heterogeneity in complex biological samples, yet the technological discrepancy across sequencing platforms limits deconvolution accuracy. To address this, we introduce an experimental design to match inter-platform biological signals, hence revealing the technological discrepancy, and then develop a deconvolution framework called DeMixSC using the better-matched, i.e., benchmark, data. Built upon a novel weighted nonnegative least-squares framework, DeMixSC identifies and adjusts genes with high technological discrepancy and aligns the benchmark data with large patient cohorts of matched-tissue-type for large-scale deconvolution. Our results using a benchmark dataset of healthy retinas suggest much-improved deconvolution accuracy. Further analysis of a cohort of 453 patients with age-related macular degeneration supports the broad applicability of DeMixSC. Our findings reveal the impact of technological discrepancy on deconvolution performance and underscore the importance of a well-matched dataset to resolve this challenge. The developed DeMixSC framework is generally applicable for deconvolving large cohorts of disease tissues, and potentially cancer.
Keywords: Transcriptomic deconvolution; age-related macular degeneration; bulk RNA sequencing; retina; single-cell RNA sequencing; single-nucleus RNA sequencing; technological discrepancy.
Conflict of interest statement
Competing interests The authors declare that they have no competing interests.
Figures
References
Publication types
Grants and funding
- S10 OD018033/OD/NIH HHS/United States
- P30 CA016672/CA/NCI NIH HHS/United States
- R01 CA268380/CA/NCI NIH HHS/United States
- S10 OD023469/OD/NIH HHS/United States
- P30 EY002520/EY/NEI NIH HHS/United States
- T32 CA096520/CA/NCI NIH HHS/United States
- S10 OD025240/OD/NIH HHS/United States
- R01 CA239342/CA/NCI NIH HHS/United States
- P30 CA125123/CA/NCI NIH HHS/United States
- R01 EY022356/EY/NEI NIH HHS/United States
- K22 CA234406/CA/NCI NIH HHS/United States
- S10 RR024574/RR/NCRR NIH HHS/United States
- R01 EY018571/EY/NEI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Molecular Biology Databases