Accurate estimation of cell-type composition from gene expression data
- PMID: 31278265
- PMCID: PMC6611906
- DOI: 10.1038/s41467-019-10802-z
Accurate estimation of cell-type composition from gene expression data
Abstract
The rapid development of single-cell transcriptomic technologies has helped uncover the cellular heterogeneity within cell populations. However, bulk RNA-seq continues to be the main workhorse for quantifying gene expression levels due to technical simplicity and low cost. To most effectively extract information from bulk data given the new knowledge gained from single-cell methods, we have developed a novel algorithm to estimate the cell-type composition of bulk data from a single-cell RNA-seq-derived cell-type signature. Comparison with existing methods using various real RNA-seq data sets indicates that our new approach is more accurate and comprehensive than previous methods, especially for the estimation of rare cell types. More importantly, our method can detect cell-type composition changes in response to external perturbations, thereby providing a valuable, cost-effective method for dissecting the cell-type-specific effects of drug treatments or condition changes. As such, our method is applicable to a wide range of biological and clinical investigations.
Conflict of interest statement
The authors declare no competing interests.
Figures




Similar articles
-
Data Analysis in Single-Cell Transcriptome Sequencing.Methods Mol Biol. 2018;1754:311-326. doi: 10.1007/978-1-4939-7717-8_18. Methods Mol Biol. 2018. PMID: 29536451
-
Rare Cell Type Detection.Methods Mol Biol. 2019;1935:79-89. doi: 10.1007/978-1-4939-9057-3_5. Methods Mol Biol. 2019. PMID: 30758820
-
Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data.BMC Genomics. 2024 Sep 18;25(1):875. doi: 10.1186/s12864-024-10728-x. BMC Genomics. 2024. PMID: 39294558 Free PMC article.
-
Current and Future Methods for mRNA Analysis: A Drive Toward Single Molecule Sequencing.Methods Mol Biol. 2018;1783:209-241. doi: 10.1007/978-1-4939-7834-2_11. Methods Mol Biol. 2018. PMID: 29767365 Review.
-
Single-cell RNA-seq: advances and future challenges.Nucleic Acids Res. 2014 Aug;42(14):8845-60. doi: 10.1093/nar/gku555. Epub 2014 Jul 22. Nucleic Acids Res. 2014. PMID: 25053837 Free PMC article. Review.
Cited by
-
Adaptive digital tissue deconvolution.Bioinformatics. 2024 Jun 28;40(Suppl 1):i100-i109. doi: 10.1093/bioinformatics/btae263. Bioinformatics. 2024. PMID: 38940181 Free PMC article.
-
Advances in mixed cell deconvolution enable quantification of cell types in spatial transcriptomic data.Nat Commun. 2022 Jan 19;13(1):385. doi: 10.1038/s41467-022-28020-5. Nat Commun. 2022. PMID: 35046414 Free PMC article.
-
SCADIE: simultaneous estimation of cell type proportions and cell type-specific gene expressions using SCAD-based iterative estimating procedure.Genome Biol. 2022 Jun 15;23(1):129. doi: 10.1186/s13059-022-02688-w. Genome Biol. 2022. PMID: 35706040 Free PMC article.
-
Differential Survival and Therapy Benefit of Patients with Breast Cancer Are Characterized by Distinct Epithelial and Immune Cell Microenvironments.Clin Cancer Res. 2022 Mar 1;28(5):960-971. doi: 10.1158/1078-0432.CCR-21-1442. Clin Cancer Res. 2022. PMID: 34965952 Free PMC article.
-
SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references.Brief Bioinform. 2021 Jan 18;22(1):416-427. doi: 10.1093/bib/bbz166. Brief Bioinform. 2021. PMID: 31925417 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials