Signature-scoring methods developed for bulk samples are not adequate for cancer single-cell RNA sequencing data
- PMID: 35212622
- PMCID: PMC8916770
- DOI: 10.7554/eLife.71994
Signature-scoring methods developed for bulk samples are not adequate for cancer single-cell RNA sequencing data
Abstract
Quantifying the activity of gene expression signatures is common in analyses of single-cell RNA sequencing data. Methods originally developed for bulk samples are often used for this purpose without accounting for contextual differences between bulk and single-cell data. More broadly, few attempts have been made to benchmark these methods. Here, we benchmark five such methods, including single sample gene set enrichment analysis (ssGSEA), Gene Set Variation Analysis (GSVA), AUCell, Single Cell Signature Explorer (SCSE), and a new method we developed, Jointly Assessing Signature Mean and Inferring Enrichment (JASMINE). Using cancer as an example, we show cancer cells consistently express more genes than normal cells. This imbalance leads to bias in performance by bulk-sample-based ssGSEA in gold standard tests and down sampling experiments. In contrast, single-cell-based methods are less susceptible. Our results suggest caution should be exercised when using bulk-sample-based methods in single-cell data analyses, and cellular contexts should be taken into consideration when designing benchmarking strategies.
Keywords: benchmarking; cancer biology; cancer stemness; computational biology; gene counts; human; signature scoring; single cell RNA sequencing; systems biology.
© 2022, Noureen et al.
Conflict of interest statement
NN, ZY, YC, XW, SZ No competing interests declared
Figures













Similar articles
-
Protocol to benchmark gene expression signature scoring techniques for single-cell RNA sequencing data in cancer.STAR Protoc. 2022 Dec 16;3(4):101877. doi: 10.1016/j.xpro.2022.101877. Epub 2022 Nov 24. STAR Protoc. 2022. PMID: 36595948 Free PMC article.
-
Single sample scoring of molecular phenotypes.BMC Bioinformatics. 2018 Nov 6;19(1):404. doi: 10.1186/s12859-018-2435-4. BMC Bioinformatics. 2018. PMID: 30400809 Free PMC article.
-
Improving bulk RNA-seq classification by transferring gene signature from single cells in acute myeloid leukemia.Brief Bioinform. 2022 Mar 10;23(2):bbac002. doi: 10.1093/bib/bbac002. Brief Bioinform. 2022. PMID: 35136933
-
What are the applications of single-cell RNA sequencing in cancer research: a systematic review.J Exp Clin Cancer Res. 2021 May 11;40(1):163. doi: 10.1186/s13046-021-01955-1. J Exp Clin Cancer Res. 2021. PMID: 33975628 Free PMC article.
-
Single-Cell Genomics.Clin Chem. 2019 Aug;65(8):972-985. doi: 10.1373/clinchem.2017.283895. Epub 2019 Mar 14. Clin Chem. 2019. PMID: 30872376 Review.
Cited by
-
ATP-binding cassette family C member 1 constrains metabolic responses to high-fat diet in male mice.J Endocrinol. 2024 Jul 3;262(2):e240024. doi: 10.1530/JOE-24-0024. Print 2024 Aug 1. J Endocrinol. 2024. PMID: 38829241 Free PMC article.
-
Protocol to benchmark gene expression signature scoring techniques for single-cell RNA sequencing data in cancer.STAR Protoc. 2022 Dec 16;3(4):101877. doi: 10.1016/j.xpro.2022.101877. Epub 2022 Nov 24. STAR Protoc. 2022. PMID: 36595948 Free PMC article.
-
S-Propargyl-Cysteine Attenuates Stroke Heterogeneity via Promoting Protective Autophagy Across Multiple Neural Cell Types: Insights From Single-Cell Sequencing.CNS Neurosci Ther. 2025 Jul;31(7):e70399. doi: 10.1111/cns.70399. CNS Neurosci Ther. 2025. PMID: 40702904 Free PMC article.
-
Data analysis guidelines for single-cell RNA-seq in biomedical studies and clinical applications.Mil Med Res. 2022 Dec 2;9(1):68. doi: 10.1186/s40779-022-00434-8. Mil Med Res. 2022. PMID: 36461064 Free PMC article. Review.
-
Cancer cell - Fibroblast crosstalk via HB-EGF, EGFR, and MAPK signaling promotes the expression of macrophage chemo-attractants in squamous cell carcinoma.iScience. 2024 Aug 3;27(9):110635. doi: 10.1016/j.isci.2024.110635. eCollection 2024 Sep 20. iScience. 2024. PMID: 39262776 Free PMC article.
References
-
- Aibar S, González-Blas CB, Moerman T, Huynh-Thu VA, Imrichova H, Hulselmans G, Rambow F, Marine J-C, Geurts P, Aerts J, van den Oord J, Atak ZK, Wouters J, Aerts S. SCENIC: single-cell regulatory network inference and clustering. Nature Methods. 2017;14:1083–1086. doi: 10.1038/nmeth.4463. - DOI - PMC - PubMed
-
- Ben-Shachar M, Lüdecke D, Makowski D. effectsize: Estimation of Effect Size Indices and Standardized Parameters. Journal of Open Source Software. 2020;5:2815. doi: 10.21105/joss.02815. - DOI
-
- Bi K, He MX, Bakouny Z, Kanodia A, Napolitano S, Wu J, Grimaldi G, Braun DA, Cuoco MS, Mayorga A, DelloStritto L, Bouchard G, Steinharter J, Tewari AK, Vokes NI, Shannon E, Sun M, Park J, Chang SL, McGregor BA, Haq R, Denize T, Signoretti S, Guerriero JL, Vigneau S, Rozenblatt-Rosen O, Rotem A, Regev A, Choueiri TK, Van Allen EM. Tumor and immune reprogramming during immunotherapy in advanced renal cell carcinoma. Cancer Cell. 2021;39:649–661. doi: 10.1016/j.ccell.2021.02.015. - DOI - PMC - PubMed
Publication types
MeSH terms
Associated data
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical