Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Aug 6;6(3):lqae098.
doi: 10.1093/nargab/lqae098. eCollection 2024 Sep.

Evaluating cell type deconvolution in FFPE breast tissue: application to benign breast disease

Affiliations
Review

Evaluating cell type deconvolution in FFPE breast tissue: application to benign breast disease

Yuanhang Liu et al. NAR Genom Bioinform. .

Abstract

Transcriptome profiling using RNA sequencing (RNA-seq) of bulk formalin-fixed paraffin-embedded (FFPE) tissue blocks is a standard method in biomedical research. However, when used on tissues with diverse cell type compositions, it yields averaged gene expression profiles, complicating biomarker identification due to variations in cell proportions. To address the need for optimized strategies for defining individual cell type compositions from bulk FFPE samples, we constructed single-cell RNA-seq reference data for breast tissue and tested cell type deconvolution methods. Initial simulation experiments showed similar performances across multiple commonly used deconvolution methods. However, the introduction of FFPE artifacts significantly impacted their performances, with a root mean squared error (RMSE) ranging between 0.04 and 0.17. Scaden, a deep learning-based method, consistently outperformed the others, demonstrating robustness against FFPE artifacts. Testing these methods on our 62-sample RNA-seq benign breast disease cohort in which cell type composition was estimated using digital pathology approaches, we found that pre-filtering of the reference data enhanced the accuracy of most methods, realizing up to a 32% reduction in RMSE. To support further research efforts in this domain, we introduce SCdeconR, an R package designed for streamlined cell type deconvolution assessments and downstream analyses.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Overview of the study framework. The diagram illustrates three primary phases of our research: construction of scRNA-seq reference data for breast tissue; comparative analysis of deconvolution methods through simulation experiments; and evaluation of these methods on real data, encompassing both FFPE and FFzn samples (created with BioRender.com).
Figure 2.
Figure 2.
Comparative analysis of deconvolution methods using simulated experiments. The benchmarking process entailed three distinct simulation scenarios: (i) baseline simulations; (ii) incomplete references, wherein specific cell types were omitted from the reference data; and (iii) FFPE artifact simulation, grounded on our proprietary FFPE–FFzn sample pair data. (A) RMSE evaluations for three representative methods from each category: MG-based, SC-based and DL-based. (B) Heatmap showing cell type correlations based on scRNA-seq reference data. (C) Aggregated RMSE metrics spanning all three experimental situations for all deconvolution methods.
Figure 3.
Figure 3.
Performance assessment of deconvolution methods on real data from BBD FFPE samples. Predictions from each deconvolution methods were compared with CaseViewer annotations. (A) The analysis of gene-filtering stringency on the overall performance of deconvolution methods for all three cell types. Dot color indicates SCC with CaseViewer annotation. Dot size indicates the inverse of RMSE. The Y-axis represents the minimum requisite percentage of expressing cells per cell type. For example, a value of ‘0’ means no gene filtering was applied, while ‘40′ indicates that at least one cell type must have over 40% of its cells expressing the gene. (B) A scatterplot juxtaposing CaseViewer annotation and predictions from Scaden at 20% cutoff. Colors indicate the three major cell types evaluated. SCC and associated P-value were annotated at the top right. (C) Insight into each cell type’s correlation performance with CaseViewer estimates across deconvolution methods. Color and shape of the dot indicate SCC values for individual cell types. (D) Precision of cell type in terms of the inverse of RMSE. Color and shape of the dot indicate 1/RMSE values for individual cell types.
Figure 4.
Figure 4.
Overview of SCdeconR. SCdeconR provide functionalities to (i) assemble integrated scRNA-seq reference data, (ii) facilitate the creation of simulated experiments for evaluation of deconvolution methods and (iii) perform cell type deconvolution and downstream analyses, including DEA and GSEA. For those interested in utilizing or exploring SCdeconR, it is currently accessible from CRAN at https://CRAN.R-project.org/package=SCdeconR and GitHub at https://github.com/Liuy12/SCdeconR (created with BioRender.com).

References

    1. Hartmann L.C., Sellers T.A., Frost M.H., Lingle W.L., Degnim A.C., Ghosh K., Vierkant R.A., Maloney S.D., Pankratz V.S., Hillman D.W. et al. . Benign breast disease and the risk of breast cancer. N. Engl. J. Med. 2005; 353:229–237. - PubMed
    1. Rohan T.E., Miller C.A., Li T., Wang Y., Loudig O., Ginsberg M., Glass A., Mardis E. Somatic mutations in benign breast disease tissue and risk of subsequent invasive breast cancer. Br. J. Cancer. 2018; 118:1662–1664. - PMC - PubMed
    1. Winham S.J., Wang C., Heinzen E.P., Bhagwate A., Liu Y., McDonough S.J., Stallings-Mann M.L., Frost M.H., Vierkant R.A., Denison L.A. et al. . Somatic mutations in benign breast disease tissues and association with breast cancer risk. BMC Med. Genomics. 2021; 14:185. - PMC - PubMed
    1. Zeng Z., Vo A., Li X., Shidfar A., Saldana P., Blanco L., Xuei X., Luo Y., Khan S.A., Clare S.E. Somatic genetic aberrations in benign breast disease and the risk of subsequent breast cancer. NPJ Breast Cancer. 2020; 6:24. - PMC - PubMed
    1. Ludyga N., Grunwald B., Azimzadeh O., Englert S., Hofler H., Tapio S., Aubele M. Nucleic acids from long-term preserved FFPE tissues are suitable for downstream analyses. Virchows Arch. 2012; 460:131–140. - PubMed