Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 4:4:PO.20.00016.
doi: 10.1200/PO.20.00016. eCollection 2020.

Systematic Assessment of Tumor Purity and Its Clinical Implications

Affiliations

Systematic Assessment of Tumor Purity and Its Clinical Implications

Syed Haider et al. JCO Precis Oncol. .

Abstract

Purpose: The tumor microenvironment is complex, comprising heterogeneous cellular populations. As molecular profiles are frequently generated using bulk tissue sections, they represent an admixture of multiple cell types (including immune, stromal, and cancer cells) interacting with each other. Therefore, these molecular profiles are confounded by signals emanating from many cell types. Accurate assessment of residual cancer cell fraction is crucial for parameterization and interpretation of genomic analyses, as well as for accurately interpreting the clinical properties of the tumor.

Materials and methods: To benchmark cancer cell fraction estimation methods, 10 estimators were applied to a clinical cohort of 333 patients with prostate cancer. These methods include gold-standard multiobserver pathology estimates, as well as estimates inferred from genome, epigenome, and transcriptome data. In addition, two methods based on genomic and transcriptomic profiles were used to quantify tumor purity in 4,497 tumors across 12 cancer types. Bulk mRNA and microRNA profiles were subject to in silico deconvolution to estimate cancer cell-specific mRNA and microRNA profiles.

Results: We present a systematic comparison of 10 tumor purity estimation methods on a cohort of 333 prostate tumors. We quantify variation among purity estimation methods and demonstrate how this influences interpretation of clinico-genomic analyses. Our data show poor concordance between pathologic and molecular purity estimates, necessitating caution when interpreting molecular results. Limited concordance between DNA- and mRNA-derived purity estimates remained a general pan-cancer phenomenon when tested in an additional 4,497 tumors spanning 12 cancer types.

Conclusion: The choice of tumor purity estimation method may have a profound impact on the interpretation of genomic assays. Taken together, these data highlight the need for improved assessment of tumor purity and quantitation of its influences on the molecular hallmarks of cancers.

PubMed Disclaimer

Conflict of interest statement

The following represents disclosure information provided by authors of this manuscript. All relationships are considered compensated unless otherwise noted. Relationships are self-held unless noted. I = Immediate Family Member, Inst = My Institution. Relationships may not relate to the subject matter of this manuscript. For more information about ASCO’s conflict of interest policy, please refer to www.asco.org/rwc or ascopubs.org/po/author-center. Open Payments is a public database containing information reported by companies about payments made to US-licensed physicians (Open Payments). Peter J. ParkHonoraria: Pfizer Consulting or Advisory Role: Neuroinflammation Newco Patents, Royalties, Other Intellectual Property: Patent on mutational signature-based detection of homologous recombination deficiencyPeter W. LairdConsulting or Advisory Role: Progenity, AnchorDx Patents, Royalties, Other Intellectual Property: Received royalties annually through 2018 for inventions licensed to Epigenomics AG by USC Travel, Accommodations, Expenses: AnchorDxWenyi WangStock and Other Ownership Interests: Genomic HealthFrancesca DemichelisPatents, Royalties, Other Intellectual Property: Co-inventor on a patent filed by the University of Michigan and the Brigham and Women’s Hospital covering the diagnostic and therapeutic fields for ETS fusions in prostate cancer. The diagnostic field has been licensed to Gen-Probe.Paul C. BoutrosConsulting or Advisory Role: BioSymetrics Patents, Royalties, Other Intellectual Property: Holds patents on multiple biomarkers No other potential conflicts of interest were reported.

Figures

FIG 1.
FIG 1.
Purity landscape in The Cancer Genome Atlas (TCGA) prostate cancer cohort (PRAD). (A) Distribution of TCGA prostate tumor purity estimates (n = 333) using in silico methods and consolidated multiobserver pathology reviews; (B) Patient-wise purity estimates grouped by Gleason score. Gray represents missing data, including both failed estimates and missing molecular profiles (see Methods for details). Columns were clustered using Ward hierarchical clustering method. Data from INTEGER were available for 107 samples using the low-pass DNA sequencing data; (C) Pearson correlation between purity estimates inferred using in silico methods and pathology reviews. Rows and columns were clustered using Ward hierarchical clustering method.
FIG 2.
FIG 2.
Deviation of pathologist-inferred tumor purity from in silico estimates. Difference between pathology estimates of tumor purity and in silico estimates from DNA and mRNA abundance profiles. P, pathology estimates; R, Pearson’s correlation coefficient; PR, statistical significance of observed correlation.
FIG 3.
FIG 3.
Molecular correlates of tumor purity. Genomic correlates of tumor purity as summarized using androgen receptor (AR) signature score (A), percent genome altered ([PGA], B), and mutation burden (C). Correlation statistic was estimated using Pearson correlation. (D) Purity estimates stratified by prostate cancer–specific driver mutations and ERG fusions. log2FC represents difference in mean purity (log2 scale) between mutant and wild-type samples (ERG represents ERG fusions). Statistical significance was estimated using Wilcoxon rank sum test, and P values were adjusted for multiple comparisons using the Benjamini–Hochberg method. Statistical tests were performed for genes with more than three mutant samples. Therefore, IDH1, RB1, AKT1, and CHD1 (displayed with “x”) were deemed inappropriate for statistical testing. (E) Correlation between purity estimates and variant allele frequency of mutant samples. Correlation statistic was estimated using Pearson correlation, and P values were adjusted for multiple comparisons using the Benjamini–Hochberg method. For reliable correlation estimates, genes (in panel 3D) with more than 10 mutant samples were considered for estimating correlation with tumor purity. FDR, false discovery rate; miRNA, microRNA.
FIG 4.
FIG 4.
Deconvolved prostate cancer profiles, and DNA- and mRNA-derived purity estimates across The Cancer Genome Atlas (TCGA) cancer types. (A) Correlation between purity estimates derived using pathology, DNA, mRNA, and microRNA (miRNA) profiles and molecular profiles (mRNA.naive = bulk mRNA abundance, mRNA.ISOpure = deconvolved mRNA abundance, miRNA.naive = bulk miRNA abundance, miRNA.ISOpure = deconvolved miRNA abundance, and CNA = bulk copy number data; deconvolved RNA profiles were generated using ISOpure). Each feature (genes for mRNA and copy number aberration [CNA] profiles, miRNAs for miRNA profiles) was correlated with tumor purity estimators (pathology, DNA, RNA, miRNA) separately. The x-axis represents number of purity estimators where a feature was found to be significantly correlated (Spearman’s |ρ| > 0.3, false discovery rate–adjusted P < .01). (B) Distribution of tumor purity estimates across 13 TCGA tumor types (4,830 tumors) using an in silico DNA-based (ASCAT) and mRNA-based (ISOpure) method. “Mean” estimate indicates combined mean of purity estimates from ASCAT and ISOpure. “Pearson’s R” indicates correlation between ASCAT and ISOpure estimates. “n” shows total number of samples with valid estimates available for both ASCAT and ISOpure.

References

    1. Hanahan D, Weinberg RA. Hallmarks of cancer: The next generation. Cell. 2011;144:646–674. - PubMed
    1. Aran D, Sirota M, Butte AJ. Systematic pan-cancer analysis of tumour purity. Nat Commun. 2015;6:8971. - PMC - PubMed
    1. Junttila MR, de Sauvage FJ. Influence of tumour micro-environment heterogeneity on therapeutic response. Nature. 2013;501:346–354. - PubMed
    1. Fox NS, Haider S, Harris AL, et al. Landscape of transcriptomic interactions between breast cancer and its microenvironment. Nat Commun. 2019;10:3116. - PMC - PubMed
    1. Smits AJ, Kummer JA, de Bruin PC, et al. The estimation of tumor cell percentage for molecular testing by pathologists is not accurate. Mod Pathol. 2014;27:168–174. - PubMed