Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 9;18(3):e1009495.
doi: 10.1371/journal.pcbi.1009495. eCollection 2022 Mar.

Tumor cell intrinsic and extrinsic features predict prognosis in estrogen receptor positive breast cancer

Affiliations

Tumor cell intrinsic and extrinsic features predict prognosis in estrogen receptor positive breast cancer

Kevin Yao et al. PLoS Comput Biol. .

Abstract

Although estrogen-receptor-positive (ER+) breast cancer is generally associated with favorable prognosis, clinical outcome varies substantially among patients. Genomic assays have been developed and applied to predict patient prognosis for personalized treatment. We hypothesize that the recurrence risk of ER+ breast cancer patients is determined by both genomic mutations intrinsic to tumor cells and extrinsic immunological features in the tumor microenvironment. Based on the Cancer Genome Atlas (TCGA) breast cancer data, we identified the 72 most common genomic aberrations (including gene mutations and indels) in ER+ breast cancer and defined sample-specific scores that systematically characterized the deregulated pathways intrinsic to tumor cells. To further consider tumor cell extrinsic features, we calculated immune infiltration scores for six major immune cell types. Many individual intrinsic features are predictive of patient prognosis in ER+ breast cancer, and some of them achieved comparable accuracy with the Oncotype DX assay. In addition, statistical learning models that integrated these features predicts the recurrence risk of patients with significantly better performance than the Oncotype DX assay (our optimized random forest model AUC = 0.841, Oncotype DX model AUC = 0.792, p = 0.04). As a proof-of-concept, our study indicates the great potential of genomic and immunological features in prognostic prediction for improving breast cancer precision medicine. The framework introduced in this work can be readily applied to other cancers.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Schematic diagram of this study.
a Definition of gene signatures to recapitulate the pathways underlying driver genomic aberrations. Here we use three genomic aberrations (TP53 mutation, ERBB2 amplification and ATM deletion) as examples. In ER+ breast cancer, we defined a total of 72 gene signatures, each for a specific genomic aberration. b To predict patient prognosis in ER+ breast cancer, we constructed prediction models to integrate the 72 gene signatures (intrinsic features), 6 types of infiltrating immune cells (extrinsic features), and clinical factors (e.g., age, tumor stage). Gene signature scores and immune cell scores were calculated based on gene expression of tumor samples. Random Forest models were used to classify good versus poor prognosis, and Cox regression models were used to predict prognostic risk scores.
Fig 2
Fig 2. Gene signatures recapitulate the downstream pathways of mutated driver genes.
a Spearman correlation coefficients (Correlation) between different gene signatures defined based on the TCGA ER+ breast cancer data. Signature scores can distinguish ER+ breast cancer samples with TP53_mut (b), ERBB2_amp (c), PIK3CA_mut (d), and GATA3_mut (e) from samples without the aberrations. b, c were based on the Curtis data; (d) was based on GSE41994; and (e) was based on GSE101780. ROC curves showing that TP53_mut (f) and ERBB2_amp (g) signature scores can predict the mutation status of their respective driver genomic aberration.
Fig 3
Fig 3. Gene signatures and immune infiltration scores predict patient prognosis.
a-c Signature scores for TP53_mut (a), TNFRSF17_amp (b), and MemB (c) distinguish patients with good and poor prognosis. d ROC curves showing that TP53_mut, TNFRSF17_amp and MemB infiltration score predicts prognosis at a comparable level to Onco-score. e AUC scores of random forest models with different combinations of predictive features. Our Sig+Imm model performs with higher AUC scores than the Onco-score models with and without clinical features. f Relative importance of the top 20 most important genomic aberration and immune infiltration features.
Fig 4
Fig 4. Optimized model outperforms Oncotype DX risk scores for prognostic prediction.
a Results from backward selection to find an optimized set of features—AUC score of the model plotted as a function of the number of features removed. The optimized model is chosen as the highest AUC score at the smallest number of features. b ROC curves of the performance of our optimized model as compared to the Onco-score + Cli model and just the Cli model, showing that our optimized model overperforms both Oncotype DX and clinical features. c ROC curves of our optimized model when trained in Curtis discovery and validated in Curtis validation, and vise versa. d ROC curves of our optimized model when trained in Curtis discovery and validated in the test dataset, the Ur-Rehman dataset, and vise versa. e ROC curves of the Onco+Cli model trained and validated in the same way as (d), showing decreased performance compared to our optimized model.
Fig 5
Fig 5. Individual signature and immune infiltration scores can identify prognostic patient groups.
Signature and immune infiltration scores fitted in a univariate Cox proportional hazards model without clinical adjustment (a) and with clinical adjustment (b) are associated with survival time. Patients are significantly dichotomized by their TP53_mut (c), TNFRSF17_amp (d), and MemB (e) score. f Risk groups dichotomized by the median Onco-score have lower or comparable significance to some of our signature and immune infiltration scores.
Fig 6
Fig 6. Optimized Cox regression model for prognostic risk prediction.
a Patients in the Curtis validation dataset are significantly grouped by their risk as predicted by the optimized model trained in the Curtis discovery dataset. b Patients are still significantly dichotomized by their risk when clinical variables are removed from the optimized model. c Onco class achieves slightly lower performance than our optimized model without clinical information in grouping patient risk categories. d-f Our optimized model is able to further stratify the Onco high (d), intermediate (e), and low (f) risk classes.

Similar articles

References

    1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer Statistics, 2021. CA: A Cancer Journal for Clinicians. 2021;71(1):7–33. 10.3322/caac.21654. - DOI - PubMed
    1. Lumachi F, Santeufemia DA, Basso SM. Current medical treatment of estrogen receptor-positive breast cancer. World journal of biological chemistry. 2015;6(3):231. doi: 10.4331/wjbc.v6.i3.231 - DOI - PMC - PubMed
    1. Henry N, Shah P, Haider I, Freer P, Jagsi R, Sabel M. Chapter 88: cancer of the breast. Abeloff’s clinical oncology, 6th edn Elsevier, Philadelphia, PA. 2020.
    1. Jerusalem G, Bachelot T, Barrios C, Neven P, Di Leo A, Janni W, et al.. A new era of improving progression-free survival with dual blockade in postmenopausal HR+, HER2− advanced breast cancer. Cancer treatment reviews. 2015;41(2):94–104. doi: 10.1016/j.ctrv.2014.12.011 - DOI - PubMed
    1. Nielsen TO, Parker JS, Leung S, Voduc D, Ebbert M, Vickery T, et al.. A comparison of PAM50 intrinsic subtyping with immunohistochemistry and clinical prognostic factors in tamoxifen-treated estrogen receptor–positive breast cancer. Clinical cancer research. 2010;16(21):5222–32. doi: 10.1158/1078-0432.CCR-10-1282 - DOI - PMC - PubMed

Publication types