Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 18;10(2):e24876.
doi: 10.1016/j.heliyon.2024.e24876. eCollection 2024 Jan 30.

Clinical decision support tool for breast cancer recurrence prediction using SHAP value in cooperative game theory

Affiliations

Clinical decision support tool for breast cancer recurrence prediction using SHAP value in cooperative game theory

Ying Liu et al. Heliyon. .

Abstract

Background: Recurrence remains the primary cause of death in patients with breast cancer. Although machine learning can efficiently predict the prognosis of breast cancer patients, the black-box nature of the model may result in a lack of evidence for clinicians when making critical decisions.

Methods: In this study, our main objective was twofold: (1) to develop a clinical decision support tool for predicting the prognosis of breast cancer and (2) to identify and explore the key factors that influence breast cancer recurrence. To achieve this, we employed an explainable ensemble learning method called Shapley additive explanation (SHAP), which leverages cooperative game theory. Using real-world data from 1629 breast cancer patients, we analyzed and uncovered the key factors associated with breast cancer recurrence. Subsequently, we used these identified factors to create a recurrence prediction model and establish a decision mechanism for the tool. The proposed method not only provides accurate recurrence predictions but also offers transparent explanations for these predictions.

Results: By utilizing four key factors, namely, tumor size, clinical stage III, number of lymph node metastases, and age, our decision support tool for predicting breast cancer recurrence achieved significant improvements. The extra-tree model exhibited an increased area under the receiver operating characteristic curve (AUC) of 0.97, while the Random Forest model demonstrated an improved AUC of 0.96. We also offer a decision mechanism for a recurrence prediction model based on the identified key factors. This transparent and interpretable decision-making process facilitated by our explainable ensemble learning model enhances trust and promotes its applicability in clinical settings.

Conclusions: The proposed explainable ensemble learning method shows promising results in predicting breast cancer recurrence, outperforming existing methods with high accuracy and transparency. This advancement has the potential to significantly improve clinical decision-making and patient outcomes in breast cancer treatment.

Keywords: Breast cancer; Cancer recurrence; Explainable ensemble learning; SHAP.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Overview of study design.
Fig. 2
Fig. 2
ROC curves of five ensemble learning models to predict breast cancer recurrence. Red, orange, green, blue, and purple represent the five models of AdaBoost, Bagging, Extra Trees, Gradient Boosting, and Random Forest, respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
Fig. 3
Fig. 3
Comparison of five ensemble learning models in predicting breast cancer recurrence. ns: p≤1.00e+00, *: 1.00e-02 < p≤5.00e-02, **: 1.00e-03 < p≤1.00e-02, ***: 1.00e-04 < p≤1.00e-03, ****: p≤1.00e-04.
Fig. 4
Fig. 4
Key features identified using the SHAP value analysis are represented by the colors red and blue, indicating labels 0 and 1, respectively. The horizontal axis represents the average SHAP value, while the vertical axis represents different clinical features. In this context, the term “Menstruation" refers to the age of menarche. “Basel-like" represents the Basel Type in molecular subtyping. “Total lymph nodes" refers to the total number of metastatic lymph nodes. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
Fig. 5
Fig. 5
Visualization of decision tree for predicting breast cancer recurrence using key clinical features. Yellow represents the non-recurrence patients, and green represents the recurrence population. Clinical state III, tumor size, age, and total lymph nodes are the reference indicators for decision-making. An arrow with ‘≤’ denotes that the classification is below the threshold of random classification decision, while an arrow with ‘>’ indicates it is above the threshold of classification decision. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
figs1
figs1
figs2
figs2

Similar articles

Cited by

References

    1. Coleman M.P., Quaresma M., Berrino F., Lutz J.-M., De Angelis R., Capocaccia R., et al. Cancer survival in five continents: a worldwide population-based study (CONCORD) Lancet Oncol. 2008;9:730–756. - PubMed
    1. Siegel R.L., Miller K.D., Fuchs H.E., Jemal A. Cancer statistics, 2021. CA A Cancer J. Clin. 2021;71:7–33. - PubMed
    1. Vicini F.A., Sharpe M., Kestin L., Martinez A., Mitchell C.K., Wallace M.F., et al. Optimizing breast cancer treatment efficacy with intensity-modulated radiotherapy. Int. J. Radiat. Oncol. Biol. Phys. 2002;54:1336–1344. - PubMed
    1. Shulman L.N., Willett W., Sievers A., Knaul F.M. Breast cancer in developing countries: opportunities for improved survival. JAMA Oncol. 2010;2010 - PMC - PubMed
    1. Kim J.-Y., Lee Y.S., Yu J., Park Y., Lee S.K., Lee M., et al. Deep learning-based prediction model for breast cancer recurrence using adjuvant breast cancer Cohort in tertiary cancer center Registry. Front. Oncol. 2021;11 - PMC - PubMed

LinkOut - more resources