Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 19;15(3):634.
doi: 10.3390/cancers15030634.

Applying Explainable Machine Learning Models for Detection of Breast Cancer Lymph Node Metastasis in Patients Eligible for Neoadjuvant Treatment

Affiliations

Applying Explainable Machine Learning Models for Detection of Breast Cancer Lymph Node Metastasis in Patients Eligible for Neoadjuvant Treatment

Josip Vrdoljak et al. Cancers (Basel). .

Abstract

Background: Due to recent changes in breast cancer treatment strategy, significantly more patients are treated with neoadjuvant systemic therapy (NST). Radiological methods do not precisely determine axillary lymph node status, with up to 30% of patients being misdiagnosed. Hence, supplementary methods for lymph node status assessment are needed. This study aimed to apply and evaluate machine learning models on clinicopathological data, with a focus on patients meeting NST criteria, for lymph node metastasis prediction.

Methods: From the total breast cancer patient data (n = 8381), 719 patients were identified as eligible for NST. Machine learning models were applied for the NST-criteria group and the total study population. Model explainability was obtained by calculating Shapley values.

Results: In the NST-criteria group, random forest achieved the highest performance (AUC: 0.793 [0.713, 0.865]), while in the total study population, XGBoost performed the best (AUC: 0.762 [0.726, 0.795]). Shapley values identified tumor size, Ki-67, and patient age as the most important predictors.

Conclusion: Tree-based models achieve a good performance in assessing lymph node status. Such models can lead to more accurate disease stage prediction and consecutively better treatment selection, especially for NST patients where radiological and clinical findings are often the only way of lymph node assessment.

Keywords: breast cancer; lymph node metastasis; machine learning; neoadjuvant systemic treatment.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Study workflow.
Figure 2
Figure 2
ROC-curve for Random Forest trained on NST-criteria group. (ROC—receiver operating characteristic curve, AUC—area under the curve, SENS—sensitivity, SPEC—specificity, PPV—positive predictive value, NPV—negative predictive value).
Figure 3
Figure 3
ROC-curve for XGBoost trained on total study population. (ROC—receiver operating characteristic curve, AUC—area under the curve, SENS—sensitivity, SPEC—specificity, PPV—positive predictive value, NPV—negative predictive value).
Figure 4
Figure 4
Shapley values (feature importance) for NST group.
Figure 5
Figure 5
Dependency plot for Shapley values (NST criteria group).
Figure 6
Figure 6
Shapley values (feature importance) for total study population.
Figure 7
Figure 7
Dependency plot for Shapley values (total study population).

Similar articles

Cited by

References

    1. Siegel R.L., Miller K.D., Fuchs H.E., Jemal A. Cancer statistics, 2022. CA Cancer J. Clin. 2022;72:7–33. doi: 10.3322/caac.21708. - DOI - PubMed
    1. Cancer Today. [(accessed on 10 October 2022.)]. Available online: https://gco.iarc.fr/today/online-analysis-table?v=2020&mode=cancer&mode_....
    1. Centers for Disease Control and Prevention . Incidence and Relative Survival by Stage at Diagnosis for Common Cancers. USCS Data Brief, n.A., GA: Centers for Disease Control and Prevention, US Department of Health and Human Services; Washington, DC, USA: 2021.
    1. Nordenskjöld A.E., Fohlin H., Arnesson L.G., Einbeigi Z., Holmberg E., Albertsson P., Karlsson P. Breast cancer survival trends in different stages and age groups—A population-based study 1989–2013. Acta Oncol. 2019;58:45–51. doi: 10.1080/0284186X.2018.1532601. - DOI - PubMed
    1. Walsh M.F., Nathanson K.L., Couch F.J., Offit K. Genomic Biomarkers for Breast Cancer Risk. Adv. Exp. Med. Biol. 2016;882:1–32. - PMC - PubMed

LinkOut - more resources