Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Multicenter Study
. 2025 Sep;8(9):e70302.
doi: 10.1002/cnr2.70302.

Development of a Machine Learning Model Integrating Pathomics and Clinical Data to Predict Axillary Lymph Node Metastasis in Breast Cancer: A Two-Center Study

Affiliations
Multicenter Study

Development of a Machine Learning Model Integrating Pathomics and Clinical Data to Predict Axillary Lymph Node Metastasis in Breast Cancer: A Two-Center Study

Long Wang et al. Cancer Rep (Hoboken). 2025 Sep.

Abstract

Background: Accurately assessing the status of axillary lymph nodes (ALNs) is essential for devising optimal surgical plans and making informed treatment decisions in breast cancer (BC) patients.

Aims: This study aims to develop an innovative nomogram based on pathomics to preoperatively predict ALN metastasis (ALNM) in BC.

Methods and results: Our study performed a retrospective analysis on digital hematoxylin and eosin (H&E)-stained images obtained from 407 patients across two institutions who were allocated into a training cohort (TC; n = 203), an internal validation cohort (IVC; n = 136), and an external validation cohort (EVC; n = 68). Initially, the Mann-Whitney U-test and Spearman's rank correlation coefficient were utilized for feature selection, employing the least absolute shrinkage and selection operator (LASSO) regression for further refinement. For the evaluation of the predictive value of ALNM and other clinicopathological factors, we deployed both univariate (ULR) and multivariate (MLR) logistic regression analyses. Among the six machine learning (ML) algorithms, logistic regression, which demonstrated the highest area under the curve (AUC) value, was employed to establish the final nomogram model. The nomogram reliability and stability were assessed by analyzing the AUC of the receiver operating characteristic (ROC) curve, decision curve analysis (DCA), and calibration plots. MLR analysis demonstrated estrogen receptor (ER), human epidermal growth factor receptor 2 (HER2), tumor size, and pathomics features as independent ALNM predictors. The nomogram demonstrated that the AUC in the IVC (0.783) surpassed that of the Path-score model (0.698) (DeLong test, p = 0.008558). Similarly, in the EVC, the nomogram surpassed the clinical model regarding AUC (0.738 vs. 0.574; DeLong test, p = 0.00494). Additionally, DCA analysis indicated a net clinical benefit associated with the nomogram.

Conclusion: Our study demonstrates the effectiveness of pathomics features in predicting ALNM in BC patients. Furthermore, the pathomics-based nomogram offers a valuable tool for personalized treatment planning in this patient population.

Keywords: axillary lymph node metastasis; breast cancer; machine learning; nomogram model; pathomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
Flowchart of the overall construction process of the machine learning models predicting ALNM status. ALNM, axillary lymph node metastasis; IVC, internal validation cohort; EVC, external validation cohort; TC, training cohort.
FIGURE 2
FIGURE 2
Schematic illustration of the constructed pathomics signature. (A) Representative H&E tile selection. Scale bar: 500, 500, and 200 μm, respectively. (B) The usage of the chosen tile to extract the pathomics feature. Selection of the pathomics feature through (C) LASSO binary regression with (D) 10‐fold cross‐validation. (E) The pathomics feature coefficients are used to construct the Path‐score. (F) Calculation of the pathomics signature relying upon the chosen features. H&E, hematoxylin and eosin; LASSO, least absolute shrinkage and selection operator.
FIGURE 3
FIGURE 3
The distribution of pathological scores among all patients and across cohorts. (A) Pathomics score for each patient suffering from BC in the total cohort; distribution of pathomics score values of the ALN+/ALN− groups in the TC (B) and IVC (C). ALN, axillary lymph node; BC, breast cancer; IVC, internal validation cohort; TC, training cohort.
FIGURE 4
FIGURE 4
ROC curves of the six machine learning classifiers that predict the ALNM in the TC (A) and IVC (B). ALNM, axillary lymph node metastasis; IVC, internal validation cohort; ROC, receiver operating characteristic; TC, training cohort.
FIGURE 5
FIGURE 5
The developed nomogram based on the combined ER, HER2, tumor size, and Path‐score through logistic regression analysis. ER, estrogen receptor; HER2, human epidermal growth factor receptor 2.
FIGURE 6
FIGURE 6
Validation of the Path‐score‐based nomogram predictive value. The nomogram model's ROC curves in the (A) TC, (B) IVC, and (C) EVC. The nomogram model calibration plots in the (D) TC, (E) IVC, and (F) EVC. The net benefit of nomogram usage is shown by the decision curves in the (G) TC, (H) IVC, and (I) EVC. EVC, external validation cohort; IVC, internal validation cohort; ROC, receiver operating characteristic; TC, training cohort.
FIGURE 7
FIGURE 7
The ALNET: A simple program interface that predicts ALNM based on tumor size, ER status, HER2 status, and Path‐score. ALNET, Axillary Lymph Node Estimation Tool; ALNM, axillary lymph node metastasis; ER, estrogen receptor; HER2, human epidermal growth factor receptor 2.

Similar articles

References

    1. Siegel R. L., Miller K. D., Wagle N. S., and Jemal A., “Cancer Statistics, 2023,” CA: A Cancer Journal for Clinicians 73, no. 1 (2023): 17–48, 10.3322/caac.21763. - DOI - PubMed
    1. Peng Y., Liu M., Li X., et al., “Application of the ACOSOG Z0011 Criteria to Chinese Patients With Breast Cancer: A Prospective Study,” World Journal of Surgical Oncology 19, no. 1 (2021): 128, 10.1186/s12957-021-02242-1. - DOI - PMC - PubMed
    1. Barrio A. V., Montagna G., Mamtani A., et al., “Nodal Recurrence in Patients With Node‐Positive Breast Cancer Treated With Sentinel Node Biopsy Alone After Neoadjuvant Chemotherapy—A Rare Event,” JAMA Oncology 7, no. 12 (2021): 1851–1855, 10.1001/jamaoncol.2021.4394. - DOI - PMC - PubMed
    1. Tran H. T. and Jackson R. S., “ASO Author Reflections: Preoperative Nomogram Identifies Patients for Selective Omission of SLNB in Breast Cancer—To Promote Implementation of the Choosing Wisely Campaign Recommendations,” Annals of Surgical Oncology 27, no. 8 (2020): 2687–2688, 10.1245/s10434-020-08288-x. - DOI - PubMed
    1. Pilger T. L., Francisco D. F., and Candido Dos Reis F. J., “Effect of Sentinel Lymph Node Biopsy on Upper Limb Function in Women With Early Breast Cancer: A Systematic Review of Clinical Trials,” European Journal of Surgical Oncology 47, no. 7 (2021): 1497–1506, 10.1016/j.ejso.2021.01.024. - DOI - PubMed

Publication types