Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 31;14(3):391-404.
doi: 10.21037/gs-24-440. Epub 2025 Mar 26.

Development and validation of a semi-automatic radiomics ensemble model for preoperative evaluation of breast masses in mammotome-assisted minimally invasive resection

Affiliations

Development and validation of a semi-automatic radiomics ensemble model for preoperative evaluation of breast masses in mammotome-assisted minimally invasive resection

Zhenfeng Huang et al. Gland Surg. .

Abstract

Background: Accurate preoperative differentiation of breast masses is critical for guiding individualized treatment strategies in Mammotome-assisted minimally invasive resection. While radiomics shows promise, existing methods rely on manual delineation, which is time-consuming and subjective. This study developed an ultrasound-based semi-automatic segmentation ensemble model to improve preoperative assessment.

Methods: We retrospectively analyzed preoperative ultrasound images from 773 patients (543 tumors, 230 non-tumors). Semi-automatic segmentation was performed using DeepLabv3_ResNet50 and fully convolutional network (FCN)_ResNet50. Radiomic and deep transfer learning (DTL) features were extracted to construct radiomic, deep learning, and combined models. An ensemble strategy integrated these with clinical models. Performance was evaluated via receiver operating characteristic (ROC) curves and decision curve analysis (DCA).

Results: The cohort included 543 tumor patients and 230 non-tumor patients (95 adenosis, 135 other benign lesions). The semi-automatic segmentation model, DeepLabv3_ResNet50, achieved a peak global accuracy of 99.4% and an average Dice coefficient of 92.0% at its best epoch. On the other hand, the FCN_ResNet50 model exhibited a peak global accuracy of 99.5% and an average Dice coefficient of 93.7% at its best epoch. In the task of predicting tumor and non-tumor patients, age, maximum diameter, and BI-RADS (Breast Imaging Reporting and Data System) classification were ultimately identified as key indicators, and the stacking model ultimately demonstrated an area under the curve (AUC) of 0.890 in the training cohort (with a sensitivity of 0.844 and a specificity of 0.815) and an AUC of 0.780 in the testing cohort (with a sensitivity of 0.713 and a specificity of 0.739). In the task of predicting adenosis and other lesion types, focus emerged as a crucial factor, and the stacking model achieved an AUC of 0.813 in the training cohort (with a sensitivity of 0.613 and a specificity of 0.859) and an AUC of 0.771 in the testing cohort (with a sensitivity of 0.759 and a specificity of 0.765).

Conclusions: Our study has established an ensemble learning model grounded in semi-automatic segmentation techniques. This model accurately distinguishes between tumor and non-tumor patients preoperatively, as well as discriminating adenosis from other lesion types among the non-tumor cohort, thus providing valuable insights for individualized treatment planning. The proposed stacking model demonstrates significant clinical utility by reducing unnecessary biopsies and saving diagnostic time compared to manual review. These improvements directly address the challenges of overtreatment and diagnostic delays in breast lesion management. By enhancing preoperative accuracy, our model supports tailored surgical planning and alleviates patient anxiety associated with indeterminate diagnoses.

Keywords: Breast lesion; deep learning; mammotome; radiomics; ultrasound (US).

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://gs.amegroups.com/article/view/10.21037/gs-24-440/coif). The authors have no conflicts of interest to declare.

Figures

Figure 1
Figure 1
Binary classification tasks of the study.
Figure 2
Figure 2
The study design and workflow of model development. ASPP, atrous spatial pyramid pooling; Conv, convolutional layer; DCNN, deep convolutional neural network; DTL, deep transfer learning; FCN, fully convolutional network.
Figure 3
Figure 3
Mammotome-assisted minimally invasive resection guided by ultrasound. (A) Under ultrasound guidance, the needle was placed underneath the deep surface of the breast lump, and then the rotating cutter was opened to position the breast lump precisely in the needle’s channel. (B) Closing the rotating cutter to remove the aspirated pathological tissue. The green arrow indicates a breast lesion, and the orange arrow indicates the rotating cutter.
Figure 4
Figure 4
Training process of the segmentation models. (A) Epoch-average loss of DeepLabv3. (B) Epoch-mean Dice of DeepLabv3. (C) Epoch-average loss of FCN. (D) Epoch-mean Dice of FCN. FCN, fully convolutional network.
Figure 5
Figure 5
Radiomics models and deep learning models of DeepLabv3 and FCN. (A) ROC curve of DeepLabv3-radiomics model; (B) ROC curve of DeepLabv3-deep learning model; (C) ROC curve of FCN-radiomics model; (D) ROC curve of FCN-deep learning model. AUC, area under the ROC curve; CI, confidence interval; FCN, fully convolutional network; GBM, gradient boosting machine; KNN, K-nearest neighbors; LR, logistic regression; MLP, multi-layer perceptron; ROC, receiver operating characteristic; SVM, support vector machine.
Figure 6
Figure 6
ROC curve of different models. (A) ROC curve of DeepLabv3 deep learning radiomics model; (B) ROC curve of FCN deep learning radiomics model; (C) ROC curve of stacking model in training cohort; (D) ROC curve of stacking model in testing cohort. Combined 1: DeepLabv3 deep learning radiomics model; combined 2: FCN deep learning radiomics model. AUC, area under the ROC curve; CI, confidence interval; FCN, fully convolutional network; GBM, gradient boosting machine; KNN, K-nearest neighbors; LR, logistic regression; MLP, multi-layer perceptron; ROC, receiver operating characteristic; SVM, support vector machine.
Figure 7
Figure 7
Decision curve analysis curves comparing tumor versus non-tumor prediction models. Plots show the decision curves of clinical model, DeepLabv3 deep learning radiomics model (combined 1), FCN deep learning radiomics model (combined 2), and the stacking model in the training (A) and testing cohorts (B). FCN, fully convolutional network.
Figure 8
Figure 8
ROC curves of different models. (A) ROC curve of DeepLabv3 deep learning radiomics model; (B) ROC curve of FCN deep learning radiomics model; (C) ROC curve of stacking model in training cohort; (D) ROC curve of stacking model in testing cohort. Combined 3: DeepLabv3 deep learning radiomics model; combined 4: FCN deep learning radiomics model. AUC, area under the ROC curve; CI, confidence interval; FCN, fully convolutional network; GBM, gradient boosting machine; KNN, K-nearest neighbors; LR, logistic regression; MLP, multi-layer perceptron; ROC, receiver operating characteristic; SVM, support vector machine.
Figure 9
Figure 9
Decision curve analysis curves comparing adenosis versus other type of lesions prediction models. Plots show the decision curves of clinical model, DeepLabv3 deep learning radiomics model (combined 3), FCN deep learning radiomics model (combined 4), and the stacking model (nomogram) in the training (A) and testing cohorts (B). FCN, fully convolutional network.

Similar articles

References

    1. Sung H, Ferlay J, Siegel RL, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin 2021;71:209-49. 10.3322/caac.21660 - DOI - PubMed
    1. Harbeck N, Gnant M. Breast cancer. Lancet 2017;389:1134-50. 10.1016/S0140-6736(16)31891-8 - DOI - PubMed
    1. Xiong X, Zheng LW, Ding Y, et al. Breast cancer: pathogenesis and treatments. Signal Transduct Target Ther 2025;10:49. 10.1038/s41392-024-02108-4 - DOI - PMC - PubMed
    1. Ohuchi N, Suzuki A, Sobue T, et al. Sensitivity and specificity of mammography and adjunctive ultrasonography to screen for breast cancer in the Japan Strategic Anti-cancer Randomized Trial (J-START): a randomised controlled trial. Lancet 2016;387:341-8. 10.1016/S0140-6736(15)00774-6 - DOI - PubMed
    1. Tadesse GF, Tegaw EM, Abdisa EK. Diagnostic performance of mammography and ultrasound in breast cancer: a systematic review and meta-analysis. J Ultrasound 2023;26:355-67. 10.1007/s40477-022-00755-3 - DOI - PMC - PubMed

LinkOut - more resources