Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 21:12:1585823.
doi: 10.3389/fmed.2025.1585823. eCollection 2025.

Machine learning-based fusion model for predicting HER2 expression in breast cancer by Sonazoid-enhanced ultrasound: a multicenter study

Affiliations

Machine learning-based fusion model for predicting HER2 expression in breast cancer by Sonazoid-enhanced ultrasound: a multicenter study

Huiting Zhang et al. Front Med (Lausanne). .

Abstract

Purpose: To predict human epidermal growth factor receptor 2 (HER2) expression in breast cancer (BC) using Sonazoid-enhanced ultrasound in a machine learning-based model.

Materials and methods: Between August 2020 and February 2021, patients with breast cancer who underwent surgical treatment without neoadjuvant chemotherapy were prospectively enrolled from 17 hospitals in China. HER2 expression status was assessed by immunohistochemistry or fluorescence in situ hybridization (FISH). The training set contained data from 11 hospitals and the validation set contained 6 hospitals. Clinical features, B-mode ultrasound, contrast-enhanced ultrasound (CEUS), and time-intensity curve were selected by the Least Absolute Shrinkage and Selection Operator. Based on the selected features, six prediction models were established to predict HER2 3 + and 2 +/1 + expression: logistic regression (LR), support vector machine (SVM), random forest (RF), eXtreme Gradient Boosting (XGB), XGB combined with LR, and fusion model.

Results: A total of 140 patients with breast cancer were enrolled in this study. Seven features related to HER2 3 + and six features related to HER2 2+/1 + were selected to establish prediction models. Among the six models, LR, SVM, and XGB showed the best prediction performance for both HER2 3 + and HER2 2+/1 + cases. These three models were then combined into a fusion model. In the validation, the fusion model achieved the highest value of area under the receiver operating characteristic curve as 0.869 (95%CI: 0.715-0.958) for predicting HER2 3 + and 0.747 (95%CI: 0.548-0.891) for predicting HER2 2+/1 + cases. The model could correctly upgrade HER2 2 + cases to HER2 3 + cases, consistent with the FISH test results.

Conclusion: Sonazoid-enhanced ultrasound can provide effective guidance for targeted therapy of breast cancer by predicting HER2 expression using machine learning approaches.

Keywords: Sonazoid; breast cancer; human epidermal growth factor receptor 2; machine learning; ultrasound.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Flowchart of study design. HER2: human epidermal growth factor receptor-2; IHC: immunohistochemistry; TIC: time intensity curve; CEUS: contrast-enhanced ultrasound. LR: logistic regression; SVM: support vector machine; RF: random forest; XGB: XGBoost.
Figure 2
Figure 2
CEUS and B-mode ultrasound images of a patient with HER2-positive breast cancer.
Figure 3
Figure 3
Feature selection in B-mode, CEUS, and TIC of the CEUS group by LASSO regression in 140 patients with breast cancer. (a,b) Selection of B-mode ultrasound and CEUS features. (c,d) Selection of TIC parameters.
Figure 4
Figure 4
Hard voting progression of the decision-level fusion model.
Figure 5
Figure 5
ROCs of the classifiers in predicting HER2-positive breast cancer based on B-mode ultrasound, CEUS, and TIC in the (a) training and (b) validation sets.
Figure 6
Figure 6
The ROCs of the six prediction models in predicting HER2 low expression patients in the (a) training and (b) validation sets.

Similar articles

References

    1. Qi X, Zhang L, Chen Y, Pi Y, Chen Y, Lv Q, et al. . Automated diagnosis of breast ultrasonography images using deep neural networks. Med Image Anal. (2019) 52:185–98. doi: 10.1016/j.media.2018.12.006, PMID: - DOI - PubMed
    1. Diéras V, Miles D, Verma S, Pegram M, Welslau M, Baselga J, et al. . Trastuzumab emtansine versus capecitabine plus lapatinib in patients with previously treated HER2-positive advanced breast cancer (EMILIA): a descriptive analysis of final overall survival results from a randomised, open-label, phase 3 trial. Lancet Oncol. (2017) 18:732–42. doi: 10.1016/S1470-2045(17)30312-1, PMID: - DOI - PMC - PubMed
    1. Krop IE, Kim SB, Martin AG, LoRusso PM, Ferrero JM, Badovinac-Crnjevic T, et al. . Trastuzumab emtansine versus treatment of physician's choice in patients with previously treated HER2-positive metastatic breast cancer (TH3RESA): final overall survival results from a randomised open-label phase 3 trial. Lancet Oncol. (2017) 18:743–54. doi: 10.1016/S1470-2045(17)30313-3, PMID: - DOI - PubMed
    1. von Minckwitz G, Procter M, de Azambuja E, Zardavas D, Benyunes M, Viale G, et al. . Adjuvant Pertuzumab and Trastuzumab in early HER2-positive breast Cancer. N Engl J Med. (2017) 377:122–31. doi: 10.1056/NEJMoa1703643, PMID: - DOI - PMC - PubMed
    1. Bitencourt AGV, Gibbs P, Rossi Saccarelli C, Daimiel I, Lo Gullo R, Fox MJ, et al. . MRI-based machine learning radiomics can predict HER2 expression level and pathologic response after neoadjuvant therapy in HER2 overexpressing breast cancer. EBioMedicine. (2020) 61:103042. doi: 10.1016/j.ebiom.2020.103042, PMID: - DOI - PMC - PubMed

LinkOut - more resources