Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 18:16:1641037.
doi: 10.3389/fendo.2025.1641037. eCollection 2025.

A combined model integrating deep learning, radiomics, and clinical ultrasound features for predicting BRAF V600E mutation in papillary thyroid carcinoma with Hashimoto's thyroiditis

Affiliations

A combined model integrating deep learning, radiomics, and clinical ultrasound features for predicting BRAF V600E mutation in papillary thyroid carcinoma with Hashimoto's thyroiditis

Peng-Fei Zhu et al. Front Endocrinol (Lausanne). .

Abstract

Objective: This study aims to develop an integrated model that combines radiomics, deep learning features, and clinical and ultrasound characteristics for predicting BRAF V600E mutations in patients with papillary thyroid carcinoma (PTC) combined with Hashimoto's thyroiditis (HT).

Methods: This retrospective study included 717 thyroid nodules from 672 patients with PTC combined with HT from four hospitals in China. Deep learning and radiomics were employed to extract deep learning and radiomics features from ultrasound images. Feature selection was performed using Pearson's correlation coefficient, the Minimum Redundancy Maximum Relevance (mRMR) algorithm, and LASSO regression. The optimal algorithm was selected from nine machine learning algorithms for model construction, including the traditional radiomics model (RAD), the deep learning model (DL), and their fusion model (DL_RAD). Additionally, a final combined model was developed by integrating the DL_RAD model with clinical and ultrasound features. Model performance was assessed using AUC, calibration curves, and decision curve analysis (DCA), while SHAP analysis was used to interpret the contribution of each feature to the combined model's output.

Results: The combined model achieved superior diagnostic performance, with AUC values of 0.895, 0.864, and 0.815 in the training, validation, and external test sets, respectively, outperforming the RAD model, DL model, and RAD_DL model. DeLong test results indicated significant differences in the external test set (p<0.05). Further validation through calibration curves and DCA confirmed the model's robust performance. SHAP analysis revealed that RAD_DL signature, aspect ratio, extrathyroidal extension, and gender were key contributors to the model's predictions.

Conclusion: The combined model integrating radiomics, deep learning features, and clinical as well as ultrasound characteristics exhibits excellent diagnostic performance in predicting BRAF V600E mutations in patients with PTC coexisting with HT, highlighting its strong potential for clinical application.

Keywords: BRAF V600E mutation; Hashimoto’s thyroiditis; deep learning; papillary thyroid carcinoma; radiomics; ultrasound.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission.

Figures

Figure 1
Figure 1
The inclusion process of the study population.
Figure 2
Figure 2
Workflow diagram for the development and evaluation of predictive models. (A) Image preprocessing, radiomics and deep learning feature extraction, feature selection, feature fusion, and construction of the DL_RAD model. (B) Construction of the Combined model by integrating the DL_RAD signature with clinical and ultrasound features. Additionally, among the nine machine learning algorithms, random forest achieved the best performance. (C) Model evaluation and interpretation. LR, Logistic Regression; SVM, Support Vector Machine; RF, Random Forest; XGB, XGBoost; KNN, K-Nearest Neighbors; LGBM, LightGBM; ET, ExtraTrees; G, Gradient Boosting; MLP, Multilayer Perceptron.
Figure 3
Figure 3
Pearson correlation coefficient network diagram and heatmap. (A) The Pearson feature correlation network illustrated the relationships between each pair of selected features. (B) The Pearson correlation coefficient heatmap indicated that each feature acted as an independent predictor, as no correlation coefficient exceeded 0.5.
Figure 4
Figure 4
Performance evaluation of different models. (A-C) AUC curves for four models (RAD, DL, DL_RAD, Combined) across three sets. (D-F) Calibration curves for the four models across the three sets. (G-I) Decision curves for the four models across the three sets. (J-L) DeLong Test for the four models across the three sets.
Figure 5
Figure 5
(A) The SHAP summary plot illustrates the impact of each feature on the model’s prediction. The features include DL_RAD, aspect ratio, ETE, and gender. Higher SHAP values indicate a greater contribution of the corresponding feature to the prediction outcome. (B) Individual prediction explanation for a specific case, where the ultrasound image is shown on the left and the corresponding SHAP force plot on the right. The DL_RAD prediction value, aspect ratio, ETE, and gender are 0.33, 1.291, −1.231, and −0.403, respectively, contributing +0.13, +0.05, −0.02, and −0.01 to the malignant label decision. Among these, the DL_RAD prediction value and aspect ratio positively support the malignant prediction, whereas extrathyroidal extension and gender exert a negative influence. Summing these contributions with the expected value (E[f(X)] = 0.735) yields a final decision probability of 0.89. SHAP values represent absolute contributions to the predicted probability (i.e., additive changes from the base value), measured in probability units.

References

    1. Pizzato M, Li M, Vignat J, Laversanne M, Singh D, La Vecchia C, et al. The epidemiological landscape of thyroid cancer worldwide: GLOBOCAN estimates for incidence and mortality rates in 2020. Lancet Diabetes Endocrinol. (2022) 10:264–72. doi: 10.1016/S2213-8587(22)00035-3, PMID: - DOI - PubMed
    1. Caulley L, Eskander A, Yang W, Auh E, Cairncross L, Cho NL, et al. Trends in diagnosis of noninvasive follicular thyroid neoplasm with papillarylike nuclear features and total thyroidectomies for patients with papillary thyroid neoplasms. JAMA Otolaryngol Head Neck Surg. (2022) 148:1–8. doi: 10.1001/jamaoto.2021.3277, PMID: - DOI - PMC - PubMed
    1. Acuña-Ruiz A, Carrasco-López C, Santisteban P. Genomic and epigenomic profile of thyroid cancer. Best Pract Res Clin Endocrinol Metab. (2023) 37:101656. doi: 10.1016/j.beem.2022.101656, PMID: - DOI - PubMed
    1. Ho AS, Luu M, Barrios L, Chen I, Melany M, Ali N, et al. Incidence and mortality risk spectrum across aggressive variants of papillary thyroid carcinoma. JAMA Oncol. (2020) 6:706. doi: 10.1001/jamaoncol.2019.6851, PMID: - DOI - PMC - PubMed
    1. Yu J, Deng Y, Liu T, Zhou J, Jia X, Xiao T, et al. Lymph node metastasis prediction of papillary thyroid carcinoma based on transfer learning radiomics. Nat Commun. (2020) 11:4807. doi: 10.1038/s41467-020-18497-3, PMID: - DOI - PMC - PubMed

MeSH terms

Substances