Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 28;17(6):106682.
doi: 10.4329/wjr.v17.i6.106682.

Non-contrast computed tomography radiomics model to predict benign and malignant thyroid nodules with lobe segmentation: A dual-center study

Affiliations

Non-contrast computed tomography radiomics model to predict benign and malignant thyroid nodules with lobe segmentation: A dual-center study

Hao Wang et al. World J Radiol. .

Abstract

Background: Accurate preoperative differentiation of benign and malignant thyroid nodules is critical for optimal patient management. However, conventional imaging modalities present inherent diagnostic limitations.

Aim: To develop a non-contrast computed tomography-based machine learning model integrating radiomics and clinical features for preoperative thyroid nodule classification.

Methods: This multicenter retrospective study enrolled 272 patients with thyroid nodules (376 thyroid lobes) from center A (May 2021-April 2024), using histopathological findings as the reference standard. The dataset was stratified into a training cohort (264 lobes) and an internal validation cohort (112 lobes). Additional prospective temporal (97 lobes, May-August 2024, center A) and external multicenter (81 lobes, center B) test cohorts were incorporated to enhance generalizability. Thyroid lobes were segmented along the isthmus midline, with segmentation reliability confirmed by an intraclass correlation coefficient (≥ 0.80). Radiomics feature extraction was performed using Pearson correlation analysis followed by least absolute shrinkage and selection operator regression with 10-fold cross-validation. Seven machine learning algorithms were systematically evaluated, with model performance quantified through the area under the receiver operating characteristic curve (AUC), Brier score, decision curve analysis, and DeLong test for comparison with radiologists interpretations. Model interpretability was elucidated using SHapley Additive exPlanations (SHAP).

Results: The extreme gradient boosting model demonstrated robust diagnostic performance across all datasets, achieving AUCs of 0.899 [95% confidence interval (CI): 0.845-0.932] in the training cohort, 0.803 (95%CI: 0.715-0.890) in internal validation, 0.855 (95%CI: 0.775-0.935) in temporal testing, and 0.802 (95%CI: 0.664-0.939) in external testing. These results were significantly superior to radiologists assessments (AUCs: 0.596, 0.529, 0.558, and 0.538, respectively; P < 0.001 by DeLong test). SHAP analysis identified radiomic score, age, tumor size stratification, calcification status, and cystic components as key predictive features. The model exhibited excellent calibration (Brier scores: 0.125-0.144) and provided significant clinical net benefit at decision thresholds exceeding 20%, as evidenced by decision curve analysis.

Conclusion: The non-contrast computed tomography-based radiomics-clinical fusion model enables robust preoperative thyroid nodule classification, with SHAP-driven interpretability enhancing its clinical applicability for personalized decision-making.

Keywords: Machine learning; Non-contrast computed tomography; Papillary thyroid carcinoma; Radiomics; Thyroid nodules.

PubMed Disclaimer

Conflict of interest statement

Conflict-of-interest statement: All the authors report no relevant conflicts of interest for this article.

Figures

Figure 1
Figure 1
Study inclusion flowchart. CT: Computed tomography; PTC: Papillary thyroid carcinoma; TA: Thyroid adenoma; NG: Nodular goiter.
Figure 2
Figure 2
Workflow of model development. CT: Computed tomography; PTC: Papillary thyroid carcinoma; TA: Thyroid adenoma; NG: Nodular goiter.
Figure 3
Figure 3
Least absolute shrinkage and selection operator feature selection and comparison. A: Least absolute shrinkage and selection operator feature selection plot, where the horizontal axis represents the logarithm of the regularization parameter (λ), and the vertical axis denotes the corresponding feature weights. As λ increases, feature weights progressively decrease until exclusion at zero; B: 10-fold cross-validation curve, where the vertical axis represents the cross-validated mean squared error. The λ-value at the lowest point (λ min) is identified as the optimal regularization parameter, balancing model complexity and prediction error; C: 27 radiomic features and their corresponding weight coefficients retained after least absolute shrinkage and selection operator regression; D: Correlation heatmap of the 27 selected radiomic features, with darker colors indicating stronger correlations.
Figure 4
Figure 4
Radiomic score distribution across cohorts. A: Comparison of radiomic score (Radscore) between benign and malignant lesions in the training; B: Comparison of Radscore between benign and malignant lesions in the validation; C: Comparison of Radscore between benign and malignant lesions in the temporal test; D: Comparison of Radscore between benign and malignant lesions in the external test cohorts. aP < 0.05; bP < 0.001.
Figure 5
Figure 5
Performance evaluation of seven machine learning models. A: Receiver operating characteristic curves of different models across the training, validation, temporal test, and external test cohorts (from left to right). The area under the receiver operating characteristic curve quantifies classification performance, with higher area under the receiver operating characteristic curve values indicating superior discrimination; B: Calibration curves for different models in the training, validation, temporal test, and external test cohorts (from left to right). These curves assess the agreement between predicted probabilities and actual outcomes, with lower Brier scores indicating better calibration, as reflected by proximity to the 45° diagonal line; C: Decision curve analysis in the training, validation, temporal test, and external test cohorts (from left to right), evaluating the clinical net benefit of different models across probability thresholds. Overall, the extreme gradient boosting model exhibits consistent and superior performance across multiple cohorts. DT: Decision trees; KNM: K-nearest neighbors; LR: Logistic regression; RF: Random forests; LGBM: Light gradient boosting machines; SVM: Support vector machines; XGB: Extreme gradient boosting; AUC: Area under the receiver operating characteristic curve.
Figure 6
Figure 6
SHapley Additive exPlanations analysis and clinical application. A: Honeycomb and bar plot ranking SHapley Additive exPlanations values by feature importance; B: Case illustration of a 67-year-old female with an 8 mm right thyroid lobe nodule, pathologically confirmed as nodular goiter; C: Case illustration of a 43-year-old female with a 4 mm right thyroid lobe nodule, pathologically confirmed as papillary thyroid carcinoma. SHAP: SHapley Additive exPlanations.

References

    1. Han B, Zheng R, Zeng H, Wang S, Sun K, Chen R, Li L, Wei W, He J. Cancer incidence and mortality in China, 2022. J Natl Cancer Cent. 2024;4:47–53. - PMC - PubMed
    1. Li X, Jian J, Zhang A, Xiang JM, Huang J, Chen Y. The role of immune cells and immune related genes in the tumor microenvironment of papillary thyroid cancer and their significance for immunotherapy. Sci Rep. 2024;14:18125. - PMC - PubMed
    1. Kim H, Kwon H, Moon BI. Association of Multifocality With Prognosis of Papillary Thyroid Carcinoma: A Systematic Review and Meta-analysis. JAMA Otolaryngol Head Neck Surg. 2021;147:847–854. - PMC - PubMed
    1. Shao L, Wang Z, Dong W, Sun W, Zhang H. Risk factors associated with preferential lateral lymph node metastasis in papillary thyroid carcinoma. Cancer Med. 2023;12:20670–20676. - PMC - PubMed
    1. Alexander EK, Cibas ES. Diagnosis of thyroid nodules. Lancet Diabetes Endocrinol. 2022;10:533–539. - PubMed

LinkOut - more resources