Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 12:16:1537386.
doi: 10.3389/fendo.2025.1537386. eCollection 2025.

Predicting central lymph node metastasis in papillary thyroid microcarcinoma: a breakthrough with interpretable machine learning

Affiliations

Predicting central lymph node metastasis in papillary thyroid microcarcinoma: a breakthrough with interpretable machine learning

Weijun Zhou et al. Front Endocrinol (Lausanne). .

Abstract

Objective: To develop and validate an interpretable machine learning (ML) model for the preoperative prediction of central lymph node metastasis (CLNM) in papillary thyroid microcarcinoma (PTMC).

Methods: From December 2016 to December 2023, we retrospectively analyzed 710 PTMC patients who underwent thyroidectomies. Feature selection was conducted using the least absolute shrinkage and selection operator (LASSO) regression method, alongside the Support Vector Machine-Recursive Feature Elimination (SVM-RFE) algorithm in conjunction with multivariate logistic regression. Eight ML algorithms, namely Decision Tree, Random Forest (RF), K-nearest neighbors, Support vector machine, Extreme Gradient Boosting, Naive Bayes, Logistic regression, and Light Gradient Boosting machine, were developed for the prediction of CLNM. The performance of these models was evaluated using area under the receiver operating characteristic curve (AUC), decision curve analysis (DCA), sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), and F1 scores. Additionally, the Shapley Additive Explanation (SHAP) algorithm was utilized to clarify the results of the optimal ML model.

Results: The results indicated that 32.95% of the patients (234/710) presented with CLNM. Tumor diameter, multifocality, lymph nodes identified via ultrasound (US-LN), and extrathyroidal extension (ETE) were identified as independent predictors of CLNM. The RF model achieved the highest performance in the validation set with an AUC of 0.893(95%CI: 0.846-0.940), accuracy of 0.832, sensitivity of 0.764, specificity of 0.866, PPV of 0.743, NPV of 0.879, and F1-score of 0.753. Furthermore, the DCA demonstrated that the RF model exhibited a superior clinical net benefit.

Conclusion: Our model predicted the risk of CLNM in PTMC patients with high accuracy preoperatively.

Keywords: SHapley Additive exPlanation; central lymph node metastasis; diagnostic imaging; machine learning; papillary thyroid microcarcinoma.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Patient flowchart for this study. CLND, central lymph node dissection; PTC, papillary thyroid carcinoma; PTMC, papillary thyroid microcarcinoma; CLNM, central lymph node metastasis.
Figure 2
Figure 2
Artificial intelligence workflow and study flowchart. DT, decision tree; RF, random forest; SVM, support vector machine; XGBoost, Extreme Gradient Boosting; KNN, k-nearest neighbors; LR, logistic regression; LightGBM, light gradient boosting machine; NBM, naive bayes model; SHAP, Shapley Additive Explanation.
Figure 3
Figure 3
Feature selection was performed by the Least Absolute Shrinkage and Selection Operator (LASSO) regression and Support Vector Machine Recursive Feature Elimination (SVM-RFE). (A) Coefficients derived from LASSO regression. (B) The range of optimal values was identified by the LASSO model. (C) Four optimal features were chosen by the LASSO. (D) Five optimal features were chosen by the SVW-RFE. US-LN, cervical lymph nodes status based on ultrasound; ETE, extrathyroidal extension; TSH, thyroid-stimulating hormone.
Figure 4
Figure 4
Presents a comparative analysis of various machine learning models employed for the prediction of cervical lymph node metastasis (CLNM) in papillary thyroid microcarcinoma (PTMC) patients. (A) ROC curves of the eight model in the validation set. (B) ROC curves evaluate the RF model and US-LN through AUC scores. (C) Decision curve analysis in the validation set. ROC, receiver operating characteristic; AUC, area under the ROC curve; DT, decision tree; KNN, k-nearest neighbors; LightGBM, light gradient boosting machine; LR, logistic regression; NBM, naive bayes model; RF, random forest; SVW, support vector machine; XGBoost, Extreme Gradient Boosting.
Figure 5
Figure 5
Effectiveness evaluation of RF prediction models (A) Confusion matrix of RF in the training set. (B) Confusion matrix of the RF model in the validation set. X-axis represents the model prediction, y-axis represents the real situation, and the values in the box are the number of samples. (C) Standard deviation of feature importance in the RF model.
Figure 6
Figure 6
Shapley Additive Explanation (SHAP) of the model. (A) Summary plots for the validation sets with associated SHAP values. Each point represents a SHAP value for a patient’s characteristic. (B) SHAP force plot for a PTMC patient without CLMN. (C) SHAP force plot for a PTMC patient with CLMN. US-LN, cervical lymph nodes status based on ultrasound; ETE, extrathyroidal extension.

Similar articles

References

    1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. . Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660 - DOI - PubMed
    1. Cabanillas ME, McFadden DG, Durante C. Thyroid cancer. Lancet (London England). (2016) 388:2783–95. doi: 10.1016/s0140-6736(16)30172-6 - DOI - PubMed
    1. Lim H, Devesa SS, Sosa JA, Check D, Kitahara CM. Trends in thyroid cancer incidence and mortality in the United States, 1974-2013. Jama. (2017) 317:1338–48. doi: 10.1001/jama.2017.2719 - DOI - PMC - PubMed
    1. Wang K, Xu J, Li S, Liu S, Zhang L. Population-based study evaluating and predicting the probability of death resulting from thyroid cancer among patients with papillary thyroid microcarcinoma. Cancer Med. (2019) 8:6977–85. doi: 10.1002/cam4.2597 - DOI - PMC - PubMed
    1. Wang J, Yu F, Shang Y, Ping Z, Liu L. Thyroid cancer: incidence and mortality trends in China, 2005-2015. Endocrine. (2020) 68:163–73. doi: 10.1007/s12020-020-02207-6 - DOI - PubMed

Supplementary concepts

LinkOut - more resources