Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 9;20(7):e0327564.
doi: 10.1371/journal.pone.0327564. eCollection 2025.

Clinical prediction of intravenous immunoglobulin-resistant Kawasaki disease based on interpretable Transformer model

Affiliations

Clinical prediction of intravenous immunoglobulin-resistant Kawasaki disease based on interpretable Transformer model

Gahao Chen et al. PLoS One. .

Abstract

Intravenous immunoglobulin (IVIG) has been established as the first-line therapy for Kawasaki disease (KD). However, approximately 10%-20% of pediatric patients exhibit IVIG resistance. Current machine learning (ML) models demonstrate suboptimal predictive performance in KD treatment response prediction, primarily due to their limited ability to effectively process categorical variables and interpret tabular clinical data. This study aims to develop and interpretable transformer-based clinical prediction model for IVIG resistant KD and validate its clinical utility. This retrospective study analyzed clinical records of KD patients from the Affiliated Hospital of North Sichuan Medical College (Nanchong, China) between January 1, 2014 and December 31, 2024. A cohort of 1,578 pediatric KD cases was systematically divided into training and validation sets. Six machine learning algorithms - Random Forest (RF), AdaBoost, Light Gradient Boosting Machine (LightGBM), eXtreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Tabular Prior-data Fitted Network version 2.0 (TabPFN-V2) - were implemented with five-fold cross-validation to optimize model hyperparameters. Model performance was rigorously evaluated using seven metrics: accuracy, precision, recall, F1-score, Matthews correlation coefficient (MCC), area under the receiver operating characteristic (ROC-AUC), and area under the precision-recall curve (PR-AUC). The top-performing model was subsequently subjected to interpretability analysis through Shapley Additive Explanations (SHAP) to elucidate feature contributions. The transformer-based TabPFN-V2 model demonstrated superior predictive performance in KD analysis, achieving an impressive validation set accuracy of 0.97. Comprehensive evaluation metrics confirmed its robust performance: precision 0.98, recall 0.97, F1-score 0.98, MCC 0.95, ROC-AUC 0.99, and PR-AUC 0.99. Global interpretability analysis through kernel SHAP methodology identified the ten most influential predictive features ranked by significance: Coronary artery lesions (CAL), Aspartate aminotransferase (AST), C-reactive protein (CRP), whether it was incomplete KD (KDtype), Neutrophil count (N), Platelet count (PLT), Albumin (ALB), age, White blood cell count (WBC) and Hemoglobin (Hb). Local interpretability analysis revealed distinct correlation patterns with IVIG resistance:AST, CRP, and N demonstrated significant positive correlations, where elevated values corresponded to increased IVIG resistance risk; PLT and ALB showed negative correlations, with higher levels associated with reduced resistance probability. Notably, age and WBC parameters demonstrated threshold effects, where optimal cutoff values enabled re-calibration of single-variable predictive scores. This threshold-dependent relationship suggests potential clinical utility in risk stratification protocols.The TabPFN-V2 model, leveraging an interpretable transformer architecture, demonstrates dual clinical utilities in KD management: (1) accurate prediction of IVIG resistance risk, and (2) data-driven support for personalized therapeutic decision-making. This framework enables probabilistic estimation of treatment resistance likelihood while providing transparent feature contribution analyses essential for developing patient-specific management protocols.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Methodology block diagram.
Fig 2
Fig 2. TabPFN model.
(A) Overview of TabPFN synthetic pre-training. (B) TabPFN architecture.
Fig 3
Fig 3. Comparison of AUC for six ML models.
(A) Random Forest, (B) AdaBoost, (C) LightGBM, (D) XGBoost, (E) CatBoost, (F) TabPFN-V2.
Fig 4
Fig 4. Comparison of PRC for six ML models.
(A) Random Forest, (B) AdaBoost, (C) LightGBM, (D) XGBoost, (E) CatBoost, (F) TabPFN-V2.
Fig 5
Fig 5. The global interpretability based on kernel SHAP under the TabPFN-V2 model.
CA, coronary artery lesions; AST, aspartate transaminase; CRP, C-reactive protein; KDtype, incomplete Kawasaki disease; N, Neutrophil count; PLT, platelet count; ALB, albumin; ALT, alanine aminotransferase; WBC, white blood cell; Hb, hemoglobin; TP, total protein. ESR, erythrocyte sedimentation rate; SHAP, SHapley Additive exPlanations.
Fig 6
Fig 6. Feature dependence plots.
(A) The X-axis represents the range of characteristic values for AST features, and the Y-axis represents the shap value of AST features, which will affect the output of the model. (B) Feature dependence plots of CRP. (C) Feature dependence plots of N. AST, aspartate transaminase; CRP, C-reactive protein; N, Neutrophil count.
Fig 7
Fig 7. Feature dependence plots.
(A) The X-axis represents the range of characteristic values for PLT features, and the Y-axis represents the shap value of PLT features, which will affect the output of the model. (B) Feature dependence plots of ALB. PLT, platelet count; ALB, albumin.
Fig 8
Fig 8. Feature dependence plots.
(A) The X-axis represents the range of characteristic values for age features, and the Y-axis represents the shap value of age features, which will affect the output of the model. (B) Feature dependence plots of WBC. WBC, white blood cell.
Fig 9
Fig 9. Feature dependence plots.
(A) The X-axis represents the range of characteristic values for ALT features, and the Y-axis represents the shap value of ALT features, which will affect the output of the model. (B) Feature dependence plots of Hb. (C) Feature dependence plots of TP. (D) Feature dependence plots of ESR. ALT, alanine aminotransferase; Hb, hemoglobin; TP, total protein. ESR, erythrocyte sedimentation rate.

Similar articles

References

    1. Kuo H-C, Liu S-F, Lin P-X, Yang KD, Lin B-S. Near infrared spectroscopy detects change of tissue hemoglobin and water Levelsin Kawasaki disease and coronary artery lesions. Children (Basel). 2022;9(3):299. doi: 10.3390/children9030299 - DOI - PMC - PubMed
    1. Matsuguma C, Wakiguchi H, Suzuki Y, Okada S, Furuta T, Ohnishi Y, et al. Dynamics of immunocyte activation during intravenous immunoglobulin treatment in Kawasaki disease. Scand J Rheumatol. 2019;48(6):491–6. doi: 10.1080/03009742.2019.1604992 - DOI - PubMed
    1. Shaanxi Provincial Diagnosis and Treatment Center of Kawasaki Disease, Clinical Research Center for Childhood Diseases of Shaanxi Province, Children’s Hospital of Shaanxi Provincial People’s Hospital, Editorial Board of Chinese Journal of Contemporary Pediatrics, Expert Committee of Advanced Training for Pediatrician, China Maternal andChildren’s Health Association, General Pediatrician Group, Society of Pediatrician, Chinese Doctor Association. Shaanxi provincial diagnosis and treatment center of Kawasaki Disease. Zhongguo Dang Dai Er Ke Za Zhi. 2021;23(9):867–76. doi: 10.7499/j.issn.1008-8830.2107110 - DOI - PMC - PubMed
    1. Kuo H-C, Lin M-C, Kao C-C, Weng K-P, Ding Y, Han Z, et al. Intravenous immunoglobulin alone for coronary artery lesion treatment of kawasaki disease: a randomized clinical trial. JAMA Netw Open. 2025;8(4):e253063. doi: 10.1001/jamanetworkopen.2025.3063 - DOI - PMC - PubMed
    1. Lim YT, Kwon JE, Kim YH. Evaluating the performance of egami, Kobayashi and sano scores in predicting IVIG resistance in infant Kawasaki disease. BMC Pediatr. 2024;24(1):606. doi: 10.1186/s12887-024-05035-z - DOI - PMC - PubMed

Substances