Explainable machine learning model and nomogram for predicting the efficacy of Traditional Chinese Medicine in treating Long COVID: a retrospective study
- PMID: 40182854
- PMCID: PMC11966431
- DOI: 10.3389/fmed.2025.1529993
Explainable machine learning model and nomogram for predicting the efficacy of Traditional Chinese Medicine in treating Long COVID: a retrospective study
Abstract
Introduction: Long COVID significantly affects patients' quality of life, yet no standardized treatment has been established. Traditional Chinese Medicine (TCM) presents a promising potential approach with targeted therapeutic strategies. This study aims to develop an explainable machine learning (ML) model and nomogram to identify Long COVID patients who may benefit from TCM, enhancing clinical decision-making.
Methods: We analyzed data from 1,331 Long COVID patients treated with TCM between December 2022 and February 2024 at three hospitals in Zhejiang, China. Effectiveness was defined as improvement in two or more symptoms or a minimum 2-point increase in the Traditional Chinese Medicine Syndrome Score (TCMSS). Data included 11 patient and disease characteristics, 18 clinical symptoms and syndrome scores, and 12 auxiliary examination indicators. The least absolute shrinkage and selection operator (LASSO) method identified features linked to TCM efficacy. Data from 1,204 patients served as the training set, while 127 patients formed the testing set.
Results: We employed five ML algorithms: Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Extreme Gradient Boosting (XGBoost), and Neural Network (NN). The XGBoost model achieved an Area Under the Curve (AUC) of 0.9957 and an F1 score of 0.9852 in the training set, demonstrating superior performance in the testing set with an AUC of 0.9059 and F1 score of 0.9027. Key features identified through SHapley Additive exPlanations (SHAP) included chest tightness, aversion to cold, age, TCMSS, Short Form (36) Health Survey (SF-36), C-reactive protein (CRP), and lymphocyte ratio. The logistic regression-based nomogram demonstrated an AUC of 0.9479 and F1 score of 0.9384 in the testing set.
Conclusion: This study utilized multicenter data and multiple ML algorithms to create a ML model for predicting TCM efficacy in Long COVID treatment. Furthermore, a logistic regression-based nomogram was developed to assist the model and improve decision-making efficiency in TCM applications for Long COVID management.
Keywords: Long COVID; SHapley Additive exPlanations; Traditional Chinese Medicine; efficacy; machine learning; nomogram.
Copyright © 2025 Zhang, Chen, Zhang, Yang, Ma, Meng, Wu, Zhu, Zhang, Lin and Lin.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures








Similar articles
-
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0. BMC Public Health. 2024. PMID: 38943093 Free PMC article.
-
A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy.BMC Med Inform Decis Mak. 2024 Aug 8;24(1):224. doi: 10.1186/s12911-024-02627-8. BMC Med Inform Decis Mak. 2024. PMID: 39118122 Free PMC article.
-
Development of a Prediction Model and Corresponding Scoring Table for Postherpetic Neuralgia Using Six Machine Learning Algorithms: A Retrospective Study.Pain Ther. 2024 Aug;13(4):883-907. doi: 10.1007/s40122-024-00612-7. Epub 2024 Jun 4. Pain Ther. 2024. PMID: 38834881 Free PMC article.
-
An interpreting machine learning models to predict amputation risk in patients with diabetic foot ulcers: a multi-center study.Front Endocrinol (Lausanne). 2025 Mar 25;16:1526098. doi: 10.3389/fendo.2025.1526098. eCollection 2025. Front Endocrinol (Lausanne). 2025. PMID: 40201760 Free PMC article.
-
Constructing a predictive model for early-onset sepsis in neonatal intensive care unit newborns based on SHapley Additive exPlanations explainable machine learning.Transl Pediatr. 2024 Nov 30;13(11):1933-1946. doi: 10.21037/tp-24-278. Epub 2024 Nov 26. Transl Pediatr. 2024. PMID: 39649648 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous