Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 13;9(2):e13289.
doi: 10.1016/j.heliyon.2023.e13289. eCollection 2023 Feb.

Application of machine learning in Chinese medicine differentiation of dampness-heat pattern in patients with type 2 diabetes mellitus

Affiliations

Application of machine learning in Chinese medicine differentiation of dampness-heat pattern in patients with type 2 diabetes mellitus

Xinyu Liu et al. Heliyon. .

Abstract

Background: China has become the country with the largest number of people with type 2 diabetes mellitus (T2DM), and Chinese medicine (CM) has unique advantages in preventing and treating T2DM, while accurate pattern differentiation is the guarantee for proper treatment.

Objective: The establishment of the CM pattern differentiation model of T2DM is helpful to the pattern diagnosis of the disease. At present, there are few studies on dampness-heat pattern differentiation models of T2DM. Therefore, we establish a machine learning model, hoping to provide an efficient tool for the pattern diagnosis of CM for T2DM in the future.

Methods: A total of 1021 effective samples of T2DM patients from ten CM hospitals or clinics were collected by a questionnaire including patients' demographic and dampness-heat-related symptoms and signs. All information and the diagnosis of the dampness-heat pattern of patients were completed by experienced CM physicians at each visit. We applied six machine learning algorithms (Artificial Neural Network [ANN], K-Nearest Neighbor [KNN], Naïve Bayes [NB], Support Vector Machine [SVM], Extreme Gradient Boosting [XGBoost] and Random Forest [RF]) and compared their performance. And then we also utilized Shapley additive explanation (SHAP) method to explain the best performance model.

Results: The XGBoost model had the highest AUC (0.951, 95% CI 0.925-0.978) among the six models, with the best sensitivity, accuracy, F1 score, negative predictive value, and excellent specificity, precision, and positive predictive value. The SHAP method based on XGBoost showed that slimy yellow tongue fur was the most important sign in dampness-heat pattern diagnosis. The slippery pulse or rapid-slippery pulse, sticky stool with ungratifying defecation also performed an important role in this diagnostic model. Furthermore, the red tongue acted as an important tongue sign for the dampness-heat pattern.

Conclusion: This study constructed a dampness-heat pattern differentiation model of T2DM based on machine learning. The XGBoost model is a tool with the potential to help CM practitioners make quick diagnosis decisions and contribute to the standardization and international application of CM patterns.

Keywords: Dampness-heat pattern; Diagnostic model; Machine learning; Pattern differentiation.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Flow diagram of patient selection and model building.
Fig. 2a
Fig. 2a
Evaluation of the six machine learning algorithms based on the AUC of the ROC curve. AUC, area under the curve; ROC, receiver operating characteristic; 2b P-R curve and AP of the six models. P-R curve: precision-recall curve; AP: average precision.
Fig. 3
Fig. 3
Shapley additive explanation (SHAP) values to show the interpretability of the effects of variables as the input factors for the diagnosis of the dampness-heat pattern of T2DM. (a) SHAP values for all 1021 patients in the train set. (b, c) SHAP values of two typical patients from the positive group (b) and the negative group (c) are illustrated with their most important variables.
Fig. 4a
Fig. 4a
Importance matrix plot of the XGBoost model, depicting the importance of each symptom and signs for diagnosing dampness-heat pattern in T2DM.4b SHAP summary plot of the 14 symptoms and signs of the XGBoost model. There is one dot per patient per symptom or sign colored according to an attribution value, where red represents a higher value and blue represents a lower value.

References

    1. International Diabetes Federation . tenth ed. Brussels, Belgium; 2021. IDF Diabetes Atlas.https://www.diabetesatlas.org Available at:
    1. Pranata R., Henrina J., Raffaello W.M., Lawrensia S., Huang I. Diabetes and COVID-19: the past, the present, and the future. Metabolism. 2021 Aug;121 doi: 10.1016/j.metabol.2021.154814. PMID: 34119537. - DOI - PMC - PubMed
    1. Sardu C., Gargiulo G., Esposito G., Paolisso G., Marfella R. Impact of diabetes mellitus on clinical outcomes in patients affected by Covid-19. Cardiovasc. Diabetol. 2020 Jun 11;19(1):76. doi: 10.1186/s12933-020-01047-y. PMID: 32527257. - DOI - PMC - PubMed
    1. Li Y., Teng D., Shi X., Qin G., Qin Y., Quan H., et al. Prevalence of diabetes recorded in mainland China using 2018 diagnostic criteria from the American Diabetes Association. national cross sectional study. BMJ. 2020:369. doi: 10.1136/bmj.m997. Apr 28. m997. PMID: 32345662. - DOI - PMC - PubMed
    1. World Health Organization . World Health Organization; Geneva: 2022. WHO Expert Meeting on Evaluation of Traditional Chinese Medicine in the Treatment of COVID-19.https://www.who.int/publications/m/item/who-expert-meeting-on-evaluation... Available from:

LinkOut - more resources