Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov;14(11):1289-1302.
doi: 10.1111/jdi.14069. Epub 2023 Aug 22.

Machine learning for the prediction of atherosclerotic cardiovascular disease during 3-year follow up in Chinese type 2 diabetes mellitus patients

Affiliations

Machine learning for the prediction of atherosclerotic cardiovascular disease during 3-year follow up in Chinese type 2 diabetes mellitus patients

Jinru Ding et al. J Diabetes Investig. 2023 Nov.

Abstract

Aims/introduction: Clinical guidelines for the management of individuals with type 2 diabetes mellitus endorse the systematic assessment of atherosclerotic cardiovascular disease risk for early interventions. In this study, we aimed to develop machine learning models to predict 3-year atherosclerotic cardiovascular disease risk in Chinese type 2 diabetes mellitus patients.

Materials and methods: Clinical records of 4,722 individuals with type 2 diabetes mellitus admitted to 94 hospitals were used. The features included demographic information, disease histories, laboratory tests and physical examinations. Logistic regression, support vector machine, gradient boosting decision tree, random forest and adaptive boosting were applied for model construction. The performance of these models was evaluated using the area under the receiver operating characteristic curve. Additionally, we applied SHapley Additive exPlanation values to explain the prediction model.

Results: All five models achieved good performance in both internal and external test sets (area under the receiver operating characteristic curve >0.8). Random forest showed the highest discrimination ability, with sensitivity and specificity being 0.838 and 0.814, respectively. The SHapley Additive exPlanation analyses showed that previous history of diabetic peripheral vascular disease, older populations and longer diabetes duration were the three most influential predictors.

Conclusions: The prediction models offer opportunities to personalize treatment and maximize the benefits of these medical interventions.

Keywords: Atherosclerotic cardiovascular disease; Machine learning; Type 2 diabetes mellitus.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. Linong Ji is an Editorial Board member of Journal of Diabetes Investigation and a co‐author of this article. To minimize bias, he was excluded from all editorial decision‐making related to the acceptance of this article for publication.

Approval of the research protocol: Protocol for the research project has been approved. Ethical approval was obtained by the ethics committees of Peking University People's Hospital and all participating hospitals, and it conforms to the provisions of the Declaration of Helsinki.

Informed consent: All patients signed the informed consent before data collection.

Registry and the registration no. of the study/trial: N/A.

Animal studies: N/A.

Figures

Figure 1
Figure 1
Framework for predicting atherosclerotic cardiovascular disease (ASCVD) risk in Chinese type 2 diabetes mellitus patients. AdaBoost, adaptive boosting; GBDT, gradient boosting decision tree; LR, logistic regression; RF, random forest; SHAP, SHapley Additive exPlanations; SVM, support vector machine; T2DM, type 2 diabetes mellitus.
Figure 2
Figure 2
Analysis overview for identifying the best‐performing atherosclerotic cardiovascular disease risk prediction model. 3BExt, 3BExt database.
Figure 3
Figure 3
Kernel density plot for (a) fasting plasma glucose (FPG), (b) glycosylated hemoglobin (HbA1c), (c) low‐density lipoprotein (LDL), (d) total cholesterol (TCHO), (e) triglycerides (TG) and (f) high‐density lipoprotein (HDL). ASCVD, atherosclerotic cardiovascular disease.
Figure 4
Figure 4
Receiver operating curves for the prediction of atherosclerotic cardiovascular disease using different machine learning models in the testing set. AdaBoost, adaptive boosting; GBDT, gradient boosting decision tree; LR, logistic regression; RF, random forest; SVM, support vector machine.
Figure 5
Figure 5
Receiver operating curves for the prediction of atherosclerotic cardiovascular disease using different machine learning models in the external validation set. AdaBoost, adaptive boosting; GBDT, gradient boosting decision tree; LR, logistic regression; RF, random forest; SVM, support vector machine.
Figure 6
Figure 6
Importance matrix plot of the random forest model. BMI, body mass index; FPG, fasting plasma glucose; HbA1c, glycosylated hemoglobin; HDL, high‐density lipoprotein cholesterol; LDL, low‐density lipoprotein cholesterol; TCHO, total cholesterol; TG, triglycerides.
Figure 7
Figure 7
SHapley Additive explanation (SHAP) summary plot of the 10 most predictive features of the RF model. HbA1c, glycosylated hemoglobin; TCHO, total cholesterol; TG, triglycerides.
Figure 8
Figure 8
SHapley Additive exPlanation (SHAP) dependence plot of the random forest model for (a) age, (b) diabetes duration, (c) triglycerides (TG) and (d) diagnostic age.

Similar articles

Cited by

References

    1. Magliano D, Boyko EJ. IDF Diabetes Atlas. Brussels, Belgium: International Diabetes Federation, 2021. - PubMed
    1. Ke C, Narayan KV, Chan JC, et al. Pathophysiology, phenotypes and management of type 2 diabetes mellitus in Indian and Chinese populations. Nat Rev Endocrinol 2022; 18: 413–432. - PMC - PubMed
    1. Draznin B, Aroda VR, Bakris G, et al. 10. Cardiovascular disease and risk management: Standards of medical Care in Diabetes‐2022. Diabetes Care 2022; 45(Supplement_1): S144–S174. - PubMed
    1. Arnett DK, Blumenthal RS, Albert MA, et al. 2019 ACC/AHA guideline on the primary prevention of cardiovascular disease: Executive summary: A report of the American College of Cardiology/American Heart Association task force on clinical practice guidelines. Circulation 2019; 140: e563–e595. - PMC - PubMed
    1. Petruzzo M, Reia A, Maniscalco GT, et al. The Framingham cardiovascular risk score and 5‐year progression of multiple sclerosis. Eur J Neurol 2021; 28: 893–900. - PubMed

MeSH terms