Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach
- PMID: 38562414
- PMCID: PMC10982324
- DOI: 10.3389/fendo.2024.1376220
Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach
Abstract
Background: Identification of patients at risk for type 2 diabetes mellitus (T2DM) can not only prevent complications and reduce suffering but also ease the health care burden. While routine physical examination can provide useful information for diagnosis, manual exploration of routine physical examination records is not feasible due to the high prevalence of T2DM.
Objectives: We aim to build interpretable machine learning models for T2DM diagnosis and uncover important diagnostic indicators from physical examination, including age- and sex-related indicators.
Methods: In this study, we present three weighted diversity density (WDD)-based algorithms for T2DM screening that use physical examination indicators, the algorithms are highly transparent and interpretable, two of which are missing value tolerant algorithms.
Patients: Regarding the dataset, we collected 43 physical examination indicator data from 11,071 cases of T2DM patients and 126,622 healthy controls at the Affiliated Hospital of Southwest Medical University. After data processing, we used a data matrix containing 16004 EHRs and 43 clinical indicators for modelling.
Results: The indicators were ranked according to their model weights, and the top 25% of indicators were found to be directly or indirectly related to T2DM. We further investigated the clinical characteristics of different age and sex groups, and found that the algorithms can detect relevant indicators specific to these groups. The algorithms performed well in T2DM screening, with the highest area under the receiver operating characteristic curve (AUC) reaching 0.9185.
Conclusion: This work utilized the interpretable WDD-based algorithms to construct T2DM diagnostic models based on physical examination indicators. By modeling data grouped by age and sex, we identified several predictive markers related to age and sex, uncovering characteristic differences among various groups of T2DM patients.
Keywords: diabetes; diabetes diagnosis; diabetic prediction; diagnostic indicator; health informatics; interpretable machine learning.
Copyright © 2024 Lv, Luo, Huang, Guo, Bai, Yan, Jiang, Zhang, Jing, Chen and Li.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures






Similar articles
-
A machine learning-based framework to identify type 2 diabetes through electronic health records.Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1. Int J Med Inform. 2017. PMID: 27919371 Free PMC article.
-
A personalized prediction model for urinary tract infections in type 2 diabetes mellitus using machine learning.Front Pharmacol. 2024 Jan 5;14:1259596. doi: 10.3389/fphar.2023.1259596. eCollection 2023. Front Pharmacol. 2024. PMID: 38269284 Free PMC article.
-
Machine Learning for the Prediction of New-Onset Diabetes Mellitus during 5-Year Follow-up in Non-Diabetic Patients with Cardiovascular Risks.Yonsei Med J. 2019 Feb;60(2):191-199. doi: 10.3349/ymj.2019.60.2.191. Yonsei Med J. 2019. PMID: 30666841 Free PMC article.
-
Development and internal validation of machine learning algorithms for end-stage renal disease risk prediction model of people with type 2 diabetes mellitus and diabetic kidney disease.Ren Fail. 2022 Dec;44(1):562-570. doi: 10.1080/0886022X.2022.2056053. Ren Fail. 2022. PMID: 35373711 Free PMC article.
-
Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach.Int J Environ Res Public Health. 2022 Nov 1;19(21):14280. doi: 10.3390/ijerph192114280. Int J Environ Res Public Health. 2022. PMID: 36361161 Free PMC article. Review.
Cited by
-
Enhancing Type 2 Diabetes Treatment Decisions With Interpretable Machine Learning Models for Predicting Hemoglobin A1c Changes: Machine Learning Model Development.JMIR AI. 2024 Jul 18;3:e56700. doi: 10.2196/56700. JMIR AI. 2024. PMID: 39024008 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical