Predicting isolated impaired glucose tolerance without oral glucose tolerance test using machine learning in Chinese Han men
- PMID: 40343071
- PMCID: PMC12058868
- DOI: 10.3389/fendo.2025.1514397
Predicting isolated impaired glucose tolerance without oral glucose tolerance test using machine learning in Chinese Han men
Abstract
Background: Isolated Impaired Glucose Tolerance (I-IGT) represents a specific prediabetic state that typically requires a standardized oral glucose tolerance test (OGTT) for diagnosis. This study aims to predict glucose tolerance status in Chinese Han men at fasting state using machine learning (ML) models with demographic, anthropometric, and laboratory data.
Methods: The study population consisted of 1,117 Chinese Han men aged 50-87 years. Baseline variables including age, fasting plasma glucose (FPG), high blood pressure (HBP), body mass index (BMI), waist to hip ratio (WHR), total cholesterol (TC), triglyceride (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C) were collected from electronic medical records (EMRs) for machine learning model training and validation. Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Logistic Regression (LR), K-Nearest Neighbors (KNN), Naive Bayes (NB), Adaptive Boosting (AdaBoost) and Gradient Boosting Machines (GBM) were tested for machine learning model performance comparison. Model performance was evaluated using metrics including accuracy, recall, F1 score, positive predictive value (PPV), negative predictive value (NPV), and the area under the receiver operating characteristic curve (AUC). Shapley Additive Explanations (SHAP) and confusion matrix plots were used for model interpretation.
Results: The RF model demonstrated the best overall performance with a 96.7% accuracy, recall of 91.4%, F1 score of 95.7%, PPV of 99.1%, and NPV of 95.6%. The AUC values for the SVM, DT, RF, LR, KNN, NB, AdaBoost, and GBM models were 0.97, 0.92, 0.96, 0.97, 0.88, 0.88, 0.97, and 0.97, respectively. While the RF model showed strong overall performance, the LR model had the highest AUC, indicating superior discriminatory power. FPG was identified as the most important predictor for I-IGT, followed by HDL, TC, HBP, BMI, and WHR. Individuals with FPG levels higher than 5.1 mmol/L were more likely to have I-IGT; the performance metrics for this cut-off value were: 89.35% accuracy, 89.79% recall, 85.22% F1 score, 81.09% PPV, 94.38% NPV, and 0.95 AUC.
Conclusion: Machine learning models based on demographic and clinical characteristics offer a cost-effective method for predicting I-IGT in Chinese Han men aged over 50, without the need for an OGTT. These models could complement existing early diagnostic strategies, thereby enhancing the early detection and prevention of diabetes. Additionally, FPG alone could serve as an efficient screening tool for the early identification of I-IGT in clinical settings.
Keywords: fasting plasma glucose; isolated impaired glucose tolerance; machine learning models; oral glucose tolerance test; pre-diabetes.
Copyright © 2025 Wang, Xie, Gu, Miao, Ma, Yan, Gong, Li, Sun and Ruan.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures






Similar articles
-
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815. J Med Internet Res. 2023. PMID: 37023416 Free PMC article.
-
Study on risk factors of impaired fasting glucose and development of a prediction model based on Extreme Gradient Boosting algorithm.Front Endocrinol (Lausanne). 2024 Sep 24;15:1368225. doi: 10.3389/fendo.2024.1368225. eCollection 2024. Front Endocrinol (Lausanne). 2024. PMID: 39381443 Free PMC article.
-
Prediction and feature selection of low birth weight using machine learning algorithms.J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8. J Health Popul Nutr. 2024. PMID: 39396025 Free PMC article.
-
Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer.Sci Rep. 2024 Jul 13;14(1):16208. doi: 10.1038/s41598-024-66979-x. Sci Rep. 2024. PMID: 39003337 Free PMC article.
-
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7. Comput Struct Biotechnol J. 2021. PMID: 34025952 Free PMC article. Review.
References
-
- Magliano DJ, Boyko EJ IDF. Diabetes Atlas 10th edition scientific committee, in: IDF DIABETES ATLAS (2021). Brussels: International Diabetes Federation. Available online at: http://www.ncbi.nlm.nih.gov/books/NBK581934/ (Accessed October 1, 2024).
-
- Ong KL, Stafford LK, McLaughlin SA, Boyko EJ, Vollset SE, Smith AE, et al. . Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet. (2023) 402:203–34. doi: 10.1016/S0140-6736(23)01301-6 - DOI - PMC - PubMed
MeSH terms
Substances
Supplementary concepts
LinkOut - more resources
Full Text Sources
Miscellaneous