Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 24:16:1514397.
doi: 10.3389/fendo.2025.1514397. eCollection 2025.

Predicting isolated impaired glucose tolerance without oral glucose tolerance test using machine learning in Chinese Han men

Affiliations

Predicting isolated impaired glucose tolerance without oral glucose tolerance test using machine learning in Chinese Han men

Lin Wang et al. Front Endocrinol (Lausanne). .

Abstract

Background: Isolated Impaired Glucose Tolerance (I-IGT) represents a specific prediabetic state that typically requires a standardized oral glucose tolerance test (OGTT) for diagnosis. This study aims to predict glucose tolerance status in Chinese Han men at fasting state using machine learning (ML) models with demographic, anthropometric, and laboratory data.

Methods: The study population consisted of 1,117 Chinese Han men aged 50-87 years. Baseline variables including age, fasting plasma glucose (FPG), high blood pressure (HBP), body mass index (BMI), waist to hip ratio (WHR), total cholesterol (TC), triglyceride (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C) were collected from electronic medical records (EMRs) for machine learning model training and validation. Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Logistic Regression (LR), K-Nearest Neighbors (KNN), Naive Bayes (NB), Adaptive Boosting (AdaBoost) and Gradient Boosting Machines (GBM) were tested for machine learning model performance comparison. Model performance was evaluated using metrics including accuracy, recall, F1 score, positive predictive value (PPV), negative predictive value (NPV), and the area under the receiver operating characteristic curve (AUC). Shapley Additive Explanations (SHAP) and confusion matrix plots were used for model interpretation.

Results: The RF model demonstrated the best overall performance with a 96.7% accuracy, recall of 91.4%, F1 score of 95.7%, PPV of 99.1%, and NPV of 95.6%. The AUC values for the SVM, DT, RF, LR, KNN, NB, AdaBoost, and GBM models were 0.97, 0.92, 0.96, 0.97, 0.88, 0.88, 0.97, and 0.97, respectively. While the RF model showed strong overall performance, the LR model had the highest AUC, indicating superior discriminatory power. FPG was identified as the most important predictor for I-IGT, followed by HDL, TC, HBP, BMI, and WHR. Individuals with FPG levels higher than 5.1 mmol/L were more likely to have I-IGT; the performance metrics for this cut-off value were: 89.35% accuracy, 89.79% recall, 85.22% F1 score, 81.09% PPV, 94.38% NPV, and 0.95 AUC.

Conclusion: Machine learning models based on demographic and clinical characteristics offer a cost-effective method for predicting I-IGT in Chinese Han men aged over 50, without the need for an OGTT. These models could complement existing early diagnostic strategies, thereby enhancing the early detection and prevention of diabetes. Additionally, FPG alone could serve as an efficient screening tool for the early identification of I-IGT in clinical settings.

Keywords: fasting plasma glucose; isolated impaired glucose tolerance; machine learning models; oral glucose tolerance test; pre-diabetes.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Proposed model workflow.
Figure 2
Figure 2
Correlation between the characteristics.
Figure 3
Figure 3
AUC comparison of eight machine learning algorithms.
Figure 4
Figure 4
Show the eight confusion matrix to present the performances of applied eight machine learning algorithms where x-axis states the predicted level and y-axis states the true level.
Figure 5
Figure 5
(a) Importance matrix plot of the Random Forest model, depicting the importance of each variable for predicting IGT in individuals with normal fasting plasma glucose levels. (b) SHAP summary plot of the 9 clinical characteristics of the Random Forest model. FPG, fasting plasma glucose; IGT, impaired glucose tolerance; T2DM, type 2 diabetes mellitus; HBP, high blood pressure; BMI, body mass index; WHR, waist to hip ratio; TC, total cholesterol; TG, triglyceride; HDL, high-density lipoprotein; LDL, low-density lipoprotein.
Figure 6
Figure 6
SHAP force plot for individuals in the dataset at high (a) or low (b) risk of IGT.

Similar articles

References

    1. Magliano DJ, Boyko EJ IDF. Diabetes Atlas 10th edition scientific committee, in: IDF DIABETES ATLAS (2021). Brussels: International Diabetes Federation. Available online at: http://www.ncbi.nlm.nih.gov/books/NBK581934/ (Accessed October 1, 2024).
    1. Ong KL, Stafford LK, McLaughlin SA, Boyko EJ, Vollset SE, Smith AE, et al. . Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet. (2023) 402:203–34. doi: 10.1016/S0140-6736(23)01301-6 - DOI - PMC - PubMed
    1. Wu J, Feng Z, Duan J, Li Y, Deng P, Wang J, et al. . Global burden of type 2 diabetes attributable to non-high body mass index from 1990 to 2019. BMC Public Health. (2023) 23:1338. doi: 10.1186/s12889-023-15585-z - DOI - PMC - PubMed
    1. American Diabetes Association . 2. Classification and diagnosis of diabetes: standards of medical care in diabetes—2021 . Diabetes Care. (2021) 44:S15–33. doi: 10.2337/dc21-S002 - DOI - PubMed
    1. Yip W, Sequeira I, Plank L, Poppitt S. Prevalence of pre-diabetes across ethnicities: A review of impaired fasting glucose (IFG) and impaired glucose tolerance (IGT) for classification of dysglycaemia. Nutrients. (2017) 9:1273. doi: 10.3390/nu9111273 - DOI - PMC - PubMed

Supplementary concepts

LinkOut - more resources