Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 5:16:1614657.
doi: 10.3389/fendo.2025.1614657. eCollection 2025.

Machine learning-based prediction of diabetic peripheral neuropathy: model development and clinical validation

Affiliations

Machine learning-based prediction of diabetic peripheral neuropathy: model development and clinical validation

Meng Sun et al. Front Endocrinol (Lausanne). .

Abstract

Background: Diabetic peripheral neuropathy (DPN) is a common and debilitating complication of type 2 diabetes mellitus (T2DM), significantly impacting patients' quality of life and increasing healthcare burdens. Early prediction and intervention are critical to mitigating its impact.

Methods: This study analyzed 1,544 diabetic patients from the First Affiliated Hospital of Shandong First Medical University, who were randomly divided into a training cohort (n = 1,082) and a testing cohort (n = 462) using a 7:3 split ratio. Feature selection was performed using both Boruta and LASSO algorithms, and the intersection of the selected variables was used as the final predictor set. Eight key predictors were identified from 23 variables, including diabetes duration, uric acid, HbA1c, NLR, smoking status, SCR, LDH, and hypertension. Nine machine learning models were developed and compared for DPN risk prediction.

Results: Stochastic Gradient Boosting (SGBT) demonstrated the best performance (training AUC: 0.933, 95% CI: 0.921-0.946; testing AUC: 0.811, 95% CI: 0.776-0.843). Shapley Additive Explanations (SHAP) analysis provided interpretability, highlighting the clinical importance of diabetes duration and HbA1c among other predictors.

Conclusion: This study establishes a robust predictive tool for early DPN detection, laying the foundation for improved prevention and management strategies.

Keywords: clinical data; diabetic peripheral neuropathy; interpretable; machine learning; risk prediction model.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Overview of data processing and machine learning workflow. (A) Database: Participants were selected from The First Affiliated Hospital of Shandong First Medical University (2023–2024). Inclusion criteria were age ≥18 years and T2DM diagnosis. Exclusion criteria included other neuropathies, malignancies, severe infections, organ dysfunction, and metabolic disorders. The final study cohort comprised 1,544 individuals. (B) Feature selection: From an initial set of 23 variables, key variables were identified using the Boruta and LASSO methods. Selected features included diabetes duration, serum creatinine (SCR), hypertension, neutrophil-to-lymphocyte ratio (NLR), smoking status, uric acid, lactate dehydrogenase (LDH), and HbA1c. (C) Model training and testing: Nine machine learning algorithms, including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Naive Bayes (NB), XGBoost (XGB), Stochastic Gradient Boosting Trees (SGBT), and Neural Network (NNET), were applied, with hyperparameter optimization performed using 10×10-fold cross-validation. Model evaluation metrics included Receiver Operating Characteristic (ROC), Area Under the Curve (AUC), F1-score, calibration curves, and decision curves. Model interpretation was conducted using SHAP analysis for feature importance.
Figure 2
Figure 2
Predictor screening results. (A) Boruta feature selection. (B) LASSO screening with lambda values indicated by dashed lines. (C) Variable trajectories in the LASSO model. (D) Common predictors identified by Boruta and LASSO, including diabetes duration, HbA1c, NLR, SCR, hypertension, smoking status, LDH, and uric acid.
Figure 3
Figure 3
Performance and comparison of nine predictive models. ROC curves for the training set (A) and the test set (B). Evaluation metrics for the training set (C) and the test set (D), including accuracy, sensitivity, specificity, PPV, NPV, F1 score, and kappa value. Decision Curve Analysis (DCA) for the training set (E) and the test set (F).
Figure 4
Figure 4
SHAP analysis for feature interpretability. (A) SHAP dendrogram of features for the SGBT model. (B) Feature importance ranking for the logistic regression model. (C) Decision plot of feature contributions to the model outputs.
Figure 5
Figure 5
Web-based calculator for predicting the risk of diabetic peripheral neuropathy (DPN) in patients with diabetes using the developed model. By entering values for diabetes duration, uric acid, HbA1c, NLR, smoking status, serum creatinine (SCR), lactate dehydrogenase (LDH), and hypertension, an individualized DPN risk prediction can be obtained.

Similar articles

References

    1. Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. (2022) 183:109119. doi: 10.1016/j.diabres.2021.109119 - DOI - PMC - PubMed
    1. Lin YK, Gao B, Liu L, Ang L, Mizokami-Stout K, Pop-Busui R, et al. The prevalence of diabetic microvascular complications in China and the USA. Curr Diabetes Rep. (2021) 21:16. doi: 10.1007/s11892-021-01387-3 - DOI - PubMed
    1. Dal Canto E, Ceriello A, Ryden L, Ferrini M, Hansen TB, Schnell O, et al. Diabetes as a cardiovascular risk factor: An overview of global trends of macro and micro vascular complications. Eur J Prev Cardiol. (2019) 26:25–32. doi: 10.1177/2047487319878371 - DOI - PubMed
    1. Liu L, Bi B, Gui M, Zhang L, Ju F, Wang X, et al. Development and internal validation of an interpretable risk prediction model for diabetic peripheral neuropathy in type 2 diabetes: a single-centre retrospective cohort study in China. BMJ Open. (2025) 15:e092463. doi: 10.1136/bmjopen-2024-092463 - DOI - PMC - PubMed
    1. Ziegler D, Papanas N, Schnell O, Nguyen BDT, Nguyen KT, Kulkantrakorn K, et al. Current concepts in the management of diabetic polyneuropathy. J Diabetes Investig. (2021) 12:464–75. doi: 10.1111/jdi.13401 - DOI - PMC - PubMed

LinkOut - more resources