Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec;44(1):562-570.
doi: 10.1080/0886022X.2022.2056053.

Development and internal validation of machine learning algorithms for end-stage renal disease risk prediction model of people with type 2 diabetes mellitus and diabetic kidney disease

Affiliations

Development and internal validation of machine learning algorithms for end-stage renal disease risk prediction model of people with type 2 diabetes mellitus and diabetic kidney disease

Yutong Zou et al. Ren Fail. 2022 Dec.

Abstract

Aims: Diabetic kidney disease (DKD) is the most common cause of end-stage renal disease (ESRD) and is associated with increased morbidity and mortality in patients with diabetes. Identification of risk factors involved in the progression of DKD to ESRD is expected to result in early detection and appropriate intervention and improve prognosis. Therefore, this study aimed to establish a risk prediction model for ESRD resulting from DKD in patients with type 2 diabetes mellitus (T2DM).

Methods: Between January 2008 and July 2019, a total of 390 Chinese patients with T2DM and DKD confirmed by percutaneous renal biopsy were enrolled and followed up for at least 1 year. Four machine learning algorithms (gradient boosting machine, support vector machine, logistic regression, and random forest (RF)) were used to identify the critical clinical and pathological features and to build a risk prediction model for ESRD.

Results: There were 158 renal outcome events (ESRD) (40.51%) during the 3-year median follow up. The RF algorithm showed the best performance at predicting progression to ESRD, showing the highest AUC (0.90) and ACC (82.65%). The RF algorithm identified five major factors: Cystatin-C, serum albumin (sAlb), hemoglobin (Hb), 24-hour urine urinary total protein, and estimated glomerular filtration rate. A nomogram according to the aforementioned five predictive factors was constructed to predict the incidence of ESRD.

Conclusion: Machine learning algorithms can efficiently predict the incident ESRD in DKD participants. Compared with the previous models, the importance of sAlb and Hb were highlighted in the current model.HighlightsWhat is already known? Identification of risk factors for the progression of DKD to ESRD is expected to improve the prognosis by early detection and appropriate intervention.What this study has found? Machine learning algorithms were used to construct a risk prediction model of ESRD in patients with T2DM and DKD. The major predictive factors were found to be CysC, sAlb, Hb, eGFR, and UTP.What are the implications of the study? In contrast with the treatment of participants with early-phase T2DM with or without mild kidney damage, major emphasis should be placed on indicators of kidney function, nutrition, anemia, and proteinuria for participants with T2DM and advanced DKD to delay ESRD, rather than age, sex, and control of hypertension and glycemia.

Keywords: Type 2 diabetes mellitus; diabetic kidney disease; end-stage renal disease; machine learning; risk prediction model.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflict of interests to disclose, and the results in this paper have not been published previously in whole or part.

Figures

Figure 1.
Figure 1.
Process of establishing prediction models.
Figure 2.
Figure 2.
Correlation between variables. The magnitude and direction of the correlation are reflected by the size (larger is stronger) and color (red is negative and blue is posive) of the circles, respectively.
Figure 3.
Figure 3.
ROC for different machine learning algorithms predicts the results of ESRD in validate data set.
Figure 4.
Figure 4.
Prognostic nomogram to predict individual renal survival probability in T2DM patients with DKD. The nomogram allows the user to obtain 1-, 3-, and 5-year renal survival corresponding to a patient's combination of variables. Points are assigned for each variable by drawing a straight line upward from the corresponding value to the “Points” line. Then, sum the points received for each variable, and locate the number on the “Total Points” axis. To speculate the patient's renal survival after 1-, 3-, or 5-years, a straight line must be drawn down to the corresponding “1-Year Survival, 3-Year Survival, or 5-Year Survival” probability axis.

References

    1. Reutens AT. Epidemiology of diabetic kidney disease. Med Clin North Am. 2013; 97(1):1–18. - PubMed
    1. Wan EYF, Fong DYT, Fung CSC, et al. . Prediction of new onset of end stage renal disease in chinese patients with type 2 diabetes mellitus – a population-based retrospective cohort study. BMC Nephrol. 2017; 18(1): 257. - PMC - PubMed
    1. Wysham CH, Gauthier-Loiselle M, Bailey RA, et al. . Development of risk models for major adverse chronic renal outcomes among patients with type 2 diabetes mellitus using insurance claims: a retrospective observational study. Curr Med Res Opin. 2020;36(2):219–227. - PubMed
    1. Yang L, Chu TK, Lian J, et al. . Individualised risk prediction model for new-onset, progression and regression of chronic kidney disease in a retrospective cohort of patients with type 2 diabetes under primary care in Hong Kong. BMJ Open. 2020;10(7):e035308. - PMC - PubMed
    1. Zhang J, Zhang R, Wang Y, et al. . The level of serum albumin is associated with renal prognosis in patients with diabetic nephropathy. J Diabetes Res. 2019;2019:1–9. - PMC - PubMed

MeSH terms