Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 11;12(10):2454.
doi: 10.3390/diagnostics12102454.

Machine Learning Models for the Prediction of Renal Failure in Chronic Kidney Disease: A Retrospective Cohort Study

Affiliations

Machine Learning Models for the Prediction of Renal Failure in Chronic Kidney Disease: A Retrospective Cohort Study

Chuan-Tsung Su et al. Diagnostics (Basel). .

Abstract

This study assessed the feasibility of five separate machine learning (ML) classifiers for predicting disease progression in patients with pre-dialysis chronic kidney disease (CKD). The study enrolled 858 patients with CKD treated at a veteran's hospital in Taiwan. After classification into early and advanced stages, patient demographics and laboratory data were processed and used to predict progression to renal failure and important features for optimal prediction were identified. The random forest (RF) classifier with synthetic minority over-sampling technique (SMOTE) had the best predictive performances among patients with early-stage CKD who progressed within 3 and 5 years and among patients with advanced-stage CKD who progressed within 1 and 3 years. Important features identified for predicting progression from early- and advanced-stage CKD were urine creatinine and serum creatinine levels, respectively. The RF classifier demonstrated the optimal performance, with an area under the receiver operating characteristic curve values of 0.96 for predicting progression within 5 years in patients with early-stage CKD and 0.97 for predicting progression within 1 year in patients with advanced-stage CKD. The proposed method resulted in the optimal prediction of CKD progression, especially within 1 year of advanced-stage CKD. These results will be useful for predicting prognosis among patients with CKD.

Keywords: kidney disease; random forest; renal dialysis.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1
Figure 1
Flow chart for the selection of study subjects.
Figure 2
Figure 2
Flow chart of model training and performance evaluation.
Figure 3
Figure 3
Plots showing the area under the receiver operating characteristic curve (AUROC) for the ability of the random forest (RF) classifier trained using synthetic minority over-sampling technique (SMOTE) to predict the progression of early-stage chronic kidney disease (CKD) to end-stage renal disease (ESRD) within (a) 3 and (b) 5 years. Plots showing the AUROC for the ability of the RF classifier trained using to predict the progression of advanced-stage CKD to ESRD within (c) 1 and (d) 3 years.
Figure 4
Figure 4
Positive and negative impacts of 13 features on the prediction of progression from early-stage chronic kidney disease (CKD) to end-stage renal disease (ESRD) within (a) 3 and (b) 5 years. Positive and negative impacts of 24 features on the prediction of progression from advanced-stage CKD to ESRD within (c) 1 and (d) 3 years. Features are ranked in descending according to Shapley additive explanations (SHAP values), where the top feature represents the most informative feature. Each dot in the plot represents the value for an individual patient. The color represents the scale of the feature’s value, ranging from high (red) to low (blue), for the observation.
Figure 5
Figure 5
Plots showing the under the receiver operatic characteristic curve (AUROC) of five models, including six features from (a) original and (b) synthetic minority over-sampling technique (SMOTE) data for predicting end-stage renal disease progression within 5 years in patients with early-stage chronic kidney disease. Plots showing the AUROC of five models, including 10 features from (c) original and (d) SMOTE data for predicting end-stage renal disease progression within 1 year in patients with advanced-stage chronic kidney disease. GNB, Gaussian naïve Bayes; LR, linear regression; RF, random forest; SVM, support vector machine; XGBoost, extreme gradient boosting.

Similar articles

Cited by

References

    1. Prasad B., Osman M., Jafari M., Gordon L., Tangri N., Ferguson T.W., Jin S., Kappel J., Kozakewycz D. Kidney failure risk equation and cost of care in patients with chronic kidney disease. CJASN. 2022;17:17–26. doi: 10.2215/CJN.06770521. - DOI - PMC - PubMed
    1. Go A.S., Chertow G.M., Fan D., McCulloch C.E., Hsu C.Y. Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N. Engl. J. Med. 2004;351:1296–1305. doi: 10.1056/NEJMoa041031. - DOI - PubMed
    1. Zhang L., Long J., Jiang W., Shi Y., He X., Zhou Z., Li Y., Yeung R.O., Wang J., Matsushita K., et al. Trends in chronic kidney disease in China. N. Engl. J. Med. 2016;375:905–906. doi: 10.1056/NEJMc1602469. - DOI - PubMed
    1. Saran R., Robinson B., Abbott K.C., Bragg-Gresham J., Chen X., Gipson D., Gu H., Hirth R.A., Hutton D., Jin Y., et al. US renal data system 2019 annual data report: Epidemiology of kidney disease in the United States. Am. J. Kidney Dis. 2020;75:A6–A7. doi: 10.1053/j.ajkd.2019.09.003. - DOI - PubMed
    1. McCullough K.P., Morgenstern H., Saran R., Herman W.H., Robinson B.M. Projecting ESRD incidence and prevalence in the United States through 2030. J. Am. Soc. Nephrol. 2019;30:127–135. doi: 10.1681/ASN.2018050531. - DOI - PMC - PubMed

LinkOut - more resources