Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
- PMID: 37653403
- PMCID: PMC10472702
- DOI: 10.1186/s12911-023-02269-2
Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
Abstract
Background: Chronic kidney disease (CKD) is a global public health concern. Therefore, to provide timely intervention for non-hospitalized high-risk patients and rationally allocate limited clinical resources is important to mine the key factors when designing a CKD prediction model.
Methods: This study included data from 1,358 patients with CKD pathologically confirmed during the period from December 2017 to September 2020 at Zhongshan Hospital. A CKD prediction interpretation framework based on machine learning was proposed. From among 100 variables, 17 were selected for the model construction through a recursive feature elimination with logistic regression feature screening. Several machine learning classifiers, including extreme gradient boosting, gaussian-based naive bayes, a neural network, ridge regression, and linear model logistic regression (LR), were trained, and an ensemble model was developed to predict 24-hour urine protein. The detailed relationship between the risk of CKD progression and these predictors was determined using a global interpretation. A patient-specific analysis was conducted using a local interpretation.
Results: The results showed that LR achieved the best performance, with an area under the curve (AUC) of 0.850 in a single machine learning model. The ensemble model constructed using the voting integration method further improved the AUC to 0.856. The major predictors of moderate-to-severe severity included lower levels of 25-OH-vitamin, albumin, transferrin in males, and higher levels of cystatin C.
Conclusions: Compared with the clinical single kidney function evaluation indicators (eGFR, Scr), the machine learning model proposed in this study improved the prediction accuracy of CKD progression by 17.6% and 24.6%, respectively, and the AUC was improved by 0.250 and 0.236, respectively. Our framework can achieve a good predictive interpretation and provide effective clinical decision support.
Keywords: Chronic kidney disease; Clinical decision support; Machine learning; Model interpretation.
© 2023. BioMed Central Ltd., part of Springer Nature.
Conflict of interest statement
The authors declare no competing interests.
Figures





Similar articles
-
Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study.J Med Internet Res. 2021 Jun 21;23(6):e26139. doi: 10.2196/26139. J Med Internet Res. 2021. PMID: 34152274 Free PMC article.
-
The prediction of in-hospital mortality in chronic kidney disease patients with coronary artery disease using machine learning models.Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x. Eur J Med Res. 2023. PMID: 36653875 Free PMC article.
-
CKD Progression Prediction in a Diverse US Population: A Machine-Learning Model.Kidney Med. 2023 Jun 24;5(9):100692. doi: 10.1016/j.xkme.2023.100692. eCollection 2023 Sep. Kidney Med. 2023. PMID: 37637863 Free PMC article.
-
Interpretable machine learning for predicting chronic kidney disease progression risk.Digit Health. 2024 Jan 15;10:20552076231224225. doi: 10.1177/20552076231224225. eCollection 2024 Jan-Dec. Digit Health. 2024. PMID: 38235416 Free PMC article.
-
Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease.Comput Intell Neurosci. 2023 Mar 14;2023:9266889. doi: 10.1155/2023/9266889. eCollection 2023. Comput Intell Neurosci. 2023. PMID: 36959840 Free PMC article.
Cited by
-
Applying stacking ensemble method to predict chronic kidney disease progression in Chinese population based on laboratory information system: a retrospective study.PeerJ. 2024 Nov 1;12:e18436. doi: 10.7717/peerj.18436. eCollection 2024. PeerJ. 2024. PMID: 39498292 Free PMC article.
-
Predicting Major Preoperative Risk Factors for Retears After Arthroscopic Rotator Cuff Repair Using Machine Learning Algorithms.J Clin Med. 2025 Mar 9;14(6):1843. doi: 10.3390/jcm14061843. J Clin Med. 2025. PMID: 40142650 Free PMC article.
References
-
- Robinson BM, Akizawa T, Jager KJ, Kerr PG, Saran R, Pisoni RL. Factors affecting outcomes in patients reaching end-stage kidney disease worldwide: differences in access to renal replacement therapy, modality use, and haemodialysis practices. Lancet. 2016;388(10041):294–306. doi: 10.1016/S0140-6736(16)30448-2. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous