Detection of the chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP
- PMID: 37069256
- PMCID: PMC10110580
- DOI: 10.1038/s41598-023-33525-0
Detection of the chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP
Abstract
Chronic kidney disease (CKD) is a condition distinguished by structural and functional changes to the kidney over time. Studies show that 10% of adults worldwide are affected by some kind of CKD, resulting in 1.2 million deaths. Recently, CKD has emerged as a leading cause of mortality worldwide, making it necessary to develop a Computer-Aided Diagnostic (CAD) system to diagnose CKD automatically. Machine Learning (ML) based CAD system can be used by a clinician to automatically diagnoses mass people. Since ML models are considered a black box, it is also necessary to expose influential causes behind a model's prediction of a particular output. So that, a doctor can make a more rational decision based on the model's output and analysis of the features influence on the model. In this paper, we have used the XGBoost as the ML classifier to predict whether a patient has CKD or not. Using the XGBoost classifier, we have obtained an accuracy, precision, recall, and F1 score of [Formula: see text] and [Formula: see text] respectively using all [Formula: see text] features. Furthermore, we have used Biogeography Based Optimization (BBO) algorithm to find an effective subset of the features. The BBO algorithm selected almost half of the initial features. We have obtained an accuracy, precision, recall, and F1 score of [Formula: see text] and [Formula: see text] respectively using only 13 features selected by the BBO algorithm. Finally, we have explained the impact of the feature on the ML models using the SHapley Additive exPlanations (SHAP) analysis. Using SHAP analysis and BBO algorithm, we have found that hemoglobin and albumin mostly contribute to the detection of CKD.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures








Similar articles
-
Investigation on explainable machine learning models to predict chronic kidney diseases.Sci Rep. 2024 Feb 14;14(1):3687. doi: 10.1038/s41598-024-54375-4. Sci Rep. 2024. PMID: 38355876 Free PMC article.
-
The prediction of in-hospital mortality in chronic kidney disease patients with coronary artery disease using machine learning models.Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x. Eur J Med Res. 2023. PMID: 36653875 Free PMC article.
-
Development and Validation of an Insulin Resistance Model for a Population with Chronic Kidney Disease Using a Machine Learning Approach.Nutrients. 2022 Jul 9;14(14):2832. doi: 10.3390/nu14142832. Nutrients. 2022. PMID: 35889789 Free PMC article.
-
Machine learning algorithm to predict the in-hospital mortality in critically ill patients with chronic kidney disease.Ren Fail. 2023 Dec;45(1):2212790. doi: 10.1080/0886022X.2023.2212790. Ren Fail. 2023. PMID: 37203863 Free PMC article.
-
A Systematic Review of Case-Identification Algorithms Based on Italian Healthcare Administrative Databases for Three Relevant Diseases of the Digestive and Genitourinary System: Inflammatory Bowel Diseases, Celiac Disease, and Chronic Kidney Disease.Epidemiol Prev. 2019 Jul-Aug;43(4 Suppl 2):88-98. doi: 10.19191/EP19.4.S2.P088.095. Epidemiol Prev. 2019. PMID: 31650809
Cited by
-
Prediction of acute and chronic kidney diseases during the post-covid-19 pandemic with machine learning models: utilizing national electronic health records in the US.EBioMedicine. 2025 May;115:105726. doi: 10.1016/j.ebiom.2025.105726. Epub 2025 Apr 26. EBioMedicine. 2025. PMID: 40288236 Free PMC article.
-
Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series.NPJ Syst Biol Appl. 2025 Feb 15;11(1):18. doi: 10.1038/s41540-025-00489-y. NPJ Syst Biol Appl. 2025. PMID: 39955293 Free PMC article.
-
An Ensemble Machine Learning Model for Early Prediction of Vancomycin-Induced Acute Kidney Injury in ICU Patients.Arch Acad Emerg Med. 2025 Apr 15;13(1):e45. doi: 10.22037/aaemj.v13i1.2560. eCollection 2025. Arch Acad Emerg Med. 2025. PMID: 40487901 Free PMC article.
-
Machine learning-based identification of key biotic and abiotic drivers of mineral weathering rate in a complex enhanced weathering experiment.Open Res Eur. 2025 Jul 3;5:71. doi: 10.12688/openreseurope.19252.2. eCollection 2025. Open Res Eur. 2025. PMID: 40792254 Free PMC article.
-
Predicting the risk of threatened abortion using machine learning methods: a comparative study.BMC Pregnancy Childbirth. 2025 Aug 30;25(1):901. doi: 10.1186/s12884-025-08030-z. BMC Pregnancy Childbirth. 2025. PMID: 40885888 Free PMC article.
References
-
- Baumgarten M, Gehr T. Chronic kidney disease: Detection and evaluation. AFP. 2011;84:1138–1148. - PubMed
-
- Chronic Kidney Disease: Diagnosis and Treatment. (Springer, 2019).
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous