Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 17;13(1):6263.
doi: 10.1038/s41598-023-33525-0.

Detection of the chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP

Affiliations

Detection of the chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP

Md Johir Raihan et al. Sci Rep. .

Abstract

Chronic kidney disease (CKD) is a condition distinguished by structural and functional changes to the kidney over time. Studies show that 10% of adults worldwide are affected by some kind of CKD, resulting in 1.2 million deaths. Recently, CKD has emerged as a leading cause of mortality worldwide, making it necessary to develop a Computer-Aided Diagnostic (CAD) system to diagnose CKD automatically. Machine Learning (ML) based CAD system can be used by a clinician to automatically diagnoses mass people. Since ML models are considered a black box, it is also necessary to expose influential causes behind a model's prediction of a particular output. So that, a doctor can make a more rational decision based on the model's output and analysis of the features influence on the model. In this paper, we have used the XGBoost as the ML classifier to predict whether a patient has CKD or not. Using the XGBoost classifier, we have obtained an accuracy, precision, recall, and F1 score of [Formula: see text] and [Formula: see text] respectively using all [Formula: see text] features. Furthermore, we have used Biogeography Based Optimization (BBO) algorithm to find an effective subset of the features. The BBO algorithm selected almost half of the initial features. We have obtained an accuracy, precision, recall, and F1 score of [Formula: see text] and [Formula: see text] respectively using only 13 features selected by the BBO algorithm. Finally, we have explained the impact of the feature on the ML models using the SHapley Additive exPlanations (SHAP) analysis. Using SHAP analysis and BBO algorithm, we have found that hemoglobin and albumin mostly contribute to the detection of CKD.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Working process.
Figure 2
Figure 2
Visualization of missing values.
Figure 3
Figure 3
Confusion matrix and performance metrics formula.
Figure 4
Figure 4
Confusion matrix of the testing set. (a) Set 1. (b) Set 2 (c) Set 3 (d) Set 4.
Figure 5
Figure 5
ROC analysis (a) Set 2. (b) Set 4.
Figure 6
Figure 6
SHAP analysis on all samples. (a) Set 1. (b) Set 2. (c) Set 3. (d) Set 4.
Figure 7
Figure 7
SHAP analysis of a single sample from Set 2. (a) Correctly classified (ID: 140, True: CKD, Predicted: CKD). (b) Misclassified (ID: 0, True: CKD, Predicted: NCKD).
Figure 8
Figure 8
SHAP analysis of a single sample from Set 4. (a) Correctly classified (ID: 0, True: CKD, Predicted: CKD). (b) Misclassified (ID: 140, True: CKD, Predicted: NCKD).

Similar articles

Cited by

References

    1. Wouters OJ, O’Donoghue DJ, Ritchie J, Kanavos PG, Narva AS. Early chronic kidney disease: Diagnosis, management and models of care. Nat. Rev. Nephrol. 2015;11:491–502. doi: 10.1038/nrneph.2015.85. - DOI - PMC - PubMed
    1. Kalantar-Zadeh K, Jafar TH, Nitsch D, Neuen BL, Perkovic V. Chronic kidney disease. Lancet. 2021;398:786–802. doi: 10.1016/S0140-6736(21)00519-5. - DOI - PubMed
    1. ERA-EDTA Council & ERACODA Working Group Chronic kidney disease is a key risk factor for severe COVID-19: A call to action by the ERA-EDTA. Nephrol. Dial. Transplant. 2021;36:87–94. doi: 10.1093/ndt/gfaa314. - DOI - PMC - PubMed
    1. Baumgarten M, Gehr T. Chronic kidney disease: Detection and evaluation. AFP. 2011;84:1138–1148. - PubMed
    1. Chronic Kidney Disease: Diagnosis and Treatment. (Springer, 2019).

Publication types