Machine learning-based warning model for chronic kidney disease in individuals over 40 years old in underprivileged areas, Shanxi Province
- PMID: 36698845
- PMCID: PMC9868668
- DOI: 10.3389/fmed.2022.930541
Machine learning-based warning model for chronic kidney disease in individuals over 40 years old in underprivileged areas, Shanxi Province
Abstract
Introduction: Chronic kidney disease (CKD) is a progressive disease with high incidence but early imperceptible symptoms. Since China's rural areas are subject to inadequate medical check-ups and single disease screening programme, it could easily translate into end-stage renal failure. This study aimed to construct an early warning model for CKD tailored to impoverished areas by employing machine learning (ML) algorithms with easily accessible parameters from ten rural areas in Shanxi Province, thereby, promoting a forward shift of treatment time and improving patients' quality of life.
Methods: From April to November 2019, CKD opportunistic screening was carried out in 10 rural areas in Shanxi Province. First, general information, physical examination data, blood and urine specimens were collected from 13,550 subjects. Afterward, feature selection of explanatory variables was performed using LASSO regression, and target datasets were balanced using the SMOTE (synthetic minority over-sampling technique) algorithm, i.e., albuminuria-to-creatinine ratio (ACR) and α1-microglobulin-to-creatinine ratio (MCR). Next, Bagging, Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) were employed for classification of ACR outcomes and MCR outcomes, respectively.
Results: 12,330 rural residents were included in this study, with 20 explanatory variables. The cases with increased ACR and increased MCR represented 1,587 (12.8%) and 1,456 (11.8%), respectively. After conducting LASSO, 14 and 15 explanatory variables remained in these two datasets, respectively. Bagging, RF, and XGBoost performed well in classification, with the AUC reaching 0.74, 0.87, 0.87, 0.89 for ACR outcomes and 0.75, 0.88, 0.89, 0.90 for MCR outcomes. The five variables contributing most to the classification of ACR outcomes and MCR outcomes constituted SBP, TG, TC, and Hcy, DBP and age, TG, SBP, Hcy and FPG, respectively. Overall, the machine learning algorithms could emerge as a warning model for CKD.
Conclusion: ML algorithms in conjunction with rural accessible indexes boast good performance in classification, which allows for an early warning model for CKD. This model could help achieve large-scale population screening for CKD in poverty-stricken areas and should be promoted to improve the quality of life and reduce the mortality rate.
Keywords: albuminuria-to-creatinine ratio; auxiliary diagnosis; chronic kidney disease; machine learning; warning model; α1-microglobulin-to-creatinine ratio.
Copyright © 2023 Song, Liu, Qiu, Qing, Li, Zhao, Li, Li and Zhou.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures




Similar articles
-
Using random forest algorithm for glomerular and tubular injury diagnosis.Front Med (Lausanne). 2022 Jul 28;9:911737. doi: 10.3389/fmed.2022.911737. eCollection 2022. Front Med (Lausanne). 2022. PMID: 35966858 Free PMC article.
-
Machine learning-enabled risk prediction of chronic obstructive pulmonary disease with unbalanced data.Comput Methods Programs Biomed. 2023 Mar;230:107340. doi: 10.1016/j.cmpb.2023.107340. Epub 2023 Jan 6. Comput Methods Programs Biomed. 2023. PMID: 36640604
-
The prediction of in-hospital mortality in chronic kidney disease patients with coronary artery disease using machine learning models.Eur J Med Res. 2023 Jan 18;28(1):33. doi: 10.1186/s40001-023-00995-x. Eur J Med Res. 2023. PMID: 36653875 Free PMC article.
-
A Hybrid Risk Factor Evaluation Scheme for Metabolic Syndrome and Stage 3 Chronic Kidney Disease Based on Multiple Machine Learning Techniques.Healthcare (Basel). 2022 Dec 9;10(12):2496. doi: 10.3390/healthcare10122496. Healthcare (Basel). 2022. PMID: 36554020 Free PMC article.
-
Deep Learning Identifies Intelligible Predictors of Poor Prognosis in Chronic Kidney Disease.IEEE J Biomed Health Inform. 2023 Jul;27(7):3677-3685. doi: 10.1109/JBHI.2023.3266587. Epub 2023 Jun 30. IEEE J Biomed Health Inform. 2023. PMID: 37043318
Cited by
-
Optimized machine learning based comparative analysis of predictive models for classification of kidney tumors.Sci Rep. 2025 Aug 19;15(1):30358. doi: 10.1038/s41598-025-15414-w. Sci Rep. 2025. PMID: 40830637 Free PMC article.
-
Prevalence and factors associated with hyperphosphatemia in continuous ambulatory peritoneal dialysis patients: A cross-sectional study.Front Med (Lausanne). 2023 Apr 14;10:1142013. doi: 10.3389/fmed.2023.1142013. eCollection 2023. Front Med (Lausanne). 2023. PMID: 37122336 Free PMC article.
-
Epidemiological trends and risk factors of CKD-T1DM in children and adolescents across 204 countries and territories (1990-2021).Front Endocrinol (Lausanne). 2025 Mar 26;16:1551467. doi: 10.3389/fendo.2025.1551467. eCollection 2025. Front Endocrinol (Lausanne). 2025. PMID: 40206600 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Miscellaneous