Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 7:13:e55825.
doi: 10.2196/55825.

Diagnosis of Chronic Kidney Disease Using Retinal Imaging and Urine Dipstick Data: Multimodal Deep Learning Approach

Affiliations

Diagnosis of Chronic Kidney Disease Using Retinal Imaging and Urine Dipstick Data: Multimodal Deep Learning Approach

Youngmin Bhak et al. JMIR Med Inform. .

Abstract

Background: Chronic kidney disease (CKD) is a prevalent condition with significant global health implications. Early detection and management are critical to prevent disease progression and complications. Deep learning (DL) models using retinal images have emerged as potential noninvasive screening tools for CKD, though their performance may be limited, especially in identifying individuals with proteinuria and in specific subgroups.

Objective: We aim to evaluate the efficacy of integrating retinal images and urine dipstick data into DL models for enhanced CKD diagnosis.

Methods: The 3 models were developed and validated: eGFR-RIDL (estimated glomerular filtration rate-retinal image deep learning), eGFR-UDLR (logistic regression using urine dipstick data), and eGFR-MMDL (multimodal deep learning combining retinal images and urine dipstick data). All models were trained to predict an eGFR<60 mL/min/1.73 m², a key indicator of CKD, calculated using the 2009 CKD-EPI (Chronic Kidney Disease Epidemiology Collaboration) equation. This study used a multicenter dataset of participants aged 20-79 years, including a development set (65,082 people) and an external validation set (58,284 people). Wide Residual Networks were used for DL, and saliency maps were used to visualize model attention. Sensitivity analyses assessed the impact of numerical variables.

Results: eGFR-MMDL outperformed eGFR-RIDL in both the test and external validation sets, with area under the curves of 0.94 versus 0.90 and 0.88 versus 0.77 (P<.001 for both, DeLong test). eGFR-UDLR outperformed eGFR-RIDL and was comparable to eGFR-MMDL, particularly in the external validation. However, in the subgroup analysis, eGFR-MMDL showed improvement across all subgroups, while eGFR-UDLR demonstrated no such gains. This suggested that the enhanced performance of eGFR-MMDL was not due to urine data alone, but rather from the synergistic integration of both retinal images and urine data. The eGFR-MMDL model demonstrated the best performance in individuals younger than 65 years or those with proteinuria. Age and proteinuria were identified as critical factors influencing model performance. Saliency maps indicated that urine data and retinal images provide complementary information, with urine offering insights into retinal abnormalities and retinal images, particularly the arcade vessels, being key for predicting kidney function.

Conclusions: The MMDL model integrating retinal images and urine dipstick data show significant promise for noninvasive CKD screening, outperforming the retinal image-only model. However, routine blood tests are still recommended for individuals aged 65 years and older due to the model's limited performance in this age group.

Keywords: chronic kidney disease; fundus image; multimodal deep learning; saliency map; urine dipstick.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1.
Figure 1.. Study flowchart from data preparation to deep learning modeling and validation. Images with abnormal brightness or haziness were excluded. CHA: CHA Bundang Medical Center; eGFR: estimated glomerular filtration rate; eGFR-MMDL: multimodal deep learning model for estimated glomerular filtration rate<60 mL/min/1.73 m²; eGFR-RIDL: retinal image deep learning model for estimated glomerular filtration rate<60 mL/min/1.73 m²; SCHPC: Severance Checkup Health Promotion Center.
Figure 2.
Figure 2.. Wide Residual Network architecture for detecting kidney function decline using retinal images. For the multimodal model, the urine dipstick measurements, age, and sex were concatenated with the feature vector from the image. Conv: convolutional layer.
Figure 3.
Figure 3.. ROC curves and AUC of low-eGFR detection models. The AUCs of the eGFR-MMDL and eGFR-UDLR models were compared with the AUC of eGFR-RIDL in (A) the test set and (B) the external validation set (DeLong test with Bonferroni correction). *P=.002; ***P<.001. AUC: area under the curve; eGFR: estimated glomerular filtration rate; MMDL: multimodal deep learning model; RIDL: retinal image deep learning model; ROC: receiver operating characteristic; UDLR: urine dipstick logistic regression.
Figure 4.
Figure 4.. ROC curves and AUC of low-eGFR detection models in subgroups of the test set. (A) Age <65 years (solid line) and ≥65 years (dotted line); (B) nondiabetes (solid line) and diabetes (dotted line); (C) no hypertension (solid line) and hypertension (dotted line); and (D) no proteinuria (solid line) and proteinuria (dotted line). The AUCs of the eGFR-MMDL and eGFR-UDLR models are compared with the AUC of the eGFR-RIDL (DeLong test with Bonferroni correction; P value indicators next to the AUC values); *P=.013, **P=.001, ***P<.001; NS: not significant. P value indicators are added to the upper left corner of the eGFR-MMDL model curve to denote significant differences in AUCs between the 2 subgroups (DeLong test); *P=.011, **P=.001, ***P<.001; NS: not significant. AUC: area under the curve; eGFR: estimated glomerular filtration rate; MMDL: multimodal deep learning model; RIDL: retinal image deep learning model; ROC: receiver operating characteristic; UDLR: urine dipstick logistic regression.
Figure 5.
Figure 5.. Saliency map results from the eGFR-RIDL and eGFR-MMDL models for eGFR decline detection. (A) A nondiabetes male aged 60 years with proteinuria level 3 (2+). (B) A male with diabetes and proteinuria level 4 (3+) aged 54 years. (C) A male with diabetes and proteinuria level 4 (3+) aged 60 years. (D) A male without hypertension or diabetes aged 60 years, with a proteinuria level of 0 (negative). From left to right: original retinal image, retinal image postpreprocessing via CLAHE and color normalization, eGFR-RIDL–generated saliency map, and eGFR-MMDL–generated saliency map. CLAHE: Contrast Limited Adaptive Histogram Equalization; eGFR: estimated glomerular filtration rate; MMDL: multimodal deep learning; RIDL: retinal image deep learning.
Figure 6.
Figure 6.. Sensitivity analysis: feature importance of age, sex, and 10 urine measurements in predicting the probability of eGFR decline by eGFR-MMDL using a test set. The sensitivity analysis scores indicate the relative impact of each variable on the predicted probability. Higher sensitivity analysis scores correspond to a greater influence. The error bars on the right end represent 95% CIs. 1e-08: 0.00000001; eGFR: estimated glomerular filtration rate; MMDL: multimodal deep learning.

References

    1. Wen J, Liu D, Wu Q, Zhao L, Iao WC, Lin H. Retinal image‐based artificial intelligence in detecting and predicting kidney diseases: current advances and future perspectives. [18-01-2025];VIEW. 2023 Jun;4(3):20220070. doi: 10.1002/VIW.20220070. https://onlinelibrary.wiley.com/toc/2688268x/4/3 URL. Accessed. doi. - DOI
    1. Kovesdy CP. Epidemiology of chronic kidney disease: an update 2022. Kidney Int Suppl (2011) 2022 Apr;12(1):7–11. doi: 10.1016/j.kisu.2021.11.003. doi. Medline. - DOI - PMC - PubMed
    1. Chapter 1: definition and classification of CKD. Kidney Int Suppl (2011) 2013 Jan;3(1):19–62. doi: 10.1038/kisup.2012.64. doi. Medline. - DOI - PMC - PubMed
    1. Go AS, Chertow GM, Fan D, McCulloch CE, Hsu C. Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N Engl J Med. 2004 Sep 23;351(13):1296–1305. doi: 10.1056/NEJMoa041031. doi. Medline. - DOI - PubMed
    1. van der Velde M, Matsushita K, Coresh J, et al. Lower estimated glomerular filtration rate and higher albuminuria are associated with all-cause and cardiovascular mortality. A collaborative meta-analysis of high-risk population cohorts. Kidney Int. 2011 Jun;79(12):1341–1352. doi: 10.1038/ki.2010.536. doi. - DOI - PubMed