Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct;30(10):2886-2896.
doi: 10.1038/s41591-024-03139-8. Epub 2024 Jul 19.

Integrated image-based deep learning and language models for primary diabetes care

Jiajia Li #  1   2 Zhouyu Guan #  1 Jing Wang #  3 Carol Y Cheung  4 Yingfeng Zheng  5 Lee-Ling Lim  6 Cynthia Ciwei Lim  7 Paisan Ruamviboonsuk  8 Rajiv Raman  9 Leonor Corsino  10 Justin B Echouffo-Tcheugui  11 Andrea O Y Luk  12   13   14   15 Li Jia Chen  4 Xiaodong Sun  16 Haslina Hamzah  17 Qiang Wu  18 Xiangning Wang  18 Ruhan Liu  1   2 Ya Xing Wang  19 Tingli Chen  3 Xiao Zhang  20 Xiaolong Yang  3 Jun Yin  1 Jing Wan  21 Wei Du  21 Ten Cheer Quek  17 Jocelyn Hui Lin Goh  17 Dawei Yang  4 Xiaoyan Hu  4 Truong X Nguyen  4 Simon K H Szeto  4 Peranut Chotcomwongse  8 Rachid Malek  22 Nargiza Normatova  23 Nilufar Ibragimova  24 Ramyaa Srinivasan  9 Pingting Zhong  5 Wenyong Huang  5 Chenxin Deng  25 Lei Ruan  25 Cuntai Zhang  25 Chenxi Zhang  26 Yan Zhou  26 Chan Wu  26 Rongping Dai  26 Sky Wei Chee Koh  27 Adina Abdullah  28 Nicholas Ken Yoong Hee  29 Hong Chang Tan  30 Zhong Hong Liew  7 Carolyn Shan-Yeu Tien  7 Shih Ling Kao  31   32 Amanda Yuan Ling Lim  31   32 Shao Feng Mok  31   32 Lina Sun  33 Jing Gu  33 Liang Wu  1 Tingyao Li  1   2 Di Cheng  1 Zheyuan Wang  1   2 Yiming Qin  1   2 Ling Dai  1   2 Ziyao Meng  1   2 Jia Shu  1   2 Yuwei Lu  1 Nan Jiang  1   2 Tingting Hu  1 Shan Huang  1   2 Gengyou Huang  1   2 Shujie Yu  1 Dan Liu  1 Weizhi Ma  34 Minyi Guo  1 Xinping Guan  35 Xiaokang Yang  2 Covadonga Bascaran  36 Charles R Cleland  36 Yuqian Bao  1 Elif I Ekinci  37   38   39 Alicia Jenkins  39   40   41 Juliana C N Chan  12   13   14   15 Yong Mong Bee  30 Sobha Sivaprasad  42 Jonathan E Shaw  38 Rafael Simó  43   44 Pearse A Keane  42   45 Ching-Yu Cheng  17   46 Gavin Siew Wei Tan  17 Weiping Jia  47 Yih-Chung Tham  48   49   50 Huating Li  51 Bin Sheng  52   53 Tien Yin Wong  54   55   56   57
Affiliations

Integrated image-based deep learning and language models for primary diabetes care

Jiajia Li et al. Nat Med. 2024 Oct.

Abstract

Primary diabetes care and diabetic retinopathy (DR) screening persist as major public health challenges due to a shortage of trained primary care physicians (PCPs), particularly in low-resource settings. Here, to bridge the gaps, we developed an integrated image-language system (DeepDR-LLM), combining a large language model (LLM module) and image-based deep learning (DeepDR-Transformer), to provide individualized diabetes management recommendations to PCPs. In a retrospective evaluation, the LLM module demonstrated comparable performance to PCPs and endocrinology residents when tested in English and outperformed PCPs and had comparable performance to endocrinology residents in Chinese. For identifying referable DR, the average PCP's accuracy was 81.0% unassisted and 92.3% assisted by DeepDR-Transformer. Furthermore, we performed a single-center real-world prospective study, deploying DeepDR-LLM. We compared diabetes management adherence of patients under the unassisted PCP arm (n = 397) with those under the PCP+DeepDR-LLM arm (n = 372). Patients with newly diagnosed diabetes in the PCP+DeepDR-LLM arm showed better self-management behaviors throughout follow-up (P < 0.05). For patients with referral DR, those in the PCP+DeepDR-LLM arm were more likely to adhere to DR referrals (P < 0.01). Additionally, DeepDR-LLM deployment improved the quality and empathy level of management recommendations. Given its multifaceted performance, DeepDR-LLM holds promise as a digital solution for enhancing primary diabetes care and DR screening.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Architecture of the DeepDR-LLM system.
The DeepDR-LLM system consists of two modules: (1) module I (LLM module), which provides individualized management recommendations for patients with diabetes; (2) module II (DeepDR-Transformer module), which performs image quality assessment, DR lesion segmentation and DR/DME grading from standard or portable fundus images. There are two modes of integrating module I and module II in the DeepDR-LLM system. In the physician-involved integration mode, the outputs of module II (that is, fundus image gradability; the lesion segmentation of microaneurysm, cotton-wool spot, hard exudate and hemorrhage; DR grade; and DME grade) could assist physicians in generating DR/DME diagnosis results (that is, fundus image gradability, DR grade, DME grade and the presence of lesions). In the automated integration mode, the DR/DME diagnosis results include fundus image gradability, DR grade, DME grade classified by module II, and the presence of lesions segmented out by module II. These DR/DME diagnosis results and other clinical metadata will be fed into module I to generate individualized management recommendations for people with diabetes.
Fig. 2
Fig. 2. Study design overview for the DeepDR-LLM system evaluation.
a, Head-to-head comparative assessment of diabetes management recommendations generated by DeepDR-LLM, nontuned LLaMA, PCPs and endocrinology residents, using 100 cases randomly selected from CNDCS. b, Efficacy analysis of the DeepDR-Transformer module on multiethnic datasets of standard and portable fundus images. c, Utility evaluation of the DeepDR-Transformer module as an assistive tool for PCPs and professional graders in the detection of referable DR. d, Study design of a two-arm, real-world, prospective study to evaluate the impact of DeepDR-LLM on patients’ self-management behavior. In the outcome analysis, for substudy I, 253 participants in the unassisted PCP arm and 234 participants in the PCP+DeepDR-LLM arm were included; for substudy II, 154 participants in the unassisted PCP arm and 144 participants in the PCP+DeepDR-LLM arm were included.
Fig. 3
Fig. 3. Head-to-head comparison between DeepDR-LLM, nontuned LLaMA, PCP and endocrinology resident in both English and Chinese.
a, Evaluators were invited to rate management recommendations for patients with diabetes, based on three domains, namely the extent of inappropriate content, the extent of missing content and the likelihood of possible harm, using 100 cases randomly selected from CNDCS. b, The total scores of management recommendations generated by LLaMA, DeepDR-LLM, PCPs and endocrinology residents, using 100 cases randomly selected from CNDCS. Box plot (n = 100), median and quartiles; whiskers, data range. The comparison was performed using two-sided Friedman tests. Post-hoc pairwise comparisons were performed using two-sided Wilcoxon signed-rank tests. P values for multiple comparisons were adjusted using the Bonferroni method. **P = 0.010, ***P < 0.001. Source data
Fig. 4
Fig. 4. Receiver operating characteristic curves showing performance of DeepDR-Transformer alone versus PCPs (when unassisted and assisted by DeepDR-Transformer) in identifying referable DR.
a, Standard fundus images (500 eyes: 250 nonreferable eyes and 250 referable eyes) graded by PCPs in the urban area. b, Portable fundus images (500 eyes: 250 nonreferable eyes and 250 referable eyes) graded by PCPs in the urban area. c, Standard fundus images (500 eyes: 250 nonreferable eyes and 250 referable eyes) graded by PCPs in the rural area. d, Portable fundus images (500 eyes: 250 nonreferable eyes and 250 referable eyes) graded by PCPs in the rural area. Source data
Fig. 5
Fig. 5. Quality and empathy ratings of the diabetes management recommendations by three consultant-level endocrinologists and 372 surveyed patients in the PCP+DeepDR-LLM arm.
a, Proportions of PCP, DeepDR-LLM and PCP+DeepDR-LLM’s recommendations being selected as the first-choice preference by consultant-level endocrinologists and patients (number of cases 372). Each of the three consultant-level endocrinologists was invited to evaluate all the 372 cases. The error bars show the Clopper–Pearson 95% CIs. b, Kernel density plots showing the quality and empathy ratings of PCP, DeepDR-LLM and PCP+DeepDR-LLM’s recommendations, as evaluated by three consultant-level endocrinologists (number of cases 372). Each of the three consultant-level endocrinologists was invited to evaluate all the 372 cases. c, Bar plots showing the quality and empathy ratings of PCP, DeepDR-LLM and PCP+DeepDR-LLM’s recommendations, as evaluated by the 372 surveyed patients (number of cases 372). Source data
Fig. 6
Fig. 6. Envisioning the future of primary diabetes care with the clinical integration of the DeepDR-LLM system.
First, patients with diabetes undergo comprehensive evaluations that include medical history taking that can be augmented by automated voice-to-text technology, physical examinations, laboratory assessments and fundus imaging. Following this, the DeepDR-LLM system processes the accumulated clinical data to concurrently deliver DR screening results and tailored management recommendations for PCPs. Subsequently, augmented with these AI-derived insights, PCPs then offer treatment guidance and health education to patients, either in person or through teleconsultation services.
Extended Data Fig. 1
Extended Data Fig. 1. Schematic overview of the DeepDR-LLM system.
a, Model architecture of the DeepDR-Transformer module. b, Model architecture of the LLM module. DME, diabetic macular edema; DR, diabetic retinopathy; NPDR, non-proliferative diabetic retinopathy; PDR, proliferative diabetic retinopathy; LLM, large language model; LoRA, Low-Rank Adaptation; MLP, Multi-Layer Perceptron.
Extended Data Fig. 2
Extended Data Fig. 2. Study design of head-to-head comparison on diabetic management recommendations between large language models (DeepDR-LLM and LLaMA) and clinicians in both English and Chinese languages.
CNDCS, China National Diabetic Complications Study; PCP, primary care physician; ADA, American Diabetes Association; ICO, International Council of Ophthalmology.
Extended Data Fig. 3
Extended Data Fig. 3. Study design of the real-world, two-arm, prospective study.
For patients in the unassisted PCP arm, ten patients were diagnosed with both newly diagnosed diabetes and referable DR. For patients in the PCP+DeepDR-LLM arm, six patients were diagnosed with both newly diagnosed diabetes and referable DR. PCP, primary care physician; DR, diabetic retinopathy.
Extended Data Fig. 4
Extended Data Fig. 4. Study design of the post-deployment evaluation of management recommendations’ quality and level of empathy.
a, In the PCP + DeepDR-LLM arm, the DeepDR-LLM system was integrated into the clinical workflow. Initially, PCPs and DeepDR-LLM gave management recommendations independently. The recommendations given by DeepDR-LLM was automatically generated from electronic health systems, by extracting and analyzing the fundus images, medical history, physical examinations, and laboratory tests. Subsequently, PCPs edited their recommendations in text form by taking DeepDR-LLM’s recommendations into account. b, For participants in the PCP + DeepDR-LLM arm, they filled out a questionnaire investigating their opinions on three recommendations at the 4-week follow-up. Evaluators, including endocrinologists and surveyed participants, ranked these three recommendations and judged both ‘the quality of information provided’ (very poor, poor, acceptable, good, or very good) and ‘the empathy or bedside manner provided’ (not empathetic, slightly empathetic, moderately empathetic, empathetic, and very empathetic).

References

    1. Sun, H. et al. IDF Diabetes Atlas: global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res. Clin. Pract.183, 109119 (2022). - PMC - PubMed
    1. Walker, A. F. et al. Interventions to address global inequity in diabetes: international progress. Lancet402, 250–264 (2023). - PMC - PubMed
    1. Jia, W. Diabetes care in China: innovations and implications. J. Diabetes Investig.13, 1795–1797 (2022). - PMC - PubMed
    1. Chan, J. C. N. et al. The Lancet Commission on diabetes: using data to transform diabetes care and patient lives. Lancet396, 2019–2082 (2021). - PubMed
    1. Bee, Y. M., Tai, E. S. & Wong, T. Y. Singapore’s ‘War on Diabetes’. Lancet Diabetes Endocrinol.10, 391–392 (2022). - PubMed

LinkOut - more resources