Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
- PMID: 34977075
- PMCID: PMC8717406
- DOI: 10.3389/fmed.2021.773881
Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
Abstract
Purpose: To development and validation of machine learning-based classifiers based on simple non-ocular metrics for detecting referable diabetic retinopathy (RDR) in a large-scale Chinese population-based survey. Methods: The 1,418 patients with diabetes mellitus from 8,952 rural residents screened in the population-based Dongguan Eye Study were used for model development and validation. Eight algorithms [extreme gradient boosting (XGBoost), random forest, naïve Bayes, k-nearest neighbor (KNN), AdaBoost, Light GBM, artificial neural network (ANN), and logistic regression] were used for modeling to detect RDR in individuals with diabetes. The area under the receiver operating characteristic curve (AUC) and their 95% confidential interval (95% CI) were estimated using five-fold cross-validation as well as an 80:20 ratio of training and validation. Results: The 10 most important features in machine learning models were duration of diabetes, HbA1c, systolic blood pressure, triglyceride, body mass index, serum creatine, age, educational level, duration of hypertension, and income level. Based on these top 10 variables, the XGBoost model achieved the best discriminative performance, with an AUC of 0.816 (95%CI: 0.812, 0.820). The AUCs for logistic regression, AdaBoost, naïve Bayes, and Random forest were 0.766 (95%CI: 0.756, 0.776), 0.754 (95%CI: 0.744, 0.764), 0.753 (95%CI: 0.743, 0.763), and 0.705 (95%CI: 0.697, 0.713), respectively. Conclusions: A machine learning-based classifier that used 10 easily obtained non-ocular variables was able to effectively detect RDR patients. The importance scores of the variables provide insight to prevent the occurrence of RDR. Screening RDR with machine learning provides a useful complementary tool for clinical practice in resource-poor areas with limited ophthalmic infrastructure.
Keywords: XGBoost; classifier; diabetic retinopathy; machine learning; population-based study.
Copyright © 2021 Yang, Liu, Guo, Zhang, Zhang, Zhang, Zeng, Huang, Meng and Cui.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures




Similar articles
-
Machine learning models for predicting critical illness risk in hospitalized patients with COVID-19 pneumonia.J Thorac Dis. 2021 Feb;13(2):1215-1229. doi: 10.21037/jtd-20-2580. J Thorac Dis. 2021. PMID: 33717594 Free PMC article.
-
Prediction of Acute Respiratory Distress Syndrome in Traumatic Brain Injury Patients Based on Machine Learning Algorithms.Medicina (Kaunas). 2023 Jan 15;59(1):171. doi: 10.3390/medicina59010171. Medicina (Kaunas). 2023. PMID: 36676795 Free PMC article.
-
Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs.JAMA. 2016 Dec 13;316(22):2402-2410. doi: 10.1001/jama.2016.17216. JAMA. 2016. PMID: 27898976
-
Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework.J Diabetes Res. 2020 Sep 24;2020:6873891. doi: 10.1155/2020/6873891. eCollection 2020. J Diabetes Res. 2020. PMID: 33029536 Free PMC article.
-
Machine Learning Approaches to Predict Chronic Lower Back Pain in People Aged over 50 Years.Medicina (Kaunas). 2021 Nov 11;57(11):1230. doi: 10.3390/medicina57111230. Medicina (Kaunas). 2021. PMID: 34833448 Free PMC article.
Cited by
-
Performance of artificial intelligence in diabetic retinopathy screening: a systematic review and meta-analysis of prospective studies.Front Endocrinol (Lausanne). 2023 Jun 13;14:1197783. doi: 10.3389/fendo.2023.1197783. eCollection 2023. Front Endocrinol (Lausanne). 2023. PMID: 37383397 Free PMC article.
-
Predicting diabetic retinopathy based on routine laboratory tests by machine learning algorithms.Eur J Med Res. 2025 Mar 18;30(1):183. doi: 10.1186/s40001-025-02442-5. Eur J Med Res. 2025. PMID: 40102923 Free PMC article.
-
Identification of diabetic retinopathy classification using machine learning algorithms on clinical data and optical coherence tomography angiography.Eye (Lond). 2024 Oct;38(14):2813-2821. doi: 10.1038/s41433-024-03173-3. Epub 2024 Jun 13. Eye (Lond). 2024. PMID: 38871934
-
Risk prediction of integrated traditional Chinese and western medicine for diabetes retinopathy based on optimized gradient boosting classifier model.Medicine (Baltimore). 2024 Dec 20;103(51):e40896. doi: 10.1097/MD.0000000000040896. Medicine (Baltimore). 2024. PMID: 39705459 Free PMC article.
-
Predicting Implantable Collamer Lens Vault Using Machine Learning Based on Various Preoperative Biometric Factors.Transl Vis Sci Technol. 2024 Jan 2;13(1):8. doi: 10.1167/tvst.13.1.8. Transl Vis Sci Technol. 2024. PMID: 38224328 Free PMC article.
References
LinkOut - more resources
Full Text Sources