An observational study to assess if automated diabetic retinopathy image assessment software can replace one or more steps of manual imaging grading and to determine their cost-effectiveness
- PMID: 27981917
- PMCID: PMC5204130
- DOI: 10.3310/hta20920
An observational study to assess if automated diabetic retinopathy image assessment software can replace one or more steps of manual imaging grading and to determine their cost-effectiveness
Abstract
Background: Diabetic retinopathy screening in England involves labour-intensive manual grading of retinal images. Automated retinal image analysis systems (ARIASs) may offer an alternative to manual grading.
Objectives: To determine the screening performance and cost-effectiveness of ARIASs to replace level 1 human graders or pre-screen with ARIASs in the NHS diabetic eye screening programme (DESP). To examine technical issues associated with implementation.
Design: Observational retrospective measurement comparison study with a real-time evaluation of technical issues and a decision-analytic model to evaluate cost-effectiveness.
Setting: A NHS DESP.
Participants: Consecutive diabetic patients who attended a routine annual NHS DESP visit.
Interventions: Retinal images were manually graded and processed by three ARIASs: iGradingM (version 1.1; originally Medalytix Group Ltd, Manchester, UK, but purchased by Digital Healthcare, Cambridge, UK, at the initiation of the study, purchased in turn by EMIS Health, Leeds, UK, after conclusion of the study), Retmarker (version 0.8.2, Retmarker Ltd, Coimbra, Portugal) and EyeArt (Eyenuk Inc., Woodland Hills, CA, USA). The final manual grade was used as the reference standard. Arbitration on a subset of discrepancies between manual grading and the use of an ARIAS by a reading centre masked to all grading was used to create a reference standard manual grade modified by arbitration.
Main outcome measures: Screening performance (sensitivity, specificity, false-positive rate and likelihood ratios) and diagnostic accuracy [95% confidence intervals (CIs)] of ARIASs. A secondary analysis explored the influence of camera type and patients' ethnicity, age and sex on screening performance. Economic analysis estimated the cost per appropriate screening outcome identified.
Results: A total of 20,258 patients with 102,856 images were entered into the study. The sensitivity point estimates of the ARIASs were as follows: EyeArt 94.7% (95% CI 94.2% to 95.2%) for any retinopathy, 93.8% (95% CI 92.9% to 94.6%) for referable retinopathy and 99.6% (95% CI 97.0% to 99.9%) for proliferative retinopathy; and Retmarker 73.0% (95% CI 72.0% to 74.0%) for any retinopathy, 85.0% (95% CI 83.6% to 86.2%) for referable retinopathy and 97.9% (95% CI 94.9 to 99.1%) for proliferative retinopathy. iGradingM classified all images as either 'disease' or 'ungradable', limiting further iGradingM analysis. The sensitivity and false-positive rates for EyeArt were not affected by ethnicity, sex or camera type but sensitivity declined marginally with increasing patient age. The screening performance of Retmarker appeared to vary with patient's age, ethnicity and camera type. Both EyeArt and Retmarker were cost saving relative to manual grading either as a replacement for level 1 human grading or used prior to level 1 human grading, although the latter was less cost-effective. A threshold analysis testing the highest ARIAS cost per patient before which ARIASs became more expensive per appropriate outcome than human grading, when used to replace level 1 grader, was Retmarker £3.82 and EyeArt £2.71 per patient.
Limitations: The non-randomised study design limited the health economic analysis but the same retinal images were processed by all ARIASs in this measurement comparison study.
Conclusions: Retmarker and EyeArt achieved acceptable sensitivity for referable retinopathy and false-positive rates (compared with human graders as reference standard) and appear to be cost-effective alternatives to a purely manual grading approach. Future work is required to develop technical specifications to optimise deployment and address potential governance issues.
Funding: The National Institute for Health Research (NIHR) Health Technology Assessment programme, a Fight for Sight Grant (Hirsch grant award) and the Department of Health's NIHR Biomedical Research Centre for Ophthalmology at Moorfields Eye Hospital and the University College London Institute of Ophthalmology.
Similar articles
-
Automated Diabetic Retinopathy Image Assessment Software: Diagnostic Accuracy and Cost-Effectiveness Compared with Human Graders.Ophthalmology. 2017 Mar;124(3):343-351. doi: 10.1016/j.ophtha.2016.11.014. Epub 2016 Dec 23. Ophthalmology. 2017. PMID: 28024825
-
Diagnostic accuracy of diabetic retinopathy grading by an artificial intelligence-enabled algorithm compared with a human standard for wide-field true-colour confocal scanning and standard digital retinal images.Br J Ophthalmol. 2021 Feb;105(2):265-270. doi: 10.1136/bjophthalmol-2019-315394. Epub 2020 May 6. Br J Ophthalmol. 2021. PMID: 32376611
-
A study of whether automated Diabetic Retinopathy Image Assessment could replace manual grading steps in the English National Screening Programme.J Med Screen. 2015 Sep;22(3):112-8. doi: 10.1177/0969141315571953. Epub 2015 Mar 5. J Med Screen. 2015. PMID: 25742804
-
Individualised variable-interval risk-based screening in diabetic retinopathy: the ISDR research programme including RCT.Southampton (UK): National Institute for Health and Care Research; 2023 Oct. Southampton (UK): National Institute for Health and Care Research; 2023 Oct. PMID: 37943975 Free Books & Documents. Review.
-
The value of digital imaging in diabetic retinopathy.Health Technol Assess. 2003;7(30):1-119. doi: 10.3310/hta7300. Health Technol Assess. 2003. PMID: 14604499 Review.
Cited by
-
Teleophthalmology and Artificial Intelligence As Game Changers in Ophthalmic Care After the COVID-19 Pandemic.Cureus. 2021 Jul 14;13(7):e16392. doi: 10.7759/cureus.16392. eCollection 2021 Jul. Cureus. 2021. PMID: 34408945 Free PMC article. Review.
-
Multicenter, Head-to-Head, Real-World Validation Study of Seven Automated Artificial Intelligence Diabetic Retinopathy Screening Systems.Diabetes Care. 2021 May;44(5):1168-1175. doi: 10.2337/dc20-1877. Epub 2021 Jan 5. Diabetes Care. 2021. PMID: 33402366 Free PMC article.
-
Comparison of Validity and Reliability of Manual Consensus Grading vs. Automated AI Grading for Diabetic Retinopathy Screening in Oslo, Norway: A Cross-Sectional Pilot Study.J Clin Med. 2025 Jul 7;14(13):4810. doi: 10.3390/jcm14134810. J Clin Med. 2025. PMID: 40649184 Free PMC article.
-
Head to head comparison of diagnostic performance of three non-mydriatic cameras for diabetic retinopathy screening with artificial intelligence.Eye (Lond). 2024 Jun;38(9):1694-1701. doi: 10.1038/s41433-024-03000-9. Epub 2024 Mar 11. Eye (Lond). 2024. PMID: 38467864 Free PMC article.
-
Prospective evaluation of an artificial intelligence-enabled algorithm for automated diabetic retinopathy screening of 30 000 patients.Br J Ophthalmol. 2021 May;105(5):723-728. doi: 10.1136/bjophthalmol-2020-316594. Epub 2020 Jun 30. Br J Ophthalmol. 2021. PMID: 32606081 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical