Using a Deep Learning Algorithm and Integrated Gradients Explanation to Assist Grading for Diabetic Retinopathy

Rory Sayres¹, Ankur Taly¹, Ehsan Rahimy², Katy Blumer¹, David Coz¹, Naama Hammel¹, Jonathan Krause¹, Arunachalam Narayanaswamy¹, Zahra Rastegar¹, Derek Wu¹, Shawn Xu³, Scott Barb⁴, Anthony Joseph⁵, Michael Shumski⁶, Jesse Smith⁷, Arjun B Sood⁸, Greg S Corrado¹, Lily Peng⁹, Dale R Webster¹

Affiliations

¹ Google Research, Google, LLC, Mountain View, California.
² Department of Ophthalmology, Palo Alto Medical Foundation, Palo Alto, California.
³ Verily Life Sciences, South San Francisco, California.
⁴ Department of Ophthalmology, Emory University, Atlanta, Georgia.
⁵ Ophthalmic Consultants of Boston, Boston, Massachusetts.
⁶ Magruder Laser Vision, Orlando, Florida.
⁷ Denver Health Medical Center, Denver, Colorado; Department of Ophthalmology, University of Colorado School of Medicine, Aurora, Colorado.
⁸ Department of Ophthalmology, Massachusetts Eye and Ear Infirmary, Harvard Medical School, Boston, Massachusetts.
⁹ Google Research, Google, LLC, Mountain View, California. Electronic address: lhpeng@google.com.

PMID: 30553900
DOI: 10.1016/j.ophtha.2018.11.016

Free article

Using a Deep Learning Algorithm and Integrated Gradients Explanation to Assist Grading for Diabetic Retinopathy

Rory Sayres et al. Ophthalmology. 2019 Apr.

Free article

. 2019 Apr;126(4):552-564.

doi: 10.1016/j.ophtha.2018.11.016. Epub 2018 Dec 13.

Authors

Affiliations

¹ Google Research, Google, LLC, Mountain View, California.
² Department of Ophthalmology, Palo Alto Medical Foundation, Palo Alto, California.
³ Verily Life Sciences, South San Francisco, California.
⁴ Department of Ophthalmology, Emory University, Atlanta, Georgia.
⁵ Ophthalmic Consultants of Boston, Boston, Massachusetts.
⁶ Magruder Laser Vision, Orlando, Florida.
⁷ Denver Health Medical Center, Denver, Colorado; Department of Ophthalmology, University of Colorado School of Medicine, Aurora, Colorado.
⁸ Department of Ophthalmology, Massachusetts Eye and Ear Infirmary, Harvard Medical School, Boston, Massachusetts.
⁹ Google Research, Google, LLC, Mountain View, California. Electronic address: lhpeng@google.com.

PMID: 30553900
DOI: 10.1016/j.ophtha.2018.11.016

Abstract

Purpose: To understand the impact of deep learning diabetic retinopathy (DR) algorithms on physician readers in computer-assisted settings.

Design: Evaluation of diagnostic technology.

Participants: One thousand seven hundred ninety-six retinal fundus images from 1612 diabetic patients.

Methods: Ten ophthalmologists (5 general ophthalmologists, 4 retina specialists, 1 retina fellow) read images for DR severity based on the International Clinical Diabetic Retinopathy disease severity scale in each of 3 conditions: unassisted, grades only, or grades plus heatmap. Grades-only assistance comprised a histogram of DR predictions (grades) from a trained deep-learning model. For grades plus heatmap, we additionally showed explanatory heatmaps.

Main outcome measures: For each experiment arm, we computed sensitivity and specificity of each reader and the algorithm for different levels of DR severity against an adjudicated reference standard. We also measured accuracy (exact 5-class level agreement and Cohen's quadratically weighted κ), reader-reported confidence (5-point Likert scale), and grading time.

Results: Readers graded more accurately with model assistance than without for the grades-only condition (P < 0.001). Grades plus heatmaps improved accuracy for patients with DR (P < 0.001), but reduced accuracy for patients without DR (P = 0.006). Both forms of assistance increased readers' sensitivity moderate-or-worse DR: unassisted: mean, 79.4% [95% confidence interval (CI), 72.3%-86.5%]; grades only: mean, 87.5% [95% CI, 85.1%-89.9%]; grades plus heatmap: mean, 88.7% [95% CI, 84.9%-92.5%] without a corresponding drop in specificity (unassisted: mean, 96.6% [95% CI, 95.9%-97.4%]; grades only: mean, 96.1% [95% CI, 95.5%-96.7%]; grades plus heatmap: mean, 95.5% [95% CI, 94.8%-96.1%]). Algorithmic assistance increased the accuracy of retina specialists above that of the unassisted reader or model alone; and increased grading confidence and grading time across all readers. For most cases, grades plus heatmap was only as effective as grades only. Over the course of the experiment, grading time decreased across all conditions, although most sharply for grades plus heatmap.

Conclusions: Deep learning algorithms can improve the accuracy of, and confidence in, DR diagnosis in an assisted read setting. They also may increase grading time, although these effects may be ameliorated with experience.

PubMed Disclaimer

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Using a Deep Learning Algorithm and Integrated Gradients Explanation to Assist Grading for Diabetic Retinopathy

Affiliations

Using a Deep Learning Algorithm and Integrated Gradients Explanation to Assist Grading for Diabetic Retinopathy

Authors

Affiliations

Abstract

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical