Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 24;14(1):6757.
doi: 10.1038/s41467-023-42444-7.

Uncertainty-inspired open set learning for retinal anomaly identification

Affiliations

Uncertainty-inspired open set learning for retinal anomaly identification

Meng Wang et al. Nat Commun. .

Abstract

Failure to recognize samples from the classes unseen during training is a major limitation of artificial intelligence in the real-world implementation for recognition and classification of retinal anomalies. We establish an uncertainty-inspired open set (UIOS) model, which is trained with fundus images of 9 retinal conditions. Besides assessing the probability of each category, UIOS also calculates an uncertainty score to express its confidence. Our UIOS model with thresholding strategy achieves an F1 score of 99.55%, 97.01% and 91.91% for the internal testing set, external target categories (TC)-JSIEC dataset and TC-unseen testing set, respectively, compared to the F1 score of 92.20%, 80.69% and 64.74% by the standard AI model. Furthermore, UIOS correctly predicts high uncertainty scores, which would prompt the need for a manual check in the datasets of non-target categories retinal diseases, low-quality fundus images, and non-fundus images. UIOS provides a robust method for real-world screening of retinal anomalies.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. The overview of the uncertainty-inspired open set (UIOS) learning for retinal anomaly classification.
Standard artificial intelligence (AI) and our proposed UIOS AI models were trained with the same dataset with 9 categories of retinal photos. In testing, standard AI model assigns a probability value (pi) to each of the 9 categories, and the one with the highest probability is output as the diagnosis. Even when the model is tested with a retinal image with disease outside the training set, the model still outputs one from the 9 categories, which may lead to misdiagnosis. In contrast, UIOS outputs an uncertainty score (μ) besides the probability (pi) for the 9 categories. When the model is fed with an image with obvious features of retinal disease in the 9 categories, the uncertainty-based classifier will output a prediction result with a low uncertainty score below the threshold θ to indicate that the diagnosis result is reliable. Conversely, when the input data contains ambiguous features or is an anomaly outside of training categories, the model will assign a high uncertainty score above threshold θ to explicitly indicate that the prediction result is unreliable and requires a double-check from their ophthalmologist to avoid misdiagnosis.
Fig. 2
Fig. 2. The receiver operating characteristic (ROC) curves of the standard AI model, our UIOS, and UIOS+thresholding in internal and two external testing datasets.
Source data are provided as a Source data file.
Fig. 3
Fig. 3. Uncertainty density distribution for different datasets.
Different colored solid lines indicate different test datasets for target categories of retinal diseases, while different colored dashed lines indicate different out of distribution datasets. θ threshold theta. Source data are provided as a Source data file.
Fig. 4
Fig. 4. Four samples of fundus images detected with the standard AI model and our UIOS model.
a, b Two samples with correct diagnostic results from both the standard AI model and our UIOS model. c, d Two samples with incorrect diagnostic results from the standard AI model and our UIOS model. Unlike the standard AI model, which directly takes the fundus disease category with the highest probability score as the final diagnosis result, our UIOS will not only give the probability scores but also provide the corresponding uncertainty score to reflect the reliability of the prediction result. If the uncertainty score is less than the threshold theta, indicating the model prediction is reliable; Conversely, if the uncertainty score is greater than the threshold theta, which represents that the result is unreliable, and manual double-checking is required to avoid possible misdiagnosis problems. US uncertainty score, θ threshold theta. Source data are provided as a Source data file.
Fig. 5
Fig. 5. Testing results of OOD samples that were not included in the training category.
Besides assigning a probability to OOD samples as the standard AI model, our UIOS model also assigns a high uncertainty score to indicate that the final decision is unreliable and needs a double-check. US uncertainty score, θ threshold theta. Source data are provided as a Source data file. a from NTC dataset, b also from NTC dataset, c from low-quality dataset, d an OCT image from RETOUCH dataset, and e an OCTA image from OCTA dataset.

References

    1. Scanlon PH. The contribution of the english nhs diabetic eye screening programme to reductions in diabetes-related blindness, comparisons within europe, and future challenges. Acta Diabetol. 2021;58:521–530. doi: 10.1007/s00592-021-01687-w. - DOI - PMC - PubMed
    1. Gulshan V, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316:2402. doi: 10.1001/jama.2016.17216. - DOI - PubMed
    1. Ting DSW, et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes. JAMA. 2017;318:2211. doi: 10.1001/jama.2017.18152. - DOI - PMC - PubMed
    1. Litjens G, et al. A survey on deep learning in medical image analysis. Med. Image Anal. 2017;42:60–88. doi: 10.1016/j.media.2017.07.005. - DOI - PubMed
    1. Bellemo V, et al. Artificial intelligence using deep learning to screen for referable and vision-threatening diabetic retinopathy in Africa: a clinical validation study. Lancet Digital Health. 2019;1:e35–e44. doi: 10.1016/S2589-7500(19)30004-4. - DOI - PubMed

Publication types