Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Aug 1;14(15):1668.
doi: 10.3390/diagnostics14151668.

Exploring Publicly Accessible Optical Coherence Tomography Datasets: A Comprehensive Overview

Affiliations
Review

Exploring Publicly Accessible Optical Coherence Tomography Datasets: A Comprehensive Overview

Anastasiia Rozhyna et al. Diagnostics (Basel). .

Abstract

Artificial intelligence has transformed medical diagnostic capabilities, particularly through medical image analysis. AI algorithms perform well in detecting abnormalities with a strong performance, enabling computer-aided diagnosis by analyzing the extensive amounts of patient data. The data serve as a foundation upon which algorithms learn and make predictions. Thus, the importance of data cannot be underestimated, and clinically corresponding datasets are required. Many researchers face a lack of medical data due to limited access, privacy concerns, or the absence of available annotations. One of the most widely used diagnostic tools in ophthalmology is Optical Coherence Tomography (OCT). Addressing the data availability issue is crucial for enhancing AI applications in the field of OCT diagnostics. This review aims to provide a comprehensive analysis of all publicly accessible retinal OCT datasets. Our main objective is to compile a list of OCT datasets and their properties, which can serve as an accessible reference, facilitating data curation for medical image analysis tasks. For this review, we searched through the Zenodo repository, Mendeley Data repository, MEDLINE database, and Google Dataset search engine. We systematically evaluated all the identified datasets and found 23 open-access datasets containing OCT images, which significantly vary in terms of size, scope, and ground-truth labels. Our findings indicate the need for improvement in data-sharing practices and standardized documentation. Enhancing the availability and quality of OCT datasets will support the development of AI algorithms and ultimately improve diagnostic capabilities in ophthalmology. By providing a comprehensive list of accessible OCT datasets, this review aims to facilitate better utilization and development of AI in medical image analysis.

Keywords: OCT; data; data analysis; data sharing; datasets; open data; optical coherence tomography.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
The dataset search with detailed search engines.
Figure 2
Figure 2
Exemplary sample images from datasets. (A)—Kermany et al.; (B)—Data of rare retinal disease; (C)—OCTID; (D)—Labeled Retinal Optical Coherence Tomography Dataset for Classification; (E)—OCT MS and HC data; (F)—Duke OCT.
Figure 3
Figure 3
Disease representation across datasets.
Figure 4
Figure 4
Origin of datasets.
Figure 5
Figure 5
Histogram for Kermany dataset.
Figure 6
Figure 6
Histogram for OCTID dataset.
Figure 7
Figure 7
Histogram for Labeled Retinal OCT Dataset for Classification dataset.

Similar articles

Cited by

References

    1. Dhar T., Dey N., Borra S., Sherratt R.S. Challenges of deep learning in medical image analysis—Improving explainability and trust. IEEE Trans. Technol. Soc. 2023;4:68–75. doi: 10.1109/TTS.2023.3234203. - DOI
    1. Shen D., Wu G., Suk H.-I. Deep learning in medical image analysis. Annu. Rev. Biomed. Eng. 2017;19:221–248. doi: 10.1146/annurev-bioeng-071516-044442. - DOI - PMC - PubMed
    1. Suzuki K. Overview of deep learning in medical imaging. Radiol. Phys. Technol. 2017;10:257–273. doi: 10.1007/s12194-017-0406-5. - DOI - PubMed
    1. Ker J., Wang L., Rao J., Lim T. Deep learning applications in medical image analysis. IEEE Access. 2017;6:9375–9389. doi: 10.1109/ACCESS.2017.2788044. - DOI
    1. Bansal M.A., Sharma R., Kathuria M. A systematic review on data scarcity problem in deep learning: Solution and applications. ACM Comput. Surv. (CSUR) 2022;54:1–29. doi: 10.1145/3502287. - DOI

LinkOut - more resources