Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 14;37(3):173-178.
doi: 10.4103/sjopt.sjopt_91_23. eCollection 2023 Jul-Sep.

Vision transformers: The next frontier for deep learning-based ophthalmic image analysis

Affiliations

Vision transformers: The next frontier for deep learning-based ophthalmic image analysis

Jo-Hsuan Wu et al. Saudi J Ophthalmol. .

Abstract

Deep learning is the state-of-the-art machine learning technique for ophthalmic image analysis, and convolutional neural networks (CNNs) are the most commonly utilized approach. Recently, vision transformers (ViTs) have emerged as a promising approach, one that is even more powerful than CNNs. In this focused review, we summarized studies that applied ViT-based models to analyze color fundus photographs and optical coherence tomography images. Overall, ViT-based models showed robust performances in the grading of diabetic retinopathy and glaucoma detection. While some studies demonstrated that ViTs were superior to CNNs in certain contexts of use, it is unclear how widespread ViTs will be adopted for ophthalmic image analysis, since ViTs typically require even more training data as compared to CNNs. The studies included were identified from the PubMed and Google Scholar databases using keywords relevant to this review. Only original investigations through March 2023 were included.

Keywords: Color fundus photographs; deep learning; ophthalmic image analysis; optical coherence tomography; vision transformers.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts of interest.

Similar articles

Cited by

References

    1. Yamashita R, Nishio M, Do RK, Togashi K. Convolutional neural networks: An overview and application in radiology. Insights Imaging. 2018;9:611–29. - PMC - PubMed
    1. Albawi S, Mohammed TA. and Al-Zawi S. "Understanding of a convolutional neural network," 2017 International Conference on Engineering and Technology (ICET), Antalya, Turkey. 2017:1–6. doi:10.1109/ICEngTechnol.2017.8308186.
    1. Wu JH, Liu TY, Hsu WT, Ho JH, Lee CC. Performance and limitation of machine learning algorithms for diabetic retinopathy screening: Meta-analysis. J Med Internet Res. 2021;23:e23863. - PMC - PubMed
    1. Wu JH, Nishida T, Weinreb RN, Lin JW. Performances of machine learning in detecting glaucoma using fundus and retinal optical coherence tomography images: A Meta-Analysis. Am J Ophthalmol. 2022;237:1–12. - PubMed
    1. Naseer MM, Ranasinghe K, Khan SH, Hayat M, Shahbaz Khan F, Yang MH. Intriguing properties of vision transformers. Adv Neural Inf Process Syst. 2021;34:23296–308.