Review

. 2025 Jan 1;36(1):90-98.

doi: 10.1097/ICU.0000000000001091. Epub 2024 Nov 4.

Foundation models in ophthalmology: opportunities and challenges

Mertcan Sevgi^{1

2

3}, Eden Ruffell^{1

4

5

3}, Fares Antaki^{1

2

6}, Mark A Chia^{1

2

3}, Pearse A Keane^{1

2

3}

Affiliations

¹ Institute of Ophthalmology, University College London.
² Moorfields Eye Hospital NHS Foundation Trust.
³ NIHR Biomedical Research Centre at Moorfields Eye Hospital NHS Foundation Trust, London, UK.
⁴ Institute of Health Informatics.
⁵ Centre for Medical Image Computing, University College London.
⁶ The CHUM School of Artificial Intelligence in Healthcare, Montreal, Quebec, Canada.

PMID: 39329204
PMCID: PMC11620320
DOI: 10.1097/ICU.0000000000001091

Review

Foundation models in ophthalmology: opportunities and challenges

Mertcan Sevgi et al. Curr Opin Ophthalmol. 2025.

. 2025 Jan 1;36(1):90-98.

doi: 10.1097/ICU.0000000000001091. Epub 2024 Nov 4.

Authors

Mertcan Sevgi^{1

2

3}, Eden Ruffell^{1

4

5

3}, Fares Antaki^{1

2

6}, Mark A Chia^{1

2

3}, Pearse A Keane^{1

2

3}

Affiliations

¹ Institute of Ophthalmology, University College London.
² Moorfields Eye Hospital NHS Foundation Trust.
³ NIHR Biomedical Research Centre at Moorfields Eye Hospital NHS Foundation Trust, London, UK.
⁴ Institute of Health Informatics.
⁵ Centre for Medical Image Computing, University College London.
⁶ The CHUM School of Artificial Intelligence in Healthcare, Montreal, Quebec, Canada.

PMID: 39329204
PMCID: PMC11620320
DOI: 10.1097/ICU.0000000000001091

Abstract

Purpose of review: Last year marked the development of the first foundation model in ophthalmology, RETFound, setting the stage for generalizable medical artificial intelligence (GMAI) that can adapt to novel tasks. Additionally, rapid advancements in large language model (LLM) technology, including models such as GPT-4 and Gemini, have been tailored for medical specialization and evaluated on clinical scenarios with promising results. This review explores the opportunities and challenges for further advancements in these technologies.

Recent findings: RETFound outperforms traditional deep learning models in specific tasks, even when only fine-tuned on small datasets. Additionally, LMMs like Med-Gemini and Medprompt GPT-4 perform better than out-of-the-box models for ophthalmology tasks. However, there is still a significant deficiency in ophthalmology-specific multimodal models. This gap is primarily due to the substantial computational resources required to train these models and the limitations of high-quality ophthalmology datasets.

Summary: Overall, foundation models in ophthalmology present promising opportunities but face challenges, particularly the need for high-quality, standardized datasets for training and specialization. Although development has primarily focused on large language and vision models, the greatest opportunities lie in advancing large multimodal models, which can more closely mimic the capabilities of clinicians.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts of interest.

Figures

**FIGURE 1**
Schematic representation of training traditional deep learning models and foundation models. The differences between training traditional deep learning (DL) models and foundation models (FM) are highlighted. Traditional DL models typically require labelled datasets and are trained for specific tasks. In contrast, foundation models are usually trained once on unlabelled data and subsequently fine-tuned for a variety of tasks and modalities, such as segmentation, classification, and object detection. CFP, colour fundus photo; DN, diabetic retinopathy; MA, microaneurysm; OCT, optical coherence tomography; UWF, ultra-wide field. Adapted from [4], licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) Licence.

**FIGURE 2**
Methods of specializing large language models. The various techniques used to tailor LLMs for specific applications, including fine-tuning, prompt engineering, and retrieval-augmented generation (RAG) are illustrated. Fine-tuning involves adjusting the internal model parameters to improve performance on a specific task, while prompt engineering and RAG do not alter the model parameters but instead enhance the model's output through different approaches.

**FIGURE 3**
Visual question and answer example scenario involving an ophthalmologist using a large multimodal language model for treating wet age-related macular degeneration. The LMM interprets OCT (optical coherence tomography) images of a patient with wet age-related macular degeneration, offering guidance on treatment adjustment. The model also responds to follow-up questions. Images are from [43] licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) Licence. LMM, large multimodal models.

See this image and copyright information in PMC

Cited by

Assessment of Corneal Endothelial Barrier Function Based on "Y-Junctions": A Finite Element Analysis.
Li D, Duan H, Wang X, Lin Z, Dai K, Hu X, Zhao X, Zhou Q, Li Z, Xie L. Li D, et al. Invest Ophthalmol Vis Sci. 2025 May 1;66(5):33. doi: 10.1167/iovs.66.5.33. Invest Ophthalmol Vis Sci. 2025. PMID: 40408094 Free PMC article.

References

1. De Fauw J, Ledsam JR, Romera-Paredes B, et al. . Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat Med 2018; 24:1342–1350. - PubMed
1. Chia MA, Antaki F, Zhou Y, et al. . Foundation models in ophthalmology. Br J Ophthalmol 2024; 108:1341–1348. doi: 10.1136/bjo-2024-325459. [Epub ahead of print]. - PMC - PubMed
1. Bommasani R, Hudson DA, Adeli E, et al. On the opportunities and risks of foundation models. arXiv [cs.LG]. 2021. Available at: http://arxiv.org/abs/2108.07258. [Accessed 3 June 2024]
1. Ross A, McGrow K, Zhi D, et al. . Foundation models, generative AI, and large language models: essentials for nursing. Comput Inform Nurs 2024; 42:377–387. - PMC - PubMed
1. Brown TB, Mann B, Ryder N, et al. Language models are few-shot learners. arXiv [cs.CL]. 2020. Available at: https://arxiv.org/abs/2005.14165. [Accessed 3 June 2024]

Publication types

Actions

MeSH terms

Actions
Actions
Actions

Grants and funding

MR/T019050/1/MRC_/Medical Research Council/United Kingdom

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Foundation models in ophthalmology: opportunities and challenges

Affiliations

Foundation models in ophthalmology: opportunities and challenges

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials

Miscellaneous