Utility of artificial intelligence-based large language models in ophthalmic care
- PMID: 38404172
- DOI: 10.1111/opo.13284
Utility of artificial intelligence-based large language models in ophthalmic care
Abstract
Purpose: With the introduction of ChatGPT, artificial intelligence (AI)-based large language models (LLMs) are rapidly becoming popular within the scientific community. They use natural language processing to generate human-like responses to queries. However, the application of LLMs and comparison of the abilities among different LLMs with their human counterparts in ophthalmic care remain under-reported.
Recent findings: Hitherto, studies in eye care have demonstrated the utility of ChatGPT in generating patient information, clinical diagnosis and passing ophthalmology question-based examinations, among others. LLMs' performance (median accuracy, %) is influenced by factors such as the iteration, prompts utilised and the domain. Human expert (86%) demonstrated the highest proficiency in disease diagnosis, while ChatGPT-4 outperformed others in ophthalmology examinations (75.9%), symptom triaging (98%) and providing information and answering questions (84.6%). LLMs exhibited superior performance in general ophthalmology but reduced accuracy in ophthalmic subspecialties. Although AI-based LLMs like ChatGPT are deemed more efficient than their human counterparts, these AIs are constrained by their nonspecific and outdated training, no access to current knowledge, generation of plausible-sounding 'fake' responses or hallucinations, inability to process images, lack of critical literature analysis and ethical and copyright issues. A comprehensive evaluation of recently published studies is crucial to deepen understanding of LLMs and the potential of these AI-based LLMs.
Summary: Ophthalmic care professionals should undertake a conservative approach when using AI, as human judgement remains essential for clinical decision-making and monitoring the accuracy of information. This review identified the ophthalmic applications and potential usages which need further exploration. With the advancement of LLMs, setting standards for benchmarking and promoting best practices is crucial. Potential clinical deployment requires the evaluation of these LLMs to move away from artificial settings, delve into clinical trials and determine their usefulness in the real world.
Keywords: artificial intelligence; chatbot; large language model; ophthalmic care; ophthalmology; optometry.
© 2024 The Authors. Ophthalmic and Physiological Optics published by John Wiley & Sons Ltd on behalf of College of Optometrists.
Similar articles
-
Exploring large language model for next generation of artificial intelligence in ophthalmology.Front Med (Lausanne). 2023 Nov 23;10:1291404. doi: 10.3389/fmed.2023.1291404. eCollection 2023. Front Med (Lausanne). 2023. PMID: 38076260 Free PMC article. Review.
-
Utility of Large Language Models for Health Care Professionals and Patients in Navigating Hematopoietic Stem Cell Transplantation: Comparison of the Performance of ChatGPT-3.5, ChatGPT-4, and Bard.J Med Internet Res. 2024 May 17;26:e54758. doi: 10.2196/54758. J Med Internet Res. 2024. PMID: 38758582 Free PMC article.
-
Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study.J Med Internet Res. 2023 Dec 28;25:e51580. doi: 10.2196/51580. J Med Internet Res. 2023. PMID: 38009003 Free PMC article.
-
ChatGPT and Beyond: An overview of the growing field of large language models and their use in ophthalmology.Eye (Lond). 2024 May;38(7):1252-1261. doi: 10.1038/s41433-023-02915-z. Epub 2024 Jan 3. Eye (Lond). 2024. PMID: 38172581 Free PMC article. Review.
-
A Comparative Analysis of the Performance of Large Language Models and Human Respondents in Dermatology.Indian Dermatol Online J. 2025 Feb 27;16(2):241-247. doi: 10.4103/idoj.idoj_221_24. eCollection 2025 Mar-Apr. Indian Dermatol Online J. 2025. PMID: 40125046 Free PMC article.
Cited by
-
Large Language Models in Ophthalmology: A Review of Publications from Top Ophthalmology Journals.Ophthalmol Sci. 2024 Dec 17;5(3):100681. doi: 10.1016/j.xops.2024.100681. eCollection 2025 May-Jun. Ophthalmol Sci. 2024. PMID: 40114712 Free PMC article.
-
Artificial intelligence virtual assistants in primary eye care practice.Ophthalmic Physiol Opt. 2025 Mar;45(2):437-449. doi: 10.1111/opo.13435. Epub 2024 Dec 26. Ophthalmic Physiol Opt. 2025. PMID: 39723633 Free PMC article.
-
Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3.Eye (Lond). 2025 Apr;39(6):1132-1137. doi: 10.1038/s41433-024-03545-9. Epub 2024 Dec 17. Eye (Lond). 2025. PMID: 39690303
-
Development of a novel scoring system for glaucoma risk based on demographic and laboratory factors using ChatGPT-4.Med Biol Eng Comput. 2025 Jan;63(1):75-87. doi: 10.1007/s11517-024-03182-0. Epub 2024 Aug 12. Med Biol Eng Comput. 2025. PMID: 39129037
-
Opportunities and Challenges of Chatbots in Ophthalmology: A Narrative Review.J Pers Med. 2024 Dec 21;14(12):1165. doi: 10.3390/jpm14121165. J Pers Med. 2024. PMID: 39728077 Free PMC article. Review.
References
REFERENCES
-
- Misischia CV, Poecze F, Strauss C. Chatbots in customer service: their relevance and impact on service quality. Procedia Comput Sci. 2022;201:421–428.
-
- Lin WC, Chen JS, Chiang MF, Hribar MR. Applications of artificial intelligence to electronic health record data in ophthalmology. Transl Vis Sci Technol. 2020;9:13. https://doi.org/10.1167/tvst.9.2.13
-
- Chen JS, Baxter SL. Applications of natural language processing in ophthalmology: present and future. Front Med. 2022;9:906554. https://doi.org/10.3389/fmed.2022.906554
-
- Foo LL, Lim GYS, Lanca C, Wong CW, Hoang QV, Zhang XJ, et al. Deep learning system to predict the 5‐year risk of high myopia using fundus imaging in children. NPJ Digit Med. 2023;6:10. https://doi.org/10.1038/s41746‐023‐00752‐8
-
- Milea D, Najjar RP, Zhubo J, Ting D, Vasseneix C, Xu X, et al. Artificial intelligence to detect papilledema from ocular fundus photographs. N Engl J Med. 2020;382:1687–1695.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources