This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2025 Jul 7:2025.07.03.25330805.

doi: 10.1101/2025.07.03.25330805.

GlaucoRAG: A Retrieval-Augmented Large Language Model for Expert-Level Glaucoma Assessment

Mohammad Aminan¹, S Solomon Darnell², Mohammad Delsoz¹, Amin Nabavi¹, Claire Wright¹, Brian Jerkins¹, Siamak Yousefi¹

Affiliations

¹ Department of Ophthalmology, University of Tennessee Health Sciences Center Memphis, Tennessee, United States.
² Department of Genetics, Genomics and Informatics, University of Tennessee Health Sciences Center Memphis, Tennessee, United States.

PMID: 40672509
PMCID: PMC12265780
DOI: 10.1101/2025.07.03.25330805

GlaucoRAG: A Retrieval-Augmented Large Language Model for Expert-Level Glaucoma Assessment

Mohammad Aminan et al. medRxiv. 2025.

[Preprint]. 2025 Jul 7:2025.07.03.25330805.

doi: 10.1101/2025.07.03.25330805.

Authors

Mohammad Aminan¹, S Solomon Darnell², Mohammad Delsoz¹, Amin Nabavi¹, Claire Wright¹, Brian Jerkins¹, Siamak Yousefi¹

Affiliations

¹ Department of Ophthalmology, University of Tennessee Health Sciences Center Memphis, Tennessee, United States.
² Department of Genetics, Genomics and Informatics, University of Tennessee Health Sciences Center Memphis, Tennessee, United States.

PMID: 40672509
PMCID: PMC12265780
DOI: 10.1101/2025.07.03.25330805

Abstract

Purpose: Purpose: Accurate glaucoma assessment is challenging because of the complexity and chronic nature of the disease; therefore, there is a critical need for models that provide evidence-based, accurate assessment. The purpose of this study was to evaluate the capabilities of a glaucoma specialized Retrieval-Augmented Generation (RAG) framework (GlaucoRAG) that leverages a large language model (LLM) for diagnosing glaucoma and answering to glaucoma specific questions.

Design: Evaluation of diagnostic capabilities and knowledge of emerging technologies in glaucoma assessment.

Participants: Detailed case reports from 11 patients and 250 multiple choice questions from the Basic and Clinical Science Course (BCSC) Self-Assessment were used to test the LLM based GlaucoRAG. No human participants were involved.

Methods: We developed GlaucoRAG, a RAG framework leveraging GPT-4.5-PREVIEW integrated with the R2R platform for automated question answering in glaucoma. We created a glaucoma knowledge base comprising more than 1,800 peer-reviewed glaucoma articles, 15 guidelines and three glaucoma textbooks. The diagnostic performance was tested on case reports and multiple-choice questions. Model outputs were compared with the independent answers of three glaucoma specialists, DeepSeek-R1, and GPT-4.5-PREVIEW (without RAG). Quantitative performance was further assessed with the RAG Assessment (RAGAS) framework, reporting faithfulness, context precision, context recall, and answer relevancy.

Main outcome measures: The primary outcome measure was GlaucoRAG's diagnostic accuracy on patient case reports and percentage of correct responses to the BCSC Self-Assessment glaucoma items, compared with the performance of glaucoma specialists and two benchmark LLMs. Secondary outcomes included RAGAS sub scores.

Results: GlaucoRAG achieved an accuracy of 81.8% on glaucoma case reports, compared with 72.7% for GPT-4.5-PREVIEW and 63.7% for DeepSeek-R1. On glaucoma BCSC Self-Assessment questions, GlaucoRAG achieved 91.2% accuracy (228 / 250), whereas GPT-4.5-PREVIEW and DeepSeek-R1 attained 84.4% (211 / 250) and 76.0% (190 / 250), respectively. The RAGAS evaluation returned an answer relevancy of 91%, with 80% context recall, 70% faithfulness, and 59% context precision.

Conclusions: The glaucoma-specialized LLM, GlaucoRAG, showed encouraging performance in glaucoma assessment and may complement glaucoma research and clinical practice as well as question answering with glaucoma patients.

Keywords: Glaucoma; Glaucoma Specialized RAG (GlaucoRAG); Large Language Mdoel (LLM); Question Answering (QA); Retrieval-Augmented Generation (RAG).

PubMed Disclaimer

Figures

**Figure 1.**
Architecture of the glaucoma specialized retrieval augmented generation (GlaucoRAG) framework.

See this image and copyright information in PMC

References

1. Madadi Yeganeh, Delsoz Mohammad, Khouri Albert S., Boland Michael, Grzybowski Andrzej, and Yousefi Siamak. Applications of artificial intelligence-enabled robots and chatbots in ophthalmology: recent advances and future trends. Current Opinion in Ophthalmology, 35(3), 2024. - PMC - PubMed
2. N2 - Purpose of review Recent advances in artificial intelligence (AI), robotics, and chatbots have brought these technologies to the forefront of medicine, particularly ophthalmology. These technologies have been applied in diagnosis, prognosis, surgical operations, and patient-specific care in ophthalmology. It is thus both timely and pertinent to assess the existing landscape, recent advances, and trajectory of trends of AI, AI-enabled robots, and chatbots in ophthalmology. Recent findings Some recent developments have integrated AI enabled robotics with diagnosis, and surgical procedures in ophthalmology. More recently, large language models (LLMs) like ChatGPT have shown promise in augmenting research capabilities and diagnosing ophthalmic diseases. These developments may portend a new era of doctor-patient-machine collaboration. Summary Ophthalmology is undergoing a revolutionary change in research, clinical practice, and surgical interventions. Ophthalmic AI-enabled robotics and chatbot technologies based on LLMs are converging to create a new era of digital ophthalmology. Collectively, these developments portend a future in which conventional ophthalmic knowledge will be seamlessly integrated with AI to improve the patient experience and enhance therapeutic outcomes.
1. Madadi Yeganeh, Abu-Serhan Hashem, and Yousefi Siamak. Domain Adaptation-Based deep learning model for forecasting and diagnosis of glaucoma disease. Biomedical Signal Processing and Control, 92:106061, June 2024. - PMC - PubMed
1. Yousefi Siamak, Pasquale Louis R., Boland Michael V., and Johnson Chris A.. Machine-Identified Patterns of Visual Field Loss and an Association with Rapid Progression in the Ocular Hypertension Treatment Study. Ophthalmology, 129(12):1402–1411, December 2022. - PMC - PubMed
1. Yousefi Siamak, Elze Tobias, Pasquale Louis R., Saeedi Osamah, Wang Mengyu, Shen Lucy Q., Wellik Sarah R., De Moraes Carlos G., Myers Jonathan S., and Boland Michael V.. Monitoring Glaucomatous Functional Loss Using an Artificial Intelligence–Enabled Dashboard. Ophthalmology, 127(9):1170–1178, September 2020. - PMC - PubMed
1. Thakur Anshul, Goldbaum Michael, and Yousefi Siamak. Predicting Glaucoma before Onset Using Deep Learning. Ophthalmology Glaucoma, 3(4):262–268, July 2020. - PMC - PubMed

Publication types

Actions

Grants and funding

R01 EY033005/EY/NEI NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Cold Spring Harbor Laboratory
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

GlaucoRAG: A Retrieval-Augmented Large Language Model for Expert-Level Glaucoma Assessment

Affiliations

GlaucoRAG: A Retrieval-Augmented Large Language Model for Expert-Level Glaucoma Assessment

Authors

Affiliations

Abstract

Figures

Similar articles

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

This is a preprint.

Abstract

Figures

Similar articles

References

Publication types

Related information

Grants and funding

LinkOut - more resources

Full Text Sources