Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct:266:289-299.
doi: 10.1016/j.ajo.2024.05.022. Epub 2024 May 31.

Predicting Glaucoma Before Onset Using a Large Language Model Chatbot

Affiliations

Predicting Glaucoma Before Onset Using a Large Language Model Chatbot

Xiaoqin Huang et al. Am J Ophthalmol. 2024 Oct.

Abstract

Purpose: To investigate the capability of ChatGPT for forecasting the conversion from ocular hypertension (OHT) to glaucoma based on the Ocular Hypertension Treatment Study (OHTS).

Design: Retrospective case-control study.

Participants: A total of 3008 eyes of 1504 subjects from the OHTS were included in the study.

Methods: We selected demographic, clinical, ocular, optic nerve head, and visual field (VF) parameters 1 year before glaucoma development from the OHTS participants. Subsequently, we developed queries by converting tabular parameters into textual format based on both eyes of all participants. We used the ChatGPT application program interface (API) to automatically perform ChatGPT prompting for all subjects. We then investigated whether ChatGPT can accurately forecast conversion from OHT to glaucoma based on various objective metrics.

Main outcome measure: Accuracy, area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and weighted F1 score.

Results: ChatGPT4.0 demonstrated an accuracy of 75%, AUC of 0.67, sensitivity of 56%, specificity of 78%, and weighted F1 score of 0.77 in predicting conversion to glaucoma 1 year before onset. ChatGPT3.5 provided an accuracy of 61%, AUC of 0.62, sensitivity of 64%, specificity of 59%, and weighted F1 score of 0.63 in predicting conversion to glaucoma 1 year before onset.

Conclusions: The performance of ChatGPT4.0 in forecasting development of glaucoma 1 year before onset was reasonable. The overall performance of ChatGPT4.0 was consistently higher than ChatGPT3.5. Large language models (LLMs) hold great promise for augmenting glaucoma research capabilities and enhancing clinical care. Future efforts in creating ophthalmology-specific LLMs that leverage multimodal data in combination with active learning may lead to more useful integration with clinical practice and deserve further investigations.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Similar articles

Cited by

References

    1. Allison K, Patel D, Alabi O. Epidemiology of Glaucoma: The Past, Present, and Predictions for the Future. Cureus. 2020;12(11):e11686. - PMC - PubMed
    1. Tham YC, Li X, Wong TY, Quigley HA, Aung T, Cheng CY. Global prevalence of glaucoma and projections of glaucoma burden through 2040: a systematic review and meta-analysis. Ophthalmology. 2014;121(11):2081–2090. - PubMed
    1. McMonnies CW. Glaucoma history and risk factors. J Optom. 2017;10(2):71–78. - PMC - PubMed
    1. Coleman AL, Miglior S. Risk factors for glaucoma onset and progression. Surv Ophthalmol. 2008;53 Suppl1:S3–10. - PubMed
    1. Jiang X, Varma R, Wu S, et al. Baseline risk factors that predict the development of open-angle glaucoma in a population: the Los Angeles Latino Eye Study. Ophthalmology. 2012;119(11):2245–2253. - PMC - PubMed

MeSH terms

LinkOut - more resources