Accuracy of ChatGPT in Neurolocalization
- PMID: 38803743
- PMCID: PMC11129669
- DOI: 10.7759/cureus.59143
Accuracy of ChatGPT in Neurolocalization
Abstract
Introduction ChatGPT (OpenAI Incorporated, Mission District, San Francisco, United States) is an artificial intelligence (AI) chatbot with advanced communication skills and a massive knowledge database. However, its application in medicine, specifically in neurolocalization, necessitates clinical reasoning in addition to deep neuroanatomical knowledge. This article examines ChatGPT's capabilities in neurolocalization. Methods Forty-six text-based neurolocalization case scenarios were presented to ChatGPT-3.5 from November 6th, 2023, to November 16th, 2023. Seven neurosurgeons evaluated ChatGPT's responses to these cases, utilizing a 5-point scoring system recommended by ChatGPT, to score the accuracy of these responses. Results ChatGPT-3.5 achieved an accuracy score of 84.8% in generating "completely correct" and "mostly correct" responses. ANOVA analysis suggested a consistent scoring approach between different evaluators. The mean length of the case text was 69.8 tokens (SD 20.8). Conclusion While this accuracy score is promising, it is not yet reliable for routine patient care. We recommend keeping interactions with ChatGPT concise, precise, and simple to improve response accuracy. As AI continues to evolve, it will hold significant and innovative breakthroughs in medicine.
Keywords: anatomical localization; anatomy; artificial intelligence; brain anatomy; chatgpt; diagnosis; generative pre-trained transformers; neuroanatomy; neurolocalization; neurosurgery.
Copyright © 2024, Dabbas et al.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- GPT-4 technical report. Achiam J, Adler S, Agarwal S, et al. arXiv. 2023
-
- Performance of ChatGPT and GPT-4 on Neurosurgery Written Board examinations. Ali R, Tang OY, Connolly ID, et al. medRxiv. 2023 - PubMed
-
- Natural language processing. Chowdhury GG. Ann Rev Info Sci Tech. 2003;37:51–89.
LinkOut - more resources
Full Text Sources
Research Materials