ChatGPT as a Clinical Decision Maker for Urolithiasis: Compliance with the Current European Association of Urology Guidelines
- PMID: 39318971
- PMCID: PMC11421362
- DOI: 10.1016/j.euros.2024.08.015
ChatGPT as a Clinical Decision Maker for Urolithiasis: Compliance with the Current European Association of Urology Guidelines
Abstract
Background and objective: Generative artificial intelligence models are among the most promising and widely used tools used in health care. This review investigates GPT-4 answers to decision-making questions regarding the diagnosis and treatment of urolithiasis across several clinical settings and their correspondence to the current European Association of Urology (EAU) guidelines.
Methods: In March 2024, the GPT-4 model was asked 11 questions, containing a brief description of a patient with urolithiasis. All questions were grouped according to urolithiasis care step: diagnosis, urgent care, scheduled intervention, and metaphylaxis. When responses were received, compliance with the current EAU guidelines was assessed by experienced urologists.
Key findings and limitations: Although all responses were provided with information that corresponded to EAU guidelines, six of the 11 answers were associated with missed guideline-provided parts, and incorrect data were given in eight of the 11 answers. GPT-4 is relatively safe in the initial diagnostic flow of patients suspected of having stones within the urinary tract and during treatment planning; however, its understanding of all the nuances of metaphylaxis leaves much to be desired and is far from the dogma given in the EAU guidelines. Moreover, GPT-4 knowledge of strategy and algorithm is not always aligned with the EAU guidelines.
Conclusions and clinical implications: Despite the fact that from the perspective of patients with urolithiasis, GPT-4 is capable of answering their questions well, the specificity of questions from urologists is labor intensive for its current version, and necessitates the ability to interpret it correctly and further attempts to improve it. While some aspects of diagnostics are more accurate, these struggle with surgical planning and algorithms in line with the EAU guidelines.
Patient summary: The generative artificial intelligence (AI) model GPT-4 is capable of answering urology-related questions, but lacks detailed responses. Although some aspects of the diagnostics are accurate, knowledge of surgical planning is not in line with the European Association of Urology guidelines. Future improvements should focus on efforts to enhance the accuracy, reliability, and clinical relevance of AI tools in urology.
Keywords: Clinical decision; Diagnosis; Generative pretrained transformer; Treatment; Urolithiasis.
© 2024 The Author(s).
Similar articles
-
What is the role of large language models in the management of urolithiasis?: a review.Urolithiasis. 2025 May 15;53(1):92. doi: 10.1007/s00240-025-01761-w. Urolithiasis. 2025. PMID: 40372452 Review.
-
Evaluating the performance of ChatGPT in answering questions related to urolithiasis.Int Urol Nephrol. 2024 Jan;56(1):17-21. doi: 10.1007/s11255-023-03773-0. Epub 2023 Sep 2. Int Urol Nephrol. 2024. PMID: 37658948
-
Assessing the Knowledge of ChatGPT in Answering Questions Regarding Female Urology.Urol J. 2024 Nov 27;21(6):410-414. doi: 10.22037/uj.v21i.8194. Urol J. 2024. PMID: 39580624
-
Annual updates of the European Association of Urology - European Society for Pediatric Urology (EAU-ESPU) paediatric urology guidelines: Are large-language models (LLM) better than the usual structured methodology?J Pediatr Urol. 2025 Jun 2:S1477-5131(25)00303-1. doi: 10.1016/j.jpurol.2025.05.030. Online ahead of print. J Pediatr Urol. 2025. PMID: 40514273
-
Best Practice in Interventional Management of Urolithiasis: An Update from the European Association of Urology Guidelines Panel for Urolithiasis 2022.Eur Urol Focus. 2023 Jan;9(1):199-208. doi: 10.1016/j.euf.2022.06.014. Epub 2022 Aug 1. Eur Urol Focus. 2023. PMID: 35927160 Review.
Cited by
-
ChatGPT as a Support Tool for Informed Consent and Preoperative Patient Education Prior to Penile Prosthesis Implantation.J Clin Med. 2024 Dec 10;13(24):7482. doi: 10.3390/jcm13247482. J Clin Med. 2024. PMID: 39768416 Free PMC article.
-
What is the role of large language models in the management of urolithiasis?: a review.Urolithiasis. 2025 May 15;53(1):92. doi: 10.1007/s00240-025-01761-w. Urolithiasis. 2025. PMID: 40372452 Review.
References
-
- Rajpurkar P., Chen E., Banerjee O., Topol E.J. AI in health and medicine. Nat Med. 2022;28:31–38. - PubMed
-
- Caglar U., Yildiz O., Meric A., et al. Evaluating the performance of ChatGPT in answering questions related to pediatric urology. J Pediatr Urol. 2024;20:26.e1–26.e5. - PubMed
-
- Caglar U., Yildiz O., Meric A., et al. Evaluating the performance of ChatGPT in answering questions related to benign prostate hyperplasia and prostate cancer. Minerva Urol Nephrol. 2023;75:729–733. - PubMed
Publication types
LinkOut - more resources
Full Text Sources