Quality of information and appropriateness of Open AI outputs for prostate cancer
- PMID: 38228809
- DOI: 10.1038/s41391-024-00789-0
Quality of information and appropriateness of Open AI outputs for prostate cancer
Abstract
Chat-GPT, a natural language processing (NLP) tool created by Open-AI, can potentially be used as a quick source for obtaining information related to prostate cancer. This study aims to analyze the quality and appropriateness of Chat-GPT's responses to inquiries related to prostate cancer compared to those of the European Urology Association's (EAU) 2023 prostate cancer guidelines. Overall, 195 questions were prepared according to the recommendations gathered in the prostate cancer section of the EAU 2023 Guideline. All questions were systematically presented to Chat-GPT's August 3 Version, and two expert urologists independently assessed and assigned scores ranging from 1 to 4 to each response (1: completely correct, 2: correct but inadequate, 3: a mix of correct and misleading information, and 4: completely incorrect). Sub-analysis per chapter and per grade of recommendation were performed. Overall, 195 recommendations were evaluated. Overall, 50/195 (26%) were completely correct, 51/195 (26%) correct but inadequate, 47/195 (24%) a mix of correct and misleading and 47/195 (24%) incorrect. When looking at different chapters Open AI was particularly accurate in answering questions on follow-up and QoL. Worst performance was recorded for the diagnosis and treatment chapters with respectively 19% and 30% of the answers completely incorrect. When looking at the strength of recommendation, no differences in terms of accuracy were recorded when comparing weak and strong recommendations (p > 0,05). Chat-GPT has a poor accuracy when answering questions on the PCa EAU guidelines recommendations. Future studies should assess its performance after adequate training.
© 2024. The Author(s), under exclusive licence to Springer Nature Limited.
Conflict of interest statement
Competing interests: The authors declare no competing interests. Ethical approval: The study was approved by a local ethical committee and was conducted in accordance with the principles of the Declaration of Helsinki.
Similar articles
-
Evaluating the performance of ChatGPT in answering questions related to urolithiasis.Int Urol Nephrol. 2024 Jan;56(1):17-21. doi: 10.1007/s11255-023-03773-0. Epub 2023 Sep 2. Int Urol Nephrol. 2024. PMID: 37658948
-
Evaluating the performance of ChatGPT in answering questions related to pediatric urology.J Pediatr Urol. 2024 Feb;20(1):26.e1-26.e5. doi: 10.1016/j.jpurol.2023.08.003. Epub 2023 Aug 7. J Pediatr Urol. 2024. PMID: 37596194
-
Evaluating the Efficacy of ChatGPT as a Patient Education Tool in Prostate Cancer: Multimetric Assessment.J Med Internet Res. 2024 Aug 14;26:e55939. doi: 10.2196/55939. J Med Internet Res. 2024. PMID: 39141904 Free PMC article.
-
ChatGPT as a Clinical Decision Maker for Urolithiasis: Compliance with the Current European Association of Urology Guidelines.Eur Urol Open Sci. 2024 Sep 16;69:51-62. doi: 10.1016/j.euros.2024.08.015. eCollection 2024 Nov. Eur Urol Open Sci. 2024. PMID: 39318971 Free PMC article. Review.
-
EAU-EANM-ESTRO-ESUR-SIOG Guidelines on Prostate Cancer-2020 Update. Part 1: Screening, Diagnosis, and Local Treatment with Curative Intent.Eur Urol. 2021 Feb;79(2):243-262. doi: 10.1016/j.eururo.2020.09.042. Epub 2020 Nov 7. Eur Urol. 2021. PMID: 33172724
Cited by
-
Letter to the Editor on "Physician vs. AI-generated messages in urology: evaluation of accuracy, completeness, and preference by patients and physicians".World J Urol. 2025 May 6;43(1):272. doi: 10.1007/s00345-025-05587-4. World J Urol. 2025. PMID: 40327130 No abstract available.
-
Accountability in AI medicine: A critical appraisal of ChatGPT in patient self-management and screening.Clin Mol Hepatol. 2025 Jan;31(1):e1-e2. doi: 10.3350/cmh.2024.0769. Epub 2024 Sep 26. Clin Mol Hepatol. 2025. PMID: 39323107 Free PMC article. No abstract available.
-
Patient- and clinician-based evaluation of large language models for patient education in prostate cancer radiotherapy.Strahlenther Onkol. 2025 Mar;201(3):333-342. doi: 10.1007/s00066-024-02342-3. Epub 2025 Jan 10. Strahlenther Onkol. 2025. PMID: 39792259 Free PMC article.
-
Performance of large language models (LLMs) in providing prostate cancer information.BMC Urol. 2024 Aug 23;24(1):177. doi: 10.1186/s12894-024-01570-0. BMC Urol. 2024. PMID: 39180045 Free PMC article.
-
The performance of large language model-powered chatbots compared to oncology physicians on colorectal cancer queries.Int J Surg. 2024 Oct 1;110(10):6509-6517. doi: 10.1097/JS9.0000000000001850. Int J Surg. 2024. PMID: 38935100 Free PMC article.
References
-
- Culp MBB, Soerjomataram I, Efstathiou JA, Bray F, Jemal A. Recent global patterns in prostate cancer incidence and mortality rates. Eur Urol. 2020;77:38–52.
-
- Ditonno F, Franco A, Manfredi C, Veccia A, Valerio M, Bukavina L, et al. Novel non-MRI imaging techniques for primary diagnosis of prostate cancer: micro-ultrasound, contrast-enhanced ultrasound, elastography, multiparametric ultrasound, and PSMA PET/CT. Prostate Cancer Prostatic Dis. 2023. https://doi.org/10.1038/s41391-023-00708-9 . Epub ahead of print.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical