Quality of information and appropriateness of Open AI outputs for prostate cancer

Affiliations

¹ Department of Urology, 'Sapienza' University of Rome, Rome, Italy. rlombardo@me.com.
² Department of Urology, 'Sapienza' University of Rome, Rome, Italy.

PMID: 38228809
DOI: 10.1038/s41391-024-00789-0

Quality of information and appropriateness of Open AI outputs for prostate cancer

Riccardo Lombardo et al. Prostate Cancer Prostatic Dis. 2025 Mar.

. 2025 Mar;28(1):229-231.

doi: 10.1038/s41391-024-00789-0. Epub 2024 Jan 16.

Authors

Affiliations

¹ Department of Urology, 'Sapienza' University of Rome, Rome, Italy. rlombardo@me.com.
² Department of Urology, 'Sapienza' University of Rome, Rome, Italy.

PMID: 38228809
DOI: 10.1038/s41391-024-00789-0

Abstract

Chat-GPT, a natural language processing (NLP) tool created by Open-AI, can potentially be used as a quick source for obtaining information related to prostate cancer. This study aims to analyze the quality and appropriateness of Chat-GPT's responses to inquiries related to prostate cancer compared to those of the European Urology Association's (EAU) 2023 prostate cancer guidelines. Overall, 195 questions were prepared according to the recommendations gathered in the prostate cancer section of the EAU 2023 Guideline. All questions were systematically presented to Chat-GPT's August 3 Version, and two expert urologists independently assessed and assigned scores ranging from 1 to 4 to each response (1: completely correct, 2: correct but inadequate, 3: a mix of correct and misleading information, and 4: completely incorrect). Sub-analysis per chapter and per grade of recommendation were performed. Overall, 195 recommendations were evaluated. Overall, 50/195 (26%) were completely correct, 51/195 (26%) correct but inadequate, 47/195 (24%) a mix of correct and misleading and 47/195 (24%) incorrect. When looking at different chapters Open AI was particularly accurate in answering questions on follow-up and QoL. Worst performance was recorded for the diagnosis and treatment chapters with respectively 19% and 30% of the answers completely incorrect. When looking at the strength of recommendation, no differences in terms of accuracy were recorded when comparing weak and strong recommendations (p > 0,05). Chat-GPT has a poor accuracy when answering questions on the PCa EAU guidelines recommendations. Future studies should assess its performance after adequate training.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests. Ethical approval: The study was approved by a local ethical committee and was conducted in accordance with the principles of the Declaration of Helsinki.

References

1. Culp MBB, Soerjomataram I, Efstathiou JA, Bray F, Jemal A. Recent global patterns in prostate cancer incidence and mortality rates. Eur Urol. 2020;77:38–52.
1. Hamdy FC, Donovan JL, Lane JA, Metcalfe C, Davis M, Turner EL, et al. Fifteen-Year outcomes after monitoring, surgery, or radiotherapy for prostate cancer. N Engl J Med. 2023;388:1547–58. - DOI - PubMed
1. Lombardo R, De Nunzio C. Nomograms in PCa: where do we stand. Prostate Cancer Prostatic Dis. 2023;26:447–8. - DOI - PubMed
1. Checcucci E, Rosati S, De Cillis S, Vagni M, Giordano N, Piana A, et al. Artificial intelligence for target prostate biopsy outcomes prediction the potential application of fuzzy logic. Prostate Cancer Prostatic Dis. 2022;25:359–62. - DOI - PubMed
1. Ditonno F, Franco A, Manfredi C, Veccia A, Valerio M, Bukavina L, et al. Novel non-MRI imaging techniques for primary diagnosis of prostate cancer: micro-ultrasound, contrast-enhanced ultrasound, elastography, multiparametric ultrasound, and PSMA PET/CT. Prostate Cancer Prostatic Dis. 2023. https://doi.org/10.1038/s41391-023-00708-9 . Epub ahead of print.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Quality of information and appropriateness of Open AI outputs for prostate cancer

Affiliations

Quality of information and appropriateness of Open AI outputs for prostate cancer

Authors

Affiliations

Abstract

Conflict of interest statement

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical