Quality of information and appropriateness of ChatGPT outputs for urology patients

Affiliations

¹ Urology Section, University of Florence, Florence, Italy. cocci.andrea@gmail.com.
² Urology Section, University of Florence, Florence, Italy.
³ Urology Section, University of Catania, Catania, Italy.
⁴ Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
⁵ Department of Urology, Copenhagen University Hospital, Herlev and Gentofte Hospital, Copenhagen, Denmark.
⁶ Institute of Urology, Keck School of Medicine, University of Southern California (USC), Los Angeles, CA, USA.

PMID: 37516804
DOI: 10.1038/s41391-023-00705-y

Quality of information and appropriateness of ChatGPT outputs for urology patients

Andrea Cocci et al. Prostate Cancer Prostatic Dis. 2024 Mar.

. 2024 Mar;27(1):103-108.

doi: 10.1038/s41391-023-00705-y. Epub 2023 Jul 29.

Authors

Affiliations

¹ Urology Section, University of Florence, Florence, Italy. cocci.andrea@gmail.com.
² Urology Section, University of Florence, Florence, Italy.
³ Urology Section, University of Catania, Catania, Italy.
⁴ Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
⁵ Department of Urology, Copenhagen University Hospital, Herlev and Gentofte Hospital, Copenhagen, Denmark.
⁶ Institute of Urology, Keck School of Medicine, University of Southern California (USC), Los Angeles, CA, USA.

PMID: 37516804
DOI: 10.1038/s41391-023-00705-y

Abstract

Background: The proportion of health-related searches on the internet is continuously growing. ChatGPT, a natural language processing (NLP) tool created by OpenAI, has been gaining increasing user attention and can potentially be used as a source for obtaining information related to health concerns. This study aims to analyze the quality and appropriateness of ChatGPT's responses to Urology case studies compared to those of a urologist.

Methods: Data from 100 patient case studies, comprising patient demographics, medical history, and urologic complaints, were sequentially inputted into ChatGPT, one by one. A question was posed to determine the most likely diagnosis, suggested examinations, and treatment options. The responses generated by ChatGPT were then compared to those provided by a board-certified urologist who was blinded to ChatGPT's responses and graded on a 5-point Likert scale based on accuracy, comprehensiveness, and clarity as criterias for appropriateness. The quality of information was graded based on the section 2 of the DISCERN tool and readability assessments were performed using the Flesch Reading Ease (FRE) and Flesch-Kincaid Reading Grade Level (FKGL) formulas.

Results: 52% of all responses were deemed appropriate. ChatGPT provided more appropriate responses for non-oncology conditions (58.5%) compared to oncology (52.6%) and emergency urology cases (11.1%) (p = 0.03). The median score of the DISCERN tool was 15 (IQR = 5.3) corresponding to a quality score of poor. The ChatGPT responses demonstrated a college graduate reading level, as indicated by the median FRE score of 18 (IQR = 21) and the median FKGL score of 15.8 (IQR = 3).

Conclusions: ChatGPT serves as an interactive tool for providing medical information online, offering the possibility of enhancing health outcomes and patient satisfaction. Nevertheless, the insufficient appropriateness and poor quality of the responses on Urology cases emphasizes the importance of thorough evaluation and use of NLP-generated outputs when addressing health-related concerns.

PubMed Disclaimer

References

1. Wise J. How Many People Use the Internet Daily in 2023? - EarthWeb. https://earthweb.com/how-many-people-use-the-internet-daily/ (accessed 15 May2023).
1. NTIA. More than Half of American Households Used the Internet for Health-Related Activities in 2019, NTIA Data Show | National Telecommunications and Information Administration. https://www.ntia.gov/blog/2020/more-half-american-households-used-intern... (accessed 2 May2023).
1. Eysenbach G, Kohler C. What is the prevalence of health-related searches on the World Wide Web? Qualitative and quantitative analysis of search engine queries on the Internet. AMIA Annu Symp Proc. 2003;2003:225. - PubMed - PMC
1. Introducing ChatGPT. https://openai.com/blog/chatgpt (accessed 15 May2023).
1. Liu Y, Yang Z, Yu Z, Liu Z, Liu D, Lin H, et al. Generative artificial intelligence and its applications in materials science: Current situation and future perspectives. J Mater. 2023. https://doi.org/10.1016/J.JMAT.2023.05.001 .

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Quality of information and appropriateness of ChatGPT outputs for urology patients

Affiliations

Quality of information and appropriateness of ChatGPT outputs for urology patients

Authors

Affiliations

Abstract

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical