Capabilities of ChatGPT-3.5 as a Urological Triage System

Christopher Hirtsiefer¹, Tim Nestler², Johanna Eckrich³, Henrieke Beverungen⁴, Carolin Siech⁵, Cem Aksoy⁶, Marianne Leitsmann^{7

8}, Martin Baunacke¹, Annemarie Uhlig⁹

Affiliations

¹ Klinik und Poliklinik für Urologie, Universitätsklinikum Carl Gustav Carus Dresden, Dresden, Germany.
² Klinik für Urologie, Bundeswehrzentralrankenhaus Koblenz, Koblenz, Germany.
³ Klinik und Poliklinik für Urologie und Kinderurologie, Universitätsklinikum Bonn, Germany.
⁴ St. Elisabeth Krankenhaus Leipzig, Leipzig, Germany.
⁵ Goethe University Frankfurt, University Hospital, Frankfurt am Main, Germany.
⁶ Klinik für Urologie, Universitätsklinikum Gießen und Marburg, Marburg, Germany.
⁷ Universitätsklinik für Urologie, Medizinische Universität Graz, Graz, Austria.
⁸ aQua-Institut für angewandte Qualitätsförderung und Forschung im Gesundheitswesen GmbH, Göttingen, Germany.
⁹ Klinik für Urologie, Universitätsmedizin Göttingen, Göttingen, Germany.

PMID: 39554303
PMCID: PMC11567918
DOI: 10.1016/j.euros.2024.10.015

Capabilities of ChatGPT-3.5 as a Urological Triage System

Christopher Hirtsiefer et al. Eur Urol Open Sci. 2024.

. 2024 Nov 1:70:148-153.

doi: 10.1016/j.euros.2024.10.015. eCollection 2024 Dec.

Authors

Christopher Hirtsiefer¹, Tim Nestler², Johanna Eckrich³, Henrieke Beverungen⁴, Carolin Siech⁵, Cem Aksoy⁶, Marianne Leitsmann^{7

8}, Martin Baunacke¹, Annemarie Uhlig⁹

Affiliations

¹ Klinik und Poliklinik für Urologie, Universitätsklinikum Carl Gustav Carus Dresden, Dresden, Germany.
² Klinik für Urologie, Bundeswehrzentralrankenhaus Koblenz, Koblenz, Germany.
³ Klinik und Poliklinik für Urologie und Kinderurologie, Universitätsklinikum Bonn, Germany.
⁴ St. Elisabeth Krankenhaus Leipzig, Leipzig, Germany.
⁵ Goethe University Frankfurt, University Hospital, Frankfurt am Main, Germany.
⁶ Klinik für Urologie, Universitätsklinikum Gießen und Marburg, Marburg, Germany.
⁷ Universitätsklinik für Urologie, Medizinische Universität Graz, Graz, Austria.
⁸ aQua-Institut für angewandte Qualitätsförderung und Forschung im Gesundheitswesen GmbH, Göttingen, Germany.
⁹ Klinik für Urologie, Universitätsmedizin Göttingen, Göttingen, Germany.

PMID: 39554303
PMCID: PMC11567918
DOI: 10.1016/j.euros.2024.10.015

Abstract

Background and objective: Patients struggle to classify symptoms, which hinders timely medical presentation. With 35-75% of patients seeking information online before consulting a health care professional, generative language-based artificial intelligence (AI), exemplified by ChatGPT-3.5 (GPT-3.5) from OpenAI, has emerged as an important source. The aim of our study was to evaluate the role of GPT-3.5 in triaging acute urological conditions to address a gap in current research.

Methods: We assessed GPT-3.5 performance in providing urological differential diagnoses (DD) and recommending a course of action (CoA). Six acute urological pathologies were identified for evaluation. Lay descriptions, sourced from patient forums, formed the basis for 472 queries that were independently entered by nine urologists. We evaluated the output in terms of compliance with the European Association of Urology (EAU) guidelines, the quality of the patient information using the validated DISCERN questionnaire, and a linguistic analysis.

Key findings and limitations: The median GPT-3.5 ratings were 4/5 for DD and CoA, and 3/5 for overall information quality. English outputs received higher median ratings than German outputs for DD (4.27 vs 3.95; p < 0.001) and CoA (4.25 vs 4.05; p < 0.005). There was no difference in performance between urgent and non-urgent cases. Analysis of the information quality revealed notable underperformance for source indication, risk assessment, and influence on quality of life.

Conclusion and clinical implications: Our results highlights the potential of GPT-3.5 as a triage system for offering individualized, empathetic advice mostly aligned with the EAU guidelines, outscoring other online information. Relevant shortcomings in terms of information quality, especially for risk assessment, need to be addressed to enhance the reliability. Broader transparency and quality improvements are needed before integration into, primarily English-speaking, patient care.

Patient summary: We looked at the performance of ChatGPT-3.5 for patients seeking urology advice. We entered more than 400 German and English inputs and assessed the possible diagnoses suggested by this artificial intelligence tool. ChatGPT-3.5 scored well in providing a complete list of possible diagnoses and recommending a course of action mostly in line with current guidelines. The quality of the information was good overall, but missing and unclear sources for the information can be a problem.

Keywords: Artificial intelligence; ChatGPT; Internet use; Triage; Urological emergency.

PubMed Disclaimer

Figures

**Fig. 1**
Percentage results for Likert-scale ratings for differential diagnoses, recommendations on a course of action, and DISCERN question 16 on the overall quality of the ChatGPT output. 1 = extensive shortcomings/no conformity with guidelines; 2 = important shortcomings/some conformity with guidelines; 3 = potentially important shortcomings/partial conformity with guidelines; 4 = minor shortcomings/predominant conformity with guidelines; 5 = minimal shortcomings/full conformity with guidelines.

**Fig. 2**
Descriptive statistics for all the DISCERN items. × = extreme outlier (interquartile range >3); ○ = mild outlier (interquartile range <3).

See this image and copyright information in PMC

References

1. Mueller J., Jay C., Harper S., Davies A., Vega J., Todd C. Web use for symptom appraisal of physical health conditions: a systematic review. J Med Internet Res. 2017;19 doi: 10.2196/jmir.6755. - DOI - PMC - PubMed
1. Shahsavar Y., Choudhury A. User intentions to use ChatGPT for self-diagnosis and health-related purposes: cross-sectional survey study. JMIR Hum Factors. 2023;10 doi: 10.2196/47564. - DOI - PMC - PubMed
1. Goebell P.J., El-Khadra S., Horstmann M., et al. What do urologist do in daily practice? A first “unfiltered” look at patient care. Urologe A. 2021;60:760–768. doi: 10.1007/s00120-021-01545-1. - DOI - PMC - PubMed
1. Szczesniewski J.J., Tellez Fouz C., Ramos Alba A., Diaz Goizueta F.J., García Tello A., Llanes G.L. ChatGPT and most frequent urological diseases: analysing the quality of information and potential risks for patients. World J Urol. 2023;41:3149–3153. doi: 10.1007/s00345-023-04563-0. - DOI - PubMed
1. Coskun B., Ocakoglu G., Yetemen M., Kaygisiz O. Can ChatGPT, an artificial intelligence language model, provide accurate and high-quality patient information on prostate cancer? Urology. 2023;180:35–58. doi: 10.1016/j.urology.2023.05.040. - DOI - PubMed

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Capabilities of ChatGPT-3.5 as a Urological Triage System

Affiliations

Capabilities of ChatGPT-3.5 as a Urological Triage System

Authors

Affiliations

Abstract

Figures

References

LinkOut - more resources

Full Text Sources

Miscellaneous