Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 7;8(1):149.
doi: 10.1038/s41746-025-01542-0.

Red teaming ChatGPT in medicine to yield real-world insights on model behavior

Crystal T Chang #  1   2 Hodan Farah #  1 Haiwen Gui #  1   3 Shawheen Justin Rezaei  3 Charbel Bou-Khalil  3 Ye-Jean Park  4 Akshay Swaminathan  3 Jesutofunmi A Omiye  1   5 Akaash Kolluri  6 Akash Chaurasia  7   8 Alejandro Lozano  5 Alice Heiman  6 Allison Sihan Jia  6 Amit Kaushal  9 Angela Jia  6 Angelica Iacovelli  10 Archer Yang  5   11 Arghavan Salles  6 Arpita Singhal  7 Balasubramanian Narasimhan  6 Benjamin Belai  12 Benjamin H Jacobson  3 Binglan Li  5 Celeste H Poe  3 Chandan Sanghera  6 Chenming Zheng  3 Conor Messer  6 Damien Varid Kettud  6 Deven Pandya  6 Dhamanpreet Kaur  3 Diana Hla  13 Diba Dindoust  6 Dominik Moehrle  3 Duncan Ross  14 Ellaine Chou  5 Eric Lin  15 Fateme Nateghi Haredasht  8 Ge Cheng  5 Irena Gao  6 Jacob Chang  5 Jake Silberg  5 Jason A Fries  8 Jiapeng Xu  5 Joe Jamison  14 John S Tamaresis  5 Jonathan H Chen  2   8   16 Joshua Lazaro  5 Juan M Banda  17 Julie J Lee  10 Karen Ebert Matthys  5 Kirsten R Steffner  18 Lu Tian  6 Luca Pegolotti  10 Malathi Srinivasan  3 Maniragav Manimaran  19 Matthew Schwede  16 Minghe Zhang  14 Minh Nguyen  6 Mohsen Fathzadeh  20 Qian Zhao  5 Rika Bajra  3 Rohit Khurana  5 Ruhana Azam  6 Rush Bartlett  21 Sang T Truong  7 Scott L Fleming  5 Shriti Raj  8 Solveig Behr  22 Sonia Onyeka  1 Sri Muppidi  6 Tarek Bandali  6 Tiffany Y Eulalio  5 Wenyuan Chen  5 Xuanyu Zhou  20 Yanan Ding  5   23   24 Ying Cui  6 Yuqi Tan  25 Yutong Liu  20 Nigam Shah  3   5 Roxana Daneshjou  26   27
Affiliations

Red teaming ChatGPT in medicine to yield real-world insights on model behavior

Crystal T Chang et al. NPJ Digit Med. .

Abstract

Red teaming, the practice of adversarially exposing unexpected or undesired model behaviors, is critical towards improving equity and accuracy of large language models, but non-model creator-affiliated red teaming is scant in healthcare. We convened teams of clinicians, medical and engineering students, and technical professionals (80 participants total) to stress-test models with real-world clinical cases and categorize inappropriate responses along axes of safety, privacy, hallucinations/accuracy, and bias. Six medically-trained reviewers re-analyzed prompt-response pairs and added qualitative annotations. Of 376 unique prompts (1504 responses), 20.1% were inappropriate (GPT-3.5: 25.8%; GPT-4.0: 16%; GPT-4.0 with Internet: 17.8%). Subsequently, we show the utility of our benchmark by testing GPT-4o, a model released after our event (20.4% inappropriate). 21.5% of responses appropriate with GPT-3.5 were inappropriate in updated models. We share insights for constructing red teaming prompts, and present our benchmark for iterative model assessments.

PubMed Disclaimer

Conflict of interest statement

Competing interests: RD has served as an advisor to MDAlgorithms and Revea and received. consulting fees from Pfizer, L’Oreal, Frazier Healthcare Partners, and DWA, and research funding from UCB and declares no non-financial competing interests. All other authors declare no financial or non-financial competing interests.

Figures

Fig. 1
Fig. 1. Key steps and considerations when organizing a red teaming workshop for large language models (LLMs) in medicine.
1Hyperparameters are settings that can be changed by the user (usually a machine learning engineer) to vary model output. These can include temperature, which varies the randomness of outputs, and max output tokens (length of response). 2An application programming interface (API) is a software interface that allows information to pass between two software applications. In the context of large language models (LLMs) and prompting, submitting prompts (user queries) through an API refers to writing code to submit prompts rather than submitting through a user interface. API submission can be preferred when batch submission of prompts is desired, or when it is desired to change settings (hyperparameters) that influence LLM responses. 3A field of study that focuses on varying the format of inputs to a language model in order to produce optimal outputs.

References

    1. Clusmann, J. et al. The future landscape of large language models in medicine. Commun. Med.3, 141 (2023). - PMC - PubMed
    1. Chen, S. et al. The effect of using a large language model to respond to patient messages. Lancet Digit. Health6, e379–e381 (2024). - PMC - PubMed
    1. Omiye, J. A., Lester, J. C., Spichak, S., Rotemberg, V. & Daneshjou, R. Large language models propagate race-based medicine. NPJ Digit. Med.6, 195 (2023). - PMC - PubMed
    1. Thirunavukarasu, A. J. et al. Large language models in medicine. Nat. Med.29, 1930–1940 (2023). - PubMed
    1. Gui, H. et al. Dermatologists’ Perspectives and Usage of Large Language Models in Practice: An Exploratory Survey. J. Invest. Dermatol.144, 2298–2301 (2024). - PubMed

LinkOut - more resources