Prompt injection attacks on vision language models in oncology
- PMID: 39890777
- PMCID: PMC11785991
- DOI: 10.1038/s41467-024-55631-x
Prompt injection attacks on vision language models in oncology
Abstract
Vision-language artificial intelligence models (VLMs) possess medical knowledge and can be employed in healthcare in numerous ways, including as image interpreters, virtual scribes, and general decision support systems. However, here, we demonstrate that current VLMs applied to medical tasks exhibit a fundamental security flaw: they can be compromised by prompt injection attacks. These can be used to output harmful information just by interacting with the VLM, without any access to its parameters. We perform a quantitative study to evaluate the vulnerabilities to these attacks in four state of the art VLMs: Claude-3 Opus, Claude-3.5 Sonnet, Reka Core, and GPT-4o. Using a set of N = 594 attacks, we show that all of these models are susceptible. Specifically, we show that embedding sub-visual prompts in manifold medical imaging data can cause the model to provide harmful output, and that these prompts are non-obvious to human observers. Thus, our study demonstrates a key vulnerability in medical VLMs which should be mitigated before widespread clinical adoption.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: The authors declare the following competing interests: DT received honoraria for lectures by Bayer and holds shares in StratifAI GmbH, Germany. SF has received honoraria from MSD and BMS. TJB is the owner of Smart Health Heidelberg GmbH, Heidelberg, Germany, outside of the scope of the submitted work. JNK declares consulting services for Bioptimus, France; Owkin, France; DoMore Diagnostics, Norway; Panakeia, UK; AstraZeneca, UK; Mindpeak, Germany; and MultiplexDx, Slovakia. Furthermore, he holds shares in StratifAI GmbH, Germany, Synagen GmbH, Germany, and has received a research grant by GSK, and has received honoraria by AstraZeneca, Bayer, Daiichi Sankyo, Eisai, Janssen, Merck, MSD, BMS, Roche, Pfizer, and Fresenius. ICW has received honoraria from AstraZeneca. DF holds shares in Synagen GmbH, Germany. No other competing interests are declared by any of the remaining authors.
Figures




Similar articles
-
Diagnostic accuracy of vision-language models on Japanese diagnostic radiology, nuclear medicine, and interventional radiology specialty board examinations.Jpn J Radiol. 2024 Dec;42(12):1392-1398. doi: 10.1007/s11604-024-01633-0. Epub 2024 Jul 20. Jpn J Radiol. 2024. PMID: 39031270 Free PMC article.
-
Diagnostic Performance of GPT-4o and Claude 3 Opus in Determining Causes of Death From Medical Histories and Postmortem CT Findings.Cureus. 2024 Aug 20;16(8):e67306. doi: 10.7759/cureus.67306. eCollection 2024 Aug. Cureus. 2024. PMID: 39301343 Free PMC article.
-
Diagnostic performances of Claude 3 Opus and Claude 3.5 Sonnet from patient history and key images in Radiology's "Diagnosis Please" cases.Jpn J Radiol. 2024 Dec;42(12):1399-1402. doi: 10.1007/s11604-024-01634-z. Epub 2024 Aug 3. Jpn J Radiol. 2024. PMID: 39096483 Free PMC article.
-
Visual-language foundation models in medical imaging: A systematic review and meta-analysis of diagnostic and analytical applications.Comput Methods Programs Biomed. 2025 Aug;268:108870. doi: 10.1016/j.cmpb.2025.108870. Epub 2025 May 21. Comput Methods Programs Biomed. 2025. PMID: 40424873 Review.
-
Integrating language into medical visual recognition and reasoning: A survey.Med Image Anal. 2025 May;102:103514. doi: 10.1016/j.media.2025.103514. Epub 2025 Feb 27. Med Image Anal. 2025. PMID: 40023891 Review.
Cited by
-
Computer-aided tumor cell fraction (TCF) estimation by medical students, residents, and pathologists improves inter-observer agreement while highlighting the risk of automation bias.Virchows Arch. 2025 Jul 4. doi: 10.1007/s00428-025-04163-w. Online ahead of print. Virchows Arch. 2025. PMID: 40610733
-
Prompt injection attacks on vision-language models for surgical decision support.medRxiv [Preprint]. 2025 Jul 23:2025.07.16.25331645. doi: 10.1101/2025.07.16.25331645. medRxiv. 2025. PMID: 40778151 Free PMC article. Preprint.
-
Large language models for clinical decision support in gastroenterology and hepatology.Nat Rev Gastroenterol Hepatol. 2025 Aug 22. doi: 10.1038/s41575-025-01108-1. Online ahead of print. Nat Rev Gastroenterol Hepatol. 2025. PMID: 40846793 Review.
-
Hallmarks of artificial intelligence contributions to precision oncology.Nat Cancer. 2025 Mar;6(3):417-431. doi: 10.1038/s43018-025-00917-2. Epub 2025 Mar 7. Nat Cancer. 2025. PMID: 40055572 Review.
-
[Applications, challenges and a trustworthy use of artificial intelligence in public health].Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2025 Aug;68(8):880-888. doi: 10.1007/s00103-025-04098-2. Epub 2025 Jul 2. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2025. PMID: 40600999 Free PMC article. Review. German.
References
-
- Bubeck, S. et al. Sparks of artificial general intelligence: early experiments with GPT-4. arXiv [cs.CL] (2023).
-
- Ferber, D. et al. Autonomous artificial intelligence agents for clinical decision making in oncology. arXiv [cs.AI] (2024).
-
- Thirunavukarasu, A. J. et al. Large language models in medicine. Nat. Med.29, 1930–1940 (2023). - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources