The Advanced Reasoning Capabilities of Large Language Models for Detecting Contraindicated Options in Medical Exams
- PMID: 40354629
- PMCID: PMC12088613
- DOI: 10.2196/68527
The Advanced Reasoning Capabilities of Large Language Models for Detecting Contraindicated Options in Medical Exams
Abstract
Enhancing clinical reasoning and reducing diagnostic errors are essential in medical practice; OpenAI-o1, with advanced reasoning capabilities, performed better than GPT-4 on 15 Japanese National Medical Licensing Examination questions (accuracy: 100% vs 80%; contraindicated option detection: 87% vs 73%), though findings are preliminary due to the small sample size.
Keywords: artificial intelligence; clinical reasoning; large language model; medical errors; natural language processing.
© Yuichiro Yano, Mizuki Ohashi, Taiju Miyagami, Hirotake Mori, Yuji Nishizaki, Hiroyuki Daida, Toshio Naito. Originally published in JMIR Medical Informatics (https://medinform.jmir.org).
Conflict of interest statement
References
-
- Learning to reason with LLMs. OpenAI. Sep 12, 2024. [08-03-2025]. https://openai.com/index/learning-to-reason-with-llms/ URL. Accessed.
-
- Zelikman E, Wu Y, Mu J, Goodman ND. STaR: bootstrapping reasoning with reasoning. arXiv. 2022 Mar 28; doi: 10.48550/arXiv.2203.14465. Preprint posted online on. doi. - DOI
MeSH terms
LinkOut - more resources
Full Text Sources