Toward Clinical-Grade Evaluation of Large Language Models
- PMID: 38401979
- PMCID: PMC11221761
- DOI: 10.1016/j.ijrobp.2023.11.012
Toward Clinical-Grade Evaluation of Large Language Models
References
-
- OpenAI. Available at: https://platform.openai.com. Accessed November 1, 2023.
-
- Singhal K, Tu T, Gottweis J, et al. Towards expert-level medical question answering with large language models. Available at: https://arxiv.org/abs/2305.09617. Accessed December 14, 2023.
-
- Nori H, King N, McKinney SM, Carignan D, Horvitz E. Capabilities of GPT-4 on medical challenge problems. Available at: https://arxiv.org/abs/2303.13375. Accessed December 14, 2023.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical