A novel evaluation benchmark for medical LLMs illuminating safety and effectiveness in clinical domains.
Wang S, Tang Z, Yang H, Gong Q, Gu T, Ma H, Wang Y, Sun W, Lian Z, Mao K, Jiang Y, Huang Z, Ma L, Shen W, Ji Y, Tan Y, Wang C, Gao Y, Ye Q, Lin R, Chen M, Niu L, Wang Z, Yu P, Lang M, Liu Y, Zhang H, Shen H, Chen L, Zhao Q, Liu SX, Zhou L, Gao H, Ye D, Meng L, Yu Y, Liang N, Wu J.
Wang S, et al. Among authors: lang m.
NPJ Digit Med. 2025 Dec 26. doi: 10.1038/s41746-025-02277-8. Online ahead of print.
NPJ Digit Med. 2025.
PMID: 41454006
Free article.