Intelligenza artificiale generativa in medicina del lavoro: quindici large language models a confronto su quesiti a scelta multipla in lingua italiana
- PMID: 41037373
- DOI: 10.1701/4573.45789
Intelligenza artificiale generativa in medicina del lavoro: quindici large language models a confronto su quesiti a scelta multipla in lingua italiana
Abstract
This study offers a comparative evaluation of fifteen generative artificial intelligence models using 397 Italian multiple-choice questions on occupational medicine. Model accuracy ranged from 75.06% to 95.72%. The results highlight the need to assess large language models in specialized fields to support their safe and effective integration into medical education and occupational medicine practice.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
