Modern artificial intelligence and large language models in graduate medical education: a scoping review of attitudes, applications & practice
- PMID: 40394586
- PMCID: PMC12093616
- DOI: 10.1186/s12909-025-07321-5
Modern artificial intelligence and large language models in graduate medical education: a scoping review of attitudes, applications & practice
Abstract
Background: Artificial intelligence (AI) holds transformative potential for graduate medical education (GME), yet, a comprehensive exploration of AI's applications, perceptions, and limitations in GME is lacking.
Objective: To map the current literature on AI in GME, identifying prevailing perceptions, applications, and research gaps to inform future research, policy discussions, and educational practices through a scoping review.
Methods: Following the Joanna Briggs Institute guidelines and the PRISMA-ScR checklist a comprehensive search of multiple databases up to February 2024 was performed to include studies addressing AI interventions in GME.
Results: Out of 1734 citations, 102 studies met the inclusion criteria, conducted across 16 countries, predominantly from North America (72), Asia (14), and Europe (6). Radiology had the highest number of publications (21), followed by general surgery (11) and emergency medicine (8). The majority of studies were published in 2023. Several key thematic areas emerged from the literature. Initially, perceptions of AI in graduate medical education (GME) were mixed, but have increasingly shifted toward a more favorable outlook, particularly as the benefits of AI integration in education become more apparent. In assessments, AI demonstrated the ability to differentiate between skill levels and offer meaningful feedback. It has also been effective in evaluating narrative comments to assess resident performance. In the domain of recruitment, AI tools have been applied to analyze letters of recommendation, applications, and personal statements, helping identify potential biases and improve equity in candidate selection. Furthermore, large language models consistently outperformed average candidates on board certification and in-training examinations, indicating their potential utility in standardized assessments. Finally, AI tools showed promise in enhancing clinical decision-making by supporting trainees with improved diagnostic accuracy and efficiency.
Conclusions: This scoping review provides a comprehensive overview of applications and limitations of AI in GME but is limited with potential biases, study heterogeneity, and evolving nature of AI.
Keywords: AI integration in medical residency training; AI-driven assessments in medical education; Applications of AI in GME; Artificial intelligence in graduate medical education; Large language models in medical education; Perceptions of AI in medical training.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests. Clinical trial number: Not applicable.
Figures
Similar articles
-
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843. JBI Database System Rev Implement Rep. 2016. PMID: 27532314
-
The Current Landscape of Artificial Intelligence in Plastic Surgery Education and Training: A Systematic Review.J Surg Educ. 2025 Aug;82(8):103519. doi: 10.1016/j.jsurg.2025.103519. Epub 2025 May 15. J Surg Educ. 2025. PMID: 40378641
-
Applications of Artificial Intelligence in Psychiatry and Psychology Education: Scoping Review.JMIR Med Educ. 2025 Jul 28;11:e75238. doi: 10.2196/75238. JMIR Med Educ. 2025. PMID: 40720804 Free PMC article.
-
Applications of Large Language Models in the Field of Suicide Prevention: Scoping Review.J Med Internet Res. 2025 Jan 23;27:e63126. doi: 10.2196/63126. J Med Internet Res. 2025. PMID: 39847414 Free PMC article.
-
The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.Med Teach. 2009 Apr;31(4):282-98. doi: 10.1080/01421590902889897. Med Teach. 2009. PMID: 19404891
References
-
- LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–44. - PubMed
-
- Murphy R. Introduction to AI robotics. Cambridge, MA: The MIT Press; 2018.
-
- Van Der Niet AG, Bleakley A. Where medical education Meets artificial intelligence: ‘does technology care?’. Med Educ. 2021;55(1):30–6. - PubMed
-
- Nagi F, Salih R, Alzubaidi M, Shah H, Alam T, Shah Z, et al. Applications of artificial intelligence (AI) in medical education: A scoping review. Stud Health Technol Inf. 2023;305:648–51. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous