ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review

Alexandra Aster^#¹, Matthias Carl Laupichler^#¹, Tamina Rockwell-Kollmann¹, Gilda Masala¹, Ebru Bala¹, Tobias Raupach¹

Affiliations

PMID: 40144083
PMCID: PMC11933646
DOI: 10.1007/s40670-024-02206-6

Review

ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review

Alexandra Aster et al. Med Sci Educ. 2024.

. 2024 Nov 13;35(1):555-567.

doi: 10.1007/s40670-024-02206-6. eCollection 2025 Feb.

Authors

Alexandra Aster^#¹, Matthias Carl Laupichler^#¹, Tamina Rockwell-Kollmann¹, Gilda Masala¹, Ebru Bala¹, Tobias Raupach¹

Affiliation

¹ Institute of Medical Education, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany.

^# Contributed equally.

PMID: 40144083
PMCID: PMC11933646
DOI: 10.1007/s40670-024-02206-6

Abstract

This review aims to provide a summary of all scientific publications on the use of large language models (LLMs) in medical education over the first year of their availability. A scoping literature review was conducted in accordance with the PRISMA recommendations for scoping reviews. Five scientific literature databases were searched using predefined search terms. The search yielded 1509 initial results, of which 145 studies were ultimately included. Most studies assessed LLMs' capabilities in passing medical exams. Some studies discussed advantages, disadvantages, and potential use cases of LLMs. Very few studies conducted empirical research. Many published studies lack methodological rigor. We therefore propose a research agenda to improve the quality of studies on LLM.

Keywords: Artificial intelligence; ChatGPT; Generative AI; Large language model; Medical education.

PubMed Disclaimer

Conflict of interest statement

Competing InterestsThe authors declare no competing interests.

Figures

**Fig. 1**
Flow diagram of the literature review process. The figure, which was adapted from Page et al. [22], was slightly modified, since we relied exclusively on databases and registers for study identification

**Fig. 2**
Map of the corresponding authors’ origin. We found no publications from authors of countries highlighted in gray. A deeper shade of blue indicates more publications on LLMs in medical education; a lighter shade of blue indicates less (but not zero) publications

See this image and copyright information in PMC

References

1. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. In: Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al., editors. Advances in Neural Information Processing Systems 30 (NIPS 2017). Long Beach, CA; 2017.
1. Devlin J, Chang MW, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. preprint. 2018.
1. Shen Y, Heacock L, Elias J, Hentel KD, Reig B, Shih G, et al. ChatGPT and other large language models are double-edged swords. Radiology. 2023;307(2):e230163. - PubMed
1. OpenAI. https://openai.com/blog/chatgpt/. 2022. ChatGPT: optimizing language models for dialogue.
1. Naveed H, Khan AU, Qiu S, Saqib M, Anwar S, Usman M, et al. A comprehensive overview of large language models. preprint. 2023;

Publication types

Actions

LinkOut - more resources

Full Text Sources
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review

Affiliation

ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

LinkOut - more resources

Full Text Sources