Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Nov 13;35(1):555-567.
doi: 10.1007/s40670-024-02206-6. eCollection 2025 Feb.

ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review

Affiliations
Review

ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review

Alexandra Aster et al. Med Sci Educ. .

Abstract

This review aims to provide a summary of all scientific publications on the use of large language models (LLMs) in medical education over the first year of their availability. A scoping literature review was conducted in accordance with the PRISMA recommendations for scoping reviews. Five scientific literature databases were searched using predefined search terms. The search yielded 1509 initial results, of which 145 studies were ultimately included. Most studies assessed LLMs' capabilities in passing medical exams. Some studies discussed advantages, disadvantages, and potential use cases of LLMs. Very few studies conducted empirical research. Many published studies lack methodological rigor. We therefore propose a research agenda to improve the quality of studies on LLM.

Keywords: Artificial intelligence; ChatGPT; Generative AI; Large language model; Medical education.

PubMed Disclaimer

Conflict of interest statement

Competing InterestsThe authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow diagram of the literature review process. The figure, which was adapted from Page et al. [22], was slightly modified, since we relied exclusively on databases and registers for study identification
Fig. 2
Fig. 2
Map of the corresponding authors’ origin. We found no publications from authors of countries highlighted in gray. A deeper shade of blue indicates more publications on LLMs in medical education; a lighter shade of blue indicates less (but not zero) publications

References

    1. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. In: Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al., editors. Advances in Neural Information Processing Systems 30 (NIPS 2017). Long Beach, CA; 2017.
    1. Devlin J, Chang MW, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. preprint. 2018.
    1. Shen Y, Heacock L, Elias J, Hentel KD, Reig B, Shih G, et al. ChatGPT and other large language models are double-edged swords. Radiology. 2023;307(2):e230163. - PubMed
    1. OpenAI. https://openai.com/blog/chatgpt/. 2022. ChatGPT: optimizing language models for dialogue.
    1. Naveed H, Khan AU, Qiu S, Saqib M, Anwar S, Usman M, et al. A comprehensive overview of large language models. preprint. 2023;

LinkOut - more resources