A systematic review of large language models and their implications in medical education

Harrison C Lucas¹, Jeffrey S Upperman², Jamie R Robinson^{2

3}

Affiliations

¹ Brandeis University, Waltham, Massachusetts, USA.
² Department of Pediatric Surgery, Vanderbilt University Medical Center, Nashville, Tennessee, USA.
³ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.

PMID: 38639098
DOI: 10.1111/medu.15402

A systematic review of large language models and their implications in medical education

Harrison C Lucas et al. Med Educ. 2024 Nov.

. 2024 Nov;58(11):1276-1285.

doi: 10.1111/medu.15402. Epub 2024 Apr 19.

Authors

Harrison C Lucas¹, Jeffrey S Upperman², Jamie R Robinson^{2

3}

Affiliations

¹ Brandeis University, Waltham, Massachusetts, USA.
² Department of Pediatric Surgery, Vanderbilt University Medical Center, Nashville, Tennessee, USA.
³ Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.

PMID: 38639098
DOI: 10.1111/medu.15402

Abstract

Introduction: In the past year, the use of large language models (LLMs) has generated significant interest and excitement because of their potential to revolutionise various fields, including medical education for aspiring physicians. Although medical students undergo a demanding educational process to become competent health care professionals, the emergence of LLMs presents a promising solution to challenges like information overload, time constraints and pressure on clinical educators. However, integrating LLMs into medical education raises critical concerns and challenges for educators, professionals and students. This systematic review aims to explore LLM applications in medical education, specifically their impact on medical students' learning experiences.

Methods: A systematic search was performed in PubMed, Web of Science and Embase for articles discussing the applications of LLMs in medical education using selected keywords related to LLMs and medical education, from the time of ChatGPT's debut until February 2024. Only articles available in full text or English were reviewed. The credibility of each study was critically appraised by two independent reviewers.

Results: The systematic review identified 166 studies, of which 40 were found by review to be relevant to the study. Among the 40 relevant studies, key themes included LLM capabilities, benefits such as personalised learning and challenges regarding content accuracy. Importantly, 42.5% of these studies specifically evaluated LLMs in a novel way, including ChatGPT, in contexts such as medical exams and clinical/biomedical information, highlighting their potential in replicating human-level performance in medical knowledge. The remaining studies broadly discussed the prospective role of LLMs in medical education, reflecting a keen interest in their future potential despite current constraints.

Conclusions: The responsible implementation of LLMs in medical education offers a promising opportunity to enhance learning experiences. However, ensuring information accuracy, emphasising skill-building and maintaining ethical safeguards are crucial. Continuous critical evaluation and interdisciplinary collaboration are essential for the appropriate integration of LLMs in medical education.

PubMed Disclaimer

Comment in

Addressing digital inequities in the age of large language models (LLMs).
Ng O, Han SP. Ng O, et al. Med Educ. 2024 Dec;58(12):1545-1546. doi: 10.1111/medu.15446. Epub 2024 May 27. Med Educ. 2024. PMID: 38801196 No abstract available.
Why we should stop writing commentaries about AI.
Johnsson V, Tolsgaard M. Johnsson V, et al. Med Educ. 2024 Nov;58(11):1262-1263. doi: 10.1111/medu.15474. Epub 2024 Jul 11. Med Educ. 2024. PMID: 38989816 No abstract available.
Dispelling the magic of artificial intelligence in medical education.
McQuade CN, Wijesekera TP, Chartash D. McQuade CN, et al. Med Educ. 2025 Mar;59(3):350-351. doi: 10.1111/medu.15536. Epub 2024 Sep 20. Med Educ. 2025. PMID: 39301702 No abstract available.

References

REFERENCES

1. GPT‐4 [Internet]. [cited 2024 Mar 19]. Available from: https://openai.com/research/gpt-4
1. Gemini ‐ chat to supercharge your ideas [Internet]. Gemini [cited 2024 Mar 19]. Available from: https://gemini.google.com
1. Buja LM. Medical education today: all that glitters is not gold. BMC Med Educ. 2019;19(1):110. doi:10.1186/s12909‐019‐1535‐9
1. Kung TH, Cheatham M, Medenilla A, et al. Performance of ChatGPT on USMLE: potential for AI‐assisted medical education using large language models. PLOS Digit Health. 2023;2(2):e0000198. doi:10.1371/journal.pdig.0000198
1. Antaki F, Touma S, Milad D, El‐Khoury J, Duval R. Evaluating the performance of ChatGPT in ophthalmology: an analysis of its successes and shortcomings. Ophthalmol Sci. 2023;3(4):100324. doi:10.1016/j.xops.2023.100324

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Ovid Technologies, Inc.
- Wiley
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A systematic review of large language models and their implications in medical education

Affiliations

A systematic review of large language models and their implications in medical education

Authors

Affiliations

Abstract

Comment in

References

REFERENCES

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous