. 2025 Apr 16:11:e63602.

doi: 10.2196/63602.

Leveraging Datathons to Teach AI in Undergraduate Medical Education: Case Study

Michael Steven Yao^{1

2

3}, Lawrence Huang^#^{3

4}, Emily Leventhal^#^{3

5}, Clara Sun^{3

6}, Steve J Stephen^{3

7

8}, Lathan Liou^{3

5}

Affiliations

¹ Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.
² Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, United States.
³ MDplus, New York, NY, United States.
⁴ Warren Alpert Medical School, Brown University, Providence, RI, United States.
⁵ Icahn School of Medicine at Mount Sinai, New York, NY, United States.
⁶ School of Medicine, Case Western Reserve University, Cleveland, OH, United States.
⁷ School of Medicine and Dentistry, University of Rochester, Rochester, NY, United States.
⁸ Simon Business School, University of Rochester, Rochester, NY, United States.

^# Contributed equally.

PMID: 40239213
PMCID: PMC12017604
DOI: 10.2196/63602

Leveraging Datathons to Teach AI in Undergraduate Medical Education: Case Study

Michael Steven Yao et al. JMIR Med Educ. 2025.

. 2025 Apr 16:11:e63602.

doi: 10.2196/63602.

Authors

Michael Steven Yao^{1

2

3}, Lawrence Huang^#^{3

4}, Emily Leventhal^#^{3

5}, Clara Sun^{3

6}, Steve J Stephen^{3

7

8}, Lathan Liou^{3

5}

Affiliations

¹ Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.
² Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, United States.
³ MDplus, New York, NY, United States.
⁴ Warren Alpert Medical School, Brown University, Providence, RI, United States.
⁵ Icahn School of Medicine at Mount Sinai, New York, NY, United States.
⁶ School of Medicine, Case Western Reserve University, Cleveland, OH, United States.
⁷ School of Medicine and Dentistry, University of Rochester, Rochester, NY, United States.
⁸ Simon Business School, University of Rochester, Rochester, NY, United States.

^# Contributed equally.

PMID: 40239213
PMCID: PMC12017604
DOI: 10.2196/63602

Abstract

Background: As artificial intelligence and machine learning become increasingly influential in clinical practice, it is critical for future physicians to understand how such novel technologies will impact the delivery of patient care.

Objective: We describe 2 trainee-led, multi-institutional datathons as an effective means of teaching key data science and machine learning skills to medical trainees. We offer key insights on the practical implementation of such datathons and analyze experiences gained and lessons learned for future datathon initiatives.

Methods: We detail 2 recent datathons organized by MDplus, a national trainee-led nonprofit organization. To assess the efficacy of the datathon as an educational experience, an opt-in postdatathon survey was sent to all registered participants. Survey responses were deidentified and anonymized before downstream analysis to assess the quality of datathon experiences and areas for future work.

Results: Our digital datathons between 2023 and 2024 were attended by approximately 200 medical trainees across the United States. A diverse array of medical specialty interests was represented among participants, with 43% (21/49) of survey participants expressing an interest in internal medicine, 35% (17/49) in surgery, and 22% (11/49) in radiology. Participant skills in leveraging Python for analyzing medical datasets improved after the datathon, and survey respondents enjoyed participating in the datathon.

Conclusions: The datathon proved to be an effective and cost-effective means of providing medical trainees the opportunity to collaborate on data-driven projects in health care. Participants agreed that datathons improved their ability to generate clinically meaningful insights from data. Our results suggest that datathons can serve as valuable and effective educational experiences for medical trainees to become better skilled in leveraging data science and artificial intelligence for patient care.

Keywords: artificial intelligence; data science education; datathon; machine learning; undergraduate medical education.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1.. Overview of the datathon event. The MDplus datathon ran for approximately 4 weeks and was loosely divided into two parts: (1) Team formation and project ideation and (2) project execution.**

Figure 2.. Bar plot visualizing participant self-assessment of technical skills before and after participating in the datathon for all 61 survey responses. Python was the only skill out of the 4 above that was an educational component in both the 2023 VBC and 2024 generative AI datathons. Participant scores correspond to the following: (1) no familiarity; (2) a little familiarity; (3) some familiarity; (4) a lot of familiarity. * Indicates a statistically significant difference in the distribution of scores before and after participating in the datathon (Python: P=.041; pairwise Fisher exact test). n.s. indicates no statistically significant difference in the distribution of scores. (R: P=.83; GitHub/Gitlab: P=.92; Microsoft Excel: P=1.00; pairwise Fisher exact test).

Figure 3.. Bar plot visualizing survey results assessing for subjective datathon quality. Participant scores correspond to the following: (1) Strongly disagree; (2) Disagree; (3) Neither agree nor disagree; (4) Agree; and (5) Strongly agree.

See this image and copyright information in PMC

References

1. Hege I, Kononowicz AA, Adler M. A clinical reasoning tool for virtual patients: design-based research study. JMIR Med Educ. 2017 Nov 2;3(2):e21. doi: 10.2196/mededu.8100. doi. Medline. - DOI - PMC - PubMed
1. Pongdee T, Larson NB, Divekar R, Bielinski SJ, Liu H, Moon S. Automated identification of aspirin-exacerbated respiratory disease using natural language processing and machine learning: algorithm development and evaluation study. JMIR AI. 2023 Jun 12;2:e44191. doi: 10.2196/44191. doi. Medline. - DOI - PMC - PubMed
1. Chae A, Yao MS, Sagreiya H, et al. Strategies for implementing machine learning algorithms in the clinical practice of radiology. Radiology. 2024 Jan;310(1):e223170. doi: 10.1148/radiol.223170. doi. Medline. - DOI - PMC - PubMed
1. Haug CJ, Drazen JM. Artificial intelligence and machine learning in clinical medicine, 2023. N Engl J Med. 2023 Mar 30;388(13):1201–1208. doi: 10.1056/NEJMra2302038. doi. Medline. - DOI - PubMed
1. Kendale S, Bishara A, Burns M, Solomon S, Corriere M, Mathis M. Machine learning for the prediction of procedural case durations developed using a large multicenter database: algorithm development and validation study. JMIR AI. 2023 Sep 8;2:e44909. doi: 10.2196/44909. doi. Medline. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- JMIR Publications
- PubMed Central
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Leveraging Datathons to Teach AI in Undergraduate Medical Education: Case Study

Affiliations

Leveraging Datathons to Teach AI in Undergraduate Medical Education: Case Study

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials