Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination

Kristyn Beam¹, Puneet Sharma², Bhawesh Kumar³, Cindy Wang⁴, Dara Brodsky¹, Camilia R Martin⁵, Andrew Beam⁶

Affiliations

¹ Department of Neonatology, Beth Israel Deaconess Medical Center, Boston, Massachusetts.
² Division of Newborn Medicine, Boston Children's Hospital, Boston, Massachusetts.
³ Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts.
⁴ Department of Statistics, Harvard University, Cambridge, Massachusetts.
⁵ Division of Neonatology, Weill Cornell Medicine, New York, New York.
⁶ Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts.

PMID: 37459084
PMCID: PMC10352922
DOI: 10.1001/jamapediatrics.2023.2373

Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination

Kristyn Beam et al. JAMA Pediatr. 2023.

. 2023 Sep 1;177(9):977-979.

doi: 10.1001/jamapediatrics.2023.2373.

Authors

Kristyn Beam¹, Puneet Sharma², Bhawesh Kumar³, Cindy Wang⁴, Dara Brodsky¹, Camilia R Martin⁵, Andrew Beam⁶

Affiliations

¹ Department of Neonatology, Beth Israel Deaconess Medical Center, Boston, Massachusetts.
² Division of Newborn Medicine, Boston Children's Hospital, Boston, Massachusetts.
³ Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts.
⁴ Department of Statistics, Harvard University, Cambridge, Massachusetts.
⁵ Division of Neonatology, Weill Cornell Medicine, New York, New York.
⁶ Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts.

PMID: 37459084
PMCID: PMC10352922
DOI: 10.1001/jamapediatrics.2023.2373

No abstract available

Plain language summary

This Diagnostic/Prognostic Study evaluates the performance of a large language model in generating answers to practice questions for the neonatal-perinatal board examination.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: None reported.

References

1. Schulman J, Zoph B, Kim C, et al. ChatGPT: optimizing language models for dialogue. OpenAI . Published November 30, 2022. Accessed February 28, 2023. https://chat.openai.com
1. Levine DM, Tuwani R, Kompa B, et al. The diagnostic and triage accuracy of the gpt-3 artificial intelligence model. medRxiv. Published online February 1, 2023. doi: 10.1101/2023.01.30.23285067 - DOI - PubMed
1. Kung TH, Cheatham M, Medinilla A, et al. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. PLOS Digit Health. 2023;2(2):e0000198. doi: 10.1371/journal.pdig.0000198 - DOI - PMC - PubMed
1. Morton S, Ehret D, Ghanta S, Sajti E, Walsh B. In: Brodsky D, Martin CR, eds. Neonatology Review: Q & A. 3rd ed. Lulu; 2015.
1. Singhal K, Azizi S, Tu T, et al. Large language models encode clinical knowledge. arXiv. Published online December 26, 2022. https://arxiv.org/abs/2212.13138

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

T32 HD098061/HD/NICHD NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination

Affiliations

Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination

Authors

Affiliations

Plain language summary

Conflict of interest statement

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources