Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination
- PMID: 37459084
- PMCID: PMC10352922
- DOI: 10.1001/jamapediatrics.2023.2373
Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination
Plain language summary
This Diagnostic/Prognostic Study evaluates the performance of a large language model in generating answers to practice questions for the neonatal-perinatal board examination.
Conflict of interest statement
Similar articles
-
Performance of a Large Language Model on Japanese Emergency Medicine Board Certification Examinations.J Nippon Med Sch. 2024 May 21;91(2):155-161. doi: 10.1272/jnms.JNMS.2024_91-205. Epub 2024 Mar 2. J Nippon Med Sch. 2024. PMID: 38432929
-
Performance of ChatGPT on Solving Orthopedic Board-Style Questions: A Comparative Analysis of ChatGPT 3.5 and ChatGPT 4.Clin Orthop Surg. 2024 Aug;16(4):669-673. doi: 10.4055/cios23179. Epub 2024 Mar 7. Clin Orthop Surg. 2024. PMID: 39092297 Free PMC article.
-
[ChatGPT and the German board examination for ophthalmology: an evaluation].Ophthalmologie. 2024 Jul;121(7):554-564. doi: 10.1007/s00347-024-02046-0. Epub 2024 May 27. Ophthalmologie. 2024. PMID: 38801461 German.
-
Evolution of AOA specialty board certification.J Am Osteopath Assoc. 2015 Apr;115(4):265-7. doi: 10.7556/jaoa.2015.051. J Am Osteopath Assoc. 2015. PMID: 25830585 Review.
-
Development and Results of the First Certification Examination in the American Board of Medical Specialties Neurocritical Care Subspecialty.Neurocrit Care. 2022 Dec;37(3):611-615. doi: 10.1007/s12028-022-01574-4. Epub 2022 Aug 9. Neurocrit Care. 2022. PMID: 35941404 Review.
Cited by
-
Can ChatGPT pass the MRCP (UK) written examinations? Analysis of performance and errors using a clinical decision-reasoning framework.BMJ Open. 2024 Mar 15;14(3):e080558. doi: 10.1136/bmjopen-2023-080558. BMJ Open. 2024. PMID: 38490655 Free PMC article.
-
Assessing the adherence of large language models to clinical practice guidelines in Chinese medicine: a content analysis.Front Pharmacol. 2025 Jul 25;16:1649041. doi: 10.3389/fphar.2025.1649041. eCollection 2025. Front Pharmacol. 2025. PMID: 40786055 Free PMC article.
-
Performance of Publicly Available Large Language Models on Internal Medicine Board-style Questions.PLOS Digit Health. 2024 Sep 17;3(9):e0000604. doi: 10.1371/journal.pdig.0000604. eCollection 2024 Sep. PLOS Digit Health. 2024. PMID: 39288137 Free PMC article.
-
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023. Front Med (Lausanne). 2023. PMID: 38155661 Free PMC article.
-
Assessment of the clinical knowledge of ChatGPT-4 in neonatal-perinatal medicine: a comparative analysis with ChatGPT-3.5.J Perinatol. 2024 Sep;44(9):1365-1366. doi: 10.1038/s41372-024-01912-8. Epub 2024 Feb 24. J Perinatol. 2024. PMID: 38402349 Free PMC article. No abstract available.
References
-
- Schulman J, Zoph B, Kim C, et al. . ChatGPT: optimizing language models for dialogue. OpenAI . Published November 30, 2022. Accessed February 28, 2023. https://chat.openai.com
-
- Morton S, Ehret D, Ghanta S, Sajti E, Walsh B. In: Brodsky D, Martin CR, eds. Neonatology Review: Q & A. 3rd ed. Lulu; 2015.
-
- Singhal K, Azizi S, Tu T, et al. . Large language models encode clinical knowledge. arXiv. Published online December 26, 2022. https://arxiv.org/abs/2212.13138
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources