The performance of artificial intelligence language models in board-style dental knowledge assessment: A preliminary study on ChatGPT

Arman Danesh, Hirad Pazouki, Kasra Danesh, Farzad Danesh, Arsalan Danesh

PMID: 37676187
DOI: 10.1016/j.adaj.2023.07.016

The performance of artificial intelligence language models in board-style dental knowledge assessment: A preliminary study on ChatGPT

Arman Danesh et al. J Am Dent Assoc. 2023 Nov.

. 2023 Nov;154(11):970-974.

doi: 10.1016/j.adaj.2023.07.016. Epub 2023 Sep 5.

Authors

Arman Danesh, Hirad Pazouki, Kasra Danesh, Farzad Danesh, Arsalan Danesh

PMID: 37676187
DOI: 10.1016/j.adaj.2023.07.016

Abstract

Background: Although Chat Generative Pre-trained Transformer (ChatGPT) (OpenAI) may be an appealing educational resource for students, the chatbot responses can be subject to misinformation. This study was designed to evaluate the performance of ChatGPT on a board-style multiple-choice dental knowledge assessment to gauge its capacity to output accurate dental content and in turn the risk of misinformation associated with use of the chatbot as an educational resource by dental students.

Methods: ChatGPT3.5 and ChatGPT4 were asked questions obtained from 3 different sources: INBDE Bootcamp, ITDOnline, and a list of board-style questions provided by the Joint Commission on National Dental Examinations. Image-based questions were excluded, as ChatGPT only takes text-based inputs. The mean performance across 3 trials was reported for each model.

Results: ChatGPT3.5 and ChatGPT4 answered 61.3% and 76.9% of the questions correctly on average, respectively. A 2-tailed t test was used to compare 2 independent sample means, and a 2-tailed χ² test was used to compare 2 sample proportions. A P value less than .05 was considered to be statistically significant.

Conclusion: ChatGPT3.5 did not perform sufficiently well on the board-style knowledge assessment. ChatGPT4, however, displayed a competent ability to output accurate dental content. Future research should evaluate the proficiency of emerging models of ChatGPT in dentistry to assess its evolving role in dental education.

Practical implications: Although ChatGPT showed an impressive ability to output accurate dental content, our findings should encourage dental students to incorporate ChatGPT to supplement their existing learning program instead of using it as their primary learning resource.

Keywords: Artificial intelligence; ChatGPT; Integrated National Board Dental Examination; dental board examination; dental education; dentistry.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The performance of artificial intelligence language models in board-style dental knowledge assessment: A preliminary study on ChatGPT

The performance of artificial intelligence language models in board-style dental knowledge assessment: A preliminary study on ChatGPT

Authors

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources