Accuracy of ChatGPT-Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

Luigi Angelo Vaira^{1

2}, Jerome R Lechien^{3

4}, Vincenzo Abbate⁵, Fabiana Allevi⁶, Giovanni Audino⁵, Giada Anna Beltramini^{7

8}, Michela Bergonzani⁹, Alessandro Bolzoni⁷, Umberto Committeri⁵, Salvatore Crimi¹⁰, Guido Gabriele¹¹, Fabio Lonardi¹², Fabio Maglitto¹³, Marzia Petrocelli¹⁴, Resi Pucci¹⁵, Gianmarco Saponaro¹⁶, Alessandro Tel¹⁷, Valentino Vellone¹⁸, Carlos Miguel Chiesa-Estomba¹⁹, Paolo Boscolo-Rizzo²⁰, Giovanni Salzano⁵, Giacomo De Riu¹

Affiliations

¹ Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Sassari, Italy.
² Biomedical Sciences Department, PhD School of Biomedical Science, University of Sassari, Sassari, Italy.
³ Department of Anatomy and Experimental Oncology, Mons School of Medicine, UMONS, Research Institute for Health Sciences and Technology, University of Mons (UMons), Mons, Belgium.
⁴ Department of Otolaryngology-Head Neck Surgery, Elsan Polyclinic of Poitiers, Poitiers, France.
⁵ Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy.
⁶ Maxillofacial Surgery Department, ASSt Santi Paolo e Carlo, University of Milan, Milan, Italy.
⁷ Department of Biomedical, Surgical and Dental Sciences, University of Milan, Milan, Italy.
⁸ Maxillofacial and Dental Unit, Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico, Milan, Italy.
⁹ Maxillo-Facial Surgery Division, Head and Neck Department, University Hospital of Parma, Parma, Italy.
¹⁰ Operative Unit of Maxillofacial Surgery, Policlinico San Marco, University of Catania, Catania, Italy.
¹¹ Department of Maxillofacial Surgery, University of Siena, Siena, Italy.
¹² Department of Maxillofacial Surgery, University of Verona, Verona, Italy.
¹³ Maxillo-Facial Surgery Unit, University of Bari "Aldo Moro", Bari, Italy.
¹⁴ Maxillofacial Surgery Operative Unit, Bellaria and Maggiore Hospital, Bologna, Italy.
¹⁵ Maxillofacial Surgery Unit, San Camillo-Forlanini Hospital, Rome, Italy.
¹⁶ Maxillo-Facial Surgery Unit, IRCSS "A. Gemelli" Foundation-Catholic, University of the Sacred Heart, Rome, Italy.
¹⁷ Department of Head and Neck Surgery and Neuroscience, Clinic of Maxillofacial Surgery, University Hospital of Udine, Udine, Italy.
¹⁸ Maxillofacial Surgery Unit, "S. Maria" Hospital, Terni, Italy.
¹⁹ Department of Otorhinolaryngology-Head and Neck Surgery, Hospital Universitario Donostia, San Sebastian, Spain.
²⁰ Department of Medical, Surgical and Health Sciences, Section of Otolaryngology, University of Trieste, Trieste, Italy.

PMID: 37595113
DOI: 10.1002/ohn.489

Free article

Observational Study

Accuracy of ChatGPT-Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

Luigi Angelo Vaira et al. Otolaryngol Head Neck Surg. 2024 Jun.

Free article

. 2024 Jun;170(6):1492-1503.

doi: 10.1002/ohn.489. Epub 2023 Aug 18.

Authors

Affiliations

¹ Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Sassari, Italy.
² Biomedical Sciences Department, PhD School of Biomedical Science, University of Sassari, Sassari, Italy.
³ Department of Anatomy and Experimental Oncology, Mons School of Medicine, UMONS, Research Institute for Health Sciences and Technology, University of Mons (UMons), Mons, Belgium.
⁴ Department of Otolaryngology-Head Neck Surgery, Elsan Polyclinic of Poitiers, Poitiers, France.
⁵ Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy.
⁶ Maxillofacial Surgery Department, ASSt Santi Paolo e Carlo, University of Milan, Milan, Italy.
⁷ Department of Biomedical, Surgical and Dental Sciences, University of Milan, Milan, Italy.
⁸ Maxillofacial and Dental Unit, Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico, Milan, Italy.
⁹ Maxillo-Facial Surgery Division, Head and Neck Department, University Hospital of Parma, Parma, Italy.
¹⁰ Operative Unit of Maxillofacial Surgery, Policlinico San Marco, University of Catania, Catania, Italy.
¹¹ Department of Maxillofacial Surgery, University of Siena, Siena, Italy.
¹² Department of Maxillofacial Surgery, University of Verona, Verona, Italy.
¹³ Maxillo-Facial Surgery Unit, University of Bari "Aldo Moro", Bari, Italy.
¹⁴ Maxillofacial Surgery Operative Unit, Bellaria and Maggiore Hospital, Bologna, Italy.
¹⁵ Maxillofacial Surgery Unit, San Camillo-Forlanini Hospital, Rome, Italy.
¹⁶ Maxillo-Facial Surgery Unit, IRCSS "A. Gemelli" Foundation-Catholic, University of the Sacred Heart, Rome, Italy.
¹⁷ Department of Head and Neck Surgery and Neuroscience, Clinic of Maxillofacial Surgery, University Hospital of Udine, Udine, Italy.
¹⁸ Maxillofacial Surgery Unit, "S. Maria" Hospital, Terni, Italy.
¹⁹ Department of Otorhinolaryngology-Head and Neck Surgery, Hospital Universitario Donostia, San Sebastian, Spain.
²⁰ Department of Medical, Surgical and Health Sciences, Section of Otolaryngology, University of Trieste, Trieste, Italy.

PMID: 37595113
DOI: 10.1002/ohn.489

Abstract

Objective: To investigate the accuracy of Chat-Based Generative Pre-trained Transformer (ChatGPT) in answering questions and solving clinical scenarios of head and neck surgery.

Study design: Observational and valuative study.

Setting: Eighteen surgeons from 14 Italian head and neck surgery units.

Methods: A total of 144 clinical questions encompassing different subspecialities of head and neck surgery and 15 comprehensive clinical scenarios were developed. Questions and scenarios were inputted into ChatGPT4, and the resulting answers were evaluated by the researchers using accuracy (range 1-6), completeness (range 1-3), and references' quality Likert scales.

Results: The overall median score of open-ended questions was 6 (interquartile range[IQR]: 5-6) for accuracy and 3 (IQR: 2-3) for completeness. Overall, the reviewers rated the answer as entirely or nearly entirely correct in 87.2% of cases and as comprehensive and covering all aspects of the question in 73% of cases. The artificial intelligence (AI) model achieved a correct response in 84.7% of the closed-ended questions (11 wrong answers). As for the clinical scenarios, ChatGPT provided a fully or nearly fully correct diagnosis in 81.7% of cases. The proposed diagnostic or therapeutic procedure was judged to be complete in 56.7% of cases. The overall quality of the bibliographic references was poor, and sources were nonexistent in 46.4% of the cases.

Conclusion: The results generally demonstrate a good level of accuracy in the AI's answers. The AI's ability to resolve complex clinical scenarios is promising, but it still falls short of being considered a reliable support for the decision-making process of specialists in head-neck surgery.

Keywords: ChatGPT; artificial intelligence; maxillofacial surgery; otorhinolaryngology.

PubMed Disclaimer

References

1. OpenAI. ChatGPT. 2023. Accessed March 28, 2023. https://openai.com/blog/chatgpt
1. Exploding Topics. Number of ChatGPT users 2023. 2023. Accessed March 30, 2023. https://explodingtopics.com/blog/chatgpt-users
1. Barat M, Soyer P, Dohan A. Appropriateness of recommendations provided by ChatGPT to interventional radiologists. Can Assoc Radiol J. Published online April 13, 2023. doi:10.1177/08465371231170133
1. Cheng K, Sun Z, He Y, Gu S, Wu H. The potential impact of ChatGPT/GPT‐4 on surgery: will it topple the profession of surgeons? Int J Surg. 2023;109:1545‐1547. doi:10.1097/JS9.0000000000000388
1. Strong E, DiGiammarino A, Weng Y, et al. Performance of ChatGPT on free‐response, clinical reasoning exams. medRxiv. Published online March 29, 2023. doi:10.1101/2023.03.24.23287731

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Accuracy of ChatGPT-Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

Affiliations

Accuracy of ChatGPT-Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

Authors

Affiliations

Abstract

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials