ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux

Jerome R Lechien^{1

2

3

4}, Thomas L Carroll⁵, Molly N Huston⁶, Matthew R Naunheim^{7

8

9}

Affiliations

¹ Research Committee, Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France. Jerome.Lechien@umons.ac.be.
² Division of Laryngology and Broncho-Esophagology, Department of Otolaryngology-Head Neck Surgery, EpiCURA Hospital, UMONS Research Institute for Health Sciences and Technology, University of Mons (UMons), Mons, Belgium. Jerome.Lechien@umons.ac.be.
³ Department of Otorhinolaryngology and Head and Neck Surgery, Foch Hospital, School of Medicine, Phonetics and Phonology Laboratory (UMR 7018 CNRS, Université Sorbonne Nouvelle/Paris 3), Paris, France. Jerome.Lechien@umons.ac.be.
⁴ Polyclinique Elsan de Poitiers, Poitiers, France. Jerome.Lechien@umons.ac.be.
⁵ Division of Otolaryngology-Head and Neck Surgery, Brigham and Women's Hospital, Department of Otolaryngology-Head and Neck Surgery, Harvard Medical School, Boston, MA, USA.
⁶ Department of Otolaryngology, Washington University School of Medicine in St. Louis, St. Louis, MO, USA.
⁷ Research Committee, Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France.
⁸ Department of Otolaryngology-Head and Neck Surgery, Harvard Medical School, Boston, MA, USA.
⁹ Division of Laryngology, Massachusetts Eye and Ear, Boston, MA, USA.

PMID: 38492008
DOI: 10.1007/s00405-024-08560-w

ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux

Jerome R Lechien et al. Eur Arch Otorhinolaryngol. 2024 May.

. 2024 May;281(5):2547-2552.

doi: 10.1007/s00405-024-08560-w. Epub 2024 Mar 16.

Authors

Jerome R Lechien^{1

2

3

4}, Thomas L Carroll⁵, Molly N Huston⁶, Matthew R Naunheim^{7

8

9}

Affiliations

¹ Research Committee, Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France. Jerome.Lechien@umons.ac.be.
² Division of Laryngology and Broncho-Esophagology, Department of Otolaryngology-Head Neck Surgery, EpiCURA Hospital, UMONS Research Institute for Health Sciences and Technology, University of Mons (UMons), Mons, Belgium. Jerome.Lechien@umons.ac.be.
³ Department of Otorhinolaryngology and Head and Neck Surgery, Foch Hospital, School of Medicine, Phonetics and Phonology Laboratory (UMR 7018 CNRS, Université Sorbonne Nouvelle/Paris 3), Paris, France. Jerome.Lechien@umons.ac.be.
⁴ Polyclinique Elsan de Poitiers, Poitiers, France. Jerome.Lechien@umons.ac.be.
⁵ Division of Otolaryngology-Head and Neck Surgery, Brigham and Women's Hospital, Department of Otolaryngology-Head and Neck Surgery, Harvard Medical School, Boston, MA, USA.
⁶ Department of Otolaryngology, Washington University School of Medicine in St. Louis, St. Louis, MO, USA.
⁷ Research Committee, Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France.
⁸ Department of Otolaryngology-Head and Neck Surgery, Harvard Medical School, Boston, MA, USA.
⁹ Division of Laryngology, Massachusetts Eye and Ear, Boston, MA, USA.

PMID: 38492008
DOI: 10.1007/s00405-024-08560-w

Abstract

Introduction: Chatbot Generative Pre-trained Transformer (ChatGPT) is an artificial intelligence-powered language model chatbot able to help otolaryngologists in practice and research. The ability of ChatGPT in generating patient-centered information related to laryngopharyngeal reflux disease (LPRD) was evaluated.

Methods: Twenty-five questions dedicated to definition, clinical presentation, diagnosis, and treatment of LPRD were developed from the Dubai definition and management of LPRD consensus and recent reviews. Questions about the four aforementioned categories were entered into ChatGPT-4. Four board-certified laryngologists evaluated the accuracy of ChatGPT-4 with a 5-point Likert scale. Interrater reliability was evaluated.

Results: The mean scores (SD) of ChatGPT-4 answers for definition, clinical presentation, additional examination, and treatments were 4.13 (0.52), 4.50 (0.72), 3.75 (0.61), and 4.18 (0.47), respectively. Experts reported high interrater reliability for sub-scores (ICC = 0.973). The lowest performances of ChatGPT-4 were on answers about the most prevalent LPR signs, the most reliable objective tool for the diagnosis (hypopharyngeal-esophageal multichannel intraluminal impedance-pH monitoring (HEMII-pH)), and the criteria for the diagnosis of LPR using HEMII-pH.

Conclusion: ChatGPT-4 may provide adequate information on the definition of LPR, differences compared to GERD (gastroesophageal reflux disease), and clinical presentation. Information provided upon extra-laryngeal manifestations and HEMII-pH may need further optimization. Regarding the recent trends identifying increasing patient use of internet sources for self-education, the findings of the present study may help draw attention to ChatGPT-4's accuracy on the topic of LPR.

Keywords: Artificial intelligence; ChatGPT; Chatbot; Head neck surgery; Laryngopharyngeal; Otolaryngology; Reference; Reflux.

PubMed Disclaimer

Cited by

Generative artificial intelligence in otolaryngology-head and neck surgery editorial: be an actor of the future or follower.
Lechien JR. Lechien JR. Eur Arch Otorhinolaryngol. 2024 Apr;281(4):2051-2053. doi: 10.1007/s00405-024-08579-z. Eur Arch Otorhinolaryngol. 2024. PMID: 38407611 No abstract available.
Artificial intelligence in otorhinolaryngology: current trends and application areas.
Demir E, Uğurlu BN, Uğurlu GA, Aydoğdu G. Demir E, et al. Eur Arch Otorhinolaryngol. 2025 May;282(5):2697-2707. doi: 10.1007/s00405-025-09272-5. Epub 2025 Feb 17. Eur Arch Otorhinolaryngol. 2025. PMID: 40019544 Free PMC article. Review.
Accuracy of ChatGPT responses on tracheotomy for patient education.
Khaldi A, Machayekhi S, Salvagno M, Maniaci A, Vaira LA, La Via L, Taccone FS, Lechien JR. Khaldi A, et al. Eur Arch Otorhinolaryngol. 2024 Nov;281(11):6167-6172. doi: 10.1007/s00405-024-08859-8. Epub 2024 Oct 2. Eur Arch Otorhinolaryngol. 2024. PMID: 39356355
Large language models improve clinical decision making of medical students through patient simulation and structured feedback: a randomized controlled trial.
Brügge E, Ricchizzi S, Arenbeck M, Keller MN, Schur L, Stummer W, Holling M, Lu MH, Darici D. Brügge E, et al. BMC Med Educ. 2024 Nov 28;24(1):1391. doi: 10.1186/s12909-024-06399-7. BMC Med Educ. 2024. PMID: 39609823 Free PMC article. Clinical Trial.
[Laryngopharyngeal reflux disease-update 2025 : Revision taking into consideration the European CEORL-HNS guideline].
Böttcher A, Schmitz L, Trache MC, Stortz U, Clausen JF, Betz CS. Böttcher A, et al. HNO. 2025 Aug;73(8):589-602. doi: 10.1007/s00106-025-01645-w. Epub 2025 Jul 9. HNO. 2025. PMID: 40643662 German.

See all "Cited by" articles

References

1. Briganti G (2023) How ChatGPT works: a mini review. Eur Arch Otorhinolaryngol. https://doi.org/10.1007/s00405-023-08337-7 - DOI - PubMed
1. Vaira LA, Lechien JR, Abbate V, Allevi F, Audino G, Beltramini GA et al (2023) Accuracy of ChatGPT-generated information on head and neck and oromaxillofacial surgery: a multicenter collaborative analysis. Otolaryngol Head Neck Surg. https://doi.org/10.1002/ohn.489 - DOI - PubMed
1. Davis RJ, Ayo-Ajibola O, Lin ME, Swanson MS, Chambers TN, Kwon DI, Kokot NC (2023) Evaluation of oropharyngeal cancer information from revolutionary artificial intelligence Chatbot. Laryngoscope. https://doi.org/10.1002/lary.31191 - DOI - PubMed - PMC
1. Lechien JR, Vaezi MF, Chan WW, Allen JE, Karkos PD, Saussez S et al (2023) The Dubai definition and diagnostic criteria of laryngopharyngeal reflux: the IFOS consensus. Laryngoscope. https://doi.org/10.1002/lary.31134 - DOI - PubMed
1. Lechien JR, Akst LM, Hamdan AL, Schindler A, Karkos PD, Barillari MR, Calvo-Henriquez C, Crevier-Buchman L, Finck C, Eun YG, Saussez S, Vaezi MF (2019) Evaluation and management of laryngopharyngeal reflux disease: state of the art review. Otolaryngol Head Neck Surg 160(5):762–782. https://doi.org/10.1177/0194599819827488 - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Springer

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux

Affiliations

ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux

Authors

Affiliations

Abstract

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Abstract

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources