ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux
- PMID: 38492008
- DOI: 10.1007/s00405-024-08560-w
ChatGPT-4 accuracy for patient education in laryngopharyngeal reflux
Abstract
Introduction: Chatbot Generative Pre-trained Transformer (ChatGPT) is an artificial intelligence-powered language model chatbot able to help otolaryngologists in practice and research. The ability of ChatGPT in generating patient-centered information related to laryngopharyngeal reflux disease (LPRD) was evaluated.
Methods: Twenty-five questions dedicated to definition, clinical presentation, diagnosis, and treatment of LPRD were developed from the Dubai definition and management of LPRD consensus and recent reviews. Questions about the four aforementioned categories were entered into ChatGPT-4. Four board-certified laryngologists evaluated the accuracy of ChatGPT-4 with a 5-point Likert scale. Interrater reliability was evaluated.
Results: The mean scores (SD) of ChatGPT-4 answers for definition, clinical presentation, additional examination, and treatments were 4.13 (0.52), 4.50 (0.72), 3.75 (0.61), and 4.18 (0.47), respectively. Experts reported high interrater reliability for sub-scores (ICC = 0.973). The lowest performances of ChatGPT-4 were on answers about the most prevalent LPR signs, the most reliable objective tool for the diagnosis (hypopharyngeal-esophageal multichannel intraluminal impedance-pH monitoring (HEMII-pH)), and the criteria for the diagnosis of LPR using HEMII-pH.
Conclusion: ChatGPT-4 may provide adequate information on the definition of LPR, differences compared to GERD (gastroesophageal reflux disease), and clinical presentation. Information provided upon extra-laryngeal manifestations and HEMII-pH may need further optimization. Regarding the recent trends identifying increasing patient use of internet sources for self-education, the findings of the present study may help draw attention to ChatGPT-4's accuracy on the topic of LPR.
Keywords: Artificial intelligence; ChatGPT; Chatbot; Head neck surgery; Laryngopharyngeal; Otolaryngology; Reference; Reflux.
© 2024. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
Similar articles
-
Diagnosing Laryngopharyngeal Reflux: A Comparison between 24-hour pH-Impedance Testing and Pharyngeal Probe (Restech) Testing, with Introduction of the Sataloff Score.J Voice. 2023 Sep;37(5):737-747. doi: 10.1016/j.jvoice.2021.04.002. Epub 2021 Jun 4. J Voice. 2023. PMID: 34092465
-
How many cases of laryngopharyngeal reflux suspected by laryngoscopy are gastroesophageal reflux disease-related?World J Gastroenterol. 2012 Aug 28;18(32):4363-70. doi: 10.3748/wjg.v18.i32.4363. World J Gastroenterol. 2012. PMID: 22969200 Free PMC article.
-
Diagnostic Testing for Laryngopharyngeal Reflux Disease: The Role of 24-hour Hypopharyngeal-Esophageal Multichannel Intraluminal Impedance-pH Monitoring.Otolaryngol Clin North Am. 2025 Jun;58(3):441-449. doi: 10.1016/j.otc.2024.12.001. Epub 2025 Jan 7. Otolaryngol Clin North Am. 2025. PMID: 39779436 Review.
-
The Dubai Definition and Diagnostic Criteria of Laryngopharyngeal Reflux: The IFOS Consensus.Laryngoscope. 2024 Apr;134(4):1614-1624. doi: 10.1002/lary.31134. Epub 2023 Nov 6. Laryngoscope. 2024. PMID: 37929860
-
Review of management of laryngopharyngeal reflux disease.Eur Ann Otorhinolaryngol Head Neck Dis. 2021 Sep;138(4):257-267. doi: 10.1016/j.anorl.2020.11.002. Epub 2020 Nov 27. Eur Ann Otorhinolaryngol Head Neck Dis. 2021. PMID: 33257265 Review.
Cited by
-
Generative artificial intelligence in otolaryngology-head and neck surgery editorial: be an actor of the future or follower.Eur Arch Otorhinolaryngol. 2024 Apr;281(4):2051-2053. doi: 10.1007/s00405-024-08579-z. Eur Arch Otorhinolaryngol. 2024. PMID: 38407611 No abstract available.
-
Artificial intelligence in otorhinolaryngology: current trends and application areas.Eur Arch Otorhinolaryngol. 2025 May;282(5):2697-2707. doi: 10.1007/s00405-025-09272-5. Epub 2025 Feb 17. Eur Arch Otorhinolaryngol. 2025. PMID: 40019544 Free PMC article. Review.
-
Accuracy of ChatGPT responses on tracheotomy for patient education.Eur Arch Otorhinolaryngol. 2024 Nov;281(11):6167-6172. doi: 10.1007/s00405-024-08859-8. Epub 2024 Oct 2. Eur Arch Otorhinolaryngol. 2024. PMID: 39356355
-
Large language models improve clinical decision making of medical students through patient simulation and structured feedback: a randomized controlled trial.BMC Med Educ. 2024 Nov 28;24(1):1391. doi: 10.1186/s12909-024-06399-7. BMC Med Educ. 2024. PMID: 39609823 Free PMC article. Clinical Trial.
-
[Laryngopharyngeal reflux disease-update 2025 : Revision taking into consideration the European CEORL-HNS guideline].HNO. 2025 Aug;73(8):589-602. doi: 10.1007/s00106-025-01645-w. Epub 2025 Jul 9. HNO. 2025. PMID: 40643662 German.
References
-
- Briganti G (2023) How ChatGPT works: a mini review. Eur Arch Otorhinolaryngol. https://doi.org/10.1007/s00405-023-08337-7 - DOI - PubMed
-
- Vaira LA, Lechien JR, Abbate V, Allevi F, Audino G, Beltramini GA et al (2023) Accuracy of ChatGPT-generated information on head and neck and oromaxillofacial surgery: a multicenter collaborative analysis. Otolaryngol Head Neck Surg. https://doi.org/10.1002/ohn.489 - DOI - PubMed
-
- Davis RJ, Ayo-Ajibola O, Lin ME, Swanson MS, Chambers TN, Kwon DI, Kokot NC (2023) Evaluation of oropharyngeal cancer information from revolutionary artificial intelligence Chatbot. Laryngoscope. https://doi.org/10.1002/lary.31191 - DOI - PubMed - PMC
-
- Lechien JR, Vaezi MF, Chan WW, Allen JE, Karkos PD, Saussez S et al (2023) The Dubai definition and diagnostic criteria of laryngopharyngeal reflux: the IFOS consensus. Laryngoscope. https://doi.org/10.1002/lary.31134 - DOI - PubMed
-
- Lechien JR, Akst LM, Hamdan AL, Schindler A, Karkos PD, Barillari MR, Calvo-Henriquez C, Crevier-Buchman L, Finck C, Eun YG, Saussez S, Vaezi MF (2019) Evaluation and management of laryngopharyngeal reflux disease: state of the art review. Otolaryngol Head Neck Surg 160(5):762–782. https://doi.org/10.1177/0194599819827488 - DOI - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources