GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search
- PMID: 38513636
- PMCID: PMC11529100
- DOI: 10.1177/21925682241241241
GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search
Abstract
Study design: Comparative study.
Objectives: This study aims to compare Google and GPT-4 in terms of (1) question types, (2) response readability, (3) source quality, and (4) numerical response accuracy for the top 10 most frequently asked questions (FAQs) about anterior cervical discectomy and fusion (ACDF).
Methods: "Anterior cervical discectomy and fusion" was searched on Google and GPT-4 on December 18, 2023. Top 10 FAQs were classified according to the Rothwell system. Source quality was evaluated using JAMA benchmark criteria and readability was assessed using Flesch Reading Ease and Flesch-Kincaid grade level. Differences in JAMA scores, Flesch-Kincaid grade level, Flesch Reading Ease, and word count between platforms were analyzed using Student's t-tests. Statistical significance was set at the .05 level.
Results: Frequently asked questions from Google were varied, while GPT-4 focused on technical details and indications/management. GPT-4 showed a higher Flesch-Kincaid grade level (12.96 vs 9.28, P = .003), lower Flesch Reading Ease score (37.07 vs 54.85, P = .005), and higher JAMA scores for source quality (3.333 vs 1.800, P = .016). Numerically, 6 out of 10 responses varied between platforms, with GPT-4 providing broader recovery timelines for ACDF.
Conclusions: This study demonstrates GPT-4's ability to elevate patient education by providing high-quality, diverse information tailored to those with advanced literacy levels. As AI technology evolves, refining these tools for accuracy and user-friendliness remains crucial, catering to patients' varying literacy levels and information needs in spine surgery.
Keywords: GPT-4; Google; anterior cervical discectomy and fusion; artificial intelligence; health literacy; patient education; readability.
Conflict of interest statement
Declaration of Conflicting InterestsThe author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Mitchell K. Ng is a paid consultant at Ferghana Partners. For the remaining authors none were declared.
Figures
Similar articles
-
GPT-4 as a Source of Patient Information for Carpal Tunnel Surgery: A Comparative Analysis Against Google Web Search.J Am Acad Orthop Surg. 2025 Mar 25. doi: 10.5435/JAAOS-D-24-00249. Online ahead of print. J Am Acad Orthop Surg. 2025. PMID: 40138304
-
ChatGPT as a Source of Patient Information for Lumbar Spinal Fusion and Laminectomy: A Comparative Analysis Against Google Web Search.Clin Spine Surg. 2024 Dec 1;37(10):E394-E403. doi: 10.1097/BSD.0000000000001582. Epub 2024 Feb 20. Clin Spine Surg. 2024. PMID: 38409676
-
Evaluation of Generative Language Models in Personalizing Medical Information: Instrument Validation Study.JMIR AI. 2024 Aug 13;3:e54371. doi: 10.2196/54371. JMIR AI. 2024. PMID: 39137416 Free PMC article.
-
Ankle conFUSION: The quality and readability of information on the internet relating to ankle arthrodesis.Surgeon. 2021 Dec;19(6):e507-e511. doi: 10.1016/j.surge.2020.12.001. Epub 2021 Jan 13. Surgeon. 2021. PMID: 33451875 Review.
-
Readability assessment of patient educational materials for pediatric spinal conditions from top academic orthopedic institutions.J Child Orthop. 2023 May 16;17(3):284-290. doi: 10.1177/18632521231156435. eCollection 2023 Jun. J Child Orthop. 2023. PMID: 37288046 Free PMC article. Review.
Cited by
-
Large language models in patient education: a scoping review of applications in medicine.Front Med (Lausanne). 2024 Oct 29;11:1477898. doi: 10.3389/fmed.2024.1477898. eCollection 2024. Front Med (Lausanne). 2024. PMID: 39534227 Free PMC article.
-
Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025. Patient Prefer Adherence. 2025. PMID: 40771655 Free PMC article. Review.
-
Evaluating the Reliability and Quality of Sarcoidosis-Related Information Provided by AI Chatbots.Healthcare (Basel). 2025 Jun 5;13(11):1344. doi: 10.3390/healthcare13111344. Healthcare (Basel). 2025. PMID: 40508957 Free PMC article.
-
Evaluating the Efficacy of ChatGPT vs. Google Gemini in Generating Patient Education Materials for GLP-1 Receptor Agonists (Semaglutide, Liraglutide, Tirzepatide): A Cross-Sectional Study.Cureus. 2025 Apr 10;17(4):e81993. doi: 10.7759/cureus.81993. eCollection 2025 Apr. Cureus. 2025. PMID: 40351930 Free PMC article.
-
Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25. Clin Orthop Relat Res. 2025. PMID: 39330944
References
-
- Number of ChatGPT Users (Dec 2023). Accessed December 22, 2023. https://explodingtopics.com/blog/chatgpt-users
-
- GPT-4 Released: What it Means for the Future of Your Business. Accessed December 22, 2023. https://www.forbes.com/sites/forbesbusinesscouncil/2023/03/28/gpt-4-rele...
Publication types
LinkOut - more resources
Full Text Sources
Research Materials