ChatGPT-4 in Neurosurgery: Improving Patient Education Materials
- PMID: 40704789
- DOI: 10.1227/neu.0000000000003606
ChatGPT-4 in Neurosurgery: Improving Patient Education Materials
Abstract
Background and objectives: Adequate understanding of health information has been shown to be a stronger determinant of health than several demographic factors, including age, income, or employment status. However, existing neurosurgical patient education materials (PEMs) may be too complex for the average American and may contribute to poor health literacy. Large language model chatbots may provide a rapid and low-cost means of rewriting existing PEMs at a lower reading level to improve patient understanding and overall health literacy.
Methods: Neurosurgical PEMs pertaining to stroke, laminectomy, pituitary tumors, epilepsy, and hydrocephalus published by the top 100 US hospitals were collected. For all PEMs, common measures of reading level and difficulty were generated, including Flesch Kincaid Grade Level, Flesch Reading Ease (FRE), Gunning Fog Index, Automated Readability Index, Coleman-Liau Index, and the Simple Measure of Gobbledygook Index readability score. ChatGPT-4 was then used to rewrite 25 randomly selected PEMs at or near the reading level of the average American (eighth-grade reading level). The rewritten PEMs were assessed for readability using the same measures of reading level and difficulty.
Results: The mean FRE for PEMs on all 5 common neurosurgical conditions were significantly greater than corresponding scores for an eighth-grade reading level (P < .001). The mean Kincaid value, Automated Readability Index, Coleman-Liau score, Gunning Fog Index, and Simple Measure of Gobbledygook Index for PEMs on each condition were all significantly greater than an eighth-grade reading level (P < .01). The mean FRE score for rewritten PEMs on each topic were significantly lower than nonrewritten materials (P < .01) except spinal stenosis (P = .104) and were validated for accuracy.
Conclusion: Existing PEMs published by the top US hospitals for common neurosurgical conditions may be too complicated for the average American that reads at an eighth-grade level. Large language model chatbots can be used to efficiently rewrite these PEMs at a lower reading level while maintaining the accuracy of the material.
Keywords: Artificial intelligence; ChatGPT; Health literacy; Large language models; Patient education materials.
Copyright © Congress of Neurological Surgeons 2025. All rights reserved.
References
-
- Liu C, Wang D, Liu C, et al. What is the meaning of health literacy? A systematic review and qualitative synthesis. Fam Med Community Health. 2020;8(2):e000351.
-
- Protheroe J, Nutbeam D, Rowlands G. Health literacy: a necessity for increasing participation in health care. Br J Gen Pract. 2009;59(567):721-723.
-
- Coughlin SS, Vernon M, Hatzigeorgiou C, George V. Health literacy, social determinants of health, and disease prevention and control. J Environ Health Sci. 2020;6(1):3061.
-
- White MS, Burns C, Conlon HA. The impact of an aging population in the workplace. Workplace Health Saf. 2018;66(10):493-498.
-
- Van Schependom J, D’haeseleer M. Advances in neurodegenerative diseases. J Clin Med. 2023;12(5):1709.
LinkOut - more resources
Full Text Sources