Artificial intelligence compared with human-derived patient educational materials on cirrhosis

Faruq Pradhan¹, Alexandra Fiedler², Kaeli Samson³, Marco Olivera-Martinez¹, Wuttiporn Manatsathit¹, Thoetchai Peeraphatdit¹

Affiliations

¹ Department of Gastroenterology and Hepatology, University of Nebraska Medicine, Omaha, Nebraska.
² Department of Internal Medicine, University of Nebraska Medicine, Omaha, Nebraska.
³ Department of Biostatistics, College of Public Health, University of Nebraska Medical Center, Omaha, Nebraska.

PMID: 38358382
PMCID: PMC10871753
DOI: 10.1097/HC9.0000000000000367

Artificial intelligence compared with human-derived patient educational materials on cirrhosis

Faruq Pradhan et al. Hepatol Commun. 2024.

. 2024 Feb 14;8(3):e0367.

doi: 10.1097/HC9.0000000000000367. eCollection 2024 Mar 1.

Authors

Faruq Pradhan¹, Alexandra Fiedler², Kaeli Samson³, Marco Olivera-Martinez¹, Wuttiporn Manatsathit¹, Thoetchai Peeraphatdit¹

Affiliations

¹ Department of Gastroenterology and Hepatology, University of Nebraska Medicine, Omaha, Nebraska.
² Department of Internal Medicine, University of Nebraska Medicine, Omaha, Nebraska.
³ Department of Biostatistics, College of Public Health, University of Nebraska Medical Center, Omaha, Nebraska.

PMID: 38358382
PMCID: PMC10871753
DOI: 10.1097/HC9.0000000000000367

Abstract

Background: The study compared the readability, grade level, understandability, actionability, and accuracy of standard patient educational material against artificial intelligence chatbot-derived patient educational material regarding cirrhosis.

Methods: An identical standardized phrase was used to generate patient educational materials on cirrhosis from 4 large language model-derived chatbots (ChatGPT, DocsGPT, Google Bard, and Bing Chat), and the outputs were compared against a pre-existing human-derived educational material (Epic). Objective scores for readability and grade level were determined using Flesch-Kincaid and Simple Measure of Gobbledygook scoring systems. 14 patients/caregivers and 8 transplant hepatologists were blinded and independently scored the materials on understandability and actionability and indicated whether they believed the material was human or artificial intelligence-generated. Understandability and actionability were determined using the Patient Education Materials Assessment Tool for Printable Materials. Transplant hepatologists also provided medical accuracy scores.

Results: Most educational materials scored similarly in readability and grade level but were above the desired sixth-grade reading level. All educational materials were deemed understandable by both groups, while only the human-derived educational material (Epic) was considered actionable by both groups. No significant difference in perceived actionability or understandability among the educational materials was identified. Both groups poorly identified which materials were human-derived versus artificial intelligence-derived.

Conclusions: Chatbot-derived patient educational materials have comparable readability, grade level, understandability, and accuracy to human-derived materials. Readability, grade level, and actionability may be appropriate targets for improvement across educational materials on cirrhosis. Chatbot-derived patient educational materials show promise, and further studies should assess their usefulness in clinical practice.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflicts to report.

Figures

**FIGURE 1**
Transplant hepatologists’ average accuracy scores for each author based on the scoring system utilized in Dy et al and Storino et al. A score of 1 indicates < 25% of the information is accurate; a score of 2 indicates 26%–50% of the information is accurate; a score of 3 indicates 51%–75% of the information is accurate; a score of 4 indicates 76%–99% of the information is accurate; a score of 5 indicates 100% of the information is accurate.

See this image and copyright information in PMC

References

1. Ayers JW, Poliak A, Dredze M, Leas EC, Zhu Z, Kelley JB, et al. . Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Intern Med Published online. 2023;183:589–596. - PMC - PubMed
1. Haupt CE, Marks M. AI-generated medical advice-GPT and beyond. JAMA. 2023;329:1349–1350. - PubMed
1. Adams K. Epic to Integrate GTP-4 into Its EHR Through Expanded Microsoft Partnership. MedCityNews. Published online April 28, 2023. https://medcitynews.com/2023/04/epic-tointegrate-gpt-4-into-its-ehr-thro...
1. van Dis EAM, Bollen J, Zuidema W, van Rooij R, Bockting CL. ChatGPT: Five priorities for research. Nature. 2023;614:224–226. - PubMed
1. Kushniruk A. The development and use of chatbots in public health: Scoping review. JMIR Hum Factors. 2022;9:e35882. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Artificial intelligence compared with human-derived patient educational materials on cirrhosis

Affiliations

Artificial intelligence compared with human-derived patient educational materials on cirrhosis

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical