Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Yee Hui Yeo¹, Jamil S Samaan¹, Wee Han Ng², Peng-Sheng Ting³, Hirsh Trivedi^{1

4}, Aarshi Vipani¹, Walid Ayoub^{1

4}, Ju Dong Yang^{1

4

5}, Omer Liran^{6

7}, Brennan Spiegel^{1

7}, Alexander Kuo^{1

4}

Affiliations

¹ Karsh Division of Gastroenterology and Hepatology, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
² Bristol Medical School, University of Bristol, Bristol, UK.
³ School of Medicine, Tulane University, New Orleans, LA, USA.
⁴ Comprehensive Transplant Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
⁵ Samuel Oschin Comprehensive Cancer Institute, Cedars- Sinai Medical Center, Los Angeles, CA, USA.
⁶ Department of Psychiatry and Behavioral Sciences, Cedars-Sinai, Los Angeles, CA, USA.
⁷ Division of Health Services Research, Department of Medicine, Cedars-Sinai, Los Angeles, CA, USA.

PMID: 36946005
PMCID: PMC10366809
DOI: 10.3350/cmh.2023.0089

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Yee Hui Yeo et al. Clin Mol Hepatol. 2023 Jul.

. 2023 Jul;29(3):721-732.

doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.

Authors

Affiliations

¹ Karsh Division of Gastroenterology and Hepatology, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
² Bristol Medical School, University of Bristol, Bristol, UK.
³ School of Medicine, Tulane University, New Orleans, LA, USA.
⁴ Comprehensive Transplant Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
⁵ Samuel Oschin Comprehensive Cancer Institute, Cedars- Sinai Medical Center, Los Angeles, CA, USA.
⁶ Department of Psychiatry and Behavioral Sciences, Cedars-Sinai, Los Angeles, CA, USA.
⁷ Division of Health Services Research, Department of Medicine, Cedars-Sinai, Los Angeles, CA, USA.

PMID: 36946005
PMCID: PMC10366809
DOI: 10.3350/cmh.2023.0089

Abstract

Background/aims: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine the accuracy and reproducibility of ChatGPT in answering questions regarding knowledge, management, and emotional support for cirrhosis and HCC.

Methods: ChatGPT's responses to 164 questions were independently graded by two transplant hepatologists and resolved by a third reviewer. The performance of ChatGPT was also assessed using two published questionnaires and 26 questions formulated from the quality measures of cirrhosis management. Finally, its emotional support capacity was tested.

Results: We showed that ChatGPT regurgitated extensive knowledge of cirrhosis (79.1% correct) and HCC (74.0% correct), but only small proportions (47.3% in cirrhosis, 41.1% in HCC) were labeled as comprehensive. The performance was better in basic knowledge, lifestyle, and treatment than in the domains of diagnosis and preventive medicine. For the quality measures, the model answered 76.9% of questions correctly but failed to specify decision-making cut-offs and treatment durations. ChatGPT lacked knowledge of regional guidelines variations, such as HCC screening criteria. However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps and adjusting to a new diagnosis.

Conclusion: We analyzed the areas of robustness and limitations of ChatGPT's responses on the management of cirrhosis and HCC and relevant emotional support. ChatGPT may have a role as an adjunct informational tool for patients and physicians to improve outcomes.

Keywords: Artificial intelligence; Chronic disease management; Health communication; Patient education as topic; Telemedicine.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest

The authors have no conflictsto disclose.

Figures

**Figure 1.**
Flow chart of question selection for cirrhosis and hepatocellular carcinoma (HCC). Frequently asked questions about the knowledge and management of cirrhosis or HCC were collected from patient support groups on Facebook and well-regarded professional societies and institutions. FAQs, frequently asked questions.

**Figure 2.**
Grade of responses by the ChatGPT language model to questions related to (A) cirrhosis and (B) hepatocellular carcinoma (HCC). The percentage of responses being graded as comprehensive, correct but inadequate, mixed with correct and incorrect/outdated data, and completely incorrect were provided. GPT, Generative Pre-trained Transformer.

See this image and copyright information in PMC

Comment in

Letter 1 regarding "Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma".
Ali H. Ali H. Clin Mol Hepatol. 2023 Jul;29(3):813-814. doi: 10.3350/cmh.2023.0120. Epub 2023 May 19. Clin Mol Hepatol. 2023. PMID: 37211355 Free PMC article. No abstract available.
Letter 2 regarding "Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma".
Kleebayoon A, Wiwanitkit V. Kleebayoon A, et al. Clin Mol Hepatol. 2023 Jul;29(3):815-816. doi: 10.3350/cmh.2023.0170. Epub 2023 May 24. Clin Mol Hepatol. 2023. PMID: 37221834 Free PMC article. No abstract available.
Letter 1 regarding "Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma".
Daungsupawong H, Wiwanitkit V. Daungsupawong H, et al. Clin Mol Hepatol. 2024 Jan;30(1):111-112. doi: 10.3350/cmh.2023.0394. Epub 2023 Oct 13. Clin Mol Hepatol. 2024. PMID: 37828840 Free PMC article. No abstract available.
Letter 2 regarding "Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma".
Zhang Y, Wu L, Mu Z, Ren L, Chen Y, Liu H, Xu L, Wang Y, Wang Y, Cheng S, Tham YC, Sheng B, Wong TY, Ji H. Zhang Y, et al. Clin Mol Hepatol. 2024 Jan;30(1):113-117. doi: 10.3350/cmh.2023.0440. Epub 2023 Nov 10. Clin Mol Hepatol. 2024. PMID: 37946373 Free PMC article. No abstract available.

References

1. GBD 2017 Cirrhosis Collaborators The global, regional, and national burden of cirrhosis by cause in 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet Gastroenterol Hepatol. 2020;5:245–266. - PMC - PubMed
1. Tsochatzis EA, Bosch J, Burroughs AK. Liver cirrhosis. Lancet. 2014;383:1749–1761. - PubMed
1. Yang JD, Hainaut P, Gores GJ, Amadou A, Plymoth A, Roberts LR. A global view of hepatocellular carcinoma: trends, risk, prevention and management. Nat Rev Gastroenterol Hepatol. 2019;16:589–604. - PMC - PubMed
1. Rumgay H, Arnold M, Ferlay J, Lesi O, Cabasag CJ, Vignat J, et al. Global burden of primary liver cancer in 2020 and predictions to 2040. J Hepatol. 2022;77:1598–1606. - PMC - PubMed
1. Desai AP, Mohan P, Nokes B, Sheth D, Knapp S, Boustani M, et al. Increasing economic burden in hospitalized patients with cirrhosis: Analysis of a national database. Clin Transl Gastroenterol. 2019;10:e00062. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Affiliations

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical