Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul;29(3):721-732.
doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Affiliations

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma

Yee Hui Yeo et al. Clin Mol Hepatol. 2023 Jul.

Abstract

Background/aims: Patients with cirrhosis and hepatocellular carcinoma (HCC) require extensive and personalized care to improve outcomes. ChatGPT (Generative Pre-trained Transformer), a large language model, holds the potential to provide professional yet patient-friendly support. We aimed to examine the accuracy and reproducibility of ChatGPT in answering questions regarding knowledge, management, and emotional support for cirrhosis and HCC.

Methods: ChatGPT's responses to 164 questions were independently graded by two transplant hepatologists and resolved by a third reviewer. The performance of ChatGPT was also assessed using two published questionnaires and 26 questions formulated from the quality measures of cirrhosis management. Finally, its emotional support capacity was tested.

Results: We showed that ChatGPT regurgitated extensive knowledge of cirrhosis (79.1% correct) and HCC (74.0% correct), but only small proportions (47.3% in cirrhosis, 41.1% in HCC) were labeled as comprehensive. The performance was better in basic knowledge, lifestyle, and treatment than in the domains of diagnosis and preventive medicine. For the quality measures, the model answered 76.9% of questions correctly but failed to specify decision-making cut-offs and treatment durations. ChatGPT lacked knowledge of regional guidelines variations, such as HCC screening criteria. However, it provided practical and multifaceted advice to patients and caregivers regarding the next steps and adjusting to a new diagnosis.

Conclusion: We analyzed the areas of robustness and limitations of ChatGPT's responses on the management of cirrhosis and HCC and relevant emotional support. ChatGPT may have a role as an adjunct informational tool for patients and physicians to improve outcomes.

Keywords: Artificial intelligence; Chronic disease management; Health communication; Patient education as topic; Telemedicine.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest

The authors have no conflictsto disclose.

Figures

Figure 1.
Figure 1.
Flow chart of question selection for cirrhosis and hepatocellular carcinoma (HCC). Frequently asked questions about the knowledge and management of cirrhosis or HCC were collected from patient support groups on Facebook and well-regarded professional societies and institutions. FAQs, frequently asked questions.
Figure 2.
Figure 2.
Grade of responses by the ChatGPT language model to questions related to (A) cirrhosis and (B) hepatocellular carcinoma (HCC). The percentage of responses being graded as comprehensive, correct but inadequate, mixed with correct and incorrect/outdated data, and completely incorrect were provided. GPT, Generative Pre-trained Transformer.
None

Comment in

References

    1. GBD 2017 Cirrhosis Collaborators The global, regional, and national burden of cirrhosis by cause in 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet Gastroenterol Hepatol. 2020;5:245–266. - PMC - PubMed
    1. Tsochatzis EA, Bosch J, Burroughs AK. Liver cirrhosis. Lancet. 2014;383:1749–1761. - PubMed
    1. Yang JD, Hainaut P, Gores GJ, Amadou A, Plymoth A, Roberts LR. A global view of hepatocellular carcinoma: trends, risk, prevention and management. Nat Rev Gastroenterol Hepatol. 2019;16:589–604. - PMC - PubMed
    1. Rumgay H, Arnold M, Ferlay J, Lesi O, Cabasag CJ, Vignat J, et al. Global burden of primary liver cancer in 2020 and predictions to 2040. J Hepatol. 2022;77:1598–1606. - PMC - PubMed
    1. Desai AP, Mohan P, Nokes B, Sheth D, Knapp S, Boustani M, et al. Increasing economic burden in hospitalized patients with cirrhosis: Analysis of a national database. Clin Transl Gastroenterol. 2019;10:e00062. - PMC - PubMed

MeSH terms