This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2023 Sep 4:2023.08.31.23294924.

doi: 10.1101/2023.08.31.23294924.

A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record

Jin Ge¹, Michael Li¹, Molly B Delk², Jennifer C Lai¹

Affiliations

¹ Division of Gastroenterology and Hepatology, Department of Medicine, University of California - San Francisco, San Francisco, CA.
² Section of Gastroenterology and Hepatology, Department of Medicine, Tulane University School of Medicine, New Orleans, LA.

PMID: 37693398
PMCID: PMC10491368
DOI: 10.1101/2023.08.31.23294924

A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record

Jin Ge et al. medRxiv. 2023.

[Preprint]. 2023 Sep 4:2023.08.31.23294924.

doi: 10.1101/2023.08.31.23294924.

Authors

Jin Ge¹, Michael Li¹, Molly B Delk², Jennifer C Lai¹

Affiliations

¹ Division of Gastroenterology and Hepatology, Department of Medicine, University of California - San Francisco, San Francisco, CA.
² Section of Gastroenterology and Hepatology, Department of Medicine, Tulane University School of Medicine, New Orleans, LA.

PMID: 37693398
PMCID: PMC10491368
DOI: 10.1101/2023.08.31.23294924

Update in

A Comparison of a Large Language Model vs Manual Chart Review for the Extraction of Data Elements From the Electronic Health Record.
Ge J, Li M, Delk MB, Lai JC. Ge J, et al. Gastroenterology. 2024 Apr;166(4):707-709.e3. doi: 10.1053/j.gastro.2023.12.019. Epub 2023 Dec 25. Gastroenterology. 2024. PMID: 38151192 Free PMC article. No abstract available.

Abstract

Importance: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown.

Objective: To determine the accuracy of data extraction using "Versa Chat," a chat implementation of the general-purpose OpenAI gpt-35-turbo LLM model, versus manual chart review for hepatocellular carcinoma (HCC) imaging reports.

Design: We engineered a prompt for the data extraction task of six distinct data elements and input 182 abdominal imaging reports that were also manually tagged. We evaluated performance by calculating accuracy, precision, recall, and F1 scores.

Setting/participants: Cross-sectional abdominal imaging reports of patients diagnosed with hepatocellular carcinoma enrolled in the Functional Assessment in Liver Transplantation (FrAILT) study.

PubMed Disclaimer

Conflict of interest statement

Disclosures: The authors of this manuscript have the following potential conflicts of interest to disclose: Dr. Jin Ge receives research support from Merck and Co; and consults for Astellas Pharmaceuticals/Iota Biosciences.Dr. Jennifer C. Lai receives research support from Lipocene and Vir Biotechnologies; receives an education grant from Nestle Nutrition Sciences; serves on an advisory board for Novo Nordisk; and consults for Genfit, Third Rock Ventures, and Boehringer Ingelheim.

Figures

**Figure 1 –**
Final prompt used for data extraction from “Versa Chat” (gpt-35-turbo)

**Figure 2 –**
Example of an exchange with “Versa Chat” (gpt-35-turbo) using mock data

See this image and copyright information in PMC

References

1. Ge J, Lai JC. Artificial intelligence-based text generators in hepatology: ChatGPT is just the beginning. Hepatol Commun. 2023;7(4). doi: 10.1097/HC9.0000000000000097 - DOI - PMC - PubMed
1. Singhal K, Azizi S, Tu T, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–180. doi: 10.1038/s41586-023-06291-2 - DOI - PMC - PubMed
1. Chernyak V, Fowler KJ, Kamaya A, et al. Liver Imaging Reporting and Data System (LI-RADS) Version 2018: Imaging of Hepatocellular Carcinoma in At-Risk Patients. Radiology. 2018;289(3):816–830. doi: 10.1148/radiol.2018181494 - DOI - PMC - PubMed
1. Azure OpenAI Service – Large Language Models for Generative AI. https://azure.microsoft.com/en-us/products/ai-services/openai-service-b. Accessed August 25, 2023.
1. Azure OpenAI Service models - Azure OpenAI | Microsoft Learn. https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models. Accessed August 26, 2023.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record

Affiliations

A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record

Authors

Affiliations

Update in

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources