Using GPT-4 for LI-RADS feature extraction and categorization with multilingual free-text reports

Kyowon Gu¹, Jeong Hyun Lee¹, Jaeseung Shin¹, Jeong Ah Hwang¹, Ji Hye Min¹, Woo Kyoung Jeong¹, Min Woo Lee¹, Kyoung Doo Song¹, Sung Hwan Bae²

Affiliations

¹ Department of Radiology and Center for Imaging Science, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea.
² Department of Radiology, Soonchunhyang University College of Medicine, Seoul Hospital, Seoul, Republic of Korea.

PMID: 38651924
DOI: 10.1111/liv.15891

Using GPT-4 for LI-RADS feature extraction and categorization with multilingual free-text reports

Kyowon Gu et al. Liver Int. 2024 Jul.

. 2024 Jul;44(7):1578-1587.

doi: 10.1111/liv.15891. Epub 2024 Apr 23.

Authors

Kyowon Gu¹, Jeong Hyun Lee¹, Jaeseung Shin¹, Jeong Ah Hwang¹, Ji Hye Min¹, Woo Kyoung Jeong¹, Min Woo Lee¹, Kyoung Doo Song¹, Sung Hwan Bae²

Affiliations

¹ Department of Radiology and Center for Imaging Science, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea.
² Department of Radiology, Soonchunhyang University College of Medicine, Seoul Hospital, Seoul, Republic of Korea.

PMID: 38651924
DOI: 10.1111/liv.15891

Abstract

Background and aims: The Liver Imaging Reporting and Data System (LI-RADS) offers a standardized approach for imaging hepatocellular carcinoma. However, the diverse styles and structures of radiology reports complicate automatic data extraction. Large language models hold the potential for structured data extraction from free-text reports. Our objective was to evaluate the performance of Generative Pre-trained Transformer (GPT)-4 in extracting LI-RADS features and categories from free-text liver magnetic resonance imaging (MRI) reports.

Methods: Three radiologists generated 160 fictitious free-text liver MRI reports written in Korean and English, simulating real-world practice. Of these, 20 were used for prompt engineering, and 140 formed the internal test cohort. Seventy-two genuine reports, authored by 17 radiologists were collected and de-identified for the external test cohort. LI-RADS features were extracted using GPT-4, with a Python script calculating categories. Accuracies in each test cohort were compared.

Results: On the external test, the accuracy for the extraction of major LI-RADS features, which encompass size, nonrim arterial phase hyperenhancement, nonperipheral 'washout', enhancing 'capsule' and threshold growth, ranged from .92 to .99. For the rest of the LI-RADS features, the accuracy ranged from .86 to .97. For the LI-RADS category, the model showed an accuracy of .85 (95% CI: .76, .93).

Conclusions: GPT-4 shows promise in extracting LI-RADS features, yet further refinement of its prompting strategy and advancements in its neural network architecture are crucial for reliable use in processing complex real-world MRI reports.

Keywords: GPT‐4; LI‐RADS; large language model; natural language processing; structured report.

PubMed Disclaimer

References

REFERENCES

1. Chernyak V, Fowler KJ, Kamaya A, et al. Liver imaging reporting and data system (LI‐RADS) version 2018: imaging of hepatocellular carcinoma in At‐risk patients. Radiology. 2018;289(3):816‐830.
1. Elsayes KM, Kielar AZ, Chernyak V, et al. LI‐RADS: a conceptual and historical review from its beginning to its recent integration into AASLD clinical practice guidance. J Hepatocell Carcinoma. 2019;6:49‐69.
1. Wallis A, McCoubrie P. The radiology report—are we getting the message across? Clin Radiol. 2011;66(11):1015‐1022.
1. Park H, Song M, Lee EB, Seo BK, Choi CM. An attention model with transfer embeddings to classify pneumonia‐related bilingual imaging reports: algorithm development and validation. JMIR Med Inform. 2021;9(5):e24803.
1. Adams LC, Truhn D, Busch F, et al. Leveraging GPT‐4 for post hoc transformation of free‐text radiology reports into structured reporting: a multilingual feasibility study. Radiology. 2023;307:e230725.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

Korea Health Industry Development Institute

LinkOut - more resources

Full Text Sources
- Ovid Technologies, Inc.
- Wiley
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Using GPT-4 for LI-RADS feature extraction and categorization with multilingual free-text reports

Affiliations

Using GPT-4 for LI-RADS feature extraction and categorization with multilingual free-text reports

Authors

Affiliations

Abstract

References

REFERENCES

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical