Data extraction for evidence synthesis using a large language model: A proof-of-concept study

Affiliations

¹ Social, Statistical, and Environmental Sciences, RTI International, Research Triangle Park, North Carolina, USA.
² Department for Evidence-based Medicine and Evaluation, Danube University Krems, Krems, Austria.
³ Preventive Medicine Residency Program, Department of Family Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.

PMID: 38432227
DOI: 10.1002/jrsm.1710

Data extraction for evidence synthesis using a large language model: A proof-of-concept study

Gerald Gartlehner et al. Res Synth Methods. 2024 Jul.

. 2024 Jul;15(4):576-589.

doi: 10.1002/jrsm.1710. Epub 2024 Mar 3.

Authors

Affiliations

¹ Social, Statistical, and Environmental Sciences, RTI International, Research Triangle Park, North Carolina, USA.
² Department for Evidence-based Medicine and Evaluation, Danube University Krems, Krems, Austria.
³ Preventive Medicine Residency Program, Department of Family Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.

PMID: 38432227
DOI: 10.1002/jrsm.1710

Abstract

Data extraction is a crucial, yet labor-intensive and error-prone part of evidence synthesis. To date, efforts to harness machine learning for enhancing efficiency of the data extraction process have fallen short of achieving sufficient accuracy and usability. With the release of large language models (LLMs), new possibilities have emerged to increase efficiency and accuracy of data extraction for evidence synthesis. The objective of this proof-of-concept study was to assess the performance of an LLM (Claude 2) in extracting data elements from published studies, compared with human data extraction as employed in systematic reviews. Our analysis utilized a convenience sample of 10 English-language, open-access publications of randomized controlled trials included in a single systematic review. We selected 16 distinct types of data, posing varying degrees of difficulty (160 data elements across 10 studies). We used the browser version of Claude 2 to upload the portable document format of each publication and then prompted the model for each data element. Across 160 data elements, Claude 2 demonstrated an overall accuracy of 96.3% with a high test-retest reliability (replication 1: 96.9%; replication 2: 95.0% accuracy). Overall, Claude 2 made 6 errors on 160 data items. The most common errors (n = 4) were missed data items. Importantly, Claude 2's ease of use was high; it required no technical expertise or labeled training data for effective operation (i.e., zero-shot learning). Based on findings of our proof-of-concept study, leveraging LLMs has the potential to substantially enhance the efficiency and accuracy of data extraction for evidence syntheses.

Keywords: accuracy; artificial intelligence; data extraction; evidence synthesis; large language models; proof of concept.

PubMed Disclaimer

References

REFERENCES

1. Institute of Medicine. Finding What Works in Health Care: Standards for Systematic Reviews. National Academies Press; 2011.
1. Higgins J, Thomas J, Chandler J, et al. Cochrane Handbook For Systematic Reviews of Interventions Version 6.4 (updated August 2023). Cochrane; 2023. Available from: www.training.cochrane.org/handbook Accessed September 27, 2023.
1. Nussbaumer‐Streit B, Ellen M, Klerings I, et al. Resource use during systematic review production varies widely: a scoping review. J Clin Epidemiol. 2021;139:287‐296.
1. Li T, Saldanha IJ, Jap J, et al. A randomized trial provided new evidence on the accuracy and efficiency of traditional vs. electronically annotated abstraction approaches in systematic reviews. J Clin Epidemiol. 2019;115:77‐89.
1. Mathes T, Klassen P, Pieper D. Frequency of data extraction errors and methods to increase data extraction quality: a methodological review. BMC Med Res Methodol. 2017;17(1):152.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

RTI International Innovation Office

LinkOut - more resources

Full Text Sources
- Wiley

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Data extraction for evidence synthesis using a large language model: A proof-of-concept study

Affiliations

Data extraction for evidence synthesis using a large language model: A proof-of-concept study

Authors

Affiliations

Abstract

References

REFERENCES

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources