Error rates of human reviewers during abstract screening in systematic reviews

Zhen Wang^{1

2}, Tarek Nayfeh², Jennifer Tetzlaff³, Peter O'Blenis³, Mohammad Hassan Murad^{1

2}

Affiliations

¹ Evidence-based Practice Center, Mayo Clinic, Rochester, Minnesota, United States of America.
² Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery Mayo Clinic, Rochester, Minnesota, United States of America.
³ Evidence Partners, Ottawa, Ontario, Canada.

PMID: 31935267
PMCID: PMC6959565
DOI: 10.1371/journal.pone.0227742

Error rates of human reviewers during abstract screening in systematic reviews

Zhen Wang et al. PLoS One. 2020.

. 2020 Jan 14;15(1):e0227742.

doi: 10.1371/journal.pone.0227742. eCollection 2020.

Authors

Zhen Wang^{1

2}, Tarek Nayfeh², Jennifer Tetzlaff³, Peter O'Blenis³, Mohammad Hassan Murad^{1

2}

Affiliations

¹ Evidence-based Practice Center, Mayo Clinic, Rochester, Minnesota, United States of America.
² Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery Mayo Clinic, Rochester, Minnesota, United States of America.
³ Evidence Partners, Ottawa, Ontario, Canada.

PMID: 31935267
PMCID: PMC6959565
DOI: 10.1371/journal.pone.0227742

Abstract

Background: Automated approaches to improve the efficiency of systematic reviews are greatly needed. When testing any of these approaches, the criterion standard of comparison (gold standard) is usually human reviewers. Yet, human reviewers make errors in inclusion and exclusion of references.

Objectives: To determine citation false inclusion and false exclusion rates during abstract screening by pairs of independent reviewers. These rates can help in designing, testing and implementing automated approaches.

Methods: We identified all systematic reviews conducted between 2010 and 2017 by an evidence-based practice center in the United States. Eligible reviews had to follow standard systematic review procedures with dual independent screening of abstracts and full texts, in which citation inclusion by one reviewer prompted automatic inclusion through the next level of screening. Disagreements between reviewers during full text screening were reconciled via consensus or arbitration by a third reviewer. A false inclusion or exclusion was defined as a decision made by a single reviewer that was inconsistent with the final included list of studies.

Results: We analyzed a total of 139,467 citations that underwent 329,332 inclusion and exclusion decisions from 86 unique reviewers. The final systematic reviews included 5.48% of the potential references identified through bibliographic database search (95% confidence interval (CI): 2.38% to 8.58%). After abstract screening, the total error rate (false inclusion and false exclusion) was 10.76% (95% CI: 7.43% to 14.09%).

Conclusions: This study suggests important false inclusion and exclusion rates by human reviewers. When deciding the validity of a future automated study selection algorithm, it is important to keep in mind that the gold standard is not perfect and that achieving error rates similar to humans may be adequate and can save resources and time.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Errors occurred during systematic review abstract screening.**

See this image and copyright information in PMC

References

1. Cochrane AL. 1931–1971: a critical review, with particular reference to the medical profession. Medicines for the year. 2000;1979:1.
1. Ioannidis JP. The Mass Production of Redundant, Misleading, and Conflicted Systematic Reviews and Meta-analyses. Milbank Q. 2016;94(3):485–514. Epub 2016/09/14. 10.1111/1468-0009.12210 - DOI - PMC - PubMed
1. Page MJ, Altman DG, McKenzie JE, Shamseer L, Ahmadzai N, Wolfe D, et al. Flaws in the application and interpretation of statistical analyses in systematic reviews of therapeutic interventions were common: a cross-sectional analysis. J Clin Epidemiol. 2018;95:7–18. Epub 2017/12/06. 10.1016/j.jclinepi.2017.11.022 . - DOI - PubMed
1. Baudard M, Yavchitz A, Ravaud P, Perrodeau E, Boutron I. Impact of searching clinical trial registries in systematic reviews of pharmaceutical treatments: methodological systematic review and reanalysis of meta-analyses. BMJ. 2017;356:j448 Epub 2017/02/19. 10.1136/bmj.j448 - DOI - PMC - PubMed
1. Higgins JP, Green S. Cochrane handbook for systematic reviews of interventions: John Wiley & Sons; 2011.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Error rates of human reviewers during abstract screening in systematic reviews

Affiliations

Error rates of human reviewers during abstract screening in systematic reviews

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources