Identifying reports of randomized controlled trials (RCTs) via a hybrid machine learning and crowdsourcing approach
- PMID: 28541493
- PMCID: PMC5975623
- DOI: 10.1093/jamia/ocx053
Identifying reports of randomized controlled trials (RCTs) via a hybrid machine learning and crowdsourcing approach
Abstract
Objectives: Identifying all published reports of randomized controlled trials (RCTs) is an important aim, but it requires extensive manual effort to separate RCTs from non-RCTs, even using current machine learning (ML) approaches. We aimed to make this process more efficient via a hybrid approach using both crowdsourcing and ML.
Methods: We trained a classifier to discriminate between citations that describe RCTs and those that do not. We then adopted a simple strategy of automatically excluding citations deemed very unlikely to be RCTs by the classifier and deferring to crowdworkers otherwise.
Results: Combining ML and crowdsourcing provides a highly sensitive RCT identification strategy (our estimates suggest 95%-99% recall) with substantially less effort (we observed a reduction of around 60%-80%) than relying on manual screening alone.
Conclusions: Hybrid crowd-ML strategies warrant further exploration for biomedical curation/annotation tasks.
Keywords: crowdsourcing; evidence-based medicine; human computation; machine learning; natural language processing.
© The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association.
Figures


Similar articles
-
Machine learning reduced workload with minimal risk of missing studies: development and evaluation of a randomized controlled trial classifier for Cochrane Reviews.J Clin Epidemiol. 2021 May;133:140-151. doi: 10.1016/j.jclinepi.2020.11.003. Epub 2020 Nov 7. J Clin Epidemiol. 2021. PMID: 33171275 Free PMC article.
-
Citation screening using crowdsourcing and machine learning produced accurate results: Evaluation of Cochrane's modified Screen4Me service.J Clin Epidemiol. 2021 Feb;130:23-31. doi: 10.1016/j.jclinepi.2020.09.024. Epub 2020 Sep 30. J Clin Epidemiol. 2021. PMID: 33007457
-
Cochrane Centralised Search Service showed high sensitivity identifying randomized controlled trials: A retrospective analysis.J Clin Epidemiol. 2020 Nov;127:142-150. doi: 10.1016/j.jclinepi.2020.08.008. Epub 2020 Aug 13. J Clin Epidemiol. 2020. PMID: 32798713
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
Crowdsourcing the Citation Screening Process for Systematic Reviews: Validation Study.J Med Internet Res. 2019 Apr 29;21(4):e12953. doi: 10.2196/12953. J Med Internet Res. 2019. PMID: 31033444 Free PMC article.
Cited by
-
Biologics for chronic rhinosinusitis.Cochrane Database Syst Rev. 2021 Mar 12;3(3):CD013513. doi: 10.1002/14651858.CD013513.pub3. Cochrane Database Syst Rev. 2021. PMID: 33710614 Free PMC article.
-
Successful incorporation of single reviewer assessments during systematic review screening: development and validation of sensitivity and work-saved of an algorithm that considers exclusion criteria and count.Syst Rev. 2021 Apr 5;10(1):98. doi: 10.1186/s13643-021-01632-6. Syst Rev. 2021. PMID: 33820560 Free PMC article.
-
Positive pressure therapy for Ménière's disease.Cochrane Database Syst Rev. 2023 Feb 23;2(2):CD015248. doi: 10.1002/14651858.CD015248.pub2. Cochrane Database Syst Rev. 2023. PMID: 36815713 Free PMC article. Review.
-
Antithrombotic therapy for ambulatory patients with multiple myeloma receiving immunomodulatory agents.Cochrane Database Syst Rev. 2021 Sep 28;9(9):CD014739. doi: 10.1002/14651858.CD014739. Cochrane Database Syst Rev. 2021. PMID: 34582035 Free PMC article.
-
Parenteral anticoagulation in ambulatory patients with cancer.Cochrane Database Syst Rev. 2017 Sep 11;9(9):CD006652. doi: 10.1002/14651858.CD006652.pub5. Cochrane Database Syst Rev. 2017. PMID: 28892556 Free PMC article.
References
-
- Chalmers I. The Cochrane collaboration: preparing, maintaining, and disseminating systematic reviews of the effects of health care. Ann NY Acad Sci. 1993;7031:156–65. - PubMed
-
- McKibbon KA, Wilczynskil NL, Haynes RB. Retrieving randomized controlled trials from Medline: a comparison of 38 published search filters. Health Info Libr J. 2009;263:187–202. - PubMed
-
- Wieland LS, Robinson KA, Dickersin K. Understanding why evidence from randomised clinical trials may not be retrieved from Medline: comparison of indexed and non-indexed records. Brit Med J. 2012;344:2008–12. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources