Automatic data source identification for clinical trial eligibility criteria resolution

Chaitanya Shivade¹, Courtney Hebert², Kelly Regan², Eric Fosler-Lussier¹, Albert M Lai³

Affiliations

¹ Department of Computer Science and Engineering.
² Department of Biomedical Informatics, The Ohio State University, Columbus, OH.
³ Department of Biomedical Informatics, The Ohio State University, Columbus, OH.; National Institute of Health, Rehabilitation Medicine Department, Mark O. Hatfield Clinical Research Center, Bethesda, MD.

PMID: 28269912
PMCID: PMC5333255

Automatic data source identification for clinical trial eligibility criteria resolution

Chaitanya Shivade et al. AMIA Annu Symp Proc. 2017.

. 2017 Feb 10:2016:1149-1158.

eCollection 2016.

Authors

Chaitanya Shivade¹, Courtney Hebert², Kelly Regan², Eric Fosler-Lussier¹, Albert M Lai³

Affiliations

¹ Department of Computer Science and Engineering.
² Department of Biomedical Informatics, The Ohio State University, Columbus, OH.
³ Department of Biomedical Informatics, The Ohio State University, Columbus, OH.; National Institute of Health, Rehabilitation Medicine Department, Mark O. Hatfield Clinical Research Center, Bethesda, MD.

PMID: 28269912
PMCID: PMC5333255

Abstract

Clinical trial coordinators refer to both structured and unstructured sources of data when evaluating a subject for eligibility. While some eligibility criteria can be resolved using structured data, some require manual review of clinical notes. An important step in automating the trial screening process is to be able to identify the right data source for resolving each criterion. In this work, we discuss the creation of an eligibility criteria dataset for clinical trials for patients with two disparate diseases, annotated with the preferred data source for each criterion (i.e., structured or unstructured) by annotators with medical training. The dataset includes 50 heart-failure trials with a total of 766 eligibility criteria and 50 trials for chronic lymphocytic leukemia (CLL) with 677 criteria. Further, we developed machine learning models to predict the preferred data source: kernel methods outperform simpler learning models when used with a combination of lexical, syntactic, semantic, and surface features. Evaluation of these models indicates that the performance is consistent across data from both diagnoses, indicating generalizability of our method. Our findings are an important step towards ongoing efforts for automation of clinical trial screening.

PubMed Disclaimer

Figures

**Figure 1.**
Sample eligibility criteria across the three classification categories.

**Figure 2.**
Role of this work in an automated clinical trial screening workflow

See this image and copyright information in PMC

Cited by

Artificial Intelligence Applied to clinical trials: opportunities and challenges.
Askin S, Burkhalter D, Calado G, El Dakrouni S. Askin S, et al. Health Technol (Berl). 2023;13(2):203-213. doi: 10.1007/s12553-023-00738-2. Epub 2023 Feb 28. Health Technol (Berl). 2023. PMID: 36923325 Free PMC article. Review.
Sociotechnical feasibility of natural language processing-driven tools in clinical trial eligibility prescreening for Alzheimer's disease and related dementias.
Idnay B, Liu J, Fang Y, Hernandez A, Kaw S, Etwaru A, Juarez Padilla J, Ramírez SO, Marder K, Weng C, Schnall R. Idnay B, et al. J Am Med Inform Assoc. 2024 Apr 19;31(5):1062-1073. doi: 10.1093/jamia/ocae032. J Am Med Inform Assoc. 2024. PMID: 38447587 Free PMC article.
Digital tools for the recruitment and retention of participants in randomised controlled trials: a systematic map.
Frampton GK, Shepherd J, Pickett K, Griffiths G, Wyatt JC. Frampton GK, et al. Trials. 2020 Jun 5;21(1):478. doi: 10.1186/s13063-020-04358-3. Trials. 2020. PMID: 32498690 Free PMC article. Review.
Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records.
Savova GK, Danciu I, Alamudun F, Miller T, Lin C, Bitterman DS, Tourassi G, Warner JL. Savova GK, et al. Cancer Res. 2019 Nov 1;79(21):5463-5470. doi: 10.1158/0008-5472.CAN-19-0579. Epub 2019 Aug 8. Cancer Res. 2019. PMID: 31395609 Free PMC article. Review.
Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.
Alkaitis MS, Agrawal MN, Riely GJ, Razavi P, Sontag D. Alkaitis MS, et al. JCO Clin Cancer Inform. 2021 May;5:550-560. doi: 10.1200/CCI.20.00139. JCO Clin Cancer Inform. 2021. PMID: 33989016 Free PMC article.

See all "Cited by" articles

References

1. Prokosch HU, Ganslandt T. Perspectives for medical informatics. Reusing the electronic medical record for clinical research. Methods Inf Med. 2009 Jan.48(1):38–44. - PubMed
1. Kopcke F, Prokosch H-U. Employing Computers for the Recruitment into Clinical Trials: A Comprehensive Systematic Review. J Med Internet Res. 2014 Jan.16(7):e161. - PMC - PubMed
1. Penberthy LT, Dahman BA, Petkov VI, DeShazo JP. Effort required in eligibility screening for clinical trials. J Oncol Pract. 2012 Nov.8(6):365–70. - PMC - PubMed
1. Shivade C, Raghavan P, Fosler-Lussier E, Embi PJ, Elhadad N, Johnson SB, et al. A review of approaches to identifying patient phenotype cohorts using electronic health records. J Am Med Inform Assoc. 2014 Mar.21(2):221–30. - PMC - PubMed
1. Rosenbloom ST, Denny JC, Xu H, Lorenzi N, Stead WW, Johnson KB. Data from clinical notes: a perspective on the tension between structure and flexible documentation. J Am Med Inform Assoc. 2011;18(2):181–6. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 LM011116/LM/NLM NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Automatic data source identification for clinical trial eligibility criteria resolution

Affiliations

Automatic data source identification for clinical trial eligibility criteria resolution

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical