Text-mining in electronic healthcare records can be used as efficient tool for screening and data collection in cardiovascular trials: a multicenter validation study
- PMID: 33248277
- DOI: 10.1016/j.jclinepi.2020.11.014
Text-mining in electronic healthcare records can be used as efficient tool for screening and data collection in cardiovascular trials: a multicenter validation study
Abstract
Objective: This study aimed to validate trial patient eligibility screening and baseline data collection using text-mining in electronic healthcare records (EHRs), comparing the results to those of an international trial.
Study design and setting: In three medical centers with different EHR vendors, EHR-based text-mining was used to automatically screen patients for trial eligibility and extract baseline data on nineteen characteristics. First, the yield of screening with automated EHR text-mining search was compared with manual screening by research personnel. Second, the accuracy of extracted baseline data by EHR text mining was compared to manual data entry by research personnel.
Results: Of the 92,466 patients visiting the out-patient cardiology departments, 568 (0.6%) were enrolled in the trial during its recruitment period using manual screening methods. Automated EHR data screening of all patients showed that the number of patients needed to screen could be reduced by 73,863 (79.9%). The remaining 18,603 (20.1%) contained 458 of the actual participants (82.4% of participants). In trial participants, automated EHR text-mining missed a median of 2.8% (Interquartile range [IQR] across all variables 0.4-8.5%) of all data points compared to manually collected data. The overall accuracy of automatically extracted data was 88.0% (IQR 84.7-92.8%).
Conclusion: Automatically extracting data from EHRs using text-mining can be used to identify trial participants and to collect baseline information.
Keywords: Cardiovascular; Data-collections; Data-mining; Electronic healthcare records (EHRs); Electronic medical records (EMRs); LoDoCo2; Multicenter; Recruitment; Screening; Text-mining; Trials.
Copyright © 2020 The Authors. Published by Elsevier Inc. All rights reserved.
Similar articles
-
Data mining information from electronic health records produced high yield and accuracy for current smoking status.J Clin Epidemiol. 2020 Feb;118:100-106. doi: 10.1016/j.jclinepi.2019.11.006. Epub 2019 Nov 12. J Clin Epidemiol. 2020. PMID: 31730918
-
DEVELOPMENT AND PERFORMANCE OF TEXT-MINING ALGORITHMS TO EXTRACT SOCIOECONOMIC STATUS FROM DE-IDENTIFIED ELECTRONIC HEALTH RECORDS.Pac Symp Biocomput. 2017;22:230-241. doi: 10.1142/9789813207813_0023. Pac Symp Biocomput. 2017. PMID: 27896978 Free PMC article.
-
Utilization of EHRs for clinical trials: a systematic review.BMC Med Res Methodol. 2024 Mar 18;24(1):70. doi: 10.1186/s12874-024-02177-7. BMC Med Res Methodol. 2024. PMID: 38494497 Free PMC article.
-
An Electronic Health Record Text Mining Tool to Collect Real-World Drug Treatment Outcomes: A Validation Study in Patients With Metastatic Renal Cell Carcinoma.Clin Pharmacol Ther. 2020 Sep;108(3):644-652. doi: 10.1002/cpt.1966. Epub 2020 Jul 18. Clin Pharmacol Ther. 2020. PMID: 32575147 Free PMC article.
-
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173. J Am Med Inform Assoc. 2019. PMID: 30726935 Free PMC article.
Cited by
-
WeChat assisted electronic symptom measurement for patients with adenomyosis.BMC Med Inform Decis Mak. 2024 Jun 17;24(1):168. doi: 10.1186/s12911-024-02570-8. BMC Med Inform Decis Mak. 2024. PMID: 38886791 Free PMC article.
-
Artificial intelligence for optimizing recruitment and retention in clinical trials: a scoping review.J Am Med Inform Assoc. 2024 Nov 1;31(11):2749-2759. doi: 10.1093/jamia/ocae243. J Am Med Inform Assoc. 2024. PMID: 39259922 Free PMC article.
-
Treatment Patterns of Cancer-associated Thrombosis in the Netherlands: The Four Cities Study.TH Open. 2024 Jan 30;8(1):e61-e71. doi: 10.1055/a-2214-8101. eCollection 2024 Jan. TH Open. 2024. PMID: 38298199 Free PMC article.
-
Data Integration Challenges for Machine Learning in Precision Medicine.Front Med (Lausanne). 2022 Jan 25;8:784455. doi: 10.3389/fmed.2021.784455. eCollection 2021. Front Med (Lausanne). 2022. PMID: 35145977 Free PMC article. Review.
-
Using electronic health records to streamline provider recruitment for implementation science studies.PLoS One. 2022 May 13;17(5):e0267915. doi: 10.1371/journal.pone.0267915. eCollection 2022. PLoS One. 2022. PMID: 35560153 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical