Approaches for combining primary care electronic health record data from multiple sources: a systematic review of observational studies
- PMID: 33055114
- PMCID: PMC7559041
- DOI: 10.1136/bmjopen-2020-037405
Approaches for combining primary care electronic health record data from multiple sources: a systematic review of observational studies
Abstract
Objective: To identify observational studies which used data from more than one primary care electronic health record (EHR) database, and summarise key characteristics including: objective and rationale for using multiple data sources; methods used to manage, analyse and (where applicable) combine data; and approaches used to assess and report heterogeneity between data sources.
Design: A systematic review of published studies.
Data sources: Pubmed and Embase databases were searched using list of named primary care EHR databases; supplementary hand searches of reference list of studies were retained after initial screening.
Study selection: Observational studies published between January 2000 and May 2018 were selected, which included at least two different primary care EHR databases.
Results: 6054 studies were identified from database and hand searches, and 109 were included in the final review, the majority published between 2014 and 2018. Included studies used 38 different primary care EHR data sources. Forty-seven studies (44%) were descriptive or methodological. Of 62 analytical studies, 22 (36%) presented separate results from each database, with no attempt to combine them; 29 (48%) combined individual patient data in a one-stage meta-analysis and 21 (34%) combined estimates from each database using two-stage meta-analysis. Discussion and exploration of heterogeneity was inconsistent across studies.
Conclusions: Comparing patterns and trends in different populations, or in different primary care EHR databases from the same populations, is important and a common objective for multi-database studies. When combining results from several databases using meta-analysis, provision of separate results from each database is helpful for interpretation. We found that these were often missing, particularly for studies using one-stage approaches, which also often lacked details of any statistical adjustment for heterogeneity and/or clustering. For two-stage meta-analysis, a clear rationale should be provided for choice of fixed effect and/or random effects or other models.
Keywords: epidemiology; public health; statistics & research methods.
© Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY. Published by BMJ.
Conflict of interest statement
Competing interests: None declared.
Figures


Similar articles
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881. Med J Aust. 2020. PMID: 33314144
-
A method for cohort selection of cardiovascular disease records from an electronic health record system.Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30. Int J Med Inform. 2017. PMID: 28495342
-
Characterization and selection of Japanese electronic health record databases used as data sources for non-interventional observational studies.BMC Med Inform Decis Mak. 2021 May 22;21(1):167. doi: 10.1186/s12911-021-01526-6. BMC Med Inform Decis Mak. 2021. PMID: 34022876 Free PMC article.
-
Behavioural modification interventions for medically unexplained symptoms in primary care: systematic reviews and economic evaluation.Health Technol Assess. 2020 Sep;24(46):1-490. doi: 10.3310/hta24460. Health Technol Assess. 2020. PMID: 32975190 Free PMC article.
Cited by
-
Pooling of primary care electronic health record (EHR) data on Huntington's disease (HD) and cancer: establishing comparability of two large UK databases.BMJ Open. 2024 Feb 14;14(2):e070258. doi: 10.1136/bmjopen-2022-070258. BMJ Open. 2024. PMID: 38355188 Free PMC article.
References
-
- Strom BL. Overview of automated databases in pharmacoepidemiology : Textbook of pharmacoepidemiology. Chichester, UK: John Wiley & Sons Ltd, 2013: 118–22.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources