Opportunities, Pitfalls, and Alternatives in Adapting Electronic Health Records for Health Services Research
- PMID: 32969760
- PMCID: PMC7878193
- DOI: 10.1177/0272989X20954403
Opportunities, Pitfalls, and Alternatives in Adapting Electronic Health Records for Health Services Research
Erratum in
-
CORRIGENDUM to "Opportunities, Pitfalls, and Alternatives in Adapting Electronic Health Records for Health Services Research".Med Decis Making. 2022 Jan;42(1):135. doi: 10.1177/0272989X20978126. Epub 2020 Dec 22. Med Decis Making. 2022. PMID: 33349131 No abstract available.
Abstract
Electronic health records (EHRs) offer the potential to study large numbers of patients but are designed for clinical practice, not research. Despite the increasing availability of EHR data, their use in research comes with its own set of challenges. In this article, we describe some important considerations and potential solutions for commonly encountered problems when working with large-scale, EHR-derived data for health services and community-relevant health research. Specifically, using EHR data requires the researcher to define the relevant patient subpopulation, reliably identify the primary care provider, recognize the EHR as containing episodic (i.e., unstructured longitudinal) data, account for changes in health system composition and treatment options over time, understand that the EHR is not always well-organized and accurate, design methods to identify the same patient across multiple health systems, account for the enormous size of the EHR, and consider barriers to data access. Associations found in the EHR may be nonrepresentative of associations in the general population, but a clear understanding of the EHR-based associations can be enormously valuable to the process of improving outcomes for patients in learning health care systems. In the context of building 2 large-scale EHR-derived data sets for health services research, we describe the potential pitfalls of EHR data and propose some solutions for those planning to use EHR data in their research. As ever greater amounts of clinical data are amassed in the EHR, use of these data for research will become increasingly common and important. Attention to the intricacies of EHR data will allow for more informed analysis and interpretation of results from EHR-based data sets.
Keywords: data science; electronic health records; health services research; registries.
Conflict of interest statement
Figures

Comment in
-
Electronic Health Records: The Signal and the Noise.Med Decis Making. 2021 Feb;41(2):103-106. doi: 10.1177/0272989X20985764. Med Decis Making. 2021. PMID: 33563112 No abstract available.
Similar articles
-
Adult patient access to electronic health records.Cochrane Database Syst Rev. 2021 Feb 26;2(2):CD012707. doi: 10.1002/14651858.CD012707.pub2. Cochrane Database Syst Rev. 2021. PMID: 33634854 Free PMC article.
-
Challenges in patient safety improvement research in the era of electronic health records.Healthc (Amst). 2016 Dec;4(4):285-290. doi: 10.1016/j.hjdsi.2016.06.005. Epub 2016 Jul 26. Healthc (Amst). 2016. PMID: 27473472
-
Electronic Health Record Challenges, Workarounds, and Solutions Observed in Practices Integrating Behavioral Health and Primary Care.J Am Board Fam Med. 2015 Sep-Oct;28 Suppl 1(Suppl 1):S63-72. doi: 10.3122/jabfm.2015.S1.150133. J Am Board Fam Med. 2015. PMID: 26359473 Free PMC article.
-
The future of Cochrane Neonatal.Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834
-
Use of Epic Electronic Health Record System for Health Care Research: Scoping Review.J Med Internet Res. 2023 Dec 15;25:e51003. doi: 10.2196/51003. J Med Internet Res. 2023. PMID: 38100185 Free PMC article.
Cited by
-
Data gaps and opportunities for modeling cancer health equity.J Natl Cancer Inst Monogr. 2023 Nov 8;2023(62):246-254. doi: 10.1093/jncimonographs/lgad025. J Natl Cancer Inst Monogr. 2023. PMID: 37947335 Free PMC article.
-
Receipt of Targeted Therapy and Survival Outcomes in Patients With Metastatic Colorectal Cancer.JAMA Netw Open. 2023 Jan 3;6(1):e2250030. doi: 10.1001/jamanetworkopen.2022.50030. JAMA Netw Open. 2023. PMID: 36656585 Free PMC article.
-
Challenges and Opportunities for Data Science in Women's Health.Annu Rev Biomed Data Sci. 2023 Aug 10;6:23-45. doi: 10.1146/annurev-biodatasci-020722-105958. Epub 2023 Apr 11. Annu Rev Biomed Data Sci. 2023. PMID: 37040736 Free PMC article. Review.
-
Patient and clinician acceptability of automated extraction of social drivers of health from clinical notes in primary care.J Am Med Inform Assoc. 2025 May 1;32(5):855-865. doi: 10.1093/jamia/ocaf046. J Am Med Inform Assoc. 2025. PMID: 40085013
-
Automated Extraction of Mortality Information From Publicly Available Sources Using Large Language Models: Development and Evaluation Study.J Med Internet Res. 2025 Aug 18;27:e71113. doi: 10.2196/71113. J Med Internet Res. 2025. PMID: 40824124 Free PMC article.
References
-
- What are the advantages of electronic health records? https://www.healthit.gov/providers-professionals/faqs/what-are-advantage.... Accessed July 19, 2018.
-
- Surveys and Data Collection Systems. https://www.cdc.gov/nchs/surveys.htm. Accessed July 19, 2018.
-
- CDC WONDER. https://www.cdc.gov/nchs/surveys.htm. Accessed July 19, 2018.
-
- Health and Retirement Study. https://hrs.isr.umich.edu/data-products. Accessed February 25, 2019.
-
- Behavioral Risk Factor Surveillance System. https://www.cdc.gov/brfss/index.html. Accessed February 25, 2019.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials