Electronic Health Records Data and Metadata: Challenges for Big Data in the United States
- PMID: 27447257
- DOI: 10.1089/big.2013.0023
Electronic Health Records Data and Metadata: Challenges for Big Data in the United States
Abstract
This article, written by researchers studying metadata and standards, represents a fresh perspective on the challenges of electronic health records (EHRs) and serves as a primer for big data researchers new to health-related issues. Primarily, we argue for the importance of the systematic adoption of standards in EHR data and metadata as a way of promoting big data research and benefiting patients. EHRs have the potential to include a vast amount of longitudinal health data, and metadata provides the formal structures to govern that data. In the United States, electronic medical records (EMRs) are part of the larger EHR. EHR data is submitted by a variety of clinical data providers and potentially by the patients themselves. Because data input practices are not necessarily standardized, and because of the multiplicity of current standards, basic interoperability in EHRs is hindered. Some of the issues with EHR interoperability stem from the complexities of the data they include, which can be both structured and unstructured. A number of controlled vocabularies are available to data providers. The continuity of care document standard will provide interoperability in the United States between the EMR and the larger EHR, potentially making data input by providers directly available to other providers. The data involved is nonetheless messy. In particular, the use of competing vocabularies such as the Systematized Nomenclature of Medicine-Clinical Terms, MEDCIN, and locally created vocabularies inhibits large-scale interoperability for structured portions of the records, and unstructured portions, although potentially not machine readable, remain essential. Once EMRs for patients are brought together as EHRs, the EHRs must be managed and stored. Adequate documentation should be created and maintained to assure the secure and accurate use of EHR data. There are currently a few notable international standards initiatives for EHRs. Organizations such as Health Level Seven International and Clinical Data Interchange Standards Consortium are developing and overseeing implementation of interoperability standards. Denmark and Singapore are two countries that have successfully implemented national EHR systems. Future work in electronic health information initiatives should underscore the importance of standards and reinforce interoperability of EHRs for big data research and for the sake of patients.
Similar articles
-
Information Models Offer Value to Standardize Electronic Health Record Flowsheet Data: A Fall Prevention Exemplar.J Nurs Scholarsh. 2021 May;53(3):306-314. doi: 10.1111/jnu.12646. Epub 2021 Mar 15. J Nurs Scholarsh. 2021. PMID: 33720514
-
Development of an open metadata schema for prospective clinical research (openPCR) in China.Methods Inf Med. 2014;53(1):39-46. doi: 10.3414/ME13-01-0008. Epub 2013 Dec 9. Methods Inf Med. 2014. PMID: 24317371
-
Electronic health records and data exchange in the WHO European region: A subregional analysis of achievements, challenges, and prospects.Int J Med Inform. 2025 Feb;194:105687. doi: 10.1016/j.ijmedinf.2024.105687. Epub 2024 Nov 10. Int J Med Inform. 2025. PMID: 39556969 Free PMC article.
-
Electronic Health Record-Based Registries: Clinical Research Using Registries in Colon and Rectal Surgery.Clin Colon Rectal Surg. 2019 Jan;32(1):82-90. doi: 10.1055/s-0038-1673358. Epub 2019 Jan 8. Clin Colon Rectal Surg. 2019. PMID: 30647550 Free PMC article. Review.
-
Semantic Interoperability of Electronic Health Records: Systematic Review of Alternative Approaches for Enhancing Patient Information Availability.JMIR Med Inform. 2024 Apr 25;12:e53535. doi: 10.2196/53535. JMIR Med Inform. 2024. PMID: 38686541 Free PMC article. Review.
Cited by
-
A Comparison of Logistic Regression Against Machine Learning Algorithms for Gastric Cancer Risk Prediction Within Real-World Clinical Data Streams.JCO Clin Cancer Inform. 2022 Jun;6:e2200039. doi: 10.1200/CCI.22.00039. JCO Clin Cancer Inform. 2022. PMID: 35763703 Free PMC article.
-
Electronic Health Record Algorithm Development for Research Subject Recruitment Using Colonoscopy Appointment Scheduling.J Am Board Fam Med. 2021 Jan-Feb;34(1):49-60. doi: 10.3122/jabfm.2021.01.200417. J Am Board Fam Med. 2021. PMID: 33452082 Free PMC article.
-
National health information technology priorities for research: A policy and development agenda.J Am Med Inform Assoc. 2020 Apr 1;27(4):652-657. doi: 10.1093/jamia/ocaa008. J Am Med Inform Assoc. 2020. PMID: 32090265 Free PMC article.
-
Disparities in surgical care for children across Brazil: Use of geospatial analysis.PLoS One. 2019 Aug 20;14(8):e0220959. doi: 10.1371/journal.pone.0220959. eCollection 2019. PLoS One. 2019. PMID: 31430312 Free PMC article.
-
Electronic Medical Record-Based Case Phenotyping for the Charlson Conditions: Scoping Review.JMIR Med Inform. 2021 Feb 1;9(2):e23934. doi: 10.2196/23934. JMIR Med Inform. 2021. PMID: 33522976 Free PMC article.
LinkOut - more resources
Full Text Sources
Other Literature Sources