Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 May 10;5(1):3.
doi: 10.13063/2327-9214.1277.

DQe-v: A Database-Agnostic Framework for Exploring Variability in Electronic Health Record Data Across Time and Site Location

Affiliations

DQe-v: A Database-Agnostic Framework for Exploring Variability in Electronic Health Record Data Across Time and Site Location

Hossein Estiri et al. EGEMS (Wash DC). .

Abstract

Data variability is a commonly observed phenomenon in Electronic Health Records (EHR) data networks. A common question asked in scientific investigations of EHR data is whether the cross-site and -time variability reflects an underlying data quality error at one or more contributing sites versus actual differences driven by various idiosyncrasies in the healthcare settings. Although research analysts and data scientists have commonly used various statistical methods to detect and account for variability in analytic datasets, self service tools to facilitate exploring cross-organizational variability in EHR data warehouses are lacking and could benefit from meaningful data visualizations. DQe-v, an interactive, database-agnostic tool for visually exploring variability in EHR data provides such a solution. DQe-v is built on an open source platform, R statistical software, with annotated scripts and a readme document that makes it fully reproducible. To illustrate and describe functionality of DQe-v, we describe the DQe-v's readme document which includes a complete guide to installation, running the program, and interpretation of the outputs. We also provide annotated R scripts and an example dataset as supplemental materials. DQe-v offers a self service tool to visually explore data variability within EHR datasets irrespective of the data model. GitHub and CIELO offer hosting and distribution of the tool and can facilitate collaboration across any interested community of users as we target improving usability, efficiency, and interoperability.

Keywords: Data Quality; Data Variability; Data Warehouse; Electronic Health Records.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Workflow for DQe-v
Figure 2
Figure 2
The Variability Preview Tab Previewing Diagnoses Per Visit Per Year
Figure 3
Figure 3
The Exploratory Analysis Tab’s Plots Help Identify Years and Site Locations with High Variability in the Number of Diagnoses Per Visit Note: The yellow box highlights a period with relative high variability.
Figure 4
Figure 4
Visualization of Probability Density Functions in Density Plots Tab for Number of Creatinine Labs Per Patients with a CKD Diagnosis and 1+ Visits Note: Colors signify time units.
Figure 5
Figure 5
Outputs of the Regression-Based Analysis on Hemoglobin A1c Labs Per Patient with a Diabetes Diagnosis and 1+ Visits. (Site-location Names Are Obstructed.)

Similar articles

Cited by

References

    1. Institute of Medicine. Capturing Social and Behavioral Domains and Measures in Electronic Health Records: Phase 2. Washington (DC): The National Academies Press; 2014. - PubMed
    1. Arts DGT, De Keizer NF, Scheffer G-J. Defining and improving data quality in medical registries: a literature review, case study, and generic framework. J Am Med Inform Assoc. 2002;9:600–611. doi:10.1197/jamia.M1087. - DOI - PMC - PubMed
    1. Hsiao C-J, Hing E, Socey TC, Cai B. Electronic health record systems and intent to apply for meaningful use incentives among office-based physician practices: United States, 20012011. NCHS Data Brief. 2011:1–8. http://www.ncbi.nlm.nih.gov6/pubmed/22617322. - PubMed
    1. Patel V, Jamoom E, Hsiao CJ, Furukawa MF, Buntin M. Variation in electronic health record adoption and readiness for meaningful use: 2008-2011. J Gen Intern Med. 2013;28:957–964. doi:10.1007/s11606-012-2324-x. - DOI - PMC - PubMed
    1. Grinspan ZM, Banerjee S, Kaushal R, Kern LM. Physician Specialty and Variations in Adoption of Electronic Health Records. Appl Clin Inform. 2013;4:225–240. doi:10.4338/ACI-2013-02-RA-0015. - DOI - PMC - PubMed

LinkOut - more resources