Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan;16(1):137-144.
doi: 10.1055/a-2441-3677. Epub 2025 Feb 12.

Discrepancies in Aggregate Patient Data between Two Sources with Data Originating from the Same Electronic Health Record: A Case Study

Affiliations

Discrepancies in Aggregate Patient Data between Two Sources with Data Originating from the Same Electronic Health Record: A Case Study

Allen J Yiu et al. Appl Clin Inform. 2025 Jan.

Abstract

Background: Data exploration in modern electronic health records (EHRs) is often aided by user-friendly graphical interfaces providing "self-service" tools for end users to extract data for quality improvement, patient safety, and research without prerequisite training in database querying. Other resources within the same institution, such as Honest Brokers, may extract data sourced from the same EHR but obtain different results leading to questions of data completeness and correctness.

Objectives: Our objectives were to (1) examine the differences in aggregate output generated by a "self-service" graphical interface data extraction tool and our institution's clinical data warehouse (CDW), sourced from the same database, and (2) examine the causative factors that may have contributed to these differences.

Methods: Aggregate demographic data of patients who received influenza vaccines at three static clinics and three drive-through clinics in similar locations between August 2020 and December 2020 was extracted separately from our institution's EHR data exploration tool and our CDW by our organization's Honest Brokers System. We reviewed the aggregate outputs, sliced by demographics and vaccination sites, to determine potential differences between the two outputs. We examined the underlying data model, identifying the source of each database.

Results: We observed discrepancies in patient volumes between the two sources, with variations in demographic information, such as age, race, ethnicity, and primary language. These variations could potentially influence research outcomes and interpretations.

Conclusion: This case study underscores the need for a thorough examination of data quality and the implementation of comprehensive user education to ensure accurate data extraction and interpretation. Enhancing data standardization and validation processes is crucial for supporting reliable research and informed decision-making, particularly if demographic data may be used to support targeted efforts for a specific population in research or quality improvement initiatives.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Fig. 1
Fig. 1
Data model of OMOP and SlicerDicer showing a shared source for patient data with differing ETL rules. EHR, electronic health record; ETL, extract transform load; OMOP, Observational Medical Outcomes Partnership.
Fig. 2
Fig. 2
In SlicerDicer, “Department” sites are grouped into a higher level “Location.” The drive-through vaccination sites were listed under “Department” and the static immunization sites were listed under “Location.”

References

    1. Chishtie J, Sapiro N, Wiebe N et al.Use of Epic electronic health record system for health care research: scoping review. J Med Internet Res. 2023;25:e51003. - PMC - PubMed
    1. Saini V, Jaber T, Como J D et al.623. Exploring ‘Slicer Dicer’, an extraction tool in EPIC, for clinical and epidemiological analysis. Open Forum Infect Dis. 2021;8:S414–S415.
    1. Baughman D J, Jabbarpour Y, Westfall J M et al.Comparison of quality performance measures for patients receiving in-person vs telemedicine primary care in a large integrated health system. JAMA Netw Open. 2022;5(09):e2233267. - PMC - PubMed
    1. Bui R, Kasabali A, Dewan K. A retrospective analysis of COVID-19 tracheostomies: early versus late tracheostomy. Laryngoscope Investig Otolaryngol. 2023;8(05):1154–1158. - PMC - PubMed
    1. Shermon S, Fazio K M, Shim R, Abd-Elsayed A, Kim C H. Prescription trends in complex regional pain syndrome: a retrospective case-control study. Brain Sci. 2023;13(07):1012. - PMC - PubMed