Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 30:13:e68171.
doi: 10.2196/68171.

Leveraging Interoperable Electronic Health Record (EHR) Data for Distributed Analyses in Clinical Research: Technical Implementation Report of the HELP Study

Affiliations

Leveraging Interoperable Electronic Health Record (EHR) Data for Distributed Analyses in Clinical Research: Technical Implementation Report of the HELP Study

Julia Palm et al. JMIR Med Inform. .

Abstract

Background: The Medical Informatics Initiative (MII) Germany established 38 data integration centers (DIC) in university hospitals to improve health care and biomedical research through the use of electronic health record (EHR) data. To showcase the value of these DIC, the HELP (Hospital-wide Electronic Medical Record Evaluated Computerized Decision Support System to Improve Outcomes of Patients with Staphylococcal Bloodstream Infection) study was initiated as a use case. This study is a clinical trial designed to assess the impact of a computerized decision support system for managing staphylococcal bacteremia.

Objective: In this paper, we present the lessons learned during the use case from a technical perspective. This paper outlines the challenges encountered and solutions developed during our initial implementation of this infrastructure, providing insights applicable to other research platforms using EHR data. These insights are organized into 3 key areas: study-specific data definition and modeling, interoperable data integration and transformation, and distributed data extraction and analysis.

Methods: An interdisciplinary team of clinicians, computer scientists, and statisticians created a catalog of items to identify data elements necessary for the study's evaluation and developed a domain-specific information model. DIC developed extract-transform-load pipelines to collect the disparate, site-specific EHR data and to transform it into a common data format. Health Level Seven International (HL7) Fast Healthcare Interoperability Resources (FHIR) and the MII's core dataset profiles were adopted for consistent data representation across sites. Additionally, data not present in EHRs was gathered using structured electronic case report forms. Analysis scripts were then distributed to the sites to preprocess the data locally, followed by a central analysis of the preprocessed data to generate the final overall results.

Our analysis revealed significant heterogeneity in data quality and implementation of interoperability standards, requiring substantial harmonization efforts. The development of analysis scripts and data extraction processes demanded multiple iterative cycles and close collaboration with local data experts. Despite these challenges, the successful implementation demonstrated the feasibility of distributed EHR analyses while highlighting the importance of thorough data quality assessment, realistic timeline planning, and multidisciplinary expertise.

Conclusions: The HELP study highlights challenges and opportunities in leveraging EHR data for clinical research, particularly in the absence of mandatory data standards and resource-intensive data harmonization efforts. Despite limitations in data availability and quality, progress in digitization and interoperability frameworks offers hope for future improvements. Lessons learned from this study can inform the development of standardized methodologies and infrastructures for sustainable EHR data integration in research.

Keywords: clinical decision support system; data collection methods; electronic health records; health information interoperability; software design.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1.
Figure 1.. Overall data flow in the HELP (Hospital-wide Electronic Medical Record Evaluated Computerized Decision Support System to Improve Outcomes of Patients with Staphylococcal Bloodstream Infection) study, based on interoperable formats created in data integration centers (DIC). EHR: electronic health record; eCRF: electronic case report form; FHIR: Fast Healthcare Interoperability Resources; MII CDS: core dataset of the Medical Informatics Initiative.
Figure 2.
Figure 2.. Illustration of modules of the Medical Informatics Initiative (MII) core dataset. The HELP (Hospital-wide Electronic Medical Record Evaluated Computerized Decision Support System to Improve Outcomes of Patients with Staphylococcal Bloodstream Infection) study used the basic modules “person,” “treatment case,” “diagnosis,” “procedures,” “lab results,” “medication” [15], and the extension module “microbiology” [16].
Figure 3.
Figure 3.. One table was created per resource type; afterwards, tables were joined based on references.
Figure 4.
Figure 4.. Examples for units of measurement for alanine aminotransferase (ALT) and aspartate aminotransferase (AST) in German laboratory results, captured from a PDF.
Figure 5.
Figure 5.. Two exemplary FHIR (Fast Healthcare Interoperability Resources) resources, illustrating potential differences despite adhering to the same FHIR profile. Elements that exist in both resources but contain different codes are highlighted in yellow, while elements present in only 1 resource are marked in red. The resources have been shortened and anonymized.
Figure 6.
Figure 6.. Iterative data extraction and analysis process in the HELP (Hospital-wide Electronic Medical Record Evaluated Computerized Decision Support System to Improve Outcomes of Patients with Staphylococcal Bloodstream Infection) study. FHIR: Fast Healthcare Interoperability Resources; DIC: data integration center; ADT: admission, discharge, and transfer data; LIS: laboratory information system; eCRF: electronic case report form; ETL: extract-transform-load.

Similar articles

References

    1. Albashiti F, Thasler R, Wendt T, Bathelt F, Reinecke I, Schreiweis B. Data integration centers-from a concept in the Medical Informatics Initiative to its local implementation in the Network of University Medicine. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2024 Jun;67(6):629–636. doi: 10.1007/s00103-024-03879-5. doi. Medline. - DOI - PMC - PubMed
    1. Gehring S, Eulenfeld R. German Medical Informatics Initiative: unlocking data for research and health care. Methods Inf Med. 2018 Jul;57(S 01):e46–e49. doi: 10.3414/ME18-13-0001. doi. Medline. - DOI - PMC - PubMed
    1. Hagel S, Gantner J, Spreckelsen C, et al. Hospital-wide ELectronic medical record evaluated computerised decision support system to improve outcomes of Patients with staphylococcal bloodstream infection (HELP): study protocol for a multicentre stepped-wedge cluster randomised trial. BMJ Open. 2020 Feb 10;10(2):e033391. doi: 10.1136/bmjopen-2019-033391. doi. Medline. - DOI - PMC - PubMed
    1. Pletz M, Hagel S, Kimmig A, et al. Zenodo; 2024. HELP CDSS v. 1.0.SMITH - smart medical information technology for healthcare.https://zenodo.org/records/10704513 URL. doi. - DOI
    1. van Berkel N, Clarkson MJ, Xiao G, et al. Dimensions of ecological validity for usability evaluations in clinical settings. J Biomed Inform. 2020 Oct;110:103553. doi: 10.1016/j.jbi.2020.103553. doi. Medline. - DOI - PubMed

LinkOut - more resources