Large-Scale Integration of DICOM Metadata into HL7-FHIR for Medical Research
- PMID: 40233823
- PMCID: PMC12133321
- DOI: 10.1055/a-2521-4250
Large-Scale Integration of DICOM Metadata into HL7-FHIR for Medical Research
Abstract
Background: The current gap between the availability of routine imaging data and its provisioning for medical research hinders the utilization of radiological information for secondary purposes. To address this, the German Medical Informatics Initiative (MII) has established frameworks for harmonizing and integrating clinical data across institutions, including the integration of imaging data into research repositories, which can be expanded to routine imaging data.
Objectives: This project aims to address this gap by developing a large-scale data processing pipeline to extract, convert, and pseudonymize DICOM (Digital Imaging and Communications in Medicine) metadata into "ImagingStudy" Fast Healthcare Interoperability Resources (FHIR) and integrate them into research repositories for secondary use.
Methods: The data processing pipeline was developed, implemented, and tested at the Data Integration Center of the University Hospital Erlangen. It leverages existing open-source solutions and integrates seamlessly into the hospital's research IT infrastructure. The pipeline automates the extraction, conversion, and pseudonymization processes, ensuring compliance with both local and MII data protection standards. A large-scale evaluation was conducted using the imaging studies acquired by two departments at University Hospital Erlangen within 1 year. Attributes such as modality, examined body region, laterality, and the number of series and instances were analyzed to assess the quality and availability of the metadata.
Results: Once established, the pipeline processed a substantial dataset comprising over 150,000 DICOM studies within an operational period of 26 days. Data analysis revealed significant heterogeneity and incompleteness in certain attributes, particularly the DICOM tag "Body Part Examined." Despite these challenges, the pipeline successfully generated valid and standardized FHIR, providing a robust basis for future research.
Conclusion: We demonstrated the setup and test of a large-scale end-to-end data processing pipeline that transforms DICOM imaging metadata directly from clinical routine into the Health Level 7-FHIR format, pseudonymizes the resources, and stores them in an FHIR server. We showcased that the derived FHIRs offer numerous research opportunities, for example, feasibility assessments within Bavarian and Germany-wide research infrastructures. Insights from this study highlight the need to extend the "ImagingStudy" FHIR with additional attributes and refine their use within the German MII.
The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/).
Conflict of interest statement
None declared.
Figures

Similar articles
-
HL7 FHIR in Health Research: A FHIR Specification for Metadata in Clinical, Epidemiological, and Public Health Studies.Stud Health Technol Inform. 2024 Aug 22;316:1960-1961. doi: 10.3233/SHTI240817. Stud Health Technol Inform. 2024. PMID: 39176876
-
The Architecture of a Feasibility Query Portal for Distributed COVID-19 Fast Healthcare Interoperability Resources (FHIR) Patient Data Repositories: Design and Implementation Study.JMIR Med Inform. 2022 May 25;10(5):e36709. doi: 10.2196/36709. JMIR Med Inform. 2022. PMID: 35486893 Free PMC article.
-
Exchange of Quantitative Computed Tomography Assessed Body Composition Data Using Fast Healthcare Interoperability Resources as a Necessary Step Toward Interoperable Integration of Opportunistic Screening Into Clinical Practice: Methodological Development Study.J Med Internet Res. 2025 May 21;27:e68750. doi: 10.2196/68750. J Med Internet Res. 2025. PMID: 40397929 Free PMC article.
-
FHIR - Overdue Standard for Radiology Data Warehouses.Rofo. 2025 May;197(5):518-525. doi: 10.1055/a-2462-2351. Epub 2024 Dec 6. Rofo. 2025. PMID: 39642924 Review. English, German.
-
Learning HL7 FHIR Using the HAPI FHIR Server and Its Use in Medical Imaging with the SIIM Dataset.J Digit Imaging. 2018 Jun;31(3):334-340. doi: 10.1007/s10278-018-0090-y. J Digit Imaging. 2018. PMID: 29725959 Free PMC article. Review.
Cited by
-
From data-driven cities to data-driven tumors: dynamic digital twins for adaptive oncology.Front Artif Intell. 2025 Jul 25;8:1624877. doi: 10.3389/frai.2025.1624877. eCollection 2025. Front Artif Intell. 2025. PMID: 40785836 Free PMC article. No abstract available.
References
-
- Arvanitis T N. Informatics opportunities and challenges in medical imaging: a journey. Stud Health Technol Inform. 2022;300:19–29. - PubMed
-
- The National Electrical Manufacturers Association (NEMA); 2024. Digital Imaging and Communications in Medicine (DICOM) - PS 3.1–PS 3.22.
-
- FHIR Release 4 (R4)Health Level Seven International;2024
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous