Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 15:327:1438-1442.
doi: 10.3233/SHTI250640.

Metadata-Driven Approach to Generalisation of Transformations in ETL Processes

Affiliations

Metadata-Driven Approach to Generalisation of Transformations in ETL Processes

Sara Bachir et al. Stud Health Technol Inform. .

Abstract

Introduction: The secondary use of clinical data becomes more important, whereby a large number of ETL routes for data integration are implemented for specific purposes. The metadata driven approach allows a generalization of ETL processes to reuse existing implementations.

Methods: The metadata stored in the MDR, which describes different attributes of data, is used for this purpose, based on the ISO 21526 and governance and provenance data were taken into account to record the history of the data throughout its lifecycle, based on the W3C PROV model.

Results: To achieve a metadata-driven approach the data structure of the source and target system are represented in a MDR. Afterwards, relations between elements have to be defined with respective transformation rules. These information are used by an generic ETL implementation, so that use case specific content is outsourced within the MDR.

Discussion: A rule-based approach of the MDD ETL implementation allows a generalization of the extract and load phase, however the transformation process has to be standardized further. Moreover, a user-friendly interface is essential for integrating expertise without technical skills.

Keywords: Data Integration; ETL process; MDR; Metadata-driven approach; OMOP CDM; metadata; rule-based approach; secondary use.

PubMed Disclaimer

MeSH terms

LinkOut - more resources