Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 26:10:e1835.
doi: 10.7717/peerj-cs.1835. eCollection 2024.

A domain-specific language for managing ETL processes

Affiliations

A domain-specific language for managing ETL processes

Aleksandar Popović et al. PeerJ Comput Sci. .

Abstract

Maintenance of Data Warehouse (DW) systems is a critical task because any downtime or data loss can have significant consequences on business applications. Existing DW maintenance solutions mostly rely on concrete technologies and tools that are dependent on: the platform on which the DW system was created; the specific data extraction, transformation, and loading (ETL) tool; and the database language the DW uses. Different languages for different versions of DW systems make organizing DW processes difficult, as minimal changes in the structure require major changes in the application code for managing ETL processes. This article proposes a domain-specific language (DSL) for ETL process management that mitigates these problems by centralizing all program logic, making it independent from a particular platform. This approach would simplify DW system maintenance. The platform-independent language proposed in this article also provides an easier way to create a unified environment to control DW processes, regardless of the language, environment, or ETL tool the DW uses.

Keywords: Data warehouse; Domain-specific language; Extraction transformation and loading; Model-driven development; Platform-independent models.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests. Nikola Trajković is employed by Softwecs LTD, and he declares no competing interests.

Figures

Figure 1
Figure 1. ECore meta-model of the ETLCL language.
Figure 2
Figure 2. Scheduling service execution.
Figure 3
Figure 3. Diagram of the manager service execution.

References

    1. Albrecht A, Naumann F. Systematic ETL management—experiences with high-level operators. Proceedings of the 18th International Conference on Information Quality; 2013. p. 9.
    1. Atigui F, Ravat F, Teste O, Zurfluh G. Using OCL for automatically producing multidimensional models and ETL processes. In: Cuzzocrea A, Dayal U, editors. Data Warehousing and Knowledge Discovery. DaWaK 2012. Lecture Notes in Computer Science. Berlin, Heidelberg: Springer; 2012.
    1. Awiti J, Vaisman A, Zimanyi E. Design and implementation of ETL processes using BPMN and relational algebra. Data & Knowledge Engineering. 2020;129(13):101837. doi: 10.1016/j.datak.2020.101837. - DOI
    1. Biswas N, Chattopadhyay S, Mahapatra G, Chatterjee S, Mondal DK. A new approach for conceptual extraction-transformation-loading process modeling. International Journal of Ambient Computing and Intelligence. 2019;30(1):45. doi: 10.4018/IJACI.2019010102. - DOI
    1. Deme A, Buchmann R. A technology-specific modeling method for data ETL processes. AMCIS 2021 Proceedings.2021.

LinkOut - more resources