A domain-specific language for managing ETL processes
- PMID: 38435592
- PMCID: PMC10909202
- DOI: 10.7717/peerj-cs.1835
A domain-specific language for managing ETL processes
Abstract
Maintenance of Data Warehouse (DW) systems is a critical task because any downtime or data loss can have significant consequences on business applications. Existing DW maintenance solutions mostly rely on concrete technologies and tools that are dependent on: the platform on which the DW system was created; the specific data extraction, transformation, and loading (ETL) tool; and the database language the DW uses. Different languages for different versions of DW systems make organizing DW processes difficult, as minimal changes in the structure require major changes in the application code for managing ETL processes. This article proposes a domain-specific language (DSL) for ETL process management that mitigates these problems by centralizing all program logic, making it independent from a particular platform. This approach would simplify DW system maintenance. The platform-independent language proposed in this article also provides an easier way to create a unified environment to control DW processes, regardless of the language, environment, or ETL tool the DW uses.
Keywords: Data warehouse; Domain-specific language; Extraction transformation and loading; Model-driven development; Platform-independent models.
© 2024 Popović et al.
Conflict of interest statement
The authors declare that they have no competing interests. Nikola Trajković is employed by Softwecs LTD, and he declares no competing interests.
Figures
References
-
- Albrecht A, Naumann F. Systematic ETL management—experiences with high-level operators. Proceedings of the 18th International Conference on Information Quality; 2013. p. 9.
-
- Atigui F, Ravat F, Teste O, Zurfluh G. Using OCL for automatically producing multidimensional models and ETL processes. In: Cuzzocrea A, Dayal U, editors. Data Warehousing and Knowledge Discovery. DaWaK 2012. Lecture Notes in Computer Science. Berlin, Heidelberg: Springer; 2012.
-
- Awiti J, Vaisman A, Zimanyi E. Design and implementation of ETL processes using BPMN and relational algebra. Data & Knowledge Engineering. 2020;129(13):101837. doi: 10.1016/j.datak.2020.101837. - DOI
-
- Biswas N, Chattopadhyay S, Mahapatra G, Chatterjee S, Mondal DK. A new approach for conceptual extraction-transformation-loading process modeling. International Journal of Ambient Computing and Intelligence. 2019;30(1):45. doi: 10.4018/IJACI.2019010102. - DOI
-
- Deme A, Buchmann R. A technology-specific modeling method for data ETL processes. AMCIS 2021 Proceedings.2021.
LinkOut - more resources
Full Text Sources