Building a Lung and Ovarian Cancer Data Warehouse
- PMID: 33190464
- PMCID: PMC7674817
- DOI: 10.4258/hir.2020.26.4.303
Building a Lung and Ovarian Cancer Data Warehouse
Abstract
Objectives: Despite the collection of vast amounts of data by the healthcare sector, effective decision-making in medical practice is still challenging. Data warehousing technology can be applied for the collection and management of clinical data from various sources to provide meaningful insights for physicians and administrators. Cancer data are extremely complicated and massive; hence, a clinical data warehouse system can provide insights into prevention, diagnosis and treatment processes through the use of online analytical processing tools for the analysis of multi-dimensional data at different granularity levels.
Methods: In this study, a clinical data warehouse was developed for lung cancer data, which were kindly provided by the United States National Cancer Institute. Lung and ovarian cancer data were imported in specific formats and cleaned to remove errors and redundancies. SQL server integration services (SSIS) were used for the extract-transform-load (ETL) process.
Results: The design of the clinical data warehouse responds efficiently to all types of queries by adopting the fact constellation schema model. Various online analytical processing queries can be expressed using the proposed approach.
Conclusions: This model succeeded in responding to complex queries, and the analysis of data is facilitated by using online analytical processing cubes and viewing multilevel data details.
Keywords: Data Analytics; Data Warehousing; Lung Cancer; Ovarian Cancer.
Conflict of interest statement
No potential conflict of interest relevant to this article was reported.
Figures








References
-
- Garani G, Atay CE. Encountering incomplete temporal information in clinical data warehouses. Int J Appl Res Public Health Manag. 2020;5(1):32–48.
-
- Kallmeyer V, Venkat K. Beyond e-health: health and information technology converge. Siliconindia. 2002;6(4):42.
-
- The Global Cancer Observatory [Internet] Lyon, France: International Agency for Research on Cancer; c2020. [cited at 2020 Sep 10]. Available from: https://gco.iarc.fr/
-
- Ferlay J, Parkin DM, Steliarova-Foucher E. Estimates of cancer incidence and mortality in Europe in 2008. Eur J Cancer. 2010;46(4):765–81. - PubMed
-
- Miele S, Shockley R. Analytics: the real-world use of big data. Somers (NY): IBM Global Business Services; 2013.
LinkOut - more resources
Full Text Sources