What You Need to Know Before Implementing a Clinical Research Data Warehouse: Comparative Review of Integrated Data Repositories in Health Care Institutions
- PMID: 32852280
- PMCID: PMC7484778
- DOI: 10.2196/17687
What You Need to Know Before Implementing a Clinical Research Data Warehouse: Comparative Review of Integrated Data Repositories in Health Care Institutions
Abstract
Background: Integrated data repositories (IDRs), also referred to as clinical data warehouses, are platforms used for the integration of several data sources through specialized analytical tools that facilitate data processing and analysis. IDRs offer several opportunities for clinical data reuse, and the number of institutions implementing an IDR has grown steadily in the past decade.
Objective: The architectural choices of major IDRs are highly diverse and determining their differences can be overwhelming. This review aims to explore the underlying models and common features of IDRs, provide a high-level overview for those entering the field, and propose a set of guiding principles for small- to medium-sized health institutions embarking on IDR implementation.
Methods: We reviewed manuscripts published in peer-reviewed scientific literature between 2008 and 2020, and selected those that specifically describe IDR architectures. Of 255 shortlisted articles, we found 34 articles describing 29 different architectures. The different IDRs were analyzed for common features and classified according to their data processing and integration solution choices.
Results: Despite common trends in the selection of standard terminologies and data models, the IDRs examined showed heterogeneity in the underlying architecture design. We identified 4 common architecture models that use different approaches for data processing and integration. These different approaches were driven by a variety of features such as data sources, whether the IDR was for a single institution or a collaborative project, the intended primary data user, and purpose (research-only or including clinical or operational decision making).
Conclusions: IDR implementations are diverse and complex undertakings, which benefit from being preceded by an evaluation of requirements and definition of scope in the early planning stage. Factors such as data source diversity and intended users of the IDR influence data flow and synchronization, both of which are crucial factors in IDR architecture planning.
Keywords: data aggregation; data analytics; data warehousing; database; health informatics; information storage and retrieval.
©Kristina K Gagalova, M Angelica Leon Elizalde, Elodie Portales-Casamar, Matthias Görges. Originally published in JMIR Formative Research (http://formative.jmir.org), 27.08.2020.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures



Similar articles
-
Desiderata for healthcare integrated data repositories based on architectural comparison of three public repositories.AMIA Annu Symp Proc. 2013 Nov 16;2013:648-56. eCollection 2013. AMIA Annu Symp Proc. 2013. PMID: 24551366 Free PMC article.
-
Using patient lists to add value to integrated data repositories.J Biomed Inform. 2014 Dec;52:72-7. doi: 10.1016/j.jbi.2014.02.010. Epub 2014 Feb 15. J Biomed Inform. 2014. PMID: 24534444 Free PMC article.
-
SPIRIT: Systematic Planning of Intelligent Reuse of Integrated Clinical Routine Data. A Conceptual Best-practice Framework and Procedure Model.Methods Inf Med. 2016;55(2):114-24. doi: 10.3414/ME15-01-0045. Epub 2016 Jan 15. Methods Inf Med. 2016. PMID: 26769124
-
The Emerging Role of the Innate Immune Response in Idiosyncratic Drug Reactions.Pharmacol Rev. 2021 Jul;73(3):861-896. doi: 10.1124/pharmrev.120.000090. Pharmacol Rev. 2021. PMID: 34016669 Review.
-
Risk management frameworks for human health and environmental risks.J Toxicol Environ Health B Crit Rev. 2003 Nov-Dec;6(6):569-720. doi: 10.1080/10937400390208608. J Toxicol Environ Health B Crit Rev. 2003. PMID: 14698953 Review.
Cited by
-
Contemporary Databases in Real-world Studies Regarding the Diverse Health Care Systems of India, Thailand, and Taiwan: Protocol for a Scoping Review.JMIR Res Protoc. 2022 Dec 13;11(12):e43741. doi: 10.2196/43741. JMIR Res Protoc. 2022. PMID: 36512386 Free PMC article.
-
Correlation Aware Relevance-Based Semantic Index for Clinical Big Data Repository.J Imaging Inform Med. 2024 Oct;37(5):2597-2611. doi: 10.1007/s10278-024-01095-w. Epub 2024 Apr 23. J Imaging Inform Med. 2024. PMID: 38653911 Free PMC article.
-
Trends in Population-Based Studies: Molecular and Digital Epidemiology (Review).Sovrem Tekhnologii Med. 2022;14(4):60-70. doi: 10.17691/stm2022.14.4.07. Epub 2022 Jul 29. Sovrem Tekhnologii Med. 2022. PMID: 37179982 Free PMC article. Review.
-
Transfer Learning for Mortality Prediction in Non-Small Cell Lung Cancer with Low-Resolution Histopathology Slide Snapshots.Stud Health Technol Inform. 2024 Jan 25;310:735-739. doi: 10.3233/SHTI231062. Stud Health Technol Inform. 2024. PMID: 38269906 Free PMC article.
-
Electronic Health Record Data in Cancer Learning Health Systems: Challenges and Opportunities.JCO Clin Cancer Inform. 2022 Mar;6:e2100158. doi: 10.1200/CCI.21.00158. JCO Clin Cancer Inform. 2022. PMID: 35353547 Free PMC article. Review. No abstract available.
References
-
- Lau F, Price M, Boyd J, Partridge C, Bell H, Raworth R. Impact of electronic medical record on physician practice in office settings: a systematic review. BMC Med Inform Decis Mak. 2012 Feb 24;12:10. doi: 10.1186/1472-6947-12-10. https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/1472-694... - DOI - DOI - PMC - PubMed
-
- MacKenzie SL, Wyatt MC, Schuff R, Tenenbaum JD, Anderson N. Practices and perspectives on building integrated data repositories: results from a 2010 CTSA survey. J Am Med Inform Assoc. 2012 Jun;19(e1):e119–24. doi: 10.1136/amiajnl-2011-000508. http://europepmc.org/abstract/MED/22437072 - DOI - PMC - PubMed
-
- Anderson N, Abend A, Mandel A, Geraghty E, Gabriel D, Wynden R, Kamerick M, Anderson K, Rainwater J, Tarczy-Hornoch P. Implementation of a deidentified federated data network for population-based cohort discovery. J Am Med Inform Assoc. 2012 Jun;19(e1):e60–7. doi: 10.1136/amiajnl-2011-000133. http://europepmc.org/abstract/MED/21873473 - DOI - PMC - PubMed
Publication types
LinkOut - more resources
Full Text Sources
Miscellaneous