Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Case Reports
. 2022 May 18;5(2):ooac032.
doi: 10.1093/jamiaopen/ooac032. eCollection 2022 Jul.

Establishing a centralized data mart from the Rakai community cohort study to improve HIV research in Rakai, Uganda

Affiliations
Case Reports

Establishing a centralized data mart from the Rakai community cohort study to improve HIV research in Rakai, Uganda

Anthony Ndyanabo et al. JAMIA Open. .

Abstract

To improve timely access to quality HIV research data, the Rakai Health Sciences Program (RHSP) Data Mart was developed to store cohort study data from a legacy database platform in a modernized system using standard data management processes. The RHSP Data Mart was developed on a Microsoft SQL Server platform using Microsoft SQL Server Integration Services with custom data mappings and queries. The data mart stores 20+ years of longitudinal HIV research data and includes standard processes for managing data, data dictionary, training materials, and a library of queries to fulfill data requests and load new data from completed survey rounds. The RHSP Data Mart enables efficient querying and analysis of multidimensional research data by simplifying data integration and processing. A sustainable database platform with well-defined data management processes promotes data accessibility and reproducibility, enabling researchers to advance their understanding and management of infectious diseases.

Keywords: data management; data mart; data reproducibility; infectious diseases.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Technical architecture for the RHSP Data Mart solution. Source data from the RCCS are integrated using standard ETL processes into a centralized data repository accompanied by the comprehensive data dictionary, library of sample queries, and training materials. ETL: extract, transform, and load; RHSP: Rakai Health Sciences Program; RCCS: Rakai Community Cohort Study.
Figure 2.
Figure 2.
Data flow diagram for the RHSP Data Mart. Using FoxPro Upsize Wizard, source data from Fox Pro databases in production are loaded into a staging database in a development, quality assurance, and production environment. ETL processes were developed to integrate data from the staging database into the data mart using a custom data model. ETL: extract, transform, and load; RHSP: Rakai Health Sciences Program.
Figure 3.
Figure 3.
Entity relationship diagram for the RHSP Data Mart. The custom relational data model is a participant-centric model that integrates data from demographic, survey, and laboratory domains by a randomized participant study ID and survey round. RHSP: Rakai Health Sciences Program.

References

    1. Chang LW, Grabowski MK, Ssekubugu R, et al. Heterogeneity of the HIV epidemic in agrarian, trading, and fishing communities in Rakai, Uganda: an observational epidemiological study. Lancet HIV 2016; 3 (8): e388–96. - PMC - PubMed
    1. Grabowski MK, Serwadda DM, Gray RH, et al. HIV prevention efforts and incidence of HIV in Uganda. N Engl J Med 2017; 377 (22): 2154–66. - PMC - PubMed
    1. Evans RS, Lloyd JF, Pierce LA.. Clinical use of an enterprise data warehouse. AMIA Annu Symp Proc 2012; 2012: 189–98. - PMC - PubMed
    1. Farnum MA, Mohanty L, Ashok M, et al. A dimensional warehouse for integrating operational data from clinical trials. Database (Oxford) 2019; 2019: 2–7. - PMC - PubMed
    1. Publications by Rakai Health Sciences Program (RHSP) since 1990. (2016. –2021). https://www.rhsp.org/resources/publications.

Publication types

LinkOut - more resources