Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 26;41(1):btae726.
doi: 10.1093/bioinformatics/btae726.

MOLGENIS Armadillo: a lightweight server for federated analysis using DataSHIELD

Affiliations

MOLGENIS Armadillo: a lightweight server for federated analysis using DataSHIELD

Tim Cadman et al. Bioinformatics. .

Abstract

Summary: Extensive human health data from cohort studies, national registries, and biobanks can reveal lifecourse risk factors impacting health. Combining these sources offers increased statistical power, rare outcome detection, replication of findings, and extended study periods. Traditionally, this required data transfer to a central location or separate partner analyses with pooled summary statistics, posing ethical, legal, and time constraints. Federated analysis-which involves remote data analysis without sharing individual-level data-is a promising alternative. One promising solution is DataSHIELD (https://datashield.org/), an open-source R based implementation. To enable federated analysis, data owners need a user-friendly way to install the federated infrastructure and manage users and data. Here, we present MOLGENIS Armadillo: a lightweight server for federated analysis solutions such as DataSHIELD.

Availability and implementation: Armadillo is implemented as a collection of three packages freely available under the open source licence LGPLv3: two R packages downloadable from the Comprehensive R Archive Network (CRAN) ("MolgenisArmadillo" and "DSMolgenisArmdillo") and one Java application ("ArmadilloService") as jar and docker images via Github (https://github.com/molgenis/molgenis-service-armadillo).

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Armadillo information flow.

References

    1. Bycroft C, Freeman C, Petkova D. et al. The UK biobank resource with deep phenotyping and genomic data. Nature 2018;562:203–9. - PMC - PubMed
    1. Cadman T, Elhakeem A, Vinther JL. et al. Associations of maternal educational level, proximity to green space during pregnancy, and gestational diabetes with body mass index from infancy to early adulthood: a proof-of-concept federated analysis in 18 birth cohorts. Am J Epidemiol 2024a;193:753–63. - PMC - PubMed
    1. Cadman T, Strandberg-Larsen K, Calas L. et al. Urban environment in pregnancy and postpartum depression: an individual participant data meta-analysis of 12 european birth cohorts. Environ Int 2024b;185:108453. - PubMed
    1. Doiron D, Burton P, Marcon Y. et al. Data harmonization and federated analysis of population-based studies: the bioshare project. Emerg Themes Epidemiol 2013;10:12. - PMC - PubMed
    1. Doiron D, Marcon Y, Fortier I. et al. Software application profile: opal and mica: open-source software solutions for epidemiological data management, harmonization and dissemination. Int J Epidemiol 2017;46:1372–8. - PMC - PubMed