MOLGENIS Armadillo: a lightweight server for federated analysis using DataSHIELD
- PMID: 39673440
- PMCID: PMC11734753
- DOI: 10.1093/bioinformatics/btae726
MOLGENIS Armadillo: a lightweight server for federated analysis using DataSHIELD
Abstract
Summary: Extensive human health data from cohort studies, national registries, and biobanks can reveal lifecourse risk factors impacting health. Combining these sources offers increased statistical power, rare outcome detection, replication of findings, and extended study periods. Traditionally, this required data transfer to a central location or separate partner analyses with pooled summary statistics, posing ethical, legal, and time constraints. Federated analysis-which involves remote data analysis without sharing individual-level data-is a promising alternative. One promising solution is DataSHIELD (https://datashield.org/), an open-source R based implementation. To enable federated analysis, data owners need a user-friendly way to install the federated infrastructure and manage users and data. Here, we present MOLGENIS Armadillo: a lightweight server for federated analysis solutions such as DataSHIELD.
Availability and implementation: Armadillo is implemented as a collection of three packages freely available under the open source licence LGPLv3: two R packages downloadable from the Comprehensive R Archive Network (CRAN) ("MolgenisArmadillo" and "DSMolgenisArmdillo") and one Java application ("ArmadilloService") as jar and docker images via Github (https://github.com/molgenis/molgenis-service-armadillo).
© The Author(s) 2024. Published by Oxford University Press.
References
-
- Cadman T, Elhakeem A, Vinther JL. et al. Associations of maternal educational level, proximity to green space during pregnancy, and gestational diabetes with body mass index from infancy to early adulthood: a proof-of-concept federated analysis in 18 birth cohorts. Am J Epidemiol 2024a;193:753–63. - PMC - PubMed
-
- Cadman T, Strandberg-Larsen K, Calas L. et al. Urban environment in pregnancy and postpartum depression: an individual participant data meta-analysis of 12 european birth cohorts. Environ Int 2024b;185:108453. - PubMed