An interconnected data infrastructure to support large-scale rare disease research
- PMID: 39302238
- PMCID: PMC11413801
- DOI: 10.1093/gigascience/giae058
An interconnected data infrastructure to support large-scale rare disease research
Abstract
The Solve-RD project brings together clinicians, scientists, and patient representatives from 51 institutes spanning 15 countries to collaborate on genetically diagnosing ("solving") rare diseases (RDs). The project aims to significantly increase the diagnostic success rate by co-analyzing data from thousands of RD cases, including phenotypes, pedigrees, exome/genome sequencing, and multiomics data. Here we report on the data infrastructure devised and created to support this co-analysis. This infrastructure enables users to store, find, connect, and analyze data and metadata in a collaborative manner. Pseudonymized phenotypic and raw experimental data are submitted to the RD-Connect Genome-Phenome Analysis Platform and processed through standardized pipelines. Resulting files and novel produced omics data are sent to the European Genome-Phenome Archive, which adds unique file identifiers and provides long-term storage and controlled access services. MOLGENIS "RD3" and Café Variome "Discovery Nexus" connect data and metadata and offer discovery services, and secure cloud-based "Sandboxes" support multiparty data analysis. This successfully deployed and useful infrastructure design provides a blueprint for other projects that need to analyze large amounts of heterogeneous data.
Keywords: bioinformatics; computational biology; fair data; genetics; infrastructure; rare disease.
© The Author(s) 2024. Published by Oxford University Press GigaScience.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures






Similar articles
-
The RD-Connect Registry & Biobank Finder: a tool for sharing aggregated data and metadata among rare disease researchers.Eur J Hum Genet. 2018 May;26(5):631-643. doi: 10.1038/s41431-017-0085-z. Epub 2018 Feb 2. Eur J Hum Genet. 2018. PMID: 29396563 Free PMC article.
-
The RD-Connect Genome-Phenome Analysis Platform: Accelerating diagnosis, research, and gene discovery for rare diseases.Hum Mutat. 2022 Jun;43(6):717-733. doi: 10.1002/humu.24353. Hum Mutat. 2022. PMID: 35178824 Free PMC article.
-
Remote visualization of large-scale genomic alignments for collaborative clinical research and diagnosis of rare diseases.Cell Genom. 2023 Jan 11;3(2):100246. doi: 10.1016/j.xgen.2022.100246. eCollection 2023 Feb 8. Cell Genom. 2023. PMID: 36819661 Free PMC article.
-
Harmonising phenomics information for a better interoperability in the rare disease field.Eur J Med Genet. 2018 Nov;61(11):706-714. doi: 10.1016/j.ejmg.2018.01.013. Epub 2018 Feb 7. Eur J Med Genet. 2018. PMID: 29425702 Review.
-
Development of Bioinformatics Infrastructure for Genomics Research.Glob Heart. 2017 Jun;12(2):91-98. doi: 10.1016/j.gheart.2017.01.005. Epub 2017 Mar 13. Glob Heart. 2017. PMID: 28302555 Free PMC article. Review.
Cited by
-
Genomic reanalysis of a pan-European rare-disease resource yields new diagnoses.Nat Med. 2025 Feb;31(2):478-489. doi: 10.1038/s41591-024-03420-w. Epub 2025 Jan 17. Nat Med. 2025. PMID: 39825153 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous