. 2024 Jan 2:13:giae058.

doi: 10.1093/gigascience/giae058.

An interconnected data infrastructure to support large-scale rare disease research

Lennart F Johansson¹, Steve Laurie^{2

3}, Dylan Spalding⁴, Spencer Gibson⁵, David Ruvolo¹, Coline Thomas⁴, Davide Piscia^{2

3}, Fernanda de Andrade¹, Gerieke Been¹, Marieke Bijlsma¹, Han Brunner^{6

7

8}, Sandi Cimerman¹, Farid Yavari Dizjikan⁵, Kornelia Ellwanger⁹, Marcos Fernandez^{2

3}, Mallory Freeberg⁴, Gert-Jan van de Geijn¹, Roan Kanninga¹, Vatsalya Maddi⁵, Mehdi Mehtarizadeh⁵, Pieter Neerincx¹, Stephan Ossowski^{9

10}, Ana Rath¹¹, Dieuwke Roelofs-Prins¹, Marloes Stok-Benjamins¹, K Joeri van der Velde¹, Colin Veal⁵, Gerben van der Vries¹, Marc Wadsley⁵, Gregory Warren⁵, Birte Zurek⁹, Thomas Keane⁴, Holm Graessner^{9

12}, Sergi Beltran^{2

13}, Morris A Swertz¹, Anthony J Brookes⁵; Solve-RD consortium

Collaborators, Affiliations

Collaborators

Solve-RD consortium:
Olaf Riess, Tobias B Haack, Holm Graessner, Birte Zurek, Kornelia Ellwanger, Stephan Ossowski, German Demidov, Marc Sturm, Julia M Schulze-Hentrich, Rebecca Schüle, Jishu Xu, Christoph Kessler, Melanie Kellner, Matthis Synofzik, Carlo Wilke, Andreas Traschütz, Ludger Schöls, Holger Hengel, Holger Lerche, Josua Kegele, Peter Heutink, Han Brunner, Hans Scheffer, Nicoline Hoogerbrugge, Alexander Hoischen, Peter A C 't Hoen, Lisenka E L M Vissers, Christian Gilissen, Wouter Steyaert, Karolis Sablauskas, Richarda M de Voer, Erik-Jan Kamsteeg, Bart van de Warrenburg, Nienke van Os, Iris Te Paske, Erik Janssen, Elke de Boer, Marloes Steehouwer, Burcu Yaldiz, Tjitske Kleefstra, Anthony J Brookes, Colin Veal, Spencer Gibson, Vatsalya Maddi, Mehdi Mehtarizadeh, Umar Riaz, Greg Warren, Farid Yavari Dizjikan, Thomas Shorter, Ana Töpf, Volker Straub, Chiara Marini Bettolo, Jordi Diaz Manera, Sophie Hambleton, Karin Engelhardt, Jill Clayton-Smith, Siddharth Banka, Elizabeth Alexander, Adam Jackson, Laurence Faivre, Christel Thauvin, Antonio Vitobello, Anne-Sophie Denommé-Pichon, Yannis Duffourd, Ange-Line Bruel, Christine Peyron, Aurore Pélissier, Sergi Beltran, Ivo Glynne Gut, Steven Laurie, Davide Piscia, Leslie Matalonga, Anastasios Papakonstantinou, Gemma Bullich, Alberto Corvo, Marcos Fernandez-Callejo, Carles Hernández, Daniel Picó, Ida Paramonov, Hanns Lochmüller, Gulcin Gumus, Virginie Bros-Facer, Ana Rath, Marc Hanauer, David Lagorce, Oscar Hongnat, Maroua Chahdil, Emeline Lebreton, Giovanni Stevanin, Alexandra Durr, Claire-Sophie Davoine, Léna Guillot-Noel, Anna Heinzmann, Giulia Coarelli, Gisèle Bonne, Teresinha Evangelista, Valérie Allamand, Isabelle Nelson, Rabah Ben Yaou, Corinne Metay, Bruno Eymard, Enzo Cohen, Antonio Atalaia, Tanya Stojkovic, Milan Macek, Marek Turnovec, Dana Thomasová, Radka Pourová Kremliková, Vera Franková, Markéta Havlovicová, Petra Lišková, Pavla Doležalová, Helen Parkinson, Thomas Keane, Mallory Freeberg, Coline Thomas, Dylan Spalding, Peter Robinson, Daniel Danis, Glenn Robert, Alessia Costa, Christine Patch, Mike Hanna, Henry Houlden, Mary Reilly, Jana Vandrovcova, Stephanie Efthymiou, Heba Morsy, Elisa Cali, Francesca Magrinelli, Sanjay M Sisodiya, Jonathan Rohrer, Francesco Muntoni, Irina Zaharieva, Anna Sarkozy, Vincent Timmerman, Jonathan Baets, Geert de Vries, Jonathan De Winter, Danique Beijer, Peter de Jonghe, Liedewei Van de Vondel, Willem De Ridder, Sarah Weckhuysen, Vincenzo Nigro, Margherita Mutarelli, Manuela Morleo, Michele Pinelli, Alessandra Varavallo, Sandro Banfi, Annalaura Torella, Francesco Musacchia, Giulio Piluso, Alessandra Ferlini, Rita Selvatici, Francesca Gualandi, Stefania Bigoni, Rachele Rossi, Marcella Neri, Stefan Aretz, Isabel Spier, Anna Katharina Sommer, Sophia Peters, Carla Oliveira, Jose Garcia-Pelaez, Rita Barbosa-Matos, Celina São José, Marta Ferreira, Irene Gullo, Susana Fernandes, Luzia Garrido, Pedro Ferreira, Fátima Carneiro, Morris A Swertz, Lennart Johansson, Joeri K van der Velde, Gerben van der Vries, Pieter B Neerincx, David Ruvolo, Kristin M Abbott, Wilhemina S Kerstjens Frederikse, Eveline Zonneveld-Huijssoon, Dieuwke Roelofs-Prins, Marielle van Gijn, Sebastian Köhler, Alison Metcalfe, Alain Verloes, Séverine Drunat, Delphine Heron, Cyril Mignot, Boris Keren, Jean-Madeleine de Sainte Agathe, Caroline Rooryck, Didier Lacombe, Aurelien Trimouille, Manuel Posada De la Paz, Eva Bermejo Sánchez, Estrella López Martín, Beatriz Martínez Delgado, F Javier Alonso García de la Rosa, Andrea Ciolfi, Bruno Dallapiccola, Simone Pizzi, Francesca Clementina Radio, Marco Tartaglia, Alessandra Renieri, Simone Furini, Chiara Fallerini, Elisa Benetti, Peter Balicza, Maria Judit Molnar, Ales Maver, Borut Peterlin, Alexander Münchau, Katja Lohmann, Rebecca Herzog, Martje Pauly, Alfons Macaya, Ana Cazurro-Gutiérrez, Belén Pérez-Dueñas, Francina Munell, Clara Franco Jarava, Laura Batlle Masó, Anna Marcé-Grau, Roger Colobran, Andrés Nascimento Osorio, Daniel Natera de Benito, Hanns Lochmüller, Rachel Thompson, Kiran Polavarapu, Bodo Grimbacher, David Beeson, Judith Cossins, Peter Hackman, Mridul Johari, Marco Savarese, Bjarne Udd, Rita Horvath, Patrick F Chinnery, Thiloka Ratnaike, Fei Gao, Katherine Schon, Gabriel Capella, Laura Valle, Elke Holinski-Feder, Andreas Laner, Verena Steinke-Lange, Evelin Schröck, Andreas Rump, Ayşe Nazlı Başak, Dimitri Hemelsoet, Bart Dermaut, Nika Schuermans, Bruce Poppe, Hannah Verdin, Davide Mei, Annalisa Vetro, Simona Balestrini, Renzo Guerrini, Kristl Claeys, Gijs W E Santen, Emilia K Bijlsma, Mariette J V Hoffer, Claudia A L Ruivenkamp, Kaan Boztug, Matthias Haimel, Isabelle Maystadt, Isabell Cordts, Marcus Deschauer, Ioannis Zaganas, Evgenia Kokosali, Mathioudakis Lambros, Athanasios Evangeliou, Martha Spilioti, Elisabeth Kapaki, Mara Bourbouli, Pasquale Striano, Federico Zara, Antonella Riva, Michele Iacomino, Paolo Uva, Marcello Scala, Paolo Scudieri, Maria-Roberta Cilio, Evelina Carpancea, Chantal Depondt, Damien Lederer, Yves Sznajer, Sarah Duerinckx, Sandrine Mary, Christel Depienne, Andreas Roos, Patrick May

Affiliations

¹ Department of Genetics, University of Groningen, University Medical Center Groningen, HPC CB50, P.O. Box 30001, Groningen, 9700 RB, The Netherlands.
² Centro Nacional de Análisis Genómico, C/Baldiri Reixac 4, 08028, Barcelona, Spain.
³ Universitat de Barcelona (UB), Gran Via de les Corts Catalanes, 585, L'Eixample, 08007, Barcelona, Spain.
⁴ European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CV10 1SD, UK.
⁵ Department of Genetics, Genomics and Cancer Sciences, University of Leicester, University Road, Leicester, Leicester, LE1 7RH, UK.
⁶ Department of Human Genetics, Radboud University Medical Center, Geert Grooteplein Zuid 10, Nijmegen, 6525 GA, The Netherlands.
⁷ Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, P.O.Box 9103, Nijmegen, 6500 HD, The Netherlands.
⁸ Department of Clinical Genetics, Maastricht University Medical Centre, P. Debyelaan 25, Maastricht, 6229 HX, The Netherlands.
⁹ Institute of Medical Genetics and Applied Genomics, University of Tübingen, Calwerstraße 7, Tübingen 72076, Germany.
¹⁰ Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, Geschwister-Scholl-Platz, Tübingen 72074, Germany.
¹¹ INSERM, US-14 Orphanet, 96 rue Didot, Paris 75014, France.
¹² Centre for Rare Diseases, University of Tübingen, Geschäftsstelle Eisenbahnstraße 63, Tübingen 72072, Germany.
¹³ Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), Diagonal, 643, 08028, Barcelona, Spain.

PMID: 39302238
PMCID: PMC11413801
DOI: 10.1093/gigascience/giae058

An interconnected data infrastructure to support large-scale rare disease research

Lennart F Johansson et al. Gigascience. 2024.

. 2024 Jan 2:13:giae058.

doi: 10.1093/gigascience/giae058.

Authors

Collaborators

Solve-RD consortium:
Olaf Riess, Tobias B Haack, Holm Graessner, Birte Zurek, Kornelia Ellwanger, Stephan Ossowski, German Demidov, Marc Sturm, Julia M Schulze-Hentrich, Rebecca Schüle, Jishu Xu, Christoph Kessler, Melanie Kellner, Matthis Synofzik, Carlo Wilke, Andreas Traschütz, Ludger Schöls, Holger Hengel, Holger Lerche, Josua Kegele, Peter Heutink, Han Brunner, Hans Scheffer, Nicoline Hoogerbrugge, Alexander Hoischen, Peter A C 't Hoen, Lisenka E L M Vissers, Christian Gilissen, Wouter Steyaert, Karolis Sablauskas, Richarda M de Voer, Erik-Jan Kamsteeg, Bart van de Warrenburg, Nienke van Os, Iris Te Paske, Erik Janssen, Elke de Boer, Marloes Steehouwer, Burcu Yaldiz, Tjitske Kleefstra, Anthony J Brookes, Colin Veal, Spencer Gibson, Vatsalya Maddi, Mehdi Mehtarizadeh, Umar Riaz, Greg Warren, Farid Yavari Dizjikan, Thomas Shorter, Ana Töpf, Volker Straub, Chiara Marini Bettolo, Jordi Diaz Manera, Sophie Hambleton, Karin Engelhardt, Jill Clayton-Smith, Siddharth Banka, Elizabeth Alexander, Adam Jackson, Laurence Faivre, Christel Thauvin, Antonio Vitobello, Anne-Sophie Denommé-Pichon, Yannis Duffourd, Ange-Line Bruel, Christine Peyron, Aurore Pélissier, Sergi Beltran, Ivo Glynne Gut, Steven Laurie, Davide Piscia, Leslie Matalonga, Anastasios Papakonstantinou, Gemma Bullich, Alberto Corvo, Marcos Fernandez-Callejo, Carles Hernández, Daniel Picó, Ida Paramonov, Hanns Lochmüller, Gulcin Gumus, Virginie Bros-Facer, Ana Rath, Marc Hanauer, David Lagorce, Oscar Hongnat, Maroua Chahdil, Emeline Lebreton, Giovanni Stevanin, Alexandra Durr, Claire-Sophie Davoine, Léna Guillot-Noel, Anna Heinzmann, Giulia Coarelli, Gisèle Bonne, Teresinha Evangelista, Valérie Allamand, Isabelle Nelson, Rabah Ben Yaou, Corinne Metay, Bruno Eymard, Enzo Cohen, Antonio Atalaia, Tanya Stojkovic, Milan Macek, Marek Turnovec, Dana Thomasová, Radka Pourová Kremliková, Vera Franková, Markéta Havlovicová, Petra Lišková, Pavla Doležalová, Helen Parkinson, Thomas Keane, Mallory Freeberg, Coline Thomas, Dylan Spalding, Peter Robinson, Daniel Danis, Glenn Robert, Alessia Costa, Christine Patch, Mike Hanna, Henry Houlden, Mary Reilly, Jana Vandrovcova, Stephanie Efthymiou, Heba Morsy, Elisa Cali, Francesca Magrinelli, Sanjay M Sisodiya, Jonathan Rohrer, Francesco Muntoni, Irina Zaharieva, Anna Sarkozy, Vincent Timmerman, Jonathan Baets, Geert de Vries, Jonathan De Winter, Danique Beijer, Peter de Jonghe, Liedewei Van de Vondel, Willem De Ridder, Sarah Weckhuysen, Vincenzo Nigro, Margherita Mutarelli, Manuela Morleo, Michele Pinelli, Alessandra Varavallo, Sandro Banfi, Annalaura Torella, Francesco Musacchia, Giulio Piluso, Alessandra Ferlini, Rita Selvatici, Francesca Gualandi, Stefania Bigoni, Rachele Rossi, Marcella Neri, Stefan Aretz, Isabel Spier, Anna Katharina Sommer, Sophia Peters, Carla Oliveira, Jose Garcia-Pelaez, Rita Barbosa-Matos, Celina São José, Marta Ferreira, Irene Gullo, Susana Fernandes, Luzia Garrido, Pedro Ferreira, Fátima Carneiro, Morris A Swertz, Lennart Johansson, Joeri K van der Velde, Gerben van der Vries, Pieter B Neerincx, David Ruvolo, Kristin M Abbott, Wilhemina S Kerstjens Frederikse, Eveline Zonneveld-Huijssoon, Dieuwke Roelofs-Prins, Marielle van Gijn, Sebastian Köhler, Alison Metcalfe, Alain Verloes, Séverine Drunat, Delphine Heron, Cyril Mignot, Boris Keren, Jean-Madeleine de Sainte Agathe, Caroline Rooryck, Didier Lacombe, Aurelien Trimouille, Manuel Posada De la Paz, Eva Bermejo Sánchez, Estrella López Martín, Beatriz Martínez Delgado, F Javier Alonso García de la Rosa, Andrea Ciolfi, Bruno Dallapiccola, Simone Pizzi, Francesca Clementina Radio, Marco Tartaglia, Alessandra Renieri, Simone Furini, Chiara Fallerini, Elisa Benetti, Peter Balicza, Maria Judit Molnar, Ales Maver, Borut Peterlin, Alexander Münchau, Katja Lohmann, Rebecca Herzog, Martje Pauly, Alfons Macaya, Ana Cazurro-Gutiérrez, Belén Pérez-Dueñas, Francina Munell, Clara Franco Jarava, Laura Batlle Masó, Anna Marcé-Grau, Roger Colobran, Andrés Nascimento Osorio, Daniel Natera de Benito, Hanns Lochmüller, Rachel Thompson, Kiran Polavarapu, Bodo Grimbacher, David Beeson, Judith Cossins, Peter Hackman, Mridul Johari, Marco Savarese, Bjarne Udd, Rita Horvath, Patrick F Chinnery, Thiloka Ratnaike, Fei Gao, Katherine Schon, Gabriel Capella, Laura Valle, Elke Holinski-Feder, Andreas Laner, Verena Steinke-Lange, Evelin Schröck, Andreas Rump, Ayşe Nazlı Başak, Dimitri Hemelsoet, Bart Dermaut, Nika Schuermans, Bruce Poppe, Hannah Verdin, Davide Mei, Annalisa Vetro, Simona Balestrini, Renzo Guerrini, Kristl Claeys, Gijs W E Santen, Emilia K Bijlsma, Mariette J V Hoffer, Claudia A L Ruivenkamp, Kaan Boztug, Matthias Haimel, Isabelle Maystadt, Isabell Cordts, Marcus Deschauer, Ioannis Zaganas, Evgenia Kokosali, Mathioudakis Lambros, Athanasios Evangeliou, Martha Spilioti, Elisabeth Kapaki, Mara Bourbouli, Pasquale Striano, Federico Zara, Antonella Riva, Michele Iacomino, Paolo Uva, Marcello Scala, Paolo Scudieri, Maria-Roberta Cilio, Evelina Carpancea, Chantal Depondt, Damien Lederer, Yves Sznajer, Sarah Duerinckx, Sandrine Mary, Christel Depienne, Andreas Roos, Patrick May

Affiliations

¹ Department of Genetics, University of Groningen, University Medical Center Groningen, HPC CB50, P.O. Box 30001, Groningen, 9700 RB, The Netherlands.
² Centro Nacional de Análisis Genómico, C/Baldiri Reixac 4, 08028, Barcelona, Spain.
³ Universitat de Barcelona (UB), Gran Via de les Corts Catalanes, 585, L'Eixample, 08007, Barcelona, Spain.
⁴ European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CV10 1SD, UK.
⁵ Department of Genetics, Genomics and Cancer Sciences, University of Leicester, University Road, Leicester, Leicester, LE1 7RH, UK.
⁶ Department of Human Genetics, Radboud University Medical Center, Geert Grooteplein Zuid 10, Nijmegen, 6525 GA, The Netherlands.
⁷ Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, P.O.Box 9103, Nijmegen, 6500 HD, The Netherlands.
⁸ Department of Clinical Genetics, Maastricht University Medical Centre, P. Debyelaan 25, Maastricht, 6229 HX, The Netherlands.
⁹ Institute of Medical Genetics and Applied Genomics, University of Tübingen, Calwerstraße 7, Tübingen 72076, Germany.
¹⁰ Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, Geschwister-Scholl-Platz, Tübingen 72074, Germany.
¹¹ INSERM, US-14 Orphanet, 96 rue Didot, Paris 75014, France.
¹² Centre for Rare Diseases, University of Tübingen, Geschäftsstelle Eisenbahnstraße 63, Tübingen 72072, Germany.
¹³ Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), Diagonal, 643, 08028, Barcelona, Spain.

PMID: 39302238
PMCID: PMC11413801
DOI: 10.1093/gigascience/giae058

Abstract

The Solve-RD project brings together clinicians, scientists, and patient representatives from 51 institutes spanning 15 countries to collaborate on genetically diagnosing ("solving") rare diseases (RDs). The project aims to significantly increase the diagnostic success rate by co-analyzing data from thousands of RD cases, including phenotypes, pedigrees, exome/genome sequencing, and multiomics data. Here we report on the data infrastructure devised and created to support this co-analysis. This infrastructure enables users to store, find, connect, and analyze data and metadata in a collaborative manner. Pseudonymized phenotypic and raw experimental data are submitted to the RD-Connect Genome-Phenome Analysis Platform and processed through standardized pipelines. Resulting files and novel produced omics data are sent to the European Genome-Phenome Archive, which adds unique file identifiers and provides long-term storage and controlled access services. MOLGENIS "RD3" and Café Variome "Discovery Nexus" connect data and metadata and offer discovery services, and secure cloud-based "Sandboxes" support multiparty data analysis. This successfully deployed and useful infrastructure design provides a blueprint for other projects that need to analyze large amounts of heterogeneous data.

Keywords: bioinformatics; computational biology; fair data; genetics; infrastructure; rare disease.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Figure 1:**
Rare disease analysis infrastructure overview. deep-ES: deep sequencing ES; EGA: European Genome-Phenome Archive; ERN: European Reference Network; ES: exome sequencing; GPAP: Genome-Phenome Analysis Platform; GS: genome sequencing; LR-GS: long-read genome sequencing; LR-RNAseq: long-read RNA-sequencing; SR-GS: short-read genome sequencing; SR-RNAseq: short-read RNA sequencing; UI: user interface. The Solve-RD dataset is also discoverable through the participation of the RD-Connect GPAP in Matchmaker exchange and the Beacon Network.

**Figure 2:**
Sandbox folder structure. Data are organized by the data analysis working groups (DATF working group [WG]) in either folders per European Reference Network (ERN) or a common folder (for data intended for all ERNs). Additionally, large files that should be kept but not shared are stored in a “Sandbox only” folder. All data to be shared with the ERNs are linked to an sftp folder with a subfolder per ERN accessible via SFTP access protocol. Thin arrows indicate links between specific subfolders. These folders are further synchronized to 2 folders: DATF and DITF, each with the same information (indicated by the thick arrow). The DATF folder has the same structure as the initial sftp folder (a folder for each DATF WG with subfolders per ERN). The DITF folder has the converse structure (a folder for each DITF ERN with subfolders per WG). This structure makes it easy for both DATF and DITF to browse the data (e.g., all CNV data or all data from ERN-ITHACA).

**Figure 3:**
Data and metadata relations within Solve-RD. Arrows indicate the “derived from” direction (e.g., Sample DNA00001 is derived from Subject P00001). We distinguish 4 main data/metadata types: subject, sample, experiments, and files, with each derived from the former. This figure is actually a simplification as data are further organized in data releases we call “freezes” and can be used in different combinations as “analyses.”

**Figure 4:**
Solve-RD RD3 LabInfo screen showing a subset of the Freeze1 experiment data. On the left, entries are filtered on patch “Original data” and columns are filtered on interest. In the current view, the experimentID is connected to the sample on which the experiment was performed. In addition, information on the experiment is shown. For these samples, genomic data were the input for exome sequencing experiments on which various different enrichment kits were used. For most of the samples, statistics on the average target coverage (MeanCov) and number of bases covered by at least 20 sequencing reads (C20) was available. If a subject was retracted from the project, all metadata except identifiers were removed from the database and the experiment was labeled as retracted.

**Figure 5:**
(A) Discovery Nexus query interface. This interface supports querying by any combination of various demographic and inheritance (Subject Filters), phenotypes (HPO Query Builder), diseases (ORDO Query Builders), or suspected variant filters (Variant Filter). In the HPO Query Builder, typing any part of an HPO phenotype term or code creates a visible list of relevant items to select from, whereupon they are transferred into the adjacent panel to form part of the query. Phenotype matching can specify matching on identical terms only (exact) or recover similar terms (based on a precomputed matrix of relationship scores and the position of the slider). The minimum number of matching terms can also be specified, creating an “OR” query, and settings above the minimum create a query that returns results that match at least the specified number of terms in any combination. HPO queries can also be instructed to interrogate phenotype data stored as ORDO terms. Matching of HPO to ORDO terms (in the ORDO Query Builder) is controlled by the HPO pairwise similarity slider, to define the number of HPO terms that should match an ORDO term as well as the ORDO match scale, defining the specificity of the HPO term(s) to the selected ORDO term (based on a precomputed matrix of their occurrence across all ORDO terms). Hence, when mapping ORDO to HPO terms, exact matching will traverse the mapping of these 2 term sets to find fewer but more specific HPO terms, while minimum matching will include more HPO terms, but these may match other ORDO terms as well. Variant data cannot be filtered at the specific base-change level (as this would raise privacy concerns) but are instead queryable by host gene, allele frequency, and mutation type using the Variant Query Builder. It is also possible to filter for variants based on affected biochemical pathways, given known relationships between genes and pathways (using the Reactome Knowledge base [60]). Finally, the ERN dataset to be queried must be explicitly stated and requires that the user has permission to query the specified ERNs. (B) Discovery Nexus Query Results. After submitting the query using the “Build query button,” the system will return a count for matching results in the resources selected. Clicking on the number in the blue box will bring up the summary pop-up window as shown above, giving basic details of the matches (again subject to the user having been assigned permissions). The blue “Get Full Data for Selected Subjects” will open a link to request access from the resources holding the required data (where this is available). Alternatively, clicking the green button in the source details will open a summary page with contact details for the resource, where a direct link to request the data is not available.

See this image and copyright information in PMC

References

1. Zurek B, Ellwanger K, Vissers LELM, et al. Solve-RD: systematic pan-European data sharing and collaborative analysis to solve rare diseases. Eur J Hum Genet. 2021;29:1325–31. 10.1038/s41431-021-00859-0. - DOI - PMC - PubMed
1. Laurie S, Piscia D, Matalonga L, et al. The RD-Connect Genome-Phenome Analysis Platform: accelerating diagnosis, research, and gene discovery for rare diseases. Hum Mutat. 2022;43(6):717–33. 10.1002/humu.24353. - DOI - PMC - PubMed
1. Swertz MA, Dijkstra M, Adamusiak T, et al. The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button. BMC Bioinf. 2010;11(Suppl. 12):S12. 10.1186/1471-2105-11-S12-S12. - DOI - PMC - PubMed
1. van der Velde KJ, Imhann F, Charbon B, et al. MOLGENIS research: advanced bioinformatics data software for non-bioinformaticians. Bioinformatics. 2019;35(6):1076–78. 10.1093/bioinformatics/bty742. - DOI - PMC - PubMed
1. Lancaster O, Beck T, Atlan D, et al. Cafe Variome: general-purpose software for making genotype–phenotype data discoverable in restricted or open access contexts. Hum Mutat. 2015;36(10):957–64. 10.1002/humu.22841. - DOI - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central
- Silverchair Information Systems
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

An interconnected data infrastructure to support large-scale rare disease research

Collaborators

Affiliations

An interconnected data infrastructure to support large-scale rare disease research

Authors

Collaborators

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous