Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

Solving patients with rare diseases through programmatic reanalysis of genome-phenome data

Leslie Matalonga et al. Eur J Hum Genet. 2021 Sep.

Erratum in

  • Correction to: Solving patients with rare diseases through programmatic reanalysis of genome-phenome data.
    Matalonga L, Hernández-Ferrer C, Piscia D; Solve-RD SNV-indel working group; Schüle R, Synofzik M, Töpf A, Vissers LELM, de Voer R; Solve-RD DITF-GENTURIS; Solve-RD DITF-ITHACA; Solve-RD DITF-euroNMD; Solve-RD DITF-RND; Tonda R, Laurie S, Fernandez-Callejo M, Picó D, Garcia-Linares C, Papakonstantinou A, Corvó A, Joshi R, Diez H, Gut I, Hoischen A, Graessner H, Beltran S; Solve-RD Consortium. Matalonga L, et al. Eur J Hum Genet. 2021 Sep;29(9):1466-1469. doi: 10.1038/s41431-021-00934-6. Eur J Hum Genet. 2021. PMID: 34393220 Free PMC article. No abstract available.

Abstract

Reanalysis of inconclusive exome/genome sequencing data increases the diagnosis yield of patients with rare diseases. However, the cost and efforts required for reanalysis prevent its routine implementation in research and clinical environments. The Solve-RD project aims to reveal the molecular causes underlying undiagnosed rare diseases. One of the goals is to implement innovative approaches to reanalyse the exomes and genomes from thousands of well-studied undiagnosed cases. The raw genomic data is submitted to Solve-RD through the RD-Connect Genome-Phenome Analysis Platform (GPAP) together with standardised phenotypic and pedigree data. We have developed a programmatic workflow to reanalyse genome-phenome data. It uses the RD-Connect GPAP's Application Programming Interface (API) and relies on the big-data technologies upon which the system is built. We have applied the workflow to prioritise rare known pathogenic variants from 4411 undiagnosed cases. The queries returned an average of 1.45 variants per case, which first were evaluated in bulk by a panel of disease experts and afterwards specifically by the submitter of each case. A total of 120 index cases (21.2% of prioritised cases, 2.7% of all exome/genome-negative samples) have already been solved, with others being under investigation. The implementation of solutions as the one described here provide the technical framework to enable periodic case-level data re-evaluation in clinical settings, as recommended by the American College of Medical Genetics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Programmatic reanalysis data workflow.
Unsolved cases (RD-REAL datasets = phenotypic and genomic data) are submitted by Solve-RD members from the 4 core ERNs and the 2 UDPs participating in the project. Genomic data is processed through a standard analysis pipeline [15] and integrated with the phenotypic information in the RD-Connect GPAP. Analysis of the data using the programmatic approach described in this study is performed by the SNV-indel working group. The SNV-indel working group is one of the seven working groups established by the Solve-RD Data Analysis Task Force (DATF) to massively reanalyse data with different analytical approaches (e.g. CNV, somatic, meta-analysis, etc.) (http://solve-rd.eu/the-group/data-analysis-task-force/). The DATF involves data scientists and genomics experts from the project. Resulting candidate variants are submitted to the Data Interpretation Task Force (DITF), involving expert clinicians and geneticists for prioritisation and final interpretation. One DITF has been established for each of the core ERNs participating in the project (http://solve-rd.eu/the-group/data-interpretation-task-force-ditf/). DITF include or are in contact with case submitters to enable a final decision for a new patient diagnosis. Diagnosed cases are automatically updated in the system and the remaining unsolved cases are susceptible to re-enter a new round of analysis.
Fig. 2
Fig. 2. Results of reanalysis of undiagnosed RD cases to identify known disease-causing variants.
A Filtration, prioritisation and interpretation workflow (numbers refer to index cases). B Number of variants per case submitted to DITFs for prioritisation and resulting number of variants submitted for interpretation. C Variants interpretation results from prioritised cases per type of disorder (numbers refer to variants). D Number of causative variants identified according to the year the corresponding gene (grey) or variant (yellow) was first described in the literature as disease-causing (according to OMIM) or pathogenic (according to ClinVar).

References

    1. Nguengang Wakap S, Lambert DM, Olry A, Rodwell C, Gueydan C, Lanneau V, et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur J Hum Genet. 2020;28:165–73. doi: 10.1038/s41431-019-0508-0. - DOI - PMC - PubMed
    1. Amberger JS, Bocchini CA, Schiettecatte F, Scott AF, Hamosh A. OMIM.org: online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 2015;43:D789–98. doi: 10.1093/nar/gku1205. - DOI - PMC - PubMed
    1. Amberger JS, Hamosh A. Searching Online Mendelian Inheritance in Man (OMIM): a Knowledgebase of Human Genes and Genetic Phenotypes. Curr Protoc Bioinforma. 2017;58:1.2.1–1.2.12. doi: 10.1002/cpbi.27. - DOI - PMC - PubMed
    1. Boycott KM, Hartley T, Biesecker LG, Gibbs RA, Innes AM, Riess O, et al. A Diagnosis for All Rare Genetic Diseases: the Horizon and the Next Frontiers. Cell. 2019;177:32–7. doi: 10.1016/j.cell.2019.02.040. - DOI - PubMed
    1. Wright CF, Fitzgerald TW, Jones WD, Clayton S, McRae JF, van Kogelenberg M, et al. Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data. Lancet. 2015;385:1305–14. doi: 10.1016/S0140-6736(14)61705-0. - DOI - PMC - PubMed

Publication types