Rare disease gene association discovery in the 100,000 Genomes Project
- PMID: 40011789
- DOI: 10.1038/s41586-025-08623-w
Rare disease gene association discovery in the 100,000 Genomes Project
Abstract
Up to 80% of rare disease patients remain undiagnosed after genomic sequencing1, with many probably involving pathogenic variants in yet to be discovered disease-gene associations. To search for such associations, we developed a rare variant gene burden analytical framework for Mendelian diseases, and applied it to protein-coding variants from whole-genome sequencing of 34,851 cases and their family members recruited to the 100,000 Genomes Project2. A total of 141 new associations were identified, including five for which independent disease-gene evidence was recently published. Following in silico triaging and clinical expert review, 69 associations were prioritized, of which 30 could be linked to existing experimental evidence. The five associations with strongest overall genetic and experimental evidence were monogenic diabetes with the known β cell regulator3,4 UNC13A, schizophrenia with GPR17, epilepsy with RBFOX3, Charcot-Marie-Tooth disease with ARPC3 and anterior segment ocular abnormalities with POMK. Further confirmation of these and other associations could lead to numerous diagnoses, highlighting the clinical impact of large-scale statistical approaches to rare disease-gene association discovery.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: The authors declare the following competing interests: D.S. and M.C. were seconded to, and received salary from, Genomics England, a wholly owned Department of Health and Social Care company, from 2016 to 2018 and 2013 to 2021, respectively. E.A.O. has research funding from Kamari Pharma, Pavella Therapeutics, Unilever and the Leo Foundation unrelated to this work. She is CI for a trial for Kamari Pharma and performs consultancy for Kamari Pharma, Azitra and Palvella Therapeutics (all money goes to the university). S.L.Z. has provided consultancy services to Health Lumen. All other authors declare no competing interests.
Update of
-
Rare disease gene association discovery from burden analysis of the 100,000 Genomes Project data.medRxiv [Preprint]. 2023 Dec 21:2023.12.20.23300294. doi: 10.1101/2023.12.20.23300294. medRxiv. 2023. Update in: Nature. 2025 Feb 26. doi: 10.1038/s41586-025-08623-w. PMID: 38196618 Free PMC article. Updated. Preprint.
References
-
- 100,000 Genomes Project Pilot Investigators et al. 100,000 Genomes pilot on rare-disease diagnosis in health care—preliminary report. N. Engl. J. Med. 385, 1868–1880 (2021).
-
- Turnbull, C. et al. The 100 000 Genomes Project: bringing whole genome sequencing to the NHS. BMJ 361, k1687 (2018). - PubMed
-
- Cataldo, L. R. et al. MAFA and MAFB regulate exocytosis-related genes in human β-cells. Acta Physiol. 234, e13761 (2022).
-
- Kang, L. et al. Munc13-1 is required for the sustained release of insulin from pancreatic beta cells. Cell Metab. 3, 463–468 (2006). - PubMed
-
- Nguengang Wakap, S. et al. Estimating cumulative point prevalence of rare diseases: analysis of the Orphanet database. Eur. J. Hum. Genet. 28, 165–173 (2020). - PubMed
LinkOut - more resources
Full Text Sources