Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

Rare disease gene association discovery from burden analysis of the 100,000 Genomes Project data

Valentina Cipriani et al. medRxiv. .

Update in

  • Rare disease gene association discovery in the 100,000 Genomes Project.
    Cipriani V, Vestito L, Magavern EF, Jacobsen JOB, Arno G, Behr ER, Benson KA, Bertoli M, Bockenhauer D, Bowl MR, Burley K, Chan LF, Chinnery P, Conlon PJ, Costa MA, Davidson AE, Dawson SJ, Elhassan EAE, Flanagan SE, Futema M, Gale DP, García-Ruiz S, Corcia CG, Griffin HR, Hambleton S, Hicks AR, Houlden H, Houlston RS, Howles SA, Kleta R, Lekkerkerker I, Lin S, Liskova P, Mitchison HH, Morsy H, Mumford AD, Newman WG, Neatu R, O'Toole EA, Ong ACM, Pagnamenta AT, Rahman S, Rajan N, Robinson PN, Ryten M, Sadeghi-Alavijeh O, Sayer JA, Shovlin CL, Taylor JC, Teltsh O, Tomlinson I, Tucci A, Turnbull C, van Eerde AM, Ware JS, Watts LM, Webster AR, Westbury SK, Zheng SL, Caulfield M, Smedley D. Cipriani V, et al. Nature. 2025 Feb 26. doi: 10.1038/s41586-025-08623-w. Online ahead of print. Nature. 2025. PMID: 40011789

Abstract

To discover rare disease-gene associations, we developed a gene burden analytical framework and applied it to rare, protein-coding variants from whole genome sequencing of 35,008 cases with rare diseases and their family members recruited to the 100,000 Genomes Project (100KGP). Following in silico triaging of the results, 88 novel associations were identified including 38 with existing experimental evidence. We have published the confirmation of one of these associations, hereditary ataxia with UCHL1 , and independent confirmatory evidence has recently been published for four more. We highlight a further seven compelling associations: hypertrophic cardiomyopathy with DYSF and SLC4A3 where both genes show high/specific heart expression and existing associations to skeletal dystrophies or short QT syndrome respectively; monogenic diabetes with UNC13A with a known role in the regulation of β cells and a mouse model with impaired glucose tolerance; epilepsy with KCNQ1 where a mouse model shows seizures and the existing long QT syndrome association may be linked; early onset Parkinson's disease with RYR1 with existing links to tremor pathophysiology and a mouse model with neurological phenotypes; anterior segment ocular abnormalities associated with POMK showing expression in corneal cells and with a zebrafish model with developmental ocular abnormalities; and cystic kidney disease with COL4A3 showing high renal expression and prior evidence for a digenic or modifying role in renal disease. Confirmation of all 88 associations would lead to potential diagnoses in 456 molecularly undiagnosed cases within the 100KGP, as well as other rare disease patients worldwide, highlighting the clinical impact of a large-scale statistical approach to rare disease gene discovery.

PubMed Disclaimer

Publication types

LinkOut - more resources