Systematic identification of disease-causing promoter and untranslated region variants in 8040 undiagnosed individuals with rare disease
- PMID: 40229884
- PMCID: PMC11998461
- DOI: 10.1186/s13073-025-01464-2
Systematic identification of disease-causing promoter and untranslated region variants in 8040 undiagnosed individuals with rare disease
Abstract
Background: Both promoters and untranslated regions (UTRs) have critical regulatory roles, yet variants in these regions are largely excluded from clinical genetic testing due to difficulty in interpreting pathogenicity. The extent to which these regions may harbour diagnoses for individuals with rare disease is currently unknown.
Methods: We present a framework for the identification and annotation of potentially deleterious proximal promoter and UTR variants in known dominant disease genes. We use this framework to annotate de novo variants (DNVs) in 8040 undiagnosed individuals in the Genomics England 100,000 genomes project, which were subject to strict region-based filtering, clinical review, and validation studies where possible. In addition, we performed region and variant annotation-based burden testing in 7862 unrelated probands against matched unaffected controls.
Results: We prioritised eleven DNVs and identified an additional variant overlapping one of the eleven. Ten of these twelve variants (82%) are in genes that are a strong match to the individual's phenotype and six had not previously been identified. Through burden testing, we did not observe a significant enrichment of potentially deleterious promoter and/or UTR variants in individuals with rare disease collectively across any of our region or variant annotations.
Conclusions: Whilst screening promoters and UTRs can uncover additional diagnoses for individuals with rare disease, including these regions in diagnostic pipelines is not likely to dramatically increase diagnostic yield. Nevertheless, we provide a framework to aid identification of these variants.
Keywords: Non-coding; Promoters; Rare disease; Regulatory regions; Splicing; Untranslated regions.
© 2025. Crown.
Conflict of interest statement
Declarations. Ethics approval and consent to participate: Ethics approval was granted by the HRA Committee East of England – Cambridge South (REC Ref 14/EE/1112). This research conforms to the principles of the Helsinki Declaration.All participants provided informed consent for their data to be part of the National Genomics Research Library and for use in research and publication. Consent for publication: All participants in this study have provided consent for their data to be part of the National Genomics Research Library and to the publication of research findings. For all cases, written informed consent for research use of clinical and genetic data was obtained from patients, their parents, or legal guardians in the case of those with intellectual disability. Competing interests: The authors declare no competing interests.
Figures
Update of
-
Systematic identification of disease-causing promoter and untranslated region variants in 8,040 undiagnosed individuals with rare disease.medRxiv [Preprint]. 2023 Sep 12:2023.09.12.23295416. doi: 10.1101/2023.09.12.23295416. medRxiv. 2023. Update in: Genome Med. 2025 Apr 14;17(1):40. doi: 10.1186/s13073-025-01464-2. PMID: 37745552 Free PMC article. Updated. Preprint.
