Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 9;12(1):7576.
doi: 10.1038/s41598-022-11589-8.

Massively parallel identification of functionally consequential noncoding genetic variants in undiagnosed rare disease patients

Affiliations

Massively parallel identification of functionally consequential noncoding genetic variants in undiagnosed rare disease patients

Jasmine A McQuerry et al. Sci Rep. .

Abstract

Clinical whole genome sequencing has enabled the discovery of potentially pathogenic noncoding variants in the genomes of rare disease patients with a prior history of negative genetic testing. However, interpreting the functional consequences of noncoding variants and distinguishing those that contribute to disease etiology remains a challenge. Here we address this challenge by experimentally profiling the functional consequences of rare noncoding variants detected in a cohort of undiagnosed rare disease patients at scale using a massively parallel reporter assay. We demonstrate that this approach successfully identifies rare noncoding variants that alter the regulatory capacity of genomic sequences. In addition, we describe an integrative analysis that utilizes genomic features alongside patient clinical data to further prioritize candidate variants with an increased likelihood of pathogenicity. This work represents an important step towards establishing a framework for the functional interpretation of clinically detected noncoding variants.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Identification of rare genetic variants from an undiagnosed rare disease cohort. (a) Gender composition of proband cohort. (b) Age distribution of proband cohort. (c) Prevalence of clinical features associated with selected patients. (d) Frequency of rare variants detected within 100 kb of a TSS. (e) Distance to nearest TSS for detected rare variants.
Figure 2
Figure 2
Profiling the regulatory capacity of genomic sequences using a massively parallel reporter assay. (a) Schematic of MPRA oligonucleotide library design. (b) Schematic of MPRA vector design and cloning. (c) Abundance of MPRA library elements corresponding to genomic sequences represented in the MPRA plasmid pool (pDNA, technical replicates) and expressed in HEK293T (cDNA, biological replicates). (d) Reproducibility of MPRA library element detection across biological replicates. (e) Regulatory activity of reference genome sequences profiled in MPRA (red = significant regulatory activity). (f) Fraction of reference genome sequences that display significant regulatory activity in MPRA. (g) Distance to nearest TSS for reference genome sequences that display significant regulatory activity.
Figure 3
Figure 3
Rare genetic variants alter the regulatory capacity of genomic sequences. (a) Distribution of expression differences between reference and variant alleles profiled in MPRA (red points denote variants with an expression difference |z-score| > 2). (b) Fraction of profiled rare variants that alter regulatory capacity of genomic sequences. (c) Distance to nearest TSS for variants that alter the regulatory capacity of genomic sequences (|z-score| > 2). (d) Heatmap showing regulatory capacity of all possible alleles corresponding to de novo variants that display altered regulatory capacity (|z-score| > 2). (e) Aggregated regulatory capacity of genomic sequences shown in (d) by allele type.
Figure 4
Figure 4
Integrative prediction of rare variant pathogenicity. (a) Significance analysis of HPO term overlap between probands and variant-associated transcription factors. (b) Schematic of transcription factor binding at candidate variant position. (c) MPRA analysis of candidate variant. (d) Volcano plot showing transcriptome-wide expression changes following CRISPRi-mediated repression of KDM5B. (e) Volcano plot showing transcriptome-wide expression changes following CRISPRi-mediated repression of variant site. (f) Significance analysis of overlapping expression changes following KDM5B or variant site repression. (g) MPRA analysis, transcription factor binding, and overlapping HPO terms for candidate variants.

References

    1. Short PJ, et al. De novo mutations in regulatory elements in neurodevelopmental disorders. Nature. 2018;555:611–616. doi: 10.1038/nature25983. - DOI - PMC - PubMed
    1. Brandler WM, et al. Paternally inherited cis-regulatory structural variants are associated with autism. Science. 2018;360:327–331. doi: 10.1126/science.aan2261. - DOI - PMC - PubMed
    1. Soden SE, et al. Effectiveness of exome and genome sequencing guided by acuity of illness for diagnosis of neurodevelopmental disorders. Sci. Transl. Med. 2014;6:265ra168. doi: 10.1126/scitranslmed.3010076. - DOI - PMC - PubMed
    1. Yang Y, et al. Molecular findings among patients referred for clinical whole-exome sequencing. JAMA. 2014;312:1870–1879. doi: 10.1001/jama.2014.14601. - DOI - PMC - PubMed
    1. Lee H, et al. Clinical exome sequencing for genetic identification of rare Mendelian disorders. JAMA. 2014;312:1880–1887. doi: 10.1001/jama.2014.14604. - DOI - PMC - PubMed

Publication types