Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 5;4(1):100166.
doi: 10.1016/j.xhgg.2022.100166. eCollection 2023 Jan 12.

Prioritization of non-coding elements involved in non-syndromic cleft lip with/without cleft palate through genome-wide analysis of de novo mutations

Affiliations

Prioritization of non-coding elements involved in non-syndromic cleft lip with/without cleft palate through genome-wide analysis of de novo mutations

Hanna K Zieger et al. HGG Adv. .

Abstract

Non-syndromic cleft lip with/without cleft palate (nsCL/P) is a highly heritable facial disorder. To date, systematic investigations of the contribution of rare variants in non-coding regions to nsCL/P etiology are sparse. Here, we re-analyzed available whole-genome sequence (WGS) data from 211 European case-parent trios with nsCL/P and identified 13,522 de novo mutations (DNMs) in nsCL/P cases, 13,055 of which mapped to non-coding regions. We integrated these data with DNMs from a reference cohort, with results of previous genome-wide association studies (GWASs), and functional and epigenetic datasets of relevance to embryonic facial development. A significant enrichment of nsCL/P DNMs was observed at two GWAS risk loci (4q28.1 (p = 8 × 10-4) and 2p21 (p = 0.02)), suggesting a convergence of both common and rare variants at these loci. We also mapped the DNMs to 810 position weight matrices indicative of transcription factor (TF) binding, and quantified the effect of the allelic changes in silico. This revealed a nominally significant overrepresentation of DNMs (p = 0.037), and a stronger effect on binding strength, for DNMs located in the sequence of the core binding region of the TF Musculin (MSC). Notably, MSC is involved in facial muscle development, together with a set of nsCL/P genes located at GWAS loci. Supported by additional results from single-cell transcriptomic data and molecular binding assays, this suggests that variation in MSC binding sites contributes to nsCL/P etiology. Our study describes a set of approaches that can be applied to increase the added value of WGS data.

Trial registration: ClinicalTrials.gov NCT01876303.

Keywords: allelic spectrum; birth defect; muscle development; non-coding genome; orofacial clefting; rare variants; transcription factor binding.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Comparative analyses of de novo mutations (A) De novo mutations (DNMs) observed in non-syndromic cleft lip with/without cleft palate (nsCL/P) case-parent trios (red) and NCR trios (blue) were annotated according to genomic location (i.e., exonic/intronic/intergenic). Exonic DNMs were defined based on exons of protein-coding genes in the GENCODE Basic gene annotation version33.hg19, including non-coding parts of gene sequences (e.g., 3′/5′ UTRs). DNMs were equally distributed between the two cohorts. (B) DNMs were annotated with each of six distinct in silico prediction scores, and their distribution was compared between the two cohorts. No significant differences were found.
Figure 2
Figure 2
Enrichment of non-syndromic cleft lip with/without cleft palate de novo mutations in genomic candidate regions (A) DNMs were mapped in eight chromatin states derived from human neural crest cells (hNCCs), cranial neural crest cells (cNCCs), and human embryonic facial tissue. FunciVar enrichment results are indicated by dot color. Dot sizes illustrate enrichment probabilities (increasing values represent increased statistical significance), and significant findings are encircled. (B) Non-coding elements with previous evidence for functional relevance were retrieved from conserved non-coding elements (CNEs) and enhancer activity assays from VISTA (n=16 tissues). DNMs mapping to these regions were tested for n enrichment in nsCL/P using FunciVar, similar to (A), and enrichment was depicted with their respective 95% credible interval (dots indicate median). The gray dashed line indicates a difference of zero. (C) DNMs were mapped within boundaries of topologically associating domains (TADs), and a subset of 45 TADs was defined based on the presence of associated common nsCL/P risk variants (TADsGWAS). Two loci (4q28.1, 2p21PKDCC, see panel D) carried significantly more DNMs in nsCL/P. TAD boundaries are highlighted in green, with surrounding regions in gray. Gene locations are shown in yellow, together with GWAS-SNPs (dot) and GWAS credible SNP regions (bar) in blue. The positions of DNMs are indicated in red for nsCL/P and dark blue for NCR cohort. Two superimposed DNMs at 4q28.1 are indicated by an asterisk (∗). (D) Same graphical depiction as in (B), except for the TADs located at the 45 nsCL/P GWAS risk loci. Nominal significant p values are indicated with an asterisk (∗), and p values significant after correction for 45 tests are indicated by a double asterisk (∗∗).
Figure 3
Figure 3
Identification of Musculin as a player in non-syndromic cleft lip with/without cleft palate etiology (A) Qualitative analysis of DNMs in transcription factor (TF) binding sites (TFBS). Using 810 position weight matrices from JASPAR2020, the relative enrichment of non-syndromic cleft lip with/without cleft palate (nsCL/P) DNMs was assessed using log2FC (on y axis) versus Fisher’s exact tests (−log10(p value) on x axis). Insert represents motif TFAP2a (var.3) that had log2FC ≥ 1 but lacked observations in the control cohort. (B) Quantitative assessment of allelic effects on TF binding. For each DNM, the binding change (BC) of alternative versus reference allele was assessed via the Mann-Whitney U (MWU) test (on x axis) and log2FC (on y axis, calculated using the ratio of mean change of binding between cohorts). All motifs with ≥3 hits per cohort and sufficient variability in BCs were used for MWU testing. Inserts represent motifs that lacked sufficient observations for MWU testing, but had log2FC ≥ 1 and ≥5 hits. (C–E) Single-cell transcriptomic data confirm a role for Msc during murine embryonic development. (C) Re-analysis of MOCA data (Cao et al., 2019) identified 24 cell clusters at day E11.5. (D) Expression levels for Musculin (Msc) in single-cell data from MOCA at E11.5 in cell clusters showed specific expression in myocytes (cell cluster 12 in C). Note: cluster numbers (x axis) correspond to cell cluster numbers in the UMAP plot in (C). (E) Single-cell expression data of different cell clusters of the lambdoidal junction at E11.5 are shown as dot plot. For each cell cluster, the percentage of cells expressing Msc is indicated by dot size, while the average expression level is indicated by color. This illustrates expression of Msc in palatal epithelium and maxillary prominences. (F) Nine DNMs mapped to the MSC motif (MA0665.1; seven in nsCL/P and two in NCR cohort). The sequences of the nine regions are illustrated per genomic region, as sorted according to BC, and with colored dots highlighting the cohort in which they were observed. At each position of a DNM, the allelic change is indicated in the order ref/alt.

Similar articles

Cited by

References

    1. Mangold E., Ludwig K.U., Nöthen M.M. Breakthroughs in the genetics of orofacial clefting. Trends Mol. Med. 2011;17:725–733. - PubMed
    1. Christensen K., Juel K., Herskind A.M., Murray J.C. Long term follow up study of survival associated with cleft lip and palate at birth. BMJ. 2004;328:1405. - PMC - PubMed
    1. Grosen D., Bille C., Petersen I., Skytthe A., Hjelmborg J.v.B., Pedersen J.K., Murray J.C., Christensen K. Risk of oral clefts in twins. Epidemiology. 2011;22:313–319. - PMC - PubMed
    1. Welzenbach J., Hammond N.L., Nikolić M., Thieme F., Ishorst N., Leslie E.J., Weinberg S.M., Beaty T.H., Marazita M.L., Mangold E., et al. Integrative approaches generate insights into the architecture of non-syndromic cleft lip ± cleft palate. HGG Adv. 2021;2:100038. - PMC - PubMed
    1. Basha M., Demeer B., Revencu N., Helaers R., Theys S., Bou Saba S., Boute O., Devauchelle B., Francois G., Bayet B., et al. Whole exome sequencing identifies mutations in 10% of patients with familial non-syndromic cleft lip and/or palate in genes mutated in well-known syndromes. J. Med. Genet. 2018;55:449–458. - PubMed

Publication types

Associated data