Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 2;26(10):107823.
doi: 10.1016/j.isci.2023.107823. eCollection 2023 Oct 20.

Antagonism among DUX family members evolved from an ancestral toxic single homeodomain protein

Affiliations

Antagonism among DUX family members evolved from an ancestral toxic single homeodomain protein

Darko Bosnakovski et al. iScience. .

Abstract

Double homeobox (DUX) genes are unique to eutherian mammals, expressed transiently during zygotic genome activation (ZGA) and involved in facioscapulohumeral muscular dystrophy (FSHD) and cancer when misexpressed. We evaluate the 3 human DUX genes and the ancestral single homeobox gene sDUX from the non-eutherian mammal, platypus, and find that DUX4 cytotoxicity is not shared with DUXA or DUXB, but surprisingly is shared with platypus sDUX, which binds DNA as a homodimer and activates numerous ZGA genes and long terminal repeat (LTR) elements. DUXA, although transcriptionally inactive, has DNA binding overlap with DUX4, and DUXA-VP64 activates DUX4 targets and is cytotoxic. DUXA competition antagonizes the activity of DUX4 on its target genes, including in FSHD patient cells. Since DUXA is a DUX4 target gene, this competition potentiates feedback inhibition, constraining the window of DUX4 activity. The DUX gene family therefore comprises antagonistic members of opposing function, with implications for their roles in ZGA, FSHD, and cancer.

Keywords: Developmental genetics; Evolutionary developmental biology; Molecular interaction; Molecular toxicology.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Platypus sDUX displays functional overlap with DUX4 (A) Evolution of the DUX gene family from an ancestral single homeobox gene in the common ancestor to eutherian mammals. (B) Sequence dendrogram of DUX homeodomains. DUX family members are shaded yellow (DUXA), green (DUXB), and blue (DUXC); and sDUX is shaded red. (C) Morphology of LHCN-M2isDUX human myoblasts in the absence (left) or presence of 100 ng/mL dox for 4 days to induce sDUX (right). Scale bar, 100 μm. (D) EdU incorporation in LHCN-M2isDUX human myoblasts after 6 h in the absence (blue) or presence (red) of 500 ng/mL dox to induce sDUX. X axis, counts; Y axis, fluorescence. (E) Annexin V staining in LHCN-M2isDUX human myoblasts after 24 h in the absence (blue) or presence (red) of 500 ng/mL dox to induce sDUX. X axis, counts; Y axis, fluorescence. (F) Sequence comparison of the sDUX homeodomain with HD1 and HD2 of the 3 human DUX family members, DUXC from elephant and mouse, and mouse DUXBL. Known or predicted DNA binding regions are highlighted in blue, and residues known or predicted to have specific nucleotide base contacts are underlined. Yellow shading indicates the E and R residues responsible for TAAT or TGAT specificities of DUX4 HD1 and HD2, resp. Asterisks indicate positions with a conserved residue, colons indicate conservation of strongly similar biochemical properties, periods indicate conservation between amino acids of weakly similar properties. (G) Real time RTqPCR for ZSCAN4, LEUTX, and PRAMEF1 in LHCN-M2iDUX4 parent and LHCN-M2-is DUX myoblasts induced with 200 ng/mL doxycycline for 14 h. Data are presented as mean ± SEM; ∗p < 0.05, ∗∗∗p < 0.01∗∗, ∗∗∗∗p < 0.0001 by one-way ANOVA, n = 3. (H) Luciferase assays in comparing activity of DUX4 and sDUX on different versions of the double homeodomain motif in 293T cells exposed to 500 ng/mL dox for 24 h. The core HD binding sites are underlined. DUX4 and DUX cores are separated by 3 nt while the PAX7 recognition motif cores are separated by 2 nt. Copy number of motifs indicated by 1x or 2x. Data are presented as mean ± SEM; n = 4. (I) Viability assay for 293T cells after 48 h of expression (100 ng/mL dox) of full length sDUX vs. sDUXΔC, lacking the C-terminal 74 amino acids. Data are presented as mean ± SEM; n = 3. (J) Real time RTqPCR for DUX4 target genes LEUTX and TRIM43 after expression of sDUX or sDUXΔC in the same cells in panel I (500 ng/mL, 24 h). Data are presented as mean ± SEM; n = 3. (K) Representative western blots with acetylation specific-antibodies in human myoblasts expressing sDUX and treated with the p300/CBP-specific HAT inhibitor, iP300w. LHCN-isDUX cells were induced with 200 ng/mL doxycycline and concurrently treated with 0.3 μM iP300w for 12 h. (L) Real time RTqPCR for sDUX target genes RFPL4B and ZSCAN4 in the presence or absence of iP300w on the cell presented in panel (K) Data are presented as mean ± SEM; n = 3.
Figure 2
Figure 2
Structural basis of sDUX interaction with DNA (A) Binding of DUX4 HD1-HD2 and sDUX HD to the non-palindromic (TAAT---ATCA) and palindromic (TGAT---ATCA or TAAT---ATTA) motifs tested in EMSA. All DNA substrates were double-stranded DNA. Sequence of the fluorescently labeled strand is shown above each gel. Gels are representative of duplicates. (B) Binding of DUX4 HD1-HD2 and sDUX HD to the TAAT and TGAT half-motifs or a non-palindromic motif with 2 nt spacing (N2) tested in EMSA. Gels are representative of duplicates. (C) Binding of sDUX HD to the three different types of DNA substrates, highlighting distinct mobilities of dimeric vs. monomeric complex in EMSA. (D) Crystal structure of sDUX homodimer bound to the non-palindromic target motif with 3 nt spacing, the structure of DUX4 double homeodomain with the same DNA substrate as reported previously, and their superposition. (E) Differential interaction of sDUX Arg75 with the TAAT and TGAT motifs in the crystal structure. 2Fo-Fc electron density for the structure with Br-dU-containing DNA is shown as blue mesh, contoured at 0.9 σ.
Figure 3
Figure 3
Toxicity and myogenic differentiation effects of DUX family members and VP64-fusions (A) Viability (ATP) assay in LHCN-M2 human myoblasts after 96 h of expression of different DUX family proteins induced with 200 ng/mL doxycycline. Data are presented as fold difference to the control (uninduced cells); n = 4. (B) Luciferase assays in 293T cells induced with 1 μg/mL dox for 24 h to express various DUX family proteins and VP64 derivatives on the 3 versions of DUX motif (N3 and P3) as well as a motif with spacing of 2 nt (P2-type). Data are presented as mean ± SEM; n = 4. (C) Heatmap of the top DUX4 target genes showing expression in cells expressing various DUX family members and VP64 derivatives after 12 h of induction (200 ng/mL dox) in LHCN-M2 human myoblasts. (D) Heatmap of the top myogenic genes affected by DUX4, and the effect of other DUX family members, and DUXA- and DUXB-VP64 activation domain fusions on this set. Genes downregulated by DUX4 shown above; genes upregulated by DUX4 shown below. LHCN-M2 human myoblasts carrying inducible gene of interest were induced by 200 ng/mL dox for 12 h. (E) Real time RTqPCR for MYH7, a marker of differentiation, after differentiation with 72 h of expression of different DUX family proteins at 20 or 125 ng/mL dox. Data are presented as mean ± SEM; ∗∗∗p < 0.01∗∗, ∗∗∗∗p < 0.0001 by one-way ANOVA, n = 6. (F) Representative immunostaining for sarcomeric myosin heavy chain on the same cells shown in panel e. Nuclei are counterstained with DAPI. Scale bar is 100 nm.
Figure 4
Figure 4
Transcriptional profiles of DUX family members (A) Principal-component analysis of gene expression profiles in the absence (open circles) and presence (closed circles) of 200 ng/mL dox for 12 h. DUX4 expressing cells are strongly shifted along PC1 which accounts for 62% of the variance while DUXA-VP64 and sDUX-expressing cells are more prominently shifted along PC2 which contains 8% of the variance. (B) Heatmap representation of log2 transformed FPKM values for the top 50 differentially expressed genes among all factors. Expression of DUX4, DUXA, DUXB, and sDUX is shown in the upper panel. (C) Venn diagrams showing overlap in upregulated gene sets among total genes (left) and ZGA genes (right). Upregulated genes for each set were defined as having a greater than 2-fold change, and Benjamini-Hochberg adjusted p value less than 0.05 and a mean FPKM value >2.5 in the dox-induced sample. (D) Distribution of repeat classification among the differentially expressed retroviral elements for DUX4, DUXA-VP64 and sDUX. Differentially expressed elements were defined as being upregulated or downregulated greater than 2-fold and having a Benjamini-Hochberg adjusted p value less the 0.05. (E) Pearson correlations of log2 fold changes in expression upon induction with dox between each of the DUX factor. The upper left half of the matrix corresponds to conventional genes whereas the lower right half of the matrix corresponds to log2 fold changes for repetitive elements. (F) Scatterplots of the log2 fold changes in conventional expression between DUX factors. Points representing individual genes are colored according to density to show the higher density of points near the center. A linear fit of each correlation is shown as a black line. (G) Scatterplots similar to panel (F) for repetitive elements.
Figure 5
Figure 5
Chromatin changes induced by DUX family members (A) Principal-component analysis of ATAC-seq data in the absence (open circles) and presence (closed circles) of 200 ng/mL dox for 12 h. Called peaks from each factor were combined into a composite set of genomic locations that had changes in DNA accessibility in one or more samples. A matrix of the number of reads within each region across all the samples was used to calculate the principal components. (B) ATAC-seq coverage tracks for each DUX factor at the ZSCAN4 locus in the absence (gray) or presence (colored according to panel a) of dox. Each track shown is a combination of two biological replicates normalized to the total number of reads per million. The scale for each pair of tracks is shown at the right, and the gene structure is shown below. The red box corresponds to peaks that appear for some DUX factors upon the addition of dox. (C) ATAC-seq coverage similar to panel B for the PRAMEF12 locus. (D) The DNA sequences from the top 500 peaks were used to define de novo motifs for DUX4, DUXA-VP64, and sDUX. The top motif and its p value are shown. The DUX4 and DUXA-VP64 motifs are similar to the previously defined motifs for DUX4 and DUXA repectively (https://jaspar.genereg.net/). (E) The de novo motifs identified from the top 500 DUX4, DUXA-VP64, and sDUX peaks were identified across each dataset using a 95% scoring threshold. Classification of each sequence is based on the nucleotide at the 2nd and 10th position. (F) The number of sequence motifs identified in panel e per ATAC-seq peak was counted to show that peaks frequently contain multiple copies of the DUX motif in all three datasets. (G) Heatmaps of ATAC-seq coverage for peaks within 10 kb of an upregulated gene. Peaks were classified into one of seven groups depending on whether they were unique to a particular dataset or shared between two or more datasets. Groups that contained more than 100 peaks were randomly subsampled to 100 peaks. Coverage was normalized to total counts per million reads of each dataset and then plotted on the scale defined by the dox-induced dataset shown below the heatmaps.
Figure 6
Figure 6
DUXA binds many DUX4 targets and inhibits DUX4 transcriptional activity (A) Tornado plots aligning on peak centers for called DUX4 peaks (upper panel) and DUXA peaks (lower panel) for both input and IP samples for both experiments. Overlap of DUX4 and DUXA peaks (peak calling: FDR ≤ 0.1, minimum treatment reads ≥ 1, enrichment ≥ 1, log10 (q value) ≤ −0.5; not overlapping blacklisted regions like satellite repeats or misassemblies (ENCODE). (B) ChIP-seq coverage tracks for DUXA and DUX4 surrounding the ZSCAN4 (upper) and PRAMEF12 (lower) genes. Input tracks are shown in gray and IP tracks are colored uniquely. Each track shown is a combination of two normalized biological replicates. The scale for each track is shown at the right and the gene structure is shown below. Arrows indicate shared peaks. (C) Venn diagrams showing overlap in peaks (left) and in bound genes (right). Promoter = gene TSS + upstream 5000 nt + downstream 5000 nt; overlap of ≥1 nt. (D) Real time RTqPCR for DUX4 target genes in LHCN-M2iDUX4 immortalized human WT myoblasts co-expressing either DUXAiresEGFP or EGFP control, and cells in which overexpressed DUXA was knocked down by siRNA or control non targeted siRNA. Cells were induced with 200 ng/mL doxycycline for 8 h. Data are presented as mean ± SEM; ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001 by two-way ANOVA, n = 3. (E) Real time RTqPCR for DUX4 target genes in immortalized human FSHD myoblasts engineered for dox-inducible for DUXA (M008-iDUXA). Cells were induced with 200 ng/mL doxycycline for 48 h in proliferation conditions (Prolif.). For differentiation (Diff.), cells were cultured in differentiation medium for 3 days. DUXA was induced (200 ng/mL dox) over the last 24 h of differentiation. Data are presented as mean ± SEM; ∗∗∗∗p < 0.0001 by two-way ANOVA, n = 3.

Update of

References

    1. Clapp J., Mitchell L.M., Bolland D.J., Fantes J., Corcoran A.E., Scotting P.J., Armour J.A.L., Hewitt J.E. Evolutionary conservation of a coding function for D4Z4, the tandem DNA repeat mutated in facioscapulohumeral muscular dystrophy. Am. J. Hum. Genet. 2007;81:264–279. - PMC - PubMed
    1. Leidenroth A., Hewitt J.E. A family history of DUX4: phylogenetic analysis of DUXA, B, C and Duxbl reveals the ancestral DUX gene. BMC Evol. Biol. 2010;10:364–1471. doi: 10.1186/1471-2148-10-364. - DOI - PMC - PubMed
    1. Gabriëls J., Beckers M.C., Ding H., De Vriese A., Plaisance S., van der Maarel S.M., Padberg G.W., Frants R.R., Hewitt J.E., Collen D., et al. Nucleotide sequence of the partially deleted D4Z4 locus in a patient with FSHD identifies a putative gene within each 3.3 kb element. Gene. 1999;236:25–32. doi: 10.1016/s0378-1119(99)00267-x. - DOI - PubMed
    1. van Overveld P.G.M., Lemmers R.J.F.L., Sandkuijl L.A., Enthoven L., Winokur S.T., Bakels F., Padberg G.W., van Ommen G.J.B., Frants R.R., et al. Hypomethylation of D4Z4 in 4q-linked and non-4q-linked facioscapulohumeral muscular dystrophy. Nat. Genet. 2003;35:315–317. - PubMed
    1. Snider L., Geng L.N., Lemmers R.J.L.F., Kyba M., Ware C.B., Nelson A.M., Tawil R., Filippova G.N., van der Maarel S.M., Tapscott S.J., et al. Facioscapulohumeral dystrophy: incomplete suppression of a retrotransposed gene. PLoS Genet. 2010;6:e1001181. doi: 10.1371/journal.pgen.1001181. - DOI - PMC - PubMed