Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Nov 26:10:364.
doi: 10.1186/1471-2148-10-364.

A family history of DUX4: phylogenetic analysis of DUXA, B, C and Duxbl reveals the ancestral DUX gene

Affiliations

A family history of DUX4: phylogenetic analysis of DUXA, B, C and Duxbl reveals the ancestral DUX gene

Andreas Leidenroth et al. BMC Evol Biol. .

Abstract

Background: DUX4 is causally involved in the molecular pathogenesis of the neuromuscular disorder facioscapulohumeral muscular dystrophy (FSHD). It has previously been proposed to have arisen by retrotransposition of DUXC, one of four known intron-containing DUX genes. Here, we investigate the evolutionary history of this multi-member double-homeobox gene family in eutherian mammals.

Results: Our analysis of the DUX family shows the distribution of different homologues across the mammalian class, including events of secondary loss. Phylogenetic comparison, analysis of gene structures and information from syntenic regions confirm the paralogous relationship of Duxbl and DUXB and characterize their relationship with DUXA and DUXC. We further identify Duxbl pseudogene orthologues in primates. A survey of non-mammalian genomes identified a single-homeobox gene (sDUX) as a likely representative homologue of the mammalian DUX ancestor before the homeobox duplication. Based on the gene structure maps, we suggest a possible mechanism for the generation of the DUX gene structure.

Conclusions: Our study underlines how secondary loss of orthologues can obscure the true ancestry of individual gene family members. Their relationships should be considered when interpreting the relevance of functional data from DUX4 homologues such as Dux and Duxbl to FSHD.

PubMed Disclaimer

Figures

Figure 1
Figure 1
DUX family overview. Representative structures of the four intron-containing and two intronless DUX genes are shown. Open reading frame in coloured boxes, homeobox sequences in dark blue. Conserved C-terminal domain of unknown function in yellow. Dotted boxes indicate UTR with uncertainty about transcriptional start and end sites. Note that introns and exons are drawn to different scales.
Figure 2
Figure 2
Mammalian DUX distribution. Summary of our mammalian DUX catalogue. Presumed functional genes (with intact ORF across all four homeobox exons, first exon may be unidentified) in green. DUX sequences with stop codons or deleted/missing exons in well sequenced regions in red. Putative DUX homologues with unclear functional status (mainly due to gaps in assembly) in yellow. Unless marked with ?, synteny was confirmed with anchor genes. Phylogenetic relationship between species according to references[28,44].
Figure 3
Figure 3
Anchor genes. Main anchor genes used to identify syntenic regions across species. DUX genes coloured yellow, anchor genes in black. Cphx as a marker of Duxbl/DUXB duplication shown in green. Gene sizes and intergenic spacing not drawn to scale. ENSEMBL access information: Human Cphx 10q processed transcript (ENSG00000230091); Human Cphx-like 16q protein coding (ENSG00000232078); Mouse CJ057 (ENSG00000021867); Human CJ057 (ENSG00000133678).
Figure 4
Figure 4
Conserved C-terminal domain. Amino acid alignment of conserved C-terminal domain found in DUXC, DUX4 and rodent Dux. All residues up to the stop codon are shown. Shading intensity indicates % of sequences that agree with consensus at that residue. Dark grey = >80%, grey = >60%, light grey = >40%, white = 40% or fewer. Figure produced with JalView 2.5.
Figure 5
Figure 5
Duxbl genes and pseudogenes. a) Overview of Duxbl pseudogenes and intact rodent Duxbl. Intact genes in colour, pseudogenes in grey. Dotted lines contain small gaps/contig holes in assembly. Asterisk next to species name indicates lack of synteny information. Star above exon = stop codon. b) Mapping of partial primate Duxbl deletion. Genomic nucleic acid alignment based on LAGAN in mVISTA of orangutan, human and chimpanzee Duxbl pseudogene loci. Conserved sequences shown as peaks, with Duxbl exons marked by dark bars. Gorilla was not included due to incomplete assembly.
Figure 6
Figure 6
Maximum Likelihood trees. a) Tree based on concatenated homeodomain sequences (120 amino acids). b) Tree based on individual homeodomains (60 amino acids). Bootstrap values shown at internal nodes. Number after DUX name denotes homeodomain 1 or 2. Number in brackets indicates sDUX copy number.
Figure 7
Figure 7
Single-homeobox sDUX. a) Schematics of identified sDUX exons. Asterisk denotes missing synteny information. Number in brackets corresponds to copy number in diagram below. b) Map of sDUX genes relative to anchors. All locations, genes sizes and distances drawn to scale. Red arrows = two sDUX exons including intron. Orientation of arrow indicates strand orientation. Distance between lizard Anxa11 and CJ057 too large to show to scale. c) Amino acid alignment of homeodomain. Duxbl homeodomains 1 and 2 included for comparison. Arrow marks splice junction. Conserved residues shaded as in Figure 4.
Figure 8
Figure 8
Relationship of sDUX to known human PRD class homeodomains. Maximum Likelihood tree based on single homeodomains. Apart from sDUX, all homeodomains are from the human orthologues (data extracted from HomeoDB[23]). Number in brackets indicates sDUX copy number. sDUX homeodomains cluster with DUX.
Figure 9
Figure 9
Homeobox duplication in DUX genes. Hypothetical scenario of homeobox duplication at the sDUX locus. As an illustrative example, the two tandem wallaby sDUX copies are shown with all sizes and distances to scale. A deletion could join the two genes leaving a single intact DUX ancestral gene. New intron redrawn at 1/8th size (intron scale). Subsequent splicing changes could result in intron gain. Mouse Duxbl is shown for comparison.

References

    1. Clapp J, Mitchell LM, Bolland DJ, Fantes J, Corcoran AE, Scotting PJ, Armour JAL, Hewitt JE. Evolutionary conservation of a coding function for D4Z4, the tandem DNA repeat mutated in facioscapulohumeral muscular dystrophy. Am J Hum Genet. 2007;81:264–279. doi: 10.1086/519311. - DOI - PMC - PubMed
    1. Gehring WJ, Affolter M, Burglin T. Homeodomain proteins. Annu Rev Biochem. 1994;63:487–526. doi: 10.1146/annurev.bi.63.070194.002415. - DOI - PubMed
    1. de Greef JC, Frants RR, van der Maarel SM. Epigenetic mechanisms of facioscapulohumeral muscular dystrophy. Mutat Res-Fund Mol M Mutat. 2008;647:94–102. doi: 10.1016/j.mrfmmm.2008.07.011. - DOI - PMC - PubMed
    1. Hewitt JE, Lyle R, Clark LN, Valleley EM, Wright TJ, Wijmenga C, Vandeutekom JCT, Francis F, Sharpe PT, Hofker M. et al.Analysis of the tandem repeat locus D4Z4 associated with facioscapulohumeral muscular-dystrophy. Hum Mol Genet. 1994;3:1287–1295. doi: 10.1093/hmg/3.8.1287. - DOI - PubMed
    1. Gabriels J, Beckers MC, Ding H, De Vriese A, Plaisance S, van der Maarel SM, Padberg GW, Frants RR, Hewitt JE, Collen D, Belayew A. Nucleotide sequence of the partially deleted D4Z4 locus in a patient with FSHD identifies a putative gene within each 3.3 kb element. Gene. 1999;236:25–32. doi: 10.1016/S0378-1119(99)00267-X. - DOI - PubMed

Publication types

Substances