Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jun 3;16(11):1093-1103.
doi: 10.1080/15384101.2017.1315494. Epub 2017 Apr 25.

Novel clades of the HU/IHF superfamily point to unexpected roles in the eukaryotic centrosome, chromosome partitioning, and biologic conflicts

Affiliations

Novel clades of the HU/IHF superfamily point to unexpected roles in the eukaryotic centrosome, chromosome partitioning, and biologic conflicts

A Maxwell Burroughs et al. Cell Cycle. .

Abstract

The HU superfamily of proteins, with a unique DNA-binding mode, has been extensively studied as the primary chromosome-packaging protein of the bacterial superkingdom. Representatives also play a role in DNA-structuring during recombination events and in eukaryotic organellar genome maintenance. However, beyond these well-studied roles, little is understood of the functional diversification of this large superfamily. Using sensitive sequence and structure analysis methods we identify multiple novel clades of the HU superfamily. We present evidence that a novel eukaryotic clade prototyped by the human CCDC81 protein acquired roles beyond DNA-binding, likely in protein-protein interaction in centrosome organization and as a potential cargo-binding protein in conjunction with Dynein-VII. We also show that these eukaryotic versions were acquired via an early lateral transfer from bacteroidetes, where we predict a role in chromosome partition. This likely happened before the last eukaryotic common ancestor, pointing to potential endosymbiont contributions beyond that of the mitochondrial progenitor. Further, we show that the dramatic lineage-specific expansion of this domain in the bacteroidetes lineage primarily is linked to a functional shift related to potential recognition and preemption of genome invasive entities such as mobile elements. Remarkably, the CCDC81 clade has undergone a similar massive lineage-specific expansion within the archosaurian lineage in birds, suggesting a possible use of the HU superfamily in a similar capacity in recognition of non-self molecules even in this case.

Keywords: CCDC81; HU; IHF; bacteroidetes; biological conflict; birds; centrosome.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Structural and sequence overview of the HU superfamily. (A-D) Cartoon renderings of HU superfamily members. (A) Integration host factor (IHF) α and β in complex with DNA (PDB: 1IHF). IHFα and IHFβ are represented as ribbons and colored green and blue, respectively. DNA is shown as a surface trace in gray. (B) IHFα (PDB: 1IHF_A). (C) HU-HIG clade homodimer from Bacteroides vulgatus with a C-terminal Ig-like domain fusion (PDB: 4FMR). Coloring as in (A) above. The region corresponding to the Ig-like domain is shown as a superimposed ribbon with surface representation colored in gray. (D) HU domain from chain A of Bacteroides vulgatus HU homolog (PDB: 4FMR_A). The domains are colored and labeled as in (B), with additional secondary structure elements colored white. (E) Multiple sequence alignment of the HU superfamily. Secondary structure provided in top line, with elements labeled to correspond with (B). Positions shown to interact with DNA are denoted by asterisks. Sequences are labeled to left with NCBI accession number and organism abbreviation separated by rightmost underscore; HU family/clade names are given to the right. Negative numbers at left indicate extension of predicted protein start sites in GenBank. The alignment is colored as follows: h, hydrophobic and yellow; l, aliphatic and yellow; s, small and green; p, polar and blue; u, tiny and green. Organism abbreviations: Esili, Ectocarpus siliculosus; Pfalc, Plasmodium falciparum; Otaur, Ostreococcus tauri; Ehuxl, Emiliania huxleyi; Gsulp, Galdieria sulphuraria; Plunu, Pyrocystis lunula; Kvene, Karlodinium veneficum; Acart, Amphidinium carterae; Ptetr, Paramecium tetraurelia; Tcruz, Trypanosoma cruzi; Ssalm, Spironucleus salmonicida; Ngrub, Naegleria gruberi; Mcomm, Micromonas commoda; Hsapi, Homo sapiens; Nvect, Nematostella vectensis; Aplat, Anas platyrhynchos; Ggall, Gallus gallus; Pging, Porphyromonas gingivalis; Ctrac, Chlamydia trachomatis; Cbact, Cytophagaceae bacterium; Bsp, Bacteroides sp, Dsp, Dysgonomonas sp; Bfrag, Bacteroides fragilis; Gbact, Gallionellales bacterium; Tdent, Treponema denticola; Fsp, Flavobacterium sp; Prumi, Prevotella ruminicola; Btimo, Bacteroides timonensis; Bfine, Bacteroides finegoldii; Pgula, Porphyromonas gulae; Psp, Prevotella sp.
Figure 2.
Figure 2.
Phylogenetic relationships and genome associations in HU-CCDC81 and HU-HIG families. (A) Phylogenetic tree depicting higher-level relationships between HU families/clades described in this study. Branches are collapsed at levels containing clearly-delineated monophyletic groups, labeled to the right. Nodes with greater than 65% bootstrap support are marked with yellow circle. Representative conserved domain architectures and gene neighborhoods in a given clade provided to the right. (For complete list see Supplemental Material). Phyletic patterns of a given architecture/neighborhood are found provided to the right in green lettering. Phylogeny abbreviations: b, bacteroidetes; C, Chlamydia; P, Porphyromonas; s, spirochaetes; β, β-proteobacterial; γ, γ-proteobacteria; δ, δ-proteobacteria; v, verrucomicrobia; a, animals; k, kinetoplastids; api, apicomplexa; diplo, diplomonads; N, Naegleria; cili, ciliates; chloro, chlorophytes; stram, stramenopiles; SAR, stramenopile-alveolate-rhizarian group; Phy, Phytopthora; o, oomycetes; G, Guillardia. (B) Phylogenetic tree depicting the multiple paralogs identified in avian expansion of HU-CCDC81 domains. Monophyletic clades, as determined by phyletic distribution conservation patterns, are collapsed and then labeled and colored according to evolutionary depth. Nodes with greater than 70% bootstrap support are marked with yellow circle. Potential lineage-specific expansions within a clade are labeled with total number of non-redundant protein copies and phyletic patterns. (C) Phylogenetic tree depicting rampant LSEs, gene loss, and incomplete lineage-sorting in HU-HIG family based on a set of all HU-HIG sequences retrieved from the 10 bacteroidetes genomes, listed in key to the right, with the highest number of identifiable HU-HIG sequences. Branch coloring in tree corresponds to genome name colors in key. Domain architectures typical of sequences in clustered branches ring the tree, see (A) for explanation of architecture depictions. Complete trees provided in Newick format in Supplemental Material.
Figure 3.
Figure 3.
Positional entropy and sequence diversity comparisons. (A) Positional entropy comparison between Gallus and Meleagris HU-CCDC81 domains (galliform birds) and primate and rodent HU-CCDC81 domains. Entropy values calculated as described in Materials and Methods. (B) Entropy values from (A) plotted along linear sequence of HU-CCDC81 domain, secondary structure provided below and labeled in concordance with Fig. 1(B). (C-F) Sequence diversity plots comparing pairwise sequence evolutionary distances (see Materials and Methods) within representatives of labeled HU families, y-axes set to log scale. Differences in boxplots (A, C-F) are significant (p < 2.2e-16) by Wilcoxon rank sum test.

Similar articles

Cited by

References

    1. Dillon SC, Dorman CJ. Bacterial nucleoid-associated proteins, nucleoid structure and gene expression. Nat Rev Microbiol 2010; 8:185-95; PMID:20140026; https://doi.org/10.1038/nrmicro2261 - DOI - PubMed
    1. Reeve JN, Bailey KA, Li WT, Marc F, Sandman K, Soares DJ. Archaeal histones: structures, stability and DNA binding. Biochem Soc Trans 2004; 32:227-30; PMID:15046577; https://doi.org/10.1042/bst0320227 - DOI - PubMed
    1. Sandman K, Reeve JN. Archaeal chromatin proteins: different structures but common function? Curr Opin Microbiol 2005; 8:656-61; PMID:16256418; https://doi.org/10.1016/j.mib.2005.10.007 - DOI - PubMed
    1. Jones DO, Cowell IG, Singh PB. Mammalian chromodomain proteins: their role in genome organisation and expression. Bioessays 2000; 22:124-37; PMID:10655032; https://doi.org/10.1002/(SICI)1521-1878(200002)22:2%3c124::AID-BIES4%3e3... - DOI - PubMed
    1. Luijsterburg MS, White MF, van Driel R, Dame RT. The major architects of chromatin: architectural proteins in bacteria, archaea and eukaryotes. Crit Rev Biochem Mol Biol 2008; 43:393-418; PMID:19037758; https://doi.org/10.1080/10409230802528488 - DOI - PubMed

Substances