Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 Mar 1;17(2):116-137.
doi: 10.1093/bfgp/ely004.

ChIP-ping the branches of the tree: functional genomics and the evolution of eukaryotic gene regulation

Review

ChIP-ping the branches of the tree: functional genomics and the evolution of eukaryotic gene regulation

Georgi K Marinov et al. Brief Funct Genomics. .

Abstract

Advances in the methods for detecting protein-DNA interactions have played a key role in determining the directions of research into the mechanisms of transcriptional regulation. The most recent major technological transformation happened a decade ago, with the move from using tiling arrays [chromatin immunoprecipitation (ChIP)-on-Chip] to high-throughput sequencing (ChIP-seq) as a readout for ChIP assays. In addition to the numerous other ways in which it is superior to arrays, by eliminating the need to design and manufacture them, sequencing also opened the door to carrying out comparative analyses of genome-wide transcription factor occupancy across species and studying chromatin biology in previously less accessible model and nonmodel organisms, thus allowing us to understand the evolution and diversity of regulatory mechanisms in unprecedented detail. Here, we review the biological insights obtained from such studies in recent years and discuss anticipated future developments in the field.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Patterns and mechanisms of regulatory element, transcription factor occupancy and gene expression evolution. (A) Patterns of conservation and divergence. Functional conservation of gene expression (top row) can be achieved through conservation of both regulatory elements and transcription factor occupancy (left), but it can also be maintained in the presence of significant turnover, both of occupancy and at the sequence level, either through replacement of individual transcription factors occupying an orthologous regulatory region (right) or by the evolution of nonorthologous regulatory regions (middle). Loss of regulatory elements and alterations in transcription factor occupancy can also lead to changes in gene expression (bottom row). (B and C) Mechanisms of transcription factor occupancy and regulatory element evolution. (B) Mechanisms of loss of transcription factor binding: loss of cognate motif (top), loss of cofactor binding (middle), loss of pioneer factor binding (bottom); (C) Mechanisms of gain of transcription factor binding and de novo evolution of regulatory elements: duplication and/or sub/neofunctionalization, direct de novo exaptation of existing intergenic space, insertion of TEs, exaptation of ancestral TE sequences.
Figure 2.
Figure 2.
Theoretical expectations regarding the evolutionary dynamics of transcription factor occupancy and interpretations of existing data sets. (A). Population genetic theoretical considerations lead to an expectation of faster turnover of regulatory sites in organisms with low effective population sizes (Ne) than in species with large Ne. (B) Distribution of effective population size values in some of the main model systems [68]. Shown is the product Neμ of Ne and the mutation rate μ, which can be most directly estimated empirically, unlike Ne alone [68, 69]. (C) The compilation of individual studies summarized in [47] suggested higher rates of regulatory site turnover in mammals than in flies. (D). In contrast, a reanalysis of several data sets in flies and mammals using a uniform data processing pipeline found similar rates of regulatory divergence within the two groups [70].
Figure 3.
Figure 3.
Examples of using ChIP-seq to map GRNs in development and evolution. (A). Cataloging enhancers involved in patterning the morphology of bat limbs. The forelimb bud of bats develops into elongated webbed wings, while the hindlimb bud produces much shorter legs. H3K27ac and H3K27me3 ChIP-seq and RNA-seq were applied to developing limb buds to chart the developmental enhancer landscape involved in the specification of these structures [82]. (B) Understanding the role of limb-specific enhancers in snake evolution. Snakes lack legs, but H3K27ac ChIP-seq in mouse and lizard embryos and comparative genomics reveal that a substantial portion of limb enhancers are in fact conserved in snakes, one major reason for which is their role during phallus development, where a similar developmental program is deployed [83].
Figure 4.
Figure 4.
Major eukaryotic clades, their epigenomic characterization and the origins of multicellularity. The tree shown follows previously published topologies [112], but it should be noted that the precise deep branching is still to be fully resolved by future phylogenomic studies. Red rounded rectangles are placed next to the lineages in which chromatin and transcriptional biology have been studied in considerable detail. Pink rounded rectangles are placed next to clades for some representatives of which initial epigenomic studies have been carried out in some detail. Clades in which multicellularity has evolved are indicated with three circles where multicellularity results from cell division, and by three triangles where multicellularity is aggregative [113]. The lineages containing nucleomorphs (the chlorarachniyophytes and the cryptophytes) are indicated with ‘NM’.
Figure 5.
Figure 5.
Relationships between metazoan phyla and their closest relatives and the extent of their epigenomic characterization. Taxons for which the correct phylogenetic relationships is still to be conclusively established (such as the position of Ctenophora relative to other metazoans [201–205], the monophyly of poriferans and the overall relationships between non-bilaterian clades [206] and the placement of Chaetognatha [207], Acoelomorpha [208, 209], Xenoturbellida [209, 210] and others) are incorporated as unresolved branches on the cladogram, with Orthonectida and Dicyemida (tentatively placed in Lophotrochozoa [211]) being omitted. Phyla in which at least some epigenomic studies have been carried out are indicated on the right-hand side with the corresponding genus names for the main taxa studied. Lineages without published sequenced genomes have been left blank. Note that CTCF has been lost in some nematodes such as C. elegans but is present in other members of the phylum.

Similar articles

Cited by

References

    1. Gilmour DS, Lis JT.. In vivo interactions of RNA polymerase II with genes of Drosophila melanogaster. Mol Cell Biol 1985;5(8):2009–18. - PMC - PubMed
    1. Gilmour DS, Lis JT.. Detecting protein-DNA interactions in vivo: distribution of RNA polymerase on specific bacterial genes. Proc Natl Acad Sci USA 1984;81(14):4275–9. - PMC - PubMed
    1. Solomon MJ, Larsen PL, Varshavsky A.. Mapping protein-DNA interactions in vivo with formaldehyde: evidence that histone H4 is retained on a highly transcribed gene. Cell 1988;53(6):937–47. - PubMed
    1. Hecht A, Strahl-Bolsinger S, Grunstein M.. Spreading of transcriptional repressor SIR3 from telomeric heterochromatin. Nature 1996;383(6595):92–6. - PubMed
    1. Ren B, Robert F, Wyrick JJ, et al.Genome-wide location and function of DNA binding proteins. Science 2000;290(5500):2306–9. - PubMed

Publication types

Substances

LinkOut - more resources