Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Dec 23;11(12):852.
doi: 10.15252/msb.20156492.

Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation

Affiliations

Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation

James Fraser et al. Mol Syst Biol. .

Abstract

Mammalian chromosomes fold into arrays of megabase-sized topologically associating domains (TADs), which are arranged into compartments spanning multiple megabases of genomic DNA. TADs have internal substructures that are often cell type specific, but their higher-order organization remains elusive. Here, we investigate TAD higher-order interactions with Hi-C through neuronal differentiation and show that they form a hierarchy of domains-within-domains (metaTADs) extending across genomic scales up to the range of entire chromosomes. We find that TAD interactions are well captured by tree-like, hierarchical structures irrespective of cell type. metaTAD tree structures correlate with genetic, epigenomic and expression features, and structural tree rearrangements during differentiation are linked to transcriptional state changes. Using polymer modelling, we demonstrate that hierarchical folding promotes efficient chromatin packaging without the loss of contact specificity, highlighting a role far beyond the simple need for packing efficiency.

Keywords: chromatin contacts; chromosome architecture; epigenetics; gene expression; polymer modelling.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Chromatin contact maps (Hi‐C) and matched gene expression (CAGE) data in murine neuronal differentiation system
  1. Scheme of the murine differentiation system of our study, from ESC to NPC and post‐mitotic neurons.

  2. Cells express stage‐specific markers as detected by immunofluorescence: ESC express Oct4, NPC the neuronal precursor marker nestin and Neurons Tubb3 (Tuj1 antibody). Scale bar, 100 μm.

  3. ESC and NPC are actively cycling, whereas Neurons are negative for BrDNA after 24‐h BrdU incorporation. Nuclei were counterstained with DAPI.

  4. Examples of interaction patterns in Hi‐C matrices across the whole chromosomes show extensive higher‐order dynamic contacts, which change during terminal neuronal differentiation. Hi‐C interaction data are plotted in log scale.

  5. Matched CAGE data sets were produced from total RNA extracted from ESC, NPC and Neurons. The expression levels confirm specific expression of stage‐specific markers. CAGE expression reported as a percentage relative to highest expression.

Figure EV1
Figure EV1. Differentiation of mouse ESC (46C line) into midbrain‐like neuronal cells
  1. Cell populations were tested by immunofluorescence staining for their purity using stage‐specific markers (pseudocoloured red), which showed efficient progression of ESC through the differentiation steps. Oct4 expression in ESC is lost upon differentiation, nestin is specifically expressed in NPC, and Tubb3 (detected using Tuj1 antibodies) is strongly expressed in Neurons. Nuclei were counterstained with DAPI (pseudocoloured blue). Scale bar represents 100 μm.

  2. Total RNA was extracted from ESC, NPC and Neurons, and directional CAGE data sets were produced in order to measure RNA transcription and define transcription start sites in each time point. Strand‐specific CAGE reads are represented (+ and – strands). The promoter regions of the stage‐specific markers are reported. CAGE signals for Oct4 (Pou5f1), nestin (Nes) and Tubb3 genes peak in ESC, NPC and Neurons, respectively.

Figure 2
Figure 2. Chromosomes are organized in a hierarchy of higher‐order domains (metaTADs)
  1. ESC Hi‐C map of chromosome 2, 53–58 Mb. The directionality index (DI, bottom) was used to identify TADs, numbered 1–6.

  2. metaTAD identification by single‐linkage clustering.

  3. Examples of TADs (1–6) and metaTADs (I–V) in the same region shown in (A).

  4. metaTADs are domains with enriched Hi‐C contacts. The ratio of average interaction, I, between pairs of TADs or metaTADs, and background value, I C, was calculated for ESC, NPC and Neurons, as a function of the total number of TADs (n) included in the metaTAD. I/I C remains 20% above control levels in randomized Hi‐C matrices up to scales of the order of n = 80 TADs.

  5. metaTADs size, d, is represented as a function of the number of TADs that they contain, n, showing that eighty TADs correspond to an average genomic length of around 40 Mb.

  6. The metaTAD tree organization in ESC versus Neurons largely coincides with stretches of compartment A (grey) or B (black). A/B compartments are represented in the two central bars and were defined based on an individual principle component (green line) derived from Hi‐C data. The yellow line indicates a value of 0.

  7. Boundaries of metaTADs larger than 10 Mb are more enriched for transitions between lamina‐associated (blue) and lamina‐detached (red) regions than TAD boundaries. Heatmaps display the 900‐kb flanking domain boundaries (dashed lines) for metaTADs (left heatmap) and TADs (right heatmap) for all boundary regions (heatmap rows). Transitions in lamina association are visible as abrupt changes in heatmap colours at boundaries (see Appendix Supplementary Methods). Both metaTAD and TAD boundaries are significantly more frequently observed to coincide with transitions than expected (P < 1 × 10−4; see Appendix Supplementary Methods).

  8. The metaTAD tree of chromosome 19 in ESC (left: full; right: zoomed region). Interactions between metaTADs are not homogeneous, but instead occur through specific contacts involving specific TADs. Hi‐C interaction data are plotted in log scale.

Figure EV2
Figure EV2. Frequency of TADTAD interactions at each tree level
The average metaTAD inter‐domain, I (blue curves), and intra‐domain, J (magenta), interactions are shown as a function of the number of TADs, n, which a metaTAD includes in our three cell types (see also Appendix Supplementary Methods). The larger the domains considered, the lower their average interactions, consistently with Hi‐C results. As expected, intra‐domain interactions, J, stay always above inter‐domain interactions, I.
Figure 3
Figure 3. metaTAD tree organization correlates with genomic, epigenomic and expression features
  1. Comparison between the distribution of epigenomic, genomic and gene expression features with the metaTAD tree structure.

  2. Left side: the diagram represents the difference between linear and metaTAD tree distance (number of edges along the tree minus one) for a given TAD (yellow) relative to other TADs (blue) in the same tree. Right side: correlations over the tree extend up to genomic scales of tens of Mb (filled circles) and are significantly stronger than those observed in linear genomic sequence (filled triangles). The horizontal dashed line indicates a 20% correlation coefficient.

  3. Statistically significant differences are observed in the correlations measured across the metaTAD tree and across random neighbour trees constructed from the same linear array of TADs (horizontal line P‐value = 0.05). Heterochromatin marks H4K20me3 and H3K9me3 levels do not correlate with the tree structure above what is expected from linear genomic distance.

  4. CAGE data, different epigenomic features and pluripotency transcription factors binding sites (TFBS) have different average correlation lengths.

  5. Genome‐wide profiles averaged over all TAD or metaTAD boundaries of selected epigenomic features and gene densities (ribbons show 95% confidence intervals of the mean; metaTADs considered had length 10–40 Mb). Higher feature enrichments are observed at metaTAD boundaries (green) than TAD boundaries (violet).

  6. Approximately two‐thirds of TAD boundaries are conserved from ESC to Neurons.

Figure EV3
Figure EV3. Correlation of genomic, epigenomic and transcription factor features
  1. Three nested levels of TAD hierarchies of chromosome 2 are displayed above multiple genomic, epigenomic, CAGE data and transcription factor (TF) features measured in ESC. Feature intensities are evaluated in 50‐kb windows and have been clipped at 5 and 95% quantiles to facilitate visualization. The colour keys reflecting the full range of feature scores over all chromosomes are depicted on the right of the topmost panel.

  2. Correlation lengths measured across metaTAD trees in each time point (ESC, NPC and Neurons) were calculated for gene density, GC content and CAGE tags. Despite metaTAD restructuring (see Fig 2F), strong correlations are detected at each time point.

Figure 4
Figure 4. Re‐wiring of metaTAD trees during neuronal differentiation is linked to gene expression changes
  1. Comparison between the metaTAD trees of chr6 in ESC and NPC. The extent of local tree topology changes is represented in the central heatmap. Examples of regions with conserved (dark green) and changing (red) local tree topologies are highlighted on the tree diagrams.

  2. Cophenetic correlation coefficients between metaTAD trees at different time points show significant tree structure conservation (˜80%) when compared to random neighbour trees.

  3. Graphical representation of the distribution of tree changes, gene expression changes and A/B compartment membership changes across chr10 TADs for the ESC–NPC transition. Changes were computed at the level of conserved TADs.

  4. Venn diagrams with the number of TADs that undergo changes in metaTAD tree topology, gene expression and A/B compartment in the ESC–NPC and NPC–Neuron differentiation transitions.

  5. Scheme of the approach used to detect overlaps between genomic regions that undergo gene expression changes and tree changes or compartment A/B changes. The Jaccard index was used as a measure of overlap, and its significance was judged by comparing with 10,000 circularly permutated regions.

  6. Overlap values (Jaccard index) between regions of gene expression changes and tree conservation/change, in compartment A, B or A‐B transitions. The significance of overlaps is represented using a blue‐coloured matrix; white cells represent non‐significant overlaps (P‐value > 0.05).

  7. TAD pairs are more likely to display coherent changes in gene expression during differentiation (both TADs are upregulated, downregulated or unchanged) in regions where tree topology is conserved than random pairs. In metaTAD trees of TADs with conserved boundaries, this is significant up to a tree distance of 5 (corresponding to ˜2–6 Mb in genomic distance).

Figure EV4
Figure EV4. Re‐wiring of metaTAD trees across differentiation and TAD‐based gene expression changes
  1. Cophenetic correlation coefficients comparing metaTAD trees per chromosome show different levels of tree restructuring in each differentiation time point transition. Dashed horizontal lines represent the average value for all autosomal chromosomes.

  2. Percentage of the total number of (left to right): TADs with a local tree change value above a threshold of 1 (threshold values of 0.5, 2 or 3 for local tree change gave similar results; P‐values refer to Wilcoxon rank‐sum test performed with all tree change values in each time point transition); genomic extension of regions characterized as tree change (z‐score > 0); and genomic extension of regions characterized as A/B compartment change (> 25% coverage change) for all time point transitions.

  3. Scatter plots illustrating TAD classification according to the gene expression change double criteria used in Fig 4: fold change (> 1.5×) and FPKM difference (> IQR/4). Other thresholds considered are also marked. TADs in red are significantly changing, while TADs in green do not display significant changes in expression.

Figure 5
Figure 5. Molecular mechanisms of metaTAD assembly
  1. Schematic representation of the strings and binders switch (SBS) model. Binder particles form multiple bonds with binding sites on the polymer, allowing loop formation between distant regions. By Monte Carlo simulations, we produce equilibrium configurations to measure potentially relevant biological quantities, such as contact matrices, polymer compaction degree and other details of folding architecture.

  2. The SBS model can explain mechanisms of hierarchical folding of TADs into metaTADs. Representation of an SBS‐TAD model with two clusters of binding sites (red and green) and their respective binders, along with its emerging contact matrix (logarithm of contact frequency). The 3D conformation example shows the formation of loops and two separate domains.

  3. Addition of a third type of interspersed binding sites (blue) and their binders promotes the formation of a metaTAD, as seen in the contact matrix (logarithm of contact frequency).

  4. The volume of the polymer model represented in (C) (metaTAD model) is half the size of the model in (B) (TAD only model), albeit their length is the same. While the organization of red and green TADs remains unaltered, the formation of a third domain (blue) encompassing the blue binding sites results in increased proximity between the green and red TADs.

  5. Despite increased compaction observed in the metaTAD model, the probability that a single red binding site contacts a green binding site in the same polymer remains vanishingly small. The contacts mediated through the blue beads result in a significant increase in the extent of polymer packing while keeping the red and green domains separated.

Figure EV5
Figure EV5. Long‐range contacts detected by Hi‐C between distant TADs can be confirmed at the single‐cell level by cryoFISH
  1. To assess the extent long‐range interactions between TADs at the single‐cell level, we designed three oligo libraries tagged with ATTO‐550, ATTO‐488 and ATTO‐594, which cover genomic regions a (red), b (green) and c (blue), separated by 1.5 (a‐b) or 2 (b‐c) Mb, and which belong to different TADs (red line). Hi‐C data show stronger long‐range interaction frequency between a and b than b and c.

  2. Probes a, b and c were simultaneously hybridized on ultrathin (~180‐nm‐thick) cryosections from ESC (Branco & Pombo, 2006), before counterstaining with DAPI (inset on the left) and imaged on a confocal microscope. Inset on the right shows a magnified region of the same image. Dashed line indicates the nuclear outline. Region b often co‐localizes with region a, but not region c.

  3. Distances between regions a and b, or b and c, were measured using an automated macro from the centroids of fluorescent signals corresponding to a, b and c. Data were collected from two independent cryoFISH experiments (red and green dots); number of distances is indicated above the graph. The average distance between the centre of the fluorescent signals corresponding to probes a and b is 350 ± 242 nm, in contrast with probes b and c, which are on average separated by 587 ± 375 nm (Wilcoxon test, P‐value 0.00168). The ratio of the genomic separation for the two pairs of probes (bc/ab) is ~1.3, compared with their physical distance ratio of 1.7.

Figure 6
Figure 6. Hierarchical organization of chromosome folding in neuronal differentiation
  1. Changes in chromatin contacts and gene expression were followed during differentiation of ESC into NPC and Neurons using Hi‐C and CAGE. A hierarchy of higher‐order contacts between TADs (numbered 1–6) was detected within Hi‐C contact matrices, which is partially altered during differentiation. The patterns of TAD–TAD contacts reflect similarity in expression and epigenomic features.

  2. The hierarchy of TAD–TAD contacts can be represented as a metaTAD tree which facilitates an integrated, multi‐scale study of the relationship between chromatin contacts and expression and epigenomic features. Rearrangements in metaTAD tree topologies often occur at genomic regions that undergo gene expression changes, while conserved topologies often underlie TADs with coherent changes in gene expression.

Similar articles

Cited by

References

    1. Abranches E, Silva M, Pradier L, Schulz H, Hummel O, Henrique D, Bekman E (2009) Neural differentiation of embryonic stem cells in vitro: a road map to neurogenesis in the embryo. PLoS One 4: e6286 - PMC - PubMed
    1. Barbieri M, Chotalia M, Fraser J, Lavitas LM, Dostie J, Pombo A, Nicodemi M (2012) Complexity of chromatin folding is captured by the strings and binders switch model. Proc Natl Acad Sci USA 109: 16173–16178 - PMC - PubMed
    1. Bickmore W, van Steensel B (2013) Genome architecture: domain organization of interphase chromosomes. Cell 152: 1270–1284 - PubMed
    1. Branco MR, Pombo A (2006) Intermingling of chromosome territories in interphase suggests role in translocations and transcription‐dependent associations. PLoS Biol 4: e138 - PMC - PubMed
    1. Brookes E, de Santiago I, Hebenstreit D, Morris KJ, Carroll T, Xie SQ, Stock JK, Heidemann M, Eick D, Nozaki N, Kimura H, Ragoussis J, Teichmann SA, Pombo A (2012) Polycomb associates genome‐wide with a specific RNA polymerase II variant, and regulates metabolic genes in ESCs. Cell Stem Cell 10: 157–170 - PMC - PubMed