Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Mar 12:2023.07.02.547201.
doi: 10.1101/2023.07.02.547201.

Genomic context sensitizes regulatory elements to genetic disruption

Affiliations

Genomic context sensitizes regulatory elements to genetic disruption

Raquel Ordoñez et al. bioRxiv. .

Update in

Abstract

Enhancer function is frequently investigated piecemeal using truncated reporter assays or single deletion analysis. Thus it remains unclear to what extent enhancer function at native loci relies on surrounding genomic context. Using the Big-IN technology for targeted integration of large DNAs, we analyzed the regulatory architecture of the murine Igf2/H19 locus, a paradigmatic model of enhancer selectivity. We assembled payloads containing a 157-kb functional Igf2/H19 locus and engineered mutations to genetically direct CTCF occupancy at the imprinting control region (ICR) that switches the target gene of the H19 enhancer cluster. Contrasting activity of payloads delivered at the endogenous Igf2/H19 locus or ectopically at Hprt revealed that the Igf2/H19 locus includes additional, previously unknown long-range regulatory elements. Exchanging components of the Igf2/H19 locus with the well-studied Sox2 locus showed that the H19 enhancer cluster functioned poorly out of context, and required its native surroundings to activate Sox2 expression. Conversely, the Sox2 locus control region (LCR) could activate both Igf2 and H19 outside its native context, but its activity was only partially modulated by CTCF occupancy at the ICR. Analysis of regulatory DNA actuation across different cell types revealed that, while the H19 enhancers are tightly coordinated within their native locus, the Sox2 LCR acts more independently. We show that these enhancer clusters typify broader classes of loci genome-wide. Our results show that unexpected dependencies may influence even the most studied functional elements, and our synthetic regulatory genomics approach permits large-scale manipulation of complete loci to investigate the relationship between locus architecture and function.

Keywords: enhancer selectivity; gene regulation; genetic engineering; genome writing; genomic regulatory architecture; synthetic regulatory genomics.

PubMed Disclaimer

Conflict of interest statement

DECLARATION OF INTERESTS R.B., J.D.B. and M.T.M are listed as inventors on a patent application describing Big-IN. Jef Boeke is a founder and Director of CDI Labs, Inc., a founder of and consultant to Opentrons Lab-Works/Neochromosome, Inc, and serves or served on the Scientific Advisory Board of the following: CZ Biohub New York, LLC, Logomix, Inc., Modern Meadow, Inc., Rome Therapeutics, Inc., Sangamo, Inc., Tessera Therapeutics, Inc. and the Wyss Institute.

Figures

Figure 1.
Figure 1.. Engineering the Igf2/H19 locus
(A) Igf2/H19 locus in mESC and differentiated mesendodermal cells. DNase-seq tracks show activation of the enhancer cluster upon differentiation. Triangles indicate CTCF recognition sequence orientation. (B) Barrier model where CTCF occupancy at the imprinting control region (ICR) directs activity of the endogenous enhancer cluster towards H19 or Igf2 (Bell and Felsenfeld 2000; Hark et al. 2000). (C) Schematic of engineering and delivery strategy. Payloads were assembled in yeast, amplified in bacteria, and delivered to mESCs using Big-IN. Multiple independent clones were systematically verified for correct genome engineering and differentiated into mesendodermal cells for further phenotyping. (D) Schematic of Igf2/H19 locus engineering using Big-IN. CpG mutations (methylation-insensitive ICR [MI-ICR]) or CTCF deletions (ΔCTCF-ICR) permit genetic control of CTCF occupancy. B6 or engineered variants in H19 and Igf2 genes permit allele-specific expression analysis. (E) Expression analysis of the engineered Igf2/H19 locus in mesendodermal cells. Schemes on the left represent each delivered payload. At right, each point represents the expression of the engineered H19 or Igf2 gene in an independent clone, measured using allele-specific qRT-PCR. Bars indicate median. Expression was normalized to Gapdh and then scaled between 0 (ΔIgf2/H19 locus) and 1 (“Full-length” rescue payloads), normalizing H19 to the “Full-length” payload with constitutive CTCF occupancy, and Igf2 to the “Full-length” payload with abolished CTCF occupancy (dashed lines). Payloads with both constitutive CTCF occupancy (orange hexagon) or abolished CTCF occupancy (empty hexagon) at the ICR were assessed.
Figure 2.
Figure 2.. Identification of long-range regulatory elements at the Igf2/H19 locus
(A) The Igf2/H19 locus and extended downstream region showing DNase-seq and CTCF CUT&RUN in mESC and differentiated mesendodermal cells. Triangles indicate CTCF recognition sequence orientation. “Full-length” and “ΔH19Enh at Igf2/H19” are repeated from Figure 1E for reference. (B) Expression analysis in mesendodermal cells of Igf2/H19 payloads when delivered to the Igf2/H19 locus, to a genomic safe harbor locus (Hprt), or to Igf2/H19 locus in conjunction with distal candidate enhancers deletions. Each point represents the expression of the engineered H19 or Igf2 in an independent clone using allele-specific qRT-PCR with bars indicating the median. First two rows are from Figure 1E. Expression was normalized to Gapdh and then scaled between 0 (ΔIgf2/H19 locus) and 1 (“Full-length” rescue payloads), normalizing H19 to the “Full-length” payload with constitutive CTCF occupancy, and Igf2 to the “Full-length” payload with abolished CTCF occupancy, which are also represented as dashed lines. Payloads with both constitutive CTCF occupancy (orange hexagon) or abolished CTCF occupancy (empty hexagon) at the ICR were assessed. (C) Contribution of the H19 enhancers and the native context in the expression levels of H19 and Igf2 genes in the payloads with constitutive CTCF occupancy or abolished CTCF occupancy, respectively. The presence (black dots) and absence (gray dots) of each element is indicated. Bars indicate median and error bars represent the interquartile range. Dashed horizontal lines indicate the expected transcriptional output of adding both the enhancers and the surrounding context (bottom), versus the measured transcriptional output (top). Source data are from (B).
Figure 3.
Figure 3.. Enhancer interchangeability at the Igf2/H19 and Sox2 loci
(A) Sox2 and locus control region (LCR) showing DNase-seq and CTCF CUT&RUN in mESC and mesendodermal cells. Triangles indicate CTCF recognition sequence orientation. (B-C) Points represent independently engineered clones and bars indicate the median (B) Expression analysis in differentiated mesendodermal cells harboring payloads including the wild-type Sox2 locus, the Sox2 locus with the LCR replaced by the H19 enhancer cluster, and the Igf2/H19 locus with the Igf2 gene and promoter replaced by the Sox2 gene and promoter. Each payload was delivered both to the Sox2 locus (left) and Igf2/H19 locus (right). Expression of the engineered Sox2 allele was normalized to the wild-type CAST allele and then scaled between 0 (ΔSox2 locus) to 1 (“Sox2 & Sox2 LCR”), which are also represented as dashed lines. (C) Igf2 and H19 expression analysis in mESC engineered with a “Full-length” Igf2/H19 rescue payload, or payloads wherein the H19 enhancer was replaced by the Sox2 LCR in either orientation. Expression was normalized to Gapdh and then scaled between 0 (ΔIgf2/H19 locus) and 1 (“Full-length”), normalizing H19 to the “Full-length” payload with constitutive CTCF occupancy, and Igf2 to the “Full-length” payload with abolished CTCF occupancy. Payloads with both constitutive CTCF occupancy (orange hexagon) or abolished CTCF occupancy (empty hexagon) at the ICR were assessed. The dashed lines indicate basal expression of “ΔH19Enh” in mESCs and expression of “Full-length” Igf2/H19 rescue payload in differentiated mesendodermal cells.
Figure 4.
Figure 4.. Cross-cell type accessibility patterns reflect locus function
(A) Accessibility patterns at the Hprt, Sox2 and Igf2/H19 loci across 45 cell and tissue types, ordered by hierarchical clustering of genomic DHSs as in (Halow et al. 2021). Shading represents the Sox2 LCR and H19 enhancers. (B) Correlation of DHS accessibility across Hprt, Sox2 and Igf2/H19 loci. Each line represents the Pearson correlation coefficient (r) between the DHS located at the viewpoint with every other DHS in a 1-Mb window. Viewpoints are shaded in each plot. (C) Schematic representation of uncorrelated, independent and coordinated DHS clusters. Multiple regions of high Pearson r values indicate coordinated DHS activity potentially due to synchronized activation in specific tissues. Corr: Pearson correlation coefficient. (D) Heatmaps showing the Pearson correlation coefficient (r) between the DHS located at the viewpoint and the surrounding DHS within a 1-Mb window. Three different sets of viewpoints are represented: random DHSs, promoters of protein coding genes (±1 kb from TSS), and mESC super-enhancers (SE) (Whyte et al. 2013); each row portrays the region surrounding a different viewpoint. Rows are not shown to scale. DHS correlation patterns were grouped into uncorrelated, independent and coordinated based on the number of peaks exceeding r > 0.25. (E) Representation of DHSs correlation patterns for various genomic features.

References

    1. Ainscough JF, Koide T, Tada M, Barton S, Surani MA. 1997. Imprinting of Igf2 and H19 from a 130 kb YAC transgene. Development 124: 3621–3632. - PubMed
    1. Arney KL. 2003. H19 and Igf2--enhancing the confusion? Trends Genet 19: 17–23. - PubMed
    1. Avsec Ž, Weilert M, Shrikumar A, Krueger S, Alexandari A, Dalal K, Fropf R, McAnany C, Gagneur J, Kundaje A, et al. 2021. Base-resolution models of transcription-factor binding reveal soft motif syntax. Nat Genet 53: 354–366. - PMC - PubMed
    1. Banerji J, Rusconi S, Schaffner W. 1981. Expression of a beta-globin gene is enhanced by remote SV40 DNA sequences. Cell 27: 299–308. - PubMed
    1. Bell AC, Felsenfeld G. 2000. Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature 405: 482–485. - PubMed

Publication types