Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jun 20;44(11):5123-32.
doi: 10.1093/nar/gkw124. Epub 2016 Feb 29.

DNA methylation in human epigenomes depends on local topology of CpG sites

Affiliations

DNA methylation in human epigenomes depends on local topology of CpG sites

Cecilia Lövkvist et al. Nucleic Acids Res. .

Abstract

In vertebrates, methylation of cytosine at CpG sequences is implicated in stable and heritable patterns of gene expression. The classical model for inheritance, in which individual CpG sites are independent, provides no explanation for the observed non-random patterns of methylation. We first investigate the exact topology of CpG clustering in the human genome associated to CpG islands. Then, by pooling genomic CpG clusters on the basis of short distances between CpGs within and long distances outside clusters, we show a strong dependence of methylation on the number and density of CpG organization. CpG clusters with fewer, or less densely spaced, CpGs are predominantly hyper-methylated, while larger clusters are predominantly hypo-methylated. Intermediate clusters, however, are either hyper- or hypo-methylated but are rarely found in intermediate methylation states. We develop a model for spatially-dependent collaboration between CpGs, where methylated CpGs recruit methylation enzymes that can act on CpGs over an extended local region, while unmethylated CpGs recruit demethylation enzymes that act more strongly on nearby CpGs. This model can reproduce the effects of CpG clustering on methylation and produces stable and heritable alternative methylation states of CpG clusters, thus providing a coherent model for methylation inheritance and methylation patterning.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Distances between CpG sites in the human genome. (A) Schematic of randomization of CpG positions used to produce an equal number of CpG sites but remove all spatial correlations between the CpG positions. The position of each of the 28 million CpGs in the genome was randomly assigned a new position (avoiding overlapping of CpG sites) within a ‘blank genome’ of 28 billion positions. (B) The observed CpG–CpG distance frequencies for the data (blue), and after CpG randomization (green). The standard errors of the mean for 12 separate genome randomizations lie within the thickness of the green line. The lower panel shows the ratio between the real and randomized distance frequencies. (C) Schematic of randomization of distances between CpG sites, keeping each individual distance unchanged but removing the correlation between distances, i.e. the distances are preserved. Effectively, an array of the 28 million genomic CpG–CpG distances was shuffled to produce a random sequence of these distances. (D) Frequencies of distances (d1) and subsequent distances (d2) are divided by the corresponding frequencies after distance randomization, showing enhancement of short-short and long-long distance combinations. (E) Schematic of CpG cluster criteria. (F) Distribution of cluster sizes NC in the genome for dmax = 25 bp and Dmin = 65 bp. (G) The genome contains 21 000 clusters of size NC = 15 with dmax = 25 bp and Dmin = 65 bp. In the plot, the point at site index = 1 is the distance between the central CpG (site index = 0) and the first CpG to the right (site index = 1) averaged across all clusters. The point at site index = 2 is the average of the distances between the first and second CpGs on the right, and so on. Average successive CpG–CpG distances going leftward from the central CpG are given by negative site indices. Black points show average distances between CpG sites within the cluster, with the average distances outside the cluster shown in gray. (H) As (G) but for Dmin = 45 bp. Note the correlation between CpG distances surrounding the island. Note the logarithmic vertical axes in (F, G and H) and the double logarithmic axes in (B, top) and (D).
Figure 2.
Figure 2.
Empirical CpG distance and methylation distributions. (A) Distributions of average methylation of clusters sized 1 ≤ NC ≤ 60 with dmax = 25 bp and Dmin = 65 bp. Panels show different NC ranges (with the mean NC in parentheses). The black dashed line shows the average methylation of each distribution. (B) As (A) but Dmin = 45 bp. (C) For each pool of clusters defined with specific values of Dmin and dmax, there is a critical cluster size, N*, at which the methylation distribution is maximally bimodal (e.g. formula image and formula image, mark the maximal bimodality obtained in (A and B)). For each cluster pool, N* is plotted against the average inter-CpG distance 〈d〉 in that pool. Each line corresponds to a particular value of Dmin (as indicated in the inset), with each point on the line derived from a cluster with a distinct dmax value.
Figure 3.
Figure 3.
Model design. (A) Collaborative model (7). Straight arrows (gray and black) are non-collaborative reactions, curved arrows are collaborative reactions (start at mediator CpG, end at the reaction stimulated). See text. (B) Average CpG methylation for genomic regions containing CpG-clusters consisting of seven CpG sites with the average inter-CpG distance 〈d〉 < 12.5 (black points) with a low density of surrounding CpG sites (gray points). Specifically, clusters were selected where 30 CpGs on each side of the cluster are spaced on average at least 80 bp apart. Clusters are sorted into those that are hyper-methylated (upper panel) or are hypo-methylated (lower panel). As in Figure 1G, site index is the ordinate position of the CpG relative to the central CpG of the cluster. (C) Introducing distance-dependent collaboration—short-range demethylation and long-range methylation. Plots show the reaction probability density function as a function of the distance between mediator and target in the new model. Methylation reactions have a power-law distance dependence ∼a/(d + α) with d the distance and α = 196 bp; an offset. Demethylation reactions have an exponential distance dependence ∼b · exp (d/d0), with d0 = 174 bp the range of the interaction. The parameters a = 650 bp and b = 5.525 are scaling factors.
Figure 4.
Figure 4.
Model results. (A) Space-time plot for a bistable system of cluster size NC = 23 (dense region in center of plot), d = 10 bp, D = 65 bp. m, h and u sites shown in red, green and blue. CpG sites within a cluster are spaced at a distance d, separated by D from the Nout = 200 CpG sites spaced D* = 100 bp apart (the genomic average), with periodic boundary conditions (a ring of NC + Nout CpG sites). Nine different reactions (Figure 3A) with rates {β = 0.005, μ = 0.01, σ1 = 0.2, σ2 = 0.8, σ3 = 0.8, κ1 = 0.8, κ2 = 0.8} were used in a standard Gillespie algorithm (see text). Collaborative reactions were subject to a distance test (see Figure 3 and text). The state of all sites were recorded just before replication, with a subset of 100 out of 3000 simulated generations shown. (B) Methylation distributions for simulations for varying CpG-cluster sizes using D = 65 bp and d = 10 bp. Compare with Figure 2A. (C) Modeled dependence of N* on d. Compare with Figure 2C.

Similar articles

Cited by

References

    1. Jaenisch R., Bird A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat. Genet. 2003;33:245–254. - PubMed
    1. Felsenfeld G. A brief history of epigenetics. Cold Spring Harb. Perspect. Biol. 2014;6 doi:10.1101/cshperspect.a018200. - PMC - PubMed
    1. Bird A. Perceptions of epigenetics. Nature. 2007;447:396–398. - PubMed
    1. Holliday R., Pugh J.E. DNA modification mechanisms and gene activity during development. Science. 1975;187:226–232. - PubMed
    1. Riggs A.D. X inactivation, differentiation, and DNA methylation. Cytogenet. Genome Res. 1975;14:9–25. - PubMed

Publication types