. 2020 May 29;11(1):2680.

doi: 10.1038/s41467-020-16354-x.

A genome-scale map of DNA methylation turnover identifies site-specific dependencies of DNMT and TET activity

Paul Adrian Ginno^#¹, Dimos Gaidatzis^#^{1

2}, Angelika Feldmann^#^{1

3}, Leslie Hoerner¹, Dilek Imanci^{1

4}, Lukas Burger^{1

2}, Frederic Zilbermann¹, Antoine H F M Peters^{1

5}, Frank Edenhofer⁶, Sébastien A Smallwood¹, Arnaud R Krebs^{1

7}, Dirk Schübeler^{8

9}

Affiliations

¹ Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.
² Swiss Institute of Bioinformatics, Basel, Switzerland.
³ Department of Biochemistry, University of Oxford, Oxford, UK.
⁴ Novartis Institutes for Biomedical Research, Basel, Switzerland.
⁵ Faculty of Sciences, University of Basel, Basel, Switzerland.
⁶ Leopold-Franzens-University Innsbruck & CMBI, Innsbruck, Austria.
⁷ EMBL Heidelberg, Heidelberg, Germany.
⁸ Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland. dirk@fmi.ch.
⁹ Faculty of Sciences, University of Basel, Basel, Switzerland. dirk@fmi.ch.

^# Contributed equally.

PMID: 32471981
PMCID: PMC7260214
DOI: 10.1038/s41467-020-16354-x

A genome-scale map of DNA methylation turnover identifies site-specific dependencies of DNMT and TET activity

Paul Adrian Ginno et al. Nat Commun. 2020.

. 2020 May 29;11(1):2680.

doi: 10.1038/s41467-020-16354-x.

Authors

Affiliations

¹ Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.
² Swiss Institute of Bioinformatics, Basel, Switzerland.
³ Department of Biochemistry, University of Oxford, Oxford, UK.
⁴ Novartis Institutes for Biomedical Research, Basel, Switzerland.
⁵ Faculty of Sciences, University of Basel, Basel, Switzerland.
⁶ Leopold-Franzens-University Innsbruck & CMBI, Innsbruck, Austria.
⁷ EMBL Heidelberg, Heidelberg, Germany.
⁸ Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland. dirk@fmi.ch.
⁹ Faculty of Sciences, University of Basel, Basel, Switzerland. dirk@fmi.ch.

^# Contributed equally.

PMID: 32471981
PMCID: PMC7260214
DOI: 10.1038/s41467-020-16354-x

Abstract

DNA methylation is considered a stable epigenetic mark, yet methylation patterns can vary during differentiation and in diseases such as cancer. Local levels of DNA methylation result from opposing enzymatic activities, the rates of which remain largely unknown. Here we developed a theoretical and experimental framework enabling us to infer methylation and demethylation rates at 860,404 CpGs in mouse embryonic stem cells. We find that enzymatic rates can vary as much as two orders of magnitude between CpGs with identical steady-state DNA methylation. Unexpectedly, de novo and maintenance methylation activity is reduced at transcription factor binding sites, while methylation turnover is elevated in transcribed gene bodies. Furthermore, we show that TET activity contributes substantially more than passive demethylation to establishing low methylation levels at distal enhancers. Taken together, our work unveils a genome-scale map of methylation kinetics, revealing highly variable and context-specific activity for the DNA methylation machinery.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Fig. 1. A dynamical model and cellular system to infer methylation and demethylation rates.**
a Graphic representation of methylation and demethylation rates. The orange and green arrows represent k_de and k_me, respectively. The enzymes responsible for influencing rates are noted. The ratio of rates determines overall methylation levels (Equation (1) below). b Example steady-state methylation levels resulting from different k_me (green) and k_de (orange) combinations. Higher methylation levels are established when k_me is larger than k_de, while low methylation levels represent the opposite. CpGs with the same steady state can have different rates as shown here for 50%. c Theoretical trace of methylation loss over time post Cre transduction for two CpGs with similar steady states. d Cellular system for genetic ablation of k_me. *Dnmt3a* and *Dnmt3b* with loxP sites flanking catalytic exons. Cre protein transduction allows for efficient genetic deletion of all four alleles. e Heatmap of methylation levels for 405 CpGs as measured by amplicon bisulfite sequencing. The left half represents methylation levels for triplicate experiments measured 0, 4, 8, 10, 13, 17, and 29 days post Cre transduction. The right half represents triplicates for mock-treated samples. f CpGs were binned based on starting methylation in 10% increments, and the mean decay over triplicates for Cre-transduced samples (left) and mock samples (right) are shown. g Dynamical model for DNA methylation and implementation of the exponential dampening factor k_e for affecting k_me over time (Eqs. (2)–(4)). See text and “Methods” for details.

**Fig. 2. The inference landscape for methylation and demethylation rates.**
a Inference landscape for k_de (left) and k_me (right), respectively, given all 6400 possible combinations tested (see “Methods”). Blue regions represent high-confidence regimes were rates can be accurately inferred, whereas CpGs lying in the green to yellow regimes are increasingly difficult and ultimately impossible to determine with high confidence. Confidence levels were determined by an error model explicitly detailed in “Methods”. b Example pairs of CpGs and their placement in the inference landscape. Theoretical decay curves for the point pairs (connected by dashed lines) are shown to the right. Each pair number trajectory from the far left panel is shown individually in the right four panels. Note, rates for some CpGs can be accurately distinguished (1), while others have reduced confidence (3) or are governed by rates that are not possible to determine (2 and 4). c Points representing rate combinations for all CpGs (405) measured with amplicon sequencing. Points are overlaid on the inference landscape taking both k_de and k_me into account, inference colors are as a. Blue points have low noise in rate inference, while red points represent CpGs where noise is high. Black points represent CpGs where rates cannot be determined. Because the logarithm of the rates is displayed on both axes, lines with a slope of one (cases 1–3) correspond to rate combinations that result in the same steady-state methylation level but with different turnover.

**Fig. 3. Genome-scale measurement of methylation kinetics.**
a Outline of the SureSelect strategy (above) and a browser screenshot of raw read counts, DHS signal (blue), bait design regions (gold), and CpG methylation level measured prior to induced deletion (black dots). b Percentage of reads in libraries mapping to bait regions. Bars represent bait region boundary extension by 0, 100, or 200 bp, respectively. Error bars represent two standard deviations from the mean of three replicates. c Hierarchical clustering of methylation levels for all samples measured. Annotation column and row depicts days post transduction and wild-type samples. PCC is the Pearson correlation coefficient. d Decay of methylation over time for 2.1 million CpGs. Color scale is as in Fig. 1e, dark red representing 100% methylated CpGs and dark blue representing 0% methylation. Traces to the right represent average profiles of CpGs with similar steady states (noted above panels) but different decay kinetics. CpGs with steady states of 20, 50, and 80% (±2%) were separated into decile bins based on k_de and average profiles from these bins are shown.

**Fig. 4. Rate combinations are characteristic of particular genomic contexts.**
a Scatterplot of k_de and k_me for all cytosines. Dashed lines represent different steady-state methylation levels, which are noted on the right upper borders. b Scatterplots as in a, but with CpGs colored according to overlap with previous genome annotations^,. Red points represent CpGs of interest for the particular genomic annotation. For example, in the first panel all CpGs overlapping with promoter regions are shown in red, while all CpGs outside of promoters are shown in gray. The number of CpGs overlapping with each state are noted above the respective panel. A graphical depiction of the genomic annotations for the five different contexts is shown below the scatterplots. c Scatterplot of methylation levels measured in wild-type (x-axis) and TTKO cells (y-axis). Inset: change in k_de as a function of TET activity. Values on the x-axis represent log2(k_de^WT) − log2(k_de^TTKO). The vertical blue line represents CpGs where TET activity has no effect on k_de, while the red vertical line represents a three-fold increase in the demethylation rate as a function of TET activity. Note the almost unimodal shift in steady-state methylation levels underscoring the role of TET proteins as demethylases. d TET mediated changes in k_de as a function of genomic context. TET activity is as defined above in c. Annotated regions are sorted based on mean change in k_de. The box represents the middle 50% of the data, the line inside the box is the median, and whiskers are defined by the most extreme values lying within 1.5 times the interquartile range.

**Fig. 5. Turnover at highly methylated cytosines correlates with genomic activity.**
a Scatterplot of k_de and k_me highlighting CpGs in red that have a high steady state ≥70%). b Rates and methylation levels as a function of location in genic regions. Mean values for CpGs are represented as a function of their position in genes as a percentage (i.e., each genic region represents 100 bins). Upstream and downstream of noted TSS and TTS regions represent 10 kb of flanking DNA. Each row in the heatmap represents a collection of genes binned on transcriptional output in RPKM (five bins total), with the highest expressing bin on top. Each bin represents at least 2k genes. c Heatmap representing signal for eight different chromatin marks across bins of highly methylated cytosines (red points from a). Mean histone signal was calculated by tiling the genome into 1 kb bins and determining enrichment in ChIP signal over input for the respective marks. From left to right, bins are split based on mean methylation turnover within the bin, with the highest turnover bin on the far right. Note the increase in H3K36me3 and active marks, with the concomitant decrease in H3K9me2/3. d Turnover increases with proximity to distal regulatory elements. CpGs were binned on turnover as for c, but their distance to the nearest DHS site was calculated. Boxplot elements are as defined in Fig. 4d.

**Fig. 6. Transcription factor binding shows variable effects on methylation and demethylation activity.**
a Rates and TET activity as a function of distal DHS signal. The mouse genome was split into 500 bp bins, and reads tallied for all bins that were completely mappable. Bins were then selected as having a minimum distance of 10 kb from an annotated promoter, and split based on number of DHS reads overlapping these bins. DHS signal increases with increasing bin number, where it is apparent that while k_me (left) decreases with increasing accessibility, both k_de (middle) and TET activity (right) increase. Boxplot elements are as defined in Fig. 4d. b Rates and TET activity as a function of distance to bound TF motifs. ENCODE ChIP data for 15 TFs was quantified by counting reads surrounding motifs for each TF in a 201 bp window centered on the motif. Each row of the heatmap represents mean rates as a function of distance to the center of the motif for the respective factor. Sites represented here were selected as the top 900 enriched motif occurrences for each factor (see “Methods” for enrichment determination). c Nucleosome positioning, rate of de novo methylation, passive demethylation, and TET activity around bound CTCF sites, color as in b. MNase read counts were shifted by 75 bp to reflect position of the nucleosome dyad. d Model representing the effect of chromatin processes on methylation and demethylation rates. Presence of bound transcription factors can inhibit both processes, while transcription through gene bodies results in increased de novo methylation and passive demethylation. TET proteins in contrast tend to illicit the strongest effect on demethylation rates at accessible regions proximal to bound transcription factors.

See this image and copyright information in PMC

Cited by

Temporally discordant chromatin accessibility and DNA demethylation define short and long-term enhancer regulation during cell fate specification.
Guerin LN, Scott TJ, Yap JA, Johansson A, Puddu F, Charlesworth T, Yang Y, Simmons AJ, Lau KS, Ihrie RA, Hodges E. Guerin LN, et al. bioRxiv [Preprint]. 2024 Aug 27:2024.08.27.609789. doi: 10.1101/2024.08.27.609789. bioRxiv. 2024. Update in: Cell Rep. 2025 May 27;44(5):115680. doi: 10.1016/j.celrep.2025.115680. PMID: 39253426 Free PMC article. Updated. Preprint.
MeConcord: a new metric to quantitatively characterize DNA methylation heterogeneity across reads and CpG sites.
Zhang X, Wang X. Zhang X, et al. Bioinformatics. 2022 Jun 24;38(Suppl 1):i307-i315. doi: 10.1093/bioinformatics/btac248. Bioinformatics. 2022. PMID: 35758820 Free PMC article.
The concurrence of DNA methylation and demethylation is associated with transcription regulation.
Shi J, Xu J, Chen YE, Li JS, Cui Y, Shen L, Li JJ, Li W. Shi J, et al. Nat Commun. 2021 Sep 6;12(1):5285. doi: 10.1038/s41467-021-25521-7. Nat Commun. 2021. PMID: 34489442 Free PMC article.
TET3 dioxygenase modulates gene conversion at the avian immunoglobulin variable region via demethylation of non-CpG sites in pseudogene templates.
Takamura N, Seo H, Ohta K. Takamura N, et al. Genes Cells. 2021 Mar;26(3):121-135. doi: 10.1111/gtc.12828. Epub 2021 Jan 31. Genes Cells. 2021. PMID: 33421268 Free PMC article.
DNA Methylation in the Adaptive Response to Exercise.
Bittel AJ, Chen YW. Bittel AJ, et al. Sports Med. 2024 Jun;54(6):1419-1458. doi: 10.1007/s40279-024-02011-6. Epub 2024 Apr 2. Sports Med. 2024. PMID: 38561436 Review.

See all "Cited by" articles

References

1. Baubec T, Schubeler D. Genomic patterns and context specific interpretation of DNA methylation. Curr. Opin. Genet Dev. 2014;25:85–92. - PubMed
1. Jaenisch R, Bird A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat. Genet. 2003;33:245–254. - PubMed
1. Bird A. DNA methylation patterns and epigenetic memory. Genes Dev. 2002;16:6–21. - PubMed
1. Smith ZD, Meissner A. DNA methylation: roles in mammalian development. Nat. Rev. Genet. 2013;14:204–220. - PubMed
1. Stadler MB, et al. DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature. 2011;480:490–495. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A genome-scale map of DNA methylation turnover identifies site-specific dependencies of DNMT and TET activity

Affiliations

A genome-scale map of DNA methylation turnover identifies site-specific dependencies of DNMT and TET activity

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Molecular Biology Databases

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Molecular Biology Databases