A new class of human CpG Island promoters with primate-specific repeats
- PMID: 40731120
- PMCID: PMC12307798
- DOI: 10.1038/s41598-025-11213-5
A new class of human CpG Island promoters with primate-specific repeats
Abstract
A subset of imprinting control regions (ICRs) in the human and mouse possess CpG islands associated with imperfect tandem repeats (TRs) that were shown to be essential for genomic imprinting through genetic studies. To identify whether this feature is also present in non-imprinted CpG island promoters, we performed extensive dot plot analyses and identified 342 (326 autosomal and 16 X-chromosomal) human CpG island gene promoters associated with imperfect TRs of ≥ 400 bp, unit lengths 50-150 bp. Most occur as clusters at the human chromosome ends, distinct from the clusters of imprinted genes, and enriched in neurodevelopmental/behavioral disorders, with some showing interindividual variation in methylation levels. A subset of TR-CGIs is highly methylated and remains so during reprogramming to primed iPSCs, but become unmethylated in naïve iPSCs, as in the case of ICRs. Transcript levels correlate with methylation levels for some TR-CGI genes suggesting their gene regulatory potential. Non-TR CGI mouse orthologs of methylated human TR-CGIs are unmethylated in mouse, suggesting the association of TRs with higher methylation levels. Most human TR-CGIs accompanied primate evolution after divergence from mouse TR-CGIs with evidence of recent additions in hominid evolution. In summary, the incorporation of TRs in certain CGI promoters in mammalian evolution results in the unique ability to acquire methylation during human embryonic development and resist reprogramming to a pluripotent stem cell state with an effect on gene expression.
Keywords: CpG Island; DNA methylation; Epigenetic reprogramming; Genome organization; Genomic imprinting Repeat; Human evolution; Stem cells.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests. Data and materials availability: No new sequence data or materials were generated. Results reported in this manuscript were after analysis of the following publicly available GEO DataSets: GSE49828 The DNA methylation landscape of human early embryos. GSE51239 DNA methylation dynamics of the human pre-implantation embryo. GSE76641 DNA Methylation Barcodes in Human Fetal Tissues and Human Induced Pluripotent Stem Cells. GSE76970 Reversion to naïve human pluripotency creates a new methylation landscape devoid of blastocyst or germline memory. GSE80970 Cortical hypermethylation across an extended region spanning the HOXA gene cluster on chromosome 7 is robustly associated with Alzheimer’s disease neuropathology. GSE110366 Profiling the DNA methylation pattern in naïve induced Pluripotent Stem cells and somatic cells. GSE120137 A multi-tissue full lifespan epigenetic clock for mice. GSE124708 Hyperandrogenemia and western-style diet act synergistically on transcription and DNA methylation in visceral adipose tissue of a non-human primate model. GSE129548 CGGBP1 regulates chromatin barrier activity and CTCF occupancy at repeats. GSE175195 TF ChIP-seq from HEK293. GSE175320 Histone ChIP-seq from HEK293. GSE200834 TNRC18 recognizes H3K9me3 to mediate transposable elements silencing at ERV regions. GSE200839 TNRC18 recognizes H3K9me3 to mediate transposable elements silencing at ERV regions. GSE233417 A comprehensive DNA methylation atlas for noncancer human tissue types. GSE247551 SPINDOC promotes genome-wide redistribution of Spindlin1. GSE109559 Cell type and species-specific methylation patterns in neuronal and non-neuronal cells of human and chimpanzee cortex. GSE53261 The relationship between DNA methylation, genetic and expression inter-individual variation in untransformed human fibroblasts.
Figures







References
-
- Eggermann, T. et al. Imprinting disorders. Nat. Rev. Dis. Primers. 9(1), 33 (2023). - PubMed
-
- Hutter, B., Helms, V. & Paulsen, M. Tandem repeats in the CpG islands of imprinted genes. Genomics88(3), 323–332 (2006). - PubMed
-
- Saito, T. et al. A tandem repeat array in IG-DMR is essential for imprinting of paternal allele at the Dlk1-Dio3 domain during embryonic development. Hum. Mol. Genet.27 (18), 3283–3292 (2018). - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources