Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 2:13:RP94502.
doi: 10.7554/eLife.94502.

Post-fertilization transcription initiation in an ancestral LTR retrotransposon drives lineage-specific genomic imprinting of ZDBF2

Affiliations

Post-fertilization transcription initiation in an ancestral LTR retrotransposon drives lineage-specific genomic imprinting of ZDBF2

Hisato Kobayashi et al. Elife. .

Abstract

The imprinted gene ZDBF2 is regulated through a unique mechanism involving a transient paternal transcript in early embryos, rather than persistent gametic DNA methylation. In humans and mice, this transcript-CMKLR2-AS (also known as GPR1-AS) or the long isoform of Zdbf2 (Liz/Zdbf2linc/Platr12)-arises from the unmethylated paternal allele and initiates secondary epigenetic marks that maintain ZDBF2 expression. Here, we investigate the evolutionary origin of this mechanism, and show that the first exon of human GPR1-AS overlaps with a MER21C long terminal repeat (LTR), a retrotransposon subfamily specific to Boreoeutherian mammals. Comparative analyses revealed that this MER21C insertion occurred in the common ancestor of Euarchontoglires, including primates, rodents, and rabbits. Although not annotated, the first exon of mouse Liz displays conserved features with the MER21C-overlapping exon in humans. In rabbit and nonhuman primate placentas, GPR1-AS orthologs with LTR-embedded first exons were also identified. In contrast, in non-Euarchontoglire mammals such as cow and tammar wallaby, ZDBF2 is biallelically expressed, suggesting absence of imprinting. These findings suggest that ZDBF2 imprinting emerged in Euarchontoglires via MER21C insertion. Together with our prior work on LTR-driven imprinting in oocytes, our findings demonstrate that post-fertilization activation of retrotransposons can also drive lineage-specific acquisition of imprinting.

Keywords: LTR retrotransposon; RNA-seq; chromosomes; evolutionary biology; gene expression; genomic imprinting; human; mammalian evolution; mouse; placenta; rhesus macaque.

PubMed Disclaimer

Conflict of interest statement

HK, TI, SK, KT, TT, SN, SS, MH, MR, MK, SS, TK, HN, HM, KN, AU, HK, MK, HI, KN, ML, KK No competing interests declared

Figures

Figure 1.
Figure 1.. Identification of GPR1-AS orthologs from public placental transcriptomes.
UCSC Genome Browser screenshots of the GPR1-ZDBF2 locus in humans (A), rhesus macaques (B), and mice (C). Predicted transcripts were generated using public directional placental RNA-seq datasets (accession numbers: SRR12363247 for humans, SRR1236168 for rhesus macaques, and SRR943345 for mice) using the Hisat2-StringTie2 programs. Genes annotated from GENCODE or RefSeq databases and long terminal repeat (LTR) retrotransposon positions from UCSC Genome Browser RepeatMasker tracks are also displayed. Among the gene lists, only the human reference genome includes an annotation for GPR1-AS (highlighted in green). GPR1-AS-like transcripts and MER21C retrotransposons are highlighted in red. Animal silhouettes were obtained from PhyloPic. Animal silhouettes were obtained from PhyloPic (mouse silhouette by Katy Lawler, available under a CC BY 4.0 license).
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Identification of GPR1-AS orthologs using public and non-directional RNA-seq data.
(A) Heat map showing the expression levels of GPR1, GPR1-AS, and ZDBF2 in different human tissues, including the placenta. Genome browser screenshots of the GPR1-ZDBF2 locus in humans (B) and baboons (C). Predicted transcripts were generated using public non-directional placental RNA-seq datasets (accession numbers: SRR1850957 for humans, GSM4696517 for baboons). Transcript/gene information and long terminal repeat (LTR) retrotransposon positions are shown. GPR1-AS-like transcripts and MER21C retrotransposons are shown in red. Animal silhouettes were obtained from PhyloPic.
Figure 2.
Figure 2.. Identification of GPR1-AS orthologs from original placental and extra-embryonic transcriptomes.
Predicted transcripts were generated from placental and extra-embryonic directional RNA-seq datasets of chimpanzee (A), rabbit (B), pig (C), cow (D), and opossum (E) with the Hisat2-StringTie2 programs. Genes annotated from RefSeq or Ensembl databases and their long terminal repeat (LTR) positions are also shown. MER21C retrotransposons, GPR1-AS-like transcripts, and their fragments per kilobase million (FPKM) and transcripts per kilobase million (TPM) values are highlighted in red. Animal silhouettes were obtained from PhyloPic (opossum silhouette by Sarah Werning, available under a CC BY 3.0 license).
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Search for GPR1-AS orthologs from embryonic transcriptomes.
Predicted transcripts were generated using directional RNA-seq datasets of embryonic proper tissues from rabbit (A), pig (B), bovine (C), and opossum (D) embryos. Transcript/gene information and long terminal repeat (LTR) retrotransposon positions are displayed and the annotated MER21C retrotransposon (only in rabbit) is highlighted in red. Animal silhouettes were obtained from PhyloPic (opossum silhouette by Sarah Werning, available under a CC BY 3.0 license).
Figure 3.
Figure 3.. Allele-specific RT-PCR sequencing of ZDBF2 in various mammals.
Heterozygous genotypes were used to distinguish between parental alleles in adult tissues from tammar wallabies (A), fetal/embryonic tissues from cattle (B), blood samples from rhesus macaques (C), and rabbits (D), respectively. Primers were designed to amplify the 3'-UTR regions of ZDBF2 orthologs and detect SNPs. Each SNP position is highlighted in red. Reverse primers were also used for Sanger sequencing. Animal silhouettes were obtained from PhyloPic.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Search for germline DMRs from oocyte and sperm DNA methylomes.
The DNA methylation (DNAme) levels of individual CpG sites in oocyte and sperm from rhesus macaque (A), pig (B), and bovine (C) whole genome bisulfite sequencing datasets are shown. Oocyte-methylated and sperm-methylated differentially methylated regions (DMRs) are highlighted in red and blue, respectively. Predicted transcripts from placental and extra-embryonic directional RNA-seq datasets (shown in Figures 1 and 2), genes annotated from RefSeq databases, and long terminal repeat (LTR) positions from UCSC/RepeatMasker are included, with a MER21C retrotransposon overlapping rhesus macaque GPR1-AS highlighted in red. Animal silhouettes were obtained from PhyloPic.
Figure 4.
Figure 4.. Multi-species comparison of long terminal repeat (LTR) retrotransposon locations at GPR1 locus.
A total of 24 mammalian genomes were compared, including six primates (human, chimpanzee, rhesus macaque, marmoset, tarsier, and gray mouse lemur), one colugo (flying lemur), one treeshrew (Chinese treeshrew), two lagomorphs (rabbit and pika), eight rodents (squirrel, guinea pig, lesser jerboa, blind mole rat, giant pouched rat, mouse, rat, and golden hamster), and six other eutherians (pig, cow, horse, dog, elephant, and armadillo). Among the selected genomes, LTRs that can be considered homologous to MER21C, which corresponds to the first exon of GPR1-AS, are marked in red. In tarsier, treeshrew, lesser jerboa, and giant pouched rat, the orthologous LTRs were annotated as MER21B, which exhibits 88% similarity with MER21C in their consensus sequences through pairwise alignment. MER21B is marked in purple. According to Dfam, the MER21C and MER21B subfamilies are specific to the genomes of Boroeutherians and Euarchontoglires, respectively. The copy number of MER21C/B in selected species is shown in red and purple (LTRs likely matching the GPR1-AS exon are underlined). There are 5418 and 2529 copies of MER21C and 2894 and 1535 copies of MER21B in human and mouse genomes, respectively.
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. Reanalysis of repeat positions using RepeatMasker.
Repetitive elements were re-identified in five mammalian species: mouse, rat, and hamster—where MER21C, which overlaps the first exon of human GPR1-AS, was not found in the homologous region—and rabbit and human, where it was detected. The Percent Identity Plot (PIP, showing a conservation scale between sequences from 50 to 100% on the y-axis) illustrates the order and alignment of the 20 kb region surrounding the GPR1-AS (Liz) transcription start site in each mammalian chromosome. Detected repeat elements are displayed above each plot. RepeatMasking was performed under less stringent settings, including switching search engines from RMblast to HMMER and adjusting speed/sensitivity settings from default to slow. Despite these adjustments, MER21C insertion was not detected in the three rodent species.
Figure 4—figure supplement 2.
Figure 4—figure supplement 2.. Multiple genome alignments at the first exon of GPR1-AS locus.
Cactus generates reference-free, whole-genome multiple alignments (Armstrong et al., 2020). The Cactus track from UCSC Genome Browser displays multiple alignments across vertebrate species and evolutionary conservation metrics from the Zoonomia Project (Zoonomia Consortium, 2020). Green square brackets indicate shorter alignments where DNA from one genomic context in the aligned species is nested within a larger alignment chain from a different genomic context. The alignment within these brackets may represent a short misalignment, a lineage-specific insertion of a retrotransposon in the human genome that aligns to a paralogous copy in another species. SINE and long terminal repeat (LTR) retrotransposon positions from the UCSC Genome Browser are also displayed. Silhouette obtained from PhyloPic.
Figure 5.
Figure 5.. Comparison of MER21C-derived sequences overlapping the first exon of GPR1-AS orthologs.
(A) Phylogenetic tree of MER21C-derived sequences estimated by multiple sequence alignment (MSA) using multiple sequence comparison by log-expectation (MUSCLE) program. (B) Positions of common and unique cis-acting elements at each sequence. (C) Motif structures of the common region that contains E74-like factor 1 and 2 (ELF1 and ELF2) binding motifs. (D) Motif structures of transcription factor AP-2 gamma (TFAP2C) and Zinc finger and SCAN domain containing 4 (ZSCAN4).
Figure 5—figure supplement 1.
Figure 5—figure supplement 1.. Pairwise alignment between consensus sequences of retrotransposons and GPR1-AS-exonic MER21 sequences.
.(A) MER21C (or MER21B) sequences located in the GPR1 intron of eutherian genomes and the first exon of mouse Liz were compared with the consensus MER21C sequence. (B) Human and rabbit MER21C sequences overlapping the first exon of GPR1-AS and the first exon of mouse Liz were compared with the consensus sequences of ERV3/ERVL solo-LTRs present in human and mouse (n=182). Each graph displays the identity percentages and alignment scores for the top five long terminal repeats (LTRs) with the highest scores. In humans and rabbits, MER21C showed the highest identity with the exonic sequences. (C) The first exon of mouse Liz was compared with the consensus sequences of all retrotransposons present in mice (n=1361). The graph represents the top 10 retrotransposons with the highest scores. In mice, MER21C does not show sufficient sequence identity to the first exon of Liz to distinguish it from other retrotransposons. Pairwise alignment scores and percent identity values for each sequence pair were calculated using Genetyx software.
Figure 5—figure supplement 2.
Figure 5—figure supplement 2.. Promoter activities of first exons of mouse Liz and human GPR1-AS.
(A) Constructs (inserted sequences) used for dual luciferase reporter assays in HEK293T cells. A promoter-less vector served as the negative control. (B) Results of dual luciferase reporter assays. Relative fold changes in Firefly luciferase activity (Firefly/Renilla) were normalized to the Firefly/Renilla ratio of the negative control. Error bars indicate mean ± s.e.m. Statistical significance was determined using unpaired t-tests: *p<0.05, **p<0.01. Data represent four biological replicates.
Figure 5—figure supplement 3.
Figure 5—figure supplement 3.. Expression patterns of transcription factors and imprinted genes during human preimplantation development.
A heat map displaying the average expression of four transcription factors associated to human GPR1-AS or mouse Liz transcription, a primary KRAB-ZFP that binds to MER21C, and three imprinted genes surrounding the ZDBF2 locus.
Figure 6.
Figure 6.. Initiation of GPR1-AS transcription before implantation.
Genome browser screenshots of the GPR1-ZDBF2 locus in humans at preimplantation stages, including the MII oocyte, zygote, 2 cell, 4 cell, 8 cell, inner cell mass (ICM), and trophectoderm (TE) from the blastocyst. Predicted transcripts were generated from publicly available full-length RNA-seq datasets, with detected GPR1-AS-like transcripts and their fragments per kilobase million (FPKM) and transcripts per kilobase million (TPM) values highlighted in red. Silhouette was obtained from PhyloPic.
Figure 6—figure supplement 1.
Figure 6—figure supplement 1.. Human long terminal repeat (LTR) reactivation during preimplantation development.
Heat map displaying the average expression of select LTR retrotransposon families in human oocytes and early embryos. MLT2A1/MLT2A2 and HERVK are reactivated between the 4- to 8 cell stage and after the 8 cell stage, respectively (Grow et al., 2015; Hashimoto et al., 2021).
Figure 7.
Figure 7.. Establishment of ZDBF2 imprinted domain in evolution and genome biology.
(A) Scheme of epigenetic and transcriptional changes at the first exon of mouse Liz and human GPR1-AS. (B) Timescale of the evolution of ZDBF2 imprinting and LTR (MER21C) insertion. Animal silhouettes were obtained from PhyloPic (mouse silhouette by Katy Lawler, available under a CC BY 4.0 license; opposum silhouette by Sarah Werning, available under a CC BY 3.0 license).
Figure 7—figure supplement 1.
Figure 7—figure supplement 1.. Interspecies epigenomic comparisons between human GPR1-AS and mouse Liz.
IGV screenshots of the first exon of GPR1-AS/Liz in human (A) and mouse (B) showing DNA methylation, enrichment of post-translational histone modifications (H3K4me3, H3K9me3, and H3K27me3), and transcription factor binding sites (TFAP2C and ZSCAN4C) from ChIP-Atlas in various tissues. DNA methylomes from oocyte and sperm from mouse and human were published previously (Brind’Amour et al., 2018). Animal silhouettes were obtained from PhyloPic (mouse silhouette by Katy Lawler, available under a CC BY 4.0 license).

Update of

  • doi: 10.1101/2023.10.30.564869
  • doi: 10.7554/eLife.94502.1
  • doi: 10.7554/eLife.94502.2

References

    1. Akizawa H, Kobayashi K, Bai H, Takahashi M, Kagawa S, Nagatomo H, Kawahara M. Reciprocal regulation of TEAD4 and CCN2 for the trophectoderm development of the bovine blastocyst. Reproduction. 2018;155:563–571. doi: 10.1530/REP-18-0043. - DOI - PubMed
    1. Armstrong DL, McGowen MR, Weckle A, Pantham P, Caravas J, Agnew D, Benirschke K, Savage-Rumbaugh S, Nevo E, Kim CJ, Wagner GP, Romero R, Wildman DE. The core transcriptome of mammalian placentas and the divergence of expression with placental shape. Placenta. 2017;57:71–78. doi: 10.1016/j.placenta.2017.04.015. - DOI - PMC - PubMed
    1. Armstrong J, Hickey G, Diekhans M, Fiddes IT, Novak AM, Deran A, Fang Q, Xie D, Feng S, Stiller J, Genereux D, Johnson J, Marinescu VD, Alföldi J, Harris RS, Lindblad-Toh K, Haussler D, Karlsson E, Jarvis ED, Zhang G, Paten B. Progressive Cactus is a multiple-genome aligner for the thousand-genome era. Nature. 2020;587:246–251. doi: 10.1038/s41586-020-2871-y. - DOI - PMC - PubMed
    1. Barau J, Teissandier A, Zamudio N, Roy S, Nalesso V, Hérault Y, Guillou F, Bourc’his D. The DNA methyltransferase DNMT3C protects male germ cells from transposon activity. Science. 2016;354:909–912. doi: 10.1126/science.aah5143. - DOI - PubMed
    1. Bogutz AB, Brind’Amour J, Kobayashi H, Jensen KN, Nakabayashi K, Imai H, Lorincz MC, Lefebvre L. Evolution of imprinting via lineage-specific insertion of retroviral promoters. Nature Communications. 2019;10:5674. doi: 10.1038/s41467-019-13662-9. - DOI - PMC - PubMed