Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 15:10:e66668.
doi: 10.7554/eLife.66668.

GAF is essential for zygotic genome activation and chromatin accessibility in the early Drosophila embryo

Affiliations

GAF is essential for zygotic genome activation and chromatin accessibility in the early Drosophila embryo

Marissa M Gaskill et al. Elife. .

Abstract

Following fertilization, the genomes of the germ cells are reprogrammed to form the totipotent embryo. Pioneer transcription factors are essential for remodeling the chromatin and driving the initial wave of zygotic gene expression. In Drosophila melanogaster, the pioneer factor Zelda is essential for development through this dramatic period of reprogramming, known as the maternal-to-zygotic transition (MZT). However, it was unknown whether additional pioneer factors were required for this transition. We identified an additional maternally encoded factor required for development through the MZT, GAGA Factor (GAF). GAF is necessary to activate widespread zygotic transcription and to remodel the chromatin accessibility landscape. We demonstrated that Zelda preferentially controls expression of the earliest transcribed genes, while genes expressed during widespread activation are predominantly dependent on GAF. Thus, progression through the MZT requires coordination of multiple pioneer-like factors, and we propose that as development proceeds control is gradually transferred from Zelda to GAF.

Keywords: D. melanogaster; chromatin; chromosomes; gene expression; maternal-to-zygotic transition; pioneer factor; transcription; zygotic genome activation.

Plain language summary

Most cells in an organism share the exact same genetic information, yet they still adopt distinct identities. This diversity emerges because only a selection of genes is switched on at any given time in a cell. Proteins that latch onto DNA control this specificity by activating certain genes at the right time. However, to perform this role they first need to physically access DNA: this can be difficult as the genetic information is tightly compacted so it can fit in a cell. A group of proteins can help to unpack the genome to uncover the genes that can then be accessed and activated. While these ‘pioneer factors’ can therefore shape the identity of a cell, much remains unknown about how they can work together to do so. For instance, the pioneer factor Zelda is essential in early fruit fly development, as it enables the genetic information of the egg and sperm to undergo dramatic reprogramming and generate a new organism. Yet, it was unclear whether additional helpers were required for this transition. Using this animal system, Gaskill, Gibson et al. identified GAGA Factor as a protein which works with Zelda to open up and reprogram hundreds of different sections along the genome of fruit fly embryos. This tag-team effort started with Zelda being important initially to activate genes; regulation was then handed over for GAGA Factor to continue the process. Without either protein, the embryo died. Getting a glimpse into early genetic events during fly development provides insights that are often applicable to other animals such as fish and mammals. Ultimately, this research may help scientists to understand how things can go wrong in human embryos.

PubMed Disclaimer

Conflict of interest statement

MG, TG, EL, MH No competing interests declared

Figures

Figure 1.
Figure 1.. GAF binds thousands of loci throughout the MZT.
(A) Images of His2Av-RFP; sfGFP-GAF(N) embryos at the nuclear cycles (NC) indicated above. sfGFP-GAF(N) localizes to puncta during interphase and is retained on chromosome during mitosis. His2AvRFP is shown in magenta. sfGFP-GAF(N) is shown in green. Scale bars, 5 µm. (B) Binding motif enrichment of GA-dinucleotide repeats at GAF peaks identified at stage 3 and stage 5 determined by MEME-suite (left). Distribution of the GA-repeat motif within peaks (right). Gray line indicates peak center. (C) Representative genome browser tracks of ChIP-seq peaks for GAF-sfGFP(C) from stage 3 and stage 5 embryos and GAF ChIP-seq from S2 cells (Fuda et al., 2015). (D) Venn diagram of the peak overlap for GAF as determined by ChIP-seq for GAF-sfGFP(C) from sorted stage 3 embryos and stage 5 embryos and by ChIP-seq for GAF from S2 cells (Fuda et al., 2015). Total number of peaks identified at each stage is indicated in parentheses. See also Figure 1—figure supplements 1–3.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. N- and C- terminal sfGFP-tags label distinct GAF isoforms.
(A) Cartoon representation of the N-terminal sfGFP tag both GAF protein isoforms. (B) Cartoon representation of C-terminal sfGFP tag on the short GAF protein isoform. (C) Confocal images of living His2Av-RFP; GAF-sfGFP(C) embryos in interphase during NC10 and NC14 and mitosis during NC12. His2Av-RFP is shown in magenta, GAF-sfGFP(C) is shown in green. Scale bars, 5 µm. (D) Immunoblot on embryo extract from sfGFP-GAF(N) homozygous, GAF-sfGFP(C) homozygous, and w1118 lines 0–4 hr after egg laying (AEL) and 13–16 hr AEL with an anti-GFP antibody. αtubulin is shown as a loading control. Arrowhead indicates the long GAF isoform.
Figure 1—figure supplement 2.
Figure 1—figure supplement 2.. GAF binds thousands of regions in the embyo and S2 cells.
(A) Heatmap of stage 5 GAF-sfGFP(C) ChIP, GAF-sfGFP(C) input, w1118 ChIP, and w1118 input peaks (n = 4175). Immunoprecipitation was performed with an anti-GFP antibody. (B) Heatmap of stage 3 GAF-sfGFP(C) ChIP, GAF-sfGFP(C) input, w1118 ChIP, and w1118 input peaks (n = 3391). Immunoprecipitation was performed with an anti-GFP antibody. (C) Genomic distribution of Zld binding at NC8-10, NC13, and NC14 (Harrison et al., 2011) and GAF binding at stage 3 and stage 5. (D) Heatmaps of GAF binding in S2 cells (Fuda et al., 2015) and GAF-sfGFP(C) binding in stage 3 and stage 5 embryos for all 4175 regions with a stage 5 ChIP peak.
Figure 1—figure supplement 3.
Figure 1—figure supplement 3.. GAF has tissue-specific binding.
(A) Heatmap of GAF ChIP-seq peaks in S2 cells (Fuda et al., 2015) and GAF-sfGFP(C) ChIP-seq peaks from stage 3 and stage 5 embryos (this work), excluding peaks that are shared in all three datasets. The heatmap is divided into subclasses as labeled. (B) The top two motifs enriched in each subclass of peaks as identified by MEME suite. Note the top motif in each subclass is a GA-rich motif known to be bound by GAF.
Figure 2.
Figure 2.. Embryos lacking maternal GAF die during early embryogenesis with nuclear and mitotic defects.
(A) Images of control (maternal genotype: His2Av-RFP; sfGFP-GAF(N)) and GAFdeGradFP (maternal genotype: His2Av-RFP/nos-deGradFP; sfGFP-GAF(N)) embryos at NC14, demonstrating loss of nuclear GFP signal specifically in GAFdeGradFP embryos. His2Av-RFP marks the nuclei. (B) Hatching rates after > 24 hr. (C) Confocal images of His2Av-RFP in arrested/dying GAFdeGradFP embryos with blocky nuclei, mitotic arrest, and nuclear fallout. (D) Confocal images of His2Av-RFP in NC13 control and GAFdeGradFP embryos, showing disordered nuclei and anaphase bridges in GAFdeGradFP embryos. Scale bars, 50 µm except where indicated. See also Figure 2—figure supplement 1.
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. sfGFP-GAF(N) and GAF-sfGF(C) are expressed at similar levels in the early embryo.
Boxplots showing the mean fluorescent intensity (arbitrary units, AU) of 10 nuclei in GAF-sfGFP(C) homozygous embryos (n = 27) and sfGFP-GAF(N) homozygous embryos (n = 26). The box indicates the lower (25%) and upper (75%) quantiles, and the solid line indicates the median. p-value=0.52 as determined by Wilcoxon rank sum test.
Figure 3.
Figure 3.. GAF is required for zygotic genome activation.
(A) Volcano plot of transcripts mis-expressed in GAFdeGradFP embryos as compared to sfGFP-GAF(N) controls. Stage 5 GAF-sfGFP(C) ChIP-seq was used to identify GAF-bound target genes. (B) The percentage of up-regulated and down-regulated transcripts in GAFdeGradFP embryos classified as maternal, zygotic, or maternal-zygotic based on Lott et al., 2011. (C) Overlap of down-regulated embryonic transcripts in the absence of GAF or Zld activity (p<2.2×10−16, log2(odds ratio)=3.16, two-tailed Fisher’s exact test). Down-regulated genes for Zld are from McDaniel et al., 2019. (D) Transcripts down-regulated in both GAFdeGradFP and ZldCRY2 embryos or in either condition alone were classified based on temporal expression during ZGA (Li et al., 2014). Early = NC10-11, Mid = NC12-13, Late = early NC14, Later = late NC14. Only genes that were assigned to one of the four classes are shown. See also Figure 3—figure supplements 1–3.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. RNA-seq replicates are reproducible.
(A) Pairwise scatterplots showing correlation between the three RNA-seq replicates for GAFdeGradFP and sfGFP-GAF(N) homozygous controls. Values are log2 (RPKM +1). Pearson correlation coefficients are indicated. (B) Heatmap of Pearson correlation coefficient between RNA-seq replicates for GAFdeGradFP and sfGFP-GAF(N) homozygous controls. (C) PCA plot of RNA-seq replicates from GAFdeGradFP and sfGFP-GAF(N) homozygous controls.
Figure 3—figure supplement 2.
Figure 3—figure supplement 2.. RNA-seq identifies genes mis-regulated in GAFdeGradFP embryos.
(A) MA plot of transcripts mis-expressed in GAFdeGradFP embryos as compared to sfGFP-GAF(N) homozygous controls. Stage 5 GAF-sfGFP(C) ChIP-seq was used to identify GAF-bound target genes. (B) log2 fold change of genes expressed at NC14 (Li et al., 2014, ‘later’ class) in GAFdeGradFP embryos compared to sfGFP-GAF(N) homozygous controls. Genes are divided by those that have a proximal GAF-binding site as identified by ChIP-seq (GAF target) and those that do not (non-GAF target). Color indicates those that are significantly changed in the GAFdeGradFP embryos as compared to sfGFP-GAF(N) homozygous controls. GAF-target genes are significantly more down-regulated than non-GAF target genes (p – 1.4 × 10−5, Wilcoxon rank sum test). (C) Gene Ontology (GO) term analysis was performed on transcripts down-regulated in GAFdeGradFP embryos compared to controls. (D) Gene Ontology (GO) term analysis was performed on transcripts up-regulated in GAFdeGradFP embryos compared to controls.
Figure 3—figure supplement 3.
Figure 3—figure supplement 3.. GAF regulates genes distinct from Zld and CLAMP.
(A) Overlap of transcripts down-regulated in GAFdeGradFP embryos, down-regulated in ZldCRY2 embryos at NC14 (McDaniel et al., 2019), and down-regulated in clamp-RNAi 2–4 hr embryos (Rieder et al., 2017). (B) Heatmap of GAF-sfGFP(C) ChIP peaks at stage 5 and CLAMP ChIP peaks in female embryos at NC14. CLAMP ChIP-seq from Rieder et al., 2019. (C) Immunoblot of extract from 2 to 2.5 hr AEL GAFdeGradFP and sfGFP-GAF(N) homozygous control embryos using anti-Zld antibody. Tubulin is shown as a loading control. (D) Immunoblot of extract from 2 to 2.5 hr AEL sfGFP-GAF(N) embryos in which zld was depleted (zld-RNAi) and in which zld levels were unperturbed (Control) probed with anti-GFP antibody. w1118 embryos are included as controls for anti-GFP immunoreactivity. Tubulin is shown as a loading control. (E) Overlap of transcripts up-regulated in GAFdeGradFP embryos and ZldCRY2 embryos (McDaniel et al., 2019) (p=0.64, log2(odds ratio)=−0.16, two-tailed Fisher’s exact test).
Figure 4.
Figure 4.. At the majority of loci, GAF and Zld bind chromatin independently.
(A) Overlap of Zld- and GAF-binding sites determined by GAF-sfGFP(C) stage 5 ChIP-seq and Zld NC14 ChIP-seq (Harrison et al., 2011). (B) Representative genome browser tracks of Zld and GAF ChIP-seq peaks at the tailless locus. (C) Immunoblot for Zld on embryo extracts from zld-RNAi; sfGFP-GAF(N) and sfGFP-GAF(N) control embryos harvested 2–3 hr AEL. Tubulin was used as a loading control. (D) Images of zld-RNAi; sfGFP-GAF(N) embryos and sfGFP-GAF(N) control embryos at NC10 and NC14 as marked. Arrowhead shows nuclear dropout, a phenotype indicative of zld loss-of-function. Scale bar, 10 µm. (E) Correlation between log2(RPKM) of ChIP peaks for GAF from GAF-sfGFP(N) stage 5 embryos (control) and zld-RNAi; sfGFP-GAF(N) embryos (zld-RNAi). Color highlights significantly changed peaks (adjusted p-value<0.05, fold change > 2) and those that are bound by both GAF and Zld as indicated below. (F) Correlation between log2(RPKM) of ChIP peaks for Zld from sfGFP-GAF(N) embryos (control) and GAFdeGradFP embryos fixed 2–2.5 hr AEL. Color highlights significantly changed peaks (adjusted p-value<0.05, fold change > 2) and those that are bound by both GAF and Zld as indicated below. See also Figure 4—figure supplements 1–2.
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. GAF and Zld bind to shared and unique regions of the genome.
(A) Heatmap of GAF-sfGFP(C) ChIP peaks at stage 5 and Zld ChIP peaks at NC14. Zld ChIP-seq from Harrison et al., 2011. (B) Heatmap of anti-GFP ChIP-seq peaks from GAF-sfGFP(C) homozygous stage 5 embryos and sfGFP-GAF(N) heterozygous 2–2.5 hr AEL embryos. ChIP peaks show reproducible identification of the highest peaks.
Figure 4—figure supplement 2.
Figure 4—figure supplement 2.. Independent chromatin binding by GAF and Zld.
(A) Heatmap of high confidence, anti-GFP ChIP-seq peaks from 2 to 2.25 hr AEL sfGFP-GAF(N) and zld-RNAi;sfGFP-GAF(N) embryos. The heatmap is divided into sites that have both GAF and Zld binding and where GAF binds independently of Zld. (B) Heatmap of high confidence, anti-Zld ChIP-seq peaks from sfGFP-GAF(N) homozygous control and GAFdeGradFP embryos at 2–2.5 hr AEL. The heatmap is divided into sites that have both GAF and Zld binding and where Zld binds independently of GAF. (C) Correlation between ranked peak height of ChIP for GFP from (A). Peaks are ranked such that the highest peaks have the highest ranking. Color indicates those regions that are bound by both GAF and Zld. (D) Correlation between ranked peak heights of ChIP for Zld from (B). Peaks are ranked such that the highest peaks have the highest ranking. Color indicates those regions that are bound by both GAF and Zld.
Figure 5.
Figure 5.. GAF is required for chromatin accessibility.
(A) Volcano plot of regions that change in accessibility in GAFdeGradFP embryos as compared to sfGFP-GAF(N) controls, stage 5 GAF-sfGFP(C) ChIP-seq was used to identify GAF-bound target regions. (B) Genomic distribution of all regions that lose accessibility (Lose accessibility (all)), regions that lose accessibility and are GAF-bound (Lose accessibility (GAF-bound), regions that did not change significantly in accessibility (Non-significant), and regions that gain accessibility (Gain accessibility)). (C) Genome browser tracks of GAF-sfGFP(C) ChIP-seq at stage 3 and stage 5, ATAC-seq on wild-type embryos at NC12, NC13, and stage 5 along with control (sfGFP-GAF(N)) and GAFdeGradFP embryos, and RNA-seq from control (sfGFP-GAF(N)) and GAFdeGradFP embryos. NC12 and NC13 ATAC-seq data is from Blythe and Wieschaus, 2016. ATAC-seq data for stage 5 embryos is from Nevil et al., 2020. Region highlighted in green indicates the GAF-dependent, GAF-bound Ubx promoter. See also Figure 5—figure supplement 1.
Figure 5—figure supplement 1.
Figure 5—figure supplement 1.. GAF is required for chromatin accessibility.
(A) Heatmap of Pearson correlation coefficients between ATAC-seq replicates for GAFdeGradFP and sfGPF-GAF(N) control embryos. (B) MA plot of regions that change in accessibility in GAFdeGradFP embryos as compared to sfGFP-GAF(N) controls. GAF binding was determined using GAF-sfGFP(C) ChIP-seq from stage 5 embryos. (C) Heatmap of the ATAC seq data for GAFdeGradFP embryos and sfGFP(N) controls subdivided by whether the region increased, decreased or remained unchanged in accessibility in the GAFdeGradFP embryos as compared to the controls. (D) Average GAF-sfGFP(C) ChIP-seq signal from stage 5 embryos for regions that increased, decreased or remained unchanged in accessibility in the GAFdeGradFP embryos as compared to the sfGFP-GAF(N) controls.
Figure 6.
Figure 6.. GAF and Zld independently shape chromatin accessibility over the MZT.
(A) Heatmaps of ChIP-seq and ATAC-seq data, as indicated above, for regions bound by both GAF and Zld and subdivided based on the change of accessibility in the absence of either factor. (B) Number of regions that depend on GAF for accessibility and are bound by GAF that are accessible at NC11, NC12, and NC13. (C) Number of regions that depend on Zld for accessibility and are bound by Zld that are accessible at NC11, NC12, and NC13. NC11, NC12, and NC13 data are from Blythe and Wieschaus, 2016. Zld ChIP-seq data are from Harrison et al., 2011. ATAC-seq data from zld germline clones (zld-) are from Hannon et al., 2017. Total number of regions accessible at NC11 = 3084, NC12 = 6487, and NC13 = 9824. See also Figure 6—figure supplements 1–2.
Figure 6—figure supplement 1.
Figure 6—figure supplement 1.. A subset of regions bound by GAF, and not Zld, depend on GAF for accessibility.
Heatmaps of regions that are bound by GAF (excluding GAF and Zld co-bound sites) as determined by GAF-sfGFP(C) stage 5 ChIP-seq subdivided by whether or not these regions change in accessibility upon GAF depletion.
Figure 6—figure supplement 2.
Figure 6—figure supplement 2.. Regions that gain accessibility late during the MZT are accessible in GAFdeGradFP embryos used for ATAC-seq.
(A) Genome browser tracks showing a region that gains accessibility at NC13 that is maintained in GAFdeGradFP embryos (dark-blue shading) and a region that gains accessibility at stage 5, is bound by GAF at stage 3 and stage 5, and at which accessibility is decreased in GAFdeGradFP embryos (light-green shading). (B) Genome browser tracks showing regions that gain accessibility at NC13 and are maintained in GAFdeGradFP embryos (dark-blue shading) and a region that gains accessibility at stage 5 that is maintained in the GAFdeGradFP embryos (light-blue shading).
Figure 7.
Figure 7.. Zld and GAF independently regulate embryonic reprogramming.
(A) During the minor wave of ZGA (NC10-13), Zld is the predominant factor required for driving expression of genes bound by Zld alone and genes bound by both Zld and GAF. (B) As the genome is more broadly activated during NC14, GAF becomes the major factor in driving zygotic transcription. (C) Early in the MZT, Zld is required for chromatin accessibility at many regions co-bound by GAF and Zld. When zld is depleted, accessibility is lost at a subset of regions, but GAF remains bound at the majority of sites. (D) Late during the MZT, GAF is required for chromatin accessibility at many Zld-bound regions. In GAFdeGradFP embryos accessibility is lost at a subset of sites, but Zld remains bound.

Similar articles

Cited by

References

    1. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Research. 2009;37:W202–W208. doi: 10.1093/nar/gkp335. - DOI - PMC - PubMed
    1. Bejarano F, Busturia A. Function of the Trithorax-like gene during Drosophila development. Developmental Biology. 2004;268:327–341. doi: 10.1016/j.ydbio.2004.01.006. - DOI - PubMed
    1. Benyajati C, Mueller L, Xu N, Pappano M, Gao J, Mosammaparast M, Conklin D, Granok H, Craig C, Elgin S. Multiple isoforms of GAGA factor, a critical component of chromatin structure. Nucleic Acids Research. 1997;25:3345–3353. doi: 10.1093/nar/25.16.3345. - DOI - PMC - PubMed
    1. Bernués J, Piñeyro D, Kosoy A. General, negative feedback mechanism for regulation of Trithorax-like gene expression in vivo: new roles for GAGA factor in flies. Nucleic Acids Research. 2007;35:7150–7159. doi: 10.1093/nar/gkm590. - DOI - PMC - PubMed
    1. Bhat KM, Farkas G, Karch F, Gyurkovics H, Gausz J, Schedl P. The GAGA factor is required in the early Drosophila embryo not only for transcriptional regulation but also for nuclear division. Development. 1996;122:1113–1124. - PubMed

Publication types

MeSH terms