Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep 16;38(18):e101220.
doi: 10.15252/embj.2018101220. Epub 2019 Aug 12.

The interactome of KRAB zinc finger proteins reveals the evolutionary history of their functional diversification

Affiliations

The interactome of KRAB zinc finger proteins reveals the evolutionary history of their functional diversification

Pierre-Yves Helleboid et al. EMBO J. .

Abstract

Krüppel-associated box (KRAB)-containing zinc finger proteins (KZFPs) are encoded in the hundreds by the genomes of higher vertebrates, and many act with the heterochromatin-inducing KAP1 as repressors of transposable elements (TEs) during early embryogenesis. Yet, their widespread expression in adult tissues and enrichment at other genetic loci indicate additional roles. Here, we characterized the protein interactome of 101 of the ~350 human KZFPs. Consistent with their targeting of TEs, most KZFPs conserved up to placental mammals essentially recruit KAP1 and associated effectors. In contrast, a subset of more ancient KZFPs rather interacts with factors related to functions such as genome architecture or RNA processing. Nevertheless, KZFPs from coelacanth, our most distant KZFP-encoding relative, bind the cognate KAP1. These results support a hypothetical model whereby KZFPs first emerged as TE-controlling repressors, were continuously renewed by turnover of their hosts' TE loads, and occasionally produced derivatives that escaped this evolutionary flushing by development and exaptation of novel functions.

Keywords: KAP1; KRAB zinc finger proteins; coelacanth; evolution; mass spectrometry.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Figure 1
Figure 1. KZFP IFs and interactome
  1. A

    IF by confocal microscopy performed on KRAB‐HA‐overexpressing 293T cells. Staining was performed with anti‐HA (Alexa‐488, green) and anti‐HSPA5 (Alexa‐647, red) antibodies, and DNA was stained with DAPI (blue). The scale bar represents 20 μm.

  2. B

    Force‐directed network representing high‐confidence interactions. Each bait KZFP is represented by a blue circle (“node”) and linked to green nodes that represent its preys. KAP1 is represented in red and its associated proteins SIRT1, SMARCAD1, and HP1α and HP1γ in orange. All interactions represented are below the false discovery rate of 1%. The topology of the network is established by a force‐directed process that follows certain rules: All nodes repel each other and are attracted to the center by artificial “gravity”, and nodes with links attract each other. Weighted links between any nodes are based on the average fold change over controls.

  3. C

    Zoom‐in on the KAP1‐centered core of the interactome.

  4. D, E

    Zoom‐in on KZFPs enriched with unique interactors.

  5. F

    Zoom‐in on the subset of KZFPs not connected to the main interactome.

Figure EV1
Figure EV1. General AP‐MS protocol and quality controls
  1. Graphical representation of AP‐MS protocol major steps and filters used for the computational analysis.

  2. Boxplot presenting the number of shared interactors for KZFP couples sharing more than 10% of their binding sites in a reciprocal fashion compared to the number of shared interactors for the same amount of random KZFP couples. Boxplots are shown as median, and 25th (Q1) and 75th (Q3) percentiles. The upper whisker extends to the last data point less than Q3 + 1.5*IQR, where IQR = Q3–Q1. Similarly, the lower whisker extends to the first data point greater than Q1 – 1.5*IQR.

  3. Number of interactions observed in previous studies (Table EV2) associated with their corresponding KZFP. The method used for the interaction detection is indicated by a specific color.

Figure EV2
Figure EV2. KZFP common interactors
  1. SIRT1 interaction validation. HA‐tagged KZFPs transduced cell lines were used for immunoprecipitations. HA immunoprecipitation of weak KAP1 interactor ZKSCAN3 and candidates ZNF138 and ZNF793, followed by the detection of endogenous SIRT1 in the immunoprecipitates through Western blot using an anti‐SIRT1 antibody. Input = cellular lysate, IP = immunoprecipitate. Western blot using an anti‐HA antibody on the IPs at the bottom.

  2. Venn diagram representing all the common preys that were detected in 5 and more KZFP interactomes. In pink are shown the common preys that only appear in interactomes alongside KAP1, in blue the prey that only appears in interactomes devoid of KAP1, and in purple the preys that are in both type of interactomes.

  3. IPO7 interaction validation. HA‐tagged KZFPs transduced cell lines were used for immunoprecipitations. HA immunoprecipitation of predominantly cytoplasmic ZNF283 and candidates ZNF577 and ZNF765, followed by the detection of endogenous IPO7 through Western blot using an anti‐IPO7 antibody. Input = cellular lysate, IP = immunoprecipitate. Western blot using an anti‐HA antibody on the IPs at the bottom.

Source data are available online for this figure.
Figure 2
Figure 2. SCAN and DUF3669 domains are involved in oligomerization
  1. Force‐directed network representing the SCAN interactome displaying SCAN‐containing baits and their SCAN‐containing preys.

  2. HA immunoprecipitation of stably expressed ZNF282‐HA in cells previously transfected with ΔDUF3669‐ZNF398‐GFP and WT‐ZNF398‐GFP followed by the detection of ZNF398 constructs in the IPs through Western blot using an anti‐GFP antibody. Bottom: Western blot using an anti‐HA antibody on the IPs. Input = cellular lysate, IP = immunoprecipitate.

  3. Left: DUF3669‐only and KRAB domain constructs used. Right: HA immunoprecipitation in cells previously co‐transfected with ZNF398‐DUF3669‐HA and either ZNF282‐KRAB‐Flag or ZNF282‐DUF3669‐Flag followed by the detection of either of these protein constructs in the IPs through Western blot using an anti‐Flag antibody. Bottom: Western blot using an anti‐HA antibody on the IPs.

Source data are available online for this figure.
Figure 3
Figure 3. A sub‐population of KZFPs displays rare and unique interactors
  1. Venn diagram representing all preys detected in less than 5 and more than 1 KZFP interactomes. In blue, are shown the preys that only appear in interactomes alongside KAP1, in red the preys that only appear in interactomes devoid of KAP1, and in purple the preys that are in both types of interactomes.

  2. This table displays all the KZFPs associating with 3 or more unique interactors, the identity of these interactors as well as the features associated with them.

  3. Venn diagram representing the binding sites overlaps between ZNF764, TFIIIC subunit GTF3C2, and CTCF. The GTF3C2 bedfile was obtained from Encode (ENCFF002CYL), and CTCF bedfile was obtained from Encode version 3. The overlap and resulting P‐values were obtained using the Bedtools Fisher exact test.

  4. Histograms representing the percentage of KZFPs (identified in Fig 3B) binding sites falling in TEs (red) or TSSs (blue).

  5. KZFPs binding sites on L1PA5 and L1PA6. Top: Boxes containing the interactors of KZFPs found enriched on L1PA5 and L1PA6. Middle: plot showing the average ChIP‐exo signals (scaled between 0 and 1) for each selected KZFP plotted on top of L1PA5 and L1PA6 multiple sequence alignment (MSA). Bottom: schematic representation of L1PA5 and L1PA6 different domains.

Figure 4
Figure 4. Old KZFPs display an unusual and conserved KRAB domain devoid of KAP1 binding
  1. KAP1 force‐directed network. The distance KZFP‐KAP1 is inversely proportional to the measured KAP1 enrichment over controls (KAP1FC). KZFPs with KAP1FC values above three were colored in blue and the ones below in green.

  2. Boxplot representing the KAP1FC value of our baits in function of the percentage of their binding sites falling in TEs. Mann–Whitney two‐sided rank test.

  3. Human KZFPs KRAB domain A‐boxes protein phylogenetic tree associated with their corresponding amino acid sequences colored according to the Clustal Zappo color scheme (residues sharing common physicochemical properties display the same color: http://www.jal-view.org/help/html/colourSchemes/zappo.html). This figure also displays a zoom‐in on the variant KRAB domain cluster (right). On this zoom‐in, the tested vKZFPs, whose interactomes were defined in our study, were marked by an asterisk.

  4. Box plot representing KAP1FC values in function of KZFPs age. On a superimposed swarm plot, the individual vKZFPs corresponding KAP1FC values were represented by red dots and for their sKZFPs counterparts, by blue dots. Mann–Whitney two‐sided rank test.

  5. Boxplot representing the number of interactors of KZFPs in function of their evolutionary age. On a superimposed swarm plot, individual KZFPs number of preys were represented by dots. When green, the interactome of this KZFP contained 3 or more unique interactors (Fig 3B).

  6. Boxplot representing the average dn/ds ratios for all vK (red) and sK (blue) domains displaying the same evolutionary ages. Mann–Whitney two‐sided rank test.

Data information: Boxplots are shown as median, and 25th (Q1) and 75th (Q3) percentiles. The upper whisker extends to the last data point less than Q3 + 1.5*IQR, where IQR = Q3–Q1. Similarly, the lower whisker extends to the first data point greater than Q1 – 1.5*IQR.
Figure EV3
Figure EV3. Impact of the composition, age, and structure of the KRAB domain on KAP1 recruitment
  1. Swarm plot representing the observed range of KAP1FC values as well as the determination of a strong KAP1 interaction threshold of 3.

  2. Double representation of KAP1 interaction for selected strong and weak KAP1‐interacting KZFPs. Top: Experimentally determined KAP1FC values are represented by a histogram. Bottom: KAP1 interaction determined by immunoprecipitations. HA‐tagged KZFPs transduced cell lines were used for immunoprecipitations. Input = cellular lysate, IP = immunoprecipitate. HA immunoprecipitation of KZFPs followed by endogenous KAP1 detection through Western blot using an anti‐KAP1 antibody. Western blot using an anti‐HA antibody on the IPs at the bottom.

  3. Violin plot representing the KAP1FC values for KZFPs in function of the presence of a B‐box in their KRAB domain.

  4. Simple phylogenetic tree representing the time of divergences of coelacanth and different tetrapod clades from the Homo sapiens lineage.

  5. Evolutionary age of all sKZFPs and vKZFPs present in the UniProt database and our previous study (Imbeault et al, 2017). Mann–Whitney two‐sided rank test.

Data information: Boxplots are shown as median, and 25th (Q1) and 75th (Q3) percentiles. The upper whisker extends to the last data point less than Q3 + 1.5*IQR, where IQR = Q3–Q1. Similarly, the lower whisker extends to the first data point greater than Q1 – 1.5*IQR.Source data are available online for this figure.
Figure EV4
Figure EV4. vKZFPs loss‐of‐function mutations in the human population
Z‐score for the number of loss‐of‐function mutations found in the canonical transcripts (ZLoF) of vKZFP or sKZFP based on human exome and whole genome data obtained from the gnomAD database. Unpaired Wilcoxon test. Data information: Boxplots are shown as median, and 25th (Q1) and 75th (Q3) percentiles. The upper whisker extends to the last data point less than Q3 + 1.5*IQR, where IQR = Q3–Q1. Similarly, the lower whisker extends to the first data point greater than Q1 – 1.5*IQR.
Figure 5
Figure 5. Interactome‐based Pearson correlation of KZFPs
Individual KZFP interactomes were compared by Pearson correlation analysis of prey protein fold change over control (Log2FC) values. Accordingly, KZFP bait proteins are ordered based on the similarity of their prey protein recovery profile.
Figure 6
Figure 6. KZFPs present in the last common ancestor of coelacanth and tetrapods were KAP1 binders
  1. Consensus sequences for KRAB A‐boxes from top to bottom: coelacanth KRAB domains, coelacanth PRDM9 KRAB domain, human PRDM9 KRAB domain, and human KRAB domains. Residues conserved between the first three sequences and not present in human KRAB consensus were highlighted in red. In the human KRAB consensus, residues crucial for KAP1 recruitment were highlighted in blue (Margolin et al,1994).

  2. Immunoprecipitations of HA‐tagged KRAB domains in order to check interaction with cKAP1. Co‐transfection of Flag‐tagged cKAP1 (upper panel) or Flag‐tagged hKAP1 (lower panel) and HA‐tagged ZNF398 DUF3669 domain negative control, H2ZYM4 and M3XKG1 cKRAB domains, and ZNF93 hKRAB domain in KAP1 KO HAP1 cells followed by HA immunoprecipitation. cKAP1 and hKAP1 presence was revealed by Western blot using an anti‐Flag antibody. Input = cellular lysate, IP = immunoprecipitate. Western blot using an HA antibody on the IPs at the bottom.

  3. Evolutionary KRAB model: On top, a simple phylogenetic tree represents the links between echinoderms, coelacanth, and human. At the bottom, different KRAB domain versions ranging from the ancestral PRDM9‐related KRAB domain to the classical KAP1 binding human KRAB are represented. Their position on this model depends (i) under which species or clades they were detected, hence reflecting the last common ancestor in which they putatively appeared and (ii) whether they bind KAP1, in the blue rectangle, or not, in the pink rectangle. Briefly, the echinoderm‐conserved PRDM9‐related KRAB domain is the putative ancestor of KAP1‐binding KRAB domain that emerged in the coelacanth. The KRAB domain and KAP1 co‐evolved, maintaining their association in tetrapods. Meanwhile, certain KZFPs have lost KAP1 binding to become vKZFPs.

Source data are available online for this figure.
Figure EV5
Figure EV5. Human and coelacanth KRAB domains
  1. Phylogenetic tree built based on amino acid sequence alignment of human KZFP KRAB A‐boxes and UniProt‐determined coelacanth KZFP KRAB A‐boxes domains, bottom: zoom‐in on the cluster containing all of the coelacanth KRAB domains, coelacanth KRAB domains are indicated in green while hPRDM9 KRAB is indicated in red.

  2. Immunoprecipitation of HA‐tagged hKAP1 domain in order to check interaction with cKAP1. Co‐transfection of HA‐tagged hKAP1 and Flag‐tagged hKAP1, negative control ΔRBCC cKAP1 (the RBCC domain mediates KAP1 oligomerization), and cKAP1 in 293T cells followed by HA immunoprecipitation. The presence of different KAP1 constructs was revealed by Western blot using an anti‐Flag antibody. Input = cellular lysate, IP = immunoprecipitate. Western blot using an anti‐HA antibody on the IPs at the bottom.

  3. Immunoprecipitation of HA‐tagged human and coelacanth PRDM9 KRAB domains in order to check interaction with cKAP1. Co‐transfection of Flag‐tagged cKAP1 and HA‐tagged ZNF398 DUF3669 domain, H2ZYM4 cKRAB domain, and PRDM9 hKRAB and cKRAB domains in KAP1 KO HAP1 cells followed by HA immunoprecipitation. cKAP1 presence was revealed by Western blot using an anti‐Flag antibody. Input = cellular lysate, IP = immunoprecipitate. Western blot using an anti‐HA antibody on the IPs at the bottom.

Source data are available online for this figure.

Similar articles

Cited by

References

    1. Bastian M, Heymann S, Jacomy M (2009) Gephi: an open source software for exploring and manipulating networks. Third Int AAAI Conf Weblogs Soc Media 361–362
    1. Birtle Z, Ponting CP (2006) Meisetz and the birth of the KRAB motif. Bioinformatics 22: 2841–2845 - PubMed
    1. Boissinot S, Furano AV (2001) Adaptive evolution in LINE‐1 retrotransposons. Mol Biol Evol 18: 2186–2194 - PubMed
    1. Bourque G, Leong B, Vega VB, Chen X, Yen LL, Srinivasan KG, Chew JL, Ruan Y, Wei CL, Huck HN et al (2008) Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res 18: 1752–1762 - PMC - PubMed
    1. Castro‐Diaz N, Ecco G, Coluccio A, Kapopoulou A, Yazdanpanah B, Friedli M, Duc J, Jang SM, Turelli P, Trono D (2014) Evolutionally dynamic L1 regulation in embryonic stem cells. Genes Dev 28: 1397–1409 - PMC - PubMed

Publication types

Associated data