Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 5;23(7):e53874.
doi: 10.15252/embr.202153874. Epub 2022 Jun 23.

Structural mechanism underpinning Thermus oshimai Pif1-mediated G-quadruplex unfolding

Affiliations

Structural mechanism underpinning Thermus oshimai Pif1-mediated G-quadruplex unfolding

Yang-Xue Dai et al. EMBO Rep. .

Abstract

G-quadruplexes (G4s) are unusual stable DNA structures that cause genomic instability. To overcome the potential barriers formed by G4s, cells have evolved different families of proteins that unfold G4s. Pif1 is a DNA helicase from superfamily 1 (SF1) conserved from bacteria to humans with high G4-unwinding activity. Here, we present the first X-ray crystal structure of the Thermus oshimai Pif1 (ToPif1) complexed with a G4. Our structure reveals that ToPif1 recognizes the entire native G4 via a cluster of amino acids at domains 1B/2B which constitute a G4-Recognizing Surface (GRS). The overall structure of the G4 maintains its three-layered propeller-type G4 topology, without significant reorganization of G-tetrads upon protein binding. The three G-tetrads in G4 are recognized by GRS residues mainly through electrostatic, ionic interactions, and hydrogen bonds formed between the GRS residues and the ribose-phosphate backbone. Compared with previously solved structures of SF2 helicases in complex with G4, our structure reveals how helicases from distinct superfamilies adopt different strategies for recognizing and unfolding G4s.

Keywords: G-quadruplexes; G4-Recognizing Surface; ToPif1; X-ray; structures.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Binding and unwinding activities of ToPif1 for different configurations of antiparallel and parallel G4s
  1. A

    Comparison of the binding activity of ToPif1 for different configurations of antiparallel (G4Tel, black points) and parallel (G4CEB, red points) G4s. (Insert: Comparison of the binding activity of ToPif1 to single‐stranded DNA (T12, black points) with G4Tel, red points), the blue points represent the double‐strand DNA (D24)). The binding activity was measured using steady‐state fluorescence anisotropy assays; 5 nM of fluorescein‐labeled DNA substrate and 0.5 mM ADP·AlF4 were titrated with increasing protein concentrations at 37 °C. The different binding curves represent an average of three to four independent experiments for each substrate with standard deviations. The solid line represents the fit of data according to equation (1).

  2. B

    Stopped‐flow DNA unwinding kinetics of ToPif1 for different configurations of G4 DNA (G4AP‐424, 4G4AP‐333, G4P‐214) under multiple turnover conditions. All curves represent the average of at least 10 individual traces and the plots are representative of three independent experiments. 4 nM G4 DNA and 100 nM ToPif1 were used under experimental conditions as described in ‘Materials and Method’.

  3. C, D

    The typical smFRET trajectories of antiparallel (smG4AP‐424, sm4G4AP‐333) and parallel (smG4P‐111) G4 unwinding catalyzed by 50 nM ToPif1 and 150 μM ATP. Black arrows in the solid style indicate the time of adding proteins. Dashed arrows in black represent the unwinding time of ToPif1 with different G4 DNA substrates.

Figure 2
Figure 2. Structure analysis of ToPif1 complexed with G‐quadruplex (G4) DNA and ADP·AlF4
  1. Overall structure of ToPif1‐G4T6/T8‐ADP·AlF4 ternary complex in cartoon mode with the conserved domains 1A (green), 1B (orange), 2A (pink), 2B (blue), and C‐terminal (CTD, brown). G4 DNA (G4T6/T8) is colored in red and ADP·AlF4 is shown as a black stick. ToPif1 molecules binding at the 5′‐ and 3′‐ends of G4 DNA are labeled as molecule a and molecule b, respectively.

  2. Surface view after cutting the crystal structure ToPif1‐G4T6/T8‐ADP·AlF4 ternary complex and the domains are colored as in (A).

  3. Structural superposition for the structures of molecules a and b with the mass centers of domain 2B shown as a sphere (mass centers of molecules a colored in blue and that of molecules b colored in cyan).

  4. Comparison of G4T6/T8 (red) in the ToPif1 ternary complex, G4Myc (PDB: 5VHE; Chen et al, 2018) colored in cyan and G4 ‘T‐loops’ (PDB: 6LDM; Traczyk et al, 2020) colored in yellow.

  5. Residues of the molecule a that interact with G4 layers in the ToPif1‐G4T6/T8‐ADP·AlF4 ternary complex; G4 DNA (G4T6/T8) is shown in red and the interacting residues are represented as sticks.

Figure EV1
Figure EV1. ToPif1 recognition for G4 and ssDNA
  1. CD experiments of G4 DNA.

  2. Left panel: DNA binding of ToPif1 to G4CEB with 1 μM T12 (black line) or 1 μM G4CEB (blue line) or none of them (red line). The binding of fitting of the data to the equation (1) yields a K d,app value for ToPif1‐G4CEB of 30.30 ± 1.17 nM, a K d,app value for ToPif1‐G4CEB (added with 1 μM T12) of 30.91 ± 2.32 nM. Right panel: DNA binding of ToPif1 to T12 with 1 μM G4CEB (red line) or 1 μM T12 (blue line) or none of them (black line). Fitting of the data to the equation (1) yields a K d,app value for ToPif1‐T12 of 29.10 ± 0.53 nM, a K d,app value for ToPif1‐T12 (added with 1 μM G4CEB) of 26.50 ± 0.71 nM. DNA binding assays were carried out in buffer A (25 mM Tris–HCl (pH 7.5), 50 mM NaCl, 2 mM MgCl2, and 2 mM DTT) with 0.5 mM ADP·AlF4.

  3. Stopped‐flow unwinding kinetic curves of ToPif1 unwinding G4 DNA (G4AP‐424, G4P‐214, 4G4AP‐333) in buffer A with 1 mM ADP·AlF4.

  4. CD analysis of G4AP‐424 and G4P‐214.

  5. Normalized melting curves from CD melting assays for G4AP‐424 and G4P‐214 at different temperatures (20–75 °C).

  6. Cartoon representation of ToPif1 (molecule a) with G4T6/T8 in the presence of ADP·AlF4 shown as a black stick and loop3 highlighted in magenta.

  7. Cartoon representation of ToPif1‐dT15‐ADP·AlF4 (PDB: 6S3 M; Dai et al, 2021) with ADP·AlF4 shown as a black stick and loop3 highlighted in magenta.

  8. Cartoon representation of BaPif1‐dT10‐ ADP·AlF4 (PDB: 5FHE; Zhou et al, 2016) with ADP·AlF4 shown as a black stick and loop3 highlighted in magenta.

  9. Cartoon representation of ScPif1p‐poly (G3T5)‐ADP·AlF4 (PDB: 5O6B; Lu et al, 2018) with ADP·AlF4 shown as a black stick and loop3 highlighted in magenta.

Figure EV2
Figure EV2. Comparison of domain 2B of the molecule a of the ToPif1‐G4T6/T8‐ADP·AlF4 ternary complex and the previously determined ToPif1‐ss/dsDNA structure (PDB: 6S3H; Dai et al, 2021)
  1. Cartoon representation of molecule a in the ToPif1‐G4T6/T8‐ADP·AlF4 structure with domain 2B and the center mass (sphere) of domain 2B is highlighted in blue. G‐quadruplex (G4) DNA is shown in red.

  2. Cartoon representation of the ToPif1‐S7D11‐ADP·AlF4 structure (PDB: 6S3H; Dai et al, 2021) with domain 2B shown in magenta. The center mass of domain 2B is shown as a red sphere.

  3. Superposition on domain 1A of the structures in (A) and (B).

  4. Domain 2B and G4 DNA of molecule a in the ToPif1‐G4T6/T8‐ADP·AlF4 structure with the close‐up view of interactions between ToPif1 and the G4 tetrad I.

  5. Domain 2B and modeled duplex DNA in the ToPif1‐S7D11‐ADP·AlF4 structure (PDB: 6S3H; Dai et al, 2021).

  6. Superposition on domain 1A of the structure in (C) with only domain 2B and G4 DNA being shown.

Figure EV3
Figure EV3. SAXS results of ToPif1‐G4T6/T8‐ADP·AlF4
  1. Fit curve of the SAXS data of ToPif1‐ G4T6/T8‐ADP·AlF4 with two modeled ToPif1 molecules bound to the substrate G4T6/T8 calculated with Crysol.

  2. The model of ToPif1‐G4T6/T8‐ADP·AlF4 superimposed on the ab initio envelope calculated with DAMMIF.

  3. The SAXS Model of ToPif1‐G4T6/T8 ‐ADP·AlF4.

  4. Crystal structure of ToPif1‐G4T6/T8 ‐ADP·AlF4.

  5. Superposition for SAXS model and crystal structure of ToPif1‐G4T6/T8‐ADP·AlF4.

Figure EV4
Figure EV4. Functional validation of ToPif1 mutants and ligand interaction diagram of T6/G7 in the ToPif1 complex compared with the guanine‐specific pocket (GSP) in the CsRecQ structure
  1. CD results of ToPif1 and its mutants.

  2. ATP hydrolysis of ToPif1 and its mutants with or without DNA effector (S18H11).

  3. Ligand interaction between T6 and ToPif1 in the ToPif1‐G4T6/T8‐ ADP·AlF4 structure.

  4. Ligand interaction between G7 and ToPif1 in the ToPif1‐GR17‐ ADP·AlF4 structure (PDB: 7BIL; Dai et al, 2021).

  5. Ligand interaction between G21 and CsRecQ in CsRecQ‐ssDNA structure (PDB: 6CRM; Voter et al, 2018).

Figure EV5
Figure EV5. Sequence alignment of Pif1 helicase from bacteria, humans, and yeast
Thermus bacteria Pif1 (1–8) are from Thermus oshimai (ToPif1), Thermodesulfovibrio yellowstonii (TyPif1), Sulfurihydrogenibium sp (SuPif1), Anaerobaculum hydrogeniformans (AhPif1), Thermodesulfobacterium commune (TcPif1), Deferribacter desulfuricans (DePif1), Mycolicibacterium thermoresistibile (MtPif1), and Synergistales bacterium (SbPif1). Bacteria at room temperature are from Bacteroides sp (BsPif1). Homo sapiens and Saccharomyces cerevisiae are labeled as HsPif1 and ScPif1. Residues involved in G4‐Recognizing Surface (GRS) are shown with green stars.
Figure 3
Figure 3. Atomic interactions between the integral G‐quadruplex (G4) DNA and ToPif1
  1. A

    Interactions of ToPif1 and the G4 layer I.

  2. B

    The negative–negative repulsion interaction between E397 and the G7 of G4T6/T8 in the ToPif1 ternary complex structure. The electrostatic potential at ± 5 kTe −1 was colored in blue (basic/positive), white (neutral), and red (acidic/negative).

  3. C, D

    Interaction of ToPif1 and G4 layers II and III, respectively. Molecular electrostatic potential map calculated with G4 DNA omitted from the co‐crystal structure (blue to red, ± 5 kTe −1).

Figure 4
Figure 4. Structural superposition of ToPif1 with the G4T6/T8 and the GR17
  1. Interaction between ToPif1 and the T6 of G4T6/T8 in the ToPif1‐G4T6/T8‐ADP·AlF4 ternary complex.

  2. Superposition on domain 1A of structures in ToPif1‐G4T6/T8 and ToPif1‐GR17.

  3. Interaction between ToPif1 and the G7 of GR17 in the ToPif1‐GR17‐ADP·AlF4 ternary complex.

  4. Structural superposition of interacting residues for ToPif1 with the T6 of G4T6/T8 and the G7 of GR17.

Figure 5
Figure 5. G‐quadruplex (G4) unwinding activity of wild‐type ToPif1 (ToPif1) and its mutants
  1. A

    Stopped‐flow kinetics of the G4 (AP‐S16‐TelG4) unwinding activity of ToPif1 and the various modified proteins. The different unwinding curves represent an average of three to four independent experiments for each substrate. The solid line represents the fit of data according to equation (2).

  2. B

    Bar plot of k cat/K d,app values in Table 2 (n = 3 biological replicates). The k cat/K d,app values of G4 in descending numerical order (front panel), and these are determined with the dsDNA (S26D17, back panel). Data are presented as means ± SD.

  3. C–F

    The typical smFRET trajectories of antiparallel G4 (smG4AP‐424) unwinding catalyzed by different ToPif1 mutants based on single‐molecular FRET assays. The dashed magenta, black, and red lines represent the states of G4: fully folded, incompletely folded, and fully unfolded, respectively. The concentrations of the R355A (C), R135A (D), R150A (E), and R419A (F) mutants were 50 nM with 150‐μM ATP. The experimental conditions are described in Methods. Black arrows in the solid style indicate the time of adding proteins.

Figure 6
Figure 6. Stimulating effect upon mutation of E397
  1. A

    Stopped‐flow kinetics of the antiparallel G4 (AP‐S16‐TelG4) unwinding activity of ToPif1 and its mutants. The different curves represent an average of three to four independent experiments for each substrate. The solid line represents the fit of data according to equation (2).

  2. B

    E397A mutant‐stimulating effect determined with parallel G4 DNA.

  3. C–E

    The typical FRET trajectories of antiparallel G4 (smG4AP‐424) unwinding catalyzed by different point mutations on E397 using single‐molecular FRET assays. The dashed magenta and red lines represent the fully folded and unfolded states of G4, respectively. The different unwinding curves represent an average of three to four independent experiments for each substrate. Black arrows in the solid style indicate the time of adding proteins.

  4. F

    Error bar plot of unwinding time of E397A, E397L, and E397H determined from unfolding curves of (C), (D), and (E), respectively (n = 3 biological replicates). The experimental conditions are described in Materials and Methods and data are presented as means ± SD.

Similar articles

Cited by

References

    1. Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, Headd JJ, Hung LW, Kapral GJ, Grosse‐Kunstleve RW et al (2010) PHENIX: a comprehensive Python‐based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr 66: 213–221 - PMC - PubMed
    1. Andis NM, Sausen CW, Alladin A, Bochman ML (2018) The WYL domain of the PIF1 helicase from the thermophilic bacterium Thermotoga elfii is an accessory single‐stranded DNA binding module. Biochemistry 57: 1108–1118 - PubMed
    1. Barlow JN, Conrath K, Steyaert J (2009) Substrate‐dependent modulation of enzyme activity by allosteric effector antibodies. Biochem Biophys Acta 1794: 1259–1268 - PubMed
    1. Bartas M, Cutova M, Brazda V, Kaura P, Stastny J, Kolomaznik J, Coufal J, Goswami P, Cerven J, Pecinka P (2019) The presence and localization of G‐Quadruplex forming sequences in the domain of bacteria. Molecules 24: 1711 - PMC - PubMed
    1. Bharti SK, Sommers JA, George F, Kuper J, Hamon F, Shin‐ya K, Teulade‐Fichou MP, Kisker C, Brosh RM Jr (2013) Specialization among iron‐sulfur cluster helicases to resolve G‐quadruplex DNA structures that threaten genomic stability. J Biol Chem 288: 28217–28229 - PMC - PubMed

Publication types

Supplementary concepts