Abstract
G-quadruplexes (G4s) formed by guanine-rich nucleic acids induce genome instability through impeding DNA replication fork progression. G4s are stable DNA structures, the unfolding of which require the functions of DNA helicases. Pif1 helicase binds preferentially to G4 DNA and plays multiple roles in maintaining genome stability, but the mechanism by which Pif1 unfolds G4s is poorly understood. Here we report the co-crystal structure of Saccharomyces cerevisiae Pif1 (ScPif1) bound to a G4 DNA with a 5′ single-stranded DNA (ssDNA) segment. Unlike the Thermus oshimai Pif1-G4 structure, in which the 1B and 2B domains confer G4 recognition, ScPif1 recognizes G4 mainly through the wedge region in the 1A domain that contacts the 5′ most G-tetrad directly. A conserved Arg residue in the wedge is required for Okazaki fragment processing but not for mitochondrial function or for suppression of gross chromosomal rearrangements. Multiple substitutions at this position have similar effects on resolution of DNA duplexes and G4s, suggesting that ScPif1 may use the same wedge to unwind G4 and dsDNA. Our results reveal the mechanism governing dsDNA unwinding and G4 unfolding by ScPif1 helicase that can potentially be generalized to other eukaryotic Pif1 helicases and beyond.
Similar content being viewed by others
Introduction
Guanine-rich DNAs can fold into noncanonical DNA structures called G-quadruplexes (G4s). G4s are formed by the stacking of several G-tetrads, each of which consists of four guanine bases held together by Hoogsteen hydrogen bonding1. In the human genome, G4s are enriched in the promoter regions2, telomeric DNA ends3, meiotic and mitotic double-strand break (DSB) hotspots4, mitochondrial DNA deletion breakpoints5, ribosomal DNA (rDNA)6 and untranslated regions7. The highly stable G4s can impede many cellular processes such as replication8, transcription9, and translation7, and the failure to resolve G4s results in genome instability. Cells have evolved many kinds of helicases that can unfold G4 structures, including DHX3610, Pif18, XPD11, and members of the RecQ helicase family (RecQ, WRN, and BLM)12. Among these, DHX36 and Pif1 are two helicases that have been studied extensively for understanding G4 unfolding and structures of the complexes with G4 DNA have been solved. The co-crystal structure of a truncated form of bovine DHX36 bound to the parallel G4 from the c-Myc promoter showed that recognition of G4 is conferred by the N-terminal DHX36-specific motif (DSM) together with the OB-fold like subdomain13. Structural comparison together with the single-molecule FRET analysis suggests a model wherein G4 binding alone induces conformational changes of the helicase core, thereby driving G4 unfolding one nucleotide at a time13.
Pif1 helicase plays multiple roles in maintaining genome stability in both the nucleus and mitochondria. In the nucleus, Pif1 inhibits telomerase at both telomeres and DSBs14,15,16,17, processes Okazaki fragments18,19, promotes break-induced replication20, regulates rDNA replication21, and prevents replication pausing at G-quadruplex structures8,22,23, tRNA genes24,25, and R-loops26. We and others have solved several crystal structures of Pif1 in complex with either a ssDNA or a forked dsDNA including Bacteroides sp (BaPif1)27,28, ScPif129, and human Pif127,30 but how Pif1 unfolds G4 remains elusive. Recently, the crystal structure of Thermus Oshimai Pif1 (ToPif1) bound to a G4 DNA with a short ssDNA segment on both sides of the G-tetrad has been reported31. The structure shows that two ToPif1 molecules bind to 5′ and 3′ tails to form a dumbbell-shaped structure with the molecule bound to the 5′ ends seemingly being in an active unwinding state. Surprisingly, ToPif1 recognizes G4 mainly through the poorly conserved 2B domain, and although the previously identified wedge region is critical for separating the incoming strands of duplex DNA27,32,33, it has no contact with the G4. Therefore, this structure provided very little information on the mechanism by which ToPif1 unfolds G4.
To reveal the structural basis for unfolding G4 by Pif1, we determined the structure of ScPif1 in complex with a G4 DNA containing a 5′ ssDNA segment. The structure showed that the wedge region contacts the ssDNA/G4 junction, poised for unfolding G4. Predictions about molecular mechanisms from the structural data were tested directly by determining the effects of single amino acid substitutions on reaction kinetics of Pif1 in vitro and on its biological activities in vivo. This combined, systematic approach revealed that ScPif1 used a conserved wedge to unwind dsDNA and unfold G4 DNA.
Results
Structural overview
Guanine-rich oligonucleotides can fold into G4s but tend to adopt a mixture of multiple G4 conformations in solution, thereby hindering structural studies. Previous studies showed that ScPif1 binds to a parallel G4 DNA tightly but unfolds it slowly compared to an antiparallel G434. To search for a G4 substrate suitable for structural determination, we used 1D NMR to identify AT11 (Supplementary Fig. 1a and Supplementary Table 1), an anti-proliferative DNA sequence that inhibits cell growth35 that binds to a truncated ScPif1 (residues 236–753, referred as ScPif1 thereafter, Fig. 1a) in solution35. G4 structures in the cell are likely have ssDNA tails on both the 5′- and 3′-end because G4 structures are more likely to form in ssDNA than dsDNA and G4 formation destabilizes nearby duplexes36. This, combined with ScPif1’s 5′–3′ directionality led us to study the interaction of ScPif1 with a 5′-ssDNA tailed G4. The CD spectrum of the complex of ScPif1 with T7-AT11 (Supplementary Table 1) demonstrated characteristics of parallel G-quadruplex formation similar to its free form (Supplementary Fig. 1b), with the characteristic maximum at 260 nm indicative of parallel G4 formation in the presence and absence of ScPif1. This confirms that the structure of AT11 remained intact on complex formation with ScPif1. A fluorescence anisotropy binding assay showed that ScPif1 binds to AT11 in the absence (AT11-FAM) or the presence of a 5′ tail (T7-AT11-FAM) with high affinities (Supplementary Fig. 1c and Supplementary Table 1). We then went on to crystallize ScPif1 in complex with ADP·AlF4− and T7-AT11 containing a 5′ T7 ssDNA loading site (designated as ScPif1-G4) and determined its structure at a resolution of 3.5 Å (Fig. 1b). The asymmetric unit of the crystal structure contains two essentially identical ScPif1-G4 complexes. For simplicity, we only describe one ScPif1-G4 complex (chain C for ScPif1 and chain D for G4 DNA). The structure of ScPif1-G4 showed that the 5′ssDNA tail of G4 binds to the ScPif1 similar as observed in the structure of ScPif1 complexed with a ssDNA29 (Fig. 1b). The AT11 portion of the DNA substrate is well ordered in the electron density map (Supplementary Fig. 2a) and adopts a G4 conformation very similar to that of AT11 alone, indicating that ScPif1 did not disrupt the G4 structure (Supplementary Fig. 2b). Similarly, the G4 structure was also intact in the ToPif1-G4 structure31. In support of these observations, the binding of ScPif1 to the G4 DNA does not affect the G4 structure as determined by DMS foot printing34. In the structure of ScPif1-G4, the conformation of ScPif1 is very similar to that of ScPif1 bound to a ssDNA29 except for the 2B domain, which rotates 9° upon G4 binding relative to ssDNA bound ScPif1 (Supplementary Fig. 3). Similar 2B domain rotation was also reported in ToPif1 bound to a G4 DNA31.
a Schematic of the truncated ScPif1 used in this study. b Cartoon representation of the crystal structure of ScPif1-G4 with the conserved domains and motifs color-coded as in (a) (1A in light blue, 2A in wheat, 2B in light pink, 2C in gray, wedge in limon, and Pif1 signature motif (PSM) in magenta). The AT11 G4 DNA is colored in orange and the ADP·AlF4− and K+ ions are shown as blue sticks and cyan spheres, respectively. The key residue R324 involved in the interaction is shown as sticks.
Interaction of ScPif1 with G4 DNA
As shown in Fig. 1b, ScPif1 recognizes G4 mainly through the wedge region containing α5 and α6 with additional interactions contributed by the β15-β16 loop in the 2B domain. The interaction between ScPif1 and G4 buries a total solvent-accessible surface area of 528 Å2. In the wedge region, the loop connecting α5 and α6 contributes the majority of the interactions with G4. The prominent feature of the ScPif1-G4 interactions is that R324 interacts with G12 in the 5′ most G-tetrad (G9.G12.G15.G19) through cation-π interaction and with T8 (the first nucleotide preceding the 5′ most G-tetrad) through van der Waals contacts, which in turn stacks against G9 (Fig. 2a). Additional interactions involve R326 which contacts the phosphate backbone between G9 and G10 and the base of T11, and R323, which interacts with the phosphate backbone connecting T11 and G12, and L329 and R330, which interact with the base of T11 through van der Waals contacts. Consistent with the critical role of R324 and R326 in G4 binding, mutation of R324 and R326 significantly reduced the G4 unfolding activities of ScPif129. To further examine the functional role of R324, we expressed and purified R324 single mutants R324A, R324E, R324W, and R324Y. These R324 variants showed similar CD spectra to that of wild type ScPif1, indicating that these single mutations do not disrupt protein folding (Supplementary Fig. 1d). Furthermore, our binding assays showed that the R324E mutant exhibited substantially reduced binding to AT11 G4 DNA (Supplementary Fig. 1e).
a Major interaction between the wedge and AT11 G4 DNA. The wedge contacts the ssDNA-G4 junction and the 5′ most G-tetrad with the α5-α6 loop contributing the majority of the interactions with G4 DNA. b Minor interaction between 2B domain and AT11 G4 DNA. The β15-β16 loop in the 2B domain interacts solely with the peripheral region of the G4 DNA and has no contact with the G-tetrad. The residues involved in the interaction are shown as sticks and color-coded as in Fig. 1.
Compared with the wedge region, the β15-β16 loop in the 2B domain interacts solely with the peripheral region of the G4 and has no contact with the G-tetrad (Fig. 2b). Specifically, N684 is hydrogen-bonded with the base of T14 while K686 and P687 make van der Waals contacts with the base of T18, which in turn stacks against the base of G15 in the 5′ most G-tetrad. These observations strikingly differ from those reported in the structure of ToPif1-G4 complex wherein the corresponding β hairpin in ToPif1 directly interacts with the 5′ most G-tetrad31. Because the 2B domain is not well conserved across Pif1 family members (Supplementary Fig. 4) the mode of G4 recognition by ToPif1 does not appear to be conserved in other Pif1 family enzymes.
Previously, we showed that the Pif1 signature motif in BaPif1 stabilized the conformation of regions involved in ssDNA binding27. In the structure of ScPif1-G4, the Pif1 signature motif not only contacts the regions involved in ssDNA binding but also interacts with the two α-helices in the wedge region predominantly through extensive hydrophobic contacts, thereby stabilizing the wedge region (Supplementary Fig. 5). The wedge is critical for separating the incoming strands of duplex DNA so the stabilization of the wedge by the Pif1 signature motif would maintain its rigidity for exerting its helicase activity. In support of the critical role of Pif1 signature motif in stabilizing the wedge, substitution of the first 15 residues (Asp352 to Gln366) in this motif with 15 Ala residues abolished the multiple functions of ScPif1 in vivo and its ATPase activity37. Further supporting the importance of the Pif1 signature motif in supporting the wedge, a mutation resulting in a L319P substitution in the signature motif of human Pif1 is associated with a predisposition to breast cancer, and the corresponding L430P mutation in S. pombe Pfh1 lacks function in both the nucleus and mitochondria38. The equivalent I118P mutation in BaPif1 severely impairs ssDNA binding, dsDNA unwinding, and G4 unfolding, suggesting that the Pif1 signature motif plays an important role in Pif1 activity although its role is indirect as it does not contact the DNA itself but instead interacts with the wedge region27.
Structural comparison with ToPif1-G4 and DHX36-G4
Structural superposition of ScPif-G4 with ToPif1-G4 showed that the two Pif1 molecules adopted very similar conformations upon G4 binding with α5 and α6 in the wedge region of ScPif1 corresponding to the extended region and α5 in the wedge region of ToPif1, respectively (Fig. 3a). The striking difference between ScPif-G4 and ToPif1-G4 is the orientation of the parallel G4. In ScPif-G4, the 5′ most G-tetrad interacts mainly with the wedge region whereas the 5′ most G-tetrad contacts predominantly the 2B domain of ToPif131 (Fig. 3a). As such, the G4 in ScPif1 rotates ~180° with respect to the G4 in ToPif1. To gain greater insight into how helicases recognize and act upon G4 structures, we then compared the structure of ScPif1-G4 to that of the helicase DHX36, which also binds to and unwinds G4 DNA13. The crystal structure of bovine DHX36-G4 (Fig. 3b) showed that DHX36 contacts the 5′ most G-tetrad of the bound G4 mainly through its DSM13. Since DHX36 is a 3′–5′ G4 resolving DNA helicase, the interaction of DSM with G4 suggests that DSM is crucial for specific recognition of G4 but not directly involved in G4 unfolding13. This is dramatically different from the wedge region in ScPif1 wherein it is important for both G4 interaction and unfolding (Fig. 3).
a Structural superposition of ScPif1-G4 with ToPif1-G4 highlighting the conformational difference of bound G4 DNAs. The cartoon representations show ScPif1 (yellow) and ToPif1 (gray) (PDB code: 7OAR), with their wedge regions colored in lemon and red, respectively. The 2B domain of ToPif1 is colored in light pink while AT11 and c-Myc are colored in orange and cyan, respectively. b Cartoon representation of the DHX36-G4 structure (PDB code: 5VHE) with the DSM colored in blue, RecA1 in dark green, RecA2 in green, WH in violet, RL in lemon, OB domain in pink, CTE in limegreen and c-Myc G4 DNA in dark gray.
The wedge region is a conserved structural feature for unwinding G4 and dsDNA
Several Pif1 structures have been determined which all contain a wedge region positioned to interact with the incoming DNA27,28,29,30. These structures allowed us to examine whether the wedge region is conserved structurally across Pif1 family members. As shown in Fig. 4a, the wedge region in prokaryotic Pif1 helicases comprises an extended region followed by an α-helix while the wedge in eukaryotic Pif1 helicases consists of two α-helices with a short loop in between. Close examination of the wedge regions showed that R324 in ScPif1, which is critical for G4 recognition, is conserved in Pif1 from humans (R290) and Candida albicans (R455) while R324 is substituted by K87 in BaPif1 (Fig. 4b and Supplementary Fig. 4). No residue corresponding to R324 in ScPif1 can be identified in Pif1 from Thermus oshimai or Deferribacter desulfuricans (Supplementary Fig. 4).
a Structural alignment of the wedge regions of Pif1 from Saccharomyces cerevisiae, Homo sapiens, Bacteroides sp., and Thermus oshimai. The cartoon representations show the superposed wedge regions of ScPif1 (limon), HsPif1(gray) (PDB code: 5FHH), BaPif1(purple) (PDB code: 5FHE) and ToPif1(red) (PDB code: 7OAR). The wedge region in prokaryotic Pif1 helicases has an extended region followed by an α-helix while the wedge in eukaryotic Pif1 helicases contains two α-helices connected by a short loop. The key positively charged residues in the wedge regions, R324 in ScPif1, R290 in HsPif1, and K87 in BaPif1 are shown in sticks. b Structure-based sequence alignment of the wedge regions of Pif1 orthologs.
Previous studies showed that a pin-like β hairpin in other SF1B helicases RecD2 and Dda functions as a wedge for dsDNA unwinding39,40. Structural comparison of ScPif1-G4 with RecD2-ssDNA showed that the pin in RecD2 and the wedge in ScPif1 are located in similar positions and are appropriately positioned to separate incoming strands of dsDNA (Supplementary Fig 6). Given that ScPif1 has both dsDNA unwinding and G4 unfolding activities, this structural observation suggests that Pif1 may use the same wedge to unwind dsDNA and unfold G4 DNA.
The effect of the wedge mutations on dsDNA unwinding and G4 DNA unfolding
Because R324 is located in the wedge and directly contacts the 5′ most G-tetrad, we tested the importance of interactions at this site for melting duplex and G4 structures. As reported previously, ScPif1 preferentially unwinds a forked duplex (Fig. 5a, c and Supplementary Fig. 7a, b)41,42. For both forked and non-forked duplexes, ScPif1 variants that are likely to retain the ability to interact with the G-tetrad due to the ability of aromatic residues to stack with a DNA base (R324W and R324Y) retain their helicase activity13,27,39,43 (Fig. 5b, c and Supplementary Fig. 7a, b). In fact, R324W and R324Y ScPif1, which would be predicted to stack with G12 in the 5′ most G-tetrad produced more product in a single-turnover reaction than wild type ScPif1. On the other hand, R324A and R324E interfere with interactions of ScPif1 with the 5′ most G-tetrad, and these variants have minimal duplex unwinding activity in a single-turnover reaction.
a Illustration of unwinding assay. Gels show unwinding under single-turnover conditions of a non-forked duplex (b) and forked duplex (c) by wild type ScPif1. Amplitudes of product formation for duplex DNA unwinding by ScPif1 variants is plotted. d Illustration of a trapping assay to measure G4 DNA unfolding. Gels show unfolding of a AT11 (e) and c-Myc (f) G4 DNAs under multi-turnover conditions by wild type ScPif1. G4 DNA unfolding by ScPif1 variants is plotted. Rate constants for unfolding are 1.4 ± 0.3 s−1, 0.81 s−1, 1.1 ± 0.4 s−1, 0.78 ± 0.12 s−1, and 1.1 ± 0.2 s−1 for unfolding of AT11 by wild type, R324A, R324E, R324W, and R324Y Pif1, respectively. Rate constants for unfolding are 3.0 ± 0.4 s−1, 0.16 s−1, 0.33 ± 0.46 s−1, 1.8 ± 0.1 s−1, and 1.4 ± 0.2 s−1 for unfolding of c-Myc by wild type, R324A, R324E, R324W, and R324Y Pif1, respectively. Results are average and standard deviation of triplicate independent experiments except G4 DNA unfolding of R324E, which is duplicate independent experiments. Source data are provided as a Source Data file.
R324A and R324E ScPif1 also exhibit reduced activity for unfolding the AT11 G4 (Fig. 5d, e and Supplementary Fig. 7c) and the c-Myc G4 (Fig. 5f and Supplementary Fig. 7d). The reduction in helicase activity of R324A and R324E cannot be explained by differences in ATP hydrolysis as they hydrolyze ATP at a similar rate to WT ScPif1 and at a faster rate than R324W and R324Y ScPif1 (Supplementary Fig. 7e). However, unlike with dsDNA, R324W and R324Y ScPif1 also exhibit slightly reduced G4 DNA unfolding activity. Because the trapped product of the G4 DNA unfolding assay (Fig. 5d) is a non-forked 5′-overhang duplex and R324W and R324Y Pif1 unwound a similar substrate better than WT Pif1, it is not clear whether the reduced rate of product formation of R324W and R324Y relative to WT Pif1 is due to a reduction in the G4 DNA unfolding rate or a competition between G4 DNA unfolding and unwinding the dsDNA product. To address this possibility, we measured G4 DNA unfolding activity using a G4 reporter assay. Previous studies have shown that unwinding of a short duplex 3′ to the G4 DNA (Supplementary Fig. 8a) serves as a reporter for G4 DNA unfolding by Pif134,44. Importantly, the products of this reaction are ssDNA and a 3′-overhang duplex, neither of which is a Pif1 substrate. Unfolding of a c-Myc reporter substrate by WT, R324W, and R324Y ScPif1 occurred at similar rates, suggesting that the reduced rate of product formation by R324W and R324Y ScPif1 in the G4 DNA assay in Fig. 5 is due to unwinding of the product, not differences in G4 DNA unfolding rates. Thus, for both duplex substrates and both G4 substrates, R324W and R324Y function similarly to WT ScPif1 while R324A and R324E have reduced activity, indicating that interaction of the residue at 324 with the incoming DNA is important for both dsDNA unwinding and G4 DNA unfolding.
The effect of the wedge mutations on Pif1 activities in vivo
Because the R324 variants have reduced enzymatic activity but still retain some helicase activity, we tested the ability of the R324 variants to perform Pif1 functions in vivo. In yeast, ScPif1 promotes formation of long flaps that require processing by both Dna2 and FEN1 nucleases during Okazaki fragment maturation45. Deletion of Dna2 is lethal because of the failure to complete processing of these Okazaki fragments and can be rescued by deletion of Pif137. We tested the function of the R324 variants in Okazaki fragment processing by transforming plasmids expressing Pif1 variants into pif1Δ dna2Δ haploid yeast (Fig. 6a, b). Cells with a functional Pif1, including WT, R324W, R324Y, and R324K, were not viable. Although expressed at similar levels (Supplementary Fig. 9), cells expressing no Pif1 (empty vector), ATPase deficient Pif1 (K264A), R324A, R324E, and R324N were viable, indicating that dna2Δ lethality was suppressed and these variants are not functional for Okazaki fragment processing. Thus, the ScPif1 variants in which the interaction with the incoming DNA has been reduced are not functional for Okazaki fragment processing. However, the variants with a residue at 324 that can interact with the DNA are functional for Okazaki fragment processing, indicating that they are able to extend the 5′-flaps on Okazaki fragments such that cleavage by Dna2 is required. Interestingly, substitution of R324 with either another positively charged residue (R324K) or an aromatic residue (R324W or R324Y) resulted in an enzyme that could extend the 5′-flaps on Okazaki fragments, suggesting that either electrostatic or stacking interactions with the DNA by the residue at position 324 support ScPif1 function.
a, b Assay for Okazaki fragment processing. Adapted from Gao J, Proffit DR, Marecki JC, Protacio RU, Wahls WP, Byrd AK, Raney KD63. Translated and reproduced by permission of Oxford University Press. Translation Disclaimer: OUP is not responsible or in any way liable for the accuracy of the translation. The Licensee is solely responsible for the translation in this publication/reprint. a A plasmid encoding a wild type or mutant Pif1 allele was transformed into pif1Δ dna2Δ haploid cells. b Plates from the transformation of each Pif1 allele were imaged. Cells expressing wild type Pif1, R324W, R324Y, and R324K are unable to suppress the lethality of dna2Δ, thus do not grow on selective plates. ATPase deficient Pif1 K264A, and wedge-altering R324A, R324E, and R324N grow on selective plates indicating null-Pif1 function. c To test for mitochondrial function, serial dilutions of cells were plated onto SD-Trp agar plates containing 2% glucose, and in parallel onto SG-Trp agar plates containing 3% glycerol. All tested Pif1-DNA interaction variants maintain mitochondrial respiration. d Rates of gross chromosomal rearrangements (GCR) (median and 95% confidence interval) are based on fluctuation analyses of frequencies of 16 individual cultures. Source data are provided as a Source Data file.
Yeast lacking Pif1 has severe mitochondrial deficiencies and is unable to use glycerol, a non-fermentable carbon source, as the sole carbon source37. Plasmids encoding Pif1 variants were transformed into a heterozygous diploid strain (YCG59, PIF1/pif1::NatMX6, DNA2/dna2::KanMX6). After tetrad dissection, haploid cells carrying plasmids were grown on selective media containing either glucose or glycerol. Cells lacking Pif1 (Empty vector) or expressing ATPase deficient Pif1 (K264A) were unable to grow on glycerol, but all cells expressing R324 variants were viable on glycerol (Fig. 6c). This surprising result indicates that interactions of the residue at position 324 with the incoming DNA are not required for Pif1 to support respiration in the mitochondria.
ScPif1 limits gross chromosomal rearrangements (GCR) by inhibiting de novo telomere addition at double-stranded breaks15,17. GCR in yeast expressing Pif1 variants with a hxt13:URA3 cassette 7.5 kb telomeric to CAN1 on chromosome V was measured by simultaneous loss of the CAN1 and URA3 markers (Fig. 6d). GCR rates were high in cells lacking Pif1 (empty vector) or expressing ATPase deficient Pif1 (K264A). However, GCRs were suppressed in cells expressing wild type Pif1 and all of the R324 variants, suggesting that interactions of the residue at position 324 with the DNA are not necessary to suppress GCRs. Thus, a residue at position 324 that can interact with the DNA is necessary for Pif1 function in Okazaki fragment processing but not for mitochondrial respiration or suppression of gross chromosomal rearrangements. This suggests that robust helicase activity like that of wild type Pif1 is necessary for Okazaki fragment processing, but the limited helicase activity exhibited by R324A and R324E for both dsDNA and G4 DNA is sufficient for mitochondrial function and suppression of GCRs.
Discussion
Pif1 is a potent G4 resolving helicase that preferentially unfolds antiparallel G4s over parallel G4s22,34,46. Several studies have investigated G4 DNA unfolding by ScPif134,44,46. One report showed that ScPif1 uses its patrolling activity to unfold G4 structures in three discrete steps47 while another report showed that ScPif1 unfolds G4 structures sequentially in two large steps48, each of which melts one column of a quadruplex structure. Another showed that ScPif1 unfolds a G4 structure in 4–5 steps44. All reports suggest that G4 unfolding by a ScPif1 monomer involves a G3-triplex intermediate. Consistent with this finding, our ScPif1-G4 structure showed that a ScPif1 monomer recognizes and unfolds AT11 G4. This is dramatically different from dsDNA unwinding whereby Pif1 dimers are required for efficiently unwinding dsDNA41,49,50.
The structure of ToPif1-G4 showed that ToPif1 recognizes G4 predominantly through its 2B domain31. As the 2B domain is not very well conserved across Pif1 family members (Supplementary Fig. 4) and the wedge region has no contact with the bound G4, G4 recognition by ToPif1 appears to be unique to ToPif1 and cannot be broadly extended to other Pif1 family enzymes. Our ScPif1-G4 structure showed that the structurally conserved wedge region contacts the ssDNA/G4 junction with R324 playing a key role in mediating the interaction of ScPif1 with G4. Consistent with the structural observation, replacement of R324 by Ala or Glu reduced the G4 unfolding activities dramatically while substitution of R324 with an aromatic residue (Trp or Tyr) decreased the G4 unfolding moderately. Our structural and mutational data suggest a mechanism governing AT11 G4 unfolding by ScPif1 (Fig. 7). One ScPif1 molecule binds to the AT11 G4 DNA (Fig. 7a), with the wedge region contacting the ssDNA/G4 junction (Fig. 7b). The binding of the G4 DNA causes a rotational conformational change of 2B domain to contact the ribose-phosphate backbone of the first G-tetrad to further enhance the interaction between ScPif1 and G4. R324 in the wedge region is anchored to G12 via cation-π interaction and engages the first guanine base G9 in the first G-tetrad indirectly through T8 in the ssDNA region. The nature of R324 interacting with the G4 DNA suggests that it appears to be positioned to deliver the first guanine nucleotide G9 into the ssDNA binding channel coupled with ATP hydrolysis-mediated translocation (Fig. 7c). Further translocation of ScPif1 along ssDNA would move G10 to occupy the G9 position such that the G9–G10 column in the first and second G-tetrads would be stripped first to form the G3-triplex intermediate (Fig. 7d) as observed by single molecule FRET47,48. Then the remaining G-columns could be stripped one by one to complete the whole unfolding process (Fig. 7e). The G4 unfolding mechanism proposed here could be applied to unfolding of other parallel G4s (Supplementary Fig. 10a and b) as well as anti-parallel G4s (Supplementary Fig. 10c and d) by Pif1.
a The DNA sequence of T7-AT11. b AT11 is a four-layer G4 comprising of two parallel-stranded subunits connected by a central linker. One ScPif1 molecule binds to the T7-AT11 substrate DNA with the wedge contacting the ssDNA/G4 junction. c The interaction of the wedge with the first G-tetrad would deliver the first guanine nucleotide G9 into the ssDNA binding channel coupled with ATP hydrolysis-mediated translocation. d Further translocation of ScPif1 along ssDNA would move G10 to occupy the G9 position such that the G9-G10 column in the first and second G-tetrads would be stripped first to form the G3-triplex intermediate. e The remaining G-columns could be stripped one by one to complete the whole unfolding process.
Interestingly, the R324 variants unwind both forked and non-forked DNA duplexes with effects similar to those observed for the two G4 substrates AT11 and c-Myc (Fig. 5 and Supplementary Fig. 8). These observations suggest that ScPif1 unwinds dsDNA and unfolds G4 using similar mechanisms, in which the wedge region plays a key role. The actions of R324 and the Trp and Tyr variants are similar to the Phe residue of the pin region in another SF1B helicase Dda39.
Several lines of evidence suggest that prokaryotic Pif1 helicases appear to use different mechanisms for dsDNA unwinding and G4 unfolding. First, unlike the wedge in ScPif1, the wedge in prokaryotic Pif1 helicases comprises an extended region followed by an α-helix27 and no residue can be identified in prokaryotic Pif1 corresponding to R324 except for K87 in BaPif1. Second, the structure of BaPif1 bound to a forked dsDNA showed that the wedge region is located far away from the ssDNA/dsDNA junction, therefore it is not involved in dsDNA unwinding. Third, the 5′-ssDNA tail bound to BaPif1 exhibited a sharp bent conformation while a ssDNA bound to ScPif1 has no such bent conformation27,29. The bend at the ssDNA/dsDNA junction has been proposed to contribute dsRNA unwinding for BaPif150 as proposed for PcrA, UvrD, and bacterial RecQ helicases43,51,52.
Surprisingly, we found that interactions at residue 324 in ScPif1 are not necessary for respiratory proficiency or suppression of GCRs. However, these interactions at residue 324 are necessary for Pif1 function in Okazaki fragment processing. Previous reports of Pif1 variants that are functional in the mitochondria but not the nucleus either lacked a nuclear localization sequence37 or had mutations in the nuclear localization sequence53. To our knowledge, these are the first ScPif1 variants that exhibit a defect only for Okazaki fragment processing. The reasons for this are not clear, but when ScPif1 creates a long flap at an Okazaki fragment, it likely requires unwinding of >25 base pairs. The R324A and R324E variants produced very little product from a forked duplex in a single-turnover (Fig. 5), suggesting that the interaction at residue 324 with the substrate is critical for processive unwinding of a duplex, which is similar to the activity of Pif1 at Okazaki fragments.
Our ScPif1-G4 structure and mutational data allowed us to propose a mechanism governing G4 unfolding wherein a G4 DNA would be unfolded in a sequential manner (Fig. 7), resulting in G3 triplex and G2 hairpin as intermediate states as observed by single-molecule FRET experiments47,48. This mechanism could explain how both parallel and antiparallel G4 structures are unfolded (Supplementary Fig. 10). Previous studies showed that ScPif1 preferentially unfolds antiparallel G4s over parallel G4s and its activity is modulated by thermal stability and loop lengths of G4 substrates34,46. The simple explanation for slow unfolding of a G4 DNA is that ScPif1 would pass through longer G-columns as the number of tetrads, and hence stability increases, and more ATP hydrolysis events would be required to destabilize more stable structures. In support of this notion, our G4 unfolding assays confirmed that ScPif1 indeed unfolds c-Myc G4 faster than AT11 (Fig. 5e, f).
DHX36 has been studied extensively for its G4 unfolding mechanism. A crystal structure of bovine DHX36 bound to a G4 DNA showed that upon DHX36 binding, the 5′ top G-tetrad has changed from a canonical to noncanonical G-quartet caused by the shifting of 3′ most guanine of the bottom G-quartet into the 3′ ssDNA region13. However, the resolution (3.8 Å) of the DHX36-G4 structure would not allow the unambiguous assignment of nucleotides in the G4 substrate. Furthermore, structural comparison between DHX36 with bound G4 and its DNA-free state showed that DHX36 undergoes conformational changes in the C-terminal and RecA2 domains. Structural information together with the smFRET data suggested that G4 DNA binding alone induces rearrangements of the helicase core, which results in the repetitive partial unfolding of G4 with one-base translocation. However, structural superposition of DHX36-G4 with DHX36 bound to a ssDNA showed that the conformations of DHX36 in the two structures are very similar (Supplementary Fig. 11), suggesting that the conformational changes of DHX36 upon G4 binding are induced by the 3′ ssDNA tail binding rather than the G4 binding. Moreover, Yan et al. showed that G4 is stabilized by DHX36 in its nucleotide-free state and is destabilized upon ATP hydrolysis54. Altogether, these results suggest that the mechanism by which DHX36 unfolds G4 still remains elusive, and further studies are required to clarify how ATP binding and hydrolysis are coupled with G4 unfolding.
In conclusion, our studies have identified a mechanism by which ScPif1 uses a conserved wedge region to unwind dsDNA and to unfold G4 DNA. This mechanism is likely shared by other eukaryotic Pif1 helicases but might be different from that used by prokaryotic Pif1 helicases. The mechanism is strikingly different from those proposed for DHX36 and RecQ55 that harbor a G4-specific recognition motif and guanine-specific binding pocket, respectively. As many helicases both unwind dsDNA and unfold G4 DNA, it remains to be seen whether the G4 unfolding mechanism we propose here can be generalized to other helicase families with both duplex and G4 DNA unwinding activities.
Methods
Identification of AT11 as the G4 DNA substrate for ScPif1
The DNA samples were purchased from Integrated DNA Technologies and dissolved in a buffer solution containing 70 mM KCl, 20 mM potassium phosphate (pH 7.0), 10% D2O, and 20 µM DSS. DNA concentrations of 100 µM and 5 µM were used for all the NMR and CD experiments respectively. The sequences were annealed by heating the DNA samples at 95 °C for 5 min in a water bath and letting them cool down slowly to room temperature. All the NMR experiments were performed on 600 MHz Bruker AVANCE II spectrometer at 25 °C using the modified jump-and-return pulse program. DNA concentration used was 100 μM. All spectra were processed and analyzed using TopSpin 4.0.6. The circular dichroism (CD) spectra of all the DNA samples were measured with JASCO-815 spectrophotometer. The respective DNA samples with bound protein (used for NMR) were diluted to a final concentration of ~4–5 µM in the buffer solution. Subsequently, 500 µl of the diluted DNA sample was transferred to a 1 cm path-length cuvette and the CD spectrum was recorded at 25 °C with 10 accumulations. The data was baseline corrected from the signal contributed by the buffer. Each spectrum was further corrected to be zero at 320 nm wavelength.
The G4 DNA substrate T7-AT11 preparation for crystallization
The G4 DNA substrate T7-AT11 (TTTTTTTTGGTGGTGGTTGTTGTGGTGGTGGTGGT) was purchased from Integrated DNA Technologies and dissolved in a buffer containing 70 mM KCl and 20 mM potassium phosphate (pH 7.0) to achieve a final concentration of 0.1 mM. The solution was then heated to 95 °C and gradually cooled to room temperature in a water bath to induce the annealing of the substrate. Subsequently, T7-AT11 was further purified by gel filtration on a Superdex 75 column (GE Healthcare) equilibrated with 20 mM Tris-HCl, pH 7.5, 150 mM KCl, 5% glycerol, 1 mM DTT.
Protein purification and crystallization
To produce the Saccharomyces cerevisiae Pif1 (Uniprot: P07271) protein and its mutants (R324A, R324E, R324W and R324Y) for crystallization and in vitro assays, a truncated ScPif1 (residues 236–753) containing the helicase core domain was cloned into a pET28a vector with an N-terminal SUMO tag and expressed in E. coli BL21 cells.
Point mutants of ScPif1 were created in the pET28a vector using site-directed mutagenesis method.
Cells were harvested (5000 × g, 20 min, 4 °C), resuspended in buffer A (20 mM Tris-HCl, pH 7.5, 1 M NaCl, 5% glycerol, 1 mM DTT), and lysed by sonication. The lysate was cleared by centrifugation (18000 × g, 1 h, 4 °C), and the supernatant was incubated with Ni-NTA resin (GE Healthcare). Bound protein was washed with buffer A, eluted with buffer B (20 mM Tris-HCl, pH 7.5, 1 M NaCl, 5% glycerol, 1 mM DTT, 300 mM Imidazole). The SUMO tag was cleaved from the protein by Ulp protease and the protein was buffer-exchanged to buffer C (20 mM MES, pH 6.5, 150 mM NaCl, 5% glycerol, 1 mM DTT) and applied to Mono S Column (GE Healthcare), eluted with a gradient from 150 mM to 1 M NaCl and further purified by gel filtration on a Superdex 200 column (GE Healthcare) in buffer D (20 mM Tris-HCl, pH 7.5, 500 mM KCl, 5% glycerol, 1 mM DTT). The ScPif1 mutants were purified using the same protocol as that used for wild type ScPif1.
The purified ScPif1 was incubated with the T7-AT11 at a molar ratio of 1:1.2 in the presence of ATP transition-state analog ADP·AlF4−. The resulting ScPif1-G4 complex was purified by gel filtration on a Superdex 200 column (GE Healthcare) in buffer E (20 mM Tris-HCl, pH 7.5, 150 mM KCl, 5% glycerol, 1 mM DTT) and then concentrated to ~10 mg·ml−1. The purified ScPif1-G4 complex was subjected to crystallization screening at 20 °C using the sitting drop vapor diffusion method. Crystals of ScPif1-G4 were obtained in a condition consisting of 0.1 M MES pH 6.9, 22.5%(w/v) PEG 400, and 0.1 M sodium acetate.
CD spectroscopy of the wild type ScPif1 and its variants
CD spectra were measured on a Jasco J-810 spectrophotometer at a resolution of 0.1 nm, with a bandwidth 2 nm. Wavelength scans were averages of five scans between 200 nm and 300 nm collected at 20 °C in a quartz cuvette with a 1-mm path length with 5 μM proteins. Mean residue ellipticity was calculated according to the CD measurement manual.
Structure determination of ScPif1-G4
X-ray diffraction data were collected at the Australian Synchrotron beamline (MX2). The crystal structure of ScPif1-G4 complex was determined by successive rounds of molecular replacement with PHASER 3 using ScPif1 (PDB code: 5O6D) and AT11 (PDB code: 2N3M) as search models56. The final model was built to acceptable stereo-chemical values with iterative model building and refinement cycles using CCP4 Program Suite v8.0.01957 and Phenix version 1.2158. Data collection and refinement statistics are given in Table 1.
Oligonucleotides
Oligonucleotides (Supplementary Table 1) for kinetic assays were purchased from Integrated DNA Technologies and resuspended in 10 mM Tris pH 7.5, 1 mM EDTA. Labeled strands (1 μM) were mixed with 1.2 μM unlabeled complementary strands (if applicable) in 10 mM Tris pH 7.5, 1 mM EDTA, 100 mM KCl. Samples were heated to 95 °C for 10 min and slowly cooled to room temperature to allow folding of duplex and G4 structures.
Fluorescence anisotropy binding assay
The measurements were carried out using a plate reader (Tecan) at 20 °C. FAM-labeled AT11 was added to each reaction at a fixed concentration of 10 nM with different concentrations of wild type ScPif1 or its mutants in 100 μl buffer containing 20 mM HEPES-NaOH pH 7.5, 100 mM NaCl, 10% Glycerol. Prior to measurement, the reaction mixtures were equilibrated for 30 min. Triplicate measurements were performed for each titration point, with 485 nm and 535 nm as the excitation and emission wavelength, respectively. The data were analyzed using the quadratic binding equation59 with Prism 9 (GraphPad).
Duplex DNA unwinding
All concentrations listed are final, after mixing. Substrate (20 nM) and Pif1 (200 nM) were pre-incubated at 25 °C for 5 min in assay buffer (25 mM HEPES pH 7.5, 50 mM KCl, 2 mM β-mercaptoethanol, 0.1 mM EDTA, 0.1 mg/mL BSA). In another tube, 5 mM ATP, 10 mM MgCl2, 600 nM duplex trap complementary to the displaced strand, and 20 μM T50 were mixed in assay buffer and pre-incubated at 25 °C for 5 min. Single-turnover unwinding reactions were initiated by mixing the enzyme/DNA solution with the ATP/MgCl2 solution. T50 was added with the ATP as a protein trap to prevent Pif1 from rebinding to the substrate after dissociation. Samples were removed and quenched with 200 mM EDTA, 0.6% sodium dodecyl sulfate, 0.1% Orange G, and 6% glycerol. Samples were separated by 20% native PAGE and visualized using an Amersham Typhoon RGB imager and quantified to determine the fraction ssDNA product.
Trapping assay for G4 DNA unfolding
All concentrations listed are final, after mixing. Substrate (20 nM) and Pif1 (200 nM) were mixed in assay buffer (25 mM HEPES pH 7.5, 50 mM KCl, 2 mM β-mercaptoethanol, 0.1 mM EDTA, 0.1 mg/mL BSA) immediately before addition of a solution containing 5 mM ATP, 10 mM MgCl2, and 600 nM trap complementary to the G4 forming region. Samples were removed and quenched with 200 mM EDTA, 0.6% sodium dodecyl sulfate, 0.1% Orange G, 6% glycerol, 4 μM T50, and 20 μM C trap complementary to the trapping strand to prevent the trapping strand from spontaneously unfolding the G4 structure after quenching the reaction. Samples were separated by 20% native PAGE, visualized using an Amersham Typhoon RGB imager, and the fraction ssDNA product at each time point was determined. Data were fit to a single exponential function using Prism 9 (GraphPad).
Reporter assay for G4 DNA unfolding
All concentrations listed are final, after mixing. Substrate was prepared with a 5′-ssDNA overhang c-Myc G4 followed by a 3′-tailed 12 base pair duplex to serve as a reporter for G4 DNA unfolding (Supplementary Table 1 and Supplementary Fig. 8a)34. Substrate (20 nM) and Pif1 (200 nM) were mixed in assay buffer (25 mM HEPES pH 7.5, 50 mM KCl, 2 mM β-mercaptoethanol, 0.1 mM EDTA, 0.1 mg/mL BSA) immediately before addition of a solution containing 5 mM ATP, 10 mM MgCl2, and 600 nM duplex trap complementary to the displaced strand. Samples were removed and quenched with 200 mM EDTA, 0.6% sodium dodecyl sulfate, 0.1% Orange G, 6% glycerol, and 4 μM T50. Samples were separated by 20% native PAGE, visualized using an Amersham Typhoon RGB imager, and the fraction ssDNA product at each time point was determined. Data were fit to a single exponential function using Prism 9 (GraphPad).
Yeast strains and plasmids
S. cerevisiae strains for measuring Pif1 function in vivo were kind gifts from Dr. Virginia Zakian: MBY77 (YPH500, hxt13:URA3, pif1::HIS3MX6) and YCG59 (W303, PIF1/pif1::NatMX6, DNA2/dna2::KanMX6)37. The haploid strain G0339 (W303, pif1::NatMX6, dna2::KanMX6) was generated by tetrad dissection of YCG59 asci. The yeast strains are described in Supplementary Table 4.
Plasmids for expression of full-length ScPif1 in S. cerevisiae under control of the PIF1 promoter were gifts from Dr. Virginia Zakian: pMB13 (pRS414 empty vector), pCG17 (pRS414-Pif1wt-3xFLAG), and pCG18 (pCG17-Pif1-K264A)37. Site-directed mutagenesis was used to create plasmids encoding each of the Pif1 variants (Supplementary Tables 3).
Okazaki fragment processing
Pif1 function in Okazaki fragment processing was measured as suppression of dna2Δ lethality as previously described37. Each pCG17-serial plasmid encoding Pif1 (WT or mutant) or a pMB13 empty vector control (1 µg) was transformed into haploid strain G0339 (W303, pif1::NatMX6, dna2::KanMX6) by the lithium-acetate-method60. Approximately 1.5 × 108 early-log-phase cells were transformed without carrier-DNA and plated onto SD-Trp medium. After incubation at 28 °C for 5 days, plates were imaged. Because loss of Pif1 function suppresses the lethality of dna2Δ, growth of colonies indicated that the Pif1 mutant was a null allele in this assay.
Mitochondrial function
Mitochondrial function was assessed by monitoring the ability of colonies to grow on plates containing glycerol. Each pCG17-serial plasmid encoding Pif1 (WT or mutant) or an empty vector control (pMB13) was transformed into a heterozygous diploid strain YCG59 (PIF1/pif1::NatMX6, DNA2/dna2::KanMX6)37 by the lithium-acetate-method. After sporulation and tetrad dissection, spore colonies were genotyped. Haploid DNA2+ pif1::NatMX6 spores carrying plasmids were selected by their KanMXS NatMXR Trp+ phenotype. These selected haploid cells were grown at 28 °C overnight in SD-Trp (10 mL) to mid-log phase. Approximately 1.2 × 107 cells were diluted to 200 µL using water in a 96-well microplate and diluted with 10-fold dilutions from 100 to 10−5 using sterile water. Approximately 3 µL of each dilution was spotted onto SD-Trp agar plates containing 2% glucose or 3% glycerol. Glucose plates were grown at 28 °C for 3 days, and glycerol plates were grown for 5 days.
Gross chromosomal rearrangement (GCR)
Experiments were performed as described previously37. Briefly, two independent isolates of each Pif1 allele were separately streaked onto SD-Trp agar plates and grown at 28 °C for 4 days. Eight clones of each isolate were separately inoculated in 5 ml of SD-Trp and grown at 28 °C for 3 days. Dilutions of each culture (50 μl of a 1:10,000 dilution) were plated on non-selective SD-Trp plates and incubated at 28 °C for 3 days. GCRs were identified on SD-Trp with 1 mg/mL 5-fluoroorotic acid and 50 µg/mL Canavanine Sulfate (SD-Trp + FOA + CAN) plates. Selective plates had 1 ml of each culture pelleted and resuspended in 100 μl sterile H2O before plating and incubating at 28 °C for 7 days. GCR rates were determined using the method of median and 95% confidence intervals were calculated by FluCalc version 2019.4.161. At least 16 individual cultures were used for each rate determination.
Western blotting
Total protein was extracted from S. cerevisiae using a procedure previously optimized for yeast62. Briefly, yeast strains carrying pCG17-serial plasmids that encode WT or mutant PIF1 alleles or pMB13 (empty vector control) were grown in SD-Trp + Ade (10 ml) at 28 °C overnight. After pelleting, cells were resuspended in 600 µL of alkaline lysis buffer (0.1 M NaOH, 50 mM EDTA, 2% SDS, 2% β-mercaptoethanol) and incubated for 10 min at 90 °C. Samples were neutralized by addition of 15 µL of 4 M acetic acid to each sample, mixed by vortexing for 30 s, and incubated for 10 min at 90 °C. The concentration of protein in each lysate was measured using a Coomassie Plus Protein Assay Kit (Pierce). Proteins (18 µg) were separated by 8% SDS-PAGE and transferred to a 0.45 μm nitrocellulose membrane at 4 °C. The membrane was initially stained with 0.11% Ponceau S in 5% acetic acid, and total protein was imaged on a Bio-Rad ChemiDoc MP Imaging system. The Ponceau was removed by washing twice in TBS-T before blocking with 3% BSA in TBS-T for 2 h at room temperature. The membrane was incubated with mouse anti-FLAG M2 (Sigma F1804; 1:1000) in TBS-T with 3% BSA overnight at 4 °C. After washing, the membrane was incubated with HRP-labeled goat anti-mouse IgG (PerkinElmer EF822001EA, 1:10,000) in TBS-T with 3% BSA for 2 h at room temperature. After washing, the membrane was incubated with Amersham ECL Prime Western Blotting Detection Reagents (Cytiva RPN2232). The membrane was imaged using a Bio-Rad ChemiDoc MP Imaging system.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The atomic coordinates and structural factors for the ScPif1-G4 complex have been deposited with the Protein Data Bank under accession code 8XAK. The atomic models used in this study are available in the Protein Data Bank under accession codes 7OAR, 5VHE, 5FHH, 5FHE, 5O6B, 3GPL, 5N8S, 5O6D, 2N3M. Source data are provided with this paper.
References
Hansel-Hertsch, R., Di Antonio, M. & Balasubramanian, S. DNA G-quadruplexes in the human genome: detection, functions and therapeutic potential. Nat. Rev. Mol. Cell Biol. 18, 279–284 (2017).
Huppert, J. L. & Balasubramanian, S. G-quadruplexes in promoters throughout the human genome. Nucleic Acids Res. 35, 406–413 (2007).
Lipps, H. J. & Rhodes, D. G-quadruplex structures: in vivo evidence and function. Trends Cell Biol. 19, 414–422 (2009).
Capra, J. A., Paeschke, K., Singh, M. & Zakian, V. A. G-quadruplex DNA sequences are evolutionarily conserved and associated with distinct genomic features in Saccharomyces cerevisiae. PLoS Comput. Biol. 6, e1000861 (2010).
Bharti, S. K. et al. DNA sequences proximal to human mitochondrial DNA deletion breakpoints prevalent in human disease form G-quadruplexes, a class of DNA structures inefficiently unwound by the mitochondrial replicative Twinkle helicase. J. Biol. Chem. 289, 29975–29993 (2014).
Hanakahi, L. A., Sun, H. & Maizels, N. High affinity interactions of nucleolin with G-G-paired rDNA. J. Biol. Chem. 274, 15908–15912 (1999).
Kumari, S., Bugaut, A., Huppert, J. L. & Balasubramanian, S. An RNA G-quadruplex in the 5′ UTR of the NRAS proto-oncogene modulates translation. Nat. Chem. Biol. 3, 218–221 (2007).
Paeschke, K., Capra, J. A. & Zakian, V. A. DNA replication through G-quadruplex motifs is promoted by the Saccharomyces cerevisiae Pif1 DNA helicase. Cell 145, 678–691 (2011).
Siddiqui-Jain, A., Grand, C. L., Bearss, D. J. & Hurley, L. H. Direct evidence for a G-quadruplex in a promoter region and its targeting with a small molecule to repress c-MYC transcription. Proc. Natl Acad. Sci. USA 99, 11593–11598 (2002).
Vaughn, J. P. et al. The DEXH protein product of the DHX36 gene is the major source of tetramolecular quadruplex G4-DNA resolving activity in HeLa cell lysates. J. Biol. Chem. 280, 38117–38120 (2005).
Gray, L. T., Vallur, A. C., Eddy, J. & Maizels, N. G quadruplexes are genomewide targets of transcriptional helicases XPB and XPD. Nat. Chem. Biol. 10, 313–318 (2014).
Mendoza, O., Bourdoncle, A., Boule, J. B., Brosh, R. M. Jr. & Mergny, J. L. G-quadruplexes and helicases. Nucleic Acids Res. 44, 1989–2006 (2016).
Chen, M. C. et al. Structural basis of G-quadruplex unfolding by the DEAH/RHA helicase DHX36. Nature 558, 465–469 (2018).
Boule, J. B., Vega, L. R. & Zakian, V. A. The yeast Pif1p helicase removes telomerase from telomeric DNA. Nature 438, 57–61 (2005).
Schulz, V. P. & Zakian, V. A. The saccharomyces PIF1 DNA helicase inhibits telomere elongation and de novo telomere formation. Cell 76, 145–155 (1994).
Zhou, J., Monson, E. K., Teng, S. C., Schulz, V. P. & Zakian, V. A. Pif1p helicase, a catalytic inhibitor of telomerase in yeast. Science 289, 771–774 (2000).
Myung, K., Chen, C. & Kolodner, R. D. Multiple pathways cooperate in the suppression of genome instability in Saccharomyces cerevisiae. Nature 411, 1073–1076 (2001).
Boule, J. B. & Zakian, V. A. Roles of Pif1-like helicases in the maintenance of genomic stability. Nucleic Acids Res. 34, 4147–4153 (2006).
Bochman, M. L., Sabouri, N. & Zakian, V. A. Unwinding the functions of the Pif1 family helicases. DNA Repair 9, 237–249 (2010).
Wilson, M. A. et al. Pif1 helicase and Poldelta promote recombination-coupled DNA synthesis via bubble migration. Nature 502, 393–396 (2013).
Ivessa, A. S., Zhou, J. Q. & Zakian, V. A. The Saccharomyces Pif1p DNA helicase and the highly related Rrm3p have opposite effects on replication fork progression in ribosomal DNA. Cell 100, 479–489 (2000).
Paeschke, K. et al. Pif1 family helicases suppress genome instability at G-quadruplex motifs. Nature 497, 458–462 (2013).
Lopes, J. et al. G-quadruplex-induced instability during leading-strand replication. EMBO J. 30, 4033–4046 (2011).
Tran, P. L. T. et al. PIF1 family DNA helicases suppress R-loop mediated genome instability at tRNA genes. Nat. Commun. 8, 15025 (2017).
Osmundson, J. S., Kumar, J., Yeung, R. & Smith, D. J. Pif1-family helicases cooperatively suppress widespread replication-fork arrest at tRNA genes. Nat. Struct. Mol. Biol. 24, 162–170 (2017).
Kumar, C., Batra, S., Griffith, J. D. & Remus, D. The interplay of RNA:DNA hybrid structure and G-quadruplexes determines the outcome of R-loop-replisome collisions. Elife 10, e72286 (2021).
Zhou, X. et al. Structural and Functional Insights into the Unwinding Mechanism of Bacteroides sp Pif1. Cell Rep. 14, 2030–2039 (2016).
Liu, N. N. et al. The Bacteroides sp. 3_1_23 Pif1 protein is a multifunctional helicase. Nucleic Acids Res. 43, 8942–8954 (2015).
Lu, K. Y. et al. Insights into the structural and mechanistic basis of multifunctional S. cerevisiae Pif1p helicase. Nucleic Acids Res. 46, 1486–1500 (2018).
Dehghani-Tafti, S., Levdikov, V., Antson, A. A., Bax, B. & Sanders, C. M. Structural and functional analysis of the nucleotide and DNA binding activities of the human PIF1 helicase. Nucleic Acids Res. 47, 3208–3222 (2019).
Dai, Y. X. et al. Structural mechanism underpinning Thermus oshimai Pif1-mediated G-quadruplex unfolding. EMBO Rep. 23, e53874 (2022).
Byrd, A. K. & Raney, K. D. Structure and function of Pif1 helicase. Biochem Soc. Trans. 45, 1159–1171 (2017).
Raney, K. D., Byrd, A. K. & Aarattuthodiyil, S. Structure and mechanisms of SF1 DNA helicases. Adv. Exp. Med. Biol. 767, 17–46 (2013).
Byrd, A. K. & Raney, K. D. A parallel quadruplex DNA is bound tightly but unfolded slowly by pif1 helicase. J. Biol. Chem. 290, 6482–6494 (2015).
Do, N. Q., Chung, W. J., Truong, T. H. A., Heddi, B. & Phan, A. T. G-quadruplex structure of an anti-proliferative DNA sequence. Nucleic Acids Res. 45, 7487–7493 (2017).
Konig, S. L., Huppert, J. L., Sigel, R. K. & Evans, A. C. Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences. Nucleic Acids Res. 41, 7453–7461 (2013).
Geronimo, C. L., Singh, S. P., Galletto, R. & Zakian, V. A. The signature motif of the Saccharomyces cerevisiae Pif1 DNA helicase is essential in vivo for mitochondrial and nuclear functions and in vitro for ATPase activity. Nucleic Acids Res. 46, 8357–8370 (2018).
Chisholm, K. M. et al. A genomewide screen for suppressors of Alu-mediated rearrangements reveals a role for PIF1. PLoS One 7, e30748 (2012).
He, X. et al. The T4 phage SF1B helicase Dda is structurally optimized to perform DNA strand separation. Structure 20, 1189–1200 (2012).
Saikrishnan, K., Powell, B., Cook, N. J., Webb, M. R. & Wigley, D. B. Mechanistic basis of 5’-3’ translocation in SF1B helicases. Cell 137, 849–859 (2009).
Lahaye, A., Leterme, S. & Foury, F. PIF1 DNA helicase from Saccharomyces cerevisiae. Biochemical characterization of the enzyme. J. Biol. Chem. 268, 26155–26161 (1993).
Ramanagoudr-Bhojappa, R., Blair, L. P., Tackett, A. J. & Raney, K. D. Physical and functional interaction between yeast Pif1 helicase and Rim1 single-stranded DNA binding protein. Nucleic Acids Res. 41, 1029–1046 (2013).
Velankar, S. S., Soultanas, P., Dillingham, M. S., Subramanya, H. S. & Wigley, D. B. Crystal structures of complexes of PcrA DNA helicase with a DNA substrate indicate an inchworm mechanism. Cell 97, 75–84 (1999).
Byrd, A. K., Bell, M. R. & Raney, K. D. Pif1 helicase unfolding of G-quadruplex DNA is highly dependent on sequence and reaction conditions. J. Biol. Chem. 293, 17792–17802 (2018).
Rossi, M. L. et al. Pif1 helicase directs eukaryotic Okazaki fragments toward the two-nuclease cleavage pathway for primer removal. J. Biol. Chem. 283, 27483–27493 (2008).
Wang, L., Wang, Q. M., Wang, Y. R., Xi, X. G. & Hou, X. M. DNA-unwinding activity of Saccharomyces cerevisiae Pif1 is modulated by thermal stability, folding conformation, and loop lengths of G-quadruplex DNA. J. Biol. Chem. 293, 18504–18513 (2018).
Zhou, R., Zhang, J., Bochman, M. L., Zakian, V. A. & Ha, T. Periodic DNA patrolling underlies diverse functions of Pif1 on R-loops and G-rich DNA. Elife 3, e02190 (2014).
Hou, X. M. et al. Molecular mechanism of G-quadruplex unwinding helicase: sequential and repetitive unfolding of G-quadruplex by Pif1 helicase. Biochem. J. 466, 189–199 (2015).
Singh, S. P., Koc, K. N., Stodola, J. L. & Galletto, R. A monomer of Pif1 unwinds double-stranded DNA and it is regulated by the nature of the non-translocating strand at the 3’-end. J. Mol. Biol. 428, 1053–1067 (2016).
Su, N. et al. Structural basis for DNA unwinding at forked dsDNA by two coordinating Pif1 helicases. Nat. Commun. 10, 5375 (2019).
Lee, J. Y. & Yang, W. UvrD helicase unwinds DNA one base pair at a time by a two-part power stroke. Cell 127, 1349–1360 (2006).
Manthei, K. A., Hill, M. C., Burke, J. E., Butcher, S. E. & Keck, J. L. Structural mechanisms of DNA binding and unwinding in bacterial RecQ helicases. Proc. Natl Acad. Sci. USA 112, 4292–4297 (2015).
Lee, R. S. et al. Identification of the nuclear localization signal in the Saccharomyces cerevisiae Pif1 DNA helicase. PLoS Genet. 19, e1010853 (2023).
You, H., Lattmann, S., Rhodes, D. & Yan, J. RHAU helicase stabilizes G4 in its nucleotide-free state and destabilizes G4 upon ATP hydrolysis. Nucleic Acids Res. 45, 206–214 (2017).
Voter, A. F., Qiu, Y., Tippana, R., Myong, S. & Keck, J. L. A guanine-flipping and sequestration mechanism for G-quadruplex unwinding by RecQ helicases. Nat. Commun. 9, 4201 (2018).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 67, 235–242 (2011).
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010).
Jarmoskaite, I., AlSadhan, I., Vaidyanathan, P. P. & Herschlag, D. How to measure and evaluate binding affinities. Elife 9, e57264 (2020).
Gietz, R. D. & Woods, R. A. Transformation of yeast by lithium acetate/single-stranded carrier DNA/polyethylene glycol method. Methods Enzymol. 350, 87–96 (2002).
Radchenko, E. A., McGinty, R. J., Aksenova, A. Y., Neil, A. J. & Mirkin, S. M. Quantitative analysis of the rates for repeat-mediated genome instability in a yeast experimental system. Methods Mol. Biol. 1672, 421–438 (2018).
von der Haar, T. Optimized protein extraction for quantitative proteomics of yeasts. PLoS One 2, e1078 (2007).
Gao, J. et al. Two residues in the DNA binding site of Pif1 helicase are essential for nuclear functions but dispensable for mitochondrial respiratory growth. Nucleic Acids Res. 52, 6543–6557 (2024).
Acknowledgements
We would like to thank the beamline scientists at MX2 of the Australian Synchrotron Radiation Facility for assistance of data collection. This work was supported by grants from the US National Institutes of Health (P20GM121293; R35GM122601 to K.D.R; and GM145834 to W.P.W. and the Agency for Science, Technology and Research in Singapore.
Author information
Authors and Affiliations
Contributions
H.S., A.K.B., and K.D.R. conceived and supervised the project. Z.H. and V.Q.T. performed protein purification, crystallization, structure determination, and DNA binding experiments. P.D. performed NMR and CD spectroscopic experiments. A.K.B., E.G.M., B.O., J.G., and J.C.M. performed the in vitro biochemistry experiments. J.G. and R.U.P. constructed yeast strains. J.G. performed the in vivo experiments. W.P.W. analyzed the in vitro biochemical data. H.S., A.K.B., and K.D.R. wrote the manuscript. All authors contributed to data interpretation and editing the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Ci Ji Lim and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hong, Z., Byrd, A.K., Gao, J. et al. Eukaryotic Pif1 helicase unwinds G-quadruplex and dsDNA using a conserved wedge. Nat Commun 15, 6104 (2024). https://doi.org/10.1038/s41467-024-50575-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-50575-8
This article is cited by
-
Visualization of liquid-liquid phase transitions using a tiny G-quadruplex binding protein
Nature Communications (2025)