Engineered transcription-associated Cas9 targeting in eukaryotic cells

Goldberg, Gregory W.; Kogenaru, Manjunatha; Keegan, Sarah; Haase, Max A. B.; Kagermazova, Larisa; Arias, Mauricio A.; Onyebeke, Kenenna; Adams, Samantha; Beyer, Daniel K.; Fenyö, David; Noyes, Marcus B.; Boeke, Jef D.

doi:10.1038/s41467-024-54629-9

Download PDF

Article
Open access
Published: 27 November 2024

Engineered transcription-associated Cas9 targeting in eukaryotic cells

Nature Communications volume 15, Article number: 10287 (2024) Cite this article

9726 Accesses
1 Citations
11 Altmetric
Metrics details

Subjects

Abstract

DNA targeting Class 2 CRISPR-Cas effector nucleases, including the well-studied Cas9 proteins, evolved protospacer-adjacent motif (PAM) and guide RNA interactions that sequentially license their binding and cleavage activities at protospacer target sites. Both interactions are nucleic acid sequence specific but function constitutively; thus, they provide intrinsic spatial control over DNA targeting activities but naturally lack temporal control. Here we show that engineered Cas9 fusion proteins which bind to nascent RNAs near a protospacer can facilitate spatiotemporal coupling between transcription and DNA targeting at that protospacer: Transcription-associated Cas9 Targeting (TraCT). Engineered TraCT is enabled in eukaryotic yeast or human cells when suboptimal PAM interactions limit basal activity and when one or more nascent RNA substrates are still tethered to the actively transcribed target DNA in cis. Using yeast, we further show that this phenomenon can be applied for selective editing at one of two identical targets in distinct gene loci, or, in diploid allelic loci that are differentially transcribed. Our work demonstrates that temporal control over Cas9’s targeting activity at specific DNA sites may be engineered without modifying Cas9’s core domains and guide RNA components or their expression levels. More broadly, it establishes co-transcriptional RNA binding as a cis-acting mechanism that can conditionally stimulate CRISPR-Cas DNA targeting in eukaryotic cells.

Visualization of a multi-turnover Cas9 after product release

Article Open access 01 July 2025

CRISPR–Cas9 bends and twists DNA to read its sequence

Article 14 April 2022

DNA targeting by compact Cas9d and its resurrected ancestor

Article Open access 07 January 2025

Introduction

Class 2 CRISPR-associated (Cas) effectors such as Cas9 are vastly simplifying (epi)genome editing applications in biomedical research, owing to their impressive DNA targeting specificity, facile guide RNA programmability, and tractable single-protein architecture^1,2,3,4,5. In complex with guide RNAs, Cas9 proteins constitutively bind and cleave DNA at target sites known as protospacers^6,7,8. Binding and cleavage occur sequentially through a coupled mechanism⁹ initiated by direct protein-DNA interactions at protospacer-adjacent motifs (PAMs)^10,11, for example, ‘NGG’ sequences¹² recognized by the model Cas9 protein from Streptococcus pyogenes SF370. Following PAM recognition, Cas9 forms an R-loop and cleaves its target once sufficient base-pairing is achieved between its guide RNA and protospacer DNA^{8,9,13,14,15,16,17}. Mechanistic coupling between PAM and guide RNA interactions enforces sequence-specific spatial control over Cas9’s DNA targeting activities; i.e., even perfectly complementary guides cannot license binding or cleavage at protospacers that lack a functional PAM. Furthermore, suboptimal PAM (subPAM) interactions may license intermediate activities, as seen with the NAG and NGA PAMs of S. pyogenes Cas9 that are recognized less strongly than NGG^12,18,19.

Beyond Cas9’s broad utility as a tool for constitutive DNA binding and cleavage, certain applications in cells demand dynamically modulated activity that can be controlled over time by an experimentalist or via an endogenous signal. Temporal control was previously engineered by inducing Cas9 expression²⁰, as well as by modifying Cas9’s guide RNA or protein components to ensure that they are held inactive until post-transcriptionally or post-translationally induced by chemical^21,22,23,24, optical^25,26,27,28, or enzymatic²⁹ inputs. Engineered Cas9 ‘pro-enzymes’ were further shown to respond to intracellularly expressed proteases via proteolytic activation²⁹, and the activity of engineered guide RNA components or their loading into Cas9 can be controlled by base-pairing interactions with intracellular RNAs^30,31,32. Spatiotemporal coupling between endogenous signals and CRISPR-Cas DNA targeting has also been described in natural systems. The type III-A CRISPR-Cas systems encode multi-protein effector complexes³ that naturally depend on co-transcriptional binding of nascent RNAs to license DNA degradation^33,34,35. Nascent RNA binding simultaneously localizes the type III-A complex near target DNA and activates its DNase activity^34,35, thereby ensuring that transcribed DNA targets are degraded in cis while identical but repressed protospacers may be conditionally tolerated in prokaryotes^36,37,38,39. It is not known whether nascent RNA binding can be exploited to control DNA targeting by other CRISPR-Cas effectors. We investigate this in the present study by engineering Cas9 fusions that can bind to nascent RNA substrates transcribed near a protospacer in cis, thus partially emulating the type III-A targeting mechanism and forming the basis for Transcription-associated Cas9 Targeting (TraCT) in eukaryotic cells. subPAM interactions are employed to ensure that Cas9’s basal DNA targeting activity in vivo does not mask the transcription-associated activity conferred by nascent RNA binding.

Results

Dynamic ligand-controlled rescue of subPAM interactions

Prior to investigating TraCT we first sought to demonstrate that Cas9 targeting can be dynamically controlled without modulating the functionality or expression of its core protein domains and single guide RNA (sgRNA) components. To this end, we reasoned we could engineer Cas9 fusions that conditionally interact with cis elements in proximity to protospacers and thereby stimulate targeting when subPAM interactions limit basal activity. Target site editing was assayed using a single-strand annealing^40,41 (SSA) based reporter system¹⁹ wherein protospacer cleavage drives scarless reconstitution of a disrupted LYS2 open reading frame (ORF) in Saccharomyces cerevisiae yeast (Fig. 1a). Throughout the study, this system enabled isogenic testing of various protospacer-proximal cis elements inserted between direct repeats without requiring exogenous repair templates. It was previously shown that fusion to strong zinc finger (ZF) DNA binding domains can constitutively increase Cas9 editing at protospacers with subPAMs when a ZF binding site is located nearby⁴². It was furthermore shown, in the same work⁴², that ZF-DNA interactions can compensate for suboptimal activity on NGG PAMs when Cas9’s PAM-interacting domain⁴³ (PID) was genetically attenuated with a single substitution known as ‘MT3’ (Cas9^MT3). Although ZFs can now be designed to recognize nearly any DNA sequence⁴⁴, they do not confer temporal control over Cas9’s activity. We therefore initially tested a single tetO2 DNA element, the target substrate for wild-type (Wt) TetR and reverse TetR (RevTetR) DNA binding proteins that can be dynamically induced with doxycycline (Dox) to dissociate from or bind tetO2, respectively⁴⁵. The (Rev)TetR proteins function as homodimers, so we fused tandem monomers to the C-terminus of Cas9 in accordance with a previously described tandem single-chain (sc) design⁴⁶, generating Cas9-scTetR and Cas9-scRevTetR fusions. These were constructed from an isogenic scaffold that included N- and C-terminal nuclear localization signal (NLS) tags and also served as our ‘native’ Cas9 control throughout the work (Supplementary Fig. 1a; Supplementary Note 1). Our heterologous protospacer insertion contained an NGG PAM that allowed us to compare editing activities for three PID variants that recognize NGG at different strengths: Wt Cas9 (Cas9^Wt), Cas9^MT3, and SpCas9-NG with multiple PID substitutions⁴⁷ (henceforth, ‘NG’ or Cas9^NG). As expected, editing with native Cas9 was unaffected by Dox and consistently lower with the MT3 and NG variants compared to Wt (Fig. 1b). In contrast, Cas9-scTetR fusions achieved similarly high levels of editing with all three PID variants in the absence of Dox, though editing with MT3 and NG was reduced to background levels when Dox was present (Fig. 1b). This phenotype was qualitatively reversed when the scTetR domain was replaced with either V10 or V16 variants⁴⁵ of scRevTetR: in this context, maximal editing by MT3 and NG was achieved in the presence of Dox (Fig. 1c). All Dox-controllable phenotypes strictly required the nearby tetO2 element (Supplementary Fig. 1b).

**Fig. 1: Cas9 activity can be controlled by conditional interactions between auxiliary fusion domains and protospacer-proximal *cis* elements.**

To confirm that cleavage by Cas9^NG fusions is stimulated on physiological timescales we set up a liquid induction assay that allowed us to monitor growth while controlling the scRevTetR(V16) domain with Dox. We used modified yeast strains lacking a downstream SSA repeat or other endogenous repair templates that would rescue cells from persistent protospacer cleavage, thus ensuring a prolonged DNA damage response that slows growth upon targeting. Dox-dependent growth reductions required both the scRevTetR domain and the tetO2 element (Supplementary Figs. 2a–e). Importantly, when compared with non-inducible controls, this Dox-dependent growth reduction was significantly detectable 260 min post-induction (Supplementary Fig. 2f). Assuming our apparent plate reader doubling time of ~90 min (Supplementary Fig. 2e), this indicates that growth was indeed reduced for a substantial portion of the population in under three generations.

mRNA binding by Cas9 fusions can facilitate TraCT

Having found that Cas9-sc(Rev)TetR fusions allow for dynamic control over DNA targeting activity when combined with subPAM interactions and a nearby tetO2 element, we next sought to test whether binding DNA-tethered nascent RNA in cis could supplant direct DNA binding at tetO2. This was first accomplished with well-described MS2 phage parts: RNA hairpins containing either Uracil (MS2^U) or Cytosine (MS2^C) at a crucial position in their loop^48,49,50, and MS2 coat protein (MCP) variants that can bind those sites co- transcriptionally^50,51,52. MCP binds to individual MS2 hairpins as a homodimer^49,53,54 so we again fused single-chain tandem monomers, generating Cas9-scMCP (Fig. 1d; Supplementary Fig. 1a). We first tested reporter strains with two Hairpin Templates (HTs) either upstream or downstream of the protospacer (Supplementary Note 2). It was unclear whether the higher-affinity MS2^C loops^48,49,50 would be preferred over Wt MS2^U loops for our purposes, so we used one of each. This yielded a small but significant increase in editing by PID mutants of Cas9-scMCP compared to native Cas9, but only with hairpins placed upstream (Fig. 1e). To probe whether alternative upstream configurations could improve activity we tested single and double MS2^U loops, or MS2^C loops. Editing activities were comparable but we obtained lower p values with MS2^U loops (Supplementary Fig. 3a) so proceeded with one or two MS2^U HTs in all subsequent experiments. Cas9^Wt and Cas9^Wt-scMCP controls displayed indistinguishable editing in all strains as expected (Supplementary Fig. 3b).

We then tested the phage lambda (λ) N-boxB system⁵⁵, which is orthogonal to MS2⁵⁶. Individual boxB RNA hairpins are bound co-transcriptionally by λ N protein monomers^57,58,59,60 or a minimal 21-residue tract (N^2–22) from its N-terminus^61,62,63. We fused two N^2–22 peptides (2xN) to the C-terminus of Cas9, generating Cas9-2xN, and assayed editing with one, two, or three boxB HTs upstream of the protospacer (Supplementary Note 2). Considerably higher editing by PID mutants was observed with Cas9-2xN than native Cas9, in particular with two or three boxB HTs (Fig. 1f). This Cas9-2xN activity was specific for boxB HTs, as no increase was observed with two upstream MS2^U HTs. In addition, 2xN fusions partially increased editing with Wt PIDs at suboptimal NAG or NGA PAMs with a single boxB HT upstream, though activity at NGG PAMs was independent of 2xN (Supplementary Fig. 3c).

To rule out the possibility that nascent RNA binding non-specifically increases the activity of interacting Cas9 fusion proteins, e.g., by generally capturing higher concentrations of Cas9 fusions in the nucleus rather than specifically increasing editing near HTs, we tested whether HTs stimulate editing in trans using SSA dual-reporter yeast strains. These strains contain a second identical protospacer and PAM insertion, accompanied by zero or two upstream MS2^U HTs, that disrupts the constitutively transcribed CAN1 ORF on a separate chromosome from LYS2 (Fig. 2a). Once reconstituted through SSA, the CAN1 locus sensitizes cells to canavanine⁶⁴ (Can) and therefore serves as a counterselection marker: only edited cells suffer reduced colony formation efficiency in the presence of Can. Editing at LYS2 by the NG variant was increased with scMCP fusions only when HTs were present at the LYS2 locus (Fig. 2b, c). Conversely, whereas scMCP fusions increased editing by NG at the CAN1 locus when HTs were present there, they did not increase editing at the CAN1 locus when HTs were present at LYS2 (Fig. 2c; Supplementary Fig. 4a). Wt control activities at each locus were not affected by fusion to scMCP or the presence of HTs (Supplementary Figs. 4b–c). Taken together, these results indicate that auxiliary RNA binding domains (RBDs) fused to Cas9 partially compensate for subPAM interactions when their RNA substrates are transcribed from one or more templates near the protospacer in cis. Given their co-transcriptional binding capabilities^{50,51,52,57,58,59,60}, we inferred that these RBDs stimulate editing through a TraCT mechanism.

Fig. 2: Nascent RNA binding is insufficient to compensate for subPAM interactions in *trans.*

TraCT can drive allele-specific editing in yeast

Encouraged by our dual-reporter results, we envisioned applying TraCT in eukaryotic cells to facilitate differential editing at identical target sequences that are transcribed at different levels. To test this potential application and more rigorously challenge the hypothesis that Cas9-RBD fusions function with cis-acting HTs via a TraCT mechanism, we constructed a diploid SSA reporter yeast strain for transcription-dependent allelic editing. Our diploid strain was engineered to contain identical target and MS2^U insertions disrupting each of its LYS2 alleles, but with different promoters controlling their transcription (Fig. 3a). One allele contains a β-Estradiol (β-E) inducible promoter (pZEVi) that can achieve high levels of transcription^65,66, while the other contains the native pLYS2 promoter that is constitutive but weak^67,68. We first tested haploid yeast strains containing either the constitutive pLYS2 promoter or the inducible pZEVi promoter. In the pLYS2 strain, editing with Cas9^MT3 or Cas9^MT3-scMCP was unaffected by β-E as expected (Fig. 3b). In the pZEVi strain, however, editing was significantly increased by β-E at 1000 nM relative to 0 and 15 nM, and Cas9^MT3-scMCP activity at 1000 nM approached that of Wt (Fig. 3b; Supplementary Fig. 5a). Similar trends were obtained when plating diploid transformants after 24 h outgrowth in liquid medium that selects for Cas9(-scMCP) and sgRNA maintenance but lacks β-E (Supplementary Fig. 5b). To definitively determine whether editing by the MT3 variant after outgrowth is preferentially stimulated at the pZEVi allele upon induction, we subcultured the diploid transformants in liquid media containing 0, 15, or 1000 nM β-E and then harvested genomic DNA for targeted deep sequencing (see “Methods”). This confirmed that the ratio of unedited pLYS2:pZEVi alleles increased significantly in MT3 populations with 1000 nM β-E relative to 15 nM, and to a much greater extent for the scMCP fusion (Fig. 3c). A similarly significant increase in this ratio, though smaller, was also detectable with Cas9^MT3-scMCP at 15 nM β-E relative to 0 nM, yet it was undetectable with native Cas9^MT3. Note that in the absence of β-E we measured a mean ratio slightly less than 1:1 ( ~ 0.95) with the scMCP fusion, but not with native Cas9 (~1.02), implying that nascent RNA binding at the weakly transcribed pLYS2 allele was sufficient to license preferential editing relative to the repressed pZEVi allele. Collectively, these results establish an engineering strategy that facilitates spatiotemporal coupling between in vivo transcription near particular protospacers and Cas9 targeting activity at those sites.

**Fig. 3: TraCT can be exploited for preferential editing at differentially transcribed alleles with identical target sequences.**

TraCT can stimulate editing with exogenous donor DNA

We investigated whether TraCT activity is detectable in non-SSA repair contexts to probe its generalizability. First we designed a haploid pZEVi-LYS2 yeast reporter strain that lacks the downstream repeat used for SSA, such that perfect reconstitution of its LYS2 ORF through homology-directed repair requires co-transformation with an exogenous DNA donor fragment (Supplementary Fig. 6a). This strain contains one boxB HT upstream of the protospacer for testing with Cas9-2xN fusions (Supplementary Fig. 6a). Constitutive Wt Cas9 cleavage suppresses yeast colony formation until target sequences are genetically disrupted^69,70, and colony formation with Cas9^Wt or Cas9^Wt−2xN was increased over 50-fold in our system by supplying a DNA donor fragment that eliminates the target upon repair (Supplementary Fig. 6b). However, unlike Wt neither Cas9^NG nor Cas9^NG−2xN suppressed colony formation under any conditions tested; they instead produced only reduced-growth phenotypes with targeting sgRNAs at 1000 nM β-E (Supplementary Fig. 6b), and their phenotypes were indistinguishable in the absence of donor DNA (Supplementary Fig. 6b; Supplementary Figs. 7a–d; Supplementary Note 3). In parallel, we quantified LYS2⁺ fractions by plating immediately (Supplementary Fig. 6c) or by replating after 44 h outgrowth on full plates containing 0, 15, or 1000 nM β-E (Supplementary Figs. 6d–e). This revealed significant increases in donor-dependent editing with Cas9^NG−2xN after outgrowth at 0 nM and 15 nM (Supplementary Fig. 6d; Supplementary Note 3).

Protospacer orientation influences TraCT outcomes

The preceding sections defined several editing conditions in which RBD-mediated interactions can detectably compensate for subPAM activity. Given that subPAM interactions are expected to limit Cas9 recruitment to DNA at the PAM-recognition step, RBDs may license TraCT through a facilitated recruitment mechanism. However, the experiments with pZEVi strains also showed that β-E induction can stimulate targeting by native Cas9^NG (Supplementary Figs. 6b–e, Supplementary Fig. 7a) or native Cas9^MT3 (Fig. 3b, c, Supplementary Fig. 5b) in certain contexts; i.e., when RBDs are absent. This prompted us to further investigate two ways in which transcription might inherently affect Cas9 editing activity and potentially mask RBD-mediated TraCT.

First, high transcriptional activity has been shown to increase histone eviction⁷¹, and occlusion of PAM sequences within nucleosomes was previously found to impair Cas9 cleavage activity⁷². Therefore, high-level transcription may increase access to naked DNA and thus subvert kinetic or energetic barriers for subPAM interactions. Using MNase-seq and a pZEVi-LYS2⁺ haploid yeast strain (see “Methods”), we confirmed that induction with 1000 nM β-E for 7 h reduced median relative nucleosome occupancy at the LYS2 locus to significantly lower levels than 15 nM ( ~ 58% and 84% of the 0 nM median, respectively; Supplementary Figs. 7e–f). Under these conditions, LYS2 RNA levels were induced more than 100-fold (Supplementary Fig. 7g).

Second, transcribing RNA polymerases (RNAPs) might dislodge native Cas9 from cleaved protospacers in the PAM-downstream orientation employed, thereby facilitating repair outcomes⁷³. Relative to the direction of transcription, this corresponds to an sgRNA spacer that anneals to the RNAP’s template strand. High transcription can thus ensure frequent Cas9 turnover and rapid processing of cleaved DNA, whereas weak transcription may allow Cas9 to remain bound for longer periods of time and thereby delay processing. When we tested a protospacer in the inverted (PAM-upstream) orientation with 1 or 3 upstream boxB HTs using our original pLYS2 haploid background (Supplementary Fig. 8a), TraCT was virtually undetectable (Supplementary Fig. 8b). Control experiments with Cas9-scTetR confirmed that the inverted protospacer orientation does not limit MT3 activity if the tetO2 element is also relocated upstream (Supplementary Figs. 8c–d). Importantly, baseline editing activity with native Cas9 was unaltered by these inversions. In other words, weak transcription from pLYS2 does not detectably stimulate native Cas9^MT3 or Cas9^NG editing outcomes with one protospacer orientation relative to the other, though it does stimulate Cas9^MT3−2xN or Cas9^NG−2xN via boxB with the PAM-downstream protospacer orientation (Supplementary Note 4).

TraCT requires a direct Cas9-RBD linkage

An assumption of our proposed model for facilitated recruitment is that direct physical interactions between Cas9 and nascent RNA, as afforded by the covalent linker used in our Cas9-RBD fusions, may be essential for engineered TraCT. To investigate this possibility and further refine our engineering guidelines, we expressed RBDs in trans to test whether they suffice to stimulate native Cas9^MT3 when a compatible HT is provided. We tested three different constitutive promoter strengths driving ectopic expression of a nuclear-localized mCherry-2xN (mC-2xN) fusion protein in a LYS2 SSA reporter strain containing one boxB HT upstream of its protospacer insertion (Fig. 4a; Supplementary Figs. 9a–b). Editing with native Cas9^MT3 was not appreciably stimulated at any mC-2xN expression strength (Fig. 4b). Importantly, we also observed a complete abrogation of Cas9^MT3−2xN’s TraCT activity when expressing moderate (++) and high (+++) levels of mC-2xN (Fig. 4b) even though expression of mC-2xN had no effect on Wt editing levels (Supplementary Fig. 9c). The inhibitory effect of mC-2xN on Cas9^MT3−2xN likely arises from competitive binding at actively transcribed boxB hairpins. Control experiments were conducted wherein the same three promoters were used to express nuclear-localized TetR proteins in trans with MT3 mutants of native Cas9 or Cas9-scTetR targeted to a tetO2-adjacent protospacer (Supplementary Figs. 9a, d). In contrast to results with mC-2xN, native Cas9^MT3 activity in the absence of Dox was partially stimulated by trans TetR expression when compared with maximal Cas9^MT3-scTetR activity measured without trans-expressed TetR (Supplementary Fig. 9e–f; Supplementary Note 5).

**Fig. 4: 2xN peptides expressed independently of Cas9 or Cas9-2xN fusions do not compensate for *boxB*-proximal subPAM interactions and can limit TraCT.**

To determine whether boxB transcription stimulates more Cas9-2xN than native Cas9 recruitment to DNA, we performed chromatin immunoprecipitation (ChIP) experiments that exploit the HA epitope tag present in both proteins (Supplementary Fig. 1). We tested our non-SSA pZEVi strain with one boxB HT in LYS2 (Supplementary Fig. 6a) and a non-targeting sgRNA plasmid to ensure that growth was not significantly perturbed at any β-E concentration. ChIP-qPCR results were highly concordant with data from editing experiments: transcriptional induction with 15 or 1000 nM β-E increased LYS2 pull-down near boxB but not pull-down of TAF10 DNA from a separate chromosome, and LYS2 pull-down was significantly higher with Cas9^NG−2xN than Cas9^NG at both induction strengths (Fig. 5a, b). We also performed ChIP-seq for the samples from our first replicate and observed broad increases in pull-down of the LYS2 locus upon induction, which increased much further with Cas9^NG-2xN in the region beyond boxB (Fig. 5c). Although enhancements in LYS2 pull-down with Cas9^NG-2xN were highest immediately downstream of boxB (Fig. 5c, insets), they remained prominent from there down to the terminator region where transcripts dissociate, suggesting that many boxB-containing nascent transcripts are bound throughout this sizeable stretch of the locus. Background pull-down of adjacent genes, or the TAF10 control region, was virtually identical with Cas9^NG-2xN and native Cas9^NG in the presence of β-E (Supplementary Fig. 10). Therefore, pull-down of LYS2 was indeed specifically enhanced by the boxB interaction. Highly consistent results were observed in experiments where we tested our targeting sgRNA under identical conditions but with dCas9^NG(−2xN) to prevent cleavage-mediated toxicity (Supplementary Fig. 11). Altogether, these results indicate that physical linkages between RBDs and Cas9 license engineered TraCT, likely by increasing Cas9’s recruitment near protospacers when cognate HT DNA is transcribed upstream.

**Fig. 5: Transcription of *boxB* stimulates Cas9−2xN accumulation at downstream DNA sequences within the transcribed locus.**

Cas9-RBDs can exploit anchored hairpins

Having confirmed that engineered TraCT functions at least partially through an RBD-facilitated recruitment mechanism, we next examined whether Cas9-RBD fusions can compensate for subPAM interactions by binding RNA hairpins anchored in cis, rather than strictly relying on HTs transcribed in cis. To test this, we constitutively localized trans-expressed boxB hairpins near a protospacer that contains a subPAM but lacks cis-acting HTs. This was achieved with a dual sgRNA system for Cas9^Wt-2xN and two closely paired protospacers, in which one contains a suboptimal NGA PAM and the other contains an NGG PAM that can be targeted with partially mismatched spacers which license constitutive binding but not cleavage (Fig. 6a). An array of three boxB hairpins was fused to the 3’ end of one sgRNA scaffold (Supplementary Note 2) while the other was left unmodified, and spacers for each target site were then tested in either scaffold. In the first configuration, hairpins are fused to the NGG-targeting sgRNA with spacer 1 (Fig. 6a); in the second, hairpins are fused to the NGA-targeting sgRNA with spacer 2 (Supplementary Fig. 12a). Either configuration could in principle stimulate Cas9^Wt-2xN recruitment to the NGA PAM via constitutive interactions at the NGG-adjacent protospacer. Indeed, the 2xN fusion robustly stimulated editing with each sgRNA configuration when 5 mismatches (Mm), but not 20 Mm that completely abrogate binding, were present in spacer 1 (Fig. 6b; Supplementary Fig. 12b; Supplementary Note 6). We also tested modified sgRNA scaffolds with only one or two boxB hairpins and this revealed smaller but still significant increases in editing with Cas9-2xN when spacer 1 contained 5 Mm (Supplementary Figs. 12c–d). These results establish that actively transcribed RNA substrates are not strictly required to facilitate Cas9-RBD targeting at subPAMs, as even trans-expressed RNA substrates can facilitate targeting if they are anchored near the protospacer via Cas9-mediated R-loops. It should be emphasized that the strong NGG PAM we exploited for R-loop formation in this assay is expected to constitutively stimulate targeting at the subPAM, rather than license transcription-associated targeting. However, anchored boxB hairpins are not strictly sufficient to stimulate targeting either, as inversion of the NGA-linked protospacer eliminated Cas9^Wt−2xN’s advantage over native Cas9^Wt (Supplementary Figs. 12e–f; Supplementary Note 6).

**Fig. 6: Constitutive R-loop formation with boxB-fused sgRNAs that bind DNA but do not license cleavage can stimulate Cas9^Wt-2xN targeting at nearby subPAMs.**

TraCT can function in human cells

Finally, we tested whether TraCT can stimulate indel formation through donor-independent non-homologous end-joining (NHEJ) in EGFP reporter lines derived from human U2OS osteosarcoma cells. To this end, two boxB HTs were inserted into either the 5’ UTR of the EGFP marker cassette or an intron within the EGFP ORF, and downstream protospacers with NGA or NAG subPAMs were each targeted with one of four sgRNAs (Fig. 7a). When the U2OS.EGFP control line lacking boxB HTs was co-transfected with transient expression plasmids for Cas9 and sgRNA, we measured lower indel formation with an alternative Wt Cas9-2xN fusion (Cas9-2xN^Wt-alt) relative to the isogenic native Cas9^Wt-alt architecture (Fig. 7b; Supplementary Fig. 1c; Supplementary Note 1). Despite this apparent impairment in the absence of boxB, indel formation by Cas9-2xN^Wt-alt was significantly higher (1.61- to 2.07-fold) relative to Cas9^Wt-alt with three out of four sgRNAs in the boxB-containing lines (Fig. 7b). These results provide a proof-of-principle that TraCT can function in human cells.

**Fig. 7: Engineered TraCT can stimulate indel formation in human cells.**

Discussion

We report here that spatiotemporal control over Cas9 targeting in eukaryotic cells can be achieved using engineered Cas9 fusion proteins that conditionally interact with specific DNA or RNA substrates in proximity to protospacers. Unlike other engineered strategies for inducible Cas9 targeting that modulate the expression or activity of Cas9’s core domains and sgRNA components^{21,22,23,24,25,26,27,28,29,30,31,32}, our approach offers conditional targeting activity only at protospacers with subPAM interactions and suitably positioned cis elements. Hence, our Wt PID fusions were fully active and targeted strong PAMs constitutively under all conditions tested. This implies that conditional and constitutive targeting could be simultaneously achieved at two unlinked protospacer sites if one has a subPAM and the other a strong PAM.

Central to this work, fusion of Cas9 to our scMCP or 2xN RBDs compensated for subPAM interactions through a TraCT mechanism that exploits cis-acting HTs. This activity is inherently conditional because the RNA hairpin substrates for these RBDs are only transiently tethered near their HT DNA during active transcription in cis. Transcription of HTs in eukaryotic cells can thus serve as an intracellular physiological signal to control Cas9 targeting spatiotemporally, analogous to the role of protospacer transcription in DNA targeting by type III systems in prokaryotes^{33,34,35,36,37,38,39}. TraCT activity scaled with the level of HT transcription and was thereby advantageous when applied in yeast for partial discrimination between identical allelic sequences that are differentially transcribed; i.e., TraCT facilitated preferential editing at the more highly transcribed of two target alleles in a diploid yeast strain (Fig. 3). Given also the suitable effect sizes we measured with yeast SSA reporters throughout this study, it is expected that our system will be generally applicable in future work as a two-hybrid platform for testing whether other putative or known protein-RNA interactions can function co-transcriptionally. We anticipate that TraCT could be further applied in future studies as a genetic surveillance or lineage tracking system in animal cell populations with heterogeneously transcribed DNA targets^{74,75,76,77,78}. For example, stable expression of RBD fusions and sgRNAs in differentiating cells could selectively enrich for editing in subpopulations that upregulate transcription at HT-linked protospacers. However, future improvements in higher eukaryotes may be needed before TraCT can be implemented for such applications, considering that our proof-of-principle indel experiments in human cells produced only small effect sizes. We moreover caution that the utility of existing Cas9-RBD fusions for TraCT may prove limited outside of genetically engineered systems, owing to their reliance on heterologous cis-acting HTs, and pre-insertion of HTs may be less efficient in many cell lines relative to our U2OS derivative’s efficiency ( ~ 23% for insertion of boxB^2x in the 5’ UTR; see “Methods”). Although our double boxB insertion within the 5’ UTR of EGFP did not reduce mean cellular fluorescence (Supplementary Fig. 13), secondary structures inserted within eukaryotic 5’ UTRs have been found to reduce expression in certain contexts^79,80, highlighting another potential engineering constraint that could limit TraCT’s implementation. Notwithstanding, natural eukaryotic systems also contain proteins capable of binding nascent RNA, such as splicing factors that are co-transcriptionally localized near the DNA templates of introns, splice junctions, or their downstream exons^81,82,83. We therefore speculate that fusion of Cas9 to splicing factors could facilitate TraCT at such native sites. It might also be possible to program TraCT virtually anywhere in the genome by targeting pre-mRNA with antisense RNAs fused to boxB hairpins^84,85 for Cas9−2xN recruitment, or, fused directly to Cas9’s sgRNA scaffold. Alternatively, fusion of Cas9 to an RNA-targeting (d)Cas13 effector protein^86,87,88 from type VI CRISPR-Cas systems³ might prove effective. Whether (d)Cas13 complexes are capable of binding target RNAs co-transcriptionally remains to be elucidated, however.

In light of prior work and findings herein, several lines of evidence offer insight into the molecular details underlying engineered TraCT in eukaryotic cells. Nucleosomal DNA is thought to principally impede Cas9 targeting at the PAM-recognition step^72,89, and DNA binding sites for proteins that displace nucleosomes in vivo have been shown to locally stimulate targeting⁹⁰. Active transcription was also confirmed to reduce nucleosome occupancy in our system (Supplementary Figs. 7e–f), though it need not evict histones from DNA entirely⁹¹. Hence, we speculate that recruitment of Cas9-RBD fusions to transcribing RNAPs, via tethered nascent RNA, provides temporal windows during which Cas9 can rapidly interrogate naked PAM DNA in cis. Concomitant RNA tethering and nucleosome displacement would thus transiently overcome the kinetic and/or energetic limitations of subPAM interactions in eukaryotic cells. In other words, TraCT might function through a dynamic mechanism that relies on concomitant Cas9 recruitment and exposure of target DNA by RNAPs. This two-part model (Supplementary Fig. 14) could explain why upstream HTs offered a pronounced advantage over downstream HTs; downstream HTs will yield RNA hairpin substrates only after an RNAP has transcribed beyond the target, at which point target DNA may once again be occluded within nucleosomes⁹¹. Supporting this model, accumulation of Cas9^NG−2xN peaked within 300 bp downstream of boxB in our yeast ChIP-seq experiments (Fig. 5c, insets), likely because short RNA tethering lengths maximize local concentrations at exposed DNA near the transcription bubble. Beyond that peak, accumulation remained considerably higher for Cas9^NG−2xN than native Cas9^NG but plateaued at a reduced level in the downstream LYS2 locus, so we infer that tethering Cas9^NG-2xN at longer lengths may still increase DNA contacts but to a lesser extent that results in fewer crosslinks. Various lengths have been tested in prior studies that examine how boxB-mediated tethering can facilitate Wt λ N’s functional interactions with elongating E. coli RNAPs. For instance, using a 184 nt length, one study confirmed that interactions with in vitro transcribing RNAPs are reduced when boxB is prematurely cleaved off the nascent RNA via treatment with a precise RNase⁵⁹. It remains to be determined whether TraCT can function with HTs 150-300 bp upstream of the protospacer or further, as all HTs used in this work were less than 150 bp away. In contrast with downstream HTs that largely failed to license TraCT, hairpins constitutively anchored downstream using our paired-protospacer strategy in yeast robustly stimulated editing by Cas9-RBDs (Fig. 6), beyond the proxy-CRISPR effect⁹² proposed to result from nucleosome displacement alone. This qualitatively mirrored our results comparing Cas9^MT3-scTetR to native Cas9^MT3 with TetR in yeast (Supplementary Fig. 9e), and ostensibly supports our two-part TraCT model that jointly considers recruitment and nucleosome displacement effects. Interactions with static tetO2 elements or NGG-adjacent mismatched protospacers licensed an advantage over native Cas9 in yeast only when positioned on the PAM side of the cleavable protospacer, however (Supplementary Figs. 8c–d; Fig. 6; Supplementary Figs. 12e–f). This might reflect Cas9’s reported requirement for a sequence-independent ‘post-PAM’ interaction at PAM-distal DNA⁹³, which, should be nucleosome-depleted in these configurations. If we furthermore surmise that transcription dynamically exposes DNA for the post-PAM interaction, the relative kinetics of this step should also be considered when interpreting our result that TraCT was influenced by protospacer orientation at a weakly transcribed yeast locus (Supplementary Figs. 8a–b). Additional experiments will be required to dissect these mechanistic points and investigate whether transcription-associated DNA targeting can be achieved with other CRISPR-Cas effectors in eukaryotic cells.

Methods

Microbial strains and growth conditions

Cultivation of Escherichia coli strains XL1-Blue or TOP10 was carried out as previously described¹⁹ for petri dish plates or 5 mL liquid cultures. Media were supplemented with carbenicillin (100 µg/mL) and/or kanamycin (50 µg/mL) for plasmid cloning and maintenance. When chemically competent or electrocompetent XL1-Blue cells were prepared for downstream cloning, media was supplemented with tetracycline (15 µg/mL) alone.

The S. cerevisiae yeast strains used in this work were all derived from BY4741 or BY4731 haploids⁹⁴; their genotypes are provided in Supplementary Data 2. Except where noted otherwise, all liquid yeast culture and competent cell preparation was performed sterilely with YEPD broth or synthetic complete (SC) dropout broths lacking one or more amino acid (SC–) in 18 × 150 (for 5 mL volumes) or 25 × 150 mm (for 20 mL volumes) disposable round-bottom glass tubes as described previously¹⁹. This included our adaption of the high-efficiency lithium acetate transformation protocol⁹⁵. Briefly, 5 mL YEPD or SC starter cultures were first inoculated from single colonies and grown for 24 h overnight. After subculturing overnight starter cultures by diluting to ~0.15 Attenuance (D_{600 nm}) in 20 mL of the same broth (1:100 for YEPD or 1:70 for SC) and allowing 4.5 h (YEPD) or 5.25 h (SC) growth on a roller drum at 30 °C ( ~ 0.6 D_{600 nm}), cells were harvested by centrifugation at 2103 × g for 3 min and washed once with resuspension in an equal volume of sterile double-distilled water (ddH₂O). After washing a second time with resuspension in an equal volume of filter-sterilized lithium acetate (0.1 M) and re-pelleting, cells were transferred to 1.5 mL tubes in their residual supernatant, centrifuged again, and resuspended to a final volume of 200 µl in 0.1 M Lithium Acetate. Each 20 mL subculture yielded four transformation reaction aliquots; when scaling up for more transformations, additional subcultures were grown in parallel and cells were pooled in 1.5 mL tubes at this step. A modified transformation mix (TRAFO mix)⁹⁵ was prepared fresh in parallel, and 50 µl of cells were combined with 300 µl of TRAFO mix in 1.5 mL tubes for each transformation reaction. DNA ( ≤ 20 µl in ddH₂O or buffer EB from Qiagen) was then added directly to this mixture, and tubes were homogenized by vortexing before transferring to a roller drum at 30 °C. After rotating for 30 min, 36 µl pure DMSO was added to each tube, and tubes were re-homogenized by vortexing and transferred to a 42 °C heat block. After incubating for 15 min, cells were pelleted by centrifugation and resuspended in filter-sterilized 5 mM CaCl₂ (400 µl, unless noted otherwise) and then incubated at room temperature (r.t.) for 10 min. After 10 min at r.t., resuspended cells were spread directly on solid media (150-200 µl for 100 × 15 mm plates) using sterile 4 mm glass beads (Fisher), or spotted as droplets (5 µl on YEPD or 10 µl on SC) from serial dilutions prepared with sterile ddH₂O. Our fresh TRAFO mixes were homogenized from 240 µl PEG 3350 (44%), 36 µl 1 M Lithium Acetate, and 25 µl Promega Herring Sperm DNA (10 mg/mL) per reaction by vortexing vigorously in a 50 mL sterile falcon tube. Immediately before preparing a TRAFO mix, pre-aliquoted Herring Sperm DNA was thawed and resuspended by vortexing vigorously, boiled on a heat block at 95 degrees for 5 min, and then chilled on ice for 5 min. To select for transformation-associated integration of DNA fragments linked to a nourseothricin-resistance (Nat^R) resistance marker, cells were plated on solid media supplemented with clonNAT (100 ug/mL) after recovering half the transformant suspension for 16-24 h in the same clonNAT-free liquid media that was used to prepare their parental competent cells. ClonNAT was generally maintained during Nat^R strain re-streaks but not included in the media during other assay procedures. For all experimental yeast assays in this work, biological replicates were initiated by launching liquid starter cultures from fresh single colonies grown on SC dropout plates at 30 °C for 48 h after streaking. Except where noted otherwise, SC media formulations (Sunrise Science) with leucine and/or uracil dropped out were used throughout the work to select for maintenance of the Cas9 and/or sgRNA episomes, respectively. Slight colony size reductions were observed when selecting for strains that express Cas9^Wt-scMCP with or without a sgRNA (Supplementary Fig. 5c), suggesting low-level toxicity from expression.

DNA preparation and cloning

All circular plasmid DNA used in this work was purified from saturated 5 mL liquid cultures of E. coli (grown for 16 h at 37 °C as described previously¹⁹) using Qiagen miniprep reagents. Centrifugation at 4000 × g for 6 min was used to pellet the cultures before miniprepping. Plasmid identifiers and descriptions are provided in Supplementary Data 3. All plasmid-derived linear DNA fragments used for cloning, yeast strain construction, or transformation-associated editing experiments were isolated from miniprepped DNA via restriction digest and gel purification. PCR amplicons used for cloning, CRISPR-assisted knock-in, or Illumina sequencing were also gel-purified. For cloning and CRISPR-assisted knock-ins, amplicons were generally obtained using high-fidelity Q5 Hot Start (2X Master Mix) or Phusion polymerases from NEB; otherwise, GoTaq Green (Promega) was used for fragments up to ~ 2 kb that could not be obtained with Q5 or Phusion. GoTaq was also used for all PCR amplification from genomic DNA (gDNA) templates, including for yeast colony PCRs performed as described previously¹⁹ by templating with 2 μl of supernatant from colonies that were resuspended in 40 μl NaOH (20 mM) and boiled 7 min. Cycling conditions recommended by the manufacturer were usually followed, except that 90 s of annealing was used during each cycle when amplifying from gDNA with GoTaq. All Wt, MT3, and NG PID fragments used for assembly with gel-purified acceptor vector backbone fragments in transformation-associated editing experiments were amplified from plasmid templates using Q5 or Phusion with oligos oGG1019/oGG1052 (Supplementary Data 4) and then gel-purified. Gel purifications were typically carried out with disposable x-tracta tools (Sigma-Aldrich) and MinElute kits (Qiagen). In addition to PCR- or plasmid-derived fragments, commercially synthesized DNA fragments (e.g., gBlocks from IDT) were also used directly in cloning procedures.

The majority of plasmid cloning in this work was carried out with Gibson⁹⁶ or NEBuilder HiFi (NEB) master mixes for isothermal assembly of linear DNA fragments at 50 °C for 1 h. In some cases these assemblies included a single-stranded DNA bridging oligo for introducing small inserts, such as sgRNA spacers. Otherwise, cloning was achieved via one-pot restriction-ligation reactions with type IIS restriction enzymes (NEB or Thermo Scientific) and T4 DNA ligase (NEB). These 20 µl reactions included circular plasmid DNA (destination vector) and annealed oligos with appropriate overhangs for ligation, 1 µl type IIS restriction enzyme, 1 µl T4 ligase, and buffers (CutSmart and T4 ligase buffers, each at 1X final concentrations); they were incubated at 37 °C for 4-24 h. In most cases XL1-Blue or TOP10 chemically competent cells were used for cloning and before plating were recovered into 300 µl SOC broth for 1 h on a roller drum at 30 or 37 °C after the 45 s heat shock at 42 °C. If these did not yield clones initially, electrocompetent XL1-Blue cells were occasionally used in accordance with dialysis and electroporation procedures described previously¹⁹. Following electroporation, transformants were recovered into 10 mL SOC broth for 1 h on a roller drum at 30 or 37 °C before plating. At a minimum, all cloned sequences derived from PCR amplicons or commercially synthesized DNA and their adjacent backbone-derived sequences at recombinant junctions were verified by Sanger sequencing. In some cases, entire plasmids were sequenced with long-read sequencing by Plasmidsaurus. The expected sequences of all plasmids used in this work are provided in Supplementary Data 3.

All Cas9 acceptor vectors were cloned in E. coli and served as the source of backbone fragments for assembly with PID inserts in yeast. These were based off the pGG211, pGG431, and pGG442 acceptor vectors¹⁹, which were themselves derived from a minimal Cas9^Wt-SV40 expression plasmid (pNA0306) described previously⁹⁷. The kanamycin-resistance (Kan^R) cassette that replaces PIDs in all acceptor vectors is flanked by restriction sites for backbone isolation and ensures that any uncut or singly cut plasmid carryover does not contribute functional Cas9 activity upon transformation. All backbone fragments were isolated by restriction digest and gel purification; NheI and NotI were used for pGG574, pGG547, pGG548, or pGG618 plasmids (Dox induction experiments) while NheI and PspOMI were used for pGG532, pGG500, or pGG573 plasmids (TraCT experiments). When functional Cas9 plasmids were needed for strain construction they were pre-assembled with PID inserts and cloned in E. coli before delivery into yeast. Expression of sgRNAs in yeast was achieved with derivatives of the single sgRNA (pNA0304) or double sgRNA (pNA0308) plasmids described previously⁹⁷. The pGG197 single sgRNA plasmid was used for the majority of targeting assays in this work and its spacer matches a heterologous EGFP protospacer (which served as Protospacer 1 in paired-protospacer experiments) that was employed previously¹⁹. The Protospacer 2 sequence used in paired-protospacer experiments is also a heterologous EGFP protospacer which was targeted by the ‘NAG 3’ spacer employed previously¹⁸, but its PAM was altered in this work (CAGCTTGC→CGAAGTAA). All derivatives of pNA0308 also included a SUP4 terminator insertion between the second sgRNA scaffold and the RPR1 terminator. Constitutive expression of Z₃EV in yeast, the β-E-inducible activator protein for our pZEVi promoter, was achieved with an ACT1 promoter and RPS5 terminator cassette stably integrated at the YKL162C-A locus. This cassette was cloned along with a Nat^R marker and homology arms in a NotI fragment of pGG613, as in pMS101, but without extraneous sequences between the Nat^R marker and downstream homology arm. The pMS101 plasmid includes previously described parts⁶⁵ but was redesigned with a pAV10.KN backbone for integration at the YKL162C-A locus as described previously⁹⁸. The pZEVi promoter used in this work is based off the P4 promoter⁶⁶, a GAL10-1 hybrid promoter with binding sites for Z₃EV. Constitutive expression cassettes for nuclear-localized TetR and mCherry-2xN proteins were similarly cloned in E. coli plasmids and then isolated on NotI fragments for stable integration at the YKL162C-A locus in yeast. These plasmids were built off the pSIB619 architecture previously used for constitutive expression of the cam-TA protein⁹⁹ with a TDH1 promoter and STR1 terminator (modified to lack BsmBI restriction sites). The TDH1 promoter was replaced with ADH1 or TDH3 promoters to obtain higher constitutive expression strengths in a subset of the pSIB619 derivatives. The Cas9^Wt-alt (pAG5) and Cas9-2xN^Wt-alt (pGG768) plasmids used for transient expression in human cells were cloned in E. coli and derived from the pGG439 plasmid¹⁹, which was itself derived from the MSP469 plasmid¹⁸. Transient expression of sgRNAs in human cells was achieved with derivatives of the BPK1520 single sgRNA plasmid¹⁸, also cloned in E. coli.

Construction of yeast strains

All haploid yeast strains constructed in this work are described within Supplementary Data 2 and were generated using one of the following three transformation schemes. (1) To introduce circular episomes carrying selection markers that confer prototrophy, plasmid DNA ( ~ 30-60 ng) was delivered directly through transformation and cells were plated with appropriate selection. (2) To modify the native LYS2 or CAN1 loci (including LYS2’s promoter region) we used a CRISPR-assisted knock-in method essentially as described previously¹⁹. Sequences of the sgRNA plasmids used for knock-in at the LYS2 ORF (pGG202), CAN1 ORF (pGG203), or LYS2 promoter (pGG578) are provided in Supplementary Data 3. Sequences of the linear double-stranded DNA donor fragments used for knock-in, including their homology arms, are listed in Supplementary Data 5. When patching colonies for initial phenotypic screening in this work, SC–Arg–Leu–Lys–Ura plates were used in place of SC–Lys, and SC–Arg–Leu–Lys–Ura+Can plates were used in place of SC–Arg+Can plates. Furthermore, during pZEVi strain isolations, 1000 nM β-E was included in the SC–Arg–Leu–Lys–Ura plates to turn on the pZEVi promoter and facilitate LYS2⁺ colony screening. CRISPR-assisted edits were validated by Sanger sequencing of gDNA-templated PCR amplicons that covered, at a minimum, all recombinant sequences and the flanking homology arms used for knock-in. In some cases, intermediate strains with genotypes that differ from BY4741’s were used as parent strains for a second round of CRISPR-assisted knock-in, as outlined in Supplementary Data 2. Before delivering different sgRNA and/or Cas9 plasmids into CRISPR-edited strains for downstream experiments or for secondary rounds of CRISPR-assisted editing, the original sgRNA and/or Cas9 episomes used for knock-in were cured via replica plate screening as described previously¹⁹. (3) To integrate DNA into the YKL162C-A locus⁹⁸, linear DNA fragments ( ~ 200-300 ng) bearing a Nat^R marker and sufficient homology arm lengths to select for integrative recombination were directly transformed without CRISPR co-selection; cells were then recovered for 24 h in liquid YEPD or SC dropout media to select for plasmids before plating on solid media of the same type but with clonNAT present. These linear DNA fragments were excised from circular E. coli plasmids listed in Supplementary Data 3 via restriction digest with NotI, followed by gel purification. Integration of the pGG613 fragment in yGWG307, yGWG311, and yGWG312 lineages or the pGG658 fragment in yGWG315 was further validated by colony PCR and Sanger sequencing of regions spanning the homology arm junctions in YKL162C-A.

The diploid SSA reporter strain yGWG422 was mated from yGWG312 and yGWG381 haploids (Supplementary Data 2) on solid media. In brief, single colonies of each strain were streaked from opposite sides of a YEPD plate into a central overlapping region where mating could occur. After 7 h growth at 30 °C, this plate was replica-plated onto an SC–Ura–His plate to select for segregants with prototrophies unique to each parent strain (URA3+ from pGG197 in yGWG312 and HIS3+ from yGWG381’s genome). A diploid clone was isolated from the segregant pool by re-streaking onto an SC–Ura–His plate; its SSA target region was amplified by colony PCR and Sanger sequencing confirmed the expected heterozygosity ( ~ 1:1 allelic ratio) at the position of the point mutation.

Quantitative transformation-associated editing assays in yeast

Editing of haploid SSA reporter strains with a single sgRNA was performed essentially as described previously for clonal selection experiments in yeast¹⁹. In brief, haploid reporter strains were co-transformed via the lithium acetate procedure described above using 150 ng linearized backbone DNA and ~ 41 ng PID insert DNA ( ~ 1:3 molar ratio). Transformant populations resuspended in CaCl₂ were then serially diluted and spotted in parallel on different SC dropout plates, including SC–Leu–Ura plates to select for all viable transformants maintaining both the Cas9 and sgRNA episomes, as well as SC–Arg–Leu–Lys–Ura plates that select for transformants with both episomes and a reconstituted LYS2 marker. Where applicable, additional spotting on SC–Arg–Leu–Lys–Ura plates supplemented with Dox, Can, or β-E was performed in parallel using the same serial dilutions. Plates were incubated for 47 h at 30 °C (or 72 h for Can-supplemented plates) before immediately photographing and then quantifying colony forming units (CFUs) by eye. After initial scoring, plates were stored at benchtop for up to 16 h (or 24 h for Can-supplemented plates) and counted once more; the higher of two CFU counts was used in determining each LYS2⁺ fraction, calculated as the ratio of LEU2⁺, LYS2⁺, URA3⁺ (and Can^R, where applicable) CFUs over LEU2⁺, URA3⁺ CFUs obtained on SC–Leu–Ura plates. Typically, each Cas9 PID variant fusion protein replicate was tested in parallel with a corresponding native Cas9 PID variant control replicate performed on the same day. The same procedures were followed for editing of paired-protospacer reporter strains with dual sgRNAs except that recipient strains harbored pre-assembled native Cas9^Wt or Cas9^Wt−2xN episomes instead of the sgRNA episome; competent cells were therefore prepared in SC–Leu rather than SC–Ura media and were transformed with circular dual sgRNA plasmid DNA (2.5 µg) instead of linear fragments.

For donor-dependent editing of haploid yeast, all recipient strains harbored pre-assembled native Cas9^Wt/NG or Cas9^Wt/NG-2xN plasmids. These strains were transformed with 75 ng of the targeting sgRNA plasmid (pGG197) alone or co-transformed with the targeting or non-targeting (pGG202) sgRNA plasmids (75 ng) and 120 bp commercially synthesized double-stranded DNA donor fragment (500 ng). Transformants were resuspended in 700 µl CaCl₂ prior to initial platings, performed by spreading aliquots (154 µl per full plate) from the CaCl₂ suspension on SC–Leu–Ura media with or without β-E at 15 nM or 1000 nM. These plates were grown for 44 h at 30 °C before scrape-homogenizing for serial dilution and replating to determine their LYS2⁺ fractions with SC–Arg–Leu–Lys–Ura plates that contained β-E (1000 nM). Replates were incubated 38.5 h at 30 °C before immediately photographing and quantifying CFUs to determine LYS2⁺ fractions as described above, except only 12 h were allowed at benchtop before recounting. In parallel with initial platings, aliquots were withdrawn from the remaining CaCl₂ suspensions for serial dilution and spotting on the same three solid media conditions. For strains with Cas9^NG or Cas9^NG-2xN, the same serial dilutions were also spotted on SC–Arg–Leu–Lys–Ura plates supplemented with β-E at 15 nM or 1000 nM to quantify LYS2⁺ fractions obtainable immediately after transformation.

Plate reader growth assays

To obtain 1 mL SC–Leu–Ura liquid starter cultures for each biological replicate, single colonies isolated by streaking on a single plate were vigorously inoculated and cultured in parallel on the same day. These were grown with continuous shaking (800 RPM) for 22 h at 30 °C in 96-well V-bottom assay blocks (Corning, 2 mL capacity) that were pre-autoclaved (dry) with a 4 mm glass bead in each well. Starter cultures were then subcultured 1:100 into 96-well SensoPlate microplates with a clear, flat bottom (Greiner Bio-One) containing 200 µl of SC–Leu–Ura media in each well, and grown for 32 h at 30 °C with their lids on and continuous shaking in a Cytation 5 microplate reader (BioTek). High-resolution growth curves were generated from Attenuance (D_{600 nm}) measurements taken every 10 min. For Dox induction experiments, microplates were briefly withdrawn after 5 h of growth and cultures were supplemented with Dox (10 µM final concentration) or solvent alone by pipetting directly into the wells, then returned to the Cytation 5 for an additional 27 h of growth. Lids were kept warm in a 30 °C incubator during the withdrawal step. For β-E induction experiments, media containing β-E was supplemented at 15 nM or 1000 nM before aliquoting into microplate wells and was thus present throughout the 32 h.

Analysis of high-resolution growth curves from plate reader assays

All growth curves were analyzed using custom python scripts. For each curve, D_{600 nm} measurements were corrected for background by subtracting their minimum measurements from each value, and then doubling times (${t}_{d}$) were calculated by fitting a logistic equation to each curve¹⁰⁰:

$$\,{N}_{t}=\frac{K}{1+\left(\frac{K-{N}_{0}}{{N}_{0}}\right){e}^{-{rt}}}$$

(1)

$${N}_{t}={{\rm{Attenuance}}}\left({D}_{600{nm}}\right){at\; time\;t}$$

$${t}_{d}=\frac{{\mathrm{ln}}\;2\,}{r}$$

(2)

To calculate time-resolved differences in growth, D_{600 nm} measurements from +Dox cultures were subtracted from –Dox cultures at each time point. Significance was calculated at each time point by comparing between pooled control samples (n = 18) and tetO2 + , Cas9^NG-scRevTetR (V16) samples (n = 6). Before calculating significance, the median differences calculated for each pair of cultures over their first 5 h (during which neither culture contains Dox) were assumed due to background variation and subtracted from the differences at each timepoint. Difference plots were generated using the python package matplotlib¹⁰¹. Curve fitting and statistical tests were performed in python with SciPy¹⁰².

Quantitative editing assays with diploid reporter yeast

Editing of our diploid SSA reporter strain was initiated by co-delivering native Cas9 or Cas9-scMCP backbones with Wt or MT3 PID inserts as described above for transformation-associated editing of haploid SSA reporter strains with a single sgRNA. However, transformants were not plated immediately after the 10 min incubation in CaCl₂; rather, half the resuspended mixture (200 µl) was inoculated into 20 mL SC–Leu–Ura broth to provide selection for cells which maintain both Cas9 (assembled in vivo) and sgRNA episomes, as well as time for steady-state segregation of the newly assembled Cas9 episomes. These 20 mL cultures were grown for 24 h at 30 °C with rotation on a roller drum and then subcultured 1:20 in a fresh 20 mL volume of the same media, either lacking β-E or supplemented with β-E at 15 nM or 1000 nM. Subcultures were grown at 30 °C with rotation on a roller drum until D_{600 nm} ~ 1.0 (12–17 h), pelleted by centrifugation at 3000 × g for 3 min, washed once with resuspension in an equal volume of sterile ddH₂O, and then re-pelleted and frozen at −20 °C for downstream gDNA isolation by bead beating and phenol-chloroform extraction (25:24:1 phenol:chloroform:isoamyl alcohol). Bead beating was carried out at 4 °C on a FastPrep-24 5 G machine from MP Biomedicals using their 2 mL tubes pre-filled with Lysing Matrix Y; eight cycles of 15-second bead beating was performed with 30 s pauses between each cycle. For the phenol-chloroform extraction, treatment with RNAse A (Thermo Scientific) at a final concentration of 30 mg/mL was performed for 15 min at 37 °C before the final 70% ethanol precipitation step. In parallel, aliquots were withdrawn from the post−24 h transformant cultures to determine their LYS2⁺ fractions immediately prior to the 12–17 h subculture step. These aliquots were first pelleted by centrifugation at 2103 × g for 4 min, washed twice in an equal volume of sterile ddH₂O with resuspension and re-pelleting with the same centrifugation conditions, and then resuspended in a final volume of 1 mL ddH₂O before serially diluting and spotting on plates. Plates were incubated 36 h at 30 °C before immediately photographing and quantifying CFUs as described for haploid editing assays, except only 12 h were allowed at benchtop before recounting.

To analyze allele frequencies at LYS2 loci within the above gDNA preparations, targeted deep sequencing using an Illumina MiSeq (v2 reagent kit) was performed on amplicons that spanned a 230 bp region with the protospacer and allele-specific point mutation. These were amplified from unedited alleles in a single round of PCR (25 cycles) with GoTaq Green and unique combinations of primers oGG1246-oGG1254 (Supplementary Data 4) for multiplexed sequencing with dual indexing in a single run. Demultiplexed paired-end (230 × 2) reads were analyzed with IDT’s rhAmpSeq CRISPR Analysis Tool (CRISPAltRations 1.1.0) using a modified protocol to quantify allele frequencies rather than Cas9 editing per se. Briefly, the 230 bp target region at Allele 1 was provided to the tool as ‘Amplicon’ sequence, while the same region but with the Allele 2 point mutation was provided as ‘Donor’ sequence. In addition, the ‘Guide RNA’ sequence provided to the tool was a mock spacer that overlapped the point mutation position, which was thus the fourth bp upstream of the PAM. Using these parameters, the ‘Percent Unedited’ output corresponds to the percentage of reads containing the Allele 1 sequence, while the ‘Percent Perfect HDR’ output corresponds to the percentage of reads containing the Allele 2 sequence. These output values summed to ≥ 98.5% for each sample analyzed, confirming that the majority of data were considered in our allelic ratio calculations. Approximately 0.204–0.473 million merged reads were analyzed per sample.

Plating assay to measure colony sizes

To obtain liquid cultures for plating, 18 × 150 glass tubes containing 5 mL SC–Leu or SC–Leu-Ura broth were inoculated with single colonies and grown for 22 h on a roller drum at 30 °C. SC–Leu–Ura was used for strains with the sgRNA plasmid while SC–Leu was used for strains without. Once grown, cultures were diluted in sterile ddH₂O and then plated on solid media of the same type to obtain ~150–400 colonies per plate. Plates were photographed directly from above after 47 h growth at 30 °C and colonies were then analyzed as 2-dimensional objects using custom python scripts to quantify colony areas. This was achieved using the python package scikit-image: colonies were segmented by the Otsu thresholding algorithm and touching colonies were separated by the watershed method. Objects detected near plate edges were excluded to avoid measuring artifacts. Swarm plots were generated using the python package seaborn and t tests were performed with SciPy⁹⁹.

MNase-seq procedures and analyzes

Experiments employed a pZEVi-LYS2⁺ haploid strain (yGWG538 in Supplementary Data 2) harboring an intact LYS2⁺ ORF, the native Cas9^Wt plasmid (pGG541), and the single EGFP sgRNA plasmid (pGG197) which is non-targeting in this background. These features were chosen (1) to ensure standard mRNA quantifications performed in parallel (see qRT-PCR section below) by avoiding nonsense-mediated decay expected in targeting strains and (2) to emulate the plasmid conditions of our other experiments without interfering with growth. The following procedures were adapted from MNase-seq methods described previously^103,104. Liquid starter cultures grown for 24 h at 30 °C were obtained by inoculating 5 mL SC–Leu–Ura media from single colonies and then subcultured (1:70) into 500 mL flasks with 100 mL of the same media, either lacking β-E or supplemented at 15 nM and 1000 nM. These were grown at 30 °C with shaking (220 RPM) until D_{600 nm} ~ 1.0 ( ~ 7 h) and then crosslinked by adding formalin (1% final concentration) and incubating at 25 °C with gentle shaking (110 RPM) for 15 min. After adding 5 mL glycine (125 mM final concentration) to quench formalin for 5 min at 25 °C with gentle shaking (110 RPM), cultures were pelleted by centrifugation at 4 °C for 5 min at 3000 × g, washed twice with resuspension in equal volumes of ice-cold sterile ddH₂O, and then pelleted again in 1.5 mL tubes for snap-freezing in liquid nitrogen and storing at −80 °C. Pellets were thawed on ice and resuspended in spheroplasting buffer (1.2 M sorbitol, 100 mM potassium phosphate, 1 mM CaCl₂, 0.5 mM β-mercaptoethanol, pH 7.5) with 1 mg/mL of 100 T Zymolyase. Cells were spheroplasted at 37 °C for 30 min and spheroplasts were collected by centrifugation and washed twice with 1 mL of spheroplasting buffer. Spheroplasts were divided into four 1.5 mL tubes, pelleted by centrifugation, and resuspended in MNase digestion buffer (1 M Sorbitol, 50 mM NaCl, 10 mM Tris-HCL (pH 7.4), 5 mM MgCl₂, 1 mM CaCl₂, 0.5 mM spermidine, 0.075% NP-40, add 1 mM β-mercaptoethanol) with 20 U of MNase (Thermo Scientific Cat. EN0181). Chromatin was digested with MNase at 37 °C for 15, 30, 45, and 60 min and reactions were stopped by addition of EDTA to a final concentration of 30 mM. Cross-links were reversed by addition of SDS (to 0.5% final) and proteinase K (0.5 mg/ml final) and incubated at 37 °C for 1 h followed by 65 °C for 2 h. Digested DNA was extracted using 25:24:1 phenol:chloroform:isoamyl alcohol and precipitated with 70% isopropanol. To remove RNA, the mixture was then resuspended in TE buffer with RNAse A (6 Kunitz units) and incubated at 37 °C for 1 h. Finally, DNA was purified with the Zymo Clean and Concentrator kit. MNase digestions were assessed by running on an agarose gel with ethidium bromide staining to confirm mono-nucleosome fragments between 100-200 bp. A multiplexed Illumina sequencing library was then prepared from samples digested for 15 and 60 min through direct adapter ligation using the automated KAPA general DNA-seq library preparation kit. Paired-end (51 × 2) sequencing of this library on an Illumina NovaSeq 6000 with a SP100 reagent kit yielded approximately 40 million reads per sample after demultiplexing. Demultiplexed reads were first analyzed with Trimmomatic (v0.39) processing to remove sequencing adapters and then with FastQC (v0.11.4) to assess read quality. Processed reads were then aligned to a custom reference of chromosome II (chr.II) containing pZEVi-LYS2⁺, using the Burrows Wheeler aligner (BWA) mem algorithm (v0.7.7). Aligned reads were then used as input for nucleosome analysis using the DANPOS (v2) pipeline¹⁰⁵. Analyzes of samples digested for 15 and 60 min revealed similar trends but suggested incomplete digestion at 15 min; hence, only 60 min data are presented. We observed a total of 4,670 nucleosomes across chr.II with ~ 1% of these nucleosomes showing statistically significant changes in occupancy levels when comparing between 0 nM β-E and 15 nM or 1000 nM, of which about half were from the LYS2⁺ locus (Supplementary Data 1; p value < 0.01; FDR-corrected Poisson tests performed by DANPOS). Tukey’s tests for significant Log2FC differences across the LYS2⁺ ORF (Supplementary Fig. 7f) were performed using GraphPad Prism software.

qRT-PCR and Fold Change calculations for pZEVi-LYS2 induction

To purify total RNA for reverse transcription, 1.3 mL aliquots were withdrawn from MNase-seq cultures immediately prior to formalin addition, pelleted at 4 °C, washed once with resuspension in 1.3 mL ice-cold autoclaved ddH₂O, then re-pelleted at 4 °C for −80 °C storage until further processing. RNA was purified using the RNeasy Mini kit (Qiagen) and an in-solution DNase I (Qiagen) treatment prior to final cleanup. cDNA was synthesized using SuperScript IV Reverse Transcriptase (Invitrogen) according to the manufacturer’s recommendations for a 20 µl reaction with 1 µg total RNA and random hexamers (IDT). qPCRs were performed with gene-specific primers (each at 500 nM) from IDT and 2.5 µL of the diluted cDNA (1:5 dilution) in a 10 µL reaction with Fast SYBR Green Master Mix (Life Technologies) and a LightCycler® 480 Real-time PCR System (Roche) according to the manufacturers’ protocols. Primers used for the TAF10 housekeeping gene or LYS2 target gene are listed in Supplementary Data 4. Raw Ct values were calculated using the second derivative method. ΔCt values were calculated relative to the TAF10 levels in each sample; ΔΔCt values were calculated relative to the LYS2 levels in replicate 3 of the 0 nM samples. Ultimately, the plotted Fold Changes in LYS2 RNA levels were calculated as Fold Change = 2^−ΔΔCt.

ChIP-qPCR and ChIP-seq procedures

Experiments employed pZEVi yeast strains derived from yGWG385 (Supplementary Data 2) via co-transformation with various (d)Cas9/sgRNA plasmid combinations (Supplementary Data 1). yGWG385 contains the pZEVi-LYS2::boxB^1x(Upstr.) non-SSA target insertion used in donor-dependent editing assays; to prevent cleavage-dependent growth perturbations when inducing pZEVi in this strain background (Supplementary Figs. 6–7), we used a non-targeting sgRNA with Cas9^NG and Cas9^NG-2xN, or alternatively, a targeting sgRNA with non-cleaving dCas9^NG and dCas9^NG-2xN variants. ChIP procedures were adapted from existing protocols^{106,107,108,109} described for S. cerevisiae that typically recommend isolating chromatin from crosslinked cells via bead beating.

To obtain crosslinked cells for ChIP, we followed all MNase-seq procedures described above for culturing from single colonies, subculturing with or without β-E induction, formalin cross-linking, glycine quenching, pellet washing, and snap-freezing for storage at −80, except that ice-cold sterile PBS (1X) was used for the two washes in place of ddH₂O. Unless otherwise noted, all subsequent steps for ChIP were performed on ice or at 4 °C using pre-chilled tubes, solutions, and equipment. After thawing on ice, pellets were resuspended in 1.4 mL Lysis Buffer (50 mM HEPES-KOH pH 7.5, 150 mM NaCl, 1 mM EDTA pH 8.0, 1% (v/v) Triton X-100, 0.1% (w/v) sodium deoxycholate, 0.1% (w/v) SDS) and transferred to 2 mL screwcap tubes pre-filled with 0.5 mm yttria-stabilized zirconium oxide beads (Lysing Matrix Y from MP Biomedicals) for bead beating. Our Lysis Buffer is identical to the ‘FA lysis buffer’ described previously¹⁰⁷ but with the following protease inhibitors freshly added before use: yeast-optimized Protease Inhibitor Cocktail from Sigma (P8215) diluted 1:200, and dissolvable cOmplete Mini EDTA-free Protease Inhibitor Cocktail tablets from Roche. Bead beating was carried out at 4 °C on a FastPrep-24 5 G machine from MP Biomedicals with seven 30-second cycles set to 8 m/sec (QuickPrep adapter) and 3 min pauses on ice between each cycle. To separate chromatin lysates from the beads, a tube-in-tube setup was used to centrifuge lysates from screwcap tubes into 2 mL LoBind (Eppendorf) tubes below, after cutting off the LoBind tube caps. First, the capless 2 mL LoBind tube was inserted into the bottom of a 14 mL round-bottom falcon tube. Second, the screwcap tube was inverted and punctured once at the bottom with a disposable 25 G 5/8” needle. Third, the screwcap tube was placed upright on the LoBind tube within the falcon and centrifuged once at 200 × g for 2 min and again at 400 × g for 2 min; in some cases, a second 2 min spin at 400 × g was needed to finish collecting the lysate below. Lysates were then pipette mixed before transferring to fresh 2 mL LoBind tubes. A total of 630 µl lysate from each sample was next aliquoted for sonication in three 1.5 mL TPX plastic tubes from Diagenode (210 µl lysate per tube). Sonication was performed with a Diagenode Bioruptor Plus device set to ‘High’ for 21 consecutive cycles of 30 s ON and 30 s OFF and a Bioruptor Water cooler/pump unit set to 2 °C. Lysates were re-pooled from sonication tubes into single 1.5 mL LoBind (Eppendorf) tubes and centrifuged at 16,000 × g for 15 min to pellet residual cellular debris, and supernatants containing sonicated chromatin were then transferred to fresh 1.5 mL LoBind tubes for downstream immunoprecipitation (IP) with magnetic Dynabeads pre-conjugated to antibody. ChIP-grade anti-HA antibody from Abcam (ab9110) was covalently conjugated to M-270 Epoxy Dynabeads beforehand at a ratio of 5 µg per 100 mg beads, stored at 4 °C, and finally blocked at r.t. immediately before use, in each case by following manufacturer protocols. After blocking, Dynabeads were resuspended in Lysis Buffer (120 µl per 100 mg beads) and volumes containing 100 mg beads were mixed into each sonicated sample to carry out IPs with rotation at 4 °C for 20 h. Prior to mixing with Dynabeads, a 20ul aliquot of the ‘input’ chromatin was withdrawn from each sonicated sample and stored at 4 °C until the de-crosslinking step carried out alongside pull-down samples the next day.

To isolate DNA from input and pull-down samples, Dynabead-bound pull-downs were first washed at r.t. several times using a magnetic stand and the following buffers: twice with Lysis Buffer, once with High-salt Buffer, once with LiCl/DOC Buffer, and once with TE Buffer (10 mM Tris pH 8.0, 1 mM EDTA pH 8.0). Our High-salt Buffer is identical to Lysis Buffer (see above) except with 500 mM NaCl in place of 150 mM NaCl, and only the liquid Protease Inhibitor Cocktail from Sigma (P8215) was added before use. Our LiCl/DOC Buffer (10 mM Tris pH 8.0, 0.25 M LiCl, 1 mM EDTA pH 8.0, 0.5% (v/v) Nonidet P-40, 0.5% (w/v) sodium deoxycholate) is identical to the ‘ChIP wash buffer’ described previously¹⁰⁷ except it was supplemented with Protease Inhibitor Cocktail from Sigma (P8215) before use. Aliquots of Lysis/High-salt Buffers were equilibrated to r.t. before use. After resuspension in each buffer (700 µl per buffer), Dynabeads were incubated with rotation for 3-5 min at r.t. and then pulse-centrifuged before handling on the magnet. To elute pull-downs, Dynabeads were resuspended in 100 µl TES (50 mM Tris pH 8.0, 10 mM EDTA pH 8.0, 1% SDS) and incubated in a thermomixer for 1 h at 65 °C with light agitation (450 RPM); 95 µl of each eluate was then transferred to fresh LoBind tubes for 18 h de-crosslinking in a thermomixer at 65 °C with agitation (800 RPM). At the same step, input samples were equilibrated to r.t. and mixed with 75 µl TES for 18 h de-crosslinking alongside pull-down samples. De-crosslinked samples were subsequently mixed with 200 µl Buffer TE (to dilute SDS) and 5 µl RNase A/T1 cocktail (2 mg/mL stock) for a 1 h incubation at 37 °C to remove RNA, followed by 10 µl Proteinase K (20 mg/mL stock) for 2 h at 56 °C to remove protein. DNA was then purified using Qiagen MinElute columns. We essentially followed Qiagen’s standard protocol for PCR purification but mixed 10 µl sodium acetate into each sample before column loading, performed an extra PE wash, and re-loaded the first eluate ( ~ 20 µl) to perform a second elution. After purification, all input samples were analyzed with an Agilent TapeStation 4150 (D5000 kit) to identify the peak of each fragment length distribution achieved by sonication; peaks ranged from 191-561 bp across samples, with most samples peaking between 200-300 bp.

To quantify pull-down efficiencies by qPCR we used KAPA SYBR FAST (2X Master Mix, Universal kit) and a LightCycler® 480 Real-time PCR System (Roche) according to the manufacturers’ protocols, with gene-specific primers (each at 200 nM) from IDT and 1.0 µL of diluted input or pull-down DNA (1:8 dilution to conserve samples) as template in a 10 µL reaction. Primers used for the LYS2 and TAF10 amplicons are listed in Supplementary Data 4. Every biological replicate sample was assayed with three technical replicate reactions for each primer set. Raw Ct values were calculated for each technical replicate using the second derivative method, and the mean values from technical triplicates were used to calculate ΔCt values for each primer set by subtracting the mean Ct value of an input sample from the mean Ct value of its corresponding pull-down sample (pull-down – input). Ultimately, the % of input captured by IP was calculated for LYS10 and TAF10 as % = 100 × (2^−ΔCt).

For ChIP-seq, a subset of the input and pull-down samples tested by qPCR were also used to prepare libraries for paired-end (51 × 2) sequencing on a single lane of the NovaSeq X Plus (Illumina). All sequencing libraries were prepared using the ThruPLEX DNA-seq kit that can be carried out with as little as 50 pg starting material. Library preparation and sequencing were performed in accordance with manufacturer protocols.

ChIP-seq analyzes

Raw demultiplexed paired-end reads from each sample were first evaluated for quality using FastQC and then trimmed of low-quality base pairs and adapter sequences from both 5’ and 3’ ends using the tools Trim Galore (Babraham Bioinformatics) and Trimmomatic. Next, the reads were aligned to their corresponding custom reference sequence using bowtie2 and the results were filtered with samtools to retain only properly paired reads with a minimum mapping quality of 30. The function bamCompare from the package deepTools was used to normalize for read depth (Counts Per Million) and produce a final output as the ratio of IP coverage over input coverage, in both bigwig and bedgraph format. Signals were then plotted across each genomic region of interest using IGV and final plots used for the figures were created with python (matplotlib). Our custom references contained the entire S. cerevisiae genome with the corresponding modified chromosomal sequences and episomal Cas9/sgRNA sequences in each strain. All plots represent coverage data that were quantified with a 10 bp binning scheme. Alternatively, to calculate mean coverage ratios across the 20 bp protospacer targeted by dCas9^NG and dCas9^NG-2xN (Supplementary Fig. 11, legend), coverage data were quantified at single-bp resolution.

Generation of the clonal U2OS.EGFP::5’ UTR-boxB ^2x and U2OS.EGFP::Intron- boxB ^2x(V1.0) cell lines

All cell lines generated in this study are derivatives of the U2OS.EGFP cell line¹⁸ obtained from Keith Joung’s Lab. U2OS.EGFP was derived from U2OS through low-MOI lentiviral transduction to introduce an expression cassette that includes EGFP-PEST; the expected sequence of its lentiviral insertion was provided by Toni Cathomen’s lab and published previously¹⁹. Using CRISPR-assisted knock-in, the U2OS.EGFP cell line was modified to contain two boxB HTs embedded within the 5’ UTR or an intron of its EGFP-PEST marker. Briefly, to generate the 5’ UTR insertion in U2OS.EGFP::5’ UTR-boxB^2x, 2 µg of Cas9 plasmid (pAG5), 666 ng of sgRNA plasmid (pGG771) and 20 pmol of a linear ssDNA repair template (oGG1238) with two phosphothiorate linkages on each end were mixed with 82 µL Nucleofector Solution V and 18 µL of Supplement Solution 1 according to manufacturer’s recommendations for the Cell Line Nucleofector Kit V (Lonza). 5 × 10⁵ U2OS.EGFP parent cells were resuspended in the above mixture and nucleofected with a Lonza Nucleofector 2b Device (Kit V, Program X-001). The cells were immediately transferred to a 6-well plate containing 2 mL of medium and incubated at standard conditions (37 °C and 5% CO₂). Five days post-nucleofection, single cells were sorted based on forward scatter area and width parameters into separate wells of 96-well plates using a BD FACSAria IIu flow cytometer. Wells that showed outgrowth within 7 days post-sorting were genotyped with primers P40 and P41 (Supplementary Data 4) for PCR-based screening of clones carrying the desired knock-in. Out of 26 clones screened, 7 contained an amplicon detectably longer than the parental genotype. These 7 clones were further verified by Sanger sequencing and 6 of 7 were found to contain a perfect insertion, implying an overall efficiency of ~ 23% perfect insertions. A single clone among those 6 was chosen for downstream assays. To generate the intron insertion in U2OS.EGFP::Intron-boxB^2x(V1.0), transfections were performed using a modified Effectene reagent (Qiagen) protocol as previously described¹¹⁰ except that the DMEM used for cell culture was also supplemented at ~ 1% each of sodium pyruvate (100 mM stock), GLUTAMAX (100X), and non-essential amino acids (100X) from Gibco. Per transfection, 0.25 µg of Cas9 plasmid (pAG5), 0.1 µg of sgRNA plasmid (pGG443) and 5.5 pmol of a linear ssDNA repair template (oGG1239) was used in 100 µL of buffer EC with a DNA:Enhancer ratio of 1:8 (3.2 µL) and DNA:Effectene ratio of 1:15 (6 µL). The resulting transfection complexes were added to each well in a 6-well plate that was freshly seeded with 2 × 10⁵ U2OS.EGFP parent cells in 2 mL of media. Seven days post-transfection, single cells were sorted based on forward scatter area and width parameters into separate wells of 96-well plates using a BD FACSAria IIu flow cytometer. Wells that grew out within 7 days post-sorting were genotyped with primers P31 and P32 (Supplementary Data 4) for PCR-based screening of clones carrying the desired knock-in. Out of 91 clones screened, 18 contained an amplicon detectably longer than the parental genotype. Sanger sequencing of the 18 amplicons revealed only one sequence that matched the repair template sequence, except for two unexpected point mutations within the intron’s polypyrimidine tract and splice acceptor elements, and the corresponding cell line was isolated for downstream assays conducted in the same medium.

Human cell culture assays to measure Cas9-mediated indels

To generate indels in the U2OS.EGFP cell line or its boxB^2x-containing derivatives, Cas9^Wt-alt (pAG5) or Cas9-2xN^Wt-alt (pGG768) plasmids were co-delivered with sgRNA plasmids (Supplementary Data 3) via nucleofection as described previously for EGFP knockdown assays¹⁹ except that ptdTomato was excluded, media lacked Geneticin (G418 Sulfate), cells were harvested 48 h post-nucleofection for genomic DNA (gDNA) purification rather than flow cytometry, and all three biological replicates were performed on the same day. gDNA was purified from cells in 6-well plates using the DNeasy blood and tissue kit (Qiagen) according to the manufacturer’s protocol. For indel detection at the targets of sgRNAs 1 through 3, deep sequencing with an Illumina NextSeq 2000 (300-cycle Low Output P1 reagent kit) was performed on PCR amplicons spanning 297 bp of the protospacer region, and demultiplexed single-read (297 × 1) sequences were analyzed with IDT’s rhAmpSeq CRISPR Analysis Tool (CRISPAltRations 1.1.0) using the default parameters for amplicon analyzes. To employ this tool for our single-read data, it was necessary to first generate mock R2 read pair ‘.fastq’ files from the R1 files—with identical but reverse-complemented sequences; then, generate sub-sampled files for downstream analyzes—each with 0.100 million random pairs of reads. Both file processing steps were performed with the Seqtk script for python. Ultimately, > 0.098 million merged reads were analyzed per sample. For indel detection at the sgRNA 4 target, deep sequencing with an Illumina MiSeq (600-cycle v3 reagent kit) was performed on PCR amplicons spanning 297 bp of the protospacer region, and demultiplexed paired-end (297 × 2) reads were analyzed with IDT’s rhAmpSeq CRISPR Analysis Tool (CRISPAltRations 1.1.0) using the default parameters for amplicon analyzes. Approximately 0.690-1.159 million merged reads were analyzed per sample without Seqtk pre-processing. Sequences of the amplified regions provided to the tool as “Amplicons” are included in Supplementary Data 6 along with raw results from the analyzes. PCR amplicons were generated through a single round of GoTaq Green PCR (30 cycles) using unique combinations of primers listed in Supplementary Data 4 for dual-indexed sequencing runs. To facilitate this PCR reaction, the first 10 cycles annealed at 55 °C and the last 20 cycles annealed at 65 °C. Amplified DNA was size-selected ( ~ 300-500 bp) by gel purification prior to sequencing, accounting for the total amplicon length expected with Illumina adapters (433 bp for unedited genotypes).

Analysis of cellular fluorescence by flow cytometry

To analyze the mCherry−2xN yeast strains for fluorescence, 1 mL starter cultures were first obtained for each biological replicate as described above in plate reader growth assays, except SC–Ura broth was used. Once grown, 33.3 µl aliquots were withdrawn from starter cultures and analyzed on a Sony SA3800 spectral analyzer after dilution into 500 µl aliquots of PBS + 0.5% FBS on ice. Mean mCherry values were measured in arbitrary fluorescence units from 30,000 events per culture according to the manufacturer’s suggested settings for mCherry fluorescence, after gating on a majority population in the expected size range. The gating strategy used for analysis in the SA3800 software is shown in Supplementary Fig. 15. To compare EGFP fluorescence levels in the human U2OS.EGFP line and its U2OS.EGFP::5’ UTR-boxB^2x derivative, cells were cultured in 6-well plates with Geneticin-free media as described for indel experiments post-nucleofection. Three wells for each line (biological replicates) were seeded in parallel on the same day with 5 × 10⁵ cells per well. Following 48 h of growth, cells were harvested and analyzed on a Sony SH800 sorter after resuspension in 500 µl aliquots of PBS + 0.5% FBS on ice. Mean EGFP values were measured in arbitrary fluorescence units according to the manufacturer’s suggested settings, either for ‘all events’ (20,000 per sample) or for ‘singlet events’ (at least 19,000 per sample) enriched by gating. The gating strategy used for analysis in the SH800 software is shown in Supplementary Fig. 16.

Statistics & reproducibility

GraphPad Prism, matplotlib (python), DANPOS v2 (python), IGV, or Microsoft Excel were used to generate all quantitative plots in this work before manually compiling and annotating them in Adobe Illustrator. Except where noted otherwise, p values were calculated with GraphPad Prism or python (SciPy) software using unpaired, two-tailed t tests with Welch’s correction, Gaussian distributions assumed, and no correction for multiple comparisons. All assays were performed with at least three biological replicates except where noted otherwise, and biological replicates were performed on separate days unless noted otherwise. No statistical method was used to predetermine experimental sample sizes. No samples were excluded from data analyzes.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Relevant data supporting the findings of this study are available in the published article, its supplementary files, and the Sequence Read Archive of the NCBI (BioProject accession codes: PRJNA1149169 and PRJNA1149094). Source data are provided with this paper.

References

Doudna, J. A. & Charpentier, E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science 346, 1258096 (2014).
Article PubMed Google Scholar
Adli, M. The CRISPR tool kit for genome editing and beyond. Nat. Commun. 9, 1911 (2018).
Article PubMed PubMed Central ADS Google Scholar
Makarova, K. S., Wolf, Y. I. & Koonin, E. V. Classification and nomenclature of CRISPR-cas systems: where from here? CRISPR J. 1, 325–336 (2018).
Article PubMed PubMed Central Google Scholar
Kantor, A., McClements, M. E. & MacLaren, R. E. CRISPR-Cas9 DNA base-editing and prime-editing. Int J. Mol. Sci. 21, 6240 (2020).
Article CAS PubMed PubMed Central Google Scholar
Scholefield, J. & Harrison, P. T. Prime editing - an update on the field. Gene Ther. 28, 396–401 (2021).
Article CAS PubMed PubMed Central Google Scholar
Garneau, J. E. et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature 468, 67–71 (2010).
Article CAS PubMed ADS Google Scholar
Gasiunas, G., Barrangou, R., Horvath, P. & Siksnys, V. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl Acad. Sci. USA. 109, E2579–E2586 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Sternberg, S. H., Redding, S., Jinek, M., Greene, E. C. & Doudna, J. A. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature 507, 62–67 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Deveau, H. et al. Phage response to CRISPR-encoded resistance in Streptococcus thermophilus. J. Bacteriol. 190, 1390–1400 (2008).
Article CAS PubMed Google Scholar
Mojica, F. J., Diez-Villasenor, C., Garcia-Martinez, J. & Almendros, C. Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology 155, 733–740 (2009).
Article CAS PubMed Google Scholar
Jiang, W., Bikard, D., Cox, D., Zhang, F. & Marraffini, L. A. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat. Biotechnol. 31, 233–239 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sternberg, S. H., LaFrance, B., Kaplan, M. & Doudna, J. A. Conformational control of DNA target cleavage by CRISPR-Cas9. Nature 527, 110–113 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Dagdas, Y. S., Chen, J. S., Sternberg, S. H., Doudna, J. A. & Yildiz, A. A conformational checkpoint between DNA binding and cleavage by CRISPR-Cas9. Sci. Adv. 3, eaao0027 (2017).
Article PubMed PubMed Central Google Scholar
Boyle, E. A. et al. Quantification of Cas9 binding and cleavage across diverse guide sequences maps landscapes of target engagement. Sci. Adv. 7, eabe5496 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Pacesa, M. et al. Structural basis for Cas9 off-target activity. Cell 185, 4067–4081.e4021 (2022).
Article CAS PubMed PubMed Central Google Scholar
Pacesa, M. et al. R-loop formation and conformational activation mechanisms of Cas9. Nature 609, 191–196 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Kleinstiver, B. P. et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature 523, 481–485 (2015).
Article PubMed PubMed Central ADS Google Scholar
Goldberg, G. W. et al. Engineered dual selection for directed evolution of SpCas9 PAM specificity. Nat. Commun. 12, 349 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dow, L. E. et al. Inducible in vivo genome editing with CRISPR-Cas9. Nat. Biotechnol. 33, 390–394 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zetsche, B., Volz, S. E. & Zhang, F. A split-Cas9 architecture for inducible genome editing and transcription modulation. Nat. Biotechnol. 33, 139–142 (2015).
Article CAS PubMed Google Scholar
Liu, K. I. et al. A chemical-inducible CRISPR-Cas9 system for rapid control of genome editing. Nat. Chem. Biol. 12, 980–987 (2016).
Article CAS PubMed Google Scholar
Oakes, B. L. et al. Profiling of engineering hotspots identifies an allosteric CRISPR-Cas9 switch. Nat. Biotechnol. 34, 646–651 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kundert, K. et al. Controlling CRISPR-Cas9 with ligand-activated and ligand-deactivated sgRNAs. Nat. Commun. 10, 2127 (2019).
Article PubMed PubMed Central ADS Google Scholar
Hemphill, J., Borchardt, E. K., Brown, K., Asokan, A. & Deiters, A. Optical Control of CRISPR/Cas9 Gene Editing. J. Am. Chem. Soc. 137, 5642–5645 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nihongaki, Y., Kawano, F., Nakajima, T. & Sato, M. Photoactivatable CRISPR-Cas9 for optogenetic genome editing. Nat. Biotechnol. 33, 755–760 (2015).
Article CAS PubMed Google Scholar
Jain, P. K. et al. Development of light-activated CRISPR using guide RNAs with photocleavable protectors. Angew. Chem. Int Ed. Engl. 55, 12440–12444 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. Very fast CRISPR on demand. Science 368, 1265–1269 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Oakes, B. L. et al. CRISPR-Cas9 circular permutants as programmable scaffolds for genome modification. Cell 176, 254–267.e216 (2019).
Article CAS PubMed PubMed Central Google Scholar
Siu, K. H. & Chen, W. Riboregulated toehold-gated gRNA for programmable CRISPR-Cas9 function. Nat. Chem. Biol. 15, 217–220 (2019).
Article CAS PubMed Google Scholar
Hochrein, L. M., Li, H. & Pierce, N. A. High-performance allosteric conditional guide RNAs for mammalian cell-selective regulation of CRISPR/Cas. ACS Synth. Biol. 10, 964–971 (2021).
Article CAS PubMed Google Scholar
Jiao, C. et al. RNA recording in single bacterial cells using reprogrammed tracrRNAs. Nat. Biotechnol. 41, 1107–1116 (2023).
Article CAS PubMed PubMed Central Google Scholar
Samai, P. et al. Co-transcriptional DNA and RNA Cleavage during Type III CRISPR-Cas Immunity. Cell 161, 1164–1174 (2015).
Article CAS PubMed PubMed Central Google Scholar
Estrella, M. A., Kuo, F. T. & Bailey, S. RNA-activated DNA cleavage by the Type III-B CRISPR-Cas effector complex. Genes Dev. 30, 460–470 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kazlauskiene, M., Tamulaitis, G., Kostiuk, G., Venclovas, C. & Siksnys, V. Spatiotemporal control of type III-A CRISPR-cas immunity: coupling DNA degradation with the target RNA recognition. Mol. Cell 62, 295–306 (2016).
Article CAS PubMed Google Scholar
Goldberg, G. W., Jiang, W., Bikard, D. & Marraffini, L. A. Conditional tolerance of temperate phages via transcription-dependent CRISPR-Cas targeting. Nature 514, 633–637 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Jiang, W., Samai, P. & Marraffini, L. A. Degradation of phage transcripts by CRISPR-associated RNAses enables type III CRISPR-cas immunity. Cell 164, 710–721 (2016).
Article CAS PubMed PubMed Central Google Scholar
Goldberg, G. W. et al. Incomplete prophage tolerance by type III-A CRISPR-Cas systems reduces the fitness of lysogenic hosts. Nat. Commun. 9, 61 (2018).
Article PubMed PubMed Central ADS Google Scholar
Rostol, J. T. & Marraffini, L. A. Non-specific degradation of transcripts promotes plasmid clearance during type III-A CRISPR-Cas immunity. Nat. Microbiol 4, 656–662 (2019).
Article PubMed PubMed Central Google Scholar
Sugawara, N., Ira, G. & Haber, J. E. DNA length dependence of the single-strand annealing pathway and the role of Saccharomyces cerevisiae RAD59 in double-strand break repair. Mol. Cell. Biol. 20, 5300–5309 (2000).
Article CAS PubMed PubMed Central Google Scholar
Bhargava, R., Onyango, D. O. & Stark, J. M. Regulation of single-strand annealing and its role in genome maintenance. Trends Genet 32, 566–575 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bolukbasi, M. F. et al. DNA-binding-domain fusions enhance the targeting range and precision of Cas9. Nat. Methods 12, 1150–1156 (2015).
Article CAS PubMed PubMed Central Google Scholar
Anders, C., Niewoehner, O., Duerst, A. & Jinek, M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature 513, 569–573 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Ichikawa, D. M. et al. A universal deep-learning model for zinc finger design enables transcription factor reprogramming. Nat. Biotechnol. 41, 1117–1129 (2023).
Article CAS PubMed PubMed Central Google Scholar
Das, A. T., Tenenbaum, L. & Berkhout, B. Tet-on systems for doxycycline-inducible gene expression. Curr. Gene Ther. 16, 156–167 (2016).
Article CAS PubMed PubMed Central Google Scholar
Krueger, C., Berens, C., Schmidt, A., Schnappinger, D. & Hillen, W. Single-chain tet transregulators. Nucleic Acids Res 31, 3050–3056 (2003).
Article CAS PubMed PubMed Central Google Scholar
Nishimasu, H. et al. Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science 361, 1259–1262 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Lowary, P. T. & Uhlenbeck, O. C. An RNA mutation that increases the affinity of an RNA-protein interaction. Nucleic Acids Res 15, 10483–10493 (1987).
Article CAS PubMed PubMed Central Google Scholar
Valegard, K. et al. The three-dimensional structures of two complexes between recombinant MS2 capsids and RNA operator fragments reveal sequence-specific protein-RNA interactions. J. Mol. Biol. 270, 724–738 (1997).
Article CAS PubMed Google Scholar
Tutucci, E. et al. An improved MS2 system for accurate reporting of the mRNA life cycle. Nat. Methods 15, 81–89 (2018).
Article CAS PubMed Google Scholar
Darzacq, X. et al. In vivo dynamics of RNA polymerase II transcription. Nat. Struct. Mol. Biol. 14, 796–806 (2007).
Article CAS PubMed PubMed Central Google Scholar
Hocine, S., Raymond, P., Zenklusen, D., Chao, J. A. & Singer, R. H. Single-molecule analysis of gene expression using two-color RNA labeling in live yeast. Nat. Methods 10, 119–121 (2013).
Article CAS PubMed Google Scholar
Valegard, K., Murray, J. B., Stockley, P. G., Stonehouse, N. J. & Liljas, L. Crystal structure of an RNA bacteriophage coat protein-operator complex. Nature 371, 623–626 (1994).
Article CAS PubMed ADS Google Scholar
Ni, C. Z. et al. Crystal structure of the MS2 coat protein dimer: implications for RNA binding and virus assembly. Structure 3, 255–263 (1995).
Article CAS PubMed Google Scholar
Weisberg, R. A. & Gottesman, M. E. Processive antitermination. J. Bacteriol. 181, 359–367 (1999).
Article CAS PubMed PubMed Central Google Scholar
Lange, S. et al. Simultaneous transport of different localized mRNA species revealed by live-cell imaging. Traffic 9, 1256–1267 (2008).
Article CAS PubMed Google Scholar
Wilson, H. R. et al. The global regulator RNase III modulates translation repression by the transcription elongation factor N. EMBO J. 21, 4154–4161 (2002).
Article CAS PubMed PubMed Central Google Scholar
Wilson, H. R., Zhou, J. G., Yu, D. & Court, D. L. Translation repression by an RNA polymerase elongation complex. Mol. Microbiol. 53, 821–828 (2004).
Article CAS PubMed Google Scholar
Conant, C. R., Goodarzi, J. P., Weitzel, S. E. & von Hippel, P. H. The antitermination activity of bacteriophage lambda N protein is controlled by the kinetics of an RNA-looping-facilitated interaction with the transcription complex. J. Mol. Biol. 384, 87–108 (2008).
Article CAS PubMed PubMed Central Google Scholar
Said, N. et al. Structural basis for lambdaN-dependent processive transcription antitermination. Nat. Microbiol 2, 17062 (2017).
Article CAS PubMed Google Scholar
Legault, P., Li, J., Mogridge, J., Kay, L. E. & Greenblatt, J. NMR structure of the bacteriophage lambda N peptide/boxB RNA complex: recognition of a GNRA fold by an arginine-rich motif. Cell 93, 289–299 (1998).
Article CAS PubMed Google Scholar
Mogridge, J. et al. Independent ligand-induced folding of the RNA-binding domain and two functionally distinct antitermination regions in the phage lambda N protein. Mol. Cell 1, 265–275 (1998).
Article CAS PubMed Google Scholar
Austin, R. J., Xia, T., Ren, J., Takahashi, T. T. & Roberts, R. W. Designed arginine-rich RNA-binding peptides with picomolar affinity. J. Am. Chem. Soc. 124, 10966–10967 (2002).
Article CAS PubMed Google Scholar
Whelan, W. L., Gocke, E. & Manney, T. R. The CAN1 locus of Saccharomyces cerevisiae: fine-structure analysis and forward mutation rates. Genetics 91, 35–51 (1979).
Article CAS PubMed PubMed Central Google Scholar
McIsaac, R. S. et al. Synthetic gene expression perturbation systems with rapid, tunable, single-gene specificity in yeast. Nucleic Acids Res 41, e57 (2013).
Article CAS PubMed Google Scholar
McIsaac, R. S., Gibney, P. A., Chandran, S. S., Benjamin, K. R. & Botstein, D. Synthetic biology tools for programming gene expression without nutritional perturbations in Saccharomyces cerevisiae. Nucleic Acids Res 42, e48 (2014).
Article CAS PubMed PubMed Central Google Scholar
Datta, A. & Jinks-Robertson, S. Association of increased spontaneous mutation rates with high levels of transcription in yeast. Science 268, 1616–1619 (1995).
Article CAS PubMed ADS Google Scholar
Pelechano, V., Wei, W. & Steinmetz, L. M. Extensive transcriptional heterogeneity revealed by isoform profiling. Nature 497, 127–131 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
DiCarlo, J. E. et al. Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems. Nucleic Acids Res 41, 4336–4343 (2013).
Article CAS PubMed PubMed Central Google Scholar
Satomura, A. et al. Precise genome-wide base editing by the CRISPR Nickase system in yeast. Sci. Rep. 7, 2095 (2017).
Article PubMed PubMed Central ADS Google Scholar
Schwabish, M. A. & Struhl, K. Evidence for eviction and rapid deposition of histones upon transcriptional elongation by RNA polymerase II. Mol. Cell. Biol. 24, 10111–10117 (2004).
Article CAS PubMed PubMed Central Google Scholar
Hinz, J. M., Laughery, M. F. & Wyrick, J. J. Nucleosomes inhibit Cas9 endonuclease activity in vitro. Biochemistry 54, 7063–7066 (2015).
Article CAS PubMed Google Scholar
Clarke, R. et al. Enhanced bacterial immunity and mammalian genome editing via RNA-polymerase-mediated dislodging of cas9 from double-strand DNA breaks. Mol. Cell 71, 42–55.e48 (2018).
Article CAS PubMed PubMed Central Google Scholar
Johnston, R. J. Jr. et al. Interlocked feedforward loops control cell-type-specific Rhodopsin expression in the Drosophila eye. Cell 145, 956–968 (2011).
Article CAS PubMed PubMed Central Google Scholar
Xu, X., Wells, A. B., O’Brien, D. R., Nehorai, A. & Dougherty, J. D. Cell type-specific expression analysis to identify putative cellular mechanisms for neurogenetic disorders. J. Neurosci. 34, 1420–1431 (2014).
Article CAS PubMed PubMed Central Google Scholar
Monahan, K. & Lomvardas, S. Monoallelic expression of olfactory receptors. Annu. Rev. Cell. Dev. Biol. 31, 721–740 (2015).
Article CAS PubMed PubMed Central Google Scholar
Symmons, O. et al. Allele-specific RNA imaging shows that allelic imbalances can arise in tissues through transcriptional bursting. PLoS Genet 15, e1007874 (2019).
Article PubMed PubMed Central Google Scholar
Xu, P. et al. The landscape of human tissue and cell type specific expression and co-regulation of senescence genes. Mol. Neurodegener. 17, 5 (2022).
Article CAS PubMed PubMed Central Google Scholar
Babendure, J. R., Babendure, J. L., Ding, J. H. & Tsien, R. Y. Control of mammalian translation by mRNA structure near caps. RNA 12, 851–861 (2006).
Article CAS PubMed PubMed Central Google Scholar
Weenink, T., van der Hilst, J., McKiernan, R. M. & Ellis, T. Design of RNA hairpin modules that predictably tune translation in yeast. Synth. Biol. (Oxf.) 3, ysy019 (2018).
Article CAS PubMed Google Scholar
Kotovic, K. M., Lockshon, D., Boric, L. & Neugebauer, K. M. Cotranscriptional recruitment of the U1 snRNP to intron-containing genes in yeast. Mol. Cell. Biol. 23, 5768–5779 (2003).
Article CAS PubMed PubMed Central Google Scholar
Gornemann, J., Kotovic, K. M., Hujer, K. & Neugebauer, K. M. Cotranscriptional spliceosome assembly occurs in a stepwise fashion and requires the cap binding complex. Mol. Cell 19, 53–63 (2005).
Article PubMed Google Scholar
Lacadie, S. A. & Rosbash, M. Cotranscriptional spliceosome assembly dynamics and the role of U1 snRNA:5’ss base pairing in yeast. Mol. Cell 19, 65–75 (2005).
Article CAS PubMed Google Scholar
Montiel-Gonzalez, M. F., Vallecillo-Viejo, I., Yudowski, G. A. & Rosenthal, J. J. Correction of mutations within the cystic fibrosis transmembrane conductance regulator by site-directed RNA editing. Proc. Natl Acad. Sci. USA 110, 18285–18290 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Montiel-Gonzalez, M. F., Vallecillo-Viejo, I. C. & Rosenthal, J. J. An efficient system for selectively altering genetic information within mRNAs. Nucleic Acids Res 44, e157 (2016).
PubMed PubMed Central Google Scholar
Abudayyeh, O. O. et al. RNA targeting with CRISPR-Cas13. Nature 550, 280–284 (2017).
Article PubMed PubMed Central ADS Google Scholar
Cox, D. B. T. et al. RNA editing with CRISPR-Cas13. Science 358, 1019–1027 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Konermann, S. et al. Transcriptome engineering with RNA-targeting type VI-D CRISPR effectors. Cell 173, 665–676.e614 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hinz, J. M., Laughery, M. F. & Wyrick, J. J. Nucleosomes selectively inhibit cas9 off-target activity at a site located at the nucleosome edge. J. Biol. Chem. 291, 24851–24856 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yarrington, R. M., Verma, S., Schwartz, S., Trautman, J. K. & Carroll, D. Nucleosomes inhibit target cleavage by CRISPR-Cas9 in vivo. Proc. Natl Acad. Sci. USA 115, 9351–9358 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Filipovski, M., Soffers, J. H. M., Vos, S. M. & Farnung, L. Structural basis of nucleosome retention during transcription elongation. Science 376, 1313–1316 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Chen, F. et al. Targeted activation of diverse CRISPR-Cas systems for mammalian genome editing via proximal CRISPR targeting. Nat. Commun. 8, 14958 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Zhang, Q. et al. The post-PAM interaction of RNA-guided spCas9 with DNA dictates its target binding and dissociation. Sci. Adv. 5, eaaw9807 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Brachmann, C. B. et al. Designer deletion strains derived from Saccharomyces cerevisiae S288C: a useful set of strains and plasmids for PCR-mediated gene disruption and other applications. Yeast 14, 115–132 (1998).
Article CAS PubMed Google Scholar
Gietz, R. D. Yeast transformation by the LiAc/SS carrier DNA/PEG method. Methods Mol. Biol. 1205, 1–12 (2014).
Article CAS PubMed Google Scholar
Gibson, D. G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345 (2009).
Article CAS PubMed Google Scholar
Xie, Z. X. et al. Rapid and efficient CRISPR/Cas9-based mating-type switching of saccharomyces cerevisiae. G3 (Bethesda) 8, 173–183 (2018).
Article PubMed Google Scholar
Agmon, N. et al. Yeast golden gate (yGG) for the efficient assembly of s. cerevisiae transcription units. ACS Synth. Biol. 4, 853–859 (2015).
Article CAS PubMed Google Scholar
Ikushima, S., Zhao, Y. & Boeke, J. D. Development of a tightly controlled off switch for saccharomyces cerevisiae regulated by camphor, a low-cost natural product. G3 (Bethesda) 5, 1983–1990 (2015).
Article PubMed PubMed Central Google Scholar
Sprouffske, K. & Wagner, A. Growthcurver: an R package for obtaining interpretable metrics from microbial growth curves. BMC Bioinforma. 17, 172 (2016).
Article Google Scholar
Hunter, J. D. Matplotlib: a 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
Article Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kubik, S. et al. Nucleosome stability distinguishes two different promoter types at all protein-coding genes in yeast. Mol. Cell 60, 422–434 (2015).
Article CAS PubMed Google Scholar
Truong, D. M. & Boeke, J. D. Resetting the yeast epigenome with human nucleosomes. Cell 171, 1508–1519.e1513 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chen, K. et al. DANPOS: dynamic analysis of nucleosome position and occupancy by sequencing. Genome Res 23, 341–351 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hecht, A. & Grunstein, M. Mapping DNA interaction sites of chromosomal proteins using immunoprecipitation and polymerase chain reaction. Methods Enzymol. 304, 399–414 (1999).
Article CAS PubMed Google Scholar
Aparicio, O., Geisberg, J. V. & Struhl, K. Chromatin immunoprecipitation for determining the association of proteins with specific genomic sequences in vivo. Curr. Protoc. Cell Biol. Ch-17, Unit 17 (2004).
Ezhkova, E. & Tansey, W. P. In Yeast Protocol (ed Wei X.) 225-244 (Humana Press, 2006).
de Jonge, W. J., Brok, M., Kemmeren, P. & Holstege, F. C. P. An optimized chromatin immunoprecipitation protocol for quantification of protein-DNA interactions. STAR Protoc. 1, 100020 (2020).
Article PubMed PubMed Central Google Scholar
Kogenaru, M. & Isalan, M. Drug-inducible control of lethality genes: a low background destabilizing domain architecture applied to the Gal4-UAS system in drosophila. ACS Synth. Biol. 7, 1496–1506 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gu, X. et al. SAMTOR is an S-adenosylmethionine sensor for the mTORC1 pathway. Science 358, 813–818 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Faedo, A. et al. Differentiation of human telencephalic progenitor cells into MSNs by inducible expression of Gsx2 and Ebf1. Proc. Natl Acad. Sci. USA 114, E1234–E1242 (2017).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank former Noyes lab members David Ichikawa and Jeff Spencer, as well as Ran Brosh and other current or former members of the Boeke and Noyes labs, for sharing helpful discussions and resources. Paolo Mita provided extensive guidance on ChIP procedures and the pPM193 plasmid template from the Boeke lab for amplifying the Wt TetR PCR fragment of scTetR. Raquel Ordonez (Maurano lab) and Mariya London (Mucida lab alumnus) provided additional advice on ChIP procedures. Mike Shen (Boeke lab alumnus) designed the pMS101 plasmid in collaboration with the Noyes lab. Makiha Fukuda of the Boeke lab provided TAF10 primers for q(RT-)PCR. We also thank Addgene for providing plasmid templates to amplify a recoded TetR PCR fragment and an isogenic RevTetR PCR fragment: the pCW57.1-MAT2A plasmid¹¹¹ (#100521; RRID:Addgene_100521) and the pCAG-TetON-3G plasmid¹¹² (#96963; RRID:Addgene_96963), respectively, gifts from David Sabatini’s lab and Elena Cattaneo’s lab. The BPK1520 plasmid¹⁸ acquired through Addgene (#65777; RRID:Addgene_65777), as well as the U2OS.EGFP cell line¹⁸, were gifts from Keith Joung’s lab. We gratefully acknowledge NYU Medical Center’s Research Support Service team for preparation of various reagents and yeast media used in this work, the Genome Technology Center (RRID: SCR_017929) for low-complexity amplicon sequencing on MiSeq/NextSeq 2000 platforms, as well as library preparation and sequencing for MNase-seq (NovaSeq 6000) and ChIP-seq (NovaSeq X Plus), and the IDT rhAmpSeq CRISPR Analysis Tool team for their services and tech support. We are also grateful for thoughtful discussions with Timothee Lionnet and other members of his lab, as well as Rachel Leicher from Shixin Liu’s lab. G.W.G. was supported by an NRSA Postdoctoral Fellowship (F32GM137482) and an NIH K99 Pathway to Independence Award (K99GM147604), both from the NIGMS, as well as a Centers of Excellence in Genomic Science award (RM1HG009491) to J.D.B. from the NHGRI. Additional funding support was provided from NIGMS awards R01GM118851 and R01GM133936 to M.B.N.

Author information

Authors and Affiliations

Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY, USA
Gregory W. Goldberg, Manjunatha Kogenaru, Sarah Keegan, Max A. B. Haase, Larisa Kagermazova, Kenenna Onyebeke, Samantha Adams, Daniel K. Beyer, David Fenyö, Marcus B. Noyes & Jef D. Boeke
Courant Institute of Mathematical Sciences, New York University, New York, NY, USA
Mauricio A. Arias
Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY, USA
Jef D. Boeke

Authors

Gregory W. Goldberg
View author publications
Search author on:PubMed Google Scholar
Manjunatha Kogenaru
View author publications
Search author on:PubMed Google Scholar
Sarah Keegan
View author publications
Search author on:PubMed Google Scholar
Max A. B. Haase
View author publications
Search author on:PubMed Google Scholar
Larisa Kagermazova
View author publications
Search author on:PubMed Google Scholar
Mauricio A. Arias
View author publications
Search author on:PubMed Google Scholar
Kenenna Onyebeke
View author publications
Search author on:PubMed Google Scholar
Samantha Adams
View author publications
Search author on:PubMed Google Scholar
Daniel K. Beyer
View author publications
Search author on:PubMed Google Scholar
David Fenyö
View author publications
Search author on:PubMed Google Scholar
Marcus B. Noyes
View author publications
Search author on:PubMed Google Scholar
Jef D. Boeke
View author publications
Search author on:PubMed Google Scholar

Contributions

G.W.G. conceived the study. G.W.G. designed experiments with help from J.D.B., M.A.B.H., L.K., and M.K. G.W.G., D.K. B., M.A.B.H., L.K., M.K., and K.O. generated biological reagents and/or conducted experiments. S.K. analyzed high-resolution growth curve data, colony size data, and ChIP-seq data. M.A.B.H. analyzed MNase-seq data. M.A.A. ran Seqtk processing for single-read NextSeq data files. S.A., J.D.B., D.F., M.A.A., and M.B.N. contributed other resources and/or technical advice. J.D.B. and M.B.N. provided funding. G.W.G. wrote the paper with help from J.D.B., M.A.B.H., L.K., S.K., and M.K.

Corresponding authors

Correspondence to Gregory W. Goldberg, Marcus B. Noyes or Jef D. Boeke.

Ethics declarations

Competing interests

Jef Boeke is a Founder and Director of CDI Labs, Inc., a Founder of and consultant to Opentrons LabWorks/Neochromosome, Inc, and serves or served on the Scientific Advisory Board of the following: CZ Biohub New York, LLC, Logomix, Inc., Modern Meadow, Inc., Rome Therapeutics, Inc., SeaHub, Seattle, WA, Tessera Therapeutics, Inc. and the Wyss Institute. Marcus Noyes is a founder of TBG Therapeutics, Inc. New York University filed a provisional patent application (No. 63/582,731) for findings described in this work, with Gregory Goldberg, Marcus Noyes, and Jef Boeke listed as inventors. The other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Chase Beisel and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Goldberg, G.W., Kogenaru, M., Keegan, S. et al. Engineered transcription-associated Cas9 targeting in eukaryotic cells. Nat Commun 15, 10287 (2024). https://doi.org/10.1038/s41467-024-54629-9

Download citation

Received: 09 August 2024
Accepted: 18 November 2024
Published: 27 November 2024
Version of record: 27 November 2024
DOI: https://doi.org/10.1038/s41467-024-54629-9