Divergence in a eukaryotic transcription factor’s co-TF dependence involves multiple intrinsically disordered regions

Snyder, Lindsey F.; O’Brien, Emily M.; Zhao, Jia; Liang, Jinye; Bruce, Baylee J.; Zhang, Yuning; Zhu, Wei; Cassier, Thomas H.; Schnicker, Nicholas J.; Zhou, Xu; Gordân, Raluca; He, Bin Z.

doi:10.1038/s41467-025-59244-w

Download PDF

Article
Open access
Published: 18 June 2025

Divergence in a eukaryotic transcription factor’s co-TF dependence involves multiple intrinsically disordered regions

Nature Communications volume 16, Article number: 5340 (2025) Cite this article

4187 Accesses
2 Citations
31 Altmetric
Metrics details

Subjects

Abstract

Combinatorial control by transcription factors (TFs) is central to eukaryotic gene regulation, yet its mechanism, evolution, and regulatory impact are not well understood. Here we use natural variation in the yeast phosphate starvation (PHO) response to examine the genetic basis and species variation in TF interdependence. In Saccharomyces cerevisiae, the main TF Pho4 relies on the co-TF Pho2 to regulate ~28 genes, whereas in the related pathogen Candida glabrata, Pho4 has reduced Pho2 dependence and regulates ~70 genes. We found C. glabrata Pho4 (CgPho4) binds the same motif with 3–4 fold higher affinity. Machine learning and yeast one-hybrid assay identify two intrinsically disordered regions (IDRs) in CgPho4 that boost its activation domain’s activity. In ScPho4, an IDR next to the DNA binding domain both allows for enhanced activity with Pho2 and inhibits activity without Pho2. This study reveals how IDR divergence drives TF interdependence evolution by influencing activation potential and autoinhibition.

Intrinsically disordered regions as facilitators of the transcription factor target search

Article 21 February 2025

Co-condensation between transcription factor and cBAF selectively modulates chromatin remodeling and gene expression

Article Open access 11 December 2025

Phosphorylation-dependent tuning of mRNA deadenylation rates

Article Open access 03 November 2025

Introduction

Transcription factors (TFs) are the cornerstone of gene regulatory networks. In eukaryotes, TFs often work collaboratively to regulate gene expression. This combinatorial control is crucial for enhancing specificity, because most eukaryotic TFs recognize short and degenerate motifs—typically less than 10 bps—that appear hundreds to tens of thousands of times in the genome¹. TFs with the same family of DNA binding domains (DBDs) also recognize highly similar motifs². Despite these, TFs often bind only a fraction of their motifs in vivo and regulate an even smaller subset of the genes they bind to³; paralogous TFs regulate distinct sets of genes^4,5. The key to achieving this specific regulation is the requirement of two or more TFs to jointly regulate the target genes^6,7,8. Another important function of combinatorial control and TF interdependence is to allow cells to integrate multiple upstream signals. In fly development, for example, a combination of tissue-specific “selector” and morphogen gradient—both are TFs—precisely determine the expression pattern and define the cell fate⁹.

The importance of combinatorial control and the molecular mechanisms behind TF interdependence are traditionally studied using mutations that disrupt TF interactions. For instance, a missense mutation that disrupts the interaction between two cardiac TFs, GATA4 and TBX5, was shown to cause dysregulation of cardiac genes and lead to malformation of heart tissues¹⁰. Similarly, studies of combinatorial control evolution often focus on losses (and gains) of TF interactions, like the MADS-box TF, Mcm1, and its co-TFs, MATa2 and MATα2 in the yeast mating type pathway^11,12. A far less explored type of change in the combinatorial control is a change in the TF interdependence itself, that is, when TFs involved in cooperative regulation evolved to be more or less dependent on the co-TF(s). Such changes are expected to dramatically rewire the network by altering the specificity and signals required for gene regulation, which could have profound implications in disease and evolution. In relation to this possibility, not all eukaryotic TFs require other TFs to function. The yeast TF, Gal4, regulates its target genes on its own¹³. This raises several intriguing and unanswered questions: if TF interdependence and combinatorial control evolve, what genetic changes underlie the divergence, what aspects of the TF activities are impacted, and what are the consequences on the regulatory output?

Intriguingly, natural variation in co-TF dependence exists in the yeast phosphate starvation (PHO) response network. In the model yeast Saccharomyces cerevisiae, the main TF of the PHO response, Pho4 (hereinafter as ScPho4), strongly depends on the co-TF, Pho2, to induce 27/28 of its target genes³. In a related human yeast pathogen, Candida glabrata, its Pho4 (hereinafter as CgPho4) is far less dependent on Pho2 and induces twice as many genes (Fig. 1A)^14,15. The level of Pho2-dependence varies quantitatively among Pho4 orthologs in other yeasts and is correlated with the number of genes induced in a common genome background¹⁵. This latter observation is consistent with the role of combinatorial control in enhancing specificity. What remains unknown is what genetic differences between Pho4 orthologs underlie the divergence in co-TF dependence, and what TF activities are impacted by those variations.

**Fig. 1: Different co-TF (Pho2) dependence between orthologous TF (Pho4) in yeast species.**

Here, we propose two non-mutually exclusive models for the difference in Pho2-dependence between ScPho4 and CgPho4 (Fig. 1B). The first “enhanced activity” model is based on studies showing that ScPho4 requires Pho2 to (1) bind cooperatively to the target gene promoters¹⁶, (2) recruit the histone acetyltransferase (HAT) complex¹⁷ and (3) help recruit general TFs and the PolII complex¹⁸. Under this model, we hypothesize that CgPho4 binds more tightly to DNA than ScPho4, and is more capable of recruiting general transcription factors (TFs) and the PolII complex, thereby becoming less Pho2-dependent. The second “autoinhibition” model is based on a study suggesting that ScPho4 is auto-inhibited, and that interaction with Pho2 unmasks its activation domain and allows it to function¹⁹. This model predicts that CgPho4 either lacks or has far weaker effects of the auto-inhibition and hence doesn’t depend on Pho2.

In this study, we tested these two models by comparing the DNA binding and activation abilities of the two Pho4 orthologs, and systematically swapping regions between the two Pho4s in a series of 50 chimeric TFs, then quantifying their activities with and without Pho2. Our results support both models as contributing to the Pho2-dependence variation. To our surprise, while CgPho4 DBD binds to the same consensus motif with a 3–4-fold higher affinity than ScPho4 DBD, swapping DBD alone failed to yield the expected results. Instead, the differences in Pho2-dependence originated primarily from differences in the Intrinsically Disordered Regions (IDRs) in the two TFs that modulate both their Activation Domain (AD) and DNA binding domain (DBD) activities. Therefore, our results reveal that evolution in a eukaryotic TF protein, particularly through changes in the IDR, can lead to divergence in co-TF dependence, which in turn results in a more than two-fold change in the size of the target network.

Results

Domain organization and sequence divergence between ScPho4 and CgPho4

Based on genetic and biochemical studies, ScPho4 encodes the following functional domains from its N- to its C-terminus (Fig. 1C): a regulatory region (R1, aa 1–42) interacting with the negative regulator Pho80²⁰, the activation domain (AD, aa 43–99)^21,22, a region encoding the nuclear export and import signals (NLS, aa 100–176)^23,24, a protein-interaction domain interacting with both Pho80 and the co-TF, Pho2 (referred together as P2ID, aa 177–242)^20,25, and the bHLH DNA binding domain (DBD, aa 243–312)^21,26. Regarding ScPho4-ScPho2 interaction, a previous study mapped the region in ScPho4 required for the interaction to aa 200–218²⁵. The same 18 aa stretch also contains the phosphorylation site that directly controls the TF-TF interaction²³.

Both ScPho4 and CgPho4 were predicted to be mostly intrinsically disordered outside the DBD, with some low confidence helices in R1 and AD (Supplementary Fig. 1). CgPho4 is significantly longer than ScPho4 (533 vs 312 amino acids, Fig. 1C). The NLS and P2ID regions contain four functionally important phosphorylation sites targeted by the cyclin-dependent kinase complex Pho80/85^24,27. Those located in the NLS control Pho4’s nuclear translocation, while phosphorylation of the one in P2ID disables ScPho4 from interacting with Pho2²³. All five Pho80/85 targeted phosphorylation sites are clearly identifiable in the alignment (Supplementary Fig. 2); in fact, they were previously found to be conserved in orthologs outside the Saccharomycotina subphylum²⁸. Thus, while the protein length and sequence diverged significantly between the two Pho4 orthologs, the domain architecture and post-translational modification motifs appear to be highly conserved in evolution.

CgPho4 binds the same consensus motif with a higher affinity compared to ScPho4

To test the enhanced activity model, we first compared the DNA binding ability of the two Pho4 orthologs. First, we asked if the two Pho4 orthologs recognize the same DNA sequence. The N-terminal stretch of the first α-helix in the bHLH domain, known as the basic region, contains the residues determining sequence specificity. All four residues known to recognize the nucleotide bases and five additional residues that recognize the phosphate backbone in ScPho4²⁶ are all conserved in CgPho4 except for an R252K change (number based on ScPho4, Fig. 2A). Next, we used our published Chromatin-Immunoprecipitation (ChIP-seq) data for both Pho4 orthologs to identify their respective motifs¹⁵ (Materials and Methods). Consistent with the amino acid sequence conservation, the results showed that they bind the same E-box motif “CACGTG”, with no obvious differences in the flanking nucleotides (Fig. 2B). These motifs were based on a relatively small number of ChIP peaks (74 for ScPho4 and 118 for CgPho4). To comprehensively characterize and compare the binding preferences of the two proteins, we applied Protein Binding Microarray (PBM) to map the binding landscape in the entire 7-mer space (universal PBM, Fig. 2C). The result revealed a lack of divergence in their sequence preference (Pearson’s r = 0.89). To complement the short oligo length in the universal PBM and further examine differences in the flanking base preference, we designed a second genome context library, which includes 36-bp sequences centered on ChIP-identified binding sites for both Pho4’s in their native genome along with the flanking nucleotides²⁹. Similar to the uPBM, the genome context PBM revealed no evidence for binding specificity differences between ScPho4 and CgPho4 (Supplementary Fig. 3, Spearman’s ρ = 0.84 and 0.86 for sequences containing the consensus E-box or non-consensus variants, respectively, Materials and Methods). We conclude that ScPho4 and CgPho4 have the same sequence specificity.

**Fig. 2: CgPho4 DBD recognizes the same motif as ScPho4 DBD and has a higher affinity towards the consensus DNA.**

Next, we tested the hypothesis that CgPho4 binds more tightly than ScPho4 does. To do so, we purified the DBD region of both Pho4s (Materials and Methods) and measured their binding affinity to a 17-bp oligo based on the S. cerevisiae PHO5 promoter with the consensus E-box motif “CACGTG”. Biolayer Interferometry (BLI) measurements showed that CgPho4 DBD binds the consensus motif >3 times more tightly than ScPho4 DBD does (K_D = 5.2 nM vs 1.2 nM for ScPho4 and CgPho4, respectively; Student’s t-test for log K_D difference P < 0.01, Fig. 2D, E). We confirmed this K_D difference using Electrophoretic Mobility Shift Assay (K_D = 5.5 nM vs 1.9 nM for ScPho4 and CgPho4, respectively; Supplementary Fig. 4). We conclude that CgPho4 recognizes the same E-box motif and binds with a higher affinity than ScPho4. It is worth noting that this observed difference in affinity, while statistically significant, is modest (∆∆G = −0.87 kcal/mol, based on the mean BLI measurements, assuming a temperature of 25 °C). Its potential impact on gene regulation depends critically on the effective concentration of Pho4 in the nucleus.

CgPho4 encodes two additional activation enhancer domains (AEDs)

We next tested the hypothesis that CgPho4 has increased transactivation potential compared to ScPho4 under the enhanced activity model. We first predicted regions with activation potential in both Pho4 orthologs using PADDLE, a Convolutional Neural Network trained on 150 activation domains (ADs) from 164 TFs in S. cerevisiae³⁰. PADDLE recovered the experimentally identified AD in ScPho4 between aa 60–102, which contained the 9aaTAD motif previously described as the minimum residues required for activation³¹ (Fig. 3A, orange triangle). No other regions with significant activation potential were predicted in ScPho4. By contrast, three regions in CgPho4 were predicted to have activation capabilities, including one corresponding to the AD in ScPho4, which was predicted to have strong activity (Z-score > 6, Fig. 3B). Two additional regions, one overlapping R1 and the other spanning the NLS and P2ID, were predicted to have medium activation strength (Z-score > 4). We refer to these two regions as E1 and E2, respectively from here on. Interestingly, the best match to the 9aaTAD motif pattern in CgPho4 was found in E2 rather than in the AD (Fig. 3B, orange triangle).

**Fig. 3: CgPho4 encodes two Activation Enhancer Domains (AEDs) that can enhance the activation potential of the main AD.**

To determine the activity of the predicted regions with activation potential in both Pho4s, we set up a yeast one-hybrid (Y1H) assay, in which each candidate region was fused to the Gal4 DBD and its activation potential was measured by a genome-integrated GAL1pr-mCherry reporter in S. cerevisiae (Fig. 3C, Materials and Methods). To see if fusing the candidate region with Gal4 DBD created a new activation domain, we applied PADDLE to all constructs and observed no peaks either in the Gal4 DBD or in the connecting regions (Supplementary Fig. 5). The yeast one-hybrid result confirmed that both ScPho4 and CgPho4 ADs were able to activate the reporter above the background level (7- and 5.6-fold, Holm-Bonferroni corrected P < 0.01). We were able to further localize the required residues for activation in ScPho4’s AD to a region of 32 aa centered on the 9aaTAD motif (Fig. 3A, C). Neither E1 nor E2 from either Pho4 activated the reporter on its own (Fig. 3C). This was surprising for CgPho4’s E1 and E2, since both had a medium strength Z-score prediction, and the latter also contained a match to the 9aaTAD (Fig. 3B). We hypothesized that E1 and E2 in CgPho4 could enhance the activity of the main AD. To test this, we first fused ScE1 and CgE1 to their respective AD, and found that only the latter resulted in a significant enhancement in the activation potential (Fig. 3D). Next, we fused CgE1 or CgE2 in both orientations to the minimal AD of ScPho4 (ScAD_9aa). We found that, indeed, both regions were able to significantly enhance the activity of the ScAD_9aa in an orientation-independent manner (Fig. 3D). It is worth noting that this enhancement activity does not involve the endogenous ScPho4 or ScPho2. ScPho4 is inactive in the phosphate replete condition in which the Y1H experiment was conducted. To test the involvement of ScPho2, we repeated the assay in a host lacking pho2. No significant difference in the boosting effects of either CgE1 or CgE2 was observed, although CgE1’s effect is slightly lower without Pho2 (Supplementary Fig. 6).

In summary, we found that CgPho4 encodes two Activation Enhancer Domains (AEDs), both of which are in the Intrinsically Disordered Region (IDR). The AEDs have little activation capability on their own but can significantly enhance the activation potential of the AD of either of the Pho4 orthologs. Since these two AEDs are present in CgPho4 but not in ScPho4, we hypothesize that they contribute to CgPho4 being less dependent on Pho2.

A dual fluorescence reporter assay accurately measures the activity of Pho4 chimeras with and without Pho2 for dissecting the divergence in co-TF dependence

So far, we compared DNA binding and activation potential between CgPho4 and ScPho4 by isolating individual regions and testing them either in vitro or in a synthetic in vivo system. While they support the enhanced activity model, it remains unclear whether and how they contribute to divergence in Pho2-dependence in the native Pho4 protein context. They also do not test or rule out the autoinhibition model, which necessitates intramolecular interactions. To answer these questions, we divided ScPho4 and CgPho4 into five corresponding parts (Figs. 1C, 4A), with breakpoints chosen to be on the edge of well-aligned regions to avoid breaking known or predicted functional domains and secondary structures. Boundaries for all regions used in this and other experiments can be found in Table 1. We created all 2⁵ = 32 combinations of these regions as well as 28 additional ones with alternative breakpoints (Fig. 4A). All constructs were C-terminally tagged with mNeon to quantify the protein levels, and were expressed from the native ScPHO4 promoter and UTRs (Fig. 4B). We also created two S. cerevisiae host strains, in which the ScPHO5 CDS was replaced by an mCherry reporter, and pho80 was knocked out so that all Pho4 chimeras were constitutively nuclear localized^24,27. One of the two host strains had pho2 knocked out. For each chimera, we measured its PHO5pr-mCherry and Pho4-mNeon levels using flow cytometry in both hosts. While mCherry and mNeon levels for the same Pho4 construct varied between experiments, the ratios between the two were consistent and characteristic of the specific construct (Fig. 4C, D). We therefore defined the activity of a Pho4 chimera as the ratio between the median fluorescence intensity (MFI) of mCherry and mNeon, which we will refer to as A_PHO2 or A_pho2∆ from hereon.

**Fig. 4: A dual fluorescence reporter system for accurately quantifying the activity of Pho4 chimeras with and without Pho2.**

Table 1 Breakpoints for chimeric Pho4 and individual regions tested in this study

Full size table

The two activation enhancer domains (AEDs) in CgPho4 increased the activity of the chimeric TFs but were insufficient to make them Pho2-independent

The chimeric Pho4 constructs exhibited varied A_PHO2 and A_pho2∆ values (Fig. 5A). Interestingly, some showed higher A_PHO2 than either ScPho4 or CgPho4. To identify the potential genetic basis for the divergence in Pho2-dependence, we first calculated the activity difference when one or two regions of ScPho4 were replaced with their counterpart(s) from CgPho4. The results were plotted as two heat maps separated based on the presence of Pho2 (Fig. 5B). For example, a chimera with its NLS from CgPho4 (NLS:Cg) and the rest from ScPho4 led to estimates of the activity difference between NLS:Cg and NLS:Sc on ScPho4 background with or without Pho2.

**Fig. 5: R1 and NLS of CgPho4 confer stronger activity than their counterparts in ScPho4.**

Based on our previous results, we expected R1, NLS, and DBD in CgPho4 to enhance the activity of the chimera with and without Pho2. In particular, the first two encode CgE1 and a majority of CgE2_9aa, which we showed enhanced the activity of the main AD (Fig. 3); DBD:Cg is expected to increase the binding ability of the chimera (Fig. 2). We indeed found R1:Cg and NLS:Cg to enhance A_PHO2 on ScPho4’s background (Fig. 5C, “R1, AD, NLS” group); contrary to our expectation, however, DBD:Cg reduced the chimera’s activity with Pho2 (Fig. 5C, bHLH). Also unexpectedly, R1:Cg and NLS:Cg had a much smaller effect on A_pho2∆ (Fig. 5B bottom, 5C right), suggesting that despite their ability to increase A_PHO2, the chimeras were still dependent on Pho2. In fact, none of the 1- or 2-region swaps led to large increases in A_pho2∆. It is worth noting that this result also argues against a classic lock-and-key model for autoinhibition, which we expect to be broken by swapping one of the regions involved.

To quantitatively examine the contribution of R1, AD, and NLS regions from CgPho4 and whether they interact non-additively (epistasis), we fit a linear model to the data to estimate the main and interaction terms for the regions, i.e., Y = X₀ + R1 + AD + NLS + R1:AD + R1:NLS + AD:NLS + R1:AD:NLS. In this model, the first term represents the activity of ScPho4, the next three terms represent the main effect of each CgPho4 region (“:Cg” omitted for brevity), and the rest are interaction terms. We found that R1:Cg and NLS:Cg each had significant, positive effects on their own on both A_PHO2 and A_pho2∆; although the magnitude was much smaller for A_pho2∆ (Fig. 5D, Holm-Bonferroni corrected P < 0.05). Both also had a significant and positive interaction term with AD:Cg on A_PHO2 (corrected P < 0.05 for R1:AD and AD:NLS); the estimates trended in the same direction but were not significant for A_pho2∆ (Fig. 5D). One explanation for the observed epistasis may be that the two CgPho4 AEDs work more efficiently with AD:Cg than with AD:Sc. However, it could also be explained by the disruption of the native conformation and function of the interacting regions resulting in a lower activity in the species-mixed constructs. Note that to minimize such effects, we designed the breakpoints to avoid any predicted secondary structures (Supplementary Figs. 1 and 2).

Since the NLS:Cg region encodes both the NLS/NES and a key part of the second AED (CgE2_9aa), we asked which of these two mechanisms was responsible for its positive effect on A_PHO2. To test the possibility that the enhanced A_PHO2 was due to a stronger nuclear localization activity of NLS:Cg, we performed fluorescence microscopy to quantify the concentration of the nuclear-localized Pho4 proteins and the ratio of nuclear vs total Pho4 proteins in six constructs that bear either NLS:Sc or NLS:Cg. No significant difference between the two groups was found (Supplementary Fig. 7, F-test P > 0.1). We thus conclude that the positive effect of NLS from CgPho4 on A_PHO2 was mainly due to its ability to boost activation.

We also examined the effect of swapping CgPho4’s DBD. On its own, swapping DBD:Cg decreased both A_PHO2 and A_pho2∆ compared to ScPho4 (Fig. 5B), suggesting that DBD:Cg may be incompatible with one or multiple regions in ScPho4.

In summary, we found that R1:Cg and NLS:Cg increased the activity of the chimera with Pho2 but were insufficient to remove the Pho2 dependence alone or in combination. In the presence of Pho2, R1:Cg and NLS:Cg both showed positive epistasis with AD:Cg on ScPho4’s background. Lastly, no 1- or 2-region swaps from CgPho4 into ScPho4 increased A_pho2∆ to the level of CgPho4. Together, we conclude that CgPho4’s two AEDs can increase the activity of the chimera in the presence of Pho2, although they fail to restore the activity without Pho2 to CgPho4’s level, suggesting that additional mechanisms are at play. CgPho4’s DBD, alone or in combination with R1 and the NLS region, did not contribute to increased activity.

A double-edged sword: Pho2 interaction domain (P2ID) in ScPho4 allows for enhanced activity with Pho2 but restricts its activity without Pho2

In the 1- and 2- region swap heatmap (Fig. 5B), the P2ID showed a puzzling pattern: swapping CgPho4’s P2ID (P2ID:Cg) into the ScPho4 background had a dominant negative effect on A_PHO2 while offering little increase in A_pho2∆. This suggests that P2ID:Sc is essential for ScPho4’s function. Conversely, swapping ScPho4’s P2ID (P2ID:Sc) into CgPho4 increased A_PHO2 beyond that of CgPho4 (18 to 22) but reduced A_pho2∆ (17 to 4.8). This suggests that P2ID:Sc is a key factor to Pho2-dependence. To investigate the unique property of the P2ID further, we plotted all chimeras based on their A_PHO2 and A_pho2∆ values (Fig. 6A). Strikingly, the chimeras fell into three distinct groups based on the identity of their P2IDs. The first group had P2ID from CgPho4, and their activities were not dependent on Pho2, falling along the diagonal line. The second group had P2ID from ScPho4. These had strong activity with Pho2 and low to no activity without Pho2, like ScPho4 does. The third group included additional chimeras with mixed P2ID regions, and they filled the intermediate space between the first two groups. Notably, many chimeras in the second group showed higher A_PHO2 than ScPho4. Most of them contained the R1 and NLS regions from CgPho4, consistent with our results above showing that these two regions enhanced the activity of the Pho4 chimeras in the presence of Pho2 (Fig. 5A, B).

**Fig. 6: P2ID in ScPho4 allows for enhanced activation with Pho2 via physical interaction with the co-TF.**

The evidence above suggests that P2ID:Sc has two effects: allowing for collaborative regulation with Pho2 and restricting Pho4’s activity without Pho2. Can these two functions be separated? To answer this question, we compared the mixed P2ID chimeras to those with whole P2IDs from either Pho4. We found that swapping P2ID:Sc into CgPho4 resulted in a 1.2-fold increase in A_PHO2 and a 71% reduction in A_pho2∆ (Fig. 6B, row 2 vs 3). Interestingly, swapping just the second half of P2ID:Sc (aa 205–242) into CgPho4 resulted in a ~40% reduction in A_PHO2 and 75% reduction in A_pho2∆. By contrast, just swapping the first half of P2ID:Sc (aa 177–204) resulted in 1.76-fold increase in A_PHO2 and maintained the same level of A_pho2∆ (1.1-fold). From this, we deduced that the first half of P2ID:Sc mainly functions to interact with Pho2 while the second half appears to limit Pho4’s activity without Pho2. The same trend was observed in another set of chimeras with lower activities (Fig. 6B, rows 6–9).

Several chimeras had very low activities even with Pho2 (A_PHo2 < 3.6, or 20% of A_PHO2 for ScPho4). Most of these nonfunctional chimeras have P2ID:Cg and DBD:Sc (Fig. 6C, rows 3–7). We hypothesize that DBD:Sc requires P2ID:Sc and Pho2 to function in the context of the full length Pho4. This is contrary to the conventional view that DBDs can function on their own, which is supported by our own in vitro results (Fig. 2). Nonetheless, we reasoned that if the above hypothesis is correct, putting P2ID:Sc back by inserting it in between P2ID:Cg and DBD:Sc should rescue the non-functional chimeras in the presence of Pho2. That is what we observed (Fig. 6C, rows 8–12), with some of the chimeras even exceeding the A_PHO2 of ScPho4 and CgPho4. However, all of them still required Pho2. These results support the above hypothesis, showing that the dual-functional P2ID is essential for ScPho4 to function.

Given that chimeras with P2ID:Cg have equal activities with and without Pho2 (Fig. 6A), we asked if CgPho4 still physically interacts with Pho2 in S. cerevisiae. Using the yeast two-hybrid assay, we were unable to detect an interaction between CgPho4ΔDBD (aa 2–463) and a region of ScPho2 known to interact with multiple TFs, including ScPho4³² (Fig. 6D). By contrast, ScPho4ΔDBD (aa 2–250) was able to interact with the same region of ScPho2 as previously found (Fig. 6D). This is consistent with our observation above, where chimeras with P2ID:Cg and lacking the region from ScPho4 mediating Pho2-interaction cannot be enhanced by Pho2 (Fig. 6A).

In summary, our chimera dissection revealed three key regions behind the difference in co-TF dependence. Notably, all three are IDRs. Among them, a region adjacent to the DBD in ScPho4 (P2ID:Sc) functions as a double-edged sword: it both allows Pho4 to gain activity with Pho2’s help and restricts it when Pho2 is absent. We showed that these two functions are encoded by physically separate parts, offering a path for determining their respective mechanisms of actions. CgPho4 lacks the ability to interact with and use Pho2’s help. Instead, the two AEDs we identified earlier—both in IDRs—conferred higher activity independent of Pho2. This, combined with the lack of the autoinhibition by P2ID:Sc, makes CgPho4 as active as ScPho4 and not dependent on the co-TF.

Discussion

Combinatorial control plays crucial roles in eukaryotic gene regulation. Mutations disrupting TF interactions can cause dysregulation and disease¹⁰. However, how mutations can alter TF interdependence itself, whether in disease or evolution, is less understood. In this study, we investigated the molecular basis for natural variation in co-TF dependence in the yeast phosphate starvation (PHO) response. We found three key differences between two orthologous Pho4s with varying dependence on the co-TF Pho2 (Fig. 7): (1) DNA Binding Affinity: CgPho4 binds the same consensus DNA motif with 3–4-fold higher affinity than ScPho4; (2) Activation potential: CgPho4 has two unique activation enhancing domains (AEDs) that increase the activation potential of both Pho4s. (3) Autoinhibition: ScPho4 contains an IDR next to its DBD that both allows it to interact with Pho2 to gain enhanced activity, and inhibits its activity in the absence of Pho2. Therefore, our results support both the enhanced activity model and the autoinhibition model for the difference in Pho2-dependence between Pho4 orthologs.

**Fig. 7: Summary of Pho4 protein divergence and its contribution to the Pho2-dependence variation in ScPho4 and CgPho4.**

Among the three differences, the contribution of CgPho4’s two AEDs to the TF’s activity and dependence on Pho2 is well supported by the yeast one-hybrid data and chimeric Pho4 results (Figs. 3, 5). We are not aware of existing examples or proposed mechanisms for such a phenomenon. One hypothesis for how AEDs work is that they are weaker ADs with the same biochemical activities, i.e., recruiting cofactors through protein-protein interactions. As such, they are not sufficient for activation by themselves but can increase the activity of the nearby AD (the effect can be non-additive). Alternatively, AEDs may affect the conformation of the AD and have no activity on their own. Further tests by synthetic constructs and biochemical assays, such as co-IP followed by mass-spec, will provide mechanistic insight into how AEDs function.

By contrast, the significance of the binding affinity differences between the two Pho4s remains unclear (Fig. 2E). Binding kinetics predicts that if the nuclear concentrations of both Pho4 are much higher than their K_D, a 3–4-fold difference in affinity would have little impact. Conversely, if the nuclear concentration of Pho4 is close to its K_D, the same difference could significantly affect gene induction. Existence of the second scenario is supported by a study showing that nuclear ScPho4 levels are much lower at intermediate phosphate concentrations than in extreme starvation conditions, leading to differential binding and induction of ScPho4’s targets³³. In our chimeric Pho4 experiments, replacing ScPho4’s DBD with that from CgPho4 reduced Pho4’s activity rather than increasing it both with and without Pho2 (Fig. 5). It seems to suggest that the binding affinity difference has no functional impact in vivo. However, it is worth noting that we assayed the activity of chimeric Pho4s in the pho80∆ background where all Pho4 proteins are constitutively inside the nucleus at near-maximal levels, hence not allowing us to test the effect of the affinity difference at a lower level of nuclear Pho4. Future studies will need to measure Pho4 activities at varying nuclear concentrations to test the above hypothesis.

Surprisingly, we found that ScPho4’s DBD requires both ScPho4’s P2ID and Pho2 for full activity (Fig. 6C), challenging the view that TFs are modular, where DBD are expected to function independently. Although ours and others’ studies confirmed that ScPho4 DBD can bind DNA in vitro (Fig. 2) and even in vivo when overexpressed on its own³⁴, they may not fully reflect its physiological activity, which depends on the nuclear concentration, nucleosomal context, and interactions with other TF regions and cofactors. Further experiments are needed to resolve the paradox and re-examine the assumption of TF modularity.

Only one of the three functional differences identified between the two Pho4 orthologs is in a structured region. The two AEDs and the dual-functional P2ID:Sc are both within IDRs. IDRs are abundant in eukaryotic TFs, with over 80% having at least one such region³⁵. Approximately 75% of ScPho4 and CgPho4 were predicted to be IDR (Supplementary Fig. 1). While most studies of TF function and evolution focus on the structured regions like the DBD, our work joins a small number of studies highlighting the significance of IDR divergence in TF evolution^36,37. On one hand, IDRs play crucial roles in nearly all aspects of a TF’s function, including recruiting cofactors, interacting with co-TFs, contributing to binding specificity and forming molecular condensates^{38,39,40,41,42,43}. On the other hand, IDRs evolve much faster than structured regions, potentially offering more raw variation for natural selection to act on. Our current understanding of how IDR evolves and affects TF function is limited by challenges in aligning their sequence and studying their functions. Progress in experimental techniques like phase separation⁴³ and large language models trained on protein sequences offers new opportunities to investigate TF IDR’s function and evolution⁴⁴.

Our study focused on dissecting the genetic basis for Pho2-dependence variation in two Pho4 orthologs. We previously showed that this trait varies across Pho4 orthologs, with reduced Pho2-dependence potentially evolving independently in more than one lineage¹⁵. We wondered whether the IDR-associated divergence identified above also correlated with the level of Pho2-dependence more broadly. Preliminary analyses of the activation potential and P2ID length across eight Pho4 orthologs with different Pho2-dependence levels supported this speculation (Supplementary Fig. 8).

Lastly, our study illustrates a mechanism for the evolution of combinatorial control in eukaryotes. Unlike previous research that focused on the gain and loss of TF interactions in gene regulatory networks^45,46, we show that the TF’s dependence on the co-TF itself can evolve. This divergence can lead to significant rewiring of the regulatory network, as seen with Pho4, where reduced co-TF dependence correlates with an expanded target gene network¹⁵. This contrasts with the dominant pattern seen in the literature on combinatorial control evolution: the Johnson lab, for example, showed that changes in TF combinations in the yeast mating type pathway altered the mode of regulation but maintained the overall network output^11,12,47,48. While cases of TF interdependence evolution are still rare, it is interesting to compare our example with Gain-of-Function mutations in kinases: in the proto-oncogene ABL, a fusion with another gene called BCL makes the merged protein independent of the upstream and downstream activators, which drives excessive cell proliferation and leads to Chronic Myeloid Leukemia⁴⁹. Whether similar changes in TF dependence on co-TFs lead to misregulation and in turn causes diseases or novel phenotypes is an interesting question for future studies.

In summary, our study provides a detailed molecular picture of how co-TF dependence is mediated and how it evolves, particularly through IDR changes. Further exploration of both questions is essential for understanding gene regulation and regulatory evolution in eukaryotes.

Methods

Breakpoints used for the chimeric Pho4 constructs and individual regions are listed in Table 1. Plasmids and strains are listed in Supplementary Data 1 and Table 2. Computational and statistical analysis scripts performed in this study are available at https://github.com/binhe-lab/E013-Pho4-evolution, which will be archived using Zenodo and minted with a DOI at the time of publication.

Table 2 Yeast strains

Full size table

Bioinformatic analyses of Pho4 orthologs

Pho4 ortholog sequences were from the Yeast Gene Order Browser (http://ygob.ucd.ie/), and were aligned using ProbCons⁵⁰ via JalView’s Web Service^51,52. The alignment was manually edited to align the five Pho80/85 motifs²⁴. Secondary structures for ScPho4 and CgPho4 were predicted using the PSIPRED 4.0⁵³. 9aaTAD motifs were predicted using the webapp https://www.med.muni.cz/9aaTAD/³¹. For ScPho4, one match was found using the moderately stringent pattern, located at aa 75–83; for CgPho4, one match was identified using the most stringent pattern, at aa 270–278, with three more using the moderately stringent patterns (aa 20–28, 24–32, 282–290). Among the latter three, aa 24–32 had a lower % match at 67% vs 83% for the others.

Identifying Pho4 binding motifs from ChIP-seq data

Chromatin Immunoprecipitation (ChIP-seq) for ScPho4 and CgPho4 were previously performed in S. cerevisiae, with both Pho4 expressed from the same endogenous ScPho4 locus with native regulatory sequences¹⁵. ChIP identified peaks for both Pho4s were downloaded from the supplementary files of the above publication. The sequences under each peak were extracted from S. cerevisiae genome (sacCer3, NCBI refseq assembly GCF_000146045.2), which were submitted to the peak-motifs tool without control sequences on the RSAT Fungi server (https://rsat.france-bioinformatique.fr/fungi/)⁵⁴. The top motif was reported for each Pho4 ortholog.

Protein expression and purification for DNA binding assays

All recombinant proteins were expressed using pET-11a (Sigma #69436-3) based vectors in BL21(DE3) E. coli cells. Vector maps are available upon request. The DNA binding domain (DBD) constructs included ScPho4 DBD-6xHIS (aa 236–312) and CgPho4 DBD-6xHIS (aa 452–533) cloned downstream of the T7 promoter in pET-11a. The transformed bacterial strains were grown overnight in LB + 100 ug/mL ampicillin. The overnight culture was diluted to OD 0.1 and the cells were grown to OD 0.6 and induced with 1 mM IPTG for 2 h. Induced cells were collected by centrifugation at 6000 rpm at 4 °C for 35 min, and were snap frozen in liquid nitrogen, then stored at −80 °C until purification. Our preliminary experiments showed that the protein of interest was largely in the insoluble fraction. Therefore, we performed a refold protocol on the Ni-NTA column. Briefly, frozen pellets were resuspended in 1xPBS, pH7.4 with cOmplete EDTA-free protease inhibitor (Sigma #11836170001) and sonicated for 45 cycles of 1 sec on / 2 sec off repeated 3 times at 50% power to lyse the cells. Sonicated samples were centrifuged at 35k rpm for 30 min. The supernatant was discarded, and the pellet was resuspended in the solubilization buffer (20 mM TrisBase, 0.5 M NaCl, 5 mM imidazole, 5.5 M Guanidine Hydrochloride, 1 mM 2-mercaptoethanol pH 8.0) and stirred at ~25 °C for an hour. The solubilized pellet was centrifuged at 35k rpm for 30 min, and the supernatant was filtered and loaded onto a 5 mL Ni-NTA column. The column was washed with a urea buffer (20 mM TrisBase, 0.5 M NaCl, 20 mM imidazole, 5.5 M Urea, 1 mM 2-mercaptoethanol pH 8.0). Then, the protein was refolded with a reverse urea gradient from buffer A (20 mM TrisBase, 0.5 M NaCl, 20 mM imidazole, 5.5 M Urea, 1 mM 2-mercaptoethanol pH 8.0) to buffer B (20 mM TrisBase, 0.5 M NaCl, 20 mM imidazole, 1 mM 2-mercaptoethanol pH 8.0) on a BioRad FPLC. The refolded protein was eluted off of the Ni-NTA column using a gradient from buffer C (20 mM TrisBase, 0.5 M NaCl, 20 mM imidazole, 1 mM 2-mercaptoethanol pH 8.0) to buffer D (20 mM TrisBase, 0.5 M NaCl, 500 mM imidazole, 1 mM 2-mercaptoethanol pH 8.0). Fractions containing the protein of interest were identified by gel electrophoresis, pooled, and diluted with a no salt buffer (25 mM Na₂HPO₄ pH 7.0, 0.5 mM THP) and run on a Heparin column equilibrated with low salt buffer (25 mM Na₂HPO₄ pH 7.0, 0.15 M NaCl, 0.5 mM THP). Protein was eluted with a gradient from low to high salt buffer (25 mM Na₂HPO₄ pH 7.0, 1.5 M NaCl, 0.5 mM THP). Protein-containing fractions were concentrated using a 3 kDa cutoff Amicon centrifugal filter (Sigma UFC8003) and loaded onto a Superdex 75 size exclusion column equilibrated with a storage buffer (25 mM Na₂HPO₄ pH 7.0, 0.5 M NaCl, 0.5 mM THP). Fractions containing the expected size products were collected, analyzed by gel electrophoresis, and stored in the storage buffer at 4 °C.

N-GST-CgPho4 full length was constructed for the Protein Binding Microarray assay. The corresponding N-GST-ScPho4 purification has been described before⁵⁵. Both proteins were grown and lysed as described in ref. ⁵⁵. Briefly, BL21 E. coli cells containing the constructs were induced at OD600 0.8 with 1 mM IPTG and collected by centrifugation after 3 h of induction at 30 °C. Pellets were snap frozen in liquid nitrogen and stored at −80 °C until purification. Cells were lysed with rLysozyme (Millipore 71110) for 20 min at room temp in the lysis buffer (1xPBS, pH7.4) with the cOmplete protease inhibitor tablet and 1 mM PMSF. The protein was run on a GST column equilibrated with the lysis buffer and eluted with a gradient to buffer B (50 mM Tris, 10 mM glutathione, pH 8). GST-CgPho4 fractions containing the protein were pooled and loaded onto a heparin column equilibrated with low salt buffer (25 mM Na₂HPO₄ pH 7.4, 150 mM NaCl, 0.5 mM THP) and eluted with a gradient to high salt buffer (25 mM sodium phosphate dibasic pH 7.4, 1.5 M NaCl, 0.5 mM THP). Fractions containing the protein were pooled and concentrated with a 30 kDa cutoff Amicon centrifugal filter (Sigma UFC9030) and loaded onto a Superdex 200 size exclusion column equilibrated with the storage buffer (25 mM HEPES pH 7.4, 500 mM NaCl, 0.5 mM THP). Protein was run on an SDS-PAGE and pure fractions were pooled and concentrated. 10% glycerol was added before snap freezing and storage at −80 °C.

Universal protein binding microarray (uPBM)

The uPBM was performed following the PBM protocol as described in ref. ⁵⁶. Briefly, after the primer extension step is used to double-strand the DNA molecules on the array, the chambers are blocked with 2% milk. After washing, proteins are incubated with the array for 1 h. Alexa Fluor 488-conjugated anti-GST antibody (Invitrogen A-11131) was used to detect binding. The array was scanned using a GenePix 4400 A scanner (Molecular Devices). GST-ScPho4 and GST-CgPho4 (full length) were prepared as described above. Both proteins were assayed at a final concentration of 1 μM as determined by the optical absorbance. An 8 × 15k array was used to assay all possible 9-mers, from which a robust 7-mer enrichment score is derived. The non-parametric enrichment score, or E-score, is invariant to differences in the concentration of the proteins used in the assay, and thus are suitable for comparisons of relative affinities between arrays. E-score ranges from −0.5 (lowest enrichment) to +0.5 (highest enrichment). Scores greater than 0.35 correspond to specific TF-DNA binding^29,57. Data analysis was performed using custom Perl scripts as described in ref. ⁵⁶ to extract and normalize fluorescence based intensity.

Genomic context protein binding microarray (gcPBM)

All probes in the DNA library are 60 bp in length with 24 bp complementary to the primer used for double stranding and 36 bp of genomic region centered on the E-box motif or its variant. The library contains (1) 5711 S. cerevisiae genomic regions with putative Pho4 binding sites; (2) 150 negative controls from the S. cerevisiae genome not specifically bound by Pho4⁵⁵; (3) 4000 DNA sequences used in previous MITOMI experiments to calibrate the binding affinities⁵⁸; (4) 150 genomic regions from C. glabrata that contain Pho4 consensus and nonconsensus binding sites¹⁵; (5) 100 probes from the library of sequences we tested with Biolayer Interferometry; and (6) 150 negative controls from the C. glabrata genome not bound by CgPho4. We used the NNNNGTG, CACNNN, and GTGNNN libraries from Maerkl and Quake (2007) in our gcPBM design. The MITOMI and BLI probes required the addition of random flanks to maintain the 36 bp length;10 different flanking sequences were generated for each sequence. Each probe is represented by six replicates, including three replicates in each orientation and were distributed randomly across the array. The custom 8 × 60k (8 chambers, 60,000 DNA spots per chamber) was synthesized by Agilent Technologies. The gcPBM was performed and analyzed following the PBM protocol as described above and in refs. ^29,55. The log transformed median intensity of the 6 replicate probes were used for comparisons. Because the log signal intensities are not directly comparable between the two Pho4 proteins, which were assayed on separate arrays, we used Spearman’s rank correlation coefficient to quantify the level of concordance in their sequence preference. Any difference in preference should result in a change in the ranks within each Pho4’s data.

Biolayer interferometry (BLI)

A library of 17-bp dsDNA was used for the assay. The consensus probe had the sequence “CTAGTCCCACGTGTGAG”, with the E-box motif bolded, and was identical to the DNA used in the crystal structure of ScPho4’s DBD²⁶. Nine half-site variants, including “AACGTG, TACGTG, … CAAGTG, ACGGTG, CATGTG” were constructed on this background. For each probe, the complementary ssDNA oligos were synthesized by Integrated DNA Technologies (IDT). One of the two probes was biotinylated on the 5’ end. To anneal them into dsDNA, 1 pmol (5 μL of 200 μM) of the biotinylated oligo was mixed with 2 pmol (10 μL of 200 μM) of the complementary, unmodified strand in the Nuclease-Free Duplex Buffer (30 mM HEPES, pH 7.5; 100 mM potassium acetate). The mixture was heated to 95 °C and then left at room temperature for it to cool down. Annealed probes were stored at −20 °C until use.

ScPho4 DBD and CgPho4 DBD were purified and stored as described above. Protein quality was checked weekly for signs of degradation using Dynamic Light Scattering (DLS) on a DynaPro NanoStar instrument. Before each experiment, the concentration of the protein prep was measured in triplicates using a NanoDrop instrument. The mean concentration was used to prepare the protein dilutions and calculate K_D.

BLI experiments were performed on an Octet RED 96 instrument at 30 °C with 1000 rpm shaking. For each protein-probe pair, eight streptavidin (SA) biosensors were hydrated for 15–20 min at room temperature in the 1× kinetics buffer (1× PBS pH 7.4, 0.01% BSA, 0.002% Tween-20, 0.005% Sodium azide). A black 96 well flat-bottom plate was loaded with experimental components also diluted in the 1× kinetics buffer. To begin the experiment, biosensors were equilibrated in a 1× kinetics buffer for 60 s to reach the baseline. Seven biosensors were then submerged in 35 nM biotinylated annealed DNA for 30 s, while one biosensor was submerged in the 1× kinetics buffer as a no-DNA control. Biosensors were then submerged in 1 μg/ml biocytin for 60 s to block any empty streptavidin pocket on their surface, before being dipped back into a 1× kinetics buffer for 60 s for a baseline measurement. Loaded and blocked biosensors were then submerged into a gradient of protein concentrations calculated for each probe based on the K_D. To obtain reliable measurements, we use a concentration range that spans 10× to 1/10 of the K_D values. Biosensors stayed in the protein solution for 900–1000 s or until equilibrium was reached.

Data analysis was performed in the ForteBio Data analysis v11 software. After subtracting the background and aligning the y-axis, the processed data were subjected to either a steady state analysis or a kinetic curve fitting. For steady state analysis, the equilibrium-level signal from each biosensor was plotted against the protein concentration, from which K_D was calculated. Kinetic curve fitting was done using a one-site specific binding with Hill slope model as implemented in the Data analysis v11 software. The latter more effectively fit the BLI data and thus were utilized for most of the analysis. For the consensus sequence, we found the kinetic curve-based estimates for both ScPho4 and CgPho4 to have higher variance, likely due to the fast kinetics not adequately captured by the model⁵⁹. Steady-state analysis gave consistent mean K_D as the kinetic analysis did, but resulted in lower variance, and was used instead. Two to four replicates were performed for each 17 bp DNA library sequence, using at least two independent protein preps.

Electrophoretic mobility shift assay (EMSA)

IR700 labeled 17 bp consensus DNA (same as the consensus DNA for BLI above, Integrated DNA technologies) was diluted to 0.1 nM and mixed 1:1 with a 2× dilution series of the DBD of interest starting at 32 nM in binding buffer (20 mM Tris-HCL, pH 8, 150 mM KCl, 10% glycerol, 5 mM MgCl2, 1 mM EDTA, and 1 mM DTT). The mixture was incubated for 1 h at 4 °C. A 10% native PAGE gel in 1× Tris Glycine buffer (2.5 mM Tris pH 8.3, 19.2 mM glycine) was prerun at 200 V for 20 min. The DNA/protein mixture was then loaded onto the gel and ran for 25–35 min. The gel was imaged using the Odyssey FC imager in the 700 nm channel. To estimate K_D, the unbound band in each protein-containing lane was quantified using the ImageStudio software (with background subtraction) and divided by the no protein control. This value was then subtracted from 1 and plotted against the protein concentrations. A nonlinear curve fitting was performed in Prism v10.2.1 using the one site specific binding model. K_D estimates were reported.

Yeast media and growth

Yeast cells were grown in Yeast extract-Peptone Dextrose (YPD) medium or Synthetic Complete (SC) medium, using Yeast Nitrogen Base without amino acids (Sigma Y0626) supplemented with 2% glucose and amino acid mix. Phosphate starvation medium was made using Yeast Nitrogen Base with ammonium sulfate, without phosphates, without sodium chloride (MP Biomedicals, 114027812) and supplemented to a final concentration of 2% glucose, 1.5 mg/ml potassium chloride, 0.1 mg/ml sodium chloride, and amino acids, as described previously¹⁵. Phosphate concentration in the medium was measured using a Malachite Green Phosphate Assay kit (Sigma, MAK307).

Yeast strain and plasmid construction

The hosts containing the endogenous PHO5pr-mCherry reporter were constructed by replacing the endogenous coding sequence with mCherry using CRISPR/Cas9. Briefly, guide RNAs targeting PHO5 were designed in Benchling (https://www.benchling.com) and cloned into bRA89 (AddGene 100950), a plasmid that contains the Cas9 protein and gRNA scaffold⁶⁰. Homology arms to the PHO5 5’UTR and 3’ UTR were added onto an mCherry donor DNA using PCR. The donor DNA and plasmid were co-transformed into the host using the standard LiAc transformation protocol. Transformants were selected for the CRISPR plasmid using hygromycin resistance and screened with PCR. Positive clones were validated using Sanger sequencing and fluorescence microscopy. Pho2 was then knocked out using a HIS3 cassette with homology arms to the PHO2 5’UTR and 3’UTR.

The chimeric Pho4 plasmid library was constructed using Golden Gate. The fragments of CgPho4 and ScPho4 were PCR amplified using Phusion Flash polymerase (Thermo Scientific F548S) with unique overhangs for Golden Gate. They were then assembled and inserted into a pRS315-based backbone that contains the ScPHO4 promoter, a C-terminal in-frame mNeon tag followed by the ScPHO4 3’ UTR and terminator. Primers were designed using the NEBridge Golden Gate assembly tool. The inserts were verified with PCR and Sanger sequencing and then transformed into the yeast hosts using either the standard LiAc protocol⁶¹ or the Zymo yeast transformation kit (Zymo research T2001) and plated onto SD-leu media.

Flow cytometry

Cells were inoculated into a 96 deep-well plate (Fisher Scientific 07-200-700) and grown overnight in SD-Leu or SC medium supplemented to final concentrations of 0.13 mg/ml adenine and 0.1 mg/ml tryptophan to reduce autofluorescence⁶². Cultures were diluted in the morning to an OD600 of 0.15 and grown to an OD600 of 0.6. These cells were directly subjected to flow cytometry at a flow rate of 25 uL/min on an Attune NxT flow cytometer (ThermoFisher) fitted with an autosampler. Data were collected using the Attune NxT software v3.1 for FSC, SSC and the appropriate fluorescence channels. Calibration beads (Spherotech RCP305A) were run routinely to ensure that experiments from different days were comparable. Pho4-mNeon was measured in the BL1 channel with 488 nm excitation and 510 ± 10 nm emission; mCherry was measured in the YL2 channel using 561 nm excitation and 615 ± 25 nm emission. Voltages for each channel were set by the brightest sample and negative control so the sample signals were between 10²–10⁵. Events were gated based on FSC-H / SSC-H to remove non-cells, then on the FSC-W / FSC-H to isolate singlets (Supplementary Fig. 9). At least 10,000 gated events were collected per sample. Each strain was measured at least three times, and on two different dates. A nonfluorescent strain was included in every experiment for subtracting the autofluorescence. Further gating and analyses were performed in R using the FlowClust and FlowCore packages. Detailed analysis scripts are available in the project github repository.

Fluorescent microscopy for Pho4 nuclear localization

Six Pho4 constructs were chosen, with three bearing NLS:Sc (SSSSS, CCSCC, SSSCC) and three bearing NLS:Cg (SSCSS, CCCCC, SSCCS). They were transformed into either a PHO2 wild type or pho2∆ background. Yeast cells were grown to mid-log phase in SD-Leu and fixed using 4% paraformaldehyde for 10 min, washed twice with 1xPBS before DAPI staining. 4’,6-diamidino-2-phenylindole (DAPI, Sigma, D9542) was diluted in the respective medium and added to the fixed cells at a final concentration of 10 μg/mL. Cells were incubated with DAPI in the dark for 30 min before DAPI was removed and cells were washed twice with 1xPBS. Fixed cells were mounted on a 1.2% agarose pad on a depression slide for image. Fluorescent imaging was performed on a Leica epifluorescence microscope. A 405 nm laser was used for DAPI excitation and 430–550 nm for emission. For Pho4-mNeON, 488 nm was used for excitation and 530 nm for emission. A bright field image was also taken to show the cell boundaries. A total of three images were recorded for each field of view.

Microscopy analysis was performed in ImageJ (v 1.53). To quantify the nuclear fraction of Pho4 proteins, ten cells were randomly selected for each construct in each host background, and the experiment was repeated two times, resulting in a total of 15 or 20 cells quantified in PHO2 and pho2∆ backgrounds, respectively. Cell boundaries were manually traced using the bright field channel, and nucleus using the DAPI channel. Both were added to the ROI manager, and the raw integrated density (sum of pixel values) was quantified for both areas. The ratio of nuclear vs whole cell integrated density was used for plotting and statistical analysis.

Yeast one-hybrid and yeast two-hybrid

Indicated regions from CgPho4, ScPho4, and ScPho2 were PCR amplified and cloned into pGBD-C3⁶³ containing the Gal4 DBD using Gibson Assembly. Plasmids were verified with PCR and Sanger sequencing. The GAL1pr-mCherry reporter was created using a pRS306 based integrative plasmid digested with StuI and inserted into the ura3 locus of a gal4Δ gal80Δ S. cerevisiae strain. The reporter was tested using flow cytometry according to the protocol above using only the red (YL2) channel. Plasmids were then transformed into this yeast strain using the standard LiAc method⁶¹ or the Zymo yeast transformation kit (Zymo research T2001) and selected for on SD-trp. For the yeast two-hybrid, the Gal4 activation domain (AD) and Gal4 DNA binding domain (DBD) fusion plasmids were constructed by PCR amplifying the indicated regions and cloning into pGBD-C3 and pGAD-C3⁶³ using Gibson Assembly. The plasmids were co-transformed and selected for on SD -leu -trp. Positive colonies were patched onto fresh plates and grown at 30 °C for 24 h before replica plating onto SD -leu -trp -ade -his to test for interactions. Plates were imaged after 45–50 h.

Statistical analyses

All replicates are biological. For yeast strains, the same strain was grown and measured in separate vials. Independent transformants were tested for the flow cytometry host strains and randomly selected constructs and were found to generate consistent results. For recombinant proteins, the same constructs were independently transformed into the bacteria and separate batches of purified proteins were used for the measurements.

Binding affinity comparison by BLI (Fig. 2): K_D estimates from either steady state or kinetic curve fitting analyses were log10 transformed and a Student’s t-test was used to compare ScPho4 vs CgPho4 DBDs against the same 17-bp DNA. Raw two-sided P-values and Holm-Bonferroni corrected P values for the 10 tests were reported; Activation (Fig. 3): median fluorescence intensity (MFI) for the GAL1pr-mCherry reporter was recorded from flow cytometry for each sample. For Fig. 3C, each Gal4 DBD fusion construct was compared to the background level. Because the host (with GAL1pr-mCherry but no Gal4 DBD plasmid) and Gal4 DBD alone showed a similar level of low MFI, we combined them as the reference group. A linear model was fit in R with the following command “lm(MFI ~ Genotype)”, where “Genotype” is a factor representing the constructs. This model estimates a common standard deviation for all constructs and tests the significance of each construct against the reference group with a two-sided t-test. The raw P-values were corrected for multiple testing using the Holm-Bonferroni procedure. Constructs with a corrected P ≤ 0.05 were considered significant. For Fig. 3D, the first two groups were tested using a two-sample t-test, while the third involved multiple levels, and were tested as described above for Fig. 3C. A Holm-Bonferroni correction was applied to the raw P-values for all six tests together. Epistasis between regions R1, AD, and NLS of CgPho4 (Fig. 5D): to determine the main and interaction terms in these swaps from CgPho4 on ScPho4’s background, we fit a linear model, Y = X₀ + R1 + AD + NLS + R1:AD + R1:NLS + AD:NLS + R1:AD:NLS in R, using the command “lm(A ~ R1 * AD * NLS)”, where “A” is either A_PHO2 or A_pho2∆, and the independent variables are coded as ScPho4 = 0, CgPho4 = 1. The raw P-values were adjusted for multiple testing using the Holm-Bonferroni procedure. Split P2ID (Fig. 6B): The same procedure as applied to the activation region test above (Fig. 3) was applied to the split P2ID swaps. Two sets of four chimeras were chosen, each with a reference construct having P2ID:Cg, one with the entire P2ID swapped for the ScPho4 version, and two with the first or second half of P2ID swapped for the ScPho4 version. The latter three were compared to the reference using a linear model “lm(MFI ~ Genotype)”, for A_PHO2 and A_pho2∆ separately. Holm-Bonferroni correction was applied to the two sets combined (6 tests in total), again separately for A_PHO2 and A_pho2∆. A corrected P < 0.05 was considered significant. Nuclear fraction of Pho4 chimeras (Supplementary Fig. 7): The ratio of the sum of pixel values in the GFP channel inside the nucleus vs the whole cell was treated as the response variable, and the identity of the NLS region (“Cg” vs “Sc”) as the predictor. First, a two-way ANOVA was performed with the formula “lm(Nuc_frac ~ NLS + Host)”, where Host is either PHO2 or pho2∆. Second, given that ratios are often non-normally distributed, we performed the non-parametric Kruskal-Wallis test with the command “kruskal.test(Nuc_frac ~ NLS)” separately in the PHO2 and pho2∆ hosts. The ANOVA F-test P-value (0.13) was reported in the results. The two P-values for the second test were 0.42 (PHO2) and 0.36 (pho2∆).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Source data for figures in this paper are provided as a Source Data file and are available at https://doi.org/10.5281/zenodo.14501732. Raw microscopy images for quantifying Pho4 nuclear concentration are available at https://doi.org/10.6084/m9.figshare.28437011. Protein Binding Microarray data are available through the Gene Expression Omnibus (GEO) under GSE293214 and GSE293355. No restrictions apply to any of the data generated in this study. Source data are provided with this paper.

Code availability

Scripts for figures and statistical tests are available at https://doi.org/10.5281/zenodo.14501732.

References

Stewart, A. J., Hannenhalli, S. & Plotkin, J. B. Why transcription factor binding sites are ten nucleotides long. Genetics 192, 973–985 (2012).
Article CAS PubMed PubMed Central Google Scholar
Berger, M. F. et al. Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences. Cell 133, 1266–1276 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zhou, X. & O’Shea, E. K. Integrated approaches reveal determinants of genome-wide binding and function of the transcription factor Pho4. Mol. Cell 42, 826–836 (2011).
Article CAS PubMed PubMed Central Google Scholar
Merhej, J. et al. A network of paralogous stress response transcription factors in the human pathogen candida glabrata. Front. Microbiol. 7, 645 (2016).
Article PubMed PubMed Central Google Scholar
Sánchez-Higueras, C. et al. In vivo Hox binding specificity revealed by systematic changes to a single cis regulatory module. Nat. Commun. 10, 3597 (2019).
Article ADS PubMed PubMed Central Google Scholar
Slattery, M. et al. Cofactor binding evokes latent differences in DNA binding specificity between Hox proteins. Cell 147, 1270–1282 (2011).
Article CAS PubMed PubMed Central Google Scholar
Todeschini, A.-L., Georges, A. & Veitia, R. A. Transcription factors: specific DNA binding and specific gene regulation. Trends Genet. 30, 211–219 (2014).
Article CAS PubMed Google Scholar
Avsec, Ž. et al. Base-resolution models of transcription-factor binding reveal soft motif syntax. Nat. Genet. https://doi.org/10.1038/s41588-021-00782-6 (2021).
Article PubMed PubMed Central Google Scholar
Guss, K. A., Nelson, C. E., Hudson, A., Kraus, M. E. & Carroll, S. B. Control of a genetic regulatory network by a selector gene. Science 292, 1164–1167 (2001).
Article ADS CAS PubMed Google Scholar
Ang, Y.-S. et al. Disease model of GATA4 mutation reveals transcription factor cooperativity in human cardiogenesis. Cell 167, 1734–1749.e22 (2016).
Article CAS PubMed PubMed Central Google Scholar
Baker, C. R., Booth, L. N., Sorrells, T. R. & Johnson, A. D. Protein modularity, cooperative binding, and hybrid regulatory states underlie transcriptional network diversification. Cell 151, 80–95 (2012).
Article CAS PubMed PubMed Central Google Scholar
Britton, C. S., Sorrells, T. R. & Johnson, A. D. Protein-coding changes preceded cis-regulatory gains in a newly evolved transcription circuit. Science 367, 96–100 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Traven, A., Jelicic, B. & Sopta, M. Yeast Gal4: a transcriptional paradigm revisited. EMBO Rep. 7, 496–499 (2006).
Article CAS PubMed PubMed Central Google Scholar
Kerwin, C. L. & Wykoff, D. D. Candida glabrata PHO4 is necessary and sufficient for Pho2-independent transcription of phosphate starvation genes. Genetics 182, 471–479 (2009).
Article CAS PubMed PubMed Central Google Scholar
He, B. Z., Zhou, X. & O’Shea, E. K. Evolution of reduced co-activator dependence led to target expansion of a starvation response pathway. eLife 6, e25157 (2017).
Article PubMed PubMed Central Google Scholar
Barbaric, S., Münsterkötter, M., Goding, C. & Hörz, W. Cooperative Pho2-Pho4 interactions at the PHO5 promoter are critical for binding of Pho4 to UASp1 and for efficient transactivation by Pho4 at UASp2. Mol. Cell. Biol. 18, 2629–2639 (1998).
Article CAS PubMed PubMed Central Google Scholar
Nourani, A., Utley, R. T., Allard, S. & Côté, J. Recruitment of the NuA4 complex poises the PHO5 promoter for chromatin remodeling and activation. EMBO J. 23, 2597–2607 (2004).
Article CAS PubMed PubMed Central Google Scholar
Magbanua, J. P., Ogawa, N., Harashima, S. & Oshima, Y. The transcriptional activators of the PHO regulon, Pho4p and Pho2p, interact directly with each other and with components of the basal transcription machinery in Saccharomyces cerevisiae. J. Biochem. 121, 1182–1189 (1997).
Article CAS PubMed Google Scholar
Shao, D., Creasy, C. L. & Bergman, L. W. Interaction of Saccharomyces cerevisiae Pho2 with Pho4 increases the accessibility of the activation domain of Pho4. Mol. Gen. Genet. 251, 358–364 (1996).
CAS PubMed Google Scholar
Jayaraman, P. S., Hirst, K. & Goding, C. R. The activation domain of a basic helix-loop-helix protein is masked by repressor interaction with domains distinct from that required for transcription regulation. EMBO J. 13, 2192–2199 (1994).
Article CAS PubMed PubMed Central Google Scholar
Ogawa, N. & Oshima, Y. Functional domains of a positive regulatory protein, PHO4, for transcriptional control of the phosphatase regulon in Saccharomyces cerevisiae. Mol. Cell. Biol. 10, 2224–2236 (1990).
CAS PubMed PubMed Central Google Scholar
McAndrew, P. C., Svaren, J., Martin, S. R., Hörz, W. & Goding, C. R. Requirements for chromatin modulation and transcription activation by the Pho4 acidic activation domain. Mol. Cell. Biol. 18, 5818–5827 (1998).
Article CAS PubMed PubMed Central Google Scholar
Komeili, A. & O’Shea, E. K. Roles of phosphorylation sites in regulating activity of the transcription factor Pho4. Science 284, 977–980 (1999).
Article ADS CAS PubMed Google Scholar
O’Neill, E. M., Kaffman, A., Jolly, E. R. & O’Shea, E. K. Regulation of PHO4 nuclear localization by the PHO80-PHO85 cyclin-CDK complex. Science 271, 209–212 (1996).
Article ADS PubMed Google Scholar
Hirst, K., Fisher, F., McAndrew, P. C. & Goding, C. R. The transcription factor, the Cdk, its cyclin and their regulator: directing the transcriptional response to a nutritional signal. EMBO J. 13, 5410–5420 (1994).
Article CAS PubMed PubMed Central Google Scholar
Shimizu, T. et al. Crystal structure of PHO4 bHLH domain–DNA complex: flanking base recognition. EMBO J. 16, 4689–4697 (1997).
Article CAS PubMed PubMed Central Google Scholar
Kaffman, A., Herskowitz, I., Tjian, R. & O’Shea, E. K. Phosphorylation of the transcription factor PHO4 by a cyclin-CDK complex, PHO80-PHO85. Science 263, 1153–1156 (1994).
Article ADS CAS PubMed Google Scholar
Gomes-Vieira, A. L. et al. Evolutionary conservation of a core fungal phosphate homeostasis pathway coupled to development in Blastocladiella emersonii. Fungal Genet. Biol. 115, 20–32 (2018).
Article CAS PubMed Google Scholar
Gordân, R. et al. Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape. Cell Rep. 3, 1093–1104 (2013).
Article PubMed PubMed Central Google Scholar
Sanborn, A. L. et al. Simple biochemical features underlie transcriptional activation domain diversity and dynamic, fuzzy binding to Mediator. eLife 10, e68068 (2021).
Article CAS PubMed PubMed Central Google Scholar
Piskacek, S. et al. Nine-amino-acid transactivation domain: establishment and prediction utilities. Genomics 89, 756–768 (2007).
Article CAS PubMed Google Scholar
Bhoite, L. T. et al. Mutations in the Pho2 (Bas2) transcription factor that differentially affect activation with its partner proteins Bas1, Pho4, and Swi5. J. Biol. Chem. 277, 37612–37618 (2002).
Article CAS PubMed Google Scholar
Ming Yip, H., Cheng, S., Olson, E. J., Crone, M. & Maerkl, S. J. Perfect adaptation achieved by transport limitations governs the inorganic phosphate response in S. cerevisiae. Proc. Natl. Acad. Sci. USA 120, e2212151120 (2023).
Article PubMed PubMed Central Google Scholar
Donovan, B. T. et al. Basic helix-loop-helix pioneer factors interact with the histone octamer to invade nucleosomes and generate nucleosome-depleted regions. Mol. Cell 83, 1251–1263.e6 (2023).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. et al. Intrinsic disorder in transcription factors. Biochemistry 45, 6873–6888 (2006).
Article CAS PubMed Google Scholar
Lynch, V. J., May, G. & Wagner, G. P. Regulatory evolution through divergence of a phosphoswitch in the transcription factor CEBPB. Nature 480, 383–386 (2011).
Article ADS CAS PubMed Google Scholar
Hsu, I. S. et al. A functionally divergent intrinsically disordered region underlying the conservation of stochastic signaling. PLOS Genet. 17, e1009629 (2021).
Article CAS PubMed PubMed Central Google Scholar
Krois, A. S., Dyson, H. J. & Wright, P. E. Long-range regulation of p53 DNA binding by its intrinsically disordered N-terminal transactivation domain. Proc. Natl Acad. Sci. 115, E11302–E11310 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Brodsky, S. et al. Intrinsically disordered regions direct transcription factor in vivo binding specificity. Mol. Cell https://doi.org/10.1016/j.molcel.2020.05.032 (2020).
Article PubMed Google Scholar
Ferrie, J. J., Karr, J. P., Tjian, R. & Darzacq, X. “Structure”-function relationships in eukaryotic transcription factors: the role of intrinsically disordered regions in gene regulation. Mol. Cell 82, 3970–3984 (2022).
Ferrie, J. J. et al. p300 is an obligate integrator of combinatorial transcription factor inputs. Mol. Cell https://doi.org/10.1016/j.molcel.2023.12.004 (2023). S1097-2765(23)01023–7.
Article PubMed Google Scholar
Mindel, V. et al. Intrinsically disordered regions of the Msn2 transcription factor encode multiple functions using interwoven sequence grammars. Nucleic Acids Res. gkad1191. https://doi.org/10.1093/nar/gkad1191 (2023).
Ji, D. et al. FOXA1 forms biomolecular condensates that unpack condensed chromatin to function as a pioneer factor. Mol. Cell 84, 244–260.e7 (2024).
Article CAS PubMed Google Scholar
Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl. Acad. Sci. USA 118, e2016239118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tuch, B. B., Galgoczy, D. J., Hernday, A. D., Li, H. & Johnson, A. D. The evolution of combinatorial gene regulation in fungi. PLoS Biol. 6, e38 (2008).
Article PubMed PubMed Central Google Scholar
Wapinski, I. et al. Gene duplication and the evolution of ribosomal protein gene regulation in yeast. Proc. Natl. Acad. Sci. USA 107, 5505–5510 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Tsong, A. E., Tuch, B. B., Li, H. & Johnson, A. D. Evolution of alternative transcriptional circuits with identical logic. Nature 443, 415–420 (2006).
Article ADS CAS PubMed Google Scholar
Nocedal, I. & Johnson, A. D. How transcription networks evolve and produce biological novelty. Cold Spring Harb. Symp. Quant. Biol. 80, 265–274 (2015).
Article PubMed Google Scholar
Sherbenou, D. W. & Druker, B. J. Applying the discovery of the Philadelphia chromosome. J. Clin. Investig. 117, 2067–2074 (2007).
Article CAS PubMed PubMed Central Google Scholar
Do, C. B., Mahabhashyam, M. S. P., Brudno, M. & Batzoglou, S. ProbCons: probabilistic consistency-based multiple sequence alignment. Genome Res. 15, 330–340 (2005).
Article CAS PubMed PubMed Central Google Scholar
Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview Version 2-a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009).
Article CAS PubMed PubMed Central Google Scholar
Troshin, P. V. et al. JABAWS 2.2 distributed web services for Bioinformatics: protein disorder, conservation and RNA secondary structure. Bioinformatics 34, 1939–1940 (2018).
Article CAS PubMed PubMed Central Google Scholar
Buchan, D. W. A. & Jones, D. T. The PSIPRED Protein Analysis Workbench: 20 years on. Nucleic Acids Res. 47, W402–W407 (2019).
Article CAS PubMed PubMed Central Google Scholar
Thomas-Chollier, M. et al. RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets. Nucleic Acids Res. gkr1104. https://doi.org/10.1093/nar/gkr1104 (2011).
Zhang, Y., Ho, T. D., Buchler, N. E. & Gordân, R. Competition for DNA binding between paralogous transcription factors determines their genomic occupancy and regulatory functions. Genome Res. 31, 1216–1229 (2021).
Article CAS PubMed PubMed Central Google Scholar
Berger, M. F. & Bulyk, M. L. Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors. Nat. Protoc. 4, 393–411 (2009).
Article CAS PubMed PubMed Central Google Scholar
Berger, M. F. & Bulyk, M. L. Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA binding proteins. in Gene Mapping, Discovery, and Expression Vol. 338, 245–260 (Humana Press, 2006).
Maerkl, S. J. & Quake, S. R. A systems approach to measuring the binding energy landscapes of transcription factors. Science 315, 233–237 (2007).
Article ADS CAS PubMed Google Scholar
Weeramange, C. J., Fairlamb, M. S., Singh, D., Fenton, A. W. & Swint‐Kruse, L. The strengths and limitations of using biolayer interferometry to monitor equilibrium titrations of biomolecules. Protein Sci. pro.3827. https://doi.org/10.1002/pro.3827 (2020).
Anand, R., Memisoglu, G. & Haber, J. Cas9-mediated gene editing in Saccharomyces cerevisiae. Protocol Exchange. https://doi.org/10.1038/protex.2017.021a (2017).
Gietz, R. D. & Schiestl, R. H. High-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG method. Nat. Protoc. 2, 31–34 (2007).
Article CAS PubMed Google Scholar
Lam, F. H., Steger, D. J. & O’Shea, E. K. Chromatin decouples promoter threshold from dynamic range. Nature 453, 246–250 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
James, P., Halladay, J. & Craig, E. A. Genomic libraries and a host strain designed for highly efficient two-hybrid selection in yeast. Genetics 144, 1425–1436 (1996).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Miles Pufall’s lab for teaching us EMSA. We thank Dr. Jan Fassler for sharing many yeast plasmids for the yeast one- and two-hybrids. Drs. Jan Fassler, Miles Pufall, Craig Ellermeier and Todd Washington all critically read the thesis chapter by L.F.S., on which this manuscript is based. We thank Kyle Malcolm for helping with the early development of the BLI assay. Christian Weinrich discovered the P2ID’s effect on Pho2-dependence during his rotation. We thank Dr. Yann Vanrobaeys for helping with an initial analysis using PADDLE. We would like to acknowledge the use of resources at the Protein and Crystallography Facility within the Carver College of Medicine at the University of Iowa and thank Lokesh Gakhar, Zhen Xu, and Devin Reusch for assistance with protein purification and BLI assays. This work was primarily supported by NIH R35-GM137831 (to B.Z.H.). L.F.S. was supported on NIH Predoctoral Training Grant T32GM008629; B.J.B. was supported on NIH Predoctoral Training Grant T32GM1454441; R.G. was supported by NIH R01-GM135658 and B.Z.H. was also supported by a startup fund from the University of Iowa.

Author information

Jia Zhao
Present address: Laboratory of Immunophysiology, The Ragon Institute of Mass General, MIT, and Harvard; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
Yuning Zhang
Present address: Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA

Authors and Affiliations

Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA, USA
Lindsey F. Snyder, Baylee J. Bruce & Bin Z. He
Department of Biology, University of Iowa, Iowa City, IA, USA
Emily M. O’Brien, Jia Zhao, Jinye Liang, Thomas H. Cassier & Bin Z. He
Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, USA
Yuning Zhang & Raluca Gordân
Department of Molecular Genetics & Microbiology, Duke University, Durham, NC, USA
Wei Zhu & Raluca Gordân
Protein and Crystallography Facility, University of Iowa, Iowa City, IA, USA
Nicholas J. Schnicker
Department of Molecular Physiology and Biophysics, University of Iowa, Iowa City, IA, USA
Nicholas J. Schnicker
Department of Pediatrics, Division of Gastroenterology, Hepatology and Nutrition, Boston Children’s Hospital and Harvard Medical School, Boston, MA, USA
Xu Zhou
Department of Computer Science, Duke University, Durham, NC, USA
Raluca Gordân
Department of Cell Biology, Duke University, Durham, NC, USA
Raluca Gordân
Department of Genomics and Computational Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
Raluca Gordân

Authors

Lindsey F. Snyder
View author publications
Search author on:PubMed Google Scholar
Emily M. O’Brien
View author publications
Search author on:PubMed Google Scholar
Jia Zhao
View author publications
Search author on:PubMed Google Scholar
Jinye Liang
View author publications
Search author on:PubMed Google Scholar
Baylee J. Bruce
View author publications
Search author on:PubMed Google Scholar
Yuning Zhang
View author publications
Search author on:PubMed Google Scholar
Wei Zhu
View author publications
Search author on:PubMed Google Scholar
Thomas H. Cassier
View author publications
Search author on:PubMed Google Scholar
Nicholas J. Schnicker
View author publications
Search author on:PubMed Google Scholar
Xu Zhou
View author publications
Search author on:PubMed Google Scholar
Raluca Gordân
View author publications
Search author on:PubMed Google Scholar
Bin Z. He
View author publications
Search author on:PubMed Google Scholar

Contributions

L.F.S. and B.Z.H. designed the experiments. L.F.S. and E.M.O. constructed the chimeras and performed the flow cytometry. L.F.S. performed the EMSA and the yeast two hybrid experiments. J.L. performed the microscopy for nuclear localization analysis. E.M.O. and B.J.B. performed experiments and analyses for the revision. Y.Z. and W.Z. performed the PBM experiments with input from R.G. J.Z. and T.H.C. established critical methods. X.Z. generated preliminary results for the work. N.J.S. gave input on and performed BLI experiments. L.F.S., Y.Z., and B.Z.H. analyzed the data. L.F.S. and B.Z.H. wrote the manuscript with edits from all co-authors.

Corresponding author

Correspondence to Bin Z. He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review file

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Snyder, L.F., O’Brien, E.M., Zhao, J. et al. Divergence in a eukaryotic transcription factor’s co-TF dependence involves multiple intrinsically disordered regions. Nat Commun 16, 5340 (2025). https://doi.org/10.1038/s41467-025-59244-w

Download citation

Received: 28 August 2024
Accepted: 13 April 2025
Published: 18 June 2025
Version of record: 18 June 2025
DOI: https://doi.org/10.1038/s41467-025-59244-w