Transcriptional stochasticity reveals multiple mechanisms of long non-coding RNA regulation at the Xist–Tsix locus

Kesler, Benjamin K.; Adams, John; Neuert, Gregor

doi:10.1038/s41467-025-59496-6

Download PDF

Article
Open access
Published: 07 May 2025

Transcriptional stochasticity reveals multiple mechanisms of long non-coding RNA regulation at the Xist–Tsix locus

Nature Communications volume 16, Article number: 4223 (2025) Cite this article

4975 Accesses
4 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Long noncoding RNAs (LncRNAs) are increasingly recognized as being involved in human physiology and diseases, but there is a lack of mechanistic understanding for the majority of lncRNAs. We comparatively test proposed mechanisms of antisense lncRNA regulation at the X-chromosome Inactivation (XCI) locus. We find that due to stochasticity in transcription, different mechanisms based on the act of transcription regulate Xist and Tsix at different levels of nascent transcription. At medium levels, RNA polymerases transcribe Xist and Tsix on each strand at the same transcription site and deposit significant amounts of the histone mark H3K36me3, which inhibits Xist. At high levels of nascent transcription, many RNA polymerases transcribe Xist or Tsix resulting in transcriptional interference. Therefore, lncRNA expression variability is not just a quirk of transcription but an important aspect of regulation that allows multiple mechanisms to be employed by the same gene locus within the same cell population.

Transcription regulation by long non-coding RNAs: mechanisms and disease relevance

Article 19 January 2024

XIST dampens X chromosome activity in a SPEN-dependent manner during early human development

Article Open access 04 June 2024

Chd8 regulates X chromosome inactivation in mouse through fine-tuning control of Xist expression

Article Open access 15 April 2021

Introduction

Protein-coding genes make up only a small portion of the human genome while transcribed noncoding genes encompass a much larger fraction of the human genome^1,2. In comparison to protein-coding genes, most of these noncoding genes have unknown functions and mechanisms^2,3, though many of them are mutated or misregulated in diseases^3,4,5. Previous studies have proposed various mechanisms for noncoding RNA functions. These mechanisms fit into two broad categories: (i) transcript-based mechanisms by which the noncoding RNA molecule regulates a target gene after transcription, and (ii) transcription-based mechanisms by which the noncoding RNA regulates the target gene through the act of transcription (Fig. 1).

**Fig. 1: The proposed mechanisms for Tsix regulation of Xist.**

Some transcript-based mechanisms involve the direct binding of a noncoding RNA to a target gene transcript through complementary base pairs. This binding can then recruit proteins that recognize double-stranded RNA, such as Dicer with noncoding microRNAs⁶ (Fig. 1a), or the noncoding RNA can obstruct an RNA-binding protein from binding to its target, as proposed for conserved noncoding antisense transcript for β-secretase-1 (BACE1-AS), which is defined as an RNA transcribed from the opposite DNA strand of BACE1⁷ (Fig. 1b). Another transcript-based mechanism is for the noncoding RNA transcript to recruit and guide a regulatory protein to the specific genomic location, as proposed for the long non-coding antisense RNA transcript overlapping the INK4b/ARF/INK4a locus (ANRIL)⁸ (Fig. 1c).

Several mechanisms instead involve the act of transcription, and one example is transcriptional interference (TI). TI occurs when polymerases physically collide upon simultaneous antisense transcription, resulting in the expression of only the sense or antisense transcript, as seen in galactose-inducible genes in yeast cells⁹ (Fig. 1d). Another proposed mechanism is that the transcription of the noncoding RNA results in RNA Polymerase II-mediated recruitment of chromatin remodeling proteins such as Su(var)3-9, Enhancer-of-zeste and Trithorax (SET2 & SET3) in yeast cells¹⁰ or SETD2 in mice¹¹. These histone lysine methyltransferase proteins change the chromatin environment and regulate the target gene¹².

Each of these mechanisms requires that the noncoding RNA and its associated proteins express and colocalize to regulate their target in the same cell. However, standard cell population assays such as RNA sequencing and chromatin or RNA immunoprecipitation cannot distinguish between cells and do not quantify the exact binding frequency¹³. This can result in a misinterpretation of regulatory relationships and mechanisms because co-expression seen at the population level might reflect expression in different cells, binding events can occur at frequencies too low to be biologically relevant, or the proposed regulatory RNAs, proteins, or chromatin environment might not be associated with regulation of the target at the single-cell level. Considerable progress has contributed to achieving single-cell resolution for sequencing-based assays, but they still lack sensitivity required to reliably quantify noncoding RNA in a single cell².

Additionally, many previous studies utilize a variety of mutations to investigate lncRNAs of interest^14,15,16,17. Mutations are powerful tools for studying lncRNAs but also have potential problems. They can have off-target effects, such by inadvertently targeting an entirely different locus¹⁸ or even disrupting an unknown transcript or gene regulatory element within the same locus¹⁹. Investigating lncRNAs with their native sequences intact is beneficial to avoid these potential problems.

As we have shown previously, single-cell methods together with well-designed experiments can interrogate noncoding genes and their regulation in single cells simultaneously and without genetic intervention at high resolution, overcoming many of the limitations of cell population and single-cell genomic measurements mentioned above^10,20. However, nobody has used these quantitative methods to compare several proposed mechanisms against each other before.

As a proof of principle, we focus on X Inactive-Specific Transcript (Xist)²¹ and its antisense transcript Tsix^16,22 in female mouse (Mus musculus) embryonic stem cells for which we agnostically tested several of the proposed mechanisms of Xist and Tsix mutual regulation. Xist and Tsix belong to a category of long noncoding RNAs (lncRNAs) called antisense lncRNAs^2,23. There is also evidence that two other transcripts, RepA²⁴ and XistAR¹⁹ are present at the locus and might also be involved in antisense lncRNA regulation. Several studies estimated that antisense RNAs account for 70% of all transcripts in human cells²⁵. They are ubiquitous in eukaryotes¹ and prokaryotes²⁶, and they are implicated in diseases^27,28.

This antisense lncRNA pair is of interest because Xist is necessary for X chromosome inactivation (XCI)²⁹. Abnormalities in XCI are associated with different cancers^30,31 and major affective disorders such as bipolar disorder and depression³². The advantage of the Xist/Tsix system is that many studies have proposed different mechanisms of lncRNA gene regulation at this locus. A limitation of this system is that it is a sense/antisense pair that only covers a subclass of lncRNAs. However, it is a useful example to test many of the proposed mechanisms of noncoding RNA regulation through a rigorous quantitative framework of single-molecule and single-cell fluorescence microscopy and analysis. Although Xist and Tsix are among the most well-studied mammalian lncRNAs, and Tsix is an established regulator of Xist in mice³³, there are many proposed mechanisms for Tsix regulation of Xist, some of which have conflicting evidence or even a complete lack of evidence³³.

The proposed mechanisms for Tsix regulation of Xist fit in the same general classifications described above (Fig. 1). The mechanisms involved in this context are categorized into two types: those related to transcripts (Fig. 1a–c) and those concerning transcription (Fig. 1d, e). Transcript-based mechanisms encompass the RNA interference (RNAi) system, which can trigger the degradation of double-stranded RNA, a process that involves the binding of Xist and Tsix within the same cell (Fig. 1a)³⁴. Additionally, the binding of Tsix to Xist may block crucial proteins from associating with Xist, which is necessary for X-chromosome inactivation (XCI) (Fig. 1b)¹⁶. Tsix may also serve as a structural framework to attract proteins that suppress Xist’s function (Fig. 1c)³⁵. On the other hand, transcription-based mechanisms include transcriptional interference (TI), where collisions between polymerases can impede the transcription process (Fig. 1d)³⁶. Furthermore, certain proteins are known to attach to polymerase tails, leading to alterations in chromatin during the transcriptional activity (Fig. 1e)^{14,37,38,39,40}. In addition, Xist contains a repeat A region that is reported to give rise to a separate transcript²⁴.

In this work, we agnostically tested all the proposed mechanisms of Xist and Tsix mutual regulation by combining single-molecule RNA fluorescent in-situ hybridization (smRNA-FISH) (Supplementary Fig. 1, Supplementary Data 1), immunofluorescence, and quantitative expression and colocalization analyses in wildtype female mouse embryonic stem cells during differentiation. The proposed method can be applied broadly to test any antisense transcript for these mechanisms in the future. Furthermore, the specific analysis methods utilized in this study are suitable for other noncoding and protein-coding pairs involving colocalization and fluorescence quantification.

Results

Mechanisms based on transcripts: Tsix binding to Xist

The possibility of complementary base pairing between antisense lncRNAs such as Xist and Tsix has been proposed to result in inhibition through at least two possible mechanisms: transcript degradation through RNA interference (RNAi) machinery (Fig. 1a)³⁴ and masking of a critical region (Fig. 1b)²⁴. RNAi recognizes and processes double-stranded RNA (dsRNA) with Dicer and degrades sequences matching the dsRNA (Fig. 1a)⁴¹. Therefore, any antisense RNA could potentially act through this mechanism. A previous study suggested that Tsix and Xist form a duplex processed by Dicer to inhibit Xist³⁴. However, other studies showed conflicting evidence as to whether Tsix inhibits Xist by this mechanism^24,42,43,44. Sense-antisense binding could instead inhibit Xist through masking, because Tsix’s exonic sequences overlap with a section of Xist that is critical for XCI called the Repeat A region (Fig. 1b)²⁴. However, there is no experimental evidence for or against this mechanism. Furthermore, the Repeat A region encodes a separate transcript²⁴. Although earlier studies showed Xist and Tsix signals in the same cells^22,45, none of these previous studies demonstrated simultaneous colocalization of individual Tsix and Xist transcripts within the same cells and outside the transcription site, which would be necessary for either of these mechanisms.

To address this gap in knowledge, we utilized smRNA-FISH to investigate the single-cell expression and colocalization of Xist and Tsix. smRNA-FISH requires chemical cross linking, cell permeabilization, and hybridization of complementary fluorescently-labeled oligonucleotides to detect RNA molecules (Supplementary Fig. 1 and 2, Supplementary Data 1)^46,47. We utilized undirected differentiation of female mouse embryonic stem cells (mESCs) as an ex vivo system to recapitulate the early stages of Xist regulation and quantify thousands of cells. To achieve high expression of Xist, cells are grown in 2i media to stay in the pluripotent state (0.5 % of Xist cloud positive cells) (Fig. 2a, Supplementary Fig. 3) and then non directionally differentiated for two days by removing 2i media (52 % of Xist cloud positive cells) (Fig. 2a), which consistently resulted in high Xist expression in our experiments and previously published results using the same system and cell line (Supplementary Figs. 4 and 10)^48,49. We chose the day 2 time point because both Xist and Tsix show variable expression, which is necessary to analyze their co-expression within the same cell and allele. Time points before day 2 show less Xist expression because Xist is not upregulated yet, making it significantly harder to analyze both Xist and Tsix co-expression. Likewise, time points after day 2 show less expression variation in Tsix because Tsix gets repressed. To visualize whether spliced Tsix binds to spliced Xist, we designed three sets of RNA-FISH probes (Fig. 2b, Supplementary Fig. 2, Supplementary Data 1): (i) probes targeting the Repeat A region (A, RepA probes), (ii) probes targeting exons of Xist outside the Repeat A region (Xist Exon probes), and (iii) probes targeting exons of Tsix outside of the region that overlaps with Xist (Tsix Exon Probes). Because the Xist Exon probes and Tsix Exon probes target the regions of Xist and Tsix that are not complementary to each other, they can bind to mature Xist and Tsix even if the antisense lncRNA pair are hybridized together (Fig. 2b insert, left). As an example of a colocalized signal in RNA FISH is the hybridization of probes to Xist and RepA (Fig. 2b insert, right, Supplementary Figs. 11–14). Although we expected colocalization between Xist and RepA, previous studies showed that colocalization is not perfect⁴⁶. Therefore, if Tsix binds to Xist in mESCs, we expect Xist and Tsix spots to colocalize (Fig. 2c, left). If Tsix does not bind Xist, we expect Tsix and Xist spots to not colocalize. An example of RepA and Xist spots colocalize is shown in Fig. 2c, right.

**Fig. 2: Lack of Xist and Tsix colocalization disproves mechanisms involving their binding.**

Imaging Xist and Tsix expression showed minimal colocalization of Xist and Tsix transcripts (Fig. 2d, Supplementary Figs. 4, 6, 7) and high colocalization of signal from Xist Exon and RepA probes (Fig. 2e, Supplementary Figs. 11, 13, 14). Bright spots of Xist and Tsix that colocalize in the nucleus are likely transcription sites, which have greater intensity due to multiple polymerases transcribing RNA (Fig. 2f). We quantified our data by automatically segmenting cells and nuclei⁵⁰ (Supplementary Figs. 1, 5, 7, 12, 14, 21, 23, 25, 27, 31, 33). After cell segmentation, we applied a semi-automated image analysis pipeline to identify and quantify spots, transcription sites and Xist clouds⁵¹ (See Materials and Methods, Supplementary Fig. 1). Xist clouds form due to clustering of transcripts on the X chromosome away from the transcription site. After RNA spot detection, we applied Particle Image Cross-Correlation Spectroscopy (PICCS)⁵² to quantify average colocalization events across different radii (Fig. 2g). PICCS typically plots average colocalization versus distance squared or area for 2D data. To adapt PICCS to our 3D data, we instead plotted colocalization events as a function of the distance cubed or volume (Fig. 2h, solid line). To account for random colocalization events (Fig. 2g), PICCS fits a line to the data far from where real colocalization occurs (Fig. 2h, dashed line). The y-intercept of this line is the colocalization fraction after subtracting random colocalization events. As a control, we also utilized randomly-generated spots throughout each nucleus (Fig. 2g) and quantified colocalization (Fig. 2h, dotted line). The y-intercept of this control should be zero, indicating no colocalization above what would be expected by chance. With this quantification scheme, we determined colocalization of Xist with RepA and Xist with Tsix at both day 1 and day 2 of differentiation (Fig. 2i, j). At both timepoints, we found low colocalization (0.003 and 0.005 at days 1 and 2 respectively) between individual Xist and Tsix molecules, indicating that they do not bind each other very often. Conversely, we found high colocalization (approximately 0.7) between the Xist Exon and RepA spots at both timepoints. As expected, random spots did not show colocalization (Fig. 2i, j, dotted lines). Together, these results provide strong evidence against Tsix and Xist binding for the majority of transcripts, which most likely precludes RNAi machinery (Fig. 1a) or transcript masking (Fig. 1b) being involved in Tsix regulation of Xist. The hypothesis that RNAi-induced rapid RNA degradation is responsible for non-detection is not supported by our prior findings. We have established that RNA-FISH can effectively identify RNA molecules that have lifetimes of less than three minutes^53,54, suggesting that RNAi degradation speed should not hinder detection.

Mechanisms based on transcripts: Xist and Tsix acting in trans

lncRNAs that recruit regulatory factors (Fig. 1c) can diffuse throughout the cell and regulate other genomic loci (also known as working in trans)². Previous studies showed a lack of trans inhibition of Xist by Tsix with mutants artificially overexpressing Tsix from specific alleles^15,55. However, whether these results would change in the wildtype context is unclear. Our data showed that wildtype Tsix can diffuse throughout the cell, potentially allowing Tsix to regulate Xist in trans (Fig. 3a). If Tsix regulates Xist in trans, then high levels of Tsix would reduce whole-cell levels of Xist and cause negative correlation between the two transcripts (Fig. 3b, left). Conversely, there would be little to no whole-cell correlation if there is no trans regulation (Fig. 3b, right). Therefore, we tested whether there was negative correlation between mature and dispersed Xist and Tsix transcripts. We quantified Xist and Tsix at day 2 of differentiation in three additional biological replicas to determine the co-expression of Xist and Tsix within the same cells (Supplementary Figs. 3, 4-10, 15). To determine the number of transcripts in Xist clouds, we divided the total intensity in the cloud by the intensity of a single transcript (Supplementary Fig. 16). Although the distribution had an L-shaped pattern (Fig. 3c), which often indicates mutual exclusivity, this shape can occur even with uncorrelated data (Supplementary Fig. 17). The Spearman test, a nonparametric test of correlation, showed no statistically significant correlation for two out of three replicas (Fig. 3c, white text). As a control, we also used the same dataset but randomly permuted the single-cell relationship between the two genes. Critically, none of the replicas significantly differed statistically from randomized data when evaluated by the two-dimensional Kolmogorov-Smirnov test⁵⁶ (Fig. 3d). Therefore, our data and analyses argue against a regulation mechanism in trans.

**Fig. 3: Lack of whole-cell negative correlation between Xist and Tsix argues against mechanisms involving transcript recruitment.**

Cis-acting Mechanisms at the Site of Transcription

When lncRNAs regulate genes locally on the same allele, they are said to work in cis^2,57. Some examples of cis mechanisms such as transcriptional interference or chromatin modification regulate through the act of transcription, (Fig. 1d, e). Previous studies utilized multiple mutagenesis strategies including deleting Tsix’s promoter region, inserting a strong promoter to drive Tsix, or prematurely stopping Tsix transcription to demonstrate that Tsix regulates Xist in cis on the same allele^15,17,22,55. However, they did not investigate individual wildtype transcription sites quantitatively in single cells to show that Tsix is regulating Xist in cis. Because transcription sites are identifiable as spots brighter than individual transcripts, we were able to quantify the presence of Xist and Tsix specifically at the transcription sites and their correlation using the data generated in the previous section (Fig. 4a, Supplementary Figs. 4-7, 9, 10, 16, 18). If there is cis-acting regulation, there would be negative correlation between Tsix and Xist Exon signal at the site of transcription (Fig. 4b, left). If there is no cis-acting regulation, there would be no negative correlation and many instances of co-transcription (Fig. 4b, right). We investigated the transcription sites with the same quantitative framework as trans-acting mechanisms (Fig. 3). With this framework, we found strong negative correlation in all replicas (Fig. 4c). Every replica was also significantly different from the randomized dataset (Fig. 4d). This evidence strongly suggests that Tsix regulates Xist in cis.

**Fig. 4: Allele specific quantification of transcripts at the site of transcription suggests a transcription-based mechanism.**

Transcriptional interference of Xist and Tsix at high nascent transcription

Transcriptional Interference (TI) through antisense polymerase collision is one possible mechanism for regulation in cis (Fig. 1d)^58,59. In this model, polymerases that initiate from converging transcription units physically collide, resulting in premature polymerase termination. This model proposes a mutually exclusive expression pattern of sense and antisense transcription in overlapping regions where TI occurs, while not affecting non-overlapping regions. Without TI, overlapping regions display co-transcription^60,61. A recent study investigating differential effects between overlapping and non-overlapping regions of Xist and Tsix suggests that TI affects the Tsix/Xist locus to cause mutual inhibition, but the study utilized a strong artificial and inducible promoter to drive Xist³⁶. It is therefore unclear whether TI is prevalent with the native promoters of Xist and Tsix intact. To distinguish between TI and other mechanisms in wildtype cells, we designed intronic probes at the Xist and Tsix locus, which allowed us to visualize nascent transcription in separate experiments (Supplementary Fig. 19. We performed these experiments on day 2 of differentiation, when there is sufficient Xist expression (Fig. 2a, Supplementary Figs. 20-27).

We designed interlaced intronic probes targeting Xist and Tsix within the Xist transcriptional unit (Xist Intron and Tsix 3’ Intron probes, respectively) to quantify nascent expression from the same genomic location (Fig. 5a, Supplementary Data 1). If TI restricts overlapping co-transcription, we only expect to observe nascent transcripts of either the Xist intron or the Tsix 3’ intron but never signal from both transcripts (Fig. 5b, left). If TI is not occurring, we expect polymerases from both promotors to read through the same genomic location. This phenomenon will result in colocalization of the Xist intron and Tsix 3’ intron nascent transcripts (Fig. 5b, right). In our microscopy experiments, we observed colocalization of Xist intronic and Tsix 3’ signal (Fig. 5c, Supplementary Figs. 16, 20–23). This result indicates that overlapping Xist and Tsix regions can co-transcribe.

**Fig. 5: Co-transcription and Transcriptional Interference (TI) at the Xist/Tsix Locus.**

To better understand the relationship between TI and co-transcribing polymerases, we utilized smRNA-FISH with both Xist Intronic probes and Tsix 5’ Intron probes (Fig. 5d). If TI significantly affects the locus, we would expect more colocalization of Tsix 5’ Intron signal with Xist Intron signal compared to the previous data with Tsix 3’ Intron and Xist Intron (Fig. 5e, left). If there is no TI, we would expect similar colocalization as with Tsix 3’ (Fig. 5e, right). Initial visualization showed similar colocalization of Tsix 5’ Intron signal with Xist Intron signal (Fig. 5f, Supplementary Figs. 16, 24–27).

Initial analysis of the joint expression of both Tsix 5’ and Tsix 3’ nascent transcription showed negative correlation with Xist nascent transcription, and both were significantly different from random (Fig. 5g, h, Supplementary Figs. 28–31). There did not seem to be a significant difference in the negative correlation values overall (rho = -0.58 for Tsix 5’ versus rho = -0.64 for Tsix 3’). To better understand the source of this negative correlation, we analyzed the significant variability in nascent transcription, which is a consequence of stochastic gene expression, a fundamental property of single cells (Supplementary Fig. 18). This prompted us to utilize this natural variability in Xist and Tsix expression and analyze colocalization with different levels of nascent transcription to gain nascent transcription dependent insights into transcriptional interference in addition to negative correlations.

For different nascent transcription levels, we quantified the colocalization fraction of transcription sites using PICCS. Xist transcription sites with low nascent transcription ( < 2 nascent transcripts) had low colocalization with Tsix 3’ nascent transcription (0.11, Fig. 6a, Supplementary Fig. 32). However, low-expressing Xist sites also had low colocalization with Tsix 5’ nascent transcription (0.15, Fig. 6b, Supplementary Fig. 32). These colocalization fractions were not significantly different from each other statistically when evaluated by the Cochran-Mantel-Haenszel test, an inferential test for the association between two binary variables⁶² (p = 0.16), suggesting that TI is not a significant regulation mechanism at low levels of nascent transcription. When nascent Xist is at medium levels (2-8 nascent transcripts), there was a significantly higher colocalization fraction with both Tsix 3’ nascent transcription (0.31, Fig. 6c, Supplementary Fig. 32) and Tsix 5’ nascent transcription (0.4, Fig. 6d) compared to the fractions when nascent Xist was at low level. These colocalization fractions were still not significantly different from each other statistically (p = 0.23), suggesting that TI is not significant at medium levels of nascent transcription. At high levels of nascent transcription ( > 8 nascent transcripts), Xist transcription sites colocalized less with Tsix 3’ transcription (0.12, Fig. 6e, Supplementary Fig. 32), while colocalization with Tsix 5’ transcription remained high (0.31, Fig. 6f, Supplementary Fig. 32). These colocalization fractions were significantly different from each other statistically (p = 0.02), which is consistent with predictions from TI³⁶. Another prediction for TI is low processivity due to collision with Xist polymerases, which means that Tsix will sometimes transcribe through its 5’ region but not the 3’ region that overlaps Xist. In other words, in an experiment that visualizes both the Tsix 5’ and Tsix 3’ nascent transcription (Supplementary Figs. 28–31, 33), we expect to see transcription sites that only have Tsix 5’ signal if TI is occurring. We saw this indication of low processivity when analyzing our data, especially when Tsix was lowly expressed (Supplementary Fig. 33). Overall, these results support that TI is contributing to antisense regulation at the Xist/Tsix locus, but only at high levels of nascent transcription. This data also shows a prevalence of antisense co-transcription, especially at medium levels of nascent transcription.

**Fig. 6: Co-transcription and Transcriptional Interference (TI) at the Xist/Tsix Locus.**

Chromatin Modification during medium transcription

Another possible mechanism for gene regulation by lncRNAs is recruitment of factors involved in transcription regulation by the polymerase machinery (Fig. 1e). For instance, the C-terminal domain (CTD) of pol II recruits SET proteins that tri-methylate the 36th lysine residue of the histone H3 protein (H3K36me3)^10,12,63. This histone mark can then recruit factors such as histone deacetylase Rpd3 in yeast or DNA methyltransferases 3A and 3B in mammals to repress transcription^64,65. In the Tsix/Xist system, Tsix induction or premature termination alters chromatin modifications including H3K36me3 at the Xist promoter^37,38,39,40. Recent studies detailed H3K36me3 dynamics upon Tsix induction⁴⁰ and found that knockdowns of Setd2 caused Xist upregulation³⁹. These studies suggest that Tsix establishes H3K36me3 to inhibit Xist. However, these studies did not measure chromatin modifications and RNA expression at the locus in the same single cells, which is critical to establish a direct relationship. In additional experiments, we utilized quantitative immunofluorescence (IF) and smRNA-FISH in the same cells to spatially correlate chromatin state through H3K36me3 with nascent Xist and Tsix expression in wildtype cells (Fig. 7a, Supplementary Figs. 34–36). The antibody for H3K36me3 was diluted so that individual spots could be detected (Supplementary Figs. 34-36). Random spots were again generated in each nucleus as a control.

**Fig. 7: Xist and Tsix transcription is associated with different enrichment levels of H3K36me3.**

Because H3K36me3 is deposited by transcription and no known transcription units overlap with the Tsix promoter, we expect higher Tsix transcription to be positively correlated with enrichment of the mark (Fig. 7b, bottom). If H3K36me3 deposition inhibits Xist, transcription sites with low levels of Xist should have more enrichment of the mark nearby, and transcription sites with high levels of Xist should have less enrichment of H3K36me3 (Fig. 7b, top left). If H3K36me3 does not inhibit Xist, then we would expect the opposite relationship, because more transcription of Xist would deposit more of the mark in the gene body (Fig. 7b, top right).

To test these predictions, we quantified fold-enrichment of H3K36me3 localization to transcription sites compared to random spots in the same nuclei (Fig. 7c). In this way, we can visualize the enrichment of histone mark localization at specific distances away from the site of transcription. Similarly to our quantifications with TI, we investigated effects for low, medium, and high levels of nascent transcription at day 2 when Xist is expressed (Fig. 7d–i, Supplementary Figs. 37-39).

We tested the prediction that Tsix transcription positively correlates with enrichment of H3K36me3. All Tsix transcription sites showed enrichment of H3K36me3 (Fig. 7d–f, Supplementary Fig. 37). Interestingly, Tsix sites with medium-level transcription showed the highest levels of enrichment of the chromatin mark (Fig. 7e). This positive correlation between Tsix transcription and H3K36me3 is consistent with the hypothesis that Tsix transcription deposits H3K36me3.

We then tested the prediction that Xist levels negatively correlate with H3K36me3 at the site of transcription. Low and medium levels of nascent Xist expression were associated with enrichment of H3K36me3 (Fig. 7g, h), while sites with high nascent Xist exhibited levels of H3K36me3 significantly below random (Fig. 7i). Notably, no H3K36me3 spots were detected within 0.25 µm of highly transcribing Xist sites (Fig. 7i, first bin). For day 0 of differentiation when Xist is largely repressed, there was high enrichment of H3K36me3 (Supplementary Fig. 40). This negative correlation between Xist transcription and H3K36me3 is consistent with the hypothesis that H3K36me3 inhibits Xist, a pattern that was previously observed in allele-specific CUT & TAG population experiments and is not observed in housekeeping genes (Supplementary Figs. 41–48, Supplementary Data 2)⁶⁶. We then compared these results to experiments where Setd2 was inhibited during two days of differentiation. We quantified the number of H3K36me3 spots in the cells, the intensity of Tsix and Xist intronic signals, and the fraction of cells with nascent Xist and Tsix expression (Supplementary Fig. 49). Our findings showed that Setd2 inhibition reduces the number of H3K36me3 spots in cells (Supplementary Fig. 49a, b), which in turn decreases the intensity of nascent Tsix expression (Supplementary Fig. 49c, d). However, there was no reduction in nascent Xist expression (Supplementary Fig. 49e, f). Interestingly, we observed an increase in the fraction of cells with nascent Xist expression upon Setd2 inhibition (Supplementary Fig. 49g), while no change was detected in the fraction of cells with nascent Tsix expression under the same conditions (Supplementary Fig. 49h).

Because Tsix and Xist can co-transcribe in the same locus (Fig. 5), we then asked how much H3K36me3 was at transcription sites with different levels of both Tsix and Xist nascent transcription. To be considered the same transcription site, Xist and Tsix signal had to be within 0.75 µm (Supplementary Fig. 50), which is less than the 1 µm minimum distance found with X chromosome pairing⁶⁷. Surprisingly, we observed the strongest and most significant enrichment of H3K36me3 when Xist and Tsix co-transcribed at medium levels (Fig. 7j, k, center bin, Supplementary Fig. 37). Our findings suggest that medium levels of transcription, which allows for the maximum amount of overlapping co-transcription (Fig. 6c, d), coincide with the most deposition of histone marks when this co-transcription occurs.

Overall, our results show different mechanisms of regulation by lncRNAs at the Xist/Tsix locus depending on the level of nascent transcription. At high levels of transcription, TI causes inhibition in overlapping transcriptional regions. At medium-level transcription, antisense regions can co-transcribe and deposit maximum histone marks for chromatin regulation.

Discussion

In this study, we established a methodology to test mechanisms of gene regulation by lncRNAs and applied it to the Xist/Tsix system. With our quantitative experimental and analytical framework, we tested multiple mechanisms by investigating single cells and utilizing the natural variability of stochastic gene expression in the system to probe the effects of different expression levels. These different levels of nascent transcription had a profound impact on what mechanisms of regulation were being utilized.

Multiple proposed mechanisms of Tsix regulation of Xist involved binding of the antisense pair (Fig. 1a, b), but it was unknown to what extent they bound in cells. This gap in knowledge resulted from previous studies using either population-based measurements or long RNA-FISH probes that could not detect individual Xist and Tsix molecules^22,36. Our smRNA-FISH data analyzed with PICCS revealed that Tsix almost never bound to Xist in mESCs, providing strong evidence against any mechanisms involving their binding (Fig. 2). Our analysis, which shows a high degree of colocalization, co-expression, and correlation between Xist and RepA, suggests that RepA is not an independent transcript but instead a segment of the Xist transcript (Fig. 2, Supplementary Figs. 8, 11, and 13). It is improbable that rapid RNA degradation via RNAi would prevent detection in our assays. Previous research has demonstrated that RNA-FISH is capable of detecting even short-lived RNA species with lifespans shorter than three minutes^53,54.

Our single-cell quantification of Xist and Tsix also determined that whole-cell levels of Xist and Tsix were not correlated (Fig. 3). The lack of correlation provided additional evidence against trans regulation by these lncRNAs at the locus. It was essential to employ rigorous statistical methods because simple visualization or incorrect statistical tests can suggest correlations that are not present (Supplementary Fig. 6). In contrast to previous studies, our methodology did not necessitate the production of mutants that can introduce artifacts leading to misinterpretation of results. Similar analyses can be done on any two genes if there is single-cell variability in the expression of the genes of interest.

Identifying transcription sites as bright RNA-FISH signals in the nucleus also enabled us to test, even with exonic probes, whether the lncRNAs regulated each other in cis. Regulation in cis had been tested with mutants of specific alleles altering Tsix transcription^15,17,22,39, but it had not been tested by quantification of nascent transcription in wildtype cells. We saw a strong negative correlation between Xist and Tsix at the site of transcription, which provided additional evidence that regulation was occurring in cis (Fig. 4). Analysis of signal from intronic RNA-FISH probes further supported that regulation occurs in cis (Figs. 5 and 6) and prompted us to investigate mechanisms involving the act of transcription.

One possible mechanism of transcription-mediated regulation by lncRNAs is Transcriptional Interference (TI) (Fig. 1d). A previous study suggested that TI is an active mechanism at the Xist/Tsix locus³⁶. However, they utilized a doxycycline-inducible system to control Xist expression instead of the native promoter. We instead took advantage of the natural variability in RNA expression and found that TI only occurs at high levels of nascent transcription (Figs. 5 and 6). This result is consistent with the previouse findings, as inducible promoters tend to be strong and would mimic the cells with high nascent transcription³⁶.

Interestingly, we found that medium-level transcription was associated with the highest co-transcription of Xist and Tsix (Fig. 6c, d). Because evidence suggests eukaryotic polymerases cannot pass each other⁵⁸, antisense co-transcription in overlapping regions has been explained by polymerases completing transcription during short times when the opposite strand is unoccupied. Completed transcripts are then retained for some time nearby⁶¹. Our results are consistent with this explanation since the opposite strand would be unoccupied for longer lengths of time during medium transcription levels compared to lengths during high levels. The low colocalization at weakly-transcribing transcription sites (Fig. 6a, b) might be explained by lower local concentration of transcription machinery. Our discovery that antisense co-transcription peaks at moderate nascent transcription levels is a starting point to determine if this observation applies to other antisense transcripts.

Another possible mechanism of transcription-mediated regulation by lncRNAs is through chromatin modification (Fig. 1e)^2,3. By quantifying H3K36me3 in the vicinity of transcription sites using quantitative immunofluorescence and smRNA-FISH, we found enrichment of H3K36me3 at Tsix transcription sites and depletion of H3K36me3 at highly transcribed Xist sites (Fig. 7). These results are consistent with the hypothesis that Tsix deposits H3K36me3 to inhibit Xist in single cells^39,40.

We found that maximum enrichment of H3K36me3 occurs when Tsix and Xist are co-transcribed at medium levels. This unexpected outcome may be a function of varying local concentrations of polymerases and histones as illustrated in Fig. 8. While higher nascent transcription levels would have more polymerases to recruit chromatin modifiers, strong transcription can also remove histones and create nucleosome-depleted regions (Fig. 8, top and bottom)⁶⁸. In contrast, we speculate that medium transcription levels enable polymerases to transcribe around histone proteins without removing them⁶⁹. As a result, the highest local density of modifiable histone proteins is observed at medium transcription levels (Fig. 8, middle). When both Xist and Tsix are transcribing, there is also a higher density of polymerases to recruit chromatin modifiers (Fig. 8, middle). Although transcription sites with medium Xist and Tsix levels represent a small subset of all detected transcription sites, they might signify a crucial temporary state for establishing H3K36me3 that persists, allowing for further regulation. Through Setd2 inhibitor studies (Supplementary Fig. 49)⁷⁰, we observed that the intensity of Tsix transcription sites decreased, suggesting that the number of polymerases transcribing Tsix is reduced. However, the number of cells initiating Tsix transcription did not change. In contrast, while the intensity of Xist transcription sites remained unchanged during Setd2 inhibition, the fraction of cells exhibiting nascent Xist expression increased. This indicates that Setd2 regulates the frequency of polymerase initiation responsible for transcribing Xist. In summary, our methodology provided single-cell evidence for Xist inhibition involving Tsix-induced H3K36me3 and identified an interaction with medium-level co-transcription that would have been undetectable in population-level experiments or those with artificial promotors.

**Fig. 8: Natural variability of transcription enables different mechanisms of Xist and Tsix regulation within a population of cells.**

Our quantitative, high-resolution methodology contributes critical insights into the regulation at the Xist and Tsix locus despite it being a focus of significant research for over two decades. We were able to test a variety of mechanisms in a single study. Since this methodology does not necessitate the generation of mutants, it is a promising way to rapidly investigate the many other antisense transcripts whose function and mechanism remain unknown in their native context. Our methodology can also be used to test other mechanisms involving RNA-RNA interactions. Overall, our results support two different regulatory mechanisms within the same population of cells: chromatin modification at medium transcription and TI at high transcription (Fig. 8).

Some limitations of this study are that we only quantified snapshots of different cells instead of following the same cells over time, so we do not know when or how long different states of transcription occur. Future studies utilizing timelapse microscopy could lead to complementary insights⁶¹. Although we did not find any evidence of Xist and Tsix binding together, we cannot exclude that Xist and Tsix binding results in a short-lived complex that is only detectible in small amounts in our assay. However, previous studies suggest that the timescale of double-stranded RNA degradation by Dicer is in minutes⁷¹, which should make the complex more visible in our data. Xist and Tsix might also bind to each other at the site of transcription only but then dissociate later. Although we determined locations and distances in three dimensions, we did not determine the precise genomic locations of the nascent transcripts and histone marks.

Our findings have general implications for future studies or therapeutics targeting lncRNAs because the lncRNA of interest might have multiple mechanisms, which could confound results or lead to suboptimal outcomes. For instance, strategies such as antisense oligonucleotides (ASOs) that lower RNA expression are promising avenues for disease treatments⁷². However, lowering the expression of a lncRNA might only eliminate mechanisms that are employed at high transcription levels (such as TI at the Xist/Tsix locus), while retaining other mechanisms requiring lower transcription (such as chromatin modification at medium levels of Tsix). Similarly, inhibiting a protein such as Setd2⁷⁰, which causes H3K36me3 and is a target of recent clinical studies^11,12, might only affect some of the mechanisms of regulation and may not interfere with TI at high expression. Our study suggests that a combinatorial approach targeting multiple mechanisms might be necessary for effective future mechanistic lncRNA studies and the development and application of therapeutics.

Methods

Experimental methods

Cell culture

F1 2-1 mESCs¹⁷ were obtained from the lab of Dr. Rudolf Jaenisch and validated by karyotyping. The F1 2-1 mESCs were thawed with 1 million seeded cells on 75 cm² tissue culture flasks with vented caps (Falcon 353110) gelatinized with EmbryoMax 0.1% Gelatin Solution (Millipore ES-006-B) for 30 minutes at 37 °C. Cells were grown with 2i media composed of: DMEM with high glucose (Life Technologies 11960-044), 0.5x N2 Supplement (Gibco 17502-048), 0.5x B27 supplement (Gibco 17504044), 25 mM HEPES (Gibco 15630-030), 0.5x MEM NEAA (Life technologies 11140-050), 1x (100 U/mL) Penicillin-Streptomycin (Gibco 15140122), 100 μM 2-mercaptoethanol (Life Technologies 21985-023), 1000 U/mL LIF (EMD Millipore ESG1106), 0.25x GlutaMAX^TM (Gibco, Catalog#: 35050-061), and 1x (1 mM) sodium pyruvate (Gibco 11360-070), 20 μg/mL human insulin (Sigma I9278-5ML), 1 μM (Sigma PD0325901), 3 μM (Sigma CHIR99021), 1000 U/mL LIF (EMD Millipore ESG1107) at 37 °C in a 5% CO₂ humidity-controlled environment for two passages before experiments. Media was changed every day, and cells were grown 3 days before passaging.

To begin undirected differentiation in adherent conditions, cells were washed twice with 1x PBS and dissociated from the flasks with accutase (Merck SF006). 1 million cells were then plated on 15 cm tissue culture dishes gelatinized with 0.1% Embryomax gelatin for 30 minutes at 37 °C. They were grown in differentiation media consisting of: DMEM with high glucose (Life Technologies 11960-044), 15% Heat Inactivated FBS (Gibco 16140-071), 25 mM HEPES (Gibco 15630-030), 1x MEM NEAA (Sigma M7145), 1x (100 U/mL) Penicillin-Streptomycin (Gibco 15140122), 100 μM 2-mercaptoethanol (Life Technologies 21985-023), 1x GlutaMAX^TM (Gibco, Catalog#: 35050-061), and 1x (1 mM) sodium pyruvate (Gibco 11360-070) at 37 °C in a 5% CO₂ humidity-controlled environment. Differentiation plates were split if >70% confluent.

Setd2 inhibitor studies were conducted using 500 nM of SETD2i (inhibitor EPZ-719, MedChemExpress) in 2i media during two days of differentiation, as previously described⁷⁰. RNA-FISH probes and H3K36me3 antibodies were used as outlined below.

Cell Fixation

mESCs were dissociated after washing with 1x PBS using 0.25x accutase when cultured in 2i media and 0.05% trypsin (Gibco 25300054) when in FBS-containing media. The cell suspension was centrifuged for 5 minutes at 300 g, washed with 1x PBS, and then fixed for 8-10 minutes at room temperature with a 3.7% formaldehyde solution in 1x PBS. The cells were washed twice with 1x PBS and then permeabilized with 70% ethanol at 4 °C for at least one hour, usually overnight.

DAPI, RNA-FISH, and IF staining

After cell permeabilization, use BSA coated tips to transfer cells to a low bind tube. After centrifugation and EtOH aspiration, resuspend cells in 100ul wash buffer (PBS with 10% formamide), centrifuge at 300g, aspirate wash buffer completely, and then resuspend cells in 50 ul hybridization buffer (10% dextran (Sigma Aldrich, D8906), 10 mM vandyl-ribonucleoside complex (NEB, S1402S), 0.02% RNAse-free BSA (Ambion, AM2616), 1mg /ml E.coli tRNA (Sigma, R4251), 2x SSC (Ambion, AM9763), 10% formamid (Ambion, AM9342)) that contains RNA-FISH probes (1:1000 dilution) and antibody if necessary and incubate at 37C overnight. After hybridization resuspend cells in 1ml wash buffer, centrifuge at 300g, aspirate solution, resuspend cells in 1ml wash buffer with DAPI (Sigma, D9564, 150 ng/ml) and secondary antibody if required. Incubate for 30min at 37C. Repeat wash step one more time without DAPI and antibodies⁵³. Primary antibodies were added at the same time as the RNA-FISH probes, and secondary antibody was added during the DAPI staining (see table for antibodies and concentrations). The primary antibody concentrations were determined with an optimization experiment for when individual spots were visible and minimal background fluorescence was detected (Fig. 7, Supplementary Figs. 34-36).

Microscopy

Cells were imaged with epifluorescence at 100x with a 500ms exposure time, at 300nm per z-step, from the top to the bottom of the cells⁵⁰. Micro-Manager software version 1.4 was used to control a Nikon Ti Eclipse epifluorescent microscope equipped with perfect focus (Nikon), a 100× VC DIC lens (Nikon), an X-cite fluorescent light source (Excelitas), and an Orca Flash 4v2 CMOS camera (Hamamatsu). Mammalian cells were on the ibidi 15 μ-Slide Angiogenesis (81506) in wells coated with 0.01% poly-D-lysine (Cultrex 3439-100-01) for 10 minutes. Fluorescent signal was imaged with Semrock Brightline filters for DAPI (DAPI-5060C-NTE-ZERO), CY5 (NIK-0013 RevA-NTE-ZERO), TAMRA (SpGold-B-NTE-ZERO), and AF488 (FITC-2024B-NTE-ZERO).

Computational methods

Cell segmentation

Nuclei were segmented in 3D and cells were segmented in 2D by utilizing CellDissect, an automatic segmentation pipeline⁵⁰. The DAPI signal of the nucleus was automatically thresholded to maximize the number of non-overlapping signals presenting individual nuclei. The nuclei are used as seeds to segment the cell boundary in 2D automatically.

Probe design

Sequences for Xist and Tsix were downloaded from the UCSC genome browser, and genomic sequences specific to the two background strains present in the cell line (129S1/SvImJ and Castaneous) were downloaded from the Mouse Genomes Project from the Sanger Institute (http://www.sanger.ac.uk/resources/mouse/genomes/). For the Xist, Tsix, and RepA exonic probes, Stellaris software (Biosearch technologies) was utilized with certain considerations for sequence input and downstream probe pruning (Supplementary Data 1). For Xist, the input sequence was designed to avoid the repeat sequences known to be important for Xist function, since secondary structure could inhibit probe binding. For Tsix, the region overlapping Xist was avoided. Since the RepA region overlaps with Tsix and is part of the Repeat A region of Xist critical for X chromosome inactivation, neither of these could be avoided. These sequences were input into the Stellaris software with the maximum masking level of 5 to generate 20mer probes with 3 base spacing. Probes with tandem repeats were removed by utilizing a tandem repeat finder (Tandem Repeats Finder Advanced Submit Page (bu.edu) on the original sequence (except for with the RepA probes), and probes with SNPs that were different between 129 and Cast backgrounds were avoided by comparing their genomic sequences. Custom MATLAB software that utilized online BLASTing and the “Mouse Genomic+Transcript” option determined the number of off-targets. Any probes with 19 or more matching bases of an off-target were excluded, and probes with lower numbers of off-targets were prioritized to be in the final probe set.

The intronic probes were designed using custom MATLAB software that utilized BLAST+ and a local BLAST database generated from the mm10 genome, mouse rRNAs from RNACentral.org, and mouse transcripts from RefSeq. Genomic and transcript sequences for Xist and Tsix were again downloaded from the UCSC genome browser, and the strain-specific sequences were again used from the Sanger institute. For Xist and Tsix 3’ intronic probes, the intronic regions of Xist were used for the probe design, again avoiding the SNPs that differ between 129 and Castaneous mouse strains. Probes were BLASTed against the mouse database, and probes were chosen based on minimal off-target binding. Probes with matches against Hsp90 and any rRNAs were excluded. All probes that had matches against highly expressed genes in differentiating mESCs with the same background (the top 465 expressed genes from the RNA-Seq dataset in⁴⁹ were also excluded. The resulting probes for Xist and Tsix 3’ were interlaced, with half designed against the Tsix strand and half against the Xist strand. The same procedure was performed for Tsix 5’ intronic probes, and the region with known DXPas34 transcription was avoided.

Probe creation

Probes were ordered from Biosearch Technologies with a 3’ Amine modification. All probes in a set were then pooled, conjugated to the appropriate dye, and then purified using HPLC. Xist Exon probes were coupled with CY5, RepA probes were coupled to TAMRA, Tsix Exon Probes were coupled to TAMRA, Xist Intron probes were coupled with TAMRA (for example in Supplementary Fig. 7) and CY5 (for experimental data), and Tsix 5’ Intron and Tsix 3’ Intron were coupled to TAMRA.

Image analysis

Data analysis was performed using custom MATLAB software. Spot detection for RNA-FISH and IF experiments was done using manual thresholding of each channel in images put through Gaussian (to remove noise) and Laplacian of a Gaussian (edge detection) filters. A single slice was used for visualization during thresholding for the IF images due to high densities of spots, while a maximum intensity projection of slices within the bottom and top 10 slices was used for the RNA-FISH images with a low density of spots. For all channels with RNA-FISH signal, a custom algorithm was used to determine the presence of clouds or transcription sites. This algorithm searched for the brightest determined spot, and then determined a 3D connected volume of signal above background attached to this spot. If another spot was present in the cell, the algorithm was used again for a total of two connected volumes determined. These were used for quantification of possible transcription sites and clouds in downstream analysis. For each determined spot and cloud, signal above background was calculated by subtracting background determined using a broad Gaussian filter from the raw images.

Data analysis

The intensity of single RNA spots for each channel was determined by plotting the probability distribution of total signal above background for each determined spot and cloud. The intensity with the highest probability density was used in downstream analyses as the intensity of an individual RNA molecule (Supplementary Fig. 16).

To determine the colocalization of individual RNA molecules visualized with Exonic RNA-FISH probes (Fig. 2), transcription sites were filtered out by ignoring spots with two times the estimated intensity of a single spot. The two brightest spots in each cell were also excluded since they could be transcription sites even if they were not bright. The Euclidian distance was determined between every spot in one channel (e.g. Xist) with every spot in the other channel (e.g. Tsix) in 3D and plotted the total average colocalized spots against distance. The distance axis is on cubic scale for a linear increase in volume from 0-1.5 µm. To determine background colocalization, we fit the data from 0.75-1.5 µm to a line. The y intercept of this line corresponds to the fraction of colocalization as described previously⁵². Analysis using intronic RNA-FISH probes to visualize nascent transcription (Figs. 5–7) utilized the same method, except the two brightest spots in each cell were considered transcription sites.

To calculate the total RNA in each cell, the number of transcripts in every cloud was added to the number of individual puncta within the same cell for Xist, while the total number of puncta was used for Tsix (Fig. 3). We estimated the number of transcripts in every cloud or transcription site by taking the total fluorescence intensity and dividing by the intensity of a single transcript (Figs. 4–7, Supplementary Fig. 16).

To determine the enrichment of histone marks with Xist and Tsix transcription sites, we calculated the distances between Xist or Tsix transcription sites and spots of histone marks. For every transcription site, 500 random spots were generated inside the same nuclei of the same cells, and the distances between them and the histone marks were also calculated. The number of colocalized histone marks within each bin was then calculated for the data as well as the randomized spots, and the odds ratio of the Cochran Mantel-Haenszel (CMH) test was used to calculate enrichment. For colocalized Xist and Tsix nascent transcription, Xist and Tsix were considered colocalized if they were within 0.75 µm, which is within the 1 µm minimum distance shown previously for paired chromosomes⁶⁷ (Supplementary Fig. 50). Enrichment and p values were again calculated with CMH statistics.

Statistics and reproducibility

All experiments were conducted with at least three independent biological replicates unless otherwise noted. Sample sizes and statistical analyses were determined based on previous studies of single-molecule and single-cell RNA quantification, ensuring sufficient power for detecting significant differences. Data were excluded only in cases of evident technical artifacts, such as imaging defects or segmentation failures.

Statistical significance was assessed using non-parametric and parametric tests as appropriate. Spearman correlation tests were used to evaluate associations between Xist and Tsix expression levels, while the two-sided Kolmogorov-Smirnov (KS) test determined whether distributions significantly differed from randomized datasets. The Cochran-Mantel-Haenszel (CMH) test was used to assess colocalization enrichment relative to random distributions and to determine the significance of differences in transcriptional interference effects across different transcription levels.

For details on replication, sample sizes, and specific statistical methods applied to individual analyses, refer to the figure legends.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The RNA-FISH and IF data generated in this study have been deposited in the BioImageArchive database under accession code S-BIAD1339. The processed RNA-FISH and IF data are available in Zenodo at https://doi.org/10.5281/zenodo.13351421. Source data are provided with this paper.

Code availability

All codes are publicly available on Zenodo through the following link. https://zenodo.org/records/13351421. The image processing software, CellDisect, used for cell and nuclear segmentation, was previously published in reference⁵⁰. The spot detection software, TrueSpot, has been submitted for publication and is available through the following Zenodo link. https://zenodo.org/records/13345321.

References

Frankish, A. et al. GENCODE 2021. Nucleic Acids Res. 49, D916–D923 (2021).
Article CAS PubMed Google Scholar
Mattick, J. S. et al. Long non-coding RNAs: definitions, functions, challenges and recommendations. Nat. Rev. Mol. Cell Biol. 24, 430–447 (2023).
Statello, L., Guo, C. J., Chen, L. L. & Huarte, M. Gene regulation by long non-coding RNAs and its biological functions. Nat. Rev. Mol. Cell Biol. 22, 96–118 (2020).
Delás, M. J. & Hannon, G. J. lncRNAs in development and disease: From functions to mechanisms. Open Biol. 7, 1–10 (2017).
Constanty, F. & Shkumatava, A. lncRNAs in development and differentiation: From sequence motifs to functional characterization. Development 148, 1–11 (2021).
O’Brien, J., Hayder, H., Zayed, Y. & Peng, C. Overview of microRNA biogenesis, mechanisms of actions, and circulation. Front. Endocrinol. 9, 1–12 (2018).
Ali Faghihi, M. et al. Expression of a Noncoding RNA Is Elevated in Alzheimer’s Disease and Drives Rapid Feed-Forward Regulation of b-Secretase. Nature Medicine 14, 723–730 (2008).
Yap, K. L. et al. Molecular Interplay of the Noncoding RNA ANRIL and Methylated Histone H3 Lysine 27 by Polycomb CBX7 in Transcriptional Silencing of INK4a. Mol. Cell 38, 662–674 (2010).
Article CAS PubMed PubMed Central Google Scholar
Prescott, E. M. & Proudfoot, N. J. Transcriptional collision between convergent genes in budding yeast. Proc. Natl Acad. Sci. 99, 8796–8801 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Van Werven, F. J. F. J. et al. Transcription of two long noncoding RNAs mediates mating-type control of gametogenesis in budding yeast. Cell 150, 1170–1181 (2012).
Article PubMed PubMed Central Google Scholar
Molenaar, T. M. & van Leeuwen, F. SETD2: from chromatin modifier to multipronged regulator of the genome and beyond. Cell. Mol. Life Sci. 79, 1–24 (2022).
Article Google Scholar
Edmunds, J. W., Mahadevan, L. C. & Clayton, A. L. Dynamic histone H3 methylation during gene induction: HYPB/Setd2 mediates all H3K36 trimethylation. EMBO J. 27, 406–420 (2008).
Article CAS PubMed Google Scholar
Stark, R., Grzelak, M. & Hadfield, J. RNA sequencing: the teenage years. Nat. Rev. Genet. 20, 631–656 (2019).
Ohhata, T., Hoki, Y., Sasaki, H. & Sado, T. Crucial role of antisense transcription across the Xist promoter in Tsix -mediated Xist chromatin modification. Development 135, 227–235 (2008).
Article CAS PubMed Google Scholar
Stavropoulos, N., Lu, N. & Lee, J. T. A functional role for Tsix transcription in blocking Xist RNA accumulation but not in X-chromosome choice. Proc. Natl Acad. Sci. USA 98, 10232–10237 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, J. T. & Lu, N. Targeted mutagenesis of tsix leads to nonrandom x inactivation. Cell 99, 47–57 (1999).
Article CAS PubMed Google Scholar
Luikenhuis, S., Wutz, A. & Jaenisch, R. Antisense transcription through the Xist locus mediates Tsix function in embryonic stem cells. Mol. Cell Biol. 21, 8512–8520 (2001).
Article CAS PubMed PubMed Central Google Scholar
Doman, J. L., Raguram, A., Newby, G. A. & Liu, D. R. Evaluation and minimization of Cas9-independent off-target DNA editing by cytosine base editors. Nat. Biotechnol. 38, 620–628 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sarkar, M. K. et al. An Xist-activating antisense RNA required for X-chromosome inactivation. Nat. Commun. 6, 3–13 (2015).
Article ADS CAS PubMed Google Scholar
Bumgarner, S. L. et al. Single-cell analysis reveals that noncoding RNAs contribute to clonal heterogeneity by modulating transcription factor recruitment. Mol. Cell 45, 470–482 (2012).
Article CAS PubMed PubMed Central Google Scholar
Brown, C. J. et al. A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome. Nature 349, 38–44 (1991).
Article ADS CAS PubMed Google Scholar
Lee, J., Davidow, L. S. & Warshawsky, D. Tsix, a gene antisense to Xist at the X-inactivation centre. Nat. Genet 21, 400–404 (1999).
Article CAS PubMed Google Scholar
Pelechano, V. & Steinmetz, L. M. Gene regulation by antisense transcription. Nat. Rev. Genet. 14, 880–893 (2013).
Article CAS PubMed Google Scholar
Zhao, J. et al. Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science 322, 750–756 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Balbin, O. A. et al. The landscape of antisense gene expression in human cancers. Genome Res 25, 1068–1079 (2015).
Article CAS PubMed PubMed Central Google Scholar
Harris, K. A. & Breaker, R. R. Large noncoding RNAs in bacteria. Microbiol Spectr 6, 1–12 (2018).
Cui, X. Y., Zhan, J. K. & Liu, Y. S. Roles and functions of antisense lncRNA in vascular aging. Ageing Res. Rev. 72, 1–13 (2021).
Liu, S. J., Dang, H. X., Lim, D. A., Feng, F. Y. & Maher, C. A. Long noncoding RNAs in cancer metastasis. Nat. Rev. Cancer 21, 446–460 (2021).
Penny, G. D., Kay, G. F., Sheardown, S. a., Rastan, S. & Brockdorff, N. Requirement for Xist in X chromosome inactivation. Nature 379, 131–137 (1996).
Richardson, A. L. et al. X chromosomal abnormalities in basal-like human breast cancer. Cancer Cell 9, 121–132 (2006).
Article CAS PubMed Google Scholar
Yildirim, E. et al. Xist RNA is a potent suppressor of hematologic cancer in mice. Cell 152, 727–742 (2013).
Article CAS PubMed Google Scholar
Ji, B., Higa, K. K., Kelsoe, J. R. & Zhou, X. Over-expression of XIST, the Master gene for X chromosome inactivation, in females with major affective disorders. EBioMedicine 2, 909–918 (2015).
Article PubMed PubMed Central Google Scholar
Loda, A., Collombet, S. & Heard, E. Gene regulation in time and space during X-chromosome inactivation. Nat. Rev. Mol. Cell Biol. 23, 231–249 (2022).
Article CAS PubMed Google Scholar
Ogawa, Y., Sun, B. K. & Lee, J. T. Intersection of the RNA interference and X-inactivation pathways. Science 320, 1336–1341 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Sun, B. K., Deaton, A. M. & Lee, J. T. A Transient Heterochromatic State in Xist Preempts X Inactivation Choice without RNA Stabilization. Mol. Cell 21, 617–628 (2006).
Article CAS PubMed Google Scholar
Mutzel, V. et al. A symmetric toggle switch explains the onset of random X inactivation in different mammals. Nat. Struct. Mol. Biol. 26, 350–360 (2019).
Article CAS PubMed PubMed Central Google Scholar
Navarro, P., Pichard, S., Ciaudo, C., Avner, P. & Rougeulle, C. Tsix transcription across the Xist gene alters chromatin conformation without affecting Xist transcription: implications for X-chromosome inactivation. Genes Dev. 19, 1474–1484 (2005).
Article CAS PubMed PubMed Central Google Scholar
Sado, T., Hoki, Y. & Sasaki, H. Tsix silences xist through modification of chromatin structure. Dev. Cell 9, 159–165 (2005).
Article CAS PubMed Google Scholar
Ohhata, T. et al. Histone H3 lysine 36 trimethylation is established over the xist promoter by antisense Tsix transcription and contributes to repressing xist expression. Mol. Cell Biol. 35, 3909–3920 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ohhata, T. et al. Dynamics of transcription-mediated conversion from euchromatin to facultative heterochromatin at the Xist promoter by Tsix. Cell Rep. 34, 1–16 (2021).
Article Google Scholar
Hannon, G. J. & User, N. RNA interference. Nature 418, 1–8 (2002).
Article Google Scholar
Nesterova, T. B. et al. Dicer regulates Xist promoter methylation in ES cells indirectly through transcriptional control of Dnmt3a. Epigenetics Chromatin 1, 1–21 (2008).
Kanellopoulou, C. et al. X Chromosome Inactivation in the Absence of Dicer. Proc Natl Acad Sci USA 106, 1122–1127 (2009).
Gayen, S., Maclary, E., Buttigieg, E., Hinten, M. & Kalantry, S. A primary role for the Tsix lncRNA in maintaining random X-chromosome inactivation. Cell Rep. 11, 1251–1265 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shiura, H. & Abe, K. Xist/Tsix expression dynamics during mouse peri-implantation development revealed by whole-mount 3D RNA-FISH. Sci. Rep. 9, 1–11 (2019).
Article ADS CAS Google Scholar
Raj, A., van den Bogaard, P., Rifkin, S. A., van Oudenaarden, A. & Tyagi, S. Imaging individual mRNA molecules using multiple singly labeled probes. Nat. Methods 5, 877–879 (2008).
Article CAS PubMed PubMed Central Google Scholar
Femino, A. M., Fay, F. S., Fogarty, K. & Singer, R. H. Visualization of single RNA transcripts in situ. Science 280, 585–590 (1998).
Article ADS CAS PubMed Google Scholar
Makhlouf, M. et al. A prominent and conserved role for YY1 in Xist transcriptional activation. Nat. Commun. 5, 1–12 (2014).
Marks, H. et al. Dynamics of gene silencing during X inactivation using allele-specific RNA-seq. Genome Biol. 16, 1–20 (2015).
Kesler, B., Li, G., Thiemicke, A., Venkat, R. & Neuert, G. Automated cell boundary and 3D nuclear segmentation of cells in suspension. Sci. Rep. 9, 1–9 (2019).
Hospelhorn et al. TrueSpot: A robust automated tool for quantifying signal puncta in fluorescent imaging, bioRxiv, https://doi.org/10.1101/2025.01.10.632467 (2025).
Semrau, S. et al. FuseFISH: Robust detection of transcribed gene fusions in single cells. Cell Rep. 6, 18–23 (2014).
Article CAS PubMed Google Scholar
Munsky, B., Li, G., Fox, Z. R., Shepherd, D. P. & Neuert, G. Distribution shapes govern the discovery of predictive models for gene regulation. Proc. Natl Acad. Sci. 115, 7533–7538 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Neuert, G. et al. Systematic identification of signal-activated stochastic gene regulation. Science 339, 584–587 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ohhata, T., Senner, C. E., Hemberger, M. & Wutz, A. Lineage-specific function of the noncoding Tsix RNA for Xist repression and Xi reactivation in mice. Genes Dev. 25, 1702–1715 (2011).
Article CAS PubMed PubMed Central Google Scholar
Peacock, J. A. Two-dimensional goodness-of-fit testing in astronomy. Mon. Not. R. Astron Soc. 202, 615–627 (1983).
Article ADS Google Scholar
Gil, N. & Ulitsky, I. Regulation of gene expression by cis-acting long non-coding RNAs. Nat. Rev. Genet. 21, 102–117 (2020).
Hobson, D. J., Wei, W., Steinmetz, L. M. & Svejstrup, J. Q. RNA polymerase II collision interrupts convergent transcription. Mol. Cell 48, 365–374 (2012).
Article CAS PubMed PubMed Central Google Scholar
Brooks, A. N. et al. Transcriptional neighborhoods regulate transcript isoform lengths and expression levels. Science (1979) 375, 1000–1005 (2022).
CAS Google Scholar
Ma, N. & McAllister, W. T. In a head-on collision, two RNA polymerases approaching one another on the same DNA may pass by one another. J. Mol. Biol. 391, 808–812 (2009).
Article CAS PubMed Google Scholar
Lenstra, T. L., Coulon, A., Chow, C. C. & Larson, D. R. Single-Molecule Imaging Reveals a Switch between Spurious and Functional ncRNA Transcription. Mol. Cell 60, 597–610 (2015).
Article CAS PubMed PubMed Central Google Scholar
Holland, P. W. & Thayer, D. T. Differential Item Functioning And The Mantel-Haenszel Procedure. ETS Research Report Series 24, 1–22 (1986).
Li, B., Howe, L., Anderson, S., Yates, J. R. & Workman, J. L. The Set2 histone methyltransferase functions through the phosphorylated carboxyl-terminal domain of RNA Polymerase II. J. Biol. Chem. 278, 8897–8903 (2003).
Article CAS PubMed Google Scholar
Carrozza, M. J. et al. Histone H3 methylation by Set2 directs deacetylation of coding regions by Rpd3S to suppress spurious intragenic transcription. Cell 123, 581–592 (2005).
Article CAS PubMed Google Scholar
Talbert, P. B. & Henikoff, S. The Yin and Yang of histone marks in transcription. Annu. Rev. Genom. Hum. Genet. 22, 147–170 (2021).
Article Google Scholar
Gjaltema, R. A. F. et al. Distal and proximal cis-regulatory elements sense X chromosome dosage and developmental state at the Xist locus. Mol. Cell 82, 190–208.e17 (2022).
Article CAS PubMed Google Scholar
Xu, N., Donohoe, M. E., Silva, S. S. & Lee, J. T. Evidence that homologous X-chromosome pairing requires transcription and Ctcf protein. Nat. Genet 39, 1390–1396 (2007).
Article CAS PubMed Google Scholar
Schwabish, M. A. & Struhl, K. Evidence for eviction and rapid deposition of histones upon transcriptional elongation by RNA polymerase II. Mol. Cell Biol. 24, 10111–10117 (2004).
Article CAS PubMed PubMed Central Google Scholar
Filipovski, M., Soffers, J. H. M., Vos, S. M. & Farnung, L. Structural basis of nucleosome retention during transcription elongation. Science 376, 1313–1316 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Mason, F. M. et al. SETD2 safeguards the genome against isochromosome formation. Proc. Natl Acad. Sci. USA 120, e2303752120 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chakravarthy, S., Sternberg, S. H., Kellenberger, C. A. & Doudna, J. A. Substrate-specific kinetics of dicer-catalyzed RNA processing. J. Mol. Biol. 404, 392–402 (2010).
Article CAS PubMed PubMed Central Google Scholar
Bennett, C. F. Therapeutic antisense oligonucleotides are coming of age. Annu Rev. Med 70, 307–321 (2019).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We like to thank members of the Neuert lab for their feedback on the manuscript. We like to thank Dr. Rudolf Jaenisch and Dr. Joost Gribnau for sharing the female F1 2-1 mESCs and we like to thank Drs. Jeannie T. Lee and Rudolf Jaenisch for sharing protocols for these mESC. We also like to thank Dr. Mark Magnuson and the Vanderbilt Stem Cell core for guidance on culturing F1 2-1 mESCs, and Dr. Frank Mason for sharing the SETD2 inhibitor. G.N. is supported by NIH, United States, DP2 GM11484901, NIH R01GM115892, NIH R01GM140240, Vanderbilt Deans Faculty Fellow award, and Vanderbilt Startup Funds. B.K.K. was supported by T32GM008320. This study used resources at the Advanced Computing Center for Research and Education (ACCRE) at Vanderbilt University, Nashville, TN (NIH, United States, S10 Shared Instrumentation Grant 1S10OD023680-01).

Author information

Authors and Affiliations

Department of Molecular Physiology and Biophysics, Basic Sciences, School of Medicine, Vanderbilt University, Nashville, TN, USA
Benjamin K. Kesler, John Adams & Gregor Neuert
Department of Pharmacology, Basic Sciences, School of Medicine, Vanderbilt University, Nashville, TN, USA
Gregor Neuert
Department of Biomedical Engineering, School of Engineering, Vanderbilt University, Nashville, TN, USA
Gregor Neuert
Vanderbilt Center for Computational Systems Biology, Vanderbilt University, Nashville, TN, USA
Gregor Neuert

Authors

Benjamin K. Kesler
View author publications
Search author on:PubMed Google Scholar
John Adams
View author publications
Search author on:PubMed Google Scholar
Gregor Neuert
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, G.N., B.K.K.; methodology, G.N., B.K.K.; software, G.N., B.K.K.; validation, G.N., B.K.K.; formal analysis, B.K.K.; investigation, B.K.K.; data curation, B.K.K., J.A.; writing – original draft, B.K.K.; writing – review & editing, G.N., B.K.K.; visualization, G.N., B.K.K.; supervision, G.N., project administration, G.N.; funding acquisition, G.N.

Corresponding author

Correspondence to Gregor Neuert.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review file

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Kesler, B.K., Adams, J. & Neuert, G. Transcriptional stochasticity reveals multiple mechanisms of long non-coding RNA regulation at the Xist–Tsix locus. Nat Commun 16, 4223 (2025). https://doi.org/10.1038/s41467-025-59496-6

Download citation

Received: 06 October 2023
Accepted: 24 April 2025
Published: 07 May 2025
DOI: https://doi.org/10.1038/s41467-025-59496-6

This article is cited by

TrueSpot: a robust automated tool for quantifying signal puncta in fluorescent imaging
- Blythe G. Hospelhorn
- Benjamin K. Kesler
- Gregor Neuert
Genome Biology (2025)