Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

RIP-PEN-seq identifies a class of kink-turn RNAs as splicing regulators

Abstract

A kink-turn (K-turn) is a three-dimensional RNA structure that exists in all three primary phylogenetic domains. In this study, we developed the RIP-PEN-seq method to identify the full-length sequences of RNAs bound by the K-turn binding protein 15.5K and discovered a previously uncharacterized class of RNAs with backward K-turn motifs (bktRNAs) in humans and mice. All bktRNAs share two consensus sequence motifs at their fixed terminal position and have complex folding properties, expression and evolution patterns. We found that a highly conserved bktRNA1 guides the methyltransferase fibrillarin to install RNA methylation of U12 small nuclear RNA in humans. Depletion of bktRNA1 causes global splicing dysregulation of U12-type introns by impairing the recruitment of ZCRB1 to the minor spliceosome. Most bktRNAs regulate the splicing of local introns by interacting with the 15.5K protein. Taken together, our findings characterize a class of small RNAs and uncover another layer of gene expression regulation that involves crosstalk among bktRNAs, RNA splicing and RNA methylation.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: A class of ncRNAs with a backward K-turn structure.
Fig. 2: Complex expression patterns of bktRNAs in tissues, cell lines and subcellular compartments and their evolution patterns.
Fig. 3: bktRNA1 guides the FBL protein to introduce 2′-O-methylation in U12 snRNA.
Fig. 4: Dysregulation of U12-type intron splicing in bktRNA1-depleted cells.
Fig. 5: Depletion of bktRNA1 affects U12 interactions with ZCRB1.
Fig. 6: Local regulation of intron splicing by bktRNAs.

Similar content being viewed by others

Data availability

All sequencing data that support the findings of this study have been deposited in the National Center for Biotechnology Information’s Gene Expression Omnibus with the following accession numbers: GSE160970 for all HEK293T 15.5K RIP-PEN-seq; GSE182757 for all Hepa1-6 RIP-PEN-seq; GSE160636 for FBL and 15.5K CLASH-seq in HEK293T cells; GSE160887 for PEN-seq in HCT116, U-87 MG, Hela, HEK293T, HepG2 and K562 cells; GSE186849 for PEN-seq in 15.5K knockdown HEK293T cells; GSE182843 for PEN-seq in HEK293T and HCT116 cell fractions; GSE160515 for RNA-seq in bktRNA1 KO HCT116 cells; GSE182830 for RNA-seq in ZCRB1 knockdown HCT116 cells; GSE182759 for RNA-seq in 15.5K knockdown HEK293T cells; and GSE220470 for RIP-PEN-SHAPE-MaP in HEK293T cells. All data are available in the manuscript and in Supplementary Information and Source data files. There are no restriction on data availability. Source data are provided with this paper.

Code availability

The program kturnSeeker was written in the C++ programming language and is available from GitHub with no restrictions or conditions on access: https://github.com/sysu-software/kturnSeeker.

References

  1. Butcher, S. E. & Pyle, A. M. The molecular interactions that stabilize RNA tertiary structure: RNA motifs, patterns, and networks. ACC Chem. Res. 44, 1302–1311 (2011).

    Article  CAS  PubMed  Google Scholar 

  2. Klein, D. J., Schmeing, T. M., Moore, P. B. & Steitz, T. A. The kink-turn: a new RNA secondary structure motif. EMBO J. 20, 4214–4221 (2001).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Lilley, D. M. The K-turn motif in riboswitches and other RNA species. Biochim. Biophys. Acta 1839, 995–1004 (2014).

    Article  CAS  PubMed  Google Scholar 

  4. Schroeder, K. T., McPhee, S. A., Ouellet, J. & Lilley, D. M. A structural database for k-turn motifs in RNA. RNA 16, 1463–1468 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Rozhdestvensky, T. S. et al. Binding of L7Ae protein to the K-turn of archaeal snoRNAs: a shared RNA binding motif for C/D and H/ACA box snoRNAs in Archaea. Nucleic Acids Res. 31, 869–877 (2003).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Baird, N. J., Zhang, J., Hamma, T. & Ferré-D’Amaré, A. R. YbxF and YlxQ are bacterial homologs of L7Ae and bind K-turns but not K-loops. RNA 18, 759–770 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Nottrott, S. et al. Functional interaction of a novel 15.5kD [U4/U6.U5] tri-snRNP protein with the 5′ stem-loop of U4 snRNA. EMBO J. 18, 6119–6133 (1999).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Szewczak, L. B. W., DeGregorio, S. J., Strobel, S. A. & Steitz, J. A. Exclusive interaction of the 15.5 kD protein with the terminal box C/D motif of a methylation guide snoRNP. Chem. Biol. 9, 1095–1107 (2002).

    Article  CAS  PubMed  Google Scholar 

  9. Szewczak, L. B. W., Gabrielsen, J. S., Degregorio, S. J., Strobel, S. A. & Steitz, J. A. Molecular basis for RNA kink-turn recognition by the h15.5K small RNP protein. RNA 11, 1407–1419 (2005).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Chawla, M., Oliva, R., Bujnicki, J. M. & Cavallo, L. An atlas of RNA base pairs involving modified nucleobases with optimal geometries and accurate energies. Nucleic Acids Res. 43, 6714–6729 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Polikanov, Y. S., Melnikov, S. V., Söll, D. & Steitz, T. A. Structural insights into the role of rRNA modifications in protein synthesis and ribosome assembly. Nat. Struct. Mol. Biol. 22, 342–344 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Liu, S., Ghalei, H., Lührmann, R. & Wahl, M. C. Structural basis for the dual U4 and U4atac snRNA-binding specificity of spliceosomal protein hPrp31. RNA 17, 1655–1663 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Vidovic, I., Nottrott, S., Hartmuth, K., Lührmann, R. & Ficner, R. Crystal structure of the spliceosomal 15.5kD protein bound to a U4 snRNA fragment. Mol. Cell 6, 1331–1342 (2000).

    Article  CAS  PubMed  Google Scholar 

  14. Jafarifar, F., Dietrich, R. C., Hiznay, J. M. & Padgett, R. A. Biochemical defects in minor spliceosome function in the developmental disorder MOPD I. RNA 20, 1078–1089 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Edery, P. et al. Association of TALS developmental disorder with defect in minor splicing component U4atac snRNA. Science 332, 240–243 (2011).

    Article  CAS  PubMed  Google Scholar 

  16. He, H. et al. Mutations in U4atac snRNA, a component of the minor spliceosome, in the developmental disorder MOPD I. Science 332, 238–240 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Merico, D. et al. Compound heterozygous mutations in the noncoding RNU4ATAC cause Roifman Syndrome by disrupting minor intron splicing. Nat. Commun. 6, 8718 (2015).

    Article  CAS  PubMed  Google Scholar 

  18. Farach, L. S. et al. The expanding phenotype of RNU4ATAC pathogenic variants to Lowry Wood syndrome. Am. J. Med. Genet. A 176, 465–469 (2018).

    Article  CAS  PubMed  Google Scholar 

  19. Lapinaite, A. et al. The structure of the box C/D enzyme reveals regulation of RNA methylation. Nature 502, 519–523 (2013).

    Article  CAS  PubMed  Google Scholar 

  20. Siegfried, N. A., Busan, S., Rice, G. M., Nelson, J. A. E. & Weeks, K. M. RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP). Nat. Methods 11, 959–965 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Luo, Q.-J. et al. RNA structure probing reveals the structural basis of Dicer binding and cleavage. Nat. Commun. 12, 3397 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. McPhee, S. A., Huang, L. & Lilley, D. M. A critical base pair in k-turns that confers folding characteristics and correlates with biological function. Nat. Commun. 5, 5127 (2014).

    Article  CAS  PubMed  Google Scholar 

  23. Huang, L. et al. Structure and folding of four putative kink turns identified in structured RNA species in a test of structural prediction rules. Nucleic Acids Res. 49, 5916–5924 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Liu, J. & Lilley, D. M. The role of specific 2′-hydroxyl groups in the stabilization of the folded conformation of kink-turn RNA. RNA 13, 200–210 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Huang, L., Wang, J. & Lilley, D. M. A critical base pair in k-turns determines the conformational class adopted, and correlates with biological function. Nucleic Acids Res. 44, 5390–5398 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Ashraf, S., Huang, L. & Lilley, D. M. J. Effect of methylation of adenine N6 on kink turn structure depends on location. RNA Biol. 16, 1377–1385 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  27. Huang, L., Ashraf, S., Wang, J. & Lilley, D. M. Control of box C/D snoRNP assembly by N6-methylation of adenine. EMBO Rep. 18, 1631–1645 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Xuan, J. J. et al. RMBase v2.0: deciphering the map of RNA modifications from epitranscriptome sequencing data. Nucleic Acids Res. 46, D327–D334 (2018).

    Article  CAS  PubMed  Google Scholar 

  29. Zhipeng, L. et al. RNA duplex map in living cells reveals higher-order transcriptome structure. Cell 165, 1267–1279 (2016).

    Article  Google Scholar 

  30. Tycowski, K. T., Aab, A. & Steitz, J. A. Guide RNAs with 5′ caps and novel box C/D snoRNA-like domains for modification of snRNAs in metazoa. Curr. Biol. 14, 1985–1995 (2004).

    Article  CAS  PubMed  Google Scholar 

  31. Cologne, A. et al. New insights into minor splicing-a transcriptomic analysis of cells derived from TALS patients. RNA 25, 1130–1149 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Will, C. L. et al. The human 18S U11/U12 snRNP contains a set of novel proteins not found in the U2-dependent spliceosome. RNA 10, 929–941 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Dominguez, D. et al. Sequence, structure, and context preferences of human RNA binding proteins. Mol. Cell 70, 854–867 e859 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Jolma, A. et al. Binding specificities of human RNA-binding proteins toward structured and linear RNA sequences. Genome Res. 30, 962–973 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Engreitz, J. M. et al. Local regulation of gene expression by lncRNA promoters, transcription and splicing. Nature 539, 452–455 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Alioto, T. S. U12DB: a database of orthologous U12-type spliceosomal introns. Nucleic Acids Res. 35, D110–D115 (2007).

    Article  CAS  PubMed  Google Scholar 

  38. Olthof, A. M., Hyatt, K. C. & Kanadia, R. N. Minor intron splicing revisited: identification of new minor intron-containing genes and tissue-dependent retention and alternative splicing of minor introns. BMC Genomics 20, 686 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  39. Madan, V. et al. Aberrant splicing of U12-type introns is the hallmark of ZRSR2 mutant myelodysplastic syndrome. Nat. Commun. 6, 6042 (2015).

    Article  CAS  PubMed  Google Scholar 

  40. Reber, S. et al. Minor intron splicing is regulated by FUS and affected by ALS-associated FUS mutants. EMBO J. 35, 1504–1521 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Verberne, E. A., Faries, S., Mannens, M., Postma, A. V. & van Haelst, M. M. Expanding the phenotype of biallelic RNPC3 variants associated with growth hormone deficiency. Am. J. Med. Genet. A 182, 1952–1956 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Argente, J. et al. Defective minor spliceosome mRNA processing results in isolated familial growth hormone deficiency. EMBO Mol. Med. 6, 299–306 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Martos-Moreno, G. et al. Response to growth hormone in patients with RNPC3 mutations. EMBO Mol. Med. 10, e9143 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  44. Elsaid, M. F. et al. Mutation in noncoding RNA RNU12 causes early onset cerebellar ataxia. Ann. Neurol. 81, 68–78 (2017).

    Article  CAS  PubMed  Google Scholar 

  45. Burns, R. et al. Homozygous splice mutation in CWF19L1 in a Turkish family with recessive ataxia syndrome. Neurology 83, 2175–2182 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Evers, C. et al. Exome sequencing reveals a novel CWF19L1 mutation associated with intellectual disability and cerebellar atrophy. Am. J. Med. Genet. A 170, 1502–1509 (2016).

    Article  CAS  PubMed  Google Scholar 

  47. Nguyen, M. et al. Pathogenic CWF19L1 variants as a novel cause of autosomal recessive cerebellar ataxia and atrophy. Eur. J. Hum. Genet. 24, 619–622 (2016).

    Article  CAS  PubMed  Google Scholar 

  48. Bailey, T. L. & Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28–36 (1994).

    CAS  PubMed  Google Scholar 

  49. Kramer, K. et al. Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins. Nat. Methods 11, 1064–1070 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Morlan, J. D., Qu, K. & Sinicropi, D. V. Selective depletion of rRNA enables whole transcriptome profiling of archival fixed tissue. PLoS ONE 7, e42882 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Adiconis, X. et al. Comparative analysis of RNA sequencing methods for degraded or low-input samples. Nat. Methods 10, 623–629 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Zinshteyn, B., Wangen, J. R., Hua, B. & Green, R. Nuclease-mediated depletion biases in ribosome footprint profiling libraries. RNA 26, 1481–1488 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Gillen, A. E., Yamamoto, T. M., Kline, E., Hesselberth, J. R. & Kabos, P. Improvements to the HITS-CLIP protocol eliminate widespread mispriming artifacts. BMC Genomics 17, 338 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  54. Keene, J. D., Komisarow, J. M. & Friedersdorf, M. B. RIP-Chip: the isolation and identification of mRNAs, microRNAs and protein components of ribonucleoprotein complexes from cell extracts. Nat. Protoc. 1, 302–307 (2006).

    Article  CAS  PubMed  Google Scholar 

  55. Deng, B. et al. An LTR retrotransposon-derived lncRNA interacts with RNF169 to promote homologous recombination. EMBO Rep. 20, e47650 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Spitale, R. C. et al. Structural imprints in vivo decode RNA regulatory mechanisms. Nature 519, 486–490 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Phelps, W. A., Carlson, A. E. & Lee, M. T. Optimized design of antisense oligomers for targeted rRNA depletion. Nucleic Acids Res. 49, e5 (2021).

    Article  CAS  PubMed  Google Scholar 

  58. Chu, C., Qu, K., Zhong, F. L., Artandi, S. E. & Chang, H. Y. Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Mol. Cell 44, 667–678 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Percharde, M. et al. A LINE1-nucleolin partnership regulates early development and ESC identity. Cell 174, 391–405 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Chu, C. et al. Systematic discovery of Xist RNA binding proteins. Cell 161, 404–416 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Haeussler, M. et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res. 47, D853–d858 (2019).

    Article  CAS  PubMed  Google Scholar 

  62. Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Lestrade, L. & Weber, M. J. snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs. Nucleic Acids Res. 34, D158–D162 (2006).

    Article  CAS  PubMed  Google Scholar 

  64. Yang, J. H., Shao, P., Zhou, H., Chen, Y. Q. & Qu, L. H. deepBase: a database for deeply annotating and mining deep sequencing data. Nucleic Acids Res. 38, D123–D130 (2010).

    Article  CAS  PubMed  Google Scholar 

  65. Jorjani, H. et al. An updated human snoRNAome. Nucleic Acids Res. 44, 5068–5082 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Kishore, S. et al. Insights into snoRNA biogenesis and processing from PAR-CLIP of snoRNA core proteins and small RNA sequencing. Genome Biol. 14, R45 (2013).

    Article  PubMed  PubMed Central  Google Scholar 

  67. Pruitt, K. D. et al. RefSeq: an update on mammalian reference sequences. Nucleic Acids Res. 42, D756–D763 (2014).

    Article  CAS  PubMed  Google Scholar 

  68. Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  70. Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12 (2011).

    Article  Google Scholar 

  71. Magoc, T. & Salzberg, S. L. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics 27, 2957–2963 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).

    Article  CAS  PubMed  Google Scholar 

  73. Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–D773 (2019).

    Article  CAS  PubMed  Google Scholar 

  74. Shen, S. et al. rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-seq data. Proc. Natl Acad. Sci. USA 111, E5593–E5601 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank L. Huang from Sun Yat-sen Memorial Hospital for valuable suggestions on this manuscript. We thank Y. Zhang for sharing Sleeping Beauty transposon system. We thank all the staff from Sun Yat-sen University for their support and hard work during the COVID-19 pandemic. This work was supported, in part, by the National Key R&D Program of China (2019YFA0802202 (to J.Y.) and 2022YFA1303300 (to J.Y.)); the National Natural Science Foundation of China (32225011 (to J.Y.), 91940304 (to J.Y.), 31971228 (to J.Y.), 31770879 (to J.Y.), 31970604 (to L.Q.), 31900903 (to B.L.) and 32100467(to S.L.)); the Youth Science and Technology Innovation Talent of Guangdong TeZhi Plan (2019TQ05Y181 (to J.Y.)); funds from Guangzhou City (202002030351 (to J.Y.)); and Fundamental Research Funds for the Central Universities, Sun Yat-sen University (20lgpy112 (to B.L.) and 2021qntd26 (to B.L.)).

Author information

Authors and Affiliations

Contributions

J.Y., B.L. and L.Q. conceived and designed the entire project. J.Y. and L.Q. designed and supervised the research. B.L., S.L., W.Z., A.L., P.Y., D.W., J.Z., P.Z., C.L., Q.L., J.Y., S.H., Q.H., H.Z. and J.Y. performed the experiments and/or data analyses. J.Y., B.L. and A.L. performed the genome-wide or transcriptome-wide data analyses. J.Y., B.L., S.L. and L.Q. contributed reagents/analytic tools and/or grant support. J.C. provided helpful discussions. J.Y., L.Q., B.L., S.L. and A.L. wrote and revised the paper. All authors discussed the results and commented on the manuscript.

Corresponding authors

Correspondence to Lianghu Qu or Jianhua Yang.

Ethics declarations

Competing interests

J.C. is a scientific advisory board member of Race Oncology. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Biotechnology thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Characterization of the RIP-PEN-seq technique.

a, Diagram of RNase H-based rRNA depletion for the construction of RIP-PEN-seq library. b, Western blotting analysis of the overexpression of 15.5K-FLAG protein in HEK293T cells. GAPDH serves as the loading control. c, RIP from HEK293T cells was performed using anti-FLAG and IgG. d, e, Meta-analyses of the RIP-PEN-seq results for the start (d) and end (e) sites of the forward K-turn RNAs (box C/D ncRNAs) in HEK293T cells. f, Genome-browser plot of RIP-PEN-seq (coverage, green; 5’-start, red; 3’-end, purple) for representative forward K-turn RNAs (box C/D snoRNAs) in the introns of GAS5. g, Computational workflow for analysis of the RIP-PEN-seq sequencing data and identification of candidate transcripts. h, KturnSeeker core algorithm workflow. KturnSeeker was developed to identify and quantify forward (fktRNAs) and backward ktRNAs (bktRNAs) from RIP-PEN-seq data. KturnSeeker can screen bktRNAs as well as fktRNAs by reverse searching the K-turn structure. i, Gene type distribution of forward ktRNAs identified from RIP-PEN-seq. CD-type represents C/D box-containing snoRNAs and scaRNAs. ACA-type represents snoRNAs or scaRNAs that only contain H/ACA boxes.

Source data

Extended Data Fig. 2 Structural characterization of bktRNAs.

a, The position and significance of motifs located within bktRNAs. The enriched motifs were identified by MEME software. The K-turn structural motif of bktRNAs is composed of two conserved sequence motifs: a CUGA motif often 4 nt downstream of the 5’ end and a UGAUG motif often 2 nt upstream of the 3’ end. The position p-value is defined as the probability that a random sequence would have a motif match score greater or equal to the sequence under test. b, Another twelve novel bktRNAs discovered from RIP-PEN-seq data. All novel bktRNAs had a CUGA motif that was often 4 nt downstream of the 5’ end and a UGAUG motif that was often 2 nt upstream of the 3’ end. These two sequence motifs were located within the K-turn structural motifs of bktRNAs. The 5’ motif (CUGA) and 3’ motif (UGAUG) are marked with black rectangles. The non-canonical A•G and G•A base pairs and the mismatch in the backward K-turn structure are also marked. c, Schematic overview of 15.5 K RIP-PEN-SHAPE-MaP. d, Secondary structure of consensus fktRNA (left panel, n = 98). Violin plots displaying the SHAPE reactivity across the forward K-turn structure (including 5’ C stem, internal (internal loop), 5’ NC stem, Loop, 3’ NC stem, and 3’ C stem), averaged across all known box C/D snoRNAs (right panel). The boxplots indicate the median and the upper and lower quartiles. e, The predicted secondary structure (upper panel) and SHAPE reactivity signal (lower panel) on fktRNA SNORD102 (also known as U102). The forward K-turn structure is indicated in the structure figure. The NC-stem and C-stem are marked with black and red underlines in the bar plot, respectively. The SHAPE reactivity signal was determined by RIP-PEN-SHAPE-MaP in this study.

Extended Data Fig. 3 The backward K-turn sequence composition, predicted functions and tissue-specific expression profile of bktRNAs.

a, The secondary structure of consensus forward K-turn RNA (fktRNA) and backward K-turn RNA (bktRNA). The nucleotide positions in the K-turn structure are named according to the nomenclature rules for the forward K-turn structure. b, Matrix plot showing the number of human bktRNAs with the indicated nucleotide in the 3b:3n sequences. c, Number of human bktRNAs with the four possible Watson-Crick base pairs in the -1b:-1n position. d, Matrix plot showing the number of mouse bktRNAs with the indicated nucleotide in the 3b:3n sequences. e, Number of mouse bktRNAs with the four possible Watson-Crick base pairs in the -1b:-1n position. f, Number of bktRNAs with or without m6A modification in humans and mice. g, Number of bktRNAs with or without m6A modification at the 1n position in humans and mice. h, Enrichment analysis of the bktRNA host protein-coding genes by Metascape software. i, Tissue-specific expression profiles of bktRNAs. The expression levels of bktRNAs are displayed in the rows and the tissues are shown in the columns. The rows and columns are sorted based on k-means clustering of bktRNA genes. The colour intensity represents the tissue-specific score (JS score) as calculated for each bktRNA using the csSpecificity function. Representative bktRNAs are indicated in the right panel.

Extended Data Fig. 4 Genomic characterization, expression, conservation, and secondary structure of bktRNA1.

a, Genome-browser plot of RIP-PEN-seq (coverage, blue; 5’-start, red; 3’-end, yellow) for bktRNA1, as well as the evolutionary conservation across 100 vertebrates (green). b, Secondary structure of bktRNA1 in the human genome was predicted by R-scape software. The SHAPE reactivities for each nucleotide were mapped to secondary structures using R2R software. The box H/ACA domain is indicated with a black dashed box, and the backward K-turn structure and the potential K-turn-like structure are marked with green dashed boxes. The NC stem and C stem are indicated with black lines. The blue boxes show the representative motifs. CAB, Cajal body box. c, The SHAPE reactivity signal on bktRNA1. The representative motifs are underlined in the bar plot. The SHAPE reactivity signal was determined by RIP-PEN-SHAPE-MaP in this study.

Extended Data Fig. 5 Secondary structures and subcellular localization of bktRNA1 and its interacting partner U12 snRNA.

a, Predicted conserved RNA structure of bktRNA1 determined by measuring pairwise covariations with R-scape software. The H/ACA domain is indicated with a black dashed box. The functional region paired with U12 snRNA is indicated with a blue dashed box. b, In situ co-localization of bktRNA1 with 15.5 K proteins and U12 snRNAs in HEK293T cells by fluorescent in situ hybridization (FISH) and immunofluorescence (IF) microscopy. White arrows indicate the signal detected by probes or antibody. c, In situ co-localization of bktRNA1 with 15.5 K proteins and U12 snRNAs in HCT116 cells by fluorescent in situ hybridization (FISH) and immunofluorescence (IF) microscopy. White arrows indicate the signal detected by probes or antibody.

Extended Data Fig. 6 Splicing efficiency analysis for wild-type and knockout bktRNA1.

a, Workflow for intron retention analysis in HCT116 and KO-bktRNA1 cells. b, Proportion of aberrantly retained U12- and U2-type introns (filtered by p < 0.05) in bktRNA1-deficient cells. c, Proportion of statistically significant changes (filtered by p < 0.05) in U12- and U2-type genes in bktRNA1-deficient cells. d, The ratio of spliced to unspliced pre-mRNA for U12-type introns was determined by qPCR in bktRNA1-deficient cells. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test. e, The ratio of spliced to unspliced pre-mRNA for U12-type and U2-type (GAPDH) introns was determined by qPCR in bktRNA1-rescued HCT116 KO-4 cells. f, The ratio of spliced to unspliced pre-mRNA for U12-type and U2-type (GAPDH) introns was determined by qPCR in SNORA12-rescued HCT116 KO-4 cells. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test. ns, no significance. g, The ratio of spliced to unspliced pre-mRNA for U12-type and U2-type (GAPDH) introns was determined by qPCR in artificial scaRNA-overexpressing HCT116 KO-4 cells. Data are presented as mean values +/- SEM (n = 3, biological replicates), two-tailed, paired t-test. ns, no significance.

Extended Data Fig. 7 Depletion of bktRNA1 affects the interaction between U12 and ZCRB1.

a, Western blots showing precipitation with each indicated antibody in wild-type (WT) and bktRNA1-deficient KO-4 (KO) cells. b, Native RIP was performed in wild-type (WT) and bktRNA1-deficient KO-4 (KO) cells using each indicated antibody or normal IgG antibody, after which qPCR was performed with primers recognizing minor splice snRNAs (U11, U12, U4atac, U5, U6atac). The percentage of RIP-enriched snRNAs was calculated relative to the input RNA. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test. ns, no significance. c, ZCRB1 RIP-enriched snRNAs were detected by Northern blotting in wild-type (WT) and bktRNA1-deficient KO-4 (KO) cells. U6 snRNA served as a negative control. d, ZCRB1 RIP-enriched snRNAs were detected by Northern blotting in bktRNA1-deficient and bktRNA1-rescued cells. U6 snRNA served as a negative control.

Source data

Extended Data Fig. 8 ZCRB1 knockdown affects U12-type intron splicing.

a, qPCR (upper panel) and western blotting analysis (lower panel) of Dox-inducible ZCRB1 knockdown in HCT116 cells. GAPDH was used as an internal reference gene for qPCR, and GAPDH served as the loading control for western blotting. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test. b, c, Dot plots displaying the intron retention levels in a representative pairwise analysis of ZCRB1 knockdown and negative control cells. The red dots represent U12-type introns, and the blue dots represent U2-type introns. d, Proportion of aberrantly retained U2- and U12- type introns in ZCRB1 knockdown cells. The red boxes represent retained introns, and the blue boxes represent unretained introns. e, f, Cumulative fraction of the inclusion level difference between U12-type and U2-type introns in ZCRB1 knockdown and negative control cells. The P value on the cumulative plots of inclusion level differences were calculated using a two-sided Mann-Whitney-Wilcoxon test. g, Venn diagram showing the numbers of overlapping retained introns across four bktRNA1-deficient HCT116 cell lines and ZCRB1 knockdown cells. h, CCK-8 assay of HCT116 cells with bktRNA1 knockout. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test. i, Colony formation assay of HCT116 cells with bktRNA1 knockout. j, Quantitative analysis of colony formation assay in the indicated lines. Data are presented as mean values +/- SEM (n = 3, biological replicates), two-tailed, paired t-test. k, CCK-8 assay of HCT116 cells with ZCRB1 knockdown. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test. l, Colony formation assay of HCT116 cells with ZCRB1 knockdown. m, Quantitative analysis of colony formation assay in the indicated lines. Data are presented as mean values +/− SEM (n = 3, biological replicates), two-tailed, paired t-test.

Source data

Supplementary information

Supplementary Information

Supplementary Notes 1–3, Supplementary Figs. 1–15, Supplementary Methods and captions for Supplementary Tables.

Reporting Summary

Supplementary Table

Supplementary Tables 1–9; combined tables are separated by tabs.

Supplementary Data

Statistical Source Data for Supplementary Figs. 8, 11, 13 and 14.

Source data

Source Data Figures

Statistical Source Data for Figs. 3–6.

Source Data Extended Data Figures

Statistical Source Data for Extended Data Figs. 6–8.

Source Data Fig. 3

Unprocessed northern blots and gels.

Source Data Fig. 4

Unprocessed gels.

Source Data Fig. 5

Unprocessed western blots and gels.

Source Data Extended Data Fig. 1

Unprocessed western blots.

Source Data Extended Data Fig. 7

Unprocessed western blots and northern blots.

Source Data Extended Data Fig. 8

Unprocessed western blots.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, B., Liu, S., Zheng, W. et al. RIP-PEN-seq identifies a class of kink-turn RNAs as splicing regulators. Nat Biotechnol 42, 119–131 (2024). https://doi.org/10.1038/s41587-023-01749-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41587-023-01749-0

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing