Extended Data Figure 3: Inferred splicing patterns identify recursive splice sites within mammalian >150 kb intron genes.

a–g, RNA-seq (red) read density patterns and normalized FUS iCLIP (green) cross-link density patterns for the OPCML (a), ROBO2 (b), HS6ST3 (c), ANK3 (d), CADM2 (e), NCAM1 (f) and PDE4D (g) genes within human brains. RNA-seq reads and normalized FUS iCLIP cross-links are grouped in 5-kb windows. RefSeq introns >150 kb were searched for novel junctions and linear regression performed on all Ensembl introns >50 kb in which novel junctions were located. Gene isoforms displayed are those including introns within which significant junctions were identified. Red novel junctions represent significant improvements in goodness-of-fit in both RNA-seq and FUS regression analysis (P < 0.01 in both data sets, F-test). Blue novel junctions contact RS-exons. Grey novel junctions were not deemed significant following regression analysis. Zoomed area represents sequence at deep intronic loci surrounding novel junction. Phylo-P conservation track indicates sequence conservation across 46 levels of mammalian evolution.