Fig. 4: Pan-cancer analysis of cohorts from TCGA and MGI reveals genes recurrently disrupted by variants that are associated with non-canonical splicing patterns.

Heatmaps showing how often genes are disrupted by variants associated with non-canonical splicing patterns across samples in a given cohort. A Rows correspond to the 40 most frequently recurring genes, as ranked by binomial p-value across cohorts (see Methods, “Identification of genes with recurrent splice-associated variants”). Genes are clustered by whether they were annotated by the CGC as an oncogene (red), an oncogene and tumor suppressor gene (yellow), or a tumor suppressor gene (green). Shading corresponds to −log10(p-value) and columns represent cohorts. Blue marks within cells indicate that the gene was annotated by CHASMplus as a driver within a given TCGA cohort. B Rows correspond to the 40 most frequently recurring genes, as ranked by the fraction of samples across cohorts. Shading corresponds to the fraction of samples and columns represent cohorts. Blue dots within cells indicate that the gene was annotated by CHASMplus as a driver within a given TCGA cohort. These results were obtained using the default splice variant window parameter (i2e3). Source data are provided as a Source Data file.