Fig. 2: Oncogenic AID activity is higher at transcriptionally active domains and differs according to transcription direction.

a Average profile of AID somatic mutations accumulation in 2775 cancer samples and replication timing across 500 kb of TAD boundaries delineating active to inactive domains (left); dot plots representing the distribution of the mutations divided by the domain length in different domain-types (heterochromatine = purple, inactive = gray, repressed = blue, low-active = orange, active = red; right; Wilcoxon rank-sum test). Error bar limits are the 25th and 75th percentiles; the center line is the median. b Average profiles of c-AID mutations accumulation in 2775 cancer samples across 500 kb of TSS for negative-strand genes (top) or positive-strand genes (bottom). Boxplots on the right shows the mutational load comparisons within the TSS and adjacent ∓50 bins on each strand where error bar limits are the 25th and 75th percentiles; the center line is the median and whiskers define the minimal and maximum values. c Volcano plot (n = 1130) showing the genes whose expression and mutations are correlated per tumor type (p-adj < 0.05, Spearman Rho > 0), where colors indicate genes enriched in a specific pathway by DAVID database analysis and the pie chart (inset plot) indicates the distribution across tumor types of the associated genes. All panels were produced using the ICGC cohort.