Figure 3

Distributions of somatic variants in DDR and non-DDR interactors. Boxplots showing the contrasting patterns in somatic variants as a function of protein length in DDR and non-DDR (panel a), and as categorized into metastasis and primary tumours (panel b). Outliers in panels (a,b) were removed for clarity, but retained for the T-test statistical analysis. Some genes occur in one or two of the categories in panel b, hence the sum of the N values is higher than in the corresponding plot in panel (a). The N in panel (a) indicates the number of unique DDR genes. All box plots depict the first and third quartiles as the lower and upper bounds of the box, with a thicker band inside the box showing the median value and whiskers representing 1.5 × the interquartile range. Panel (c) shows the accumulation of somatic variants in different biological pathways associated with the 229 DDR genes. For better visualization of the barplot we excluded TP53, which has a large number of recorded variants.