Fig. 1: Pan-cancer copy number features of 33 tumour types from TCGA.

a, Median number of segments in a copy number (CN) profile (x axis), median proportion of the genome that shows LOH (y axis) and the proportion of samples that have undergone one or more WGD events (size). The line of best fit from a robust linear regression is shown, whereby the colour of points indicates the weight of the tumour type in the regression model. Error bands indicate the 95% confidence interval, n = 33, t = 4.95, P = 2.5e-5. See Supplementary Table 1 for cancer type abbreviations. b, Ploidy characteristics of all samples split by tumour type. Bottom, ploidy (y axis) for each sample in a tumour type (x axis), whereby samples are coloured by their genome doubling status as follows: 0×WGD, non-genome-doubled (green); 1×WGD, genome doubled (purple); and 2×WGD, twice genome-doubled (orange). Top, proportion (Prop.) of samples in each tumour type that are 0, 1 or 2×WGD. Horizontal lines indicate median ploidies. c, Decomposition plots of 21 pan-cancer copy number signatures (CN1–CN21). Heterozygosity (Het) status and total copy number (0–9+) are indicated below each column. Segment sizes are shown on the bottom right. Increasing saturation of colour indicates increasing segment size.