Extended Data Fig. 1: Association of mutational signature prevalence and driver mutations with geographical region, biological sex and EGFR mutation status in LCINS adenocarcinoma cases.
From: The mutagenic forces shaping the genomes of lung cancer in never smokers

a, DBS, ID, CN and SV mutational signatures enrichment analysis with geographical regions. Horizontal lines marking statistically significant thresholds were included at 0.05 (dashed orange line) and 0.01 FDR value levels (dashed red line). Blue-coloured signatures were enriched in North American and European patients, whereas red-coloured signatures were enriched in East Asian patients. Statistical significance was evaluated using multivariable logistic regression models for geographical regions and adjusted by age, sex, and tumour purity. b, SBS, DBS, ID CN, and SV mutational signatures enrichment analysis with biological sexes. Blue-coloured signatures were enriched in males, whereas red-coloured signatures were enriched in females. Statistical significance was evaluated using multivariable logistic regression models for biological sex and adjusted by age, genetic ancestry, and tumour purity. c–e, Detail of the enrichment of EGFR (c), TP53 (d) and KRAS (e) driver mutations in North American and European versus East Asian LCINS adenocarcinoma cases. f, Driver mutations enrichment analysis with biological sexes. Blue-coloured genes were enriched in males, whereas red-coloured genes were enriched in females. Statistical significance was evaluated using multivariable logistic regression models for biological sex and adjusted by age, genetic ancestry, and tumour purity. g, Quantification of the tumour mutational burden for TP53 wild-type and mutant tumours across EGFR mutation status (n = 271 TP53 wild-type EGFR wild-type, n = 241 TP53 wild-type EGFR mutant, n = 81 TP53 mutant EGFR wild-type, n = 144 TP53 mutant EGFR mutant). Statistical significance was evaluated using a multivariable linear regression model for EGFR mutation status and adjusted by age, sex, ancestry, and tumour purity. The line within the box indicates the median, the upper and lower ends indicate the 25th and 75th percentiles, whiskers show 1.5 × interquartile range, and values outside are shown as individual data points. h, SBS, DBS, ID, CN and SV mutational signatures enrichment analysis with EGFR mutation status. Blue-coloured signatures were enriched in EGFR mutant tumours, whereas red-coloured signatures were enriched in EGFR wild-type tumours. Statistical significance was evaluated using multivariable logistic regression models for EGFR mutation status and adjusted by age, sex, genetic ancestry, and tumour purity.