Main

Chromosomal instability (CIN) is pervasive during cancer evolution, particularly in non-small-cell lung cancer (NSCLC), where it is associated with poor recurrence-free survival1,2,3,4. CIN results in loss of heterozygosity (LOH) events, the burden of which correlates with the frequency of whole-genome doubling (WGD) events in solid tumours5,6. WGD not only mitigates the effect of LOH resulting from CIN, but also fosters ongoing CIN7,8,9,10. By duplicating the complete set of chromosomes, WGD is a key event during cancer evolution and correlates with poor prognosis and targeted therapy resistance7,11,12,13. Despite the importance of CIN and WGD in accelerating cancer evolution by promoting intratumour heterogeneity7, genetic events responsible for the initiation and maintenance of CIN and WGD in NSCLC have not been systematically investigated.

Tracking Cancer Evolution Through Therapy/Rx (TRACERx) is a longitudinal cancer study that utilizes multiregional sampling and whole-exome sequencing (WES)3 to identify and time genetic alterations and their relationship with WGD and CIN3. Here, we identify mutations in cancer driver genes that co-occur with WGD and CIN and further characterize their respective involvement in the DNA damage response (DDR) and CIN. We demonstrate that genetic perturbation of six TRACERx driver genes identified in our screen (particularly FAT1) causes deficiencies in homologous recombination (HR) repair and elevated CIN. Using NSCLC cell line models, we further characterize the molecular mechanisms by which FAT1 alterations drive WGD and CIN—known mediators of drug resistance12,14,15,16. These findings highlight the importance of FAT1 gene alterations in lung cancer evolution.

Results

Identification of DDR and CIN drivers in lung TRACERx

To identify alterations that correlate with WGD and CIN in NSCLC, we analysed multiregional WES data from tumours obtained from the first 100 patients within the lung TRACERx study3. Here, we identified 795 driver events in 91 genes, excluded known oncogenes that could not be modelled appropriately by genetic depletion approaches and focused on 37 tumour suppressor gene mutations that co-occurred with WGD (Fig. 1a). Pathway analysis revealed cellular processes related to genome maintenance, such as the DDR, transcription and chromatin remodelling (Supplementary Fig. 1)17,18,19. To assess how these genes contribute to genome integrity, we performed a multi-parametric RNA interference (RNAi) screen using high-content imaging in four different lung cancer cell lines harbouring mutations in KRAS, EGFR and TP53 to reflect the mutational landscape of the TRACERx cohort (Fig. 1b (top) and Extended Data Fig. 1a). DNA double-strand breaks (DSBs) induced by ionizing radiation, as well as replication stress induced by hydroxyurea, were chosen to model genotoxic stress. In parallel, the impact on chromosome loss was investigated utilizing a human artificial chromosome (HAC) reporter system20 (Fig. 1b (bottom) and Extended Data Fig. 1b). These combined approaches enabled the identification of six tumour suppressor driver genes—namely BAP1, CREBBP, FAT1, NCOA6, RAD21 and UBR5—as regulators of the DDR and maintenance of chromosomal stability in NSCLC (Fig. 1c). In addition, we also confirmed previously reported roles of WRN, FANCM, DICER1, SMARCA4/BRG1, ARID1B, ARID2, KDM5C and ATRX in the DDR21,22,23,24,25,26 (Extended Data Fig. 1a).

Fig. 1: DDR and CIN loss-of-function screen of genome doubling-associated drivers from the TRACERx 100 cohort.
figure 1

a, Flow chart depicting candidate gene selection for the DDR and CIN screens. b, Schematic of the design of the DDR and CIN screens. c, Venn diagram showing the six driver genes contributing to DDR and CIN. d, Validation of the six candidate genes by DR-GFP homologous recombination reporter assay; BRCA2 serves as a positive control. HR efficiencies are normalized to those of control samples. Statistical significance was determined by two-sided, one-sample t-test. The data represent means ± s.d. (n = 3 biological replicates, except for BRCA2, for which n = 2). e, Validation of the six candidate genes by DIvA U2OS-AsiSI site-directed resection assay. Statistical significance was determined by two-sided paired t-test. The data represent means ± s.d. (n = 3 biological replicates). f, Box plots quantifying RAD51 foci formation in A549 cells following depletion of the six candidate genes, following 6 Gy ionizing irradiation and 1 h of recovery. The box edges represent interquartile ranges, the horizontal lines represent median values and the ranges of the whiskers denote 1.5× the interquartile range (n = 3 biological replicates; >150 cells quantified per biological replicate). Statistical significance was determined by Kruskal–Wallis test with Dunn’s multiple comparison test. g, Driver mutation distribution and mutational timing of the six candidate genes in the TRACERx 421 cohort. ATM, CHEK2, ATR, CHEK1 and members of the Fanconi anaemia (FA)/BRCA pathway are included for comparison. FAT1 is highlighted in red. CTRL, control; EGFR, epidermal growth factor receptor; FACS, fluorescence-activated cell sorting; HU, hydroxyurea; IR, ionizing radiation; mut, mutant; nt, nucleotides; WT, wild type.

Source data

The effect of these genes on the DDR was initially validated using two orthogonal approaches; the DR-GFP HR reporter assay27 and a site-specific DSB-generating endonuclease system (DIvA)28. Loss of BAP1, FAT1, NCOA6, RAD21 or UBR5, but not CREBBP, was associated with HR repair deficiency and impaired single-stranded DNA (ssDNA) resection—a step required for accurate HR repair (Fig. 1d,e and Supplementary Fig. 2). Next, we assessed early HR repair signalling 1 h after 6 Gy ionizing radiation in both A549 and H1944 cells. A marked decrease in the formation of RAD51 ionizing radiation-induced foci (IRIFs) was observed following depletion of BAP1, FAT1, NCOA6, RAD21 or UBR5 (Fig. 1f, Extended Data Fig. 2a,b and Supplementary Figs. 3 and 4). These alterations in HR efficiency could not be fully explained by cell cycle changes (Extended Data Fig. 2c), which dictate the selection of activating HR or non-homologous end-joining (NHEJ) repair, suggesting that these genes may be involved in HR directly. FAT1 was prioritized as a top candidate of clinical relevance, as inactivating mutations in FAT1 were highly recurrent (~10%) in the TRACERx 421 cohort (comprising 421 patients and 1,644 tumour regions), with a notable proportion of FAT1 mutations occurring early before WGD (Fig. 1g and Supplementary Fig. 5a–c). Notably, ~20% of the lung TRACERx cohort were found to harbour other inactivating mutations that impair HR efficiency (Fig. 1g and Extended Data Fig. 2d).

FAT1 alterations are positively selected in lung cancer

To quantify whether mutations in FAT1 were under positive selection, we measured the enrichment of FAT1 mutations before and after WGD using the ratio of the observed number of non-synonymous substitutions per non-synonymous site to the number of synonymous substitutions per synonymous site (dN/dS)29. Estimates of dN/dS above or below 1 suggest positive or negative selection, respectively, whereas estimates overlapping 1 imply that there is no evidence of selection. FAT1 mutations, which occurred more frequently in lung squamous cell carcinoma (LUSC) compared with lung adenocarcinoma (LUAD), were under greater positive selection before WGD occurrence in LUSC (Fig. 2a and Extended Data Fig. 3a). In the TRACERx 421 cohort, an enrichment of copy number deletion events was identified around the FAT1 genomic locus 4q35.2 only in patients with clonal WGD, indicating positive selection of 4q35.2 loss (Fig. 2b and Extended Data Fig. 3b–d). In LUSC, FAT1 promoter hypermethylation events reducing FAT1 expression levels and FAT1 copy number loss events were observed in the same tumours (Extended Data Fig. 3e–g). Furthermore, we detected a significant occurrence of mirrored subclonal allelic imbalance (MSAI) at the 4q35.2 locus, suggesting parallel evolution (Fig. 2c). Among genes encoded at the 4q35.2 locus, FAT1 has the lowest Genome Aggregation Database loss-of-function score in germline samples, implying that FAT1 loss is the least tolerated event within 4q35.2 (Fig. 2d). These results highlight the importance of FAT1 alterations.

Fig. 2: FAT1 loss of function is enriched in the TRACERx 421 cohort and leads to an elevated mitotic error rate and WGD.
figure 2

a, Top, schematic of dN/dS ratio analysis. Bottom, results of dN/dS ratio analysis in the TRACERx 421 cohort, demonstrating that FAT1 truncation mutations are selected early in LUSC tumour evolution. The data points represent estimated dN/dS ratios and the error bars represent 95% confidence intervals calculated using the genesetdnds function from the R package dNdScv. The TRACERx 421 cohort comprised 233 males and 188 females (421 patients total), corresponding to a 55:45 male:female ratio. 93% of the cohort was from a White ethnic background and the mean age of the patients was 69 years, ranging between 34 and 92 years. Written informed consent was obtained. None of the patients was compensated for their involvement in the study. b, Genomic identification of significant targets In cancer (GISTIC) analysis of LUAD (141 patients) and LUSC tumours (80 patients) in TRACERx with clonal WGD only, demonstrating that SCNA loss at the FAT1 genomic locus (4q35.2; red text and highlighted) is positively selected in tumours with clonal WGD only. SCNA loci overlapping with common or rare chromosome fragile sites72,73 are annotated (in blue for common fragile sites and in green for rare fragile sites). c, MSAI analysis illustrating that the genomic region of chromosome 4 that harbours the FAT1 gene (arrows) is frequently lost in LUSC. Statistical significance was determined by Fisher’s exact test. In the schematic at the top, paternal and maternal chromosomes are indicated in blue and red, respectively. d, Top, schematic illustrating the location of the FAT1 gene on chromosome 4, together with other 4q35.2 genes within the frequently lost 4q35.2 genomic region. Bottom, selection pressures against losing genes. The data are from the Genome Aggregation Database (gnomAD) and demonstrate high selective pressure against deletion of the FAT1 genomic locus within 4q35.2. exp, expected; LOF, loss of function; obs, observed.

FAT1 ablation reduces HR efficiency

Considering the frequency of FAT1 alterations in NSCLC (Figs. 1g and 2, Extended Data Fig. 3 and Supplementary Fig. 5) and its potential role in genome maintenance (Fig. 1d–f), we further elucidated at which stage FAT1 acts in the DSB repair pathway by systematically investigating which mediators were affected by FAT1 depletion 1 h post-ionizing radiation. FAT1 knockdown did not impact IRIFs of early DSB repair mediators, including phosphorylated ATM, γH2A.X and 53BP1 oligomerization, which are associated with NHEJ22 (Fig. 3a and Supplementary Fig. 6a,b). However, IRIF formation of CtBP interacting protein (CtIP), which is responsible for initiating ssDNA resection22, was significantly impaired by FAT1 knockdown (Fig. 3a and Supplementary Fig. 6c). FAT1 depletion also reduced IRIF formation of the key HR mediator breast cancer type 1 susceptibility protein (BRCA1) in G2/M cells, using centromere protein F-positive staining as a marker (Fig. 3a and Supplementary Fig. 6d). CRISPR knockout of FAT1 in H1944 or A549 cells impaired RAD51 IRIF formation, but not γH2A.X (Extended Data Fig. 4a,b and Supplementary Fig. 6e). A time-course post-ionizing radiation demonstrated that FAT1 depletion was associated with persistent DNA damage, as manifested by the increased frequency of 53BP1 nuclear bodies (Extended Data Fig. 4c).

Fig. 3: FAT1 loss attenuates HR repair.
figure 3

a, Box plots demonstrating the impact of FAT1 siRNA knockdown on early DNA damage signalling and 53BP1 binding in A549 cells. The boxes represent interquartile ranges, the black and red bars represent median and mean values, respectively, and the ranges of the whiskers denote 1.5× the interquartile range. Statistical significance was determined by two-sided Wilcoxon rank-sum test (n = 3 biological replicates). The total numbers of cells quantified per condition were as follows: n ≥ 370 (pATM), n ≥ 438 (γH2A.X), n ≥ 448 (53PB1), n ≥ 560 (CtIP) and n ≥ 218 (BRCA1). b, Schematic of the FAT1 functional domains. The full-length FAT1 protein is 4,588 amino acids. c, RAD51 IRIF formation following 6 Gy ionizing radiation and 1 h of recovery in FAT1 CRISPR knockout (sgFAT1) versus control A549 cells with overexpression of HA–FAT1ICD versus pcDNA3.1. The boxes represent interquartile ranges, the black and red bars represent median and mean values, respectively, and the ranges of the whiskers denote 1.5× the interquartile range. Statistical significance was determined by two-sided Kruskal–Wallis test followed by Dunn’s test with Bonferroni correction (n = 3 biological replicates). df, Top, cartoons depicting examples of HRD-related large-scale transition (LST; d), telomeric allelic imbalance (TAI; e) and LOH (f). Bottom left, Permutation analysis showing a correlation between FAT1 CNA and HRD-related genomic signatures based on TCGA LUAD data. Red lines indicates 90 and 95% confidence intervals, blue line indicates observed correlation value. Bottom right, FAT1 driver mutation scores for these respective genetic alterations, based on TRACERx LUAD data. For the TRACERx LUAD data, tumour numbers were as follows: n = 212 (WT) and n = 17 (mut). In the box and whisker plots, the boxes represent interquartile ranges, the lines represent median values and the ranges of the whiskers denote 1.5× the interquartile range. Statistical significance was determined by two-sided mixed-effects linear model with purity as a fixed covariate and tumour ID as a random variable. g, Top, cartoon showing the design of the EJ5–GFP distal end-joining reporter integrated in U2OS cells. Bottom, 53BP1 siRNA knockdown, but not FAT1 knockdown, affects the distal end-joining rate. The data represent means ± s.d. Statistical significance was determined by two-sided repeated measures one-way analysis of variance (ANOVA) with Holm–Šidák correction (n = 5 biological repeats). h, Top, cartoon showing the design of the EJ2–GFP alternative end-joining reporter integrated into U2OS cells. Bottom, FAT1 siRNA knockdown significantly reduces the alternative end-joining efficiency. The data represent means ± s.d. Statistical significance was determined by two-sided paired t-test (n = 4 biological repeats). EGF, epidermal growth factor-like domain; LAMG, laminin G-like domain; NLS, nuclear localization signal.

Source data

Although the size of full-length FAT1 (4,588 amino acids) limits its ectopic expression, the reduction in RAD51 foci formation could be partially rescued by overexpression of the FAT1 carboxy (C)-terminal intracellular domain30 (HA–FAT1ICD; amino acids 4,202–4,588), which exhibited nuclear localization, suggesting that nuclear FAT1 may promote efficient HR (Fig. 3b,c and Extended Data Fig. 4d,e). FAT1-knockout A549 cells were more sensitive to genotoxic stress induced by poly(ADP-ribose) polymerase inhibitors, cisplatin and hydroxyurea (Extended Data Fig. 5a,b). By analysing both the TRACERx and The Cancer Genome Atlas (TCGA) LUAD datasets, we observed a correlation between FAT1 loss and HR deficiency (HRD)-related genomic signatures, including elevated telomeric allelic imbalance (TAI)31, large-scale transitions (LST)32 and loss of heterozygosity (LOH)33 (Fig. 3d–f). Using established reporters for distal and alternative end-joining activities, respectively34, we confirmed that transient FAT1 siRNA depletion reduced the alternative end-joining efficiency without significantly reducing distal end joining (Fig. 3g,h). Utilizing WGS data from Genomics England, no significant difference was observed in ID6 and SBS3 mutational profiles (Extended Data Fig. 5c,d and Supplementary Fig. 7a,b), both of which are mutation signatures associated with NHEJ activity35.

FAT1 ablation leads to structural and numerical CIN

Indeed, FAT1 depletion resulted in an increased fork collapse rate and HAC loss (Fig. 4a and Extended Data Fig. 1b). Our analysis of the TCGA and Genomics England datasets revealed that FAT1 loss correlated with an increased weighted genome instability index score and total mutational burden, indicating elevated structural and numerical CIN (Fig. 4b,c). To validate this observation, we used U2OS and type 2 pneumocyte (T2P) cells to investigate the formation of micronuclei and 53BP1 nuclear bodies in G1 daughter cells, both established markers of unresolved replication stress and HRD36,37,38,39. FAT1 ablation at baseline and under replication stress induced by either low-dose aphidicolin or a short pulse of hydroxyurea exacerbated the formation of cyclin A-negative (G1-specific) 53BP1 nuclear bodies and micronuclei (Fig. 4d–h and Extended Data Fig. 6a). Notably, acentric micronucleus formation was significantly elevated following FAT1 depletion in A549 and U2OS cells following replication stress (Fig. 4f,g). Under these conditions, FAT1 ablation also resulted in an increased mitotic error rate at baseline and under replication stress, which manifested as an increased formation rate of chromatin bridges and lagging chromosomes (Fig. 5a and Extended Data Fig. 6b). Utilizing the DIvA site-directed DSB system28,40, we further demonstrated that FAT1 depletion resulted in increased illegitimate translocation of two DSBs induced on chromosome 17 (Fig. 5b). Concurrently, higher rates of structural chromosomal aberrations, including radial chromosomes and chromatid gaps, were observed upon FAT1 loss (Fig. 5c and Extended Data Fig. 6c). FAT1 silencing also reduced mitotic fidelity, as evidenced by deviations in the modal chromosome number (Fig. 5d–f and Extended Data Fig. 6d).

Fig. 4: FAT1 loss elevates replication stress and micronuclei.
figure 4

a, FAT1 knockout exacerbates replication fork stalling in A549 cells. Top, scheme of the nucleotide labeling used to measure replication fork stalling. Bottom (left) quantification; (right), representative image for the DNA fibre experiments. The data represent means ± s.d. Statistical significance was determined by two-sided paired t-test (n = 3 biological replicates; >600 forks counted in total). Scale bars, 20 µm. b, TCGA LUAD analysis showing that FAT1 copy number loss is significantly correlated with weighted genome instability index measurements. The blue lines indicate FAT1 loss and the red dotted lines indicate the 90 and 95% confidence intervals. Confidence intervals were generated using computational permutation analyses. c, Box plot comparing the numbers of indels in FAT1 WT versus mutated tumours in the Genomics England LUAD and LUSC cohorts. The boxes represent interquartile ranges, the lines represent median values and the ranges of the whiskers denote 1.5× the interquartile range. Statistical significance was determined by two-sided Wilcoxon rank-sum test. n = 818 (WT) and n = 16 (mut). d, Transient FAT1 siRNA knockdown induces the formation of 53BP1 bodies in cyclin A-negative U2OS cells following 4 mM hydroxyurea for 5 h and recovery for 24 h. Statistical significance was determined by two-sided Wilcoxon rank-sum test. Scale bars, 10 μm. The red bars in the graph to the left represent mean values (n = 5 biological replicates). e,f, Transient FAT1 siRNA knockdown in U2OS cells induces the formation of total micronuclei with or without replication stress induced by 5 h of 4 mM hydroxyurea followed by 24 h recovery (e), as well as the formation of both acentric and centromeric micronuclei following the hydroxyurea treatment (f). The data represent means ± s.d. Statistical significance was determined by one-way ANOVA with Bonferroni correction. Biological repeats: n = 8 (e) and n = 4 (f). g,h, FAT1 loss elevates the rate of micronuclei formation in response to replication stress induced by 0.2 µM aphidicolin treatment (24 h) following FAT1 CRISPR knockout in A549 cells (g) or transient siRNA knockdown in T2P cells (h). The data represent means ± s.d. Statistical significance was determined by two-sided Student’s t-test. Biological repeats: n = 4 (total micronuclei in g), n = 3 (centromeric and acentric micronuclei in g) and n = 8 (h).

Source data

Fig. 5: FAT1 loss increases structural CIN and chromosome numbers.
figure 5

a, Transient FAT1 knockdown significantly increases the mitotic error rate (lagging chromosomes plus DAPI bridges; left; data represent means ± s.d.) and the occurrence of nucleoplasmid bridges (middle; red bars represent mean values) in U2OS cells after 5 h treatment with 4 mM hydroxyurea and 24 h recovery. Statistical significance was determined by one-way ANOVA with Bonferroni correction (left) or Dunn’s test (middle). Right, selected maximum projection images following FAT1 knockdown, showing DAPI-stained mitotic U2OS cells following treatment with 4 mM hydroxyurea and 24 h recovery. Scale bars, 5 μm. Over 100 mitotic cells were scored across three biological replicates. b, Representative PCR-based semi-quantitative DIvA U2OS-AsiSI translocation assay. Transient FAT1 siRNA knockdown increases illegitimate repair products. PCR products generated from the uncut region and the legitimate repair product were used as the loading control. n = 3 biological replicates. c, Histogram (left) and representative images (right) showing that A549 cells with FAT1 loss exhibit a significantly increased number of chromosomal aberrations upon challenge with replication stress induced by 0.2 µM aphidicolin (APH) treatment. Scale bar, 5 μm. The data represent means ± s.d. Statistical significance was determined by one-way ANOVA with Holm–Šidák correction. A total of 60 metaphases were scored across three biological replicates per condition. Blue and red arrows indicate radial chromosomes and chromatid gaps, respectively. df, Transient FAT1 siRNA knockdown causes a significant numerical deviation in chromosome number in H1944 cells, as determined by multiple methodologies, including clonal fluorescence in situ hybridization (d), ImageStream high-throughput flow cytometry (e) and metaphase spreads (f). The histogram data represent means ± s.d. For the box plots, the boxes represent interquartile ranges, the black and red lines represent median and mean values, respectively, and the ranges of the whiskers denote 1.5× the interquartile range. Statistical significance was determined by two-sided Wilcoxon rank-sum test (d) or two-sided paired t-test (e). n = 3 biological replicates for all cases. bp, base pairs. NT, non-targeting.

Source data

Since FAT1 mutations are both common in lung cancer and evolutionarily selected before WGD (Fig. 2a–c), we experimentally validated the mitotic defect associated with FAT1 loss using the near-triploid (3N) LUAD cell line PC9 (which harbours an in-frame deletion at exon 19 of the epidermal growth factor receptor-encoding gene) and its isogenic WGD hexaploid (6N) clone12. Transient FAT1 depletion in PC9 cells significantly elevated the rate of stalled replication forks and mitotic errors, further confirming the involvement of FAT1 in genome maintenance (Extended Data Fig. 6e–g). Despite resulting in an elevated replication fork collapse rate, reduced formation of interphase and mitotic Fanconi anaemia complementation group D2 (FancD2) foci was also observed in FAT1-depleted WGD PC9 cells, suggesting a failure to recover from replication stress, leading to structural CIN (Extended Data Fig. 6e,g–j).

FAT1 depletion leads to mitotic errors and WGD

WGD events were highly prevalent in the lung TRACERx 421 cohort (84% of LUSC and 77% of LUAD) and FAT1 driver mutations were selected before WGD in LUSC tumours (Fig. 2a). However, FAT1 mutations and WGD did not significantly co-occur in the TRACERx 421 cohort (Fisher’s exact test; P = 0.179) (Extended Data Fig. 7a). This was probably due to the presence of other HR-related gene alterations (~20% of patients; Fig. 1g and Extended Data Fig. 2d), which also can contribute to WGD. To investigate whether FAT1 alterations drive WGD, we used the PC9 lung cancer model to quantify the proportion of actively replicating cells with >6N genome content (the basal ploidy of PC9 is 3N). Increased 5-ethynyl-2′-deoxyuridine (EdU) incorporation rates (Fig. 6a,b), as well as a significant increase in loading of the replicative helicase MCM7 beyond 6N, were observed in FAT1-knockout PC9 cells (Extended Data Fig. 7b), both indicating a second replication event post-6N and suggesting that FAT1 loss is associated with WGD. These observations were independent of p53 mutational status, as similar results were obtained in TP53 wild-type, near diploid, untransformed retinal pigment epithelial-1 (RPE-1) cells immortalized with the human telomerase reverse transcriptase subunit (hTERT) (Extended Data Fig. 7c).

To identify the cause of WGD following FAT1 depletion, we monitored cells using live-cell microscopy, tracking at single-cell resolution. Reported causes of WGD include cytokinesis defects, endoreplication, mitotic bypass and cyclin B1 dysregulation during G2 (refs. 41,42,43,44). No change in cyclin B1 level was observed upon FAT1 knockdown in G2, ruling out cyclin B1 dysregulation as the cause of WGD in the absence of FAT1 (Extended Data Fig. 7d,e). DNA synthesis, measured by EdU incorporation, was also unaltered in control versus FAT1-knockout WGD cells transiently blocked in mitosis using nocodazole (Extended Data Fig. 7f), thereby ruling out a role for FAT1 loss in driving WGD through endoreplication in a manner similar to that of cyclin E amplification reported recently44. To investigate mitotic bypass, we used the hTERT RPE-1 cell line expressing both H2B-mTurquoise and fluorescent, ubiquitination-based cell cycle indicator (FUCCI) (hereafter FUCCI–RPE-1 cells) for live-cell imaging (Supplementary Video 1). FAT1 depletion, irrespective of the induction of aphidicolin-induced replication stress, did not cause a significant increase in the rate of mitotic bypass (Fig. 6c and Supplementary Fig. 8). In contrast, FAT1-depleted cells demonstrated an elevated rate of cytokinesis failure, suggesting defects in this final step of cell division as the cause of the WGD associated with FAT1 deficiency (Fig. 6d, Extended Data Fig. 7g and Supplementary Video 2). FAT1 depletion was also associated with an increased rate of nuclear shape abnormalities in daughter cells after normal mitoses (Fig. 6e and Supplementary Video 3). Similarly, after FAT1 depletion, increases in multinucleation and nuclear morphology alterations were observed in fixed U2OS and RPE-1 cells (Extended Data Fig. 7h,i) and WGD PC9 cells, respectively, after replication stress exposure (Extended Data Fig. 8a).

Fig. 6: FAT1 loss leads to an elevated mitotic error rate and results in WGD.
figure 6

a, Representative dot plots demonstrating FAT1 ablation in PC9 cells and assessment of EdU incorporation beyond the normal G2 phase, to visualize WGD. b, Top, representative western blot validating FAT1 knockout. Bottom, quantification of EdU incorporation beyond the normal G2 population showing that FAT1 knockout significantly increases the WGD population in PC9 cells. The data represent means ± s.d. Statistical significance was determined by one-way ANOVA with Bonferroni correction. Biological repeats: n = 7 (sgNT), n = 5 (sgFAT1 clone 1) and n = 4 (sgFAT1 clone 2). c, Schematic (left) and histogram (right) showing the impact of transient FAT1 knockdown in TERT RPE-1 cells on the promotion of WGD through mitotic bypass, as determined by live-cell imaging. The data represent means ± s.e.m. Statistical significance was determined by one-way ANOVA with Bonferroni correction. Biological repeats: n = 3 (with aphidicolin treatment) and n = 6 (without aphidicolin treatment). d,e, Schematics (left) and histograms (right) showing that transient FAT1 siRNA knockdown in TERT RPE-1 cells increases the rates of cytokinesis failure (d) and nuclear shape deformation (e), as determined by 30× magnification live-cell microscopy imaging at 20 min intervals. The data represent means ± s.e.m. Statistical significance was determined by two-sided paired t-test, At least 200 mitotic events were tracked per condition over five biological replicates. YFP, yellow fluorescent protein.

Source data

To delineate whether structural CIN precedes nuclear shape abnormalities, we performed live-cell spinning-disk confocal microscopy on FUCCI–RPE-1 cells in G2, allowing us to determine the timing and outcome of the pending mitosis and the fate of respective daughter cells at high resolution with low phototoxicity. We observed that FAT1 depletion not only increases chromatin bridge formation rates (Extended Data Fig. 8b, left) but reduces the maintenance of normal nuclear morphology following mitotic chromosomal bridge formation (Extended Data Fig. 8b (right) and Supplementary Videos 4 and 5). Next, we investigated the long-term outcome of daughter cells with deformed nuclear morphology, by observing the heritability and recurrence of nuclear deformities and mitotic errors in respective daughter cells following mitosis (Extended Data Fig. 8c). In addition, the number of normal second mitoses was significantly reduced following an initial nuclear deformation event (Extended Data Fig. 8c). Furthermore, FAT1-depleted cells had an accelerated rate of metaphase entry (Extended Data Fig. 8d), suggestive of a potential role for FAT1 in regulating mitotic timing. Taken together, these results demonstrate that FAT1 loss contributes to both an increased rate of structural CIN and elevated mitotic defects in daughter cells following CIN, contributing to WGD.

Since elevated CIN is known to synergize with WGD to generate increased intratumour heterogeneity3,45 and escape from targeted therapeutic pressure12,14,46,47, we also investigated whether FAT1 loss might enable WGD PC9 cells to accelerate cancer evolution and bypass targeted therapy treatment. PC9 cells were treated with a sensitizing concentration of the epidermal growth factor receptor inhibitor osimertinib (>90% inhibitory concentration) for 5 weeks (Extended Data Fig. 8e,f). An increase in clonal survival and clonal derivation rate was observed for FAT1-knockout cells over 3 months (Extended Data Fig. 8g,h). Osimertinib-resistant FAT1-knockout clones exhibited elevated ploidies compared with control clones (Extended Data Fig. 8i). Taken together, these results suggest that FAT1 loss might fuel WGD and elevated CIN, which in turn exacerbates cancer evolution and targeted therapy resistance, as previously demonstrated12,14,46,47.

FAT1 depletion leads to dysregulated Yes-associated protein 1 signalling

To determine the molecular mechanism by which FAT1 alterations lead to CIN, we investigated components of potential signalling pathways in which FAT1 has been implicated. FAT1 loss has been connected to dysregulation of the Hippo pathway, RAS–RAF–MEK–MAPK signalling and regulation of the level of Yes tyrosine kinase, which mediates hybrid–epithelial–mesenchymal transition48,49. Under our experimental conditions, no consistent FAT1 siRNA-mediated alterations of MEK–ERK phosphorylation or Yes tyrosine kinase levels (Fig. 7a) were observed. However, FAT1 depletion resulted in elevated nuclear localization of the key Hippo pathway activator YAP/TAZ (Fig. 7b and Extended Data Fig. 9a,b). Despite increased Yes-associated protein 1 (YAP1) nuclear localization, no significant alteration in YAP1 phosphorylation could be consistently observed following LATS1 or FAT1 depletion (Extended Data Fig. 9c) in T2P or FUCCI–RPE-1 cells.

Fig. 7: FAT1 loss leads to dysregulation of the Hippo pathway.
figure 7

a, Western blot demonstrating the impact of FAT1 knockdown on the Src–Mek–Erk signalling axis in hTERT RPE-1 and T2P cells. The results are representative of three repeats. b, Scatter plot showing the impact of transient siRNA depletion of FAT1 on nuclear YAP1 localization using the stringent PTEMF fixation buffer (Methods) in TERT RPE-1 cells. The red bars represent median values. Statistical significance was determined by two-sided Kruskal–Wallis test followed by Dunn’s multiple comparisons test. Over 170 cells were scored per condition over three biological replicates. c, Top, schematic illustrating the predicted domains of the FAT1 protein and respective regions cloned into a pCMV expression plasmid with an HA epitope tag. Bottom, TEAD activity in FAT1 knockout PC9 hexaploid WGD cells. The normalized TEAD activity was measured as an enrichment of the neonGreen signal over the untransfected background signal in each experiment. The HA signal was used to identify successful FAT1 rescue construct co-transfection at the single-cell level. TEAD activity was elevated in FAT1-knockout cells but could be further increased by overexpressing the constitutively active HA–YAP15SA mutant. Overexpression of the FAT1 wild-type construct repressed TEAD activity. However, overexpression of FAT1 mutants devoid of the MIB2 binding region (mScarlet–HA–FAT1MIB∆) and HA–FAT1ICD did not repress TEAD activity. The edges of the histograms represent mean values. Statistical significance was determined by two-sided Kruskal–Wallis test followed by Dunn’s multiple comparisons test. More than 95 cells were scored over three biological replicates. d, DSB resection assay, showing that transient knockdown of LATS1 and LATS2—both negative regulators of YAP1—represses ssDNA formation at DSB break sites (chr22:37194035, ssDNA measured 131 nucleotides from the DSB) in U2OS-AsiSI cells. The data represent means ± s.d. (n = 4 biological replicates). Statistical significance was determined by two-sided paired t-test. e, Both LATS1 and LATS2 siRNA knockdown in U2OS cells elevate rates of 53BP1 nuclear body formation when challenged with replication stress (5 h of 4 mM hydroxyurea followed by 24 h recovery). The boxes represent interquartile ranges, the black and red lines represent median and mean values, respectively, and the ranges of the whiskers denote 1.5× the interquartile range. Statistical significance was determined by two-sided Wilcoxon rank-sum test. More than 340 cells were scored over three biological replicates. f, Both LATS1 and LATS2 siRNA knockdown in U2OS cells induce centromeric and acentric micronuclei formation following challenge with replication stress (5 h of 4 mM hydroxyurea followed by 24 h recovery), suggestive of a mitotic segregation deficiency. Statistical significance was determined by repeated measures one-way ANOVA. The data represent means ± s.d. (n = 3 biological replicates). g, Transient siRNA knockdown of LATS1 in U2OS cells elevates the mitotic error rate. The data represent means ± s.d. Statistical significance was determined by repeated measures one-way ANOVA (n = 3 biological replicates). MIB2, MindBomb2-interacting domain; mS, mScarlet; TM, transmembrane domain.

Source data

Orthogonally, we designed a neonGreen reporter with 14xTEAD binding sites to elucidate how FAT1 regulates the ultimate output of the Hippo pathway, TEAD transcriptional activity. This is particularly important since YAP1 and TAZ/WWTR1 are highly analogous, functionally overlapping50 and both amplified in lung cancer3,51 (Extended Data Fig. 9d). By scoring HA-tagged construct and neonGreen reporter double-positive cells, we confirmed that FAT1 loss elevated TEAD transcriptional activity, despite the presence of multiple copies of YAP/TAZ in hexaploid WGD PC9 cells (Fig. 7c, condition 1 versus condition 2). Notably, overexpression of the constitutively active YAP1 mutant construct52 further stimulated TEAD activity in FAT1-knockout cells (HA-YAP15SA; Fig. 7c, condition 3).

Next, we investigated which FAT1 domains are required to re-suppress TEAD transcriptional activity. In addition to the C-terminal HA–FAT1ICD fragment capable of rescuing RAD51 IRIF formation (Fig. 3b,c), a FAT1 construct encompassing the extracellular and transmembrane domain was generated (mScarlet–HA–FAT1WT in Fig. 7c), consisting of 3,735–4,588 amino acids of FAT1. Since FAT1 is known to cooperate with the E3 ligase MIB2 to regulate YAP/TAZ signalling53, we also generated a FAT1 mutant lacking the MIB2 interaction domain (mScarlet–HA–FAT1MIB∆ in Fig. 7c). Indeed, the mScarlet–HA–FAT1WT fragment successfully re-repressed TEAD transcription, whereas the FAT1 construct lacking the MIB2 interaction domain failed to do so (Fig. 7c (conditions 4 and 5) and Extended Data Fig. 9e), highlighting the role of FAT1–MIB2 interaction in the modulation of TEAD transcription. Notably, the HA–FAT1ICD construct, which lacks a transmembrane domain anchor compared with HA–FAT1WT (Supplementary Fig. 9), failed to re-repress TEAD transcription (Fig. 7c, condition 6), despite successfully rescuing RAD51 IRIF formation in A549 cells (Fig. 3b,c).

Next, we investigated whether depletion of LATS1/2—crucial modulators of YAP1 nuclear localization54—might lead to phenotypes resembling those of DDR and CIN observed following FAT1 depletion. Indeed, a marked reduction of the DNA end-resection rate, 53BP1 nuclear body formation and an increase in micronuclei formation were all observed following depletion of either LATS1 or LATS2 (Fig. 7d–f and Extended Data Fig. 10a). Using the site-directed endonuclease DIvA system, depletion of LATS2 caused illegitimate DSB repair, culminating in a translocation (Extended Data Fig. 10b). Notably, somatic copy number alteration (SCNA) loss of LATS2 genomic loci at 13q12.11 was positively selected for in LUSC (Extended Data Fig. 10c). Despite causing HRD, neither LATS1 nor LATS2 depletion disrupted the activation of early DDR signalling, such as phosphorylation of KAP1/TRIM28 or γH2A.X phosphorylation (Extended Data Fig. 10a). Similar to FAT1 depletion, LATS1 knockdown led to an elevated mitotic error rate (Fig. 7g).

FAT1 and YAP1 co-depletion reverses WGD but not HR defects

Given the involvement of FAT1 loss in YAP/TAZ activity (Fig. 7b,c), we systematically investigated whether WGD and numerical and structural CIN associated with FAT1 loss could be reversed by co-depleting YAP1. Despite FAT1/YAP1 co-depletion reversing the cell cycle arrest associated with YAP1 single knockdown (Extended Data Fig. 10d,e), co-depletion of FAT1 and YAP1 did not rescue HR activation defects after DSB formation (Fig. 8a–c). This result was orthogonally validated using the DR-GFP reporter assay, ssDNA formation after endonuclease-induced DSBs and RAD51 foci formation after ionizing irradiation (Fig. 8a–c). Next, since unresolved recombination intermediates due to HR repair deficiency can cause mitotic errors55, we quantified the rate of mitotic errors. YAP1/FAT1 co-depletion failed to rescue the elevated mitotic error and chromosomal bridges observed in FAT1-knockout A549 cells (Fig. 8d,e and Extended Data Fig. 10f).

Fig. 8: Co-depletion of FAT1 and YAP1 reverses cytokinesis failure but not HR deficiencies.
figure 8

a, Impact of FAT1/YAP1 siRNA co-depletion in U2OS cells, as determined by DR-GFP HR reporter assay. MRE11A siRNA served as a positive control. The HR efficiencies are normalized to those of the control samples. Statistical significance was determined by one-way ANOVA with Holm–Šidák correction. The data represent means ± s.d. Biological replicates: n = 5 (siCTRL and siFAT1), n = 3 (siYAP1) and n = 4 (siFAT1 + siYAP1 and siMRE11A). b, ssDNA resection rates for FAT1/YAP1 siRNA co-depletion, as determined by DIvA U2OS-AsiSI site-directed resection assay with the DSB site located at chr22:37194035, ssDNA measured 131 nucleotides from the DSB. Statistical significance was determined by one-way ANOVA with Holm–Šidák correction. The data represent means ± s.d. (n = 3 biological replicates). c, Box plots quantifying RAD51 foci formation in A549 cells following the loss of FAT1, or the combined loss of both FAT1 and YAP1, after 6 Gy ionizing irradiation and 1 h recovery. The boxes represent interquartile ranges, the black and red lines represents median and mean values, respectively, and the ranges of the whiskers denote 1.5× the interquartile range. Over 380 cells were scored across three biological replicates. Statistical significance was determined by uncorrected Dunn’s test. d,e, Plots (d) and representative images (e) illustrating the quantification of mitotic error rates in A549 cells after 24 h of aphidicolin treatment (0.2 µM). FAT1 wild-type or knockout cells were transiently depleted of YAP1 using RNAi. For the mitotic error analysis, statistical significance was determined by one-way ANOVA with Holm–Šidák multiple correction and the data represent means ± s.d. (biological repeats: n = 3 (FAT1 WT) and n = 4 (FAT1 knockout)). For the DAPI bridge and Fanconi anaemia complementation group D2 (FANCD2)-flanked DAPI bridge analyses, the red lines represent mean values, the boxes represent interquartile ranges and the ranges of the whiskers denote 1.5× the interquartile range, and statistical significance was determined by Dunn’s test with Bonferroni correction. Over 100 mitotic cells were scored across three biological replicates. Scale bars, 5 µM. f, Results of live-cell imaging analysis, showing that FAT1/YAP1 double siRNA knockdown in TERT–RPE-1 cells fully rescued the failed cytokinesis and WGD introduced by FAT1 knockdown (left) but only partially ameliorated the nuclear shape deformation (right). Statistical significance was determined by one-way ANOVA. At least five biological replicates were quantified per condition. Biological repeats: n = 4 (siCTRL), n = 7 (siFAT1) and n = 5 (siFAT1 + siYAP1). The data represent means ± s.e.m. g, Histogram illustrating the WGD populations in TP53 wild-type versus knockout RPE-1 cells with or without transient mScarlet–YAP5SA transfection. The data represent means ± s.e.m. Statistical significance was determined by two-sided Mann–Whitney test (n = 4 biological repeats). h, Histograms illustrating the total (left) and normalized (right) EdU+ WGD populations in FAT1 wild-type versus knockout PC9 cells, with or without transient mScarlet–YAP5SA transfection. The data represent means ± s.e.m. (n = 5 biological repeats). Statistical significance was determined by repeated measures one-way ANOVA with Benjamini–Hochberg correction (left) or Friedman test (right). KO, knockout.

Source data

Next, we investigated whether FAT1/YAP1 co-depletion might rescue the CIN phenotypes associated with FAT1 loss. Live-cell imaging experiments demonstrated that co-depletion of FAT1 and YAP1 reversed the cytokinesis failure phenotypes (Fig. 8f (left) and Extended Data Fig. 10g). However, the nuclear shape deformation observed following FAT1 depletion was only partially rescued (Fig. 8f, right).

Taken together, our observations suggest that FAT1 possesses dual roles. One outcome of FAT1 loss is HRD leading to unresolved recombination intermediates, replication stress, structural CIN and nuclear deformation (Figs. 35 and 8a–d); these events appear to be YAP1 independent. In contrast, hyperactive YAP1 leads to increased cytokinesis failure and WGD (Fig. 8f). To visualize the impact of constitutively active mScarlet–YAP15SA overexpression on WGD, we performed an EdU incorporation assay. Complementary to the literature56, mScarlet–YAP15SA overexpression promoted WGD in RPE-1 cells and this was independent of TP53 status (Fig. 8g and Supplementary Fig. 10). Concurrently, we observed an emerging WGD population in FAT1 wild-type and FAT1-knockout PC9 cells (Fig. 8h). These results suggest that constitutively active YAP1 is sufficient to promote WGD. We postulate that FAT1 loss might drive genome instability through two different routes—one through WGD, dependent on YAP1, and the second through HRD, driving structural CIN—in a TEAD/YAP1-independent manner.

Discussion

We systematically analysed 37 NSCLC drivers previously reported in the lung TRACERx study3 that are associated with CIN and WGD. Particularly, we show that depletion of either BAP1, FAT1, NCOA6, RAD21 or UBR5 is associated with downregulation of multiple key steps required for HR repair and elevated loss of a HAC, thereby confirming previous reports of the roles of BAP1, RAD21 and UBR5 in genome maintenance57,58,59. In addition, our DDR screen confirms previous reports characterizing the role of WRN, FANCM, DICER1, SMARCA4/BRG1, ARID1B, ARID2, KDM5C and ATRX in the DDR21,22,23,24,25,26. Despite FAT1 being frequently altered in normal somatic skin cells60 and in multiple cancers61,62,63, as well as being associated with NSCLC mortality2, the mechanistic links between FAT1 alterations, DDR and both numerical and structural CIN have remained elusive.

Through our comprehensive analyses, we have demonstrated that FAT1 ablation impacts HR efficiency, but not NHEJ efficiency, nor the early signalling events of the DDR. FAT1 loss results in hallmarks of HRD, such as increased sensitivity to replication stress inducers, elevated chromosome translocation, elevated fork collapse rates and increased HRD-predictive genetic scars in NSCLC. Our data suggest that FAT1 loss leads to end-resection deficiency after the chromatin remodelling step to initiate HR. The inability to accurately resolve replication stress or clastogenic breaks may lead to chromosome-level alterations36, especially structural CIN.

FAT1 alterations have been detected in normal somatic tissue60 and we hypothesize that FAT1 inactivation occurs early in tumorigenesis. Using the multiregional sequencing of the TRACERx study to time the occurrence of FAT1 alterations, we observed evidence of positive selection for tumour subclones with FAT1 alterations before WGD. Indeed, FAT1 depletion attenuates HR and causes unresolved replication stress, both of which contribute to mitotic errors and micronucleus formation39. Using live-cell microscopy, we demonstrated that FAT1 depletion not only elevates structural CIN, such as mitotic bridges, but also increases subsequent nuclear shape deformation.

FAT1 has been proposed to be tumour suppressive and involved in various signalling pathways, including the CaMK2, WNT and Hippo signalling pathways48,49,50,62. Components of the Hippo pathway have been reported to impact various aspects of genome maintenance, such as mitotic control, stabilization of replication forks, and modulation of nuclear shape, which can affect three-dimensional genome organization56,64,65,66. We confirmed that depletion of LATS1 or LATS2 can significantly affect HR efficiency and contribute to CIN64. We discovered that FAT1—at least in part through its intracellular C-terminal domain—is involved in the DDR and represses CIN through the modulation of YAP1, whereas its interaction with YAP1 is dependent on the interaction between FAT1 and the E3 ligase MIB2 (ref. 53). Notably, despite being able to reconstitute HR-related functions, overexpression of the HA–FAT1ICD construct, devoid of the transmembrane domain, failed to re-repress TEAD transcriptional activity in FAT1-knockout cells. In line with previous reports that HRD results in mitotic defects and structural CIN39,55, FAT1/YAP1 co-depletion failed to rescue both the HRD-related phenotypes and the formation of chromosomal bridges, suggesting that FAT1 loss contributes to HRD and structural CIN in a YAP1-independent manner. It has been reported that FAT1 can interact with MST1 and LATS1 (ref. 49), which can influence HR efficiency through their interaction with ATR and RASSF1A to control BRCA2 recruitment at DNA damage sites64,67.

Conversely, FAT1/YAP1 co-depletion rescued both cytokinesis failure and WGD. Strikingly, overexpression of constitutively active mScarlet–YAP15SA was sufficient to drive WGD and increased numerical CIN. Taken together, our data reveal that FAT1 alterations may contribute to HRD, CIN and WGD through two distinct mechanisms (Extended Data Fig. 10h).

Multiple studies have suggested that WGD accelerates cancer evolution and promotes intratumour heterogeneity, which promotes targeted therapy resistance6,12,46,47. Here, using PC9 cells that are sensitive to osimertinib as an experimental model, we illustrate that FAT1 loss, through elevated genomic instability, may provide the evolutionary advantages that fuel targeted therapy resistance. Taken together, genomic observations and experimental results suggest a role for early positive selection of FAT1 alterations in LUSC tumorigenesis, potentially by driving cancer evolution and WGD through elevated CIN.

Recent studies have also identified that the YAP/TEAD signalling axis promotes therapy resistance68. In addition to FAT1, three other genes identified through our DDR and CIN screens—namely RAD21, BAP1 and NCOA6—have also been demonstrated to modulate the Hippo pathway69,70,71. Although YAP1 is not amplified in the TRACERx dataset, amplification of the WWTR1/TAZ genomic locus (3q25.1) and loss of the LATS2 genomic locus (13q12) are prevalent in LUSC3,51. Consistent with other studies61,68, our data highlight the importance of FAT1 and Hippo pathway dysregulation in genome instability, targeted therapy resistance and cancer evolution.

Methods

Ethical approval

The TRACERx study (NCT01888601; Clinicaltrials.gov) is sponsored by University College London (UCL/12/0279) and has been approved by an independent Research Ethics Committee (13/LO/1546). TRACERx is funded by Cancer Research UK (CRUK; C11496/A17786) and coordinated through the CRUK and University College London Cancer Trials Centre.

Cell line, cell culture, transfection and CRISPR knockout generation

H1944, H1650, H1792, HCC4006, A549 (CCL-185) and U2OS (HTB-96) cells were obtained from Cell Services at the Francis Crick Institute. The FUCCI–H2B-mTurquoise–RPE-1 cells have been described previously44 and were a kind gift from J. Diffley (Francis Crick Institute). The HA–ER–AsiSI–U2OS cells have been described previously28 and were a kind gift from G. Legube (Paul Sabatier University). The ER–KRAS–V12–T2P cells (referred to as T2P) have been described previously38 and were a kind gift from J. Downward (Francis Crick Institute). The TP53 wild-type and TP53-knockout RPE-1 cells have been described previously74. The HCT116 iRFP cell lines have been described previously75 and were a kind gift from K. Vousden (Francis Crick Institute). All of the cell lines were maintained in either McCoy’s 5A Medium, Dulbecco Modified Eagle′s Medium (DMEM) or RPMI 1640 Medium (all from Thermo Fisher Scientific) fortified with 10% foetal bovine serum in the presence of 1% penicillin–streptomycin (Thermo Fisher Scientific). Cells were grown at 37 °C under 5% CO2. All cell lines used were negative for Mycoplasma contamination and are frequently tested in-house at the Francis Crick Institute. DNA was transfected using GeneJuice (EMD Millipore) according to the manufacturer’s instructions.

CRISPR knockout cell lines were generated using a single-cell cloning approach. To minimize the off-target effect, CRISPR plasmids and guides were transiently transfected using GeneJuice (EMD Milipore) and underwent puromycin selection for 5 days before single-cell sorting to generate knockout clones. Single-cell clones were then cultured, validated and frozen. A549 clones were generated with two individual guides (sgFAT1#1 (AAACCCGGGAAGTCGAAGTCCTTGC) and sgFAT1#2 (AAACACGCTGGATGTGTAATGTAAC)), whereas PC9 clones were generated with both guides. The sgNT sequence as follows: GCGAGGTATTCGGCTCCGCG.

RNAi

A list of the siRNAs used is provided in Supplementary Table 1. Dharmacon ON-TARGETplus siRNA pools (Horizon Discovery) were used for high-content screening, HAC assay and RAD51 foci experiments in H1994 cells (Extended Data Figs. 1 and 2b,c and Supplementary Fig. 4). Ambion Silencer Select siRNA (Thermo Fisher Scientific) was used for all of the other RNAi experiments, except siRNA against MRE11A (L-009271-00-0005; Dharmacon; Fig. 8a). siRNA transfection was carried out using Lullaby reagent (OZ Biosciences) or Lipofectamine RNAiMAX Transfection Reagent (Thermo Fisher Scientific) according to the manufacturers’ instructions.

ssDNA resection and translocation assays

ssDNA resection assays were carried out as described previously28. Briefly, DIvA HA–ER–AsiSI–U2OS cells were plated overnight and siRNAs were transfected as described previously. Samples were subjected to a 4 h incubation in 300 nM 4-hydroxytamoxifen. Genomic DNA was extracted using the DNeasy Blood & Tissue kit (69504; Qiagen). For every 500 ng genomic DNA used, five units of RNase H1 (M0297; New England Biolabs) were added at 37 °C for 15 min. In vitro, restriction digestion with the BanI restriction enzyme was performed to assay for the presence of ssDNA around break sites. Then, 200 ng of samples were digested with 16 units of BanI at 37 °C for 12 h (New England Biolabs).

The following primers were used for the assay: 131-nucleotide forward (ACCATGAACGTGTTCCGAAT), 131-nucleotide reverse (GAGCTCCGCAAAGTTTCAA), 851-nucleotide forward (ACAGATCCAGAGCCACGAAA) and 851-nucleotide reverse (CCCACTCTCAGCCTTCTCAG),

The percentage of ssDNA generated by DNA resection was determined by quantitative PCR. ΔCT was defined as the difference in average cycles between a given digested sample and its undigested counterpart. To calculate the percentage of ssDNA, the following equation was used:

$${\rm{Percentage}}\,{\rm{of}} \,{\rm{ssDNA}}=1/[{2}^{(\Delta {\rm{CT}}-1)}+0.5]\times 100.$$

For the semi-quantitative DNA translocation assay40, 150 ng genomic DNA was used to perform the PCR reactions using the following primers: TRIM37 forward (AATTCGCAAACACCAACCGT), TRIM37 reverse (TCTGAAGTCTGCGCTTTCCA), MIS12 forward (GACTGGCATAAGCGTCTTCG), control chr1_82844750 forward (AGCACATGGGATTTTGCAGG) and control chr1_82844992 reverse (TTCCCTCCTTTGTGTCACCA).

Legitimate and illegitimate re-joining frequencies between MIS12 and TRIM37 (chr17_5390209 and chr17_57184285) were tested by PCR. The uncut site at chromosome 1 was used as a negative control.

Plasmid construction

A gene fragment containing the C-terminal intracellular domain of FAT1 transcript (NM_005245.4) was ordered from Integrated DNA Technologies. The gene fragment was subsequently subcloned into the pCMV-SPORT6 expression plasmid containing a His-HA tag at the amino (N) terminus of the open reading frame. To generate the mScarlet–HA–FAT1WT construct, gene fragments containing the mScarlet sequence and FAT1 sequences were ordered and ligated into the HA–FAT1ICD construct using NEBuilder HiFi DNA Assembly (New England Biolabs). Similarly, deletion of the MIB2 binding domain and generation of the HA–FAT1WT construct were achieved using NEBuilder HiFi DNA Assembly (New England Biolabs).

The TEAD transcriptional reporter comprises a multimerized TEAD-binding sequence (14xTBS), based on the sequence described by Schlegelmilch et al.76, upstream of a minimal promoter and a nuclear localization signal in frame with four copies of mNeonGreen followed by a Myc-tag. The sequence components were assembled by GeneArt (Thermo Fisher Scientific) in a pDONOR221 backbone. The final construct was obtained by cloning the insert into pLenti X1 Zeo DEST using Gateway technology (Thermo Fisher Scientific). The YAP5SA construct was obtained from Addgene (plasmid 27371)77 and subcloned into a pCMV-SPORT6 expression plasmid containing a His-HA tag at the N terminus of the open reading frame. The His-HA-eIF4A1 plasmid was a kind gift from M. Bushell (CRUK Scotland Institute) and was documented by Meijer et al.78.

All of the plasmids used in this study were then sequenced using Plasmidsaurus or Full Circle sequencing. The relevant sequences are included in Supplementary Information.

DNA fibre assay

DNA fibre assays were performed as described in ref. 55. A549 and PC9 cells were incubated with 5-chloro-2′-deoxyuridine for 20 min, challenged with replication stress induced by 2 mM hydroxyurea incubation for 2 h and then labelled with 5-iodo-2′-deoxyuridine for 20 min.

MCM7 loading assay

PC9 or RPE-1 cells were trypsinized and treated with modified CSK buffer (10 mM HEPES (pH 7.9), 100 mM NaCl, 3 mM MgCl2, 1 mM EGTA, 300 mM sucrose, 1% bovine serum albumin (BSA), 0.2% Triton X-100 and 1 mM dithiothreitol) on ice for 5 min to remove soluble proteins. Cells were washed with 1% BSA in phosphate-buffered saline (PBS) and fixed in 4% paraformaldehyde (Thermo Fisher Scientific) for 10 min. Cells were then washed, pelleted and permeabilized in 70% ethanol for 15 min. MCM7 staining was performed using a mouse anti-MCM7 antibody (sc-56324; Santa Cruz Biotechnology) followed by an Alexa Fluor 594 goat anti-mouse antibody (A11007; Invitrogen). DNA content was stained using 1 µg ml−1 4′,6-diamidino-2-phenylindole (DAPI) supplemented with 100 µg ml−1 RNase A. Cell doublets were excluded from all of the analyses.

Metaphase spreads

To assess numerical CIN, transfected cells were incubated with KaryoMAX Colcemid at 0.1 μg ml−1 for 1 h, collected and incubated for 7 min at 37 °C in pre-warmed hypotonic solution (75 mM KCl). Cells were pelleted and repeatedly resuspended in freshly prepared fixation buffer (3:1 methanol:glacial acetic acid) and spun three times, after which the cells were dropped onto glass slides. For hybridization, the slides were ethanol dehydrated and subsequently incubated overnight at 37 °C in a humidified chamber with All Human centromeric probes (KBI-20000G; Leica) according to the manufacturer’s instructions. Metaphase chromosome images were visualized using a 100× oil immersion objective on an Image Solutions DeltaVision microscope.

To assess chromosomal breakage and aberrations, metaphase spreads were prepared as previously described79. Briefly, 250,000 wild-type or FAT1-knockout cells were seeded in 6 cm dishes for one day before treatment with 0.2 µM aphidicolin for 24 h. Cells were treated with 0.2 µg ml−1 colcemid for 1 h at 37 °C before collection by trypsinization and incubation with 5 ml pre-warmed hypotonic swelling buffer (75 mM KCl) in 15 ml tubes at 37 °C for 10 min. 1 ml freshly prepared fixation buffer (3:1 methanol:acetic acid) was added to the 15 ml tube and mixed by inversion. Cells were pelleted by centrifugation at 300g for 4 min at room temperature. About 500 µl supernatant was left in the tubes and the cells were slowly resuspended before the addition of 5 ml fixation buffer, mixing and incubation for 10 min at room temperature. The fixation step was repeated by pelleting the cells, resuspension and the addition of fixation buffer, as above. To spread the metaphases, the fixed cells were dropped on glass slides at a 45° angle from a 30 cm height. Slides were left to air dry before staining with Giemsa (7% in 10 mM PIPES (pH 6.8)) for 10 min at room temperature, then mounted with DPX. Metaphase chromosome images were visualized using a 100× oil immersion objective on an upright Zeiss Axio Imager fluorescence microscope equipped with Volocity software.

ImageStream fluorescence in situ hybridization

Transfected cells were processed for fluorescence in situ hybridization in suspension, as described elsewhere80. Briefly, following transfection, cells were harvested and fixed with freshly prepared 3:1 methanol:glacial acetic acid. Cells were subsequently hybridized with chromosome 15 satellite enumeration probe (LPE015G; CytoCell) performed in a thermocycler under the following conditions: 65 °C (2 h pre-annealing), 80 °C (5 min) and 37 °C (16 h) before analysis on an ImageStream X Mark II (Amnis).

Definition of FAT1 driver mutation in the TRACERx 421 cohort

FAT1 driver mutations in the TRACERx 421 cohort were defined as described in ref. 81. Briefly, FAT1 non-synonymous variants that were found to be deleterious (either stop–gain or predicted deleterious in two of the three computational approaches applied (that is, Sift82, Polyphen83 and MutationTaster81)) were classified as a driver mutation. Any tumour with any such mutations in FAT1 was considered to have a FAT1 loss.

Determination of WGD events in TRACERx 421

The WGD status for each tumour was estimated in two steps, as described in ref. 81. Briefly, if the genome-wide copy number of the major allele was ≥2 across at least 50% of the genome, this was assumed to reflect a WGD event13,84. A major allele copy number ≥3 across at least 50% genome was assumed to reflect two WGD events. Then, we leveraged additional information from the estimated copy number of mutations using a novel tool, ParallelGDDetect, available as an R package (https://github.com/amf71/ParallelGDDetect).

TRACERx 421 cohort

The TRACERx 421 cohort consisted of 233 males and 188 females (421 patients in total), corresponding to a 55:45 male:female ratio. 93% of the cohort was from a White ethnic background and the mean age of the patients was 69 years, ranging between 34 and 92 years. Written informed consent was obtained. None of the patients were compensated for their involvement in the study.

Mutation clonality in TRACERx 421

Reconstructed phylogenetic trees were used to classify mutations based on their inferred phylogenetic cancer cell fraction (phyloCCF)81. We classified as clonal in each tumour region every cluster whose 95% confidence interval of the phyloCCF of its mutations overlapped with the 95% confidence interval of the phyloCCF of the mutations in the mutation cluster within the trunk node of the tree (a minimum threshold of 0.9 was used for the left side of the 95% confidence interval on truncal mutations). Second, we defined as subclonal every mutation cluster in a tumour region whose mean phyloCCF across the corresponding mutations in that region was greater than 0 and not clonal (that is, the mutation cluster did not pass the previous tests). Lastly, any remaining mutation cluster was defined as absent in a tumour region otherwise. Furthermore, clonal mutations were defined as early or late depending on whether they occurred before or after WGD. A clonal mutation copy number consistent with the WGD ploidy was therefore considered early, or late, otherwise.

dN/dS analyses

The dN/dS point mutation estimate for FAT1 was calculated using the dndscv and geneci functions in the dNdScv29 R package. dndscv was run on different subsets of mutations (clonal early versus clonal late) from different subsets of samples (LUAD versus LUSC). The 95% confidence interval of the dN/dS estimate was obtained using the geneci function.

Enrichment analyses

All enrichment analyses were performed by counting the tumours in each category (for example, LUAD versus LUSC, WGD versus no WGD or early clonal versus late clonal) and performing a chi-squared test.

SCNA detection and MSAI

The identification of genome-wide allele-specific copy number states for multiregion WES is described in ref. 81. Briefly, logR data were calculated using VarScan 2, GC corrected using a wave-pattern GC correction method developed by Cheng and colleagues85 and processed using ASCAT (version 2.3)86. Sequenza (version 2.1.2)87 was utilized to offer supplementary assessments of tumour purity and ploidy for further examination. ASCAT was presented with the automatically selected models for ploidy and purity from either Sequenza or ASCAT, which were manually reviewed to generate SCNA profiles for each tumour area. Samples that had an estimated tumour purity below 10% were excluded. The data on ploidy, purity and copy number segmentation were fed into a multi-sample SCNA estimation approach51 to create a consistently minimal segmentation and genome-wide evaluation of LOH, as well as loss, gain and amplification copy number states in relation to sample ploidy.

The input SCNA profiles were used to identify allelic imbalance, which was subsequently employed to phase heterozygous single-nucleotide polymorphisms and re-estimate allele-specific copy numbers. Furthermore, MSAI, which occurs when SCNAs disrupt the same genomic region but affect different parental alleles within distinct tumour subclones, was detected as previously described57. We identified a subset of these MSAI events as parallel SCNA events that refer to the same class of event (gain/amplification or loss/LOH) in multiple samples from a given tumour but with major alleles from distinct haplotypes in the samples demonstrating the event. We then quantified the proportion of tumours where MSAI events occurred per cytoband, providing an empirical P value for cytoband 4q35.2.

EdU incorporation assay and cyclin B loading assay

The Click-iT Plus EdU Alexa Fluor 647 Flow Cytometry Assay kit (Thermo Fisher Scientific) was used for EdU incorporation assays. Cells were incubated with 10 μM EdU for 30 min before being fixed and processed per the manufacturer’s instructions. Cyclin B1 antibody was obtained from Abcam (ab32053; Abcam). DNA was stained in staining buffer (1 µg ml−1 DAPI and 100 µg ml−1 RNase A in PBS). Cell doublets were excluded from all of the analyses.

Microscopy and DNA damage foci studies

DDR foci and DNA fibre assays

The preparation of slides for DDR immunofluorescence foci studies was carried out as described in ref. 88. Briefly, cells were pre-seeded on glass coverslips, subjected to Cs-137 ionizing irradiation and allowed to recover. Cells were then washed once with PBS and pre-extracted with CSK buffer (100 mM NaCl, 300 mM sucrose, 3 mM MgCl2, 10 mM PIPES (pH 6.8), 10 mM β-glycerol phosphate, 50 mM NaF, 1 mM EDTA, 1 mM EGTA, 5 mM Na3VO4 and 0.5% Triton X-100). Cells were then washed once with CSK buffer and fixed with 4% paraformaldehyde on ice for 20 min. Samples were then washed three times with 0.1% Tris-buffered saline (TBS)-Tween, blocked with 10% goat serum for 1 h, washed twice with 0.1% TBS-Tween and incubated with primary antibody overnight at 4 °C (see Supplementary Table 1). The samples were subsequently washed and incubated with secondary antibodies (Alexa Fluor 488/594/647; Invitrogen) with DAPI in 1% goat serum for 1 h at room temperature. Samples were then washed with 0.1% TBS-Tween and mounted with Vectashield (H1200; Vector Laboratories). The samples were then double-blinded for quantification.

Images were acquired using a Zeiss Axio Imager M1 microscope with a 40×/1.3 NA Plan-Neofluar or 63×/1.4 NA Plan-Apochromat objective, equipped with an ORCA-spark CMOS camera (Hamamatsu) and pE-300white LED light source (CoolLED) and controlled by Micro-Manager 2.0 software89. Images were processed and counted in an unbiased way using the FindFoci ImageJ plugin90.

Segregation errors

For segregation errors, cells were fixed and permeablized in PTEMF buffer (4% paraformaldehyde, 0.2% Triton X-100, 20 mM PIPES and 2 mM MgCl2) for 15 min at room temperature. The samples were washed three times in PBS, blocked for 1 h with 3% BSA with 0.2% Triton X-100 and incubated overnight with the relevant primary antibodies. The samples were subsequently washed and incubated with secondary antibodies (Alexa Fluor 488/594/647; Invitrogen) with DAPI in 1% goat serum for 1 h at room temperature. Samples were then washed with 0.1% TBS-Tween and mounted with Vectashield (H1200; Vector Laboratories). Images were acquired using a Nikon Ti2 microscope with a 60×/1.45 NA Plan Apo TIRF objective and 1.5× intermediate magnification, equipped with a Prime BSI sCMOS camera (Teledyne Photometrics), motorized XY stage with piezo Z axis (Applied Scientific Instrumentation) and Spectra X LED light engine (Lumencor) and controlled with Micro-Manager 2.0 software89. Z stacks were acquired (31 μm × 0.2 μm z steps) and deconvolved using Microvolution software with ten iterations.

Live-cell widefield microscopy

Live-cell widefield microscopy was performed using a Nikon Ti2 inverted microscope with a Perfect Focus System using 1.5× intermediate magnification, equipped with a Prime BSI sCMOS camera (Teledyne Photometrics), motorized XY stage with piezo Z axis (ASI) and Spectra X LED light engine (Lumencor) and controlled with Micro-Manager 2.0 software89. Environmental conditions were maintained at 37 °C under 5% CO2 using a chamber and CO2 mixer (Okolab).

For live-cell imaging of the mitotic failure rate, the FUCCI–H2B-mTurquoise–RPE-1 cells were grown and transfected for 72 h on glass-bottomed plates (IBL) in DMEM culture media, as described above. Before imaging, the growth media was replaced with FluoroBrite DMEM (Thermo Fisher Scientific) with 10% foetal bovine serum to improve the image quality. To acclimatize to potential temperature fluctuations, the cells were allowed to settle for 1 h before imaging. Phase-contrast and fluorescence images were acquired with 20×/0.75 NA Plan Apo Ph2 objective every 20 min for over 72 h. The images were analysed using Fiji and the cells were tracked manually.

For the mitotic timing experiments, FUCCI–H2B-mTurquoise–RPE-1 cells were set up as above but imaged with a 40×/0.95 NA Plan Apo objective. Locations with high-level mVenus–geminin-expressing G2 cells were identified and recorded. Z stacks (25 μm × 0.5 μm z steps) of H2B-mTurquoise were taken at 5 min intervals over 12 h. Images were deconvolved over ten iterations using Microvolution software.

Live-cell confocal microscopy

Live-cell confocal microscopy was performed using either: (1) a Yokogawa CSU-W1 spinning-disk confocal scanhead on a Nikon Ti2 inverted microscope with a 40×/1.15 NA Apo objective and Prime 95B sCMOS camera (Teledyne Photometrics), controlled with NIS-Elements, at 37 °C under 5% CO2 with an environmental chamber and gas mixer (Okolab); or (2) an NL5 slit-scanning confocal unit on a Nikon Eclipse Ti inverted microscope with a 40×/1.25 NA Apo objective and Quest qCMOS camera (Hamamatsu), controlled with Micro-Manager 2.0 software89 at 37 °C under 5% CO2 using a stage top incubator and gas mixer (Tokai Hit). Locations with high-level mVenus–geminin-expressing G2 cells were first identified and recorded. Z stacks (25 μm × 0.5 μm z steps) of H2B-mTurquoise were taken at 5 min intervals over 12 h.

For Extended Data Fig. 7h,i and Supplementary Videos 4 and 5, a higher resolution was required to delineate whether mitotic error precedes nuclear deformation. A mixture of widefield deconvolution microscopy (n = 28 cells (siCTRL) and n = 27 cells (siFAT1)) and confocal microscopy (n = 50 cells (siCTRL) and n = 27 cells (siFAT1)) techniques were utilized to obtain sufficient images for statistical analysis.

Statistics and reproducibility

No statistical methods were used to predetermine the sample size. The statistical test types and biological n numbers used are detailed in the relevant figure captions. For immunofluorescence data, the numbers of cells sampled per biological repeat are documented accordingly. All of the data points shown are independent samples. For the experiments using statistical tests that assume normal distributions, we assumed that the data distribution was normal, but this was not formally tested.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.