Introduction

Management of locally advanced rectal cancer (LARC) has shifted from the one-size-fits-all paradigm of long-course neoadjuvant chemoradiotherapy followed by total mesorectal excision (TME) and adjuvant systemic chemoth erapy to a plethora of new strategies aimed at anticipating, intensifying, deintensifying or omitting one or more of the three treatment modalities1,2,3,4,5.

Organ preservation is supported by an increasingly solid amount of evidence accumulated from retrospective case series over the years6,7,8,9,10, and recently also from prospective clinical trials. As an example, the Organ Preservation in Rectal Adenocarcinoma (OPRA) trial11 was a randomized phase II trial in which patients underwent total neoadjuvant therapy (TNT) in the form of induction chemotherapy followed by chemoradiotherapy (INCT-CRT) or chemoradiotherapy followed by consolidation chemotherapy (CRT-CNCT) and were subsequently offered, in case of complete or near-complete clinical response (cCR or ncCR), strict observation without rectum resection. At its most recent update, the rates of organ preservation at 5 years were 39% and 54% in the INCT-CRT and the CRT-CNCT cohorts, respectively12.

Watch and wait (WW) is now a standard option offered to rectal cancer patients with a cCR or ncCR to neoadjuvant therapy (NAT) in the United States13. Nevertheless, both patient selection and follow-up strategies are active areas of investigation for this approach. Assessment of cCR can be challenging14 and a substantial fraction of patients with an apparent clinical response after NAT experience regrowth during surveillance6,12. This happens more often in patients with ncCR and limits the organ preservation rate of this strategy. Moreover, while WW has been carried out at high-volume centers with high rectal cancer expertise and excellent resources for thorough and complete multimodal follow-up, doubts have been raised regarding broad implementation of a rectum-preserving approach, which requires important safeguards to ensure patient safety15. In this context, objective and precise noninvasive tools to improve patient selection based on radiological, endoscopic and clinical evaluation are urgently needed.

In parallel with the efforts to improve patient selection using endoscopy and MRI, liquid biopsy technologies have gained credibility over the past decade and are now being used for detection of minimal residual disease (MRD) in patients with LARC at some centers. Studies have demonstrated that detection of circulating tumor DNA (ctDNA) after NAT is associated with a higher risk of disease relapse and lower likelihood of a sustained clinical complete response (scCR)16,17,18,19,20,21. Moreover, baseline tumor fraction (TF) can be used as a proxy to estimate disease burden22 and has the potential to improve prediction of NAT outcomes over approaches based on routine clinical staging features alone23, which can be relevant for planning and optimizing the choice of the most appropriate NAT strategy. Despite this, evidence for the implementation of ctDNA in the WW strategy is scarce, with lack of adequate sensitivity remaining a critical limiting factor24,25.

In this context, the interest in using liquid biopsy data for MRD monitoring and the need for lowering the threshold for ctDNA detection led to the development of new assays covering the whole genome and expanding the range of detectable tumor alterations beyond point mutations and small insertions and deletions (indels)26,27,28,29. In this study, we evaluated the use of ctDNA assessment based on a highly sensitive, primary-tumor-informed, whole genome sequencing (WGS) assay26 to predict CR and disease recurrence in LARC patients from the OPRA trial.

Results

Patient selection

From April 2014 until the accrual target was reached in March 2020, 324 patients were randomized in the OPRA trial. Plasma samples for ctDNA analyses were collected from 76 patients enrolled at MSKCC. Solid tumor specimens collected pre-treatment were available for 46 of those patients, and matched blood specimens were available for 32 patients. One of these 32 pairs of solid tumor and matched normal failed quality control protocols when sent for WGS, which led to the final set of 31 patients analyzed in our study (Supplementary Fig. 1). These 31 patients had plasma samples collected prospectively at baseline (P0, n = 21), during TNT (P1, n = 28), at restaging after TNT (P2, n = 26), and at various timepoints during follow-up (F1–F9, n = 102, Fig. 1). The baseline characteristic of the 31 patients included in this study did not differ significantly from the rest of the OPRA cohort, except for a slightly smaller median tumor size (4.0 vs 4.7 cm, p = 0.003) (Table 1); importantly, there were no significant differences between the two cohorts in terms of TME-free survival and disease-free survival (DFS) (p = 0.510 and p = 0.810, respectively).

Fig. 1: Study design and overview of computational methods for ctDNA analysis.
figure 1

We analyzed data from 31 LARC patients treated with NAT. Patients were randomized to receive either INCT-CRT or CRT-CNCT. Patients with a cCR or nCR were enrolled in WW, while the rest underwent TME. Longitudinal WGS data of solid tumors and matched plasma samples were analyzed for ultrasensitive detection and quantification of ctDNA. CRT-CNCT chemoradiation followed by consolidation chemotherapy, ctDNA circulating tumor DNA, cCR clinical complete response, nCR clinical near complete response, INCT-CRT induction chemotherapy followed by chemoradiation, WW watch and wait, TME total mesorectal excision.

Table 1 Relevant baseline clinical features in the study cohort and in the rest of the OPRA trial cohort

All 31 patients received TNT in the form of either INCT-CRT or CRT-CNCT. There were no differences in baseline patients and tumor characteristics between the two arms, except for a higher rate of extramural venous invasion in the INCT-CRT group (26.7% vs. 0%, p = 0.043) (Supplementary Table 1). All patients completed planned chemoradiotherapy cycles, while the overall completion rate of induction/consolidation chemotherapy was 94%.

Patients were restaged after a median time of 8 weeks following completion of TNT, at which 24 patients had a cCR (n = 14) or nCR (n = 10) and were offered the option of WW per protocol (Fig. 2). Of those 24 patients offered WW, 6 (25%) developed local tumor regrowth during surveillance and required TME; 4 of them developed local recurrence, distant metastasis or both, and 2 remain without evidence of disease. The remaining 18 patients did not experience primary tumor regrowth; nevertheless, 2 patients developed distant metastases during surveillance. In total, 16 (52%) of 31 patients never developed tumor regrowth, locoregional recurrence or distant metastasis at the end of follow-up and were defined as having a scCR. By contrast, of the 7 patients initially treated with TME, one developed distant metastasis, 3 developed local recurrence and 3 remained without evidence of disease until the last available follow-up. In total, after a median follow-up of 6.52 years, ten patients experienced disease recurrence.

Fig. 2: Timeline of ctDNA detection results and treatment outcomes.
figure 2

Relevant clinical data and results from ctDNA assessments are summarized using a swimmer plot. The circles represent the available liquid biopsies and are colored in blue, green, light brown or dark brown according to the detection of ctDNA and the respective tumor fraction. Y-axis reports anonymized patients ID. CRT-CNCT chemoradiation followed by consolidation chemotherapy, ctDNA circulating tumor DNA, scCR sustained clinical complete response, DFS disease-free survival, FUP follow-up, INCT-CRT induction chemotherapy followed by chemoradiation, WW watch and wait, TME total mesorectal excision, TF tumor fraction.

As in the OPRA trial, the CRT-CNCT group and the INCT-CRT group did not differ significantly in terms of DFS, overall survival, local recurrence-free survival, or distant metastasis-free survival (Supplementary Fig. 2).

ctDNA and TF at baseline

Baseline plasma samples (P0 timepoint) were available for 21 of the 31 patients (Fig. 2); ctDNA was detected in plasma samples of 20 patients (95%; 95% confidence interval, 74–100%), with a median TF of 0.081% (interquartile range [IQR], 0.014–0.158%) (Fig. 3A). TF detected at P0 was positively correlated with tumor size (ρ = 0.76, P < 0.001) (Fig. 3B) and with the presence of locoregional nodal metastasis at baseline MRI (median, 0.12% vs 0.018%; P = 0.014) (Fig. 3C). No other associations between baseline TF and clinical or genomic features were statistically significant (Supplementary Table 2).

Fig. 3: Assessment of baseline ctDNA.
figure 3

A Heatmap showing (from top to bottom): the TF of the baseline P0 liquid biopsy (before the initiation of the NAT) in log10 scale, relevant baseline clinical features of the patients and alterations of TP53, APC, KRAS and BRAF from primary tumor samples acquired before the initiation of the neoadjuvant treatment; each column represents a patient, columns are ordered according to the P0 TF. B Scatterplot showing the primary tumor size (x-axis) and the TF of the baseline P0 liquid biopsy in log10 scale; each dot is a patient and is colored according to the clinical N staging; the dashed line parameters were calculated using linear regression, the p-value was calculated using Spearman correlation; for graphic purposes, a sample with undetected ctDNA was plotted as TF = 0.001%. C Boxplot representing the baseline P0 liquid biopsy TF in log10 scale according to the radiological suspicion of lymph nodes metastasis; the p-value was calculated with Wilcoxon signed-rank test; for graphic purposes, a sample with undetected ctDNA was plotted as TF = 0.001%. D Proportion of patients with scCR depending on the detection of ctDNA and the respective TF at the P0 timepoint. AV distance distance from anal verge, cN clinical nodal classification, cT clinical tumor classification, ctDNA circulating tumor DNA, scCR sustained clinical complete response, EMVI Extramural vascular invasion, T size primary tumor size at baseline according to radiological measurement, scCR sustained clinical complete response, TMB tumor mutational burden, TF tumor fraction, WGD whole genome doubling.

Next, we investigated whether the disease burden represented by baseline TF was predictive of treatment outcomes. Using exploratory TF cutoffs of 0.1% and 0.01%, we found that all 4 patients (100%) with a TF < 0.01% had a scCR, compared to 5 of 7 patients (71%) with a TF between 0.01 and 0.1%, and in 2 of 10 (20%) patients with a TF > 0.1% (Fig. 3D). Baseline TF modeled as a continuous numerical variable was significantly associated with scCR in univariate logistic regression (P = 0.048) (Supplementary Table 3), with an AUC for scCR prediction of 0.809 (95% confidence interval, 0.609–1.000) (Supplementary Fig. 3A).

ctDNA and TF during NAT and at restaging after NAT

We next investigated ctDNA assessment at time points P1 (right after the completion of the first component of NAT) and P2 (restaging after NAT). The median time between the end of the first component of NAT (INCT in the INCT-CRT group and CRT in the CRT-CNCT group) and P1 was 3.8 weeks (IQR, 3.0–4.5 weeks). The median time from the end of NAT to P2 was 8.0 weeks (IQR, 6.1–9.3 weeks). All patients had a P1 or a P2 liquid biopsy available, or both (Fig. 2). ctDNA was detected in 12 out of 28 patients (43%) at P1 and 17 out 26 patients (65%) at P2, with no statistically significant difference between CRT-CNCT patients and INCT-CRT patients.

To identify potential patterns of delayed response or early resistance, we stratified the cohort according to the higher of the P1 and P2 TFs, and we used the exploratory cutoffs of 0.01% and 0.1% to compare their response rates. Four (80%) of the 5 patients with no TF detected at P1 and P2 had a scCR, compared to 8 (62%) of the 13 patients with TF < 0.01%, 4 (36%) of the 11 patients with TF between 0.01% and 0.1%, and neither of the 2 patients with TF ≥ 0.1% (Fig. 4A). The higher of the P1 and P2 TFs, modeled as a continuous variable, was linearly associated with a lower likelihood of scCR (OR, 0.10; 95% confidence interval, 0.01-0.48); logistic regression P = 0.018) (Supplementary Table 4 and Supplementary Fig. 3B) and a poorer DFS (HR, 4.20; 95% confidence interval, 1.87–9.43; Cox regression P < 0.001; Harrell’s C-index, 0.792) (Fig. 4B and Supplementary Table 5).

Fig. 4: ctDNA assessment during the TNT and the follow-up.
figure 4

A Proportion of patients with scCR depending on the detection of ctDNA and the respective TF at the P1 and P2 timepoints; the highest available value between P1 and P2 was used for this purpose. B Kaplan–Meier curves depicting DFS depending on the detection of ctDNA and the respective TF at the P1 and P2 timepoints; DFS was measured starting from the moment of the liquid biopsy blood withdrawal; the highest available value between P1 and P2 was used for this purpose; C Proportion of patients with scCR depending on the detection of ctDNA and the respective TF at the F1 timepoint; for this panel, only patients proposed for WW observation were considered; D Kaplan–Meier curves depicting DFS depending on the detection of ctDNA and the respective TF at the F1 timepoint; DFS was measured starting from the moment of the liquid biopsy blood withdrawal; E TF across all available timepoints; each line is a patient, colored depending on the presence of disease relapse; the dashed line represent the 0.01% cut-off; F TF of all follow-up timepoints depending on the disease relapse timing; the p-value was calculated using Wilcoxon signed-rank test; for graphic purposes, a sample with undetected ctDNA was plotted as TF = 0.001%. ctDNA circulating tumor DNA, DFS disease-free survival, scCR sustained clinical complete response.

Detectable ctDNA at P1 was associated with a lower likelihood of scCR (16.7% vs 75.0%; P = 0.006) and a higher risk of disease relapse (hazard ratio [HR], 6.47; 95% confidence interval, 1.33–31.41; P = 0.020) (Supplementary Fig. 4A, B). However, detectable ctDNA at P2 was not associated with either scCR likelihood in WW candidates (71.4% vs 60.0%; P = 1.000) or risk of disease relapse (HR, 1.63; 95% confidence interval, 0.61–8.07; P = 0.552) (Supplementary Fig 4C, D).

In exploring the counterintuitive finding of an association at P1 and no association at P2, we observed that 12 (80%) of the 15 patients with no detectable ctDNA at P1 had detectable ctDNA at P2 (Supplementary Fig. 5A) and that only 2 (17%) of the 12 experienced disease relapse. Compared with the other patients, the 12 P1-negative P2-positive patients had lower TFs at baseline (median 0.017% vs 0.13%, P = 0.018) (Supplementary Fig. 5B), possibly suggesting that a phenotype with lower ctDNA shedding might have affected the specificity of the assay at P2. Low specificity was also suggested by the association of detectable ctDNA at P2 with higher risk of disease recurrence when the TF threshold for presence of MRD was set at 0.01% (HR, 5.09; 95% confidence interval, 1.20–21.55; P = 0.027) (Supplementary Fig. 5C).

ctDNA and TF during follow-up

At least one follow-up liquid biopsy (n = 102) was available for 22 patients. The median time between P2 and the first follow-up (F1) ctDNA assessment was 14.0 weeks (IQR, 12.3–18.1 weeks). The mean number of liquid biopsies per patient was 4.9 (range 1–9). Half of the patients had samples available beyond 3 years from the end of NAT.

High TF at F1, modeled as a continuous variable, was associated with higher risk of disease relapse (Cox regression P = 0.009), which occurred in all 3 patients with TF > 0.01% but in only 1 of 5 patients with TF < 0.01% and in none of the 14 patients with undetectable ctDNA (Fig. 4C). We noted a trend for negative association between TF at F1 and scCR (logistic regression P = 0.062) (Fig. 4D). An increase in TF between P2 and F1 was associated with poorer DFS (log rank P = 0.002) (Supplementary Fig. 6A) and a trend toward a lower likelihood of scCR (40.0% vs 91.6%; P = 0.053) (Supplementary Fig. 6B).

With inclusion of follow-up liquid biopsies at timepoints after F1, the sensitivity of detection of ctDNA at any time point for predicting disease recurrence was 100%, but specificity was 27.8%, resulting in a positive predictive value of 23.5%. However, the median TF of follow-up liquid biopsies performed after tumor recurrence was significantly higher than the median TF of follow-up biopsies performed in the absence of tumor recurrence (P < 0.001) (Fig. 4E, F).

In 4 patients undergoing WW who did not experience neither disease recurrence nor local regrowth, the last ctDNA TF > 0.1% was obtained more than 4 years after the end of NAT. Intriguingly, in selected patients with follow-up data available at multiple timepoints, we observed a paired fluctuation of TF and carcinoembryonic antigen (CEA), both in patients who did not experience a disease relapse and had low TF and clinically nonrelevant CEA values (patients 2, 5 and 19 in Supplementary Fig. 7) and in patients who later developed metastatic disease and had the highest CEA and TF (patients 11 and 29 in Supplementary Fig. 7).

Additional insights from WGS of solid tissue specimens

Besides providing information for ultrasensitive detection of ctDNA, WGS from solid tissue offers additional insights into tumor etiology and tumor evolution throughout treatment that can be translationally relevant. As an example, we used these data to analyze mutational signatures from the COSMIC catalog30 (Fig. 5A, B). After removing potential FFPE-related sequencing artifacts that could introduce noise in our analysis, we identified the presence of three main signatures within this cohort. The mitotic clock, age-related SBS1/SBS40 signatures were consistently detected across all samples. SBS18, which has been associated with damage by reactive oxygen species, was detected in 17/31 (54%) cases. Finally, a colibactin (Escherichia coli exposure)-associated signature (SBS88) was identified in 7/31 (23%) of the specimens collected at baseline (Fig. 5A). This frequency of colibactin positive cases is consistent with data from a larger case series31. Even though our sample size was too small to draw robust statistical conclusions, colibactin positive tumors tended to be smaller and located further from the anal verge. Patients with colibactin positive tumors also exhibited lower rates of pathological or clinical complete responses to TNT and worse DFS (although differences were not statistically significant, Fig. 5A).

Fig. 5: Baseline and longitudinal analyses of mutational signatures.
figure 5

A Number of SNVs and mutational signatures detected from tumor tissue WGS; B Emergence of FOLFOX treatment associated mutational signature in blood based WGS in patient #29.

Longitudinal analysis of WGS data from solid tumors and matched ctDNA samples also revealed the emergence of a treatment-related signature (SBS17b) after NAT, which was detected in plasma of patient 29 (Fig. 5B). Of note, this was the only patient in the study with a sufficiently high detected TF to enable de novo mutation calling from plasma samples at F1 and F3. The analyses of those mutations demonstrated a high-concordance with the set of mutations that were initially identified in the reference solid tumor, but also the existence of an additional set of new mutations that were private to the plasma samples. The SBS17b signature detected among these novel mutations is consistent with the systemic chemotherapy treatment that the patient had undergone during NAT.

Discussion

In recent years, several studies have demonstrated the clinical utility of WW for patients with rectal cancer6,7,8,9,10, and this has been further validated by the results from the OPRA trial11. The findings of our translational study, aimed at improving both the selection of rectal cancer patients for WW and follow-up monitoring, indicate that primary-tumor-informed analysis of liquid biopsies can be used to measure disease burden and might be useful to monitor NAT efficacy, with the caveat of low specificity for detection of MRD and prediction of scCR.

Accurate prediction of the expected response to pre-operative treatment is key for the clinical management of LARC patients. For patients who undergo TNT and WW, our results confirm that high disease burden at baseline is associated with lower likelihood of scCR and that consideration of TF can help improve prognostic stratification. The 95% ctDNA positivity at baseline was higher than the 75–84% reported in other studies16,17,18,19,20,21, which were not able to stratify outcomes according to baseline ctDNA levels. Given the similarity between the cohorts, the lower rate of ctDNA detection in the other studies can probably be attributed to lower sensitivity of the assays used in those studies rather than differences in patient characteristics. Clinical trials exploring new modalities for treatment intensification or deintensification should further investigate the predictive value of baseline TF (rather than just ctDNA positivity) as a translational biomarker, using high-sensitivity techniques such as WGS.

The 43% ctDNA positivity at P1 (completion of the first component of NAT) was higher than the 8–23% ctDNA positivity after neoadjuvant CRT in the previous series16,17,18,19,20,21. Similarly to our results, those studies found that ctDNA positivity at completion of neoadjuvant CRT was associated with disease recurrence (HR, 9.11 [meta-analysis of the six studies32] vs 6.47 in our study [for ctDNA detection at completion of the first component of NAT]). Our data confirm the prognostic value of ctDNA detection at completion of either neoadjuvant CRT or neoadjuvant INCT and this could be used to refine the criterion of 20% tumor reduction that was used for omitting CRT after INCT in the clinical trial reported by Schrag et al.5 – although we are underpowered to have an effective comparisons of the two treatment arms due to our limited sample size.

The high rate of ctDNA positivity at post-NAT restaging (P2) and in follow-up, mostly in patients with low TF and no subsequent disease recurrence, poses some concerns for the clinical implementation of ultra-sensitive ctDNA assays to predict WW outcomes in rectal cancer. It is possible that a high baseline level of nontumoral circulating DNA may have lowered the specificity of the assay, with shedding of CEA and nontumoral DNA from normal colonic cells33 causing the paired fluctuation of TF and CEA. However, this is unlikely due to the tumor-informed nature of our pipeline, which incorporates the molecular profile of the primary tumor into the detection method. Alternatively, the low TF may actually reflect shedding of tumoral cells at a level insufficient to overcome immune control and cause regrowth. In the context of WW, it may not be possible for ctDNA detection to reach the high specificity for disease recurrence observed in other settings at the post-surgical34,35,36 or at the landmark timepoints37. However, Amatu et al.24 recently reported that post-TNT targeted sequencing informed by primary-tumor mutations had high specificity but low sensitivity and low overall accuracy. The WGS-based ctDNA assay used in our study does not have low sensitivity (a major concern with liquid biopsy-based assessments of MRD38,39), but our results suggest that isolated detection of ctDNA in WW patients with a highly sensitive primary-tumor-informed assay is not a sufficient predictor of regrowth and should be combined with dynamic monitoring of TF.

The WGS data from solid tumors and matched plasma samples generated as part of our analysis pipeline provides biological insights that go beyond MRD detection and can have translational relevance. As proof-of-concept, we robustly identified colibactin signatures in almost one fourth of the patients in our cohort. Colibactin is a genotoxin produced by some strains of E. coli that harbor the polyketide synthase (pks) genomic island and are often found in the human gut40. Previous studies have shown that colibactin can cause double-strand breaks and mutations in host cells41, contributing to the pathogenesis of colorectal cancer42,43. Even though our sample size is limited, our results suggest that these signatures may be associated with differences in clinical outcomes, and could therefore potentially be used as future biomarkers for patient stratification.

The main limitation of our study is the relatively small sample size, which prevented analysis of differences between the INCT-CRT and CRT-CNCT groups and which warrants caution in drawing conclusions from the data. In particular, the lack of statistical significance in some of our comparisons might be explained by our limited statistical power and some of these results should be revisited in future, larger cohorts. In addition to this, missingness of plasma samples that were not collected or failed QC at some specific timepoints, particularly for patients who underwent resection of the primary tumor, prevented meaningful comparison across follow-up measurements and make it difficult to generalize some of our findings, including the high assay sensitivity at baseline and the relevance of TF changes between P0, P1 and P2. The TF thresholds that we have used in this manuscript are purely exploratory, and they were used to illustrate the usefulness of stratifying patients based on TF levels that go beyond the simple binary classification of ctDNA presence or absence. Before any clinical utility of these thresholds can be claimed, they should be optimized and validated using larger, independent cohorts. Notwithstanding these limitations, our study is the first to investigate the use of WGS-based, ultra-sensitive longitudinal analyses of ctDNA in LARC patients undergoing WW and to describe both its strengths and technical challenges.

In conclusion, our results demonstrate that WGS approaches for tumor-informed ctDNA analysis can achieve high sensitivity and hold promise for treatment planning, prognostication and monitoring in LARC patients treated with NAT. However, additional research is needed in order to guarantee adequate specificity for MRD detection and early relapse prediction. Addressing this question will be critical in order to assess the clinical utility of liquid biopsies for monitoring LARC patients undergoing non-operative management.

Methods

Patients’ and data collection

This preplanned secondary retrospective analysis used data from a subgroup of patients from the OPRA trial11, a phase II randomized study evaluating DFS of patients with stage II or III rectal cancer who underwent TNT and selective WW in case of a cCR or a nCR. Patients with LARC were randomized to undergo either CRT-CNCT or INCT-CRT. INCT and CNCT consisted, at the oncologist’s discretion, of either eight 2-week cycles of infusional fluorouracil, leucovorin, and oxaliplatin chemotherapy (FOLFOX) or five 3-week cycles of capecitabine and oxaliplatin (CAPOX); chemoradiotherapy consisted of a long course radiotherapy with concomitant oral or infusional fluoropyrimidine treatment as chosen by the investigators. WW candidates underwent a closer follow-up monitoring with pelvic MRI, CT of the chest and abdomen and colonoscopy every 4 months. The last follow-up update was in April 2023. The data collected for this study were stored in a secured database at MSK.

We analyzed specimens from all OPRA patients treated at MSK with available pre-treatment formalin-fixed, paraffin-embedded (FFPE) primary tumor tissue samples and available plasma samples collected at baseline and throughout the course of treatment. Given the exploratory nature of the study, no formal sample size calculation was performed.

Blood samples

For each patient, plasma was collected before the start of NAT (timepoint P0), right after the completion of INCT in the INCT-CRT group or completion of CRT in the CRT-CNCT group (P1), 8 ± 4 weeks after the end of NAT (P2), and at follow-up visits (F1, F2, etc.) (Fig. 1).

DNA isolation and sequencing

Pathologist-guided microdissection was carried out to enrich tumoral DNA content from FFPE samples of primary tumors of eligible patients. DNA was extracted using either the AllPrep DNA/RNA FFPE Kit (Qiagen) or the Qiagen DNeasy Tissue Kit (Qiagen). Plasma samples were collected in BD Vacutainer K2-EDTA tubes and stored at −80 °C; circulating DNA was subsequently extracted by centrifugation. Both plasma-derived DNA and primary-tumor-derived DNA were sequenced using the NovaSeq platform (Illumina), employing paired-end sequencing with a read length of 150 bp.

Detection of ctDNA and estimation of TF

ctDNA was assessed using a primary-tumor-informed assay based on WGS, as previously described26. Specifically, we used the C2i Genomics platform, which has been validated and used in previously published studies28,29. The algorithm uses the complete set of somatic alterations (single-nucleotide variants [SNVs], small indels and copy number alterations) found in a patient’s tumor sample to generate a patient-specific tumor signature, which is then applied to detect ctDNA in cell-free DNA from blood samples and to estimate TF. To address WGS artifacts, the signatures were refined by excluding alterations detected in at least 3 of 45 cell-free DNA samples from healthy individuals. Furthermore, suppression of cell-free DNA read-level error was implemented to eliminate SNV-like artifacts by assessing concordance between overlapping read-pair sequences.

Analyses of mutational signatures in solid tumors and plasma samples

Cancer mutational signature analysis was performed both in solid tumor samples and cfDNA plasma samples (Fig. 5A, B). Plasma specimens with a TF above 10% were selected for de novo calling of SNVs and CNVs. SNV and INDEL results from the plasma samples were compared with results from the solid tumor samples to determine shared, tumor-unique, and plasma-unique alterations.

Sigprofiler v1.1.2044 was used to extract SNV and INDEL signatures in a three-step workflow: (I) De-novo extraction of signatures for potential new signatures in the cohort. (II) Fitting of selected signatures from COSMIC v3.345 for cancer signatures and artifact signatures used for the detection of spurious deviations30. (III) A final fitting using cancer-type exclusive signatures and the manual inclusion of select signatures determined from steps (I) and (II), that corresponds to the final set of signatures presented in the results.

Outcomes and statistical analysis

DFS was calculated from randomization until local or distant disease recurrence; as per the original protocol12, the occurrence of an operable cancer regrowth in patients followed-up with WW intent was not considered a treatment failure. Overall survival (OS), local recurrence-free survival and distant recurrence-free survival were calculated from the date of randomization until, respectively, death, local recurrence or distant recurrence. Presence of cCR/nCR was assessed at the time of post-NAT restaging according to the radiologic, endoscopic and clinical criteria of the original study. For this study, patients with cCR/nCR who did not develop later tumor regrowth or disease recurrence were defined as having a scCR.

Right-censored data were modeled using univariate Cox regression. For ctDNA-based predictors, left truncation was employed in order to consider the date of the blood draw and not the randomization date as the start of observation. Log rank was used despite Cox regression in cases where a P value could not be computed due to the absence of events in one of the two groups. Logistic regression was used to model binomial outcomes. When TF was used as a numerical predictor in Cox and logistic regressions, log10 transformation was applied, and TF of 0 was entered as 0.001%. The Fisher exact test and the Wilcoxon signed-rank test were used to compare categorical and numerical variables, respectively. Correlation between numerical variables was assessed with the Spearman ρ coefficient.

Ethics approval and consent to participate

This study was conducted in accordance with the principles of the declaration of Helsinki. All subjects provided informed consent as specified in the study protocol. The MSKCC Institutional Review Board/Privacy Board (IRB/PB) approved the study protocol and the respective relevant documentation prior to its implementation (IRB number: 13-213 A23).