Introduction

The invasive margin (IM) of colorectal cancer (CRC) represents the front line of a tumor as it spreads into surrounding tissue. We know that the IM plays a critical role in cancer invasion and disease progression. The IM can be fairly heterogenous with both pushing border (PB) as well as infiltrative border configurations. It has been previously shown that infiltrative growth patterns contribute to a worse disease-free survival in comparison to pushing borders1,2Due to a growing body of evidence linking IM patterns with patient survival3, further investigation is needed to elucidate the biological mechanisms behind the invasive potential of the CRC tumor front.

Within the IM, distinct morphological features of cancer cell dissociation are represented by tumor budding (TB) and poorly differentiated clusters (PDCs). TB is defined as isolated cancer cells or cell clusters of < 5 cells located at the IM4. TB has been firmly established as a powerful independent predictor of lymph node metastasis, disease recurrence, and cancer-related death in CRC5TB has been incorporated into the World Health Organization Classification of Digestive System Tumours (2019)6and international CRC reporting protocols such as the International Collaboration on Cancer Reporting7PDCs are defined as isolated tumor clusters ≥ 5 cells without glandular formation8Several studies have demonstrated that PDCs are independently associated with poor outcomes9Novel grading systems combining TB and PDCs have shown that their combination is at least equivalent to their separate assessment10.

TB and PDCs, despite the arbitrary cut-off of 5 cells, may be considered as different points of a biological continuum9Studies investigating the relation of TB and PDCs with the expression of epithelial-mesenchymal-transition (EMT) related markers, determined that TB and PDCs represent different manifestations of EMT, with TB appearing to be closer to complete EMT than PDCs11,12 Using a limited panel of characterization markers, there is yet to be a study which molecularly interrogates the concept of a biological continuum at the IM using a comprehensive spatial transcriptomic profiling of TB, PDC, and PB regions to elucidate mechanisms of CRC invasion.

Through the efforts of the International Tumor Budding consortium (ITB), an effective AI model for automated detection of TB in H&E whole slide images (WSIs) has been developed and validated in large series of CRC cases13 By using the algorithm to detect isolated single tumor cells and cell clusters of any particular size, this is the first study to quantify nearly 400,000 single tumor cells/cell clusters across the IM of more than 1100 CRC cases. We determine cellular abundancies of TB and PDCs and establish correlations with disease-free survival (DFS). We investigate whether the size of TB and PDCs plays a role in CRC prognostication and link these findings with our transcriptomic analysis to draw conclusions as to whether a biological continuum between PDCs and TB exists, and the role tumor clusters play in cancer invasion.

Materials and methods

Cohorts/Study populations

A cohort of 1134 primary tumors from patients with pTNM stage I-III CRC who did not receive neoadjuvant treatment was established by the IBC and used for the digital assessment and quantification of TB and PDCs using automated detection. The cohort is comprised of 163 cases from Switzerland, 601 cases from the Netherlands, and 370 cases from Canada. This study also included separately resection material from 20 cases at the Radboud University Medical Center (RadboudUMC) where TB, PDC and PB regions were identified and processed for spatial transcriptomic and immunohistochemical (IHC) analyses. Pathologists (SKO, IDN) were guided and selected regions based on a review of cases according to the International Tumor Budding Concensus Congress (ITBCC) 2016 guidelines4and using the formal definition of PDCs8 This study was approved by ethical review boards at each center (approval numbers 2017 − 01803, 2017–3603, and REB17-0054-E). During their cancer treatment, patients provided informed consent, acknowledging that left-over tissue material could be used for research, and at that time, they expressed no objections to such use. All methods were performed in accordance with the relevant guidelines and regulations.

Digital assessment

TB and PDCs were assessed using a deep learning network for the automated detection of TB in H&E whole-slide images13The algorithm was used to detect all isolated tumor cells/cell clusters ranging in size from 1 to 10 cells. For each of the 1134 cases, an invasive margin of 1000 μm was determined using an IM detection algorithm13 (Fig. 1A). All tumor cells/clusters ≤ 10 cells were quantified across the entire IM. All tumor cells/clusters from 1 to 4 cells were categorized as TB. All tumor clusters ranging from 5 to 10 cells were categorized as PDCs. TB was scored automatically for each case analogous to ITBCC guidelines4A TB hotspot of 0.785 mm2was determined using a hotspot detection algorithm13 The number of TB within the hotspot was determined to produce a TB score (Fig. 1B). The ITBCC cut-off of 10 buds for high-grade TB was used to distinguish between low and high-grade TB4. For the scoring of PDCs, a PDC hotspot of 0.785 mm2 was determined using the hotspot detection algorithm. The number of PDCs within the hotspot was determined to produce a PDC score (Fig. 1C). A cutoff of 10 PDCs was used to distinguish between low and high-grade PDCs according to the method of Ueno et al.8. Tiered scoring was also performed in the same manner as done for traditional TB and PDC scoring but with increasing the tumor cluster size cut-off incrementally. This means that scores using the hotspot detection algorithm13with a hotspot of 0.785 mm2 were determined separately for only single tumor cells, ≤ 2 cells, ≤ 3 cells, ≤ 4 cells (TB), ≤ 5 cells, ≤ 6 cells, ≤ 7 cells, ≤ 8 cells, ≤ 9 cells and ≤ 10 cells. A cutoff of 10 tumor cells/clusters was used to distinguish between low and high-grade for tiered scoring as was done for TB and PDC scoring. The decision was made to limit the detection to clusters ≤ 10 cells as 10-celled clusters accounted for < 0.5% of all tumor cells/clusters quantified in the study. The coordinates for all hotspots were measured in order to determine whether two hotspots overlapped or not.

Fig. 1
figure 1

Tumor Budding (TB) and Poorly Differentiated Cluster (PDC) assessment using automated detection in H&E using deep learning. (A) An H&E slide processed with the automated TB detection algorithm in H&E used to detect single tumor cells and tumor clusters ≤ 10 cells. A hotspot detection algorithm is used where densities of tumor buds and PDCs along the invasive front are determined. (B) Detection of tumor cells/clusters within the PDC hotspot (area = 0.785 mm2. Automatically detected tumor cells/clusters are denoted by the colored overlays. (C) Detection of tumor cells/clusters within the TB hotspot (area = 0.785 mm2. Automatically detected tumor cells/clusters are denoted with the colored overlays. Colored overlays are not indicative of size. The TB and PDC hotspots in this particular case do not overlap. (Scale bar = 10 mm; Inset = 250 μm).

Digital Spatial profiling (DSP)

In order to characterize intratumoral heterogeneity of the IM, 8 tissue microarrays (TMAs) consisting of 28, 32, and 29 cores representing TB, PDCs, and PB regions, respectively, were constructed from formalin-fixed paraffin-embedded (FFPE) resection material of 20 heterogenous CRC primary tumor cases collected at the RadboudUMC (Table S1). Serial Sect. (4 μm) of the TMA blocks were cut and mounted onto glass slides. For each TMA, we performed DSP using the Nanostring GeoMx DSP platform (NanoString, Seattle, WA, USA) and Cancer Transcriptome Altas (CTA, NanoString) RNA panel as previously published14,15,16Pathologists (SKO, IDN) pre-selected regions of interest (ROIs) in H&E stained serial sections and selected the same ROIs in pan-cytokeratin-positive DSP sections according to the ITBCC guidelines4and using the formal definition of PDCs8Serial TMA sections stained with H&E were scanned processed using the automated TB detection algorithm13 The algorithm quantified all tumor cells/clusters ≤ 10 cells within the H&E ROIs. PB ROIs were determined as having no TB or PDCs (Fig. 2A). PDC ROIs were determined as having more than 50% PDCs (Fig. 2B). TB ROIs were determined as having more than 50% TBs (Fig. 2C). Tumor areas of interest (AOIs) within the ROIs were selected based on PanCK + staining. TME AOIs were selected as PanCK–. Only regions contained within the ROIs were utilized in our analyses (Fig. 2). Data was processed and normalized as previously published16.

Fig. 2
figure 2

Overview of Tumor Budding (TB), Poorly Differentiated Cluster (PDC), and Pushing Border (PB) regions. (A) (Left) A PB region from a pan-cytokeratin stained TMA serial section. (Center) The same PB region from an H&E stained serial section. (Right) The same PB region from a TMA serial section processed for Nanostring GeoMx Digital Spatial profiling (DSP). The section was stained with pan-cytokeratin (PanCK, green), CD45 (red) and DNA stainSyto13 (blue). The regions of interest (ROIs) are represented by the white irregular polygons. Within the ROIs, the areas of interest (AOIs) were selected based on segmentation of pan-cytokeratin staining. Tumor AOIs (all epithelium which is segmented red and contained within an ROI) are pan-cytokeratin +. Tumor microenvironment (TME) AOIs (all TME segmented yellow) surrounding the segmented epithelium within the ROI) are pan-cytokeratin –. Each AOI is considered a sample. (B) (Left) A PDC region from a pan-cytokeratin stained TMA serial section. (Center) The same PDC region from an H&E stained TMA serial section. Automatically detected tumor cells/clusters are denoted by the colored overlays. (Right) The same PDC region from a TMA serial section processed for DSP. (C) (Left) A TB region from a pan-cytokeratin stained TMA serial section. (Center) The same TB region from an H&E stained serial section. Automatically detected tumor cells/clusters are denoted by the colored overlays. (Right) The same TB region from a TMA serial section processed for DSP. Colored overlays are not indicative of size. (Scale bar = 250 μm).

Immunohistochemistry (IHC)

Serial Sect. (4 μm) of the TMAs pertaining to the 20 heterogenous CRC primary tumor cases were stained using H&E, AE1/AE3, and FN1 as previously published16Slides were scanned using a P1000 (3DHistech, Budapest, Hungary). The same ROIs selected for the DSP experiments were selected in the IHC WSIs and only areas within the ROIs were included in our analyses. Stromal and epithelial FN1 expression in TB, PDC and PB regions were classified as previously published by two independent observers (TSH, LvdD) based on intensity as follows: 1, weak; 2, moderate; and 3, strong16 No TB or PDC scoring was performed on IHC stained slides as the deep learning network only detects TB in H&E whole-slide images.

Statistical analyses

Statistical analyses were performed using R v4.2.0. Associations between TB, PDC as well as tiered scoring were evaluated using the Cox proportional hazards model. HR and 95% confidence intervals (CI) are reported. P values are reported and considered statistically significant when P< 0.05. Demographics and disease characteristics were compared by χ2 and Kruskal-Wallis and when applicable. Transcriptomic data was preprocessed and normalized according to the GeomxTools 3.1.1 package developed by Nanostring (https://nanostring.com/resources/analyzing-geomx-ngs-rna-expression-datawith-geomxtools/). Differential gene expression across groups was analyzing using linear mixed-effect models17 Spatial deconvolution analysis was performed using the Spatial Decon package (version 1.3) (https:github.com/Nanostring-Biostats/SpatialDecon/). Differentially expressed genes (DEGs) were defined as log2 fold = change > 0.5 and false discovery rate ≤ 0.05.

Results

Quantifying isolated tumor cells/clusters at the IM using automated detection

By automatically scoring TB and PDCs in H&E across the IM of 1134 pTNM Stage I-III cases, we quantified 378,412 single tumor cells and clusters ≤ 10 cells. Of all tumor cells/clusters quantified, single cells accounted for 20.1% and 2-celled clusters accounted for the largest percentage (44.4%) (Fig. 3A). Three-celled clusters accounted for 17.4% with a steadily decreasing percentage as the cluster size increased, with 10-celled clusters having a prevalence of only 0.4%. TB accounted for 90% of all single cells and cell clusters detected. When breaking down the size percentages with regards to TB, 2-celled clusters accounted for nearly half (49.4%) of all TBs and 4-celled clusters were the least prevalent (9%) (Fig. 3B). Five-celled clusters comprised the majority of PDCs (41.7%) with a steadily decreasing prevalence as the cluster size increased to 10-celled clusters (4.3%) (Fig. 3C).

Fig. 3
figure 3

Quantification of all tumor cells/clusters along the invasive front in a large series of colorectal cancer (CRC) cases. (A) Pie chart depicting percentages of tumor cells/clusters of a particular size out of all tumor cells/clusters ≤ 10 cells automatically detected across the entire invasive margin of all 1134 pTNM stage I-III CRC cases. (B) Pie chart depicting percentages of tumor cells/clusters of a particular size out of all tumor cells/clusters ≤ 4 cells detected and considered tumor budding (TB). (C) Pie chart depicting percentages of tumor cells/clusters of a particular size out of all tumor clusters detected with a size between 5 and 10 cells and considered poorly differentiated clusters (PDCs). (D) Percentage overlap of TB and PDC hotspots in all cases as well as the hotspots of increasing cluster size cut-off through the tiered scoring. (E) Table depicting the statistics of tiered scoring with cut-offs of only single cells and inclusion of cell clusters of incrementally increasing size. The mean number of tumor cells/clusters within a hotspot of each grouping, the distribution (%) of cases between low and high-grade groups, and hazard ratios for each grouping are depicted.

Hotspot overlap analysis of TB, PDCs, and tiered scoring system

When conducting the hotspot overlap analysis for TB, PDC, and tiered scoring on the H&E slides of the 1134 pTNM Stage I-III cases, we determined that TB and PDC hotspots overlapped in only 32.8% of cases suggesting that the highest densities of TBs and PDCs are often in differing areas of the IM (Fig. 3D). When comparing the hotspot of only single cells vs. groupings with increasing cluster size cut-off (single cells vs. ≤ 2 cells, single cells vs. ≤ 3 cells, etc.), the highest percentage overlap was between single cells vs. ≤ 2 cells (52.7%) and dropped to roughly 48% with gradually increasing cut-offs (Fig. 1D). When comparing the hotspot of ≤ 2 cells vs. all other groupings (≤ 2 cells vs. ≤ 3 cells, ≤ 2 cells vs. ≤ 4 cells, etc.), the highest percentage overlap was between ≤ 2 cells vs. ≤ 3 cells (81.3%) and leveled off at 75% as the cut-off increased (Fig. 3D). These findings suggest that single tumor cells are often more distributed across the IM compared with larger tumor clusters.

Analysis of tiered scoring and associations with DFS

In association with the tiered scoring performed on the H&E slides of the 1134 pTNM Stage I-III cases as described above, we conducted a cox regression analysis of tiered scoring ranging from a cut-off of only single cells to ≤ 10 cells across the entire cohort to see how the inclusion of gradually larger tumor clusters being scored affects risk stratification and DFS. Using a cutoff of 10 to distinguish between low-risk and high-risk categories (as is done with traditional TB and PDC scoring), we saw that 68% of cases were considered low-risk when only single cells were scored (Fig. 3E). With the inclusion of 2-celled clusters, the distribution inverses with 78% of cases being considered high-risk (mean number of tumor cells/clusters within the hotspot increased from 8 to 18). After reaching a cutoff of ≤ 5 cells, the distribution of cases between the low-risk and high-risk categories stayed consistent at 15% and 85%, respectively. Scoring single cells only, patients in the high-risk category had nearly twice the risk compared to low-risk cases (Hazard Ratio (HR) 1.8, 95% CI 1.4–2.3; P value 0.0001). We observed a steady increase in the HR from ≤ 2 cells (HR 1.7, 95% CI 1.2–2.4; P value 0.003) to ≤ 5 cells (HR 2.4, 95% CI 1.5–4.0; P value < 0.0001) (Fig. 3E). At \(\:\ge\:\:\)5 cells, the risk plateaued and stayed consistently at HR 2.4. We therefore determined that for this cohort, there was no gain or loss in prognostic value with the inclusion of tumor clusters > 5 cells being scored.

Evaluation of TB, PDC scoring systems and associations with DFS

We proceeded to evaluate traditional TB and PDC scoring in H&E within the entire cohort. Clinicopathological features of the overall population and based on TB and PDC grading are shown in Table 1. Regarding TB, 15.8% cases were considered low-grade (Bd1-2) and 84.2% considered high-grade (Bd3). When scored for PDCs, the highest percentage of cases were considered low-grade (P1-2, 85.9%) compared to high-grade (P3, 14.1%). Cases with high-grade TB (Bd3) or high-grade PDCs (P3) were more frequently advanced stage disease compared to Bd1-2 or P1-2 cases respectively. This was accompanied by more lymphatic and perineural invasion. Bd1-2 tumors had relatively more MMR deficiency compared to Bd3 (P value = 0.0001). In turn, MMR-deficient tumors had fewer tumor buds than MMR-proficient tumors. Patients with Bd3 tumors had poorer DFS (HR 2.16, 95% CI 1.36–3.41; P value < 0.0001) in reference to Bd1-2 tumors compared to patients with P3 tumors (HR 1.53, 95% CI 1.30–2.12; P value = 0.01) and in reference to P1-2 tumors (Fig. 4A, B). We combined scores of TB and PDCs to see whether their synergistic performance was stronger than either independently (Fig. 4C). We observed that patients with high-risk cases (Bd3 & P3) had the worst DFS (HR 2.81, 95% CI 1.66–4.78; P value < 0.0001) in reference those with low-risk (Bd1-2 & P1-2) cases, and compared to Bd3 or P3 cases alone. Interestingly, we observed no cases across the entire cohort which were low-grade TB and high-grade PDCs (Bd1-2 & P3). The Intermediate-risk group (Bd3 & P1-2) (HR 2.04, 95% CI 1.28–3.24) had similar risk stratification to Bd3 alone group, suggesting that the low number or absence of PDCs scored in Bd3 cases has nearly the same stratification compared to when PDCs were not scored. Non-existence of Bd3 & P1-2 cases suggests that large numbers of PDCs at the IM only exist with a high amount of TB.

Table 1 Clinicopathological Features of Cases Stratified by Tumor Budding (TB) and Poorly Differentiated Clusters (PDCs).
Fig. 4
figure 4

Association of Tumor Budding (TB), Poorly Differentiated Clusters (PDCs), and their combined score with DFS for patients with Stage I-III colorectal cancer based on univariable analysis. (A) TB, (B) PDC, and (C) the combination of TB and PDC in overall cohort. HR, hazard ratio; CI, confidence interval.

Distinct transcriptomic profiles differentiate between PB, PDC, and TB regions with IHC validation

Having investigated the prognostic effects of TB, PDCs, single tumor cells and scores of increasing cluster size, we set out better understand the molecular background of tumor heterogeneity at the IM through the transcriptomic profiling of PB, PDC and TB regions within 20 heterogenous CRC cases using DSP. These regions presented distinct histology at the IM (Fig. 2A-C). Following preprocessing and normalization, we profiled the expression of over 1,100 unique cancer related gene targets across 470 total samples, of which 235 represented the tumor and 235 the tumor microenvironment (TME). Of the 470 samples, 180 represented TB regions (90 tumor and 90 TME), 170 represented PDC regions (85 tumor and 85 TME), and 120 regions representing PB (60 tumor and 60 TME). When comparing PB and TB regions, we found 348 and 510 DEGs within the tumor and TME, respectively (Figure S1A, B and Table S1). In comparing PB and PDC regions, we found 131 and 35 statistically significant DEGs within the tumor and TME, respectively (Fig. 5A, B, and Table S1). Fibronectin 1 (FN1) and SFRP2, involved in migration and metastatic processes, were upregulated in PDCs regions along with matrix components such as COL1 A1, COL3 A1, and COL1 A2. When comparing PDC and TB regions, we found only 19 and 3 significant DEGs within the tumor and TME, respectively (Fig. 5C, D and Table S1). When comparing the tumor of PDC and TB regions, we found DEGs in TB regions involved with invasion and migration such as FN1, COMP, SPP1, as well as matrix components.

Fig. 5
figure 5

Transcriptional profiles and Differentially Expressed Genes (DEG) between Tumor Budding (TB), Poorly Differentiated Clusters (PDC) and Pushing Border (PB) regions. DEGs between TB, PDC and PB regions using Nanostring GeoMx Digital Spatial Profiling (DSP). (A) Volcano plot demonstrating DEGs within the tumor area of 60 and 85 samples representing PB and PDC, respectively. X-axis represents -Log2 fold change and Y-axis represents -Log10 p value. (B) Volcano plot demonstrating DEGs within the tumor microenvironment (TME) of 60 and 85 samples representing PB and PDC, respectively. (C) Volcano plot demonstrating DEGs within the tumor area of 90 and 85 samples representing TB and PDC, respectively. (D) Volcano plot demonstrating DEGs within the TME of 90 and 85 samples representing TB and PDC, respectively. (E) Spatial deconvolution of immune cell populations within the TME of TB, PDC and PB regions, respectively. Statistical comparison done using a Wilcoxon test.

Spatial deconvolution of immune cell populations using DSP within the TME of PB, PDC and TB regions showed no significant differences in T-cell or B-cell populations (Fig. 5E). We observed significantly higher levels of macrophages in both TB and PDC regions compared to PB regions, as well as significantly higher levels of fibroblasts going from PB to PDC to TB regions (Fig. 5E).

To validate the transcriptomic upregulation of FN1, we performed FN1 IHC staining on TMA serial sections (Fig. 6A-C) of the 20 cases described above where DSP was performed. Based on the FN1 IHC stained slides, we observed significantly higher stromal FN1 expression in PDC compared to PB regions (Fig. 6D). There was no significant difference in stromal FN1 expression between PDC and TB regions. We observed significant differences in epithelial FN1 expression with significantly higher expression going from PB to PDC to TB regions (Fig. 6E). These findings correlated with transcriptomic FN1 expression (Fig. 5A-D, Figure S1A, B, and Table S1).

Fig. 6
figure 6

Validation of transcriptomic expression using Immunohistochemistry (IHC). On TMA serial sections, pan-cytokeratin (AE1/AE3) and Fibronectin 1 (FN1) staining was performed for all Tumor Budding (TB), Poorly Differentiated Clusters (PDC), and Pushing Border (PB) regions. (A) The same representative PB region stained with (upper) AE1/AE3 and (lower) FN1. (B) The same representative PDC region stained with (upper) AE1/AE3 and (lower) FN1. (C) The same representative TB region stained with (upper) AE1/AE3 and (lower) FN1. (D) Violin plot showing stromal FN1 expression scored across all TB (dark blue), PDC (light blue), and PB (pink) regions (p < 0.001, T-test). (E) Violin plot showing epithelial FN1 expression scored across all TB (dark blue), PDC (light blue), and PB (pink) regions (p < 0.001, T-test). (Scale bar = 100 μm; Inset = 25 μm).

Discussion

Through our investigation of TB and PDCs at the IM, we successfully demonstrated a biological continuum from larger to smaller tumor clusters and single tumor cells, with TB having more invasive potential than PDCs. We determined that 2-celled clusters are the most abundant feature at the IM, and a high number of PDCs did not exist without the presence of a high amount of TB. TB and PDC scores in combination yielded a prognostic performance stronger than either independently.

Two-celled clusters being the most abundant feature at the IM (44.4%) supports the idea of collective migration and that CRC invasion occurs more often in clusters rather than as individual tumor cells18,19CRC invasion may occur more often in smaller than larger clusters as mechanical confinement delimits tissue space and forces multicellular movement of epithelial cells between tissue surfaces20 This may also explain why we did not observe an increased risk stratification with a cluster size cut-off beyond 5 cells (Fig. 3E). TB in relation with collective cell migration is also supported by multiple 3D studies observing TB as continuous finger-like extensions from the main tumor21,22,23With isolated single cell/cell cluster migration considered rare, CRC cell migration is more likely a combination of attenuated protrusions from the main tumor with occasional isolated cell clusters and least commonly single cells. This may also reflect a transitional state of CRC migration from projections to isolated tumor clusters/cells. It has been shown that mesenchymal single cells resemble leader cells during collective migration, suggesting the potential duality of single-celled TB observed in 2D as both mesenchymal single cells and more commonly leader cells in collective migration24.

In the progression from PB to PDC to TB regions, we observed an upregulation of DEGs associated with the deposition of extracellular matrix (ECM), cancer invasion and migration. Increased deposition of ECM through an upregulation of COL3 A1, COL5 A2, and COL6 A3is associated with advanced staging, metastasis, and poor prognosis25,26,27 An upregulation of SFRP2promotes therapeutic resistance in a damaged tumor microenvironment, with SFRP1/2-expressing cancer associated fibroblast populations supporting epithelial tumor progression28,29 Overexpression of LAMC2in CRC is associated with poor prognosis by promoting cancer cell migration and invasion30 Both COMP and SPP1promote malignant progression and metastasis through cytoskeletal remodeling and activating the EMT pathway31,32,33,34.

The ECM undergoes continuous remodeling, a process regulated by matrix metalloproteinases (MMPs). Among them, MMP9 plays a pivotal role in cleaving ECM proteins like collagen and elastin, thereby facilitating ECM turnover. Elevated MMP9 expression was observed in TB regions compared to PDC regions (Fig. 5), and is often correlated with enhanced tumor invasion and metastasis in CRC, as MMP9upregulation contributes to degradation of the basement membrane and interstitial stroma, promoting cancer cell dissemination35 There is also suggestion of a strong relationship between MMP9expression in TB and a more aggressive tumor phenotype36 MMP1 and MMP11, which were elevated in PDC regions compared to PB regions, are known to promote colorectal cancer cell migration37,38 Our spatial deconvolution analysis revealed a significant increase in macrophages and fibroblasts in TB and PDC regions compared to PB regions, suggesting their role in ECM remodeling and tumor progression. Macrophages promote ECM degradation by secreting MMP9 and enhance fibroblast activation via TGF-β, leading to increased collagen and FN1production39,40Similarly, fibroblasts, which were more abundant in TB and PDC regions, regulate ECM composition by secreting FN1 and collagen, contributing to ECM stiffness and tumor invasion41.

Increased deposition of ECM is a hallmark of desmoplastic reaction (DR), referring to fibrotic tissue response surrounding invasive tumors and characterized by fibroblast proliferation. DR results in stromal fibrosis, which contributes to the structural integrity of the TME, influences tumor progression and is correlated to our findings of increased ECM deposition and presence of fibroblasts in TB regions. In CRC, the presence and maturity of DR at the invasive front has been linked to patient prognosis. Ueno et al. categorized DR in CRC to three patterns-mature, intermediate, and immature-based on histological features (keloid-like collagen and myxoid stroma), with immature DR having been associated with poorer outcomes42 These findings highlight stromal fibrosis as a key factor in CRC progression.

Budding regions are characterized by a phenotypic switch compared to the bulk by acquiring migratory characteristics, becoming EMT positive, and decreasing cell proliferation16,43FN1, of which we observed increased transcriptomic and protein expression of in TB regions compared to PDC and PB regions, is associated with an EMT response44 In-vitro knockdown of FN1in CRC cell lines significantly inhibited invasion and cellular migration44,45Pavlic et al. showed that TB and PDCs are related to partial EMT, with TB being closer to complete EMT than PDCs11A mean decrease in Ki-67 labeling and higher nuclear β-catenin index was also observed in the progression from tumor core to PDCs to TB12 Based on our findings, TB appears to possess greater invasive potential than PDCs which supports their role as leader cells during collective migration or mesenchymal single cells in CRC invasion. These findings also correlate with a higher prognostic value observed for TB compared to PDCs (Fig. 4).

Several studies have shown the independent prognostic strength of both TB and PDCs, with our findings supporting their clinical relevance in early-stage CRC5,8,9,46,47,48,49,50,51High-grade TB in early-stage CRC is a risk factor for LNM and may warrant radical surgery with lymph node dissection3,5Stage II CRC with high-grade TB is considered high-risk and patients may benefit from adjuvant therapy3,5PDCs are similarly predictive in stage II CRC as patients with high-grade PDC had a significantly worse 5-year DFS than patients with Stage III and low-grade PDC52These high-risk Stage II patients may benefit from adjuvant chemotherapy9PDC grade may also help guide post-operative surveillance of stage III patients9.

Currently, TB and PDCs are assessed manually in diagnostic practice. Shivji et al. showed that the combination of TB and PDC into a single manual scoring system provides a risk stratification in CRC at least equivalent to that of TB and PDCs assessed individually10Manual scoring is time-consuming and often plagued by low interobserver agreement46,53AI models can serve to score TB and PDC more efficiently and reproducibly13,54 The increased prognostic performance we observed by combining automated TB and PDC scores may be due in part to the assessment of multiple hotspot areas rather than one as only 32.8% of cases had overlapping hotspots (Fig. 3D). Considering this, it may be worth re-evaluating how TB and PDCs are scored and expanding beyond a single hotspot-based system. Automated detection allows for more complex assessment such as multiple hotspots or the entire IM which may give a more accurate assessment as to how aggressive a tumor actually is. Further studies are needed to determine whether TB and PDCs should be combined into a single scoring system as well as explore and evaluate new assessment strategies. Until then, we recommend that TB and PDCs be assessed simultaneously in routine diagnostics to improve risk stratification and help identify high-risk Stage I, II, and III cases which may benefit from radical resection, adjuvant therapy, or surveillance, respectively.

Our study utilizes a large, multicentric cohort of Stage I-III CRC cases. Limitations are that this study is retrospective and uses a deep learning algorithm which has not yet been implemented in routine diagnostic practice. This study also does not account for temporal changes as is inherent to histopathology, and investigates a single timepoint after the tumor has been excised.

In summary, our study provides evidence for the biological continuum of PDCs and TB as they represent different points of CRC invasion. TB has greater invasive potential and prognostic value than PDCs, with both showing significantly more invasive potential compared to PB regions. The prognostic performance of scoring TB and PDCs simultaneously provides strong evidence for their adoption into standard of care. The study also emphasizes the value of incorporating AI into diagnostic practice which can help optimize diagnostic assessment and benefit patients.