Integration of radiomics, deep learning, transcriptomics, and metabolomics reveals prognostic risk stratification and underlying biological mechanisms in colorectal cancer

Li, Zhiheng; Cai, Rongzhi; Qin, Yangyang; Liao, Xiaoqing; Wang, Enqi; Wu, Xuanyu; Zhao, Yan; Lu, Zengxin; Lin, Yan

doi:10.1038/s41698-026-01331-2

Download PDF

Article
Open access
Published: 06 March 2026

Integration of radiomics, deep learning, transcriptomics, and metabolomics reveals prognostic risk stratification and underlying biological mechanisms in colorectal cancer

Zhiheng Li¹^na1,
Rongzhi Cai^1,2^na1,
Yangyang Qin³^na1,
Xiaoqing Liao¹,
Enqi Wang¹,
Xuanyu Wu¹,
Yan Zhao^1,4,
Zengxin Lu⁵ &
…
Yan Lin¹

npj Precision Oncology volume 10, Article number: 155 (2026) Cite this article

3309 Accesses
Metrics details

Subjects

Abstract

Colorectal cancer (CRC) is the third most common malignancy and the second leading cause of cancer-related death worldwide, yet current prognostic stratification is hindered by tumor heterogeneity. Here, we developed a deep learning radiomics model (DLRM), optimized through systematic evaluation of ten machine learning algorithms across 117 combinations, using venous-phase computed tomography (CT) images of 1183 patients from four centers. The resulting risk stratification stratified patients into high- and low-risk groups with distinct survival outcomes, and integration with clinical factors further improved prediction. Integrative transcriptomic and metabolomic analyses revealed that high-risk tumors were enriched for extracellular matrix (ECM)-related pathways associated with tumor progression, whereas low-risk tumors exhibited immune-related signatures, including higher CD8⁺ T-cell infiltration. Both omics consistently identified butanoate metabolism and nitrogen metabolism as protective pathways, validated in an independent public cohort (n = 417). This integrative analytic framework provides robust risk stratification and uncovers biological processes with potential therapeutic relevance.

Machine learning-based model for CD4⁺ conventional T cell genes to predict survival and immune responses in colorectal cancer

Article Open access 18 October 2024

Deep representation learning of tissue metabolome and computed tomography annotates NSCLC classification and prognosis

Article Open access 03 February 2024

Integrated analysis of single-cell and bulk RNA-sequencing to predict prognosis and therapeutic response for colorectal cancer

Article Open access 07 March 2025

Introduction

Colorectal cancer (CRC) ranks as the third most common cancer and the second leading cause of cancer-related death worldwide¹. Accurate prediction of individual prognosis is of great significance for CRC patients. However, current prognostic stratification and treatment decisions mainly rely on the pathological TNM staging system, which is suboptimal due to the substantial tumor heterogeneity observed in histopathological CRC tissues^2,3. Therefore, there is an urgent need for accurate risk stratification at the time of diagnosis to guide individualized treatment and surveillance strategies.

Contrast-enhanced computed tomography (CT) is the preferred imaging modality for CRC evaluation⁴, but conventional CT features offer limited prognostic information. In recent years, radiomics and deep learning (DL) have emerged as powerful tools in oncology, enabling the extraction of high-dimensional quantitative imaging features that capture tumor heterogeneity noninvasively and improve prognostic prediction^5,6,7,8,9. Machine learning (ML) can further integrate these features to build predictive models with high accuracy^10,11,12. Nevertheless, the performance of such models is highly dependent on the choice of algorithm, and different ML algorithms may yield substantially different results. To date, there is no consensus on the most effective algorithm for prognostic modeling in CRC. Moreover, the underlying biological underpinnings of imaging-based prognostic models remain poorly understood, thereby hindering their clinical application.

In this study, we aimed to develop a prognostic risk stratification based on a deep learning radiomics model (DLRM), optimized through systematic evaluation of ten machine learning algorithms across 117 combinations, to stratify CRC patients according to their overall survival (OS). Furthermore, we integrated metabolomic and transcriptomic analyses to explore the biological mechanisms underlying distinct prognostic outcomes, aiming to provide biological insights and interpretability for the imaging-based predictive models.

Results

Patient characteristics

A total of 1183 patients (mean age: 59 ± 11 years; 438 females) were enrolled for model construction. Among the included patients, 622 from Center 1 were divided into a training cohort (n = 435; mean age: 60 ± 10 years; 159 females) and an internal validation cohort (n = 187; mean age: 58 ± 11 years; 72 females) at a ratio of 7:3. Additionally, 313 (mean age: 59 ± 10 years; 122 females) from Center 2 and Center 3 were assigned to the external validation cohort 1, and 248 (mean age: 58 ± 11 years; 85 females) from Center 4 were assigned to the external validation cohort 2. The median OS for the four cohorts was 69.03, 68.97, 67.60, and 67.85 months, respectively. No significant differences were observed among the four cohorts in terms of age, sex, carcinoembryonic antigen (CEA), carbohydrate antigen 199 (CA199), T stage, N stage, location, postoperative chemotherapy, or differentiation (all p > 0.05), as shown in Table 1. The overview of the entire research process was illustrated in Fig. 1.

**Fig. 1: Overview of the entire research process.**

Table 1 Patient’s characteristics for DLRM-RS development

Full size table

Feature selection and development of the DLRM-RS

A total of 1437 radiomics and 512 DL features were initially extracted from CT images of CRC patients. After deleting the features that showed poor reproducibility [intra- and inter-observer intraclass correlation coefficients (ICCs) ≤ 0.8], 1130 radiomics and 504 DL features were retained. Subsequent Spearman correlation analysis removed redundant features, yielding 174 radiomics and 142 DL features. Finally, univariate Cox regression identified 38 prognostic features (Supplementary Fig. S1 and Table S1).

These 38 prognostic features were analyzed using an ML-based integrative framework to construct the DLRM. As an initial screened feature set, these features were further refined through intrinsic feature selection within the ML models. Evaluation of 117 algorithmic combinations revealed that the combination of Lasso + GBM achieved the highest mean concordance index (C-index) (0.768) across all cohorts (Fig. 2a). In this approach, 15 features were initially selected by Lasso and subsequently used to train the GBM model. The final DLRM comprised nine radiomics features and six DL features, with their respective coefficients shown in Fig. 2b and Supplementary Table S2. Notably, the DL feature of D2 (feature_172) exhibited the highest coefficient weight (Fig. 2b). Spearman correlation analysis confirmed minimal multicollinearity, with all inter-feature correlations <0.6 (Fig. 2c). To assess the incremental prognostic value of the proposed DLRM, its discrimination performance was compared with simpler Cox-based baseline models. As shown in Supplementary Fig. S2 and Table S3, the DLRM consistently achieved higher C-index values than the radiomics-only Cox and DL-only Cox models across the training, internal validation, and two external validation cohorts. The improvements in C-index were statistically significant in all cohorts (all p < 0.05), indicating that the DLRM (Lasso + GBM) provided superior discriminatory ability over conventional single-modality Cox models. The GBM model generated a risk score for each patient. The optimal risk score cutoff (−0.380) (Fig. 2d) was determined in the training cohort using the ‘surv_cutpoint’ function (survminer’ package), which established the DLRM-based risk stratification (DLRM-RS) threshold for classifying patients into low- and high-risk groups.

**Fig. 2: Integration of DL and radiomics features for survival risk stratification in CRC patients using a machine learning framework.**

Kaplan-Meier (K-M) analysis revealed significantly poorer OS in high-risk versus low-risk patients across all cohorts (all p < 0.001; Fig. 2e–h). Time-dependent receiver operating characteristic (ROC) curve further demonstrated the prognostic performance of the DLRM-RS: the area under the curve (AUC) values for 3- and 5-year OS were 0.720 and 0.744 in the training cohort, 0.700 and 0.730 in the internal validation cohort, 0.663 and 0.693 in external validation cohort 1, and 0.759 and 0.720 in external validation cohort 2 (Fig. 2i–m). Collectively, these findings confirmed that the DLRM-RS provided robust prognostic stratification for CRC patients.

DLRM-RS performance across different clinical subgroups

To assess the prognostic utility of DLRM-RS across different clinical subgroups, we conducted stratified survival analyses within the entire cohort. The DLRM-RS consistently demonstrated a strong prognostic indicator across all subgroups, including age, sex, CEA/CA199 levels, T/N stage, location, postoperative chemotherapy (received vs. not received), and differentiation grade (all p < 0.001; Fig. 3, Supplementary Fig. S3 and Table S4). High-risk patients consistently showed significantly worse OS than low-risk patients in all subgroups.

**Fig. 3: The HRs of OS predicted using the DLRM-RS across clinical subgroups in 1183 CRC patients.**

Comparative analysis of clinical characteristics revealed that high-risk patients were more likely to present with adverse clinical features, including advanced T/N stages, elevated levels of CEA and CA199, and poor tumor differentiation (all p < 0.05; Fig. 4a–h). However, no significant differences were observed in age, sex, or location between risk groups (all p > 0.05). DLRM-RS scores were significantly higher in patients with advanced T/N stages, elevated levels of CEA and CA199, and poor differentiation (all p < 0.05; Fig. 4i–p). No significant variations in DLRM-RS scores were observed across age, sex, or location (all p > 0.05). These findings collectively demonstrated the reliability of DLRM-RS in reflecting tumor aggressiveness and malignant potential.

**Fig. 4: Associations between DLRM-RS-defined risk groups and clinical characteristics.**

Development and evaluation of the nomogram model

Univariate and multivariate Cox regression analyses were conducted to assess the independent prognostic value of DLRM-RS. The univariate Cox regression analysis identified DLRM-RS as a significant predictor of OS in CRC patients (HR = 8.00, 95% CI: 5.40–11.86, p < 0.001; Fig. 5a; Supplementary Table S5), with this association remaining significant after multivariate Cox regression adjustment (HR = 5.88, 95% CI: 3.81–9.06, p < 0.001; Fig. 5a; Supplementary Table S5). Multivariate Cox regression analysis identified CEA, N stage, and DLRM-RS as independent prognostic factors for OS. Using these three variables, a prognostic nomogram was constructed through multivariate Cox regression with stepwise selection to generate individualized risk predictions (Fig. 5b).

Our results demonstrated that the newly developed nomogram outperformed both the clinical model (including only CEA and N stage as predictors) and the DLRM-RS model across all cohorts. The C-index values and AUCs of 3- and 5-year OS predictions in all cohorts were significantly higher (all p < 0.05; Fig. 5c, d; Supplementary Fig. S4a–d; Table 2). Specifically, C-index values of the nomogram were 0.787 (95% CI: 0.742–0.828), 0.804 (95% CI: 0.736–0.866), 0.758 (95% CI: 0.693–0.822), and 0.803 (95% CI: 0.738–0.862), respectively. For 3-year OS prediction, the corresponding AUCs were 0.786 (95% CI: 0.726–0.847), 0.838 (95% CI: 0.767–0.908), 0.737 (95% CI: 0.651–0.824), and 0.830 (95% CI: 0.748–0.911), respectively (Supplementary Fig. S5a–d). For 5-year OS prediction, the AUCs were 0.822 (95% CI: 0.775–0.869), 0.853 (95% CI: 0.785–0.920), 0.786 (95% CI: 0.719–0.852), and 0.823 (95% CI: 0.754–0.893), respectively (Supplementary Fig. S5e–h).

Table 2 Comparison of C-index and time-dependent AUCs among three prognostic models across training, internal validation, and two external validation cohorts

Full size table

Furthermore, the calibration plots confirmed the accuracy of the nomogram in predicting 3- and 5-year OS, indicating a good consistency between the predicted and the observed survival probabilities (Fig. 5e–h). Decision curve analysis (DCA) further showed that the nomogram yielded greater clinical net benefits compared to both the clinical model and the DLRM-RS model (Supplementary Fig. S6a–d).

Metabolomic alterations between high- and low-risk groups in CRC

To investigate metabolic differences between high-risk and low-risk CRC patients, 52 patients from Center 3 were stratified into high-risk (n = 17) and low-risk (n = 35) groups based on the DLRM-RS. Metabolomics analysis using 600 MHz NMR spectroscopy was performed on these 52 pairs of CRC tumor tissues and matched adjacent normal tissues (Supplementary Fig. S7). Principal component analysis (PCA) was conducted to visualize global metabolic differences among the high-risk, low-risk, and normal tissue groups. The 3D PCA score plot demonstrated a clear separation among the three groups, indicating distinct metabolic profiles (Fig. 6a). Hierarchical clustering heatmaps further illustrated intergroup metabolic differences, highlighting relative changes in metabolite levels across the normal, high-risk, and low-risk groups (Fig. 6b). Subsequently, orthogonal partial least squares discriminant analysis (OPLS-DA) was employed to enhance group discrimination. OPLS-DA score plots demonstrated clear segregation both between CRC and normal tissues (intercepts of goodness-of-fit R² = 0.885; goodness-of-prediction Q² = 0.817) and between high-risk and low-risk CRC groups (R² = 0.879, Q² = 0.802), supporting the robustness and predictive power of the models (Fig. 6c, d). Moreover, 999-time permutation tests were conducted to assess potential overfitting. The R² and Q² suggested that the OPLS-DA models were not apparently overfitted (Fig. 6e, f). Collectively, these findings revealed apparent metabolic differences among the normal, high-risk, and low-risk CRC groups.

**Fig. 6: Differential metabolite and metabolic pathway analysis.**

Biomarkers were selected based on variable importance in projection (VIP > 1) from the OPLS-DA model and a Student’s t-test p-value < 0.05. As a result, significant metabolic alterations included 26 metabolites differentiating CRC from normal tissues and 27 metabolites distinguishing high-risk from low-risk CRC tissues (Supplementary Tables S6, S7). Pathway enrichment analysis using MetaboAnalyst revealed that the 26 metabolites differentiating CRC from normal tissues were enriched in pathways such as arginine biosynthesis, alanine, aspartate and glutamate metabolism, butanoate metabolism, and valine, leucine, and isoleucine biosynthesis (Fig. 6g, Supplementary Table S8). Similarly, the 27 metabolites differentiating high-risk from low-risk groups were enriched in alanine, aspartate, and glutamate metabolism, arginine biosynthesis, butanoate metabolism, glyoxylate and dicarboxylate metabolism, and others (Fig. 6h; Supplementary Table S9). Venn analysis of differential metabolites from CRC-normal and high-low risk comparisons identified 17 shared metabolites, representing key signatures linking tumorigenesis with prognostic risk stratification (Fig. 6i; Supplementary Fig. S8). The enrichment analysis showed that among the 17 overlapping metabolites, 11 metabolites were significantly enriched [false discovery rate (FDR) < 0.05] in six key metabolic pathways, namely arginine biosynthesis, butanoate metabolism, alanine, aspartate and glutamate metabolism, histidine metabolism, nitrogen metabolism, and valine, leucine, and isoleucine biosynthesis (Fig. 6j; Supplementary Table S10).

Transcriptomic profiling reveals distinct biological processes between high- and low-risk groups in CRC

To explore transcriptomic alterations underlying risk stratification defined by DLRM-RS in CRC, we analyzed RNA sequencing data from 19 CRC samples in The Cancer Imaging Archive (TCIA) database, comprising 15 low-risk and 4 high-risk samples. Gene set enrichment analysis (GSEA) based on Gene Ontology (GO) terms revealed that significantly upregulated pathways in the low-risk group were predominantly related to antigen presentation, while those enriched in the high-risk group involved cardiac muscle repolarization and branching morphogenesis (Fig. 7a). All enriched pathways were ranked by their normalized enrichment score (NES) to identify the most prominent biological processes in each group. Notably, immunological synapse (NES = −2.60, p < 0.001), respiratory burst (NES = −2.567, p < 0.001), and antigen processing and presentation of peptide antigen (NES = −2.430, p < 0.001) were significantly enriched in low-risk tumors (Fig. 7b). In contrast, we observed significant up-regulation of several extracellular matrix (ECM)-related pathways in high-risk group, such as collagen fibril organization (NES = 2.307, p < 0.001), ECM structural constituent (NES = 2.274, p < 0.001), and ECM constituent conferring tensile strength (NES = 2.184, p < 0.001) (Fig. 7b). These results suggested distinct biological processes between risk groups, with low-risk tumors exhibiting immune activation and high-risk tumors characterized by ECM remodeling.

**Fig. 7: Transcriptomic profiles between high- and low-risk groups defined by DLRM-RS.**

Given the overall upregulation of immune-related pathways in the low-risk group–particularly antigen processing and presentation of peptide antigen, a key pathway in anti-tumor immune responses–we further assessed differences in the tumor immune microenvironment between risk groups. Using the Cell-type Identification by Estimating Relative Subpopulations of RNA Transcripts (CIBERSORT) algorithm¹³, we estimated the abundance of 22 immune cell subsets across samples. Radar plots revealed distinct immune cell profiles between groups (Fig. 7c), with CD8⁺ T cells significantly more abundant in the low-risk group (Fig. 7d).

To validate the metabolomic findings at the transcriptomic level, we conducted pathway enrichment analysis using Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets between high-risk and low-risk groups. GSEA results revealed several metabolic pathways significantly upregulated in the high-risk group, including the cGMP-PKG signaling pathway (NES = 1.882, FDR < 0.001), proteoglycans in cancer (NES = 1.819, FDR < 0.001), calcium signaling pathway (NES = 1.571, FDR = 0.006), and PI3K-Akt signaling pathway (NES = 1.529, FDR = 0.003) (Fig. 7e, f; Supplementary Table S11). Conversely, the valine, leucine and isoleucine degradation pathway (NES = −2.249, FDR < 0.001), butanoate metabolism (NES = -1.914, FDR = 0.019), nitrogen metabolism (NES = −2.080, FDR = 0.001), Th1/Th2 cell differentiation (NES = −1.840, FDR = 0.002), and T cell receptor signaling pathway (NES = −1.558, FDR = 0.019) were significantly downregulated in the high-risk group (Fig. 7g, h; Supplementary Table S11). Importantly, butanoate metabolism and nitrogen metabolism, initially identified through metabolomics analysis, were also significantly enriched at the transcriptomic level, providing convergent evidence for their potential association with prognostic stratification in CRC.

Prognostic significance of butanoate metabolism and nitrogen metabolism in CRC

Based on the above findings, butanoate metabolism and nitrogen metabolism emerged as recurrent metabolic pathways differing between risk groups in both metabolomic and transcriptomic analyses. To further validate their prognostic relevance, we performed K-M analyses using the gene set variation analysis (GSVA) scores of these pathways in The Cancer Genome Atlas colorectal cancer (TCGA-CRC) cohort (n = 417). Patients were divided into high- and low-score groups according to the median GSVA score for each pathway.

The results showed that patients with lower butanoate metabolism scores had significantly poorer OS (p = 0.007; HR = 0.52; 95% CI: 0.33–0.84) (Fig. 8a). Similarly, a lower GSVA score for the nitrogen metabolism pathway was also associated with worse OS (p < 0.018; HR = 0.56; 95% CI: 0.35–0.90) (Fig. 8b). These findings suggested that reduced activity in these two metabolic pathways may contribute to unfavorable prognosis in CRC.

**Fig. 8: The relationship between key pathways and OS in CRC patients.**

Discussion

In this multicenter study, we developed a prognostic DLRM based on venous-phase CT images and derived a corresponding DLRM-RS to stratify CRC patients into low- and high-risk groups. The DLRM-RS demonstrated robust performance in predicting OS and maintained its discriminatory ability across diverse clinical subgroups, supporting its generalizability. To enhance clinical applicability, we integrated DLRM-RS with clinical parameters (CEA level and N stage) to construct a nomogram that improved prognostic accuracy. Moreover, integrative multi-omics analysis revealed that high-risk tumors were enriched for ECM-related pathways associated with tumor progression, whereas low-risk tumors exhibited immune-related signatures, including higher CD8⁺ T-cell infiltration. Notably, butanoate metabolism and nitrogen metabolism were consistently enriched in the low-risk group, providing exploratory insights into potential biological patterns associated with the prognostic value of the DLRM-RS.

Given the substantial heterogeneity of CRC outcomes, developing reliable prognostic tools remains a research priority. In recent years, radiomics and DL technologies have shown strong potential in tumor prognostication by extracting high-dimensional imaging features that noninvasively characterize tumor heterogeneity and biological behavior^5,14. In addition, integrating these features with machine learning further improves predictive performance¹⁵. However, many previous studies relied on limited or subjectively selected algorithms, increasing the risk of overfitting and yielding suboptimal models. In our study, we applied a comprehensive computational framework incorporating ten machine learning algorithms and 117 combinations to build a stable, reproducible prognostic model. This systematic approach allowed us to explore algorithmic interactions, reduce feature dimensionality, and enhance model generalizability. Among all combinations, a combined approach using Lasso and GBM was ultimately selected due to the highest mean C-index (0.768) across cohorts. Compared to traditional survival analysis methods, Lasso can effectively perform variable selection and regularization on high-dimensional datasets, reduce model complexity, and mitigate multicollinearity¹⁶. GBM integrates the strengths of the Boosting algorithm with the generalized regression approach, incrementally enhancing the predictive capability of the model¹⁷. Notably, the final model included nine radiomics features and six DL features, which showed low inter-correlation, indicating they capture distinct but synergistic aspects of tumor biology. The DLRM-RS derived from the optimal model effectively stratified patients, with those in the high-risk group showing significantly worse OS, which were validated in both internal and external cohorts. To improve the predictive accuracy and facilitate individualized risk assessment, we integrated clinical variables into the prognostic framework. Multivariate Cox regression identified CEA and N stage as independent predictors of OS. As a key serum biomarker for gastrointestinal malignancies, elevated preoperative CEA levels have been associated with more aggressive tumor behavior and a higher risk of postoperative recurrence⁵⁵. Likewise, previous studies have demonstrated that advanced N stage has consistently been associated with poorer prognosis^18,19,20. Combining DLRM-RS, CEA, and N stage into a nomogram yielded superior predictive performance, offering a clinical application tool for personalized risk stratification of CRC.

Despite their excellent predictive performance, data-driven DL and radiomics studies have the nature of the post generation of scientific hypotheses, making it challenging to directly infer the biological significance of their results. However, exploring the potential biological relevance of these imaging features is essential for understanding the disease mechanism, identifying potential diagnostic and therapeutic targets, and facilitating clinical translation. To enhance the interpretability of the DLRM-RS, we conducted a series of biological analyses. Metabolomic profiling revealed significant alterations in several metabolic pathways between the high- and low-risk CRC group, including butanoate metabolism, nitrogen metabolism, and alanine, aspartate, and glutamate metabolism. Each of these metabolic pathways is related to tumor progression and immune regulation^21,22,23. These findings suggest that metabolic reprogramming may be associated with the prognostic stratification defined by the DLRM-RS, providing an exploratory biological context. Furthermore, the transcriptomic analysis using GSEA demonstrated that the tumors in the high-risk group were enriched in ECM-related pathways, such as ECM structural constituent, ECM constituent conferring tensile strength, and collagen fibril organization. The ECM, as a key component of the tumor microenvironment together with various immune cells, plays a critical role in regulating cellular proliferation, migration, diffusion, and differentiation^24,25,26. These findings were consistent with previous studies showing that ECM-related pathways were closely associated with tumor progression^27,28. In contrast, the tumors in the low-risk group showed significant enrichment in immune-related pathways, including antigen processing and presentation, immunological synapse formation, and respiratory burst, suggesting a more active anti-tumor immune microenvironment. Consistently, immune cell infiltration analysis revealed that low-risk patients exhibited higher levels of CD8⁺ T cell infiltration. CD8⁺ T cells are central to anti-tumor immunity by releasing cytotoxic molecules and cytokines^29,30, and their high density has been confirmed to correlate with better survival outcomes^31,32. Collectively, these exploratory results suggest that the DLRM-RS may be associated with distinct metabolic and immune-related patterns in CRC, and point toward potential immunotherapy targets for future interventions.

Importantly, integrative metabolomic and transcriptomic analyses consistently identified two altered pathways—butanoate metabolism and nitrogen metabolism. Differential pathway analysis revealed that both pathways were significantly enriched in the low-risk group, suggesting their potential protective roles in CRC prognosis. Further corroborating their clinical relevance, GSVA analysis in an independent TCGA-CRC cohort demonstrated that lower activity of these pathways was significantly associated with worse OS, highlighting their potential relevance to prognosis and their candidacy for further investigation as biomarkers or therapeutic targets. Butanoate metabolism, closely linked to the production of short-chain fatty acids by commensal gut microbiota, was known for its anti-inflammatory, anti-proliferative, and pro-apoptotic effects in CRC³³. It functions as a histone deacetylase inhibitor, suppressing CD8⁺ T cell exhaustion within the tumor immune microenvironment, inhibiting JAK-STAT signaling, and enhancing antitumor immunity^23,34,35. The enrichment of this pathway in the low-risk group suggests a more favorable tumor microenvironment, possibly associated with a healthier microbiota composition and enhanced immunosurveillance. Interestingly, nitrogen metabolism also showed higher activity in patients with better prognosis. Although traditionally it was believed to be linked to tumor proliferation^36,37,38, recent evidence suggests a context-dependent role. In the tumor microenvironment, enhanced nitrogen metabolism may inhibit tumor progression by supporting the function of immune cells, maintaining REDOX balance and metabolic homeostasis^39,40, and promoting the formation of an immunologically active microenvironment. Therefore, the upregulation of nitrogen metabolism in the low-risk group may reflect a metabolic state conducive to immune surveillance, suggesting a potential link between metabolism-immune interactions and CRC prognosis.

The findings of this study carry significant clinical implications, particularly in informing personalized treatment strategies for CRC patients. Notably, the prognostic value of the DLRM-RS was consistently preserved across a wide range of clinically relevant subgroups, including age, sex, tumor location, CEA/CA199 levels, T stage, N stage, differentiation grade, and postoperative chemotherapy status. Importantly, within each of these subgroups, patients classified as high-risk by the DLRM-RS consistently exhibited significantly worse OS than their low-risk counterparts. This indicates that the DLRM-RS provides prognostic information beyond conventional clinicopathological factors, rather than merely recapitulating established predictors such as tumor stage. From a clinical perspective, this additional stratification capability suggests that high-risk patients identified by the DLRM-RS may benefit from intensified treatment strategies or closer postoperative surveillance, even within the same clinical stage or treatment category, whereas low-risk patients may be considered for de-escalated interventions to avoid overtreatment. Integrating radiomics and DL into routine clinical workflow may therefore enhance prognostic precision, support risk-adapted decision-making, and facilitate more individualized patient management. Furthermore, the identification of prognosis-related metabolic pathways, such as butanoate metabolism and nitrogen metabolism, provides biologically plausible contexts or signals that may help interpret risk stratification and inform future hypothesis-driven studies.

While this study provided significant insights, several limitations should be acknowledged. Firstly, as a retrospective multicenter study, potential biases and unmeasured confounders remain. In particular, the transcriptomic and metabolomic analyses were limited by relatively small sample sizes, particularly in the high-risk subgroup of the transcriptomic cohort. Such underpowered datasets may reduce the statistical robustness of differential expression and pathway enrichment analyses and increase the risk of overfitting. Therefore, the biological interpretations derived from these exploratory multi-omics analyses should be considered hypothesis-generating rather than definitive, and validation in larger, well-designed prospective multi-omics cohorts is warranted. Secondly, manual ROI segmentation is heavily influenced by subjective experience; future work needs to adopt an unsupervised segmentation method. Thirdly, although the patch-based DL strategy employed in this study captures inter-patch heterogeneity, it may not fully model fine-grained intra-patch heterogeneity. Recent advances, such as multiple-instance learning frameworks, enable instance-level feature learning within patches, thereby preserving detailed morphological and spatial information that may be critical for prognostic modeling. Moreover, graph-based transformer approaches can further model complex contextual relationships among instances or patches, facilitating multi-scale representation of tumor heterogeneity. These advanced strategies may address the limitations of the current approach and will be explored in future work⁴¹. Lastly, while this study integrated radiomics, DL, metabolomics, and transcriptomics, other critical dimensions like methylation, proteomics, and pathomics were not included. Future multi-omics integrative analyses are warranted to deepen understanding of tumor biology and improve precision medicine.

In conclusion, this study proposed and validated a DLRM model together with its derived DLRM-RS for prognosis prediction in CRC patients based on preoperative CT. Combining CEA and N stage with DLRM-RS will provide a unique opportunity for new management approaches. In addition, we provided new insights into the molecular mechanisms of CRC occurrence and development from the perspectives of metabolomics and transcriptomics. By integrating molecular insights with clinical applications, our findings paved the way for improving prognosis prediction and the development of personalized treatment strategies.

Methods

Patients

We retrospectively collected data from CRC patients who underwent surgical resection between January 2016 and January 2020 across four centers. A total of 1183 patients with preoperative CT images, clinical, and follow-up data were included for the development and validation of a risk stratification model. The inclusion criteria were as follows: (a) CRC confirmed by postoperative pathology; (b) radical surgery for CRC performed; (c) contrast-enhanced CT examination within 2 weeks before surgery with good image quality; and (d) a follow-up period of more than 5 years. The exclusion criteria included: (a) receipt of radiotherapy, chemotherapy, or other treatments before surgery; (b) presence of other malignant neoplasms; (c) incomplete clinical or pathological data; and (d) distant metastasis before surgery. All these patients were divided into four cohorts: (1) training cohort (n = 435) and (2) an internal validation cohort (n = 187), both from Center 1 using a 7:3 split between January 2016 and December 2019; (3) external validation cohort 1 (n = 313) from Centers 2 and 3 (January 2018 -January 2020); and (4) external validation cohort 2 (n = 248) from Center 4 (January 2018 - January 2020). The two external validation cohorts were intentionally analyzed separately rather than merged, as they originated from distinct geographic regions and clinical settings. This design allows a more rigorous assessment of the model’s generalizability across heterogeneous real-world environments.

In addition, 52 patients from Center 3 with paired CT images, follow-up data, and ¹H NMR-based metabolomic data (tumor and adjacent normal tissue) were included from a previous CRC diagnostic study⁴⁴ (used in metabolomic analysis). While the previous study primarily focused on diagnostic applications, our current analysis aimed to investigate metabolic alterations underlying distinct prognostic outcomes. Furthermore, CT images, follow-up, and RNA sequencing data from 19 CRC patients in the TCIA database were used for mechanistic exploration of prognostic differences (used in transcriptomic analysis), and an additional 417 patients from The Cancer Genome Atlas (TCGA) with RNA sequencing and follow-up data were included for pathway validation (used in pathway validation). To ensure the clarity of the data sources and cohort definitions, a summary of patient sources, data types, and analytic purposes was presented in Supplementary Table S12, and the patient enrollment process was illustrated in Supplementary Fig. S9.

Clinical data collection and follow-up

Clinical data recorded from the medical records of participating institutions included age, sex, tumor differentiation, tumor location, CEA, CA199, T stage, N stage, and postoperative chemotherapy.

The clinical endpoint of this study was OS, defined as the time from surgical resection to death or the last follow-up. The final follow-up was conducted in January 2025. Patients were monitored every 6–12 months in the first 2 years and once a year thereafter. All patients were followed up for at least 5 years.

CT image acquisition and image preprocessing process

CT images were obtained at four centers using five CT scanners. All patients underwent contrast-enhanced abdominal CT scans. Detailed acquisition parameters for each scanner were provided in Supplementary Table S13. To reduce noise-related intensity variability caused by different scanners and hospitals in CT images and improve the robustness of feature extraction, the voxel intensities were discretized using a fixed bin width of 25 Hounsfield units (HU). The HU is a standardized quantitative scale in CT imaging that reflects tissue attenuation relative to water and air. Fixed bin-width discretization with a bin width of 25 HU is a commonly adopted preprocessing step in CT-based image analysis, recommended by the Image Biomarker Standardisation Initiative (IBSI), to improve the robustness of image-derived feature extraction by reducing high-frequency intensity fluctuations and minor scanner-dependent variations^45,46,47. Subsequently, all images were resampled to an isotropic voxel spacing of 1 mm × 1 mm × 1 mm (x, y, z) using linear interpolation to standardize spatial resolution across scanners and institutions.

Tumor segmentation and feature extraction

Tumor segmentation was performed on venous-phase CT images. Two radiologists (Z.H.L. and Y.Q., with 5 and 10 years of experience, respectively) manually segmented the tumors using 3D Slicer software (version 5.2.2; https://www.slicer.org/). Mesentery air and pericolonic fat were excluded from the volume of interest. Discrepancies were reviewed and resolved by a third radiologist (Y.L), who has over 20 years of experience. All radiologists were blinded to patients’ survival outcomes.

Subsequently, radiomic features were extracted from the volume of interest (VOI) in venous-phase CT images using the Pyradiomics Python package (version 3.8)⁴⁸, including first-order, shape-based, gray level co-occurrence matrix (GLCM), gray-level size zone matrix (GLSZM), gray level run-length matrix (GLRLM), gray level dependence matrix (GLDM), and neighboring gray tone difference matrix (NGTDM). The specific classification of radiomics features is shown in Supplementary Table S14. In addition, DL-based feature extraction was performed using a three-dimensional convolutional neural network (3D CNN). Considering the limited availability of pretrained models for medical 3D imaging, the Med3D dataset—a large-scale, multi-modal, multi-organ medical imaging resource—was employed for network pretraining⁴⁹. A 3D ResNet18 architecture pretrained on Med3D was adopted as the backbone network, selected for its favorable balance between representational capacity and robustness in small- to moderate-sized medical imaging datasets. Manually delineated tumor regions of interest (ROIs) were used to localize the lesion in three-dimensional space. For each patient, a cubic bounding box fully enclosing the ROI was generated. The resulting 3D volumes were then resampled using linear interpolation to a fixed size of 96 × 96 × 96 voxels at an isotropic resolution of 1 × 1 × 1 mm³, ensuring uniform input dimensions across patients while preserving essential peritumoral contextual information. Prior to network input, voxel intensities within each 3D patch were linearly scaled to a fixed range to improve numerical stability during training. Data augmentation strategies, including random affine transformations, random flips, and random 90° rotations along different axes, were applied during training to reduce overfitting and enhance model generalizability. The 3D ResNet18 backbone was initialized with Med3D-pretrained weights and subsequently fine-tuned using patient outcomes from the training cohort as supervision. Model training was performed for 200 epochs using the Adam optimizer with an initial learning rate of 0.0005 and a binary cross-entropy loss with logits, and a cosine annealing learning rate scheduler. Given that the primary objective of the DL model was feature representation learning rather than direct survival time modeling, OS was formulated as a binary outcome during network training to provide supervision for discriminative feature extraction. Specifically, patients were labeled as deceased or alive at 5 years after surgery, and this binary endpoint was used solely for network supervision rather than for final survival prediction. After fine-tuning, DL features were extracted from the penultimate layer of the 3D ResNet18 network for each patient, yielding a 512-dimensional feature vector^5,50. A total of 1437 radiomics features and 512 DL features were extracted from each patient. Given the standardized image preprocessing in accordance with IBSI recommendations, ICC-based feature robustness assessment, redundancy reduction, and rigorous independent multi-center external validation, no additional feature-level harmonization methods were applied in this study.

To ensure the reproducibility and reliability of radiomic and DL features, 30 CT images were randomly selected for VOI segmentation, and inter-/intra-observer consistency of feature extraction was assessed by intra-class correlation coefficient (ICC) analysis. First, radiologist 1 (Z.H.L) and radiologist 2 (Y.Q) independently delineated the VOI and calculated the interobserver ICC. After 2 months, radiologist 1 re-segmented the VOI to calculate the intra-observer ICC. An ICC value greater than 0.80 was considered to indicate good agreement.

Feature selection and development of the DLRM-RS

The process of feature selection was confined to the training cohort. Before feature selection, z-score normalization was used to standardize the radiomics and DL features. First, features with an intra- and inter-observer ICCs value of > 0.80 were retained to ensure reproducibility. Second, Spearman correlation analysis was performed to reduce redundancy and multicollinearity, using a threshold of |r | > 0.8; when collinearity was detected, only one of the correlated features was retained for further analysis. Third, univariate Cox regression was applied as an initial feature screening step to reduce dimensionality. This analysis was not intended for final statistical inference, and therefore, no formal multiple testing correction was applied at this stage. Subsequent model development incorporated ML algorithms with intrinsic regularization or feature selection mechanisms to further control overfitting and identify the most informative features.

To develop a robust DLRM for OS prediction, we utilized the “Mime” R package (version 1.0, https://github.com/l-magnificence/Mime), which integrates ten machine learning algorithms and 117 algorithm combinations⁵¹. The integrative algorithms included random forest (RSF), elastic network (Enet), stepwise Cox (StepCox), least absolute shrinkage and selection operator (Lasso), partial least squares regression for Cox (plsRcox), supervised principal components (SuperPC), generalized boosted regression models (GBM), survival support vector machine (survival-SVM), ridge regression (Ridge), and Cox model with boosting algorithm (CoxBoost). The precise parameters of the ten machine learning algorithms can be found in the original code and in Supplementary Table S15. Each model was trained using 10-fold cross-validation in the training cohort and evaluated in the internal and two external validation cohorts. To identify a robust and generalizable final model, we summarized predictive performance using the C-index across the training cohort, the internal validation cohort, and the two external validation cohorts. The model achieving the highest mean C-index across these four cohorts was selected as the final DLRM, aiming to balance predictive accuracy and stability across heterogeneous populations. Each patient’s computed risk score from this model was termed the DLRM risk score.

To facilitate comparison with conventional prognostic modeling strategies, two baseline models were also constructed. Specifically, radiomics-only and DL-only models were developed following the same feature selection procedures and were subsequently built using a conventional multivariable Cox regression framework. These baseline models were used to benchmark the incremental prognostic value of the proposed integrative DLRM.

In addition, the optimal cutoff value for the DLRM risk score in the training cohort was determined using the ‘surv_cutpoint’ function from the ‘survminer’ package (version 0.5.0). Based on this threshold, a DLRM-RS was established, categorizing patients into low- and high-risk groups.

Evaluation of the DLRM-RS

A time-dependent ROC curve was used to evaluate the prognostic performance of the DLRM-RS. K-M analysis and the log-rank test were conducted to assess survival differences between the two risk groups. To minimize potential confounding and ensure the robustness of the DLRM-RS, subgroup analyses were performed based on sex, age (≥/ <65 years), tumor location (right colon/rectum/left colon), T stage (T1-2/T3-4), N stage (N0/N+), CEA level, CA199 level, and postoperative chemotherapy.

Development and evaluation of the nomogram model

In the training cohort, univariate and multivariate Cox regression analyses were performed to identify independent prognostic factors for OS among clinical variables and the DLRM-RS. A clinical model was developed based on the significant clinical variables from the multivariate analysis. Subsequently, a combined prognostic nomogram was constructed by incorporating the DLRM-RS with these clinical variables using multivariate Cox regression.

The overall discriminative performance of the clinical model, DLRM-RS model, and the nomogram was evaluated using the C-index. Predictive accuracy at 3 and 5 years was assessed using time-dependent ROC curves, with the AUC calculated for each time point. The C-index values and time-dependent AUCs of the models were compared using a nonparametric bootstrap method with 1000 resamples to estimate 95% confidence intervals and test for statistically significant differences in discrimination. Calibration curves were plotted to assess the agreement between predicted and observed survival probabilities at 3 and 5 years. Furthermore, DCA was performed to evaluate the clinical utility of each model across a range of threshold probabilities.

Metabolomics sample preparation, metabolite identification, and analysis

We collected metabolomics data from 52 CRC patients in Center 3, each of whom provided paired tumors and adjacent normal tissue samples (collected approximately 5 cm from the tumor margin). Tissue processing and ¹H NMR spectroscopy were performed as previously described⁴⁴. Briefly, dried tissue extracts were reconstituted in 550 μL phosphate-buffered saline (PBS, 0.1 M, pH 7.4, in D₂O) together with 50 μL 3-(trimethylsilyl) propionic-2,2,3,3-d₄ acid sodium salt (TSP) in D₂O stock solution. After centrifugation at 10,000 × g for 5 min, 500 μL of the supernatant was transferred into a 5 mm NMR tube for ¹H NMR analysis on a Bruker Avance 600 MHz spectrometer.

The CHENOMX NMR Suite software (Chenomx Inc., Edmonton, Canada) 600 mHz, version 11 was used to identify and quantify the tissue metabolites detected by ¹H NMR by matching the compounds’ peak with its library according to the chemical shift. 52 metabolites’ concentration was quantified, using TSP as the reference compound (shift at 0 ppm). The metabolites’ concentrations were reported in micromolar (μM).

Median normalization was applied to minimize systematic variations across samples, and log₂ transformation was subsequently used to reduce skewness and approximate normality of metabolite concentration distributions. Pattern recognition analysis, including PCA, metabolite heatmap, and OPLS-DA, was carried out to identify key metabolic features. To confirm the results of the established models and avoid overfitting, we performed 999 iterations of the permutation test and CV-ANOVA, and the quality of the OPLS-DA model was assessed using R² and Q². The differentially abundant metabolites were screened (VIP > 1.0 and p < 0.05). Metabolic pathway analysis was conducted via MetaboAnalyst 5.0. Importance and relevance of pathways are indicated by the impact factor (X axis) and -log (p value) (Y axis).

Transcriptomics analysis

To explore the transcriptomic alterations underlying the prognostic stratification defined by the DLRM-RS, we collected CT scans and RNA sequencing TPM data of 19 patients from the TCIA to reveal the molecular mechanisms associated with different prognosis groups. Patients were grouped into high- and low-risk groups defined by the DLRM-RS. To explore and visualize the biological characteristics of different risk groups, GSEA based on GO was performed by using the GSEA software (version 4.0.2, Broad Institute, San Diego, CA, USA)⁵². 1000 permutations were used. Permutation was performed based on gene sets. A weighted enrichment statistic was applied, using the signal-to-noise ratio as a metric for ranking genes. No additional normalization was applied within GSEA. As in the default mode, gene sets smaller than 15 and bigger than 200 genes were excluded from analysis. For visualization of GSEA results, the EnrichmentMap (version 3.5.0)⁵³ application within the Cytoscape environment (version 3.10.3)⁵⁴ was used. Gene sets were considered if they were identified at an FDR < 0.05. For gene-set-similarity filtering, data set edges were set automatically. A combined Jaccard and Overlap metric was used, applying a cutoff of 0.375. For gene set clustering, AutoAnnotate (version 1.5.2) ⁵³ was used by automates running clusterMaker2 to generate clusters. The gene-set-similarity coefficient was utilized for edge weighting.

In addition, GSEA was also employed to investigate functional enrichment differences between high- and low-risk groups defined by the DLRM-RS, based on the GO biological process and KEGG sets. Pathways meeting the threshold of |NES | > 1 and FDR < 0.05 were considered significantly enriched pathways. The CIBERSORT algorithms were employed to evaluate differences in immune cell infiltration between the high- and low-risk groups⁴.

To validate the prognostic relevance of the identified metabolic pathways, we obtained RNA sequencing data (TPM values) and clinical follow-up information for 417 patients from the TCGA cohort. GSVA was performed based on KEGG pathway gene sets to calculate enrichment scores for each sample. For each pathway, patients were stratified into high- and low-score groups using the median GSVA score as the cutoff. Kaplan-Meier survival analysis was then conducted to evaluate differences in OS between the groups.

Statistical analysis

All statistical analyses were conducted using R software (version 4.4.2), Python (version 3.8), R-based MetaboAnalyst 5.0 platform (https://www.metaboanalyst.ca), Cytoscape software (version 3.10.3), GSEA software (version 4.0.2, Broad Institute, San Diego, CA, USA), and SIMCA (version 14.1). Continuous variables were expressed as mean ± standard deviation for normally distributed data or median with interquartile range (IQR) for non-normally distributed data. Categorical variables were presented as frequencies and percentages. Group comparisons for continuous variables were performed using unpaired t-tests or Mann-Whitney U tests as appropriate. Categorical variables were compared using the Chi-square test or Fisher’s exact test. All statistical tests were two-sided, with a p-value < 0.05 considered statistically significant.

Ethics statement

This multicenter retrospective study was approved by the ethics committees of the Shaoxing People’s Hospital (2020-K-Y-143-01), the First Affiliated Hospital of Ningbo University (2025-189A), Cancer Hospital of Shantou University Medical College (ChiCTR2300073613), and the Second Affiliated Hospital of Shantou University Medical College (2020-17). This research was conducted in accordance with the guidelines of the Helsinki Declaration and the principles of Good Clinical Practice. Given the retrospective nature of the study and the use of anonymized data, the requirement for informed consent was waived. The study was conducted and reported in accordance with the CheckList for EvaluAtion of Radiomics research (CLEAR)⁴² and the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD)⁴³ guidelines to ensure comprehensive and transparent reporting.

Data availability

All data of TCIA and TCGA could be downloaded from The Cancer Imaging Archive (https://www.cancerimagingarchive.net/browse-collections/) and The Cancer Genome Atlas (https://portal.gdc.cancer.gov/). Details of the software, packages, and versions used in the computational workflow are provided in Supplementary Table S16. Other data are not publicly available due to ethical considerations regarding the inclusion of facial information of patients. However, interested parties may be able to obtain access to the data through a reasonable request from the corresponding author.

Code availability

The source codes used in this study are available at https://github.com/Radiology123/CRC-CT-prognosis.git.

References

Bray, F. et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 74, 229–263 (2024).
PubMed Google Scholar
Benson, A. B. et al. Colon Cancer, Version 2.2021, NCCN Clinical Practice Guidelines in Oncology. J. Natl. Compr. Cancer Netw. 19, 329–359 (2021).
Article Google Scholar
Benson, A. B. et al. Rectal Cancer, Version 2.2022, NCCN Clinical Practice Guidelines in Oncology. J. Natl. Compr. Cancer Netw. 20, 1139–1167 (2022).
Article CAS Google Scholar
Mazurowski, M. A. Radiogenomics: what it is and why it is important. J. Am. Coll. Radio. 12, 862–866 (2015).
Article Google Scholar
Li, Z. et al. Comparison of clinical, radiomics, deep learning, and fusion models for predicting early recurrence in locally advanced rectal cancer based on multiparametric MRI: a multicenter study. Eur. J. Radio. 189, 112173 (2025).
Article Google Scholar
Lan, T. et al. MRI-based deep learning and radiomics for prediction of occult cervical lymph node metastasis and prognosis in early-stage oral and oropharyngeal squamous cell carcinoma: a diagnostic study. Int J. Surg. 110, 4648–4659 (2024).
Article PubMed PubMed Central Google Scholar
Zhang, M. et al. An interpretable CT-based deep learning model for predicting overall survival in patients with bladder cancer: a multicenter study. NPJ Precis. Oncol. 9, 288 (2025).
Article PubMed PubMed Central Google Scholar
Zhu, J. et al. Sub-regional radiomics combining multichannel 2-dimensional or 3-dimensional deep learning for predicting neoadjuvant chemo-immunotherapy response in esophageal squamous cell carcinoma: a multicenter study. NPJ Precis. Oncol. 9, 248 (2025).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. Unraveling the power of radiomics: prediction and exploration of lymph node metastasis in stage T1/2 esophageal squamous cell carcinoma. NPJ Precis. Oncol. 9, 176 (2025).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. & Sejdić, E. Radiological images and machine learning: Trends, perspectives, and prospects. Comput Biol. Med. 108, 354–370 (2019).
Article PubMed PubMed Central Google Scholar
Lin, C. Y. et al. Combined model integrating deep learning, radiomics, and clinical data to classify lung nodules at chest CT. Radio. Med. 129, 56–69 (2024).
Article Google Scholar
Zhang, D., Li, Y. N., Li, C. L. & Guo, W. L. Multimodal radiomics and deep learning models for predicting early femoral head deformity in LCPD. Eur. J. Radio. 181, 111793 (2024).
Article Google Scholar
Chen, B., Khodadoust, M. S., Liu, C. L., Newman, A. M. & Alizadeh, A. A. Profiling tumor infiltrating immune cells with CIBERSORT. Methods Mol. Biol. 1711, 243–259 (2018).
Article CAS PubMed PubMed Central Google Scholar
Crombé, A. et al. Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis. NPJ Precis. Oncol. 8, 129 (2024).
Article PubMed PubMed Central Google Scholar
Avanzo, M. et al. Machine and deep learning methods for radiomics. Med. Phys. 47, e185–e202 (2020).
Article PubMed PubMed Central Google Scholar
Poulos, R. C. et al. Strategies to enable large-scale proteomics for reproducible research. Nat. Commun. 11, 3793 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen H. et al. MALMPS: a machine learning-based metabolic gene prognostic signature for stratifying clinical outcomes and molecular heterogeneity in Stage II/III Colorectal Cancer. Adv Sci (Weinh), e01333 (2025).
Ning, F. L. et al. Identification and initial validation of maximal tumor area as a novel prognostic factor for overall and disease-free survival in patients with resectable colon cancer: a retrospective study. Int J. Surg. 109, 3407–3416 (2023).
Article PubMed PubMed Central Google Scholar
Sinicrope, F. A. et al. Use of deep learning to evaluate tumor microenvironmental features for prediction of colon cancer recurrence. Cancer Res. Commun. 4, 1344–1350 (2024).
Article CAS PubMed PubMed Central Google Scholar
Kleppe, A. et al. A clinical decision support system optimising adjuvant chemotherapy for colorectal cancers by integrating deep learning and pathological staging markers: a development and validation study. Lancet Oncol. 23, 1221–1232 (2022).
Article CAS PubMed Google Scholar
Zhao, Y. et al. NMR and MS reveal characteristic metabolome atlas and optimize esophageal squamous cell carcinoma early detection. Nat. Commun. 15, 2463 (2024).
Article CAS PubMed PubMed Central Google Scholar
De Martino, M., Rathmell, J. C., Galluzzi, L. & Vanpouille-Box, C. Cancer cell metabolism and antitumour immunity. Nat. Rev. Immunol. 24, 654–669 (2024).
Article PubMed PubMed Central Google Scholar
Shi, Z. et al. Butyrate-producing Faecalibacterium prausnitzii suppresses natural killer/T-cell lymphoma by dampening the JAK-STAT pathway. Gut. 74, 557–570 (2025).
Article CAS PubMed PubMed Central Google Scholar
Chaudhuri, O., Cooper-White, J., Janmey, P. A., Mooney, D. J. & Shenoy, V. B. Effects of extracellular matrix viscoelasticity on cellular behaviour. Nature 584, 535–546 (2020).
Article CAS PubMed PubMed Central Google Scholar
Vasudevan, J., Jiang, K., Fernandez, J. G. & Lim, C. T. Extracellular matrix mechanobiology in cancer cell migration. Acta Biomater. 163, 351–364 (2023).
Article PubMed Google Scholar
Wang, K. et al. Extracellular matrix stiffness regulates colorectal cancer progression via HSF4. J. Exp. Clin. Cancer Res 44, 30 (2025).
Article PubMed PubMed Central Google Scholar
Xie, Y. et al. Noninvasive prognostic classification of ITH in HCC with multi-omics insights and therapeutic implications. Sci. Adv. 11, eads8323 (2025).
Article CAS PubMed PubMed Central Google Scholar
Rigillo, G. et al. The NF-Y splicing signature controls hybrid EMT and ECM-related pathways to promote aggressiveness of colon cancer. Cancer Lett. 567, 216262 (2023).
Article CAS PubMed Google Scholar
Peng, J. et al. Three-dimensional multimodal imaging for predicting early recurrence of hepatocellular carcinoma after surgical resection. J. Adv. Res. (2025).
Lian, X. et al. Immunometabolic rewiring in tumorigenesis and anti-tumor immunotherapy. Mol. Cancer 21, 27 (2022).
Article PubMed PubMed Central Google Scholar
Williams, C. J. M. et al. Evaluation of CD3 and CD8 T-Cell Immunohistochemistry for Prognostication and Prediction of Benefit From Adjuvant Chemotherapy in Early-Stage Colorectal Cancer Within the QUASAR Trial. J. Clin. Oncol. 42, 3430–3442 (2024).
Article CAS PubMed PubMed Central Google Scholar
Shang, S. et al. TRIB3 reduces CD8(+) T cell infiltration and induces immune evasion by repressing the STAT1-CXCL10 axis in colorectal cancer. Sci. Transl. Med. 14, eabf0992 (2022).
Article CAS PubMed Google Scholar
Zhao, Y. & Jiang, Q. Roles of the polyphenol-gut microbiota interaction in alleviating colitis and preventing colitis-associated colorectal cancer. Adv. Nutr. 12, 546–565 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pedersen, S. S., Ingerslev, L. R., Olsen, M., Prause, M. & Billestrup, N. Butyrate functions as a histone deacetylase inhibitor to protect pancreatic beta cells from IL-1β-induced dysfunction. FEBS J. 291, 566–583 (2024).
Article CAS PubMed Google Scholar
Wang, X. et al. Fusobacterium nucleatum facilitates anti-PD-1 therapy in microsatellite stable colorectal cancer. Cancer Cell 42, 1729–1746.e1728 (2024).
Article CAS PubMed Google Scholar
Tong, M. et al. An analysis of prognostic risk and immunotherapy response of glioblastoma patients based on single-cell landscape and nitrogen metabolism. Neurobiol. Dis. 211, 106935 (2025).
Article CAS PubMed Google Scholar
Jin, J., Byun, J. K., Choi, Y. K. & Park, K. G. Targeting glutamine metabolism as a therapeutic strategy for cancer. Exp. Mol. Med. 55, 706–715 (2023).
Article CAS PubMed PubMed Central Google Scholar
Nan, D. et al. Glutamine and cancer: metabolism, immune microenvironment, and therapeutic targets. Cell Commun. Signal 23, 45 (2025).
Article PubMed PubMed Central Google Scholar
Kurmi, K. & Haigis, M. C. Nitrogen Metabolism in Cancer and Immunity. Trends Cell Biol. 30, 408–424 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sies, H., Mailloux, R. J. & Jakob, U. Fundamentals of redox regulation in biology. Nat. Rev. Mol. Cell Biol. 25, 701–719 (2024).
Article CAS PubMed PubMed Central Google Scholar
Nakhli, R. et al. Sparse Multi-Modal Graph Transformer with Shared-Context Processing for Representation Learning of Giga-pixel Images. In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)) (2023).
Kocak, B. et al. CheckList for EvaluAtion of Radiomics Research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Imaging 14, 75 (2023).
Article PubMed PubMed Central Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Bmj. 350, g7594 (2015).
Article PubMed Google Scholar
Cai, R. et al. NMR-based metabolomics combined with metabolic pathway analysis reveals metabolic heterogeneity of colorectal cancer tissue at different anatomical locations and stages. Int. J. Cancer, (2024).
Zwanenburg, A. et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology 295, 328–338 (2020).
Article PubMed PubMed Central Google Scholar
Chen, X. et al. Integrated analysis of radiomics, RNA, and clinicopathologic phenotype reveals biological basis of prognostic risk stratification in colorectal cancer. Sci. Bull. 69, 3666–3671 (2024).
Article CAS Google Scholar
Xie, C. et al. Multimodal data integration for biologically-relevant artificial intelligence to guide adjuvant chemotherapy in stage II colorectal cancer. EBioMedicine 117, 105789 (2025).
Article CAS PubMed PubMed Central Google Scholar
van Griethuysen, J. J. M. et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 77, e104–e107 (2017).
Article PubMed PubMed Central Google Scholar
Chen, S., Ma, K., Zheng, Y. Med3d: Transfer learning for 3d medical image analysis. arXiv preprint arXiv:190400625, (2019).
Guo, L. et al. Early warning of hepatocellular carcinoma in cirrhotic patients by three-phase CT-based deep learning radiomics model: a retrospective, multicentre, cohort study. EClinicalMedicine 74, 102718 (2024).
Article PubMed PubMed Central Google Scholar
Liu, H. et al. Mime: A flexible machine-learning framework to construct and visualize models for clinical characteristics prediction and feature selection. Comput. Struct. Biotechnol. J. 23, 2798–2810 (2024).
Article CAS PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Article CAS PubMed PubMed Central Google Scholar
Reimand, J. et al. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap. Nat. Protoc. 14, 482–517 (2019).
Article CAS PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We acknowledge the BioRender platform for providing the tools to create most of the illustrations in this paper. This work was supported by the National Natural Science Foundation of China (82071973, 82572207, and 82503566), Basic and Applied Basic Research Foundation of Guangdong Province (2020A1515011022, 2023A1515010326, and 2024A1515110018), Guangdong Medical Scientific Research Foundation Program (A2025008), and Key Research Platform and Project of Guangdong University (2022ZDZX2020). The funding sources had no role in the collection, analysis, or interpretation of the data or in the decision to submit the manuscript for publication.

Author information

These authors contributed equally: Zhiheng Li, Rongzhi Cai, Yangyang Qin.

Authors and Affiliations

Department of Radiology, The Second Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong, China
Zhiheng Li, Rongzhi Cai, Xiaoqing Liao, Enqi Wang, Xuanyu Wu, Yan Zhao & Yan Lin
Department of Radiology, Cancer hospital of Shantou University Medical College, Shantou, Guangdong, China
Rongzhi Cai
Department of Radiology, The First Affiliated Hospital of Ningbo University, Ningbo, Zhejiang, China
Yangyang Qin
Central Laboratory, Clinical Research Center, Shantou Central Hospital, Shantou, Guangdong, China
Yan Zhao
Department of Radiology, The Shaoxing People’s Hospital, Shaoxing, Zhejiang, China
Zengxin Lu

Authors

Zhiheng Li
View author publications
Search author on:PubMed Google Scholar
Rongzhi Cai
View author publications
Search author on:PubMed Google Scholar
Yangyang Qin
View author publications
Search author on:PubMed Google Scholar
Xiaoqing Liao
View author publications
Search author on:PubMed Google Scholar
Enqi Wang
View author publications
Search author on:PubMed Google Scholar
Xuanyu Wu
View author publications
Search author on:PubMed Google Scholar
Yan Zhao
View author publications
Search author on:PubMed Google Scholar
Zengxin Lu
View author publications
Search author on:PubMed Google Scholar
Yan Lin
View author publications
Search author on:PubMed Google Scholar

Contributions

Z.H.L., R.C., and Y.Q. contributed equally to this work. Z.H.L. and Y.L. conceived the study. Z.H.L., R.C., and Y.Q. analyzed the data. X.L., E.W., and Y.L. double-checked the data and results. Z.H.L. and Y.L. drafted the manuscript. Z.H.L., X.W., Z.X.L., Y.Z., and Y.L. supervised and coordinated the study. Z.H.L., R.C., Y.Z., Y.Q., and Y.L. revised the final version of the manuscript. Z.H.L., Z.X.L., and Y.L. are guarantors for this study. Z.H.L., Z.X.L., and Y.L. have accessed and verified the data reported in the manuscript. All authors had full access to all the data in the study and accepted the responsibility to submit it for publication. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Yan Lin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

41698_2026_1331_MOESM1_ESM (download PDF )

41698_2026_1331_MOESM2_ESM (download PDF )

41698_2026_1331_MOESM3_ESM (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Z., Cai, R., Qin, Y. et al. Integration of radiomics, deep learning, transcriptomics, and metabolomics reveals prognostic risk stratification and underlying biological mechanisms in colorectal cancer. npj Precis. Onc. 10, 155 (2026). https://doi.org/10.1038/s41698-026-01331-2

Download citation

Received: 22 September 2025
Accepted: 06 February 2026
Published: 06 March 2026
Version of record: 20 April 2026
DOI: https://doi.org/10.1038/s41698-026-01331-2

Subjects

Abstract

Similar content being viewed by others

Machine learning-based model for CD4+ conventional T cell genes to predict survival and immune responses in colorectal cancer

Deep representation learning of tissue metabolome and computed tomography annotates NSCLC classification and prognosis

Integrated analysis of single-cell and bulk RNA-sequencing to predict prognosis and therapeutic response for colorectal cancer

Introduction

Results

Patient characteristics

Feature selection and development of the DLRM-RS

DLRM-RS performance across different clinical subgroups

Development and evaluation of the nomogram model

Metabolomic alterations between high- and low-risk groups in CRC

Transcriptomic profiling reveals distinct biological processes between high- and low-risk groups in CRC

Prognostic significance of butanoate metabolism and nitrogen metabolism in CRC

Discussion

Methods

Patients

Clinical data collection and follow-up

CT image acquisition and image preprocessing process

Tumor segmentation and feature extraction

Feature selection and development of the DLRM-RS

Evaluation of the DLRM-RS

Development and evaluation of the nomogram model

Metabolomics sample preparation, metabolite identification, and analysis

Transcriptomics analysis

Statistical analysis

Ethics statement

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

41698_2026_1331_MOESM1_ESM (download PDF )

41698_2026_1331_MOESM2_ESM (download PDF )

41698_2026_1331_MOESM3_ESM (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links

Machine learning-based model for CD4⁺ conventional T cell genes to predict survival and immune responses in colorectal cancer