Introduction

In the past decade, the development of total mesorectal excision and the widespread use of neoadjuvant chemoradiotherapy have significantly decreased the locoregional recurrence rate of locally advanced rectal cancer (LARC)1,2,3 but 25–40% of LARC patients still experience systemic recurrence after surgery4. To reduce the incidence of systemic recurrence, the guidelines recommend postoperative adjuvant chemotherapy for LARCs with high-risk clinicopathological factors such as T4 tumor stage, poorly differentiated tumors, obstruction or perforation at the time of diagnosis, lymphovascular invasion (LVI), perineural invasion (PNI), positive margins, and fewer than 12 lymph nodes examined after surgery5. Valentini’s nomogram (VN), which includes pathological stage, surgical procedure, and adjuvant chemotherapy status, was developed for prognostic prediction of LARC6 and endorsed by the American Joint Committee on Cancer. However, its performance is still limited (local recurrence, c-index 0.68; distant metastasis, c-index 0.73)7. In addition, whether high-risk patients classified by these clinicopathological factors can benefit from adjuvant chemotherapy after surgery remains controversial8,9. At present, radiological imaging is widely used for early diagnosis and prognostic factor detection due to its advantages of being noninvasive, visible, dynamic, and comprehensive for evaluating tumor lesions. Tumor angiogenesis is also an important indicator for predicting patient prognosis10,11,12. Hence, integrating multidimensional information from clinical-imaging-angiogenesis data to construct a fusion prediction model might provide better performance for postoperative risk stratification and selection of chemotherapy benefit populations.

In fact, imaging evaluation relies on visual assessment by radiologists, and there may be low consistency in diagnostic results among radiologists, especially junior radiologists with limited experience. Spectral detector CT (SDCT) has become a popular research topic due to its ability to not only obtain anatomical information from conventional CT but also provide various quantitative parameter information based on material composition analysis13. However, the majority of studies based on SDCT have used 2D-ROI delineation at the maximum level of tumors14,15,16,17,18. Due to the heterogeneity of LARC and the uneven distribution of neovascularization in tumors, qualified imaging markers must be able to reveal information on all lesions and be stable. Therefore, early evaluation using reproducible 3D-SDCT imaging biomarkers may become an important parameter for improving prognostic prediction.

Angiogenesis is one of the core markers of cancer19. Multiple studies have shown that tumor angiogenesis, such as that of CD34 and CD105, and the level of vascular endothelial growth factor (VEGF) were important indicators for predicting patient prognosis10,11,12but the prognostic value of different angiogenesis parameters in RC remains controversial20,21,22. The identification of optimal angiogenesis biomarkers associated with the prognosis of LARC may provide additional prognostic value for the construction of prognostic models.

In this regard, we analyzed the impact of different ROI selection methods (2D vs. 3D) on the measurement of SDCT imaging biomarkers, explored the complementary prognostic value of different angiogenesis biomarkers, and established an interpretable model based on the optimal ROI protocol for individualized prediction of postoperative risk stratification in LARC patients. We further assessed and compared the prognostic value of our model to that of the VN model and tested its ability to determine the benefits of postoperative adjuvant chemotherapy in LARC patients.

Materials and methods

Patients

All studies complied with the Declaration of Helsinki. This retrospective study was approved by the institutional review boards of Tianjin Medical University Cancer Institute & Hospital (EK20240180), and patients informed consent was waived. All experiments were performed in accordance with relevant guidelines and regulations.

We retrospectively reviewed the data of rectal cancer patients who underwent pretreatment SDCT examination. Retrospective data between March 2019 and March 2021 were used to identify eligible patients according to the following inclusion criteria: (1) biopsy-proven untreated rectal cancer; (2) available pre-treatment pelvic SDCT images from the PACS; (3) standard neoadjuvant therapy followed by surgical resection with histopathological confirmation; and (4) no concurrent malignant tumors or prior cancer treantment. The exclusion criteria were as follows: (1) lost to follow-up within 3 years; or (2) had incomplete records of imaging, clinical and pathological data. The detailed information on preoperative treatment, surgery, and postoperative adjuvant therapy are shown in Supplementary Material 1.

Two rectal cancer cohorts were analyzed: 204 patients were randomly classified into training (n = 143) and validation (n = 61) cohorts at a 7:3 ratio, using the Python package scikit-learn, in Python 3.10.9. Figure 1 presents the flowchart for patient selection and the workflow of this study.

After surgical resection, all patients were followed up every 3–6 months for at least 3 years. The follow-up measurements included the CEA level and a contrast CT/MRI scan. The outcome of this study was 3-year disease-free survival (DFS), which was defined as the period between the date of surgery and the first evidence of clinical recurrence and/or distant metastases, as confirmed by imaging or histopathology. Patients were divided into recurrent and non-recurrent groups based on recurrence status within 3 years.

Fig. 1
Fig. 1
Full size image

Flowchart of patient’s enrollment and the workflow of the study.

SDCT protocols and imaging feature acquisition

Rectal unenhanced and two-phase contrast-enhanced scans were performed by a dual-layer detector spectral CT (IQon Spectral CT, Philips Healthcare) in the supine position with hands above the head. First, under a tube voltage of 120 kVp, nonenhanced scanning was obtained following scout imaging. The nonionic contrast medium iohexol was applied as a contrast agent at a concentration of 1.2 mL/kg and an injection rate of 2.5 mL/s through the median cubital vein. The acquisition times in the venous phase (VP) and the delayed phase (DP) were 60–70 s and 90 s after contrast agent administration, respectively. The acquisition and reconstruction parameters are shown in Supplementary Material 2.

Spectral images were obtained on an IntelliSpace Portal Workstation (ISP version 10, Philips Healthcare) for further investigation. Two radiologists with abundant abdominal expertise (10 years and 12 years) and who were blinded to the clinical data, measured the quantitative imaging biomarkers independently. Two different ROI protocols were used. (1) For 2D-ROI, a round ROI was placed at the greatest axial diameter and adjacent to the upper and lower slices of the rectal cancer lesions, and then the average iodine concentration (IC) values on the iodine-based images were calculated (Fig. S1a,b,e,f). (2) For 3D-ROI, the volume of interest along the entire enhanced tumor margin at all visible tumor levels was outlined (Fig. S1c,d,g,h), and the average IC values were calculated. Additionally, the normalized iodine concentration (NIC) refers to the quotient of the lesion IC divided by the the same layer of external iliac artery IC to minimize patient variability (NIC = IClesion/ICartery).

Clinicopathological parameters

Clinicopathological parameters, including age, sex, hypertension history, family history, histologic grade, pathologic T, N stage, microsatellite instability status, HIF-1α expression, Ki-67 level, LVI, PNI, surgical procedure and postoperative adjunctive history, were collected by reviewing the medical records of the patients (Supplementary Material 3).

Tumor angiogenesis parameters

Tumor angiogenesis was assessed via the CD34-based microvessel density (MVD), CD105-based MVD, and VEGF level.

Immunohistochemistry was performed to stain selected specimens using monoclonal mouse anti-human CD34 and CD105 antibodies and monoclonal rabbit anti-human VEGF antibody (Supplementary Material 4,5).

Model construction

The “xgboost” package in R was used to build the model, with the learning rate set to 0.1 and the objective function specified as binary logistic regression.

Statistical analyses

All the statistical calculations were conducted using SPSS 19.0, MedCalc 20.014 or R 6.1 (http://www.R-project.org). Quantitative variables are expressed as the mean ± SD. The data of the two observers were tested with the interclass correlation coefficient (ICC) and Bland-Ahman plot. Continuous variables were compared using the t-test or Mann-Whitney U tests, while categorical variables were compared using the χ2 or Fisher exact tests, as appropriate. For continuous variables with missing values, use mean, median, or mode for imputation; For categorical variables, use mode interpolation. Cox proportional hazards model was used for univariate and multivariate analyses, effect sizes were reported as hazard ratios (HRs) with 95% confidence intervals (CIs). The DeLong test was adopted to compare the differences between parameters. The optimal cutoff value was determined using X-tile (3.6.1) for dividing patients into high-risk and low-risk groups. Patient stratification in different risk groups was evaluated using Kaplan‒Meier curve analysis and the log-rank test. In order to control bias and achieve pseudo-randomization, propensity score matching (PSM) using R was used to rematch the study samples. To establish an interpretable prediction model, SHapley Additive exPlanation (SHAP) was used to provide an explanation for our predictive model, which calculated the importance of ranking features from the final model23. P < 0.05 was considered to indicate statistical significance.

Results

Patient characteristics

The baseline data of all 204 patients and their tumors are summarized in Table 1. The median follow-up times for DFS were 899.5 days in the training cohort (interquartile range [IQR]: 385.0–1257.0 days) and 950.4 days in the validation cohort (IQR: 499.0–1304.5 days).

Table 1 Clinicopathological parameters in all cohorts.

Sixty-five of the 204 patients experienced disease recurrence, 33 (50.8%) of whom experienced systemic disease recurrence (8 in the lung, 20 in the liver, 2 in the bone and 3 in both the liver and lung), 21 (32.3%) of whom experienced locoregional disease recurrence, and 11 (16.9%) of whom experienced mixed disease. Among them, 20 patients were confirmed by surgery, while the other 45 patients were diagnosed based on radiological characteristics. 137 (67.2%) patients were treated with postoperative adjuvant fluorouracil-based chemotherapy.

Overall, the average CD34-based MVD of all the lesions was 40.19 ± 6.89; for CD105-based MVD, it was 28.25 ± 5.50.

2D- vs. 3D-ROI interobserver agreement

Among the two ROI methods, the 3D-ROI method had the best interobserver agreement (ICC of 0.826–0.960) (Table 2). The Bland-Altman analysis showed that all the imaging features measured by the 3D-ROI method were more concentrated than those measured by the 2D-ROI method, indicating that the 3D-ROI analysis had a smaller consistency interval and better accuracy in repeated measurements by different readers (Fig. 2). Therefore, the average values of 3D-quantitative imaging features calculated by the two radiologists were used for further analysis.

Table 2 The interclass correlation coefficient between the two observers using two different ROI methods.

Predictive factors for DFS

In the univariate analyses of DFS, clinicopathological parameters (histologic grade, pT stage, pN stage, CEA, HIF-1α, LVI, and PNI), SDCT features (NICVP3D and NICDP3D values) and angiogenesis parameters (CD34, CD105, and VEGF) were associated with DFS. According to the multivariate analysis, clinicopathological parameters (PNI, histologic grade), SDCT features (NICVP3D values) and angiogenesis parameters (CD105) were found to be independent predictors in the training cohort (P < 0.05, Table 3).

Table 3 Univariate and multivariate logistic regression analysis for recurrence prediction in training cohort.
Fig. 2
Fig. 2
Full size image

Bland-Altman analysis between the two observers using two different ROI methods. (a) ICVP3D. (b) ICDP3D. (c) NICVP3D. (d) NICDP3D. (e) ICVP2D. (f) ICVP2D. (g) NICVP2D. (h) NICDP2D.

Model construction and comparison

A multidimensional radiological-angiogenesis-clinicopathological integrated model (RACIM) was established based on the above prediction variables (PNI, CD105, histologic grade and NICVP3D values), which predicted the probability of disease recurrence for each individual patient. Multivariate analysis was used to construct a clinical model that included histologic grade, HIF-1α, LVI and PNI; an angiogenesis model that included CD105; and a radiological model that included NICVP3D values. The receiver-operating characteristic (ROC) curves of the different models for the entire cohort are shown in Fig. 3. The ROC curves revealed that the radiological model (NICVP3D) had an AUC of 0.85 (95% CI, 0.78–0.91), a sensitivity of 78.4%, and a specificity of 79.3%. According to the X-tile, the optimal cut-off value of the NICVP3D was identified as 0.345. The combined model achieved excellent predictive performance, with AUCs of 0.95 (95% CI, 0.92–0.98) and 0.93 (95% CI, 0.85-1.00) in the training and validation cohorts, respectively (Table 4). The AUC of the combined model was obviously greater than that of the radiological (P = 0.0004, P = 0.0393), angiogenesis (P < 0.0001, P = 0.0091) and clinical models (P = 0.0471, P = 0.0088) in all cohorts.

Fig. 3
Fig. 3
Full size image

Receiver operating characteristic curve (ROC) analysis for the prediction models in the training (a) and validation cohorts (b).

Table 4 ROC analyses of the different models in the training and validation cohorts.

Additionally, a VN model with pathological stage, surgical procedure, and adjuvant chemotherapy status was also built for comparison. Compared with the VN model (AUC: 0.77, 95% CI: 0.70–0.85; AUC: 0.76, 95% CI: 0.59–0.92; AUC: 0.72, 95% CI: 0.59–0.85), our radiological and RACIM models exhibited superior performance in the training (AUC: 0.85, 95% CI: 0.78–0.91, P = 0.0160; AUC: 0.95, 95% CI: 0.92–0.98, P < 0.0001) and validation (AUC: 0.83, 95% CI: 0.73–0.93, P = 0.4217; AUC: 0.93, 95% CI: 0.85-1.00, P = 0.0428) (Table 4). Moreover, the calibration plots of RACIM model showed that the estimations had good agreement with the actual observations (Fig. 4a,b). The decision curve analysis curves revealed that the RACIM model achieved moderately better net benefit than other models over the relevant threshold range in all cohorts (Fig. 4c, d).

Fig. 4
Fig. 4
Full size image

Calibration curves and decision curves of different models. (a) Calibration curves in training cohort. (b) Calibration curves in validation cohort. (c) Decision curves in training cohort. (d) Decision curves in validation cohort.

Patient risk stratification

We divided patients into high- and low-risk groups according to the X-tile-generated optimum cutoff value (0.389) of the RACIM, which significantly differed in terms of DFS in the training cohort (log-rank test, P < 0.001) (Fig. 5a). Then, we performed the same analyses to stratify patients in the validation cohort to determine the prognostic value of the RACIM. Consistent with the training cohort, significant differences in DFS were observed between the two groups in validation cohort (log-rank test, P = 0.001) (Fig. 5b). Table 5 showed the selected prediction parameters in RACIM-classified high and low-risk groups.

Table 5 Selected prediction parameters in RACIM-classified high and low-risk groups.

To test the ability of the RACIM to identify patients who may benefit from postoperative adjuvant chemotherapy, subgroup analyses of patients receiving adjuvant chemotherapy were further performed. Notably, in the RACIM-classified high-risk group, postoperative adjuvant chemotherapy was significantly associated with a treatment benefit (P = 0.036), while adjuvant chemotherapy did not improve survival in any of the 204 patients (P = 0.400) or in patients with any high-risk clinicopathological features (P = 0.400, Fig. 6).

Fig. 5
Fig. 5
Full size image

The Kaplan Meier survival analysis curve stratified the prognosis of patients according to the RACIM-based classifier. (a) Training cohort. (b) Validation cohort.

Fig. 6
Fig. 6
Full size image

Effect of postoperative adjuvant chemotherapy in different subgroups, which were stratified by the receipt of chemotherapy. (a) All cases group. (b) RACIM-classified high risk group. (c) Any high-risk clinicopathological features group.

Model interpretability with SHAP

In this study, we employed the SHAP algorithm to endow our RACIM with global and local interpretability. As observed in the plot, the SDCT imaging indicator NICVP3D was the most important risk factor, followed by CD105, PNI, and histologic grade (Fig. 7a,b).

Figure 7c,d shows the SHAP model force plot of two male participants who had TNM stage IIIB disease, depicting how NICVP3D, CD105 and clinicopathological characteristics affect the ability of the model to predict recurrence risk at the individual level.

Fig. 7
Fig. 7
Full size image

Model interpretability of the RACIM for the prediction of disease-free survival (DFS) with SHAP in the training cohort. (a) Feature importance plot listing the most significant variables in descending order. (b) Summary plot of the impact of features on model decision-making and the interactions between features in the model. SHAP force plots of two participants with high (c) and low (d) risk of DFS. Yellow dots represent higher eigenvalues and purple dots represent lower eigenvalues.

Discussion

In this study, we first demonstrated that SDCT-derived 3D quantitative biomarkers achieved more satisfactory interobserver accuracy than 2D parameters and could reliably predict DFS in LARC patients. Compared to other angiogenesis parameters (CD34 and VEGF), CD105 was the only independent angiogenesis factor for DFS. We then developed and validated a multidimensional RACIM combining clinicopathological, SDCT and angiogenesis parameters (PNI, histologic grade, NICVP3D, and CD105) for the individualized prediction of DFS and achieved better performance than traditional clinicopathological parameters. More importantly, the proposed RACIM successfully identified patients who were most likely to benefit from postoperative adjuvant therapy.

Over the past few years, many scholars have attempted to identify new biomarkers that can predict patient prognosis and help identify patients who are suitable for postoperative adjuvant chemotherapy in LARCs. Mounting evidence has revealed the predictive value of SDCT features in rectal cancer14,15,16,17,18. Previous studies selected only the maximum level of the tumor for ROI delineation, ignoring tumor heterogeneity and phenotypic variations. In contrast, our study showed that the 3D-ROI method has the best consistency and repeatability, thereby ensuring the reliability and stability of the results. In addition, our NICVP3D model was superior to any other parameter or VN model for predicting recurrence. Through the variance importance plot of the SHAP algorithm, we found that NICVP3D is also the most important feature, confirming the necessity of incorporating imaging features into conventional clinical features for risk stratification and laying the foundation for the large-scale standardized clinical application of SDCT in the future.

Notably, in our study, the NIC was an independent factor for DFS compared to the IC, and the venous-phase NIC outperformed the delay-phase NIC in terms of prognosis, which is consistent with the results of other studies24,25,26,27. The reason for these results may be that the NIC removed the potential influence of individual circulation on the IC, as well as the differences in tumor blood supply during different phases of contrast-enhanced CT. The iodine quantitative imaging biomarkers in the VP could well reflect the microcirculation of tumors and may reflect the distribution of contrast agents in the interstitial space25,27. However, other studies have also reported the predictive value of IC and NICDP in rectal cancer26. We speculated that this difference in the results may be attributable to the use of different ROI selection methods, although further investigations are needed before widespread implementation of these methods in clinical practice.

Angiogenesis is an important component of tumor progression; thus, our study incorporated angiogenesis parameters into the constructed predictive model. Notably, compared to VEGF and CD34, CD105-based MVD was the only independent factor for DFS, confirming that CD105 can be regarded as a better prognostic marker for rectal cancer, which is similar to the findings of a previous study28. CD105, a 180 kDa transmembrane glycoprotein, is a coreceptor of TGF-β and is the key factor in the TGF-β pathway and angiogenesis29,30. In contrast to CD31, CD34 and factor-related antigens, CD105 is highly expressed only in neovascular endothelial cells, while it is almost undetectable in the blood vessels of normal tissues31. Therefore, it is reasonable to explain why the average value of CD105-based MVD was lower than that of CD34-based MVD in the present study.

Another focus of this study was the selection of patients who benefited from adjuvant chemotherapy after surgery. According to our subgroup analyses, we found that patients in the RACIM-based high-risk group achieved a better survival benefit from adjuvant chemotherapy, whereas patients in the low-risk group did not. Furthermore, this study revealed that adjuvant chemotherapy failed to significantly improve survival outcomes in either the overall patient cohort or subgroups stratified by conventional high-risk clinicopathological parameters. These findings reaffirm the inherent limitations of traditional clinicopathological risk assessment in identifying optimal candidates for postoperative chemotherapy. Importantly, our novel RACIM-based classifier demonstrated superior predictive capacity for therapeutic stratification, enabling more personalized adjuvant treatment decisions through precision patient selection.

As a preliminary effort, the study has a few limitations. First, as a retrospective study, it was characterized by unavoidable selection bias. Second, the data were based on small numbers of patients from one center and lacked external validation. For this reason, future studies with more samples are needed to verify the model, and integrated research with antiangiogenic agents and neoadjuvant therapies are promising future directions. Finally, manual segmentation was time-consuming. To our knowledge, the IntelliSpace Portal workstation 12 has achieved semiautomatic volumetric ROI delineation, which will provide convenient technical support for further validation of our results.

Future impact

To translate the promising performance of the RACIM into routine clinical practice, prospective multicenter validation in diverse patient cohorts is essential. This will rigorously confirm its generalizability, stability across different SDCT platforms and operators, and real-world prognostic value. Integration into the clinical workflow could involve incorporating the RACIM score calculation into radiology reporting systems or hospital information systems (HIS/RIS/PACS) upon SDCT examination completion. The visualized risk stratification output could then be readily available within multidisciplinary team meetings, directly informing discussions on adjuvant therapy intensity (e.g., escalating or de-escalating chemotherapy) for individual LARC patients. Furthermore, exploring the development of automated or semi-automated tools for NICVP3D measurement within SDCT would enhance standardization, reduce observer variability, and facilitate the seamless adoption of this quantitative biomarker into future prospective studies and clinical practice, ultimately paving the way for its role in optimizing LARC treatment strategies.

Conclusion

The 3D-SDCT analysis protocol improved interobserver consistency, and the NICVP obtained by this method has the potential for predicting DFS. Moreover, our study highlighted the prognostic value of CD105-based MVD in rectal cancer. Furthermore, the multidimensional RACIM combined with SDCT, angiogenesis and clinicopathological parameters could offer an optional novel approach to improve the postoperative risk stratification in LARC patients. Our prediction model provided individual-level interpretability through the SHAP algorithm, providing a valuable decision-making basis for personalized postoperative therapeutic strategies.