Machine learning prognosis model for locally recurrent rectal cancer patients after radioactive 125I seed implantation

Qin, Yun; Li, Xuemin; Sun, Haitao; Zhao, Wei; Zhu, Lihua; Wang, Junjie; Wang, Hao

doi:10.1038/s41598-025-32579-6

Download PDF

Article
Open access
Published: 17 December 2025

Machine learning prognosis model for locally recurrent rectal cancer patients after radioactive ¹²⁵I seed implantation

Yun Qin^1,2,
Xuemin Li²,
Haitao Sun²,
Wei Zhao¹,
Lihua Zhu¹,
Junjie Wang² &
…
Hao Wang^2,3

Scientific Reports volume 16, Article number: 2679 (2026) Cite this article

842 Accesses
Metrics details

Subjects

Abstract

To develop and validate a multiscale radiomics prognostic tool for accurately predicting local control (LC) and overall survival (OS) in locally recurrent rectal cancer (LRRC) patients underwent CT-guided radioactive ¹²⁵I seed implantation (RISI). 189 LRRC patients who treated with RISI were eligible for exploratory retrospective study and randomly divided into training and validation sets. Intra-and peri-tumoral handcrafted radiomics features (RFs) selection was performed using the univariate analysis and LASSO-Cox model. The deep learning RFs were also performed same procedures. The random survival forest (RSF) and Cox hazard regression (CHR) prognostic models were fitted with bootstrapping resampling and comprehensively evaluated by the concordance index (C-index), integrated brier score (IBS), and time-dependent area under the curve (tAUC). Among all peritumoral radscores (RS), the RSperi1mm and RSperi4mm demonstrated the best prediction for LC for OS in the validation set, respectively. The addition of deep learning radscores can also improve prediction efficiency. The combined RSF model demonstrated robust performance compared to CHR model for LC prediction, achieving a C-index (95%CI) of 0.78 (0.74–0.84) and an IBS of 0.13 (0.12–0.14). Similar results were observed in predicting OS with a C-index of 0.76 (0.75–0.77), an IBS of 0.11 (0.10–0.12). According to the RSF model predictions, the LRRC patients were significantly dichotomized into two different prognostic groups (p < 0.001). The RSF model could provide more accurate LC and OS prediction and remarkable prognostic stratification than the CHR model for LRRC patients after RISI treatment.

Introduction

Locally recurrent rectal cancer (LRRC) patients (2.4–10% relapse) still face a formidable prognosis and have limited curative therapeutic choices^1,2,3. However, there is no standard treatment strategy for these unable to tolerate re-operation or re-irradiation LRRC patients, so improving prognosis condition is still an imperative clinical challenge. CT-guided radioactive ¹²⁵I seed implantation (RISI) has been recommended by National Comprehensive Cancer Networks (NCCN) guideline as a high-security salvage strategy⁴. Given the therapeutic response to RISI exhibits significant individual heterogeneity, providing personalized and accurate prediction of overall survival (OS), local control (LC) and risk stratification could enhance clinical management during follow-up and guide treatment strategies for LRRC patients. Although our previous studies have reported some conventional dosimetric risk parameters based on the univariate analysis associated with LC for LRRC patients after RISI^5,6, which inadequately capture the complex interplay between tumor biology and troublesome in precise prognosis. Identifying reliable image biomarkers and constructing robust prognostic tools remain a critical barrier to personalized management for LRRC patients treated with CT-guided RISI, but no studies have been reported to date.

Higher spatial-resolution CT imaging pattern is widely used to detect pelvic recurrence and locate the tumor. Radiomics and machine learning method break the traditional medical image analysis framework by extracting and analyzing multiple quantitative texture features from radiologic images to characterize the intrinsic heterogeneity of the lesion area and molecular regulation information^7,8,9. Emerging evidence highlights the growing significance of the tumor surrounding microenvironment, meaning that peritumoral radiomics features (RFs) also are promising prognostic biomarkers^10,11. Moreover, based pre-trained connected neural networks (CNNs) autonomously extracted RFs have been growing achievements in the therapeutic effect monitoring and prognosis evaluation of many malignancies^12,13,14. However, the prognostic potential of CT-derived handcrafted and deep learning (DL) features extracted from intra-and peri-tumoral regions for predicting LC and OS of LRRC patients after RISI remains fully unexplored in prior investigations. Our aim was to investigate if intra-and peri-tumoral radiomics combined with clinical variables can improve personalized predictions.

The Random survival forest (RSF) and the Cox hazard regression (CHR) model have demonstrated robust predictive performance in various diverse survival analysis research^15,16,17. Unfortunately, the optimal predictive model whether machine learning or traditional statistical method is not yet available for LRRC patients. To address these gaps, this study first combined multiscale CT-based radiomics to rigorously develop and compare the performance of two prognostic models tailored for LRRC patients treated with RISI for optimizing surveillance and adjuvant therapy strategies.

Materials and methods

Ethics and patients

This study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The ethical protocol for this retrospective study was approved by our hospital’s Institutional Review Committee (No. IRB00006761), and the informed consent was waived. In addition, we confirmed that the whole process strictly adhered to the METRICS checklist regulations to improve the credibility, reproducibility, and transparency of the study, which was also provided in the Supplementary 1.

Initially, a total of two hundred and forty-nine patients diagnosed with LRRC treated with RISI were recruited, of which 60 were excluded due to missing CT images or incomplete clinical information. The detailed exclusion and inclusion criteria are in Fig. 1. All LRRC patients received external beam radiotherapy, chemotherapy or surgery before RISI treatment. Complete medical records of all LRRC patients were reviewed to collect the dosimetric characteristics, with their definitions fully described in Supplementary 2. Finally, 189 eligible LRRC patients were identified through standardized screening protocols at Peking University Third Hospital between December 2015 to December 2023. No additional missing data existed in the 189 analyzed LRRC patients. Among them, 145 patients underwent RISI assisted by 3D-printed non-coplanar template (3D-PNCT) technology, and the remaining 44 patients underwent traditional RISI therapy. The RISI assisted with 3D-PNCT treatment workflow strictly adhered to the procedure we mentioned⁶, as depicted in Fig.S1. These patients were randomly allocated to training and validation sets with a ratio of 7:3, followed by bootstrap with 1000 repetitions on the training set (N = 132) for development and comparison of the RSF and CHR models, with independent internal validation conducted on the validation set (N = 57).

Endpoints and follow-up

We evaluated the patient’s disease condition after receiving RISI treatment using routine blood work, biochemical testing, tumor marker analysis, abdominal CT, chest CT, and pelvic MRI. The treatment outcomes were evaluated according to the RECIST guideline (version 1.1)¹⁸. LC and OS were the target variables in the current study. Progression was defined as an increase of 20% or more in the sum of diameters of tumor lesions in available imaging follow-up. LC was defined as the duration from RISI completion to the tumor progression within the lesion. OS was calculated from the initiation of RISI to the date of death from any cause or the last follow-up. The patients were regularly checked for a LC and OS every 3 months since the time of RISI treatment. If no LC occurred, patients were right-censored after the last follow-up.

Image acquisition

All patients underwent the pre-treatment helical pelvic CT scanner 2 days before RISI to locate the tumor (Brilliance BigBore, Philips, Amsterdam, Netherlands) with the following standardized imaging protocols: tube voltage, 120 kV; tube current, 325 mA; collimation, 16 × 1.5 mm; beam pitch, 0.938; field of view (FOV), 500 mm; reconstruction slice thickness, 5 mm; rotation time, 0.75s; and matrix size, 512 × 512.

ROIs segmentation

Delineation of the recurrent tumors (intratumoral region) were manually contoured layer-by-layer by a physician (reader 1) with 10 years of clinical-diagnosing experience in Brachytherapy Treatment Planning System (B − TPS). All delineations underwent rigorous review and refinement by a senior radiologist (reader 2, > 15 years of expertise) to ensure segmentation accuracy before designing and optimizing the radiotherapy plan. Reader 1 and 2 independently contoured target volumes for 50 randomly selected cases to evaluate inter-observer consistency. At the same time, reader 1 repeated contouring for the same 50 patients after a one-week to assess intra-observer reproducibility.

To determine the optimal peritumoral range, we employed morphological dilation via Python’s SimpleITK library to isotropically expand the GTV by 1–6 mm peritumoral regions of interest (ROIs) for comparative analysis. This process yielded seven distinct sets of ROIs per patient, including the original GTV and six peritumoral ROIs at 1 mm intervals. Given that tumor recurrence frequently involves sacral invasion and rectum, we removed the contours containing air and bone tissue with Hounsfield unit (HU) threshold of below − 200 and above + 400 to optimize peritumoral expansion delineation. The study process flow is depicted in Fig. 2.

Image processing and features extraction

Preprocessing procedures were implemented for CT image of each patient resampling and normalization prior to feature extraction. Because of the variability in-plane resolution, the CT voxels were interpolated to 1 × 1 × 1 mm³ using linear interpolation. Subsequently, the gray values were uniformly normalized to [0,1] using min-max normalization.The results were prepared for handcrafted and deep learning features extraction.

Intratumoral and peritumor radiomics features

The detailed settings regarding the CT image preprocessing and handcrafted RFs extraction are provided in Table S1. A total of 1874 quantitative intra- and peri-tumoral handcrafted RFs were initially derived from the original CT images and 7 image preprocessing filters using open-source Pyradiomics package for per patient. The whole features extraction process adhering to the Image Biomarker Standardization Initiative (IBSI) protocols^19,20.

Deep learning features

We implemented the ResNet and Densenet as representational feature extractors, which were fine-tuned via transfer learning using the TensorFlow framework (v2.13.0) on an NVIDlA GeForce GT 730 graphics processing units. The architecture of fully ResNet-50 (DL₁), ResNet-101(DL₂), Densenet121 (DL₃) and Densenet201 (DL₄) models are presented in Fig. S2. These four commonly used CNNs were pre-trained on the large-scale, well-annotated ImageNet database²¹. 7 consecutive two-dimensional CT slices with the largest lesion area on the axial plane as the center layer were selected and resized to 224*224 pixel as the input size. Owing to the limited amount of data, we performed data augmentation on each training image as mentioned above, including random translation and rotation. We trained the network by dividing LRRC patients into two labels with the median LC and OS time as the cut off value and utilized sigmoid crossentropy as the loss function on training set (n = 132), while an independent validation set monitored performance metrics to trigger early stopping at 200 epochs. we chose the Adam optimization algorithm with an initial learning rate to 0.0003, utilizing a batch size of 32 and random state of 42 for DL models. For each patient, a total of 2048 DL₁ and DL₂ features, 1024 DL₃ 1920 DL₄ were extracted from the global averaging pooling layer, respectively. Guided Gradient-weighted Class Activation Mapping (Guided Grad-CAM) visualizes the CNN output in the last convolutional layer to understand the interpretability of the four types of DL models, as Fig. S3.

Feature selection

The Z-score normalization was adopted to transform grey values of different magnitudes into a unified measure before following steps. The repeatability and robust of handcrafted and deep learning RFs were quantitatively evaluate by classical intraclass correlation coefficient (ICC). The good agreement for segmentation were considered stable if the ICC was above 0.80 and input into the process of subsequent feature selection. Due to the high dimensionality of the features, univariate Cox regression analysis was first performed to screen possible high prognosis related RFs (p < 0.05). Subsequently, pearson correlation validation was used to eliminate the correlation of the selected features, only one feature from pairs with a coefficient above 0.9 was retained. To further avoid overfitting and enhance model generalizability, the LASSO-COX also incorporating 5-fold cross-validation was subsequently applied, eliminating redundant RFs by shrinking coefficients of non-predictive variables to zero, and a maximum of 10 features were retained in each radiomic model. The corresponding radscore (RS) models based on the most predictive handcrafted and deep learning RFs derived from intratumoral, 6 peritumoral ROIs region, and 4 DL models with nonzero coefficients to predict LC and OS.

Model construction and statistical analysis

The significant variables of clinical and dosimetric (p < 0.05) that were strongly associated with LC and OS were screen out by univariate and multivariate Cox regression analysis. Then, these selected characteristics were employed to construct RSF and CHR prediction model. First, the tune_grid method was implemented to confirm the optimal combination of parameters for RSF model. Next, we run RSF and CHR model bootstrap resampling 1000 iterations to increase the robustness on the training set. Finally, we fairly evaluated the prognostic performance of the proposed RSF and CHR models by the discordance index (C-index), integrated Brier score (IBS) and time-dependent area under the curve (tAUC) to choose the excellent prognosis model in the validation set. The 95% confidence intervals were reported for each metric using bootstrap resampling. The details of each package of R software version v.4.3.1 were described in Supplementary 3, two-sided p value below the 0.05 were considered statistical significance.

Ethics approval

The study protocol was approved by the Ethics Committee of Peking University Third Hospital [IRB00006761].

Results

Patient characteristics

Detailed information about baseline clinical characteristics and dosimetric parameters of LRRC patients in the training and validation sets were listed in Table 1. These variables demonstrated no significant difference distributions between the two cohorts (all p > 0.05). The median (95%CI) LC and OS time for 189 patients in this study were 15.0 (12.5–18.0) months and 19.4 (17.6–21.3) months, respectively. The one- and two-year LC rates were 58.3% (95%CI 51.5–66.2) and 29.5% (95%CI 21.8–39.1), respectively. Corresponding one- and two-year OS rates were 75.6% (95%CI 69.7–82.0) and 37.5% (95%CI 35.2–52.8), respectively. During the follow-up period, a total of 108 (57.1%) LRRC patients were confirmed progressive disease in the treated region and 134 (70.9%) participants dead at the end of the follow-up. Death prior to local progression constitutes a competing risk that may bias LC estimates. In our study (n = 189), no deaths occurred without prior documented local progression. This nullifies competing risks concerns for LC endpoints.

Table 1 Patient characteristics in the training and validation sets.

Full size table

Result of feature selection

The results of feature selection conducted a sequential combination of ICC, univariate analysis, and LASSO-Cox selection on the from the training data as shown in Fig. 3A. Following feature exclusion based on the intra- and inter-observer ICC > 0.80, 1245 handcrafted, 1089 DL1, 1072 DL2, 978 DL3 and 1062 DL4 RFs exhibiting high reproducible were ultimately retained for subsequent analytical workflows. The details of selected features by the univariate Cox regression analysis and Lasso-COX model were showed in the Supplementary 4. Finally, 10 of the most useful intratumoral handcrafted RFs were retained for LC prediction, and 9 features were used for OS prediction. Peri-tumoral handcrafted RFs and DL features associated with LC and OS as illustrated in Fig. 3B.

Performance of radiomics signatures

The performance of all radscores for predicting LC and OS were evaluated in both training and validation cohorts, as detailed in Table 2. Notably, while the RS_Had attained the highest C-index for LC prediction (0.72, 95%CI 0.63–0.81) in the validation set. The results indicated that the radscore generated by 1-mm peritumoral expansion (RS_Peri1mm) demonstrated the optimal LC prediction, achieving a C-index of 0.70 (95%CI 0.60–0.77) in the validation set. For the prespecified secondary endpoint OS, the radscore based on 4-mm expansion (RS_Peri4mm) showed the highest performance of C-index 0.64 (95%CI 0.53–0.74). Comparative analysis of different peritumoral radscores was confirmed by the DeLong test (all p < 0.05). Therefore, RS_Peri1mm and RS_Peri4mm were retained as the representative peritumoral radscores for LC and OS prediction, respectively. Additionally, the RS_DL4 exhibited impressive C-index of 0.67 (95%CI 0.57–0.77) for LC, and RS_DL2 with 0.63 (95%CI 0.54–0.70) for OS in the validation set. The Kaplan-Meier curves showed based on RS_Had, RS_Peri1mm, RS_DL4 for LC analysis (Fig. 4a, b, c, p < 0.05) and RS_Had, RS_Peri4mm, RS_DL2 for OS evaluation (Fig. 4a1, b1, c1, p < 0.05) were able to stratify patients with a significant risk stratification in the validation set. All the radscore models showed better predictive performance than the clinical model in predicting LC. RS_Had and RS_DL features is presented in Table S2. RS_Peri1mm and RS_Peri4mm handcrafted radiomic features along with their corresponding coefficients, respectively selected by LASSO_COX analysis was listed in Table S3.

Table 2 Predictive performance (C-index) of intratumoral, different peritumoral, and deep learning radscores for LC and OS prediction of the LRRC patients in the training and validation sets. Peri peritumoral, RS radscore, C-index concordance index, CI confidence interval.

Full size table

Clinical prognosis factors

The results of multivariate Cox regression in the training set indicated that 3D-PNCT (HR = 0.47, 95%CI0.26–0.87, p = 0.016), D₉₀ (HR = 0.99, 95%CI 0.99-1.000, p = 0.001), V₁₀₀ (HR = 0.94, 95%CI 0.90–0.97, p = 0.001) were significantly associated with LC. DM (HR = 1.84, 95%CI 1.27–2.67, p = 0.001), T stage (HR = 0.36, 95%CI 0.21–0.60, p < 0.001), chemotherapy (HR = 0.04, 95%CI 0.01–0.19, p = 0.033), radiotherapy (HR = 2.05, 95%CI 1.11–3.76, p = 0.020), D₉₀ (HR = 1.00, 95%CI 1.00–1.00, p < 0.001), V₁₀₀ (HR = 0.95, 95%CI 0.91–0.99, p = 0.016) were prognostic predictors associated with the survival outcomes (Table S4). The Kaplan–Meier analysis of the above-mentioned independent clinical and dosimetric factors in predicting LC and OS for LRRC patients is shown in Fig. S4 and Fig. S5, respectively (all p < 0.05). The clinical model had a C-index of 0.66, 0,67 for predicting LC and OS in the validation set, respectively.

Performance and risk stratification of the RSF and CHR model

Figure S6 visualized a pairwise spearman correlation matrix between independent clinical factors and selected radscores for predicting LC (A) and OS (B), all p value > 0.05, for LC range: -0.38 to 0.74; for OS range: -0.28 to 0.74. It can be concluded that the correlation-based feature selection procedure successfully mitigated feature redundancy. We further integrated identical risk predictors into the two models to explore their predictive capability for LC and OS. The RSF model was adopted to run 1000 times with n_tree = 1000, and nodesize = 4 to predict LC, n_tree = 1500 and nodesize = 14 to predict OS. Table 3 showed that the IBS, C-index and 95%CI of RSF and CHR models in the training and validation sets. The traditional CHR model (IBS: 0.16, 95%CI 0.15–0.17, C-index: 0.72, 95%CI 0.63–0.81) demonstrated significantly inferior predictive performance for predicting LC than RSF model (IBS: 0.13, 95%CI 0.12–0.13, p < 0.001; C-index: 0.76, 95%CI 0.64–0.84, p < 0.01), respectively in the validation set. Similarly, the RSF model (IBS: 0.11, 95%CI 0.10–0.12; C-index: 0.75, 95%CI 0.75–0.77) exhibited superior predictive accuracy and greater generalization capability relative to the CHR model (IBS: 0.17, 0.13–0.20; C-index: 0.69, 95%CI 0.60–0.77) for OS prediction.

Table 3 The prognostic performance of RSF and CHR models integrating same selected radscores and clinical features in predicting LC and OS for LRRC patients after RISI in the training and validation sets. ¹Comparison with the performance of CHR model to RSF model in the same datasets. Abbreviations: CHR, Cox hazards regression; RSF, random survival forests; CI, confidence interval.

Full size table

Furthermore, LRRC patients could be stratified into low-and high-risk groups by the median predicted values of RSF and CHR model (LC threshold:19.15; OS threshold: 51.80) in the training and validation sets. The RSF predicted values (training: p < 0.001; validation: p < 0.001, Fig. 5a, b) demonstrated significant stratification efficacy for LC compared to the CHR model (training: p = 0.009; validation set: p = 0.18, Fig. 5c, d). The RSF (p = 0.026, Fig. 5f) could provide more accurate OS prediction and remarkable prognostic stratification than the CHR model (p = 0.32, Fig. 5h) for LRRC patients in the validation set. The high-risk patients displayed significantly worse LC and OS than those in the low-risk group. Compared with the risk stratification performance of single radscore model for LC evaluation, the prediction values derived from the RSF model achieved more statistically significant outcomes. Figure 6A and B demonstrated LRRC patients stratification using the RSF model’s median predicted value for LC. The high-risk cohort exhibited significantly higher progression rates at 1-year (76.1% vs. 1.5%) and 2-year (100.0% vs. 35.9%) compared to the low-risk group (p < 0.001). Non-progressed at 1-and 2-year patients demonstrated markedly lower predictive scores than those with tumor progression, with a mean difference of 26.05 (95%CI 25.68–28.81; p < 0.001) and 27.76 (95%CI 23.71–32.89; p < 0.001). Additionally, the LRRC patients who survived 1- year (mean RSF predicted value: 44.73 vs. 93.29, p < 0.001, Fig. 6C) and 2-year (mean RSF predicted value: 33.98 vs. 72.75, p < 0.001, Fig. 6D) had a significantly lower predicted value than those who died.

Moreover, the tAUC value at 1 and 2 years also confirmed the better prognostic discriminative ability of the RSF model in predicting LC and OS consistently exceeding those of the CHR model in training and validation set (p < 0.05). The tAUC of the RSF model for 1 and 2-year LC prediction were 0.840, 95%CI 0.758–0.928 and 0.888, 95%CI 0.860–0.997, respectively, for 1 and 2-year OS prediction (0.835, 95%CI 0.801–0.952 and 0.761, 95%CI0.667–0.918, respectively) in the validation set, as shown in Table 4. Finally, the predicted LC and OS by the RSF model showed great agreement with the observed survival (Fig. S7). Decision curve analysis (DCA) of the RSF model also had a better overall net benefit across most threshold probabilities compared with CHR model (Fig. S8).

Table 4 The discrimination performance of RSF and CHR model for predicting LC and OS in the training and test set. tAUCs of RSF and CHR model were calculated and compared in the two sets. CHR Cox proportional Hazards regression, tAUC time-dependent area under the curve, RSF random survival forests, CI confidence interval. * p < 0.05, the difference reach statistically significance between RSF and CHR models in the validation set.

Full size table

Discussion

Accurate prognostic assessment is critical for enabling clinicians to tailor timely and individualized therapeutic strategies for LRRC patients after RISI treatment. Lu et al.⁶ first reported 66 patients with LRRC treated by CT-guided RISI that only D₉₀ > 130 Gy or D₁₀₀ > 55 Gy or V₁₀₀ > 90% can significantly prolong the LC time, none of the dosimetric parameters had an effect on OS. However, their utility in constructing precise prognostic models is inherently limited by small cohort sizes and univariate analysis. To date, no prognostic model has been established for this population, mainly due to the relapsing population that suitable for RISI is a small group, resulting in limited availability of eligible cohorts for robust model development. To address these limitations, we first develop and validate machine learning-driven multiscale radiomics prognostic model with a relatively large sample size to optimize clinical decision-making for LRRC patients after RISI treatment.

Radiomics encompass more comprehensive and valuable prognostic information related to tumor surrounding environment and are increasingly recognized in predicting the prognosis of locally advanced rectal cancer^22,23, but it has not been explored in the field of LRRC. In contrast to conventional handcrafted intratumoral RFs, pre-trained DL models have been extensively employed in automated RFs extraction, capitalizing on their transfer learning capabilities to overcome data scarcity challenges prevalent in medical imaging research. Xiao et al. confirmed that CT-based DL features based on pre-trained ResNet-50 significantly enhanced the prognostic performance in predicting the OS among small-cell lung cancer patients¹⁴. Gong et al. also demonstrated that deep learning signature using the 3D-Densenet architecture showed better prognostic performance than radiomic signature for predicting local recurrence-free survival²⁴. Therefore, we adopted transfer learning to extract DL features from four pre-trained Resnet-50/101 and Densenet121/201 modules to improve the prediction performance. The median LC and OS time represent a clinically actionable threshold where outcomes meaningfully diverge (clinically interpretable risk stratification), which significantly affects prognosis. Division at the median ensures balanced class distribution. This mitigates model bias toward majority classes and enhances training stability in limited training dataset (n = 132). The binary cross-entropy loss function was employed to measure the discrepancy between the model’s output and the label. This dichotomization has been validated in some deep learning radiomics studies^14,26.

The peritumoral subclinical regions are recognized as biologically informative niches, containing critical biomarkers such as localized inflammatory activity, microinvasive tumor fronts, and stromal remodeling factors, which demonstrated in other cancer types^15,25. Furthermore, we explored the impact of peritumoral expansion margin by selecting ROIs at 1–6 mm around the tumor based on previous studies to determine the optimal region^25,26,27,28, and compared the predictive performance of radscores constructed from different peritumoral regions for LC and OS prediction. Gu et al. demonstrated that the 4 mm peritumoral expansion radscore yielded the highest prediction for recurrence-free survival, with a C-index of 0.74 (95%CI 0.69–0.79)²⁶. Li et al. demonstrated that the combined radscore incorporating intratumoral and peritumoral 3 mm had the best predictive capacity, with a C-index of 0.800 (95%CI 0.681–0.920)²⁷. Pérez‑Morales et al. developed a predictive model integrating peritumoral and intratumoral CT-based radiomic signatures to estimate OS and PFS in lung cancer patients²⁸. These finding underscore that precise selection of region-specific ROIs is critical, and synergistic integration of multimodal methodologies significantly enhances prognostic predictive performance. Our findings also demonstrated that RS_peri4mm provided the best predictive accuracy for LC, as this 1-mm expansion corresponding to the clinical target volume might preserve more tumor-related biological information. Moreover, the RS_peri4mm demonstrated superior predictive performance compared to intratumoral radscore. Our analysis revealed that peri-tumoral expansions contains distinct prognostic information and biological heterogeneity.

Machine learning enables precise prognostic prediction by interpreting complex patterns within high-dimensional data²⁹. Traditional CHR model requires data to meet the proportional hazards assumption, but RSF model overcomes the limitations and has shown excellent prediction performance. Dong et al. find that RSF model showed excellent performance in predicting overall survival and stratification than the Cox regression model for patients after lung transplantation, but they only relied on clinical characteristics for modeling without considering other potential factors¹⁷. Nevertheless, the machine learning model have not yet been explored in LRRC patients receiving RISI treatment.

In this study, we first confirmed that the prognostic performance of multiscale radiomics based RSF model demonstrated better performance over the conventional CHR model based on the C-index, IBS, tAUC, calibration curve, and DCA curve in the training and internal validation in predicting LC and OS. Our study demonstrated that the RSF model can consistently provide more accurate progressive disease and survival predictions at specific time points, including 1 and 2 year, confirming its robust prognostic value and broadening its potential applications. This approach provides a more comprehensive perspective of tumor biology, thereby enhancing the prognostic predictive performance of the model. This performance is primarily attributable to the RSF model’s capability to capture complex nonlinear interactions between the selected predictors and outcomes consistent with its application in other heterogeneous tumors. In addition, the predictive value based on RSF could achieve accurate risk classification of LC and OS than CHR model for LRRC patients, which has important significance for determining the population with better efficacy.

There were some limitations merit consideration in this study. The development of our predictive model employed robust internal validation, but our study lacks external validation on independent cohorts. The main limitation lies in the fact that the limited datasets availability for LRRC patients receiving this specialized treatment and data-sharing restrictions in multicenter studies (only a few centers possessing adequate experience). More prospective and multicenter studies are needed to explore the relevant issues and mechanisms to further confirm the generalizability of our models. Second, the tumor delineation for the DL model was based on the largest 2D slice, which may not adequately represent the entire tumor’s spatial characteristics. Future investigations should prioritize 3D analysis to characterize full spatial tumor features. We also acknowledge this approach discards time-to-event information, potentially limiting predictive precision. we will explore alternatives that respect survival timing (e.g.: we can select the Cox partial log-likelihood as the loss function to train the LC and OS signatures building network in the large sample training cohort in the future). Finally, we will further explore potential dosiomics and MRI biomarkers with more prognostic information to further improve the prediction performance of LRRC patients suitable for RISI therapy.

Conclusions

In conclusion, this study investigated the prognosis predictive value of intratumoral, peritumoral and deep learning features for LC and OS prediction. The RSF model incorporating multiscale radscores and clinical features may outperform the traditional CHR model in LC and OS prediction and risk stratification. Despite some limitations, this study first introduces the RSF model integrating clinical variables, intra- and peri-tumoral radiomics to prognostic prediction of LRRC patients after RISI, and the proposed method may have significant potential for future applications in clinical practice.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

OS:: Overall survival
LC:: Local control
LRRC:: Locally recurrent rectal cancer
RISI:: Radioactive ¹²⁵I seed implantation
RSF:: Random survival forest
CHR:: Cox hazard regression
RFs:: Radiomics features
C-index:: Concordance index
IBS:: Integrated Brier score
tAUC:: Time-dependent area under the curve
NCCN:: National Comprehensive Cancer Networks
CNN:: Connected neural networks
RS:: Radscore
CI:: Confidence interval
ICC:: Intraclass correlation coefficient
LASSO:: Least absolute shrinkage and selection operator
DM:: Distant metastasis
GTV:: Gross Tumor Volume
ROI:: Regions of interest

References

Räsänen, M. et al. Pattern of rectal cancer recurrence after curative surgery. Int. J. Colorectal Dis. 30 (6), 775–785 (2015).
Article PubMed Google Scholar
Cai, Y. et al. Prognostic factors associated with locally recurrent rectal cancer following primary surgery (Review). Oncol. Lett. 7 (1), 10–16 (2014).
Article PubMed ADS Google Scholar
Wang, H. et al. The dosimetry evaluation of 3D printing non-coplanar template-assisted CT-guided 125I seed stereotactic ablation brachytherapy for pelvic recurrent rectal cancer after external beam radiotherapy. J. Radiat. Res. 62 (3), 473–482 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
National Comprehensive Cancer Networks (NCCN). The NCCN rectal cancer clinical practice guidelines in oncology (version 2 2020) (2020).
Wang, Z. M. et al. Clinical application of CT-guided 125I seed interstitial implantation for local recurrent rectal carcinoma. Radiat. Oncol. 6, 138 (2011).
Article PubMed PubMed Central Google Scholar
Wang, L., Wang, H., Jiang, Y. & et The efficacy and dosimetry analysis of CT-guided ¹²⁵I seed implantation assisted with 3D-printing non-co-planar template in locally recurrent rectal cancer. Radiat. Oncol. 15 (1), 179 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: images are more than Pictures, they are data. Radiology 278 (2), 563–577 (2016).
Article PubMed Google Scholar
Hatt, M. et al. Radiomics: data are also images. J. Nucl. Med. 60 (Suppl 2), 38S–44S (2019).
Article PubMed Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006 (2014).
Article CAS PubMed ADS Google Scholar
Hu, Y. et al. Assessment of intratumoral and peritumoral computed tomography radiomics for predicting pathological complete response to neoadjuvant chemoradiation in patients with esophageal squamous cell carcinoma. JAMA Netw. Open. 3 (9), e2015927 (2020).
Article PubMed PubMed Central Google Scholar
Lin, C. H. et al. Prognostic value of interim CT-based peritumoral and intratumoral radiomics in laryngeal and hypopharyngeal cancer patients undergoing definitive radiotherapy. Radiother Oncol. 189, 109938 (2023).
Article PubMed Google Scholar
Zhong, L. Z. et al. A deep learning MR-based radiomic nomogram May predict survival for nasopharyngeal carcinoma patients with stage T3N1M0. Radiother Oncol. 151, 1–9 (2020).
Article PubMed Google Scholar
Yihuai, H. et al. Computed tomography-based deep-learning prediction of neoadjuvant chemoradiotherapy treatment response in esophageal squamous cell carcinoma. Radiother. Oncol. 154, 6–13 (2021).
Article Google Scholar
Zheng, X. et al. Predicting overall survival and prophylactic cranial irradiation benefit in small-cell lung cancer with CT-based deep learning: A retrospective multicenter study. Radiother. Oncol. 195, 110221 (2024).
Article PubMed Google Scholar
Cao, C. et al. Machine learning-based radiomics analysis for predicting local recurrence of primary dermatofibrosarcoma protuberans after surgical treatment. Radiother Oncol. 186, 109737 (2023).
Article CAS PubMed Google Scholar
Ishwaran, H. et al. Random survival forests for high-dimensional data. Stat. Analy Data Min. 4, 115–132 (2011).
Tian, D. et al. Machine Learning-Based prognostic model for patients after lung transplantation. JAMA Netw. Open. 6 (5), e2312022 (2023).
Article PubMed PubMed Central Google Scholar
Eisenhauer, E. A. et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur. J. Cancer. 45 (2), 228–247 (2009).
Article CAS PubMed Google Scholar
HATT, M. et al. Ibsi: an international community radiomics standardization initiative. J. Nucl. Med. 59, 287 (2018).
Aerts, H. J. et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295 (2), 328–338 (2020).
Article PubMed Google Scholar
Krizhevsky, A. et al. ImageNet classification with deep convolutional neural networks. Commun. ACM. 60, 84–90 (2017).
Article Google Scholar
Zhang, S. et al. Improving prognosis and assessing adjuvant chemotherapy benefit in locally advanced rectal cancer with deep learning for MRI: A retrospective, multi-set study. Radiother Oncol. 188, 109899 (2023).
Article CAS PubMed Google Scholar
Cui, Y. et al. Prognostic value of multiparametric MRI-based radiomics model: potential role for chemotherapeutic benefits in locally advanced rectal cancer. Radiother Oncol. 154, 161–169 (2021).
Article CAS PubMed Google Scholar
Gong, J. et al. CT-based radiomics nomogram May predict local recurrence-free survival in esophageal cancer patients receiving definitive chemoradiation or radiotherapy: A multicenter study. Radiotherapy Oncology: J. Eur. Soc. Therapeutic Radiol. Oncol. 174, 8–15 (2022).
Article CAS Google Scholar
Lin, C. H. et al. Prognostic value of interim CT-based peritumoral and intratumoral radiomics in laryngeal and hypopharyngeal cancer patients undergoing definitive radiotherapy. Radiotherapy Oncology: J. Eur. Soc. Therapeutic Radiol. Oncol. 189, 109938 (2023).
Article Google Scholar
Gu, Q. et al. Multiscale deep learning radiomics for predicting recurrence-free survival in pancreatic cancer: A multicenter study. Radiotherapy Oncology: J. Eur. Soc. Therapeutic Radiol. Oncol. 205, 110770 (2025).
Article Google Scholar
Li, Q. et al. Intratumoral and peritumoral CT radiomics in predicting prognosis in patients with chondrosarcoma: a multicenter study. Insights into Imaging. 15, 1–9 (2024).
Article Google Scholar
P´ erez-Morales, J. et al. Peritumoral and intratumoral radiomic features predict survival outcomes among patients diagnosed in lung cancer screening. Sci. Rep. 10, 10528 (2020).
Article Google Scholar
Rajkomar, A., Dean, J. & Kohane, I. Machine learning in medicine. N Engl. J. Med. 380 (14), 1347–1358 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

We are grateful to all the participants in this study and anonymous reviewers for reading and commenting on the manuscript.

Funding

This work is supported by the Innovation & Transfer Fund of Peking University Third Hospital (Grant No. BYSYZHKC2021113), National multidisciplinary cooperative diagnosis and treatment capacity building project for major diseases: comprehensive diagnosis and treatment of gastroinvalidationinal tumors and Beijing Lianying Intelligent Imaging Technology Research Institute Hospital-enterprise Joint Research and Development Platform Fund, No.H79462-07 and the National Nature Science Foundation of China under Grants No. U1867210.

Author information

Authors and Affiliations

School of Physics, Beihang University, Beijing, 100191, China
Yun Qin, Wei Zhao & Lihua Zhu
Department of Radiation Oncology, Peking University Third Hospital, Beijing, 100191, China
Yun Qin, Xuemin Li, Haitao Sun, Junjie Wang & Hao Wang
Cancer Center, Peking University Third Hospital, Beijing, China
Hao Wang

Authors

Yun Qin
View author publications
Search author on:PubMed Google Scholar
Xuemin Li
View author publications
Search author on:PubMed Google Scholar
Haitao Sun
View author publications
Search author on:PubMed Google Scholar
Wei Zhao
View author publications
Search author on:PubMed Google Scholar
Lihua Zhu
View author publications
Search author on:PubMed Google Scholar
Junjie Wang
View author publications
Search author on:PubMed Google Scholar
Hao Wang
View author publications
Search author on:PubMed Google Scholar

Contributions

YQ, HS, XL and HW were responsible for the collection of CT images and clinical data, and had full access to all of the data in the study. YQ and HW drafted the manuscript. JW was in charge of verifying patients’ implantations plan and directing the writing. QY was responsible for research design and data analysis. HW, WZ, and LZ were responsible for the critical revision of the manuscript for important intellectual content. HS was responsible for the design and production of radioactive seeds implantation plans and 3D-PNCT. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Junjie Wang or Hao Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Qin, Y., Li, X., Sun, H. et al. Machine learning prognosis model for locally recurrent rectal cancer patients after radioactive ¹²⁵I seed implantation. Sci Rep 16, 2679 (2026). https://doi.org/10.1038/s41598-025-32579-6

Download citation

Received: 12 August 2025
Accepted: 10 December 2025
Published: 17 December 2025
Version of record: 21 January 2026
DOI: https://doi.org/10.1038/s41598-025-32579-6

Subjects

Abstract

Introduction

Materials and methods

Ethics and patients

Endpoints and follow-up

Image acquisition

ROIs segmentation

Image processing and features extraction

Intratumoral and peritumor radiomics features

Deep learning features

Feature selection

Model construction and statistical analysis

Ethics approval

Results

Patient characteristics

Result of feature selection

Performance of radiomics signatures

Clinical prognosis factors

Performance and risk stratification of the RSF and CHR model

Discussion

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links