Abstract
The study investigates the correlation between CD3 T-cell expression levels and cervical cancer (CC) while developing a magnetic resonance (MR) imaging-based radiomics model for preoperative prediction of CD3 T-cell expression levels. Prognostic correlations between CD3D, CD3E, and CD3G gene expressions and various cancers were analyzed using the Cancer Genome Atlas (TCGA) database. Protein–protein interaction (PPI) analysis via the STRING database identified associations between these genes and T lymphocyte activity. Gene Set Enrichment Analysis (GSEA) revealed immune pathway enrichment by categorizing genes based on CD3D expression levels. Correlations between immune checkpoint molecules and CD3 complex genes were also assessed. The study retrospectively included 202 patients with pathologically confirmed early-stage CC who underwent preoperative MRI, divided into training and test groups. Radiomic features were extracted from the whole-lesion tumor region of interest (ROItumor) and from peritumoral regions with 3 mm and 5 mm margins (ROI3mm and ROI5mm, respectively). Various machine learning algorithms, including Support Vector Machine (SVM), Logistic Regression, Random Forest, AdaBoost, and Decision Tree, were used to construct radiomics models based on different ROIs, and diagnostic performances were compared to identify the optimal approach. The best-performing algorithm was combined with intra- and peritumoral features and clinically relevant independent risk factors to develop a comprehensive predictive model. Analysis of the TCGA database demonstrated significant associations between CD3D, CD3E, and CD3G expressions and several cancers, including CC (p < 0.05). PPI analysis highlighted connections between these genes and T lymphocyte function, while GSEA indicated enrichment of immune-related pathways linked to CD3D. Immune checkpoint correlations showed positive associations with CD3 complex genes. Radiomics analysis selected 18 features from ROItumor and ROI3mm across MRI sequences. The SVM algorithm achieved the highest predictive performance for CD3 T-cell expression status, with an area under the curve (AUC) of 0.93 in the training group and 0.92 in the test group. This MR-based radiomics model effectively predicts CD3 expression status in patients with early-stage CC, offering a non-invasive tool for preoperative assessment of CD3 expression, but its clinical utility needs further prospective validation.
Similar content being viewed by others
Introduction
Cervical cancer (CC) is the most prevalent malignancy of the female reproductive system1,2. According to the World Health Organization (WHO), approximately 660,000 new cases and 350,000 deaths from CC occur globally each year1. In China alone, over 110,000 new diagnoses and 61,000 deaths are reported annually2. While traditional treatments such as surgery, radiotherapy, and chemotherapy are effective for early-stage CC, there is a growing need to explore innovative approaches that could further improve patient outcomes3. This highlights the urgent need for innovative clinical treatments. Immunotherapy, including immune checkpoint inhibitors (ICIs), antibody–drug conjugates (ADCs), and targeted therapies, has emerged as a promising approach for advanced and metastatic CC. However, patient responses to these therapies are highly variable4,5. Existing biomarkers such as PD-L1 and MSI-H have significant limitations in predicting immunotherapy outcomes4, underscoring the necessity of developing accurate predictive models to identify potential responders and optimize treatment strategies.
Recent studies increasingly emphasize the immunological roles of T cell subtypes, but their prognostic and therapeutic significance across different cancers remains insufficiently understood6,7. Histopathological evidence frequently demonstrates T lymphocyte infiltration within tumor nests, where these immune cells contribute to the tumor microenvironment and influence cancer progression and treatment outcomes8,9. Single-cell sequencing has revealed significant heterogeneity in T lymphocyte populations across malignancies, correlating with varying responses to radiotherapy, chemotherapy, and immunotherapy8,10. As a result, identifying reliable predictive biomarkers to enhance the efficacy of immunotherapy has become a critical focus in CC research.
At the molecular level, the CD3 complex (comprising CD3D, CD3E, and CD3G) is a pivotal marker for T lymphocyte identification and profiling, making it an essential target for immunological investigations11,12. This study utilized pan-cancer analysis of data from The Cancer Genome Atlas (TCGA) to evaluate the prognostic significance of T lymphocyte infiltration across various malignancies. Despite these advancements, objective and non-invasive methods for assessing T lymphocyte abundance in clinical settings remain unavailable.
Radiomics, a rapidly evolving field in medical imaging, presents a non-invasive and promising methodology for assessing the tumor immune microenvironment13,14,15. By extracting quantitative features from imaging modalities such as computed tomography (CT) and magnetic resonance imaging (MRI), radiomics enables phenotypic characterization of the immune landscape within tumors14. This approach deepens the understanding of tumor-immune interactions and supports the development of personalized therapeutic strategies with the potential to improve clinical outcomes8. Recent studies have increasingly investigated radiomics to identify associations between imaging features and tumor immune markers11,12,14,16. For example, CT-derived radiomic features have demonstrated potential in predicting CTLA4 expression in clear cell renal cell carcinoma17, while MRI-based radiomic features have shown efficacy in estimating CD3 + T lymphocyte levels in glioblastoma18.
Focusing on CC, where T lymphocyte infiltration significantly influences disease progression and therapeutic outcomes, this study leverages immunohistochemistry to detect CD3 as a marker of T cell presence in tumor tissues. A radiomics model was developed to identify imaging features indicative of CD3 expression levels, offering a non-invasive method to evaluate variations in T lymphocyte abundance. These radiomic features provide valuable insights into tumor immune status, aiding in the prediction of immunotherapy responses and enhancing prognostic assessments for patients with CC. In summary, this approach enables more personalized and effective treatment strategies.
Materials and methods
Data acquisition and comprehensive analysis of T lymphocytes in cancer
To investigate the clinical significance of T lymphocytes, CD3D, CD3E, and CD3G were selected as candidate gene markers. Normal tissue data from the GTEx database (https://gtexportal.org/) were integrated with tumor tissue data from The TCGA database (https://cancergenome.nih.gov) for a comprehensive joint analysis. Samples with zero gene expression values were excluded to mitigate potential biases arising from technical artifacts or non-expression states. RNA sequencing data underwent processing and standardization using the Toil pipeline, followed by log2 transformation (value + 1). Analyses included Kaplan–Meier (K-M) survival analysis, clinical parameter evaluation, immune checkpoint correlation analysis, and gene set enrichment analysis (GSEA), conducted via the Xiantao Academic platform with the R package (version 4.2.1; https://www.xiantaozi.com/).
The study analyzed 16 cancer types: bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), cholangiocarcinoma (CHOL), colon adenocarcinoma (COADREAD), esophageal carcinoma (ESCA), head and neck squamous cell carcinoma (HNSC), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma and squamous cell carcinoma (LUADLUSC), ovarian serous cystadenocarcinoma (OV), pancreatic adenocarcinoma (PAAD), prostate adenocarcinoma (PRAD), rectum adenocarcinoma (READ), skin cutaneous melanoma (SKCM), stomach adenocarcinoma (STAD), and uterine corpus endometrial carcinoma (UCEC). Kaplan–Meier survival curves for these cancers were plotted, categorizing samples into high- and low-expression groups based on median gene expression levels to reduce subjective bias.
Protein–protein interaction (PPI) analysis utilized the STRING database (https://string-db.org/) with a confidence score threshold > 0.7. GSEA identified significant pathway differences between high- and low-expression groups, with criteria for significance set at a false discovery rate (FDR) < 0.25 and a p-value < 0.05.
Immunomonitoring correlation analysis examined the relationship between CD3D expression and immune checkpoint molecules, with results visualized in heatmaps to elucidate the immune landscape.
Immunohistochemistry (IHC) was performed using the Ventana Benchmark ULTRA automated staining system (Ventana Medical Systems, Tucson, AZ). Staining utilized a mouse monoclonal anti-CD3 antibody (clone 12,730, 1:100 dilution; Santa Cruz Biotechnology), with 3,3′-diaminobenzidine (DAB) as the chromogen. Positive controls confirmed strong CD3 staining in cervical carcinoma tissues, while phosphate-buffered saline (PBS) served as the negative control. Tissue images were scanned using the TEKsqray digital slide scanner (Shenzhen Shengqiang Technology Co., Ltd, China).
Patients and clinical data
This retrospective study received approval from the Institutional Review Board (IRB) of Xiangyang Centre Hospital, with informed consent waived due to its retrospective nature. All methods were performed in accordance with the relevant guidelines and regulations, including the Declaration of Helsinki. Patients with pathologically confirmed early-stage CC treated at the hospital between December 2020 and December 2023 were enrolled. Inclusion criteria comprised: (1) a diagnosis of early-stage CC (FIGO stages IA2-IIA) confirmed by surgical pathology, (2) availability of preoperative multi-parametric MRI (mpMRI) scans, (3) complete pathological data including CD3 expression status, and (4) no history of gynecological malignancies, prior pelvic surgeries, or previous radiotherapy or chemotherapy. Patients were excluded if they had undergone cervical conization, presented low-quality MRI images, or had lesions smaller than 5 mm in diameter on MRI. A total of 202 patients were included in the analysis, comprising 119 with low CD3 expression and 83 with high CD3 expression (Fig. 1). We observed that the FIGO stage and histological grade were correlated with the number of CD3 + T cells (Figure S1A,B). Based on this observation, we determined a cutoff value to distinguish CD3 expression levels, which was subsequently used as the classification criterion in our follow-up studies. High CD3 expression was defined as the presence of at least 20 CD3-positive cells per 1000 square micrometers of tissue (Fig. 2A, B).
Immunohistochemistry and T2W imaging results from patients with high and low CD3 expression. Immunohistochemical results (scale bar: 50 µm) and the original image with a mask highlighting low CD3 expression are shown in (A) and (C), respectively; corresponding images with masks highlighting high CD3 expression are shown in (B) and (D).
Clinical and laboratory data were extracted from electronic medical records, encompassing CD3 expression status, patient age, clinical symptoms, obstetric history, blood tumor biomarkers, and peripheral blood counts.
MRI acquisition and image analysis
All participants underwent mpMRI within 2 weeks prior to surgery, using a 3.0 T MRI scanner. The MRI protocol included axial T2-weighted imaging (T2WI) with fat suppression, diffusion-weighted imaging (DWI), and contrast-enhanced T1-weighted imaging (CE-MRI) sequences. Detailed scanning parameters for each sequence are listed in Table 1.
MRI images were independently reviewed by Radiologist A, an abdominal and pelvic imaging specialist with 5 years of experience. Radiologist A conducted the analysis blinded to clinical and histopathological data to minimize bias. For quantitative assessment, apparent diffusion coefficient (ADC) values were calculated using FireVoxel software (version 4.23; https://www.firevoxel.org/). Additionally, the presence of an intact dark stromal ring and the longest lesion diameter on axial T2WI images were recorded.
Tumor segmentation, feature extraction, and selection
Tumor segmentation was conducted using the Deepwise Multimodal Research Platform (version 2.3, Beijing Deepwise & League of PHD Technology Co., Ltd, Beijing, China; https://keyan.deepwise.com), which enables precise three-dimensional (3D) delineation of tumor regions. Manual 3D regions of interest (ROIs) were delineated on DWI with a b-value of 800 s/mm2, T2WI, and CE-MRI images (Fig. 2C, D). The segmented ROIs from DWI were mapped onto corresponding ADC maps for consistency and analysis. Peritumoral regions with 3 mm (ROI3mm) and 5 mm (ROI5mm) radial thicknesses were automatically generated using the Deepwise platform to facilitate the analysis of peritumoral features.
Initial segmentation was performed by Radiologist A and subsequently reviewed and refined by Radiologist B, an expert with over 10 years of experience in gynecological oncology. To ensure the reproducibility of ROI segmentation, intra-observer agreement was assessed using the intra-class correlation coefficient (ICC). Thirty randomly selected patients underwent repeat segmentation after a 1-month interval, providing an objective measure of reliability and reproducibility.
To standardize datasets, all images were resampled to a uniform resolution of 1 × 1 × 1 mm using B-spline interpolation. Radiomic feature extraction was performed on the segmented regions using the Deepwise platform, encompassing first-order statistics, shape features, and texture features. Texture features included the gray-level co-occurrence matrix (GLCM), gray-level size zone matrix (GLSZM), gray-level run length matrix (GLRLM), and gray-level difference matrix (GLDM). Additionally, features were derived from preprocessed images using methods such as wavelet filtering, Laplacian of Gaussian (LoG) filtering, Gradient, LBP2D (Local Binary Pattern 2D), and nonlinear intensity transformations (e.g., square, square root, logarithm, exponential). A total of 1,502 radiomic features were extracted for each tumor volume, ROI3mm, and ROI5mm across ADC, T2WI, and CE-MRI sequences. Detailed categorization and statistics of radiomic features are provided in Supplementary Table S1.
To ensure the reliability and effectiveness of our radiomics features, we implemented a rigorous three-step feature selection process. Initially, the Intraclass Correlation Coefficient (ICC) with a threshold of > 0.8 was employed to evaluate the reproducibility of features across repeated measurements or observers. Features with high ICC values were retained to ensure robustness and reliability, consistent with best practices in radiomics research. Subsequently, we performed a pairwise Pearson correlation analysis to eliminate highly collinear features (threshold: |r|> 0.8), which helps to reduce feature redundancy and mitigate multicollinearity effects during model training. Finally, the F-test (ANOVA) served as a univariate filter method to assess the discriminatory capacity of each feature in distinguishing outcome classes, preserving features exhibiting significant statistical relevance for classification purposes.
Model construction and evaluation
Predictive models were constructed using five established machine learning algorithms: Support Vector Machine (SVM), Logistic Regression, Random Forest, AdaBoost, and Decision Tree. These models were trained on features extracted from multiple mpMRI sequences, including CE-MRI, T2WI, and ADC images. Given the high dimensionality and complexity of medical imaging data, coupled with the risk of overfitting, SVM was selected as the final algorithm due to its superior performance and robustness.
We performed hyperparameter tuning using grid search on the training set for all five machine learning models evaluated in this study. The best-performing hyperparameter combination for each algorithm was selected based on performance on test group, rather than k-fold cross-validation. And default settings were not used for any of the models. Details of the hyperparameter optimization strategy for five machine learning models are provided in Supplementary Table S2.
To explore the impact of tumor heterogeneity and the peritumoral microenvironment, models were developed using tumor ROIs and varying peritumoral ROIs, specifically the tumor ROI (ROItumor) and peritumoral rings at 3 mm (ROI3mm) and 5 mm (ROI5mm) from the tumor boundary. This approach sought to determine the optimal peritumoral ROI for predictive modeling (Supplementary Table S3).
A clinical model was also constructed by incorporating significant clinical risk factors derived from clinical, laboratory, and conventional imaging characteristics. The top-performing radiomics model was subsequently integrated with the clinical model to form a comprehensive predictive model designed to enhance overall performance. In the radiomics analysis, considering the definition of high CD3 expression, the distribution of FIGO subgroups was regarded as a potential confounding factor. We constructed a distribution table for high and low CD3 expression groups across FIGO IA, IB, and IIA subgroups and evaluated the differences between groups using the chi-square test. Additionally, we explored the impact of incorporating FIGO staging as a variable into the clinical model and compared it with the model without this variable using the Delong test to assess the differences between them.
Model discrimination was evaluated through receiver operating characteristic (ROC) curve analysis, with metrics including area under the curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Calibration and decision utility were assessed using calibration curves and Decision Curve Analysis (DCA) for both training and test cohorts. Additionally, SHapley Additive exPlanations (SHAP) values were employed to interpret the contribution of individual features to predictions, providing transparency into the model’s decision-making process.
Statistical analysis
Statistical analyses were conducted using the Deepwise Multimodal Research Platform (version 2.3) and Xiantao Academic website with the R package (version 4.2.1; https://www.xiantaozi.com/). Continuous variables were tested for normality and variance homogeneity, with comparisons made using the Student’s t-test or the Wilcoxon rank-sum test, as appropriate. Categorical variables were compared using the Chi-square test or Fisher’s exact test.
Survival analysis was performed using the Kaplan–Meier method, with log-rank tests for comparing survival curves between groups. The “survival” (version 3.3.1), “survminer” (version 0.4.9), and “ggplot2” (version 3.3.6) R packages were utilized for visualization. PPI analysis was conducted using the STRING database with a confidence score threshold > 0.7 to ensure reliability. GSEA compared CD3D high- and low-expression groups using DESeq2 and the “clusterProfiler” package, identifying significant pathways (FDR < 0.25) and visualizing results with “ggplot2.”
The relationship between CD3D expression and immune checkpoint molecules was assessed via Spearman correlation analysis, with results visualized as a heatmap using “ggplot2.” Statistical significance was set at a p-value < 0.05. All data preprocessing steps were performed in accordance with CLEAR (Checklist for Artificial Intelligence in Medical Imaging) guidelines. The CLEAR checklist is provided in Supplementary 2.
Results
Bioinformatics analysis in the TCGA database
Prognostic analysis of 16 common malignant tumor types in the TCGA database demonstrated significant correlations between the expressions of CD3D, CD3E, and CD3G and cancers such as uterine endometrial cancer, ovarian cancer, CC, cutaneous malignant melanoma, breast cancer, and head and neck squamous cell carcinoma (p < 0.05) (Fig. 3). PPI analysis via the STRING database identified CD3E, CD3G, CD2, CD4, CD8A, CD247, LCK, TRAT1, and ZAP70 as closely related genes to CD3D.
GSEA, categorizing genes into high- and low-expression groups based on CD3D levels, revealed significant enrichment of immune-related pathways associated with CD3D across all tumor types (Fig. 4).
Correlation analysis of CD3D expression with immune checkpoint molecules—including BTLA, CD27, CD274, CTLA4, ICOS, LAG3, TIGIT, TNFRSF4, and TNFRSF9—showed strong positive correlations with CD3D, CD3E, and CD3G (Fig. 5). These results highlight the potential of immune checkpoint inhibitors to modulate T lymphocyte populations and shape the anti-cancer immune microenvironment.
Baseline characteristics of the clinical cohort
The study included 202 patients with early-stage CC, consisting of 119 with low CD3 expression and 83 with high CD3 expression. Patient ages ranged from 26 to 77 years (mean age: 52.3 years). Detailed clinicopathological characteristics are summarized in Table 2.
Within the training cohort, significant differences were observed between low and high CD3 expression groups in terms of the presence of an intact stromal ring and serum SCC levels. Additionally, the distribution of the longest tumor diameter varied significantly between the two groups in both the training (p = 0.02) and test (p = 0.047) cohorts. No other clinicopathological parameters showed significant differences (p > 0.05).
Feature selection
From T2WI images, ICC analysis yielded 1487, 1466, and 1479 features from ROItumor, ROI3mm, and ROI5mm, respectively. For ADCmap images, 1467, 1472, and 1487 features were extracted from ROItumor, ROI3mm, and ROI5mm, respectively. CE-MRI images provided 1480, 1465, and 1484 features from the same regions.
Feature correlation analysis and F-test selection reduced dataset dimensionality, retaining 18 features for the combined ROItumor + ROI3mm model derived from mpMRI sequences (Supplementary Table S4).
Univariate analysis of clinical and imaging characteristics (Table 3) revealed significant differences in platelet (PLT) count and tumor longest diameter between low and high CD3 expression groups within the training cohort (p < 0.05). These variables were included in subsequent multivariate analyses and model development to enhance predictive performance.
Chi-square testing revealed significant differences in CD3-high/CD3-low distribution across FIGO sub-stages (χ2 = 77.919, p < 0.001), with higher CD3 expression observed in High stages (IIA 61.4% vs. IB 38.6% vs. IA 0%) (Supplementary Figure S1B and Table S5). Incorporating FIGO stage into the clinical model minimally affected the prognostic significance of CD3 status (AUC: 0.738 to 0.772, 95% CI 0.693–0.850). Delong’s test showed no significant difference in ROC curve performance between models with and without FIGO stage (p = 0.564, 0.685) (Supplementary Table S6 and S7). Due to these findings and potential multicollinearity between FIGO stage and existing covariates, the original model (without FIGO stage) was retained.
Model construction and evaluation
The performance of machine learning algorithms utilizing mpMRI data is summarized in Table 4 and visualized in Figs. 6A and B. Five algorithms—SVM, Logistic Regression, Random Forest, AdaBoost, and Decision Tree—were assessed for their predictive capabilities. Among these, the SVM model demonstrated superior predictive performance in both training and test cohorts, as indicated by the ROC curves.
Models incorporating ROItumor, ROI3mm, and ROI5mm from CE-MRI, T2WI, and ADC sequences were evaluated across training and test groups (Supplement Table S2). The inclusion of ROI3mm significantly improved predictive efficacy for CD3 expression states compared to models using ROI5mm, leading to the selection of ROItumor and ROI3mm for the final radiomics models to optimize sensitivity and specificity.
Table 5 details the AUC with 95% confidence intervals (CI), accuracy, sensitivity, specificity, PPV, and NPV for the clinical model, radiomics models (ROItumor + ROI3mm from mpMRI sequences), and the comprehensive model integrating radiomic and clinical data. Figures 6C and D depict the ROC curves for these models in both training and test cohorts. Notably, the comprehensive model, which combines radiomic features with clinical parameters, achieved the highest AUC values, reaching 0.93 (95% CI 0.88–0.97) in the training cohort and 0.92 (95% CI 0.84–0.99) in the test cohort.
The influence of model variables was evaluated using SHAP values within the SVM algorithm, as shown in Fig. 7. SHAP analysis identified gradient_glcm_MCC_ADC as the most impactful feature in predicting outcomes, followed by lbp-2D_firstorder_Skewness_T1C, which demonstrated a positive contribution to the model’s predictions. Additional features, such as log-sigma-5-0-mm-3D_firstorder_InterquartileRange_ADC and exponential_firstorder_Minimum_T2W, also contributed significantly, highlighting their importance in the model’s predictive performance.
Assessment of the relative influence of model variables in the SVM algorithm, based on SHAP values. Feature0: wavelet-HH_glszm_GrayLevelNonUniformityNormalized_ADC. Feature1: log-sigma-5–0-mm-3D_firstorder_InterquartileRange_ADC. Feature2: lbp-2D_firstorder_Kurtosis_ADC. Feature3: wavelet-HH_glrlm_LongRunEmphasis_ADC. Feature4: wavelet-LH_glcm_Correlation_ADC. Feature5: wavelet-HL_gldm_DependenceVariance_ADC_3mm. Feature6: gradient_glcm_MCC_ADC. Feature7: lbp-2D_glcm_JointEntropy_ADC. Feature8: lbp-2D_firstorder_InterquartileRange_ADC. Feature9: wavelet-LH_gldm_LargeDependenceLowGrayLevelEmphasis_ADC. Feature10: gradient_glcm_Imc2_ADC. Feature11: log-sigma-5–0-mm-3D_glcm_InverseVariance_T1C. Feature12: exponential_firstorder_Minimum_T2W. Feature13: log-sigma-5–0-mm-3D_glcm_DifferenceVariance_T1C. Feature14: logarithm_glszm_HighGrayLevelZoneEmphasis_T2W. Feature15: lbp-2D_firstorder_Skewness_T1C. Feature16: wavelet-HH_ngtdm_Busyness_ADC. Feature17: lbp-2D_gldm_DependenceVariance_T1C.
Calibration curve analysis and DCA were conducted for the clinical model, radiomics model (ROItumor + ROI3mm from mpMRI sequences), and the comprehensive model, indicating excellent calibration and clinical utility in both training and test cohorts (Fig. 8). Calibration curves revealed a strong alignment between predicted probabilities and actual outcomes, with the comprehensive model showing the best agreement, further validating its reliability. DCA demonstrated the clinical utility of the models across a range of threshold probabilities, with the comprehensive model offering the highest net benefit compared to the radiomics model for threshold probabilities between 0 and 0.4. This finding highlights the comprehensive model’s superior capability to balance sensitivity and specificity, underscoring its robustness and practical value in supporting clinical decision-making.
Discussion
This study developed and validated machine learning models to predict CD3 T cell levels in early-stage CC using radiomic features extracted from tumor regions and 3 mm peritumoral rings on T2WI, ADC, and CE-MRI images. The findings demonstrate a strong association between CD3 complex gene expression and several cancers, including CC, highlighting its potential as a prognostic biomarker. By integrating radiomic features with clinical data, the comprehensive model provides a promising non-invasive method for predicting CD3 expression status. These results underscore the utility of combining radiomics and clinical data to optimize immunotherapy strategies and improve patient outcomes in CC management.
In CC, the extent of T lymphocyte infiltration is a critical indicator of the tumor microenvironment’s immune status and correlates with patient outcomes10,19,20,21. Previous research has shown that higher T lymphocyte levels within the tumor microenvironment enhance cytotoxic activity against tumor cells, thereby improving prognosis10,11,19. Consistent with these findings, the current analysis of TCGA data demonstrates that increased T lymphocyte abundance is associated with better prognoses across various cancers, including CC. Routine immunohistochemical assessments have reliably identified T lymphocytes in cancer tissues12. TCGA analysis has assisted radiomics in screening biomarkers and revealed the correlation between imaging features and tumor biological behavior. Narang et al.18 confirmed that texture features of glioma are associated with CD3 T cell infiltration, with the model’s predictive AUC reaching 0.847. He et al.17 constructed a CT radiomics model to predict CTLA4 expression and survival in ccRCC. TCGA analysis may provide a broader molecular basis, such as immune-related gene expression profiles, enhance the integration of radiomics with immunological biomarkers, highlighting the potential of multimodal approaches to improve prognostic accuracy and guide personalized cancer treatment. The combination of these two studies highlights the potential of multimodal approaches. Building on this knowledge, this study focused on identifying MRI-derived imaging biomarkers reflective of T lymphocyte infiltration in CC, aiming to facilitate pre-treatment prediction of immune status and improve treatment planning.
Traditional radiomic studies have primarily focused on analyzing primary tumors, often neglecting peritumoral regions, which play a critical role in the tumor microenvironment22,23. Advances in peritumoral radiomics have shown its diagnostic and prognostic potential across various systemic diseases22,23,24. Although definitions of the peritumoral region vary, regions spanning 0 to 10 mm from the tumor boundary—particularly 3 mm and 5 mm—are commonly analyzed25,26,27,28. In some cancers, the peritumor region may exhibit more pronounced changes in cellular density, necrosis, or structural organization, which could impact the peritumoral optimal distance for radiomic analysis. In this study, radiomic features extracted from 3 and 5 mm peritumoral rings demonstrated that the 3 mm ring provided superior diagnostic accuracy, as reflected by AUC values. These findings align with growing evidence that the immediate peritumoral microenvironment significantly influences tumor-immune interactions and cancer progression.
Radiomic analysis ultimately identified 18 features from each tumor and the 3 mm peritumoral region to construct a predictive model for CD3 expression levels in CC. These features included six wavelet features, four first-order features, three morphological features, and five additional features. SHAP analysis highlighted GLCM features, first-order statistics, and wavelet transform features as the most influential in the model’s predictions. Among these, gradient_glcm_MCC_ADC had the highest contribution, reflecting tumor boundary clarity and internal structural consistency. GLCM plays a crucial role in quantitatively analyzing tumor heterogeneity by reflecting the distribution and positional relationships of pixels, thereby revealing the spatial irregularity within tumors29,30. Higher GLCM contrast indicates greater spatial heterogeneity, which is linked to microstructural complexity in tumors. Previous studies have also provided some clues that prognostic value of GLCM-based features in oesophageal cancer31. First-order and wavelet-based features captured the biological properties and heterogeneity of the tumors, enhancing the model’s ability to characterize tumor behavior and morphology32,33. Although characteristics from all imaging sequences contribute significantly, ADC-based features predominate in our model, which is probably due to their increased sensitivity in identifying minor yet crucial elements of tumor microenvironment. Previous studies have also reported the prognostic value of ADC-based radiomic features in some tumors, underscoring its potential to encapsulate biologically relevant information15,34.
The comprehensive model incorporated clinical risk factors, including the longest tumor diameter on axial T2WI images and PLT count. Tumors in the high CD3 expression group tended to be larger, with evidence of deeper stromal or parametrial invasion35. This aligns with the 2018 FIGO guidelines, which stress the importance of MRI-based tumor size measurement for accurate staging34. Specifically, the longest diameter on axial T2WI was chosen to correspond with transverse stromal penetration depth36. Elevated PLT counts, occurring in approximately 10% to 57% of cancer patients, are associated with tumor progression and metastasis through various mechanisms37,38. This study identified a negative correlation between elevated PLT counts and CD3 expression levels. This is in agreement with Zhao et al.39, who reported that increased PLT counts may act as a poor prognostic indicator for patients with early-stage CC. Similarly, Shi Jia Xin et al.40 observed higher PLT counts in patients with CC exhibiting lymph node metastasis compared to those without metastasis (p < 0.05) in the training cohort, although this difference was not significant in the validation cohort, aligning with previous reports.
Several limitations warrant consideration. First, the T-lymphocyte analysis relied on public database-derived data, which may be subject to variability in data quality and completeness. Second, the lack of comprehensive survival data and cross-database validation in the CC patient cohort limits the generalizability of the findings. Future studies should aim to develop radiomics models integrating multi-sequence imaging data and clinical parameters from diverse centers. Additionally, the current imaging dataset mainly focuses on early-stage patients and lacks data from advanced stages. Future research plans include increasing the sample size to cover the entire disease course from early to advanced stages, aiming for a more comprehensive perspective and accurate predictive models.
In conclusion, CD3 expression levels correlate significantly with CC staging and prognosis. By integrating intra- and peritumoral multi-sequence MRI radiomic features, tumor size, and PLT counts, this study developed an efficient model for predicting CD3 expression status in preoperative patients with CC. It could aid in identifying patients more likely to benefit from immunotherapy or other immune-targeted treatments, thus informing personalized treatment strategies.
Data availability
The datasets generated during the current study are included in this published article. Further inquiries can be available from the corresponding author.
References
World Health Organization. Cervical cancer: Key facts [EB/OL]. World Health Organization. https://www.who.int/news-room/fact-sheets/detail/cervical-cancer. (2023).
Xia, C. et al. Cancer statistics in China and United States, 2022: Profiles, trends, and determinants. Chin. Med. J. Engl. 135(5), 584–590 (2022).
Marret, G., Borcoman, E. & le Tourneau, C. Pembrolizumab for the treatment of cervical cancer. Expert Opin. Biol. Ther. 19(9), 871–877 (2019).
Ferrall, L. et al. Cervical cancer immunotherapy: Facts and hopes. Clin. Cancer Res. 27(18), 4953–4973 (2021).
Holder, A. M. et al. Defining clinically useful biomarkers of immune checkpoint inhibitors in solid tumours. Nat. Rev. Cancer 24(7), 498–512 (2024).
Zhao, K., Han, D., Tang, L. & Jin, H. CD39 expression in peripheral T cells is associated with clinicopathological characteristics in patients with cervical cancer. Iran. J. Immunol. 20(3), 276–286 (2023).
Heeren, A. M. et al. Neoadjuvant cisplatin and paclitaxel modulate tumor-infiltrating T cells in patients with cervical cancer. Cancer Immunol. Immunother. 68(11), 1759–1767 (2019).
Jia, Q., Wang, A., Yuan, Y., Zhu, B. & Long, H. Heterogeneity of the tumor immune microenvironment and its clinical relevance. Exp. Hematol. Oncol. 11(1), 24 (2022).
Jiang, Y. et al. Noninvasive imaging evaluation of tumor immune microenvironment to predict outcomes in gastric cancer. Ann. Oncol. 31(6), 760–768 (2020).
Yu, L. et al. T cell immunotherapy for cervical cancer: Challenges and opportunities. Front. Immunol. 26(14), 1105265 (2023).
Rowe, J. H. et al. Patients with CD3G mutations reveal a role for human CD3γ in Treg diversity and suppressive function. Blood 131(21), 2335–2344 (2018).
Nedergaard, B. S., Ladekarl, M., Thomsen, H. F., Nyengaard, J. R. & Nielsen, K. Low density of CD3+, CD4+ and CD8+ cells is associated with increased risk of relapse in squamous cell cervical cancer. Br. J. Cancer 97(8), 1135–1138 (2007).
Zhang, R. et al. Predicting preoperative muscle invasion status for bladder cancer using computed tomography-based radiomics nomogram. BMC Med. Imaging 24(1), 98 (2024).
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: Images are more than pictures, they are data. Radiology 278(2), 563–577 (2016).
Wu, F. et al. Radiomics analysis based on multiparametric magnetic resonance imaging for differentiating early stage of cervical cancer. Front. Med. (Lausanne) 2(11), 1336640 (2024).
Seo, N., Hayakawa, S. & Tokura, Y. Mechanisms of immune privilege for tumor cells by regulatory cytokines produced by innate and acquired immune cells. Semin. Cancer Biol. 12(4), 291–300 (2002).
He, H. et al. Computed tomography-based radiomics prediction of CTLA4 expression and prognosis in clear cell renal cell carcinoma. Cancer Med. 12(6), 7627–7638 (2023).
Narang, S. et al. Tumor image-derived texture features are associated with CD3 T-cell infiltration status in glioblastoma. Oncotarget 8(60), 101244–101254 (2017).
Lippens, L. et al. Immunologic impact of chemoradiation in cervical cancer and how immune cell infiltration could lead toward personalized treatment. Int. J. Cancer 147(2), 554–564 (2020).
Yin, X., Chen, S. & Eisenbarth, S. C. Dendritic cell regulation of T helper cells. Annu. Rev. Immunol. 26(39), 759–790 (2021).
Dong, S., Guo, X., Han, F., He, Z. & Wang, Y. Emerging role of natural products in cancer immunotherapy. Acta Pharm. Sin B 12(3), 1163–1185 (2022).
Duan, Y., Chen, X., Li, W., Li, S. & Zhang, C. Multimodal radiomics and nomogram-based prediction of axillary lymph node metastasis in breast cancer: An analysis considering optimal peritumoral region. J. Clin. Ultrasound. 51(7), 1231–1241 (2023).
Hu, Y. et al. Assessment of intratumoral and peritumoral computed tomography radiomics for predicting pathological complete response to neoadjuvant chemoradiation in patients with esophageal squamous cell carcinoma. JAMA Netw. Open 3(9), e2015927 (2020).
Algohary, A. et al. Combination of peri-tumoral and intra-tumoral radiomic features on bi-parametric MRI accurately stratifies prostate cancer risk: A multi-site study. Cancers (Basel) 12(8), 2200 (2020).
Zhou, J. et al. Diagnosis of benign and malignant breast lesions on DCE-MRI by using radiomics and deep learning with consideration of peritumor tissue. J. Magn. Reson. Imaging 51(3), 798–809 (2020).
Ma, Y., Xu, X., Pang, P. & Wen, Y. A CT-based tumoral and mini-peritumoral radiomics approach: Differentiate fat-poor angiomyolipoma from clear cell renal cell carcinoma. Cancer Manag. Res. 12(13), 1417–1425 (2021).
Li, H. et al. MRI-based multiregional radiomics for preoperative prediction of tumor deposit and prognosis in resectable rectal cancer: A bicenter study. Eur. Radiol. 33(11), 7561–7572 (2023).
Xiao, M. L. et al. Intratumoral and peritumoral MRI radiomics nomogram for predicting parametrial invasion in patients with early-stage cervical adenocarcinoma and adenosquamous carcinoma. Eur. Radiol. 34(2), 852–862 (2024).
Chen, C. et al. Contrast-enhanced CT texture analysis: A new set of predictive factors for small cell lung cancer. Mol. Imaging Biol. 22(3), 745–751 (2020).
Yang, W. L., Zhu, F. & Chen, W. X. Texture analysis of contrast-enhanced magnetic resonance imaging predicts microvascular invasion in hepatocellular carcinoma. Eur. J. Radiol. 156, 110528 (2022).
O’Shea, R. et al. Multicentre validation of CT grey-level co-occurrence matrix features for overall survival in primary oesophageal adenocarcinoma. Eur. Radiol. 34(10), 6919–6928 (2024).
Wu, Y. et al. A multicenter study on preoperative assessment of lymphovascular space invasion in early-stage cervical cancer based on multimodal MR radiomics. J. Magn. Reson. Imaging 58(5), 1638–1648 (2023).
Zheng, Z. et al. Magnetic resonance imaging-based radiomics signature for preoperative prediction of Ki67 expression in bladder cancer. Cancer Imaging 21(1), 65–79 (2021).
Fan, C. et al. Discriminating malignant from benign testicular masses using machine-learning based radiomics signature of appearance diffusion coefficient maps: Comparing with conventional mean and minimum ADC values. Eur. J. Radiol. 148, 110158 (2022).
Haldorsen, I. S., Lura, N., Blaakær, J., Fischerova, D. & Werner, H. M. J. What is the role of imaging at primary diagnostic work-up in uterine cervical cancer?. Curr. Oncol. Rep. 21(9), 77 (2019).
Ren, J. et al. MRI-based radiomics analysis improves preoperative diagnostic performance for the depth of stromal invasion in patients with early stage cervical cancer. Insights Imaging 13(1), 17 (2022).
Zhang, F. et al. Combination of platelet count and mean platelet volume (COP-MPV) predicts postoperative prognosis in both resectable early and advanced stage esophageal squamous cell cancer patients. Tumour Biol. 37(7), 9323–9331 (2016).
Borsig, L. The role of platelet activation in tumor metastasis. Expert Rev. Anticancer Ther. 8(8), 1247–1255 (2008).
Zhao, K., Deng, H., Qin, Y., Liao, W. & Liang, W. Prognostic significance of pretreatment plasma fibrinogen and platelet levels in patients with early-stage cervical cancer. Gynecol. Obstet. Invest. 79(1), 25–33 (2015).
Shi, J. et al. MRI-based peritumoral radiomics analysis for preoperative prediction of lymph node metastasis in early-stage cervical cancer: A multi-center study. Magn. Reson. Imaging 88, 1–8 (2022).
Acknowledgements
We thank Bullet Edits Limited for the linguistic editing and proofreading of the manuscript.
Funding
This study have received funding by the the Hubei Provincial Health Science and Technology Project (Grant Number: WJ2025M068), the Key Research & Development Program of Hubei Province (2023BCB128), and Hubei Provincial Natural Science Foundation of China (2023AFB112), the Hubei Provincial Natural Science Foundation Joint Fund Project (2025AFD884), and Open Fund for the Institute of Gynecological and Obstetric Diseases, Xiangyang Center Hospital (2023MDI05).
Author information
Authors and Affiliations
Contributions
Research integrity, L.F., Q.X.M., X.H.; Research conceptualization, W.F, Y.J.; Data collation, Z.R., L.H.B; Statistics and analysis, W.F., H.C.C.,J.C.F.; Supervise, Q.X.M., X.H.; Verification, L.F., D.L.; Literature and clinical research, J.C.F., L.L.; Manuscript – original draft, Z.R., A.T.; Manuscript – review and editing, L.H.B, A.T.; Funding acquisition, Z.R., W.F., Y.J., X.H.; Approval the submitted version of the manuscript, all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval and consent to participate declarations
This retrospective study was authorized by Xiangyang Central Hospital’s Institutional Review Board (approval number 2022-027). And the requirement of informed patient consent was waived.
Study subjects or cohorts overlap
No study subjects or cohorts have been previously reported before.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhang, R., Jiang, C., Li, F. et al. Preoperative MRI-based radiomics analysis of intra- and peritumoral regions for predicting CD3 expression in early cervical cancer. Sci Rep 15, 26754 (2025). https://doi.org/10.1038/s41598-025-12162-9
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-12162-9










