Introduction

Endometrial cancer (EC) ranks among the most prevalent gynecological cancers, and its incidence is increasing in younger populations1,2. While surgical treatment offers significant survival benefits, 10%–20% of patients with EC still experience poor prognoses, reflected in a 5-year survival rate of less than 20%3. The recent development of targeted therapies has opened new avenues for treating EC. Numerous research efforts have suggested that therapies aimed specifically at human epidermal growth factor receptor-2 (HER2)-positive EC could enhance survival rates among affected patients4,5. As a receptor tyrosine kinase, HER2 is crucial in the processes of tumor development, invasion, and metastasis, primarily because of its amplification and overexpression at the genetic level6. Consequently, it is essential to predict HER2 status accurately before surgery to inform tailored treatment approaches and enhance patient results.

Immunohistochemical (IHC) analysis of biopsy samples remains the gold standard for assessing HER2 status preoperatively7. Nonetheless, this approach has various drawbacks, such as its invasiveness, long procedure time, and significant costs. Moreover, due to limited biopsy size, IHC analysis may fail to comprehensively capture tumor heterogeneity8. Although polymerase chain reaction can accurately detect HER2 gene amplification, it is not routinely used in clinical practice due to its technical complexity, higher cost, and limited ability to reflect intratumoral protein expression heterogeneity. Magnetic resonance imaging (MRI) is frequently employed in diagnosing, staging, and monitoring subsequent treatment of EC9,10,11. One study has indicated that quantitative MRI techniques, such as amide proton transfer and intravoxel incoherent motion, may serve as valuable imaging biomarkers for HER2 status evaluation in EC patients12. However, it was constrained by small sample sizes and single-slice image analysis, which may have led to the loss of crucial tumor-related information. These functional imaging sequences are also not routinely included in standard clinical MRI protocols, potentially increasing scanning time and limiting their widespread clinical adoption. Therefore, developing a non-invasive and practical tool for assessing HER2 status preoperatively remains urgent.

Radiomics involves the extraction of intricate quantitative features from medical imaging modalities, including MRI, which go beyond human visual perception capabilities13. This advanced analysis focuses on the relationships between these extracted imaging features and various clinical, pathological, and genetic features. Therefore, radiomics provides valuable insights into tumor heterogeneity and elucidates their biological behavior14. It has been applied to non-invasively evaluate tumor grading15, gene expression16, and lymph node (LN) status17 in EC. Recently, MRI-based radiomics has shown promising predictive capabilities regarding HER2 status across multiple malignancies, including breast, gastric, and bladder cancers18,19,20. Despite these encouraging findings, its effectiveness for predicting HER2 status in EC remains ambiguous and thus warrants further investigation.

Therefore, we sought to evaluate the viability of utilizing of employing multiparametric MRI-based radiomics to predict HER2 status in EC. We develop a prediction model capable of risk stratification to improve prediction accuracy and investigate its predictive ability for the progression-free survival (PFS).

Materials and methods

The Ethics Review Committee of the Fifth Affiliated Hospital of Wenzhou Medical University approved this study (ethics approval number 2025-136), waiving the requirement for informed consent due to its retroactive character. All procedures adhered to the ethical norms established by the Helsinki Declaration.

Patient selection

This study enrolled 492 patients with EC from three different medical centers. Figure 1 depicts the enrollment process. Patients were selected based on the subsequent criteria: (1) histopathologically confirmed EC diagnosis following surgical resection, (2) clear determination of HER2 status in tumor specimens through IHC, and (3) preoperative MRI examination conducted within two weeks before surgery. Patients were excluded if they met any of the following criteria: (1) maximum tumor diameter of < 1.0 cm, (2) incomplete clinical or pathological data, and (3) prior radiotherapy or curettage before the MRI examination.

Fig. 1
figure 1

The study recruitment pathway for eligible patients.

The 307 patients treated at the Fifth Affiliated Hospital of Wenzhou Medical University (Center 1) from January 2016 to May 2023 were randomly assigned to training (n = 215) and internal validation (n = 92) cohorts at a 7:3 ratio. External validation cohort 1 comprised 64 patients treated at the Third Affiliated Hospital of Wenzhou Medical University (Center 2) from January 2020 to May 2023. External validation cohort 2 comprised 121 patients treated at the Sixth Affiliated Hospital of Wenzhou Medical University (Center 3) from January 2018 to May 2023. Each patient’s HER2 positive (HER2 (+)) or negative (HER2 (−)) status was determined based on their postoperative pathological examination results. The detailed evaluation process is explained in Appendix E1.

MRI examination and radiological assessment

All patients underwent pelvic contrast-enhanced MRI scans at three centers. The imaging protocols employed included T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI), and contrast-enhanced T1-weighted imaging (CE-T1WI) with fat saturation. Detailed information about the MRI scanners and parameters used for acquisition is provided in Table S1.

Two radiologists, possessing 9 and 20 years of experience in MRI interpretation, respectively, independently evaluated the images while remaining unaware of both the clinical and pathological information. The radiological features encompassed tumor size, tumor margin, tumor composition, myometrial invasion depth, LN status, and abnormal ascites, as detailed in Appendix E2.

Follow up

All patients underwent regular monitoring every six months postoperative to assess disease progression. Disease progression was characterized by either local recurrence in the pelvis or the emergence of new metastases in the abdomen or other distant locations, which comprised nodal, peritoneal, or visceral metastases. PFS was defined as the interval from the surgery date to the first instance of disease recurrence, death due to any cause, or the latest follow-up in March 2025.

Tumor segmentation, feature extraction and selection

Figure 2 illustrates the study workflow. Tumor segmentation and imaging preprocessing were conducted before feature extraction, as described in Appendix E3. Initially, 1688 radiomics features were obtained from each whole-tumor volume of interest (VOI) of each MRI sequence for each patient. Appendix E4 contains further information regarding radiomics feature extraction. Next, all extracted radiomics features were standardized using the z-score method. Features were selected in the training cohort, utilizing the T2WI, DWI, and CE-T1WI subsets both individually and in combination. The three-step procedure used for feature selection—reproducibility analysis via interclass correlation coefficients (ICCs), Spearman’s correlation analysis, and least absolute shrinkage and selection operator (LASSO) analysis—is described in Appendix E5. The subsets of radiomics features with the highest predictive value were utilized to construct the radiomics model.

Fig. 2
figure 2

The study workflow.

Radiomics model building

Six machine learning (ML) algorithms, including k-nearest neighbor (KNN), support vector machine (SVM), logistic regression (LR), random forest (RF), naïve Bayes (NB), and extreme gradient boosting (XGBoost), were employed to construct radiomics models that could predict the HER2 status of EC based on the chosen features. The radiomics model with the highest average area under the receiver operating characteristic (ROC) curve (AUC) across the three validation cohorts was deemed the optimal radiomics model and was utilized to calculate the radiomics score (Radscore).

Clinical and fusion nomogram development and validation

In the training cohort, univariate and multivariate LR analyses were conducted to identify clinical predictors of HER2 status in EC, which were utilized to develop the clinical model. A fusion model was established alongside the respective nomogram for predicting HER2 status by integrating the Radscore and clinical predictors through multivariate LR analysis. ROC analysis was employed to assess the predictive capabilities of these models. Metrics such as sensitivity, specificity, and accuracy were computed and visualized in a confusion matrix. The calibration curves were generated using 1000 bootstrap resamples to evaluate the alignment between predicted risks and actual outcomes. It was also conducted decision curve analysis (DCA) to ascertain the clinical relevance.

Statistical analysis

Statistical analyses were performed using R (v 4.1.0) and Python (v 3.9.6). Student’s t-test or Mann–Whitney U-test was used to compare continuous variables across groups, while Chi-square or Fisher’s exact test was used for categorical variables. Subgroup analyses accounting for age, menopausal status, and tumor size were conducted to assess the model’s generalizability. DeLong’s test was employed to compare the AUCs. The cut-off for the risk score was determined by the fusion model within the training cohort through the Youden index. This process enabled the classification of patients into predicted HER2 (−) and HER2 (+) categories. The prognostic significance of the fusion model regarding PFS was evaluated through Kaplan–Meier analysis and log-rank tests across different cohorts. To pinpoint risk factors influencing PFS in EC patients, both uni- and multi-variate Cox regression analyses were performed. A two-tailed P-value of less than 0.05 was deemed statistically significant.

Results

Baseline characteristics of the enrolled patients

This study enrolled 492 patients with EC from three centers. Tables 1 and S2 provide a summary of the clinical and radiological characteristics of all patients. Notable differences in both differentiation and myometrial invasion were found between the HER2 (−) and HER2 (+) groups across all four cohorts (P < 0.05). HER2 (+) was associated with poor differentiation (odds ratio [OR]: 3.036, 95% confidence interval [CI]: 1.508–6.111) and myometrial invasion (OR: 3.621, 95% CI 1.895–6.915), which were subsequently employed to develop the clinical model (Table S3).

Table 1 Comparison of clinical and radiological characteristics between patients with HER2 (−) and HER2 (+) EC.

Radiomics model construction

After conducting the reproducibility analysis, we identified stable radiomics features with ICCs > 0.75. Specifically, 1,411 features were retained from T2WI, 1,428 from DWI, and 1,472 from CE-T1WI (Fig. S1). Subsequent correlation analysis resulted in 42, 47, and 52 radiomics features being retained in these three sequences, respectively. During the final selection step utilizing LASSO analysis, the T2WI, DWI, and CE-T1WI models included 7, 5, and 6 radiomics features, respectively. Ultimately, 10 radiomics features were preserved from multiparametric MRI, comprising 3 from T2WI, 3 from DWI, and 4 from CE-T1WI (Fig. S2). The optimal radiomics feature subsets and their corresponding coefficients are listed in Table S4.

Based on the optimal feature subset from multiparametric MRI, six ML algorithms were employed to construct radiomics models. Hyperparameter optimization for each ML algorithm was conducted using grid search with five-fold cross-validation implemented in Python to identify the best-performing parameter combinations. The grid search was applied to systematically evaluate combinations of key hyperparameters (e.g., C, gamma, max_depth, n_estimators, etc.) within predefined ranges. The optimal hyperparameters for the SVM algorithm were C = 0.6, kernel = ‘rbf’, and gamma = 0.1. The other ML algorithm parameters are detailed in Appendix E6.

The predictive capabilities of the various ML models according to ROC analyses are shown in Table 2 and Fig. S3. The SVM model surpassed the other ML models (Table S5), attaining the highest average AUC of 0.814 and an average accuracy of 79.58% across the validation cohorts (Table 2). In comparison to SVM models that utilized single MRI sequences, the SVM model utilizing multiparametric MRI demonstrated superior AUC values (Fig. S4). Therefore, the SVM-based model using multiparametric MRI was identified as the best radiomics model, and utilized to calculate the Radscore.

Table 2 Predictive performance of various machine learning-based radiomics models for predicting HER2 status of EC in the training, internal validation, and external validation cohorts.

Fusion nomogram development and validation

By incorporating differentiation, myometrial invasion, and the SVM-based Radscore, a fusion model was created. The fusion model showed favorable and robust performance, with an AUC of 0.914 in the training cohort, 0.846 in the internal validation cohort, and 0.809 and 0.865 in the external validation cohorts 1 and 2, respectively (Table 3 and Fig. 3A–D), which were superior to the clinical model in all cohorts (all P < 0.05; Table S6). The confusion matrices of HER2 status classification by different models are shown in Fig. S5. Subgroup analyses based on age, menopausal status, and tumor size further validated the robustness of the fusion model (Table S7 and Fig. S6). The calibration curves showed good consistency between the fusion model-predicted risks and the actual outcomes (Fig. 3E–H). DCA indicated that the fusion model yielded more net benefit compared with the clinical and radiomics models in all cohorts (Fig. 3I–L). Ultimately, the fusion model was effectively visualized as an easy-to-use nomogram, and two instances of the nomogram application are depicted in Fig. 4.

Table 3 Predictive performance of the clinical model, SVM-based radiomics model, and fusion model for predicting HER2 status in the training, internal validation, and external validation cohorts.
Fig. 3
figure 3

The predictive efficacy of the models. (A-D) ROC curves of different models. (E–H) Calibration curves of different models. (I-L) Decision curves of different models.

Fig. 4
figure 4

Two instances in which the nomogram and multiparametric MRI images are used to assess the individual risk of HER2 status in patients with EC. Multiparametric MRI includes axial T2WI, DWI, and CE-T1WI. (A) Patient 1 exemplifies a case where HER2 (+) was correctly predicted. (B) Patient 2 exemplifies a case where HER2 (−) was correctly predicted. The pathological images underwent IHC analysis at 200 × magnification.

Model risk stratification and prognostic value assessment

Figures 5A–C and S7A show the risk score distributions generated by the fusion model for each patient across all cohorts. Next, we investigated the prognostic value of the fusion model across the different cohorts. The PFS of patients with EC used in the survival analyses are listed in Table S8. Individual patients’ progression status and PFS time are shown in Figs. 5D–F and S7B. The patients classified as predicted HER2 (+) demonstrated significantly worse PFS than those classified as predicted HER2 (−) in the internal validation (hazard ratio [HR] = 2.838, 95% CI 1.100–7.324), external validation 1 (HR = 3.195, 95% CI 1.042–9.800), and external validation 2 (HR = 2.913, 95% CI 1.380–6.147) cohorts (Fig. 5G–I). This is consistent with the PFS results of patients with actual HER2 status, that is, the PFS of patients with actual HER2 (+) was also significantly worse than that of patients with actual HER2 (−) (internal validation: HR = 3.026, 95% CI 1.239–7.394; external validation 1: HR = 3.967, 95% CI 1.287–12.235; external validation 2: HR = 3.083, 95% CI 1.462–6.500). A comparable trend was observed in the training cohort (Fig. S7C). The forest plots indicated a significant association between the risk score and PFS across various subgroups in the validation cohorts, including age, menopause status, and tumor size (Fig. 5J). Cox regression analyses also showed that the predicted HER2 status was independently correlated with PFS (HR: 2.166, 95% CI 1.032–4.546, P = 0.041; Fig. 5K and Table S9).

Fig. 5
figure 5

Associations between the fusion model-predicted risk score and PFS. (A–C) Plots show the fusion model risk score and the actual HER2 status of each patient in the validation cohorts. The dotted line represents the optimal cut-off of 0.353 for the risk score. (D–F) The correlation between progression status and survival time for each patient. (G–I) Kaplan–Meier survival curves of PFS according to predicted HER2 status and actual HER2 status in the validation cohorts. (J) Forest plot of Cox regression analysis of the fusion model-predicted risk scores in different subgroups of the validation cohorts. The center of the error bar represents the HR. (K) Forest plot of independent risk factors for PFS.

Discussion

EC is typically characterized by heightened tumor invasiveness and a less favorable prognosis. HER2 is a pivotal predictive biomarker for identifying candidates suitable for targeted therapies. Therefore, precise preoperative assessment of HER2 status holds considerable clinical importance. This study illustrated that radiomics features obtained from multiparametric MRI can successfully predict HER2 status in EC patients. The fusion model, which combined clinical features with SVM-based Radscore, demonstrated exceptional performance. In the training cohort, it predicted the HER2 status of patients with EC with an AUC of 0.914, and in the three validation cohorts, it had AUCs of 0.809–0.865. It is worth noting that only a HER2 prediction model was trained in this study; the risk probability output by this model facilitates the stratification of EC patients into predicted HER2 (+) and HER2 (−) groups. In addition, we verified that EC patients with predicted HER2 (+) had worse PFS than those with predicted HER2 (−).

Radiomics enables the extraction of high-dimensional quantitative features from medical images, facilitating the characterization of intratumoral heterogeneity in morphology, texture, and function14. This heterogeneity reflects variations in cellularity, necrosis, vascularity, and underlying molecular alterations. HER2 status has been shown to influence cell proliferation, angiogenesis, and immune infiltration in breast cancer, reshaping the tumor microenvironment21. These biological changes can be indirectly captured through radiomics features quantifying structural and textural complexity. In this study, radiomics analysis of tumors with different HER2 status revealed distinct feature patterns, suggesting a link between HER2-driven processes and imaging heterogeneity. Feature selection enhanced model interpretability, with wavelet-based features (7/10) being most predictive. These features, which decompose imaging data into multiple spatial frequency components, are sensitive to subtle texture and intensity variations, reflecting increased tissue complexity associated with HER2 positivity. HER2 overexpression promotes cellular proliferation, increased cell density, and disrupted glandular architecture, all contributing to imaging heterogeneity22. Additionally, HER2-driven angiogenesis leads to irregular microvascular patterns, detectable by wavelet-transformed features. Thus, wavelet-based features may serve as proxies for architectural disorganization and vascular abnormalities characteristic of HER2-positive tumors. The feature most strongly correlated with HER2 status was zone entropy from the GLSZM. This feature quantifies the irregularity of gray-level patterns; higher values reflect more heterogeneous textures, potentially indicating structural instability driven by HER2-related changes in the tumor microenvironment.

MRI has emerged as a crucial element in radiomics analysis due to its outstanding resolution for soft tissues, lack of ionizing radiation, and ability to perform multiparametric imaging. Earlier MRI-based radiomics studies mainly focused on EC diagnosis and prognosis23,24,25,26. Their results indicate that multiparametric MRI could offer a broader understanding of tumor heterogeneity. Thus, our study identified multiparametric MRI-based radiomics features to predict HER2 status in EC. Additionally, six ML algorithms were implemented to develop radiomics models that were structured around multiparametric MRI-based radiomics features. Among these ML-based models, the SVM-based radiomics model proved the most effective in predicting HER2 status.

The SVM algorithm is particularly adept at identifying subtle patterns in intricate datasets, primarily because of its ability to minimize classification errors on unseen data, regardless of any prior assumptions regarding the data’s probability distribution27. An important benefit of SVM is its capacity to manage minor nonlinear fluctuations28. Mainenti et al.29 reported that their SVM-based radiomics model offered significant benefits in risk stratification for EC. Similarly, Song et al.23 employed algorithms such as SVM, RF, and LR to identify the microsatellite instability status of EC patients, reporting that the SVM-based model exhibited the best diagnostic results. However, the optimal classification algorithm may depend on the clinical application. Meng et al.30 reported that their RF classifier outperformed other classifiers in predicting EC with a high tumor mutation burden. Moreover, Mao et al.25 showed that a NB model performed exceptionally well in distinguishing type I and II EC preoperatively. Therefore, further research is warranted to identify the most suitable algorithms for specific clinical applications.

Multivariate LR analysis identified low tumor differentiation and deep myometrial invasion as independent risk factors for HER2 (+) status, consistent with prior studies12,31,32. The potential mechanism is that HER2 expression typically activates cellular signaling pathways, promoting cancer cell proliferation, invasion, and migration, thereby exacerbating the tumor’s biological aggressiveness. Nonetheless, the clinical model constructed utilizing clinical predictors demonstrated only moderate efficacy (AUC: 0.635–0.716). This moderate efficacy stems from clinical features only representing visual anatomical data, which do not sufficiently reflect intratumoral heterogeneity. Consequently, we combined clinical attributes with the Radscore to develop a fusion model. The clinical model’s predictive power and classification accuracy were significantly enhanced by the inclusion of the SVM-based Radscore. Notably, subgroup analyses also validated the excellent robustness of the fusion model. These results further emphasize the shortcomings of clinical features while illustrating the significant benefits of multiparametric MRI-based radiomics features in predicting HER2 status in EC.

Our study’s proposed nomogram can serve as an effective tool for classifying patients with EC into two distinct categories: predicted HER2 (−) and HER2 (+). Among the patients with EC, those with predicted HER2 (+) had notably worse PFS than those with predicted HER2 (−). The HER2 status predicted by the fusion nomogram was also recognized as a distinct risk factor for PFS in EC, suggesting that our prediction nomogram for HER2 status could effectively guide the recurrence stratification of patients with EC. For those at high risk, the nomogram could assist in formulating more timely individual treatment strategies, such as targeted therapy. They may also require more regular follow-ups to enhance their prognosis. Consequently, this nomogram may be an effective resource for directing healthcare professionals in managing EC.

There were various restrictions on our study. Firstly, it was retrospective, which carries inherent biases and involves a limited sample size. Larger datasets obtained through multicenter prospective studies are needed to confirm the validity of our predictive model. Secondly, it only included patients with EC whose tumors had a maximum diameter > 1 cm to improve the delineation of tumor boundaries and ensure an adequate pixel size for radiomics analysis. Future investigations should include patients with EC with tumors with smaller diameters to enhance the model’s applicability. Lastly, the manual segmentation process is not only intricate but also requires considerable time; therefore, future studies would benefit from an automated, dependable, and reproducible segmentation technique.

Conclusions

In conclusion, our study comprehensively investigated using multiparametric MRI-based radiomics features to predict HER2 status in EC patients preoperatively. Our proposed fusion nomogram integrating clinical features and the Radscore could be a useful tool to non-invasively determine HER2 status and assess prognosis in patients with EC, offering valuable guidance for personalized treatment regimens and intensive follow-up strategies.