Interpretable model based on MRI radiomics to predict the expression of Ki-67 in breast cancer

Zhang, Li; Du, Qinglin; Shen, Mengyi; He, Xin; Zhang, Dingyi; Huang, Xiaohua

doi:10.1038/s41598-025-97247-1

Download PDF

Article
Open access
Published: 17 April 2025

Interpretable model based on MRI radiomics to predict the expression of Ki-67 in breast cancer

Li Zhang¹^na1,
Qinglin Du¹^na1,
Mengyi Shen¹,
Xin He¹,
Dingyi Zhang¹ &
…
Xiaohua Huang¹

Scientific Reports volume 15, Article number: 13318 (2025) Cite this article

3560 Accesses
7 Citations
Metrics details

Subjects

Abstract

This study aimed to develop an interpretable machine learning model that accurately predicts Ki-67 expression in breast cancer (BC) patients using a combination of dynamic-contrast enhanced magnetic resonance imaging (DCE-MRI) radiomics and clinical-imaging features. A total of 195 BC patients, including 201 lesions, were enrolled retrospectively. These lesions were randomized into training and testing set (7:3). The correlation between clinical-imaging features and Ki-67 expression was analyzed via univariate analysis and binary logistic regression, leading to the development of a Clinical-imaging model. Radiomics features were extracted based on the early and delayed phases of DCE-MRI. These features were screened by Pearson correlation coefficient and recursive feature elimination (RFE). The logistic regression classifier was used to develop the Radiomics model. The clinical imaging and radiomics features were combined to form a Combined model. The Shapley Additive Explanation (SHAP) algorithm was employed to explain the optimal model, and the AUC was used to assess the model’s performance. Ki-67 expression was markedly different from the internal enhancement pattern and necrosis among the imaging features. Compared to the Clinical-imaging model (AUC = 0.682), the AUCs of the Radiomics and the Combined models in the training set were 0.797 and 0.821, respectively. Clinical-imaging, Radiomics, and Combined models had AUCs of 0.666, 0.796, and 0.802 in the test set. Based on the IDI results, the combined model outperformed the Clinical-imaging and Radiomics models in the training set by 11.8% and 2.1%, respectively. They increased by 11% and 1.74% in the test set. SHAP analysis showed that ph2-original-shape-surface volume ratio was the most important feature of the model. Based on clinical-imaging features and DCE-MRI radiomics, the interpretable machine learning model can accurately predict the expression of Ki-67 in BC. Combining the SHAP algorithm with the model improves its interpretability, which may assist clinicians in formulating more accurate treatment strategies.

Prediction of Ki-67 expression in invasive breast cancer with dual-modality radiomics

Article Open access 05 March 2026

Establishment of an interpretable MRI radiomics-based machine learning model capable of predicting axillary lymph node metastasis in invasive breast cancer

Article Open access 18 July 2025

Multiparameter MRI-based model integrating radiomics and deep learning for preoperative staging of laryngeal squamous cell carcinoma

Article Open access 09 May 2025

Introduction

Breast cancer (BC) is the most prevalent malignant tumor in women, and it is the most lethal of all invasive tumors^1,2,3. The course of treatment of BC has been continuously optimized, and their associated fatality rate has been substantially reduced due to the development of precision medicine⁴. In BC, clinicians primarily rely on analyzing traditional biomarkers for treatment decision-making, including progesterone receptor (PR), estrogen receptor (ER), human epidermal growth factor receptor 2 (HER-2), and Ki-67⁵. Studies have found that the expression of Ki-67 is a reliable biomarker of cell proliferative activity⁶, and its expression is closely correlated with the prognosis of BC patients. The relapsing rates and prognosis of patients are typically worse when the Ki-67 level is high^7,8. Further, patients with its expression > 20% show improved responses against neoadjuvant chemotherapy (NAC)⁹. However, in clinical practice, the primary method for detecting Ki-67 is immunohistochemistry (IHC), which necessitates invasive biopsy and may result in bleeding, infection, or metastasis¹⁰. In HER-2-negative BC patients, preoperative biopsy results in a significant increase in Ki-67 expression, thereby deteriorating the prognosis of the disease^11,12. Therefore, it is crucial to establish a timely and non-invasive strategy for predicting the expression of Ki-67 in BC to enhance the outcomes of BC patients.

In detecting BC, dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is highly sensitive and provides more details about tissue anatomy and function¹³. With the rapid development of precision quantitative medical imaging technology, radiomics can transform medical images into high-dimensional and quantitative data, thereby identifying the potential correlation between radiomics and biological characteristics¹⁴. Previous studies based on MRI radiomics have explored its value in predicting benign and malignant BC, molecular subtypes, Ki-67 expression and efficacy of neoadjuvant therapy, and all have shown good performance^{15,16,17,18,19,20,21}. However, previous studies have focused on improving the performance of models. Still, the internal working principles of models are challenging to elucidate due to their low transparency, and their interpretability restricts the clinical application of prediction results.

Currently, the SHAP algorithm has been gradually used in medical research, and its interpretability has been explored in the prediction model of pathological grading of pancreatic neuroendocrine tumors, molecular subtypes of BC, and post-acute pancreatitis diabetes^22,23,24. Therefore, based on the SHAP algorithm, this study interpreted the optimal model for predicting the Ki-67 expression in BC. This method can identify the features that have the most substantial influence on the model, and quantify the impact of each feature on single sample when the features are not fully independent. The feature contribution can be visualized to promote doctors’ understanding of the model prediction logic.

Materials and methods

Study cohorts

The ethics committee of the Affiliated Hospital of North Sichuan Medical College approved the retrospective study and exempted patients from informed consent (Accreditation number: 2023ER131-1). All the procedures performed in this study were in accordance with the Declaration of Helsinki (as revised in 2013). The clinical and Imaging data of 290 BC patients admitted to the Affiliated Hospital of North Sichuan Medical College between May 2021 and April 2023 were collected. The inclusion criteria included (1) female patients with BC confirmed by puncture or surgical biopsy; (2) not received any previous treatment for the breast. The exclusion criteria included: (1) Incomplete clinical data; (2) Image artifacts; (3) Incomplete imaging data. Finally, 195 BC patients with 201 lesions were included, including 189 unilateral BC patients and 6 bilateral BC patients (Fig. 1).

Image acquisition

Images were acquired on a 3.0T MRI scanner (Shanghai United Imaging Medical Co., LTD., China) with the 10-channel phased-array breast coil. The chest wall was affixed to the coil, and the breasts were freely hanging within the breast coil while scanning was carried out in the prone state. The scan imaging covered both breasts and armpits. All patients underwent conventional MRI scan, including a transverse T1-weight (T1WI), a transverse fat-suppressed T2 weighted image (FS-T2WI), a transverse DWI, and a transverse dynamic MRI fat-suppressed T1-weighted (DCE-MRI). First, T1WI, T2WI-fs and DWI sequences were acquired. After the first-stage normal scan, Gd-DOTA (0.2 mmol/kg) was injected into the median elbow vein with a high-pressure syringe at a rate of 2.5 mL/s, and 20 ml normal saline was injected at the same rate. DCE-MRI images were obtained in eight phases, each phase was 69 s. The detailed parameters of the scanning sequence are shown in Appendix 1.

Analyses of clinical data and conventional imaging features

The clinicopathological data of the patients were collected, including age, reproductive history, menopausal status, tumor location, T stage before treatment, lymph node metastasis (LN) status, and whether accompanied by benign tumors. Ki-67 was defined as the percentage of nuclear staining positive cells in cancer cells. There was no significant difference in the prognosis between patients with Ki-67 < 14% and those with Ki-67 expression of 14–20%, while the prognosis of patients with Ki-67 > 20% was significantly worse²⁵. Therefore, 20% was used as the boundary in this study, and Ki-67 ≤ 20% was defined as low expression, and > 20% was considered as high expression.

All collected conventional imaging features were evaluated by two radiologists as per the guidelines of the American College of Radiology (ACR) Breast Imaging Reporting and Data System (BI-RADS)²⁶. All included features were as follows: tumor maximum diameter (TMD), type of time-signal intensity curve (TIC), ADC value, mammographic calcification, internal enhancement pattern, type of gland, and necrosis. Among them, TMD was measured on the largest section of the tumor on DCE-MRI, and the average value was obtained after measuring the TMD thrice. According to the DCE-MRI sequence, the range of solid lesions was determined, which was marked (thrice) on the ADC map and averaged. The DCE-MRI images were imported to the medical image processing uWS-MR software (R005, United Imaging Medical, Shanghai, China). In the most apparent enhancement phase, the region of interest (ROI) was defined at the maximum lesion level, covering the whole lesion. Next, the TIC was generated, and the relevant semi-quantitative parameters were derived²⁷. All ROI were blindly examined for the pathological outcomes of the patients by two radiologists, and the inter-observer agreement was evaluated using the inter-class correlation coefficient (ICC).

Extraction of radiomics feature

The lesion was identified by two radiologists with 3 years of experience. They were unaware of the patient’s pathological reports. One radiologist used 3D slicer software to sketch the ROI on the early (ph2) and delayed (ph5) phases of DCE-MRI. Another radiologist randomly picked 1/3 of the patients for sketching to evaluate the inter-observer agreement. Feature extraction was performed through 3D sclicer’s “Pyradiomics” package, with voxels of the segmented ROIs were resampled to 1 × 1 × 1 mm (X-axis, Y-axis, and z-axis) via linear interpolation algorithm to ensure data standardization and comparability. The extracted radiomics features were derived from both the original images and those processed with convolutional filters. Gaussian Laplacian filters (sigma = 0.5, 1.0, 1.5, 2.0) and wavelet filters (LLH, LHL, LHH, HLL, HHL, HLH, HHH and LLL) were used for convolution filters. A total of 1223 radiomics features were extracted for each patient. These included 14 shap features, 18 first-order features, 24 glcm features, 14 gldm features, 16 glrlm features, 16 glszm features, and 5 ngtdm feature.

The radiomics features with ICC ≥ 0.80 were selected. The selected features were standardized by Z-score to enhance the comparability of data from various dimensions after ICC. After that, the Pearson correlation coefficient between features was determined. One of the most closely related characteristics was removed using a threshold of 0.8, and the most significant features were retained via recursive feature elimination (RFE).

Explainable model development

The radiomics model was developed by combining the features gathered from ph2 and ph5. The clinical-imaging features associated with Ki-67 expression were selected using univariate and binary logistic regression analyses, and a Clinical-imaging model was then developed. The combined model was constructed by integrating the selected clinical-imaging features with radiomics features to predict the Ki-67 expression in BC using logistic regression. The SHAP algorithm was used to explain the optimal model. Among these, the logistic regression algorithm uses L2 regularization. Further, the liblinear optimization algorithm suitable for small data sets and multi-classification problems, is used, with the maximum number of iterations set to 10,000 to ensure adequate convergence time for the model. The best regularization strength parameter C is determined by grid search (GS). 5-fold cross-validation was used to evaluate the performance of each parameter combination. The SHAP value of each feature is calculated by determining the marginal contribution of that feature across all possible feature combinations. This is done by analyzing the change in the prediction result when the feature is added or removed, and assigning weights to each marginal contribution. The final model output is then the sum of these individual feature contributions. In this study, the bar plot of feature importance was used to demonstrate the ability of feature importance scores. At the same time, the force diagram and waterfall diagram were used to show the prediction process of a single patient, to help users intuitively understand the effects of features on model prediction, and to improve the interpretability and transparency of the model. Finally, SHAP value heatmaps were used to demonstrate the direction and strength of the influence of features on model predictions.

Statistical analysis

Python 3.9.13 software and SPSS software (version: 25.0) were used for statistical analysis. Independent sample t test was used for continuous variables conforming to normal distribution and described as mean ± standard deviation (SD). Mann–Whitney U test was used for continuous variables not conforming to normal distribution and described as median (quartile). Categorical variables were expressed as frequency and percentage using the chi-square test or Fisher’s exact test. All statistical tests were bilateral, and P < 0.05 was considered statistically significant.

The performance of each model was assessed using the area under the curve (AUC) along with the 95% confidence interval (CI), and a comparison was made of the sensitivity, specificity, precision, and F1 score metrics. Delong test was used to determine whether there was a significant difference in the performance of different models in predicting Ki-67 expression status. The models’ overall improvement was assessed via Integrated Discrimination Improvement (IDI), Calibration curves were used to evaluate the goodness of fit of the model. The net benefit of the test set at different threshold probabilities was quantified by DCA to evaluate the clinical applicability of the model.

Results

Clinical data

The median age of the 195 patients enrolled in this study was 51 years, ranging from 24 to 84 years. They were randomly assigned to the training and test sets. The training set comprised 140 lesions (75 high-expression and 65 low-expression lesions), while the test set contained 61 lesions (42 high-expression and 19 low-expression lesions). The ICC of the semi-quantitative parameters varied from 0.945 to 0.999. Univariate analysis showed that the expression status of Ki-67 was significantly different from TMD, internal enhancement pattern and necrosis in the clinical-imaging features (P < 0.05). In the analysis of binary logistic regression, it was found that internal enhancement mode and necrosis serve as distinct markers of Ki-67 expression in BC after controlling for other variables (P < 0.05) (Table 1). A Clinical-imaging model was developed using logistic regression based on internal enhancement patterns and tumor necrosis.

Table 1 Univariate and binary logistic regression results in the training set.

Full size table

Radiomics analysis and evaluation

A sum of 2446 radiomics features was extracted from phases 2 and 5 of the DCE-MRI sequence, and the features with ICC ≥ 0.8 were retained. A total of 2316 radiomics features were retained, of which 1126 features were retained in ph2 and 1190 features were retained in ph5. To reduce the redundancy between features, the Pearson correlation coefficient between features was calculated, and the features with high correlation were removed with a threshold of 0.8, and 242 features were finally retained. Finally, 12 radiomics features were retained by RFE, including 7 radiomics features for ph2 and 5 radiomics features for ph5, and the Radiomics model was constructed by logistic regression. A Combined model was constructed by combining clinical-imaging features and radiomics features. In the training set, the AUC of Clinical-imaging model, Radiomics model and Combined model were 0.682, 0.797 and 0.821, respectively. In the test set, the AUC were 0.666, 0.796 and 0.802, respectively. The ROC of different models are shown in Fig. 2. The sensitivity and specificity of each model in predicting Ki-67 expression in breast cancer patients are shown in Table 2.

Table 2 Performance of Clinical-imaging, radiomics and combined models in the training and test sets.

Full size table

Delong test showed that there in the training set, there were significant differences in performance between the Clinical-imaging model and Radiomics model (Clinical-imaging vs. Radiomics: P < 0.001), and there was a significant difference in performance between the Clinical-imaging model and the Combined model (Clinical-imaging vs. Combined: P < 0.001). There was no statistically significant difference in performance between the Radiomics model and the Combined model (Radiomics vs. Combined: P = 0.076). In the test set, there was no statistically significant difference in performance between the Clinical-imaging and Radiomics models (Clinical-imaging vs. Radiomics: P = 0.137), but there was a statistically significant difference in performance between the Clinical-imaging and Combined models (Clinical-imaging vs. Combined: P = 0.046). There was no statistically significant difference in performance between the Radiomics and Combined models (Radiomics vs. Combined: P = 0.872). The IDI results revealed that in the training set, the performance of the combined model was improved by 11.8% and 2.1% compared with the Clinical-imaging and Radiomics models, respectively. They increased by 11% and 1.74% in the test set, respectively.

The calibration curves of the Combined model demonstrated good consistency between the predictive outcome and observation in the training and test sets (Fig. 3). Hosmer–Lemeshow test showed that there was no significant difference between the predicted value and the true value (training set: 0.257, test set: 0.447). The DCA plot shows that the net clinical benefits of the Combined model are higher than those of the Clinical-imaging model and, to a large extent, higher than those of the Radiomics model (Fig. 3).

Explainable machine learning model Building

The optimal model was interpreted using the SHAP algorithm, the overall Shapley value of the combined model and the individual Shapley value was calculated, and the weights of the nine most significant features of the model were displayed using the SHAP method (Fig. 4). The figure shows that the radiomics features from ph2 of DCE-MRI contributed more to the model than those from ph5, with the surface volume ratio of the original ph2 images contributing the most. Figure 5 demonstrates the individualized predictive performance of the combined model using SHAP plots for two breast cancer patients, visualizing how MRI-based radiomics and clinical features affect the model’s prediction of Ki-67 expression at an individual level. The results showed that the output value of patient 1 was 2.05, which was predicted to be a high expression (Fig. 5A). A waterfall plot (Fig. 5B) illustrates the patient’s overall decision-making process, with a final predicted probability of 0.886. Figure 5C shows Patient 2 with an output value of − 1.45, predicted by the model as a low expression. The same waterfall diagram (Fig. 5D) was used to show the overall decision-making process of the patient, and its final prediction probability was 0.191. However, the heat map of feature distribution under sample clustering (Fig. 6) shows the influence direction and strength of each feature in all samples in the model. The more brightly colored features in the figure, the greater their contribution in the model. Consistent with the results shown in Fig. 4, the surface volume ratio contributed the most in the model, followed by Imc1 from the second phase of DCE-MRI with Gaussian Laplacian filtering.

Discussion

This study investigated the predictive value of clinical imaging and DCE-MRI dual-phase radiomics features in predicting Ki-67 expression in BC patients. The findings indicated that the Combined model, which combined clinical-imaging and dual-phase radiomics features, outperformed the Clinical-imaging and Radiomics models independently. Simultaneously, this study used the SHAP algorithm to explain the best performance model, visualize the prediction process from individual to overall, ensure the interpretability of the model, assist clinicians in better understanding the decision-making process of the model, and formulate more accurate treatment options for patients.

Recently, many studies have explored the correlation between breast MRI and Ki-67 expression and found that some conventional MRI features are correlated with Ki-67 expression status. Song et al.²⁶ explored the relationship between MRI morphology, kinetic characteristics, ADC value, and Ki-67 expression, and found that Wash-out and intratumoral high signal intensity on T2WI were independent biomarkers of Ki-67 expression status in BC. However, in this study, semi-quantitative parameters were not correlated with the Ki-67 expression, which may be due to the difference in the distribution and quantity of samples between both studies. Surov et al.²⁸ analyzed the relationship between ADC and Ki-67 proliferation index and other histopathological features of BC via a multicenter study, and observed that ADC value could not be used as a surrogate index of BC proliferation activity, which was in line with the findings of this study. Further, this study also revealed that patients with high Ki-67 expression were more likely to have necrosis than those with low Ki-67 expression, which may be related to the proliferation activity of cancer cells. The proliferation activity of BC with elevated Ki-67 expression was considerably higher, and the proliferation rate was excessively rapid. However, neovascularization was insufficient to penetrate the blood flow of the lesion center, resulting in necrosis²⁹. Meanwhile, this study found that patients with elevated Ki-67 levels were susceptible to ring enhancement, similar to a previous study’s findings³⁰.

Radiomics is the process of analyzing medical images to extract high-throughput radiomics features that can accurately represent the overall heterogeneity of lesions. In contrast to conventional imaging features, this method is capable of more accurately reflecting tumor heterogeneity and revealing the possible interaction between radiomics and biological characteristics^14,31. Therefore, based on conventional imaging features, the predictive ability of DCE-MRI radiomics features was also analyzed for Ki-67 expression in BC patients. Wu et al.³² extracted the radiomics features of early and delayed DCE-MRI and demonstrated that the nomogram model based on preoperative DCE-MRI images had good efficacy in predicting lymphatic invasion of invasive BC determined by D2-40. In the early and delayed phases of breast DCE-MRI, the characteristics of signal intensity changes may depict the internal blood vessels, microcirculation, and blood circulation of breast lesions, which is important in the differentiation of various forms of BC^33,34. Based on this, this study extracted radiomics features of early and delayed DCE-MRI images to obtain more hidden information on images to predict the Ki-67 expression status of BC³⁵. Simultaneously, to reduce the redundancy between features in the early and delayed phases, the Pearson correlation test was used to remove redundant features with high correlation to improve the performance of the model. Ma et al.³⁶ extracted the radiomics features of DCE-MRI and found that the quantitative radiomics features of breast tumors extracted by DCE-MRI were related to the expression of Ki-67 in BC. Feng et al.³⁷ presented that the radiomics model based on the whole tumor area of the DCE-MRI parametric map and the ADC map could predict the Ki-67 expression of BC patients (AUC = 0.839). Liu et al.³⁸ established a radiomics model based on deep learning through multi-parametric MRI for preoperative prediction of Ki-67 status, and its model efficiency reached 0.875. Liang et al.³⁹ established a radiomics classifier as per the T2WI to predict Ki-67 expression, and its model efficiency reached 0.740. Zhang et al.¹⁷ extracted radiomics features of bone metastases in patients with primary breast cancer to predict the Ki-67 expression of BC patients. The AUC for the training set was 0.796, while the test set was 0.705. Although previous studies have demonstrated good predictive capabilities for assessing Ki-67 expression in BC, these prediction models are remain a “black box”, and it is challenging to identify the precise function of each feature within the prediction model. Based on this, the SHAP algorithm was used in this study to explain conventional machine learning models at both the local and global levels^40,41. The SHAP algorithm functions by quantifying the marginal contribution of a certain feature when included in this model and visualizing the overall or individual contribution of each feature. In this study, the SHAP algorithm was used to analyze the degree of contribution of different features to the prediction model. The average SHAP value of a feature can reflect the average contribution of a feature in the overall prediction. The larger the SHAP value, the greater the influence of the feature on the prediction result. Through the SHAP bar chart, it was observed that “ph2-original-shape-surface volume ratio” exhibited the highest SHAP value, indicating its substantial contribution to the model. Clinicians can understand the specific contributions of different features in each sample by examining the specific SHAP values, which helps doctors understand how the feature affects the model prediction under different circumstances, thereby enhancing the understanding of the model and facilitating the clinical application of the model^24,34,41.

Limitations

This study still has some limitations. First, due to the limited temporal resolution of the DCE-MRI sampling in this study, only semi-quantitative parameters can be incorporated to explore their relationship with the expression of Ki-67 in BC. We will improve the temporal resolution in subsequent studies to explore the relationship between quantitative parameters and the expression status of Ki-67. Secondly, this study only explored the value of logistic regression algorithm in predicting Ki-67 expression in breast cancer, and the value of other algorithms needs to be further explored in the future and further explained by SHAP algorithm. Moreover, as a retrospective study, the sample size is relatively small, and in the future study, larger sample size and multi-institutional validation are needed to verify the effectiveness of the model. Finally, the tumor was segmented manually, a time-consuming process, characterized by a high degree of interprocessor variability. In the future, semi-automatic or automatic segmentation may be adopted.

Conclusion

This study developed and validated an interpretable machine learning model based on clinical-imaging features and DCE-MRI dual-phase radiomics features. The model can effectively predict the Ki-67 expression of BC patients, and improve clinicians’ understanding of the decision-making process of the model, which is helpful for clinicians to formulate personalized treatment options for patients.

Data availability

The data sets generated and analyzed in this study are not publicly available because of the need for expanded sample size and further research based on these data sets but are available from the corresponding author on reasonable request.

References

Loibl, S., Poortmans, P., Morrow, M., Denkert, C. & Curigliano, G. Breast cancer. Lancet 397, 1750–1769 (2021).
Article CAS PubMed Google Scholar
Cao, W., Chen, H. D., Yu, Y. W., Li, N. & Chen, W. Q. Changing profiles of cancer burden worldwide and in China: A secondary analysis of the global cancer statistics 2020. Chin. Med. J. (Engl). 134, 783–791 (2021).
Article PubMed Google Scholar
Torre, L. A. et al. Global cancer statistics, 2012: Global cancer statistics, 2012. CA Cancer J. Clin. 65, 87–108 (2015).
Article PubMed Google Scholar
Coates, A. S. et al. Tailoring therapies—Improving the management of early breast cancer: St Gallen international expert consensus on the primary therapy of early breast cancer 2015. Ann. Oncol. 26, 1533–1546 (2015).
Article CAS PubMed PubMed Central Google Scholar
Goldhirsch, A. et al. Strategies for subtypes—Dealing with the diversity of breast cancer: Highlights of the St Gallen international expert consensus on the primary therapy of early breast cancer 2011. Ann. Oncol. 22, 1736–1747 (2011).
Article CAS PubMed PubMed Central Google Scholar
Juríková, M., Danihel, Ľ., Polák, Š. & Varga, I. Ki67, PCNA, and MCM proteins: Markers of proliferation in the diagnosis of breast cancer. Acta Histochem. 118, 544–552 (2016).
Article PubMed Google Scholar
Sun, X. & Kaufman, P. D. Ki-67: More than a proliferation marker. Chromosoma 127, 175–186 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yerushalmi, R., Woods, R., Ravdin, P. M., Hayes, M. M. & Gelmon, K. A. Ki67 in breast cancer: Prognostic and predictive potential. Lancet Oncol. 11, 174–183 (2010).
Article CAS PubMed Google Scholar
Ács, B. et al. Ki-67 as a controversial predictive and prognostic marker in breast cancer patients treated with neoadjuvant chemotherapy. Diagn. Pathol. 12, 20 (2017).
Article PubMed PubMed Central Google Scholar
Bilous, M. Breast core needle biopsy: Issues and controversies. Mod. Pathol. 23, S36–S45 (2010).
Article PubMed Google Scholar
Tong, Y. et al. Ki67 increase after core needle biopsy associated with worse disease outcome in HER2-negative breast cancer patients. Sci. Rep. 13, 2489 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, S., Chen, X. & Shen, K. Association of Ki-67 change pattern after core needle biopsy and prognosis in HR+/HER2– early breast cancer patients. Front. Surg. 9, 905575 (2022).
Article PubMed PubMed Central Google Scholar
Fusco, R., Sansone, M., Maffei, S., Raiano, N. & Petrillo, A. Dynamic contrast-enhanced MRI in breast cancer: A comparison between distributed and compartmental tracer kinetic models. J. Biomed. Graph Comput. 2, 23 (2012).
Google Scholar
Lambin, P. et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer. 48, 441–446 (2012).
Article PubMed PubMed Central Google Scholar
Fan, M. et al. DCE-MRI texture analysis with tumor subregion partitioning for predicting Ki-67 status of Estrogen receptor-positive breast cancers. J. Magn. Reson. Imaging. 48, 237–247 (2018).
Article PubMed Google Scholar
Fan, M. et al. Joint prediction of breast cancer histological grade and Ki-67 expression level based on DCE-MRI and DWI radiomics. IEEE J. Biomed. Health Inf. 24, 1632–1642 (2020).
Article Google Scholar
Zhang, H. et al. Radiomics signatures for predicting the Ki-67 level and HER-2 status based on bone metastasis from primary breast cancer. Front. Cell. Dev. Biol. 11, 1220320 (2023).
Article PubMed Google Scholar
Kayadibi, Y. et al. Radioproteomics in breast cancer: Prediction of Ki-67 expression with MRI-based radiomic models. Acad. Radiol. 29, S116–S125 (2022).
Article PubMed Google Scholar
Lee, J. Y. et al. Radiomic machine learning for predicting prognostic biomarkers and molecular subtypes of breast cancer using tumor heterogeneity and angiogenesis properties on MRI. Eur. Radiol. 32, 650–660 (2022).
Article CAS PubMed Google Scholar
Liu, Z. et al. Radiomics of multiparametric MRI for pretreatment prediction of pathologic complete response to neoadjuvant chemotherapy in breast cancer: A multicenter study. Clin. Cancer Res. 25, 3538–3547 (2019).
Article CAS PubMed Google Scholar
Lafcı, O., Celepli, P., Seher Öztekin, P. & Koşar, P. N. DCE-MRI radiomics analysis in differentiating luminal A and luminal B breast cancer molecular subtypes. Acad. Radiol. 30, 22–29 (2023).
Article PubMed Google Scholar
Zhang, J. et al. Machine learning for post-acute pancreatitis diabetes mellitus prediction and personalized treatment recommendations. Sci. Rep. 13, 4857 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Ye, J. Y. et al. A radiomics-based interpretable model to predict the pathological grade of pancreatic neuroendocrine tumors. Eur. Radiol. 34, 1994–2005 (2024).
Article CAS PubMed Google Scholar
Ma, M. et al. Predicting the molecular subtype of breast cancer and identifying interpretable imaging features using machine learning algorithms. Eur. Radiol. 32, 1652–1662 (2022).
Article CAS PubMed Google Scholar
Petrelli, F., Viale, G., Cabiddu, M. & Barni, S. Prognostic value of different cut-off levels of Ki-67 in breast cancer: A systematic review and meta-analysis of 64,196 patients. Breast Cancer Res. Treat. 153, 477–491 (2015).
Article PubMed Google Scholar
Song, S. E. et al. Machine learning with multiparametric breast MRI for prediction of Ki-67 and histologic grade in early-stage luminal breast cancer. Eur. Radiol. 32, 853–863 (2022).
Article PubMed Google Scholar
Wu, M. et al. Relating doses of contrast agent administered to TIC and semi-quantitative parameters on DCE-MRI: Based on a murine breast tumor model. PLoS One 11, e0149279 (2016).
Article PubMed PubMed Central Google Scholar
Surov, A. et al. Can diffusion-weighted imaging predict tumor grade and expression of Ki-67 in breast cancer? A multicenter analysis. Breast Cancer Res. 20, 58 (2018).
Article PubMed PubMed Central Google Scholar
Miller, I. et al. Ki67 is a graded rather than a binary marker of proliferation versus quiescence. Cell. Rep. 24, 1105–1112e5 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mercado, C. L. BI-RADS update. Radiol. Clin. North Am. 52, 481–487 (2014).
Article PubMed Google Scholar
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: Images are more than pictures, they are data. Radiology 278, 563–577 (2016).
Article PubMed Google Scholar
Wu, Z. et al. Evaluation of lymphatic vessel invasion determined by D2-40 using preoperative MRI-based radiomics for invasive breast cancer. Acad. Radiol. 30, 2458–2468 (2023).
Article PubMed Google Scholar
Kousi, E. et al. Quantitative evaluation of contrast agent uptake in standard fat-suppressed dynamic contrast-enhanced MRI examinations of the breast. Med. Phys. 45, 287–296 (2018).
Article CAS PubMed Google Scholar
Marvaso, G. et al. Can we predict pathology without surgery? Weighing the added value of multiparametric MRI and whole prostate radiomics in integrative machine learning models. Eur. Radiol. 34, 6241–6253 (2024).
Article PubMed Google Scholar
Schnall, M. D. et al. Diagnostic architectural and dynamic features at breast MR imaging: Multicenter study. Radiology 238, 42–53 (2006).
Article PubMed Google Scholar
Ma, W. et al. Breast cancer Ki67 expression prediction by DCE-MRI radiomics features. Clin. Radiol. 73, 909e1–909e5 (2018).
Article Google Scholar
Feng, S. & Yin, J. Radiomics of dynamic contrast-enhanced magnetic resonance imaging parametric maps and apparent diffusion coefficient maps to predict Ki-67 status in breast cancer. Front. Oncol. 12, 847880 (2022).
Article CAS PubMed PubMed Central Google Scholar
Liu, W. et al. Preoperative prediction of Ki-67 status in breast cancer with multiparametric MRI using transfer learning. Acad. Radiol. 28, e44–e53 (2021).
Article PubMed Google Scholar
Liang, C. et al. An MRI-based radiomics classifier for preoperative prediction of Ki-67 status in breast cancer. Acad. Radiol. 25, 1111–1117 (2018).
Article PubMed Google Scholar
Wang, K. et al. Interpretable prediction of 3-year all-cause mortality in patients with heart failure caused by coronary heart disease based on machine learning and SHAP. Comput. Biol. Med. 137, 104813 (2021).
Article PubMed Google Scholar
Liu, C., Elmer, J., Arefan, D., Pease, M. & Wu, S. Interpretable machine learning model for imaging-based outcome prediction after cardiac arrest. Resuscitation 191, 109894 (2023).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to all participants in the study for their support.

Funding

This study was funded by Bureau of Science and Technology and Intellectual Property Nanchong City [No. 19SXHZ0429].

Author information

Li Zhang and Qinglin Du contributed equally.

Authors and Affiliations

Department of Radiology, Affiliated Hospital of North Sichuan Medical College, No 1 Maoyuan South Road, Nanchong, 637000, Sichuan, China
Li Zhang, Qinglin Du, Mengyi Shen, Xin He, Dingyi Zhang & Xiaohua Huang

Authors

Li Zhang
View author publications
Search author on:PubMed Google Scholar
Qinglin Du
View author publications
Search author on:PubMed Google Scholar
Mengyi Shen
View author publications
Search author on:PubMed Google Scholar
Xin He
View author publications
Search author on:PubMed Google Scholar
Dingyi Zhang
View author publications
Search author on:PubMed Google Scholar
Xiaohua Huang
View author publications
Search author on:PubMed Google Scholar

Contributions

L.Z., M.S., D.Z., and X.H., collected the data. L.Z. performed the data analysis. L.Z., Q.D. and X.H. were responsible for the methods section. L.Z. and X.H. wrote the article. All the authors reviewed and edited the manuscript.

Corresponding author

Correspondence to Xiaohua Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, L., Du, Q., Shen, M. et al. Interpretable model based on MRI radiomics to predict the expression of Ki-67 in breast cancer. Sci Rep 15, 13318 (2025). https://doi.org/10.1038/s41598-025-97247-1

Download citation

Received: 11 October 2024
Accepted: 03 April 2025
Published: 17 April 2025
Version of record: 17 April 2025
DOI: https://doi.org/10.1038/s41598-025-97247-1

Keywords

This article is cited by

Ki-67 expression correlates with hepatocellular carcinoma recurrence and is predictable using radiomics features
- Haiyang Nong
- Yongyi Cen
- Ke Ding
Abdominal Radiology (2025)