Noninvasive imaging biomarker reveals invisible microscopic variation in acute ischaemic stroke (≤ 24 h): a multicentre retrospective study

Sun, Kui; Shi, Rongchao; Yu, Xinxin; Wang, Ying; Zhang, Wei; Yang, Xiaoxia; Zhang, Mei; Wang, Jian; Jiang, Shu; Li, Haiou; Kang, Bing; Li, Tong; Zhao, Shuying; Ai, Yu; Qiu, Jianfeng; Wang, Haiyan; Wang, Ximing

doi:10.1038/s41598-025-88016-1

Download PDF

Article
Open access
Published: 30 January 2025

Noninvasive imaging biomarker reveals invisible microscopic variation in acute ischaemic stroke (≤ 24 h): a multicentre retrospective study

Kui Sun¹^na1,
Rongchao Shi³^na1,
Xinxin Yu²^na1,
Ying Wang²,
Wei Zhang⁴,
Xiaoxia Yang⁵,
Mei Zhang⁶,
Jian Wang⁷,
Shu Jiang⁸,
Haiou Li⁹,
Bing Kang²,
Tong Li²,
Shuying Zhao¹⁰,
Yu Ai¹¹,
Jianfeng Qiu¹²,
Haiyan Wang² &
…
Ximing Wang²

Scientific Reports volume 15, Article number: 3743 (2025) Cite this article

3886 Accesses
5 Citations
Metrics details

Subjects

Abstract

To develop and validate non-contrast computed tomography (NCCT)-based radiomics method combines machine learning (ML) to investigate invisible microscopic acute ischaemic stroke (AIS) lesions. We retrospectively analyzed 1122 patients from August 2015 to July 2022, whose were later confirmed AIS by diffusion-weighted imaging (DWI). However, receiving a negative result was reported by radiologists according to the NCCT images. Patients in five institutions (n = 592) were combined to generate training and internal validation sets, remaining in three institutions as external validation sets (n = 204, 53 and 273). Through a series of procedures: head alignment, co-registration of NCCT and DWI, the volume of interest delineation and feature extraction. Multiple ML models (random forest, RF; support vector machine, SVM; logistic regression, LR; multilayer perceptron, MLP) were used to discriminate microscopic AIS and non-AIS. Among 1122 patients included (760 men [67.7%]; median [range] age, 64 [21–96] years). After least absolute shrinkage and selection operator (LASSO) algorithm, 44 optimal features were remained. The radiomics combined ML models were yielded similar mean areas under the receiver operating characteristic curve of 0.808 (95% CI 0.754 to 0.861) for RF, 0.802 (95% CI 0.748 to 0.856) for radial basis kernel function-based SVM, 0.792 (95% CI 0.737 to 0.847) for MLP, 0.792 (95% CI 0.736 to 0.848) for Linear-SVM and 0.787 (95% CI 0.730 to 0.844) for LR, respectively. Combining radiomics with ML models can be an efficient, noninvasive, economical, and reliable technique for evaluating invisible microscopic AIS on NCCT and assisting radiologists to make clinical decisions.

Combining clinical and imaging data for predicting functional outcomes after acute ischemic stroke: an automated machine learning approach

Article Open access 07 October 2023

Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke

Article Open access 23 August 2023

Random forest-based prediction of stroke outcome

Article Open access 12 May 2021

Introduction

Stroke is a global health threat. The 2019 Global Burden of Diseases Study (GBD)¹reported that stroke was the second-leading cause of death and the third-leading cause of death and disability combined worldwide. In 2019, 12.2 million individuals suffered from stroke, 6.55 million deaths occurred, and the global stroke burden increased substantially from 1990 to 2019. Acute ischaemic stroke (AIS) is defined as sudden neurologic dysfunction caused by focal cerebral ischaemia for more than 24 h or proof of acute infarction on brain imaging, regardless of the duration of symptoms². AIS is the primary type of stroke, accounting for approximately 80% of strokes, and is caused by interruption of cerebral blood flow due to arterial occlusion^3,4. Numerous clinical trials have demonstrated the significant clinical benefits of endovascular therapy in AIS patients within 6–24 h of stroke onset^5,6,7,8,9. Thus, early identification of AIS is extremely significant for guiding early clinical treatment and improving patient outcomes, as ‘time is brain.’

Noncontrast computed tomography CT (NCCT) is the first-line imaging modality that is used to evaluate patients with suspected acute stroke. Unfortunately, although NCCT has good diagnostic performance for acute intracranial haemorrhage, it appears to be insufficiently sensitive to ischaemic stroke, especially AIS^3,10,11,12. NCCT AIS findings typically include negative or nonspecific low-density changes, complicating radiologists’ diagnoses. Magnetic resonance imaging (MRI) has excellent advantages in the early detection of AIS, especially diffusion-weighted imaging (DWI) scans. Nevertheless, MRI is less readily available, expensive, difficult for patients, and has longer time costs, which poses a great challenge for most emergency centres. For this high-mortality disease that requires time-sensitive treatment, a technical method that can precisely detect early AIS lesion changes on NCCT must be developed.

In general, quantitatively assessing the brain regions involved in AIS is difficult with NCCT because the variations in density and texture are too subtle to be visually discernible. Radiomics involves analysing and converting medical images into quantitative data and is promising for developing image-driven biomarkers to aid clinical decisions¹³. Machine learning (ML), a branch of artificial intelligence (AI), has been widely used in neuroscience, including for brain tumours, epilepsy, neurodegenerative diseases, and demyelinating diseases^{14,15,16,17,18}. A recent work by Lisowska et al.¹⁹explored the use of context-aware convolutional neural networks for stroke detection. Hu et al.²⁰ evaluated the efficiency of deep learning-based CT perfusion imaging in thrombolytic therapy for acute cerebral infarction with an unknown onset time, demonstrating that the diagnosis effects and image quality were significantly higher in the AI group than in the control group.

Radiomics allows in-depth characterization of phenotypes with distinct lesions, yielding novel predictive indicators. We aimed to develop and validate an NCCT-based radiomics imaging biomarker combined with ML model for detecting early microscopic changes in AIS patients.

Materials and methods

Study design and patient enrolment

In this multicentre study, 1122 eligible patients were retrospectively enrolled from eight cohorts in China. Patients’s details, inclusion and exclusion criteria is shown in Appendix S1. For consistency, the NCCT images of all eligible patients were rereviewed by a radiologist with seven years of experience and a radiologist with twenty years of experience. The patients (n = 592) in cohorts 1 to 5 were combined to generate the main dataset for model fitting, training and parameter tuning, and the remainder of the cohorts were used for independent validation. The baseline characteristics were collected from medical records. The study was approved and the requirement for informed patient consent was waived by the ethical committee of Shandong Provincial Hospital Affiliated to Shandong First Medical University. The study was performed in accordance with the ethical standards as laid down in the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards. Figure 1 is the workflow. The main dataset included 592 patients from cohorts 1 to 5 and was used for radiomics feature selection, signature building, model fitting, training and parameter tuning. Among the selected patients, 397 (67.1%) were male, and 195 (32.9%) were female. The average age was 62.9 years, with the patient ages ranging from 29 to 96 years. A total of 391 (66.0%) patients had hypertension, 155 (26.2%) had dyslipidaemia, 187 (31.6%) had diabetes, 160 (27.0%) had coronary artery disease (CAD), 247 (41.7%) smoked, and 227 (38.3%) drank. The detailed demographic characteristics of all cohorts are shown in Table 1.

Table 1 Baseline characteristics of AIS patients on all cohorts.

Full size table

Delineation and feature extraction

Details of pre-processing is shown in Appendix S2. For all cohorts, we collected NCCT and DWI images for each participant. Two junior radiologists delineated the volume of interest (VOI) of the AIS lesions by using ITK-SNAP software (Version 3.8.0) on the DWI images. The VOI covered regions with high signals to delineate lesions, and one senior radiologist with thirty years of experience reviewed the VOI. As a control, we also sketched a VOI with no abnormal signal area on the contralateral side of the DWI image.

Quantitative radiomics data were extracted from well-registered NCCT images by mapping the VOIs using the PyRadiomics tool package (version 3.0.1). The filters included original, Laplacian of Gaussian (LoG) with various sigma levels (1.0, 2.0, 3.0, 4.0, and 5.0), wavelet, square, square root, logarithm, exponential and gradient. Four classes of grey-level matrices were calculated in three dimensions: the grey-level cooccurrence matrix (GLCM), grey-level run-length matrix (GLRLM), grey-level size zone matrix (GLSZM) and grey-level dependence matrix (GLDM). Each VOI generated 1634 radiomics features.

Feature selection and classifier building

We randomly divided the main dataset into a training set (n = 415) and an internal validation set (n= 177) at a ratio of 7:3. For feature selection, we analyzed and eliminated predictors with zero variance using ANOVA. Then, predictors with multicollinearity were excluded via QR decomposition. Next, the values of the screened features were normalized using the z-score method. The least absolute shrinkage and selection operator (LASSO) regression model is known for its sparsity and anti-over-fitting and is commonly used as a variable selection method in biomedicine^21,22. In our study, the LASSO model was used to select the most predictive features of microscopic AIS in the training dataset. Eventually, these valuable features were generated with the LASSO regression model via ten-fold cross-validation, with one standard error of the minimum penalty coefficient lambda (λ) as the index.

For the ML model, five classifiers were used for prediction in this study: random forest (RF), support vector machine with the linear kernel (Linear-SVM), support vector machine with radial basis function kernel (RBF-SVM), logistic regression (LR) and multilayer perceptron (MLP). The classifiers were fitted and trained on the training set according to the optimal features, and the best parameters were tuned on the internal validation set.

After training, three independent validation cohorts were used to evaluate the performance of the classifiers. The construction of MLP is shown in Appendix S3. The LASSO plot, visualization of MLP training process is shown in Appendix Figure S1.

Statistical analysis

The t test or Mann‒Whitney U tests were used to evaluate the numerical variables. We used the Delong method to determine the area under the receiver operator characteristic (AUROC) and its confidence interval, the area under the precision-recall curve (AUPRC), calibration curve, decision curve analysis (DCA), sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV) and F1-score to assess whether the radiomics features could be used to divide patients into non-AIS and microscopic AIS groups based on NCCT images. The interpretable algorithm SHapley Additive exPlanations (SHAP) was used to calculate the contribution of individual radiomics feature to the ML model predictions.

A p value of < 0.05 was defined as significant in the two-tailed analysis. Statistical analyses were performed with R (version 4.2.1), and MLP model construction was performed with the deep learning framework PyTorch (version 1.11.0) based on Python (version 3.8.0).

Results

Optimal radiomics feature

Initially, 1538 of the 1634 features remained after ANOVA was performed. After the multicollinearity test, 830 features with lower linear correlations were reserved. Ultimately, 44 optimal features were selected according to the LASSO regression model to develop the radiomics signature (Table 2). Of these features, 8 were first-order features, and 36 were higher-order features incorporating 11 GLCM, 12 GLSZM, 3 GLDM, and 10 GLRLM features. Five features originated from the original filter, 20 features were derived from the LoG filters, and 19 features were determined from the wavelet filters.

Table 2 The optimal radiomics features selected by the LASSO model.

Full size table

Predictive performance validated by independent external cohorts

We separately validated the diagnostic performance of each classifier on three independent cohorts. In cohort 6, the RBF-SVM classifier achieved an AUROC of 0.846 (95% CI 0.808 to 0.884), and an AUPRC of 0.854 (95% CI 0.798 to 0.896). The linear SVM model achieved an AUROC of 0.838 (95% CI 0.798 to 0.878), and an AUPRC of 0.856 (95% CI 0.801 to 0.898). The LR model achieved an AUROC of 0.831 (95% CI 0.790 to 0.872), and an AUPRC of 0.840 (95% CI 0.783 to 0.884). The RF model achieved an AUROC of 0.832 (95% CI 0.791 to 0.872), and an AUPRC of 0.846 (95% CI 0.789 to 0.889). The MLP model achieved an AUROC of 0.838 (95% CI 0.799 to 0.878) (Fig. 2a), and an AUPRC of 0.827 (95% CI 0.769 to 0.873) (Fig. 2d).

In cohort 7, the RBF-SVM model achieved an AUROC of 0.805 (95% CI 0.722 to 0.889), and an AUPRC of 0.804 (95% CI 0.675 to 0.890). The linear SVM model achieved an AUROC of 0.775 (95% CI 0.687 to 0.864), and an AUPRC of 0.767 (95% CI 0.636 to 0.862). The LR model achieved an AUROC of 0.775 (95% CI 0.687 to 0.864), and an AUPRC of 0.766 (95% CI 0.634 to 0.861). The RF model achieved an AUROC of 0.818 (95% CI 0.737 to 0.898), and an AUPRC of 0.802 (95% CI 0.674 to 0.889). The MLP model achieved an AUROC of 0.806 (95% CI 0.722 to 0.890) (Fig. 2b), and an AUPRC of 0.819 (95% CI 0.692 to 0.901) (Fig. 2e).

In cohort 8, the RBF-SVM model achieved an AUROC of 0.754 (95% CI 0.714 to 0.795), and an AUPRC of 0.733 (95% CI 0.677 to 0.782). The linear SVM model achieved an AUROC of 0.763 (95% CI 0.723 to 0.803), and an AUPRC of 0.732 (95% CI 0.676 to 0.781). The LR model achieved an AUROC of 0.754 (95% CI 0.713 to 0.795), and an AUPRC of 0.730 (95% CI 0.674 to 0.779). The RF model achieved an AUROC of 0.774 (95% CI 0.735 to 0.813), and an AUPRC of 0.770 (95% CI 0.716 to 0.816). The MLP model achieved an AUROC of 0.731 (95% CI 0.689 to 0.773) (Fig. 2c), and an AUPRC of 0.693 (95% CI 0.636 to 0.745) (Fig. 2f).

The performance of the different models on the training and internal validation sets is shown in Table S1, and the efficacy of different classifiers on three independent validation cohorts is shown in Table 3. A comparison of the AUROC, AUPRC, calibration curve, and DCA of the ML models on the training and internal validation sets is shown in Appendix Figure S2. The calibration curves showed good agreement between the predicted and observed values for the different models in independent cohorts 6–8 (Fig. 3a-c). The DCA results demonstrated the good clinical performance of the models (Fig. 3d-f).

Table 3 The performance of the different classifiers on three independent validation cohorts.

Full size table

Top features ranked by coefficients

For the radiomics signature, we analyzed the features of the top four ranked coefficients, including the LoG (σ = 2 mm) GLCM inverse difference moment normalized (Idmn) feature (coefficient: 0.205), LoG (σ = 1 mm) GLCM Idmn feature (coefficient: 0.193), LoG (σ = 1 mm) GLCM maximum probability feature (coefficient: 0.151), and wavelet (LHL) GLSZM zone entropy feature (coefficient: 0.137) on each independent cohort. We discovered that for these four features, the values of the patients in the microscopic AIS group were generally higher than those of the patients in the non-AIS group. For the LoG (σ = 2 mm) GLCM Idmn feature, p < 0.001 in cohorts 6 and 7 and p = 0.002 in cohort 8. For the LoG (σ = 1 mm) GLCM Idmn feature, p < 0.001 in cohorts 6 and 8 and p = 0.013 in cohort 7. For the LoG (σ = 1 mm) GLCM maximum probability feature, p < 0.001 in cohorts 6 and 8 and p = 0.027 in cohort 7. For wavelet (LHL) GLSZM zone entropy feature, p < 0.001 in cohort 6 and p = 0.033 in cohort 8; however, there were no significant difference in cohort 7, with a p of 0.102 (Fig. 4). A boxplot analysis of the top four features on the training and validation sets is shown in Appendix Figure S3. Moreover, we visualized the top two features with heatmaps. In the No. 1 feature heatmap, the high-signal lesion is red or even dark red, indicating high heterogeneity. The No. 2 feature heatmap displayed mixed heterogeneity in lesions with mixed signals (Fig. 5).

Subgroup analysis and feature contribution calculated by SHAP

We conducted subgroup analyses based on age, gender, and comorbidities (hypertension, dyslipidemia, diabetes, smoking, alcohol consumption, and CAD) to examine differences in predictive performance. Validation using three external independent cohorts (cohorts 6–8) confirmed the model’s robust performance (Figure S4).

For the gender subgroup, the ML model achieved a mean AUROC of 0.829 (range: 0.784–0.831) for males and 0.793 (range: 0.731–0.884) for females. In the age subgroup, participants were divided into two groups based on a threshold of 50 years. The ML model achieved a mean AUROC of 0.808 (range: 0.768–0.842) for those over 50 years and 0.860 (range: 0.813–0.878) for those under 50 years. For the comorbidities subgroup, the ML model achieved mean AUROC of 0.813 (range: 0.772–0.852) for those with hypertension and 0.832 (range: 0.783–0.858) for those without, 0.810 (range: 0.785–0.850) for those with dyslipidemia and 0.818 (range: 0.775–0.847) for those without, 0.833 (range: 0.777–0.862) for those with diabetes and 0.809 (range: 0.777–0.842) for those without, 0.843 (range: 0.810–0.906) for smokers and 0.778 (range: 0.699–0.876) for non-smokers, 0.834 (range: 0.804–0.859) for drinkers and 0.801 (range: 0.761–0.856) for non-drinkers and 0.781 (range: 0.660–0.860) for those with CAD and 0.817 (range: 0.769–0.848) for those without.

The SHAP analysis identified the top three most important features for the model: the original first-order 90Percentile (importance: 0.59), the first-order InterquartileRange from the Laplacian of Gaussian (LoG, σ = 2 mm) transformation (importance: 0.46), and the wavelet-transformed (LLH) first-order InterquartileRange (importance: 0.34) (Figure S5a). The beeswarm plot revealed that higher values of 90Percentile were associated with a lower risk of AIS, whereas higher values of InterquartileRange were linked to a greater likelihood of AIS (Figure S5b).

Discussion

The prognosis of individuals with AIS varies according to the time of diagnosis and intervention, demonstrating the importance of early accurate diagnoses. Considering the scan time and patient cooperation during the scan process, MRI is a less optimal imaging modality than CT. Furthermore, for some emergency centres and low-income countries, MRI is inconvenient and expensive. Specifically, for the Chinese patients of acute cerebral vascular disease, CT is more common than MRI for screening. However, CT also have the flaw of being insensitive to AIS lesions with an onset time of less than 24 h, with an overall sensitivity of 57–71%^23,24. Diseases that can mimic AIS include viral encephalitis, tumors, and white matter lesions. Viral encephalitis is often characterized by a history of infection and alterations in immune cells. Brain tumors typically exhibit mass effects, significant edema, and marked enhancement on contrast-enhanced CT imaging. Cerebral white matter lesions commonly appear as bilateral diffuse hypodense areas within the cerebral hemispheric white matter and are frequently linked to chronic conditions such as long-standing hypertension and hyperlipidemia. Here, we developed and validated an NCCT-based radiomics approach and evaluated its diagnostic value for AIS. The results indicated that the radiomics signature model could distinguish non-AIS and microscopic AIS patients.

The most common mechanism of AIS is embolism. Abnormal cerebral blood flow caused by embolism can lead to persistent brain tissue damage. The progression from reversible injury to irreversible necrosis depends on the magnitude and duration of the reduced blood flow. Ischaemia may occur within minutes in the core of the lesion and develop to the peripheral area. It is estimated that 1.9 million neurons are lost during each minute of ischaemia²⁵. Hence, rapid and accurate diagnosis is crucial for developing urgent treatments to restore blood flow and save neurons.

Current guidelines recommend brain imaging within 30 min of patient presentation to facilitate rapid decision-making about thrombolytic therapy^26,27,28. NCCT is regarded as the most important diagnostic method for differentiating ischaemic stroke from intracerebral haemorrhage. Intracerebral haemorrhage, which appears as a high-density region on CT images, is considered a contraindication of intravenous thrombolytic therapy. Early imaging signs of acute stroke are associated with cellular hypoperfusion and cytotoxic oedema sequelae. The signs of early infarction in CT images include hyperdense artery, decreased grey matter density, cerebral tissue swelling, and sulcal effacement^29,30. However, typical CT signs are rare and present in less than 50% of AIS patients²⁴. The resolution and sensitivity of NCCT are too low to detect prophase changes in ischaemic stroke, especially for identifying infarcts in the posterior cranial fossa and deep brain tissue.

In this study, textural features had the largest contribution to the performance of the ML model for distinguishing microscopic AIS and non-AIS, accounting for ~ 82% (36/44) of the features. Decreased resampling allows more elaborate textural information to be obtained from the images; thus, we resampled the NCCT images to 0.5 × 0.5 × 0.5 mm³. Radiomics features can reflect subtle benign or malignant changes in medical images and reveal regularities that are invisible to radiologists. For example, Idmn indicates local homogeneity in an image. Mabrouk et al. reported that the Idmn values in malignant skin cancer were higher than those in nevi. In our study, the values of the top four features were generally higher in the microscopic AIS group than in the non-AIS group. For the LoG (σ = 2 mm) GLCM Idmn feature, in cohort 6, the median was 0.986 (IQR, 0.982–0.995) in the microscopic AIS group vs. 0.982 (IQR, 0.981–0.984) in the non-AIS group. In cohort 7, the median was 0.985 (IQR, 0.982–0.990) in the microscopic AIS group vs. 0.982 (IQR, 0.981–0.984) in the non-AIS group. In cohort 8, the median was 0.984 (IQR, 0.980–0.990) in the microscopic AIS group vs. 0.982 (IQR, 0.979–0.986) in the non-AIS group. For the LoG (σ = 1 mm) GLCM Idmn feature, in cohort 6, the median was 0.965 (IQR, 0.961–0.983) in the microscopic AIS group vs. 0.961 (IQR, 0.959–0.963) in the non-AIS group. In cohort 7, the median was 0.964 (IQR, 0.961–0.968) in the microscopic AIS group vs. 0.962 (IQR, 0.959–0.965) in the non-AIS group. In cohort 8, the median was 0.966 (IQR, 0.962–0.971) in the microscopic AIS group vs. 0.963 (IQR, 0.960–0.967) in the non-AIS group. For the LoG (σ = 1 mm) GLCM maximum probability feature, in cohort 6, the median was 0.460 (IQR, 0.420–0.502) in the microscopic AIS group vs. 0.425 (IQR, 0.412–0.448) in the non-AIS group. In cohort 7, the median was 0.470 (IQR, 0.434–0.498) in the microscopic AIS group vs. 0.446 (IQR, 0.427–0.470) in the non-AIS group. In cohort 8, the median was 0.483 (IQR, 0.442–0.543) in the microscopic AIS group vs. 0.458 (IQR, 0.430–0.509) in the non-AIS group. The zone entropy feature measures the uncertainty in the distribution of the zone size and grey levels, with higher values indicating more heterogeneity in the texture patterns. For the wavelet (LHL) GLSZM zone entropy feature, in cohort 6, the median was 3.000 (IQR, 2.585–3.462) in the microscopic AIS group vs. 2.750 (IQR, 2.412–3.027) in the non-AIS group. In cohort 7, the median was 2.922 (IQR, 2.322–3.325) in the microscopic AIS group vs. 2.725 (IQR, 2.322–3.000) in the non-AIS group. In cohort 8, the median was 2.750 (IQR, 2.322–3.170) in the microscopic AIS group vs. 2.585 (IQR, 2.322–3.000) in the non-AIS group.

Different ML models have various advantages in distinct tasks, and one model cannot perform all tasks perfectly. We used five types of classifiers and evaluated their performance on this task. According to the results of this work, the models performed well for distinguishing non-AIS and microscopic AIS lesions. Notably, the nonlinear models outperformed the linear models. The mean AUROCs of the RF, RBF-SVM and MLP models were 0.808 (95% CI 0.754 to 0.861), 0.802 (95% CI 0.748 to 0.856), and 0.792 (95% CI 0.737 to 0.847), respectively, in the three independent cohorts. The reason for this result may be that real-world problems are usually nonlinear, so linear models cannot accurately distinguish data with nonlinear distributions. The linear models performed slightly worse than the nonlinear models, with mean AUROCs of 0.792 (95% CI 0.736 to 0.848) for the linear-SVM model and 0.787 (95% CI 0.730 to 0.844) for the LR model. This study has several limitations. Firstly, although it includes data from eight independent centres encompassing 1,122 individuals, its retrospective nature may introduce biases in population selection, feature extraction, and model training. Further exploration is needed to incorporate more comprehensive feature parameters and optimize model selection. Importantly, validation using prospective cohorts is essential to confirm the findings of this study. Secondly, this study did not differentiate between sites of occlusion, as the study population included patients with both anterior and posterior circulatory occlusions. This limitation may impede a deeper understanding of the imaging mechanisms and potential variations in imaging biomarkers associated with different pathophysiological types. To address this, future studies will analyze patients with anterior and posterior circulation occlusions separately to provide greater insight into how these distinct pathophysiological mechanisms influence the model’s performance.

In conclusion, our study demonstrates that combining radiomics with machine learning models can be an efficient, noninvasive, economical, and reliable technique for evaluating early microscopic AIS based on NCCT. Although we used three independent cohorts to validate the results, a larger prospective cohort is still needed. Some challenges, such as the sophistication of the registration methods and the determination of the target volume, should be addressed in further prospective studies. The conception of this method applying in the future can be an automatic segment tool to delineate VOI on CT scan according to the brain’s structural area or functional area, and extract radiomics features to input our model resulting in predictive results. Or, a sliding convolutional kernel (e.g. 3 × 3 × 3 or 5 × 5 × 5) on the brain zone to extract radiomics features sequentially, and input our model to give predictive reference. The generated radiomics heatmap can visualize the heterogeneity zone intuitionally, this strategy can be used to diagnose on CT scan without MRI. Our findings suggest the potential value of noninvasive biomarkers to aid in clinical decision-making.

Data availability

The datasets used for the current study are available from the corresponding author on reasonable request.

References

Collaborators, G. B. D. S. Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the global burden of Disease Study 2019. Lancet Neurol. 20 (10), 795–820 (2021).
Article MATH Google Scholar
Sacco, R. L. et al. An updated definition of stroke for the 21st Century. Stroke 44 (7), 2064–2089 (2013).
Article PubMed PubMed Central MATH Google Scholar
Feske, S. K. Ischemic stroke. Am. J. Med. 134 (12), 1457–1464 (2021).
Article PubMed MATH Google Scholar
Paul, S. & Candelario-Jalil, E. Emerging neuroprotective strategies for the treatment of ischemic stroke: an overview of clinical and preclinical studies. Exp. Neurol. 335, 113518 (2021).
Article CAS PubMed MATH Google Scholar
Goyal, M. et al. Randomized assessment of rapid endovascular treatment of ischemic stroke. N Engl. J. Med. 372 (11), 1019–1030 (2015).
Article CAS PubMed MATH Google Scholar
Powers, W. J. et al. 2015 American Heart Association/American Stroke Association Focused Update of the 2013 guidelines for the early management of patients with Acute ischemic stroke regarding endovascular treatment: a Guideline for Healthcare professionals from the American Heart Association/American Stroke Association. Stroke 46 (10), 3020–3035 (2015).
Article CAS PubMed MATH Google Scholar
Saver, J. L. et al. Time to treatment with intravenous tissue plasminogen activator and outcome from acute ischemic stroke. JAMA 309 (23), 2480–2488 (2013).
Article CAS PubMed MATH Google Scholar
Saver, J. L. et al. Number needed to treat to benefit and to harm for intravenous tissue plasminogen activator therapy in the 3- to 4.5-hour window: joint outcome table analysis of the ECASS 3 trial. Stroke 40 (7), 2433–2437 (2009).
Article CAS PubMed PubMed Central MATH Google Scholar
Saver, J. L. et al. Time to Treatment with Endovascular Thrombectomy and outcomes from ischemic stroke: a Meta-analysis. JAMA 316 (12), 1279–1288 (2016).
Article PubMed MATH Google Scholar
Chalela, J. A. et al. Magnetic resonance imaging and computed tomography in emergency assessment of patients with suspected acute stroke: a prospective comparison. Lancet 369 (9558), 293–298 (2007).
Article PubMed PubMed Central Google Scholar
Hopyan, J. et al. Certainty of stroke diagnosis: incremental benefit with CT perfusion over noncontrast CT and CT angiography. Radiology 255 (1), 142–153 (2010).
Article PubMed Google Scholar
Wardlaw, J. M. & Mielke, O. Early signs of brain infarction at CT: observer reliability and outcome after thrombolytic treatment–systematic review. Radiology 235 (2), 444–453 (2005).
Article PubMed MATH Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006 (2014).
Article ADS CAS PubMed MATH Google Scholar
An, J. et al. Decreased white matter integrity in mesial temporal lobe epilepsy: a machine learning approach. Neuroreport 25 (10), 788–794 (2014).
Article PubMed MATH Google Scholar
Bhagyashree, S. I. R., Nagaraj, K., Prince, M., Fall, C. H. D. & Krishna, M. Diagnosis of dementia by machine learning methods in epidemiological studies: a pilot exploratory study from south India. Soc. Psychiatry Psychiatr Epidemiol. 53 (1), 77–86 (2018).
Article PubMed Google Scholar
Ion-Margineanu, A. et al. Machine Learning Approach for classifying multiple sclerosis courses by combining Clinical Data with Lesion loads and magnetic resonance metabolic features. Front. Neurosci. 11, 398 (2017).
Article PubMed PubMed Central MATH Google Scholar
Rekik, I., Allassonniere, S., Carpenter, T. K. & Wardlaw, J. M. Medical image analysis methods in MR/CT-imaged acute-subacute ischemic stroke lesion: segmentation, prediction and insights into dynamic evolution simulation models. A critical appraisal. Neuroimage Clin. 1 (1), 164–178 (2012).
Article PubMed PubMed Central MATH Google Scholar
Zacharaki, E. I., Kanas, V. G. & Davatzikos, C. Investigating machine learning techniques for MRI-based classification of brain neoplasms. Int. J. Comput. Assist. Radiol. Surg. 6 (6), 821–828 (2011).
Article PubMed PubMed Central Google Scholar
Lisowska, A., O’Neil, A. & Dilys, V. Context-aware convolutional neural networks for stroke sign detection in non-contrast CT scans. Paper presented at: Annual Conference on Medical Image Understanding and Analysis pp. 494–505.
Hu, M., Chen, N., Zhou, X., Wu, Y. & Ma, C. Deep learning-based computed Tomography Perfusion Imaging to evaluate the effectiveness and safety of thrombolytic therapy for cerebral infarct with unknown time of Onset. Contrast Media Mol. Imaging. 2022, 9684584 (2022).
Article PubMed PubMed Central Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized Linear models via Coordinate Descent. J. Stat. Softw. 33 (1), 1–22 (2010).
Article PubMed PubMed Central MATH Google Scholar
Huang, W. et al. Noninvasive imaging of the tumor immune microenvironment correlates with response to immunotherapy in gastric cancer. Nat. Commun. 13 (1), 5095 (2022).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Mendelson, S. J. & Prabhakaran, S. Diagnosis and management of transient ischemic attack and Acute ischemic stroke: a review. JAMA 325 (11), 1088–1098 (2021).
Article CAS PubMed MATH Google Scholar
Vilela, P. & Rowley, H. A. Brain ischemia: CT and MRI techniques in acute ischemic stroke. Eur. J. Radiol. 96, 162–172 (2017).
Article PubMed MATH Google Scholar
Saver, J. L. Time is brain–quantified. Stroke 37 (1), 263–266 (2006).
Article PubMed MATH Google Scholar
Kelly, A. G., Hellkamp, A. S., Olson, D., Smith, E. E. & Schwamm, L. H. Predictors of rapid brain imaging in acute stroke: analysis of the get with the guidelines-Stroke program. Stroke 43 (5), 1279–1284 (2012).
Article PubMed Google Scholar
Lee, J. S. & Demchuk, A. M. Choosing a Hyperacute Stroke Imaging Protocol for proper patient selection and time efficient endovascular treatment: lessons from recent trials. J. Stroke. 17 (3), 221–228 (2015).
Article PubMed PubMed Central MATH Google Scholar
Liu, L. et al. Chinese Stroke Association guidelines for clinical management of cerebrovascular disorders: executive summary and 2019 update of clinical management of ischaemic cerebrovascular diseases. Stroke Vasc Neurol. 5 (2), 159–176 (2020).
Article PubMed PubMed Central MATH Google Scholar
Radhiana, H., Syazarina, S. O., Shahizon Azura, M. M., Hilwati, H. & Sobri, M. A. Non-contrast computed tomography in Acute Ischaemic Stroke: a Pictorial Review. Med. J. Malaysia. 68 (1), 93–100 (2013).
CAS PubMed Google Scholar
Wu, S. et al. Hyperdense artery sign, symptomatic infarct swelling and effect of alteplase in acute ischaemic stroke. Stroke Vasc Neurol. 6 (2), 238–243 (2021).
Article PubMed MATH Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This study has received funding from Natural Science Foundation of China (NSFC: 82471978 and 82271993) by Xm.W. The Cancer Prevention and Treatment fund of Shandong Province Natural Science Foundation (ZR2019LZL009) and the Key Research and Development Project of Shandong Province (2021SFGC0104) by Hy.W.

Author information

Kui Sun, Rongchao Shi and Xinxin Yu contributed equally.

Authors and Affiliations

Department of General Surgery, Peking University Third Hospital, Beijing, China
Kui Sun
Department of Radiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jing Wu Road, No. 324, Jinan, 250021, Shandong, China
Xinxin Yu, Ying Wang, Bing Kang, Tong Li, Haiyan Wang & Ximing Wang
Department of Radiology, Beijing Friendship Hospital, Capital Medical University, Beijing, 100050, China
Rongchao Shi
Department of Radiology, Wangjing Hospital of CACMS, Beijing, 100102, China
Wei Zhang
Department of Radiology, The Third People’s Hospital of Datong, Datong, 037000, Shanxi, China
Xiaoxia Yang
Department of Radiology, Shandong First Medical University & Shandong Academy of Medical Sciences, Taian, 271016, Shandong, China
Mei Zhang
Department of Radiology, Jinan Central Hospital Affiliated to Shandong University, Jinan, 250013, Shandong, China
Jian Wang
Department of Radiology, The First Affiliated Hospital of Shandong First Medical University, Shandong Provincial Qianfoshan Hospital, Jinan, 250014, Shandong Province, China
Shu Jiang
Department of Radiology, Cheeloo College of Medicine, Qilu Hospital, Shandong University, Jinan, 250012, Shandong, China
Haiou Li
The National Clinical Research Center for Mental Disorders & Beijing Key Laboratory of Mental Disorders, Beijing Anding Hospital, Capital Medical University, Beijing, 100088, China
Shuying Zhao
Department of Otolaryngology-Head and Neck Surgery, Cheeloo College of Medicine, Shandong Provincial ENT Hospital, Shandong University, Jinan, 250022, China
Yu Ai
Medical Science and Technology Innovation Center, Shandong First Medical University, Shandong Academy of Medical Sciences, Jinan, 250000, China
Jianfeng Qiu

Authors

Kui Sun
View author publications
Search author on:PubMed Google Scholar
Rongchao Shi
View author publications
Search author on:PubMed Google Scholar
Xinxin Yu
View author publications
Search author on:PubMed Google Scholar
Ying Wang
View author publications
Search author on:PubMed Google Scholar
Wei Zhang
View author publications
Search author on:PubMed Google Scholar
Xiaoxia Yang
View author publications
Search author on:PubMed Google Scholar
Mei Zhang
View author publications
Search author on:PubMed Google Scholar
Jian Wang
View author publications
Search author on:PubMed Google Scholar
Shu Jiang
View author publications
Search author on:PubMed Google Scholar
Haiou Li
View author publications
Search author on:PubMed Google Scholar
Bing Kang
View author publications
Search author on:PubMed Google Scholar
Tong Li
View author publications
Search author on:PubMed Google Scholar
Shuying Zhao
View author publications
Search author on:PubMed Google Scholar
Yu Ai
View author publications
Search author on:PubMed Google Scholar
Jianfeng Qiu
View author publications
Search author on:PubMed Google Scholar
Haiyan Wang
View author publications
Search author on:PubMed Google Scholar
Ximing Wang
View author publications
Search author on:PubMed Google Scholar

Contributions

XW, HW, KS, XY conceived and designed the study. KS, RS, YW, WZ, XY, JW, SJ, HL, and TL acquired the data. XW, HW, KS, RS and XY implemented quality control of data. KS, RS and BK did the radiomics and machine learning-based analysis. KS, RS, XY and YW performed the statistical analysis. KS and XY made figures and tables. KS, RS, XY, YW prepared the first draft of the manuscript. KS, RS, YW, MZ, SZ, YA, JQ, HW and XW reviewed the manuscript. HW and XW had direct access to and verified the data. KS, RS and XY contributed equally to this work as first authors. HW and XW contributed equally to this work as co-corresponding author. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Haiyan Wang or Ximing Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Consent for publication

Not applicable.

Ethical approval

The study was approved and the requirement for informed patient consent was waived by the ethical committee of Shandong Provincial Hospital Affiliated to Shandong First Medical University. The study was performed in accordance with the ethical standards as laid down in the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Sun, K., Shi, R., Yu, X. et al. Noninvasive imaging biomarker reveals invisible microscopic variation in acute ischaemic stroke (≤ 24 h): a multicentre retrospective study. Sci Rep 15, 3743 (2025). https://doi.org/10.1038/s41598-025-88016-1

Download citation

Received: 29 August 2024
Accepted: 23 January 2025
Published: 30 January 2025
Version of record: 30 January 2025
DOI: https://doi.org/10.1038/s41598-025-88016-1

Keywords

This article is cited by

Distinct visual biases affect humans and artificial intelligence in medical imaging diagnoses
- Graham A. McLeod
- Emma A. M. Stanley
- Nils D. Forkert
npj Digital Medicine (2025)
Bi-parametric MRI-based quantification radiomics model for the noninvasive prediction of histopathology and biochemical recurrence after prostate cancer surgery: a multicenter study
- Si Yu Wu
- Ying Wang
- Mian Zhang
Abdominal Radiology (2025)

Subjects

Abstract

Similar content being viewed by others

Combining clinical and imaging data for predicting functional outcomes after acute ischemic stroke: an automated machine learning approach

Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke

Random forest-based prediction of stroke outcome

Introduction

Materials and methods

Study design and patient enrolment

Delineation and feature extraction

Feature selection and classifier building

Statistical analysis

Results

Optimal radiomics feature

Predictive performance validated by independent external cohorts

Top features ranked by coefficients

Subgroup analysis and feature contribution calculated by SHAP

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Consent for publication

Ethical approval

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1 (download DOCX )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Distinct visual biases affect humans and artificial intelligence in medical imaging diagnoses

Bi-parametric MRI-based quantification radiomics model for the noninvasive prediction of histopathology and biochemical recurrence after prostate cancer surgery: a multicenter study

Search

Quick links