Multimodal nomogram integrating deep learning radiomics and hemodynamic parameters for early prediction of post-craniotomy intracranial hypertension

Fu, Zunfeng; Wang, Jing; Shen, Wenyi; Wu, Yanqing; Zhang, Jiajun; Liu, Yan; Wang, Chongqiang; Shen, Yanlin; Zhu, Ye; Zhang, Weifu; Lv, Chunju; Peng, Lin

doi:10.1038/s41598-025-09115-7

Download PDF

Article
Open access
Published: 02 July 2025

Multimodal nomogram integrating deep learning radiomics and hemodynamic parameters for early prediction of post-craniotomy intracranial hypertension

Zunfeng Fu¹^na1,
Jing Wang¹^na1,
Wenyi Shen¹,
Yanqing Wu⁴,
Jiajun Zhang¹,
Yan Liu¹,
Chongqiang Wang¹,
Yanlin Shen⁵,
Ye Zhu⁶,
Weifu Zhang³,
Chunju Lv¹ &
…
Lin Peng²

Scientific Reports volume 15, Article number: 23595 (2025) Cite this article

2075 Accesses
Metrics details

Subjects

Abstract

To evaluate the effectiveness of deep learning radiomics nomogram in distinguishing early intracranial hypertension (IH) following primary decompressive craniectomy (DC) in patients with severe traumatic brain injury (TBI) and to demonstrate its potential clinical value as a noninvasive tool for guiding timely intervention and improving patient outcomes. This study included 238 patients with severe TBI (training cohort: n = 166; testing cohort: n = 72). Postoperative ultrasound images of the optic nerve sheath (ONS) and Spectral doppler imaging of middle cerebral artery (MCASDI) were obtained at 6 and 18 h after DC. Patients were grouped according to threshold values of 15 mmHg and 20 mmHg based on invasive intracranial pressure (ICPi) measurements. Clinical-semantic features were collected, and radiomics features were extracted from ONS images, and Additionally, deep transfer learning (DTL) features were generated using RseNet101. Predictive models were developed using the Light Gradient Boosting Machine (light GBM) machine learning algorithm. Clinical-ultrasound variables were incorporated into the model through univariate and multivariate logistic regression. A combined nomogram was developed by integrating DLR (deep learning radiomics) features with clinical-ultrasound variables, and its diagnostic performance over different thresholds was evaluated using Receiver Operating Characteristic (ROC) curve analysis and decision curve analysis (DCA). The nomogram model demonstrated superior performance over the clinical model at both 15 mmHg and 20 mmHg thresholds. For 15 mmHg, the AUC was 0.974 (95% confidence interval [CI]: 0.953–0.995) in the training cohort and 0.919 (95% CI: 0.845–0.993) in the testing cohort. For 20 mmHg, the AUC was 0.968 (95% CI: 0.944–0.993) in the training cohort and 0.889 (95% CI: 0.806–0.972) in the testing cohort. DCA curves showed net clinical benefit across all models. Among DLR models based on ONS, MCASDI, or their pre-fusion, the ONS-based model performed best in the testing cohorts. The nomogram model, incorporating clinical-semantic features, radiomics, and DTL features, exhibited promising performance in predicting early IH in post-DC patients. It shows promise for enhancing non-invasive ICP monitoring and supporting individualized therapeutic strategies.

Development and validation of a multi-omics hemorrhagic transformation model based on hyperattenuated imaging markers following mechanical thrombectomy

Article Open access 23 May 2025

Noncontrast CT-based deep learning for predicting intracerebral hemorrhage expansion incorporating growth of intraventricular hemorrhage

Article Open access 31 August 2025

A novel nomogram for predicting prolonged disorders of consciousness in severe supratentorial hypertensive intracerebral hemorrhage patients

Article Open access 17 July 2025

Introduction

Accurate early prediction of intracranial hypertension (IH) is critical in the management of brain injury patients, as it can lead to brain herniation and pose a significant risk of irreversible brain damage and death^1,2. Currently, a variety of invasive and non-invasive methods and devices are used for intracranial pressure (ICP) measurement³. Invasive monitoring is achieved by placing devices into the ventricles, brain parenchyma, epidural, subdural, and subarachnoid spaces^1,2. Although invasive ICP monitoring (ICPi) has several disadvantages, such as the risk of infections, bleeding, catheter blockage, and brain parenchymal damage^3,4, it remains the gold standard for ICP detection due to its high accuracy. However, its high cost and technical demands mean that not all medical institutions can perform such monitoring routinely. Non-invasive techniques, such as optic nerve sheath diameter (ONSD) assessment, pupil measurement, transcranial Doppler (TCD), HeadSense technology, and flash visual evoked potential (FVEP), are also used to assess ICP. While ONSD and TCD have demonstrated moderate to good accuracy in related studies^5,6,7, they are not yet able to completely replace invasive ICP monitoring in the short term.

The American Brain Injury Foundation recommends ICP monitoring for all traumatic brain injury (TBI) cases with a Glasgow Coma Scale (GCS) score between 3 and 8, and abnormal CT findings (such as hematoma, contusion, severe swelling, brain herniation, or basal cistern collapse)⁸. However, for patients after initial craniotomy, there is controversy regarding whether to implement invasive ICP monitoring, even if they meet the criteria for ICP monitoring. Proponents argue that invasive monitoring is necessary for maintaining adequate cerebral perfusion pressure (CPP). Without ICP monitoring, it would be difficult to develop and implement CPP-guided treatment protocols. Studies have shown that a significant proportion of TBI patients still have elevated ICP after undergoing decompressive craniectomy (DC), potentially causing CPP to drop to dangerously low levels (< 60 mmHg). Therefore, ICP monitoring is considered essential for these patients, as it provides crucial data for CPP-based therapeutic decisions^9,10. Conversely, opponents argue that routine ICP monitoring is unnecessary. They believe that experienced clinicians can use pressure measurements from the decompression window, combined with CT or MRI scans, to identify recurrent hematomas, fluid accumulation, and changes in the size and shape of the ventricles, thereby estimating ICP levels and conducting a comprehensive ICP assessment. Furthermore, they point out that there is no consensus on the threshold for ICP after DC. Additionally, for severely injured TBI patients, maintaining ICP below 20 mmHg does not offer a significant advantage over treatment protocols based on imaging and clinical examination alone^11,12. Despite the differences in opinion, the importance of maintaining normal ICP in post-DC patients is undeniable, and mature, reliable non-invasive monitoring techniques may hold the key to resolving this debate in the future. In our previous research¹³, our team demonstrated that the combined use of ONSD by ultrasound and transcranial color doppler (TCCD) has good clinical application value, outperforming single indicators in discriminating early IH. However, for patients with mild IH (ICP ≥ 15 mmHg), the predictive sensitivity was only 0.484, (95% confidence interval [CI]:0.358–0.613) through this combined clinical method [Area Under the Curve (AUC):0.732, 95% CI: (0.651–0.822)], which warrants further investigation into optimizing predictive models to improve the monitoring of early mild IH.

Radiomics is a quantitative analytical approach that allows for the extraction and evaluation of radiological features from the entire lesion, enhancing the accuracy and objectivity of disease diagnosis and treatment outcome prediction¹⁴. Deep learning can quantify high-dimensional radiological phenotypes beyond human perception and develop specialized predictive models for diverse clinical applications, making it a valuable non-invasive tool for prognostic assessment¹⁵, disease diagnosis¹⁶, and treatment decision-making¹⁷. Nevertheless, research evidence remains insufficient to establish the feasibility of using deep learning techniques to predict early IH in post-DC patients with TBI. This study aims to develop a deep learning model based on ultrasound images to enhance the accuracy of IH prediction in these patients. Additionally, the study evaluates the predictive value of individual ONS images, individual middle cerebral artery spectral Doppler imaging (MCASDI) images, and their pre-fused images within the comprehensive model.

Materials and methods

Patient population

This retrospective observational study was conducted at the Neuro-Intensive Care Unit (NICU) of the Second Affiliated Hospital of Shandong First Medical University (SFMU) between May 2018 and December 2024 and at the NICU of the Affiliated Hospital of Qilu Medical University between August 2019 and December 2024. Due to the retrospective nature of the study, the Ethics Committee of the Second affiliated Hospital of SFMU waived the need of obtaining informed consent.

A total of 238 patients with TBI who underwent primary DC were included based on predefined inclusion and exclusion criteria (Fig. 1: Flowchart for Selecting the Study Population). The study enrolled adult patients (≥ 18 years old) with TBI who had undergone primary DC and required ICP monitoring within 24 h postoperatively.

Exclusion criteria included patients with poor TCCD window quality that impaired 2D imaging—specifically, cases where the temporal bone window could not be penetrated by ultrasound to detect the MCA, or where vascular boundaries appeared blurred and the signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) were reduced, compromising diagnostic reliability. Patients with significant cardiovascular or cerebrovascular conditions, such as severe arrhythmias, critical valvular stenosis, moderate to severe cerebral vasospasm, or ICPi fluctuations exceeding 10 mmHg within 24 h, were also excluded. Eligible patients—176 from the Second Affiliated Hospital of SFMU and 62 from the Affiliated Hospital of Qilu Medical University—were randomly allocated to the training and testing cohorts at a 7:3 ratio.

Treatment methods

In patients with TBI, the decision to perform primary DC is influenced by factors such as the injury mechanism, clinical symptoms, GCS score, and findings on CT imaging. Unilateral DC is typically performed when hematomas are limited to a single hemisphere, whereas bilateral or frontotemporal decompression is preferred in cases involving bilateral hematomas or frontal lobe damage. Surgically, a bone flap approximately 9 × 9 cm is created and extended in an arc to adequately expose the dura mater. Hemostasis is achieved by evacuating hematomas, necrotic brain tissue, and clots. The dura is then either loosely sutured or covered with a synthetic substitute. ICP is continuously monitored via an intraparenchymal sensor or a ventricular catheter linked to an external transducer. The probe placement depends on the lesion location—on the same side for unilateral lesions, or on the right hemisphere in cases of diffuse injury.

ICP management in this study followed the Chinese Expert Consensus on the Monitoring and Management of Elevated ICP. Sedation was maintained using agents such as propofol, titrated based on electroencephalography to achieve burst suppression. When partial neurological evaluation was required, midazolam or dexmedetomidine were administered with individualized dosing. Sedation depth and ICP were continuously monitored, with sedation typically maintained for 24–72 h postoperatively. CPP was kept between 60 and 70 mmHg, adjusted based on optimal CPP, and supported by vasopressors like norepinephrine to maintain MAP. For hyperosmolar therapy, 3% hypertonic saline (2–5 mL/kg over 15–30 min) and mannitol (0.25–1 g/kg over 20–30 min) were used, with serum osmolality (< 320 mOsm/L) and renal function continuously monitored. Hypertonic Saline was preferred during acute ICP elevation (ICP > 20 mmHg), with repeat doses at 4–6 h intervals if necessary. Therapeutic hypothermia (33–35 °C for 24–72 h) was induced gradually (0.25–0.5 °C/h), with rewarming performed slowly (0.1–0.25 °C/h) to avoid rebound ICP elevation. Core temperature, ICP, CPP, and electrolytes were continuously monitored. All interventions were adjusted dynamically based on multimodal monitoring, balancing clinical benefit with risks such as infection or coagulopathy.

In the absence of universally accepted guidelines for IH post-DC, thresholds of 15 mmHg (indicating mild elevation) and 20 mmHg (indicating moderate to severe elevation) were adopted to assess the model’s predictive capability at these levels.

US acquisition, semantic features, and segmentation

ONS imaging was performed twice within 24 h in post-DC patients using the Mindray M9 ultrasound system equipped with an L12-4s linear probe (5–10 MHz; Shenzhen, Guangzhou, China). Patients were positioned in a supine posture, and the probe was gently placed on the closed upper eyelid with standard coupling gel, angled to visualize the ONS in the transverse plane. ONSD measurements were obtained 3 mm posterior to the optic disc, with three measurements taken in the transverse plane for each eye to reduce variability.

To ensure both image quality and safety, the probe frequency was set to 10 MHz, providing an optimal balance between resolution and tissue penetration. The power output was maintained at 20–25%, in accordance with the “As Low As Reasonably Achievable” (ALARA) principle and within recommended safety limits (MI ≤ 0.23, TI ≤ 1 °C, and ISPTA.3 ≤ 50 mW/cm²). The depth was adjusted to 40–45 mm to center the eyeball within the field of view, and the focus was set at 25–30 mm, aligning with the typical position of the optic nerve. To enhance image clarity, the gain was set to 55–60%, ensuring adequate brightness without introducing excessive contrast. For Doppler imaging, a pulse repetition frequency (PRF) of 1.0 kHz was used. A frame rate of 25 frames per second (FPS) was maintained to ensure resolution. All measurements were performed using the same type of probe across patients to maintain consistency and comparability.

Additionally, MCASDI captured by bedside TCCD and assessments were conducted using the same Mindary M9 ultrasound system, equipped with an SP5-1s probe (1–5 MHz). Bilateral MCA evaluation included the measurement of pulse index (PI), peak systolic velocity (PSV), end-diastolic velocity (EDV), and mean velocity (MV). The definitive PI value was determined by averaging the measurements from both sides or selecting the higher value in cases of significant discrepancy.

For TCCD assessments, a phased-array probe with a frequency of 1.6 MHz was used to ensure adequate acoustic penetration through the temporal bone window. The imaging depth was set between 12 and 16 cm to visualize the basal cerebral arteries, and the focal zone was positioned at 5–7 cm, corresponding to the depth of the circle of Willis. The gain was adjusted to 60–70% to achieve optimal image brightness while minimizing background noise. Additionally, a frame rate of 25 FPS was maintained to ensure resolution. The PRF was set to 3 kHz to allow for accurate detection of cerebral blood flow velocities without aliasing.

Two ultrasound specialists (ZJJ and WCQ), who were blinded to the ICPi values and clinical information, independently assessed the ONS and TCCD images. One specialist had 4 years of experience, while the other had 15 years of expertise in ultrasound imaging. Both specialists underwent standardized training for measurements and reached a consensus on their evaluations of the images.

Ultrasound images were preprocessed by converting them from digital imaging and communications in medicine (DICOM) format to a compatible format, resizing them to a uniform 512 × 512 pixel resolution, and applying z-score normalization to standardize pixel intensities. These steps ensured consistent image quality and facilitated accurate feature extraction and model training.

The regions of interest (ROIs) were manually delineated by ultrasound imaging experts using ITK-SNAP (version 3.8.0, http://www.itksnap.org/). The experts were blinded to the ICPi values, and any discrepancies were resolved through consensus discussion. Intra- and inter-observer reproducibility were assessed using the intraclass correlation coefficient (ICC), with values ≥ 0.75 considered to indicate strong reliability. Figure 2 illustrates the flowchart depicting the overall research design.

Radiomics and DTL feature extraction

Radiomic features were extracted from the segmented ROIs using PyRadiomics (version 3.0.1, http://pyradiomics.readthedocs.io). The extracted features were categorized into three primary groups: (I) geometry, (II) intensity, and (III) texture. Geometric features describe the two-dimensional shape of the ONS, while intensity features reflect the statistical distribution of voxel intensities within the ONS. These intensity features capture the first-order statistical distribution of voxel intensities, providing insights into the overall brightness and contrast of the ONS. Texture features, on the other hand, characterize the patterns and spatial distribution of intensities at higher orders, offering further details on the internal structure of the ONS. A comprehensive list and detailed explanation of the extracted radiomic features can be found in Supplementary S1.

The ResNet101 architecture was adapted to develop a convolutional neural network (CNN) for extracting deep transfer learning (DTL) features. To achieve this, separate CNN models were pretrained using ONS images, MCASDI images, and a combined dataset containing both ONS and MCASDI images. After feature extraction, the DTL features from ONS and MCASDI images were fused using feature concatenation, where the extracted feature vectors from each modality were directly combined to form a comprehensive representation. Specifically, the output from the penultimate layer of each pretrained CNN model was concatenated to generate the final DTL feature set, ensuring an integrated representation of both imaging modalities.

To fine-tune ResNet101 and reduce overfitting, we applied a dropout rate of 0.5 and L2 regularization with a coefficient of 0.001. The initial learning rate was set to 0.0001 and adjusted using a cosine annealing strategy. Additionally, we used data augmentation techniques during training, including random horizontal and vertical flipping, slight rotations (± 5°), and brightness adjustments (± 10%), to improve model robustness and generalizability.

Radiomics and DTL feature selection

The z-score normalization technique was applied to standardize all features. A statistical analysis was then performed using the Mann–Whitney U test, with features selected based on their p-values. Only features with a p-value < 0.05 were retained. Spearman’s rank correlation coefficient was calculated to assess the correlation of features with high repeatability. If two features had a correlation coefficient greater than 0.9, one was excluded. The signature construction from the discovery dataset employed Least Absolute Shrinkage and Selection Operator (LASSO) regression. LASSO regression uses a regularization weight, λ, to shrink regression coefficients towards zero, assigning zero to irrelevant features. The optimal value of λ was determined via 5-fold cross-validation, selecting the value that minimized the cross-validation error. The scikit-learn package (version 1.0.2) in Python was utilized for LASSO regression modeling. Subsequently, all selected features were used to train a Light Gradient Boosting Machine (LightGBM), with hyperparameters set as follows: learning rate = 0.1, maximum depth = 6, and number of leaves = 31. To avoid data leakage and ensure the integrity of model evaluation, all feature selection procedures were performed exclusively on the training dataset. The independent testing cohort was strictly reserved for external validation and was not involved in any stage of feature selection or model training.

Statistical analyses

Statistical analyses were performed using MedCalc (version 20.015.0), Python (version 3.7.11; http://www.python.org). Student’s t-test and ANOVA were applied to compare continuous variables, expressed as mean ± standard deviation (χ ± s). The chi-square test or Fisher’s exact test was used to analyze categorical variables, presented as ratios. A p-value of less than 0.05 was considered statistically significant.

Results

Patients’ population and clinical-ultrasonic characteristics

A total of 238 patients, including 159 males and 79 females, participated in the study. At an ICP threshold of 15 mmHg, the incidence of intracranial hypertension (IH) was 33.19%, while at 20 mmHg, it was 26.89%. Among these patients, 166 were randomly assigned to the training set, and 72 to the test set.

No significant differences were observed between the cohorts in terms of age, weight, height, BMI, gender, ejection fraction (EF), MAP, partial pressure of oxygen (PaO₂), partial pressure of carbon dioxide (PaCO₂), heart rate, or respiratory rate, as outlined in Table 1 (ICP threshold of 15 mmHg) and Supplementary Table 1S (ICP threshold of 20 mmHg). Significant differences were observed in PI and ONSD between the normal and elevated ICP groups in both the training and validation sets. Univariate and multivariable logistic regression analyses further identified MCAPI and ONSD as significant predictors (see Table 2 and Supplementary Table 2S). These variables were subsequently incorporated into the clinical predictive model.

Patients with elevated ICP (≥ 15 mmHg) exhibited a higher likelihood of increased PI (odds ratio [OR]: 4.911; 95% CI: 1.777–13.578) and elevated ONSD (OR: 7.320; 95% CI: 3.386–15.841). A similar trend was observed at the 20 mmHg ICP threshold, with increased PI (OR: 4.976; 95% CI: 1.840–13.454) and elevated ONSD (OR: 4.302; 95% CI: 2.204–8.410).

Table 1 Baseline characteristics of patients in cohorts.

Full size table

Table 2 Univariate and multivariable logistic regression analyses for selecting clinical features of model development (15 mmHg).

Full size table

Feature selection and deep learning radiomics model construction

Radiomic features were extracted from the segmented ROIs using the PyRadiomics software (version 3.0.1), yielding a total of 1,561 commonly used features. To ensure reliability, a reproducibility assessment was conducted using intraclass correlation coefficients (ICCs), which confirmed 1,291 and 1,059 features as reliable at thresholds of 15 mmHg and 20 mmHg, respectively. Subsequently, the Mann–Whitney U test identified 1,023 and 906 statistically significant features. Further refinement was performed using the Spearman rank correlation test, narrowing the selection to 280 and 237 features. The Rad score was then computed using a LASSO regression model, which identified 20 and 26 features with nonzero coefficients. The coefficients and mean standard errors (MSE) from 5-fold cross-validation are shown in Figs. 3A and B. Figure 3C presents the coefficient values of the selected nonzero features at the 15 mmHg threshold, while Supplementary Fig. 3S(A–C) displays the corresponding results for 20 mmHg. The radiomics signature model demonstrated robust predictive performance, achieving AUC values of 0.974 (95% CI: 0.952–0.995) and 0.912 (95% CI: 0.832–0.991) in the training and testing cohorts at 15 mmHg, and 0.943 (95% CI: 0.902–0.983) and 0.887 (95% CI: 0.801–0.973) at 20 mmHg, as shown in Figs. 3D and Supplementary 3SD.

To enhance predictive accuracy for early IH in post-DC patients, radiomic features were combined with DTL features. This fusion strategy incorporated DLR features for further analysis. Using the LASSO regression model, 14 and 26 features with nonzero coefficients were selected at thresholds of 15 mmHg and 20 mmHg, respectively. The coefficients and mean standard errors from 5-fold cross-validation are presented in Figs. 4A and B for 15 mmHg and Supplementary Figs. 4SA and SB for 20 mmHg. Figures 4C and Supplementary 4 SC illustrate the coefficient values of the final selected features for 15 mmHg and 20 mmHg, respectively. The DLR-radiomics signature model achieved AUC values of 0.927 (95% CI: 0.896–0.958) and 0.885 (95% CI: 0.779–0.992) in the training and testing cohorts at 15 mmHg, and 0.949 (95% CI: 0.915–0.983) and 0.893 (95% CI: 0.801–0.985) at 20 mmHg, as demonstrated in Fig. 4D and Supplementary 4SD. Decision curve analysis (DCA) for the radiomics and DLR-radiomics signature models is presented in Figs. 3E and 4E for 15 mmHg and Supplementary Figs. 3SE and 4 SE for 20 mmHg.

Clinical model and nomogram model construction

The clinical model demonstrated strong predictive performance, achieving AUC values of 0.868 (95% CI: 0.812–0.923) and 0.807 (95% CI: 0.705–0.909) in the training and testing cohorts at 15 mmHg, and 0.891 (95% CI: 0.842–0.940) and 0.814 (95% CI: 0.691–0.937) at 20 mmHg, respectively, as shown in Fig. 5A and Supplementary Fig. 5SA. DCA confirmed the model’s net benefit in predicting IH, as depicted in Figs. 5B and Supplementary 5SB for 15 and 20 mmHg. Additionally, Fig. 6 and Supplementary 6S present nomograms designed for clinical application, where the total score corresponds to the probability of IH in post-DC patients within 24 h at different thresholds.

Comparison of the clinical model, DTL-radiomics model, and nomogram model

The performance of the clinical, radiomics, DTL, DLR-radiomics, and nomogram models is summarized in Table 3 (15 mmHg) and Supplementary Table 3S (20 mmHg). At the 15 mmHg threshold, AUC values in the training cohort (Fig. 7A) were as follows: 0.860 (95% CI:0.796–0.925) for the clinical model, 0.974 (95% CI:0.952–0.995) for the radiomics model, 0.946 (95% CI:0.908–0.984) for the DTL model, 0.973 (95% CI:0.951–0.994) for the DLR model, and 0.974 (95% CI:0.953–0.995) for the nomogram model. In the testing cohort (Fig. 7D), the AUC values were 0.801 (95% CI:0.689–0.913) for the clinical model, 0.912 (95% CI: 0.832–0.991) for the radiomics model, 0.850 (95% CI:0.747–0.956) for the DTL model, 0.914 (95% CI:0.835–0.994) for the DLR model, and 0.919 (95% CI:0.846–0.993) for the nomogram model. Decision curve analysis (DCA) results for these models are presented in Figs. 7B and E, with similar patterns observed at the 20 mmHg threshold in Supplementary Fig. 7S (A, B, D, E).

Table 3 The performance of five models in the training and testing cohort (15 mmHg).

Full size table

The results indicate that, in the training cohort, the nomogram, DLR, and radiomics models provided greater net clinical benefit in predicting IH compared to the clinical model. The calibration curves (Fig. 7C and F; Supplementary Fig. 7SC and SF) demonstrate good concordance between predicted and observed IH in both the training and testing cohorts. The Hosmer–Lemeshow test results were not statistically significant in either cohort (training: p = 0.386; testing: p = 0.131), supporting the models’ calibration accuracy. Furthermore, the DeLong test showed that the nomogram significantly outperformed the clinical model at the 15 mmHg threshold in both the training (p = 0.000) and testing (p = 0.004) cohorts. At the 20 mmHg threshold, however, the nomogram demonstrated superior performance only in the training cohort (p = 0.003), with no significant improvement observed in the testing cohort (p = 0.525). This difference may be due to the limited additional information contributed by the DLR model when combined with clinical features for predicting moderate to severe IH in the testing cohort.

This study also assessed the performance of DLR models constructed using ONS images, MCASDI images, and their concatenation-based feature fusion. At the 15 mmHg threshold, the DLR(ONS) model [AUC: 0.914 (95% CI:0.835–0.994) ] exhibited comparable predictive performance to the DLR (feature-fused) model ( 0.913, 95% CI:0.832–0.994) and the DLR (MCASDI) model (0.873, 95% CI:0.757–0.989) in the testing cohort. Similarly, at the 20 mmHg threshold, the AUC values in the testing cohort were 0.893 (95% CI:0.801–0.985), 0.889 (95% CI:0.810–0.965) and 0.870 (95% CI:0.751–0.988), respectively. These findings indicate that concatenation-based feature pre-fusion of ONS and MCASDI images did not provide a substantial improvement in predictive performance over the DLR(ONS) model alone. Therefore, all images and tables presented in this study are based on features extracted from ONS images.

Discussion

This study utilized deep learning-based radiomic features to develop a nomogram model for accurately predicting the risk of early IH in post-DC patients. By integrating radiomic and DTL features with clinical parameters, the nomogram model demonstrated stronger performance compared to the clinical model at both 15 mmHg and 20 mmHg thresholds. For 15 mmHg, the AUC was 0.974 (95% CI: 0.953–0.995) in the training cohort and 0.919 (95% CI: 0.845–0.993) in the testing cohort. For 20 mmHg, the AUC was 0.968 (95% CI: 0.944–0.993) in the training cohort and 0.889 (95% CI: 0.806–0.972) in the testing cohort. Furthermore, its capability to assess IH using postoperative ultrasound scans provides valuable support for early intervention and treatment planning.

Previous studies¹⁸ have shown that ultrasound measurement of ONSD is a highly effective method for detecting IH following TBI or other causes. The AUC for this technique is 0.940 (95% CI: 0.91–0.96), whereas TCD demonstrates relatively poor performance in detecting IH, with AUC values ranging from 0.55 to 0.72 across different studies. Notably, both ONSD and TCD-derived parameters hold promise due to their ease of use, bedside feasibility, and cost-effectiveness⁶. However, most prior researches have focused on patients who did not undergo DC after TBI, with limited studies examining post-DC patients. This gap is partly due to the altered intracranial physiology in post-DC patients, which deviates from the classical Monroe–Kellie doctrine and complicates the application of conventional ICP assessment methods. Following DC, intracranial contents protrude through the decompressed bone window, disrupting the original ICP equilibrium and impairing the autoregulatory function of cerebral blood vessels. Consequently, conventional ICP thresholds, such as 20, 22 or 25 mmHg, are no longer applicable to these patients. Furthermore, the clinical utility of TCD-derived parameters, including the PI, EDV of MCA, and non-invasive ICP estimation formulas, has been questioned.

Several studies^19,20 have attempted to correlate post-DC ONSD changes with ICP through observational methods. However, this approach is highly operator-dependent, as ONSD measurements are influenced by factors such as ultrasound probe selection, measurement plane, and patient positioning. These methodological inconsistencies limit the scalability of this technique in clinical practice. Our previous research found that the AUC for predicting IH (≥ 20 mmHg) using ONSD alone was 0.74, and for an ICP threshold of 15 mmHg, it was 0.73¹³. These findings highlight the limitations of relying on a single clinical parameter for non-invasive ICP monitoring in post-DC patients. In the current study, we developed a clinical model that integrates ONSD and TCCD-derived parameters to predict IH. The AUC for predicting IH at 20 mmHg was 0.85, while at 15 mmHg, it was 0.79. Additionally, for the first time, we applied radiomics and deep learning techniques to ultrasound images of the ONS and the MCASDI for non-invasive ICP monitoring in post-DC patients. The nomogram model achieved strong predictive ability, with consistent results across both training and testing datasets.

During the deep learning modeling process, we performed feature pre-fusion using both ONS and MCASDI images. However, the pre-fused model did not outperform the ONS-based DLR model alone. We hypothesize two potential reasons for this outcome. First, ONS images were acquired using a high-resolution linear array probe, providing superior image quality compared to MCASDI images obtained with a lower-frequency convex array probe. The latter is affected by far-field attenuation and poor sound transmission through the temporal window. Additionally, studies have shown that the sensitivity of MCASDI in detecting IH is constrained by its SNR or CNR²¹. Second, the predictive value of certain indices, such as the PI for IH, may decrease in cases of mild IH⁵. Cerebral blood flow autoregulation—mediated by factors such as elevated PCO₂, decreased PO₂²² and the accumulation of cellular metabolites²³—helps maintain MCASDI within normal limits. Consequently, this results in an overlap between normal and mild IH in the DTL process.

In clinical practice, we collect patient data on ONSD and TCCD, with image segmentation performed by experienced physicians. Radiomic and deep learning features are then extracted using PyRadiomics and deep learning models, and these features are integrated into the predictive model. Although the model’s efficiency is not yet sufficient for the direct diagnosis of early IH, it serves as a valuable tool for optimizing treatment strategies. On one hand, if the model predicts a high probability of early IH, it allows clinicians to promptly identify the underlying cause and implement appropriate measures to prevent brain herniation or irreversible damage. On the other hand, for patients initially predicted to have a high likelihood of IH, follow-up evaluations after treatment can help assess whether the risk has decreased, thereby aiding in the evaluation of treatment effectiveness. In summary, our model is expected to provide valuable guidance for personalized decision-making in the management of early IH.

This study has several limitations. First, its retrospective design may introduce potential selection bias; therefore, future research should validate the model’s feasibility through prospective studies. Second, another important limitation of our study is the random allocation of patients between two medical centers without implementing center-based stratification. This approach may introduce potential data leakage and limit our ability to assess the true generalizability across different institutional settings. Future studies will employ leave-one-center cross-validation to better evaluate the model’s generalizability. Third, discrepancies arose due to the subjective nature of boundary delineation during manual segmentation. Additionally, our DTL model did not show a significant advantage over traditional radiomics and clinical models, which may be attributed to the manual delineation of ROIs. Future research should explore the optimal size and range of ROIs to enhance the DTL model’s predictive capability. Fourth, unlike typical TBI cases, post-DC TBI patients do not strictly adhere to the Monro-Kellie doctrine²⁴,, and their ICP thresholds remain undefined. Therefore, additional parameters, such as surgical incision size, operation duration, and blood loss, may be necessary for more precise classification and evaluation. Fifth, this study lacked completely external validation using datasets from other institutions due to the limited availability of post-DC patients with invasive ICP monitoring. We recognize the importance of external validation for evaluating model generalizability, and future work will aim to include multicenter data for this purpose. Finally, this study focused exclusively on post-DC TBI patients and did not include other forms of IH unrelated to DC. Future studies should aim to extract features from different IH subtypes to develop tailored predictive models for each condition. Our findings underscore the potential of integrating DLR features into non-invasive ICP monitoring, particularly through high-quality ONS ultrasound images. Future research should focus on refining multimodal imaging approaches and expanding datasets to improve model robustness and clinical applicability.

Conclusion

Our study introduced and validated an integrated model that combines clinical-ultrasonic features, radiomics, and DTL features for the early detection of IH in post-DC TBI patients at different ICP thresholds using post-DC ultrasound images. This model presents a novel approach to precision medicine, with the potential to enhance clinical decision-making and improve patient outcomes during the critical period. Notably, concatenation-based feature pre-fusion of ONS and MCASDI images did not yield a substantial improvement in predictive performance, suggesting that ONS-derived features alone may be sufficient for early IH prediction.

Data availability

The datasets used and/or analysed during the current study are provided within the supplementary information files.

Abbreviations

ALARA:: As low as reasonably achievable
BMI:: Body mass index
CI:: Confidence interval
CNN:: Convolutional neural networks
CPP:: Cerebral perfusion pressure
DC:: Decompressive craniectomy
DCA:: Decision curve analysis
DICOM:: Digital imaging and communications in medicine
DTL:: Deep transfer learning
DLR:: Deep learning radiomics
EDV:: End-diastolic velocity (MCA)
EF:: Ejection fraction
GCS:: Glasgow coma scale
GLCM:: Gray-level co-occurrence matrix
GLRLM:: Gray-level run-length matrix
GLSZM:: Gray-level size zone matrix
GLDM:: Gray-level dependence matrix
ICC:: Intraclass correlation coefficient
ICP:: Intracranial pressure
IH:: Intracranial hypertension
KNN:: KNEAREST neighbors
ICPi:: Invasive ICP
LASSO:: Least absolute shrinkage and selection operator
Light GBM:: Light Gradient Boosting Machine
MCASDI:: Spectral doppler imaging of middle cerebral artery
MAP:: Mean arterial pressure
MCA:: Middle cerebral artery
MCA_mean:: Mean velocity (MCA)
MCAED:: End-dystolic velocity of MCA MCA: middle cerebral artery
MCAPS:: Peak systolic velocity of MCA
MCAPI:: Pulse index (middle cerebral artery)
NPV:: Negative predictive value
ONS:: Optic nerve sheath
ONSD:: Optic nerve sheath diameter
ORs:: Odds ratios
PaO₂ :: Oxygen partial pressure (Artery)
PaCO₂ :: Carbon dioxide partial pressure (Artery)
PACS:: Picture archiving and communication systems
PI:: Pulse index
PPV:: Positive predictive value
RF:: Random forest
ROC:: Receiver operating characteristic
ROI:: Region of interest
TBI:: Traumatic brain injury
TCD:: Transcranial Doppler
TCCD:: Transcranial color doppler
SNR:: Signal-to-noise ratio
CNR:: Contrast-to-noise ratio
PRF:: Pulse repetition frequency
FPS:: Frames per second

References

Raboel, P. H. et al. M,. Intracranial pressure monitoring: invasive versus non-invasive methods: a review. Crit. Care Res. Pract. 2012, 950393. (2012).
Stevens, A. R. et al. Optical pupillometry in traumatic brain injury: neurological pupil index and its relationship with intracranial pressure through significant event analysis. Brain Inj. 33 (8), 1032–1038 (2019).
Article CAS PubMed Google Scholar
Faria, B. C. D. et al. The use of noninvasive measurements of intracranial pressure in patients with traumatic brain injury: a narrative review. Arq. Neuropsiquiatr. 81 (6), 551–563 (2023).
Article PubMed PubMed Central Google Scholar
Harary, M., Dolmans, R. G. F. & Gormley, W. B. Intracranial pressure Monitoring-Review and avenues for development. Sens. (Basel), 18(2). (2018).
Rasulo, F. A. et al. Transcranial doppler as a screening test to exclude intracranial hypertension in brain-injured patients: the IMPRESSIT-2 prospective multicenter international study. Crit. Care. 26 (1), 110 (2022).
Article PubMed PubMed Central Google Scholar
Robba, C. Measuring optic nerve sheath diameter using ultrasonography for the detection of Non invasive intracranial pressure: what it is and what it is not. Arq. Neuropsiquiatr. 80 (6), 547–549 (2022).
Article PubMed PubMed Central Google Scholar
Robba, C. et al. Multimodal non-invasive assessment of intracranial hypertension: an observational study. Crit. Care. 24 (1), 379 (2020).
Article PubMed PubMed Central Google Scholar
Brain Trauma, F. et al. Guidelines for the management of severe traumatic brain injury. V. Deep vein thrombosis prophylaxis. J. Neurotrauma. 24 (Suppl 1), S32–S36 (2007).
Google Scholar
Demetriades, A. K. Intracranial pressure monitoring after primary decompressive craniectomy: is it useful? Acta Neurochir. (Wien). 159 (4), 623–624 (2017).
Article PubMed Google Scholar
Picetti, E. et al. Intracranial pressure monitoring after primary decompressive craniectomy in traumatic brain injury: a clinical study. Acta Neurochir. (Wien). 159 (4), 615–622 (2017).
Article PubMed Google Scholar
Chesnut, R. M. et al. A trial of intracranial-pressure monitoring in traumatic brain injury. N Engl. J. Med. 367 (26), 2471–2481 (2012).
Article CAS PubMed PubMed Central Google Scholar
Melhem, S., Shutter, L. & Kaynar, A. A trial of intracranial pressure monitoring in traumatic brain injury. Crit. Care. 18 (1), 302 (2014).
Article PubMed PubMed Central Google Scholar
Fu, Z. et al. Post-craniotomy intracranial pressure monitoring: a novel approach combining optic nerve sheath diameter ultrasonography and cervical-cerebral arterial ultrasound. Front. Neurol. 15, 1472494 (2024).
Article PubMed Google Scholar
Lambin, P. et al. Radiomics: the Bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14 (12), 749–762 (2017).
Article PubMed Google Scholar
Song, J. et al. A new approach to predict Progression-free survival in stage IV EGFR-mutant NSCLC patients with EGFR-TKI therapy. Clin. Cancer Res. 24 (15), 3583–3592 (2018).
Article CAS PubMed Google Scholar
Dong, D. et al. Deep learning radiomic nomogram can predict the number of lymph node metastasis in locally advanced gastric cancer: an international multicenter study. Ann. Oncol. 31 (7), 912–920 (2020).
Article CAS PubMed Google Scholar
Jiang, Y. et al. Predicting peritoneal recurrence and disease-free survival from CT images in gastric cancer with multitask deep learning: a retrospective study. Lancet Digit. Health. 4 (5), e340–e50 (2022).
Article CAS PubMed Google Scholar
Fernando, S. M. et al. Diagnosis of elevated intracranial pressure in critically ill adults: systematic review and meta-analysis. BMJ 366, l4225 (2019).
Article PubMed PubMed Central Google Scholar
Wang, J. et al. Ultrasonographic optic nerve sheath diameter correlation with ICP and accuracy as a tool for noninvasive surrogate ICP measurement in patients with decompressive craniotomy. J. Neurosurg. 133 (2), 514–520 (2020).
Article PubMed Google Scholar
Xu, X. et al. Diagnostic value of the combination of ultrasonographic optic nerve sheath diameter and width of crural cistern with respect to the intracranial pressure in patients treated with decompressive craniotomy. Neurocrit Care. 39 (2), 436–444 (2023).
Article MathSciNet PubMed Google Scholar
Rondagh, M. et al. A comparison of ultrafast and conventional spectral doppler ultrasound to measure cerebral blood flow velocity during inguinal hernia repair in infants. J. Clin. Anesth. 92, 111312 (2024).
Article PubMed Google Scholar
Yoshihara, M., Bandoh, K. & Marmarou, A. Cerebrovascular carbon dioxide reactivity assessed by intracranial pressure dynamics in severely head injured patients. J. Neurosurg. 82 (3), 386–393 (1995).
Article CAS PubMed Google Scholar
Gordon, G. R. et al. Brain metabolism dictates the Polarity of astrocyte control over arterioles. Nature 456 (7223), 745–749 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Benson, J. C. et al. The Monro-Kellie doctrine: A review and call for revision. AJNR Am. J. Neuroradiol. 44 (1), 2–6 (2023).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We express our gratitude to all colleagues who provided assistance during the course of this study.

Author information

Zunfeng Fu and Jing Wang have contributed equally to this work.

Authors and Affiliations

Department of Ultrasound, The Second Affiliated Hospital of Shandong First Medical University, No. 366 Taishan Street, Taishan District, Tai’an, 271000, Shandong Province, China
Zunfeng Fu, Jing Wang, Wenyi Shen, Jiajun Zhang, Yan Liu, Chongqiang Wang & Chunju Lv
Department of General Practice, The Second Affiliated Hospital of Shandong First Medical University, No. 366 Taishan Street, Taishan District, Tai’an, 271000, Shandong Province, China
Lin Peng
Department of Public Health Management, The Second Affiliated Hospital of Shandong, First Medical University, Tai’an, China
Weifu Zhang
Department of Ultrasound, The Affiliated Hospital of Qilu Medical University, Xintai, China
Yanqing Wu
College of Medical Imaging and Laboratory, Jining Medical University, Jining, China
Yanlin Shen
College of Radiology, Shandong First Medical University, Tai’an, China
Ye Zhu

Authors

Zunfeng Fu
View author publications
Search author on:PubMed Google Scholar
Jing Wang
View author publications
Search author on:PubMed Google Scholar
Wenyi Shen
View author publications
Search author on:PubMed Google Scholar
Yanqing Wu
View author publications
Search author on:PubMed Google Scholar
Jiajun Zhang
View author publications
Search author on:PubMed Google Scholar
Yan Liu
View author publications
Search author on:PubMed Google Scholar
Chongqiang Wang
View author publications
Search author on:PubMed Google Scholar
Yanlin Shen
View author publications
Search author on:PubMed Google Scholar
Ye Zhu
View author publications
Search author on:PubMed Google Scholar
Weifu Zhang
View author publications
Search author on:PubMed Google Scholar
Chunju Lv
View author publications
Search author on:PubMed Google Scholar
Lin Peng
View author publications
Search author on:PubMed Google Scholar

Contributions

ZF.F: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. J.W: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. WY.S: Methodology, Writing – original draft. YQ.W: Formal Analysis, Software, Writing – original draft. JJ. Z: Formal Analysis, Software, Writing – original draft. Y.L: Software, Writing – original draft. CQ.W: Software, Writing – original draft. YL.S: Software, Writing – original draft. Y.Z: Software, Writing – original draft. WF.Z: Investigation, Supervision, Writing – original draft. CJ.L: Conceptualization, Funding acquisition, Project administration, Writing – original draft, Writing – review & editing. L.P: Conceptualization, Project administration, Writing – original draft, Writing – review & editing.

Corresponding authors

Correspondence to Chunju Lv or Lin Peng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the Second affiliated Hospital of Shandong First Medical University (2021-A-037).

Consent for publication

All authors agree to publish this paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Supplementary Material 4

Supplementary Material 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Fu, Z., Wang, J., Shen, W. et al. Multimodal nomogram integrating deep learning radiomics and hemodynamic parameters for early prediction of post-craniotomy intracranial hypertension. Sci Rep 15, 23595 (2025). https://doi.org/10.1038/s41598-025-09115-7

Download citation

Received: 10 April 2025
Accepted: 25 June 2025
Published: 02 July 2025
Version of record: 02 July 2025
DOI: https://doi.org/10.1038/s41598-025-09115-7

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Patient population

Treatment methods

US acquisition, semantic features, and segmentation

Radiomics and DTL feature extraction

Radiomics and DTL feature selection

Statistical analyses

Results

Patients’ population and clinical-ultrasonic characteristics

Feature selection and deep learning radiomics model construction

Clinical model and nomogram model construction

Comparison of the clinical model, DTL-radiomics model, and nomogram model

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethical approval and consent to participate

Consent for publication

Additional information

Publisher’s note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links