Machine learning model to predict sepsis in ICU patients with intracerebral hemorrhage

Tang, Lei; Li, Ye; Zhang, Ji; Zhang, Feng; Tang, Qiaoling; Zhang, Xiangbin; Wang, Sai; Zhang, Yupeng; Ma, Siyuan; Liu, Ran; Chen, Lei; Ma, Junyi; Zou, Xuelun; Yao, Tianxing; Tang, Rongmei; Zhou, Huifang; Wu, Lianxu; Yi, Yexiang; Zeng, Yi; Wang, Duolao; Zhang, Le

doi:10.1038/s41598-025-99431-9

Download PDF

Article
Open access
Published: 10 May 2025

Machine learning model to predict sepsis in ICU patients with intracerebral hemorrhage

Lei Tang^{1,2,3,4,5,6,7},
Ye Li^{1,2,3,4,5,6,7},
Ji Zhang¹,
Feng Zhang^{1,2,3,4,5,6,7,8},
Qiaoling Tang^{1,2,3,4,5,6,7},
Xiangbin Zhang^{1,2,3,4,5,6,7},
Sai Wang^{1,2,3,4,5,6,7},
Yupeng Zhang^{1,2,3,4,5,6,7},
Siyuan Ma^{1,2,3,4,5,6,7},
Ran Liu^{1,2,3,4,5,6,7},
Lei Chen^{1,2,3,4,5,6,7},
Junyi Ma^{1,2,3,4,5,6,7},
Xuelun Zou^{1,2,3,4,5,6,7},
Tianxing Yao^{1,2,3,4,5,6,7},
Rongmei Tang^{1,2,3,4,5,6,7},
Huifang Zhou^{1,2,3,4,5,6,7},
Lianxu Wu^{1,2,3,4,5,6,7},
Yexiang Yi^{1,2,3,4,5,6,7},
Yi Zeng⁹,
Duolao Wang¹⁰ &
…
Le Zhang^{1,2,3,4,5,6,7}

Scientific Reports volume 15, Article number: 16326 (2025) Cite this article

7842 Accesses
9 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Patients with intracerebral hemorrhage (ICH) are highly susceptible to sepsis. This study evaluates the efficacy of machine learning (ML) models in predicting sepsis risk in intensive care units (ICUs) patients with ICH. We conducted a retrospective analysis on ICH patients using the MIMIC-IV database, randomly dividing them into training and validation cohorts. We identified sepsis prognostic factors using Least Absolute Shrinkage and Selection Operator (LASSO) and backward stepwise logistic regression. Several machine learning algorithms were developed and assessed for predictive accuracy, with external validation performed using the eICU Collaborative Research Database (eICU-CRD). We analyzed 2,214 patients, including 1,550 in the training set, 664 in the validation set, and 513 for external validation using the eICU-CRD. The Random Forest (RF) model outperformed others, achieving Area Under the Curves (AUCs) of 0.912 in training, 0.832 in internal validation, and 0.798 in external validation. Neural Network and Logistic Regression models recorded training AUCs of 0.840 and 0.804, respectively. ML models, especially the RF model, effectively predict sepsis in ICU patients with ICH, enabling early identification and management of high-risk cases.

A predictive model for sepsis risk in patients with non-traumatic cerebral hemorrhage based on the MIMIC-IV database

Article Open access 09 July 2025

Development of a machine learning-based prediction model for sepsis-associated delirium in the intensive care unit

Article Open access 04 August 2023

Machine learning for the prediction of in-hospital mortality in patients with spontaneous intracerebral hemorrhage in intensive care unit

Article Open access 20 June 2024

Introduction

Intracerebral hemorrhage (ICH) represents about 15% of all stroke cases yet is associated with significant mortality, accounting for roughly 2.8 million deaths globally each year^1,2. Approximately 11–31% of ICH cases lead to infections and long-term disabilities^3,4, often exacerbating into sepsis due to immunosuppression-induced systemic inflammations and metabolic disorders^5,6.

In the intensive care unit (ICU), approximately 25% of ICH patients who develop sepsis succumb within 28 days. Furthermore, sepsis is strongly associated with worsened long-term functional outcomes. These findings highlight the critical importance of early detection and effective management of sepsis in ICH patients⁷. A retrospective cohort study revealed that about 28% of ICH patients would develop sepsis, which poses significant challenges in their clinical management⁸. Sepsis is associated with an increased risk of a systemic infectious encephalopathy called sepsis-associated encephalopathy (SAE)⁹, leading to in-hospital coma events and a higher risk of complications in ICH patients¹⁰.

The initial clinical manifestations of sepsis are not specific, and the disease progresses rapidly^11,12. Currently, there are no effective treatments, highlighting the significance of early detection and appropriate management to mitigate its impact^13,14,15. A comprehensive understanding of the etiology of sepsis after ICH is essential for improving targeted prevention and treatment. However, the underlying cause of sepsis has not been fully elucidated to date¹¹. This complicates the management of ICH-associated sepsis, posing challenges in reducing mortality rates and addressing cognitive complications. In this context, developing predictive models for early sepsis detection and risk factor identification is of great importance for improving early prevention strategies¹⁶.

In recent years, advancements in statistical theory and computer technology have propelled machine learning (ML) into the forefront of medical research, garnering significant attention from clinicians. ML techniques have outperformed traditional methods like logistic regression and Cox regression in disease prediction, as evidenced by comparative studies^17,18. Particularly, neural networks (NNs) have grown substantially in size and sophistication over the last decade, becoming leading tools in ML applications¹⁹. Among various algorithms, random forest (RF) and boosted trees with calibrated probabilities have demonstrated superior performance across multiple metrics²⁰. Previous research has also shown consistent strong performance of RF algorithms on various biomedical data sets^21,22. These algorithms offer several advantages, such as scalability to large data sets and greater robustness compared to other algorithm types²³. We can find considerable effort in the application of ML algorithms in sepsis prediction^24,25,26,27. Despite the demonstrated efficacy of ML in predicting various diseases, its application in predicting sepsis among patients with ICH remains underexplored, with limited data available. This study aims to address this gap by developing and validating multiple ML models to accurately predict the onset of sepsis in ICH patients, striving to determine the most effective model for clinical use.

Methods

Database

This study utilized clinical data from two sources: the Medical Information Mart for Intensive Care (MIMIC)-IV database (version 2.2), containing ICU patient records from Beth Israel Deaconess Medical Center between 2008 and 2019, and the eICU-CRD (Telehealth Intensive Care Unit Collaborative Research Database), a database of over 200,000 ICU admissions across 208 U.S. hospitals during 2014–2015, used for external validation. Data integrity and research compliance were ensured by author Y.L., who completed the National Institutes of Health’s “Protecting Human Research Participants” program (certification number:53244021). The study is registered at Clinical Trials.gov (NCT06326385).

Study population and definitions

Data were extracted using Structured Query Language in PostgreSQL (version 14.6) and the study adhered to STROCSS criteria. Diagnoses from both databases were identified via the International Classification of Diseases, Ninth Revision (ICD-9) and Tenth Revision (ICD-10) codes²⁸. Patients were included if they had a diagnosis of ICH and were admitted to the ICU. Exclusion criteria included pediatric patients (< 18 years), patients with sepsis diagnosed prior to ICU admission, and those with incomplete follow-up data. The primary outcome of this study was the occurrence of sepsis within 28 days after ICU admission, defined according to the Third International Consensus Definition (Sepsis-3) as suspected or confirmed infection and a Sequential Organ Failure Assessment (SOFA) score ≥ 2 points. For patients diagnosed with sepsis, the follow-up time was defined as the interval (in days) from ICU admission to the first documented diagnosis of sepsis. For non-sepsis patients, the follow-up time was defined as the duration from ICU admission to discharge.

Data collection

In this study, we extracted baseline patient data including age, sex, ethnicity, and Body Mass Index (BMI) from the database. ICU admission metrics such as Sequential Organ Failure Assessment (SOFA) and Glasgow Coma Scale (GCS) scores were collected, along with the use of mechanical ventilation (MV), continuous renal replacement therapy (CRRT), peripherally inserted central catheter (PICC), and intracranial pressure (ICP) monitoring. Patients were categorized into craniotomy, minimally invasive surgery (MIS), or non-surgical groups based on the type of cranial intervention received. Vital signs and comorbidities were also recorded at admission. Laboratory tests performed included White Blood Cell count (WBC), anion gap, creatinine, and more. Laboratory measurements taken after the diagnosis of sepsis were excluded from the analysis to avoid bias. The primary study outcome was the incidence of sepsis during the follow-up period.

Statistical analysis

The Shapiro–Wilk test was used to test the normality assumption. Continuous variables were summarized using median interquartile ranges and compared using the Kruskal-Wallis test. Categorical variables were summarized as numbers and percentages and compared using the chi-square test or Fisher’s exact test. Missing values were addressed through multiple imputation (missForest R), with variables missing over 25% transformed into dummy variables to reduce bias. The number and percentage of missing values for each variable have been provided in Supplementary Table 1. To evaluate the impact of sepsis on prognosis, survival curves were plotted to compare the outcomes of ICH patients who developed sepsis and those who did not. The impact of sepsis on mortality was assessed using the log-rank test.

We divided ICH patients into training and testing sets to identify significant predictors. Using the glmnet package, we performed Least Absolute Shrinkage and Selection Operator (LASSO) regression to select non-zero coefficient features, followed by stepwise logistic regression to determine significant variables at P < 0.05. Patients were randomly split into a 7:3 training-to-testing ratio using the caTools package. Logistic regression was used as a baseline model to compare its performance with machine learning approaches (RF and NN). To account for nonlinear effects in continuous variables, restricted cubic splines (RCS) were incorporated into the model using the rms package in R. Knot placement followed standard recommendations, with four knots.

Our analysis included logistic regression, RF, and NN, with the model’s performance assessed via the area under the receiver operating characteristics (ROC) curve using the pROC package. Decision Curve Analysis (DCA) was applied to establish the clinical utility of the models. The chosen model underwent five-fold cross-validation and was evaluated based on metrics such as the area under the ROC curve (AUC), sensitivity, specificity, recall, accuracy, and F1 score²⁹, ensuring robust validation of its predictive capabilities. To further enhance the interpretability of the optimal model, SHapley Additive exPlanations (SHAP) analysis was conducted, providing insights into the contribution of each feature to the predictions.

Results

Demographic and clinical characteristics of ICH patients

We analyzed 2,214 patients who fulfilled the inclusion and exclusion criteria in MIMIC-IV database, with 1550 patients in the training cohort and 664 in the validation cohort. The patient selection process is illustrated in Supplemental Fig. 1. The baseline characteristics of patients upon admission are shown in Table 1. The study found that 53.5% of the participants, totaling 1184 patients, were men. In the group diagnosed with ICH, 813 patients (36.7%) also experienced complications due to sepsis. The prevalence of hypertension was 63.2% in the training cohort and 60.1% in the internal validation cohort. Additionally, the heart disease was present in 46.0% of the training cohort and 46.7% of the internal validation cohort. Diabetes was found in 25.1% and 22.4% of the training and internal validation cohorts, respectively. Acute pneumonia was observed in 10.9% of the training cohort and 9.6% of the validation cohort. The GCS score was consistent across both cohorts at 14 [IQR 11–15]. WBC counts (109/l) averaged 10.25 [IQR 8.40–12.10] in the training cohort and 10.16 [IQR 8.40–12.00] in the internal validation cohort. Average chloride levels (mmol/L) were noted at 103.40 [IQR 102.00-105.13] mmol/L in the training cohort and 103.00 [IQR 101.00-105.25] in the internal validation cohort. Additionally, baseline characteristics of the study cohort grouped by the presence or absence of sepsis are provided in Supplement Table 2. Furthermore, we investigated the impact of sepsis on the prognosis of patients with ICH. The results revealed that sepsis was associated with a significantly higher mortality rate. The survival curve (Supplemental Fig. 2) demonstrates a significant difference between the two groups, with ICH patients who developed sepsis having a markedly higher risk of mortality compared to those without sepsis (log-rank test, p < 0.0001).

Table 1 Baseline characteristics of the study cohort (grouped by training and testing sets).

Full size table

Feature selection and nomogram construction

In our analysis of cohorts from the MIMIC-IV database, sepsis was identified in 813 of the total participants, representing 36.7% of the entire sample. This incidence rate was consistent across different study subsets, with 36.7% of both the training set (569 out of 1,551 patients) and the internal validation set (244 out of 664 patients) diagnosed with sepsis. To identify relevant variables, we used LASSO and backward stepwise logistic regression. The different mean square errors for different log(lambda) ranges are shown in Supplemental Fig. 3. Subsequently, RCS analysis was performed to explore the non-linear relationships of continuous variables in the model. The results demonstrated significant non-linear associations between sepsis and both WBC count (non-linear p < 0.0001) and chloride levels (non-linear p = 0.018), as shown in Supplemental Fig. 4.

Variables with non-zero coefficients from LASSO regression were further screened using stepwise logistic regression, which identified race, gender, acute pneumonia, fluid electrolyte disorders, heart diseases, liver diseases, renal failure, ICP monitoring, invasive ventilation, supplemental oxygen, GCS score, heart rate, chloride, and WBC as independent risk factors for sepsis in ICH patients. Patients undergoing ICP monitoring exhibited a significantly higher likelihood of developing sepsis, with odds 4.12 times greater than those not monitored (OR 4.12; 95% CI: 2.19, 8.00). Similarly, the presence of acute pneumonia in patients increases the odds of sepsis by 3.56 times compared to those without pneumonia (95% CI = 2.75, 4.63). Furthermore, patients with cerebral hemorrhage who also suffer from fluid and electrolyte disorders are 3.56 times more likely to develop sepsis than their counterparts without such disorders (95% CI = 2.75, 4.63). Detailed OR and 95% CI values are shown in Supplemental Table 3. These results were used to create a nomogram to estimate the odds of sepsis in ICU patients with ICH (Fig. 1). For instance, the male patient had liver disease, acute pneumonia, fluid electrolyte disorders, and underwent intracranial pressure monitoring, and invasive ventilation upon admission, with no supplemental oxygen upon admission, and a chloride level of 110 mmol/l. The sum of these points (68) is located on the total points axis. A line is then drawn downward on the axis to determine the probability of developing sepsis (> 90%). The feature importance ranking under the framework of RF algorithm is shown in Fig. 2.

Model performance comparisons and internal validation

We developed three ML models to predict the development of sepsis in patients with ICH after ICU admission. Figure 3 illustrates ROC curves measuring the discrimination of these models. In the training set, the RF model demonstrated the highest predictive performance for sepsis in ICH patients (AUC = 0.912, 95% CI: 0.898–0.927), followed by the NN model (AUC = 0.840, 95% CI: 0.820–0.861) and the LR model (AUC = 0.804, 95% CI: 0.781–0.827).

In internal validation, the RF model achieved the best performance with an AUC of 0.832 (95% CI: 0.801–0.864), outperforming the NN model (AUC = 0.811, 95% CI: 0.777–0.845) and the LR model (AUC = 0.799, 95% CI: 0.763–0.834) for predicting sepsis in patients with ICH. Detailed performance metrics for the three models are presented in Supplemental Table 4. Additionally, we incorporated RCS into the logistic regression model to account for potential nonlinear relationships between independent variables and sepsis risk. The inclusion of RCS terms improved the predictive performance of the LR model. As shown in Supplemental Fig. 5A, the AUC of the RCS-enhanced LR model increased to 0.812 (95% CI: 0.790–0.812) in the training set and 0.803 (95% CI: 0.768–0.803) in the validation set, compared to the original LR model without RCS terms (training set: AUC = 0.804, 95% CI: 0.781–0.827; validation set: AUC = 0.799, 95% CI: 0.763–0.834). Additionally, we incorporated interaction terms between variables significantly associated with the outcome to further enhance the LR model’s predictive performance. As shown in Supplemental Fig. 5B, the inclusion of interaction terms improved the AUC of the LR model to 0.822 (95% CI: 0.800–0.822) in the training set and 0.809 (95% CI: 0.775–0.809) in the validation set.

The RF model achieved the highest F1 scores in both the training set and the internal validation set (0.88 and 0.829, respectively), indicating it has the best discriminative ability. Sensitivity and specificity were also the most balanced. Taken together, the RF model is considered the optimal model. Finally, a DCA curve (Supplemental Fig. 6) demonstrated the RF model provided the highest net benefit and threshold probability, indicating its superior clinical utility compared with the other two models.

External validation

The RF model was validated in an external dataset from the eICU-CRD database using the same data extraction process as the derivation dataset. After screening, we ultimately selected 513 patients from the e-ICU for external validation. In the external validation dataset, the AUC was 0.798 (95% CI 0.606–0.99) (Fig. 4). This indicates that the RF model demonstrated good predictive performance in independent external populations. However, further clinical evaluation is required to assess the robustness of the RF model across different populations.

SHAP analysis

To enhance the interpretability of the model, we conducted SHAP analysis on the optimal model, a RF classifier. The SHAP summary plot (Fig. 5) highlights the most influential features contributing to the prediction of sepsis risk in patients with ICH. The analysis identified fluid electrolyte disorders, WBC count, Supplemental oxygen use, GCS score, and renal failure as the top five contributors to the model’s predictions. Among these, fluid electrolyte disorders and elevated WBC counts demonstrated the strongest association with an increased risk of sepsis, as evidenced by their high SHAP values.

Discussion

To the best of our knowledge, this study represents a novel exploration of sepsis in ICH patients using the MIMIC-IV and eICU-CRD public databases. Our study demonstrates that machine learning models, particularly random forest models, exhibit high accuracy in predicting the onset of sepsis in patients with cerebral hemorrhage, showcasing superior clinical utility compared to alternative models. Compared to the LR algorithm which requires manual selection of independent variables, potentially introducing complex nonlinear relationships and interactions between independent variables into the error of the model, the RF model has several advantages. It efficiently handles missing data and creates effective predictive models by combining weak predictors. Due to its excellent accuracy and performance, the RF algorithm has received increasing attention as a competing alternative to LR analysis for predicting adverse clinical events.

Patients with ICH face a heightened susceptibility to sepsis due to immunosuppression and gut microbiota dysbiosis¹⁶. The combination of ICH and sepsis presents a significant challenge for clinicians, contributing to increased mortality rates and cognitive complications. Both conditions share similar pathophysiological mechanisms involving systemic inflammation and circulatory disturbance, resulting in high mortality and morbidity^{30,31,32,33,34}. Timely risk assessment, prudent antibiotic therapy, and appropriate targeted therapy can reduce the incidence of sepsis³⁵, highlighting the importance of developing reliable predictive models for timely interventions and better management of high-risk patients.

Based on our study findings, we propose several early intervention strategies to mitigate sepsis risk in ICH patients. Integrating the RF model into ICU monitoring systems could enable real-time identification of high-risk patients, allowing for enhanced monitoring and timely interventions. Proactive infection control measures, such as early use of antibiotics, strict aseptic techniques during invasive procedures, and regular microbiological assessments, are crucial for reducing the risk of sepsis. Additionally, personalized interventions targeting key risk factors, including acute pneumonia, electrolyte imbalances, and renal failure, could further enhance patient care. Exploring immune-enhancing therapies to address immunosuppression in ICH patients also holds promise. These strategies underscore the critical importance of timely sepsis diagnosis, as early detection and intervention during the ICU stay can prevent the progression to severe sepsis or septic shock, ultimately optimizing patient care.

LASSO and multivariate logistic regression analysis identified nine clinical characteristic variables for assessing sepsis risk in ICH patients, including race, gender, acute pneumonia, fluid electrolyte disorders, heart disease, liver disease, renal failure, heart rate, GCS score, ICP monitoring, invasive ventilation, supplemental oxygen, chloride, and WBC. Liver disease has been consistently associated with an increased risk of infections, including sepsis, due to impaired synthetic function and detoxification processes^36,37. This predisposition is particularly pronounced in ICU settings where patients are more susceptible to systemic infections³⁸. Similarly, renal failure contributes to sepsis risk through mechanisms such as immune dysfunction and accumulation of uremic toxins, which impair host defenses^39,40,41,42. Furthermore, heart disease impacts sepsis risk through multiple pathways, including reduced tissue perfusion and altered hemodynamic responses, which can precipitate organ dysfunction in critically ill patients^43,44,45,46. In terms of invasive procedures, the use of invasive ventilation and ICP monitoring devices increases the risk of secondary infection, decreases hospital discharge rates, and raises mortality⁴⁷. Aziz et al.⁴⁸ found that invasive mechanical ventilation is identified as an independent predictor of mortality in ICU-treated adults with sepsis. In addition, acute pneumonia is a significant independent predictor of sepsis in patients with ICH. Research indicates that around 43% of individuals with ICH develop acute pneumonia⁴⁹, which is linked to a four-fold increased risk of complications^50,51,52. This aligns with the results of our study. Moreover, evidence has shown that elevated chloride levels independently correlate with mortality in ICU patients with ICH⁵³. When patients have severe sepsis and hyperchloremia, this condition is typically observed upon admission to the ICU. Further, hyperchloremic patients admitted to the ICU experienced elevated chloride levels even after 72 h of admission, and the exacerbation of hyperchloremia was independently associated with all-cause in-hospital mortality^54,55.

The present study has certain limitations, including its retrospective and observational nature, which may introduce selection bias. The data from the MIMIC-IV database come from a single center in the United States, potentially undermining the generalization to other populations. Therefore, external validation against different populations is required to accurately assess the model’s performance. It is important to note that the model should be used as a medical reference only, as other complex clinical factors should also be considered in treatment decisions. Nevertheless, the established model can assist clinicians in the timely management of high-risk ICH patients with sepsis in the ICU.

Conclusion

In conclusion, ML models could be reliable tools for predicting sepsis in ICH patients. Among all prediction models, the RF model proved to be most effective in providing early identification and timely intervention for high-risk ICH patients with sepsis, potentially mitigating disease progression.

Data availability

The raw data for this study were sourced from the MIMIC-IV and eICU-CRD databases, both of which are accessible to the public. Detailed data pertinent to this study can be made available upon request by reaching out to the corresponding author.

References

Watson, N., Bonsack, F. & Sukumari-Ramesh, S. Intracerebral hemorrhage: the effects of aging on brain injury. Front. Aging Neurosci. 14, 859067. https://doi.org/10.3389/fnagi.2022.859067 (2022).
Article CAS PubMed PubMed Central Google Scholar
Spontaneous Intracerebral Hemorrhage. N. Engl. J. Med. 388, 1440. https://doi.org/10.1056/NEJMx230001 (2023).
Ali, M., Lyden, P., Sacco, R. L., Shuaib, A. & Lees, K. R. Natural history of complications after intracerebral haemorrhage. Eur. J. Neurol. 16, 624–630. https://doi.org/10.1111/j.1468-1331.2009.02559.x (2009).
Article CAS PubMed Google Scholar
Lord, A. S. et al. Infection after intracerebral hemorrhage: risk factors and association with outcomes in the Ethnic/racial variations of intracerebral hemorrhage study. Stroke 45, 3535–3542. https://doi.org/10.1161/strokeaha.114.006435 (2014).
Article PubMed PubMed Central Google Scholar
Berger, B., Gumbinger, C., Steiner, T. & Sykora, M. Epidemiologic features, risk factors, and outcome of sepsis in stroke patients treated on a neurologic intensive care unit. J. Crit. Care. 29, 241–248. https://doi.org/10.1016/j.jcrc.2013.11.001 (2014).
Article PubMed Google Scholar
Cheng, Y. et al. Evaluation of intestinal injury, inflammatory response and oxidative stress following intracerebral hemorrhage in mice. Int. J. Mol. Med. 42, 2120–2128. https://doi.org/10.3892/ijmm.2018.3755 (2018).
Article CAS PubMed Google Scholar
Lin, J. et al. Impact and risk factors of sepsis on long-term outcomes after spontaneous intracerebral hemorrhage. Chin. Med. J. (Engl). 135, 1006–1008. https://doi.org/10.1097/cm9.0000000000001954 (2022).
Article CAS PubMed Google Scholar
Adam, N., Kandelman, S., Mantz, J., Chrétien, F. & Sharshar, T. Sepsis-induced brain dysfunction. Expert Rev. Anti Infect. Ther. 11, 211–221. https://doi.org/10.1586/eri.12.159 (2013).
Article CAS PubMed Google Scholar
Gofton, T. E. & Young, G. B. Sepsis-associated encephalopathy. Nat. Rev. Neurol. 8, 557–566. https://doi.org/10.1038/nrneurol.2012.183 (2012).
Article CAS PubMed Google Scholar
Wang, G. et al. [Sepsis associated encephalopathy is an independently risk factor for nosocomial coma in patients with supratentorial intracerebral hemorrhage: a retrospective cohort study of 261 patients]. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 28, 723–728. https://doi.org/10.3760/cma.j.issn.2095-4352.2016.08.011 (2016).
Article PubMed Google Scholar
Singer, M. et al. The third international consensus definitions for Sepsis and septic shock (Sepsis-3). Jama 315, 801–810. https://doi.org/10.1001/jama.2016.0287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cecconi, M., Evans, L., Levy, M. & Rhodes, A. Sepsis and septic shock. Lancet 392, 75–87. https://doi.org/10.1016/s0140-6736(18)30696-2 (2018).
Article PubMed Google Scholar
Angus, D. C. et al. Epidemiology of severe sepsis in the united States: analysis of incidence, outcome, and associated costs of care. Crit. Care Med. 29, 1303–1310. https://doi.org/10.1097/00003246-200107000-00002 (2001).
Article CAS PubMed Google Scholar
Martin, G. S., Mannino, D. M., Eaton, S. & Moss, M. The epidemiology of sepsis in the united States from 1979 through 2000. N. Engl. J. Med. 348, 1546–1554. https://doi.org/10.1056/NEJMoa022139 (2003).
Article PubMed Google Scholar
Thompson, K., Venkatesh, B. & Finfer, S. Sepsis and septic shock: current approaches to management. Intern. Med. J. 49, 160–170. https://doi.org/10.1111/imj.14199 (2019).
Article PubMed Google Scholar
Lin, J., Tan, B., Li, Y., Feng, H. & Chen, Y. Sepsis-Exacerbated brain dysfunction after intracerebral hemorrhage. Front. Cell. Neurosci. 15, 819182. https://doi.org/10.3389/fncel.2021.819182 (2021).
Article CAS PubMed Google Scholar
Hou, N. et al. Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost. J. Transl. Med. 18, 462. https://doi.org/10.1186/s12967-020-02620-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Du, M., Haag, D. G., Lynch, J. W. & Mittinty, M. N. Comparison of the Tree-Based machine learning algorithms to Cox regression in predicting the survival of oral and pharyngeal cancers: analyses based on SEER database. Cancers (Basel). 12 https://doi.org/10.3390/cancers12102802 (2020).
Yu, J. R. et al. Energy efficiency of inference algorithms for clinical laboratory data sets: green artificial intelligence study. J. Med. Internet Res. 24, e28036. https://doi.org/10.2196/28036 (2022).
Article PubMed PubMed Central Google Scholar
R, C. & A, N.-M. in 23rd International Conference on Machine Learning. Pittsburgh, PA, (2006).
Zhang, Y. et al. Empirical study of seven data mining algorithms on different characteristics of datasets for biomedical classification applications. Biomed. Eng. 16, 125. https://doi.org/10.1186/s12938-017-0416-x (2017).
Article PubMed PubMed Central Google Scholar
Harper, P. R. A review and comparison of classification algorithms for medical decision making. Health Policy. 71, 315–331. https://doi.org/10.1016/j.healthpol.2004.05.002 (2005).
Article PubMed Google Scholar
Murphy, K. P. Machine Learning: a Probabilistic Perspective (MIT Press, 2012).
Kam, H. J. & Kim, H. Y. Learning representations for the early detection of sepsis with deep neural networks. Comput. Biol. Med. 89, 248–255. https://doi.org/10.1016/j.compbiomed.2017.08.015 (2017).
Article PubMed Google Scholar
Scherpf, M., Gräßer, F., Malberg, H. & Zaunseder, S. Predicting sepsis with a recurrent neural network using the MIMIC III database. Comput. Biol. Med. 113, 103395. https://doi.org/10.1016/j.compbiomed.2019.103395 (2019).
Article PubMed Google Scholar
Van Steenkiste, T. et al. Accurate prediction of blood culture outcome in the intensive care unit using long short-term memory neural networks. Artif. Intell. Med. 97, 38–43. https://doi.org/10.1016/j.artmed.2018.10.008 (2019).
Article PubMed Google Scholar
Aşuroğlu, T. & Oğul, H. A deep learning approach for sepsis monitoring via severity score Estimation. Comput. Methods Progr. Biomed. 198, 105816. https://doi.org/10.1016/j.cmpb.2020.105816 (2021).
Article PubMed Google Scholar
Mathew, G. et al. STROCSS 2021: strengthening the reporting of cohort, cross-sectional and case-control studies in surgery. Int. J. Surg. 96, 106165. https://doi.org/10.1016/j.ijsu.2021.106165 (2021).
Article PubMed Google Scholar
Yacouby, R. & Axman, D. Probabilistic Extension of Precision, Recall, and F1 Score for More Thorough Evaluation of Classification Models. (2020).
Fu, Y., Liu, Q., Anrather, J. & Shi, F. D. Immune interventions in stroke. Nat. Rev. Neurol. 11, 524–535. https://doi.org/10.1038/nrneurol.2015.144 (2015).
Article CAS PubMed PubMed Central Google Scholar
Westendorp, W. F., Nederkoorn, P. J., Vermeij, J. D., Dijkgraaf, M. G. & van de Beek, D. Post-stroke infection: a systematic review and meta-analysis. BMC Neurol. 11, 110. https://doi.org/10.1186/1471-2377-11-110 (2011).
Article PubMed PubMed Central Google Scholar
Singer, B. H. et al. Bacterial dissemination to the brain in Sepsis. Am. J. Respir. Crit. Care Med. 197, 747–756. https://doi.org/10.1164/rccm.201708-1559OC (2018).
Article CAS PubMed PubMed Central Google Scholar
Kong, Y. & Le, Y. Toll-like receptors in inflammation of the central nervous system. Int. Immunopharmacol. 11, 1407–1414. https://doi.org/10.1016/j.intimp.2011.04.025 (2011).
Article CAS PubMed Google Scholar
Kodali, M. C., Chen, H. & Liao, F. F. Temporal unsnarling of brain’s acute neuroinflammatory transcriptional profiles reveals panendothelitis as the earliest event preceding microgliosis. Mol. Psychiatry. 26, 3905–3919. https://doi.org/10.1038/s41380-020-00955-5 (2021).
Article CAS PubMed Google Scholar
Evans, L. et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic shock 2021. Intensiv. Care Med. 47, 1181–1247. https://doi.org/10.1007/s00134-021-06506-y (2021).
Article PubMed PubMed Central Google Scholar
Bellot, P., Francés, R. & Such, J. Pathological bacterial translocation in cirrhosis: pathophysiology, diagnosis and clinical implications. Liver Int. 33, 31–39. https://doi.org/10.1111/liv.12021 (2013).
Article CAS PubMed Google Scholar
Bartoletti, M. et al. Epidemiology and outcomes of bloodstream infection in patients with cirrhosis. J. Hepatol. 61, 51–58. https://doi.org/10.1016/j.jhep.2014.03.021 (2014).
Article PubMed Google Scholar
Gustot, T. et al. Impact of infection on the prognosis of critically ill cirrhotic patients: results from a large worldwide study. Liver Int. 34, 1496–1503. https://doi.org/10.1111/liv.12520 (2014).
Article PubMed Google Scholar
Schrier, R. W. & Wang, W. Acute renal failure and sepsis. N. Engl. J. Med. 351, 159–169. https://doi.org/10.1056/NEJMra032401 (2004).
Article CAS PubMed Google Scholar
Takasu, O. et al. Mechanisms of cardiac and renal dysfunction in patients dying of sepsis. Am. J. Respir. Crit. Care Med. 187, 509–517. https://doi.org/10.1164/rccm.201211-1983OC (2013).
Article CAS PubMed PubMed Central Google Scholar
Ergin, B., Kapucu, A., Demirci-Tansel, C. & Ince, C. The renal microcirculation in sepsis. Nephrol. Dial. Transpl. 30, 169–177. https://doi.org/10.1093/ndt/gfu105 (2015).
Article CAS Google Scholar
Schor, N. Acute renal failure and the sepsis syndrome. Kidney Int. 61, 764–776. https://doi.org/10.1046/j.1523-1755.2002.00178.x (2002).
Article PubMed Google Scholar
Laupland, K. B., Pasquill, K., Steele, L. & Parfitt, E. C. Burden of bloodstream infection in older persons: a population-based study. BMC Geriatr. 21, 31. https://doi.org/10.1186/s12877-020-01984-z (2021).
Article PubMed PubMed Central Google Scholar
Leng, Y. et al. Sepsis as an independent risk factor in atrial fibrillation and cardioembolic stroke. Front. Endocrinol. (Lausanne). 14, 1056274. https://doi.org/10.3389/fendo.2023.1056274 (2023).
Article PubMed Google Scholar
Walker, A. M. N. et al. Prevalence and predictors of Sepsis death in patients with chronic heart failure and reduced left ventricular ejection fraction. J. Am. Heart Assoc. 7, e009684. https://doi.org/10.1161/jaha.118.009684 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shao, I. Y., Elkind, M. S. V. & Boehme, A. K. Risk factors for stroke in patients with Sepsis and bloodstream infections. Stroke 50, 1046–1051. https://doi.org/10.1161/strokeaha.118.023443 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, H. et al. Risks and features of secondary infections in severe and critical ill COVID-19 patients. Emerg. Microbes Infect. 9, 1958–1964. https://doi.org/10.1080/22221751.2020.1812437 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mohamed, A. K. S., Mehta, A. A. & James, P. Predictors of mortality of severe sepsis among adult patients in the medical intensive care unit. Lung India. 34, 330–335. https://doi.org/10.4103/lungindia.lungindia_54_16 (2017).
Article PubMed PubMed Central Google Scholar
Hannawi, Y., Hannawi, B., Rao, C. P., Suarez, J. I. & Bershad, E. M. Stroke-associated pneumonia: major advances and Obstacles. Cerebrovasc. Dis. 35, 430–443. https://doi.org/10.1159/000350199 (2013).
Article CAS PubMed Google Scholar
Koennecke, H. C. et al. Factors influencing in-hospital mortality and morbidity in patients treated on a stroke unit. Neurology 77, 965–972. https://doi.org/10.1212/WNL.0b013e31822dc795 (2011).
Article PubMed Google Scholar
Teh, W. H. et al. Impact of stroke-associated pneumonia on mortality, length of hospitalization, and functional outcome. Acta Neurol. Scand. 138, 293–300. https://doi.org/10.1111/ane.12956 (2018).
Article CAS PubMed Google Scholar
Wilson, R. D. Mortality and cost of pneumonia after stroke for different risk groups. J. Stroke Cerebrovasc. Dis. 21, 61–67. https://doi.org/10.1016/j.jstrokecerebrovasdis.2010.05.002 (2012).
Article PubMed Google Scholar
Qureshi, A. I. et al. Early hyperchloremia is independently associated with death or disability in patients with intracerebral hemorrhage. Neurocrit Care. 37, 487–496. https://doi.org/10.1007/s12028-022-01514-2 (2022).
Article CAS PubMed Google Scholar
Ditch, K. L., Flahive, J. M., West, A. M., Osgood, M. L. & Muehlschlegel, S. Hyperchloremia, not concomitant hypernatremia, independently predicts early mortality in critically ill Moderate-Severe traumatic brain injury patients. Neurocrit. Care. 33, 533–541. https://doi.org/10.1007/s12028-020-00928-0 (2020).
Article CAS PubMed Google Scholar
Zhou, D. et al. Increase in chloride from baseline is independently associated with mortality in intracerebral hemorrhage patients admitted to intensive care unit: A retrospective study. J. Intensive Med. 2, 274–281. https://doi.org/10.1016/j.jointm.2022.04.002 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the MIMIC-IV and eICU-CRD programs for open access to their database.

Funding

This study was supported by The National Science & Technology Fundamental Resources Investigation Program of China to L.Z. (No.2018FY100900), Central South University Research Programme of Advanced Interdisciplinary Projects in Changsha Studies to L.Z. (No.2023QYJC011) , The Hunan Provincial Natural Science Foundation of China Grant to L.Z. (No.2023JJ60144) and Y.Z. (No.2021JJ30923), The Provincial Science and Technology Innovation Leading Talents Project to L.Z.(No.2021RC4014). Major Science and Technology Projects in Changsha to L.Z. (No.kq2301008), The Changsha Municipal Natural Science Foundation to L.T. (kq2202022), The Hunan Provincial Natural Science Foundation to L.T. (2023JJ60383), Hunan Provincial Health High-Level Talent Scientific Research Project to L.Z. (No.R2023069), The Major Basic Research Projects in Hunan Province to L.Z. (No.2024JC0004), The National Natural Science Foundation of China to L.Z. (No.82471364).

Author information

Authors and Affiliations

Department of Neurology, Xiangya Hospital, Central South University, Jiangxi, Nanchang, 330006, Jiangxi, China
Lei Tang, Ye Li, Ji Zhang, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
Department of Neurology, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
Lei Tang, Ye Li, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
Multi-Modal Monitoring Technology for Severe Cerebrovascular Disease of Human Engineering Research Center, Changsha, Hunan, China
Lei Tang, Ye Li, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
Brain Health Center of Hunan Province, Changsha, Hunan, China
Lei Tang, Ye Li, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
Human Brain Disease Biological Resources Platform of Hunan Province, Changsha, Hunan, China
Lei Tang, Ye Li, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, China
Lei Tang, Ye Li, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
FuRong Laboratory, Changsha, 410078, Hunan, China
Lei Tang, Ye Li, Feng Zhang, Qiaoling Tang, Xiangbin Zhang, Sai Wang, Yupeng Zhang, Siyuan Ma, Ran Liu, Lei Chen, Junyi Ma, Xuelun Zou, Tianxing Yao, Rongmei Tang, Huifang Zhou, Lianxu Wu, Yexiang Yi & Le Zhang
Department of Cardiovascular Medicine, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
Feng Zhang
Department of Geriatrics, Second Xiangya Hospital, Central South University, Changsha, 410011, Hunan, China
Yi Zeng
Department of Clinical Sciences, Liverpool School of Tropical Medicine, Liverpool, UK
Duolao Wang

Authors

Lei Tang
View author publications
Search author on:PubMed Google Scholar
Ye Li
View author publications
Search author on:PubMed Google Scholar
Ji Zhang
View author publications
Search author on:PubMed Google Scholar
Feng Zhang
View author publications
Search author on:PubMed Google Scholar
Qiaoling Tang
View author publications
Search author on:PubMed Google Scholar
Xiangbin Zhang
View author publications
Search author on:PubMed Google Scholar
Sai Wang
View author publications
Search author on:PubMed Google Scholar
Yupeng Zhang
View author publications
Search author on:PubMed Google Scholar
Siyuan Ma
View author publications
Search author on:PubMed Google Scholar
Ran Liu
View author publications
Search author on:PubMed Google Scholar
Lei Chen
View author publications
Search author on:PubMed Google Scholar
Junyi Ma
View author publications
Search author on:PubMed Google Scholar
Xuelun Zou
View author publications
Search author on:PubMed Google Scholar
Tianxing Yao
View author publications
Search author on:PubMed Google Scholar
Rongmei Tang
View author publications
Search author on:PubMed Google Scholar
Huifang Zhou
View author publications
Search author on:PubMed Google Scholar
Lianxu Wu
View author publications
Search author on:PubMed Google Scholar
Yexiang Yi
View author publications
Search author on:PubMed Google Scholar
Yi Zeng
View author publications
Search author on:PubMed Google Scholar
Duolao Wang
View author publications
Search author on:PubMed Google Scholar
Le Zhang
View author publications
Search author on:PubMed Google Scholar

Contributions

LT and LZ conceived and designed the study. YL, LT, JZ and QT collected the primary data. FZ, SM, and RL performed data analysis. XZ (Xiangbin Zhang) and SW developed analytical tools. YZ (Yupeng Zhang), LC, and JM contributed to data interpretation. XZ (Xuelun Zou), TY, and RT created visualizations and figures. YY and YZ (Yi Zeng) validated the methodology and findings. YL drafted the manuscript with input from all authors. LZ and DW supervised the study and critically revised the manuscript. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Duolao Wang or Le Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Tang, L., Li, Y., Zhang, J. et al. Machine learning model to predict sepsis in ICU patients with intracerebral hemorrhage. Sci Rep 15, 16326 (2025). https://doi.org/10.1038/s41598-025-99431-9

Download citation

Received: 15 October 2024
Accepted: 21 April 2025
Published: 10 May 2025
Version of record: 10 May 2025
DOI: https://doi.org/10.1038/s41598-025-99431-9

Keywords

This article is cited by

Hybrid phenotype-guided modeling across algorithm–feature regimes with application to ICU mortality prediction for Acinetobacter baumannii
- Abdelrahman Eid
- Rasha Khayyat
- Hadi Rabee
Scientific Reports (2026)