Introduction

Gastrointestinal bleeding (GIB) is a common and potentially life-threatening emergency in clinical practice; patients frequently present with symptoms such as hematemesis, melena, or hematochezia1. Despite substantial advances in pharmacotherapy, endoscopic techniques, and interventional treatments, rapid and accurate risk stratification remains essential to optimize outcomes and reduce mortality rates2.

Current risk stratification tools typically address upper (UGIB) and lower gastrointestinal bleeding (LGIB) separately3. Regarding UGIB, scoring systems such as the Glasgow–Blatchford, Rockall, and AIMS65 scores have been validated to predict the need for clinical intervention, hospitalization, or suitability for outpatient management4,5,6. However, these systems demonstrate only moderate accuracy for in-hospital mortality prediction, with areas under the receiver operating characteristic curve (AUROC) generally below 0.807. Similarly, tools such as the BLEED score for LGIB show suboptimal predictive accuracy for mortality, with reported AUROCs typically ranging from 0.66 to 0.738,9,10. In clinical practice, distinguishing between UGIB and LGIB at initial presentation is frequently challenging, limiting the clinical utility of these site-specific scores8. Notably, approximately 35% of emergency GIB patients have indeterminate bleeding sites upon initial assessment, further complicating early risk stratification and management8,11. Consequently, there is an unmet need for a more accurate and broadly applicable risk prediction tool that is effective regardless of the bleeding location12.

Nomograms are graphical statistical tools that integrate multiple clinical predictors and have recently emerged as valuable instruments for individualized risk prediction across various medical disciplines13. Their clinical utility derives from the ability to incorporate diverse, patient-specific variables, generating precise and individualized risk estimates. Recent studies have highlighted the successful application of nomograms in emergency and critical care, demonstrating superiority over traditional scoring systems in certain settings. For example, Sharif et al. developed a nomogram for ICU disposition in acute clozapine intoxication14. Lashin et al. constructed a nomogram predicting the need for mechanical ventilation in acutely intoxicated patients15. Moreover, Heba I. Lashin et al. validated a nomogram predicting adverse cardiovascular events in acute cardiotoxic poisoning, further demonstrating the method’s versatility and clinical significance16.

Given the limitations of existing scoring systems and the demonstrated advantages of nomogram-based modeling, we conducted this multicenter retrospective study to develop and validate a nomogram for accurately predicting in-hospital mortality risk in emergency GIB patients, irrespective of bleeding site. The primary objective was to facilitate rapid risk stratification and support clinical decision-making in the emergency department (ED).Moreover, major guidelines recommend the use of validated risk-stratification tools at ED presentation-for UGIB17, pre-endoscopic assessment with the Glasgow-Blatchford score; for LGIB18, the Oakland score-to aid early decision‑making, while acknowledging evidence and implementation limitations. These gaps, together with frequent uncertainty about bleeding site at first contact, highlight the need for practical and generalizable models applicable across GIB presentations. Accordingly, we developed and internally validated a site-agnostic, ED-ready nomogram to estimate in-hospital mortality using variables available at presentation.

Research methods

Study design

This retrospective cohort study involved patients with gastrointestinal bleeding who visited the ED and were subsequently admitted between January 1, 2023, and December 31, 2023. The study was approved by the Ethics Committee of Wuhan Central Hospital (Approval No. WHZXKYL2024-117). Due to the retrospective nature of the study, informed consent was waived by the Ethics Committee of Wuhan Central Hospital. This study was performed in accordance with the principles outlined in the Declaration of Helsinki.

Study participants

Patients meeting the following criteria were included in the study: (1) aged 18 years or older at the time of ED presentation; (2) presented to the ED with symptoms suggestive of gastrointestinal bleeding (such as hematemesis, melena, hematochezia, or coffee-ground vomiting); (3) admitted to the hospital from the ED for inpatient care; and (4) discharged with a final diagnosis of gastrointestinal bleeding confirmed by the attending physician. The exclusion criteria were as follows: (1) incomplete or missing key clinical data necessary for analysis; (2) gastrointestinal bleeding secondary to trauma or postoperative complications; and (3) declined or withdrew from inpatient treatment during hospitalization.

Data collection

Data were collected from the hospital information systems of Wuhan Central Hospital: Nanjing Road Branch, Houhu Branch, and Yangchunhu Branch, from January to December 2023. The emergency triage system included triage information such as the patient’s name, sex, age, chief complaint, diagnosis, mode of arrival, time of arrival, triage level, vital signs, and admitting department. The Hospital Information System contained the patient’s basic information, such as length of stay, surgical procedures, hemostatic interventions and their timing, the types of comorbidities, and discharge status. The patient’s visit number was used to link the data across different systems. The visit number represents the index for a single hospital visit during the patient’s stay. The final baseline data were matched using the data closest to the visit time. In this study, hemostatic procedures included any endoscopic, interventional radiology, or surgical intervention performed to achieve hemostasis for gastrointestinal bleeding during the entire hospitalization following ED admission.

Statistical methods

Data analysis, model construction, and validation were conducted as described by Wang S, Tu J (2020)19. Descriptive statistics were used to summarize the patients’ baseline characteristics. The Kolmogorov–Smirnov test was used to assess the normality of the data. Normally distributed continuous variables are expressed as the mean ± standard deviation (x̄ ± s) and compared using an independent samples t test. Non-normally distributed continuous variables are expressed as medians (Q1, Q3) and compared using the Wilcoxon rank-sum test. Categorical variables are expressed as frequencies and percentages, and comparisons were made using the χ2 test or Fisher’s exact test. Least absolute shrinkage and selection operator (LASSO) regression, suitable for high-dimensional data20, was used for predictor selection and regularization. Logistic regression analysis was employed to develop a predictive model for the risk of in-hospital mortality among patients with gastrointestinal bleeding admitted through the ED, and a nomogram was constructed according to this model. The model’s discriminative performance was evaluated by calculating the area under the curve (AUC), with internal validation performed using 1,000 bootstrap resamples. For further validation, patients were randomly divided into training (70%) and validation (30%) sets. The DeLong method was used to compare the statistical differences in AUCs. The clinical utility of the model was assessed using decision curve analysis (DCA) to quantify the net benefit at different probability thresholds21. Statistical analyses were conducted using EmpowerStats (www.empowerstats.com) and R version 4.2.2 (www.r-project.org). A p value < 0.05 was considered to indicate significance.

Sample size and model complexity

Because this was a retrospective cohort, we included all consecutive eligible patients during the study period (N = 847). A formal a priori sample-ize calculation was not performed. To ensure adequate model development, we limited model complexity and quantified optimism. We targeted ≥ 10 events per effective22,23 parameter given 75 outcome events, and fitted a penalized multivariable logistic regression, with 1,000‑bootstrap internal validation to estimate optimism-corrected performance (AUROC and calibration slope/intercept). In the full cohort, the events-per-parameter for the final model was 15.0; after the 70/30 split used for secondary validation, the training set contained≈52.5 events, corresponding to≈10.5 events-per-parameter. These values are compatible with stable estimation. Details of discrimination and calibration are reported in the Results.

Results

A total of 847 patients participated in this study. The flow of information collection from the target population is shown in Fig. 1. Among these patients, 75 (8.85%) died during their hospitalization after being admitted to the ED. In-hospital deaths were more frequent among older patients (median age 73 years versus 65.5 years among survivors; p < 0.001). Patients who died had lower systolic and diastolic blood pressure, higher heart rates, and higher shock indices per initial vital sign monitoring upon ED admission (p < 0.001). These patients were also more likely to arrive by ambulance (p < 0.001) and to be classified as Emergency Severity Index (ESI) level 1 (p < 0.001). Additionally, the incidence of malignancy was higher among those who died (p < 0.001), and they underwent fewer surgical (p = 0.003) and hemostatic (p < 0.001) procedures. These findings suggest that age, vital signs, mode of arrival, triage level, presence of malignancy, and types of clinical interventions are important factors associated with in-hospital mortality in this patient population. The demographic and clinical characteristics of the study participants are shown in Table 1.

Fig. 1
figure 1

Flow diagram of patient selection for the study.

Table 1 Baseline demographic and clinical characteristics of the study participants.

Of the 21 variables collected from patients, five were selected according to non-zero coefficients in the LASSO regression analysis (Fig. 2): ambulance ED arrival, shock index > 1, admission to ICU, malignancy, and hemostatic procedure. Multivariable logistic regression analysis was performed on these five variables using LASSO regression techniques to develop a predictive model for in-hospital mortality among patients with gastrointestinal bleeding. We constructed a mortality risk prediction model using the aforementioned five predictors (Table 2). The dataset was split into a training set (70%) and a validation set (30%) to assess the model’s generalizability. In the training set, the model’s AUC was 0.8619, with a confidence interval (CI) of [0.7858, 0.9381]; in the validation set, the AUC was 0.8455, with a 95% CI of [0.7871, 0.9039]. These results indicate that the model maintains high and consistent performance across different datasets, demonstrating good statistical stability (Fig. 3). A nomogram was constructed using the training data to visually represent the model’s predictive capability (Fig. 4). To further assess model calibration, calibration curves were plotted for both the training and validation sets (Fig. 5). In the training set, the calibration curve’s slope was 1.02 with an intercept of − 0.03, indicating a high agreement between predicted and observed probabilities. In the validation set, the calibration curve’s slope was 1.05 with an intercept of − 0.05, also showing good calibration. Finally, DCA was used to evaluate the clinical utility of the model (Fig. 6). In the training set, DCA results showed that when the threshold probability was between 0.1 and 0.9, our model significantly improved net benefit compared with the strategies of treating all or none. Specifically, at a threshold probability of 0.1, the model’s net benefit was 0.741 higher than the treat-all strategy and 74.10% higher than the treat-none strategy. In the validation set, DCA similarly indicated that our model outperformed the treat-all and treat-none strategies when the threshold probability was between 0.1 and 0.9. At a threshold probability of 0.1, the model’s net benefit was 0.814 higher than the treat-all strategy and 81.4% higher than the treat-none strategy. These data suggest that our model performs well statistically and may have potential clinical utility in clinical decision-making; however, prospective evaluation is required to confirm impact on outcomes.

Fig. 2
figure 2

Predictor selection using LASSO regression with tenfold cross-validation. (A) Selection of tuning parameter (lambda) by minimum and 1-SE criteria. (B) Coefficient profile against log(lambda). Five nonzero predictors were retained. LASSO, least absolute shrinkage and selection operator; SE, standard error.

Table 2 Logistic regression model and the Odds ratio of predictors.
Fig. 3
figure 3

ROC curves for mortality prediction in the training (A) and validation (B) cohorts.

Fig. 4
figure 4

Nomogram for predicting in-hospital mortality in ED patients with GIB.

Fig. 5
figure 5

Calibration curves for the nomogram in the training (A) and validation (B) cohorts.

Fig. 6
figure 6

Decision curve analysis (DCA) of the nomogram in the training (A) and validation (B) cohorts.

Discussion

In this study, a nomogram was developed to predict in-hospital mortality risk for patients with gastrointestinal bleeding in the ED. This nomogram includes five variables: ambulance ED arrival, shock index > 1, admitted to ICU, malignancy, and hemostatic procedure. The nomogram demonstrated good discriminative ability and calibration, and the DCA suggests potential clinical utility. To our knowledge, most current prediction models for mortality in patients with gastrointestinal bleeding focus on hospitalized UGIB or LGIB patients. Mortality in gastrointestinal bleeding patients admitted through the ED cannot be accurately predicted by emergency physicians, especially when the bleeding location is unclear. In this study, we developed a nomogram predicting all-cause mortality during hospitalization for such patients, incorporating five variables collected from patients admitted through the ED. The variables included in the nomogram were selected through LASSO regression analysis, which is considered superior to univariate analysis for predictor selection24,25.

Additionally, we evaluated the clinical significance of these predictive factors. The factors "arrival by ambulance to the emergency department" and "shock index > 1" reflect the urgency and severity of the patient’s condition and are associated with poorer outcomes in gastrointestinal bleeding patients26,27. The shock index is calculated by dividing the heart rate by the systolic blood pressure, with a normal range considered to be 0.5–0.7. A shock index > 1 has significant potential in predicting short-term adverse outcomes in upper gastrointestinal bleeding patients28. Dogru U, Yuksel et al. found that the shock index can serve as an important quantitative indicator for assessing mortality risk in gastrointestinal bleeding patients29. Emergency medical staff should prioritize patients arriving by ambulance and those with a shock index > 1, as these patients may require more urgent care. It is recommended to establish a fast-track system for such patients, optimize emergency procedures, and reduce the time spent in the ED to ensure timely and prioritized care for high-risk patients.

Admission to the ICU typically indicates that a patient’s condition is very severe, requiring close monitoring and advanced life support. Thus, "admitted to ICU" as an independent risk factor in predictive models reflects the severity of the patient’s condition. Although the ICU provides more medical resources, the mortality rate remains high due to the severity of these patients’ conditions. This severity underscores the importance of early identification and intervention, aiming to take effective measures before the condition worsens. Furthermore, ICU treatment is not solely focused on gastrointestinal bleeding; patients often have other systemic complications, the presence and treatment complexity of which can also affect prognosis30. ICU patients often require invasive procedures, such as mechanical ventilation and central venous catheterization, which can increase the risk of infections and other complications, further affecting prognosis31. While "admitted to ICU" is a significant independent risk factor, it does not necessarily mean that ICU treatment itself leads to higher mortality risk. Rather, it likely reflects the severity of the patient’s condition32. Therefore, for emergency clinical teams, the early identification of high-risk patients who may need ICU admission and proactive intervention before their condition deteriorates may help reduce in-hospital mortality rates.

Patients with co-existing malignancy generally have poorer overall health and more complex, variable conditions. Therefore, “malignancy” as an independent risk factor in predictive models reflects the complexity of the patient’s condition and poor prognosis. Despite the comprehensive treatment and care provided by medical teams, the patient mortality rate remains high due to the invasive nature of tumors and their systemic impact33. This finding highlights the necessity of fully considering the effects of systemic diseases and the importance of early, comprehensive treatment when managing such patients. Additionally, patients with malignancy often have other systemic complications such as anemia, malnutrition, and compromised immune function, in addition to requiring treatment for gastrointestinal bleeding34. The presence of these complications and the complexity of their management can significantly affect the patient’s prognosis. These patients may require multiple treatments, such as chemotherapy and radiation therapy, and the side effects of these treatments can further increase the complexity and risk of treatment35. Although “malignancy” is a significant independent risk factor, malignancy itself does not necessarily directly lead to higher mortality risk. It is more likely a reflection of the deterioration of the patient’s overall health status. Therefore, for patients with both malignancy and gastrointestinal bleeding, early identification of their complex conditions and the implementation of comprehensive treatment measures may help improve their prognosis. In clinical practice, individualized treatment strategies are particularly important for these high-risk patients. Multidisciplinary collaboration, including close coordination between oncology, gastroenterology, and surgery departments, can ensure that patients receive comprehensive and effective treatment, thereby potentially reducing in-hospital mortality rates.

Regarding "hemostatic procedure," the execution of hemostasis surgery in patients with gastrointestinal bleeding is closely related to adverse survival outcomes during hospitalization after being admitted through the ED. The success of hemostasis procedures directly affects bleeding control and the patient’s chance of survival36. Our study results showed that performing hemostasis surgery on gastrointestinal bleeding patients transferred from the ED to inpatient care increased survival rates by 62.76% compared with those observed when the procedure was not performed. This finding also reflects the severity and complexity of bleeding in nonsurgical patients, who may not tolerate surgery due to significant blood loss, difficult-to-control bleeding sites, or other serious comorbidities. Additionally, nonsurgical patients may have advanced tumors or other factors affecting surgical decisions, such as overall health status or tumor biology. Although studies have shown that hemostasis within 24 h significantly reduces mortality37,38, timely and effective hemostasis interventions during hospitalization remain crucial for saving lives in certain high-risk gastrointestinal bleeding patients who cannot be treated within 24 h18,39. For such patients, it is recommended to stabilize their condition as quickly as possible through multidisciplinary collaboration. The risk of bleeding-related complications and mortality can be minimized through a combination of pharmacological, endoscopic, and interventional treatments to achieve early hemostasis.

These five predictive indicators are readily obtainable in clinical settings. Our nomogram demonstrates good discrimination and calibration, and the DCA highlights its potential clinical utility. This freely accessible nomogram may provide emergency clinical teams with a practical tool for the early assessment and management of in-hospital mortality risk among patients admitted with gastrointestinal bleeding through the ED. This nomogram, built from variables available at ED presentation, may assist clinicians and nursing staff in the early identification of higher‑risk patients and in prioritizing monitoring, level of care, and the timing of diagnostic evaluation. Any implementation (e.g., within an electronic health record) should follow external validation, local calibration, and prospective impact assessment. Whether use of the model improves patient outcomes remains to be determined In our cohort, the nomogram achieved AUROC 0.862 in the training set and 0.846 in the validation set, with good calibration (slope ≈1.0; intercept ≈0), indicating acceptable/clinically useful risk estimation at ED presentation, pending external validation. Because several data elements required for established scores were not consistently captured, we did not perform head-to-head comparisons; the model is intended to complement, rather than replace, existing tools.Unlike existing tools such as the Glasgow–Blatchford, Rockall, AIMS65, or BLEED scores, which are site-specific (focusing only on UGIB or LGIB) and generally achieve AUROCs below 0.808,9,10, our model was developed specifically for the emergency department setting and is applicable to all GIB presentations, regardless of bleeding site. With an AUC > 0.84 in both training and validation cohorts, it may offer clinicians a more versatile option for early mortality risk prediction when the bleeding source is uncertain. Any impact on clinical outcomes will require external validation and prospective evaluation. However, whether this approach will lead to improved patient outcomes remains to be confirmed in future prospective studies and real-world clinical implementation. Notably, previous research has demonstrated that validated clinical prediction models can outperform subjective judgment in predicting in-hospital mortality in the emergency setting40. Our nomogram provides objective risk stratification; thus, it may serve as a valuable adjunct to emergency clinical decision-making for patients with gastrointestinal bleeding.

This study has some limitations. First, due to the study’s retrospective nature, the clinical utility of the nomogram requires external validation. Second, we did not integrate the model with existing clinical guidelines and practices. Further research is needed to validate this model. Nevertheless, we developed a model that demonstrated good internal performance on our dataset. Finally, our study results pertain to in-hospital mortality risk following admission from the ED. Further research is needed to determine if this model can predict in-hospital mortality for patients admitted directly from outpatient settings. In addition, as with all observational studies of clinical interventions, our analysis may be subject to confounding by indication and unmeasured confounders. For example, the observed association between hemostatic procedures and improved survival could be influenced by patient selection, as healthier patients are more likely to tolerate and receive such interventions. Therefore, this finding should be interpreted with caution, and further prospective studies are needed to validate our results.

Conclusion

A nomogram model was developed and validated to predict in-hospital mortality risk for patients with GIB in the ED. The model demonstrated good classification performance in the validation set, with an AUC of 0.847, indicating potential clinical utility that requires further confirmation in future studies. By integrating key clinical variables, such as age and shock index, the model provides the emergency clinical team with a potentially useful aid for rapid risk assessment and optimizing treatment decisions, especially for GIB patients with unclear bleeding sites. Despite the limitations associated with single-center, retrospective studies, the model may offer value for improving emergency risk stratification and the prognostic assessment of GIB patients. Future research should include external, multicenter, prospective validation to confirm generalizability and determine its real-world clinical impact.