Introduction

COVID-19 is a current pandemic disease caused by SARS-CoV-2 which is associated with serious health complications worldwide1. As of October 28, 2022, there were 626,337,158 confirmed cases of COVID-19, including 6,566,610 deaths, reported to the WHO2. The severity of COVID-19 infection ranges from asymptomatic manifestations to flu-like symptoms such as fever, headache, dry cough, dyspnea, mayalgia, and joint pain, eventually leading to fatal acute respiratory distress syndrome (ARDS)3,4,5. According to previous studies, the mortality rate for the most serious SARS-CoV-2 infections that require admission to the intensive care unit (ICU) ranges from 8.1 to 30% for hospitalized patients with SARS-CoV-2 pneumonia and up to 16% to 78% for patients who require admission to an ICU with critical care4,6,7,8.

According to previous studies, the COVID-19 outbreak is not uniform across nations, with notable variations in the proportion of serious diseases and case fatality rates9. Multicenter reports emphasize that patient-specific characteristics are important predictors of the presentation and consequences of COVID-19, even though the quality of healthcare services may be a factor in such variations10. Since the WHO officially declared a global pandemic in March 2020, there have been significant efforts to identify prognosticators that clinicians use to evaluate the risk at the early stage of the illness. This has helped to better tailor management strategies, assist decision-making, and promote health for COVID-19 patients by raising the therapeutic response, increasing the diagnostic accuracy, and lowering the case fatality rate11.

Various studies found that advanced age12,13,14,15, male gender13, and co-morbidities13,14,15 such as diabetes mellitus (DM), obesity, systemic hypertension (HTN), renal diseases, coronary artery disease15, and malignancy were risk factors for COVID-19 mortality. Beyond manifestations such as fever, cough16, haemoptysis12, dyspnoea12,16, unconsciousness12, clinical parameters such as elevated neutrophil/lymphocyte ratio12,17, and high levels of creatinine18, elevated lactate dehydrogenase (LDH)12,14,15,18, direct bilirubin12, and alanine amino-transferase (AST)18, which indicate early evidences to the severity of disease, an increased plasma level of biomarkers like d-dimer15,18,19,20, C-Reactive Protein (CRP)21,22, serum ferritin18, Interleukin-6 (IL-6)18, and procalcitonin (PCT) strengthens these findings18,19,20. The efficacy of particular antiviral and targeted immuno-modulatory therapies is still uncertain, therefore risk categorization and mortality prediction offer a logical strategy for allocating medical resources23.

The value of developing a score system for predicting the upcoming prognosis as mortality is vastly important for health care providers, especially doctors, to be more reliable and objective in evaluating their patients rather than being subjective24,25. Additionally, the recent publications in the medical field and beyond focus on the quality of the scoring system model and the methods of formation either through using strict and dedicated factors during statistical analysis steps or using machine learning programs25,26,27,28. Therefore, in order to estimate the probability of mortality in patients with COVID-19 pneumonia, forming a simple custom score could enhance the objective decision-making around treatment selection, as well as teach from trial and error in dealing with such a new disease. Our aim of the study was to develop and validate a simple scoring system to predict in-hospital mortality among patients with COVID-19 pneumonia within the first days of hospitalization, which was based mainly on different presenting symptoms, comorbidities, vital signs, and some laboratory data in addition to some local and affordable treatment options entered into the regression model. The multiple logistic regression model approaches were applied to our set of data to find the best equation that predicts the death probability. Furthermore, to measure the validity of that equation, a receiver operating characteristic curve was applied. Hence, the area under the curve expresses the goodness of that model in the prediction of mortality.

Material and methods

Study design and patient selection

A retrospective cohort study included all data records of patients with positive PCR for SARS-CoV-2 who was admitted to Almaza Military Hospital from January 2020 to the end of December 2021. A flow diagram showed the criteria of selected patients (Fig. 1). Hence, only 1535 patients had been considered as suggested candidates for primary study criteria. All of them had positive oropharangeal PCR tests in addition to CT chest. Those below 18 years old and those with free CT findings were excluded from the study. As well, cases with incomplete data records were also excluded. After informed written consent from all participants for publication of their data, the study has been approved by the Research Ethics Committee of the Faculty of Pharmacy, Tanta University (REC-TP code: TP/RE/012-21P-005) and the ethical committee office of the Medical Military Academy in agreement with the Helsinki Declaration Roles.

Figure 1
figure 1

Flow chart for the study design.

Data collection

The data was extracted from the patient's files and included basic socio-demographic data in addition to the presenting symptoms. Only laboratory data that was conducted in the first 24 h after admission was considered. The evaluation of CT chest and quantifying CT-SS as regarding Yang et al.29, which was performed by two independent radiologists for reliability. The data about treatment protocol as regarding local guidelines was also included.

Outcome

The outcome of desire was in-hospital mortality from COVID-19 pneumonia, which is defined as death during a period of admission to a hospital as a consequence of COVID-19 disease. Expectation of a short-term death after discharge from the hospital is also considered in-hospital mortality30.

Statistical analysis

Building the predictive model and development of scoring system

All the data was collected and coded on an Excel sheet. The normality of data was examined using the Shapiro–Wilk test using SigmaPlot for Windows version 12.5. 0.38 (Systat Software, Inc., UK, 2011). The descriptive statistics were performed using Minitab 17.1.0.0 for Windows (Minitab Inc., 2013, Pennsylvania, USA). In the first step, all data was subjected to univariate analysis. Hence, the comparison between two means was done using an independent t-test, while the frequency comparison was made using the chi-square test. Regarding the Neyman-Pearson theory of classical statistics31,32, the critical value of a significant number was selected by the observer to guarantee that the type II errors were minimized as much as possible and circumvent the interrelationship between type I and type II errors. Additionally, Park, 201333 mentioned that all variables could be implicated to test their correlation with the final outcome, provided that the log equation of fitness was acceptable; however, some authors preferred to select the variable of desire to be enrolled in the logistic equation after one step of simple hypothesis testing (univariate analysis)34,35, consuming that the margin of type II error was not so far away to accept the null hypothesis36. Therefore, in the current models, all factors with a p-value ≤ 0.35 were subjected to the second step, multivariate analysis. During building the logistic regression models, and to avoid multicollinearity, factors with VIF > 5 were eliminated. The goodness of fit for the regression model was performed using the Hosmer and Lemeshow test. Forward selection and backward elimination techniques were applied to select the best predictors for mortality. The Wald statistics numbers for each factor of interest were multiplied by the coefficient and divided by a constant value, and the result was rounded to the nearest integer number, taking into account the sign of the regression coefficient. According to the literature, all numerical data are transformed into categorical groups based on the distribution histogram and cumulative frequency or normal reference. The total score is then calculated from the submission of each individual score.

Score validation

The data for all factors implicated in the scoring system (SOM) was collected from the validation cohort, while the individual score and total score for every participant were calculated automatically in an Excel sheet. The performance of the total score was assessed using ROC curve analysis. The AUC above 0.6 was considered acceptable. The logistic regression analysis was finally performed to calculate the equation of death probability from total SOM. All tests were two-sided and a p-value of less than 0.05 was considered significant.

Institutional review board statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by after approval by the Research Ethics Committee of Faculty of Pharmacy, Tanta University (REC-TP code: TP/RE/012-21P-005) and the ethical committee office of the Medical Military Academy.

Informed consent statement

Informed consent was obtained from all subjects involved in the study.

Results

Patients' characteristics

The total number of data points was 316, and the mortality rate of patients with COVID-19 pneumonia was 28.5%. As shown in Table 1, the basic criteria of both the survival and mortality groups were presented. One third of cases were male, and more than half of them were older than 60 years old. The most frequent comorbidities were DM and HTN, and nearly all patients were presented with cough and dyspnea. However, fever was a significant cardinal sign that correlated with mortality. The duration of complaints in the mortality group was significantly shorter than the survival one, as more than half of them came with a complaint history of less than 4 days. In the mortality group, the CT-SS was significantly higher and more than 2/3 of cases had severe and very severe degrees of CT scores (> 20). Moreover, neutrophil (%), NLR and PLR, as well as AST, urea, creatinine and IL-6 were significantly higher in the mortality group.

Table 1 Socio-demographic data and clinical characteristics.

Predictors of mortality

After selection of all factors in univariate analysis with a p-value ≤ 0.35, multivariate analysis with different approaches was applied to select the most significant factors that affected in-hospital mortality. As shown in Table 2, being elderly (> 60 years), having a shorter duration of complaint, having a high NLR, and having a higher CT-SS (> 20) were all significant independent predictors of mortality (P < 0.05 for all).

Table 2 Predictors of mortality using multivariate logistic regression model with different technique.

Score model formation and its performance

After collecting the predictors from the previous step, a multiple logistic regression analysis were applied to weight the factors and forming the scoring number based on both WS and Coefficient number. As shown in Table 3, two different scores could be used; the first one (SOM-1) including tocilizumab, and the second one (SOM-2) without it. However, both of them showed insignificant differences in discriminating in-hospital mortality (Fig. 2). Because the AUC was 85% and 84%, respectively, with P = 0.001 for both, SOM-1 was chosen for further validation.

Table 3 Development of scoring system based on best predictors for mortality.
Figure 2
figure 2

ROC curve for the proposed score of mortality (SOM) (SOM-1: score of mortality No-1, SOM-2: score of mortality No-2, AUC: area under curve, P considered significant if < 0.05).

Assessment of the new score: validation and prognostic utility

About 327 patients were subjected to score validation; the descriptive statistics of the validation cohort were summarized in Table 4, in which the mortality rate was 28.74%.

Table 4 Characters of validity cohort.

Moreover, Fig. 3a showed the median (IQR) value of SOM-1 was significantly higher in the mortality group; (13.5 (3–18) than the survival one; 2 (− 0.5–7)), P < 0.001. The prognostic utility of SOM-1 was so good; the AUC was 75%, P < 0.001 (Fig. 3b), and at cutoff values of above 5 and 16.5, the sensitivity and specificity were above 90%, respectively (Table 5). The probability of mortality increased with every unit increase in the SOM-1 (Fig. 3c).

Figure 3
figure 3

Score of mortality (SOM) in validating cohort.

Table 5 Performance of SOM-1 in predicting in-hospital mortality.

Discussion

The current study used the observational data from patients with COVID-19 pneumonia to develop a simple predictive model for further building a scoring system that easily calculates the probability of in-hospital mortality. The algorithms of the predictive models were applied to 316 patients and included simply collected historical, clinical, and laboratory data on the day of admission, which gave our model of prediction the capability to be applied in other medical sectors with an affordable set of data collection and less sophisticated investigation. However, so many scoring systems have been developed since the start of the pandemics to predict different outcomes related to COVID-19 disease. Their sensitivity and specificity ranged between 70 and 100%37,38,39, and the most effective study recorded a model with excellent validity (AUC = 93.8%)20.

The present score, besides its easy and manual calculation from simple data, had the capability for prognosis prediction; the utility of the score was very good enough to be accepted, hence the AUC of 85% in the training cohort and still good when applied to the validating cohort (AUC of 75%), which indicates how much the stability of the score in predicting the disease prognosis made the treatment strategy more powerful if it was applied as early as possible for a much better outcome. Moreover, during building the model, we entered medication that was used in local treatment protocol in response to laboratory results to estimate if these factors were implicated in the final outcome or not. We reported that remdesivir and tocilizumab were significantly correlated with in-hospital mortality. Therefore, two scoring systems were calculated with tocilizumab (SOM-1) and without tocilizumab (SOM-2), with an insignificant difference between both of them. The accuracy was 85 and 84%, respectively (Fig. 2). For that reason, SOM-1 was the choice for further external validity. Another concern that made our newly formed score more stable was that we depended on both the coefficient number and Wald statistic number from the logistic regression model to weight the predictors.

The attendance results did not find any surprising factors that affected the mortality except the duration before hospital admission. Hence, we found that patients with a shorter duration of less than 4 days before admission died, which made the explanation more difficult, and it could be related to the state of denial that the patients caught the disease and suffered from silent prolonged hypoxia. Surprisingly, only one study includes this historical item in the predictive model of COVID-19-related mortality. Henderson et al. reported that a shorter time from symptom onset to hospitalization is associated with a more serious disease and higher mortality40. On the other hand, old age was correlated with bad prognosis, which came in consistence with other reports41,42,43,44,45,46.

Despite the fact that many studies have pointed to the importance of male sex and associated comorbidity in poor prognosis45,46,47,48,49, our study did not find that link after multiple filtration of logistic regression modeling, and a recent study supported that50. Furthermore, the severity level of lesion in HRCT was one of the most reliable predictors; thus, it was associated with a poor prognosis, despite the fact that a few studies used that factor to predict in-hospital mortality outcome50,51,52.

Our study introduced some treatment medications like iverzine53,54, sofosbuvir/ledipasvir55, remdesivir, and tocilizumab to be estimated as predictive factors for mortality in a regression model. iverzine was used following our national COVID-19 management guidelines in its old versions53,54, however it is deleted from the recent version56. Only remdesivir and tocilizumab continued to be linked with mortality after multiple filtrations of the predictors. Hence, remdesivir was found to be protective against a bad prognosis, and early administration could inhibit the replication of viruses and decrease the viral load; it could also counter the process of pathology inside the lung parenchyma that finally led to improvement in the lung lesion57.

On the other hand, our data showed that tocilizumab increased the likelihood of mortality two times more, which could be due to limiting the use of that medication in patients with severe disease, so the risk of its link to a bad prognosis became much higher. Additionally, tocilizumab had been prescribed in COVID-19 patients with an established higher level of IL-6, denoting that a cascade of cytokines storm had been started58,59,60. Although a pooled analysis of systematic reviews on tocilizumab and mortality outcomes found that it was not only protective against bad outcomes, it was also significantly linked to post-drug infection, which led to a poor prognosis from super infection61. That fact could explain the present finding and draw attention to the limited power of using biological therapy in treatment. Nevertheless, several newly published randomized controlled trials (RCTs)62,63,64,65,66,67,68,69,70 and meta-analyses71,72,73 of RCTs have investigated the effects of TCZ as an adjunctive therapy in patients with COVID-19 but have reported inconsistent results. Moreover, there are increasing number of newly available studies regarding TCZ treatment for COVID-19. Therefore, there are still limited real-world data about the effect of TCZ on inflammatory activity in COVID-19 patients74.

Limitations

Even though our study was the first of its kind that was proposed in Egypt, which is a developing country with limited resources, it highlighted the ability of simple data to predict the outcome of COVID-19 patients. However, the study showed some limitations; the first was the types of study design; hence, the main issue with retrospective cohort type studies was the effect of confounders, which could influence the final outcome in an unpredicted way. The second limitation was the single-centre study, which increased the demand for further external validation from other centers. Additionally, the third point was that the study excluded younger individuals < 18 years old during recruitment, which may be an area of future interest. Moreover, the score was constructed for only COVID-19 patients, which made the comparison with other alternative scores much more difficult to apply.

Conclusion

The constructed score (EGY.Score) from the observational data could predict the prognosis of patients with COVID-19 pneumonia, which may possibly be used to adjust the management intervention for further gain of a desirable outcome.