A modular approach to forecasting COVID-19 hospital bed occupancy

Tobin, Ruarai J.; Walker, Camelia R.; Moss, Robert; McCaw, James M.; Price, David J.; Shearer, Freya M.

doi:10.1038/s43856-025-01086-0

Download PDF

Article
Open access
Published: 12 August 2025

A modular approach to forecasting COVID-19 hospital bed occupancy

Communications Medicine volume 5, Article number: 349 (2025) Cite this article

2285 Accesses
2 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Background

Monitoring the number of COVID-19 patients in hospital beds was a critical component of Australia’s real-time surveillance strategy for the disease. From 2021 to 2023, we produced short-term forecasts of bed occupancy to support public health decision-making.

Methods

We present a model for forecasting the number of ward and intensive care unit (ICU) beds occupied by COVID-19 cases. The model simulates the stochastic progression of COVID-19 patients through the hospital system and is fit to reported occupancy counts using an approximate Bayesian method. We do not directly model infection dynamics—instead, taking independently produced forecasts of case incidence as an input—enabling the independent development of our model from that of the underlying case forecast(s).

Results

Here, we evaluate the performance of 21-day forecasts of ward and ICU occupancy across Australia’s eight states and territories produced across the period March and September 2022. We find forecasts are on average biased downwards immediately prior to epidemic peaks and biased upwards post-peak. Forecast performance is best in jurisdictions with the largest population sizes.

Conclusions

Our forecasts of COVID-19 hospital burden were reported weekly to national decision-making committees to support Australia’s public health response. Our modular approach for forecasting clinical burden is found to enable both the independent development of our model from that of the underlying case forecast(s) and the performance benefits of an ensemble case forecast to be leveraged by our occupancy forecasts.

Plain language summary

During the COVID-19 pandemic, predicting the potential future impact of the disease on hospitals was crucial. In this study, we aimed to predict how many hospital ward and intensive care unit (ICU) beds would be needed for COVID-19 patients in Australia with the aim of supporting public health decision-makers. Our approach used forecasts of new COVID-19 cases as input and factored in real-time information such as how likely cases were to be hospitalised or require ICU care.

We found that, while generally accurate, our forecasts tended to underpredict just before a wave of infections peaked, and overpredict after the peak had passed. Our flexible modelling method could be adapted to predict hospital needs for other infectious diseases in the future, helping to prepare for epidemics of illnesses like influenza or RSV.

Forecasting local hospital bed demand for COVID-19 using on-request simulations

Article Open access 03 December 2023

Supporting COVID-19 policy-making with a predictive epidemiological multi-model warning system

Article Open access 08 December 2022

A retrospective cohort study of 238,000 COVID-19 hospitalizations and deaths in Brazil

Article Open access 07 March 2022

Introduction

Throughout 2020–2022, SARS-CoV-2 induced large epidemic waves of infection internationally, with a considerable proportion of these infections requiring medical care. During peak epidemic periods, the demand for hospital beds overwhelmed the capacity of healthcare systems in many settings^1,2,3. The number of beds occupied by COVID-19 cases depends upon the number of new patients admitted and the length of stay of these patients—with both quantities being products of the severity of disease and of clinical practice^4,5,6,7. Forecasts of hospital occupancy can provide public health decision makers with intelligence to support decision-making.

Australia’s early COVID-19 experience differed from most other countries, with only a small proportion of the population having been infected prior to the widespread uptake of vaccination; by December 2021, over 80% of adults had been vaccinated, and less than 2% of adults had been recorded as infected amidst intensive public health measures^8,9. The Omicron variant of SARS-CoV-2 emerged in November 2021, with the Omicron BA.1 lineage inducing major waves of infection across Australia and resulting in at least 17% of the population having been infected by March 2022¹⁰. We limit our study to the period between March and September 2022, which was defined by two major waves of infection: a wave induced by the Omicron BA.2 lineage, which peaked in March–April 2022¹¹; and a wave induced by the Omicron BA.4 and BA.5 lineages, which peaked in late July 2022¹².

In this work, we describe a model for producing short-term (21-day) forecasts of hospital occupancy. We chose daily bed occupancy as a forecast target—rather than daily admissions—as occupancy more closely relates to the overall capacity of the hospital system. Furthermore, such bed occupancy counts had been collected and publicly reported for each state and territory of Australia on a daily basis since the early stages of the pandemic¹³. Our forecasting model takes as input an independently produced forecast of daily case incidence (specifically, an ensemble forecast consisting of four component models, each produced by a different researcher(s)¹⁴), with this incidence then transformed into ward and ICU occupancy counts through a stochastic compartmental model, with the probabilities of hospitalisation and of ICU admission informed by near-real-time data. The duration of time spent in each compartment is informed by censoring-adjusted estimates of patient length of stay¹⁵. Simulation outputs are then fit to reported occupancy counts using an Approximate Bayesian Computation approach¹⁶.

Under the specifications of the Australian National Disease Surveillance Plan for COVID-19¹⁷, we reported forecasts from our model to key national decision-making committees on a weekly basis as part of a national COVID-19 situational assessment programme¹⁴. We examine the performance of the forecasts throughout the study period (March–September 2022), both qualitatively—using visual cheques—and quantitatively—with the use of formal statistical metrics^18,19,20,21. We discuss how the performance of our occupancy forecasts changes with the epidemiological context and how it depends upon the performance of the input case forecasts.

Methods

Summary

We produced forecasts of the number of COVID-19 cases in hospital ward and ICU beds (i.e. the ward and ICU occupancies) on a weekly basis using a bespoke clinical forecasting pipeline (Fig. 1a). We simulated the pathways taken by COVID-19 cases through a hospital as flow through a compartmental model (Fig. 2a). Our clinical forecasting pipeline takes in three primary inputs: an ensemble case forecast, time-varying estimates of key epidemiological parameters (the age distribution of cases, the probability of hospital admission, and the probability of ICU admission), and estimates of patient length of stay. The model outputs are fit to reported occupancy counts across a seven-day window prior to the forecast start date using Approximate Bayesian Computation (ABC)¹⁶. We reported the resultant 21-day forecasted counts of ward and ICU occupancy to public health committees on a weekly basis.

**Fig. 1: Overview of the clinical forecasting pipeline.**

**Fig. 2: The compartmental model of COVID-19 clinical progression used to simulate the flow of COVID-19 patients through a hospital.**

Compartmental pathways model

Our compartmental model simulates the progression of severe COVID-19 disease and corresponding pathways taken through a hospital (Fig. 2a). The design of this model was informed by COVID-19 clinical progression models previously developed for the Australian health system context^22,23,24. In our model, new COVID-19 cases start in the Case compartment according to their date of symptom onset (inferred where not recorded). From this compartment, some fraction of cases are admitted to hospital, according to a (time-varying) probability of case hospitalisation. Hospitalisations start in the Ward compartment, from which a patient can then develop further severe disease and be admitted to ICU, according to a (time-varying) probability of ICU admission. Patients in the ICU compartment can then move to the Post-ICU ward compartment. In addition, across each of the Ward, ICU, and Post-ICU ward compartments, we assume patients have some probability of dying or being discharged. We count the number of occupied ward beds as the number of patients in the Ward and Post-ICU ward compartments, and the number of occupied ICU beds as the number of patients in the ICU compartment.

Length of stay estimates

To simulate the flow of patients through the compartmental model, we need to specify distributional estimates of the duration of time they will spend within a compartment before a transition occurs (i.e. their length of stay), and the probabilities of each particular transition occurring (i.e. transition probabilities). We produced estimates of length of stay and transition probabilities using a multi-state survival analysis approach¹⁵. This survival analysis framework allowed us to produce estimates across our compartmental model in near-real-time while accounting for right-censoring, such that we could rapidly incorporate any changes in length of stay or transition probabilities when necessary. Changes in these quantities may have arisen as a consequence of factors such as a new variant exhibiting different clinical severity, changes in clinical practice, or vaccination. Although we did not include these factors as covariates in the survival model, their net effect on length of stay statistics during the study period was captured by producing our length of stay estimates over only recently admitted patients. We estimated length of stay and transition probabilities using hospital data from the state of New South Wales (see ref. ¹⁵, Supplementary Methods). We were not able to produce similar estimates for the other states and territories of Australia as the requisite line-listed hospital stay data were not accessible to us or did not exist. The delay distribution for Case to Ward was informed by estimates (not described here) from the FluCAN sentinel hospital surveillance network study²⁵, as appropriate data were not available to estimate this delay in the New South Wales dataset (Supplementary Methods), noting that this delay only affects the relative timing of the occupancy time series. The transition probabilities from the multi-state survival model were used across all transitions in the compartmental model except for the Case to Ward and the Ward to ICU transitions. The transition probabilities for these two transitions were estimated as time-varying (described later), given their substantial impact upon the net occupancy counts. The length of stay and transition probability estimates were provided to the simulation model as bootstrapped samples of gamma distribution shape and scale parameters and multinomial probabilities of transition.

Case incidence

In our compartmental model (Fig. 2a), cases of COVID-19 begin in the Case compartment. As such, we must inform the model with the number of new cases entering this compartment each day: we achieve this through use of a time series of historically reported case incidence concatenated with a trajectory of forecasted case incidence.

We received time series of historical case incidence indexed by date of symptom onset from an external model²⁶. This external model performs imputation of symptom onset dates where they have not been recorded in the data, with the final time series being the count of cases with a (reported or imputed) onset date on each given date. Because this external model did not perform multiple imputation of the symptom onset date, we added noise to capture uncertainty in the case counts via sampling from a negative binomial distribution with a mean of the historical case count and a dispersion of k = 25. This uncertainty was expected to assist the subsequent inference stage by increasing the prior predictive uncertainty. A dispersion value of 25 was selected through visual inspection such that the expected variability in incidence by symptom onset date was captured (noting that the subsequent inference stage was able to further refine this uncertainty where necessary, e.g. rejecting samples where the uncertainty in case incidence was too great or too small).

Our method is agnostic to the case forecasting approach used as input, thus allowing us to couple it with any independently produced forecast of case incidence. Here we used outputs from an ensemble forecast of case incidence, which varied in model composition during the study period (methodologies and summary outputs for the ensemble forecast are publicly available¹⁴). A total of four different models were used at various stages: two mechanistic compartmental models, one mechanistic branching process model, and a non-mechanistic time series model (see in refs. ^14,27,28 for details). Models within the ensemble received ongoing development across the study period in response to changes in our understanding of the epidemiology and biology of the virus¹⁴.

Estimation of time-varying parameters

We specified three parameters in the compartmental model of clinical progression as time-varying. For each forecast, we produced estimates stratified by age group a and varying with time t of: the probability of a case being within a certain age group, p_age(a, t); the probability of a case being hospitalised, p_hosp(a, t); and the probability of a hospitalised case being admitted to ICU, p_ICU(a, t). These parameters were chosen to capture phenomena such as changes in case age distribution, changes in case ascertainment, differences in variant virulence and outbreaks of the disease within populations subgroups. We defined age groups as 10-year groups from age 0 to 80, followed by a final age group comprising individuals of age 80 and above (i.e. 0–9, 10–19, ..., 80+).

The time-varying parameters were estimated using case data from the National Notifiable Disease Surveillance System (NNDSS), which collates information on COVID-19 cases across the eight state and territories of Australia. For each case in this dataset, we extracted the date of case notification, the recorded symptom onset date, the age of the case, and whether or not the case had been admitted to hospital or ICU. Where symptom onset date was not available, we assumed it to be one day prior to the date of notification (where this was the median delay observed in the data).

For each of the three time-varying parameters, we constructed estimates using a one-week moving-window average, with estimates for time t including all cases with a symptom onset date within the period (t − 7, t]. To capture uncertainty in these time-varying parameters, estimates were produced using bootstrapping (sampling with replacement) from the line-listed data. A total of 50 bootstrapped samples were produced, with each sample consisting of three parameter time series (each stratified by nine age groups for a total of 27 time series). At the simulation and inference stage, each simulation received a single such sample as input, such that correlation between the 27 time series was preserved. We calculated the first parameter p_age(a, t), which defines the multinomial age distribution of cases over time, as the proportion of cases within each age group for an estimation window:

$${p}_{{{\rm{age}}}}(a,t)=\frac{\mathop{\sum }_{\tau = t-6}^{t}{n}_{a}(t)}{\mathop{\sum }_{\tau = t-6}^{t}{\sum }_{i}{n}_{i}(\tau )},$$

(1)

where n_a(τ) is the number of cases in age group a with symptom onset at time τ. To calculate the probability of a case being hospitalised and the probability of a hospitalised case being admitted to ICU, we produced estimates with adjustment for right-truncation. Here, right-truncation was present as we used near-real-time epidemiological data and indexed our estimates by date of symptom onset. The most recent symptom onset dates in our estimates thus included cases that would eventually be (but had not yet been) hospitalised (and similarly for cases admitted to hospital, but not yet admitted to ICU). Had we not accounted for this right-truncation, we would have consistently underestimated the probabilities of hospitalisation and ICU admission for the most recent dates. We describe the maximum-likelihood estimation of the hospitalisation and ICU admission parameters in the Supplementary Methods.

If in a given reporting week we identified a jurisdiction as having unreliable data on hospitalised cases (most often, missing data on cases admitted to hospital or ICU due to data entry delays), we replaced the local estimates with estimates produced from pooled data across all other (reliable) jurisdictions. Changes made in this regard during the study period are listed in the Supplementary Methods.

Simulation and inference

To simulate a single trajectory of ward and ICU occupancy, we sampled: a trajectory of case incidence from the ensemble; a sample of the bootstrapped time-varying parameters (each comprising three time series across nine age groups); and a sample from the bootstrapped length of stay and transition probability estimates. Using these inputs, we performed simulations across the compartmental model (Fig. 2a) independently across each age group and then summed across all age groups to produce total ward and ICU counts for each day. The compartmental model simulates the pathways of patients through the hospital at the population scale with an efficient agent-based approach; we provide details on this algorithm in the Supplementary Methods.

To ensure that trajectories simulated from the clinical pathways model aligned with reported occupancy counts, we introduce a simple rejection-sampling approximate Bayesian method, rejecting trajectories that did not match the true reported occupancy counts within a relative tolerance ϵ across a one-week calibration window. For each simulation with a simulated ward occupancy count $\hat{W}(t)$ and simulated ICU occupancy count $\hat{I}(t)$, simulations were rejected where either:

$$\left\vert W(t)-\hat{W}(t)\right\vert > \max (\epsilon W(t),10)\,{{\mbox{or}}}\,\left\vert I(t)-\hat{I}(t)\right\vert > \max (2\epsilon I(t),10),$$

(2)

where W(t) and I(t) were the true reported occupancy counts for each date t in the fitting window, with these counts retrieved from the covid19data.com.au project¹³. We selected ϵ using a simple stepped threshold algorithm, initialising ϵ at a small value, and continued to sample simulations until 1000 trajectories had been accepted by the model. If 1000 trajectories were not accepted by the time that 100, 000 simulations had been performed (i.e. 100 rejections per target number of output trajectories), we increased ϵ in sequence from [0.1, 0.2, 0.3, 0.5, 1, 10] and restarted the sampling procedure. This behaviour was chosen to achieve a good degree of predictive performance while ensuring that reporting deadlines were met (typically less than 24 h from receipt of ensemble forecasts and relevant hospital data), even where the model was otherwise unlikely to capture hospital occupancy at tighter degrees of tolerance.

We fit simulation outputs over a calibration window defined as the seven days following the start of the 28-day case forecast. This was chosen such that the most up-to-date occupancy data could be used in fitting (typically data as of, or a day prior to, the date clinical forecasts were produced). We could fit the clinical forecast over occupancy data points which were seven days in the future relative to the start of the case forecast for two reasons: the case forecasts were indexed by date of symptom onset and began at the date where a majority (>90%) of cases had experienced symptom onset, adding a delay of 2–3 days; and case forecasts were affected by reporting delays of 3–4 days (whereas occupancy data was not lagged). We did not fit over a larger window as the seven-day window was expected to be sufficient for our purposes, and the computational requirements of model fitting would increase exponentially with a larger window. The forecasts we reported on a weekly basis and examine here are the model outputs across the 21 days following this seven-day fitting window.

We introduced two additional parameters to improve the ability of the model to fit to the reported occupancy counts. These parameters increased variance in the magnitude of the output ward and ICU occupancy count trajectories, reducing the probability of a substantial mismatch between these trajectories and the reported occupancy counts. The first parameter added was H, a modifier on the probability of hospitalisation acting linearly across logit-transformed values:

$${p}_{\,{\mbox{h}}}^{* }={{\mbox{logit}}}^{-1}({{\mbox{logit}}}({p}_{{{\rm{h}}}})+H),H \sim N(0,{\sigma }_{{\mbox{hosp}}\,}^{2}).$$

(3)

The second parameter added was L, which modified the shape of the length of stay distributions across the transitions out of Case, Ward, and Post-ICU Ward, acting linearly across log-transformed values:

$${{\mbox{shape}}}_{i}^{* }={{\mbox{exp}}}({{\mbox{log}}}({{\mbox{shape}}}_{i})+L),L \sim N(0,{\sigma }_{{\mbox{los}}}^{2}).$$

(4)

The values of H and L were sampled from normal distribution priors with means of zero and standard deviations of ${\sigma }_{\,{\mbox{hosp}}\,}^{2}=0.8$ and ${\sigma }_{\,{\mbox{los}}\,}^{2}=0.5$ respectively. We specified these values to reduce the computational time required while ensuring the output model trajectories had good coverage over the reported occupancy counts. These parameters were changed for some jurisdictions during the study period; see the Supplementary Methods for details.

To illustrate the effect of the H and L parameters, we simulated model outputs for an example forecast with and without these parameters set to zero (Supplementary Methods). This demonstrates that output trajectories without the effect of H and L may already align with the reported occupancy counts, but where this does not occur, they enable the recent reported occupancy counts to be well captured by the fitted model outputs (Supplementary Methods).

Performance evaluation

We consider the performance of our forecasts produced between March and September 2022. We produced plots for the visual assessment of forecast performance (Fig. 3a, b and Supplementary Figs. 9–24) which depict all forecasts across the study period with the same presentation of uncertainty as was used in official reporting of the forecasts (with pointwise credible intervals ranging from 20% through to 90% by steps of 10%, and reported occupancy counts overlaid).

**Fig. 3: Forecasts of ward and ICU occupancy for the state of New South Wales produced between March and September 2022.**

To evaluate the overall performance of our forecasts, we calculated continuous ranked probability scores (CRPS) across log-transformed counts of occupancy. The CRPS measures the distributional accuracy of a set of forecasts against the eventual observations¹⁸. The CRPS is a proper scoring rule: in the limit, where a forecast reports the true probabilities of the underlying process, it will receive the greatest score. We calculated CRPS over log-transformed counts (specifically, ${x}^{* }={\log }_{e}(x+1)$) rather than over raw counts, as this has been argued to be more meaningful given the exponential nature of epidemic growth¹⁹. This transformation also allows us to interpret the resultant CRPS values as a relative error¹⁹, enabling comparison of the forecast performance between different settings. We also calculated skill scores of our forecasting model in comparison to a naive random walk model (Supplementary Methods), with results presented in Supplementary Figs. 7 and 8.

We calculated forecast bias to examine where the overall performance of our forecast was reduced due to consistent overprediction or underprediction²⁰ (Fig. 5a–h). Forecast bias (as opposed to, for example, estimator bias²⁹) ranges between −1 and 1, with a bias greater than zero indicating overprediction and less than zero indicating underprediction. Bias values of approximately zero are ideal, indicating a forecast that overpredicts as often as it underpredicts (or vice versa).

We produced plots demonstrating the association between the performance of our ward and ICU occupancy forecasts and the underlying case forecasts used as input. Specifically, we compared the case forecast performance calculated using CRPS to the bias of the ward occupancy forecasts (Fig. 6a–h) and ICU occupancy forecasts (Supplementary Fig. 5), and the bias of the case forecasts to that of the ward occupancy forecasts (Supplementary Fig. 6). These values were calculated across the whole horizon of the respective forecasts; it should be noted that such comparisons are inherently limited due to the lag between onset of symptoms and admission to hospital, i.e. the performance of the case forecast at the 28 day horizon would be expected to be of lesser influence given these cases are less likely to be hospitalised within the time-frame of our simulation.

We produced probability integral transform (PIT) plots to evaluate the calibration of the forecast (Supplementary Fig. 3). Calibration refers to the concordance between the distribution of our forecasts and the eventual distribution of observations²¹; for example, in a well calibrated forecast, each decile across the distribution of all forecast predictions should contain ~10% of the eventual observations. Where overlapping intervals contained the eventual observation (typically due to small integer counts, e.g. in smaller population size jurisdictions), we have counted each overlapping interval as containing the observation, with these down-weighted such that any given observation only contributed a total count of one.

Version control repositories are available on GitHub for the simulation and inference steps (http://github.com/ruarai/curvemush), the forecasting pipeline (http://github.com/ruarai/clinical_forecasts), and performance evaluation and manuscript figure plotting code (http://github.com/ruarai/clinical_forecasting_paper). Analysis was performed in the R statistical computing environment (version 4.3.2)³⁰. The forecasting pipeline was implemented using the targets package³¹, with tidyverse packages used for data manipulation³², pracma for numerical solutions of the maximum-likelihood estimates³³, and Rcpp for interfacing with the stochastic simulation C++ code. Forecasting performance was evaluated using the fabletools, tsibble, and distributional packages^34,35,36.

Ethics

The study was undertaken as urgent public health action to support Australia’s COVID-19 pandemic response. The study used data from the Australian National Notifiable Disease Surveillance System (NNDSS) provided to the Australian Government Department of Health and Aged Care under the National Health Security Agreement for the purposes of national communicable disease surveillance. Non-identifiable data from the NNDSS were supplied to the investigator team for the purposes of provision of epidemiological advice to government; data were securely managed to ensure patient privacy and to ensure the study’s compliance with the National Health and Medical Research Council’s Ethical Considerations in Quality Assurance and Evaluation Activities. Contractual obligations established strict data protection protocols agreed between the University of Melbourne and sub-contractors and the Australian Government Department of Health and Aged Care, with oversight and approval for use in supporting Australia’s pandemic response and for publication provided by the data custodians represented by the Communicable Diseases Network of Australia. The use of these data for these purposes, including publication, was agreed by the Department of Health with the Communicable Diseases Network of Australia. Ethical approval for this study was also provided by The University of Melbourne’s Human Research Ethics Committee (2024-26949-50575-3). Further, as part of this ethics approval, the University of Melbourne’s Human Research Ethics Committee provided waiver of consent for the use of the case data, as it was believed to be impracticable to contact each individual included in this routinely collected surveillance data and that there was no likely reason that individuals would not consent if asked.

The study used routinely collected patient administration data from the New South Wales (NSW) Patient Flow Portal (PFP). De-identified PFP data were securely managed to ensure patient privacy and to ensure the study’s compliance with the National Health and Medical Research Council’s Ethical Considerations in Quality Assurance and Evaluation Activities. These data were provided for use in this study to support public health response under the governance of Health Protection NSW. The NSW Public Health Act (2010) allows for such release of data to identify and monitor risk factors for diseases and conditions that have a substantial adverse impact on the population and to improve service delivery. Following review, the NSW Ministry of Health determined that this study met that threshold and therefore provided approval for the study to proceed. Approval for publication was provided by the NSW Ministry of Health.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Visual performance assessments

We examined the qualitative performance of our ward and ICU forecasts through visual assessment using the state of New South Wales as a case study (Fig. 3a, b, respectively). Forecasts 1–3 captured both ward and ICU counts (with observed data falling within the 60% intervals across the 15–21 day horizon) during the early growth phase of Omicron BA.2 in late March and early April 2022. The peak in ward occupancy induced by Omicron BA.2 in late April fell within the central density (50% interval) across the 15–21 day horizons of forecasts produced 2–3 weeks prior to the peak (forecasts 4, 5). Forecasts of the declining phase of the BA.2 epidemic exhibited varied performance. The first forecast in mid-April (forecast 6) underpredicted ward occupancy, though not ICU occupancy. This was followed by two forecasts (forecasts 7, 8) which captured ward occupancy better than forecast 6, although forecast 8 predicted ICU occupancy with insufficient uncertainty (i.e. overconfidence), with observations falling outside of the 80% interval for the majority of time points across the forecast horizon. The subsequent forecast produced in early May (forecast 9) incorrectly predicted that ward and ICU occupancy counts would resurge rather than continue to very slowly decline.

New South Wales forecasts produced during the inter-epidemic period between the BA.2 and BA.4/5 waves in late May and early June consistently underpredicted ward occupancy and marginally underpredicted ICU occupancy (forecasts 11–14). We continued to predict declines in occupancy, with the early growth phase of the BA.4/5 wave not captured in our predictions until late June (forecast 15), almost a month after occupancy had begun to stabilise and then slowly increase. Similar to the BA.2 peak, early forecasts captured the magnitude of the BA.4/5 peak in ward occupancy in mid-July, with observations across days 15–21 of the forecast horizon lying within the 40% interval and the 80% interval for forecast 17 and 18, respectively. However, these forecasts failed to predict the timing of the peak, instead predicting that ward occupancy would continue to increase into August. Our forecasts only correctly predicted reductions in the occupancy counts once counts had already begun to stabilise in late July (forecasts 19–21), though these still marginally overpredicted ward occupancy counts.

Further plots for the visual assessment of forecast performance for all other jurisdictions are available in the supplementary materials (Supplementary Figs. 9–24).

Quantitative performance

Measured forecast performance varied over the duration of the study period and across Australia’s eight states and territories (Fig. 4a–d). Measuring performance aggregated by forecast horizon (Fig. 4a) shows that the performance of the ward occupancy forecasts generally degraded the further into the future predictions were made (such a reduction in forecasting performance as the time horizon increases is common to many domains¹⁸). Ward occupancy performance for the Northern Territory was particularly unstable across all days of the horizon (Fig. 4a). The drop in forecast performance as forecast horizon increased was less visible for the ICU forecasts (Fig. 4c), likely reflecting the reduced scale of variation in the ICU time series, where the effect of changes in epidemic activity were less visible.

**Fig. 4: Summary performance of the ward and ICU forecasts across the study period (March to September 2022) measured using CRPS across log-transformed counts.**

Median ward occupancy forecast performance averaged across all horizons was best in New South Wales (Fig. 4b), possibly reflecting our use of hospital length of stay estimates derived from New South Wales data rather than local estimates (as the requisite data for other states and territories were not accessible or did not exist). ICU occupancy forecast performance was best in Victoria, followed by New South Wales. The states and territories with smaller populations (Tasmania, the Australian Capital Territory and the Northern Territory) tended to have worse performance for both ward and ICU occupancy forecasts, possibly due to a greater impact of individual-level variation in length of stay where admission counts were low (Supplementary Figs. 9, 10, 13, 14, 19, 20). Although South Australia had a (marginally) worse median ward occupancy forecast CRPS than New South Wales (Fig. 4b), examining performance across the 15–21 day horizon (Fig. 4a), the CRPS for South Australia exhibited a greater consistency in performance.

Examining changes in performance of the ward occupancy forecast over the duration of the study period (Fig. 5a–h), we note associations between forecast performance and the epidemiological context, with ward occupancy forecasts often biased downwards during pre-epidemic peak phases, and biased upwards during the post-epidemic peak phases. Results for ICU occupancy forecast performance over time (Supplementary Fig. 4) show similar trends, though here variation in length of stay at the individual-scale likely has a greater influence on performance, given the low (<50) counts for occupancy across most jurisdictions over the study period.

**Fig. 5: Three week (days 15–21) horizon performance of the ward forecasts for the forecasts produced between March and September 2022.**

We examined how the performance of the ensemble case forecast used as input to our model affected the performance of our ward and ICU forecasts. Averaged across the horizon of each of the forecasts, the mean ward forecast CRPS tended to be lower than that of the corresponding case forecast (Fig. 6a–h). This is expected given that the case forecast is a forecast of incidence, whereas our forecasts are of occupancy (i.e. prevalence), and as such exhibit greater autocorrelation and hence predictability. Comparing the ICU forecast performance to that of the case forecast (Supplementary Fig. 5) yields broadly similar results, although ICU performance in the Australian Capital Territory notably underperforms in comparison to the case forecasts. Bias in the case forecasts tended to be reflected in the ward occupancy forecasts (Supplementary Fig. 6), although this effect is less clear in jurisdictions with smaller populations, such as the Northern Territory (which has a population of approximately 250,000, compared to 8.1 million for New South Wales or 6.5 million for Victoria).

**Fig. 6: Performance of the ward occupancy forecasts (y-axis) in comparison to the corresponding ensemble case forecast used as input (x-axis).**

In our probability integral transform plots (Supplementary Fig. 3), we observe that forecast calibration varies from good (with the transformed distribution approximately uniform) to poor (with the transformed distribution far from uniform) between states and across the ward and ICU forecasts. Calibration was best for the ward forecasts in South Australia and best for the ICU forecasts in New South Wales. A few forecasts were overconfident, with Northern Territory, Queensland, and Tasmanian ward forecasts and Queensland ICU forecasts having a substantial proportion of observations occurring in the bottom- or top-most intervals. A similar pattern can be observed for the New South Wales ward occupancy forecasts, with a large proportion of observations falling in the top-most interval; this was likely a consequence of a string of underpredicting forecasts from late May through to early July (Fig. 3a, b, forecasts 11–14). The ICU forecasts for South Australia and Victoria had excessive levels of uncertainty, with few observations falling in the outer intervals.

We inspected the relative performance of our model compared to a naive random walk forecasting model (Supplementary Fig. 7). We find that we outperformed the naive model for most states and territories for both ward and ICU occupancy across the forecasting horizon. However, across the 2–3 week horizon, the naive model outperforms our forecasts for ward occupancy in New South Wales, the Australian Capital Territory, and Western Australia, and for ICU occupancy in Western Australia. These results suggest that there may be differences in the inherent difficulty of forecasting occupancy between the different states and territories. For predictions at a very short time horizon (less than 3 days ahead), our model was consistently worse on average than the naive model, likely reflective of our fitting procedure, where we allowed for trajectories to have some degree of error around the recent observed counts. We observe substantial heterogeneity in skill scores between forecasts across the study period (Supplementary Fig. 8). In New South Wales, two forecasts had skill scores of less than negative three, and in Western Australia, three forecasts had skill scores of less than negative three. These poorly performing forecasts likely had a substantial influence on our overall ward forecasting skill scores for these states. Notably, forecasts for ICU occupancy in Victoria outperformed the naive forecasting model across all forecasts produced.

Discussion

We have presented a clinical forecasting model for forecasting the number of patients with COVID-19 in ward and ICU beds. The model simulates the progression of patients through a compartmental model of hospital pathways, with simulations informed by near-real-time epidemiological data and fit to reported bed occupancy counts using Approximate Bayesian Computation. We have evaluated the performance of our forecasting methodology as it was applied and reported to public health decision-makers in the Australian context between March and September 2022 (although forecast outputs were produced between December 2021 and March 2022, we do not consider them in this study as the model received intensive development throughout that period). Our use of an independently produced case forecast as input to the clinical model has allowed us to take advantage of diverse case forecasting methodologies, and we have shown how the performance of our clinical forecasts can be evaluated in terms of the input case forecast performance.

Our results show that forecasting performance was variable over the study period and dependent upon the epidemiological context. The 15–21 day performance of the ward forecasts was poorest across most jurisdictions during the transition from Omicron BA.2 dominance to Omicron BA.4/5 dominance between May and July 2022 (Fig. 5a–h). This reduced performance can be observed in New South Wales from late May until early June (Fig. 3a, forecasts 11–14); by late June (forecast 15), a BA.4/5 transmission advantage was included in the mechanistic case forecasting models¹⁴, increasing median predicted occupancy counts but also the uncertainty across these predictions. Forecasting the burden of infectious disease during such variant transition events has previously been noted to be challenging, primarily due to the difficulty in rapidly ascertaining any differences in the biological properties of a new variant and incorporating these into models^37,38. These differences could include a change in the virulence of the pathogen, leading to changes in length of stay, probability of hospital admission, or probability of ICU admission. However, in the absence of evidence for a difference in virulence between the Omicron BA.2 and BA.4/5 variants³⁹, it is most likely that improving our clinical forecasts during this period would have required adjustments to the underlying case incidence forecasts.

Accurate prediction near epidemic peaks has previously been recognised to be a particularly difficult problem, both in the context of case incidence forecasts^40,41,42 and hospital burden forecasts^43,44. In our results, forecasting performance around epidemic peaks varied. Prior to peaks (in the epidemic growth phase), our forecasts generally performed well, although they tended to be biased downwards (Fig. 5a–h). Examining forecasts with start dates in the weeks prior to epidemic peaks (Supplementary Figs. 8 and 9–24), we see that occupancy count at the peak was generally well captured by forecasts produced one or two weeks prior to the point of peak occupancy. Forecasts that were produced three weeks prior to the peak performed comparatively worse, with most predicting that occupancy would continue to grow beyond what eventuated to be the peak. However, at this three-week horizon, forecasts typically had wide credible intervals, which appropriately conveyed the uncertainty of our predictions. During these peak periods, the performance of the forecasts was likely strongly influenced by the underlying case forecast performance. However, it is also possible that proactive changes in clinical practice could have led to reductions in length of stay or hospital admission rates around peak periods⁵. Such reductions in these key epidemiological parameters could only be captured in our model once they were realised in the data used for estimation. If reductions in length of stay or hospital admission rates occurred across the forecasting horizon, our forecasts would over-predict occupancy.

Previously published forecasting models for COVID-19 clinical burden can be broadly categorised into two groups: statistical models and mechanistic models. Statistical models produce predictions of clinical burden by learning patterns in the observed data, and may take as input only the target time series data^45,46,47,48, or may consider regression against other observations such as mobility or historical case incidence^47,48,49,50. In contrast, mechanistic forecasting models of clinical burden consider the flow of individuals through an explicitly described model of disease progression and clinical care pathways. Such mechanistic models may capture the entry of individuals into the healthcare system through an embedded model of infection dynamics^{23,51,52,53,54} or through statistical predictions of the entry process^50,55,56. Statistical models may perform as well (or better) than mechanistic models in some situations; however, one notable advantage of the mechanistic modelling approach is in allowing for the effect of changes in epidemiological parameters to be predicted and explained (e.g. a reduction in patient length of stay)⁵⁷.

Our work is distinguished from similar mechanistic clinical burden forecasting models through its use of an independently produced forecast of case incidence as input. This decoupling of the clinical progression model from the case forecasting models allows for greater separation of concerns since the development of each model can occur independently⁵⁸. A potential disadvantage of this modular approach is that clinical observations cannot be used to inform the underlying forecasts of infection dynamics, as each model is fit to data separately. Figure 6a–h demonstrate that the quality of our occupancy forecasts depends upon the performance of the input case incidence forecasts (a similar result has been previously reported for a model of hospital admissions⁴⁷), implying that our use of an ensemble case forecast as input has been advantageous for the performance of our occupancy forecasts, given ensembles have repeatedly been shown to improve case forecasting performance^{43,47,48,59,60}.

Our clinical forecasting model is designed to receive outputs from forecasts of case incidence as a (large) sample of trajectories. However, it has been more common for forecast outputs to be summarised using prediction intervals, which quantify the probability of outcomes falling within certain ranges. Examples of this have included the collaborative ensemble forecasts reported by the US and European COVID-19 forecast hubs^43,61. These prediction intervals are incompatible with our methodology as they obscure the underlying autocorrelation in the case incidence time series—if we were to sample from such intervals across each day of the forecast, uncertainty in the cumulative case count would be underestimated. We recommend that collaborative ensemble forecasts of infectious disease report outputs as trajectories where possible, so as to enable the appropriate propagation of uncertainty in further applications (such as that presented here).

Infectious disease forecasting models often exhibit reduced performance when predicting in low count contexts^45,51. In our work, we produced forecasts across low counts of both ward and ICU occupancy, typically during inter-epidemic periods and in jurisdictions with smaller population sizes. The performance of our forecasts as measured through CRPS was worse in these contexts (Fig. 4a–d). However, this is in large part due to the CRPS being calculated over log-transformed counts, effectively making it a measure of relative error. This would be expected to penalise forecasts produced in low count contexts, where small absolute changes can produce large relative differences¹⁹. Supplementary Fig. 7 provides further evidence for this, with some jurisdictions with smaller population sizes, such as Tasmania and the Northern Territory, performing particularly well when compared to a naive forecasting model. We also note that the performance of our occupancy forecasts across these low-count contexts may be of lesser importance to public health decision-makers, given they are typically (by definition) distant from capacity constraints.

Since the clinical forecasting model is informed by near-real-time estimates of key quantities such as probability of hospitalisation and length of stay, reasonable forecast performance could be expected in the absence of the approximate Bayesian fitting step. While this occasionally proved to be true in application (e.g. see Supplementary Methods, where the model output without fitting captures the reported ward occupancy counts for the Northern Territory), a few factors may have prevented this from being generally the case: firstly, we used patient length of stay distributions which were fit to data from the state of New South Wales and these distributions may not reflect the clinical practice or realised severity in other jurisdictions; secondly, the compartmental model we used may miss some components of hospital occupancy dynamics, such as outbreaks of COVID-19 within hospitals; thirdly, we assumed that the population which was reported as hospitalised in the case data was the same population as that reported in the hospital occupancy figures, which was not always the case due to differing upstream datasets (e.g. Victoria collected occupancy counts as a separate census of patients⁶²); finally, our near-real-time estimates of ward and ICU admission probability were not adjusted for possible right-truncation due to reporting lags as the date of data entry was not available within the case dataset we had access to.

Our forecasting methodology did not (explicitly) include the effect of vaccination upon the clinical trajectory of a COVID-19 case, as linked case vaccination data were generally unavailable or incomplete during our study period. These data would be of substantial value, allowing for our key epidemiological quantities (i.e. the probability of hospitalisation, probability of ICU admission and length of stay) to be produced with stratification by vaccination status, and potentially enabling the production of more accurate forecasts. Data on infections were also limited during our study period, with no large-scale infection survey performed in Australia during our study period⁶³ (although a number of sero-surveys were produced^10,64,65,66). Such infection data would allow us to produce estimates of the infection hospitalisation risk, unbiased by changes in the case ascertainment rate.

The measure of hospital burden we chose to forecast—hospital occupancy—has an advantage over incidence measures such as daily hospital admissions since it directly relates to the capacity of the healthcare system. However, it has a few disadvantages of note. Because hospital occupancy is a prevalence measure, it is inherently slower to respond to changes in the epidemic situation than hospital admissions and is, therefore, less useful as an indicator of changes in epidemic activity. It may also be more difficult to measure at the hospital level, given that it requires either accurate accounting of admissions and discharges or recording of individual patient stays. Ideally, both admissions and occupancy would be monitored and reported; in such a context, our model could be easily extended to fit to and report admission counts, given that admission counts are already recorded within our simulations.

Throughout the period for which COVID-19 bed occupancy counts were collected and reported in Australia, no nationally consistent standard specified which COVID-19 cases should be included in the counts. As a result, distinct definitions were created and applied across jurisdictions. For example, during our study period, the state of New South Wales counted any patient in hospital who had been diagnosed with COVID-19 either during their hospital stay or within the 14 days prior to their admission to hospital⁶⁷. This broad definition had the beneficial effect of reducing false negatives in the counting process but resulted in the inclusion of a large number of individuals who had since recovered from infection and/or whose stay was unrelated to the disease (with this effect then being captured in the estimates of length of stay used in our study). This was in contrast to Victoria, where COVID-19 cases were counted only until a negative test result was received⁶², reducing false positive inclusions but underestimating the total hospital burden of the disease, given COVID-19 cases may still require hospital care or be isolated for infection control reasons even when they no longer test positive. Although these differences would not be expected to substantially affect the forecast performance given our fitting methodology (which is able to adjust the probability of hospital admission and patient length of stay to account for such biases), the development of standard definitions that could be applied in future epidemics would allow for direct comparison of counts between jurisdictions and simplify modelling efforts.

The modelling framework we have described here is flexible and not inherently tied to COVID-19 hospital occupancy as the forecasting target. In general terms, our method stochastically simulates the convolution of a time series of case incidence into time series of subsequent outcomes. As such, the methodology could be applied to other viral respiratory pathogens that lead to substantial hospital burden, including respiratory syncytial virus (RSV) or the influenza viruses, both of which are currently the focus of international forecasting efforts^68,69. Further, our framework could be used to model other infectious diseases outcomes, such as absenteeism from the workforce or long-term sequelae. The efficient simulation and inference methodology we present allows for forecasts to be produced within a short turnaround time, with forecasting across the eight states and territories of Australia taking less than one hour on an eight core virtual machine (AMD EPYC 7702), where approximately one quarter of this time was dedicated to the pre- and post-processing of data and results. Our approach is highly amenable to parallelisation (across both individual simulations and the regions we choose to forecast) and as such would be expected to be suitable in applications where there is a greater number of target regions or outcomes.

We have presented a robust approach for forecasting COVID-19 hospital ward and ICU bed occupancy and have examined the performance of this methodology as applied in the Australian context between March and September 2022. Our forecasting model takes as input an independently produced forecast of daily case incidence. This incidence is then transformed into ward and ICU occupancy counts through a stochastic compartmental model, with the probabilities of hospitalisation and of ICU admission informed by near-real-time data. Our use of independently produced forecasts of case incidence has allowed us to both develop our model independently of the input case forecasting models and take advantage of the performance benefits provided by ensemble case forecasts. Our computationally efficient inference method allowed us to generate forecasts for multiple Australian jurisdictions in near-real-time, enabling the rapid provision of evidence to public health decision-makers.

Data availability

Limited data for reproducing the figures presented in this manuscript are also available at OSF (http://osf.io/5e6ma/, DOI: 10.17605/OSF.IO/5E6MA⁷⁰); this includes all model output forecast trajectories, reported occupancy counts retrieved from covid19data.com.au¹³, case forecast performance metrics, and Approximate Bayesian Computation diagnostic plots as produced in the course of producing occupancy forecasts. The complete line-listed case dataset is not publicly available; for access to the raw data, a request must be submitted to the Australian Government Department of Health and Aged Care, which will be assessed by a data committee independent of authorship group.

Code availability

All code is available archived at OSF (http://osf.io/5e6ma/, DOI: 10.17605/OSF.IO/5E6MA⁷⁰). All R code was run using R version 4.1 or greater. Changes to the model which occurred throughout the study period (which was limited to jurisdiction-specific modifications to ${\sigma }_{\,{\mbox{hosp}}\,}^{2}$ and ${\sigma }_{\,{\mbox{los}}\,}^{2}$ and a correction for New South Wales case data not including cases detected via rapid antigen test) are described in the Supplementary Methods.

References

Kadri, S. S. et al. Association between caseload surge and COVID-19 survival in 558 U.S. hospitals, March to August 2020. Ann. Intern. Med. 174, 1240–1251 (2021).
Article PubMed Google Scholar
Fong, K. J., Summers, C. & Cook, T. M. NHS hospital capacity during COVID-19: overstretched staff, space, systems, and stuff. BMJ 385, e075613 (2024).
Article PubMed Google Scholar
Dale, C. R. et al. Surge effects and survival to hospital discharge in critical care patients with COVID-19 during the early pandemic: a cohort study. Crit. Care 25, 70 (2021).
Article PubMed PubMed Central Google Scholar
Warrillow, S. et al. ANZICS guiding principles for complex decision making during the COVID-19 pandemic. Crit. Care Resusc. 22, 98–102 (2020).
PubMed PubMed Central Google Scholar
Varney, J., Bean, N. & Mackay, M. The self-regulating nature of occupancy in ICUs: stochastic homoeostasis. Health Care Manag. Sci. 22, 615–634 (2019).
Article PubMed Google Scholar
Maslo, C. et al. Characteristics and outcomes of hospitalized patients in South Africa during the COVID-19 omicron wave compared with previous waves. JAMA 327, 583–584 (2022).
Article CAS PubMed Google Scholar
Nyberg, T. et al. Comparative analysis of the risks of hospitalisation and death associated with SARS-CoV-2 omicron (B.1.1.529) and delta (B.1.617.2) variants in England: a cohort study. Lancet 399, 1303–1312 (2022).
Article CAS PubMed PubMed Central Google Scholar
Department of Health and Aged Care. Australia’s COVID-19 Vaccine Rollout (Technical report, 2022).
Shearer, F. M. et al. Estimating the impact of test-trace-isolate-quarantine systems on SARS-CoV-2 transmission in Australia. Epidemics 47, 100764 (2024).
Article CAS PubMed Google Scholar
Machalek, D. et al. Seroprevalence of SARS-CoV-2-specific antibodies among Australian blood donors, February-March 2022. Technical report, Australian COVID-19 Serosurveillance Network, (2022).
COVID-19 National Incident Room Surveillance Team. COVID-19 Australia: Epidemiology Report 62 Reporting period ending 5 June 2022. Commun. Dis. Intell. 46, 39 (2022).
COVID-19 Epidemiology and Surveillance Team. COVID-19 Australia: Epidemiology Report 67 Reporting period ending 23 October 2022. Commun. Dis. Intell. 46, 80 (2022).
O’Brien, J. et al. covid19data.com.au. https://www.covid19data.com.au/.
Shearer, F. et al. Series of weekly COVID-19 epidemic situational assessment reports submitted to the Australian Government Department of Health Office of Health Protection from April 2020 to December 2023. Technical report (2024).
Tobin, R. J. et al. Real-time analysis of hospital length of stay in a mixed SARS-CoV-2 Omicron and Delta epidemic in New South Wales, Australia. BMC Infect. Dis. 23, 28 (2023).
Article PubMed PubMed Central Google Scholar
Sunnåker, M. et al. Approximate Bayesian computation. PLoS Comput. Biol. 9, e1002803 (2013).
Article PubMed PubMed Central Google Scholar
Communicable Diseases Network Australia. Australian National Disease Surveillance Plan for COVID-19. Technical report, Australian Government Department of Health, (2022).
Rob Hyndman and George Athanasopoulos. Forecasting: Principles and Practice 3rd ed (OTexts, 2021).
Bosse, N. I. et al. Scoring epidemiological forecasts on transformed scales. PLoS Comput. Biol. 19, e1011393 (2023).
Article CAS PubMed PubMed Central Google Scholar
Funk, S. et al. Assessing the performance of real-time epidemic forecasts: a case study of Ebola in the Western Area region of Sierra Leone, 2014-15. PLoS Comput. Biol. 15, e1006785 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gneiting, T., Balabdaoui, F. & Raftery, A. E. Probabilistic forecasts, calibration and sharpness. J. R. Stat. Soc. Series B Stat. Methodol. 69, 243–268 (2007).
Article Google Scholar
Moss, R. et al. Coronavirus disease model to inform transmission-reducing measures and health system preparedness, Australia. Emerg. Infect. Dis. 26, 2844–2853 (2020).
Article CAS PubMed PubMed Central Google Scholar
Price, D. J. et al. Early analysis of the Australian COVID-19 epidemic. Elife 9, e58785 (2020).
Conway, E. et al. COVID-19 vaccine coverage targets to inform reopening plans in a low incidence setting. Proc. Royal Soc. B 290, 20231437 (2023).
Article Google Scholar
Cheng, A. C. et al. Influenza epidemiology in patients admitted to sentinel Australian hospitals in 2019: the Influenza Complications Alert Network (FluCAN). Commun. Dis. Intell. 46, 14 (2022).
Golding, N. et al. A modelling approach to estimate the transmissibility of SARS-CoV-2 during periods of high, low, and zero case incidence. eLife 12, e78089 (2023).
Article CAS PubMed PubMed Central Google Scholar
Moss, R. et al. Forecasting COVID-19 activity in Australia to support pandemic response: May to October 2020. Sci. Rep. 13, 8763 (2023).
Article CAS PubMed PubMed Central Google Scholar
Golding, N. et al. Situational assessment of COVID-19 in Australia—Technical Report 22 May 2022. Technical report, August 2022).
Walther, B. A. & Moore, J. L. The concepts of bias, precision and accuracy, and their use in testing the performance of species richness estimators, with a literature review of estimator performance. Ecography 28, 815–829 (2005).
Article Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2023).
Landau, W. M. The targets R package: a dynamic make-like function-oriented pipeline toolkit for reproducibility and high-performance computing. J. Open Source Softw. 6, 2959 (2021).
Article Google Scholar
Wickham, H. et al. Welcome to the tidyverse. J. Open Source Softw. 4, 1686 (2019).
Article Google Scholar
Borchers, H. W. pracma: Practical Numerical Math Functions, R package (CRAN, 2023).
O’Hara-Wild, M., Hyndman, R. Y. & Wang, E. fabletools: Core Tools for Packages in the ‘fable’ Framework, R package version 0.3.4. (2023).
Wang, E., Cook, D. & Hyndman, R. J. A new tidy data structure to support exploration and modeling of temporal data. Journal of Computational and Graphical Statistics 29, 466–478 (2020).
Article Google Scholar
O’Hara-Wild, M., Kay, M. & Hayes, A. distributional: Vectorised Probability Distributions, R package version 0.3.2. (2023).
Park, S. W. et al. The importance of the generation interval in investigating dynamics and control of new SARS-CoV-2 variants. J. R. Soc. Interface 19, 20220173 (2022).
Article PubMed PubMed Central Google Scholar
Keeling, M. J. & Dyson, L. A retrospective assessment of forecasting the peak of the SARS-CoV-2 Omicron BA.1 wave in England. PLoS Comput. Biol. 20, e1012452 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wolter, N. et al. Clinical severity of SARS-CoV-2 Omicron BA.4 and BA.5 lineages compared to BA.1 and Delta in South Africa. Nat. Commun. 13, 5860 (2022).
Article CAS PubMed PubMed Central Google Scholar
Castro, M., Ares, Saúl, Cuesta, JoséA. & Manrubia, S. The turning point and end of an expanding epidemic cannot be precisely forecast. Proc. Natl. Acad. Sci. USA. 117, 26190–26196 (2020).
Article CAS PubMed PubMed Central Google Scholar
Reich, N. G., Tibshirani, R. J., Ray, E. L. & Rosenfeld, R. On the Predictability of COVID-19 (International Institute of Forecasters, accessed 23 February 2024); https://forecasters.org/blog/2021/09/28/on-the-predictability-of-covid-19/.
Bracher, J. et al. A pre-registered short-term forecasting study of COVID-19 in Germany and Poland during the second wave. Nat. Commun. 12, 5173 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sherratt, K. et al. Predictive performance of multi-model ensemble forecasts of COVID-19 across European nations. Elife 12, e81916 (2023).
Manley, H. et al. Combining models to generate consensus medium-term projections of hospital admissions, occupancy and deaths relating to COVID-19 in England. R. Soc. Open Sci. 11, 231832 (2024).
Article PubMed PubMed Central Google Scholar
Panaggio, M. J. et al. Gecko: a time-series model for COVID-19 hospital admission forecasting. Epidemics 39, 100580 (2022).
Article CAS PubMed PubMed Central Google Scholar
Olshen, A. B. et al. COVIDNearTerm: a simple method to forecast COVID-19 hospitalizations. J. Clin. Transl. Sci. 6, e59 (2022).
Article PubMed PubMed Central Google Scholar
Meakin, S. et al. Comparative assessment of methods for short-term forecasts of COVID-19 hospital admissions in England at the local level. BMC Med. 20, 86 (2022).
Article CAS PubMed PubMed Central Google Scholar
Paireau, J. et al. An ensemble model based on early predictors to forecast COVID-19 health care demand in France. Proc. Natl. Acad. Sci. USA. 119, e2103302119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Klein, B. et al. Forecasting hospital-level COVID-19 admissions using real-time mobility data. Commun. Med. 3, 25 (2023).
Article PubMed PubMed Central Google Scholar
Goic, M., Bozanic-Leal, M. S., Badal, M. & Basso, L. J. COVID-19: Short-term forecast of ICU beds in times of crisis. PLoS ONE 16, e0245272 (2021).
Article CAS PubMed PubMed Central Google Scholar
Overton, C. E. et al. EpiBeds: Data informed modelling of the COVID-19 hospital burden in England. PLoS Comput. Biol. 18, e1010406 (2022).
Article CAS PubMed PubMed Central Google Scholar
Grodd, M. et al. Retrospektive Evaluation eines Prognosemodells für die Bettenbelegung durch COVID-19-Patientinnen und -Patienten auf deutschen Intensivstationen, June 2023.
Moghadas, S. M. et al. Projecting hospital utilization during the COVID-19 outbreaks in the United States. Proc. Natl. Acad. Sci. USA. 117, 9122–9126 (2020).
Article CAS PubMed PubMed Central Google Scholar
Garcia-Vicuña, D., Esparza, L. & Mallor, F. Hospital preparedness during epidemics using simulation: the case of COVID-19. Cent. Eur. J. Oper. Res. 30, 213–249 (2022).
Article PubMed Google Scholar
Heins, J., Schoenfelder, J., Heider, S., Heller, A. R. & Brunner, J. O. A scalable forecasting framework to predict COVID-19 hospital bed occupancy. INFORMS J. Appl. Anal. 52, 508–523 (2022).
Article Google Scholar
Deschepper, M. et al. Prediction of hospital bed capacity during the COVID-19 pandemic. BMC Health Serv. Res. 21, 468 (2021).
Article PubMed PubMed Central Google Scholar
Funk, S. & King, A. A. Choices and trade-offs in inference with infectious disease models. Epidemics 30, 100383 (2019).
Article PubMed Google Scholar
Laplante, P. A. What every engineer should know about software engineering. What Every Engineer Should Know (CRC Press, 2007).
Pinson, P. Comparing Ensemble Approaches For Short-term Probabilistic COVID-19 Forecasts in the U.S.(International Institute of Forecasters, accessed 23 July 2023); https://forecasters.org/blog/2020/10/28/comparing-ensemble-approaches-for-short-term-probabilistic-covid-19-forecasts-in-the-u-s/.
Cramer, E. Y. et al. Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States. Proc. Natl. Acad. Sci. USA. 119, e2113561119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Cramer, E. Y. et al. The United States COVID-19 Forecast Hub dataset. Sci. Data 9, 462 (2022).
Article CAS PubMed PubMed Central Google Scholar
Victorian Department of Health and Victorian Agency for Health Information. COVID-19 Daily Capacity and Occupancy Register. Technical report, October 2021.
Shearer, F. M. et al. Opportunities to strengthen respiratory virus surveillance systems in Australia: lessons learned from the COVID-19 response. Commun. Dis. Intell. 48, 47 (2024).
Machalek, D. et al. Seroprevalence of SARS-CoV-2-specific antibodies among Australian blood donors: Round 2 update. Technical report, Australian COVID-19 Serosurveillance Network, July 2022.
Machalek, D. et al. Seroprevalence of SARS-CoV-2-specific antibodies among Australian blood donors: Round 3 update. Technical report, Australian COVID-19 Serosurveillance Network, November 2022.
Koirala, A. et al. The seroprevalence of SARS-CoV-2-specific antibodies in Australian children: a cross-sectional study. PLoS ONE 19, e0300555 (2024).
Article CAS PubMed PubMed Central Google Scholar
Health Protection New South Wales. New South Wales COVID-19 weekly data overview, epidemiological week 9. Technical report, March (2022).
Mathis, S. M. et al. Title evaluation of FluSight influenza forecasting in the 2021-22 and 2022-23 seasons with a new target laboratory-confirmed influenza hospitalizations. Nat. Commun. 15, 6289 (2024).
Article CAS PubMed PubMed Central Google Scholar
Infectious Disease Dynamics Group at Johns Hopkins University (US RSV Forecast Hub, accessed 22 January 2025); https://rsvforecasthub.org/.
Tobin, R. J. et al. “A Modular Approach to Forecasting COVID-19 Hospital Bed Occupancy”, Supplementary Data, (OSF, November 2023); https://doi.org/10.17605/OSF.IO/5E6MA.

Download references

Acknowledgements

Our forecasts used surveillance data reported through the Communicable Diseases Network Australia by the interim Australian CDC (previously the Office of Health Protection), Department of Health and Aged Care, on behalf of the Communicable Diseases Network Australia (CDNA) as part of the nationally coordinated response to COVID-19. We additionally used surveillance data provided by the New South Wales Department of Health. We thank public health staff in state and territory health departments, the Australian Government Department of Health and Aged Care, and in state and territory public health laboratories. We thank members of CDNA for their feedback and perspectives on the results of the analyses. This work was directly funded by the Australian Government Department of Health and Aged Care. Additional support was provided by the National Health and Medical Research Council of Australia through its Investigator Grant Schemes (FMS Emerging Leader Fellowship, 2021/GNT2010051).

Author information

These authors jointly supervised this work: David J Price, Freya M Shearer.

Authors and Affiliations

School of Mathematics and Statistics, The University of Melbourne, Melbourne, VIC, Australia
Ruarai J. Tobin, Camelia R. Walker & James M. McCaw
Infectious Disease Dynamics Unit, Centre for Epidemiology & Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
Ruarai J. Tobin, Camelia R. Walker, Robert Moss, James M. McCaw & Freya M. Shearer
Department of Infectious Diseases, The University of Melbourne, at the Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
David J. Price
Centre for Epidemiology & Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
David J. Price
Infectious Disease Ecology and Modelling, The Kids Research Institute Australia, Perth, WA, Australia
Freya M. Shearer

Authors

Ruarai J. Tobin
View author publications
Search author on:PubMed Google Scholar
Camelia R. Walker
View author publications
Search author on:PubMed Google Scholar
Robert Moss
View author publications
Search author on:PubMed Google Scholar
James M. McCaw
View author publications
Search author on:PubMed Google Scholar
David J. Price
View author publications
Search author on:PubMed Google Scholar
Freya M. Shearer
View author publications
Search author on:PubMed Google Scholar

Contributions

R.J.T. and C.R.W. produced the agent-based forecasting methodology. Occupancy forecast outputs were produced by R.J.T. as part of weekly situational assessment reporting. R.M. contributed to the methodology for forecast performance evaluation and provided the forecasting evaluation for the case incidence forecasts. J.M.M., D.J.P., and F.M.S. provided supervision, funding, and were involved in conceptualisation. Manuscript text was draughted by R.J.T., D.J.P., and F.M.S., and all authors provided proofreading and editing.

Corresponding authors

Correspondence to Ruarai J. Tobin or Freya M. Shearer.

Ethics declarations

Competing interests

The authors declare the following competing interests: this study was funded by the Australian Government Department of Health and Aged Care. The forecasts of hospital bed occupancy, which are contained in the manuscript, were reported on a weekly basis to the Department of Health and Aged Care (Office of Health Protection, now interim Australian CDC) as part of the nationally coordinated response to COVID-19.

Peer review

Peer review information

Communications Medicine thanks Marlon Grodd and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Tobin, R.J., Walker, C.R., Moss, R. et al. A modular approach to forecasting COVID-19 hospital bed occupancy. Commun Med 5, 349 (2025). https://doi.org/10.1038/s43856-025-01086-0

Download citation

Received: 29 October 2024
Accepted: 04 August 2025
Published: 12 August 2025
Version of record: 12 August 2025
DOI: https://doi.org/10.1038/s43856-025-01086-0

Subjects

Abstract

Background

Methods

Results

Conclusions

Plain language summary

Similar content being viewed by others

Forecasting local hospital bed demand for COVID-19 using on-request simulations

Supporting COVID-19 policy-making with a predictive epidemiological multi-model warning system

A retrospective cohort study of 238,000 COVID-19 hospitalizations and deaths in Brazil

Introduction

Methods

Summary

Compartmental pathways model

Length of stay estimates

Case incidence

Estimation of time-varying parameters

Simulation and inference

Performance evaluation

Ethics

Reporting summary

Results

Visual performance assessments

Quantitative performance

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Transparent Peer Review file

Supplementary Information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links