Abstract
Several studies found an association between COVID-19 incidence, cumulated over the first pandemic wave, and the risk of death for infected individuals. They attributed this association to hospital overload. We studied this association across the French departments using 82,467 serological samples and a hierarchical Bayesian model with spatial smoothing. In high-incidence areas, we hypothesized that hospital overload would increase infection fatality rate (IFR) without increasing infection hospitalization rate (IHR). The analyses were adjusted for intensive care beds per capita, age of the population, and diabetes prevalence (as a surrogate for obesity). We found that increasing departmental incidence from 3 to 9% rose IFR from 0.42 to 1.14% (difference 0.72%, 95% CI 0.49–1.01%), and IHR from 1.66 to 3.61% (difference 1.94%, 95% CI 1.18–2.80%). An increase in incidence from 6 to 12% in people under 60 was associated with an increased proportion of people over 60 among those infected, from 11.6 to 17.4% (difference 5.8%, 95% CI 2.9–8.8%). Higher incidence increased the risk of death for infected individuals and their risk of hospitalization by the same magnitude. These findings could be explained by a higher age among infected individuals in high-incidence areas, rather by than hospital overload.
Similar content being viewed by others
Introduction
The first wave of the COVID-19 pandemic led to episodes of hospital overload in many parts of the world, requiring the urgent provision of additional hospital beds to prevent excess mortality1,2. Temporary units were set up, but suspected of being less effective than permanent units (although these comparisons were impeded by confounders such as admission criteria)3,4,5. Indeed, hospital overload can manifest in several ways. First, it can appear as a lack of hospital bed availability, resulting in a lower proportion of infected individuals being hospitalized. Second, it can lead to a decrease in the quality of care due to the urgent addition of extra beds with potentially inadequate facilities, equipment, or staff. Hospital overload thus has multiple aspects, making it difficult to measure using a single indicator.
Outside any pandemic context, several studies have already demonstrated an association between hospital overload and mortality among admitted patients6,7. During the first wave of the COVID-19 pandemic, the consequences of these episodes of hospital overload on the risk of death for infected persons have mainly been studied through the number of confirmed COVID-19 cases, using the case fatality rate (CFR)8,9,10. However, CFR analysis is limited by spatial and temporal variations in the case detection rate (the proportion of new cases that are detected)11,12. In these studies, hospital overload was notably expressed as the ratio between the number of COVID-19 cases and baseline hospital resources, a measure that does not distinguish between situations of high incidence and those of insufficient hospital resources at baseline8.
To overcome the limitations of the cases-based approach, several studies used serological data and estimated infection fatality rates (IFRs)13,14,15,16. IFR is defined as the number of deaths attributed to COVID-19 during a given period divided by the number of infected individuals over the same period (incidence), and corresponds to the risk of death for infected persons. These studies compared IFRs across countries and age groups, but did not explore the impact of hospital overload.
Built on the serosurvey SAPRIS-SERO, the present work aimed to estimate the effect of COVID-19 incidence on IFR at the scale of the French departments during the first pandemic wave in the population over 20, and to explore the role of hospital overload in this relation. As incidence (understood as a cumulative incidence over the first wave) was reported to range from 3 to 9% in the French regions (administrative subdivisions gathering several departments), the consequences of an incidence shift from 3 to 9% on IFR were the main object of this study17,18. Evidence for hospital overload with shortage of hospital beds was sought by examining the effect of COVID-19 incidence on infection hospitalization rate (IHR), which is the proportion of infected individuals being hospitalized for COVID-19. The age of the infected individuals may also play a role in the effect of incidence on IFR. Indeed, it has been suggested that older persons could be under-represented in the population of infected people at the early stages of the epidemic19. To determine if infected individuals were actually younger in departments with low incidence, the dependence between COVID-19 incidence in people under 60 and the proportion of individuals over 60 among those infected was investigated.
Methods
SAPRIS-SERO study
This study used the data of the SAPRIS-SERO serosurvey, previously described18,20. SAPRIS-SERO was built on SAPRIS (“SAnté, Perception, pratiques, Relations et Inégalités Sociales en population générale pendant la crise COVID-19”), a cohort whose inclusions began in March 2020, which studied epidemiological and sociological aspects of the COVID-19 epidemic in France21. The adult participants of SAPRIS were recruited from three cohorts based on the general population (without particular selection on a disease):
-
The cohort NutriNet-Santé focused on nutrition, with online follow-up. It included 170,000 participants at the start of the study in 200922.
-
The cohort CONSTANCES was set up in 2012 and included 204,973 adults, selected to be a representative sample of the French adult population23.
-
E3N/E4N is a multi-generational adult cohort including 113,000 persons: the women recruited at the start of the study (1990), their children, and the fathers of these children24.
All participants from the initial cohorts who had regular internet access and were still being followed in 2020 were invited to participate in the SAPRIS study, which involved self-administered questionnaires during the first wave21. These questionnaires covered demographic information and the history of SARS-CoV-2 testing by RT-PCR. A total of 93,610 SAPRIS participants were over 20, completed the questionnaires, and resided in metropolitan France. These participants were then invited to join the SAPRIS-SERO study by collecting a single dried-blood spot sample themselves. The samples were sent to a virology laboratory (Unité des virus émergents, Marseille, France) for serological analysis using the commercial ELISA test (Euroimmun, Lübeck, Germany), which detects anti-SARS-CoV-2 IgG antibodies targeting the S1 domain of the spike protein. The ELISA assays performed on dried-blood spot samples demonstrated a sensitivity of 98.1 to 100% and a specificity of 99.3 to 100% when compared to conventional serum assays as a standard25,26.
External data
The results of two independent French seroprevalence studies were used as prior distributions for national seroprevalence (metropolitan France)17,27. The results of a diagnostic study concerning the ELISA test (IgG anti-S1 from Euroimmun) were used as prior distributions for sensitivity and specificity28.
The French population structure by age and by administrative department came from the census of January first, 2020 (Insee, Institut national de la statistique et des études économiques)29. The data about COVID-19-related hospitalizations during the first semester (before July first, 2020) by administrative department were obtained from the SI-VIC database, the exhaustive national inpatient surveillance system used during the pandemic30. The data about general population mortality (including deaths occurring in nursing homes) attributed to COVID-19 during the first semester (before July first, 2020) were obtained from the CépiDc (Centre d’épidémiologie sur les causes médicales de décès), online (open data) or directly31. Raw diabetes prevalence in French departments in 2019 (pre-pandemic) was provided by Santé Publique France (open data), based on an exhaustive monitoring of anti-diabetic drugs use (Système national des données de santé)32. The number of ICU (intensive care unit) beds per inhabitant in 2019 (pre-pandemic) was obtained from the DREES (Direction de la recherche, des études, de l’évaluation et des statistiques)33.
Identification of causal effects
The relation between incidence and IFR is confounded, as the determinants of IFR may share socio-economic causes with incidence at the scale of departments. Typically, wealthier departments could have a population which travels more (possibly increasing incidence) and which is healthier (decreasing IFR), participating in a spurious negative association between incidence and IFR. Thus, IFR was adjusted (by conditioning and averaging, as described in the Statistical model section) for the main determinants of COVID-19 outcome: prevalence of diabetes (as a surrogate for obesity), proportion of the population over 60, and number of intensive care beds per inhabitant8,9,34,35. Prevalence of diabetes was chosen as a surrogate for obesity because it is easier to quantify precisely (through data on the sale of diabetes medication).
Causal graph. The variables are considered at the departmental scale. The effect of X on Y can be estimated by adjusting on {Diab., Age, Beds}. X: COVID-19 incidence. Y: IFR (infection fatality rate) or IHR (infection hospitalization rate). Diab.: Prevalence of diabetes. Beds: Number of intensive care beds per inhabitant. Age: Proportion of population over 60. Dashed arrows represent the effects of unmeasured confounders C.
Figure 1 features a causal graph representing departmental incidence (X), IFR or IHR (Y), unobserved socio-economic variables (C), and the determinants of COVID-19 outcome (according to8,9,34,35). Our most critical assumptions, which are included in the graph, are:
-
Age and diabetes are the main individual risk factors for COVID-19 severity, influencing hospitalization and mortality.
-
The determinants of IFR and IHR at the departmental scale act through the prevalence of these individual risk factors, through incidence, or through the number of intensive care beds per inhabitant (the latter having a potentially decisive role for IFR but acting as a surrogate for hospital beds when considering IHR).
Given this causal graph (Fig. 1), the set {Diab., Age, Beds} satisfies the back-door criterion relative to (X, Y) and allows for estimating the causal effect of X on Y despite the presence of unobserved socio-economic variables, using the back-door adjustment formula (Eq. 1)36. Let \(P(\text {IFR}|do(\text {inc}=x))\) denote this causal effect, which is the distribution of departmental IFRs if incidence (inc) was artificially set to the value x for all the departments, without any other modification. With \(z_j\) denoting the vector of covariates for a department j, and because the distribution of these covariates in the 95 departments of metropolitan France is known (\(P(z_j) = \frac{1}{95}\) for all j),

The target of this study was the average causal effect of an incidence shift from 3 to 9%:
This average causal effect corresponds to the expected difference in IFR when artificially setting incidence to 9% versus 3% in a department of metropolitan France (without changing anything else than incidence). The same formula was used to identify the average causal effect of this incidence shift on IHR.
Statistical model
We used a Bayesian statistical framework to leverage multiple sources of data and to account for uncertainty surrounding the latent variables when used in regressions (such as incidence and IFR). An overview of the model is featured in Fig. 2, where the equations of the model are referenced next to the variables.
Overview of the model. The blue and red rectangles represent the exposure and outcome of the main analysis, respectively. The numbers indicate the equations associated with the variables (see the Model section). IFR: Infection fatality rate. IHR: Infection hospitalization rate.
In the remainder of this section, prior distributions are not always explicitly written. If so, the latter are uniform. Age groups are indexed by the letter i (\(i = 0\) for individuals aged 20 to 59, and \(i = 1\) for individuals aged over 60). The departments are indexed by the letter j, ranging from 1 to 95.
For an age group i and a department j, the participants without a positive RT-PCR nor missing data on department contributed to the estimation of seroprevalence \(s_{i,j}\), considering \(N_{i,j}\) the number of these participants, and \(y_{i,j}\) the number of positive serological tests:
Seroprevalence at the scale of metropolitan France (\(\text {sero}_{\text {France}}\)) was obtained by post-stratification from \(s_{i,j}\) and \(\text {pop}_{i,j}\), the size of the population corresponding to this group:
Seroprevalence estimates from other surveys were incorporated using beta distributions:
For an age group i and a department j, incidence (cumulated over the first semester) was denoted \(p_{i,j}\). Seroprevalence \(s_{i,j}\) was linked to incidence \(p_{i,j}\) and to the sensitivity (Se) and specificity (Sp) of the serological test:
Prior distributions for sensitivity and specificity originated from28:
The participants with a positive RT-PCR contributed to the likelihood of sensitivity. With \(N_{\text {se}}\) and \(y_{\text {se}}\) the number of total and positive (respectively) serological tests in this group,
Incidence \(p_{i,j}\) was modeled on the logit scale. As it was suggested that older persons could be under-represented in the population of infected persons at the early stages of the epidemic19, every department had a unique log-odds ratio for age over 60 (\(\beta _j\)), possibly influenced by its intercept \(\alpha _j\) (logit of incidence in the 20-59 in the department j) through a linear regression with intercept \(\mu _{\text {age}}\), slope \(b_{\text {age}}\) and standard deviation \(\sigma _{\text {age}}\) (hierarchical modeling, \(\beta _j\) is a random effect):

The \(\alpha _j\) departmental intercepts entailed spatial auto-correlation through an ICAR (intrinsic conditional auto-regressive) component \(\phi _j\)(as described and implemented in this reference37), associated with an overall intercept \(\mu _{\alpha }\) and a scale parameter \(\sigma _{\phi }\) representing the amount of spatial correlation:
The proportion \(\text {age}_{\text {infected,j}}\) of persons over 60 among those infected in a department j was reconstructed from \(p_{i,j}\) and from \(\text {age}_{\text {pop},j} = \frac{\text {pop}_{1,j}}{\text {pop}_{1,j} + \text {pop}_{0,j}}\) (the proportion of persons above 60 in the population of the department j):
For a given incidence in the persons under 60, we estimated an expected proportion of persons over 60 among those infected for a department with the same age structure as metropolitan France. This expected proportion was reconstructed from the coefficients \(\mu _{age}\), \(\beta _{age}\), and \(\sigma _{age}\) (Eq. 9), according to a procedure described in Supplementary Information 1 (“Computation of expectations”). This analysis aimed to illustrate the dependence between incidence in people under 60 and incidence in those over 60, and its possible consequences for IFR.
The K participants with a positive RT-PCR and no missing data concerning the department contributed directly to incidence. With \(\text {IS}_k\) being the infection status of the participant k (\(\text {IS}_k\) is always equal to 1 in this positive RT-PCR group),
Departmental incidence \(\text {inc}_j\) was obtained by post-stratification from \(p_{i,j}\) and \(\text {pop}_{i,j}\):
For a department j, the counts of deaths (\(\text {D}_j\)) and hospitalizations (\(\text {H}_j\)) of the first semester were modeled with Poisson regressions:

Departmental IFRs and IHRs were modeled as logistic functions of linear predictors \(g_\text {IFR}(j)\) and \(g_\text {IHR}(j)\), ranging respectively from 0 to 5% and from 0 to 10%:
These linear predictors included a departmental random intercept, a slope representing the role of incidence, and coefficients associated with the covariates (prevalence of diabetes, number of intensive care beds per inhabitant and proportion of the population over 60). With \(\text {diab}_j\), \(\text {age}_{\text {pop},j}\), and \(\text {beds}_j\) being the covariates for the department j,

The coefficients of Eq. (16) were used to compute \(E[\text {IFR}|do(\text {inc}=x)]\), and subsequently the average causal effect of incidence on IFR, as described in Supplementary Information 1 (“Reconstruction of expectations” section). The same procedure was used to compute the effect of incidence on IHR.
Algorithm and software
The data management was done using R version 4.3.1, and the modeling was performed with Stan (R package cmdstanr version 0.5.3), which implements Hamiltonian Monte Carlo (HMC)38,39. The models for the random coefficients (\(\beta _j\), \(\mu _{\text {IFR}_j}\) and \(\mu _{\text {IFR}_j}\)) employed non-centered parameterizations to improve HMC convergence40. The Monte Carlo sampling consisted of 8 chains of 2,000 iterations each (including 1,000 warm up iterations). Trace plots, \({\hat{R}}\) statistics and effective Monte Carlo sample sizes provided by Stan were used to assess convergence. Posterior predictive checks are provided in Supplementary Information 2. The model’s code (in Stan) is provided in Supplementary Code 1 and in a GitHub repository (https://github.com/bglemain/does-hospital-overload).
Ethical approval and consent to participate
Ethical approval and written or electronic informed consent were obtained from each participant before enrollment in the original cohort. The SAPRIS-SERO study was approved by the Sud-Mediterranée III ethics committee (approval 20.04.22.74247) and electronic informed consent was obtained from all participants for dried blood spot testing. The study was registered (#NCT04392388). All methods were performed in accordance with the relevant guidelines and regulations.
Results
Participants
The study included 82,467 persons with a serological test, living in metropolitan France, and over 20 years old. All serological samples were collected between May and November 2020 (the detail of this timing is provided as Supplementary Fig. 1). Among the participants, 319 reported a positive RT-PCR. These latter had a mean age of 52 years, 29% of them were males, and the median time elapsed between RT-PCR and dried blood sampling was 111 days (IQR: 68-128). The participants with a positive RT-PCR were considered infected. On the other hand, 82,148 participants reported no positive RT-PCR (no RT-PCR or a negative RT-PCR). These participants had a mean age of 58 years, and 35% of them were males. Comorbidity data were available for 77,547 participants (94.0%). Within the cohort, 5.9% of individuals reported having diabetes, 5.3% reported having cancer, and 3.6% reported having coronary heart disease. A detailed flowchart, a table with the number of samples for each department and age group, and the detail of the age of the participants are provided in Supplementary Fig. 2, Supplementary Table 1, and Supplementary Table 2 (respectively).
Departmental incidence, IFR, and IHR
The model estimated an incidence of 6.8% over the first wave in metropolitan France, with a 95% credible interval (95% CI) of 6.4 to 7.2%. Figure 3
COVID-19 departmental incidence (cumulated over the first wave) in metropolitan France.This map was created using the R package maps: https://cran.r-project.org/web/packages/maps/index.html..
displays a map of departmental incidence, showing that north-east France was the most affected area. Indeed, departmental incidence ranged from 2.5% (95% CI 1.8–3.4%), in Ariège (a south-west department), to 13.6% (95% CI 12.1–15.2%), in Haut-Rhin (a north-east department).
The overall infection fatality rate in metropolitan France was 0.94% (95% CI 0.89–0.99%) and infection hospitalization rate was 3.28% (95% CI 3.10–3.46%). IFR ranged from 0.27% (95% CI 0.21–0.33%), in Haute-Garonne (a south-west department with low COVID-19 incidence), to 1.97% (95% CI 1.74–2.22%), in Haut-Rhin (the department with the highest COVID-19 incidence, located in the north-east of France). IHR ranged from 1.02% (95% CI 0.72–1.42%), in Tarn-et-Garonne, to 5.92% (95% CI 5.09–6.79%), in Territoire de Belfort. Supplementary Tables 3 and 4 provide exhaustive departmental estimates (incidence, IFR, IHR) and characteristics (covariates, counts of deaths and hospitalizations), respectively.
Effect of incidence on IFR and IHR
Effect of departmental incidence on infection fatality rate (IFR) and on infection hospitalization rate (IHR). The points represent the mean posterior departmental estimates. Black line and gray zone: Posterior mean and 95% CI of the expected adjusted departmental IFR given incidence (or expected causal effect of incidence on IFR: see Eq. 1 of the Methods). The same description applies to IHR. The covariates are represented relative to their medians. Pop. over 60: Proportion of adult population over 60. ICU beds per 1,000 inhab.: Number of intensive care beds per 1,000 inhabitants.
Figure 4 illustrates the association between departmental incidence and adjusted IFR, or adjusted IHR (adjustment for the proportion of persons over 60 in the population, for the prevalence of diabetes, and for the number of intensive care beds per inhabitant). An incidence of 3% was associated with an adjusted IFR of 0.42% (95% CI 0.33–0.52%), and an incidence of 9% was associated with an adjusted IFR of 1.14% (95% CI 0.95–1.39%). The absolute difference (equivalent to the average causal effect of an incidence shift from 3 to 9%) was 0.72% (95% CI 0.49–1.01%).
An incidence of 3% was associated with an adjusted IHR of 1.66% (95% CI 1.30–2.06%), and an incidence of 9% was associated with an adjusted IHR of 3.61% (95% CI 3.05–4.28%). The absolute difference was 1.94% (95% CI 1.18–2.80%).
Complementary results (univariate analysis, role of the confounders) are provided in Supplementary Information 3. For IFR, the only covariate whose parameter had a 95% credible interval that did not include zero was diabetes, which was positively associated with IFR. For IHR, none of the covariates had a parameter whose 95% credible interval excluded zero.
Association between incidence and age of infected persons
Association between incidence in people under 60 and the proportion of people over 60 among those infected. The points represent the mean posterior departmental estimates. Black line and gray zone: Posterior mean and 95% CI of the expected proportion of persons over 60 among those infected (for a department with the same age structure as metropolitan France).
As illustrated in Fig. 5, a shift in incidence in the persons under 60 from 6 to 12% (typical observed values) was associated with an increase in the expected proportion of persons over 60 among those infected for a department with the same age structure as metropolitan France, from 11.6% (95% CI 9.6–13.6%) to 17.4% (95% CI 15.5–19.5%). The absolute difference was 5.8% (95% CI 2.9–8.8%).
Discussion
This study explored the role of hospital overload on the risk of death for COVID-19 patients (IFR) using data collected in France following the first pandemic wave. We found that a higher departmental incidence was associated with a higher adjusted IFR, corresponding to a causal effect of incidence on IFR. This effect could possibly be explained by the age of the infected persons, as we found a higher proportion of people aged over 60 among those infected in high-incidence departments. The role of hospital overload was explored through the analysis of the probability of hospitalization when infected (IHR). In case of hospital overload with a lack of beds, we would have expected a decrease in the proportion of infected individuals who are hospitalized. In case of hospital overload with sufficient additional beds available but a decrease in the quality of care, we would have expected IHR to remain stable with incidence (since infected individuals could have been hospitalized without restriction). On the contrary, we found that an incidence shift from 3 to 9% increased IHR by the same magnitude as IFR (by a factor of two to three), consistent with the increase in the age of infected individuals with incidence (since both IHR and IFR increase with age)41.
Previous studies have compared IFR between countries and have found that age-specific IFR was the main predictor of country-level IFR13. However, IFR varied strongly between countries (by a factor of more than 30 in13), even after accounting for the age of the infected individuals15,16. In particular, European countries faced higher IFRs than expected13,14,15, suggesting the importance of other determinants of IFR than age and health-care capacity13. On another note, some of these studies have found a positive association between the number of ICU beds per capita and COVID-19 mortality9. It is possible that the wealth of some countries, through international mobility and age of their populations, acted as a confounding factor in this relationship. In our study, hospital overload was represented by the number of ICU beds per capita at baseline and by COVID-19 incidence, while adjusting for the population’s age. That way, focusing on a single country constituted a strength of our study, as it eliminated the influence of any country-level confounder (heterogeneity in populations, healthcare systems, mitigation policies, etc.), by design. Furthermore, the association between departmental incidence and IFR was not explained by the covariates we used: baseline hospital resources (measured as the number of intensive care beds per inhabitant), proportion of the population over 60, or prevalence of diabetes (used as a proxy for obesity). This association between incidence and adjusted IFR was also suggested in one international comparison (see Fig. 3 of16).
Some other studies found a positive association between a higher case fatality rate (CFR) and hospital overload in the US and in France4,8. In these studies, hospital overload was measured either by the number of hospitalizations for COVID-19 or by the ratio between COVID-19 cases and hospital resources, such as ICU beds or nursing staff. In one of these studies, CFR was standardized for the age of the population, but not for the age of the infected individuals4. These findings may be explained by the role of incidence. Indeed, hospital overload increases with incidence, as does the age of infected individuals (and consequently, IFR). Incidence could therefore be a confounder for hospital overload and IFR.
An important strength of this study was the Bayesian statistical framework, which accounted for uncertainty in the latent variables used in the regressions (such as incidence or IFR). Indeed, underestimation of incidence results in overestimation of IFR (since the number of COVID-19 related deaths is known), and overestimation of incidence leads to underestimation of IFR. Thus, sampling variation contributes to a spurious negative association between incidence and IFR, as evidenced in the posterior predictive checks (see Supplementary Information 2). Considering these latent variables as known during the regression step would therefore have led to biased results.
A limitation of our study was that focusing on a single country could reduce external validity, particularly for countries on other continents or with different levels of development. A second limitation relates to the causal assumptions we have made, as they result in a simplified view of the relation between the variables. We used the prevalence of diabetes as a surrogate for obesity because it is easier to collect data on (through diabetes medication), and not all risk factors for COVID-19 death were considered (like immunosuppression). However, these assumptions enabled formal causal reasoning using graphs, which is not yet widely practiced in health research42. This formalism contributes to greater clarity regarding the questions asked and the mechanisms under study.
Another limitation concerns the temporal dynamics of the epidemic within age groups, which is an important factor in interpreting our results. Our approach relies on serological data collected at the end of the wave, which does not allow for a direct analysis of these dynamics. Therefore, our interpretation is based on findings from the literature and on consistent results from our study at the geographical scale, namely the association between cumulative incidence and the age distribution of infected individuals19.
A mediation analysis of the effect of incidence on IFR, using the age of infected individuals as a mediator, could have provided insights into the effect of hospital overload by estimating the direct effect of incidence. Despite several modeling attempts, our algorithms did not converge: the available data did not allow us to perform this analysis.
Finally, we investigated the issue of hospital overload using a population-based approach (as opposed to a hospital-centered approach). A limitation of this approach is the inclusion of nursing home residents in IFR estimates. Indeed, for ethical reasons, very elderly individuals with loss of autonomy may not be admitted to intensive care in case of severe COVID-19, even in the absence of hospital overload. Moreover, nearly half of COVID-19-related deaths during the first pandemic wave in France occurred in nursing homes43. This phenomenon could thus limit the ability to identify an association between hospital overload and the risk of death among infected individuals at the population level. Moreover, our cohort did not include nursing home residents, so the estimation of incidence in this population was based on the incidence among individuals of the same age living in private households.
Conversely, several studies have focused on the risk of in-hospital death among infected individuals44,45,46. These studies have found an association with hospital burden (population incidence or number of hospital admissions), regardless of the age of hospitalized patients. However, they could not account for the fact that, at the same age, individuals with mild forms of COVID-19 are more likely to be hospitalized in the absence of hospital overload. This approach, therefore, introduces a selection bias, highlighting the need to also consider a population-based perspective.
In conclusion, this study found that a higher incidence increased the risk of death among individuals infected with COVID-19. However, the mechanism of this effect could be largely related to a shift in the profile of infected individuals as incidence changes. Indeed, the departments with lower incidence tended to have a younger infected population (to be distinguished from the age of the general population of the department). These findings prompt a reinterpretation of studies that have observed an association between the number of COVID-19 cases in a specific location and the risk of death among infected individuals, as the age of infected persons was generally not considered. It is important to emphasize that, while we did not find epidemiological evidence implicating hospital overload in increasing the risk of death among COVID-19 patients, this effect has already been demonstrated in hospital-level studies44,45. However, our results warn against hasty interpretations of population-level studies.
Data availability
The data of this study are under the protection of health data regulation, set by the French National Commission on Informatics and Liberty (Commission Nationale de l’Informatique et des Libertés, CNIL). The data can be made available upon reasonable request to fabrice.carrat@iplesp.upmc.fr, after a consultation with the steering committee of the SAPRIS-SERO study. The French law forbids us to provide free access to SAPRIS-SERO data; access could however be given by the steering committee after legal verification of the use of the data. Please, feel free to come back to us should you have any additional question.
Code availability
The model’s code (in Stan) is provided in Supplementary Code 1 and in a GitHub repository (https://github.com/bglemain/does-hospital-overload).
Change history
19 March 2026
A Correction to this paper has been published: https://doi.org/10.1038/s41598-026-42364-8
References
Candel, F. J. et al. Temporary hospitals in times of the COVID pandemic. An example and a practical view. Rev. Esp. Quimioter. 34, 280. https://doi.org/10.37201/req/041.2021 (2021).
Lefrant, J.-Y. et al. A national healthcare response to intensive care bed requirements during the COVID-19 outbreak in France. Anaesth. Crit. Care Pain Med. 39, 709–715. https://doi.org/10.1016/j.accpm.2020.09.007 (2020).
Jimenez, J. V. et al. Outcomes in Temporary ICUs Versus Conventional ICUs: An Observational Cohort of Mechanically Ventilated Patients With COVID-19-Induced Acute Respiratory Distress Syndrome. Crit. Care Explor. 4, e0668. https://doi.org/10.1097/CCE.0000000000000668 (2022).
Souris, M. & Gonzalez, J.-P. COVID-19: Spatial analysis of hospital case-fatality rate in France. PLoS One 15, e0243606. https://doi.org/10.1371/journal.pone.0243606 (2020).
Zappella, N. et al. Temporary ICUs during the COVID-19 pandemic first wave: Description of the cohort at a French centre. BMC Anesthesiol. 22, 310. https://doi.org/10.1186/s12871-022-01845-9 (2022).
Sprivulis, P. C., Da Silva, J.-A., Jacobs, I. G., Frazer, A. & Jelinek, G. A. The association between hospital overcrowding and mortality among patients admitted via Western Australian emergency departments. Med. J. Aust. 184, 208–212. https://doi.org/10.5694/j.1326-5377.2006.tb00416.x (2006).
Miró, O. et al. Decreased health care quality associated with emergency department overcrowding. Eur. J. Emerg. Med. 6, 105–107. https://doi.org/10.1097/00063110-199906000-00003 (1999).
Janke, A. T. et al. Analysis of hospital resource availability and COVID-19 mortality across the United States. J. Hosp. Med. 16, 211–214. https://doi.org/10.12788/jhm.3539 (2021).
Sen-Crowe, B., Sutherland, M., McKenney, M. & Elkbuli, A. A closer look into global hospital beds capacity and resource shortages during the COVID-19 pandemic. J. Surg. Res. 260, 56–63. https://doi.org/10.1016/j.jss.2020.11.062 (2021).
Dudel, C. et al. Monitoring trends and differences in COVID-19 case-fatality rates using decomposition methods: Contributions of age structure and age-specific fatality. PLoS One 15, e0238904. https://doi.org/10.1371/journal.pone.0238904 (2020).
Lau, H. et al. Evaluating the massive underreporting and undertesting of COVID-19 cases in multiple global epicenters. Pulmonology 27, 110–115. https://doi.org/10.1016/j.pulmoe.2020.05.015 (2021).
Borgdorff, M. W. New measurable indicator for tuberculosis case detection. Emerg. Infect. Dis. 10, 1523. https://doi.org/10.3201/eid1009.040349 (2004).
COVID-19 Forecasting Team. Variation in the COVID-19 infection-fatality ratio by age, time, and geography during the pre-vaccine era: A systematic analysis. Lancet399, 1469–1488, https://doi.org/10.1016/S0140-6736(21)02867-1 (2022).
O’Driscoll, M. et al. Age-specific mortality and immunity patterns of SARS-CoV-2. Nature 590, 140–145. https://doi.org/10.1038/s41586-020-2918-0 (2021).
Pezzullo, A. M., Axfors, C., Contopoulos-Ioannidis, D. G., Apostolatos, A. & Ioannidis, J. Age-stratified infection fatality rate of COVID-19 in the non-elderly population. Environ. Res. 216, 114655. https://doi.org/10.1016/j.envres.2022.114655 (2023).
Ioannidis, J. Infection fatality rate of COVID-19 inferred from seroprevalence data. Bull. World Health Organ. 99, 19-33F. https://doi.org/10.2471/BLT.20.265892 (2021).
Le, V. et al. Prevalence of SARS-CoV-2 antibodies in France: Results from nationwide serological surveillance. Nat. Commun. 12, 3025. https://doi.org/10.1038/s41467-021-23233-6 (2021).
Carrat, F. et al. Antibody status and cumulative incidence of SARS-CoV-2 infection among adults in three regions of France following the first lockdown and associated risk factors: A multicohort study. Int. J. Epidemiol. 50, 1458–1472. https://doi.org/10.1093/ije/dyab110 (2021).
Tran Kiem, C. et al. SARS-CoV-2 transmission across age groups in France and implications for control. Nat. Commun. 12, 6895. https://doi.org/10.1038/s41467-021-27163-1 (2021).
Carrat, F. et al. Age, COVID-19-like symptoms and SARS-CoV-2 seropositivity profiles after the first wave of the pandemic in France. Infection 50, 257–262. https://doi.org/10.1007/s15010-021-01731-5 (2022).
Carrat, F. et al. Incidence and risk factors of COVID-19-like symptoms in the French general population during the lockdown period: A multi-cohort study. BMC Infect. Dis. 21, 169. https://doi.org/10.1186/s12879-021-05864-8 (2021).
Hercberg, S. et al. The Nutrinet-Santé Study: A web-based prospective study on the relationship between nutrition and health and determinants of dietary patterns and nutritional status. BMC Public Health 10, 242. https://doi.org/10.1186/1471-2458-10-242 (2010).
Zins, M. & Goldberg, M. The French CONSTANCES population-based cohort: Design, inclusion and follow-up. Eur. J. Epidemiol. 30, 1317–1328. https://doi.org/10.1007/s10654-015-0096-4 (2015).
Clavel-Chapelon, F. & E3N Study Group. Cohort Profile: The French E3N Cohort Study. Int J Epidemiol44, 801–809, https://doi.org/10.1093/ije/dyu184 (2015).
Morley, G. L. et al. Sensitive detection of SARS-CoV-2-specific antibodies in dried blood spot samples. Emerg. Infect. Dis. 26, 2970–2973. https://doi.org/10.3201/eid2612.203309 (2020).
Zava, T. T. & Zava, D. T. Validation of dried blood spot sample modifications to two commercially available COVID-19 IgG antibody immunoassays. Bioanalysis 13, 13–28. https://doi.org/10.4155/bio-2020-0289 (2021).
Warszawski, J. et al. Trends in social exposure to SARS-Cov-2 in France. Evidence from the national socio-epidemiological cohort-EPICOV. PLoS One 17, e0267725. https://doi.org/10.1371/journal.pone.0267725 (2022).
Otter, A. D. et al. Implementation and extended evaluation of the euroimmun anti-SARS-CoV-2 IgG Assay and Its Contribution to the United Kingdom’s COVID-19 Public Health Response. Microbiol. Spectr. 10, e0228921. https://doi.org/10.1128/spectrum.02289-21 (2022).
Populations légales 2020 Recensement de la population Régions, départements, arrondissements, cantons et communes. https://www.insee.fr/fr/statistiques/6683031?sommaire=6683037.
Données hospitalières relatives à l’épidémie de COVID-19 (SIVIC). https://www.data.gouv.fr/fr/datasets/donnees-hospitalieres-relatives-a-lepidemie-de-covid-19/.
Covid-19 - Inserm-CépiDc. https://opendata.idf.inserm.fr/cepidc/covid-19/.
Géodes - Santé publique France. https://geodes.santepubliquefrance.fr/#c=home.
La Statistique annuelle des établissements (SAE) | Direction de la recherche, des études, de l’évaluation et des statistiques. https://drees.solidarites-sante.gouv.fr/sources-outils-et-enquetes/00-la-statistique-annuelle-des-etablissements-sae.
COVID-ICU Group on behalf of the REVA Network and the COVID-ICU Investigators. Clinical characteristics and day-90 outcomes of 4244 critically ill adults with COVID-19: A prospective cohort study. Intensive Care Med47, 60–73, https://doi.org/10.1007/s00134-020-06294-x (2021).
Brown, P. A. Country-level predictors of COVID-19 mortality. Sci. Rep. 13, 9263. https://doi.org/10.1038/s41598-023-36449-x (2023).
Pearl, J. Causal diagrams for empirical research. Biometrika 82, 669–688. https://doi.org/10.2307/2337329 (1995) (2337329.).
Morris, M. et al. Bayesian hierarchical spatial models: Implementing the Besag York Mollié model in stan. Spat Spatiotemporal Epidemiol. 31, 100301. https://doi.org/10.1016/j.sste.2019.100301 (2019).
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2023).
Carpenter, B. et al. Stan: A Probabilistic Programming Language. J. Stat. Softw. 76, 1. https://doi.org/10.18637/jss.v076.i01 (2017).
Papaspiliopoulos, O. & Roberts, G. Non-Centered Parameterisations for Hierarchical Models and Data Augmentation. Bayesian Statis. 7, 307–326 (2003).
Glemain, B. et al. Estimating SARS-CoV-2 infection probabilities with serological data and a Bayesian mixture model. Sci. Rep. 14, 9503. https://doi.org/10.1038/s41598-024-60060-3 (2024).
Tennant, P. et al. Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: Review and recommendations. Int. J. Epidemiol. 50, 620–632. https://doi.org/10.1093/ije/dyaa213 (2021).
Loganathan, K., Leroy, P., Elbaz, P., Grimfeld, A. & Mouaffak, F. Incidence and outcomes of COVID-19 first wave pandemic in a French nursing home with residents suffering from severe mental illnesses. Psychiatry Res. 309, 114398. https://doi.org/10.1016/j.psychres.2022.114398 (2022).
Asch, D. A. et al. Variation in US Hospital Mortality Rates for Patients Admitted With COVID-19 During the First 6 Months of the Pandemic. JAMA Intern. Med. 181, 471–478. https://doi.org/10.1001/jamainternmed.2020.8193 (2021).
Soria, A. et al. The high volume of patients admitted during the SARS-CoV-2 pandemic has an independent harmful impact on in-hospital mortality from COVID-19. PLoS One 16, e0246170. https://doi.org/10.1371/journal.pone.0246170 (2021).
Guillon, A. et al. Case fatality inequalities of critically ill COVID-19 patients according to patient-, hospital- and region-related factors: A French nationwide study. Ann. Intensive Care 11, 127. https://doi.org/10.1186/s13613-021-00915-4 (2021).
Acknowledgements
The authors warmly thank all the volunteers of the Constances, E3N-E4N, and NutriNet-Santé cohorts. We thank the staff of the Constances, E3N-E4N and NutriNet-Santé cohorts that have worked with dedication and engagement to collect and manage the data used for this study and to ensure continuing communication with the cohort participants. We thank the CEPH-Biobank staff for their adaptability and the quality of their work. We thank all the members of the SAPRIS-SERO study group.
Funding
SAPRIS-SERO study: ANR (Agence Nationale de la Recherche, #ANR-10-COHO-06), Fondation pour la Recherche Médicale (#20RR052-00), Inserm (Institut National de la Santé et de la Recherche Médicale, #C20-26). The sponsor and funders facilitated data acquisition but did not participate in the study design, analysis, interpretation or drafting. Cohorts funding: The CONSTANCES Cohort Study is supported by the Caisse Nationale d’Assurance Maladie (CNAM), the French Ministry of Health, the Ministry of Research, the Institut national de la santé et de la recherche médicale. CONSTANCES benefits from a grant from the French National Research Agency [grant number ANR-11-INBS-0002] and is also partly funded by MSD, AstraZeneca, Lundbeck and L’Oreal. The E3N-E4N cohort is supported by the following institutions: Ministère de l’Enseignement Supérieur, de la Recherche et de l’Innovation, INSERM, University Paris-Saclay, Gustave Roussy, the MGEN, and the French League Against Cancer. The NutriNet-Santé study is supported by the following public institutions: Ministère de la Santé, Santé Publique France, Institut National de la Santé et de la Recherche Médicale (INSERM), Institut National de la Recherche Agronomique (INRAE), Conservatoire National des Arts et Métiers (CNAM) and Sorbonne Paris Nord. The CEPH-Biobank is supported by the << Ministère de l’Enseignement Supérieur, de la Recherche et de l’Innovation >> .
Author information
Authors and Affiliations
Consortia
Contributions
F.C., N.L., C.A., W.G., P.M, and B.G. conceived and designed the study. B.G. implemented the model and wrote the manuscript. All authors (BG, CA, WG, PM, XL, MZ, GS, MT, JFD, NL, FC) reviewed and edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethics approval and consent to participate
Ethical approval and written or electronic informed consent were obtained from each participant before enrollment in the original cohort. The SAPRIS-SERO study was approved by the Sud-Mediterranée III ethics committee (approval 20.04.22.74247) and electronic informed consent was obtained from all participants for dried blood spot testing. The study was registered (#NCT04392388). All methods were performed in accordance with the relevant guidelines and regulations.
Consent for publication
Participants can not be identified on the basis of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this Article was revised: The original version of this Article contained an error in the author list, where the members of the consortium were added as authors in the author list.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Glemain, B., Assaad, C., Ghosn, W. et al. Revisiting the link between COVID-19 incidence and infection fatality rate during the first pandemic wave. Sci Rep 15, 15638 (2025). https://doi.org/10.1038/s41598-025-99078-6
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-99078-6
Keywords
This article is cited by
-
Optimal pandemic control strategies and cost-effectiveness of COVID-19 non-pharmaceutical interventions in the United States
BMC Global and Public Health (2025)







