Causal inference concepts can guide research into the effects of climate on infectious diseases

Barrero Guevara, Laura Andrea; Kramer, Sarah C.; Kurth, Tobias; Domenech de Cellès, Matthieu

doi:10.1038/s41559-024-02594-3

Download PDF

Analysis
Open access
Published: 25 November 2024

Causal inference concepts can guide research into the effects of climate on infectious diseases

Nature Ecology & Evolution volume 9, pages 349–363 (2025)Cite this article

10k Accesses
4 Citations
8 Altmetric
Metrics details

Subjects

Abstract

A pressing question resulting from global warming is how climate change will affect infectious diseases. Answering this question requires research into the effects of weather on the population dynamics of transmission and infection; elucidating these effects, however, has proved difficult due to the challenges of assessing causality from the predominantly observational data available in epidemiological research. Here we show how concepts from causal inference—the sub-field of statistics aiming at inferring causality from data—can guide that research. Through a series of case studies, we illustrate how such concepts can help assess study design and strategically choose a study’s location, evaluate and reduce the risk of bias, and interpret the multifaceted effects of meteorological variables on transmission. More broadly, we argue that interdisciplinary approaches based on explicit causal frameworks are crucial for reliably estimating the effect of weather and accurately predicting the consequences of climate change.

Perspectives on climate change and infectious disease outbreaks: is the evidence there?

Article Open access 20 July 2024

Climate warming and influenza dynamics: the modulating effects of seasonal temperature increases on epidemic patterns

Article Open access 25 February 2025

Identifying outbreak risk factors through case-controls comparisons

Article Open access 30 May 2025

Main

A key question ensuing from global warming is how climate change may impact the population dynamics of infectious diseases^1,2,3. Indeed, observations of large climatic variability in the distribution and seasonality of multiple infectious diseases worldwide—including major causes of death like malaria⁴, cholera⁵ and influenza⁶—suggest that many pathogens are sensitive to environmental conditions such that climate change could modify their ecology and epidemiology. Accordingly, predictive studies, based on numerical simulations combining models of global climate and infectious diseases under different scenarios of greenhouse gas emissions, suggest that climate change will affect many infections. These include infections with indirect transmission through intermediate, climate-sensitive stages involving a vector, such as mosquito-borne diseases like malaria and dengue, or the environment, including water-borne diseases like cholera. All of these infections are predicted to shift their geographical range under continued global warming^7,8,9. Fewer studies have focused on directly transmitted pathogens, but it has been suggested that climate change could also alter the transmission dynamics of respiratory syncytial viruses in the United States and Mexico¹⁰ and varicella zoster viruses in Mexico¹¹. Although such predictions cannot yet be evaluated, earlier research has already documented the impact of past climate warming³, for example, the increased altitudinal range of malaria in the highlands of Ethiopia and Colombia¹² and the increased risk of Vibrio disease in Northern Europe, coinciding with the warming of the Baltic Sea’s surface¹³.

A prerequisite to predicting the long-term consequences of climate change is to elucidate the effect of weather on infectious diseases. Even though effects of weather on infection dynamics (and the resulting ‘calendar of epidemics’¹⁴) have long been observed, there are persisting uncertainties about the direct causes and mechanisms for even well-researched pathogens like influenza viruses^15,16. Perhaps the most robust evidence for this effect is afforded by experimental studies, which demonstrate that environmental variables like temperature and humidity tightly modulate transmission parameters (such as pathogen survival time or infectivity) of viral^17,18, bacterial^19,20 and parasitic^21,22 infections. Although such evidence is useful for population-based research (in particular for postulating causal environmental factors), it remains too limited to estimate the causal impact of weather at the scale of human populations for at least three reasons. First, the endpoints measured in experimental studies—such as pathogen survival time—can be challenging to translate into meaningful epidemiological quantities, such as transmissibility. Second, because of differences in infection biology between species, the results from animal studies may not generalize to humans²³. Third, experimental studies cannot recapitulate all the mechanisms whereby weather affects infection, especially those operating at the population level—for example, weather causes behavioural changes in people, resulting in seasonal changes in social contacts²⁴. Hence, observational studies remain necessary to estimate the multifaceted effects of weather on human infectious diseases.

However, a well-known shortcoming of observational studies is their tendency to misidentify causes because found associations do not always imply causation for observational data. This problem is also expected when inferring the effect of weather, which is characterized by meteorological variables generally highly correlated with one another and potentially many other seasonal causes of infectious diseases. Here we discuss and demonstrate how causal inference—a methodological framework aiming at inferring causes from observed data—offers a principled approach to tackle these issues and strengthen evidence in observational research^25,26.

Causal inference in climate–infectious disease research

Causal inference frameworks and their tools are increasingly used to analyse data and guide study design in epidemiology²⁷ and beyond^28,29, and may also be useful for experimental research²⁹. The impact of such tools is illustrated by the fact that causal frameworks like target emulation trials may provide evidence as robust as that from randomized trials³⁰, thereby expanding the scope of observational research for answering causal questions.

Despite these advances, the use of causal methods—in the form of mechanistic models or statistical models based on causal reasoning—remains limited in the field of weather– or climate–infectious disease research³¹. To assess how regularly causal methods are used in these fields, we re-analysed 33 studies previously assessed in a review^{32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64}. All of these studies used time series regression models to evaluate the association between weather and cases of dengue, influenza, cholera or malaria cases (Supplementary Table 1). Although more causally principled methods for time series analysis exist^65,66, the standard time series design remains widely used, as evidenced by numerous recent applications for SARS-CoV-2⁶⁷. Four of the 33 studies had an explicitly predictive objective and did not address causality^38,41,43,58. Of the remaining 29 studies that addressed causality, only one derived the statistical model from explicit causal assumptions⁴⁰. By contrast, the other 28 studies neither explicitly mentioned causal reasoning to formulate their research question nor used causal graphs for study design or statistical analysis. Based on this assessment, we conclude that applying causal inference methods may help strengthen the evidence in this field.

Here we aim to demonstrate how using a causal inference framework can improve research on the effects of weather and climate on infectious diseases. Our focus is on the conceptual aspects, while we recommend other studies for methodological details^65,68,69. Throughout our analysis, we illustrate the different causal inference concepts through a series of short case studies (‘vignettes’), all based on a causal inference framework described below.

Results

A causal inference framework for a model of infectious disease transmission

A causal inference framework can be broadly defined as a systematic approach to identifying the causal effects of an input variable (for example, a weather variable) on an outcome variable of interest (for example, infectious disease risk)^70,71. In this framework, a central first step is to explicitly represent cause-and-effect relationships using a causal diagram called a directed acyclic graph (DAG; see glossary Fig. 1)⁷². The benefits of this representation are double: conceptual, as they help investigators clarify their hypotheses and assumptions and highlight potential sources of bias; and methodological, as given a DAG, causal inference theory prescribes a set of rules to determine the form of a model (if any) that can answer the causal question of interest. In a DAG, one should also carefully consider the measurement process to distinguish between observed and unobserved variables. This distinction is central for infectious diseases, as variables that are typically unobserved—such as population immunity, often portrayed as the ‘dark matter’ of epidemics⁷³—sensitively control transmission dynamics. We illustrate the application of a causal inference framework with a simple mathematical model representing the population-level dynamics of an acute infection spread by direct contact between susceptible and infected hosts. Although we do not focus on a specific pathogen in the rest of this analysis, this model—known as Susceptible–Infected–Recovered–Susceptible (SIRS) in the field of infectious disease modelling⁷⁴—may be considered realistic for respiratory viruses with short generation times, like influenza viruses⁷⁵, respiratory syncytial viruses⁷⁶ and SARS-CoV-2⁷⁶.

**Fig. 1: Glossary of causal inference concepts.**

To describe the effect of weather on infection dynamics, we incorporated an environmental model representing the joint causal effect—dictated by physical laws^77,78—of ambient air temperature (Te) and dew point temperature (a measure of absolute humidity) on relative humidity (RH). We then assumed a direct, negative effect of temperature and relative humidity on transmission (β)—that is, transmission decreased as either climatic variable increased. Finally, we incorporated an observation model representing the causal link between the true and observed incidence rates, assuming a surveillance system with perfectly specific but incompletely sensitive case detection, resulting in case under-reporting. Of note, we assumed this observation model was completely random, and we thus did not consider the potential bias resulting from systematic differences between the true and the observed incidence rates^79,80,81. Strategic simplifying assumptions (in particular, of a discrete-time model with fixed generation time and time step of 1 week) then allowed us to represent the full model with a simple DAG (Fig. 2). We focus on this representation instead of the conventional compartmental model diagram used in infectious disease modelling because DAGs more concisely convey causal concepts. Still, we note that both representations are causal diagrams (see ref. ⁸² for a more extensive discussion of the correspondence between both diagrams). Full model details—including equations, numerical implementation and further discussion of our assumptions about the effects of weather—can be found in Methods.

**Fig. 2: Causal graph for the illustrative transmission model.**

In the rest of this analysis, we use this DAG and its underlying model to illustrate four causal inference concepts: descendants and measurement bias (vignette 1); natural experiments (vignette 2); confounders and confounding bias (vignette 3); and mediators (vignette 4). In so doing, we emphasize two key points: first, causal inference frameworks are useful—and, indeed, required—to assess the effects of weather on infectious diseases; second, transmission models are valuable to encapsulate this framework, as they specify explicit causal mechanisms for the observed and unobserved variables that underlie infectious disease data.

Causal inference concepts–illustrations with four vignettes

Vignette 1 on descendants, measurement bias and the intricate association between environmental variables and incidence rate

Time series regression analysis of observed incidence rates is a frequent study design in environmental epidemiology^68,83. The implicit assumption of such studies is that statistical quantities derived from regression models—typically, regression coefficients—will accurately capture the causal effect of meteorological variables. However, as shown in our causal graph (Fig. 2) and as expected in practice, the causal chain relating weather to observed incidence rates is indirect and complex. A key issue arises from the fact that, although weather directly affects the transmission rate, the observed incidence rate is located two causal links downstream from the transmission rate; in causal inference language, the latter rate is described as a descendant (that is, a consequence; Fig. 1) of the former. Of these two causal links, the one relating the transmission and incidence rates may be challenging to recapitulate with regression models because unobserved variables like the size of the population susceptible to infection (inversely related to the population, or herd, immunity, which controls epidemic thresholds) induce nonlinearities that may result in marked dissimilarities between these two rates^69,74. Hence, measurement bias—that is, the bias arising from non-random differences between the targeted (here, transmission) and the observed endpoints (here, observed incidence; Fig. 1)⁸⁴—may distort causal inference from time series regression models. This potential bias has been recognized in environmental epidemiology, as reflected in recommendations to include additional covariates for capturing temporal variations in population immunity or other long-term trends⁶⁸. However, because of the above complexities, such additions, depending on the underlying causal structure and available information, are not guaranteed to reduce measurement bias.

To illustrate, we generated model simulations for a pathogen with low, medium or high transmissibility (basic reproduction number of 1.25, 2.5 or 5, respectively), with meteorological data from a temperate climate (Lübeck, Germany; Supplementary Table 2) resulting in a seasonally forced transmission rate with a single peak every winter (Fig. 3a). Under the medium-transmissibility scenario (Fig. 3b, middle panel), the epidemiological dynamic displayed annual periodicity, with winter seasonality in the incidence rate that broadly matched that of the transmission rate (Spearman’s correlation coefficient: r_s = 0.63). In marked contrast, lower transmissibility resulted in biennial epidemics showing little correlation with the seasonal transmission rate (r_s = 0.07; Fig. 3b, top panel). This phenomenon—called sub-harmonic resonance⁷⁴—resulted from the higher susceptibility threshold needed to trigger epidemics and the longer time required to replenish the pool of susceptible individuals (via births and waning immunity) to exceed that threshold. Finally, the opposite phenomenon of super-harmonic resonance was observed in the high-transmissibility scenario, which resulted in biannual epidemics (Fig. 3b, bottom panel). These simple numerical experiments illustrate the complex dynamic of infectious diseases and the potent but sometimes counter-intuitive footprint that weather—or, for that matter, any other source of seasonal forcing—can have on this dynamic⁶⁹.

**Fig. 3: Measurement bias and the intricate association between environmental variables and incidence rate (vignette 1).**

Next, we generated 100 replicate time series of observed incidence rates for each scenario to assess the reliability of time series regression models. For every replicate, as a control, we first fitted a negative binomial generalized additive model (GAM) with 1-week-lagged weather variables and the susceptible and infected population sizes as covariates and the observed incidence rate as endpoint—the true candidate model for our application, as we show in the Methods. As expected, the causal effects of temperature and relative humidity on the transmission rate were estimated, on average, without bias for this model (Supplementary Fig. 1). In practical applications, however, the susceptible and infected population sizes would be unobserved. Therefore, we next fitted a comparable model with a flexible smooth of time to try to capture variations in these unobserved variables. As shown in Fig. 3c, because of measurement bias, estimation performance was, overall, poor. For low transmissibility, the causal effect of temperature on the transmission rate was estimated with a substantial bias (mean absolute bias (AB): 0.08, 40% relative error in comparison to the actual value of −0.2) and imprecision (mean standard error of estimates across simulations (SE): 0.07). The bias was even more substantial in the medium- (mean AB: 0.16) and high-transmissibility scenarios (mean AB: 0.22). Because relative humidity had a high correlation with temperature but lower variability, its estimated effect was marred with large uncertainty, which exceeded, on average, the absolute effect size in every scenario (mean SE: 0.21–0.35, to be compared with the true effect size of −0.2; Supplementary Fig. 1).

In an additional analysis, we tested time series regression models of the effective reproduction number, another outcome that can be considered to assess the effects of weather. The corresponding DAG (Supplementary Fig. 2) shows that this outcome depends on only one unobserved variable, while the incidence rate depends on two (Fig. 2). As a result, we found that time series regression generally performed better for this outcome, even though the bias remained very large in all scenarios (Supplementary Fig. 1). We note that this better performance may not generalize to all settings. In practice, the effective reproduction number is not directly observable and must first be reconstructed from incidence time series⁸⁵. In our deliberately simple application, we assumed a small amount of noise but no systematic bias in this reconstruction, an assumption that may be too optimistic for more realistic models.

Although not intended to be exhaustive, this simple simulation study echoes earlier discussions of measurement bias as a major concern for study designs based on time series regression, particularly when population immunity varies over fast time scales^68,83. Relating to the central thread of this analysis, we note that causal reasoning—particularly the causal diagram representing the effect of weather—allowed us to identify, a priori, the relevant theoretical issues and propose simple numerical experiments to assess their practical relevance. This vignette thus illustrates the value of causal reasoning not only as a methodological tool but also as a theoretical tool to guide study design and analysis.

Vignette 2 on climate variability as natural experiments to estimate the individual effect of meteorological variables

Owing predominantly to latitudinal gradients in solar radiation and other factors like altitude and proximity to the sea, the Earth displays a large variability of climates⁸⁶. This variability is reflected in the Köppen–Geiger system, which classifies worldwide climates into 5 main types and 30 sub-types based on seasonal averages of precipitation and temperature⁸⁷. Because these different climates exhibit diverse seasonal patterns of variation in weather variables and correlations between them, they may be regarded as a range of ‘natural experiments,’ conceptually equivalent to manipulating specific weather variables to identify their causal effects. More broadly, the strategy of leveraging randomness that occurs naturally in observed data (that is, quasi-experiments; Fig. 1) is increasingly advocated for when inferring causality in predominantly observational research fields like economics²⁸, ecology⁸⁸ and epidemiology⁸⁹. As an example of such quasi-experiments, previous studies analysed large-scale, irregular oceanic phenomena such as the El Niño–Southern Oscillation to evaluate the effects of ‘climate change-like shocks’^90,91.

Of particular interest for environmental epidemiological research is the contrast between tropical climates (where temperature generally varies little and thus only slightly affects relative humidity) and temperate climates (where the opposite is typically observed), which may be leveraged to isolate the effects of temperature and relative humidity. To test this hypothesis, we used our illustrative model to run a range of numerical experiments in a pair of locations where this contrast was marked: Lübeck, Germany (53.9° N latitude, coefficient of variation (CV; standard deviation/mean) of temperature CV(Te) = 0.59, CV(RH) = 0.07, r_s(Te, RH) = −0.48) and Bogotá, Colombia (4.7° N latitude, CV(Te) = 0.04, CV(RH) = 0.07, r_s(Te, RH) = −0.10). By back-fitting our transmission model to 100 replicate time series of observed incidence rates it generated, we gauged how well we could estimate the effects of temperature and relative humidity (as well as other model parameters that would be unknown in real-world applications; Methods) in the two climates. In Lübeck, as expected for a climate characterized by low RH variability and large RH–Te correlation, the effect of temperature was estimated with more accuracy than that of relative humidity (mean AB of 0.02 and 0.09, respectively; Fig. 4). Of note, these results are reminiscent of those of vignette 1, except that the parameters estimated in this vignette originated from the true causal transmission model (Fig. 2) and, therefore, did not suffer from measurement bias. In Bogotá and its climate with low Te variability and almost null RH–Te correlation, the opposite result held, with higher accuracy for relative humidity than for temperature (mean AB of 0.04 and 0.05, respectively).

**Fig. 4: Climate variability as natural experiments to estimate the individual effect of meteorological variables (vignette 2).**

This simulation study thus suggests the scope for strategic choices regarding a study’s location, where the local climate’s properties can help estimate the effect of the weather variable of causal interest. Whenever more data are available, an alternative strategy is to use multilevel models, which provide a principled way to pool information while modelling variation across multiple locations⁸⁴. Multilevel extensions are now routine for standard regression models but more challenging for the complex—typically nonlinear, stochastic and partially observed⁹²—models needed to capture infectious disease dynamics. Nevertheless, recent statistical advances permit the estimation of such multilevel models^93,94, opening an avenue for large-scale dynamical modelling studies that harness information from multiple natural experiments in different climates. More broadly, this vignette underscores the critical importance of considering the causal mechanisms underlying weather dynamics.

Vignette 3 on confounding bias and how climate variability can masquerade as spatial spread

Spatial heterogeneities are commonly observed for infectious diseases^{95,96,97,98,99}. Such heterogeneities can result from two broad classes of mechanisms, depending on whether they involve spatial interactions through the movement of individuals (that is, spatial spread) or spatial variation in some other variable (for example, climate)¹⁰⁰. For example, spatial variations in climate across multiple locations may result in spatial covariance between these locations, even in the absence of spatial spread. This common cause of spatial variability may thus result in spurious associations that confound the estimated effect of spatial spread on observed incidence—that is, confounding bias (Fig. 1; see also Supplementary Fig. 3 for a DAG illustrating the problem in two locations).

To illustrate, we considered a scenario with seasonal transmission forced by weather but no spatial spread in two distinct countries (Supplementary Table 2): one with a definite latitudinal gradient in climate (Colombia¹⁰¹) and another with little spatial variability in climate (Spain). We simulated the resulting dynamics of our transmission model and assessed epidemic synchrony¹⁰² across various locations in these two countries. In Colombia, the simulated incidence displayed diverse seasonal patterns that followed a latitudinal gradient broadly matching that of the climate (Fig. 5b). In contrast, the low climatic variability in Spain resulted in tightly synchronous epidemics across the locations (Supplementary Fig. 4b). Hence, despite the absence of mechanisms causing spatial spread in our model, the shared effect of climate between locations resulted in marked spatial correlation in observed incidence, up to ~250 km in Colombia and more extensively throughout Spain (Fig. 5d and Supplementary Fig. 4d).

**Fig. 5: Confounding bias and how climate variability can masquerade as spatial diffusion in Colombia (vignette 3).**

To further characterize this spatial correlation, we estimated the speed of the—spurious—travelling wave under the incorrect assumption of spatial spread being the sole cause of spatial heterogeneity⁹⁶. Speed estimates were near infinite in Spain (Supplementary Fig. 4a,c), suggesting either confounding by climate or extremely strong coupling between the locations⁷⁴. In Colombia, however, the speed was estimated at 218 km per month (95% credible interval (CI): 121–444 km per month; Fig. 5a,c), a value consistent with that documented for real travelling waves—for example, 110–320 km per month for pertussis during 1951–2010 in the United States⁹⁸ and 150 km per month for dengue during 1983–1997 in Thailand¹⁰³.

To assess whether a statistical approach can address this confounding bias, we fitted a series of Gaussian process models to estimate the covariance in observed incidence rates between all locations in each country (Methods). As a control, we first tested the true model implied by the DAG, which included the sizes of the susceptible and infected populations, in addition to 1-week-lagged weather variables as covariates. As expected, this model correctly identified the absence of spatial spread, with the maximum covariance between locations being negligible (η (95% CI) = 0.01 (0–0.02) in Colombia and 0 (0–0.02) in Spain; Fig. 5e and Supplementary Fig. 4e left panel). Next, we fitted models with smooths of time to try to capture the variations in the susceptible and infected population sizes, which would be unobserved in practice. We found that a model omitting weather variables (that is, the confounded model) estimated a spurious spatial covariance between the locations (η (95% CI) = 0.09 (0.05–0.16) in Colombia and 0.38 (0.22–0.70) in Spain; Fig. 5e and Supplementary Fig. 4e middle panel). Furthermore, a model including the weather variables also led to biased estimates of spatial covariance (η (95% CI) = 0.11 (0.06–0.18) in Colombia and 0.39 (0.23–0.74) in Spain; Fig. 5e and Supplementary Fig. 4e right panel). This finding thus re-emphasizes the difficulty of analysing incidence data because of measurement bias (vignette 1) and suggests the need for explicit models for capturing the unobserved variables.

Next, we thus designed a simple transmission model with climate forcing and spatial spread (described by a single coupling parameter τ) between two locations. Using the data generated from the model with no spatial spread (Fig. 5d), we estimated the value of the coupling parameter (Methods). Here, the model including the weather variables—and thus controlling for the shared weather between the two locations—correctly revealed the absence of spatial spread (Fig. 5f and Supplementary Fig. 4f, with all confidence intervals of the maximum likelihood estimate for τ including 0). Although used for illustration here, this deliberately simple model could be extended to include more realistic features of spatial spread, such as multiple locations with different population sizes, potentially resulting in source–sink dynamics between urban and rural areas⁹⁵. More generally, this vignette highlights the importance of integrating explicit causal models with transmission models to disentangle the mechanisms underlying spatial heterogeneities.

Vignette 4 on mediation and the direct and indirect causal effects of temperature on transmission

Weather is the consequence of a complex web of causally related environmental variables that can affect pathogen transmission through multiple pathways⁶⁵. For instance, as illustrated in our simple causal graph and supported by experimental evidence¹⁸, both temperature and relative humidity may directly affect transmission. However, because temperature also impacts relative humidity, its total causal effect may comprise a direct effect (Te → β) and an indirect effect mediated by relative humidity (Te → RH → β). Importantly, but counter-intuitively, the direct and indirect effects may act in opposite directions depending on the causal relationship between the parent variable (for example, temperature) and its mediator (for example, relative humidity; Fig. 1) in the environmental model.

To illustrate how these two effects play out in different settings, we simulated our model in Lübeck, Germany, and Pasto, another Colombian city with relative humidity variability higher than in Bogotá. Owing to climatic differences causing temperature to be more variable than relative humidity in Lübeck (CV(Te) = 0.59, CV(RH) = 0.07) but less variable in Pasto (1.4° N latitude, CV(Te) = 0.07, CV(RH) = 0.19), we hypothesized the total impact of temperature would differ between the two cities. In each city, we considered two scenarios. In the first scenario, we set the two climatic parameters (see legend of Fig. 2) to their baseline values to capture the total effect of temperature—that is, the direct effect and the indirect effect mediated by relative humidity. In the second scenario, we set the climatic parameter of temperature to 0 to capture only its indirect effect, mediated by relative humidity.

In both cities, because of a combination of two negative effects (of temperature on relative humidity and relative humidity on transmission), higher temperature increased transmission through the indirect pathway (r_s(Te, β) = 0.91 in Pasto and 0.72 in Lübeck; Fig. 6a,b). In Lübeck, however, the higher temperature variability caused the direct effect to outweigh this indirect effect so that, overall, transmission decreased as temperature increased (r_s(Te, β) = −0.99; Fig. 6a). By contrast, the higher variability in relative humidity reversed the total effect in Pasto, where transmission increased with temperature (r_s(Te, β) = 0.80; Fig. 6b). Despite this overall positive effect, adjusting for relative humidity revealed the negative direct effect of temperature in Pasto (partial Spearman’s rank correlation coefficient: r_s(Te, β|RH) = −0.56; Fig. 6b). Of note, echoing the results of vignette 1, the effects of temperature on the observed incidence rates were less definite because of measurement bias (Supplementary Fig. 5). Hence, despite identical causal mechanisms, climatic differences resulted in divergent effects of temperature in the two cities.

**Fig. 6: Mediation and the direct and indirect causal effects of temperature on transmission (vignette 4).**

These conceptual insights have practical implications for interpreting the association between environment and transmission rates. Specifically, when evaluating the effect of temperature in a DAG similar to that in Fig. 2, models adjusting for relative humidity would identify the direct effect of both variables. In contrast, models without adjustment would only identify the total effect of temperature. The lack of clear causal frameworks may thus lead to misinterpreting model outputs, a risk described by earlier research as the ‘table 2 fallacy’¹⁰⁴. Hence, this vignette re-emphasizes the critical importance of causal reasoning and careful interpretation when probing the effect of climate on infection dynamics.

Discussion

Here we aimed to show how causal inference concepts—such as descendants and mediators, confounding and measurement biases, and quasi-experiments—can guide research into the effects of climate on infectious diseases. Through a series of case studies, we illustrated how such concepts could help assess study design (vignette 1); strategically choose a study’s location to achieve the set-up of a natural experiment (vignette 2); evaluate the risk of confounding bias (vignette 3); and interpret the direct and—sometimes paradoxical—indirect effects of meteorological variables on transmission (vignette 4). In addition, we showed that transmission models offer a principled and parsimonious tool to capture infectious disease dynamics and encapsulate causal frameworks. More broadly, seconding earlier calls in the epidemiological field²⁷, we argue that such frameworks are necessary for inferring the effect of weather and subsequently predicting the consequences of climate change on infectious diseases.

Because of this study’s conceptual focus, we sidestepped the many methodological technicalities that inevitably arise in practice. In addition to mere data problems (for example, a mismatch between the time scales of observed data and infection dynamics), the effect of weather on infectious diseases can be more intricate than our simple model suggests. First, variables other than temperature and relative humidity may directly affect transmission, such that disentangling their direct and indirect effects may be more challenging than in vignette 4. For example, if one assumes a direct effect of dew point temperature on transmission in the DAG represented in Fig. 2, then a causal mediation analysis would be required to estimate these different effects for all the environmental variables. Second, because of interindividual variability in the period separating exposure from infectiousness (that is, the latent period), the effect of weather on transmission is expected to be lag-distributed, resulting in a more complex causal diagram than Fig. 2. In this case, a standard compartmental model diagram may be more suited to depict the causal processes, with the added benefit that such diagrams can explicitly represent interactions and interference⁸². Third, this effect may be non-continuous and non-monotonic, as illustrated by recent experimental evidence showing a V-shape threshold association between relative humidity and survival time of coronaviruses^18,105. Fourth, although the causal graph of Fig. 2 is realistic for pathogens causing acute, directly transmitted infections (such as respiratory viruses), the weather may have multiple effects on pathogens with more complex relationships with their human hosts. Such multiple effects are, for example, expected for invasive bacteria with prolonged carrier states (like the pneumococcus¹⁰⁶), with weather affecting not only transmission of carriage but also progression from carriage to invasive disease^107,108. Finally, the poor correlation between indoor and outdoor meteorological variables (especially observed for temperature and relative humidity¹⁰⁹) has led to discussions about which measure of weather is more appropriate for causal inference, with indoor data argued to represent the bulk of weather exposures^16,110. This problem may be viewed as another form of measurement bias and treated in a causal framework by modelling the causal link between indoor variables and their outdoor counterparts, with some recent research in this direction¹¹⁰. More generally, our simple causal framework could be similarly extended to tackle the other complexities listed above, as well as the potential biases illustrated in the vignettes simultaneously.

In conclusion, the expanding field of causal inference offers opportunities to strengthen evidence derived from observed data. This study thus presents an early effort to integrate this field with infectious disease epidemiology and climatology, with the ultimate research aims of elucidating how climate affects pathogens and predicting the consequences of climate change. Given that climate is an ever-present component of the environment, this research will also advance our understanding of the ecology of infectious diseases.

Methods

Model formulation

Meteorological model

The daily average records of dew point temperature (Td, expressed in °C) and ambient temperature (Te, expressed in °C) were extracted using the WeatherData function in Mathematica¹¹¹. The data covered the period 2013–2022 in multiple weather stations located near the major cities of Colombia, Spain and Germany (Supplementary Table 1). Because the transmission model had a time step of 1 week, we calculated weekly averages of Te and Td for inclusion as covariates in the model. In case of missing daily records for a given week, we calculated the weekly average based on the records observed within that week in other years.

The relative humidity (RH, defined as the actual amount of water moisture in the air compared with the total amount that the air can hold at a given temperature) was then calculated using the formula^77,78:

$$\normalsize \normalsize \normalsize \log {{\mathrm{RH}}}=\frac{\alpha {\mathrm{{Td}}}}{\lambda +{\mathrm{Td}}}-\frac{\alpha {\mathrm{Te}}}{\lambda +{\mathrm{Te}}}$$

where α = 17.625 and λ = 243.04 °C are the revised Magnus coefficients¹¹². This association can be understood intuitively as follows: relative humidity increases as the absolute moisture in the air (quantified by Td) increases, while it decreases as ambient temperature increases (because of the physical property that the maximum moisture air can hold increases exponentially with temperature). To verify the adequacy of this formula, we also extracted actual RH records from the weather stations: the agreement between predictions and measurements was excellent (>90% correlation in all the locations considered), even though the predicted RH was overestimated at low ambient temperatures in temperate climates, such as in Germany.

In the following, we denote by Te_t and RH_t the weekly time series of ambient temperature and relative humidity, $\overline{{{\mathrm{Te}}}}$ and $\overline{{{\mathrm{RH}}}}$ their temporal averages (over the entire time series), and ${{{\mathrm{Te}}}}_{{\mathrm{t}}}^{{\prime} }=\frac{{\mathrm{T{e}}}_{{\mathrm{t}}}}{\overline{{{\mathrm{Te}}}}}$ and ${{{\mathrm{RH}}}}_{{\mathrm{t}}}^{{\prime} }=\frac{{\mathrm{R{H}}}_{{\mathrm{t}}}}{\overline{{{\mathrm{RH}}}}}$ their renormalized values.

Transmission model

To illustrate the different causal inference concepts, we formulated a discrete-time SIRS⁷⁴ model with the transmission rate β_t forced by the two climatic variables:

$${\log \beta }_{{\mathrm{t}}}=\log \beta +{\delta }_{{{\mathrm{Te}}}}({{{\mathrm{Te}}}}_{{\mathrm{t}}}^{{\prime} }-1)+{\delta }_{{{\mathrm{RH}}}}({{{\mathrm{RH}}}}_{{\mathrm{t}}}^{{\prime} }-1)$$

where β is the average transmission rate, δ_Te the effect of ambient temperature and δ_RH the effect of relative humidity. Based on experimental evidence on respiratory viruses^18,113, we assumed a small negative effect of both climatic variables: ${\delta }_{{{\mathrm{Te}}}}={\delta }_{{{\mathrm{RH}}}}=-0.2$. In other words, we assumed that transmission decreased as either climatic variable increased.

To derive the equations of the discrete-time model, we first write the system of ordinary differential equations for the continuous-time model:

$$\frac{{{\mathrm{d}}S}}{{{\mathrm{d}}t}}=\mu N\,+\alpha R\,-\,(\lambda (t)+\mu )S$$

$$\frac{{{\mathrm{d}}I}}{{{\mathrm{d}}t}}=\lambda (t)S-(\gamma +\mu )I$$

$$\frac{{{\mathrm{d}}R}}{{{\mathrm{d}}t}}=\gamma I-(\alpha +\mu )R$$

where $\lambda (t)=\beta (t)I(t)/N$ is the force of infection, N is the population size, μ is the birth/death rate, γ⁻¹ is the generation time, and α⁻¹ is the average duration of protection. All parameters are listed in Supplementary Table 3. Assuming a fixed time step Δt = 1 week and a fixed generation time equal to this time step (γ⁻¹ = Δt = 1 week), we discretized the system of ordinary differential equations (using the approximation $\frac{{{\mathrm{d}}X}}{{{\mathrm{d}}t}}\approx \frac{{X}_{t+\Delta t}-{X}_{t}}{\Delta t}$) to get the equations of the discrete-time model:

$${S}_{t+1}={S}_{t}+\mu N+\alpha {R}_{t}-({\lambda }_{t}+\mu ){S}_{t}$$

$${I}_{t+1}={\lambda }_{t}{S}_{t}-\mu {I}_{t}$$

$${R}_{t+1}={R}_{t}+{I}_{t}-(\alpha +\mu ){R}_{t}$$

For all simulations, we initialized the state variables to their equilibrium values for the model with no seasonal forcing (${\delta }_{{{\mathrm{Te}}}}={\delta }_{{{\mathrm{RH}}}}=0$). For the discrete-time model, these equilibria are given by ${S}^{* }=\frac{N}{{R}_{0}}$, ${I}^{* }=\frac{\alpha +\mu }{\alpha +\mu +1}(N-{S}^{* })$ and ${R}^{* }=N-{S}^{* }-{I}^{* }$, where ${R}_{0}=\frac{\beta }{\mu +1}$ is the basic reproduction number.

Observation model

To complete the model formulation, we specified a stochastic observation model to generate observed data from the transmission model’s outputs. Let

$${C}_{t}={\lambda }_{t-1}{S}_{t-1}={\beta }_{t-1}\frac{{I}_{t-1}}{N}{S}_{t-1}$$

represent the (true) incidence rate (Fig. 2), defined as the weekly number of new cases. We then used a negative binomial (NB) model to sample the observed incidence rate:

$${{C}_{t}^{({\mathrm{O}})}} \sim {\mathrm {NB}}({\mu }_{t}=\bar{\rho }{C}_{t},{\rho }_{{\mathrm{k}}})$$

where $\bar{\rho }$ is the mean reporting probability and ${\rho }_{{\mathrm{k}}}$ the reporting over-dispersion, representing extra variability in the mean reporting probability.

Complete model and causal graph

The complete model thus consisted of the discrete-time transmission model and the observation model described above. Because of our simplifying assumptions (in particular, a fixed generation time and an immediate effect of climate on transmission), this model was exactly represented by the causal graph displayed in the main text (Fig. 2).

Numerical implementation

The model was implemented in the R package pomp¹¹⁴, operating in R version 4.4.1¹¹⁵. All figures were created with the R package ggplot¹¹⁶, and the data for the maps was obtained from Natural Earth (https://www.naturalearthdata.com). Other packages used for specific vignettes are cited below.

Vignette 1 on descendants and measurement bias

Simulation details

The simulations for this vignette were based on climatic data in Lübeck, Germany, a location with a temperate oceanic climate (Köppen–Geiger classification: Cfb) characterized by large seasonal variability in temperature (CV: 0.59), little variability in relative humidity (CV: 0.07) and marked correlation between the two (r_s = −0.48). The model was simulated for three different values of the basic reproduction number (1.25, 2.5 and 5) and an average duration of immunity of 1 year; the other parameters were fixed to the values indicated in Supplementary Table 3.

Regression model for time series of observed cases

To identify a candidate regression model for the variable ${C}_{t}^{({\mathrm{O}})}$ (observed incidence rate), we first log-transformed the variable C_t (true incidence rate):

$$\log {C}_{t}=\log \left({\beta }_{t-1}\frac{{I}_{t-1}}{N}{S}_{t-1}\right)=C+{\delta }_{{{\mathrm{Te}}}}{{{\mathrm{Te}}}}_{t-1}^{{\prime} }+{\delta }_{{{\mathrm{RH}}}}{{{\mathrm{RH}}}}_{t-1}^{{\prime} }+\log \left({I}_{t-1}{S}_{t-1}\right)$$

where C is a constant. This equation, alongside the negative binomial observation model connecting ${C}_{t}^{({\mathrm{O}})}$ and ${C}_{t}$, shows that a natural candidate model for ${C}_{t}^{({\mathrm{O}})}$ is a negative binomial regression model with log-link, and ${{{\mathrm{Te}}}}_{t-1}^{{\prime} }$ and ${{{\mathrm{RH}}}}_{t-1}^{{\prime} }$ as covariates. In practice, the variable ${I}_{t-1}{S}_{t-1}$ is unobserved but may be captured by including a function of time as a covariate⁶⁸. Here, we fitted a negative binomial GAM¹¹⁷ that included the climatic covariates and a smooth of time to capture temporal variations in this variable. To give the regression model enough flexibility to capture these variations (which may occur over fast time scales; see Fig. 3), we set the basis dimension of the smooth to 50, or approximately 1 degree of freedom per 10 weeks of data. As a control, we also verified that the exact model with $\log ({S}_{t-1}{I}_{t-1})$ as a covariate yielded, on average, unbiased estimates of ${\delta }_{{{\mathrm{Te}}}}$ and ${\delta }_{{{\mathrm{RH}}}}$ (Supplementary Fig. 1). All the regression models were fitted using the mgcv package (version 1.8-42) in R¹¹⁷.

Regression models for time series of effective reproduction numbers

By definition, the time-varying effective reproduction number ${R}_{{\mathrm{e}},t}$ equals:

$${R}_{{\mathrm{e}},t}=\frac{{R}_{0,t}{S}_{t}}{N}\approx \frac{{\beta }_{t}{S}_{t}}{N}$$

where ${R}_{0,t}=\frac{{\beta }_{t}}{(\mu +1)}$ is the time-varying basic reproduction number and $\mu \ll 1$ (per week) is the death rate. Taking logs, we find that:

$$\log {R}_{{\mathrm{e}},t}=C+{\delta }_{{{\mathrm{Te}}}}{{{\mathrm{Te}}}}_{t}^{{\prime} }+{\delta }_{{{\mathrm{RH}}}}{{{\mathrm{RH}}}}_{t}^{{\prime} }+\log ({S}_{t})$$

where C is a constant. The corresponding DAG, shown in Supplementary Fig. 2, differs from the DAG for the incidence rate (Fig. 2) in that the effective reproduction number depends on just one unobserved variable (${S}_{t}$), while the incidence rate depends on two (${S}_{t}$ and ${I}_{t}$).

To derive an estimator for ${R}_{{\mathrm{e}},t}$, we first express it as a function of the incidence rates ${C}_{t}$. Specifically (see ‘Model formulation’), ${C}_{t+1}={\lambda }_{t}{S}_{t}=\frac{{\beta }_{t}{I}_{t}{S}_{t}}{N}\approx {R}_{{\mathrm{e}},t}{I}_{t}$ and ${I}_{t}={C}_{t}-\mu {I}_{t-1}\approx {C}_{t}$ (neglecting mortality). Hence, we find the following equation for ${R}_{{\mathrm{e}},t}$:

$${R}_{{\mathrm{e}},t}\approx \frac{{C}_{t+1}}{{C}_{t}}$$

In other words, the effective reproduction number is approximately equal to the epidemic growth rate, as expected intuitively because of our assumption of a 1-week fixed generation time.

In practice, because of under-reporting, the true values of ${R}_{{\mathrm{e}},t}$ are unobserved. However, estimation is possible via renewal models that back-calculate ${R}_{{\mathrm{e}},t}$ values from the generation time distribution and the observed incidence rates. Denoting by ${R}_{{\mathrm{e}},t}^{({\mathrm{O}})}$ this estimator, we generated it as follows:

$$\log{R}_{{\mathrm{e}},t}^{({\mathrm{O}})}\sim{N}(\mu =\log{R}_{{\mathrm{e}},t},\sigma =0.1)$$

In other words, we assumed a small dose of noise (approximately 10% around the true value) but no systematic bias in the estimation ${R}_{{\mathrm{e}},t}$.

We then fitted normal (N) regression models of $\log {R}_{{\mathrm{e}},t}^{({\mathrm{O}})}$ (outcomes) with ${{{\mathrm{Te}}}}_{t}^{{\prime} }$, ${{{\mathrm{RH}}}}_{t}^{{\prime} }$ and a smooth of time as covariates (Supplementary Fig. 1). As a control, we also verified that the true regression model with ${{{\mathrm{Te}}}}_{t}^{{\prime} }$, ${{{\mathrm{RH}}}}_{t}^{{\prime} }$ and $\log {S}_{t}$ as covariates yielded, on average, unbiased estimates (Supplementary Fig. 1).

Vignette 2 on climate variability as natural experiments

Simulation details

The simulations for this vignette were based on climatic data in Bogotá, Colombia, and Lübeck, Germany. In marked contrast to Lübeck’s climate (see above), Bogotá’s climate is classified as warm and temperate (Köppen classification: Csb), with little seasonality in temperature because of proximity to the Equator (CV: 0.04) but larger variability in relative humidity (CV: 0.07) and decoupling between the two climatic variables (r_s = −0.1).

To introduce model misspecification during the estimation of model parameters, we implemented a stochastic transmission model where the deterministic transmission rate β_t was multiplied at every time step by gamma white noise with mean 1 and standard deviation 0.02 (ref. ¹¹⁸). The complete model was, therefore, fully stochastic, with noise affecting both the transmission and the observation models. The model parameters were set as follows: basic reproduction number of 1.25, average duration of immunity of 1 year and other parameters fixed to the values indicated in Supplementary Table 3.

Parameter estimation protocol

The following six model parameters were assumed unknown and estimated from the data: basic reproduction number (${R}_{0}$), waning immunity rate ($\alpha$), mean reporting probability ($\bar{\rho }$), reporting over-dispersion (${\rho }_{{\mathrm{k}}}$) and climatic parameters (${\delta }_{{{\mathrm{Te}}}}$ and ${\delta }_{{{\mathrm{RH}}}}$). To generate the synthetic data for estimation, we first generated 100 replicate time series of observed weekly cases (${C}_{t}^{({\mathrm{O}})}$) from the fully stochastic model. For every replicate time series, we then fitted the misspecified model—with a deterministic transmission model and stochastic observation model—using trajectory matching⁹². Specifically, we used the Nelder–Mead algorithm¹¹⁹ (initialized at the true parameter values) to maximize the log-likelihood and identify the maximum likelihood parameter estimates. All the parameters were estimated on an unconstrained scale using log (parameters ${R}_{0}$, $\alpha$ and ${\rho }_{{\mathrm{k}}}$) or logit (parameter $\bar{\rho }$) transformations.

Vignette 3 on confounding bias

Simulation details

The simulations for this vignette were based on climatic data from 15 weather stations in Spain and 19 in Colombia, located near the major cities of both countries. Continental Spain exhibits relatively uniform temperate climates with consistent seasonal variations of temperature and humidity seasonality across the country. In contrast, Colombia displays a range of tropical climates, with diverse seasonal patterns of precipitation along a latitudinal gradient¹⁰¹. The models were simulated without incorporating spatial spread between the locations and with a basic reproduction number of 2.5 and an average duration of immunity of 2 years to achieve yearly epidemics in all the locations. The other parameters were fixed to the values indicated in Supplementary Table 3.

Epidemic synchrony and assessment of spatial spread

To assess spatial synchrony between locations, we estimated the non-parametric (cross-) correlation function (NCF) of the simulated time series using the NCF package¹⁰⁰. We estimated the 95% confidence intervals using 500 bootstraps.

To diagnose the risk of confounding bias caused by climate, we estimated the speed of the potential travelling wave that would have resulted from spatial diffusion. To do so, we first estimated the difference in epidemic peak timing as the lag (in weeks), maximizing the cross-correlation function between every location and a reference location (Riohacha for Colombia and Gijón for Spain). We then regressed this difference against the geographical distance between locations and estimated the speed wave as the inverse of the regression coefficient.

To further evaluate how climate can confound the estimate of spatial spread between locations, we fitted negative binomial Gaussian process models, that assumes a joint multivariate normal (MVN) distribution, to the time series of observed incidence from all locations in Colombia:

$$\left(\begin{array}{c}\log {C}_{t}^{\left({\mathrm{O}}\right)\left(1\right)}\\ \log {C}_{t}^{\left({\mathrm{O}}\right)\left(2\right)}\\ \cdots \\ \log {C}_{t}^{\left({\mathrm{O}}\right)\left(19\right)}\end{array}\right)\sim{\mathrm{MVN}}\left(\left[\begin{array}{c}\log {{\rm{\mu }}}_{t}^{\left(1\right)}\\ \log {{\rm{\mu }}}_{t}^{\left(2\right)}\\ \cdots \\ \log {{\rm{\mu }}}_{t}^{\left(19\right)}\end{array}\right],K\right)$$

The Gaussian process captures the covariance K^(ij) between the incidence of any pair of locations i and j as a function of the geographic distance between locations, defined by the exponential quadratic kernel:

$${K}^{\,({ij})}=\,{\eta }^{2}\mathrm{exp}\left(-\frac{{||}{x}^{\left(i\right)}-{x}^{\left(j\right)}{{||}}^{2}}{2{\varphi }^{2}}\right)\,$$

where ||x⁽ⁱ⁾ − x^(j)|| is the Euclidean distance between two locations, η is the maximum covariance between locations and $\varphi$ is a characteristic distance that controls the spatial scale over which the covariance varies. Based on the proposed ‘true’ and ‘smooth’ regression models described in the methods for vignette 1, we fitted models including the size of the infected and susceptible population or a smooth of time with or without the environmental variables. For example, for the true model, we wrote $\log {\mu }_{t}^{(1)}\approx \log ({S}_{t-1}^{(1)}{I}_{t-1}^{(1)})+{{{\mathrm{Te}}}}_{t-1}^{{(1)}^{{\prime} }}+{{{\mathrm{RH}}}}_{t-1}^{{(1)}^{{\prime} }}$ in location 1. Finally, we calculated the correlation matrix between the incidence of all locations from the estimated covariance matrix K. All models were fitted using the brms package¹²⁰ with four Markov chains, each run for 10,000 iterations, assuming uninformative priors.

Two-location transmission model with spatial diffusion

As purely statistical approaches resulted in confounded estimations of spatial spread, we moved on to estimate spatial spread with transmission models. We extended our climate-forced SIRS model to include spatial diffusion by dividing the population into two coupled locations, for i = 1,2 (refs. ^121,122):

$${{S}^{(i)}_{t+1}}={{S}^{(i)}_{t}}+\mu {N}^{(i)}+\alpha {{R}^{(i)}_{t}}-({{\lambda }^{(i)}_{t}}+\mu ){{S}^{(i)}_{t}}$$

$${{I}^{(i)}_{t+1}}={{\lambda }^{(i)}_{t}}{{S}^{(i)}_{t}}-\mu {{I}^{(i)}_{t}}$$

$${{R}^{(i)}_{t+1}}={{R}^{(i)}_{t}}+{{I}^{(i)}_{t}}-(\alpha +\mu ){{R}^{(i)}_{t}}$$

with the force of infection in each location i given by:

$${\lambda }^{(i)}(t)=\,\frac{{\beta }^{(i1)}{I}^{(1)}(t)}{{N}^{(1)}}+\frac{{\beta }^{(i2)}{I}^{(2)}(t)}{{N}^{(2)}}\,$$

We assumed a symmetric scenario where the transmission rate is the same within patches (${\beta }^{({ij})}=\beta$ if i = j) but differs between patches (${\beta }^{({ij})}=\tau \beta$ if $i\ne j$, where $0\le \tau \le 1$ is the coupling strength). In this case, each location had the same population (${N}^{(1)}={N}^{(2)}=N$), and the endemic equilibria were independent of the location, given by ${S}^{* }=\frac{N}{{R}_{0}}$, ${I}^{* }=\frac{\alpha +\mu }{\alpha +\mu +1}(N-{S}^{* })$, and ${R}^{* }=N-{S}^{* }-{I}^{* }$, where ${R}_{0}=\frac{\beta (1+\tau )}{\mu +1}$.

Parameter estimation protocol

As in vignette 2, we assumed the same six parameters unknown and estimated from the simulated data without spatial spread. We fitted the two-location transmission model with spatial diffusion for every location and a reference location (Riohacha for Colombia and Gijón for Spain) using trajectory matching⁹². Again, we used the Nelder–Mead algorithm¹¹⁹ (initialized at the true parameter values) to maximize the log-likelihood and identify the maximum likelihood parameter estimates. Then, we generated likelihood profiles to estimate the spatial spread parameter $\tau$ by varying the parameter while maximizing the likelihood over the remaining parameters to obtain likelihood-ratio-test-based confidence intervals. All the parameters were estimated on an unconstrained scale using log (parameters ${R}_{0}$, $\alpha$ and ${\rho }_{{\mathrm{k}}}$) or logit (parameter $\bar{\rho }$ and $\tau$) transformations.

Vignette 4 on mediation, direct and indirect causal effects

Simulation details

The simulations in this vignette were based on climatic data from Pasto, Colombia, and Lübeck, Germany. As described before, in Lübeck, temperature displays larger seasonal variability than relative humidity. In contrast, in Pasto, seasonal variability in relative humidity is larger (CV: 0.19) than in temperature (CV: 0.07).

As temperature affects humidity, the total causal effect of temperature comprises a direct effect and an indirect effect mediated by relative humidity. Thus, we simulated a model representing the total effect of temperature (with climatic parameters to ${\delta }_{{{\mathrm{Te}}}}$ = −0.2 and ${\delta }_{{{\mathrm{RH}}}}$ = −0.2) and another model representing only the indirect effect of temperature (with climatic parameters fixed to ${\delta }_{{{\mathrm{Te}}}}$ = 0 and ${\delta }_{{{\mathrm{RH}}}}$ = −0.2). For these simulations, we fixed the other parameters as follows: basic reproduction number of 1.25, average duration of immunity of 1 year and other parameters as indicated in Supplementary Table 3.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data are available at https://github.com/DomenechLab/Causality_Seasonality and stored in Edmond, the open research data repository of the Max Planck Society, at https://doi.org/10.17617/3.9CWN7W. All weather data were extracted using the WeatherData function in Mathematica (https://reference.wolfram.com/language/ref/WeatherData.html), with the weather station names indicated in Supplementary Table 2.

Code availability

All R programming codes are available at https://github.com/DomenechLab/Causality_Seasonality and stored in Edmond, the open research data repository of the Max Planck Society, at https://doi.org/10.17617/3.9CWN7W.

References

Our Risk for Infectious Diseases is Increasing Because of Climate Change (National Center for Emerging and Zoonotic Infectious Diseases, 2021).
Impact of Climate Change on Infectious Diseases and Antimicrobial Resistance – Part 1 of the German Status Report on Climate Change and Health 2023 (Robert Koch Institute and Statistisches Bundesamt, 2023); https://www.rki.de/EN/Content/Health_Monitoring/Health_Reporting/GBEDownloadsJ/JHealthMonit_2023_S3_Status_report_climate_change_health_part1.html
Cissé, G. et al. in Climate Change 2022: Impacts, Adaptation and Vulnerability (eds Pörtner, H. O. et al.) 1041–1170 (Cambridge Univ. Press, 2022).
The Global Health Observatory Estimated Number of Malaria Deaths (World Health Organization, 2024); https://www.who.int/data/gho/data/indicators/indicator-details/GHO/estimated-number-of-malaria-deaths
Cholera Worldwide Overview: Geographical Distribution of Cholera Cases Reported Worldwide (European Centre for Disease Prevention and Control, 2024); https://www.ecdc.europa.eu/en/all-topics-z/cholera/surveillance-and-disease-data/cholera-monthly
Global Influenza Programme: Burden of Disease (World Health Organization, 2024); https://www.who.int/teams/global-influenza-programme/surveillance-and-monitoring/burden-of-disease
Colón-González, F. J. et al. Projecting the risk of mosquito-borne diseases in a warmer and more populated world: a multi-model, multi-scenario intercomparison modelling study. Lancet Planet. Health 5, e404–e414 (2021).
Article PubMed PubMed Central Google Scholar
Escobar, L. E. et al. A global map of suitability for coastal Vibrio cholerae under current and future climate conditions. Acta Trop. 149, 202–211 (2015).
Article PubMed Google Scholar
Kruger, S. E., Lorah, P. A. & Okamoto, K. W. Mapping climate change’s impact on cholera infection risk in Bangladesh. PLoS Glob. Public Health 2, e0000711 (2022).
Article PubMed PubMed Central Google Scholar
Baker, R. E. et al. Epidemic dynamics of respiratory syncytial virus in current and future climates. Nat. Commun. 10, 5512 (2019).
Article CAS PubMed PubMed Central Google Scholar
Baker, R. E., Mahmud, A. S. & Metcalf, C. J. E. Dynamic response of airborne infections to climate change: predictions for varicella. Clim. Change 148, 547–560 (2018).
Article Google Scholar
Siraj, A. S. et al. Altitudinal changes in malaria incidence in highlands of Ethiopia and Colombia. Science 343, 1154–1158 (2014).
Article CAS PubMed Google Scholar
Baker-Austin, C. et al. Emerging Vibrio risk at high latitudes in response to ocean warming. Nat. Clim. Change 3, 73–77 (2012).
Article Google Scholar
Martinez, M. E. The calendar of epidemics: seasonal cycles of infectious diseases. PLoS Pathog. 14, e1007327 (2018).
Article PubMed PubMed Central Google Scholar
Shaman, J. & Kohn, M. Absolute humidity modulates influenza survival, transmission, and seasonality. Proc. Natl Acad. Sci. USA 106, 3243–3248 (2009).
Article CAS PubMed PubMed Central Google Scholar
Marr, L. C., Tang, J. W., Van Mullekom, J. & Lakdawala, S. S. Mechanistic insights into the effect of humidity on airborne influenza virus survival, transmission and incidence. J. R. Soc. Interface 16, 20180298 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lowen, A. C., Mubareka, S., Steel, J. & Palese, P. Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog. 3, 1470–1476 (2007).
Article CAS PubMed Google Scholar
Morris, D. H. et al. Mechanistic theory predicts the effects of temperature and humidity on inactivation of SARS-CoV-2 and other enveloped viruses. Elife 10, e65902 (2021).
Article CAS PubMed PubMed Central Google Scholar
Huq, A., West, P. A., Small, E. B., Huq, M. I. & Colwell, R. R. Influence of water temperature, salinity, and pH on survival and growth of toxigenic Vibrio cholerae serovar 01 associated with live copepods in laboratory microcosms. Appl. Environ. Microbiol. 48, 420–424 (1984).
Article CAS PubMed PubMed Central Google Scholar
Jusot, J.-F. et al. Airborne dust and high temperatures are risk factors for invasive bacterial disease. J. Allergy Clin. Immunol. 139, 977–986.e2 (2017).
Article PubMed PubMed Central Google Scholar
Paaijmans, K. P. et al. Influence of climate on malaria transmission depends on daily temperature variation. Proc. Natl Acad. Sci. USA 107, 15135–15139 (2010).
Article CAS PubMed PubMed Central Google Scholar
Bayoh, M. N. & Lindsay, S. W. Effect of temperature on the development of the aquatic stages of Anopheles gambiae sensu stricto (Diptera: Culicidae). Bull. Entomol. Res. 93, 375–381 (2003).
Article CAS PubMed Google Scholar
van der Worp, H. B. et al. Can animal models of disease reliably inform human studies? PLoS Med. 7, e1000245 (2010).
Article PubMed PubMed Central Google Scholar
Willem, L., Van Kerckhove, K., Chao, D. L., Hens, N. & Beutels, P. A nice day for an infection? Weather conditions and social contact patterns relevant to influenza transmission. PLoS ONE 7, e48695 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hernán, M. A. & Robins, J. M. Causal Inference: What If (Chapman & Hall/CRC, 2020).
Neal, B. Introduction to Causal Inference from a Machine Learning Perspective (2020); https://www.bradyneal.com/Introduction_to_Causal_Inference-Dec17_2020-Neal.pdf
Kurth, T. Continuing to advance epidemiology. Front. Epidemiol. 1, 782374 (2021).
Article PubMed PubMed Central Google Scholar
Liu, T., Ungar, L. & Kording, K. Quantifying causality in data science with quasi-experiments. Nat. Comput. Sci. 1, 24–32 (2021).
Article PubMed PubMed Central Google Scholar
Collazo, A., Kuhn, H.-G., Kurth, T., Piccininni, M. & Rohmann, J. L. Rethinking animal attrition in preclinical research: expressing causal mechanisms of selection bias using directed acyclic graphs. J. Cereb. Blood Flow Metab. https://doi.org/10.1177/0271678X241275760 (2024).
Hernán, M. A., Wang, W. & Leaf, D. E. Target trial emulation: a framework for causal inference from observational data. JAMA 328, 2446–2447 (2022).
Article PubMed Google Scholar
Liang, L. & Gong, P. Climate change and human infectious diseases: a synthesis of research findings from global and spatio-temporal perspectives. Environ. Int. 103, 99–108 (2017).
Article PubMed Google Scholar
Kim, Y.-M., Park, J.-W. & Cheong, H.-K. Estimated effect of climatic variables on the transmission of Plasmodium vivax malaria in the Republic of Korea. Environ. Health Perspect. 120, 1314–1319 (2012).
Article PubMed PubMed Central Google Scholar
Jusot, J.-F. & Alto, O. Short term effect of rainfall on suspected malaria episodes at Magaria, Niger: a time series study. Trans. R. Soc. Trop. Med. Hyg. 105, 637–643 (2011).
Article PubMed Google Scholar
Haque, U. et al. The role of climate variability in the spread of malaria in Bangladeshi highlands. PLoS ONE 5, e14341 (2010).
Article CAS PubMed PubMed Central Google Scholar
Xiao, D. et al. Spatiotemporal distribution of malaria and the association between its epidemic and climate factors in Hainan, China. Malar. J. 9, 185 (2010).
Article PubMed PubMed Central Google Scholar
Olson, S. H. et al. Links between climate, malaria, and wetlands in the Amazon Basin. Emerg. Infect. Dis. 15, 659–662 (2009).
Article PubMed PubMed Central Google Scholar
Hashizume, M., Terao, T. & Minakawa, N. The Indian Ocean Dipole and malaria risk in the highlands of western Kenya. Proc. Natl Acad. Sci. USA 106, 1857–1862 (2009).
Article CAS PubMed PubMed Central Google Scholar
Teklehaimanot, H. D., Schwartz, J., Teklehaimanot, A. & Lipsitch, M. Weather-based prediction of Plasmodium falciparum malaria in epidemic-prone regions of Ethiopia II. Weather-based prediction systems perform comparably to early detection systems in identifying times for interventions. Malar. J. 3, 44 (2004).
Article PubMed PubMed Central Google Scholar
Teklehaimanot, H. D., Lipsitch, M., Teklehaimanot, A. & Schwartz, J. Weather-based prediction of Plasmodium falciparum malaria in epidemic-prone regions of Ethiopia I. Patterns of lagged weather effects reflect biological mechanisms. Malar. J. 3, 41 (2004).
Article PubMed PubMed Central Google Scholar
Abeku, T. A. et al. Effects of meteorological factors on epidemic malaria in Ethiopia: a statistical modelling approach based on theoretical reasoning. Parasitology 128, 585–593 (2004).
Article CAS PubMed Google Scholar
Hii, Y. L., Zhu, H., Ng, N., Ng, L. C. & Rocklöv, J. Forecast of dengue incidence using temperature and rainfall. PLoS Negl. Trop. Dis. 6, e1908 (2012).
Article PubMed PubMed Central Google Scholar
Gomes, A. F., Nobre, A. A. & Cruz, O. G. Temporal analysis of the relationship between dengue and meteorological variables in the city of Rio de Janeiro, Brazil, 2001–2009. Cad. Saude Publica 28, 2189–2197 (2012).
Article PubMed Google Scholar
Lowe, R. et al. The development of an early warning system for climate-sensitive disease risk with a focus on dengue epidemics in southeast Brazil. Stat. Med. 32, 864–883 (2013).
Article PubMed Google Scholar
Hashizume, M., Dewan, A. M., Sunahara, T., Rahman, M. Z. & Yamamoto, T. Hydroclimatological variability and dengue transmission in Dhaka, Bangladesh: a time-series study. BMC Infect. Dis. 12, 98 (2012).
Article PubMed PubMed Central Google Scholar
Earnest, A., Tan, S. B. & Wilder-Smith, A. Meteorological factors and El Niño Southern Oscillation are independently associated with dengue infections. Epidemiol. Infect. 140, 1244–1251 (2012).
Article CAS PubMed Google Scholar
Pham, H. V., Doan, H. T. M., Phan, T. T. T. & Minh, N. N. T. Ecological factors associated with dengue fever in a Central Highlands province, Vietnam. BMC Infect. Dis. 11, 172 (2011).
Article PubMed PubMed Central Google Scholar
Pinto, E., Coelho, M., Oliver, L. & Massad, E. The influence of climate variables on dengue in Singapore. Int. J. Environ. Health Res. 21, 415–426 (2011).
Article PubMed Google Scholar
Shang, C.-S. et al. The role of imported cases and favorable meteorological conditions in the onset of dengue epidemics. PLoS Negl. Trop. Dis. 4, e775 (2010).
Article PubMed PubMed Central Google Scholar
Chen, S.-C. et al. Lagged temperature effect with mosquito transmission potential explains dengue variability in southern Taiwan: insights from a statistical analysis. Sci. Total Environ. 408, 4069–4075 (2010).
Article CAS PubMed Google Scholar
Tipayamongkholgul, M., Fang, C.-T., Klinchan, S., Liu, C.-M. & King, C.-C. Effects of the El Niño–Southern Oscillation on dengue epidemics in Thailand, 1996–2005. BMC Public Health 9, 422 (2009).
Article PubMed PubMed Central Google Scholar
Lu, L. et al. Time series analysis of dengue fever and weather in Guangzhou, China. BMC Public Health 9, 395 (2009).
Article CAS PubMed PubMed Central Google Scholar
Johansson, M. A., Dominici, F. & Glass, G. E. Local and global effects of climate on dengue transmission in Puerto Rico. PLoS Negl. Trop. Dis. 3, e382 (2009).
Article PubMed PubMed Central Google Scholar
Thammapalo, S., Chongsuwiwatwong, V., McNeil, D. & Geater, A. The climatic factors influencing the occurrence of dengue hemorrhagic fever in Thailand. Southeast Asian J. Trop. Med. Public Health 36, 191–196 (2005).
PubMed Google Scholar
Hashizume, M. et al. The Indian Ocean Dipole and cholera incidence in Bangladesh: a time-series analysis. Environ. Health Perspect. 119, 239–244 (2011).
Article PubMed Google Scholar
Rajendran, K. et al. Influence of relative humidity in Vibrio cholerae infection: a time series model. Indian J. Med. Res. 133, 138–145 (2011).
CAS PubMed PubMed Central Google Scholar
Hashizume, M., Faruque, A. S. G., Wagatsuma, Y., Hayashi, T. & Armstrong, B. Cholera in Bangladesh: climatic components of seasonal variation. Epidemiology 21, 706–710 (2010).
Article PubMed Google Scholar
Paz, S. Impact of temperature variability on cholera incidence in southeastern Africa, 1971–2006. Ecohealth 6, 340–345 (2009).
Article PubMed Google Scholar
Constantin de Magny, G. et al. Environmental signatures associated with cholera epidemics. Proc. Natl Acad. Sci. USA 105, 17676–17681 (2008).
Article CAS PubMed PubMed Central Google Scholar
Martinez-Urtaza, J. et al. Emergence of Asiatic Vibrio diseases in South America in phase with El Niño. Epidemiology 19, 829–837 (2008).
Article PubMed Google Scholar
Luque Fernández, M. A. et al. Influence of temperature and rainfall on the evolution of cholera epidemics in Lusaka, Zambia, 2003–2006: analysis of a time series. Trans. R. Soc. Trop. Med. Hyg. 103, 137–143 (2009).
Article PubMed Google Scholar
Hashizume, M. et al. The effect of rainfall on the incidence of cholera in Bangladesh. Epidemiology 19, 103–110 (2008).
Article PubMed Google Scholar
Huq, A. et al. Critical factors influencing the occurrence of Vibrio cholerae in the environment of Bangladesh. Appl. Environ. Microbiol. 71, 4645–4654 (2005).
Article CAS PubMed PubMed Central Google Scholar
Hu, W. et al. Did socio-ecological factors drive the spatiotemporal patterns of pandemic influenza A (H1N1)? Environ. Int. 45, 39–43 (2012).
Article PubMed Google Scholar
Jusot, J.-F., Adamou, L. & Collard, J.-M. Influenza transmission during a one-year period (2009–2010) in a Sahelian city: low temperature plays a major role. Influenza Other Respi. Viruses 6, 87–89 (2012).
Article Google Scholar
Runge, J. et al. Inferring causation from time series in Earth system sciences. Nat. Commun. 10, 2553 (2019).
Article PubMed PubMed Central Google Scholar
Deyle, E. R., Maher, M. C., Hernandez, R. D., Basu, S. & Sugihara, G. Global environmental drivers of influenza. Proc. Natl Acad. Sci. USA 113, 13081–13086 (2016).
Article CAS PubMed PubMed Central Google Scholar
Moazeni, M., Rahimi, M. & Ebrahimi, A. What are the effects of climate variables on COVID-19 pandemic? A systematic review and current update. Adv. Biomed. Res. 12, 33 (2023).
Article PubMed PubMed Central Google Scholar
Imai, C., Armstrong, B., Chalabi, Z., Mangtani, P. & Hashizume, M. Time series regression model for infectious disease and weather. Environ. Res. 142, 319–327 (2015).
Article CAS PubMed Google Scholar
Metcalf, C. J. E. et al. Identifying climate drivers of infectious disease dynamics: recent advances and challenges ahead. Proc. R. Soc. B 284, 20170901 (2017).
Article PubMed PubMed Central Google Scholar
Pearl, J. Causality: Models, Reasoning, and Inference (Cambridge Univ. Press, 2009).
Pearl, J. 3. The foundations of causal inference. Sociol. Methodol. 40, 75–149 (2010).
Article Google Scholar
Digitale, J. C., Martin, J. N. & Glymour, M. M. Tutorial on directed acyclic graphs. J. Clin. Epidemiol. 142, 264–267 (2022).
Article PubMed Google Scholar
Mina, M. J. et al. A Global lmmunological Observatory to meet a time of pandemics. Elife 9, e58989 (2020).
Article PubMed PubMed Central Google Scholar
Keeling, M. J. & Rohani, P. Modeling Infectious Diseases in Humans and Animals (Princeton Univ. Press, 2008).
Kramer, S. C. & Shaman, J. Development and validation of influenza forecasting for 64 temperate and tropical countries. PLoS Comput. Biol. 15, e1006742 (2019).
Article CAS PubMed PubMed Central Google Scholar
Weber, A., Weber, M. & Milligan, P. Modeling epidemics caused by respiratory syncytial virus (RSV). Math. Biosci. 172, 95–113 (2001).
Article CAS PubMed Google Scholar
Singh, P. Relative Humidity Calculator (Omni Calculator, 2022); https://www.omnicalculator.com/physics/relative-humidity
Lawrence, M. G. The relationship between relative humidity and the dewpoint temperature in moist air: a simple conversion and applications. Bull. Am. Meteorol. Soc. 86, 225–234 (2005).
Article Google Scholar
Johndrow, J., Ball, P., Gargiulo, M. & Lum, K. Estimating the number of SARS-CoV-2 infections and the impact of mitigation policies in the United States. Harv. Data Sci. Rev. 11, 202–224 (2017).
Google Scholar
Osthus, D., Hickmann, K. S., Caragea, P. C., Higdon, D. & Del Valle, S. Y. Forecasting seasonal influenza with a state-space SIR model. Ann. Appl. Stat. 11, 202–224 (2017).
Article PubMed PubMed Central Google Scholar
van Smeden, M., Lash, T. L. & Groenwold, R. H. H. Reflection on modern methods: five myths about measurement error in epidemiological research. Int. J. Epidemiol. 49, 338–347 (2020).
Article PubMed Google Scholar
Ackley, S. F. et al. Compartmental model diagrams as causal representations in relation to DAGs. Epidemiol. Methods 6, 20060007 (2017).
Article PubMed PubMed Central Google Scholar
Imai, C. & Hashizume, M. A systematic review of methodology: time series regression analysis for environmental factors and infectious diseases. Trop. Med. Health 43, 1–9 (2015).
Article PubMed Google Scholar
McElreath, R. Statistical Rethinking: A Bayesian Course with Examples in R and Stan 2nd edn (Chapman and Hall/CRC, 2020).
Nash, R. K., Nouvellet, P. & Cori, A. Real-time estimation of the epidemic reproduction number: scoping review of the applications and challenges. PLOS Digit. Health 1, e0000052 (2022).
Google Scholar
Madeleine, C. T. & Simon, J. M. Climate Information For Public Health Action (Routledge, 2018).
Beck, H. E. et al. Present and future Köppen–Geiger climate classification maps at 1-km resolution. Sci Data 5, 180214 (2018).
Article PubMed PubMed Central Google Scholar
Butsic, V., Lewis, D. J., Radeloff, V. C., Baumann, M. & Kuemmerle, T. Quasi-experimental methods enable stronger inferences from observational data in ecology. Basic Appl. Ecol. 19, 1–10 (2017).
Article Google Scholar
Sanderson, E. et al. Mendelian randomization. Nat. Rev. Methods Primers 2, 6 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hsiang, S. M., Meng, K. C. & Cane, M. A. Civil conflicts are associated with the global climate. Nature 476, 438–441 (2011).
Article CAS PubMed Google Scholar
Pascual, M., Rodó, X., Ellner, S. P., Colwell, R. & Bouma, M. J. Cholera dynamics and El Niño-Southern Oscillation. Science 289, 1766–1769 (2000).
Article CAS PubMed Google Scholar
King, A. A., Nguyen, D. & Ionides, E. L. Statistical inference for partially observed Markov processes via the R package pomp. J. Stat. Softw. 69, 1–43 (2016).
Article Google Scholar
Lavielle, M. Mixed Effects Models for the Population Approach: Models, Tasks, Methods and Tools (CRC Press, 2014).
Bretó, C., Ionides, E. L. & King, A. A. Panel data analysis via mechanistic models. J. Am. Stat. Assoc. 115, 1178–1188 (2019).
Article PubMed PubMed Central Google Scholar
Grenfell, B. T., Bjørnstad, O. N. & Kappey, J. Travelling waves and spatial hierarchies in measles epidemics. Nature 414, 716–723 (2001).
Article CAS PubMed Google Scholar
Martinez-Bakker, M., King, A. A. & Rohani, P. Unraveling the transmission ecology of polio. PLoS Biol. 13, e1002172 (2015).
Article PubMed PubMed Central Google Scholar
Viboud, C. et al. Synchrony, waves, and spatial hierarchies in the spread of influenza. Science 312, 447–451 (2006).
Article CAS PubMed Google Scholar
Choisy, M. & Rohani, P. Changing spatial epidemiology of pertussis in continental USA. Proc. R. Soc. B 279, 4574–4581 (2012).
Article PubMed PubMed Central Google Scholar
Barrero Guevara, L. A. et al. Delineating the seasonality of varicella and its association with climate in the tropical country of Colombia. J. Infect. Dis. 228, 674–683 (2023).
Article PubMed PubMed Central Google Scholar
Bjørnstad, O. N., Ims, R. A. & Lambin, X. Spatial population dynamics: analyzing patterns and processes of population synchrony. Trends Ecol. Evol. 14, 427–432 (1999).
Article PubMed Google Scholar
Urrea, V., Ochoa, A. & Mesa, O. Seasonality of rainfall in Colombia. Water Resour. Res. 55, 4149–4162 (2019).
Article Google Scholar
Bjørnstad, O. N. Epidemics: Models and Data Using R (Springer, 2018).
Cummings, D. A. T. et al. Travelling waves in the occurrence of dengue haemorrhagic fever in Thailand. Nature 427, 344–347 (2004).
Article CAS PubMed Google Scholar
Westreich, D. & Greenland, S. The table 2 fallacy: presenting and interpreting confounder and modifier coefficients. Am. J. Epidemiol. 177, 292–298 (2013).
Article PubMed PubMed Central Google Scholar
Yang, W., Elankumaran, S. & Marr, L. C. Relationship between humidity and influenza A viability in droplets and implications for influenza’s seasonality. PLoS ONE 7, e46789 (2012).
Article CAS PubMed PubMed Central Google Scholar
Weiser, J. N., Ferreira, D. M. & Paton, J. C. Streptococcus pneumoniae: transmission, colonization and invasion. Nat. Rev. Microbiol. 16, 355–367 (2018).
Article CAS PubMed PubMed Central Google Scholar
Opatowski, L. et al. Assessing pneumococcal meningitis association with viral respiratory infections and antibiotics: insights from statistical and mathematical models. Proc. R. Soc. B 280, 20130519 (2013).
Article PubMed PubMed Central Google Scholar
Domenech de Cellès, M. et al. Unraveling the seasonal epidemiology of pneumococcus. Proc. Natl Acad. Sci. USA 116, 1802–1807 (2019).
Article PubMed PubMed Central Google Scholar
Nguyen, J. L. & Dockery, D. W. Daily indoor-to-outdoor temperature and humidity relationships: a sample across seasons and diverse climatic regions. Int. J. Biometeorol. 60, 221–229 (2016).
Article PubMed Google Scholar
Verheyen, C. A. & Bourouiba, L. Associations between indoor relative humidity and global COVID-19 outcomes. J. R. Soc. Interface 19, 20210865 (2022).
Article CAS PubMed PubMed Central Google Scholar
WeatherData: Wolfram Language Function (Wolfram Research, 2014); https://reference.wolfram.com/language/ref/WeatherData.html
Alduchov, O. A. & Eskridge, R. E. Improved magnus form approximation of saturation vapor pressure. J. Appl. Meteorol. Climatol. 35, 601–609 (1996).
Article Google Scholar
Prussin, A. J. II et al. Survival of the enveloped virus Phi6 in droplets as a function of relative humidity, absolute humidity, and temperature. Appl. Environ. Microbiol. 84, e00551-18 (2018).
Article CAS PubMed PubMed Central Google Scholar
King, A. A. et al. pomp: statistical inference for partially-observed Markov processes. GitHub https://kingaa.github.io/pomp/ (2023).
R Core Team. R: a language and environment for statistical computing. The R Foundation https://www.R-project.org/ (2023).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer-Verlag, 2016).
Wood, S. N. Generalized Additive Models: An Introduction with R (CRC Press/Taylor & Francis Group, 2017).
He, D., Ionides, E. L. & King, A. A. Plug-and-play inference for disease dynamics: measles in large and small populations as a case study. J. R. Soc. Interface 7, 271–283 (2010).
Article PubMed Google Scholar
Nelder, J. A. & Mead, R. A simplex method for function minimization. Comput. J. 7, 308–313 (1965).
Article Google Scholar
Bürkner, P.-C. brms: an R package for Bayesian multilevel models using Stan. J. Stat. Softw. 80, 1–28 (2017).
Article Google Scholar
Lloyd, A. L. & May, R. M. Spatial heterogeneity in epidemic models. J. Theor. Biol. 179, 1–11 (1996).
Article CAS PubMed Google Scholar
Brauer, F., Castillo-Chavez, C. & Feng, Z. Spatial structure in disease transmission models. Math. Models Epidemiol. 69, 457 (2019).
Article Google Scholar

Download references

Acknowledgements

This work was funded by the Max Planck Society through the core funding of M.D.d.C.’s Max Planck Research Group at the Max Planck Institute for Infection Biology.

Funding

Open access funding provided by Max Planck Society.

Author information

Authors and Affiliations

Max Planck Institute for Infection Biology, Infectious Disease Epidemiology Group, Campus Charité Mitte, Berlin, Germany
Laura Andrea Barrero Guevara, Sarah C. Kramer & Matthieu Domenech de Cellès
Institute of Public Health, Charité–Universitätsmedizin Berlin, Berlin, Germany
Laura Andrea Barrero Guevara & Tobias Kurth

Authors

Laura Andrea Barrero Guevara
View author publications
Search author on:PubMed Google Scholar
Sarah C. Kramer
View author publications
Search author on:PubMed Google Scholar
Tobias Kurth
View author publications
Search author on:PubMed Google Scholar
Matthieu Domenech de Cellès
View author publications
Search author on:PubMed Google Scholar

Contributions

L.A.B.G. and M.D.d.C. conceptualized the project and designed the methods. Model implementation and analyses were carried out by L.A.B.G. and M.D.d.C. L.A.B.G. and M.D.d.C. wrote the paper, with support from S.C.K. and T.K. M.D.d.C. supervised the project. All authors approved the final version of the paper.

Corresponding author

Correspondence to Matthieu Domenech de Cellès.

Ethics declarations

Competing interests

T.K. reports outside the submitted work, having received research grants from the Gemeinsamer Bundesausschuss (G-BA—Federal Joint Committee, Germany). He has also received personal compensation from Eli Lilly and Company, Novartis, the BMJ and Frontiers. M.D.d.C. received postdoctoral funding (2017–2019) from Pfizer and consulting fees from GSK. The other authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks Rachel Baker, Paul Ferraro and David Fisman for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Methods (Review), Supplementary Tables 1–3 and Supplementary Figs. 1–5.

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Barrero Guevara, L.A., Kramer, S.C., Kurth, T. et al. Causal inference concepts can guide research into the effects of climate on infectious diseases. Nat Ecol Evol 9, 349–363 (2025). https://doi.org/10.1038/s41559-024-02594-3

Download citation

Received: 12 February 2024
Accepted: 31 October 2024
Published: 25 November 2024
Issue date: February 2025
DOI: https://doi.org/10.1038/s41559-024-02594-3

This article is cited by

Sex-specific effects of antagonistic coevolution: insights from an insect host and a bacterial pathogen coevolution system
- Neetika Ahlawat
- Manas Geeta Arun
- Prasad Nagaraj Guru
Evolutionary Ecology (2025)

Subjects

Abstract

Similar content being viewed by others

Perspectives on climate change and infectious disease outbreaks: is the evidence there?

Climate warming and influenza dynamics: the modulating effects of seasonal temperature increases on epidemic patterns

Identifying outbreak risk factors through case-controls comparisons

Main

Causal inference in climate–infectious disease research

Results

A causal inference framework for a model of infectious disease transmission

Causal inference concepts–illustrations with four vignettes

Vignette 1 on descendants, measurement bias and the intricate association between environmental variables and incidence rate

Vignette 2 on climate variability as natural experiments to estimate the individual effect of meteorological variables

Vignette 3 on confounding bias and how climate variability can masquerade as spatial spread

Vignette 4 on mediation and the direct and indirect causal effects of temperature on transmission

Discussion

Methods

Model formulation

Meteorological model

Transmission model

Observation model

Complete model and causal graph

Numerical implementation

Vignette 1 on descendants and measurement bias

Simulation details

Regression model for time series of observed cases

Regression models for time series of effective reproduction numbers

Vignette 2 on climate variability as natural experiments

Simulation details

Parameter estimation protocol

Vignette 3 on confounding bias

Simulation details

Epidemic synchrony and assessment of spatial spread

Two-location transmission model with spatial diffusion

Parameter estimation protocol

Vignette 4 on mediation, direct and indirect causal effects

Simulation details

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Sex-specific effects of antagonistic coevolution: insights from an insect host and a bacterial pathogen coevolution system

Search

Quick links