Introduction

Extreme events are becoming more severe1. This holds in particular for heat extremes: rising mean temperatures increase the severity, frequency and duration of heatwaves. These changes may additionally be modulated by circulation changes and land-atmosphere feedbacks2. It is well known that in regions of increasing soil moisture-temperature coupling, temperature variability is amplified such that extreme heat responds stronger to climate change than average temperatures2,3,4,5.

Attribution studies have found an influence of anthropogenic climate change on a large number of recent heat extremes, including the devastating events in Canada in 20216, Northern India in 20227 and the Western Mediterranean in 20238. Such disastrous heatwaves are very rare events with return periods of hundreds of years even in present climate, and among the most severe events ever recorded9,10.

The rareness of these events is not well represented by the typical analyses conducted in climate change studies until recently: widely used extreme event indicators11 consider events occurring once a year or more often, and even analyses explicitly quantifying return levels typically select return periods of only 20 years or below12,13. This choice may often be motivated by the limited availability of long time series to robustly estimate very extreme events, both from observations and model simulations. A large fraction of our knowledge about extreme events is therefore based on the study of moderate extreme events, and applying this knowledge to statements about changes in very extreme events therefore tacitly assumes that the latter respond in a similar way to climate change than the former.

Over recent years, researchers have therefore started to study very extreme or record breaking events with return periods of hundred years or beyond, focusing on the analysis of individual events14,15,16 or investigating multi-model projections17,18,19,20. None of these studies, however, has investigated whether and to what extent very extreme heat events may respond differently to climate change compared to moderate extreme events. Ignoring such differences would have dramatic consequences for climate risk assessments, as projections and attribution statements would misrepresent the influence of climate change on the most impactful events.

There is evidence for exactly such differential climate change signals of moderate and very extreme heat events. While most studies investigate the role of land-atmosphere coupling for seasonal mean conditions2,21,22, the coupling has been shown to be particularly relevant for intense heat events14,23,24,25. In fact, the strength of land-atmosphere coupling is not constant but varies with weather26,27. Hence, episodic coupling during extreme heat events, in the following referred to as event coupling, may be different—stronger or weaker—from seasonal mean coupling, and trends in event coupling may be different from trends in seasonal mean soil moisture-temperature coupling.

The generation of multiple single model initial conditions large ensembles (SMILES) over the last decade28 offers new possibilities to sample many extreme events19,29, and thus to robustly study also very extreme events.

Here, we employ three different SMILES to assess to what extent changes in very extreme heat events differ from changes in moderate heat extremes, and to identify the underlying mechanisms. For our analysis, we consider daily maximum temperatures TX. Extremes of TX are a well-established heat index4,19,30 and a useful proxy for heatwaves, as the hottest temperatures typically build up during long heatwaves31,32. We represent mean temperature conditions as the summer (annual in the Tropics) average of TX, moderate extremes as discussed above by the 2-year return levels of TX, and very extreme events by their 200-year return level. In the following we refer to these indicators as mean, moderate extreme and very extreme heat.

The choice of 200 years balances the need to consider, on the one hand, the most extreme events, but on the other hand allows sampling several such events with a SMILE, and thus to gain robust statistics without having to extrapolate and thus potentially create statistical artefacts. We have chosen the three SMILES available offering daily data for characterising the underlying processes while providing a sufficient number of ensemble members. We refer to these as the MPI, CSIRO and MIROC SMILES. For details on the methodology see the Methods section.

The spatial patterns of return level changes differ substantially between the models considered. To avoid smoothing out physical effects, we therefore consider each model separately as a pausible and physically self-consistent storyline of future changes33,34.

Results

Projected changes in moderate and very extreme heat

The spatial patterns of moderate extreme and very extreme heat over land are mainly controlled by latitude, continentality and elevation, and strongly resemble the pattern of the corresponding means (Fig. 1a, c and e for the MPI SMILE, and Table 1, pattern correlations above 0.97). Their climatic changes, however, are markedly different (panels b, d and f). Change patterns in the 2-year return level closely resemble those in mean TX (pattern correlations between 0.80 and 0.90 depending on the model, Table 1), although the amplitude of changes is stronger in many regions. This amplification is consistent with previous findings2. But change patterns in mean and moderately extreme temperatures are much weaker correlated with spatial patterns of changes in the very extreme heat (pattern correlations between 0.02 and 0.56). Major regions of strong changes are substantially shifted: for instance, while strong changes in 2-year return levels can be found over the Great Plains of North America, the strongest changes in 200-year return levels over North America can be found several hundred kilometers further east. Similarly, the strongest changes in 2-year return levels over Eurasia can be found along a band ranging from the Balkans and the coast of the Black Sea to central Asia. The strongest changes in the 200-year return levels, however, are shifted northward, starting in Central Europe. Patterns in changes of very extreme heat are essentially uncorrelated with changes in the mean (for two models, and only weakly correlated for one model; pattern correlations between 0.02 and 0.40).

Fig. 1: Daily maximum temperature and its changes.
figure 1

a Present day (1990–2014) summer mean (annual mean for 20S to 20N) daily maximum temperature TX and b its projected changes (2076–2100 vs. 1990–2014) under the SSP5-8.5 scenario. Corresponding present day values and changes for the 2-year (c, d) and 200-year (e, f) return levels of TX. Results are shown for MPI-ESM1-2-LR. The corresonding results for the other two SMILES can be found in the supplementary material, Figs. S1 and S2. Changes in hatched regions are small compared to interannual variability (see Methods for detail).

Table 1 Similarity of temperature climatologies and changes

The orthogonality of the change patterns is highlighted by considering the differences between the changes in 200-year and 2-year return levels: over distinct regions, very extreme heat respond differently to climate change compared to moderate extreme events. Pronounced dipole-like patterns emerge for all considered models, but the spatial patterns are markedly different between different models (Fig. 2a, c and e; maps for further return periods are shown in Supplementary Fig. S3). For instance, the MPI SMILE shows a strong amplification of changes in the 200-year return levels over Eastern North America and a band ranging from France via Eastern Europe into Central Asia. Here, the changes in 200-year return levels are often some 4 K stronger than those in 2-year return levels. In contrast, over the North American Great Plains and a band from the Balkans into Mongolia, the changes in the very extreme heat are typically 2–3 K weaker than those in moderate extremes. Also the other two SMILES show distinct patterns of regionally amplified and dampened responses of very extreme heat, albeit with substantially displaced spatial features. These differences highlight large uncertainties in the representation of changes in very extreme heat events.

Fig. 2: Difference in changes of extreme temperature return levels and changes in event coupling.
figure 2

a, c, e Difference between changes (2076–2100 vs. 1990–2014) in 200-year and 2-year return levels of daily maximum temperature TX for MPI-ESM1-2-LR (top), ACCESS-ESM1-5 (middle) and MIROC6 (bottom). Results for further return levels are shown in Supplementary Fig. S3. b, d, f Changes in the corresponding soil moisture-temperature coupling strength. Differences/changes in hatched regions are not significant at the 95% level (see Methods for detail).

The role of changes in event soil moisture-temperature coupling

Soil moisture-temperature coupling plays a key role in the amplification of extreme temperatures2,35: in an energy limited regime, soil moisture values are so high that evaporation is dominated by the available energy and not limited by soil moisture. In a dry regime, soil moisture and thus evaporation are low. In both cases, the coupling between temperature and soil moisture is weak. In a transitional regime, soil moisture limits evapotranspiration, but is sufficiently high to generate a coupling: decreasing soil moisture in turn decreases evaporation, thereby increases sensible heat fluxes and ultimately temperature35. Increases in the summer coupling strength therefore amplify temperature variability such that heat extremes typically warm stronger than mean temperatures in a warming climate2,3,4,21,36. The SMILES we consider reproduce the present-day coupling strength (highest values in the lower mid-latitudes and subtropics, Supplementary Fig. S4) as well as the amplifications of changes in moderate extreme events (e.g., over the Americas, Europe and Southern Africa, Supplementary Fig. S5) found in previous studies2,4, although with strong displacements in spatial patterns across individual models.

While research so far has focused on quantifying the change of soil moisture-temperature coupling in a climatological sense, there is evidence that the coupling is particularly relevant for very severe heat events14,23,24,25. In fact, it has recently been highlighted that land-atmosphere coupling varies with weather26, and climate in many locations is composed of days with and without active coupling27. We argue that this event character of land-atmosphere coupling is particularly relevant for heatwaves. During heatwaves soil moisture typically decreases over time, such that the coupling may change from an energy limited regime into a transitional regime, or from a transitional regime into a dry regime. Thus, we hypothesise that the coupling strength for the hottest days of the year is different from summer mean coupling.

We therefore specifically analyse event coupling between annual maxima TXX of daily maximum temperatures and latent heat fluxes at the occurrence days of these maxima (see Methods section for details). Projected changes in this event coupling, in particular in the mid-latitudes, closely follow the patterns of differences in the changes of very and moderate extreme events (Fig. 2b, d and e). Increases in event coupling strength amplify the climate change signal of the very extreme heat events, decreases dampen them. The robustness of the effect across models corroborates that changes in event soil moisture-temperature coupling are indeed an important physical mechanism for modulations of changes in very extreme heat.

Changes in event coupling during heat events behave differently compared to summer mean coupling (see Methods section for definitions). While both exhibit broadly similar zonal mean behaviour (Fig. 3a), the regional patterns may differ substantially (see Supplementary Fig. S4 and the discussion of spatial patterns of driving processes in the following section). Also projected changes between mean and event coupling differ substantially (Fig. 3b, compare also Fig. 2, right column). While models more or less agree on changes in mean coupling, uncertainties in event coupling changes are substantial and differ even in sign. The role of changes in event coupling for amplifying changes in very extreme heat is strongest in the mid-latitudes. There, it explains some 40% of the differential changes in extreme return levels (Fig. 3c), while its role is insignificant in the Tropics and high latitudes.

Fig. 3: Latitudinal dependences.
figure 3

Latitudinally averaged present-day coupling strength (a, negative values depict active coupling), projected changes in coupling strength (b) and pattern correlation between projected changes in coupling strength and the difference in changes of 200- and 2-year return levels as shown in Fig. 2c. Solid lines depict event coupling, dashed lines seasonal mean coupling. Latitudinal average pattern correlations are calculated across a full latitudinal circle over a latitudinal band of approximately 20. No results are shown for regions south of 40 south because ice-free land areas are very small and only very few grid boxes enter the calculations. Shaded areas in (a) and (b) mark approximate 95% confidence intervals; grey area in (c) marks non-significant pattern correlations (see Methods for detail).

Driving processes

Seneviratne et al.35 have presented a conceptual model to explain changes in seasonal soil moisture-temperature coupling. They argue that a drying in an energy limited regime leads to enhanced variability (e.g., in Europe), whereas further drying in a transitional regime leads to reduced variability (e.g., in the southeast US). Apart from studies on land surface influences37 and decadal SST variability38, further research on the causes of coupling changes is sparse.

Here, we transfer Seneviratne’s model to event coupling, and expand it to situations of increasing soil moisture (Fig. 4). Based on this model, we identify potentially relevant hydrometeorological drivers and their changes. We hypothesise that regional differences in changing coupling strength are mainly caused by changes in soil moisture during heat events. As discussed above, three regimes exist: weak coupling for both low and high soil moisture values, and strong coupling in a transitional regime of medium soil moisture values. Thus soil moisture-temperature coupling increases when soil moisture values increase from dry to a transitional regime, or decrease from wet to a transitional regime. Vice versa, soil moisture-temperature coupling decreases when soil moisture values decrease to dry, or increase to wet conditions. Such changes can be caused by both changes in rainfall amounts as well as by increases in evapotranspiration, caused by changes mainly in temperature.

Fig. 4: Changes in coupling strength.
figure 4

Conceptual figure of coupling strength as a function of soil moisture with wilting point θwilt and the boundary θcrit between a soil moisture and energy limited regime. Below, for several example regions, projected changes according to the MPI-ESM1-2-LR are given (arrows; blue: decrease in coupling strength, red: increase). These examples are also given in the main text when discussing Fig. 5 in Section “Driving processes”.

Regions of strong changes in event coupling strength and thus strong amplification or dampening of changes in very extreme heat (Fig. 2a, b for MPI model) coincide with regions of intermediate to high event coupling strength in present climate (Fig. 5a for MPI model). Yet event soil moisture values in present climate vary strongly across these regions (panel b). Where soil moisture values are high, but decrease in a warming climate (panel c), coupling strength increases—see Central Europe and the Eastern US as prominent examples. But also when soil moisture values are low and increase in a warming climate, such as in Central Africa, the coupling strength increases. In contrast, when soil moisture values are already low and further decrease (such as in the Balkans and US Great Plains), and when soil moisture values are relative high and further increase in a warming climate (e.g., around Beijing), the coupling strength decreases (see also Fig. 4). These changes in soil moisture can, to a large extent, be explained by changes in summer precipitation patterns (panel d).

Fig. 5: Driving processes.
figure 5

a Present (1990–2014) event soil moisture-temperature coupling strength (strong negative values indicate strong coupling), b present (1990–2014) total soil moisture during heat events, c changes in total soil moisture during heat events (d) and summer (annual mean for 20S to 20N) precipitation changes (2076–2100 vs. 1990–2014). Results are shown for MPI-ESM1-2-LR. The corresonding results for the other two SMILES can be found in the supplementary material, Figs. S6 and S7. Values in hatched regions of (a) are not significantly smaller than zero at the 95% siginficance level; changes in hatched regions in (c) and (d) are small compared to interannual variability (see Methods for detail).

These findings highlight the role of planetary-scale dynamic and thermodynamic changes in modulating changes in soil moisture coupling and thus changes in very extreme heat: the overall poleward shift of summer precipitation can be explained by a poleward shift of the summer jet streams and the accompanying storm tracks39,40. Regionally, processes such as changes in the frequency of atmospheric ridges in the western United States and Canada during summer41,42, or changes in the land-sea temperature contrast and lapse rate over the Mediterranean43 affect changes in summer precipitation patterns. In some continental regions, of course, soil moisture-precipitation coupling may play an additional role35.

The subtle interplay between event coupling strength and event soil moisture values in present climate, as well as changes in soil moisture, and the role of atmospheric dynamics explain the large uncertainty in the projections of very extreme heat across the different considered models: a small displacement in the patterns of one of these variables may have strong implications for changes of the coupling strength and may even revert the sign of these changes.

Discussion

The magnitude of very extreme heat events can change markedly different to moderate extreme events. In some regions we find amplifications of changes in the 200-year return levels by more than 2.5 K compared to changes in 2-year return levels, in other regions a dampening by similar amounts. Thus, in some regions the climate change signal in very extreme heat essentially doubles compared to moderate extreme events.

This modulation of changes in very extreme heat is not only relevant for strong warming scenarios, but emerges already for near term changes (Fig. 6). For instance, the strong dipole in the change over Central Europe is clearly visible in the near term projections as well. Similar changes are of course expected for similar levels of global warming such as for moderate scenarios by the end of the century.

Fig. 6: Difference in changes of extreme temperature return levels for near future projections.
figure 6

Difference between changes in 200-year and 2-year return levels of daily maximum temperature TX for near future climate (2036–2060 vs. 1990–2014) for MPI-ESM1-2-LR. The corresonding results for the other two SMILES can be found in the supplementary material, Figs. S8 and S9. Differences in hatched regions are not significant at the 95% level (see Methods for detail).

Until recently, discovering differences between the changes of moderate and very extreme events would have been impossible because of a lack of observed or simulated data (Fig. 7). Sub-sampling the large ensemble to 25-year periods results in huge uncertainties in return level estimates already for climatological return levels in present and future climate (grey lines in panels a and c). These values become even larger when considering climatic changes of return levels (panels b and d): robust estimates of changes in very extreme heat are impossible. Some realisations show a stronger change in very extreme heat, some a weaker change compared to moderate extremes for both considered grid boxes. Only when analysing all ensemble members jointly, the uncertainties are reduced such that the modulations of changes in very extreme heat emerge: over Northern Germany, these events change stronger than moderate extremes; over the Balkans the opposite is simulated.

Fig. 7: Relevance of SMILES.
figure 7

a, c Return level plots for present (blue, 1990–2014) and future (red, 2076–2100) climate based on the full ensemble of MPI-ESM1-2-LR at a grid box in Northern Germany (a) and the Balkans (c). The grey lines depict estimates based on individual ensemble members. The two black vertical lines show 2-year and 200-year return periods. b, d. corresponding difference between future and present day return levels.

Our findings are of course contingent upon the performance of the chosen climate models. CMIP6 models roughly capture observed patterns of soil moisture44 and soil-atmosphere coupling45, but the spread across models is large45,46. Nevertheless, we find a robust physical mechanism across the different models: the modulation of changes in very extreme heat can be explained by changes in event coupling strength, which in turn can be traced to changes in summer precipitation. More research is needed to fully understand the underlying mechanisms, in particular because present-day biases affect the climate change signal22.

An encouraging result of this study is that in regions with the strongest event soil moisture-temperature coupling, this coupling strength is projected to remain constant or decrease. In turn, changes in very extreme heat will not be higher than changes in moderate extremes but potentially even weaker. But a concerning finding is that some regions will emerge as hot spots of changes in very extreme heat. These regions are often not the regions of the hottest extremes in present climate, but will experience much stronger changes than anticipated based on studies of moderate extreme events. Ignoring the amplification would lead to a dramatic underestimation of future heat risks. In turn, societies would likely not adapt adequately and thus be unprepared. Robustly identifying these regions of strong amplification is therefore crucial for climate risk assessments and adaptation planning. Our findings also suggest that attribution studies for very extreme heat events should incorporate SMILES to be able to better estimate changes in the tail behaviour.

Unfortunately, our assessment is limited by huge model uncertainties: while, from a global perspective, the models we analysed produce qualitatively similar results, they differ regionally even in sign. In the mid-term this calls for making the relevant daily-scale variables available for more SMILES to better quantify uncertainties. In the near-term, a rough estimate may be gained by estimating changes in soil moisture-temperature coupling during the hottest days of a year across the full CMIP6 model ensemble. These changes may serve as a proxy for modulations of changes in very extreme heat. Also, emergent constraints and benchmarking against observations45 should be considered to reduce projection uncertainties. Nevertheless, some region-specific conclusions can already be drawn from our assessment, considering the identified physical drivers: most regions facing a strong amplification of changes in very extreme heat experience an energy limited wet regime in present climate and expect future decreases in event soil moisture and summer precipitation. Thus, societies in regions with such characteristics should carefully consider the possibility of substantial increases in very extreme heat events well beyond the usually projected changes.

Overall, our results are in line with an emerging consensus across recent studies16,33,47,48: to improve climate risk assessments, we cannot rely on analysing traditional extreme indices, but need to specifically address the very extreme events. Dedicated research is required exploiting new opportunities such as SMILES29, storylines33 and ensemble boosting16. Given the limitations of climate models, credibility of such studies can only be established by disentangling the processes governing changes in the very extremes, similar to research on recent events14,15,49.

Methods

Climate model simulations

To sample very rare extreme events and to obtain robust estimates of 200-year return levels, we employ all available SMILES providing the required variables at a daily resolution50. These are 51 (present) and 30 (future) realisations of the MPI-ESM1-2-LR (1.8 horizontal resolution in the atmosphere) of the Max Planck Institute of Meteorology50, 32 (present) and 40 (future) realisations of the ACCESS-ESM1-5 (1.88 × 1.25) developed by an Australian consortium51, and 50 (present and future) realisations of the MIROC6 (1.4) developed by the Japanese team MIROC52. For future projections we have considered the worst-case scenario SSP5-8.5 because it represents a high signal-to-noise ratio and is available for all models providing the relevant data. We consider three 25-year periods representing present climate (1990–2014), near future (2036–2060), and the end of the century (2076–2100).

Temperature return levels

For the temperature analyses, we have considered daily maximum temperatures TX. For mean changes, we calculated summer means (June to August in the Northern Hemisphere, December to February in the Southern Hemisphere, annual means in the Tropics, i.e., between 20 S and 20 N). Moderate extreme events have been represented by 2-year return levels, very extreme events by 200-year return levels. Return levels have been estimated from a Generalised Extreme Value distribution, fitted to annual maxima TXX of daily maximum temperatures. Note that block maxima are not required to be sampled from continuous time series, but from i.i.d. block maxima. This condition is fulfilled for block maxima from a single model arising under the same forcings. We therefore combined block maxima from all ensemble members. Over a 25-year period, the total number of maxima entering the fits is thus 25 times the number of ensemble members for a given model. For instance, for the MPI SMILE we had 30 × 25 = 750 maxima in the future period. For comparison, we estimated return levels as averages across the results for individual ensemble members. The results are virtually indistinguishable (not shown). The return level curves based on a single initial condition (grey lines in Fig. 7) were calculated from 25 maxima.

Coupling strength

We calculate seasonal mean coupling strength as the summer (annual in the Tropics) correlation of 7-day running mean surface latent heat flux (a measure of evapotranspiration) and daily maximum temperature. This approach loosely follows the widely-used definition from the GLACE project2. To speed up computations, we calculate these correlations separately for each ensemble member and subsequently average.

Event soil moisture-temperature coupling during heat events is measured correspondingly as the correlation between all (over a 25-year period and all ensemble members) annual maxima of Tx and surface latent heat flux on the occurrence days of the annual maxima in Tx. The changes in event coupling strength are calculated on the days of all annual maxima of Tx and thus do not consider any information about the severity of the events. These changes, therefore, cannot be an effect but only a cause of the differential temperature changes.

Uncertainty assessment

Climate response uncertainties have been considered as comprehensively as currently possible by using three different SMILES. The role of sampling uncertainty is relatively small because the SMILES provide many ensemble members to robustly estimate most relationships. Thus all significance tests (apart from where pattern correlations are considered) have a very high sensitivity given the high number of data points (25 years  × number of ensemble members), i.e., also small changes emerge as significiant. To assess the relevance of derived changes, we therefore compare projected changes with interannual variability where the latter can meaningfully be defined. For changes in temperature (Fig. 1 second column) and soil moisture (Fig. 5c), 100% of interannual variability have been considered as threshold. Because of the lower signal-to-noise ratio, for precipitation changes the threshold has been chosen as 25% of interannual variability (Fig. 5d).

The significance of differences in return level changes (Fig. 2 first column and Fig. 6) has been assessed as follows. First, we derived standard errors \(s{d}_{200,2}^{p,f}\) of present-day and future 200 and 2-year return levels \({z}_{200,2}^{p,f}\) from their respective maximum likelihood estimates. From these, we calculated the standard error \(s{d}_{\Delta 200,2}^{\Delta p,f}\) of differences in return level changes \(({{{{\rm{rl}}}}}_{200}^{f}-{{{{\rm{rl}}}}}_{200}^{p})-({{{{\rm{rl}}}}}_{2}^{f}-{{{{\rm{rl}}}}}_{2}^{p})\)). We then conducted a significance test for zero difference under the assumption that

$$\frac{\left({{{{\rm{rl}}}}}_{200}^{f}-{{{{\rm{rl}}}}}_{200}^{p}\right)-\left({{{{\rm{rl}}}}}_{2}^{f}-{{{{\rm{rl}}}}}_{2}^{p}\right)}{{{{{\rm{sd}}}}}_{\Delta 200,2}^{\Delta p,f}}$$
(1)

is approximately standard normal distributed. Similarily, we have derived 95% confidence intervals of return levels and their changes from the return level standard errors (Fig. 7).

The significance of the coupling strength (Fig. 5a) has been assessed based on the assumption that

$$t=\frac{r}{\sqrt{\frac{1-{r}^{2}}{n-2}}}$$
(2)

follows a t-distribution with n–2 degrees of freedom. In a similar vein, changes in coupling strength (i.e., correlation, Fig. 2 second column) have been assessed for signficance53. First we transformed the correlations rp,f in present and future climate using the Fisher z-transformation as

$${z}^{p,f}=\frac{1}{2}\ln \left(\frac{1+{r}^{p,f}}{1-{r}^{p,f}}\right)$$
(3)

and derived standard errors as

$${{{{\rm{se}}}}}_{{z}_{p}-{z}_{f}}=\sqrt{\frac{1}{{n}^{p}-3}+\frac{1}{{n}^{f}-3}}.$$
(4)

We then conducted the significance test assuming that the test statistic

$$\frac{{z}^{f}-{z}^{p}}{{{{{\rm{se}}}}}_{{z}_{p}-{z}_{f}}}$$
(5)

is approximately standard normal distributed.

For the latitudinal dependencies, i.e., zonal averages across latitudinal bands, the effective number of spatial degrees of freedom has been estimated as54

$${N}_{{{{\rm{eff}}}}}=\frac{2{\mu }^{2}}{{{{{\rm{sd}}}}}^{2}}$$
(6)

where μ and sd are sample mean and standard deviation of the field average E of the squared gridbox time series xt,i

$${E}_{t}={\sum}_{i=1}^{N}{x}_{t,i}^{2}$$
(7)

with i = 1. . . N gridboxes. For seasonal mean and event properties we have considered Neff of seasonal mean Tx and annual maxima Txx, respectively. Neff depends on both the spatial correlations and the number of land gridboxes in a considered latitudinal band.

For the zonal means of coupling strength, we have first derived approximate gridbox 95% confidence intervals for invididual realisations from standard errors of the gridbox correlations, approximated as55

$${{{\rm{sd}}}}=\frac{1-{r}^{2}}{\sqrt{n-3}}.$$
(8)

We then derived confidence intervals for the average across the full ensemble and the latitudinal band, accounting for the effective number of degrees of freedom (Fig. 3a). From these, we have further derived corresponding confidence intervals for the changes in coupling strength (Fig. 3b). For the latitudinally dependent pattern correlation we have assessed the significance in the same way as for the coupling strength, but accounting for the effective number of degrees of freedom (Fig. 3c).

All signficance tests have been conducted with a significance level α = 0.05.