Large biases in the frequency of water limitation across Earth system models

Giardina, Francesco; Padrón, Ryan S.; Stocker, Benjamin D.; Schumacher, Dominik L.; Seneviratne, Sonia I.

doi:10.1038/s43247-025-02426-7

Download PDF

Article
Open access
Published: 16 June 2025

Large biases in the frequency of water limitation across Earth system models

Communications Earth & Environment volume 6, Article number: 469 (2025) Cite this article

3141 Accesses
1 Citations
18 Altmetric
Metrics details

Subjects

Abstract

Water availability limits evapotranspiration on land, shaping the energy balance, land carbon uptake, and climate extremes. Despite its importance, Earth system models struggle to capture where and how often water-limited conditions occur. Here we investigate the representation of water limitation and its link to land water storage capacity in simulations from the Sixth Coupled Model Intercomparison Project (CMIP6) driven by consistent observational atmospheric forcing. Using observations of remotely sensed solar-induced vegetation fluorescence and terrestrial water storage, together with ecosystem flux data, we find that CMIP6 models overestimate the frequency of water limitation by 14% over land and 26% in the tropics. Model overestimation occurs over 57% of the land area, and 78% in the tropics. These too frequent water-limited conditions are not conclusively linked to a potential underestimation of land water storage capacity in the models, hinting at gaps in how ESMs represent rooting depths, plant water uptake, and plant water-use strategies. Our study highlights the need for model development in these areas, with implications for projections of future climate on land.

Global patterns of water storage in the rooting zones of vegetation

Article Open access 09 February 2023

Rising water-use efficiency in European grasslands is driven by increased primary production

Article Open access 27 March 2023

Observations indicate regionally misleading wetting and drying trends in CMIP6

Article Open access 17 October 2024

Introduction

Climate projections are based on a variety of Earth system model (ESM) simulations compiled in model intercomparison projects¹. The accuracy of these simulations is key for progress in climate science and eventually affects the implementation of climate policies globally. The sixth phase of the Coupled Model Intercomparison Project (CMIP6) substantially contributed to the physical science basis of the Sixth Assessment Report (AR6) by the Intergovernmental Panel on Climate Change (IPCC)^1,2. This phase includes the most advanced ESMs, simulating historical and future climates based on greenhouse gas and aerosol concentration scenarios outlined in the Shared Socioeconomic Pathways (SSP)³. Nonetheless, continuous efforts are needed to keep improving multi-model ensembles, as some models have been shown to not fully align with observational evidence or theoretical understanding^4,5,6,7,8.

The land water available for evapotranspiration (ET)—here referred to as active land water (ALW)—links the global energy, water, and carbon cycles, and is thus key for accurate climate projections^9,10,11. ALW denotes the combined water available in the soil profile—including surface water, soil moisture from both upper and deeper layers, accessible groundwater, and rock moisture – that collectively contributes to plant transpiration and surface evaporation. ALW primarily directs the available energy (net radiation) at the land surface, towards the evaporation of water^9,12. This process not only affects the water cycle but also modulates the turbulent heat fluxes, thus impacting the climate system as a whole¹³. ALW also acts as a reservoir for precipitation and radiation anomalies, maintaining stability in the climate system^12,14. Moreover, given that plants regulate photosynthesis and transpiration in response to water availability, ALW influences the global carbon cycle⁹.

When ALW drops below a critical threshold, the evaporative fraction of net radiation decreases, leading to an increase in the sensible heat fraction and ultimately in air temperature^12,13. Once ALW is below that threshold, terrestrial vegetation can no longer maintain sufficient transpiration and evaporation, effectively entering a water-limited regime. Water limitation is estimated to affect evapotranspiration in 30% to 60% of the Earth’s land surface for most of the year¹⁵, and it is an important factor in the exacerbation of heat extremes. However, our understanding of the frequency of water limitation and its effects on ecosystems under climate change remains limited¹². This is reflected in the representation of land-atmosphere interactions in climate models^16,17. Recent research has suggested an overestimation of future warming across CMIP6 models⁵, plausibly connected to other documented biases in soil moisture⁶ through land-atmosphere interactions. This uncertainty affects our ability to determine whether we can limit global warming below the targets outlined in the Paris Agreement. It is therefore crucial to constrain these model ensembles with other evidence, such as historical trends and current climate observations.

Using simulations from the land surface model component of nine ESMs available within CMIP6¹⁸ driven with observed atmospheric forcing, we show that the frequency of water-limited conditions for evapotranspiration is generally overestimated across models. Furthermore, we analyze how biases in the frequency of water limitation relate to active land water storage capacity (ALWSC) within the models. We conclude by discussing the reasons behind these model biases and the potential implications for climate projections on land.

Results and discussion

Biases in the frequency of water limitation

We focus on the land-hist experiment within the Land Model Intercomparison Project (LMIP), which consists of simulations of the land component of ESMs from CMIP6, driven by observational atmospheric forcing. Because LMIP simulations are forced with observational data, differences in the frequency of water limitation cannot stem from variations in precipitation or incoming radiation. This is not the case when using CMIP6 simulations from the historical scenario, where precipitation and radiation can differ among models. To quantify water constraints on evapotranspiration and photosynthesis, we study the evaporative fraction of net radiation (EF) as a function of total-column soil moisture from all nine ESMs available within LMIP. The variable ‘total soil moisture’ includes moisture from all soil layers^2,19. On the other hand, given the challenges of directly observing EF and total soil moisture on a global scale, we use normalized remotely sensed solar-induced fluorescence (SIF) and TWS from GRACE to identify when vegetation is affected by insufficient water availability¹¹. For models and observations, we first derive the critical water limitation threshold θ_crit at every grid cell, and then compute the amount of time each grid cell remains beneath θ_crit (see Methods).

We find that the frequency of water limitation is on average overestimated in CMIP6 models compared to observations by 14% over land, and up to 26% in the tropics (Figs. 1 and S1). Conversely, models often underrepresent the occurrence of water limitation in the Northeastern US and in central and northern Europe, and generally at high latitudes in the Northern Hemisphere. Grouping our results by IPCC regions generally supports these findings, with many regions showing an overestimation of frequency of water limitation in the models. The most affected regions include the Congo rainforest, Southern Africa, Australia, the Amazon, South and Southeast South America, and Eastern Central Asia (Fig. 2). We additionally analyze water limitation frequency when substituting GRACE TWS data with total soil moisture from two products (GLWS2.0²⁰ and GLDAS_CLSM025_DA1_D^19,21,22) that assimilate GRACE TWS and exclude snow and water bodies on land (Fig. S2). Here, too, we find a similar overestimation of water limitation frequency by the models. Our results are also consistent when using CMIP6 simulations from the historical scenario instead of those from the land-hist experiment (Fig. S3).

**Fig. 1: Global maps of frequency of water limitation.**

**Fig. 2: Comparison of the frequency of water limitation across different IPCC regions based on LMIP-CMIP6 model simulations and observations.**

The model overestimation of the occurrence of water limitation is confirmed when performing the same analysis with FLUXNET2015 observations, complemented with soil moisture simulated with a bucket-type soil water balance model driven by in-situ measurements²³ (Figs. 3 and S4 and S5). For many ESMs, even a typically humid biome as the Amazon can experience water limitation for more than 30% of the days in a year (Fig. 3g, h, i, j). This pattern of water limitation is similar to that observed in a dry Mediterranean savanna (US-Ton, Fig. 3k–o). ESMs also overestimate the frequency of water limitation at multiple FLUXNET2015 stations in Australia, China, Italy, and the USA (Fig. S4). We use daily data from LMIP-CMIP6 (available only for four models) and we compare the ESM output to the FLUXNET2015 observations lying within the same grid cell (see Methods). The footprint of an eddy-covariance flux tower is much smaller than an ESM grid cell, and this could be a source of bias.

**Fig. 3: Analysis of water limitation at three selected eddy-covariance sites.**

These findings raise the question of whether the overestimation of the frequency of water-limited conditions stems from too little ALW available in the models, or from intrinsic model assumptions about how water availability influences land–atmosphere fluxes. In the next section, we compare ALWSC in models versus observational proxies, to assess whether biases in water storage capacity may be at the root of the discrepancies in the frequency of water limitation.

Modeled and observed active land water storage capacity

There are no direct observations of ALWSC that could be used to evaluate the models. The maximum cumulative water deficit (CWD_max), based on the annual maximum cumulative difference between evapotranspiration (ET) and precipitation (P)^24,25,26 can be considered as a proxy for ALWSC, as it captures the depletion of ALW. To calculate the CWD with CMIP6 data, we directly used ET and P from LMIP-CMIP6 models. For the observational reference (referred to as S_CWDX), we use the 80-year extreme CWD from a previous study²⁷, determined using ET data derived from thermal infrared remote sensing via the Atmosphere-Land Exchange Inverse (ALEXI) product^24,25 and precipitation reanalysis data from WATCH-WFDEI²⁶. We find a model underestimation of ALWSC, especially in the wet tropics (Fig. 4). Grouping our results by IPCC regions reveals several regions where ALWSC is underestimated, i.e., South Asia, Central and South America, and Sub-Saharan Africa (Fig. S6). Using historical CMIP6 simulations instead of those from the land-hist experiment yields similar results (Fig. S7). It is important to note that S_CWDX can be inaccurately low in areas that are not typically water-limited, as it relies on ET estimates derived from thermal infrared remote sensing. In addition, S_CWDX accounts for snow accumulation and melt²⁷, whereas this is not accounted for in the CWD estimate of the models, contributing to the differences in high-latitude regions.

**Fig. 4: Comparison of active land water storage capacity (ALWSC) from LMIP-CMIP6 simulations against observations-based estimates.**

As an alternative proxy for ALWSC, we also use the maximum annual depletion of total-column soil moisture (ΔSM_max) (see Methods). This is straightforward to obtain for the models, but difficult to derive from observations. We assume that the annual depletion in terrestrial water storage (ΔTWS) in observations from the Gravity Recovery and Climate Experiment (GRACE) is comparable to the depletion in total soil moisture from CMIP6 models^10,11,28 (Fig. S8a, b). However, we note that this assumption is particularly problematic in regions with large annual variability in snow (e.g., high latitudes) or water bodies on land (e.g., the wet tropics, which contain large river basins), given that they contribute to the ΔTWS signal^29,30. We thus additionally estimate ΔSM_max using two products (GLWS2.0²⁰ and GLDAS_CLSM025_DA1_D^19,21,22) that assimilate GRACE TWS to simulate total soil moisture while excluding snow and water bodies on land (Fig. S8c–f). Results from GRACE ΔTWS_max suggest a much larger ALWSC, particularly in the Amazon, than what is represented in the CMIP6 models (Fig. S8a, b), whereas when compared to GLDAS_CLSM025_DAI_D (Fig. S8c, d) and GLWS2.0 (Fig. S8e, f), the bias in CMIP6 models is much smaller.

Overall, given the limitations of the observational estimates, it remains difficult to conclude how well ALWSC is represented in the ESMs. It is also conceivable that, during dry periods, the land component of the ESMs responds too strongly to water stress, reducing ET and thereby limiting further soil moisture depletion, which would decrease our model estimate of ALWSC. Thus, the overly frequent water-limited conditions in the models may not necessarily stem from an underestimation of the ALWSC; rather, they likely relate to how plant-available water is represented in the models, for instance, through assumptions about rooting depth or soil moisture stress functions. This is consistent with other studies pointing to a general overreliance on shallow rather than deep soil moisture in models^31,32 and the stronger drying trends in projections of surface compared to deep soil moisture³³.

Potential causes of biases between CMIP6 models and observations

Our analysis shows that CMIP6 models overestimate the occurrence of water limitation (Figs. 1 and 2), particularly in the tropics. The overestimation of the time under water limitation is consistent with previous studies suggesting that models underestimate ET in the tropics⁷, and across most regions during dry periods^31,34,35,36.

Europe and North America emerge among the least biased regions when compared to observations (Figs. 1 and 2). This is probably due to the large availability of ground-based observations to constrain ESMs in these areas compared to the rest of the world. On the other hand, the Amazon and, in general, the wet tropics, are subject to the largest biases (Figs. 1 and 4). This supports previous findings that ESMs tend to overestimate water stress in the Amazon and do not adequately capture the positive sensitivity to atmospheric aridity in its most humid regions³⁷ and at locations with a shallow water table³⁸. This also potentially reflects the inadequate representation of tropical forest root traits in global models³⁹. Given the key role of the Amazon for the global water and carbon cycles^40,41, it is crucial to improve model accuracy when representing this region, also because the response of tropical rainforests to water limitation is one of the main uncertainties in ESMs⁴².

Among the nine LMIP-CMIP6 models available for this study, CESM2 has the most realistic representation of the soil-plant-atmosphere continuum, being the only model that represents water stress based on leaf water potential^31,43,44. It explicitly accounts for plant hydraulics and calculates water potentials in soil, roots, stems, and leaves⁴³. This enables plants in CESM2 to draw more water for transpiration from deeper soil layers compared to other ESMs⁴³. This may explain why CESM2 is the model with the lowest raw bias (Figs. S1 and S9). CNRM-ESM2-1 and UKESM1-0-LL are among the models with the highest linear fit for the spatial pattern (R²) and lowest absolute bias (Figs. S1 and S9). This stronger performance may be linked to their more detailed treatment of land processes, including dynamic seasonality of leaf area index (LAI), interactive vegetation cover, and land use change, rather than relying on a fixed annual LAI cycle⁴⁵. EC-Earth3-Veg ranks as one of the least accurate models in terms of water limitation (Fig. S1), even though it estimates ALWSC comparatively well (Fig. S9). This discrepancy likely stems from its limited accuracy in simulating EF (Fig. 3). MIROC6 has the highest overestimation of water limitation (Fig. S1), despite being the only model that overestimates ALWSC (Fig. S9). This is probably due to the model lacking a representation of terrestrial carbon-cycle processes and relying on prescribed vegetation properties (Table S1). These last two examples (with EC-Earth3-Veg and MIROC6) suggest that biases in water limitation are not necessarily related to biases in ALWSC. Overall, an accurate representation of water potentials, soil water uptake profiles, and LAI appears relevant for improving biases in water-limitation frequency and ALWSC.

Implications for predicting future climate on land

Our global comparison of state-of-the-art ESMs to observational estimates reveals an overestimation of water-limiting conditions. This bias hampers the model representations of both regional and global water cycles. For example, the Amazon is characterized by a high precipitation recycling ratio, as about one-third of the rainfall has previously evaporated from the Amazon itself⁴⁶. In this and other regions strongly reliant on terrestrial ET, exaggerated ET suppression could result in excessive drought self-intensification and self-propagation^12,47. Given that precipitation projections are uncertain due to both internal climate variability and the reliance on parameterizations at subgrid-scales¹, it is crucial to improve model fidelity of ET to prevent an amplification of this uncertainty through unrealistic land–atmosphere interactions. As the global hydrological cycle intensifies in response to our warming climate^48,49,50, biases in soil moisture-limitation of ET are likely to disproportionately affect the reliability of future projections.

Owing to the fundamental role of ALW in modulating not only water but also energy and carbon fluxes, model biases in water use and limitation propagate beyond the hydrological cycle. Soil moisture–temperature feedbacks are known to amplify hot extremes in most land areas⁵¹, which also emerges clearly in climate projections^52,53. In fact, it has recently been shown that across much of Europe, air temperature increases are outpaced by even stronger soil temperature trends, suggesting that “the heat comes from below”⁵⁴. It is challenging to reliably quantify the role of soil moisture–temperature coupling in a changing climate, but for certain regions, such as the wet tropics, including Amazonia, CMIP6 simulations point to a strong contribution of land feedbacks to extreme heat⁵⁵. Recent work, again based on CMIP6 model experiments, indicates that strong land-atmosphere coupling will become more widespread under increasing atmospheric CO₂, suggesting an amplification of future climate sensitivity to such feedbacks⁵⁶. These findings distinctly rely on the ability of the CMIP6 multi-model ensemble to adequately capture the interactions between land and atmosphere, yet our results indicate systematic deficiencies with respect to how the land surface models make use of the available subsurface water and how they respond to drought conditions. As such, targeted efforts to improve the representation of these processes in climate models would likely enable more accurate projections of hot and dry extremes.

We remark that in certain regions, e.g., Eastern North America, Northern Europe, and India, the analyzed CMIP6 models underestimate the frequency of water limitation (Fig. 1). Consequently, in those regions, increases in both the occurrence and magnitude of future heatwaves could be underestimated by current state-of-the-art ESMs. Individual hot and dry events can undo several years’ worth of net carbon uptake at regional scales⁵⁷, and global soil moisture variability has been shown to dictate the strength of the terrestrial carbon sink^10,11,58, which in turn largely governs the fraction of anthropogenic CO₂ emissions remaining in the atmosphere. Due to this inherent link between land carbon sequestration and climate extremes, model improvements of both subsurface water utilization and limitation could also reduce the intermodel uncertainty of carbon uptake and hence long-term climate projections.

In this study, we identify an overestimation of water limitation frequency across CMIP6 models compared to observations, and analyze how it relates to ALWSC, indicating a promising avenue for upcoming model development. Our analysis illustrates the challenges ESMs face in accurately capturing the specificities of the land water cycle, with implications for the simulated land water, energy, and carbon fluxes. Future work to refine land surface models is poised to benefit from a simulation environment that offers observational constraints to attribute model biases⁵⁹, from novel proximal remote sensing techniques⁶⁰, and from model outputs at higher temporal resolutions⁶¹. Insights from these developments can improve how models represent plant uptake and use of water across biomes and seasons.

Methods

Data sets

This study investigates how water limitation of evapotranspiration and ALWSC are represented across nine CMIP6 models (Table S1)². We select these nine CMIP6 models because they were the largest set for which all required variables were available from the ETH Zurich CMIP6 next generation archive⁶². Our focus is on the land-hist experiment within the Land Model Intercomparison Project (LMIP), which consists of global land-only offline simulations driven with observational atmospheric forcing over a historical interval, improving snow and soil moisture estimates. Sharing the same configuration of historical simulations of the parent model within CMIP6, the land-hist experiment is conceived for diagnosing systematic biases within the land component of ESMs¹⁸.

We use the CMIP6 Land Model Intercomparison Project (LMIP) land-hist experiments to ensure that all models are forced by the same, observation-based meteorological inputs. This design removes a major source of inter-model variability—differing precipitation and incoming radiation—and helps us focus on how each land surface model responds to water limitation and regulates evapotranspiration (ET) and subsurface water storage. Although land-hist runs do not include land-to-atmosphere coupling, they do incorporate oceanic moisture fluxes implicitly through reanalysis-based forcing data. In contrast, fully coupled CMIP historical simulations include land–atmosphere–ocean feedbacks but have model-specific atmospheric forcing that can diverge substantially from observations, complicating the attribution of ET and water-storage biases to the land model itself. However, we should be aware that forcing LSMs with observed meteorological conditions can introduce other issues, because these models are typically tuned to run with their ‘native’ coupled atmosphere—one that may carry biases relative to observations (even disregarding internal variability). By using LMIP land-hist, we can more directly compare model performance against reference datasets under a common and realistic atmospheric forcing framework. As illustrated in the Supplementary Figs. (e.g., Figs. S3 and S7), our comparisons with the fully coupled CMIP6 historical runs show broadly consistent results, yet reinforce the added clarity in bias attribution when atmospheric forcing is fixed in the LMIP setup.

To benchmark CMIP6 models, we use several observational datasets. We use SIF from version 2.6 of the Global Ozone Monitoring Experiment-2 (GOME-2)⁶³ as a proxy of photosynthetic activity (Fig. 1), consistent with previous studies^41,63,64. Monthly means are calculated, retaining days when the effective cloud fraction is <30%⁴¹. SIF is a complementary process of photosynthesis, and it is thus directly related to the photosynthetic rate⁶⁵. In addition, we use total water storage (TWS) data from the Gravity Recovery and Climate Experiment (GRACE)²⁹. TWS accounts for soil moisture, groundwater, surface water, snow, and ice. To complement our analysis, we use eddy-covariance data from the FLUXNET2015 dataset⁶⁶ together with soil moisture simulated with a bucket-type soil water balance model driven by in-situ measurements²³. In Figs. S2 and S8, we also use two data products that assimilate GRACE TWS, namely: GLDAS_CLSM025_DA1_D^19,21,22 and GLWS2.0²⁰. The key advantage of the GRACE dataset lies in its foundation on mass balance principles, ensuring its water balance aligns with that of CMIP6 models. Both CMIP6 models and GRACE operate on this principle, providing consistency in their approach to water balance, despite the CMIP6 models likely not capturing all physical processes contributing to land water storage variations. We use CMIP6 data from the ETH Zürich CMIP6 next generation (CMIP6ng) archive⁶², which adds extra validation for processed variables and consistency among files from different sources. We retained pixels with vegetated land using a global land cover dataset from MODIS⁶⁷. To group the vegetated land pixels of the world in meaningful climatic regions (Fig. 2), we use the fourth version of the IPCC WGI reference regions⁶⁸. Although the non-CMIP6 data products had higher spatial resolution, all datasets were resampled to the CMIP6 grid using area-weighted averaging. For land-cover data, after averaging, CMIP6 grid cell was classified as vegetated when more than 50% of its underlying high-resolution pixels were classified as vegetation. All analyses were performed using R Statistical Software⁶⁹. To access all code and R packages used in this study, please refer to our published repository on GitHub and Zenodo (see “Data Availability” section).

Determining water limitation thresholds globally with monthly data

We studied the evaporative fraction (EF) as a function of the total-column soil moisture (SM, variable ‘mrso’) using monthly data from CMIP6 models. EF was calculated as the ratio of latent heat flux to net radiation:

$${EF}=\frac{{latent\; heat\; flux}}{{R}_{n}}=\frac{{{{\rm{hfls}}}}}{\left({{{\rm{rsds}}}}-{{{\rm{rsus}}}}\right)+({{{\rm{rlds}}}}-{{{\rm{rlus}}}})}$$

(1)

where “hfls” (W m⁻²) is latent heat flux from CMIP6 and “rsds,” “rsus,” “rlds,” and “rlus” were respectively incoming and outgoing shortwave radiation and incoming and outgoing longwave radiation (W m⁻²), also from CMIP6.

We retained data from all months with R_n > 75 W m⁻² to focus on the growing season, effectively removing colder winter months at high latitudes. We then fitted a segmented linear regression with one breakpoint (i.e., “linear-plus-plateau model”^70,71) to the EF vs SM relationship at each pixel, using R package “segmented”⁷². The pixel-specific estimate of the breaking point θ_crit was determined by least-square fit; its value represents the SM threshold up to which EF increases linearly as a function of SM (water-limited regime)^9,70,71. The percentage of time under SM limitation was calculated as the ratio of the number of months with SM < θ_crit divided by the total number of months (Fig. S10b). Note that some pixels were excluded from the analysis (gray areas in Fig. 1), given that too little data points (fewer than 25, i.e., at least 2 years of data at monthly resolution) remained to fit the linear regression after applying the growing season filter. The global observational map shown in Fig. 1 was created with GOME-2 SIF⁶³ data and TWS data from GRACE²⁹ as a proxy of ALW. We focused on monthly data from the growing season by retaining months with SIF values greater than or equal to half of the pixel-specific SIF maximum¹¹. This filter mainly excludes months when vegetation is not active at high latitudes, similarly to the R_n filter applied to the models. To derive a metric comparable to EF, we use SIF data divided by its pixel-specific maximum value, as in previous studies^11,73. We then proceeded to calculate the frequency of water limitation as described above.

Total-column soil moisture and TWS values were scaled to the 0–1 unit interval using pixel-specific min-max normalization to allow for direct comparison between GRACE and CMIP6 datasets. Both model and observational analyses were limited to the period from January 2007 to December 2014 (8 years), based on the availability of the observational and modeled datasets.

We do not extend our analysis of water limitation to the intercept and slope from the segmented regression given the different variables used for the models and observations, as well as the high sensitivity of both the intercept and slope to the underlying assumptions of the segmented regression and the quantity of data points included in the analysis.

Determining water limitation thresholds at flux tower locations with daily data

We repeated the EF vs SM analysis outlined in the preceding section at the site-scale, using FLUXNET2015 daily data at selected sites (Figs. 3 and S4 and S5). We calculated EF using FLUXNET2015 data as ${EF}=\frac{{latent\; heat\; flux}}{{R}_{n}}$. Due to inconsistencies of measured soil moisture at several FLUXNET2015 sites³⁶, we simulated soil moisture at eddy-covariance locations with SPLASH, a bucket-type soil water balance model based on a Priestley-Taylor formulation for ET estimation, with water-holding capacity set to 220 mm^23,74. Soil moisture values were scaled to the 0–1 unit interval using pixel-specific min-max normalization to allow for direct comparison between FLUXNET and CMIP6 datasets. We focused on the growing season by retaining site-days with observed gross primary productivity (GPP) equal or greater than half of the site-specific maximum. We extracted EF and SM data at FLUXNET2015 locations using daily datasets from 2000 to 2014. We used daily LMIP-CMIP6 data (available only for models UKESM1-0-LL, IPSL-CM6A-LR, EC-Earth3-Veg, and CNRM-ESM2-1) and focused on the grid cells corresponding to the FLUXNET2015 sites for comparison. We determined the critical threshold θ_crit and calculated the percentage of days when SM was less than θ_crit relative to the total number of days. When the decrease in EF from its maximum value to the y-axis intercept was less than 0.3, we assigned a status of no water limitation (NA) to avoid misinterpreting noise as water limitation.

Estimating active land water storage capacity

We estimated ALWSC as the maximum cumulative water deficit (CWD_max). For the observational benchmark, we used CWD_max derived from an ET estimate based on thermal infrared remote sensing²⁷, and precipitation from WATCH-WFDEI data. For the CMIP6 models, we derived the CWD as the annual cumulative difference in evapotranspiration (ET) and precipitation (P) at the monthly resolution, focusing on continuous dry periods, i.e., periods where the difference P–ET was negative^27,36. We also assessed ALWSC by computing the long-term maximum annual soil moisture depletion in CMIP6 models, by using the total-column soil moisture (variable “mrso”) at the monthly resolution. This LMIP variable includes moisture from all soil layers in the model. In each grid cell, we estimated the maximum depletion of total-column soil moisture (ΔSM_max) by first calculating the difference between the highest and lowest total-column soil moisture monthly values in every year (Fig. S10a). We then identify the greatest annual difference across all analyzed years:

$$\Delta S{M}_{\max }={\max \left({\max \left({SM}\right)}_{{year}}-{\min \left({SM}\right)}_{{year}}\right)}_{{all} \, {years}}$$

(2)

For the maps in Fig. S8, we calculate the long-term maximum annual soil moisture depletion with estimates from GRACE, GLDAS_CLSM025_DA1_D^19,21,22, and GLWS2.0²⁰. The calculation was performed for the years 2003–2014, when data were simultaneously available for the used products and CMIP6. GRACE was converted from cm to mm, whereas total-column soil moisture was already available in Kg m⁻² (equivalent to mm H₂O). To visualize regional biases in CMIP6 predictions, we grouped the results of Fig. 4 by IPCC climate reference regions⁶⁸. We determined the mean of the ALWSC across all points within each region, using CMIP6 data. We then compared to the corresponding observational data for the same region (Fig. S6).

Data availability

All data used in this study are openly available: all intermediate data are available on Github: https://github.com/fgiardin/water_biases_CMIP6 and from the Zenodo Digital Repository: https://doi.org/10.5281/zenodo.10810324 under a CC-BY 4.0 license. LMIP-CMIP6 data (see Table S1 and Methods for details of the experiments and run IDs): https://esgf-node.llnl.gov/search/cmip6/ or directly from the ETH Zurich CMIP6 next generation archive: https://zenodo.org/records/3734128; GOME-2 SIF: https://avdc.gsfc.nasa.gov/pub/data/satellite/MetOp/GOME_F; MODIS land cover data: https://lpdaac.usgs.gov/products/mcd12c1v006/; Ecosystem fluxes and meteorological data: https://fluxnet.org/data/fluxnet2015-dataset/; Global estimates of maximum CWD determined from ALEXI and WATCH-WFDEI data, augmented using an extreme value distribution: https://zenodo.org/records/5515246; GRACE land data: https://grace.jpl.nasa.gov/data/get-data/jpl_global_mascons/; GLDAS_CLSM025_DA1_D data product: https://disc.gsfc.nasa.gov/datasets/GLDAS_CLSM025_DA1_D_2.2/summary; GLWS2.0 data product: https://doi.org/10.1594/PANGAEA.954742.

Code availability

All computer code that supports this study is available on Github: https://github.com/fgiardin/water_biases_CMIP6 and from the Zenodo Digital Repository: https://doi.org/10.5281/zenodo.10810324 under a CC-BY 4.0 license.

References

Seneviratne, S. I. et al. Weather and Climate Extreme Events in a Changing Climate. In Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change 1513–1766 (Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 2021). https://doi.org/10.1017/9781009157896.013.
Eyring, V. et al. Overview of the coupled model intercomparison project phase 6 (CMIP6) experimental design and organization. Geosci. Model Dev. 9, 1937–1958 (2016).
Article Google Scholar
Meinshausen, M. et al. The shared socio-economic pathway (SSP) greenhouse gas concentrations and their extensions to 2500. Geosci. Model Dev. 13, 3571–3605 (2020).
Article CAS Google Scholar
Tebaldi, C. & Knutti, R. The use of the multi-model ensemble in probabilistic climate projections. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 365, 2053–2075 (2007).
Tokarska, K. B. et al. Past warming trend constrains future warming in CMIP6 models. Sci. Adv. 6, 9549–9567 (2020).
Article Google Scholar
Qiao, L., Zuo, Z. & Xiao, D. Evaluation of soil moisture in CMIP6 simulations. J. Clim. 35, 779–800 (2022).
Article Google Scholar
Wang, Z., Zhan, C., Ning, L. & Guo, H. Evaluation of global terrestrial evapotranspiration in CMIP6 models. Theor. Appl. Climatol. 143, 521–531 (2021).
Article Google Scholar
Fu, Z. et al. Global critical soil moisture thresholds of plant water stress. Nat. Commun. 15, 1–13 (2024).
Google Scholar
Seneviratne, S. I. et al. Investigating soil moisture-climate interactions in a changing climate: a review. Earth Sci. Rev. 99, 125–161 (2010).
Article CAS Google Scholar
Humphrey, V. et al. Sensitivity of atmospheric CO2 growth rate to observed changes in terrestrial water storage. Nature 560, 628–631 (2018).
Article CAS Google Scholar
Green, J. K. et al. Large influence of soil moisture on long-term terrestrial carbon uptake. Nature 565, 476–479 (2019).
Article CAS Google Scholar
Miralles, D. G., Gentine, P., Seneviratne, S. I. & Teuling, A. J. Land–atmospheric feedbacks during droughts and heatwaves: state of the science and current challenges. Ann. N. Y Acad. Sci. 1436, 19–35 (2019).
Article Google Scholar
Budyko, M. I. Climate and Life (Academic Press, 1974).
Humphrey, V., Gudmundsson, L. & Seneviratne, S. I. Assessing global water storage variability from GRACE: trends, seasonal cycle, subseasonal anomalies and extremes. Surveys Geophys. 37, 357–395 (2016).
Schwingshackl, C., Hirschi, M. & Seneviratne, S. I. Quantifying spatiotemporal variations of soil moisture control on surface energy balance and near-surface air temperature. J. Clim. 30, 7105–7124 (2017).
Article Google Scholar
García-García, A., Cuesta-Valero, F. J., Beltrami, H. & Smerdon, J. E. Characterization of air and ground temperature relationships within the CMIP5 historical and future climate simulations. J. Geophys. Res. Atmos.124, 3903–3929 (2019).
Article Google Scholar
Sippel, S. et al. Refining multi-model projections of temperature extremes by evaluation against land-Atmosphere coupling diagnostics. Earth Syst. Dyn. 8, 387–403 (2017).
Article Google Scholar
Van Den Hurk, B. et al. LS3MIP (v1.0) contribution to CMIP6: the land surface, snow and soil moisture model intercomparison project - aims, setup and expected outcome. Geosci. Model Dev. 9, 2809–2832 (2016).
Article Google Scholar
Save, H., Bettadpur, S. & Tapley, B. D. High-resolution CSR GRACE RL05 mascons. J. Geophys. Res. Solid Earth 121, 7547–7569 (2016).
Article Google Scholar
Gerdener, H., Kusche, J., Schulze, K., Döll, P. & Klos, A. The global land water storage data set release 2 (GLWS2.0) derived via assimilating GRACE and GRACE-FO data into a global hydrological model. J. Geod. 97, 1–18 (2023).
Article Google Scholar
Li, B. et al. Global GRACE data assimilation for groundwater and drought monitoring: advances and challenges. Water Resour. Res. 55, 7564–7586 (2019).
Article Google Scholar
Rodell, M. et al. The global land data assimilation system. Bull. Am. Meteorol. Soc. 85, 381–394 (2004).
Article Google Scholar
Davis, T. W. et al. Simple process-led algorithms for simulating habitats (SPLASH v.1.0): robust indices of radiation, evapotranspiration and plant-available moisture. Geosci. Model Dev. 10, 689–708 (2017).
Article Google Scholar
Hain, C. R. & Anderson, M. C. Estimating morning change in land surface temperature from MODIS day/night observations: applications for surface energy balance modeling. Geophys. Res. Lett. 44, 9723–9733 (2017).
Article Google Scholar
Anderson, M. C., Norman, J. M., Diak, G. R., Kustas, W. P. & Mecikalski, J. R. A two-source time-integrated model for estimating surface fluxes using thermal infrared remote sensing. Remote Sens Environ. 60, 195–216 (1997).
Article Google Scholar
Weedon, G. P. et al. The WFDEI meteorological forcing data set: WATCH Forcing Data methodology applied to ERA-Interim reanalysis data. Water Resour. Res. 50, 7505–7514 (2014).
Article Google Scholar
Stocker, B. D. et al. Global patterns of water storage in the rooting zones of vegetation. Nat. Geosci. 16, 250–256 (2023).
CAS Google Scholar
Liu, L. et al. Increasingly negative tropical water–interannual CO2 growth rate coupling. Nature 618, 755–760 (2023).
Article CAS Google Scholar
Landerer, F. W. & Swenson, S. C. Accuracy of scaled GRACE terrestrial water storage estimates. Water Resour. Res. 48, 4 (2012).
Humphrey, V., Rodell, M. & Eicker, A. Using satellite-based terrestrial water storage data: a review. Surv. Geophys.44, 1489–1517 (2023).
Article Google Scholar
Zhao, M., A, G., Liu, Y. & Konings, A. G. Evapotranspiration frequently increases during droughts. Nat. Clim. Chang 12, 1024–1030 (2022).
Article Google Scholar
Dong, J., Lei, F. & Crow, W. T. Land transpiration-evaporation partitioning errors responsible for modeled summertime warm bias in the central United States. Nat. Commun. 13, 1–8 (2022).
CAS Google Scholar
Berg, A., Sheffield, J. & Milly, P. C. D. Divergent surface and total soil moisture projections under global warming. Geophys. Res. Lett. 44, 236–244 (2017).
Article Google Scholar
Ukkola, A. M. et al. Land surface models systematically overestimate the intensity, duration and magnitude of seasonal-scale evaporative droughts. Environ. Res. Lett. 11, 104012 (2016).
Article Google Scholar
Teuling, A. J., Seneviratne, S. I., Williams, C. & Troch, P. A. Observed timescales of evapotranspiration response to soil moisture. Geophys. Res. Lett. 33, 0–4 (2006).
Article Google Scholar
Giardina, F., Gentine, P., Konings, A. G., Seneviratne, S. I. & Stocker, B. D. Diagnosing evapotranspiration responses to water deficit across biomes using deep learning. N. Phytol. 240, 968–983 (2023).
Article Google Scholar
Green, J. K., Berry, J., Ciais, P., Zhang, Y. & Gentine, P. Amazon rainforest photosynthesis increases in response to atmospheric dryness. Sci. Adv. 6, 47 (2020).
Costa, F. R. C., Schietti, J., Stark, S. C. & Smith, M. N. The other side of tropical forest drought: do shallow water table regions of Amazonia act as large-scale hydrological refugia from drought? New Phytol. https://doi.org/10.1111/nph.17914 (2022).
Pagán, B., Maes, W., Gentine, P., Martens, B. & Miralles, D. Exploring the potential of satellite solar-induced fluorescence to constrain global transpiration estimates. Remote Sens11, 413 (2019).
Article Google Scholar
Pan, Y. et al. A large and persistent carbon sink in the world’s forests. Science 333, 988–993 (2011).
Article CAS Google Scholar
Giardina, F. et al. Tall Amazonian forests are less sensitive to precipitation variability. Nat. Geosci. 11, 405–409 (2018).
Article CAS Google Scholar
Huntingford, C. et al. Simulated resilience of tropical rainforests to CO2-induced climate change. Nat. Geosci. 6, 268–273 (2013).
Article CAS Google Scholar
Kennedy, D. et al. Implementing plant hydraulics in the community land model version 5. J. Adv. Model Earth Syst. 11, 485–513 (2019).
Article Google Scholar
Lawrence, D. M. et al. The Community land model version 5: description of new features, benchmarking, and impact of forcing uncertainty. J. Adv. Model Earth Syst. 11, 4245–4287 (2019).
Article Google Scholar
Boucher, O. et al. Presentation and evaluation of the IPSL-CM6A-LR climate model. J. Adv. Model Earth Syst. 12, e2019MS002010 (2020).
Article Google Scholar
Dominguez, F. et al. Amazonian moisture recycling revisited using WRF with water vapor tracers. J. Geophys. Res. Atmos. 127, e2021JD035259 (2022).
Article Google Scholar
Schumacher, D. L., Keune, J., Dirmeyer, P. & Miralles, D. G. Drought self-propagation in drylands due to land–atmosphere feedbacks. Nat. Geosci. 15, 262–268 (2022).
Article CAS Google Scholar
Allen, M. R. & Ingram, W. J. Constraints on future changes in climate and the hydrologic cycle. Nature 419, 224–232 (2002).
Koutsoyiannis, D. Revisiting the global hydrological cycle: is it intensifying?. Hydrol. Earth Syst. Sci. 24, 3899–3932 (2020).
Article Google Scholar
Akhoudas, C. H. et al. Isotopic evidence for an intensified hydrological cycle in the Indian sector of the Southern Ocean. Nat. Commun. 14, 1234567890 (2023).
Article Google Scholar
Mueller, B. & Seneviratne, S. I. Hot days induced by precipitation deficits at the global scale. Proc. Natl. Acad. Sci. USA 109, 12398–12403 (2012).
Article CAS Google Scholar
Vogel, M. M. et al. Regional amplification of projected changes in extreme temperatures strongly controlled by soil moisture-temperature feedbacks. Geophys. Res. Lett. 44, 1511–1519 (2017).
Article Google Scholar
Vogel, M. M., Zscheischler, J. & Seneviratne, S. I. Varying soil moisture-atmosphere feedbacks explain divergent temperature extremes and precipitation projections in central Europe. Earth Syst. Dyn. 9, 1107–1125 (2018).
Article Google Scholar
García-García, A. et al. Soil heat extremes can outpace air temperature extremes. Nat. Clim. Chang 13, 1237–1241 (2023).
Article Google Scholar
Dirmeyer, P. A., Sridhar Mantripragada, R. S., Gay, B. A. & Klein, D. K. D. Evolution of land surface feedbacks on extreme heat: adapting existing coupling metrics to a changing climate. Front. Environ. Sci. 10, 949250 (2022).
Article Google Scholar
Hsu, H. & Dirmeyer, P. A. Soil moisture-evaporation coupling shifts into new gears under increasing CO2. Nat. Commun. 14, 1–CO9 (2023).
Article Google Scholar
Ciais, P. et al. Europe-wide reduction in primary productivity caused by the heat and drought in 2003. Nature 437, 529–533 (2005).
Article CAS Google Scholar
Padrón, R. S., Gudmundsson, L., Liu, L., Humphrey, V. & Seneviratne, S. I. Drivers of intermodel uncertainty in land carbon sink projections. Biogeosciences 19, 5435–5448 (2022).
Article Google Scholar
Abramowitz, G. et al. On the predictability of turbulent fluxes from land: PLUMBER2 MIP experimental description and preliminary results. https://doi.org/10.5194/EGUSPHERE-2023-3084 (2024).
Pierrat, Z. A. et al. Proximal remote sensing: an essential tool for bridging the gap between high-resolution ecosystem monitoring and global ecology. New Phytol. https://doi.org/10.1111/NPH.20405 (2025).
Findell, K. L. et al. Accurate assessment of land–atmosphere coupling in climate models requires high-frequency data output. Geosci. Model Dev. 17, 1869–1883 (2024).
Article Google Scholar
Brunner, L., Hauser, M. & Lorenz, R. & Beyerle, U. The ETH Zurich CMIP6 next generation archive: technical documentation. https://doi.org/10.5281/zenodo.3734128 (2020).
Joiner, J. et al. Global monitoring of terrestrial chlorophyll fluorescence from moderate-spectral-resolution near-infrared satellite measurements: methodology, simulations, and application to GOME-2. Atmos. Meas. Tech. 6, 2803–2823 (2013).
Article Google Scholar
Frankenberg, C. et al. New global observations of the terrestrial carbon cycle from GOSAT: patterns of plant fluorescence with gross primary productivity. Geophys. Res. Lett. 38, 1–6 (2011).
Article Google Scholar
Porcar-Castell, A. et al. Linking chlorophyll a fluorescence to photosynthesis for remote sensing applications: Mechanisms and challenges. J. Exp. Bot. 65, 4065–4095 (2014).
Article CAS Google Scholar
Pastorello, G. et al. The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data. Sci. Data 7, 225 (2020).
Article Google Scholar
Friedl, M. A. et al. MODIS Collection 5 global land cover: algorithm refinements and characterization of new datasets. Remote Sens Environ. 114, 168–182 (2010).
Article Google Scholar
Iturbide, M. et al. An update of IPCC climate reference regions for subcontinental analysis of climate model data: definition and aggregated datasets. Earth Syst. Sci. Data 12, 2959–2970 (2020).
Article Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/ (2023).
Fu, Z. et al. Critical soil moisture thresholds of plant water stress in terrestrial ecosystems. Sci. Adv. 8, 7827 (2022).
Article Google Scholar
Fu, Z. et al. Uncovering the critical soil moisture thresholds of plant water stress for European ecosystems. Glob. Chang Biol. 28, 2111–2123 (2022).
Article CAS Google Scholar
Muggeo, V. M. R. Estimating regression models with unknown break-points. Stat. Med. 22, 3055–3071 (2003).
Article Google Scholar
Giardina, F., Seneviratne, S. I., Liu, J., Stocker, B. D. & Gentine, P. Groundwater rivals aridity in determining global photosynthesis. Preprint at https://doi.org/10.21203/RS.3.RS-3793488/V1 (2024).
Orth, R., Koster, R. D. & Seneviratne, S. I. Inferring soil moisture memory from streamflow observations using a simple water balance model. J. Hydrometeorol. 14, 1773–1790 (2013).
Article Google Scholar

Download references

Acknowledgements

The authors thank the providers of the data sets used in this study. In particular, we want to acknowledge the FLUXNET community for their role in making the FLUXNET2015 dataset globally available.

Funding

Open access funding provided by Swiss Federal Institute of Technology Zurich.

Author information

Authors and Affiliations

Institute for Atmospheric and Climate Science, Department of Environmental Systems Science, ETH Zurich, Zürich, Switzerland
Francesco Giardina, Ryan S. Padrón, Dominik L. Schumacher & Sonia I. Seneviratne
Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Birmensdorf, Switzerland
Ryan S. Padrón
Institute of Geography, University of Bern, Bern, Switzerland
Benjamin D. Stocker
Oeschger Centre for Climate Change Research, University of Bern, Bern, Switzerland
Benjamin D. Stocker

Authors

Francesco Giardina
View author publications
Search author on:PubMed Google Scholar
Ryan S. Padrón
View author publications
Search author on:PubMed Google Scholar
Benjamin D. Stocker
View author publications
Search author on:PubMed Google Scholar
Dominik L. Schumacher
View author publications
Search author on:PubMed Google Scholar
Sonia I. Seneviratne
View author publications
Search author on:PubMed Google Scholar

Contributions

F.G. wrote the main manuscript in collaboration with R.S.P. and D.L.S.; F.G. prepared figures; F.G. performed the analysis in collaboration with R.S.P.; F.G. designed the study with contributions from R.S.P., B.D.S., D.L.S. and S.I.S.; F.G., R.S.P., B.D.S., D.L.S. and S.I.S. reviewed and edited the manuscript.

Corresponding author

Correspondence to Francesco Giardina.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Earth and Environment thanks A Al-Yaari and Laura Jensen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Rodolfo Nóbrega and Alireza Bahadori. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Giardina, F., Padrón, R.S., Stocker, B.D. et al. Large biases in the frequency of water limitation across Earth system models. Commun Earth Environ 6, 469 (2025). https://doi.org/10.1038/s43247-025-02426-7

Download citation

Received: 24 July 2024
Accepted: 29 May 2025
Published: 16 June 2025
DOI: https://doi.org/10.1038/s43247-025-02426-7