A Global-Scale Time Series Dataset for Groundwater Studies within the Earth System

Bäthge, Annemarie; Vargas, Claudia Ruz; Lischeid, Gunnar; Collenteur, Raoul; Cuthbert, Mark; Fleckenstein, Jan; Flörke, Martina; de Graaf, Inge; Gnann, Sebastian; Hartmann, Andreas; Huggins, Xander; Moosdorf, Nils; Wada, Yoshihide; Wagener, Thorsten; Reinecke, Robert

doi:10.1038/s41597-026-06966-1

Download PDF

Data Descriptor
Open access
Published: 09 March 2026

A Global-Scale Time Series Dataset for Groundwater Studies within the Earth System

Scientific Data volume 13, Article number: 401 (2026) Cite this article

3964 Accesses
7 Altmetric
Metrics details

Subjects

Abstract

Groundwater is a central component of the Earth system. However, our understanding of how it is dynamically interlinked with the atmosphere, hydrosphere, cryosphere, biosphere, geosphere, and anthroposphere remains limited. In the pursuit of understanding groundwater dynamics across diverse global settings, we present GROW (the global-scale integrated GROundWater package). This analysis-ready, quality-controlled dataset combines depth to groundwater and level time series from 55 countries, 91% from North America, India, Europe, and Australia, with associated Earth system variables. The dataset contains >200,000 time series with either daily, monthly, or yearly temporal resolution, accompanied by 36 time series or static attributes of meteorological, hydrological, geophysical, vegetation, and anthropogenic variables (e.g., precipitation, drainage density, rock type, NDVI, land use). 34 data flags regarding well features (e.g., coordinates and country), as well as time series characteristics (e.g., gap fraction or autocorrelation), facilitate quick data filtering. GROW provides a foundation for understanding large-scale groundwater processes in space and time, as well as for calibrating and evaluating models that simulate groundwater dynamics within the Earth system.

Multi-dimensional scaling for space-time transformation to achieve sustainable planning and management of water resource under changing land use pattern

Article Open access 07 January 2025

Appraisal of groundwater susceptibility to pollution using hybrid framework of stable isotopes and indexed approach

Article Open access 18 December 2025

Constraining the response of continental-scale groundwater flow to climate change

Article Open access 16 March 2022

Background & Summary

Access to groundwater is pivotal for humans and ecosystems that depend on this freshwater source^1,2,3. Groundwater is the largest store of unfrozen freshwater on Earth, accounting for approximately 99% of the Earth’s accessible freshwater³. It is estimated to contribute between 19% (PCR-GLOBWB, period 2000–2015⁴) and 22% (WaterGAP v2.2e, period 1991–2019⁵) of total global water withdrawal. Beyond its role as a source for anthropogenic water consumption, groundwater plays a key role for biodiversity. It sustains various ecosystems, especially phreatophytic vegetation in drylands, surface waters, and wetlands⁶. Saccò et al.² estimated that 75% of habitable land area is ecologically interconnected with groundwater. Ecosystems that rely on groundwater, also called groundwater-dependent ecosystems⁷, provide multiple benefits. They regulate climate, floods, water quality, host biodiversity and have recreational and cultural value^2,8.

While critical to humans and ecosystems, groundwater accessibility is threatened by human-driven changes in hydrological storages and fluxes. Among others, groundwater accessibility is limited by the depth of the water table. When water tables decline, groundwater may become inaccessible for anthropogenic water supply⁹ and groundwater-dependent ecosystems⁶. Rohde et al.⁶ showed that 53% of groundwater-dependent ecosystems in drylands are at risk from declining regional water tables. One of the biggest threats is over-abstraction, which has led to widespread groundwater depletion^3,10,11. Based on estimations for 2010, 70% of global groundwater abstraction was attributed to irrigation¹. The pressure that irrigation exerts on groundwater accessibility was highlighted by Jasechko et al.¹². They¹² found that aquifer systems with water tables in rapid decline are most common in agricultural drylands. Consequently, it is not surprising that the water scarcity hotspots in the world occur in arid and semiarid regions where groundwater withdrawal for irrigation is prevalent (i.e., Pakistan, northeastern China, and the Middle East)^11,13. Humans further impact water tables through land use changes, e.g., urbanization, wetland loss, and conversion to agricultural land^11,14,15. This, in turn, changes infiltration¹⁶, interception, evapotranspiration¹⁷, runoff¹⁴ and recharge^11,17. Additionally, climate change and its consequences alter the water cycle in manifold ways (e.g., soil moisture depletion¹⁸, sea level rise^18,19, melting of glaciers, and thawing of permafrost¹⁷). However, the quantification of these impacts on groundwater systems, and their variation in space and time, remain underevaluated^3,20,21,22.

To better understand how these changes may affect water table dynamics, we must investigate groundwater as an integral part of the Earth system^23,24,25,26. Studying groundwater not in isolation, but together with environmental variables within the Earth system (here called Earth system variables), reveals their interactions with and influence on groundwater. In this way, controls behind water table dynamics can be addressed and their respective importance compared²⁴. Groundwater’s embeddedness within the Earth system suggests that its dynamics may be shaped more by the configuration of multiple Earth system variables across these interconnected systems than by individual system properties²⁷. We understand that the Earth system is not only composed of natural components like the atmosphere, geosphere, hydrosphere, cryosphere, and biosphere²⁸ but also includes the anthroposphere¹⁴.

In Table 1, we present relationships of groundwater within different Earth system components that have been identified as crucial for quantitative groundwater dynamics. The selection of the Earth system variables is rooted in existing literature and limited due to data availability. To provide a clear overview, the Earth system variables are categorized into six components: Atmosphere (1), Geosphere (2), Hydrosphere (3), Cryosphere (4), Biosphere (5), and Anthroposphere (6).

Table 1 Overview of prominent relationships that highlight groundwater’s interconnectedness within the Earth system.

Full size table

The general understanding of the complex interconnectedness of groundwater within the Earth system is still incomplete^17,20,29. The knowledge gaps include a lack of observations in environments like high-latitudes^17,30 and regions without agricultural influence²⁷, a lack of knowledge regarding environmental characteristics like the spatial heterogeneity of the subsurface hydraulic conductivity^31,32 and insufficient knowledge about measurement errors³³. Groundwater modeling is constrained by the uncertainty regarding processes and state variables, particularly evident for input and boundary conditions. Other restricting factors in modelling are the oversimplification of processes and the lack of clarity regarding the optimal implementation^29,33,34. These limitations contribute to highly uncertain groundwater simulations^17,34.

In situ measurements are key to closing these knowledge gaps^17,23,35, because they provide the closest approximation of ‘ground truth’. In combination with variables that are driving and influencing groundwater dynamics, new groundwater-related relationships can be uncovered^17,29,36. This enhances our integrated process understanding of groundwater within the Earth system³⁶ and, consequently, can lead to improved conceptual/perceptual models. Other than that, global models could highly benefit from global time-varying groundwater datasets for parametrization and model evaluation³⁴. A dataset built for these purposes should preferably:

(1)
be a global-scale dataset that captures the natural variability of the included variables, which enables better intercomparison between regions³⁷ and helps reveal large-scale processes and connections^3,11,23,38. It can be used to develop, calibrate, and evaluate large-scale hydrological, land-surface, and climate models^29,35,39;
(2)
contain time series data that provide insights into temporal patterns (e.g., seasonality), extreme events, regime shifts and cause-effect lags^{3,14,34,36,40};
(3)
combine groundwater observations with associated variables of the Earth system that can be used to explain groundwater dynamics^23,30,41. Huggins et al.³⁶ emphasized that there is an underrealized potential in combining existing groundwater and Earth system data to uncover relationships that have not yet been recognized;
(4)
include metadata to improve comparability and the assessment of data uncertainty according to best practices in large-sample datasets^20,29,37;
(5)
be standardized, harmonized and freely available according to the FAIR principles to enhance applicability and accessibility^36,37,42. The use of data is further incentivized by ‘analysis-ready’ data. A consistent temporal resolution, gap-filling and data characterizing flags significantly reduce preprocessing time for users.

A dataset that fulfills most of these aspects, but has a focus on streamflow data, is CARAVAN³⁹. It combines national collections of measured daily streamflow time series with meteorological forcing data from ERA5-Land⁴³ and catchment attributes from HydroAtlas⁴⁴. In situ groundwater time series are not included in CARAVAN. There are datasets that have been developed to understand quantitative groundwater dynamics. Regional sets of groundwater level time series accompanied with additional environmental information exist, e.g., for Chile²⁶, Switzerland⁴⁵ or the Baltic countries⁴⁶. But a comparison of these datasets without extensive preprocessing is difficult as they are standardized differently and contain different variables. At the global scale, static datasets such as Fan et al.³⁸ with depth to groundwater and Moeck et al.³⁰ with groundwater recharge provide steady-state groundwater records for many locations worldwide. The Global Groundwater Monitoring Network (GGMN)⁴⁷ contains groundwater level or depth time series from different regions of the world, but lacks an extensive set of associated Earth system variables. Consequently, past studies that analyzed observational data have either focused on steady-state data^48,49, which can only be used to a limited extent to understand temporal patterns, or have concentrated on regional^41,50,51,52 to continental scales^53,54,55. The results of regional to continental studies^52,55 can help us to understand global-scale patterns like atmospheric-ocean oscillations (e.g., climate teleconnections such as El Niño–Southern Oscillation)⁵². A global-scale dataset can provide even further insights as teleconnections, global interlinkages²⁹, and the diversity of environmental settings can be investigated with a consistent method. To our knowledge, Jasechko et al.¹² were the first to analyze temporally dynamic groundwater level data on a global scale in relation to two influencing variables: Aridity index and land fraction under cultivation. They focused on time series with yearly resolution. This leaves potential for more comprehensive studies, integrating more timescales and a wider set of Earth system variables.

Here, we present GROW (the global-scale integrated GROundWater package)⁵⁶, a quality-controlled dataset that accompanies groundwater depth and level time series from around the world with associated Earth system variables. GROW⁵⁶ is designed to enable large-sample spatiotemporal groundwater analysis without further preprocessing, making it ‘analysis-ready.’ The groundwater data included in GROW⁵⁶ have been sourced from the Global Groundwater Monitoring Network⁴⁷ and Groundwater Observations Repository⁵⁷ by the International Groundwater Resources Assessment Centre (IGRAC). The source data were temporally harmonized to either a daily, monthly, or yearly resolution to ensure equal time step intervals. Per time series, gap fraction and single gap lengths were constrained to stay below certain thresholds (10%). Remaining gaps were linearly filled. Records that indicate measurement errors were flagged. Duplicate time series and wells were discarded. Overall, GROW⁵⁶ contains 204,292 time series. 85% of the time series have a yearly resolution, 9% have a monthly resolution and 6% have a daily resolution. 51% of the time series are at least 10 years long. Covering all major Earth system components, the groundwater data are accompanied by 36 groundwater-associated variables (time series and attributes) that are categorized into: atmosphere (n = 6), geosphere (10), hydrosphere (2), cryosphere (4), biosphere (5) and anthroposphere (9) (see also Table 1). 34 data flags related to well features (e.g., location coordinates, country), as well as time series characteristics (e.g., time series length, trend direction, autocorrelation, total gap fraction), enable targeted filtering.

With GROW⁵⁶, we offer a dataset to the global community that can be used to understand spatiotemporal groundwater dynamics in the Earth system. Cumulative effects of multiple controls on groundwater time series can be studied in space and time. Processes conceptualized for specific environmental settings can be transferred to regions with no data but the same environmental conditions. GROW⁵⁶ offers great potential for tools like machine learning^35,51,58,59 and principal component analysis⁶⁰, which can be used to uncover hidden connections and test conceptual assumptions in large datasets. In addition to this, the harmonized groundwater time series in GROW⁵⁶ provide a ready-to-use calibration and validation dataset for global hydrological, land-surface, or climate modeling. We not only encourage hydrologists to use the dataset but also experts and scientists from other fields because groundwater dynamics feed back into other storages and fluxes of the Earth system. It should be noted that the dataset shows a spatial bias, with most wells located in North America (51% of the wells), India (17%), Europe (13%), and Australia (10%). Additional biases occur toward arid (26%) and temperate (58%) climates, low elevations (62% of the wells are located below 200 m altitude), and a higher anthropogenic impact (67% of the wells are located on anthropogenically used land). Furthermore, uncertainties from the utilized data products are inherited by the GROW dataset⁵⁶. The limited global representativeness and data uncertainties have implications for analysis. They are discussed in the Technical Validation section. Guidance on addressing these limitations is provided in the Usage Notes section.

Methods

Groundwater time series

The in situ groundwater time series in GROW⁵⁶ were derived from the global groundwater datasets Global Groundwater Monitoring Network (GGMN)⁴⁷ and Groundwater Observations Repository⁵⁷ hosted by the International Groundwater Resources Assessment Centre (IGRAC). For the GGMN dataset⁴⁷, IGRAC collects and shares monitoring groundwater data like groundwater level, groundwater quality parameters, and groundwater-related metadata from national and subnational authorities. This data is regularly updated. In addition, IGRAC maintains the Groundwater Observations Repository⁵⁷ that contains groundwater data from research studies and other sources, only added once. IGRAC standardizes the format of the data obtained from the provider and enriches it with descriptive data flags but does not further alter the groundwater records. These source datasets were used because they are currently the most comprehensive and publicly accessible collection of observed groundwater time series. Among others, groundwater time series from the study of Jasechko et al.¹² are partially included in Groundwater Observations Repository⁵⁷ and make up 86% of latter dataset. The two datasets together contain 251,396 time series (as of August 2025) and have heterogeneous temporal scales and reference point elevations. The groundwater records of a time series are either given as depth from the ground elevation to groundwater (86% in GROW⁵⁶), depth from the top of the well to groundwater (<1%), or groundwater level elevation above mean sea level (14%). We do not label the groundwater data as hydraulic head because of the following reasons. The depth to groundwater records would need to be transferred to groundwater level elevation, but the ground elevation is only specified by the original data provider for 24% of the groundwater depth time series in GROW⁵⁶. Other than that, the information on whether the aquifer is confined or unconfined is only provided for 6% of the time series in GROW⁵⁶. An estimation is not possible with the available data. For readability, we use ‘water table’ as a term in the rest of the article when referring to both ‘depth to groundwater’ and ‘groundwater level’ records. This does not affect the published data, which contains groundwater information for both reference points (either depth to groundwater or groundwater level). The reference point per time series is given in the attributes table of the final dataset. For clarity, each reference point is assigned its own column in the groundwater time series table. In addition to the time series, the GGMN⁴⁷ and Groundwater Observations⁵⁷ provide static attributes for each well, including the reference point, coordinates, country, data license, confinement, and more.

The preprocessing of the time series data consists of eleven steps (Table 2, steps 1–11). In addition, attribute data were checked for duplicates and erroneous coordinates (Table 3, steps 12–14). The individual preprocessing steps are described below. Tables 2, 3 provide an overview of the number and percentage of discarded or flagged time series and the data flags created during preprocessing. The Supplements Tables S.1-1, S.1-4 describe the effects of preprocessing on time series characteristics. Examples of discarded or flagged time series are provided in the Supplements Figures S.1-1–S.1-5.

1.
Discarding time series with fewer than 2 records

Empty records and duplicates are removed. In this study, time series are defined as containing a minimum of two records at different timestamps. This criterion is checked at the start and after every preprocessing step in which data are aggregated or removed.
2.
Reconciling records to ensure that only one groundwater reference point elevation is provided per time series

Records with more than one reference point in the same time series exist. 3.8% of the time series contain both depth to groundwater and groundwater level information for every timestamp. To ensure consistency and comparability, depth to groundwater records are removed from these time series, because we prefer groundwater level records. They have the same reference height (elevation above mean sea level) and can be compared directly between different wells.
3.
Removing numeric placeholders for missing records

Because the entries ‘- 999’ and’- 9,999’ are common to mark missing records and are below realistic value ranges, they are removed from groundwater level time series. Ground elevation at the well location and top of the well elevation which is ‘-999’ or ‘-9,999’ is replaced with ‘NA’ (Not available). Both data flags are provided by the original data provider for 24% (ground elevation) and <1% (top of the well elevation) of the wells.
4.
Temporal aggregation of time series to either daily, monthly, or yearly resolution

In the original data, a timestamp is provided for each record. However, it is unknown whether the water table was measured at that specific timestamp or if it represents the mean over a specific day, month, or year. For more straightforward analysis, the temporal resolution was harmonized to ensure equal time step intervals. The time series were aggregated to either daily, monthly, or yearly means. The rationale applied to this time step aggregation was: When a time series is aggregated, there should not be more than 10% of the time series with data gaps (missing days, months or years). Consequently, 90% of the time series should be at least in that certain resolution. Specifically, this means that 90% of the time intervals between the records should be at least daily to be classified as daily time series. In case of monthly time series, we decided that 90% of the time intervals should be at least 40 days. We made this decision because many time series have a loose monthly measurement schedule between 20 and 50 days. These would probably be classified as yearly time series with a threshold of 31 days although such a loose schedule will not necessarily cause gaps in the monthly data. We chose 40 days as a compromise so that only a record on the 22^nd day of a month or later followed by a 40 day time interval to the next record will lead to a gap (1 missing month). This also means that for monthly data more than 10% gaps are still possible in case of the described example. Every other time series where the 90th percentile of the time step intervals between the records is larger than 40 days is aggregated to a yearly resolution. For transparency, the number of records aggregated to daily, monthly, or yearly means is flagged in cases where time series do not solely contain the first day of the month [year] (e.g., 2010-01-01) or the zeroth hour of the day (e.g., 2010-01-01 00:00:00). In latter case, we assume that the records already represent aggregated means of multiple records whose number is unknown (aggregated_from_n_values = ‘NA’).
5.
Capping the gap fraction and gap length

GROW⁵⁶ provides an additional data field for every time series where gaps are linearly filled. To minimize the impact of the gap-filling on the statistical characteristics of the time series, the total gap fraction and gap length are capped accordingly. As the total gap fraction is determining most of the thresholds in the preprocessing (temporal aggregation, single gap length, plateau length), we performed a sensitivity analysis on the gap fraction and tested 10%, 20% and 30%. Based on this, we selected 10% because this threshold guarantees the highest data quality while resulting in only minor variations in processing outcomes and dataset characteristics across the different threshold settings. The total number of time series after preprocessing (steps 1–10) differed by fewer than 200. The proportions of the three temporal resolutions (daily, monthyl, yearly) varied by no more than 4%. With a threshold of 10%, there are 4% fewer daily or monthly time series after preprocessing compared to a threshold of 30%. The percentage of data lost due to capping of gap fraction and gap lengths varied by less than 1%. The full results of the sensitivity analysis are presented in the Supplements Table S.1-4.
1. a.
  The total allowed gap fraction is 10%. For time series that exceed this threshold, the longest sequence of years with less than 10% gaps is extracted for daily and monthly data. For yearly data, only the sequence without gaps is extracted.
2. b.
  The maximum allowed gap length is derived from the total gap fraction of 10%. We selected the thresholds so that the gap lengths are not longer than 10% of a month (daily data), year (monthly data), or the total time series length (yearly data). Consequently, the maximum allowed gap length is 3 days for daily data, 1 month for monthly data, 0 gaps for yearly data up to 9 years, 1 year for yearly data between 10 and 19 years long, and 2 years for time series that are 20 years or longer. To account for natural variance within time series, the allowed gap length can be smaller when a time series exhibits high scatter. Based on the autocorrelation of every time series, the allowed gap between two steps is determined to be the maximum time lag for which the autocorrelation is equal to or larger than 0.6 (spearman rank correlation coefficient⁶¹), considering only time lags up to the point where the correlation coefficient becomes negative for the first time. Still, the maximum gap length is the upper limit. If there is no autocorrelation exceeding 0.6 within the considered time lags, no gaps are allowed.The sequence is extracted in which the gaps do not exceed the individual gap length threshold, and the preprocessing is continued.
6.
Flagging negative depth to groundwater records

Depth to groundwater records that are negative, indicating that the hydraulic head is above ground, naturally exist in the case of flowing artesian wells⁶² or riparian zones⁶³. However, we cannot manually check all 2,528 wells with negative values to separate plausible data from potential measurement errors. Therefore, time series with some negative records and time series with solely negative records were flagged.
7.
Flagging autocorrelation of time series

The autocorrelation is calculated again.
8.
Flagging outliers and change points with DBSCAN

Outliers (noise or spikes⁶⁴) and change points (breaks - sudden increases or decreases of a variable⁶⁴) in time series can result from sensor failures. However, this can also be attributed to the natural variability of highly dynamic aquifers, weather extremes, or anthropogenic pumping. We flagged time series that potentially contain outliers and/or change points. The purpose of the flag is to be a first point of reference for anomalies (i.e. measurement errors, human impacts, extreme events). For actual time series repair, knowledge about the study area and visual inspection to validate detected outliers are highly recommended^59,60,65. Users who want to correct the time series can use the flag for prioritization. Instead of correction, the flagged time series could also be filtered out. We implemented an outlier and change point detection with the DBSCAN algorithm⁶⁶ as in Nolte et al.⁵⁹ DBSCAN is a density-based clustering method with noise⁶⁶. The algorithm groups the data based on their groundwater values in different clusters and noise. Every time series that contains one cluster and noise or multiple clusters, is flagged to contain outliers and/or change points. The single records assigned to noise or a minority cluster are flagged as outliers as well. The two determining parameters in DBSCAN are set to Eps = 2x standard deviation of time series and MinPts = 0.5x total number of records in time series. Eps defines the maximum distance between two data records to be considered neighbours. MinPts determines the number of neighbours required for a data record to be considered as core point⁶⁶.
9.
Flagging uninterrupted sequences of static water table values as potential measurement error

Uninterrupted sequences of the exact same water table (plateaus) are flagged in each time series with their plateau length. Like the flagging of outliers and change points, this indicates possible measurement errors. Plateaus are treated as potential data gaps and deemed problematic when they exceed thresholds for the maximum allowed gap length. Accordingly, sequences of the exact same water table in sequential time steps are flagged as plateaus starting from 4 days for daily data, 2 months for monthly data, 2 years for yearly data shorter than 20 years and 3 years for yearly data that is 20 years or longer.
10.
Calculating Mann-Kendall trend direction and Sen’s slope

The trend direction and the Sen’s slope, in case of a significant trend (p-value < 0.05), were derived for each time series. If the time series was flagged to be autocorrelated (see step 4), the Hamed and Rao Modified Mann-Kendall test was used to perform the trend analysis. Here, a variance correction is applied to account for serial autocorrelation⁶⁷. Otherwise, the original Mann-Kendall test from the python package pyMannKendall was utilized⁶⁸. Independent from the reference point, an increasing trend is associated with a rising water table and vice versa. Therefore, the trend direction of time series containing depth to groundwater records was switched (increasing < - > decreasing) so that, for example, a rising water table (decreasing depth) is flagged as ‘increasing’. Additionally, the sign of the trend slope is switched for these cases. It should be noted that the effect size and significance of a test should be critically scrutinized because of its dependence on sample size⁶⁹.
11.
Adding further data flags

Additional flags were generated per time series and added to the well attributes (see Table 2). That are: the mean and median water table per time series, the first and last date of a time series, the length of the time series in years and the median number of records in the original data that were aggregated to one record in GROW⁵⁶.
12.
Removing duplicates in well attributes

There are three types of duplicates apparent in the dataset. Duplicate by well ID and country (a), duplicate by location and groundwater time series (with different ID; b), and duplicate from the Jasechko et al.’s¹² dataset with same data source but altered attributes and records (c):
1. a.
  To assign the attributes to the time series a unique key in every table is necessary. As time series and attributes in the source data were sorted in different folders by country, the well ID and country can be used to merge the data. Consequently, duplicate records with duplicate ID from the same country are both deleted as it is not feasible to resolve which IDs should be assigned to each time series.
2. b.
  Furthermore, wells with the same coordinates, starting date, ending date and mean water table under two different IDs were identified as duplicates. In this case, only one well record was removed.
3. c.
  108,328 wells of Jasechko et al.’s study¹² are part of the groundwater source dataset Groundwater Observation Repository⁵⁷. A subset of them overlaps with wells from the GGMN⁴⁷ as they come from the same original data provider. However, the time series published by Jasechko et al.¹² were modified. That means that they are not identical with the original data from the provider. Well coordinates were truncated, different well ID’s were used and the time series were aggregated to a yearly resolution. Thus, the mean water table as well as the start and end date could deviate from the duplicate. For this reason, the detection method in (b), above, was not applicable for those cases. To detect and remove these duplicates, the latitude and longitude coordinates were rounded to one decimal place and the mean water table was rounded to zero decimal places. Afterwards, duplicates within these three parameters that originate from different organizations (data origin) were filtered. Within this subset, all duplicates originating from Jasechko et al.¹² were removed (9,712 wells). We rounded the coordinates and the mean water table very strongly because we would rather remove more time series that are no duplicates than keep real duplicates. This rounding was only applied to find duplicates and is not applied to the final dataset.
13.
Checking for coordinates outside the plausible range

The coordinates of the well location are projected in WGS 84. In this coordinate system, the latitude must be between −90° and 90°, and the longitude between −180° and 180°. We searched for outliers to remove them from the dataset because a manual data correction is not feasible for such a large dataset. No outliers were found.
14.
Removing empty attribute columns

34 columns that were either empty or contained only one test entry were removed.
15.
Trimming time series based on kept attributes

Time series whose well attributes were lost during the preprocessing (steps 12-13) were removed.
Table 2 Overview of preprocessing steps for the groundwater time series with the number and percentage of discarded/flagged time series and the flags created per step.
Full size table
Table 3 Overview of preprocessing steps for the groundwater attributes with the number and percentage of discarded/flagged wells and the flags created per step.
Full size table

Earth system variables

The set of Earth system variables added to GROW⁵⁶ are based on a selection of processes most relevant to water table dynamics (see Table 1). To expand the groundwater time series data with these variables, data products were identified that fulfil the following criteria. (1) The data product must be published under a license that permits modification and redistribution. (2) Instead of using multiple regional data products per variable, only global data products were considered to enable comparability within GROW⁵⁶. (3) When choosing between different data products, preference was given to data products based on the following priorities (in decreasing order): Greater temporal coverage, outperforming comparable data products as indicated in the literature, higher temporal resolution, and higher spatial resolution. (4) Furthermore, variables and data products were chosen that could either be downloaded automatically or did not need extensive preprocessing.

In total, thirty-six Earth system variables were selected to complement the groundwater attributes and time series. The added Earth system variables were derived from observation-based data, reanalysis data, or modelled data. Table 4 (attributes) and Table 5 (time series) give an overview of every variable and its source. Additional information about data access, spatial or temporal resolution, original units, and record coverage is provided in the Supplements Tables S.2-1, S.2-2. The variables are organized by the six Earth system components: Atmosphere, Hydrosphere, Geosphere, Biosphere, Cryosphere, and Anthroposphere (Table 1).

Table 4 Overview of all added Earth system attributes (no temporal dimension), their unit and data source.

Full size table

Table 5 Overview of all added time-varying Earth system variables, their unit, and data source.

Full size table

It is desirable to associate groundwater time series with precipitation and potential evapotranspiration records as comprehensive as possible as these are widely recognized to be the most important, large-scale, non-anthropogenic variables driving groundwater dynamics^12,45,48,54. To increase temporal coverage and account for some uncertainty in precipitation and potential evapotranspiration data products, two complementary products per variable were added to GROW⁵⁶, each one as separate column. For the MSWEP 3-hour precipitation data product⁷⁰ multiple precipitation data sources (gauge-measurements, satellite, or reanalysis) were merged. Per location that data source with the highest quality was selected. MSWEP has a high temporal resolution and can cover the most recent groundwater records as it is almost updated in real-time with a latency of 3 hours. Following this, we selected this product due to its high data quality standards, high temporal resolution, and coverage of the most recent records. However, it does not cover records before 1979. Early groundwater records are therefore supplemented with monthly precipitation data from GPCC⁷¹, which can cover records before 1901 (begins in 1891) and have a higher spatial resolution (0.25°) in comparison to other products that start before 1901 like Observed Land Surface Precipitation Data (2.5°)⁷². Both datasets cover 99.99% of the groundwater time series, while MSWEP only achieves a record coverage of 98% (Supplements Table S.2-2). Regarding potential evapotranspiration, the ERA-5 Land dataset⁴³ covers the longest time span (1950-now) but is known to overestimate potential evapotranspiration^73,74, especially for arid regions⁴³. To account for this uncertainty, we included another product derived from a different data source. The Global Land Evaporation Amsterdam Model (GLEAM4) was selected because it calculates potential evapotranspiration with Penman’s equation⁷⁵ unlike ERA5-Land. Latter’s potential evapotranspiration is simulated with a land surface model ‘for agricultural land as if it is well watered and assuming that the atmosphere is not affected by this artificial surface condition’⁴³.

To merge Earth system variables with each groundwater well, the value of the raster pixel or polygon containing the well’s location is extracted and associated with that well. This means that the variables are not measured at the explicit location of a well, but are, due to the nature of raster or polygon data, averaged over a certain region. With this in mind, the results should be understood as local to regional characteristics, representative for the respective spatial resolutions of the Earth system variables (with a spatial resolution range from 3 arc seconds to 0.5°, Supplements Tables S.2-1, S.2-2). Since the well coordinates are projected in WGS 84, the used data products were reprojected to WGS 84 if they were initially available in a different coordinate system. The static variables (no temporal dimension) were added as columns in the attributes table. The time-varying variables were first aggregated to the temporal scale of each individual groundwater time series and merged by ID and date to the time series table.

Some variables were only available at monthly (precipitation from GPCC) or yearly resolutions (water withdrawal, land use fractions and days per year with snow cover). We include these data also for daily and monthly time series as they can be used to split the dataset based on thresholds. Therefore, these products are combined with daily [monthly] groundwater data so that each day [month] within a specific month [year] is matched with the corresponding value for that month [year].

The time series with different temporal resolutions are stored in a single table, all with the same unit of quantity (mm/year) for parameters such as precipitation. This enables the straightforward derivation of further aggregations across different time series resolutions, subsets, and statistics, regardless of the temporal resolution. Because 85% of the time series have a yearly resolution, quantitative units are provided for an annual period (mm/year). This means that monthly and daily time series also have units of mm/year.

In the following, we briefly describe the processing steps needed for a few variables that require additional processing:

The aquifer type [porous, fractured, porous/fractured, karst and water_body] was estimated based on the World Karst Aquifer Map (WOKAM)⁷⁶ and the GLiM⁷⁷. Wells, which are located in a WOKAM-karstifiable region, are assigned ‘karst’ as aquifer type. For the rest of the wells the following classification based on GLiM rock types was applied. The aquifer type is porous when the rock type is unconsolidated sediments (GLiM class: su). As aquifers in mixed sedimentary rocks and siliciclastic sedimentary rocks (GLiM class: sm,ss) can be porous or fractured, wells within those rock types are assigned ‘porous/fractured’. The class ‘water_body’ is directly transferred from GLiM’s class ‘water bodies’. The rest is classified as 'fractured'.
Drainage density was calculated by dividing the sum of all river lengths in a catchment by the area of that catchment. HydroRIVERS⁷⁸ and BasinATLAS (level 9)⁴⁴ were utilized for that purpose. Both datasets were reprojected into the global metric coordinate system World-Eckert-IV before calculating the fraction of river lengths sum per basin area.
The (average) days per year with snow cover were derived from the daily ERA5-Land^43,79 snow cover data. For each year and each well, the number of days with snow cover >0 m was calculated and added to the time series table. The variable ‘days per year with snow cover’ was set to NA if any data were missing for that year. Otherwise, missing values would have been interpreted as 0 days, which would have biased the results. The multi-year average was added to the attributes table.
The main land use type in the attributes table is that land use whose average fraction over time is the highest.

Data Records

The final GROW dataset⁵⁶ consists of two files: A table containing time series data and a table with static attribute data. Both can be downloaded as either a comma-separated values (CSV) or parquet file from Zenodo: https://doi.org/10.5281/zenodo.15149480 and used under a Creative Commons Attribution Non-Commercial ShareAlike 4.0 International License (CC-BY-NC-SA 4.0). Some groundwater time series and Earth system variables are published under an individual license, but none of them is more restrictive than CC-BY-NC-SA 4.0. Please note the full license information provided in the Readme file on Zenodo. A dataset documentation on Zenodo (Readme file) provides descriptions of every variable/column for both tables. To subset the data, the attributes table can be filtered based on user-specified criteria, and afterwards the time series table can be filtered for the remaining well IDs in the attributes table. A total of 34 data flags are available to help with this filtering (Table 6). With data flags like the average water table (‘groundwater_mean_m’), the number of values that were aggregated to one water table record (‘aggregated_from_n_values’) or water table records that were detected to be potential outliers/change points (‘outliers_change_points’), users can select data based on their individual (quality) needs. For example, a data subset is possible in which yearly data was either not aggregated (always the first day of the year) or aggregated from at least four values. To provide another example, a user can decide to drop all time series that contain outliers and change points (7% of the time series) or plateaus (11% of the time series). A best-practice script for data selection can be viewed on GitHub (https://github.com/EarthSystemModelling/GROW/blob/main/usage_example.py) or Zenodo: https://doi.org/10.5281/zenodo.15149480.

Table 6 List of all data flags that are added to the time series and attributes table.

Full size table

Data Overview

A map showing the global distribution of the wells included in GROW⁵⁶ (Fig. 1a) highlights the geographic patterns in data availability. This mapping reveals a bias to locations within North America (51% of the wells), India (17% of the wells), Europe (13% of the wells), and Australia (10% of the wells), which contain 91% of the data records (Supplements Figure S.1–6). With a median depth to groundwater of 8 m, GROW⁵⁶ represents shallow groundwater (here defined with depth to groundwater below 10 m like in Cuthbert et al.⁵⁴). 34% of the time series are between 10 and 19 years long (70,302). 17% of the time series are 20 years and longer (34,504) with a maximum length of 135 years. Time series of 20 years and longer are more common to have a yearly resolution (91%) compared to all time series (85%). On the other hand, short time series of 1 to 4 years have the highest fraction of daily (9%) and monthly (14%) time series among the length classes (Fig. 1d).

The characteristics of the Earth system variables in GROW⁵⁶ differ from their global distributions (Fig. 1e,f). The global distributions are derived from all pixel values in the respective raster data with area-weighting to correct for the area distortion of the WGS 84 coordinate system. The area-weighting is described in more detail in the Supplements section S2. Based on the Koeppen-Geiger climate classification⁸⁰, the wells in GROW⁵⁶ are disproportionately located in temperate (55%–80% in GROW⁵⁶ vs. 25% globally) and arid (9%–28% in GROW⁵⁶ vs. 17% globally) climates (Fig. 1e). The main land use in GROW⁵⁶ is characterized by a higher anthropogenic influence compared to the global land use distribution (Fig. 1f). While the proportion of urban areas is below 1% globally, it is very prominent in the yearly resolution data with 15%. Compared to global data, the proportion of agriculturally used land (pastures and cropland) is higher in GROW⁵⁶ by 23%, 26%, and 32% in the daily, monthly, and yearly data. The wells are located at lower elevations (median ground elevation of 105 m in comparison to 366 m globally; Supplements Figure S.2-3). The distributions of the rest of the Earth system variables in GROW⁵⁶ compared to their global distributions are given in the Supplements section S2.

Technical Validation

Quality control of groundwater data

The preprocessing of the groundwater time series ensures that the data are harmonized, gap-limited, and checked for potential measurement errors, incorrect coordinates, and duplicates. In total, 47,104 of 251,396 time series (19%) were discarded due to not meeting the quality criteria (Tables 2, 3). 9% of the time series were rejected because they were empty or contained only one record. The largest data loss in the attribute tables was due to duplicates originating from Jasechko et al.¹² (4%). Examples of discarded time series are given in the Supplements section S1. Of the remaining groundwater time series, 13,751 contain gaps (7% of all time series in GROW⁵⁶). 34% of these time series are in a monthly resolution, and 36% are in a daily resolution. Therefore, higher-resolution time series were found to possess incomplete records more often compared to time series with yearly time steps. The median gap fraction of the incomplete time series is 1.7%. Yearly data have the highest median gap fraction (4.2%), and daily data have the lowest (0.1%). The median gap lengths for daily, monthly and yearly data are 1 day, month, or year long. The total gap fraction and single gap lengths per well do not exceed the defined thresholds. A figure with the distribution of the gap fraction and mean gap length per time series and temporal resolution is displayed in the Supplements Figure S.1-7.

Uncertainty sources

The observed groundwater time series as well as the Earth system variables contain data gaps and uncertainties that propagate to GROW⁵⁶. In the following, we give an overview with examples of uncertainty sources in the dataset. Uncertainties in groundwater observations arise from a lack of knowledge about metadata, the exact measurement time, and spatial data bias. Metadata such as measurement method (not reported at all), borehole construction details (only drilling depth provided for 5% of wells), and aquifer confinement (available for 6%), are largely incomplete across the dataset. Dependent on the measurement method, uncertainties associated with the groundwater data and the susceptibility to certain measurement errors differ. Reading precision is lower for manual measurements, which are mainly performed with dip meters. On the other hand, automated pressure measurements hold the risk of sensor failures and internal clock drifts. Following, errors like spikes, water table drifts and erroneous timestamps are more likely with automated measurements⁶⁴. The point of head error is further influenced by construction details like screen length or borehole inclination⁶⁴. As the measurement methods and construction details, except for drilling depth and top of the well elevation, are unknown for the GROW wells, the quality of the groundwater data cannot be assessed and compared⁶⁴. The lack of knowledge about confinement hinders a correct interpretation of the water table records as water table head. Further, it is impossible to determine whether water table records above the ground are measurement errors or belong to artesian wells. Knowing whether an aquifer is confined or unconfined is important for groundwater analysis as many theoretical concepts only apply to either unconfined or confined conditions. Other than that, the interpretation of water table changes should be adapted in regard to confinement. For example, storage changes lead initially to stronger head declines in confined compared to unconfined aquifers⁸¹. As the groundwater time series are provided by different data holders, the preprocessing of the data is challenging. For example, it is unclear whether the timestamp of a record indicates the exact day of measurement or an already aggregated mean for the respective period (daily, monthly, or yearly). Due to a lack of groundwater observations, the dataset is not able to represent the globally possible value ranges of all variables (Fig. 1). Groundwater observations are recorded at relatively shallow depths less than 10 m and in places with high anthropogenic impact (agriculture as primary land use) because wells are built where water is cost-effectively available (shallow), needed (high anthropogenic impact), and where funds for recording and sharing observations are available (wealthy countries)⁸². Groundwater time series are not available for many water-scarcity hotspots like Pakistan, northeastern China and the Middle East^11,13. This suggests that the calculated proportion of decreasing trends in GROW⁵⁶ (22%) is likely underestimating the actual global decreasing trends. On the other hand, the high anthropogenic impact might skew the trend direction to more decreasing trends. Less data is available for processes related to high altitudes, tropics, polar zones, snow accumulation, glaciers, and permafros. Therefore, it could be more difficult to achieve statistical significance in an analysis regarding those processes. A bias of in situ data towards an anthropogenic impact and high-income countries is a common problem in hydrological research^83,84. Along comes a higher proportion of research and data sets originating from regions with more in situ data and lower risk of adverse impacts. This inhibits a truly global perspective^36,83. GROW⁵⁶ makes the unavailability of in situ measurements in certain regions and further implications on groundwater analysis visible.

An additional source of uncertainty in GROW⁵⁶ originates from the Earth system variables that were combined with the observed groundwater data. Uncertainties in the modelled variables arise from the oversimplification of Earth system processes and differing assumptions on how they are implemented^20,24,33,34. Especially, human impacts are complex and difficult to quantify^35,85. This is emphasized for the water withdrawal product in GROW⁵⁶, which was derived from a multi-model ensemble⁸⁶. Wada et al.⁸⁶ found substantial differences between the three global models simulating water withdrawal trends for the industrial sector. Among other things, the authors attributed the discrepancy to differing model approaches (e.g., the distinction between electricity and manufacturing water use). Another source of uncertainty is input data^31,33,85,87. For example, GLEAM4 primarily uses reanalysis and satellite data as simulation input⁷⁵, which pass their uncertainties on to the model output. Finally, a lack of observations^48,88, the non-applicability of local methods that are based on expert knowledge, and even computational limits^29,88 inhibit comprehensive calibration and evaluation methods. Notably, field measurements of actual evapotranspiration⁸⁹ and actual water withdrawal⁸⁶ are scarce. There are also no satellite observations as both parameters cannot be directly measured with the sensors^89,90.

There are other sources of uncertainty coming from Earth system variables that are not modelled. Satellite-based data products, such as AVHRR NDVI⁹¹ or MERIT DEM⁹², have the advantage of wide temporal and spatial coverage⁹³. However, satellite observations are no direct point measurements but an average characteristic over a particular spatial resolution. In this sense, the MERIT DEM lacks detailed topography, such as small lakes or levees⁹². Although reanalysis products (e.g., ERA5-Land⁹⁴) seem less uncertain than modelled data because they have incorporated observations, they contain their own pitfalls: They are sensitive to data availability, measurement errors, and spatial as well as temporal representativeness. Therefore, data-scarce regions and times are less reliable⁹³. The same applies to gauge-based data products like GPCC, where observation gaps are interpolated⁹³. Uncertainties in categorical data like geological maps (e.g., GLiM) are not characterized by magnitude of deviation but by ambiguities in group classification. In that case, wrong mapping and deviations between regional geological maps are examples of uncertainty sources⁹⁵. Furthermore, structural details like fault zones are neglected on the global scale⁹⁶, and there is no mapping of vertical heterogeneity^77,96,97.

Lastly, the combination of point observations with gridded data creates its own uncertainty: Commensurability errors occur when point observations are compared with grid values that represent several square kilometers. Naturally, extreme events are often not fully captured in these contexts. The greater the mismatch in (spatial) scale, the higher the commensurability error^29,93.

Here, we present a first static version of GROW⁵⁶, marking the commencement of a long-term initiative. In this initiative, we envision to develop a dynamic version of the dataset⁵⁶ that continually grows and benefits from data additions of the global community. The addition of groundwater time series, especially from regions that are underrepresented, reduces the spatial bias and increases global representativeness in such a dataset. We invite the global groundwater community to contribute data to IGRAC’s GGMN⁴⁷ and Groundwater Observations Repository⁵⁷. It is currently the only dynamically growing global dataset of groundwater time series that offers the option to add a comprehensive set of metadata.

Usage Notes

We envision GROW⁵⁶ to be used for integrated groundwater analysis that addresses the drivers behind groundwater dynamics. Here, we provide guidance and usage examples. The groundwater information is either given as water table depth or level dependent on the time series. To derive both for the same time series, the ground elevation from either the provider or the MERIT DEM⁹² can be used. This is described in more detail in the Readme file on Zenodo (https://doi.org/10.5281/zenodo.15149480).

Although the gaps in the dataset incentivize analysis of well-represented environmental settings (e.g., arid or temperate climates and low elevations; see Fig. 1), we encourage the inclusion of underrepresented settings in analyses using GROW⁵⁶. To reduce bias, users could trim the dataset to approximate the global distribution of one or multiple Earth system variables. The descriptive statistics comparing the distribution of the Earth system variables in GROW⁵⁶ with their respective global distributions provide a reference for this (Supplements section S2).

GROW⁵⁶ can help to understand groundwater dynamics within local to regional contexts captured by the data products from which the Earth system variables originate (see Tables 4, 5). Site-specific analysis should be carried out with caution. For example, localized pumping conditions cannot be investigated with the Earth system variables ‘Total water withdrawal for domestic and industrial use’. They represent modelled total water withdrawals in the 0.5° raster pixel in which the well is located. With this dataset and the limited metadata, actual human disturbance in GROW⁵⁶ can only be derived from the groundwater time series. We are not aware of an algorithm that can adequately do that for global-scale data. A method for regional data was presented by Lehr & Lischeid⁶⁰, but local knowledge and visual control were still necessary. We suggest using land use and the regional total water withdrawal to pre-filter data for further analysis. One can suspect that wells with the land use class ‘forests_natural_vegetation’ and 0 m³ regional water withdrawal are more likely to be undisturbed.

A benefit of the dataset is that 36 explanatory variables are provided, which originate from different geodata formats and resolutions. These variables can be used for clustering which may result in groups that resemble hydrological response units⁹⁸ for groundwater. Such a clustering can potentially explain different groundwater behavior around the world, for example, as in Nolte et al.⁵⁹ and Chávez et al.⁵¹.

GROW⁵⁶ can additionally be used for model calibration and validation. We do not recommend aggregating the groundwater time series per raster cell for calibration or validation because 1) the aquifers might not be structurally connected, and 2) this creates time series that do not exist. For example, different trends and extreme points may get neutralized. The latter is especially probable as groundwater dynamics can vary greatly locally^51,59. Additionally, 3) aggregated time series might have different temporal resolutions and lengths. Instead, comparing the local groundwater observations with modelled raster data has the potential to reveal how well a model can represent certain wells within that raster.

Data availability

The final GROW dataset⁵⁶, containing an attributes table and a time series table, as well as a dataset documentation can be downloaded on Zenodo: https://doi.org/10.5281/zenodo.15149480 as either a CSV or parquet file. GROW⁵⁶ itself is published under Creative Commons Attribution Non-Commercial ShareAlike 4.0 International License (CC-BY-NC-SA 4.0). An example of use that demonstrates how the data are subset and prepared is given on GitHub (https://github.com/EarthSystemModelling/GROW/blob/main/usage_example.py) and Zenodo (https://doi.org/10.5281/zenodo.15149480).

Code availability

The version of the code used for the preparation of the dataset⁵⁶ is available from Zenodo (https://doi.org/10.5281/zenodo.15149480) and can be viewed and downloaded on GitHub: https://github.com/EarthSystemModelling/GROW). Moreover, a usage example is provided in both repositories.

References

Margat, J. & van der Gun, J. Groundwater around the World: A Geographic Synopsis. (CRC Press, Boca Raton London New York Leiden, 2013).
Saccò, M. et al. Groundwater is a hidden global keystone ecosystem. Glob. Change Biol. 30 (2024).
United Nations (UN). The United Nations World Water Development Report 2022: Groundwater Making the Invisible Visible. (UNESCO, Paris, 2022).
Sutanudjaja, E. H. et al. PCR-GLOBWB 2: a 5 arcmin global hydrological and water resources model. Geosci. Model Dev. 11, 2429–2453 (2018).
Article ADS Google Scholar
Müller Schmied, H. et al. The global water resources and use model WaterGAP v2.2e: description and evaluation of modifications and new features. Geosci. Model Dev. 17, 8817–8852 (2024).
Article ADS Google Scholar
Rohde, M. M. et al. Groundwater-dependent ecosystem map exposes global dryland protection needs. Nature 632, 101–107 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Kløve, B. et al. Groundwater dependent ecosystems. Part I: Hydroecological status and trends. Environ. Sci. Policy 14, 770–781 (2011).
Article Google Scholar
Howard, J. K., Dooley, K., Brauman, K. A., Klausmeyer, K. R. & Rohde, M. M. Ecosystem services produced by groundwater dependent ecosystems: a framework and case study in California. Front. Water 5, 1115416 (2023).
Article Google Scholar
Jasechko, S. & Perrone, D. Global groundwater wells at risk of running dry. Science 372, 418–421 (2021).
Article ADS CAS PubMed Google Scholar
Bierkens, M. F. P. & Wada, Y. Non-renewable groundwater use and groundwater depletion: a review. Environ. Res. Lett. 14, 063002 (2019).
Article ADS Google Scholar
Scanlon, B. R. et al. Global water resources and the role of groundwater in a resilient water future. Nat. Rev. Earth Environ. 4, 87–101 (2023).
Article ADS Google Scholar
Jasechko, S. et al. Rapid groundwater decline and some cases of recovery in aquifers globally. Nature 625, 715–721 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Wada, Y., Wisser, D. & Bierkens, M. F. P. Global modeling of withdrawal, allocation and consumptive use of surface water and groundwater resources. Earth Syst. Dyn. 5, 15–40 (2014).
Article ADS Google Scholar
Abbott, B. W. et al. A water cycle for the Anthropocene. Hydrol. Process. 33, 3046–3052 (2019).
Article ADS Google Scholar
Minea, I., Boicu, D., Negm, A., Zelenakova, M. & Demirci, M. Editorial: Impact of climate changes on groundwater resources. Front. Environ. Sci. 13, 1557374 (2025).
Article Google Scholar
Haaf, E., Giese, M., Reimann, T. & Barthel, R. Data‐Driven Estimation of Groundwater Level Time‐Series at Unmonitored Sites Using Comparative Regional Analysis. Water Resour. Res. 59, e2022WR033470 (2023).
Article ADS Google Scholar
Kuang, X. et al. The changing nature of groundwater in the global water cycle. Science 383, 0630 (2024).
Article Google Scholar
Seo, K.-W. et al. Abrupt sea level rise and Earth’s gradual pole shift reveal permanent hydrological regime changes in the 21st century. Science 387, 1408–1413 (2025).
Article ADS CAS PubMed Google Scholar
Konikow, L. F. Contribution of global groundwater depletion since 1900 to sea-level rise: GROUNDWATER DEPLETION. Geophys. Res. Lett. 38 (2011).
Condon, L. E. et al. Global Groundwater Modeling and Monitoring: Opportunities and Challenges. Water Resour. Res. 57, e2020WR029500 (2021).
Article ADS Google Scholar
Reinecke, R. et al. Uncertainty of simulated groundwater recharge at different global warming levels: a global-scale multi-model ensemble study. Hydrol. Earth Syst. Sci. 25, 787–810 (2021).
Article ADS CAS Google Scholar
Intergovernmental Panel On Climate Change (IPCC). Climate Change 2021 – The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. (Cambridge University Press, 2023).
Dubois, E. & Larocque, M. Contribution of standardized indexes to understand groundwater level fluctuations in response to meteorological conditions in cold and humid climates. J. Hydrol. 634, 131105 (2024).
Article Google Scholar
Gleeson, T., Cuthbert, M., Ferguson, G. & Perrone, D. Global Groundwater Sustainability, Resources, and Systems in the Anthropocene. Annu. Rev. Earth Planet. Sci. 48, 431–463 (2020).
Article ADS CAS Google Scholar
Mukherjee, A., Jha, M. K., Kim, K.-W. & Pacheco, F. A. L. Groundwater resources: challenges and future opportunities. Sci. Rep. 14, 28540, s41598-024-79936–5 (2024).
Venegas-Quiñones, H. L. et al. Development of Groundwater Levels Dataset for Chile since 1970. Sci. Data 11, 170 (2024).
Article PubMed PubMed Central Google Scholar
Huggins, X., Gleeson, T., Villholth, K. G., Rocha, J. C. & Famiglietti, J. S. Groundwaterscapes: A Global Classification and Mapping of Groundwater’s Large‐Scale Socioeconomic, Ecological, and Earth System Functions. Water Resour. Res. 60, e2023WR036287 (2024).
Article ADS Google Scholar
Rockström, J. et al. Safe and just Earth system boundaries. Nature 619, 102–111 (2023).
Article ADS PubMed PubMed Central Google Scholar
Gleeson, T. et al. GMD perspective: The quest to improve the evaluation of groundwater representation in continental- to global-scale models. Geosci. Model Dev. 14, 7545–7571 (2021).
Article ADS Google Scholar
Moeck, C. et al. A global-scale dataset of direct natural groundwater recharge rates: A review of variables, processes and relationships. Sci. Total Environ. 717, 137042 (2020).
Article CAS PubMed Google Scholar
Moges, E., Demissie, Y., Larsen, L. & Yassin, F. Review: Sources of Hydrological Model Uncertainties and Advances in Their Analysis. Water 13, 28 (2020).
Article Google Scholar
Tarasova, L., Gnann, S., Yang, S., Hartmann, A. & Wagener, T. Catchment characterization: Current descriptors, knowledge gaps and future opportunities. Earth-Sci. Rev. 252, 104739 (2024).
Article Google Scholar
Beven, K. J. & Cloke, H. L. Comment on “Hyperresolution global land surface modeling: Meeting a grand challenge for monitoring Earth’s terrestrial water” by Eric F. Wood et al. Water Resour. Res. 48, 2011WR010982 (2012).
Article Google Scholar
Reinecke, R. et al. Uncertainty in model estimates of global groundwater depth. Environ. Res. Lett. https://doi.org/10.1088/1748-9326/ad8587 (2024).
Article Google Scholar
Hasan, F., Medley, P., Drake, J. & Chen, G. Advancing Hydrology through Machine Learning: Insights, Challenges, and Future Directions Using the CAMELS, Caravan, GRDC, CHIRPS, PERSIANN, NLDAS, GLDAS, and GRACE Datasets. Water 16, 1904 (2024).
Article Google Scholar
Huggins, X. et al. A review of open data for studying global groundwater in social–ecological systems. Environ. Res. Lett. 20, 093002 (2025).
Article ADS Google Scholar
Addor, N. et al. Large-sample hydrology: recent progress, guidelines for new datasets and grand challenges. Hydrol. Sci. J. 65, 712–725 (2020).
Article CAS Google Scholar
Fan, Y., Li, H. & Miguez-Macho, G. Global patterns of groundwater table depth. Science 339, 940–3 (2013).
Article ADS CAS PubMed Google Scholar
Kratzert, F. et al. Caravan - A global community dataset for large-sample hydrology. Sci. Data 10, 61 (2023).
Article PubMed PubMed Central Google Scholar
Cuthbert, M. O., Gleeson, T., Bierkens, M. F. P., Ferguson, G. & Taylor, R. G. Defining Renewable Groundwater Use and Its Relevance to Sustainable Groundwater Management. Water Resour. Res. 59 (2023).
Gelsinari, S. et al. Nonstationary recharge responses to a drying climate in the Gnangara Groundwater System, Western Australia. J. Hydrol. 633, 131007 (2024).
Article Google Scholar
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).
Article PubMed PubMed Central Google Scholar
Muñoz-Sabater, J. et al. ERA5-Land: a state-of-the-art global reanalysis dataset for land applications. Earth Syst. Sci. Data 13, 4349–4383 (2021).
Article ADS Google Scholar
Linke, S. et al. Global hydro-environmental sub-basin and river reach characteristics at high spatial resolution. Sci. Data 6, 283 (2019).
Article PubMed PubMed Central Google Scholar
Collenteur, R. A., Moeck, C., Schirmer, M. & Birk, S. Analysis of nationwide groundwater monitoring networks using lumped-parameter models. J. Hydrol. 626, 130120 (2023).
Article Google Scholar
Jemeļjanova, M. et al. Modeling hydraulic heads with impulse response functions in different environmental settings of the Baltic countries. J. Hydrol. Reg. Stud. 47, 101416 (2023).
Article Google Scholar
IGRAC. The Global Groundwater Monitoring Network (GGMN). IGRAC https://doi.org/10.58154/6Z0Y-DA34 (2025).
Berghuijs, W. R. et al. Groundwater recharge is sensitive to changing long-term aridity. Nat. Clim. Change 14, 357–363 (2024).
Article ADS Google Scholar
Fan, Y. Groundwater in the E arth’s critical zone: Relevance to large‐scale patterns and processes. Water Resour. Res. 51, 3052–3069 (2015).
Article ADS Google Scholar
Boutt, D. F. Assessing hydrogeologic controls on dynamic groundwater storage using long‐term instrumental records of water table levels. Hydrol. Process. 31, 1479–1497 (2017).
Article ADS Google Scholar
Chávez García Silva, R. et al. Multi-decadal groundwater observations reveal surprisingly stable levels in southwestern Europe. Commun. Earth Environ. 5, 387 (2024).
Article ADS Google Scholar
Rust, W., Holman, I., Bloomfield, J., Cuthbert, M. & Corstanje, R. Understanding the potential of climate teleconnections to project future groundwater drought. Hydrol. Earth Syst. Sci. 23, 3233–3245 (2019).
Article ADS Google Scholar
MacDonald, A. M. et al. Mapping groundwater recharge in Africa from ground observations and implications for water security. Environ. Res. Lett. 16, 034012 (2021).
Article ADS Google Scholar
Cuthbert, M. O. et al. Observed controls on resilience of groundwater to climate variability in sub-Saharan Africa. Nature 572, 230–234 (2019).
Article ADS CAS PubMed Google Scholar
Scanlon, B. R. et al. Linkages between GRACE water storage, hydrologic extremes, and climate teleconnections in major African aquifers. Environ. Res. Lett. 17, 014046 (2022).
Article ADS Google Scholar
Bäthge, A. et al. GROW: A Global-Scale Time Series Dataset for Groundwater Studies within the Earth System. Zenodo https://doi.org/10.5281/ZENODO.15149480 (2026).
IGRAC. Groundwater Observations Repository. Retrieved from https://ggis.un-igrac.org/view/repository/ (2025).
Wagener, T. et al. On doing hydrology with dragons: Realizing the value of perceptual models and knowledge accumulation. WIREs Water 8, e1550 (2021).
Article ADS Google Scholar
Nolte, A., Haaf, E., Heudorfer, B., Bender, S. & Hartmann, J. Disentangling coastal groundwater level dynamics in a global dataset. Hydrol. Earth Syst. Sci. 28, 1215–1249 (2024).
Article ADS Google Scholar
Lehr, C. & Lischeid, G. Efficient screening of groundwater head monitoring data for anthropogenic effects and measurement errors. Hydrol. Earth Syst. Sci. 24, 501–513 (2020).
Article ADS Google Scholar
Spearman, C. The Proof and Measurement of Association between Two Things. Am. J. Psychol. 15, 72 (1904).
Article Google Scholar
Jiang, X.-W., Cherry, J. & Wan, L. Flowing wells: terminology, history and role in the evolution of groundwater science. Hydrol. Earth Syst. Sci. 24, 6001–6019 (2020).
Article ADS Google Scholar
Lyon, S. W., Grabs, T., Laudon, H., Bishop, K. H. & Seibert, J. Variability of groundwater levels and total organic carbon in the riparian zone of a boreal catchment. J. Geophys. Res. 116, G01020 (2011).
ADS Google Scholar
Rau, G. C. et al. Error in hydraulic head and gradient time-series measurements: a quantitative appraisal. Hydrol. Earth Syst. Sci. 23, 3603–3629 (2019).
Article ADS Google Scholar
Retike, I. et al. Rescue of groundwater level time series: How to visually identify and treat errors. J. Hydrol. 605, 127294 (2022).
Article Google Scholar
Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. Proc Second Int Conf Knowl Discov Data Min. KDD-96 226–231 (1996).
Hamed, K. H. & Ramachandra Rao, A. A modified Mann-Kendall trend test for autocorrelated data. J. Hydrol. 204, 182–196 (1998).
Article ADS Google Scholar
Hussain, M. & Mahmud, I. pyMannKendall: a python package for non parametric Mann Kendall family of trend tests. J. Open Source Softw. 4, 1556 (2019).
Article ADS Google Scholar
Lin, M., Lucas, H. C. & Shmueli, G. Research Commentary —Too Big to Fail: Large Samples and the p -Value Problem. Inf. Syst. Res. 24, 906–917 (2013).
Article Google Scholar
Beck, H. E. et al. MSWEP V2 Global 3-Hourly 0.1° Precipitation: Methodology and Quantitative Assessment. Bull. Am. Meteorol. Soc. 100, 473–500 (2019).
Article ADS Google Scholar
Schneider, U., Hänsel, S., Finger, P., Rustemeier, E. & Ziese, M. GPCC Full Data Monthly Version 2022 at 0.25°: Monthly Land-Surface Precipitation from Rain-Gauges built on GTS-based and Historic Data: Globally Gridded Monthly Totals. Global Precipitation Climatology Centre (GPCC) at Deutscher Wetterdienst https://doi.org/10.5676/DWD_GPCC/FD_M_V2022_025 (2022).
Dai, A., Fung, I. Y. & Del Genio, A. D. Surface Observed Global Land Precipitation Variations during 1900–88. J. Clim. 10, 2943–2962 (1997).
Article ADS Google Scholar
Singer, M. B. et al. Hourly potential evapotranspiration at 0.1° resolution for the global land surface from 1981-present. Sci. Data 8, 224 (2021).
Article PubMed PubMed Central Google Scholar
Xu, C., Wang, W., Hu, Y. & Liu, Y. Evaluation of ERA5, ERA5-Land, GLDAS-2.1, and GLEAM potential evapotranspiration data over mainland China. J. Hydrol. Reg. Stud. 51, 101651 (2024).
Article Google Scholar
Miralles, D. G. et al. GLEAM4: global land evaporation and soil moisture dataset at 0.1° resolution from 1980 to near present. Sci. Data 12, 416 (2025).
Article PubMed PubMed Central Google Scholar
Chen, Z. et al. World Karst Aquifer Map (WHYMAP WOKAM). BGR, IAH, KIT, UNESCO https://doi.org/10.25928/B2.21_SFKQ-R406 (2017).
Hartmann, J. & Moosdorf, N. Global Lithological Map Database v1.0 (gridded to 0.5° spatial resolution). PANGAEA https://doi.org/10.1594/PANGAEA.788537 (2012).
Lehner, B. & Grill, G. Global river hydrography and network routing: baseline data and new approaches to study the world’s large river systems. Hydrol. Process. 27, 2171–2186 (2013).
Article ADS Google Scholar
Copernicus Climate Change Service. ERA5-Land post-processed daily statistics from 1950 to present. ECMWF https://doi.org/10.24381/CDS.E9C9C792 (2024).
Koeppen, W. & Geiger, R. Handbuch Der Klimatologie. (Gebrüder Borntraeger, Berlin, 1936).
Alley, W. M., Reilly, T. E. & Franke, O. L. Sustainability of Ground-Water Resources. (U.S. Dept. of the Interior, U.S. Geological Survey, Denver, 1999).
Fan, Y. et al. Hillslope Hydrology in Global Change Research and Earth System Modeling. Water Resour. Res. 55, 1737–1772 (2019).
Article ADS Google Scholar
Stein, L. et al. Wealth Over Woe: Global Biases in Hydro‐Hazard Research. Earths Future 12, e2024EF004590 (2024).
Article ADS Google Scholar
Krabbenhoft, C. A. et al. Assessing placement bias of the global river gauge network. Nat. Sustain. 5, 586–592 (2022).
Article PubMed PubMed Central Google Scholar
Döll, P., Douville, H., Güntner, A., Müller Schmied, H. & Wada, Y. Modelling Freshwater Resources at the Global Scale: Challenges and Prospects. Surv. Geophys. 37, 195–221 (2016).
Article ADS Google Scholar
Wada, Y. et al. Modeling global water use for the 21st century: the Water Futures and Solutions (WFaS) initiative and its approaches. Geosci. Model Dev. 9, 175–222 (2016).
Article ADS Google Scholar
Müller Schmied, H. et al. Variations of global and continental water balance components as impacted by climate forcing uncertainty and human water use. Hydrol. Earth Syst. Sci. 20, 2877–2898 (2016).
Article ADS Google Scholar
Yoshida, T. et al. Inference of Parameters for a Global Hydrological Model: Identifiability and Predictive Uncertainties of Climate‐Based Parameters. Water Resour. Res. 58, e2021WR030660 (2022).
Article ADS Google Scholar
Ezenne, G. I., Tanner, J. L., Asoiro, F. U. & Obalum, S. E. An overview of uncertainties in evapotranspiration estimation techniques. J. Agrometeorol. 25 (2023).
Benaraba, N., Touati, F., Benyahia, S. & Yebdri, D. Jointly estimating recharge and groundwater withdrawals of the NWSAS by inverting GRACE/GRACE-FO gravity data. Hydrol. Sci. J. 67, 2215–2231 (2022).
Article Google Scholar
Vermote, E. & NOAA CDR Program. NOAA Climate Data Record (CDR) of AVHRR Normalized Difference Vegetation Index (NDVI), Version 5. NOAA National Centers for Environmental Information https://doi.org/10.7289/V5ZG6QH9 (2018).
Yamazaki, D. et al. A high‐accuracy map of global terrain elevations. Geophys. Res. Lett. 44, 5844–5853 (2017).
Article ADS Google Scholar
Sun, Q. et al. A Review of Global Precipitation Data Sets: Data Sources, Estimation, and Intercomparisons. Rev. Geophys. 56, 79–107 (2018).
Article ADS Google Scholar
Muñoz Sabater, J. et al. ERA5-land post-processed daily-statistics from 1950 to present. Copernicus Climate Change Service (C3S) Climate Data Store (CDS). https://doi.org/10.24381/cds.e9c9c792 (2024).
Article Google Scholar
Hartmann, J. & Moosdorf, N. The new global lithological map database GLiM: A representation of rock properties at the Earth surface. Geochem. Geophys. Geosystems 13, 2012GC004370 (2012).
Article Google Scholar
Gleeson, T., Moosdorf, N., Hartmann, J. & Van Beek, L. P. H. A glimpse beneath earth’s surface: GLobal HYdrogeology MaPS (GLHYMPS) of permeability and porosity. Geophys. Res. Lett. 41, 3891–3898 (2014).
Article ADS Google Scholar
Huscroft, J., Gleeson, T., Hartmann, J. & Börker, J. Compiling and Mapping Global Permeability of the Unconsolidated and Consolidated Earth: GLobal HYdrogeology MaPS 2.0 (GLHYMPS 2.0). Geophys. Res. Lett. 45, 1897–1904 (2018).
Article ADS Google Scholar
Flügel, W. Delineating hydrological response units by geographical information system analyses for regional hydrological modelling using PRMS/MMS in the drainage basin of the River Bröl, Germany. Hydrol. Process. 9, 423–436 (1995).
Article ADS Google Scholar
Wu, W.-Y. et al. Divergent effects of climate change on future groundwater availability in key mid-latitude aquifers. Nat. Commun. 11, 3710 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Cuthbert, M. O. et al. Global patterns and dynamics of climate–groundwater interactions. Nat. Clim. Change 9, 137–141 (2019).
Article ADS Google Scholar
West, C., Rosolem, R., MacDonald, A. M., Cuthbert, M. O. & Wagener, T. Understanding process controls on groundwater recharge variability across Africa through recharge landscapes. J. Hydrol. 612, 127967 (2022).
Article Google Scholar
Barron, O. V. et al. Climatic controls on diffuse groundwater recharge across Australia. Hydrol. Earth Syst. Sci. 16, 4557–4570 (2012).
Article ADS Google Scholar
Jasechko, S. et al. The pronounced seasonality of global groundwater recharge. Water Resour. Res. 50, 8845–8867 (2014).
Article ADS Google Scholar
Gnann, S. et al. The Influence of Topography on the Global Terrestrial Water Cycle. Rev. Geophys. 63, e2023RG000810 (2025).
Article ADS Google Scholar
Jasechko, S., Kirchner, J. W., Welker, J. M. & McDonnell, J. J. Substantial proportion of global streamflow less than three months old. Nat. Geosci. 9, 126–129 (2016).
Article ADS CAS Google Scholar
Winter, T. C. Ground Water and Surface Water: A Single Resource. (US Geological Survey, Denver, Col., 1999).
Hartmann, A., Goldscheider, N., Wagener, T., Lange, J. & Weiler, M. Karst water resources in a changing world: Review of hydrological modeling approaches: KARST WATER RESOURCES PREDICTION. Rev. Geophys. 52, 218–242 (2014).
Article ADS Google Scholar
Kim, J. H. & Jackson, R. B. A Global Analysis of Groundwater Recharge for Vegetation, Climate, and Soils. Vadose Zone J. 11, vzj2011.0021RA (2012).
Article Google Scholar
Winter, T. C. The Role of Ground Water in Generating Streamflow in Headwater Areas and in Maintaining Base Flow. JAWRA J. Am. Water Resour. Assoc. 43, 15–25 (2007).
Article ADS Google Scholar
Walvoord, M. A. & Kurylyk, B. L. Hydrologic Impacts of Thawing Permafrost—A Review. Vadose Zone J. 15, 1–20 (2016).
Article Google Scholar
Vincent, A., Violette, S. & Aðalgeirsdóttir, G. Groundwater in catchments headed by temperate glaciers: A review. Earth-Sci. Rev. 188, 59–76 (2019).
Article ADS Google Scholar
Hood, J. L. & Hayashi, M. Characterization of snowmelt flux and groundwater storage in an alpine headwater basin. J. Hydrol. 521, 482–497 (2015).
Article ADS Google Scholar
Burgess, S. S. O., Adams, M. A., Turner, N. C., White, D. A. & Ong, C. K. Tree roots: conduits for deep recharge of soil water. Oecologia 126, 158–165 (2001).
Article ADS PubMed Google Scholar
Ilstedt, U. et al. Intermediate tree cover can maximize groundwater recharge in the seasonally dry tropics. Sci. Rep. 6, 21930 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, Z., Yang, Y., Chen, G., Wu, J. & Wu, J. Variation of lake-river-aquifer interactions induced by human activity and climatic condition in Poyang Lake Basin, China. J. Hydrol. 595, 126058 (2021).
Article Google Scholar
Fan, Y., Miguez-Macho, G., Jobbágy, E. G., Jackson, R. B. & Otero-Casal, C. Hydrologic regulation of plant rooting depth. Proc. Natl. Acad. Sci. 114, 10572–10577 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Batelaan, O., De Smedt, F. & Triest, L. Regional groundwater discharge: phreatophyte mapping, groundwater modelling and impact analysis of land-use change. J. Hydrol. 275, 86–108 (2003).
Article ADS Google Scholar
Karger, D. N. et al. Climatologies at high resolution for the earth’s land surface areas CHELSA V2.1 (current). EnviDat https://doi.org/10.16904/ENVIDAT.228.V2.1 (2021).
Meybeck, M., Kummu, M. & Dürr, H. H. Global hydrobelts and hydroregions. PANGAEA https://doi.org/10.1594/PANGAEA.806957 (2013).
Amatulli, G. Geomorpho90m - Global high-resolution geomorphometry layers. PANGAEA https://doi.org/10.1594/PANGAEA.899135 (2019).
Huscroft, J., Gleeson, T., Hartmann, J. & Börker, J. Compiling and mapping global permeability of the unconsolidated and consolidated Earth: GLobal HYdrogeology MaPS 2.0 (GLHYMPS 2.0). Borealis https://doi.org/10.5683/SP2/TTJNIU (2018).
Article Google Scholar
Gleeson, T. GLobal HYdrogeology MaPS (GLHYMPS) of permeability and porosity. Borealis https://doi.org/10.5683/SP2/DLGXYO (2018).
Article Google Scholar
Simons, G., Koster, R. & Droogers, P. HiHydroSoil v2.0 - A high resolution soil map of global hydraulic properties. https://www.futurewater.nl/wp-content/uploads/2020/10/HiHydroSoil-v2.0-High-Resolution-Soil-Maps-of-Global-Hydraulic-Properties.pdf (2020).
Huggins, X. et al. Data from: Overlooked risks and opportunities in groundwatersheds of the world’s protected areas. Borealis https://doi.org/10.5683/SP3/P3OU3A (2023).
Article Google Scholar
Volkholz, J. & Ostberg, S. ISIMIP3a landuse input data. ISIMIP Repository https://doi.org/10.48364/ISIMIP.571261.3 (2024).
Article Google Scholar
Huggins, X., Gleeson, T., Villholth, K. G., Rocha, J. C. & Famiglietti, J. S. Data and code from: Groundwaterscapes: A global classification and mapping of groundwater’s large-scale socioeconomic, ecological, and Earth system functions. Borealis https://doi.org/10.5683/SP3/MFYCWV (2024).
Article Google Scholar
Vermote, E. & NOAA CDR Program. NOAA Climate Data Record (CDR) of VIIRS Normalized Difference Vegetation Index (NDVI), Version 1. NOAA National Centers for Environmental Information https://doi.org/10.25921/GAKH-ST76 (2022).
Wada, Y. et al. ISIMIP3a water abstraction input data. ISIMIP Repository https://doi.org/10.48364/ISIMIP.228996 (2022).

Download references

Acknowledgements

IG acknowledges funding from the European Research Council (ERC) Starting Grant (GROW: No. 10104110). TW acknowledges support from the Alexander von Humboldt Foundation in the framework of the Alexander von Humboldt Professorship endowed by the German Federal Ministry of Educationand Research (BMBF). GROW⁵⁶ contains modified Copernicus Climate Change Service information 2024. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains. The authors are grateful to the ISIMIP3a project (https://protocol.isimip.org/#/ISIMIP3a) that provided multi-model ensemble means of domestic and industrial water withdrawal and the land use fractions. Further, we want to thank Annika Nolte for her methodical groundwork, including the provision of her Python scripts, and insights on the use of the DBSCAN algorithm on groundwater time series, Sara Nazari for providing data that were used in earlier stages of GROW⁵⁶ and her advice on data products and data processing, Lina Stein for her advice on data products and an early paper draft, and Emmanuel Nyenah for his advice on data processing.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of Geography, Johannes Gutenberg-University Mainz, Mainz, Germany
Annemarie Bäthge & Robert Reinecke
International Groundwater Resources Assessment Centre (IGRAC), Delft, The Netherlands
Claudia Ruz Vargas
Research Area 4: ‘Simulation & Data Science’, Leibniz Centre for Agricultural Landscape Research, Müncheberg, Germany
Gunnar Lischeid
Institute of Environmental Science and Geography, University of Potsdam, Potsdam, Germany
Gunnar Lischeid & Thorsten Wagener
Department Water Resources and Drinking Water, Eawag, Dübendorf, Switzerland
Raoul Collenteur
School of Earth and Environmental Sciences, Cardiff University, Cardiff, UK
Mark Cuthbert
Department of Hydrogeology, Helmholtz Center for Environmental Research - UFZ, Leipzig, Germany
Jan Fleckenstein
Institute of Engineering Hydrology and Water Resources Management, Ruhr University Bochum, Bochum, Germany
Martina Flörke
Earth Systems and Global Change Group, Wageningen University and Research, Wageningen, the Netherlands
Inge de Graaf
Chair of Hydrology, University of Freiburg, Freiburg, Germany
Sebastian Gnann
Institute of Groundwater Management, Technical University of Dresden, Dresden, Germany
Andreas Hartmann
Institute for Resources, Environment, and Sustainability, University of British Columbia, Vancouver, BC, Canada
Xander Huggins
High Meadows Environmental Institute, Princeton University, Princeton, NJ, USA
Xander Huggins
Stockholm Resilience Centre, Stockholm University, Stockholm, Sweden
Xander Huggins
Leibniz Centre for Tropical Marine Research (ZMT), Bremen, Germany
Nils Moosdorf
Institute of Geosciences, Kiel University, Kiel, Germany
Nils Moosdorf
Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Yoshihide Wada

Authors

Annemarie Bäthge
View author publications
Search author on:PubMed Google Scholar
Claudia Ruz Vargas
View author publications
Search author on:PubMed Google Scholar
Gunnar Lischeid
View author publications
Search author on:PubMed Google Scholar
Raoul Collenteur
View author publications
Search author on:PubMed Google Scholar
Mark Cuthbert
View author publications
Search author on:PubMed Google Scholar
Jan Fleckenstein
View author publications
Search author on:PubMed Google Scholar
Martina Flörke
View author publications
Search author on:PubMed Google Scholar
Inge de Graaf
View author publications
Search author on:PubMed Google Scholar
Sebastian Gnann
View author publications
Search author on:PubMed Google Scholar
Andreas Hartmann
View author publications
Search author on:PubMed Google Scholar
Xander Huggins
View author publications
Search author on:PubMed Google Scholar
Nils Moosdorf
View author publications
Search author on:PubMed Google Scholar
Yoshihide Wada
View author publications
Search author on:PubMed Google Scholar
Thorsten Wagener
View author publications
Search author on:PubMed Google Scholar
Robert Reinecke
View author publications
Search author on:PubMed Google Scholar

Contributions

A.B.: conceptualization, investigation, methodology, data curation, validation, visualization, writing – original draft, writing – review and editing. R.R.: supervision, conceptualization, preparing the original draft, writing, review, and editing. C.R.V.; G.L.; N.M.: conceptualization, methodology, writing – review and editing. RC: conceptualization, advertising, writing- review and editing. M.C.; J.F., I.G.; S.G.; A.H.; X.H.; Y.W.; T.W.: conceptualization, writing – review and editing. M.F.: writing – review and editing.

Corresponding author

Correspondence to Annemarie Bäthge.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information for 'A Global-Scale Time Series Dataset for Groundwater Studies within the Earth System' (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bäthge, A., Vargas, C.R., Lischeid, G. et al. A Global-Scale Time Series Dataset for Groundwater Studies within the Earth System. Sci Data 13, 401 (2026). https://doi.org/10.1038/s41597-026-06966-1

Download citation

Received: 07 July 2025
Accepted: 24 February 2026
Published: 09 March 2026
Version of record: 17 March 2026
DOI: https://doi.org/10.1038/s41597-026-06966-1