Background & Summary

Isotope hydrology uses stable and radioactive isotopes —both naturally occurring and artificially produced— as tools to characterize and understand water dynamics within the hydrological cycle. Stable isotopes are naturally incorporated in the water molecule, and their isotopic abundance varies with the occurrence of fractionation and mixing processes. Consequently, they can be used as a traceable fingerprint to investigate specific processes related to the quantity and quality of water resources1,2,3,4. These applications include assessing water balance5,6,7,8,9, determining aquifer recharge origin10,11,12,13, evaluating water mixing processes14,15,16,17,18, estimating water residence times19,20,21, understanding groundwater-surface water interactions22,23,24,25, identifying anthropogenic impacts26,27,28,29, and assessing the vulnerability and sanitation of water resources30,31,32.

Previous isotope hydrology studies conducted in Cuba have primarily focused on specific case studies related to groundwater salinization, mainly in the western region of the country33,34,35,36,37. However, descriptions of meteoric waters are limited in both spatial and temporal domains, due to heterogeneous sampling conditions in terms of frequency and collection methods. As a result, the local meteoric water lines (LMWLs) reported exhibit considerable discrepancies, as a result of fragmented time series and spatial limitations35.

Here, we present a novel and unique dataset, derived from a nationwide study in Cuba, aiming to characterize the isotopic composition of meteoric waters and evaluate the factors influencing its spatiotemporal variability across the island. It encompassed precipitation data from monthly composite samples (N = 526) between January 2017 and December 2021, as well as event-based samples (N = 111) from July 2019 to August 2021. The event-based samples included regular (non-cyclonic) precipitation events and those associated with two tropical cyclones (TCs) that made landfall in Cuba: tropical storm Elsa (July 2021) and hurricane Ida (August 2021).

Methods

Study area

Cuba, the largest archipelago in the Insular Caribbean, is surrounded by the waters of the North Atlantic Ocean and the Caribbean Sea (Fig. 1). The climate is influenced by high solar radiation, trade winds, and high humidity from the North Atlantic Subtropical Anticyclone38. According to the Köppen-Geiger classification, the predominant climate is tropical savanna (Aw), followed by tropical monsoon (Am) and tropical rainforest (Af)39. The climate regime is represented by two main seasons: a rainy season (from May to October) and a dry season (from November to April). Prevailing winds are from the ENE throughout the year. Due to its location, Cuba is affected by both tropical and extra-tropical circulations, with the most significant changes linked to disturbances in the tropical circulation, such as easterly waves and TCs, occurring primarily between June and November40,41.

Fig. 1
Fig. 1The alternative text for this image may have been generated using AI.
Full size image

The inset shows the location of Cuba in the Insular Caribbean. Sampling stations are represented by bold-empty diamonds and numbered as described in the table, as part of the National Network for Isotope Hydrology in Precipitations (RNHIP).

Sampling

Rainfall monitoring was conducted by the National Network for Isotope Hydrology in Precipitations (RNHIP), integrated within the supporting framework of 21 stations from the Meteorological Service Network of the National Institute of Meteorology in Cuba (INSMET), and 3 stations from the National Network for Environmental Radiological Surveillance (Fig. 1). Sampling strategies included the collection of: 1) monthly composite samples (nationwide) using PALMEX RS-2 collectors42, from January 2017 to December 2021; and 2) event-based samples (in Cienfuegos) using a set of ground fixed exchangeable passive collectors to cover full event and intra-event sampling (30 min intervals), from July 2019 to August 2021. The event-based samples encompassed both non-cyclonic events and those associated with two TCs that made landfall at the southwestern coast of Cuba.

The TCs sampling covered:

  1. 1)

    Tropical storm, Elsa (TS Elsa), from the 5:30 p.m. July 4 to 4:55 p.m. July 5. Landfall in Cuba as TS at 08:00 a.m. on July 5, 2021 (UTC-4 time zone)43.

  2. 2)

    Transition from TS to category 1 hurricane, Ida (TS-H1 Ida), from 10:15 a.m. and 3:15 p.m. on August 27, 2021. Landfall in Cuba as a H1 at 2:00 p.m. UTC-4 on August 27, 2021 (UTC-4 time zone)44.

Stable isotopes

Water stable isotopes (δ18O and δ2H) were determined at three laboratories, using two main analytical techniques. The analytical contribution of each laboratory to the final dataset is divided as follows:

Laser absorption spectroscopy (LAS)

  1. 1)

    At the Isotope Hydrology Laboratory of the International Atomic Energy Agency (IAEA) in Vienna (24%) using 2140-i (Picarro, Santa Clara, CA, USA), DLT‐100 (Los Gatos Research, San Jose, CA, USA) and TIWA‐24 d (Los Gatos Research) water isotopes analyzers45.

  2. 2)

    At the Stable Isotope Research Group, Chemistry Department, National University of Costa Rica (47%), using two water isotope analyzers, L2120‐i (Picarro, USA) and LWIA-45-EP (Los Gatos Research, San Jose, CA, USA)46.

    Isotope ratio mass spectrometry (IRMS)

  3. 3)

    At the Environmental Isotopes Laboratory of the CEAC (29%). Performed by Continuous Flow IRMS, using gas-water equilibrium method47 with the analytical configuration MultiFlow-Isoprime (Elementar Analyse Systems GmbH, Germany). Results were corrected to the VSMOW-SLAP scale using the two-point normalization method48,49. Two in-house standards LIA-CEAC 1 (δ18O: 3.31‰ ± 0.35‰; δ2H: 12.60‰ ± 1.2‰) and SNOW (δ18O: −23.78‰ ± 0.33‰, δ2H: −179.77‰ ± 0.81‰) were used for normalization purpose; and a third LIA-CEAC 2 (δ18O: −5.00‰ ± 0.22‰; δ2H: −32.92‰ ± 1.28‰), was used as a control in-house standard.

Stable isotope ratios are reported in delta notation (per mil ‰), calculated from the expression δ18O or δ2H = (Rs/Rstd - 1) * 1000. Where R is the isotope ratio 18O/16O or 2H/1H from a sample (s) or standard (std) relative to the Vienna Standard Mean Ocean Water (VSMOW) reference standard. All the values and uncertainties reported were calibrated against internationally certified reference materials VSMOW2 and SLAP250.

Deuterium excess was calculated as: d-excess = δ2H – 8 * δ18O, as described in Dansgaard, 196451. This second-order parameter was used to assess the influence of moisture recycling, transport52,53, and the occurrence of kinetic fractionation in low-humidity conditions during below-cloud evaporation54.

Metadata and supplementary variables

Metadata reported at each RNHIP station included location (latitude N and longitude W, in decimal degrees WGS84), elevation (meters above sea level, masl), distance from the coast (shortest distance in km), and sampling date/time frame (sampling period, season, start and end date). Monthly sampling included supplementary variables, such as cumulative rainfall amount (mm), air temperature (°C), relative humidity (%), and atmospheric pressure (hPa). These last three parameters were reported as monthly means based on synoptical observations every three hours at each of the stations according to the regular procedures for meteorological monitoring of the INSMET. Event-based sampling included collection time (min), rainfall amount (mm), rainfall intensity (mm/h), and rainfall classification (light, moderate, and heavy precipitation)55.

Additionally, the Sequential No. of samples by station is reported as metadata. This number represents the sequential number attributed to each entry of the primary sampling record at each station, and constitutes an evidence of the non-homogeneous representativity of the data. The missing values corresponded to samples rejected in the case of: 1) missing samples; 2) quality control of the samples before analysis (too low volume, unsealed or broken flasks, or any evidence of sample perturbation); and 3) quality control of the isotope data (evidence of strong evaporation, outliers, and replicates with significant differences unresolved by reanalysis).

Back-trajectories

To address the moisture source origin of the event-based samples, 72 h back-trajectories (BTs) were computed using the Hybrid Single Particle Lagrangian Integrated Trajectory (HYSPLIT) model56 and the Global NOAA-NCEP/NCAR reanalysis data files57. BTs were run at 3 h intervals starting at ground levels of 50, 500, and 1000 m for each rainfall event. This information was used to determine moisture source origin and transport paths. The main BTs regions were classified as the North Atlantic Ocean (NAO), Eastern Caribbean Sea (ECS), and Continental/Gulf of Mexico (CGM). Spatial trajectory clustering estimations and maps were produced using the OpenAir R package58.

Data Records

The dataset is available at https://doi.org/10.6084/m9.figshare.26061526.v259 . It consists of an Excel file (.xlsx) named “IsotopeDataBaseCubaPrecip10.06.2024_v2”, containing four spreadsheets:

Spreadsheet 1. Monthly Composite Stations: corresponds to 25 inputs characterizing each “monthly composite” monitoring station from the RNHIP, incorporated in 10 columns of categorical and numerical data, including a station ID (see detailed explanation in the original dataset description, referenced above.

Spreadsheet 2. Monthly Composite Data: corresponds to 531 independent observations obtained from the monthly composite monitoring of precipitations of the RNHIP, between January 2017 and December 2021. It is structured in 16 columns covering: station identification; sampling accompanying information; isotopic composition of samples; and data source.

Spreadsheet 3. Event-Based Data: corresponds to 111 independent observations obtained from event-based monitoring at 3 stations in Cienfuegos province, between July 2019 and August 2021. It is structured in 17 columns covering: sample identification; detailed time frame of the sampling, sampling accompanying information; isotopic composition of samples; and data source.

Spreadsheet 4. Calculations: contains basic calculations for specific parameters that can be used for the hydrological interpretation and further usage of the dataset in specific contexts or case studies.

Technical Validation

Isotopic composition of monthly precipitations in Cuba

Monthly results are based on the regular monitoring carried out by the RNHIP (Fig. 1), within a period of 4 years from January 2017 to December 2021. This dataset (Spreadsheet 1), presents 526 results of dual isotopes δ18O and δ2H from monthly composite samples of meteoric waters in Cuba, representing 63% of the total samples collected during the period (840 samples). δ18O results ranged between −9.59‰ and 2.03‰, with an arithmetic mean of −2.42‰ (±1.89‰), and a volume-weighted mean of −3.24‰. δ2H results ranged between −64.42‰ and 18.00‰, with an arithmetic mean of −8.32‰ (±14.64‰), and a volume-weighted mean of −14.89‰. The d-excess, varied between 0.57‰ and 23.38‰, with an arithmetic mean of 11.07‰ (±4.07‰). Rainfall amount ranged from 0.07 mm to 737.89 mm, with an average value of 121.80 mm (±110.06 mm).Other variables were also monitored and included in the dataset (air temperature, atmospheric pressure, and relative humidity).

Local Meteoric Water Lines (LMWLs) resulting from this dataset were calculated form all the monthly data (N = 526) using two different regression methods: 1) Ordinary Least Square Regression (OLSR)60; and 2) Precipitation Amount Weighted Least Squares Regression (PWLSR)61. These lines are based on rainfall isotopic data obtained at different physical-geographical conditions in the country (location, orography, climate, and vapor source origin of precipitations), covering a 4-year study period (2017 to 2021) (Fig. 2). Descriptive statistics for regression lines can be found in Table 1.

Fig. 2
Fig. 2The alternative text for this image may have been generated using AI.
Full size image

Dual isotope plot of meteoric waters in Cuba (input data from January 2017 to December 2021, N = 526). Blue and red lines denote the calculated LMWL using the OLSR60 and PWLSR61 methods, respectively. The blue and red empty symbols denote the arithmetic and volume-weighted mean values.

Table 1 Summary of descriptive statisticians on regressions lines calculated for the isotopic composition of meteoric waters in Cuba.

Spatial and temporal variations can be assessed by regions and seasonality (rainy and dry seasons). As classified in the dataset, the regional grouping used a physical-geographic regionalization based on the political-administrative limits between provinces: 1) Western region, from Pinar del Río to Matanzas, 2) Central region from Cienfuegos to Camagüey, 3) Eastern region from Las Tunas to Guantánamo. Regional and seasonal grouping of δ18O values are presented in Fig. 3 for statistical intercomparison purpose among groups

Fig. 3
Fig. 3The alternative text for this image may have been generated using AI.
Full size image

Box plots of δ18O compositions for monthly composite samples, grouped by regions and seasonality (blue = rainy and orange = dry). Input data from January 2017 to December 2021, N = 526.

Figure 4 presents a spatial variability analysis across 12 stations, representing the main regions. Physical-geographical representativeness of the data was also considered, including different elevations at each region, and distance from the coastline. The minimum temporal representativity for stations selected to include in this analysis was at least 2 years of data. Overall, no spatial trends or notable differences were observed among stations, exhibiting an homogeneous distribution of the mean isotopic composition of precipitation across the island. Notably, more variability was observed towards the central-western regions. Further statistical analysis are needed to test homoscedasticity among groups.

Fig. 4
Fig. 4The alternative text for this image may have been generated using AI.
Full size image

Box plots showing the spatial δ18O (‰) variability in monthly composite samples, grouped by stations (input data from January 2017 to December 2021, n = 362), and regions (Western: 1–5; Central: 11–18; Eastern: 1–25).

Temporal variations are presented as integrated monthly plots and time series sequence for the monitoring period (Figs. 5, 6, respectively). Both plots showed temporal trends between the dry (November to April) and rainy (May to October) seasons. Notably, during the rainy season, particularly lower δ18O values were observed on specific samples from July, August, and October. These samples align with the LMWLs calculated from the dataset and do not exhibit high d-excess values. The location and sampling period corresponding to each of this data revealed a clear association with the landfall of TCs, such as tropical storms Nate and Philippe (October 201762,63), hurricane Michael (October 201864), tropical storm Elsa (July 202143), and hurricane Ida (August 202144).

Fig. 5
Fig. 5The alternative text for this image may have been generated using AI.
Full size image

Monthly isotopic composition of meteoric waters in Cuba (δ18O and d-excess). The dashed grey lines enclose the TCs season between June and November37. Extreme depleted values are identified by date and location. Input data from January 2017 to December 2021, N = 526.

Fig. 6
Fig. 6The alternative text for this image may have been generated using AI.
Full size image

Mean isotopic composition of meteoric waters in Cuba (δ18O and d-excess), shown as the monthly mean time sequence for the period studied (input data from January 2017 to December 2021, N = 526). Dashed grey line square represents the time frame defined as the TCs season in the Insular Caribbean38.

Isotopic composition of event-based precipitations in Cuba

The event-based monitoring was carried out between July 2019 and August 2021, encompassing non-cyclonic rainfall and those associated with the occurrence of TCs. These samples were collected as a full event (n = 43) and intra-event 30-minute time intervals (n = 68). From BTs (Fig. 7), three main groups were defined according to their moisture source origin, 51% originating in the NAO and 49% from the ECS. Only one trajectory was associated with the CGM origin.

Fig. 7
Fig. 7The alternative text for this image may have been generated using AI.
Full size image

Air mass back-trajectories clusters for event-based samples. (A) detailed clustering by moisture source origin (a simplified grouping was considered for the dataset analysis: CGM, NAO, and ECS, see the section for Methods); (B) d-excess distribution for the computed back-trajectories.

Rainfall amounts in full events ranged from 0.95 mm and 121.59 mm (rainfall intensity between 0.76 mm/h and 127.98 mm/h). The isotopic composition exhibited a wide variation compared to the range of values obtained for monthly composite samples, see Fig. 8A. δ18O ranged from −12.78‰ to 0.97‰, (with a mean value −3.58‰ ± 2.94‰), and δ2H in a range between −89.15‰ and 16.5‰ (with a mean value −18.93‰ ± 22.93‰). The d-excess was in a range between 1.4‰ and 16.46‰ (with a mean value of 9.74‰ ± 4.03‰).

Fig. 8
Fig. 8The alternative text for this image may have been generated using AI.
Full size image

Isotopic composition of meteoric waters from event-based samples from regular rainfall, and TCs. (A) Dual isotope plot (δ18O vs. δ2H) showing the nationwide LMWLs, and corresponding OLSR lines for regular rainfall (green), and TCs (orange and blue). The grey shaded area denotes the typically dual isotope space ranges for monthly composite samples. (B) δ18O vs. d-excess plot for event-based data (showing data range as square and distribution grouping as ellipses). Samples are coloured and shape-coded by rainfall intensity and moisture source origin, respectively. TS = tropical storm; H1 = category 1 hurricane; CGM, NAO, and ECS: see the Methods section for details.

Intra-event monitoring revealed rainfall amount and intensity ranging between 0.53 mm and 37.66 mm, and 1.10 mm/h and 150.63 mm/h, respectively. The isotopic composition exhibited the broader range observed in the dataset, compared to both full event and monthly composite samples. δ18O ranged between −16.18‰ and 0.82‰, (with a mean value of −5.47‰ ± 4.22‰), and δ2H ranged from −115.23‰ to 14.64‰ (with a mean value of −34.98‰ ± 34.43‰). The d-excess ranged from −1.65‰ and 16.35‰ (with a mean value of 8.81‰ ± 4.29‰).

Figure 8A illustrates how the isotopic composition of the rainfall associated with TCs was more depleted compared to monthly composite samples. The slope differences for both TC events, with respect to the country LMWL, are presented in Table 2 as Δa61. A notable grouping of the values was observed, highlighted as coloured squares (isotopic range) and ellipses (data grouping) in Fig. 8B. These groups are evident based on d-excess values. Nevertheless, no significant relationship was found between sample grouping and other variables (e.g. moisture source origin, rainfall amount, and rainfall intensity). In such a case, further analysis should be conducted, considering microphysical conditions during the formation and precipitation of raindrops, and the meteorological evolution of these phenomena.

Table 2 Summary of descriptive statisticians on regressions lines calculated for the isotopic composition of meteoric waters event-based monitoring, and their difference with respect to the LMWL presented for Cuba.

Key drivers controlling the isotopic composition of meteoric waters in Cuba

The assessment of the potential parameters modulating the isotopic composition of Cuban meteoric waters is a necessary analysis to understand temporal and spatial variations within the island. As reported in previous studies, the most common correlations in tropical regions and small islands, are: 1) amount effect65,66, 2) altitude effect51,67,68, 3) synoptic or seasonality effects65,69,70, 4) moisture source origin46,71,72, 4) rainfall type (Convective vs. Stratiform)54,73,74, and 5) tropical cyclones excursion72,75,76.

In general terms, all the parameters considered in this dataset showed a relatively weak correlation with the monthly δ18O compositions, see Fig. 9. While the so-called “amount effect” is described as the typical controlling factor for tropical islands51,65, in this study, rainfall amount explains only 25% of the variability observed (Fig. 9A). Similar results were obtained for relative humidity (22%, Fig. 9B), and no influence was observed from other potential effects or drivers such as altitude and temperature (Fig. 9C,D, respectively). This apparent lack of influence on air temperature was expected, considering the study area’s latitudinal location and the relatively homogeneous temperature gradient compared to extratropical latitudes. Nevertheless, seasonality variations should be further investigated, as they appear to significantly affect the variability of the isotopic composition of the meteoric waters in Cuba, as well as the occurrence of heavy rainfalls such as those associated to TCs.

Fig. 9
Fig. 9The alternative text for this image may have been generated using AI.
Full size image

Linear regressions of the δ18O compositions in monthly composite samples vs. potential controlling effects: (A) Rainfall amount; (B) Relative Humidity; (C) Elevation above sea level; and (D) Air temperature at the sampling station level.

Usage Notes

To facilitate the use of this dataset, a detailed description of the structure and calculations are available online at figshare59.

This unique dataset includes monthly composite, full event, and intra-event isotopic compositions of meteoric waters from Cuba. Previous data in this region has been scarce, truncated, limited, or non-existent at such a spatial scale. The dataset is particularly useful for improving our understanding of regional atmospheric processes, extreme rainfall events, and terrestrial water partitioning. Other specific potential uses can be described as:

  1. 1)

    Isoscape mapping

  2. 2)

    Isotope-enable modeling

  3. 3)

    Isotope-enabled forecasting

  4. 4)

    Paleoclimate reconstruction