Record-breaking rainfall: a stochastic approach for its prediction

Chen, Mengzhu; Mahto, Shanti Shwarup; He, Xiaogang; Jun, Changhyun; Paschalis, Athanasios; Peleg, Nadav; Mascaro, Giuseppe; Fatichi, Simone

doi:10.1038/s44304-025-00148-6

Download PDF

Article
Open access
Published: 29 October 2025

Record-breaking rainfall: a stochastic approach for its prediction

Mengzhu Chen¹,
Shanti Shwarup Mahto^1,2,
Xiaogang He¹,
Changhyun Jun³,
Athanasios Paschalis^4,5,
Nadav Peleg^6,7,
Giuseppe Mascaro⁸ &
…
Simone Fatichi¹

npj Natural Hazards volume 2, Article number: 98 (2025) Cite this article

2960 Accesses
Metrics details

Subjects

Abstract

Extreme rainfall events that break previous records are occurring more frequently worldwide, leading to severe flooding and infrastructure damage. Conventional flood design approaches, based on extreme value analysis (EVA) of limited historical data, often fail to anticipate such unprecedented extremes. Here, we present a stochastic approach that leverages the Advanced Weather Generator (AWE-GEN) to simulate a large ensemble of 100-year hourly rainfall time series, explicitly accounting for internal climate variability. By excluding the record-breaking event year during calibration, we assess the ability of our proposed method to reproduce unseen record-breaking events. We evaluated this approach using data from 2703 rain stations across nine countries. Our results show that the stochastic approach captures record-breaking events more reliably than EVA, achieving success rates exceeding 85% for 3–12-hour durations at a 100-year return period threshold. This framework provides a more robust way for estimating rainfall extremes and supports the design of resilient infrastructure under deep uncertainty.

Introduction

Floods continue to pose a major global threat, causing significant loss of life and extensive damage to infrastructure, property, and agriculture. In recent decades, severe flooding driven by record-breaking rainfall events has become more frequent worldwide^1,2. For example, in July 2021, Zhengzhou, China, experienced an unprecedented rainstorm, leading to catastrophic urban flooding³. The city recorded an hourly maximum rainfall of 201.9 mm, breaking the national record previously held since 1975⁴. Similarly, in August 2017, record-breaking rainfall associated with Hurricane Harvey led to severe flash flooding in Houston, USA, inundating over 300,000 structures and impacting approximately half a million vehicles⁵. High-impact events of this magnitude were previously considered highly improbable, but they are occurring with increasing regularity.

At present, the predominant approach for designing flood protection systems and water-related infrastructure relies on extreme value analysis (EVA), which involves fitting a theoretical statistical distribution, typically a generalized extreme value (GEV) distribution, to historical extreme records⁶. While EVA is mathematically robust and has been widely used in engineering practice, it is constrained by two fundamental assumptions: that the statistical properties of extremes remain stationary over time, and that the historical record adequately represents the full range of future possibilities.

Three major challenges undermine these assumptions. First, the climate is nonstationary^7,8,9,10, and climate change is altering the frequency and intensity of extreme rainfall events. Global warming enhances atmospheric moisture-holding capacity and convective intensity, increasing the likelihood of record-breaking rainfall^11,12,13,14. Second, different extreme rainfall events may belong to different statistical populations because of the presence of multiple rainfall-generating mechanisms^15,16. This heterogeneity violates the assumption that all extremes belong to a single statistical population. However, extensions such as the two-component extreme value (TCEV) distribution^17,18 can account for this heterogeneity by modeling two populations of extremes originated from different physical processes. Third, even in a stationary climate with a distribution of events from the same population, internal climate variability¹⁹ can lead to the occurrence of record-breaking events that are significantly larger than any previously observed on records. This is due to the inherently stochastic nature of the climate system, which implies that even relatively long observational records (e.g., >30-year records) may fail to capture very rare but physically possible extreme events. In this study, we focus on the second and third challenges, as the first has been extensively addressed in the literature on rainfall extremes in a changing climate^{10,20,21,22,23,24,25}.

We illustrate the problem investigated here with an example. During Hurricane Harvey in August 2017, a rain gauge in northwest Houston, USA, recorded 408.4 mm of rainfall in 24 h, as shown in Fig. 1. This event significantly exceeded all other observed extremes by a considerable margin. While the exact return period of this event is unknown, its probability of occurrence was likely underestimated by the conventional EVA approach. The GEV distribution fitted to historical annual precipitation maxima provided lower estimates for 100–300-year return periods and failed to account for the magnitude of this hurricane-induced event, even when considering the 5–95th confidence intervals for a 100-year return period. Given that hydraulic infrastructures are typically designed based on return periods of 100 or 200 years, the existing infrastructures were largely unprepared for an event of this magnitude. While illustrative of a single case, this example motivates a broader systematic investigation of record-breaking extreme rainfall events and underscores the limitations of conventional EVA methodology in assessing the risks posed by such extremes.

**Fig. 1: Observed extreme rainfall for a 24-h duration and fitted GEV distribution (excluding the record-breaking event during the fitting) for a station located northwest of Houston, Texas.**

To enhance the robustness and resilience of engineering design to avoid flood damage, there is a growing need for methodologies that can account for internal climate variability and diverse rainfall-generating mechanisms. In this study, we investigate whether a stochastic weather generator can serve as a more reliable tool for estimating record-breaking rainfall events compared to conventional EVA. Specifically, we use the Advanced Weather Generator (AWE-GEN) to simulate the stochastic variability of the precipitation process by generating a large ensemble of 100-year-long hourly synthetic rainfall time series (see “Methods”). Unlike conventional EVA, which relies solely on the “tail” part of historical records, our approach considers the full distribution, including both tail and non-tail parts. This allows the reproduction of a broad range of rainfall statistics beyond extremes, including the explicit representation of internal climate variability and different rainfall-generating mechanisms by using rainfall statistics computed over different months.

The proposed stochastic approach based on AWE-GEN is evaluated using hourly rainfall data from 2703 stations across nine countries, spanning a range of climates and storm types. The geographical distribution of these stations is shown in Fig. 2. Among them, we have identified 429 stations that experienced record-breaking rainfall events for various durations (1, 3, 6, 12, and 24 h), highlighted in green in Fig. 2. Details on identifying record-breaking rainfall events and the proposed stochastic approach are provided in the “Methods” section. In the following sections, we present the performance of the new approach relative to conventional EVA and discuss the implications for flood risk estimation and infrastructure design under uncertainty.

Fig. 2: Spatial distribution of the 2703 quality-controlled rain stations used in this study, located across the United States, Belgium, Germany, Switzerland, the United Kingdom, South Korea, Japan, Singapore, and New Zealand.

Results

Simulation of unseen record-breaking rainfall events

The performance of the stochastic AWE-GEN approach compared to the conventional GEV-based EVA method in simulating extreme rainfall across five durations (1, 3, 6, 12, and 24 h) at ten representative stations is showcased in Fig. 3. The left column illustrates five stations where AWE-GEN fails to reproduce a rainfall magnitude equal to the record-breaking events within the 5–95th percentiles for a 100-year return period; the right column showcases five stations where instead the proposed approach successfully captures the record-breaking events for each duration. Even though the exact value of the return period of the record-breaking rainfall event is unknown, as discussed more in detail later on, with success we refer to cases where the magnitude of the record-breaking event was captured by the 5–95th percentile range of AWE-GEN simulations for a 100-year return period, while failure denotes cases where the event exceeds this range (see “Methods”). As a 100-year period is an arbitrary decision, other target return periods (50 and 200 years) are also used as definitions of success and failure cases.

**Fig. 3: Comparison of AWE-GEN simulations and GEV distributions in capturing record-breaking rainfall events across five durations (1, 3, 6, 12, and 24 h) for ten representative stations.**

In all ten cases, the record-breaking events fall outside the 5–95% bootstrap confidence intervals of the GEV fits, demonstrating that the underestimation is not only due to a poor fitting of the GEV curve but persists even when accounting for parameter uncertainty. It reveals a critical limitation of conventional EVA methods: the tail behavior estimated from limited records of annual maxima often fails to anticipate the magnitude of unprecedented very large extremes.

In contrast, the AWE-GEN approach provides a broader probabilistic envelope that incorporates internal climate variability, enabling it to simulate a more realistic range of extremes. For instance, in Fig. 3j, corresponding to the Houston station affected by Hurricane Harvey, the observed 24-h rainfall (408.4 mm) is successfully captured within the 5–95th percentile range of the AWE-GEN ensemble. For the same event, the GEV distribution assigns a return period of approximately 320 years; thus, the magnitude of such an event will be underestimated using conventional 100- or 200-year design criteria. This could lead to the underdesign of hydraulic structures and flood prevention measures, posing risks to infrastructure and public safety.

Another illustrative example occurred in October 2016, when Hurricane Matthew brought heavy rainfall to central and eastern North Carolina, which resulted in major flooding. More than 600 roads were closed and nearly 99,000 structures were affected by floodwaters²⁶. The station of Fayetteville, North Carolina, recorded a 12-h record-breaking rainfall event of 307 millimeters during Hurricane Matthew (Fig. 3h). The GEV method estimates the return period of this event at 6000 years. Furthermore, a clear discrepancy emerges between the historical observations and the GEV distribution tail. The GEV distribution systematically underestimates the high return levels, even when accounting for its 5–95% confidence intervals. By contrast, the AWE-GEN approach successfully reproduces this extreme event. This suggests that a stochastic weather generator, by accounting for a larger number of statistics and mostly internal variability, offers a more flexible and robust representation of the range of potential unseen extremes.

However, not all record-breaking events are successfully captured by AWE-GEN. For example, a station located on the northern coast of Puerto Rico experienced unprecedented heavy rainfall during Hurricane Maria in September 2017 (Fig. 3i). As a powerful Category 4 hurricane, Hurricane Maria brought catastrophic flooding and landslides to Puerto Rico, with extreme rainfall reaching 547 mm within 24 h at this station. This event represents a significant outlier in the historical rainfall record. While it lies within the broader 1–99th percentile range of the AWE-GEN simulations, it exceeds the upper boundary of the 5–95th percentile range for a 100-year return period. The GEV method also fails to represent this event within conventional return periods, assigning a return period of about 1000 years. These limitations of traditional EVA highlight the challenges of modeling high-impact outliers that deviate significantly from precedent events on record. Conversely, there is a value in using stochastic rainfall generators like AWE-GEN that are better equipped to simulate unseen but possible record-breaking rainfall events.

Success rate of the stochastic rainfall generator

After illustrating AWE-GEN potential in Fig. 3, we provide a systematic analysis of results across all selected record-breaking rainfall events and stations. Figure 4 summarizes the success rates in capturing record-breaking events across different durations (1, 3, 6, 12, and 24 h) using three approaches: (a) the proposed stochastic AWE-GEN approach; (b) the GEV distribution fitted to historical observations; and (c) the GEV distribution fitted to 100 synthetic 100-year-long realizations generated by AWE-GEN. The success rate is quantified at the 100-year return period for three percentile ranges: 10–90th, 5–95th, and 1–99th. To examine whether the poor performance of the GEV-based EVA method is primarily due to data limitations, the third approach (Fig. 4c) increases data availability by extending the rainfall time series length through synthetic realizations (see “Methods”). We also present the success rate calculated based on the 50-year and 200-year return periods for the three approaches in Supplementary Fig. 1, while, as expected, exact percentages of successes are larger using a target 200 years return period and lower using a target 50-year return period, differences are not particularly pronounced and most importantly they do not modify the relative performance of AWE-GEN and EVA based approaches.

The AWE-GEN approach (Fig. 4a) consistently achieves high success rates across all durations and percentile ranges, demonstrating its robustness in capturing unseen extreme rainfall events. For example, using the 5–95th percentile range, the AWE-GEN method achieves success rates of 58% (1 h), 87% (3 h), 97% (6 h), 93% (12 h), and 76% (24 h), which are considerably higher than those obtained with the conventional GEV method (Fig. 4b), which only achieve success rates of 5%, 1%, 2%, 4%, and 2% respectively. This pattern remains consistent across all percentile ranges considered.

By fitting the GEV distribution to the large ensemble of 100-year-long synthetic realizations generated by AWE-GEN (Fig. 4c), we effectively eliminate the issue of limited observational data. If the GEV method were only limited by data availability, its success rates applied to synthetic data should be comparable to those of the AWE-GEN approach. The GEV method fitted to the 100 synthetic realizations (Fig. 4c) improves considerably (Fig. 4b) but still underperforms by far the success rate obtained with the stochastic weather generator (Fig. 4a). For example, at the 5–95th percentile range, the success rates of the GEV method fitted with synthetic realizations are 21% (1 h), 58% (3 h), 57% (6 h), 45% (12 h), and 27% (24 h).

These findings suggest that the conventional GEV method suffers from inherent methodological limitations in capturing record-breaking rainfall events, beyond the commonly acknowledged constraint of limited observational data. While increasing the dataset size through synthetic realizations improves the success rates to some extent, the GEV framework still falls short in capturing the stochastic nature of unseen extreme rainfall events. In contrast, the results confirm that AWE-GEN is capable of simulating record-breaking rainfall events with a high degree of success, particularly for mid-to-long durations in the range of 3–24 h, but it still captures more than 50% of record-breaking events also at the hourly scale. This makes the proposed approach a valuable tool for assessing risks associated with record-breaking precipitation extremes. Given the dominant role of stochastic variability for future projections of station-scale rainfall extremes under an uncertain future^20,27, such an approach is likely to capture most of the extremes also in a non-stationary climate. However, the model skill in simulating short-duration 1-h unseen extremes is much less than for other durations, which demands further enhancements to improve model structure at these temporal scales or for combinations of different stochastic rainfall models²⁸.

Theoretically, the greater the extremity of an outlier event, the more challenging it is to capture its magnitude. To explore whether the degree of extremity of record-breaking events correlates with the success rate of the AWE-GEN approach, we analyze the distribution of the ratio of maximum to second maximum rainfall for both success and failure cases across all durations in Fig. 5. The ratio represents the degree of extremity of a record-breaking event in the observed historical record. Failure cases generally display higher ratios, with greater medians compared to success cases for all durations. This indicates that failure cases are generally associated with more extreme outliers, where the record-breaking rainfall event significantly exceeds the historical pattern. This pattern is particularly pronounced for longer durations, such as 12 and 24 h. In contrast, for shorter durations (i.e., 1 h and 3 h), success and failure cases exhibit more overlap in their ratio distributions, suggesting that the performance of the AWE-GEN approach at shorter timescales is less related to the degree of event extremity.

**Fig. 5: Violin plots showing the distribution of the ratio between maximum and second maximum rainfall for AWE-GEN simulation success and failure cases across five durations (1, 3, 6, 12, and 24 h).**

Interestingly, while the failure cases are generally associated with higher ratios due to more extreme outliers, the success cases also show the presence of some high outliers across all durations. These outliers in the success cases indicate that the AWE-GEN approach has the capacity (although not always) to simulate extremes that even substantially exceed the subsequently observed record-breaking extremes.

Limitations of conventional EVA in predicting record-breaking events

We further assess whether the potential underestimation of the conventional EVA approach in estimating record-breaking rainfall events is a true underestimation or is simply because these events have very large return periods and thus EVA estimates are indeed correct. We compared the return periods estimated for all the identified record-breaking rainfall events, inverting different extreme value distributions (GEV, Gumbel, and the two-parameter Fréchet) with the theoretical return periods of these events. In this analysis, the theoretical return periods are derived directly using the same number of stations and record length as observations and are an accurate approximation of the real ensemble distribution of return periods of the analyzed record-breaking events (see Methods and Supplementary Fig. 4).

From this analysis, we can see that the GEV, despite being widely used for EVA of rainfall, consistently overestimates return periods of record-breaking events (Fig. 6). This overestimation is evident in the wide range of return periods produced by GEV, with most values often extending far beyond theoretically expected return periods. Such overestimation undermines the reliability of GEV for accurately quantifying the risks associated with record-breaking precipitation events in the tail of the distribution, as their return periods will be largely overestimated or conversely the magnitude underestimated for a given return period. For practical applications, these inflated return periods could lead to under-preparedness in flood risk management and infrastructure design. For example, record-breaking events that are likely to occur over relatively shorter timeframes (a century or so) are misrepresented as events with extremely low probabilities.

Fig. 6: Boxplots of return periods (years) for all identified record-breaking rainfall events, derived by inverting different extreme value distributions fitted to the data: generalized extreme value (GEV), Gumbel, and Fréchet.

While the Gumbel and Fréchet distributions are special cases of the GEV family, in hydrological practice it is common to use them rather than the GEV. This is often done either for historical consistency, simplicity in engineering design, or based on assumptions about the tail behavior of the data. For example, many engineering guidelines still recommend the Gumbel distribution for design purposes^29,30. Our analysis shows that the Gumbel distribution, while exhibiting a narrower range of return periods than GEV, still overestimates return periods relative to the theoretical expectations, with a higher median. Its overestimation is less severe than GEV but remains problematic. In contrast, the two-parameter Fréchet distribution demonstrates the closest alignment with theoretical return periods. Its narrower interquartile range and lower median values suggest that the Fréchet distribution provides a reasonable probabilistic assessment of return periods associated with record-breaking events. Similarly, Papalexiou and Koutsoyiannis (2013), by analyzing over 15,000 global rainfall records to evaluate which extreme value distribution best fits annual maximum daily rainfall globally, found that the Fréchet distribution consistently outperformed other distributions³¹. The heavy-tail characteristic of this distribution better captures the variability and extremity of rainfall events.

Our results reinforce the idea that using the Fréchet distribution or other distributions explicitly tailored to better capture the tail of the extreme values, such as the TCEV distribution^17,18 would be essential to associate the right probability to record-breaking rainfall events originating from different rainfall-generating mechanisms¹⁶. However, the practical application of such models generally requires that the parameters for the two (or more) populations be estimated reliably from enough samples in the precipitation data, which may not always be the case. It should be noted that in any single station, the magnitude of these record-breaking events will still be underestimated using conventional return periods of 100–200 years for design, even using the Fréchet distribution, as the true return periods of these record-breaking events are estimated around 500–5000 years (see the range of theoretical return periods in Fig. 6).

While our study applies a consistent GEV across thousands of stations for objective comparison, we acknowledge that some practitioners might use a broader set of approaches in operational hydrology and engineering, including comparing multiple statistical distributions (e.g., Gumbel, Fréchet, and generalized Pareto), and regional frequency analysis^29,32,33. Guidelines such as the flood estimation handbook (FEH)³⁴ recommend the use of a range of distributions and the careful assessment of model fit, threshold selection, and data independence. Our pragmatic use of EVA was chosen to ensure methodological consistency across a large and diverse dataset and still reflects the approach used by many engineers worldwide.

Discussion

Overall, the GEV and Gumbel distributions tend to overestimate return periods for record-breaking rainfall events, with the GEV distribution showing the most pronounced overestimation. This limitation highlights the risks of relying solely on conventional EVA for infrastructure design and flood prevention. The Fréchet distribution, with its closer alignment to theoretical values, may offer a more reliable alternative, especially in contexts requiring a proper approximation of the heavy tails of the distribution. However, the stochastic approach presented here provides further advantages over traditional EVA methods. Although this approach comes with slightly higher implementation complexity and is less accessible for some practitioners, its improved performance demonstrates the value of robust stochastic modeling techniques that can explicitly account for internal climate variability and diverse rainfall-generating mechanisms. While user-friendly weather and rainfall generators are becoming more widely available³⁵, we suggest that future work focus on developing even more accessible tools and practical guidelines to facilitate wider adoption by practitioners.

Importantly, our results show that the stochastic approach is substantially less sensitive to the exclusion of extreme events from the calibration dataset compared to conventional extreme value approaches. This reduced sensitivity is particularly valuable in the real world, where future unprecedented rainfall events are inevitably absent from historical records. By leveraging the entire rainfall time series, the stochastic framework offers a more reliable foundation for predicting unseen extremes.

In summary, our findings highlight the limitations of conventional EVA, which consistently underestimates the magnitude of record-breaking rainfall events, even when using extended synthetic datasets and considering wide confidence intervals. This underperformance is not only due to limited data availability but also reflects methodological constraints in capturing the full spectrum of rainfall variability. In contrast, the proposed stochastic weather generator approach demonstrates a more realistic capacity for simulating record-breaking rainfall events, particularly for mid-to-long durations between 3 and 12 h. Success rates for these durations consistently exceeded 95% within the 1–99th percentile range and 85% within the 5–95th percentile range. This probabilistic framework enables a statistical representation of the variability inherent in the precipitation process, enhancing the reliability of risk estimates for very rare events. As such, it provides a more adaptive and robust foundation for flood risk estimation and infrastructure planning in an uncertain climate, with direct implications for engineering practice and long-term resilience.

Methods

Observational data

Observed hourly rainfall time series from 3520 rain stations across multiple countries, including the United States, Belgium, Germany, Switzerland, the United Kingdom, South Korea, Japan, Singapore, and New Zealand, were collected from national meteorological and/or hydrological agencies (Table S1). Stations were selected based on a minimum record length of ten years, with an average record length of 44 years. Although national meteorological agencies conducted preliminary quality checks on rain gauge observations, certain records still contain anomalously high values or unrealistically long dry periods, along with missing data. To ensure that the recorded rainfall extremes are reliable and not caused by measurement errors, equipment malfunctions, or data processing mistakes, additional quality-control procedures were developed and applied to these multi-sourced hourly rainfall datasets. The quality control procedures are provided in Supplementary Information (Text S1). Ultimately, 2703 rain stations passed the quality-control procedures and were used in the formal analysis. Figure 2 illustrates the geographical distribution of the 2703 quality-controlled stations, highlighting the stations that experienced record-breaking rainfall events (see definition in the next section).

To characterize the climatic diversity of the study domain, we classified each station using the Köppen–Geiger climate classification³⁶, based on its geographic coordinates. Specifically, we identified 315 stations in arid and semi-arid climates, 1295 in temperate climates, 925 in continental climates, 157 in tropical climates and 11 in polar climates. A detailed analysis of success rates by climate group is presented in Supplementary Fig. 2. Due to limited representation in tropical and polar climates, our analysis focused on three dominant groups with sufficient sample size: Arid/Semi-arid, Temperate, and Continental.

Identification of record-breaking rainfall events

Record-breaking rainfall events were systematically identified using hourly rainfall data from the 2703 quality-controlled rain stations. The identification involved the following steps:

(1)
Extraction of annual maxima: Annual maximum rainfall for five durations (1, 3, 6, 12, and 24 h) at each station was computed using a moving window method, ensuring the largest rainfall event for each duration was recorded for every year of available data.
(2)
Quantification of extreme event magnitudes: For each station and each duration, the maximum and the second maximum annual rainfall events were extracted from the series of annual maxima. The ratio between the maximum event and the second maximum event (Max1/Max2) was then calculated. This ratio represents the degree to which a certain record-breaking event exceeds the second-highest event in the record, serving as a quantifiable measure of extremity.
(3)
Threshold selection for record-breaking events: Stations with ratios in the top 5% (95th percentile) of the cumulative distribution function (CDF) of Max1/Max2 across all gauges were identified as experiencing significant record-breaking events (Fig. 7). These thresholds range approximately from 1.51 to 1.57, depending on the rainfall duration, implying that the magnitude of record-breaking events is at least 50% larger than the second-largest maxima on record.

**Fig. 7: Cumulative distribution functions (CDFs) of the ratio between observed maximum and second maximum rainfall events for five durations (1, 3, 6, 12, and 24 h).**

As a result, this procedure identified 135 stations for each duration that experienced record-breaking rainfall events, resulting in a total of 675 selections. Because some stations experienced record-breaking events across multiple durations, the final number of unique selected stations is 429, with an average record length of 36.2 years, as highlighted in green in Fig. 2.

Stochastic weather generator

The AWE-GEN (Advanced WEather GENerator) is an hourly weather generator designed to simulate time series of various weather variables such as precipitation, cloud cover, air temperature, and incoming shortwave radiation, by combining both stochastic approaches and some level of process representation, thus offering a flexible framework for weather simulation going beyond purely statistical methods³⁷. We only use the precipitation module of AWE-GEN, which is based on the Neyman–Scott Rectangular Pulse (NSRP) model^28,38. The NSRP model is well-suited for capturing the temporal clustering and variability of rainfall, particularly between seasonal and hourly scales, while it starts to degrade at sub-hourly scales²⁸. It allows for the stochastic generation of rainfall events by considering storm arrivals, rain cells, and their timing within the storm and associated durations and intensities of rain cells, making it particularly effective for modeling extreme rainfall events. In this framework, the precipitation amount at a given time is given by the overlapping of all rain cells covering that specific time window which might include rain cells of the same storm or even different storms. Model parameters are calibrated by fitting several rainfall statistics (e.g., mean, coefficient of variation, lag-1 autocorrelation, skewness, and frequency of precipitation) at different durations (1, 6, 24, and 72 h), but there is no specific fitting associated with extreme events, contrary to other approaches³⁹. A full description of the Neyman–Scott Rectangular Pulse model and associated parameter estimation is available in previous references^{37,38,40,41,42,43}. AWE-GEN also has a module to deal with inter-annual variability, ensuring a realistic representation of precipitation variability from hourly to decades. The parameters of the weather generator are calibrated independently for the twelve months to account for seasonality and were estimated from the observed hourly rainfall data from the selected stations with record-breaking rainfall events but excluding the entire year containing the record-breaking event. In such a way, the model is blind to this occurrence. For a comprehensive description of the AWE-GEN model structure and parameterization, refer to Fatichi et al.³⁷. The ability of AWE-GEN to realistically replicate the observed rainfall time series is shown in Supplementary Figs. 5 and 6.

Conventional extreme value analysis

Conventional hydrological design and risk assessment typically rely on EVA to statistically represent rare events by fitting an appropriate probability distribution^6,31,44,45. Extreme rainfall time series for various durations are generally constructed using block maxima over the observational period. The GEV distribution⁴⁶ is the most widely adopted and recommended distribution for extreme rainfall frequency analysis when data are obtained using the annual block maxima approach^21,47,48. The CDF of the GEV is expressed as:

$$F\left({x|}\mu ,\sigma ,\xi \right)=\exp \left[-{\left(1+\xi \left(\frac{x-\mu }{\sigma }\right)\right)}^{-1/\xi }\right]$$

(1)

where $\mu$, $\sigma$, and $\xi$ represent the location, scale, and shape parameters, respectively. Specifically, $\mu$ determines the center of the distribution, $\sigma$ describes the dispersion of data around $\mu$, and $\xi$ governs the tail behavior of the distribution. Depending on the shape parameters $\xi$, the Gumbel ($\xi =0$), Fréchet ($\xi > 0$), and Weibull ($\xi < 0$) distributions can be derived as special cases. The GEV parameters were estimated using L-moments, preferred for their robustness against skewness and outliers, especially advantageous for limited sample sizes^31,32,49.

To quantify the uncertainty in the GEV estimates, confidence intervals were calculated using a bootstrap resampling approach, generating 10³ bootstrap samples by resampling the original data with replacement. This provides robust uncertainty bounds around the fitted GEV distribution, facilitating a comparison with results from the stochastic weather generator. In this study, both the conventional EVA and the AWE-GEN rainfall generator are used under the assumption of stationarity, as is standard practice.

Comparison of weather generator vs extreme value analysis

To assess the capability of the proposed stochastic approach in simulating unprecedented record-breaking extremes, we tested both AWE-GEN and GEV-based methods using observed rainfall data from the selected stations (see section “Observational data”). For both approaches, parameters were estimated using all available records excluding the year of the record-breaking event, simulating a scenario in which the extreme event had not yet recorded. This allowed us to evaluate whether both approaches could successfully identify the record-breaking event as a potential future rainfall extreme. For the proposed stochastic approach, an ensemble of 100 synthetic realizations, each comprising a 100-year-long hourly rainfall time series was generated with AWE-GEN. These 100 realizations represent equiprobable stochastic replicates of the rainfall time series and thus explicitly consider internal climate variability in rainfall occurrence.

To determine if the limited performance of the GEV-based EVA method was primarily due to data limitations, an additional comparison was performed. The GEV distribution was fitted separately to each of the 100 synthetic rainfall realizations generated by AWE-GEN. By evaluating the ability of these GEV fits to capture the record-breaking events, we assessed whether increased data availability (in this case from synthetic realizations) improved the performance of the conventional EVA approach, or if its limitations persisted despite having longer or much longer data series.

Additionally, to evaluate whether the degree of extremity of record-breaking events influences the ability of AWE-GEN to capture them, we classified the stations into two groups: (1) success cases, where AWE-GEN simulations captured the event within their 5–95th percentile range at a given return period, and (2) failure cases, where the record-breaking event exceeded the 95th percentile. We then analyzed the distribution of Max1/Max2 for both success and failure cases through violin plots.

When analyzing a large number of stations (2703) with extensive observational data, it is expected that some observed maxima will have very large return periods, potentially much larger than the value assigned with the Weibull plotting position formula⁵⁰. The distribution of these very large return periods is a function of the number of stations and recorded data length. To provide a robust benchmark for evaluating return periods estimated by the GEV and two widely used extreme value distributions in engineering practice (Gumbel and two parameters-Fréchet distribution), we generated theoretical distributions of return periods associated with record-breaking events. Specifically, for each station and observational record length, we randomly generated return periods based on their definition, i.e., as the inverse of exceedance probability, with exceedance probability drawn uniformly between 0 and 1 for the same number of years for each station in the database. We then selected only the top 5% of stations where a return period for a station exceeds the second-highest return period by the largest ratio, mirroring the criteria used to identify record-breaking events in the actual data. Although it is impossible to precisely assign return periods to individual observed events, this stochastic approach provides a realistic theoretical distribution of return periods across all the stations-years used, enabling meaningful comparison with return periods obtained from inverting the extreme value distributions. To account for randomness in the return period generation process, we repeated the random generation procedure twenty times and confirmed that the resulting theoretical return-period distributions remained stable across replicates (Supplementary Fig. 4).

Data availability

The observed hourly rainfall data was obtained from multiple national meteorological and/or hydrological agencies, the access websites are provided in Supplementary Table 1. Most of these datasets can be downloaded from the websites, and some of them are not publicly available but can be provided upon reasonable request to the authors, subject to the terms and conditions of the original data providers.

Code availability

The MATLAB code of the stochastic weather generator AWE-GEN is available at https://hyd.ifu.ethz.ch/research-data-models/awe-gen.html.

References

de Vries, I., Sippel, S., Zeder, J., Fischer, E. & Knutti, R. Increasing extreme precipitation variability plays a key role in future record-shattering event probability. Commun. Earth Environ. 5, 482 (2024).
Article Google Scholar
Lehmann, J., Coumou, D. & Frieler, K. Increased record-breaking precipitation events under global warming. Clim. Change 132, 501–515 (2015).
Article Google Scholar
Nie, Y. & Sun, J. Moisture sources and transport for extreme precipitation over Henan in July 2021. Geophys. Res. Lett. 49, e2021GL097446 (2022).
Article Google Scholar
Duan, R., Huang, G., Zhou, X., Lu, C. & Tian, C. Record-breaking heavy rainfall around Henan Province in 2021 and future projection of extreme conditions under climate change. J. Hydrol. 625, 130102 (2023).
Article Google Scholar
Zhang, W., Villarini, G., Vecchi, G. A. & Smith, J. A. Urbanization exacerbated the rainfall and flooding caused by Hurricane Harvey in Houston. Nature 563, 384–388 (2018).
Article CAS Google Scholar
Katz, R. W., Parlange, M. B. & Naveau, P. Statistics of extremes in hydrology. Adv. Water Resour. 25, 1287–1304 (2002).
Article Google Scholar
Milly, P. C. et al. Stationarity is dead: whither water management? Science 319, 573–574 (2008).
Article CAS Google Scholar
Serinaldi, F. & Kilsby, C. G. Stationarity is undead: uncertainty dominates the distribution of extremes. Adv. Water Resour. 77, 17–36 (2015).
Article Google Scholar
Yilmaz, A. G., Imteaz, M. A. & Perera, B. J. C. Investigation of non-stationarity of extreme rainfalls and spatial variability of rainfall intensity–frequency–duration relationships: a case study of Victoria, Australia. Int. J. Climatol. 37, 430–442 (2017).
Article Google Scholar
Schlef, K. E. et al. Incorporating non-stationarity from climate change into rainfall frequency and intensity–duration–frequency (IDF) curves. J. Hydrol. 616, 128757 (2023).
Article Google Scholar
Allan, R. P. & Soden, B. J. Atmospheric warming and the amplification of precipitation extremes. Science 321, 1481–1484 (2008).
Article CAS Google Scholar
O’Gorman, P. A. & Schneider, T. The physical basis for increases in precipitation extremes in simulations of 21st-century climate change. Proc. Natl Acad. Sci. USA 106, 14773–14777 (2009).
Article Google Scholar
Prein, A. F. et al. The future intensification of hourly precipitation extremes. Nat. Clim. Change 7, 48–52 (2017).
Article Google Scholar
Fowler, H. J. et al. Anthropogenic intensification of short-duration rainfall extremes. Nat. Rev. Earth Environ. 2, 107–122 (2021).
Article Google Scholar
Smith, J. A., Villarini, G. & Baeck, M. L. Mixture distributions and the hydroclimatology of extreme rainfall and flooding in the eastern United States. J. Hydrometeorol. 12, 294–309 (2011).
Article Google Scholar
Alshehri, M., Mascaro, G. & Kunkel, K. E. On the generating mechanisms of daily precipitation in the conterminous United States: climatology, trends, and associated marginal and extreme distributions. J. Hydrometeorol. 25, 1895–1914 (2024).
Article Google Scholar
Rossi, F., Fiorentino, M. & Versace, P. Two-component extreme value distribution for flood frequency analysis. Water Resour. Res. 20, 847–856 (1984).
Article Google Scholar
Rulfová, Z., Buishand, A., Roth, M. & Kyselý, J. A two-component generalized extreme value distribution for precipitation frequency analysis. J. Hydrol. 534, 659–668 (2016).
Article Google Scholar
Deser, C., Phillips, A., Bourdette, V. & Teng, H. Uncertainty in climate change projections: The role of internal variability. Clim. Dyn. 38, 527–546 (2012).
Article Google Scholar
Fatichi, S. et al. Uncertainty partition challenges the predictability of vital details of climate change. Earths Future 4, 240–251 (2016).
Article Google Scholar
Ragno, E., AghaKouchak, A., Cheng, L. & Sadegh, M. A generalized framework for process-informed nonstationary extreme value analysis. Adv. Water Resour. 130, 270–282 (2019).
Article Google Scholar
Kourtis, I. M. & Tsihrintzis, V. A. Update of intensity–duration–frequency (IDF) curves under climate change: a review. Water Supply 22, 4951–4974 (2022).
Article Google Scholar
Chen, M., Papadikis, K., Jun, C. & Macdonald, N. Linear, nonlinear, parametric and nonparametric regression models for nonstationary flood frequency analysis. J. Hydrol. 616, 128772 (2023).
Article Google Scholar
Marra, F., Koukoula, M., Canale, A. & Peleg, N. Predicting extreme sub-hourly precipitation intensification based on temperature shifts. Hydrol. Earth Syst. Sci. 28, 375–393 (2024).
Article Google Scholar
Peleg, N. et al. A simple and robust approach for adapting design storms to assess climate-induced changes in flash flood hazard. Adv. Water Resour. 193, 104823 (2024).
Article Google Scholar
Musser, J. W., Watson, K. M. & Gotvald, A. J. Characterization of peak streamflows and flood inundation at selected areas in North Carolina following Hurricane Matthew, October 2016. US Geol. Surv. Open-File Rep. 2017–1047 https://doi.org/10.3133/ofr20171047 (2017).
Peleg, N., Molnar, P., Burlando, P. & Fatichi, S. Exploring stochastic climate uncertainty in space and time using a gridded hourly weather generator. J. Hydrol. 571, 627–641 (2019).
Article Google Scholar
Paschalis, A., Molnar, P., Fatichi, S. & Burlando, P. On temporal stochastic modeling of precipitation, nesting models across scales. Adv. Water Resour. 63, 152–166 (2014).
Article Google Scholar
Ball, J. et al. (eds) Australian Rainfall and Runoff: A Guide to Flood Estimation (Commonwealth of Australia, 2019).
Department of Irrigation and Drainage Malaysia. Urban Stormwater Management Manual for Malaysia (MSMA) 2nd edn (DID Malaysia, 2012).
Papalexiou, S. M. & Koutsoyiannis, D. Battle of extreme value distributions: A global survey on extreme daily rainfall. Water Resour. Res. 49, 187–201 (2013).
Article Google Scholar
Hosking, J. R. M. & Wallis, J. R. Regional Frequency Analysis: An Approach Based on L-Moments (Cambridge Univ. Press, 1997).
Kjeldsen, T. R., Jones, D. A. & Bayliss, A. C. Improving the FEH statistical procedures for flood frequency estimation. Environment Agency (2008).
Institute of Hydrology. Flood Estimation Handbook (5 vols) (Institute of Hydrology, Wallingford, 1999).
De Luca, D. L. & Petroselli, A. STORAGE (STOchastic RAinfall GEnerator): a user-friendly software for generating long and high-resolution rainfall time series. Hydrology 8, 76 (2021).
Article Google Scholar
Beck, H. E. et al. High-resolution (1 km) Köppen–Geiger maps for 1901–2099 based on constrained CMIP6 projections. Sci. Data 10, 724 (2023).
Article Google Scholar
Fatichi, S., Ivanov, V. Y. & Caporali, E. Simulation of future climate scenarios with a weather generator. Adv. Water Resour. 34, 448–467 (2011).
Article Google Scholar
Cowpertwait, P., Isham, V. & Onof, C. Point process models of rainfall: developments for fine-scale structure. Proc. R. Soc. A 463, 2569–2587 (2007).
Article Google Scholar
Beneyto, C., Aranda, J. Á & Francés, F. Exploring the uncertainty of weather generators’ extreme estimates in different practical available information scenarios. Hydrol. Sci. J. 68, 1203–1212 (2023).
Article Google Scholar
Cowpertwait, P. S. A Poisson-cluster model of rainfall: some high-order moments and extreme values. Proc. R. Soc. Lond. A 454, 885–898 (1998).
Article Google Scholar
Cowpertwait, P. S. A spatial–temporal point process model of rainfall for the Thames catchment, UK. J. Hydrol. 330, 586–595 (2006).
Article Google Scholar
Cowpertwait, P. S. P., Kilsby, C. G. & O’Connell, P. E. A space–time Neyman–Scott model of rainfall: Empirical analysis of extremes. Water Resour. Res. 38, 6-1–6-11 (2002).
Cowpertwait, P. S. P., O’Connell, P. E., Metcalfe, A. V. & Mawdsley, J. A. Stochastic point process modelling of rainfall. I. Single-site fitting and validation. J. Hydrol. 175, 17–46 (1996).
Article Google Scholar
Coles, S. An Introduction to Statistical Modeling of Extreme Values (Springer, 2001).
Koutsoyiannis, D. Statistics of extremes and estimation of extreme rainfall: I. Theoretical investigation. Hydrol. Sci. J. 49, 575–590 (2004).
Article Google Scholar
Jenkinson, A. F. The frequency distribution of the annual maximum (or minimum) values of meteorological elements. Q. J. R. Meteorol. Soc. 81, 158–171 (1955).
Article Google Scholar
Cooley, D. Extreme value analysis and the study of climate change: a commentary on Wigley 1988. Clim. Change 97, 77–83 (2009).
Article Google Scholar
He, X., Pan, M., Wei, Z., Wood, E. F. & Sheffield, J. A global drought and flood catalogue from 1950 to 2016. Bull. Am. Meteorol. Soc. 101, E508–E535 (2020).
Article Google Scholar
Mascaro, G. Comparison of local, regional, and scaling models for rainfall intensity–duration–frequency analysis. J. Appl. Meteorol. Climatol. 59, 1519–1536 (2020).
Article Google Scholar
Cunnane, C. Unbiased plotting positions—A review. J. Hydrol. 37, 205–222 (1978).
Article Google Scholar

Download references

Acknowledgements

Mengzhu Chen and Simone Fatichi acknowledge the financial support from PUB, Singapore National Water Agency, through the Competitive Funding for Water Research (CWR) for the project “Climate and Land-Use Changes: Effects on Nutrient Inputs to Singapore Reservoirs (CLUE)" (Award Number: CWR-2102-0009). Giuseppe Mascaro thanks the support from the National Science Foundation (NSF) Award 2221803: “Collaborative Research: CAS-Climate: Improving Nonstationary Intensity-Duration-Frequency Analysis of Extreme Precipitation by Advancing Knowledge on the Generating Mechanisms.”

Author information

Authors and Affiliations

Department of Civil and Environmental Engineering, National University of Singapore, Singapore, Singapore
Mengzhu Chen, Shanti Shwarup Mahto, Xiaogang He & Simone Fatichi
Department of Geoinformatics, Central University of Jharkhand, Ranchi, India
Shanti Shwarup Mahto
School of Civil, Environmental and Architectural Engineering, College of Engineering, Korea University, Seoul, Republic of Korea
Changhyun Jun
Department of Civil and Environmental Engineering, University of Cyprus, Nicosia, Republic of Cyprus
Athanasios Paschalis
Department of Civil and Environmental Engineering, Imperial College London, London, UK
Athanasios Paschalis
Institute of Earth Surface Dynamics, University of Lausanne, Lausanne, Switzerland
Nadav Peleg
Expertise Center for Climate Extremes, University of Lausanne, Lausanne, Switzerland
Nadav Peleg
School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ, USA
Giuseppe Mascaro

Authors

Mengzhu Chen
View author publications
Search author on:PubMed Google Scholar
Shanti Shwarup Mahto
View author publications
Search author on:PubMed Google Scholar
Xiaogang He
View author publications
Search author on:PubMed Google Scholar
Changhyun Jun
View author publications
Search author on:PubMed Google Scholar
Athanasios Paschalis
View author publications
Search author on:PubMed Google Scholar
Nadav Peleg
View author publications
Search author on:PubMed Google Scholar
Giuseppe Mascaro
View author publications
Search author on:PubMed Google Scholar
Simone Fatichi
View author publications
Search author on:PubMed Google Scholar

Contributions

M.C., X.H., and S.F. developed the research idea and concept. M.C. conducted the data analysis, wrote the manuscript with support from S.F., and created the figures. S.S.M. contributed to data preprocessing and quality control. C.J., A.P., N.P., and G.M. reviewed and edited the manuscript. S.F. supervised the research, designed the methodology, and contributed to the interpretation of results. All authors discussed the results and reviewed the final version of the manuscript.

Corresponding author

Correspondence to Mengzhu Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Updated_Supplementary Infro_ (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, M., Mahto, S.S., He, X. et al. Record-breaking rainfall: a stochastic approach for its prediction. npj Nat. Hazards 2, 98 (2025). https://doi.org/10.1038/s44304-025-00148-6

Download citation

Received: 11 June 2025
Accepted: 30 September 2025
Published: 29 October 2025
Version of record: 29 October 2025
DOI: https://doi.org/10.1038/s44304-025-00148-6