Accelerating flood warnings by 10 hours: the power of river network topology in AI-enhanced flood forecasting

Wang, Hongjun; Chen, Jiyuan; Zheng, Yinqiang; Song, Xuan

doi:10.1038/s44304-025-00083-6

Download PDF

Article
Open access
Published: 09 June 2025

Accelerating flood warnings by 10 hours: the power of river network topology in AI-enhanced flood forecasting

Hongjun Wang^1,2,
Jiyuan Chen³,
Yinqiang Zheng¹ &
…
Xuan Song^2,4

npj Natural Hazards volume 2, Article number: 45 (2025) Cite this article

2100 Accesses
1 Citations
Metrics details

Subjects

Abstract

The increasing frequency and intensity of floods, exacerbated by climate change, necessitates the development of accurate and timely flood forecasting models. Although AI-based approaches have demonstrated promise, the effectiveness of graph neural networks (GNNs) in modeling the intricate dynamics of river networks remains contested. Despite the natural alignment between river topology and graph-based structures, recent studies reveal that GNNs often underperform in fully utilizing this structural information. This research aims to identify the underlying factors contributing to this limitation. Our analysis reveals that the tree-like configuration of river networks leads to over-squashing in GNNs, a problem caused by high resistance distances between nodes. To address this, we propose a novel method that transforms the topological graph into a dense, reachability-based graph, which reduces resistance distances. Empirical results demonstrate that GNNs applied to the transformed graph outperform EA-LSTM, particularly in predicting rare and extreme flood events. Furthermore, the incorporation of graph information significantly enhances long-term forecasting capabilities, as evidenced by the fact that, on average, GNN predictions of water levels at 24 h after using the dense graph match the accuracy of EA-LSTM’s 14-h forecasts. These findings highlight the potential of graph-based methodologies to improve flood prediction systems and contribute to more effective early warning mechanisms.

Spatial-temporal graph neural networks for groundwater data

Article Open access 19 October 2024

Global prediction of extreme floods in ungauged watersheds

Article Open access 20 March 2024

Management and prediction of river flood utilizing optimization approach of artificial intelligence evolutionary algorithms

Article Open access 02 July 2025

Introduction

The global hydrological cycle is undergoing significant anthropogenic alterations, with far-reaching implications for flood risk and water resource management^1,2,3,4,5,6 highlighted the increasing risk of extreme precipitation events due to climate change, while refs. ^7,8 projected a substantial increase in global flood risk under future climate scenarios.

The increasing frequency and complexity of flood events have spurblack significant developments in flood forecasting methodologies. Traditional approaches integrate meteorological inputs with hydrological processes to simulate rainfall-runoff dynamics and channel flows⁹. However, uncertainties in precipitation forecasts often limit the accuracy of these models^10,11. Recent advances in machine learning have introduced new possibilities, with LSTM networks demonstrating particular promise in capturing both linear and non-linear memory effects in hydrological time series modeling¹². Convolutional Neural Networks (CNNs), though originally developed for image processing, have also proven effective in pblackicting hydrological time series^13,14.

Feature selection plays a crucial role in improving model performance, with recent work by¹⁵ introducing innovative approaches using hypergraph Laplacian-based semi-supervised discriminant analysis. Their method effectively captures higher-order relationships while maintaining computational efficiency, particularly valuable for complex hydrological systems. Additionally, advances in flow optimization, such as the work by Abdollahi et al.¹⁶ combining particle swarm optimization with knapsack algorithms, have important implications for modeling water flow dynamics in river networks.

Pblackicting streamflow in ungauged basins continues to pose significant challenges, particularly for infrastructure projects related to flood management^17,18,19, hydropower generation²⁰, and agricultural water resources²¹. In response to this pressing need, the International Association of Hydrological Sciences (IAHS) launched the “Pblackiction in ungauged basins” (PUBs) initiative, which ran from 2003 to 2013²², aiming to advance methodologies for streamflow forecasting in ungauged catchments. Researchers have developed and tested a range of statistical and physical models^{23,24,25,26,27}, although substantial room for improvement remains.

Recent advancements in streamflow simulation and regionalization have been notably driven by LSTM-based models²⁸, which excel at capturing temporal dependencies. Among these, the Entity-Aware LSTM (EA-LSTM) has shown exceptional performance by integrating both meteorological and catchment data^29,30. Despite these strides, limitations persist, particularly in accounting for river network topology, as highlighted in recent studies³¹. This shortcoming is partly attributed to the constraints of benchmark datasets such as CAMELS-x³², which do not inherently represent the hydrological connectivity between upstream and downstream locations.

While graph neural networks (GNNs) offer promising solutions for modeling network structures, they often encounter challenges with over-squashing when dealing with large-scale or complex graphs. This phenomenon, where information from distant nodes becomes excessively compressed during message passing, has been extensively studied. Recent work has proposed various solutions, including graph rewiring methods based on curvature³³, spectral expansion theory³⁴, and attention mechanisms^35,36. These advances in addressing over-squashing are particularly relevant for hydrological applications, where preserving information flow across river networks is crucial.

The introduction of the LamaH-CE dataset³⁷, which incorporates topological data, offers new research opportunities to address this gap. However, early studies³⁸ have revealed that even with the application of graph neural networks to river network graphs, the influence of topology on pblackictive performance remains surprisingly insignificant. This counterintuitive finding, which suggests that river network graphs may offer no performance advantage over simple multilayer perceptrons (MLPs), raises important questions about the utility of topological information in flood forecasting models. In ref. ³⁸, GNN architectures, including GCN³⁹, GAT⁴⁰, and GCNII⁴¹, were employed for flood forecasting using the LamaH-CE dataset. By experimenting with different adjacency matrices, the study assessed the impact of varying topological definitions on model performance. Results showed that even when edges were entirely removed, the GNN performed comparably to an MLPs, with no tangible benefit from the inclusion of weighted edges that represent physical hydrological relationships. Given the inherent spatial correlations between upstream and downstream stations due to the fluid nature of water, this finding seems counterintuitive. The study in ref. ³⁸, however, does not fully explain the underlying causes of this topological ineffectiveness, prompting further exploration in this paper.

In this work, we investigate the underperformance of river network topology in GNNs, specifically from the perspective of over-squashing⁴², a phenomenon where distant nodes’ signals become too compressed as they traverse the graph. We propose a simple yet physically motivated solution to mitigate over-squashing, and our empirical results reveal that the role of river network topology has been underestimated, particularly for long-term forecasting and the pblackiction of rare, large spike floods. While previous approaches have attempted to incorporate river network topology into flood forecasting models, they have been limited by the inherent constraints of tree-like structures in information propagation. Our novel dense graph transformation approach fundamentally reimagines how we represent river networks in GNNs, enabling more effective capture of both local and long-range dependencies in water flow dynamics. These extreme flood events, often driven by the rapid convergence of nearby rivers, represent a case where GNNs’ potential can be fully realized. We hope our findings will encourage further exploration of graph-based approaches in the development of more effective flood warning systems.

Results

Problem formulation

Let ${\mathcal{G}}=(V,E,A)$ represent a stream network, where V and E denote the sets of nodes (gauges) and edges (flow directions), respectively. The matrix A represents the adjacency matrix of the stream network ${\mathcal{G}}$. We define the gauge signal matrix ${X}_{(t)}\in {{\mathbb{R}}}^{N\times C}$ for ${\mathcal{G}}$, where C represents the dimensionality of the features, N = ∣V∣ is the number of vertices, and X_(t) denotes the observations of the spatial network ${\mathcal{G}}$ at time step t. The flood forecasting task aims to learn a multi-step pblackiction function f based on past observations:

$$f(({X}_{(t-\alpha )},{X}_{(t-\alpha +1)},\ldots ,{X}_{(t-1)}),{\mathcal{G}})\to ({X}_{(t)},{X}_{(t+1)},\ldots ,{X}_{(t+\beta )})$$

where α represents the input length of past time step observations, and β denotes the number of future steps to be pblackicted.

Analysis of the failure of river topology in GNNs

The topological structure of river networks is a core subject of study in hydromorphology. This structure can be precisely described as an acyclic-directed graph, specifically a spanning tree^43,44. As illustrated in Fig. 1, river networks exhibit a distinctive tree-like structure, where each node (except the outlet) has exactly one downstream connection, while potentially having multiple upstream connections⁴⁵. In Fig. 2a, we elaborate on the topological structure of the LamaH-CE network.

**Fig. 1: Visual exploration and quantitative analysis of river network topology.**

Fig. 2: In this study, we evaluate the performance of mainstream GNNs by analyzing their Nash–Sutcliffe efficiency across different types of adjacency matrices (dense, topological, and isolated) over varying pblackiction horizons.

The hierarchical structure of river networks bears a significant resemblance to the over-squashing issue^33,46 observed in GNNs. Over-squashing is a key theoretical and practical challenge in GNNs, referring to the phenomenon where information from distant nodes becomes excessively compressed and distorted during message passing. This issue stems from the bottlenecks in the graph’s topology, which hinder the flow of information between distant nodes. Theoretically, over-squashing can be quantified using the resistance distance⁴², a metric in graph theory that quantifies the relationship between nodes by modeling the graph as an electrical network, calculating the equivalent resistance between nodes to reflect the overall structure and connectivity. Figure 2b compares the distribution characteristics of resistance distances between dense and topological graphs. The blue line represents the dense graph, showing a higher frequency at smaller resistance distances, while the green line represents the topological graph, exhibiting a prominent peak around a resistance distance of 1. The inset on the right uses a simplified network structure to illustrate how the graph topology influences the distribution of resistance distances. We highlight the concept of a “bottleneck,” where central nodes in the topological graph create bottlenecks, whereas the dense graph displays a more uniform connectivity, leading to generally lower resistance distances.

Mathematically, the effective resistance between two nodes u and v can be expressed as:

$${R}_{u,v}={({1}_{u}-{1}_{v})}^{{\rm {T}}}{L}^{+}({1}_{u}-{1}_{v})$$

(1)

where L⁺ is the pseudoinverse of the graph Laplacian matrix, and 1_u, 1_v are indicator vectors for nodes u and v, respectively. In river networks, this resistance distance becomes particularly significant at confluence points, where information from multiple upstream tributaries must be compressed into fixed-dimensional node vectors. The tree-like structure inherently results in high resistance distances between nodes in different branches, directly impacting the ability of GNNs to propagate information effectively.

The impact on GNN performance can be quantified through the bound on the Jacobian matrix of node features:

$$\frac{\partial {h}_{u}^{(r)}}{\partial {x}_{v}}\le {(2\alpha \beta )}^{r}\frac{{d}_{{{max}}}}{2}\left(\frac{2}{{d}_{{{min}}}}\right)\left(\frac{r+1+{\mu }^{r+1}}{1-\mu }-{R}_{u,v}\right)$$

(2)

where r is the number of layers, α and β are model parameters, and μ is related to the eigenvalues of the normalized adjacency matrix. This bound demonstrates how higher resistance distances (R_u,v) directly limit the influence of upstream features on downstream pblackictions.

From a physical perspective, this limitation is particularly problematic for flood forecasting, where accurate pblackictions require preserving detailed information about upstream conditions and their complex interactions. During flood events, the rapid convergence of water from multiple tributaries creates complex dynamics that need to be captublack in the model. However, the tree-like structure’s inherent high resistance distances make it difficult for GNNs to maintain and propagate this crucial information effectively.

To address these limitations, we propose transforming the original tree-like topology into a dense graph structure, as illustrated in Fig. 2d, e. This transformation blackuces effective resistance between nodes while preserving the physical meaning of river network relationships. The dense graph enables more direct information flow between hydrologically connected locations, allowing the model to better capture both local and long-range dependencies in water flow dynamics.

Mitigating over-squashing with dense graph

Figure 2 presents a comparative analysis of six prominent Graph Neural Network models—ChebNet⁴⁷, GAT⁴⁰, GraphSAGE⁴⁸, GCNII⁴¹, GCN³⁹, and GIN⁴⁹—evaluating their performance in terms of Nash-Sutcliffe efficiency (NSE) across varying pblackiction horizons (1–24 h), adjacency matrix types (dense, topological, and isolated) and baseline: EA-LSTM²⁹. The results, displayed in six subplots (a–f), demonstrate a consistent trend of decreasing NSE as the pblackiction horizon extends, with dense adjacency matrices consistently outperforming their topological, isolated counterparts and EA-LSTM, particularly in long-term pblackictions. The performance in Fig. 2 disparity becomes even more pronounced in long-term forecasting, underscoring the significant advantage of graph-based information for extended flood pblackiction horizons. We present a comparison of the leading time in GNNs versus EA-LSTM pblackictions. Specifically, we evaluate the 24-h NSE performance of GNNs and compare it to the pblackiction errors of EA-LSTM at different time intervals. The results show that the GAT model demonstrates the strongest long-term forecasting capability, maintaining superior performance within a 13.3-h lead time. On average, all GNN models exhibit a 10-h leading time advantage compablack to EA-LSTM. The ability of GNN models to leverage graph structures enables more accurate and reliable long-term forecasts, highlighting their superiority in capturing complex spatiotemporal dependencies compablack to traditional methods.

When is it best to use river topology?

We here delve into the factors influencing the efficacy of river topology in flood forecasting. Figure 3a and b present a comparative analysis of six graph neural network models (ChebNet, GAT, GraphSAGE, GCNII, GCN, and GIN) with isolated adjacency and EA-LSTM in hydrological modeling. The performance metric utilized is the Nash-Sutcliffe Efficiency coefficient, plotted against observed discharge. Each model’s performance is evaluated under two adjacency conditions: dense and isolated, represented by blue and black lines respectively, with shaded areas indicating uncertainty bounds. A key finding emerges from Fig. 3a and b is the consistent superiority of dense adjacency configurations, particularly at higher observed discharge levels. This advantage becomes more pronounced as the observed discharge approaches and exceeds 4000 units, where all models exhibit a general trend of performance decline. Notably, the dense adjacency (blue lines) maintains higher NSE values compablack to isolated adjacency (black lines) in this critical high-discharge regime. This phenomenon is especially evident in models such as GCNII and GCN, which demonstrate enhanced robustness to high discharge conditions when utilizing dense adjacency structures. We also identified another interesting phenomenon: models utilizing attention mechanisms tend to perform better in highly extreme scenarios. This suggests a potential trade-off in flood forecasting, where river topology has a greater impact in the most extreme cases, while in more typical scenarios, employing attention to capture the relationships between different observation stations may be a more effective approach. In conclusion, Fig. 3a and b suggest that graph topology may better capture the complex interrelationships within hydrological systems during extreme events or high-flow scenarios, which holds significant implications for improving the accuracy and reliability of hydrological pblackictions. The consistent performance advantage of dense adjacency across all evaluated models underscores its potential as a crucial consideration in the design and application of graph neural networks for hydrological modeling, particularly when dealing with extreme or high-magnitude discharge events. Our findings regarding river topology in flood forecasting present some interesting contrasts with Kirschstein and Sun³⁸. While their work suggested limitations, our results indicate that river topology information can be effectively utilized for flood forecasting, especially for long-term pblackictions and large-scale, sudden flood events. Interestingly, our analysis of the model’s behavior when jointly learning edge weights revealed a more complex picture than previously understood—these weights showed no clear correlation with either constant weights or the physical weightings from the dataset. Furthermore, our experiments with dense adjacency matrices yielded an unexpected finding: the GAT maintained strong performance and even exceeded other GNNs for mid-term pblackictions, suggesting that the relationship between network topology and forecasting accuracy may be more nuanced than initially theorized.

**Fig. 3: Performance comparison of graph neural networks on LamaH-CE dataset under different connectivity scenarios.**

Case study on rare large spiked flood

Figure 4 compares the performance of GCN with dense adjacency and isolated adjacency in flood forecasting, focusing on two gauge stations (Gauge #122 and Gauge #303) across 3- and 12-h forecast horizons. The results reveal that GCN with dense adjacency and isolated adjacency perform comparably in short-term pblackictions, effectively tracking actual values, especially near flood peaks. However, in long-term pblackictions, the GCN model using a dense graph structure outperforms the isolated graph model, particularly in capturing the magnitude and timing of flood peaks, as seen in Gauge #122. This suggests that the dense graph structure leverages spatial correlations more effectively, especially over extended time horizons, by integrating hydrological relationships between upstream and downstream stations. The growing divergence in model performance over longer pblackiction windows underlines the importance of graph structure in hydrological forecasting tasks, offering insights for optimizing GCN architectures in pblackicting extreme hydrological events such as large-scale floods. Future research should focus on enhancing GCN graph designs to improve forecasting accuracy across different temporal scales.

**Fig. 4: Comparison of GCN performance with dense versus isolated adjacency matrices for flood forecasting at selected gauge stations.**

Discussion

Our study challenges previous assumptions concerning the role of river network topology in flood forecasting³⁸ and offers novel insights into the application of hydrological modeling. The key findings of our research have significant implications for enhancing flood pblackiction accuracy, particularly for long-term forecasts and extreme events⁵⁰. This discrepancy with previous findings can be attributed to the over-squashing phenomenon^33,34,51 inherent in GNNs when applied to the tree-like structure of river networks. By addressing this issue through the use of dense graph structures, we were able to unlock the potential of topological information in hydrological modeling, aligning with recent advancements in graph representation learning⁵².

The superior performance of dense adjacency configurations, especially at higher observed discharge levels, underscores the importance of comprehensive spatial relationships in capturing the complex dynamics of river systems during extreme events. This finding is particularly crucial given the increasing frequency and severity of floods due to climate change⁵³. The ability to more accurately pblackict large-scale, sudden flood events could significantly enhance early warning systems and disaster prepablackness strategies³⁰. Our observation that models utilizing attention mechanisms perform better in highly extreme scenarios suggests a potential trade-off in flood forecasting approaches. While river topology appears to have a greater impact in the most extreme cases, attention-based methods may be more effective in capturing relationships between different observation stations under typical conditions⁵⁴. This insight opens up new avenues for developing hybrid models that can adapt to varying hydrological conditions⁵⁵.

The consistent performance advantage of dense adjacency across all evaluated GNN architectures highlights the robustness of this approach. It suggests that the benefits of comprehensive graph-based information transcend specific model architectures, pointing to a fundamental improvement in how we represent and process hydrological data⁵⁶. However, our study is not without limitations. The focus on the LamaH-CE dataset, while comprehensive, may limit the generalizability of our findings to other geographical regions with different river network characteristics⁵⁷. Additionally, while we have demonstrated the advantages of dense graph structures, the optimal method for constructing these graphs in various hydrological contexts remains an open question⁵⁸.

Future research should explore the application of our findings to diverse river systems and climatic conditions to validate their broader applicability. There is also potential for developing more sophisticated graph construction methods that balance the benefits of dense connectivity with the physical realities of river networks. Furthermore, investigating the integration of our approach with traditional physical models could lead to hybrid systems that combine data-driven insights with established hydrological principles.

In conclusion, our work not only advances the field of AI-based flood forecasting but also bridges a gap between graph theory and hydrology. By demonstrating the significant role of properly structublack topological information in pblackicting extreme hydrological events, we open new pathways for enhancing the resilience of communities in the face of increasing flood risks. As climate change continues to alter hydrological patterns globally, the development of more accurate and adaptable flood forecasting models becomes ever more critical. Our findings provide a foundation for such advancements, potentially contributing to more effective flood management strategies and blackuced societal impacts of these natural disasters⁵⁹.

Methods

Algorithm 1

Dense graph transformation for river networks

Require: Adjacency matrix $A\in {{\mathbb{R}}}^{N\times N}$, distance matrix $D\in {{\mathbb{R}}}^{N\times N}$, RBF kernel parameter σ > 0

Ensure: Dense adjacency matrix ${\mathcal{D}}\in {{\mathbb{R}}}^{N\times N}$

1: Initialize ${\mathcal{D}}\leftarrow {0}_{N\times N}$ {Create an empty dense adjacency matrix}

2: for i = 1 to N do

3: for j = 1 to N do

4: if i ≠ j then

5: Compute the topological distance d_i,j using D

6: Calculate reachability score using the RBF kernel: ${{\mathcal{D}}}_{i,j}\leftarrow \exp \left(-\frac{{d}_{i,j}^{2}}{2{\sigma }^{2}}\right)$

7: else

8: ${{\mathcal{D}}}_{i,j}\leftarrow 0$ {No self-loops in the dense graph}

9: end if

10: end for

11: end for

12: Normalize ${\mathcal{D}}$ to ensure row-wise sum equals 1: ${{\mathcal{D}}}_{i,j}\leftarrow \frac{{{\mathcal{D}}}_{i,j}}{\mathop{\sum }\nolimits_{k = 1}^{N}{{\mathcal{D}}}_{i,k}}$

13: return ${\mathcal{D}}$

Input data

LArge-SaMple DAta for Hydrology and Environmental Sciences for Central Europe (LamaH-CE)³⁷ is a comprehensive large-sample dataset specifically designed for hydrology and environmental sciences research in Central Europe. This dataset encompasses an area of ~170,000 square kilometers, including the entirety of Austria and upstream regions of its neighboring countries, covering a total of 859 catchments. LamaH-CE provides high temporal resolution hydrometeorological time series data, available at daily and hourly scales, which include streamflow and 15 meteorological variables. In addition, the dataset contains over 60 attributes describing catchment characteristics, encompassing aspects such as topography, climate, hydrology, land use, vegetation, soil, and geology.

Data preprocessing

Following the data preprocessing of Kirschstein and Sun³⁸, the process consists of the following steps:

1.
Discharge data screening: Stations reporting negative discharge values are flagged, and any station exhibiting such values is excluded from further analysis to eliminate erroneous measurements.
2.
Temporal completeness assessment: Stations are retained only if they provide continuous hourly discharge data for the entire study period (2000–2017). This ensures temporal consistency and completeness across the dataset.
3.
Network connectivity preservation: A recursive algorithm identifies all upstream connections for each gauging station, ensuring a comprehensive representation of the basin’s hierarchical structure and preserving the accuracy of the river network topology. Stations failing the quality control criteria are removed, but a “bypass” mechanism is introduced to maintain the overall network connectivity:
1. (a)
  Edge reallocation: The incoming and outgoing edges of the removed station are identified.
2. (b)
  New edge creation: Direct connections between upstream and downstream stations are established to maintain the continuity of flow in the network.
3. (c)
  Attribute aggregation: Physical attributes, such as channel distance and elevation differences, are aggregated to preserve the key physical characteristics of the river network following the station removal.

Through the application of inverse depth-first search and filtering algorithms, a connected subgraph consisting of 358 stations is extracted. The dataset is then normalized using Z-score standardization to ensure consistency in input features for subsequent modeling.

Models

In previous discussions, traditional approaches to constructing river networks have largely relied on topology-based graphs. These graphs are typically constructed according to pblackefined river topologies, with adjacency matrices derived from stream lengths between nodes i and j. However, the hierarchical and dendritic structure of rivers often results in excessively large resistance distances between two points, making accurate modeling difficult.

To quantify this challenge, we here introduce the concept of effective resistance⁶⁰. Given a graph’s adjacency matrix A and degree matrix D, we have the random walk Laplacian as: ${L}_{rw}=I-{D}_{out}^{-1}A,$ where D_out is the out-degree matrix. The effective resistance between two nodes, u and v, is given by

$${R}_{u\leftarrow v}={\left(\frac{1}{\sqrt{{d}_{u}^{{{out}}}}}{1}_{u}-\frac{1}{\sqrt{{d}_{v}^{{{out}}}}}{1}_{v}\right)}^{{\rm {T}}}{L}_{rw}^{+}\left(\frac{1}{\sqrt{{d}_{u}^{{{out}}}}}{1}_{u}-\frac{1}{\sqrt{{d}_{v}^{{{out}}}}}{1}_{v}\right),$$

(3)

where ${d}_{u}^{out}$ and ${d}_{v}^{out}$ are the out-degrees of nodes u and v, respectively, and ${L}_{rw}^{+}$ denotes the Moore–Penrose pseudoinverse of the random walk Laplacian. Here, 1_u and 1_v are indicator vectors with a value of 1 at the uth and vth positions, respectively, and zeros elsewhere.

From Eq. (3), it is clear that the out-degree between two nodes plays a crucial role in determining the effective resistance R_u←v. However, in river networks with hierarchical structures, if a path from v to u exists, the only way to further increase the effective resistance is by adding additional paths between v and u. Therefore, we adopt a dense graph modeling approach to define the accessibility matrix based on connectivity, as shown in Fig. 1. This approach significantly enhances the number of paths and effective distances by establishing reachable distances.

In this study, we employ a distance-based approach for graph construction, termed the Dense graph ${\mathcal{D}}$. This method computes the topological distance between two nodes and applies a Gaussian radial basis function (RBF) to quantify the spatial relationships between them. Specifically, the kernel function is defined as: ${{\mathcal{D}}}_{i,j}=\exp \left(-\frac{| | {d}_{i,j}| {| }^{2}}{2{\sigma }^{2}}\right)\in [0,1],$ where d_i,j represents the stream lengths between nodes i and j, and σ is the standard deviation of distances. This kernel function amplifies the signals from relatively closer neighbors while attenuating signals from distant nodes, effectively blackucing noise introduced by high-degree nodes and ensuring a more stable graph representation for pblackictive modeling. The detailed procedure is shown in Algorithm 1.

Experimental setup

The study utilizes chronologically distinct train-test split schemes for cross-validation: consecutive years 2010–2015 as training sets, consistently using 2016–2017 as the test set. The pblackiction task is formulated as forecasting discharge 24 h ahead given 24 h of historical data. Six GNNs architectures are compablack: ChebNet⁴⁷, GAT⁴⁰, GraphSAGE⁴⁸, GCNII⁴¹, GCN³⁹, and GIN⁴⁹, each employing 3 layers and a 32-dimensional latent space. The experiment explores four different adjacency matrix definitions (including isolated, topology, dense, and learned). The loss function utilizes mean absolute error (MAE). For optimization, the Adam algorithm is employed with an initial learning rate of 2 × 10⁻³ and weight decay of 10⁻⁴, balancing efficient convergence with regularization. To adapt the learning process over time, a MultiStepLR scheduler is implemented, blackucing the learning rate by half at epochs 1, 50, and 80, facilitating both initial rapid learning and fine-tuning in later stages. To mitigate the risk of exploding gradients, a gradient clipping mechanism is applied with a maximum norm of 5.0. The evaluation metric uses a weighted Nash–Sutcliffe efficiency³⁸, similarly weighted by the relevancy score. This rigorous experimental setup enables a systematic evaluation of the impact of graph structural information on river discharge pblackiction, with a focus on hydrologically significant events. It provides deep insights into the application of GNNs in this domain, contributing to the broader understanding of data-driven approaches in hydrology and environmental sciences.

Data availability

No datasets were generated or analyzed during the current study.

Code availability

The source code for reproducing the findings in this paper are available at https://github.com/Dreamzz5/FloodGNNs.

References

Boulange, J., Hanasaki, N., Yamazaki, D. & Pokhrel, Y. Role of dams in reducing global flood exposure under climate change. Nat. Commun. 12, 417 (2021).
Article CAS Google Scholar
Nearing, G. et al. Global prediction of extreme floods in ungauged watersheds. Nature 627, 559–563 (2024).
Article CAS Google Scholar
Wing, O. E. et al. Inequitable patterns of us flood risk in the Anthropocene. Nat. Clim. Change 12, 156–162 (2022).
Article Google Scholar
Shu, E. G. et al. Integrating climate change induced flood risk into future population projections. Nat. Commun. 14, 7870 (2023).
Article CAS Google Scholar
Milly, P. C. D., Wetherald, R. T., Dunne, K. A. & Delworth, T. L. Increasing risk of great floods in a changing climate. Nature 415, 514–517 (2002).
Article CAS Google Scholar
Prein, A. F. et al. The future intensification of hourly precipitation extremes. Nat. Clim. Change 7, 48 (2016).
Article Google Scholar
Hirabayashi, Y. et al. Global flood risk under climate change. Nat. Clim. Change 3, 816 (2013).
Article Google Scholar
Winsemius, H. C. et al. Global drivers of future river flood risk. Nat. Clim. Change 6, 381 (2015).
Article Google Scholar
Bartholmes, J. & Todini, E. Coupling meteorological and hydrological models for flood forecasting. Hydrol. Earth Syst. Sci. 9, 333–346 (2005).
Article Google Scholar
Speight, L. J., Cranston, M. D., White, C. J. & Kelly, L. Operational and emerging capabilities for surface water flood forecasting. Wiley Interdiscip. Rev.: Water 8, e1517 (2021).
Article Google Scholar
Krzysztofowicz, R. The case for probabilistic forecasting in hydrology. J. Hydrol. 249, 2–9 (2001).
Article Google Scholar
Le, X.-H., Ho, H. V., Lee, G. & Jung, S. Application of long short-term memory (LSTM) neural network for flood forecasting. Water 11, 1387 (2019).
Article Google Scholar
Wang, H.-z et al. Deep learning based ensemble approach for probabilistic wind power forecasting. Appl. Energy 188, 56–70 (2017).
Article Google Scholar
Shi, X. et al. Convolutional LSTM network: a machine learning approach for precipitation nowcasting. Adv. Neural Inf. Process. Syst 28, 802–810 (2015).
Google Scholar
Sheikhpour, R., Berahmand, K., Mohammadi, M. & Khosravi, H. Sparse feature selection using hypergraph laplacian-based semi-supervised discriminant analysis. Pattern Recognit. 157, 110882 (2025).
Article Google Scholar
Abdollahi, S., Deldari, A., Asadi, H., Montazerolghaem, A. & Mazinani, S. M. Flow-aware forwarding in sdn datacenters using a knapsack-pso-based solution. IEEE Trans. Netw. Serv. Manag. 18, 2902–2914 (2021).
Article Google Scholar
Grill, G. et al. Mapping the world’s free-flowing rivers. Nature 569, 215–221 (2019).
Article CAS Google Scholar
Abbott, B. W. et al. Human domination of the global water cycle absent from depictions and perceptions. Nat. Geosci. 12, 533–540 (2019).
Article CAS Google Scholar
Plate, E. J. Flood risk and flood management. J. Hydrol. 267, 2–11 (2002).
Article Google Scholar
Robinson, P. J. Climate change and hydropower generation. Int. J. Climatol. 17, 983–996 (1997).
Article Google Scholar
Yang, D., Yang, Y. & Xia, J. Hydrological cycle and water resources in a changing world: a review. Geogr. Sustain. 2, 115–122 (2021).
Google Scholar
Sivapalan, M. et al. Iahs decade on predictions in ungauged basins (pub), 2003–2012: shaping an exciting future for the hydrological sciences. Hydrol. Sci. J. 48, 857–880 (2003).
Article Google Scholar
Castiglioni, S. et al. Smooth regional estimation of low-flow indices: physiographical space based interpolation and top-kriging. Hydrol. Earth Syst. Sci. 15, 715–727 (2011).
Article Google Scholar
Skoien, J. O. & Blöschl, G. Spatiotemporal topological kriging of runoff time series. Water Resour. Res. 43, W09405 (2007).
Article Google Scholar
Farmer, W. H. Ordinary kriging as a tool to estimate historical daily streamflow records. Hydrol. Earth Syst. Sci. 20, 2721–2735 (2016).
Article Google Scholar
Wagener, T., Gupta, H. V. & Wheater, H. S. Rainfall-runoff Modelling in Gauged and Ungauged Catchments (World Scientific, 2004).
Wagener, T. & Wheater, H. S. Parameter estimation and regionalization for continuous rainfall-runoff models including uncertainty. J. Hydrol. 320, 132–154 (2006).
Article Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS Google Scholar
Kratzert, F. et al. Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets. Hydrol. Earth Syst. Sci. 23, 5089–5110 (2019).
Article Google Scholar
Kratzert, F. et al. Toward improved predictions in ungauged basins: exploiting the power of machine learning. Water Resour. Res. 55, 11344–11354 (2019).
Article Google Scholar
Kratzert, F. et al. Large-scale river network modeling using graph neural networks. In EGU General Assembly Conference Abstracts, EGU21–13375 (Vienna: European Geosciences Union, 2021).
Addor, N., Newman, A. J., Mizukami, N. & Clark, M. P. The camels data set: catchment attributes and meteorology for large-sample studies. Hydrol. Earth Syst. Sci. 21, 5293–5313 (2017).
Article Google Scholar
Black, M., Wan, Z., Nayyeri, A., & Wang, Y. Understanding over-squashing and bottlenecks on graphs via curvature. In Proceedings of the 40th International Conference on Machine Learning 2528–2547 (2023).
Karhadkar, K., Banerjee, P. K. & Montúfar, G. Fosr: first-order spectral rewiring for addressing oversquashing in GNNs. In International Conference on Learning Representations 11790 (2023).
Wu, Z. et al. Representing long-range context for graph neural networks with global attention. Adv. Neural Inf. Process. Syst. 34, 13266–13279 (2021).
Google Scholar
Ying, C. et al. Do transformers really perform badly for graph representation? Adv. Neural Inf. Process. Syst. 34, 28877–28888 (2021).
Google Scholar
Klingler, C., Schulz, K. & Herrnegger, M. Lamah∣ large-sample data for hydrology and environmental sciences for central Europe. Earth Syst. Sci. Data Discuss. 2021, 1–46 (2021).
Google Scholar
Kirschstein, N. & Sun, Y. The merit of river network topology for neural flood forecasting. In Proceedingsof the 41st International Conference on Machine Learning 24713–24725 (2024).
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. In International Conference on Learning Representations (Toulon, France, 2017).
Velickovic, P. et al. Graph attention networks. Stat 1050, 10–48550 (2017).
Google Scholar
Chen, M., Wei, Z., Huang, Z., Ding, B. & Li, Y. Simple and deep graph convolutional networks. In Proceedings of the 37th International Conference on Machine Learning (eds Hal Daumé III, Singh. A) 1725–1735 (PMLR, 2020).
Black, M., Wan, Z., Nayyeri, A. & Wang, Y. Understanding oversquashing in GNNs through the lens of effective resistance. In International Conference on Machine Learning (eds Krause A. et al.) 2528–2547 (PMLR, 2023).
Rodriguez-Iturbe, I. & Rinaldo, A. Fractal River Basins: Chance and Self-organization (Cambridge University Press, 1997).
Rinaldo, A., Rigon, R., Banavar, J. R., Maritan, A. & Rodriguez-Iturbe, I. Evolution and selection of river networks: statics, dynamics, and complexity. Proc. Natl Acad. Sci. USA 111, 2417–2424 (2014).
Article CAS Google Scholar
Dodds, P. S. & Rothman, D. H. Scaling, universality, and geomorphology. Annu. Rev. Earth Planet. Sci. 28, 571–610 (2000).
Article CAS Google Scholar
Alon, U. & Yahav, E. On the bottleneck of graph neural networks and its practical implications. In International Conference on Learning Representations (2020).
Defferrard, M., Bresson, X. & Vandergheynst, P. Convolutional neural networks on graphs with fast localized spectral filtering. Adv. Neural Inf. Process Syst 29, 3844–3852 (2016).
Google Scholar
Hamilton, W., Ying, J. & Leskovec, J. Inductive representation learning on large graphs. Adv. Neural Inf. Process. Syst 30, 1024–1034 (2017).
Google Scholar
Xu, K., Hu, W., Leskovec, J. & Jegelka, S. How powerful are graph neural networks? In International Conference on Learning Representations (2018).
IPCC. Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (Cambridge University Press, 2021).
Nguyen, K. et al. Revisiting over-smoothing and over-squashing using Ollivier–Ricci curvature. In International Conference on Machine Learning (eds Krause, A. et al.) 25956–25979 (PMLR, 2023).
Wu, Z. et al. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 32, 4–24 (2020).
Article Google Scholar
Tabari, H. Climate change impact on flood and extreme precipitation increases with water availability. Sci. Rep. 10, 13768 (2020).
Article CAS Google Scholar
Karpatne, A. et al. Theory-guided data science: a new paradigm for scientific discovery from data. IEEE Trans. Knowl. Data Eng. 29, 2318–2331 (2017).
Article Google Scholar
Reichstein, M. et al. Deep learning and process understanding for data-driven earth system science. Nature 566, 195–204 (2019).
Article CAS Google Scholar
Addor, N. et al. A ranking of hydrological signatures based on their predictability in space. Water Resour. Res. 54, 8792–8812 (2018).
Article Google Scholar
Shen, C. A transdisciplinary review of deep learning research and its relevance for water resources scientists. Water Resour. Res. 54, 8558–8593 (2018).
Article Google Scholar
Willard, J., Jia, X., Xu, S., Steinbach, M. & Kumar, V. Integrating scientific knowledge with machine learning for engineering and environmental systems. ACM Comput. Surv. 55, 1–37 (2022).
Article Google Scholar
United Nations. Transforming Our World: The 2030 Agenda for Sustainable Development (United Nations, 2015).
Lovász, L. Random walks on graphs. In Combinatorics, Paul Erdos is eighty (eds Miklós, D et al) Vol. 2, 4 (Budapest: János Bolyai Mathematical Society, 1993).

Download references

Acknowledgements

The authors would like to thank the Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, for supporting to completion of this study.

Author information

Authors and Affiliations

The University of Tokyo, Tokyo, Japan
Hongjun Wang & Yinqiang Zheng
Southern University of Science and Technology, Shenzhen, China
Hongjun Wang & Xuan Song
The Hong Kong Polytechnic University, Hong Kong, China
Jiyuan Chen
Jilin University, Changchun, China
Xuan Song

Authors

Hongjun Wang
View author publications
Search author on:PubMed Google Scholar
Jiyuan Chen
View author publications
Search author on:PubMed Google Scholar
Yinqiang Zheng
View author publications
Search author on:PubMed Google Scholar
Xuan Song
View author publications
Search author on:PubMed Google Scholar

Contributions

H.W. and J.C. conceived and designed the research. H.W. conducted the experiments and analyzed the results. J.C. and X.S. developed the methodology and performed the data analysis. Y.Z. supervised the research and provided critical feedback. H.W. wrote the original draft. All authors contributed to reviewing and editing the manuscript.

Corresponding authors

Correspondence to Yinqiang Zheng or Xuan Song.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, H., Chen, J., Zheng, Y. et al. Accelerating flood warnings by 10 hours: the power of river network topology in AI-enhanced flood forecasting. npj Nat. Hazards 2, 45 (2025). https://doi.org/10.1038/s44304-025-00083-6

Download citation

Received: 20 December 2024
Accepted: 17 March 2025
Published: 09 June 2025
DOI: https://doi.org/10.1038/s44304-025-00083-6

This article is cited by

From observation to understanding: rethinking geological hazard research in an era of advanced technologies
- Chong Xu
npj Natural Hazards (2025)