Interpretable flash flood susceptibility mapping in Yarlung Tsangpo River Basin using H2O Auto-ML

He, Fei; Liu, Suxia; Mo, Xingguo; Wang, Zhonggen

doi:10.1038/s41598-024-84655-y

Download PDF

Article
Open access
Published: 11 January 2025

Interpretable flash flood susceptibility mapping in Yarlung Tsangpo River Basin using H2O Auto-ML

Fei He^1,2,
Suxia Liu^1,2,
Xingguo Mo^1,2 &
…
Zhonggen Wang^1,3

Scientific Reports volume 15, Article number: 1702 (2025) Cite this article

4183 Accesses
20 Citations
13 Altmetric
Metrics details

Subjects

Abstract

Flash flood susceptibility mapping is essential for identifying areas prone to flooding events and aiding decision-makers in formulating effective prevention measures. This study aims to evaluate the flash flood susceptibility in the Yarlung Tsangpo River Basin (YTRB) using multiple machine learning (ML) models facilitated by the H2O automated ML platform. The best-performing model was used to generate a flash flood susceptibility map, and its interpretability was analyzed using the Shapley Additive Explanations (SHAP) tree interpretation method. The results revealed that the top four models, including both single and ensemble models, demonstrated high accuracy in the tests. The flash flood susceptibility map generated by the best-performing eXtreme Randomized Trees (XRT) model showed that 8.92%, 12.95%, 15.42%, 31.34%, and 31.37% of the study area exhibited very high, high, moderate, low, and very low flash flood susceptibility, respectively, with approximately 74.9% of the historical flash floods occurring in areas classified as moderate to very high susceptibility. The SHAP plot identified topographic factors as the primary drivers of flash floods, with the importance analysis ranking the most influential factors in such descending order as DEM, topographic wetness index, topographic position index, normalized difference vegetation index, and average multi-year precipitation. This study demonstrates the benefits of interpretable machine learning, which can provide guidance for flash flood mitigation.

Future flood susceptibility mapping under climate and land use change

Article Open access 11 April 2025

The comparative study of machine learning agent models in flood forecasting for tidal river reaches

Article Open access 31 May 2025

Leveraging multi-source data and teleconnection indices for enhanced runoff prediction using coupled deep learning models

Article Open access 27 April 2025

Introduction

The accumulation of heavy precipitation typically occurs within a short period of time, exceeding the drainage capacity of mountain streams and slopes and thus causing flash floods^1,2. Flash floods not only may lead to grave consequences such as causalities, they may also cause damage to property, including private houses, farmland, and infrastructure³. Flash floods pose a significant threat to public safety and economic activities^4,5. Flash floods are characterized by their suddenness, intensity, destructiveness, and short duration^6,7, making them one of the most dangerous types of floods^8,9.

The assessment of flash flood susceptibility is a crucial aspect of flash flood research, typically developed based on two types of models¹⁰: (1) Parametric models (physically-based models) such as Hydrologic Engineering Center’s River Analysis System (HEC-RAS)¹¹, Hydrologic Engineering Center’s Hydrologic Modeling System (HEC-HMS)¹², and Soil and Water Assessment Tool (SWAT)¹³. These models are theoretically robust in scientific terms; however, they face challenges in practical applications, such as high computational demands, difficulties in parameter calibration, and high development and maintenance costs¹¹. Moreover, issues like scale effects and other limiting factors can affect the accuracy of these models^14,15. (2) Non-parametric models (data-driven models), including Random Forest (RF)¹⁶, Support Vector Machine (SVM)¹⁷, Logistic Regression (LR)¹⁸, and Artificial Neural Networks (ANN)¹⁹. These models do not rely on a detailed description of the physical process²⁰, but generally use a certain algorithmic structure, and then adjust their parameters to the data (training), as well as their complexity by optimizing the bias-variance trade-off, so that the fitted model generalizes well to new data²⁰. Such methods have achieved significant results in practical applications^21,22. Additionally, researchers are also exploring methods to integrate physical models with data-driven models to achieve more accurate predictive results²³.

The development of ML models, as well as hybrid models (hybrid models are defined as models that integrate multiple machine learning algorithms)²⁴, has sparked increased interest in model interpretability^24,25,26,27. Although model interpretability has been a concern for a long time and depends on the types of models used, many ML models have remained difficult to understand due to their complexity (e.g., complex structures and parameters²⁸, non-linear interactions²⁹, and the “black box” nature³⁰), leading to significant challenges in explaining their predictive outcomes. However, the H2O Automated ML (Auto-ML) platform provides access to a range of predictive models, automated model selection, and model parameter (hyper-parameters) tuning as well as integrates methods for model interpretability. This constitutes an advancement for making decisions about the suitability of the modelling process³¹.

Flash floods typically result from a combination of factors²⁵. Particularly in the YTRB, flash floods are frequent due to the region’s complex terrain and weather patterns, which make prediction particularly challenging³². Research on the YTRB has strongly focused on the response of precipitation, runoff, and land use to climate change^33,34,35. In terms of flood research, the main focus has been on flood mechanisms such as glacial lake outbursts and localized debris flows³⁶, while flash flood susceptibility has not been extensively investigated. Flash flood susceptibility research will not only help to understand the influencing factors of flash floods, but also provide a reference for policy formulation and enhanced local flash flood prevention and mitigation³⁷.

This study explores the use of auto-ML for flash flood susceptibility mapping in the YTRB, responding to the urgent need for these tools in the region, and conducts interpretability analysis of the results. The innovation of the study is reflected in the following aspects: the introduction of advanced Auto-ML techniques to enhance model efficiency and accuracy; the integration of interpretability analysis methods to make the model results transparent and credible; and, the provision of quick, accurate and up-to-date flash flood susceptibility mapping for the YTRB. Based on this, our research objectives are: (1) to identify and evaluate the influencing factors of flash floods; (2) to select and evaluate machine learning models; (3) to conduct flash flood susceptibility mapping; and (4) to perform interpretability analysis on the results of the optimal model.

Materials and methods

Study area

The YTRB (82°00′ E to 97°07′ E, 28°00′ N to 31°16′ N) is situated in the southern part of the Tibetan Plateau, covering a total area of approximately 240,000 km² (Fig. 1). The topography of the basin is characterized by higher elevations in the west, north and south, with lower elevations in the east and middle. The average altitude exceeds 4000 m³⁸. The main valley of the YTRB follows an east-west fault zone, giving the basin an elongated shape. Situated in this tectonic fault zone, the YTRB experiences frequent earthquakes and active geological processes³⁶. Major tributaries include the Lhasa River, Parlung Tsangpo, Yigong Tsangpo, Lhoka Tsangpo, Nizequ, and Nianchu River³⁹.

The eastern region of the YTRB receives over 900 mm of annual precipitation, while the western region gets less than 100 mm, making it arid. The spatial and temporal distribution of precipitation within the basin is extremely uneven⁴⁰, the occurrence time and intensity of extreme precipitation range from 25 to 50 mm/day³⁸. Vegetation in the basin includes alpine meadow and steppe in the upper reaches, shrub lands and forests in the middle, and coniferous and broadleaf forests in the lower reaches. The southeastern area, influenced by the tropical monsoon, is dominated by broadleaf forests. The main soil types are mountainous shrub-steppe soils and alpine steppe soils.

Data

Flash flood inventory data

Flash flood inventory records document a series of events caused by intense rainfall⁴¹. Analyzing historical flash flood data is essential for mitigating flood-related damage and enhancing preventative measures⁴². The flash flood inventory data used in this study covers the period from 1980 to 2019, primarily sourced from the China Flash Flood Investigation Project, with supplementary information from online news media and government records. The data attributes include the geographic location, year, and month of occurrence of the disasters (Table 1). Since the 1980s, the YTRB has experienced 605 documented flash floods (Fig. 1). Additionally, we utilized ArcGIS 10.8 software to randomly generate 605 non-flash flood points within the study area. Historical flash floods data suggests that geographic and environmental factors contribute to the susceptibility of certain areas to flash floods, indicating that regions with similar attributes may also experience such events.

Table 1 Partial data of historical flash flood points.

Full size table

Factors influencing flash floods

An adequate selection of factors influencing flash floods is crucial for predicting flash flood susceptibility. We selected 20 factors (Table 2) based on previous studies^{43,44,45,46,47}. These factors are categorized into meteorological and surface factors. Meteorological factors include annual maximum 3-hourly precipitation (3-H-P), annual maximum 6-hourly precipitation (6-H-P), annual maximum 12-hourly precipitation (12-H-P), annual maximum 24-hourly precipitation (24-H-P), and average multi-year precipitation (AP). The long-term average precipitation is indicative of the overall soil humidity in the region. Spatial interpolation was applied to annual mean precipitation data from meteorological stations for the years 1980–2015 (Fig. 2a). In contrast, extreme precipitation indices highlight the potential for intense, short-duration downpours. The maximum 3-hourly precipitation values were derived from China Meteorological Forcing Dataset (CMFD), for the years 2011–2015 (Fig. 2b).

Surface factors include slope (SL), aspect (AS), plan curvature (PLC), profile curvature (PRC), terrain ruggedness index (TRI), topographic position index (TPI), topographic wetness index (TWI), stream power index (SPI), river density (RD), distance from river (DFR), soil type (ST), lithology (LI), normalized difference vegetation index (NDVI), and land use/land cover (LULC). All factors influencing flash floods were resampled to a spatial resolution of 1 km.

Table 2 Description of factors influencing flash floods.

Full size table

Topography plays a crucial role in influencing the speed and direction of surface runoff, with steep slopes facilitating the rapid downhill flow of water⁴⁸. Utilizing 90-meter spatial resolution STRM DEM data, the study area was cropped to its boundaries (Fig. 2c). Aspect (Fig. 2d) affects the direction of surface runoff⁴⁹. The PLC and PRC (Fig. 2e and f) determine the direction and speed of runoff, which influences the accumulation or dispersion of water on the terrain. The TPI measures the relative elevation and terrain variability at specific points (Fig. 2g). The TWI indicates how the topography influences the water flow paths and water accumulation and soil saturation potential (Fig. 2h), demonstrating the impact of the terrain on moisture retention⁵⁰. The SPI reveals the layout of the river system and its potential water conveyance capacity (Fig. 2i)⁵¹. Indices such as RD and DFR indicate the hydrological status of the location (hydrological status refers to characteristics such as the distribution, flow, and availability of water bodies in the region) as shown in Fig. 2j and k⁵². ST (Fig. 2l) can affect surface runoff by affecting infiltration. For example, clayey soils may restrict infiltration during heavy rainfall events. The LI can affect water infiltration, so geologic structures that are susceptible to erosion can exacerbate flooding during heavy rainfall events (Fig. 2m)². The NDVI is extensively used as a vegetation condition index to effectively differentiate between healthy land cover and areas that are either unhealthy or lacking vegetation (Fig. 2n)². The Changes of LULC affects the permeability of water (Fig. 2o), thereby affecting surface runoff⁵³. The methods for calculating the TPI, TWI and SPI indices are as follows:

$$TPI=Z - {Z_{mean}}$$

(1)

Where Z is the elevation value at a specific point, and $~{Z_{mean}}$ is the average elevation of the surrounding points. Positive TPI values typically indicate ridges, negative values indicate valleys, and values near zero indicate flat areas.

$$TWI=ln\left( {\frac{A}{{tan\left( \beta \right)}}} \right)$$

(2)

Where A is the water flow accumulation, and β is the local slope angle in radians. TWI is used to identify areas that may be saturated due to water flow accumulation, often wetlands or valleys. A higher TWI value indicates greater potential for saturation.

$$SPI=A \times tan\left( \beta \right)$$

(3)

The SPI is used to estimate the erosive power of water flow at a given point. It is a function of water flow accumulation and the steepness of the slope. Higher SPI values indicate stronger erosive potential.

Methods

Based on the initially selected flash flood influencing factors, this study involves processes such as factor selection, ML model training, model evaluation, flash flood susceptibility mapping, and model interpretation. A flowchart of the approach used in this study is shown in Fig. 3.

Correlation and multi-collinearity analysis

Spearman’s rank correlation coefficient is a non-parametric statistical method ideal for analyzing data that does not require a normal distribution or linear relationship. It evaluates the monotonic relationship between variables by ranking the data, which enhances its robustness to outliers. Its simplicity in calculation makes it particularly well-suited for ordinal data, especially when the conditions for traditional parametric tests are not met⁵⁴. As such, it is commonly employed to assess the correlation between each pair of different factors.

Multi-collinearity refers to the condition where multiple independent variables exhibit high correlation with each other, potentially causing instability in statistical models and leading to inaccurate parameter estimates. This, in effect, compromises the model’s predictive accuracy and interpretative clarity. Therefore, it is essential to assess and address multicollinearity when selecting factors that may influence flash floods. Multi-collinearity analysis is evaluated using Variance Inflation Factors (VIFs), where:

1.
A VIF of 1 indicates no multi-collinearity.
2.
A VIF between 1 and 5 suggests a moderate level of multi-collinearity, which is usually acceptable.
3.
While a VIF above 5 indicates strong multi-collinearity⁵⁵.

H2O Auto-ML platform

H2O is an open-source, distributed ML platform that automates the training and testing of multiple ML models efficiently⁵⁶. It employs a combination of rapid stochastic search and layered integration to attain results. The H2O Auto-ML workflow encompasses several key stages such as data preprocessing, features engineering (the process of selecting, modifying, or creating new features from raw data to improve the performance of ML models), hyper-parameter tuning, model performance evaluation, and interpretability⁵⁷.

H2O Auto-ML constructs multiple types of ML models, including Gradient Boosting Machines (GBM), Distributed Random Forest (DRF), Generalized Linear Models (GLM), deep learning architectures, eXtreme Randomized Trees (XRT), and more. Following the individual training of all these models, H2O Auto-ML combines them as Stacked Ensemble All Models (SEAM) and Stacked Ensemble Best Family (SEBF) to make predictions.

Model performance evaluation

The sample data were divided into training (70%) and testing (30%) datasets. Generic statistical metrics were selected to assess the accuracy of the predictions.

Sensitivity, which measures the proportion of actual positive cases (e.g., instances of flash floods) that the model correctly identifies, is defined as:

$${\text{Sensitivity~}}=\frac{{{\text{TP}}}}{{{\text{TP}}+{\text{FN}}}}$$

(4)

where true positive (TP) is the number of correctly predicted positive cases, and false negative (FN) is the number of incorrectly predicted negative cases.

Specificity, which measures the proportion of actual negative cases (e.g., instances of no flash floods) that the model correctly identifies, is defined as:

$${\text{Specificity~}}=\frac{{{\text{TN}}}}{{{\text{TN}}+{\text{FP}}}}$$

(5)

where true negative (TN) is the number of correctly predicted negative cases, and false positive (FP) is the number of incorrectly predicted positive cases.

Accuracy, which measures the overall correctness of the model predictions, combining both positive and negative cases, is defined as:

$${\text{Accuracy~}}=\frac{{{\text{TN}}+{\text{TP}}}}{{{\text{TN}}+{\text{FN}}+{\text{TP}}+{\text{FP}}}}$$

(6)

Precision, which measures the proportion of predicted positive cases that are actually positive, indicating the reliability of positive predictions, is defined as:

$${\text{Precision~}}=\frac{{TP}}{{FP+TP}}$$

(7)

The area under the Receiver Operating Characteristic (ROC) curve (AUC), which summarizes in a single metric a model’s overall performance after considering multiple decision boundaries. The closer the ROC curve is to the top-right corner of the plot the better is the model performance. AUC values range from 0.5 to 1, indicating random guessing or perfect discrimination between binary classes, respectively.

Model interpretability and Shapley Additive explanations (SHAP) method

Model interpretability refers to the extent to which the relationship between input factors and output predictions generated by ML models is easy to understand. The SHAP value assigns an importance score to each factor by determining its contribution to the model’s prediction. It represents a factor’s average marginal contribution across all possible factor subsets. This provides an objective measure of how much each factor influences the model’s prediction for flash flood susceptibility⁵⁸. In the context of H2O Auto-ML’s flash flood model, the SHAP value, ${\phi _i}$, for the i-th factor is calculated as follows:

$$\phi _{i} = \mathop \sum \limits_{{S \subseteq N\backslash \left\{ i \right\}}} \frac{{\left| S \right|!\left( {M - \left| S \right| - 1} \right)!}}{{M!}}\left[ {f_{X} \left( {S \cup \left\{ i \right\}} \right) - f_{X} \left( S \right)} \right]$$

(8)

Where N is the set of all factors, and S is a subset of N, excluding factor i. M is the total number of factors, and ${f_X}\left( S \right)$ represents the model’s output when only the factors in subset S are considered.

The predicted output $f\left( x \right)$ for any given input x is then explained as:

$$f\left( x \right)={\phi _0}+\mathop \sum \limits_{{i=1}}^{M} {\phi _i}$$

(9)

Where ${\phi _0}~$represents the baseline contribution to the solution, which is the average predicted output when no factors are considered, and ${\phi _i}$ the factors’ contribution relative to the baseline.

SHAP enhances both local and global interpretability of the H2O Auto-ML model for flash flood susceptibility mapping. Locally, it explains individual predictions by showing how each factor influences the model’s output. Globally, it highlights the most important factors by averaging SHAP values across multiple samples, revealing the factors that consistently impact predictions. Additionally, SHAP assists in factor selection by ranking key factors, facilitating more efficient model optimization and reducing complexity without sacrificing accuracy.

Results

Spearman correlation and multi-collinearity analyses

This study computed Spearman’s coefficient for each pair of factors influencing flash floods, as shown in Fig. 4. The sign of the Spearman’s coefficient determines the direction of correlation between the factors influencing flash flood. The Spearman correlation coefficient ranges from − 1 to 1, with values closer to 0 indicating weaker correlations between the factors influencing flash flood. The analysis reveals that the strongest correlation is between TRI and SL, with a coefficient of 0.98, followed by a negative correlation between RD and DFR, with a coefficient of -0.9. The factors 3-H-P, 6-H-P, 12-H-P, and 24-H-P also exhibit relatively strong positive correlations, with coefficients ranging from 0.71 to 0.88.

The results of the multi-collinearity analysis are presented in Table 3, showing that the smallest VIF is associated to the AS factor and the largest to the 12-H-P factor. The factors 6-H-P, 12-H-P, 24-H-P, SL, and TRI, have VIFs above the threshold of 5 and are excluded from the list of predictors to be used in the ML models of flash flood susceptibility, leaving a final number of 15 factors to be considered.

Table 3 VIF values for factors influencing flash floods.

Full size table

Model validation and performance evaluation

The learning curves of the models show that the number of iterations for the XRT and DRF models is 36 and 35, respectively (Fig. 5a and b), indicating that these individual models have higher operational efficiency. In contrast, the SEAM and SEBF models required 103 and 122 iterations, respectively (Fig. 5c and d). Stacked models require a longer iteration time to achieve the desired results due to more extensive model training, cross-validation hyper-parameter tuning, and model fusion step—combining the predictions from the multiple base models using an additional ML model, the meta-learner.

In the training set results, the XRT model demonstrated the best performance, with an AUC value of 0.9671, followed by DRF (0.9669), SEAM (0.9668), and SEBF (0.9665). In the testing set results (Table 4). The sensitivity, which indicates the number of correctly classified floods, was high with values of 0.938 and 0.955 for XRT and SEAM, respectively. Specificity, which refers to the model’s ability to correctly identify non-flash floods, was high for XRT and SEAM, at 0.938 and 0.937, respectively. In contrast, SEBF had the lowest specificity, at 0.887. Accuracy, which indicates the proportion of all predictions where the model correctly predicted flash floods and no flash floods, was highest for XRT and DRF at 0.938, followed by SEAM (with an accuracy of 0.932), and lowest for SEBF (0.924). Precision indicates the percentage of events predicted as flash floods by the model. The highest accuracy (0.938) was found for XRT and SEAM.

Table 4 Evaluation of model performance (testing set results).

Full size table

In model testing, the AUC values of the four ML models were high (Fig. 6), indicating their high performance in predicting flash flood susceptibility. In fact, the differences between the highest and lowest AUC values among the four models was only 0.006. By comparing at the AUC values for the training set and those for the test set, the XRT was selected for flash flood susceptibility mapping and interpretability analysis.

Flash flood susceptibility mapping

For each raster cell, the XRT model trained with 15 factors influencing flash floods was used to estimate flood susceptibility. The results were then classified using the natural breaks method, which is widely used in similar studies^44,59. The flash flood susceptibility was classified into five levels: very low, low, moderate, high, and very high. As shown in Fig. 7 and 8.92% and 12.95% of the study area fall into very high and high flood susceptibility levels, respectively; 31.37% and 31.34% of the area fall into very low and low flood susceptibility levels, respectively; and the remaining 15.42% of the area falls into the moderate flood susceptibility level.

In the flash flood susceptibility map, the western and central parts of the basin present very high and high susceptibility levels, while the eastern regions have fewer areas with such flash flood susceptibility. When overlaying the (represented by points in the map), 74.9% of historical flash floods occurred in areas classified as moderate to very high flash flood susceptibility.

Analysis of model interpretability

The global importance of the factors influencing flash floods was evaluated by computing the mean absolute SHAP values across all the observations. The SHAP summary plot in Fig. 8, illustrates the positive and negative impacts of the factors on the prediction outcomes. It identified the DEM as the most important factor, whereas LULC is the least important factor. The DEM emerged as the dominant factor possibly due to the YTRB’s location in the southeast of the Qinghai-Tibet Plateau, which is characterized by complex topography and significant elevation differences. In such a geographic environment, changes in elevation have a direct impact on a variety of natural processes such as surface water flow and climate. Concurrently, changes in altitude lead to significant variations in vegetation type and coverage, which affect the infiltration and evaporation of surface water. As such, the importance of LULC is lower, likely because of the limited impact of human activities and minimal land-use changes in remote and high-altitude areas, rendering the effect of land-use change on hydrological processes less significant than other factors. Overall, the five main factors influencing flash floods are all topography-related (DEM, TWI, TPI, AS, and SPI), indicating that topography is dominant. This was followed by the meteorological factor 3-H-P and surface factor NDVI. Hydrological factors such as river network density and river network distance have limited impact on flash floods.

In many cases, the rankings of factors in the SHAP plot and factor importance plot (Fig. 9) exhibit comparable patterns. However, the SHAP value offers a local interpretation of individual prediction outcomes, whereas the factor importance provides an overall perspective. The first three dominant factors (DEM, TWI, and TPI) are consistent between the two plots, which reinforces the idea that topographic factors are the most important factors influencing flash floods in this study region. Similarly, LULC and hydrological factors are found to have minor influence according to Fig. 9. However, in contrast with the SHAP plot, this figure ranks multi-year average precipitation higher than the 3-H-P.

Discussion

The exploration of the impact of main influencing factors on flash flood susceptibility

In Ma et al.’s study on flash floods in Yunnan Province, China, precipitation, topography, and flood prevention measures were identified as major influencing factors⁶⁰. However, our study, using the same precipitation dataset, indicates that precipitation is not a primary factor for flash floods in the YTRB. This may be due to the higher precipitation levels and the frequency of extreme rainfall compared to the central and western regions of the YTRB. In the YTRB, the central and western regions experience lower annual precipitation, have numerous river valleys, and more high-susceptibility areas. In contrast, the eastern region has higher annual precipitation, deeply incised canyons, and fewer high-susceptibility areas, indicating that the topography may inhibit the occurrence of flash floods, thereby emphasizing the role of terrain in flood events.

We further explore the mechanisms by which the main influencing factors affect flash floods. Partial Dependence Plots (PDPs), which are suitable for analyzing the relationship between the magnitude of a factor (x-axis) and its impact on predictions (y-axis), are used to investigate specific influences. As depicted in Fig. 10a, the susceptibility to flash floods is not well explained by terrain altitudes exceeding 5000 m. Within the range of 1300–4000 m, the contribution of terrain altitude to explain flash flood susceptibility is notably high in this region. Figure 10b indicates that regions with high TWI values better explain flash flood susceptibility, which can be attributed to increased water accumulation. Figure 10c illustrates that as the TPI increases, the probability of flash flood occurrence exhibits a pattern of rapid escalation followed by a rapid decline. A TPI that falls within the range of – 1.5 to 0.6 significantly contribute to explain flash flood susceptibility. Figure 10d reveals that the NDVI contribution to explain flash flood susceptibility is lower when its value is either exceptionally high or low. Figure 10e shows that when the influence factor (AP) is within the range of 350–500 mm, the model’s average prediction result, which represents flash flood susceptibility, is at a relatively high level. Figure 10f shows that the plane curvature is within a limited range, when the value is higher, its contribution to explain flash floods susceptibility decreases. These findings underscore the complex interplay between topographic, climatic, and vegetation factors in determining flash flood susceptibility.

The implication of the research to local stakeholders

In the YTRB, the middle and lower reaches are covered with fertile soils and well-irrigated forests, making it home to approximately 50% of the population of the Tibet Autonomous Region, including areas such as Shigatse, Lhasa, Lokha County, and Nyingchi³⁸. However, due to the basin’s unique geographical location and complex natural conditions, it often faces threats from natural disasters such as flash floods and landslides. Considering the results of this study and the costs of flash flood prevention, the following recommendations are proposed:

1.
Use the flash flood susceptibility maps to inform zoning decisions that better protect communities and property.
2.
Screen communities and assets at risk for further studying and for developing mitigation measures, including relocation and safe evacuation routes.
3.
Contribute to the development of flash flood risk indicators and potentially to the development of early warning systems.

The limitation of the proposed methodology and future research.

The dataset utilized in this study is constrained by historical limitations in flood statistics within China and the geographical context of the YTRB, which has influenced the results. Our findings indicate that impact factors exhibit variability in both spatial and temporal resolutions, with the spatial resolution of extreme precipitation being particularly in need of enhancement. To address temporal variability, we predominantly employed data from approximately 2015; however, data limitations hindered our ability to thoroughly analyze spatiotemporal dynamics.

Overall, the investigation of flash flood susceptibility in the YTRB, especially in relation to heavy rainfall, poses significant challenges due to difficulties in data acquisition, modeling uncertainties, and the intricate nature of influencing factors. Despite these challenges, this study provides a solid basis for future research in flash flood susceptibility. Future studies should prioritize addressing the discrepancies in spatial and temporal resolutions of various influencing factors, the generalization of flash flood susceptibility models to other regions, and account for the temporal variability of these factors.

Conclusion

In this study, we employed H2O Auto-ML for the inaugural generation of a flash flood susceptibility map for the YTRB. The analysis incorporated 605 flash flood occurrences and utilized 15 influencing factors. To assess the correlations among flood condition parameters, multiple collinearity diagnostic tests (e.g., VIF) and Spearman correlation matrices were conducted for the selection of influencing factors. Utilizing the H2O Auto-ML platform, we developed and validated flood susceptibility models with training and validation datasets, identifying key contributing factors through the SHAP interpretability algorithm. The results indicated that both individual algorithms (e.g., XRT; DRF) and ensemble algorithms (e.g., SEAM; SEBF) based on H2O Auto-ML achieved high performance. With AUC values exceeding 0.97, the XRT model demonstrated the superior performance. In the flash flood susceptibility map generated by the XRT model, areas classified as highly susceptible or above constituted 21.87% of the total area, predominantly distributed in the western and central regions of the YTRB. Furthermore, 74.9% of historical flash flood occurrences were situated in regions classified from moderate to extremely high flood susceptibility. Terrain factors such as DEM, TWI, and TPI proved to be more effective in this study than other considered factors, offering valuable insights for managing population distribution in the YTRB region.

Overall, the XRT model based on H2O Auto-ML offers significant advantages, including rapid computation speed, automatic parameter optimization, and enhanced interpretability, facilitating the prompt development of flash flood susceptibility maps and associated analyses. Fast and accurate flash flood susceptibility mapping constitutes a critical contribution to risk management. This study serves as a reference for other researchers to continue advancing this type of techniques, and it will assist regional planners and local decision-makers in formulating strategies to mitigate the consequences associated with flash flood disasters.

Data availability

The data will be provided based on a request to the corresponding author, Suxia Liu (liusx@igsnrr.ac.cn).

References

Costache, R. et al. Flash flood hazard using deep learning based on H2O R package and fuzzy-multicriteria decision-making analysis. J. Hydrol. 609, 563. https://doi.org/10.1016/j.jhydrol.2022.127747 (2022).
Taherizadeh, M., Niknam, A., Nguyen-Huy, T., Mezösi, G. & Sarli, R. Flash flood-risk areas zoning using integration of decision-making trial and evaluation laboratory, GIS-based analytic network process and satellite-derived information. Nat. Hazards. 118, 2309–2335. https://doi.org/10.1007/s11069-023-06089-5 (2023).
Article Google Scholar
Pham, N. T. T., Nong, D. & Garschagen, M. Natural hazard’s effect and farmers’ perception: perspectives from flash floods and landslides in remotely mountainous regions of Vietnam. Sci. Total Environ. 759, 142656. https://doi.org/10.1016/j.scitotenv.2020.142656 (2021).
Article CAS PubMed Google Scholar
Othman, A., El-Saoud, W. A., Habeebullah, T., Shaaban, F. & Abotalib, A. Z. Risk assessment of flash flood and soil erosion impacts on electrical infrastructures in overcrowded mountainous urban areas under climate change. Reliab. Eng. Syst. Saf. 236, 109302. https://doi.org/10.1016/j.ress.2023.109302 (2023).
Article Google Scholar
Yin, J. B. et al. Large increase in global storm runoff extremes driven by climate and anthropogenic changes. Nat. Commun. 9, 4389. https://doi.org/10.1038/s41467-018-06765-2 (2018).
Gao, D. et al. Modelling and validation of flash flood inundation in drylands. J. Geogr. Sci. 34, 185–200. https://doi.org/10.1007/s11442-024-2201-7 (2024).
Article MATH Google Scholar
Yang, Z. L. et al. Meta-analysis and visualization of the literature on early identification of flash floods. Remote Sens. 14, 3313. https://doi.org/10.3390/rs14143313 (2022).
Diakakis, M. et al. Proposal of a flash flood impact severity scale for the classification and mapping of flash flood impacts. J. Hydrol. 590, 125452. https://doi.org/10.1016/j.jhydrol.2020.125452 (2020).
Ding, L. S. et al. A survey of remote sensing and geographic information system applications for flash floods. Remote Sens. 13, 1818. https://doi.org/10.3390/rs13091818 (2021).
Schweidtmann, A. M., Zhang, D. & Stosch, M. A review and perspective on hybrid modeling methodologies. Digit. Chem. Eng. 10, 100136. https://doi.org/10.1016/j.dche.2023.100136 (2024).
Costabile, P., Costanzo, C., Ferraro, D. & Barca, P. Is HEC-RAS 2D accurate enough for storm-event hazard assessment? Lessons learnt from a benchmarking study based on rain-on-grid modelling. J. Hydrol. 603, 126962. https://doi.org/10.1016/j.jhydrol.2021.126962 (2021).
Article Google Scholar
Khan, I. R., Elmahdy, S. I., Rustum, R., Khan, Q. & Mohamed, M. M. Floods modeling and analysis for Dubai using HEC-HMS model and remote sensing using GIS. Sci. Rep. 14, 4586. https://doi.org/10.1038/s41598-024-74736-3 (2024).
Article CAS Google Scholar
Soomro, S. E. H. et al. River flood susceptibility and basin maturity analyzed using a coupled approach of geo-morphometric parameters and SWAT model. Water Resour. Manag. 36, 2131–2160. https://doi.org/10.1007/s11269-022-03127-y (2022).
Article MATH Google Scholar
Merz, R., Parajka, J. & Blöschl, G. Scale effects in conceptual hydrological modeling. Water Resour. Res. 45, 7485. https://doi.org/10.1029/2009wr007872 (2009).
Article MATH Google Scholar
Dottori, F. & Todini, E. Testing a simple 2D hydraulic model in an urban flood experiment. Hydrol. Process. 27, 1301–1320. https://doi.org/10.1002/hyp.9370 (2013).
Article ADS MATH Google Scholar
Chen, W. et al. Modeling flood susceptibility using data-driven approaches of naive Bayes tree, alternating decision tree, and random forest methods. Sci. Total Environ. 701, 134979. https://doi.org/10.1016/j.scitotenv.2019.134979 (2020).
Liu, J. et al. Assessment of flood susceptibility mapping using support vector machine, logistic regression and their ensemble techniques in the Belt and Road region. Geocarto Int. 37, 9817–9846. https://doi.org/10.1080/10106049.2022.2025918 (2022).
Article ADS MATH Google Scholar
Al-Juaidi, A. E. M., Nassar, A. M. & Al-Juaidi, O. E. M. Evaluation of flood susceptibility mapping using logistic regression and GIS conditioning factors. Arab. J. Geosci. 11, 785. https://doi.org/10.1007/s12517-018-4095-0 (2018).
Article MATH Google Scholar
Costache, R. et al. Novel hybrid models between bivariate statistics, artificial neural networks and boosting algorithms for flood susceptibility assessment. J. Environ. Manage. 265, 110485. https://doi.org/10.1016/j.jenvman.2020.110485 (2020).
Pichler, M. & Hartig, F. Machine learning and deep learning—a review for ecologists. Methods Ecol. Evol. 14, 994–1016. https://doi.org/10.1111/2041-210x.14061 (2023).
Article MATH Google Scholar
Al-Ruzouq, R. et al. Flood susceptibility mapping using a novel integration of multi-temporal sentinel-1 data and eXtreme deep learning model. Geosci. Front. 15, 101780. https://doi.org/10.1016/j.gsf.2024.101780 (2024).
Rahmati, O. et al. Development of novel hybridized models for urban flood susceptibility mapping. Sci. Rep. 10, 129387. https://doi.org/10.1038/s41598-020-69703-7 (2020).
Mo, Z. B., Shi, R. Y. & Di, X. A physics-informed deep learning paradigm for car-following models. Transp. Res. C-Emer. 130, 103240. https://doi.org/10.1016/j.trc.2021.103240 (2021).
Band, S. S. et al. Flash Flood susceptibility modeling using New approaches of Hybrid and Ensemble Tree-based machine learning algorithms. Remote Sens. 12, 3568. https://doi.org/10.3390/rs12213568 (2020).
Article ADS MATH Google Scholar
Habibi, A., Delavar, M. R., Sadeghian, M. S., Nazari, B. & Pirasteh, S. A hybrid of ensemble machine learning models with RFE and Boruta wrapper-based algorithms for flash flood susceptibility assessment. Int. J. Appl. Earth Obs Geoinf. 122, 103401. https://doi.org/10.1016/j.jag.2023.103401 (2023).
Article Google Scholar
Pradhan, B., Lee, S., Dikshit, A. & Kim, H. Spatial flood susceptibility mapping using an explainable artificial intelligence (XAI) model. Geosci. Front. 14, 10162. https://doi.org/10.1016/j.gsf.2023.101625 (2023).
Wang, M. et al. An XGBoost-SHAP approach to quantifying morphological impact on urban flooding susceptibility. Ecol. Indic. 156, 111137. https://doi.org/10.1016/j.ecolind.2023.111137 (2023).
Qi, D. & Majda, A. J. Using machine learning to predict extreme events in complex systems. Proc. Natl. Acad. Sci. USA. 117, 52–59. https://doi.org/10.1073/pnas.1917285117 (2020).
Kang, J., Zhang, B. & Dang, A. A novel geospatial machine learning approach to quantify non-linear effects of land use/land cover change (LULCC) on carbon dynamics. Int. J. Appl. Earth Obs Geoinf. 128, 103712. https://doi.org/10.1016/j.jag.2024.103712 (2024).
Article Google Scholar
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 1, 206–215. https://doi.org/10.1038/s42256-019-0048-x (2019).
Article PubMed PubMed Central MATH Google Scholar
Luo, J. Y. et al. Prediction of biological nutrients removal in full-scale wastewater treatment plants using H2O automated machine learning and back propagation artificial neural network model: optimization and comparison. Bioresour Technol. 390, 129842. https://doi.org/10.1016/j.biortech.2023.129842 (2023).
Article CAS PubMed MATH Google Scholar
Wang, N., Cheng, W. M., Wang, B. X., Liu, Q. Y. & Zhou, C. H. Geomorphological regionalization theory system and division methodology of China. J. Geogr. Sci. 30, 212–232. https://doi.org/10.1007/s11442-020-1724-9 (2020).
Article MATH Google Scholar
Ye, X. Y., Guo, Y. H., Wang, Z. G., Liang, L. F. & Tian, J. Y. Extensive evaluation of four satellite precipitation products and their hydrologic applications over the Yarlung Zangbo River. Remote Sens. 14, 3350. https://doi.org/10.3390/rs14143350 (2022).
Article ADS Google Scholar
Luo, J. et al. Study of the intensity and driving factors of land use/cover change in the Yarlung Zangbo River, Nyang Qu River, and Lhasa River region, Qinghai-Tibet Plateau of China. J. Arid Land. 14, 411–425. https://doi.org/10.1007/s40333-022-0093-x (2022).
Article Google Scholar
Li, C. Y. et al. Runoff variations affected by climate change and human activities in Yarlung Zangbo River, southeastern tibetan Plateau. Catena 230, 107184. https://doi.org/10.1016/j.catena.2023.107184 (2023).
Article MATH Google Scholar
An, B. S. et al. Process, mechanisms, and early warning of glacier collapse-induced river blocking disasters in the Yarlung Tsangpo Grand Canyon, southeastern tibetan Plateau. Sci. Total Environ. 816, 151653. https://doi.org/10.1016/j.scitotenv.2021.151652 (2022).
Chen, Y. G., Zhang, X. Y., Yang, K. J., Zeng, S. Y. & Hong, A. Y. Modeling rules of regional flash flood susceptibility prediction using different machine learning models. Front. Earth Sci. 11, 1117004. https://doi.org/10.3389/feart.2023.1117004 (2023).
Article ADS Google Scholar
Sang, Y. F. et al. Precipitation variability and response to changing climatic condition in the Yarlung Tsangpo River basin, China. J. Geophys. Res. Atmos. 121, 8820–8831. https://doi.org/10.1002/2016jd025370 (2016).
Article ADS MATH Google Scholar
Li, S. et al. The evolution of Yarlung Tsangpo River: constraints from the age and provenance of the Gangdese conglomerates, southern Tibet. Gondwana Res. 41, 249–266. https://doi.org/10.1016/j.gr.2015.05.010 (2017).
Article ADS CAS Google Scholar
Dong, W. H. et al. Summer rainfall over the southwestern Tibetan Plateau controlled by deep convection over the Indian subcontinent. Nat. Commun. 7, 10925. https://doi.org/10.1038/ncomms10925 (2016).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Vennari, C., Parise, M., Santangelo, N. & Santo, A. A database on flash flood events in Campania, southern Italy, with an evaluation of their spatial and temporal distribution. Nat. Hazards Earth Syst. Sci. 16, 2485–2500. https://doi.org/10.5194/nhess-16-2485-2016 (2016).
Article ADS Google Scholar
Bilasco, S. et al. Flash flood risk assessment and mitigation in digital-era governance using unmanned aerial vehicle and GIS spatial analyses case study: small river basins. Remote Sens. 14, 2481. https://doi.org/10.3390/rs14102481 (2022).
Al-Areeq, A. M. et al. Computational machine learning approach for flood susceptibility assessment integrated with remote sensing and GIS techniques from Jeddah, Saudi Arabia. Remote Sens. 14, 5515. https://doi.org/10.3390/rs14215515 (2022).
Arabameri, A. et al. Flash flood susceptibility modelling using functional tree and hybrid ensemble techniques. J. Hydrol. 587, 125007. https://doi.org/10.1016/j.jhydrol.2020.125007 (2020).
Pandey, M. et al. Flood susceptibility modeling based on new hybrid intelligence model: optimization of XGboost model using GA metaheuristic algorithm. Adv. Space Res. 69, 3301–3318. https://doi.org/10.1016/j.asr.2022.02.027 (2022).
Article ADS MATH Google Scholar
Liu, J. F., Liu, K. & Wang, M. A. Residual neural network integrated with a hydrological model for global flood susceptibility mapping based on remote sensing datasets. Remote Sens. 15, 2447. https://doi.org/10.3390/rs15092447 (2023).
Gharakhanlou, N. M. & Perez, L. Flood susceptible prediction through the use of geospatial variables and machine learning methods. J. Hydrol. 617, 129121. https://doi.org/10.1016/j.jhydrol.2023.129121 (2023).
Bettoni, M. et al. Land use effects on surface runoff and soil erosion in a southern Alpine valley. Geoderma 435, 116505. https://doi.org/10.1016/j.geoderma.2023.116505 (2023).
Jaafarzadeh, M. S., Tahmasebipour, N., Haghizadeh, A., Pourghasemi, H. R. & Rouhani, H. Groundwater recharge potential zonation using an ensemble of machine learning and bivariate statistical models. Sci. Rep. 11, 5587. https://doi.org/10.1038/s41598-021-85205-6 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Riihimäki, H., Kemppinen, J., Kopecky, M. & Luoto, M. Topographic wetness index as a proxy for soil moisture: the importance of flow-routing algorithm and grid resolution. Water Resour. Res. 57, eWR029871. https://doi.org/10.1029/2021WR029871 (2021).
Bui, Q. T. et al. Verification of novel integrations of swarm intelligence algorithms into deep learning neural network for flood susceptibility mapping. J. Hydrol. 581, 124379. https://doi.org/10.1016/j.jhydrol.2019.124379 (2020).
Giovannettone, J., Copenhaver, T., Burns, M. & Choquette, S. A statistical approach to mapping flood susceptibility in the lower connecticut river valley region. Water Resour. Res. 54, 7603–7618. https://doi.org/10.1029/2018wr023018 (2018).
Article ADS Google Scholar
Roy, P. et al. Threats of climate and land use change on future flood susceptibility. J. Clean. Prod. 272, 122757. https://doi.org/10.1016/j.jclepro.2020.122757 (2020).
Sedgwick, P. Spearman’s rank correlation coefficient. BMJ 349, 748. https://doi.org/10.1136/bmj.g7528 (2014).
Khuri, A. I. Introduction to linear regression analysis. Int. Stat. Rev. 81, 318–319. https://doi.org/10.1111/insr.12020_10 (2013).
Article ADS MATH Google Scholar
LeDell, E. & Poirier, S. H2O automl: Scalable automatic machine learning. In Proceedings of the AutoML Workshop at ICML (2020).
Chen, H. et al. Toward an improved ensemble of multi-source daily precipitation via joint machine learning classification and regression. Atmos. Res. 304, 107385. https://doi.org/10.1016/j.atmosres.2024.107385 (2024).
Lundberg, S. M. & Lee, S. I. A unified approach to interpreting model predictions. Adv. Neur Inf. Process. Syst. 30, 475 (2017).
MATH Google Scholar
Tang, X. Z., Li, J. F., Liu, M. N., Liu, W. & Hong, H. Y. Flood susceptibility assessment based on a novel random Naive Bayes method: a comparison between different factor discretization methods. Catena 190, 104536. https://doi.org/10.1016/j.catena.2020.104536 (2020).
Article MATH Google Scholar
Ma, M. H. et al. XGBoost-based method for flash flood risk assessment. J. Hydrol. 598, 126382. https://doi.org/10.1016/j.jhydrol.2021.126382 (2021).

Download references

Acknowledgements

This study was funded by the Second Tibetan Plateau Scientific Expedition and Research (STEP) Program of China (Grant Nos. 2019QZKK0903，2019QZKK0403) and project of National Key Research and Development Program of China (Grant No. 2022YFF0801804).

Author information

Authors and Affiliations

Key Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Sciences and Natural Resources Research (IGSNRR), Chinese Academy of Sciences (CAS), Beijing, 100101, China
Fei He, Suxia Liu, Xingguo Mo & Zhonggen Wang
College of Resources and Environment/Sino-Danish College, University of Chinese Academy of Sciences (UCAS), Beijing, 100190, China
Fei He, Suxia Liu & Xingguo Mo
National Institute of Natural Hazards, Ministry of Emergency Management of the People’s Republic of China, Beijing, 100085, China
Zhonggen Wang

Authors

Fei He
View author publications
Search author on:PubMed Google Scholar
Suxia Liu
View author publications
Search author on:PubMed Google Scholar
Xingguo Mo
View author publications
Search author on:PubMed Google Scholar
Zhonggen Wang
View author publications
Search author on:PubMed Google Scholar

Contributions

S. L conceptualized the study. F. H drafted the literature review, developed the methodology, and interpreted the results. S. L, X. M, and Z. W reviewed and provided critical revisions to the manuscript. All authors have given consent for submission.

Corresponding author

Correspondence to Suxia Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

He, F., Liu, S., Mo, X. et al. Interpretable flash flood susceptibility mapping in Yarlung Tsangpo River Basin using H2O Auto-ML. Sci Rep 15, 1702 (2025). https://doi.org/10.1038/s41598-024-84655-y

Download citation

Received: 07 August 2024
Accepted: 25 December 2024
Published: 11 January 2025
Version of record: 11 January 2025
DOI: https://doi.org/10.1038/s41598-024-84655-y

Keywords

This article is cited by

Machine learning model optimization for flood susceptibility zonation over the Kosi megafan, Himalayan foreland basin, India
- Aman Arora
- Purna Durga G
- Alireza Arabameri
Scientific Reports (2025)
Uncovering Spatiotemporal Urban Flood Dynamics: An Explainable GeoAI Approach to Land Cover Change Over Two Decades
- Abdulwaheed Tella
- Izni Zahidi
- Quoc Bao Pham
Earth Systems and Environment (2025)
Multi-dimensional assessment of flood susceptibility drivers in the urban watershed of Guwahati
- Ishita Afreen Ahmed
- Swapan Talukdar
- Atiqur Rahman
Acta Geophysica (2025)
Analysis of a comprehensive inventory of rainfall-induced landslides and multi-factor coupling mechanisms in Eastern Guangdong, China, in August 2018
- Chenchen Xie
- Chong Xu
- Huiran Gao
Environmental Earth Sciences (2025)
Interpretable machine learning for flood susceptibility mapping in the metropolitan region of São Paulo, Southeast Brazil
- Enner Alcântara
- Yasmim Carvalho Guimarães
- José Roberto Mantovani
Discover Geoscience (2025)

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Study area

Data

Flash flood inventory data

Factors influencing flash floods

Methods

Correlation and multi-collinearity analysis

H2O Auto-ML platform

Model performance evaluation

Model interpretability and Shapley Additive explanations (SHAP) method

Results

Spearman correlation and multi-collinearity analyses

Model validation and performance evaluation

Flash flood susceptibility mapping

Analysis of model interpretability

Discussion

The exploration of the impact of main influencing factors on flash flood susceptibility

The implication of the research to local stakeholders

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Search

Quick links