Introduction

The accumulation of heavy precipitation typically occurs within a short period of time, exceeding the drainage capacity of mountain streams and slopes and thus causing flash floods1,2. Flash floods not only may lead to grave consequences such as causalities, they may also cause damage to property, including private houses, farmland, and infrastructure3. Flash floods pose a significant threat to public safety and economic activities4,5. Flash floods are characterized by their suddenness, intensity, destructiveness, and short duration6,7, making them one of the most dangerous types of floods8,9.

The assessment of flash flood susceptibility is a crucial aspect of flash flood research, typically developed based on two types of models10: (1) Parametric models (physically-based models) such as Hydrologic Engineering Center’s River Analysis System (HEC-RAS)11, Hydrologic Engineering Center’s Hydrologic Modeling System (HEC-HMS)12, and Soil and Water Assessment Tool (SWAT)13. These models are theoretically robust in scientific terms; however, they face challenges in practical applications, such as high computational demands, difficulties in parameter calibration, and high development and maintenance costs11. Moreover, issues like scale effects and other limiting factors can affect the accuracy of these models14,15. (2) Non-parametric models (data-driven models), including Random Forest (RF)16, Support Vector Machine (SVM)17, Logistic Regression (LR)18, and Artificial Neural Networks (ANN)19. These models do not rely on a detailed description of the physical process20, but generally use a certain algorithmic structure, and then adjust their parameters to the data (training), as well as their complexity by optimizing the bias-variance trade-off, so that the fitted model generalizes well to new data20. Such methods have achieved significant results in practical applications21,22. Additionally, researchers are also exploring methods to integrate physical models with data-driven models to achieve more accurate predictive results23.

The development of ML models, as well as hybrid models (hybrid models are defined as models that integrate multiple machine learning algorithms)24, has sparked increased interest in model interpretability24,25,26,27. Although model interpretability has been a concern for a long time and depends on the types of models used, many ML models have remained difficult to understand due to their complexity (e.g., complex structures and parameters28, non-linear interactions29, and the “black box” nature30), leading to significant challenges in explaining their predictive outcomes. However, the H2O Automated ML (Auto-ML) platform provides access to a range of predictive models, automated model selection, and model parameter (hyper-parameters) tuning as well as integrates methods for model interpretability. This constitutes an advancement for making decisions about the suitability of the modelling process31.

Flash floods typically result from a combination of factors25. Particularly in the YTRB, flash floods are frequent due to the region’s complex terrain and weather patterns, which make prediction particularly challenging32. Research on the YTRB has strongly focused on the response of precipitation, runoff, and land use to climate change33,34,35. In terms of flood research, the main focus has been on flood mechanisms such as glacial lake outbursts and localized debris flows36, while flash flood susceptibility has not been extensively investigated. Flash flood susceptibility research will not only help to understand the influencing factors of flash floods, but also provide a reference for policy formulation and enhanced local flash flood prevention and mitigation37.

This study explores the use of auto-ML for flash flood susceptibility mapping in the YTRB, responding to the urgent need for these tools in the region, and conducts interpretability analysis of the results. The innovation of the study is reflected in the following aspects: the introduction of advanced Auto-ML techniques to enhance model efficiency and accuracy; the integration of interpretability analysis methods to make the model results transparent and credible; and, the provision of quick, accurate and up-to-date flash flood susceptibility mapping for the YTRB. Based on this, our research objectives are: (1) to identify and evaluate the influencing factors of flash floods; (2) to select and evaluate machine learning models; (3) to conduct flash flood susceptibility mapping; and (4) to perform interpretability analysis on the results of the optimal model.

Materials and methods

Study area

The YTRB (82°00′ E to 97°07′ E, 28°00′ N to 31°16′ N) is situated in the southern part of the Tibetan Plateau, covering a total area of approximately 240,000 km2 (Fig. 1). The topography of the basin is characterized by higher elevations in the west, north and south, with lower elevations in the east and middle. The average altitude exceeds 4000 m38. The main valley of the YTRB follows an east-west fault zone, giving the basin an elongated shape. Situated in this tectonic fault zone, the YTRB experiences frequent earthquakes and active geological processes36. Major tributaries include the Lhasa River, Parlung Tsangpo, Yigong Tsangpo, Lhoka Tsangpo, Nizequ, and Nianchu River39.

The eastern region of the YTRB receives over 900 mm of annual precipitation, while the western region gets less than 100 mm, making it arid. The spatial and temporal distribution of precipitation within the basin is extremely uneven40, the occurrence time and intensity of extreme precipitation range from 25 to 50 mm/day38. Vegetation in the basin includes alpine meadow and steppe in the upper reaches, shrub lands and forests in the middle, and coniferous and broadleaf forests in the lower reaches. The southeastern area, influenced by the tropical monsoon, is dominated by broadleaf forests. The main soil types are mountainous shrub-steppe soils and alpine steppe soils.

Fig. 1
figure 1

The geographical location of the YTRB in the Tibetan Plateau, China. This figure was created using ArcGIS 10.8 software.

Data

Flash flood inventory data

Flash flood inventory records document a series of events caused by intense rainfall41. Analyzing historical flash flood data is essential for mitigating flood-related damage and enhancing preventative measures42. The flash flood inventory data used in this study covers the period from 1980 to 2019, primarily sourced from the China Flash Flood Investigation Project, with supplementary information from online news media and government records. The data attributes include the geographic location, year, and month of occurrence of the disasters (Table 1). Since the 1980s, the YTRB has experienced 605 documented flash floods (Fig. 1). Additionally, we utilized ArcGIS 10.8 software to randomly generate 605 non-flash flood points within the study area. Historical flash floods data suggests that geographic and environmental factors contribute to the susceptibility of certain areas to flash floods, indicating that regions with similar attributes may also experience such events.

Table 1 Partial data of historical flash flood points.

Factors influencing flash floods

An adequate selection of factors influencing flash floods is crucial for predicting flash flood susceptibility. We selected 20 factors (Table 2) based on previous studies43,44,45,46,47. These factors are categorized into meteorological and surface factors. Meteorological factors include annual maximum 3-hourly precipitation (3-H-P), annual maximum 6-hourly precipitation (6-H-P), annual maximum 12-hourly precipitation (12-H-P), annual maximum 24-hourly precipitation (24-H-P), and average multi-year precipitation (AP). The long-term average precipitation is indicative of the overall soil humidity in the region. Spatial interpolation was applied to annual mean precipitation data from meteorological stations for the years 1980–2015 (Fig. 2a). In contrast, extreme precipitation indices highlight the potential for intense, short-duration downpours. The maximum 3-hourly precipitation values were derived from China Meteorological Forcing Dataset (CMFD), for the years 2011–2015 (Fig. 2b).

Surface factors include slope (SL), aspect (AS), plan curvature (PLC), profile curvature (PRC), terrain ruggedness index (TRI), topographic position index (TPI), topographic wetness index (TWI), stream power index (SPI), river density (RD), distance from river (DFR), soil type (ST), lithology (LI), normalized difference vegetation index (NDVI), and land use/land cover (LULC). All factors influencing flash floods were resampled to a spatial resolution of 1 km.

Table 2 Description of factors influencing flash floods.

Topography plays a crucial role in influencing the speed and direction of surface runoff, with steep slopes facilitating the rapid downhill flow of water48. Utilizing 90-meter spatial resolution STRM DEM data, the study area was cropped to its boundaries (Fig. 2c). Aspect (Fig. 2d) affects the direction of surface runoff49. The PLC and PRC (Fig. 2e and f) determine the direction and speed of runoff, which influences the accumulation or dispersion of water on the terrain. The TPI measures the relative elevation and terrain variability at specific points (Fig. 2g). The TWI indicates how the topography influences the water flow paths and water accumulation and soil saturation potential (Fig. 2h), demonstrating the impact of the terrain on moisture retention50. The SPI reveals the layout of the river system and its potential water conveyance capacity (Fig. 2i)51. Indices such as RD and DFR indicate the hydrological status of the location (hydrological status refers to characteristics such as the distribution, flow, and availability of water bodies in the region) as shown in Fig. 2j and k52. ST (Fig. 2l) can affect surface runoff by affecting infiltration. For example, clayey soils may restrict infiltration during heavy rainfall events. The LI can affect water infiltration, so geologic structures that are susceptible to erosion can exacerbate flooding during heavy rainfall events (Fig. 2m)2. The NDVI is extensively used as a vegetation condition index to effectively differentiate between healthy land cover and areas that are either unhealthy or lacking vegetation (Fig. 2n)2. The Changes of LULC affects the permeability of water (Fig. 2o), thereby affecting surface runoff53. The methods for calculating the TPI, TWI and SPI indices are as follows:

$$TPI=Z - {Z_{mean}}$$
(1)

Where Z is the elevation value at a specific point, and \(~{Z_{mean}}\) is the average elevation of the surrounding points. Positive TPI values typically indicate ridges, negative values indicate valleys, and values near zero indicate flat areas.

$$TWI=ln\left( {\frac{A}{{tan\left( \beta \right)}}} \right)$$
(2)

Where A is the water flow accumulation, and β is the local slope angle in radians. TWI is used to identify areas that may be saturated due to water flow accumulation, often wetlands or valleys. A higher TWI value indicates greater potential for saturation.

$$SPI=A \times tan\left( \beta \right)$$
(3)

The SPI is used to estimate the erosive power of water flow at a given point. It is a function of water flow accumulation and the steepness of the slope. Higher SPI values indicate stronger erosive potential.

Fig. 2
figure 2

Thematic map of factors influencing flash flood in the YTRB: meteorological factors: (a) 3-H-P, (b) AP; Surface factors: (c) DEM, (d) As, (e) PLC, (f) PRC, (g) TPI, (h) TWI, (i) SPI, (j) RD, (k) DFR, (l) ST, (m) LI, (n) NDVI, (o) LULC. This figure was created using ArcGIS 10.8 software.

Methods

Based on the initially selected flash flood influencing factors, this study involves processes such as factor selection, ML model training, model evaluation, flash flood susceptibility mapping, and model interpretation. A flowchart of the approach used in this study is shown in Fig. 3.

Fig. 3
figure 3

Flowchart for predicting flash flood susceptibility based on the H2O Auto-ML.

Correlation and multi-collinearity analysis

Spearman’s rank correlation coefficient is a non-parametric statistical method ideal for analyzing data that does not require a normal distribution or linear relationship. It evaluates the monotonic relationship between variables by ranking the data, which enhances its robustness to outliers. Its simplicity in calculation makes it particularly well-suited for ordinal data, especially when the conditions for traditional parametric tests are not met54. As such, it is commonly employed to assess the correlation between each pair of different factors.

Multi-collinearity refers to the condition where multiple independent variables exhibit high correlation with each other, potentially causing instability in statistical models and leading to inaccurate parameter estimates. This, in effect, compromises the model’s predictive accuracy and interpretative clarity. Therefore, it is essential to assess and address multicollinearity when selecting factors that may influence flash floods. Multi-collinearity analysis is evaluated using Variance Inflation Factors (VIFs), where:

  1. 1.

    A VIF of 1 indicates no multi-collinearity.

  2. 2.

    A VIF between 1 and 5 suggests a moderate level of multi-collinearity, which is usually acceptable.

  3. 3.

    While a VIF above 5 indicates strong multi-collinearity55.

H2O Auto-ML platform

H2O is an open-source, distributed ML platform that automates the training and testing of multiple ML models efficiently56. It employs a combination of rapid stochastic search and layered integration to attain results. The H2O Auto-ML workflow encompasses several key stages such as data preprocessing, features engineering (the process of selecting, modifying, or creating new features from raw data to improve the performance of ML models), hyper-parameter tuning, model performance evaluation, and interpretability57.

H2O Auto-ML constructs multiple types of ML models, including Gradient Boosting Machines (GBM), Distributed Random Forest (DRF), Generalized Linear Models (GLM), deep learning architectures, eXtreme Randomized Trees (XRT), and more. Following the individual training of all these models, H2O Auto-ML combines them as Stacked Ensemble All Models (SEAM) and Stacked Ensemble Best Family (SEBF) to make predictions.

Model performance evaluation

The sample data were divided into training (70%) and testing (30%) datasets. Generic statistical metrics were selected to assess the accuracy of the predictions.

Sensitivity, which measures the proportion of actual positive cases (e.g., instances of flash floods) that the model correctly identifies, is defined as:

$${\text{Sensitivity~}}=\frac{{{\text{TP}}}}{{{\text{TP}}+{\text{FN}}}}$$
(4)

where true positive (TP) is the number of correctly predicted positive cases, and false negative (FN) is the number of incorrectly predicted negative cases.

Specificity, which measures the proportion of actual negative cases (e.g., instances of no flash floods) that the model correctly identifies, is defined as:

$${\text{Specificity~}}=\frac{{{\text{TN}}}}{{{\text{TN}}+{\text{FP}}}}$$
(5)

where true negative (TN) is the number of correctly predicted negative cases, and false positive (FP) is the number of incorrectly predicted positive cases.

Accuracy, which measures the overall correctness of the model predictions, combining both positive and negative cases, is defined as:

$${\text{Accuracy~}}=\frac{{{\text{TN}}+{\text{TP}}}}{{{\text{TN}}+{\text{FN}}+{\text{TP}}+{\text{FP}}}}$$
(6)

Precision, which measures the proportion of predicted positive cases that are actually positive, indicating the reliability of positive predictions, is defined as:

$${\text{Precision~}}=\frac{{TP}}{{FP+TP}}$$
(7)

The area under the Receiver Operating Characteristic (ROC) curve (AUC), which summarizes in a single metric a model’s overall performance after considering multiple decision boundaries. The closer the ROC curve is to the top-right corner of the plot the better is the model performance. AUC values range from 0.5 to 1, indicating random guessing or perfect discrimination between binary classes, respectively.

Model interpretability and Shapley Additive explanations (SHAP) method

Model interpretability refers to the extent to which the relationship between input factors and output predictions generated by ML models is easy to understand. The SHAP value assigns an importance score to each factor by determining its contribution to the model’s prediction. It represents a factor’s average marginal contribution across all possible factor subsets. This provides an objective measure of how much each factor influences the model’s prediction for flash flood susceptibility58. In the context of H2O Auto-ML’s flash flood model, the SHAP value, \({\phi _i}\), for the i-th factor is calculated as follows:

$$\phi _{i} = \mathop \sum \limits_{{S \subseteq N\backslash \left\{ i \right\}}} \frac{{\left| S \right|!\left( {M - \left| S \right| - 1} \right)!}}{{M!}}\left[ {f_{X} \left( {S \cup \left\{ i \right\}} \right) - f_{X} \left( S \right)} \right]$$
(8)

Where N is the set of all factors, and S is a subset of N, excluding factor i. M is the total number of factors, and \({f_X}\left( S \right)\) represents the model’s output when only the factors in subset S are considered.

The predicted output \(f\left( x \right)\) for any given input x is then explained as:

$$f\left( x \right)={\phi _0}+\mathop \sum \limits_{{i=1}}^{M} {\phi _i}$$
(9)

Where \({\phi _0}~\)represents the baseline contribution to the solution, which is the average predicted output when no factors are considered, and \({\phi _i}\) the factors’ contribution relative to the baseline.

SHAP enhances both local and global interpretability of the H2O Auto-ML model for flash flood susceptibility mapping. Locally, it explains individual predictions by showing how each factor influences the model’s output. Globally, it highlights the most important factors by averaging SHAP values across multiple samples, revealing the factors that consistently impact predictions. Additionally, SHAP assists in factor selection by ranking key factors, facilitating more efficient model optimization and reducing complexity without sacrificing accuracy.

Results

Spearman correlation and multi-collinearity analyses

This study computed Spearman’s coefficient for each pair of factors influencing flash floods, as shown in Fig. 4. The sign of the Spearman’s coefficient determines the direction of correlation between the factors influencing flash flood. The Spearman correlation coefficient ranges from − 1 to 1, with values closer to 0 indicating weaker correlations between the factors influencing flash flood. The analysis reveals that the strongest correlation is between TRI and SL, with a coefficient of 0.98, followed by a negative correlation between RD and DFR, with a coefficient of -0.9. The factors 3-H-P, 6-H-P, 12-H-P, and 24-H-P also exhibit relatively strong positive correlations, with coefficients ranging from 0.71 to 0.88.

Fig. 4
figure 4

Spearman correlation analysis of flash flood influencing factors. Red indicating a positive correlation and blue indicating an inverse correlation.

The results of the multi-collinearity analysis are presented in Table 3, showing that the smallest VIF is associated to the AS factor and the largest to the 12-H-P factor. The factors 6-H-P, 12-H-P, 24-H-P, SL, and TRI, have VIFs above the threshold of 5 and are excluded from the list of predictors to be used in the ML models of flash flood susceptibility, leaving a final number of 15 factors to be considered.

Table 3 VIF values for factors influencing flash floods.

Model validation and performance evaluation

The learning curves of the models show that the number of iterations for the XRT and DRF models is 36 and 35, respectively (Fig. 5a and b), indicating that these individual models have higher operational efficiency. In contrast, the SEAM and SEBF models required 103 and 122 iterations, respectively (Fig. 5c and d). Stacked models require a longer iteration time to achieve the desired results due to more extensive model training, cross-validation hyper-parameter tuning, and model fusion step—combining the predictions from the multiple base models using an additional ML model, the meta-learner.

Fig. 5
figure 5

Model learning curves (a: XRT, b:DRF, c:SEAM, and d: SEBF).

In the training set results, the XRT model demonstrated the best performance, with an AUC value of 0.9671, followed by DRF (0.9669), SEAM (0.9668), and SEBF (0.9665). In the testing set results (Table 4). The sensitivity, which indicates the number of correctly classified floods, was high with values of 0.938 and 0.955 for XRT and SEAM, respectively. Specificity, which refers to the model’s ability to correctly identify non-flash floods, was high for XRT and SEAM, at 0.938 and 0.937, respectively. In contrast, SEBF had the lowest specificity, at 0.887. Accuracy, which indicates the proportion of all predictions where the model correctly predicted flash floods and no flash floods, was highest for XRT and DRF at 0.938, followed by SEAM (with an accuracy of 0.932), and lowest for SEBF (0.924). Precision indicates the percentage of events predicted as flash floods by the model. The highest accuracy (0.938) was found for XRT and SEAM.

Table 4 Evaluation of model performance (testing set results).

In model testing, the AUC values of the four ML models were high (Fig. 6), indicating their high performance in predicting flash flood susceptibility. In fact, the differences between the highest and lowest AUC values among the four models was only 0.006. By comparing at the AUC values for the training set and those for the test set, the XRT was selected for flash flood susceptibility mapping and interpretability analysis.

Fig. 6
figure 6

Comparison of the performance of four ML models (XRT, DRF, SEAM, and SEBF) using ROC curves and AUC values on the test set.

Flash flood susceptibility mapping

For each raster cell, the XRT model trained with 15 factors influencing flash floods was used to estimate flood susceptibility. The results were then classified using the natural breaks method, which is widely used in similar studies44,59. The flash flood susceptibility was classified into five levels: very low, low, moderate, high, and very high. As shown in Fig. 7 and 8.92% and 12.95% of the study area fall into very high and high flood susceptibility levels, respectively; 31.37% and 31.34% of the area fall into very low and low flood susceptibility levels, respectively; and the remaining 15.42% of the area falls into the moderate flood susceptibility level.

In the flash flood susceptibility map, the western and central parts of the basin present very high and high susceptibility levels, while the eastern regions have fewer areas with such flash flood susceptibility. When overlaying the (represented by points in the map), 74.9% of historical flash floods occurred in areas classified as moderate to very high flash flood susceptibility.

Fig. 7
figure 7

Flash flood susceptibility map of the study region by the XRT model.

Analysis of model interpretability

The global importance of the factors influencing flash floods was evaluated by computing the mean absolute SHAP values across all the observations. The SHAP summary plot in Fig. 8, illustrates the positive and negative impacts of the factors on the prediction outcomes. It identified the DEM as the most important factor, whereas LULC is the least important factor. The DEM emerged as the dominant factor possibly due to the YTRB’s location in the southeast of the Qinghai-Tibet Plateau, which is characterized by complex topography and significant elevation differences. In such a geographic environment, changes in elevation have a direct impact on a variety of natural processes such as surface water flow and climate. Concurrently, changes in altitude lead to significant variations in vegetation type and coverage, which affect the infiltration and evaporation of surface water. As such, the importance of LULC is lower, likely because of the limited impact of human activities and minimal land-use changes in remote and high-altitude areas, rendering the effect of land-use change on hydrological processes less significant than other factors. Overall, the five main factors influencing flash floods are all topography-related (DEM, TWI, TPI, AS, and SPI), indicating that topography is dominant. This was followed by the meteorological factor 3-H-P and surface factor NDVI. Hydrological factors such as river network density and river network distance have limited impact on flash floods.

Fig. 8
figure 8

Summary plot of SHAP values for the XRT model.

In many cases, the rankings of factors in the SHAP plot and factor importance plot (Fig. 9) exhibit comparable patterns. However, the SHAP value offers a local interpretation of individual prediction outcomes, whereas the factor importance provides an overall perspective. The first three dominant factors (DEM, TWI, and TPI) are consistent between the two plots, which reinforces the idea that topographic factors are the most important factors influencing flash floods in this study region. Similarly, LULC and hydrological factors are found to have minor influence according to Fig. 9. However, in contrast with the SHAP plot, this figure ranks multi-year average precipitation higher than the 3-H-P.

Fig. 9
figure 9

Factor importance for the XRT model.

Discussion

The exploration of the impact of main influencing factors on flash flood susceptibility

In Ma et al.’s study on flash floods in Yunnan Province, China, precipitation, topography, and flood prevention measures were identified as major influencing factors60. However, our study, using the same precipitation dataset, indicates that precipitation is not a primary factor for flash floods in the YTRB. This may be due to the higher precipitation levels and the frequency of extreme rainfall compared to the central and western regions of the YTRB. In the YTRB, the central and western regions experience lower annual precipitation, have numerous river valleys, and more high-susceptibility areas. In contrast, the eastern region has higher annual precipitation, deeply incised canyons, and fewer high-susceptibility areas, indicating that the topography may inhibit the occurrence of flash floods, thereby emphasizing the role of terrain in flood events.

We further explore the mechanisms by which the main influencing factors affect flash floods. Partial Dependence Plots (PDPs), which are suitable for analyzing the relationship between the magnitude of a factor (x-axis) and its impact on predictions (y-axis), are used to investigate specific influences. As depicted in Fig. 10a, the susceptibility to flash floods is not well explained by terrain altitudes exceeding 5000 m. Within the range of 1300–4000 m, the contribution of terrain altitude to explain flash flood susceptibility is notably high in this region. Figure 10b indicates that regions with high TWI values better explain flash flood susceptibility, which can be attributed to increased water accumulation. Figure 10c illustrates that as the TPI increases, the probability of flash flood occurrence exhibits a pattern of rapid escalation followed by a rapid decline. A TPI that falls within the range of – 1.5 to 0.6 significantly contribute to explain flash flood susceptibility. Figure 10d reveals that the NDVI contribution to explain flash flood susceptibility is lower when its value is either exceptionally high or low. Figure 10e shows that when the influence factor (AP) is within the range of 350–500 mm, the model’s average prediction result, which represents flash flood susceptibility, is at a relatively high level. Figure 10f shows that the plane curvature is within a limited range, when the value is higher, its contribution to explain flash floods susceptibility decreases. These findings underscore the complex interplay between topographic, climatic, and vegetation factors in determining flash flood susceptibility.

Fig. 10
figure 10

Partial dependence plot of XRT model for six factors influencing flash floods: (a)DEM; (b) TWI; (c) TPI; (d) NDVI; (e) AP; and (f) PLC.

The implication of the research to local stakeholders

In the YTRB, the middle and lower reaches are covered with fertile soils and well-irrigated forests, making it home to approximately 50% of the population of the Tibet Autonomous Region, including areas such as Shigatse, Lhasa, Lokha County, and Nyingchi38. However, due to the basin’s unique geographical location and complex natural conditions, it often faces threats from natural disasters such as flash floods and landslides. Considering the results of this study and the costs of flash flood prevention, the following recommendations are proposed:

  1. 1.

    Use the flash flood susceptibility maps to inform zoning decisions that better protect communities and property.

  2. 2.

    Screen communities and assets at risk for further studying and for developing mitigation measures, including relocation and safe evacuation routes.

  3. 3.

    Contribute to the development of flash flood risk indicators and potentially to the development of early warning systems.

The limitation of the proposed methodology and future research.

The dataset utilized in this study is constrained by historical limitations in flood statistics within China and the geographical context of the YTRB, which has influenced the results. Our findings indicate that impact factors exhibit variability in both spatial and temporal resolutions, with the spatial resolution of extreme precipitation being particularly in need of enhancement. To address temporal variability, we predominantly employed data from approximately 2015; however, data limitations hindered our ability to thoroughly analyze spatiotemporal dynamics.

Overall, the investigation of flash flood susceptibility in the YTRB, especially in relation to heavy rainfall, poses significant challenges due to difficulties in data acquisition, modeling uncertainties, and the intricate nature of influencing factors. Despite these challenges, this study provides a solid basis for future research in flash flood susceptibility. Future studies should prioritize addressing the discrepancies in spatial and temporal resolutions of various influencing factors, the generalization of flash flood susceptibility models to other regions, and account for the temporal variability of these factors.

Conclusion

In this study, we employed H2O Auto-ML for the inaugural generation of a flash flood susceptibility map for the YTRB. The analysis incorporated 605 flash flood occurrences and utilized 15 influencing factors. To assess the correlations among flood condition parameters, multiple collinearity diagnostic tests (e.g., VIF) and Spearman correlation matrices were conducted for the selection of influencing factors. Utilizing the H2O Auto-ML platform, we developed and validated flood susceptibility models with training and validation datasets, identifying key contributing factors through the SHAP interpretability algorithm. The results indicated that both individual algorithms (e.g., XRT; DRF) and ensemble algorithms (e.g., SEAM; SEBF) based on H2O Auto-ML achieved high performance. With AUC values exceeding 0.97, the XRT model demonstrated the superior performance. In the flash flood susceptibility map generated by the XRT model, areas classified as highly susceptible or above constituted 21.87% of the total area, predominantly distributed in the western and central regions of the YTRB. Furthermore, 74.9% of historical flash flood occurrences were situated in regions classified from moderate to extremely high flood susceptibility. Terrain factors such as DEM, TWI, and TPI proved to be more effective in this study than other considered factors, offering valuable insights for managing population distribution in the YTRB region.

Overall, the XRT model based on H2O Auto-ML offers significant advantages, including rapid computation speed, automatic parameter optimization, and enhanced interpretability, facilitating the prompt development of flash flood susceptibility maps and associated analyses. Fast and accurate flash flood susceptibility mapping constitutes a critical contribution to risk management. This study serves as a reference for other researchers to continue advancing this type of techniques, and it will assist regional planners and local decision-makers in formulating strategies to mitigate the consequences associated with flash flood disasters.