Introduction

The Bay of Bengal (BoB) region is highly susceptible to tropical cyclone-driven storm-tide hazards due to its high population density and low-lying coastal zone. While the region accounts for only 7.8% of the total number of recorded surge events globally, the events that do occur are extremely strong, with an average of five surge events per decade exceeding 5 m1. These events imperil the estimated 170 million people in the region who live below 10 m of elevation2. Particularly at risk is the Northern BoB surrounding the Ganges-Brahmaputra-Meghna Delta (GBM), consisting of the Country of Bangladesh and the Indian State of West Bengal, where the complex hydrodynamics of the river delta and the risk of high-intensity tropical cyclones (TCs) greatly increase the risk of compound flooding3.

In the next 25–30 years, coastal populations in the region—which includes large cities such as Kolkata, Chennai, and Visakhapatnam on the east coast of India; and Chittagong in Bangladesh—are expected to continue to grow4,5. Supporting these growing populations will require the construction of new infrastructure such as transportation networks, telecommunication services, electricity generation, and healthcare services. Failures and disruptions to this critical infrastructure can cascade, causing further suffering and damage in the wake of extreme storms6,7. At the same time as this population growth and infrastructure construction, the storm-tide hazard in the BoB will be exacerbated by higher-intensity cyclones8, subsiding river deltas9,10, and rising sea levels.

Accurately predicting storm-tide risk under present and future conditions is critical to provide policy makers and risk managers with the information they require to both protect existing assets (power plants, hospitals, etc.) and to inform the siting of additional infrastructure required to support a growing population base. In the BoB, this aim is complicated by several factors, including the strong riverine influence of the GBM system, rapidly evolving deltaic topobathy, and limited long-term observations of coastal water levels11 in the region. This complex marginal sea necessitates physics-based modeling approaches that are capable of resolving nonlinear interactions between oceanic, hydrologic, and atmospheric processes. Such nonlinear phenomena, such as tide-surge interaction, can greatly alter storm-tides and are best resolved using physics-based models.

To address this need, a substantial body of work advancing understanding of tides, storm surge, and coastal flooding in the BoB can be found in the literature. For example12, showcases an improved bathymetric dataset for the GBM delta, validating their new topographic/bathymetric dataset with tidally-forced hydrodynamic simulations of the region and showing the importance of accurately resolving the complex coastal region. Additional studies have examined specific storm events, performing hindcasts of some of the most devastating TCs to hit the region [refs. 13,14,15,16, among others]. Complementary works have analyzed flood return periods in the region under both present and future climate conditions17,18,19,20,21,22, while others have examined the interaction of different processes—e.g., the interactions between riverine discharge, wind-driven surge, and rainfall23; or the contribution of wave radiation stress on storm-surge on the East Coast of India24. Clearly, assessing coastal hazards in this highly populated and developing region is an active area of research.

These aforementioned studies underscore the importance of nonlinear coastal and hydrologic processes, however, they generally focus on specific events, use relatively few ensemble members in their extreme value analyses, or focus only on specific interactions/drivers of storm-surge. For example20, incorporated tidal forcing in their simulation of several thousand synthetic TCs; however, only one tidal stage per storm was accounted for, while21 excluded tidal forcing in order to isolate the impact of changes in storm behavior at the mid-century. These design choices limit the ability to attribute storm-surge to different drivers and do not allow for great confidence in long-term (>100-year) return levels.

In this study, we present a high-resolution, fully coupled ADvanced CIRCulation (ADCIRC) and Simulated WAves Nearshore (SWAN)25,26,27 modeling framework of the BoB. Using this model, we simulate both present-day and mid-century climate conditions using one historical and four future climate realizations each containing a millennia’s worth of tropical cyclones. For each storm, multiple tidal and riverine perturbations are applied, for future climate scenarios mean sea level is accounted for, and the effects of waves are incorporated through the coupling to SWAN in order to sample nonlinear interactions. Projections of mean sea level rise are also considered for the future climate scenarios. The resulting ensemble is used to estimate long-term tropical cyclone-driven storm-tide water levels at sites of critical coastal infrastructure in the BoB. Storm-tide is defined here as the total water level resulting from the combined effects of meteorologically driven surge, astronomical tides, riverine discharge, and mean sea level. Long-term return levels are estimated via fitting of the Generalized Pareto Distribution (GPD) with a peaks-over-threshold (POT) extreme value approach. In addition to the full storm ensemble, singly-forced simulations of tides, meteorology-only storm-surge, and riverine discharge are performed and used to assess the relative importance of nonlinear processes such as tide-surge interaction and wave-related contributions. This is accomplished via a regression-based sensitivity analysis that is intended for diagnostic attribution rather than event-level prediction and allows for the relative importance of various physical processes in estimating storm-tide hazards. While we apply this methodology specifically to sites of existing or proposed nuclear power plants in the region, it is applicable to any coast and region where storm-tide risk assessment would be useful. Furthermore, the storm-tide return levels reported hold true for the areas immediately surrounding the points of interest.

The remainder of this manuscript is structured as follows. First, we present the results of both our sensitivity analysis and calculated return levels for long return periods (Results). Next, we discuss the results of this study as well as future paths of inquiry presented by our findings (“Discussion”). Detailed information on model setup and methods, namely the hydrodynamic model, including mesh generation, model forcing, and validation, as well as an explanation for how return levels were calculated, are explained in “Methods”.

Results

This section presents the results from the two complementary analyses designed to examine tropical cyclone-driven coastal flood risk—expressed in terms of extreme storm-tide water levels—in the Bay of Bengal. Specifically, we (1) diagnose the importance of nonlinear interactions among storm-tide drivers using a regression-based comparison to linear superposition, and (2) estimate long-period storm-tide return levels under present and mid-century scenarios using extreme value analysis. The computational domain with topo/bathy and sites of interest labeled is shown in Fig. 1. All of the analyses and results are based on event-maximum storm-tides. In this study, “storm-tide” is defined as the maximum simulated total water level at a site of interest and includes the effects of meteorologically driven surge, astronomical tides, wave radiation stress, riverine discharge, and mean sea level offset. To compute this value, we take the maximum water level over all mesh nodes within 5 km of each site.

Fig. 1: Computational mesh covering the Bay of Bengal with sites of interest labeled.
Fig. 1: Computational mesh covering the Bay of Bengal with sites of interest labeled.The alternative text for this image may have been generated using AI.
Full size image

a Computational mesh domain with reactor sites indicated by red diamonds and labeled with numbers; bd Insets of high-resolution regions with flood plains. River inflow boundary conditions are denoted by the red solid lines.

The synthetic storms used are generated using the Synthetic Tropical cyclOne geneRation Model (STORM) algorithm28. The millenia corresponding to present-day conditions statistically matches samples from the International Best Track Archive for Climate Stewardship (IBTrACS)29,30, while the four millenia for future climate scenarios are drawn from four mid-century climate models from the Climate Model Intercomparison Project version 6 (CMIP6)31,32. Each synthetic storm is simulated using a fully coupled ADCIRC+SWAN model of the BoB. Coupling to SWAN allows wave effects (via wave radiation stresses) to be physically represented in all simulations used for extreme value analysis. Nonlinear interactions between different forcing mechanisms are sampled by simulating each storm 12 times, with four tidal phases—low tide, high tide, maximum flood current, maximum ebb curent—at landfall and three riverine discharge states—minimum, maximum, and mean mean monthly calculated from reanalysis from the Global Flood Awareness System (GloFAS)33. Mid-century storms additionally have a spatially-varying mean sea level offset interpolated from Deltares Global Tide and Surge Model version 3.0 (GTSM)34 enforced at the open ocean boundary. In addition to the full ensemble simulations, additional auxiliary simulations were performed to estimate the relative importance of nonlinear interactions through comparison of the full ensemble results to linear superposition of the auxiliary simulations. These include simulations of each individual storm in a “baseline” configuration forced only by meteorological fields and mean mean monthly riverine discharge as well as five single-forcing simulations to isolate the individual contributions of tides, riverine discharge, and mean sea level:

  • Tide-only forcing with mean mean monthly riverine discharge (present climate)

  • Tide-only forcing with mean mean monthly riverine discharge and mid-century mean sea level offset

  • Minimum mean monthly riverine discharge

  • Mean mean monthly riverine discharge

  • Maximum mean monthly riverine discharge

These auxiliary simulations are not part of the storm ensemble, but are used exclusively to perform the sensitivity analysis described in “Sensitivity analysis”.

The long-term return levels reported in “Extreme Value Analysis” are estimated for each climate scenario using the full ensemble simulations and a Generalized Pareto Distribution (GPD) with Peaks Over Threshold (POT) approach. Uncertainty is quantified via bootstrap resampling. Detailed descriptions of the ADCIRC+SWAN model, forcing sources, and statistical methodology are found in “Methods”.

Sensitivity analysis

To quantify the relative contributions of the nonlinear interactions among storm surge, tides, riverine discharge, and mean sea level, we compare the total water levels from the full storm ensemble to estimates based on linear superposition. Differences between the linear superposition and the full ensemble water levels are interpreted as the combined effect of nonlinear physical processes such as tide-surge interaction and wave setup. These effects are estimated using a regression framework defined as:

$$\begin{array}{rcl}{\eta }_{twl} & = & {\eta }_{baseline}^{* }+{\eta }_{tide}+{\eta }_{river}+{\eta }_{msl}+{\alpha }_{1}{\eta }_{baseline}^{* }\\ & & +{\alpha }_{2}{\eta }_{baseline}^{* }{\eta }_{tide}+{\alpha }_{3}{\eta }_{baseline}^{* }{\eta }_{river}+{\alpha }_{4}{\eta }_{baseline}^{* }{\eta }_{msl}\\ & & +{\alpha }_{5}{\eta }_{tide}{\eta }_{river}+{\alpha }_{6}{\eta }_{tide}{\eta }_{msl}+{\alpha }_{7}{\eta }_{river}{\eta }_{msl}\end{array}$$
(1)

Where \({\eta }_{baseline}^{* }={\eta }_{baseline}-{\eta }_{msl}\), in which ηbaseline is the water level from the storm’s baseline simulation and ηmsl is the water level due to mean sea level; ηriver is the water level due to the riverine discharge; and αi is a coefficient found by performing a least squares regression at a location of interest for all storms in the ensemble. ηmsl is calculated as the zero-frequency harmonic constituent from the yearlong tide-only simulations. Because the baseline simulations include the mean mean monthly riverine discharge, ηriver is calculated from the river-only simulations as ηriver = ηriver,flowstateηriver,meanflow (i.e., ηriver = 0 for ηtwl perturbations with mean riverine discharge). Note that in this formulation it is assumed that ηtwl is equal to the linear superposition of ηbaseline, ηtide, and ηriver plus the nonlinear interactions between surge, tide, and riverine forces as well as the \({\alpha }_{1}{\eta }_{baseline}^{* }\) term, which we assume is a proxy for unresolved processes such as wave setup, which is not included in the baseline simulations.

In order to assess the relative importance of each α term in Eq. (1), we use the method of standardized coefficients. By standardizing both the dependent and independent variables prior to performing the least squares regression, we can calculate so-called beta coefficients. By combining the terms in Eq. (1) without α coefficients on the left hand side and standardizing:

$$\begin{array}{l}\begin{array}{rcl}\overline{\varDelta \eta } & = & {\beta }_{1}\overline{{\eta }_{baseline}^{* }}+{\beta }_{2}\overline{{\eta }_{baseline}^{* }{\eta }_{tide}}+{\beta }_{3}\overline{{\eta }_{baseline}^{* }{\eta }_{river}}+{\beta }_{4}\overline{{\eta }_{baseline}^{* }{\eta }_{msl}}\\ & & +{\beta }_{5}\overline{{\eta }_{tide}{\eta }_{river}}+{\beta }_{6}\overline{{\eta }_{tide}{\eta }_{msl}}+{\beta }_{7}\overline{{\eta }_{river}{\eta }_{msl}}\end{array}\end{array}$$
(2)

Where,

$$\begin{array}{rcl}\begin{array}{rcl}\Delta \eta & = & {\eta }_{{twl}}-{\eta }_{{baseline}}^{\ast }-{\eta }_{{tide}}-{\eta }_{{river}}-{\eta }_{{msl}}\\ \bar{Y} & = & \frac{Y-\frac{1}{N}{\sum }_{i=1}^{N}{Y}_{i}}{\sigma (Y)}\\ N & = & {\rm{Number}}\,{\rm{of}}\,{\rm{samples}}\\ \sigma (Y) & = & {\rm{Standard}}\,{\rm{deviation}}\,{\rm{of}}\,Y\\ {\beta }_{i} & = & {\rm{Standardized}}\,{\rm{coefficient}}\end{array}\end{array}$$

The resulting β coefficients provide a sense of the relative importance of each nonlinear term in Eq. (1). Positive (negative) β values indicate that a given interaction term tends to increase (decrease) peak storm-tide relative to linear superposition.

Table 1 shows the correlation coefficients of various estimates of ηtwl. As the table shows, the performance of the “Full Fit” (Eq. (1)) outperforms both the pure linear superposition (Fit 1), as well as Fit 2, which incorporates unresolved physics such as wave setup by including \({\alpha }_{1}{\eta }_{baseline}^{* }\), i.e., \({\eta }_{twl}={\eta }_{baseline}^{* }+{\eta }_{tide}+{\eta }_{river}+{\eta }_{msl}+{\alpha }_{1}{\eta }_{baseline}^{* }\). The degradation seen between Fits 1 and 2 confirms that wave setup is not the sole contributor to larger storm surge when comparing ηtwl and ηbaseline, and that other nonlinear terms are required to accurately assess the contribution of various forces on storm-tides in the BoB.

Table 1 Correlation coefficients (R2) of estimates of ηtwl

The beta-coefficients of performing a least squares regression on Eq. (2) for each location of interest over all scenarios, storms, and perturbations are shown in Table 2 and Fig. 2. The magnitude of the coefficient provides a sense of the relative importance of each interaction term.

Fig. 2: Standardized β coefficients denoting relative importance of interaction terms on storm-tides.
Fig. 2: Standardized β coefficients denoting relative importance of interaction terms on storm-tides.The alternative text for this image may have been generated using AI.
Full size image

Relative importance of various nonlinear interaction terms on storm-tides based on the regression analysis described by Eq. (2). Positive (negative) β coefficients indicate that a particular interaction term increases (decreases) storm-tides. The magnitude of β provides a sense of the relative importance of an interaction term. Interaction terms are: a Wave-setup; b tide-surge; c river-surge; d mean sea level-surge; e tide-river; f tide-mean sea level; g river-mean sea level.

Table 2 Standardized coefficients βi for the terms in Eq. (2)

The effect of wave setup—approximated by the \({\beta }_{1}\overline{{\eta }_{baseline}^{* }}\) term—is generally larger for the locations on the east coast of India (Madras Atomic Power Station and Kovvada Atomic Power Project) as well as for locations towards the center/east of the GBM Delta (Gangamati and Mazher Char). On the other hand, the stations further west and upriver (Galachipa, Tazumuddin, Boyar Char) have smaller β1 coefficients and the effect of wave setup on total water levels. The outlier of this pattern is Kuakata, located around 10 km west of Gangamati, but with a significantly lower β1 coefficient.

The β coefficient associated with tide-surge interaction (β2) has a large magnitude for every site except Mazher Char. Mazher Char is located further inland and deeper in the GBM delta, likely resulting in less interaction between tides and surge. The remaining sites, however, all have β2 coefficients with large magnitudes, highlighting the importance of not only including tides in simulations of TC-induced flooding but also that altering the tidal stage with respect to landfall can greatly change the resulting surge.

In more concrete terms, the impact of wave setup and the nonlinear interactions between tides, meteorologically driven storm surge, and riverine flow often result in different return levels at sites of interest. Figure 3 shows the change in 1000- and 5000-year return levels for all five climate realizations as compared to the full model ensemble for both the linear superposition fit (Fit 1), as well as when the effects of waves are approximated by adding \({\alpha }_{1}{\eta }_{baseline}^{* }\) (Fit 2). While the effect of the nonlinear interactions between surge/tides/river currents varies depending on location and the climate model used, using linear superposition can result in return levels up to 25% lower than the full model ensemble. Conversely, by including only the effect of wave setup (superposition+waves), return levels up to 35% higher than the full model ensemble are calculated. These higher return levels are especially noticeable on the east coast of India at the Madras atomic power station and Kovvada atomic power projects. The reason that including only waves increases these return levels so drastically is that it does not capture the tide-surge interaction, which has a lowering effect on water levels in the region. It is vital to highlight that the purpose of this analysis is diagnostic rather than predictive. The regression framework is used only to rank the relative importance of the interaction terms and is not an attempt to reconstruct individual storm events.

Fig. 3: Comparison between 1000- and 5000-year return levels computed using the “full” model ensembles, linear superposition, and linear superposition + α1ηbaseline.
Fig. 3: Comparison between 1000- and 5000-year return levels computed using the “full” model ensembles, linear superposition, and linear superposition + α1ηbaseline.The alternative text for this image may have been generated using AI.
Full size image

Left column shows the % difference between superposition (orange “X” markers) and superposition + α1ηbaseline (green “X” markers) compared to the full ensemble 1000-year return levels. The right column shows the same for 5000-year return levels. a and b show difference at the Madras Atomic Power Station; c and d at the Kovvada Atomic Power Project; e and f at Gangamati; i and j at Boyar Char; k and l at Tazumuddin; m and n at Galachipa; and o and p at Kuakata.

Extreme value analysis

Storm-tide return levels at each site of interest are estimated using the full storm ensemble, and the extreme value analysis framework detailed in “Extreme value analysis methodology”. Figure 4 shows the return level curves for return periods of 10–5000 years for the five climate realizations. Shaded bands indicate the 90% confidence interval. Figure 5 shows the 1000- and 5000-year return levels at sites of interest for both the present climate and the mean of the four return levels calculated from the CMIP6 scenarios.

Fig. 4: Calculated storm-tide return levels from the full model ensemble at sites of interest under all five climate realizations.
Fig. 4: Calculated storm-tide return levels from the full model ensemble at sites of interest under all five climate realizations.The alternative text for this image may have been generated using AI.
Full size image

Return levels as calculated from the full model ensemble for: a Madras Atomic Power Station; b Kovvada Atomic Power Project; c Gangamati; d Mazher Char; e Boyar Char; f Tazumuddin; g Galachipa; and h Kuakata. Solid lines show the storm-tide return levels calculated from the full model ensemble using the Generalized Pareto Distribution with Peaks Over Threshold for present climate (dark blue); cnrm (orange); cmcc (green); ecearth (light blue); and hadgem (brown). 90% confidence intervals calculated via bootstrapping are denoted by the background fill. Point data shows the (mean) empirical return level calculated from the full model ensemble.

Fig. 5: 1000- and 5000-year storm-tide return levels at sites of interest for present and (mean) future climate as calculated via the full model ensemble.
Fig. 5: 1000- and 5000-year storm-tide return levels at sites of interest for present and (mean) future climate as calculated via the full model ensemble.The alternative text for this image may have been generated using AI.
Full size image

a, b show the 1000-year (5000-year) storm-tide return levels at sites of interest as calculated from the full model ensemble. c, d show the 1000-year (5000-year) mean storm-tide return levels across the four future climate scenarios (cnrm, cmcc, eceaerth, hadgem) as calculated from the full model ensemble.

Generally, long-term return levels decrease in the GBM Delta but increase along the east coast of India. In the GBM Delta, Boyar Char (Fig. 4e), and Tazumuddin (Fig. 4f) both exhibit lower—but still substantial—flood levels for return periods longer than 500 years. Further west, Galachipa (Fig. 4g), Gangamati (Fig. 4c), and Kuakata (Fig. 4h) see more modest but still noticeable decreases in long-term return levels, with the Hadgem scenario at Gangamati predicting an increased return level for return periods greater than ~2000 years. On the east coast of India, both the Madras Atomic Power Station (Fig. 4a), and Kovvada Atomic Power Project (Fig. 4b) see an increase in return level for all future climate scenarios. Finally, Mazher Char (Fig. 4d), located further inland in the middle of the GBM delta sees reductions in return levels under future conditions akin to those seen at the Tazumuddin and Boyar Char to the east.

While, generally speaking, the return levels on the GBM delta under these four future climate projections decrease, they are still substantial. For example, at Tazumuddin, the 1000-year return level under present climate conditions is around 8 m of storm-tide, while the mean of the four future climate scenarios predict a 1000-year return level of approximately 5 m. With the exception of Mazher Char, the sites on the GBM delta all see decreased return levels for both 1000- and 5000-year return periods of around 30% which, while substantial, still means that the 1000- and 5000-year storms would produce around 5 and 6.5 m of storm-tide respectively at Boyar Char, Tazumuddin, Galachipa, Gangamati, and Kuakata under mid-century conditions. In stark contrast, the Madras Atomic Power Station and Kovvada Atomic Power Project see increases in 1000-year (5000-year) return levels of 30 cm (50 cm) and 80 cm (1.5 m) or 16% (24%) and 47% (78%), respectively.

Across the GBM delta, projected decreases in return levels are consistent with changes in the STORM synthetic cyclone climatology, including fewer storms in several future realizations and shifts in genesis/track density toward the eastern Indian coast35. These changes reduce the frequency of high-intensity landfalling cyclones in the delta region.

To contextualize these changes in return levels, major historical cyclones in the region were simulated, and results were compared to our calculated return levels. Table 3 shows the equivalent return periods for the present and future scenarios of the largest historical storm surge at each reactor site based on these simulations. Cyclone Sidr was a category 4 TC which made landfall in Bangladesh in November of 2007. One of the deadliest storms to hit Bangladesh, Sidr is estimated to have resulted in at least 3447 deaths and caused $2.31 billion worth of damage36. The storm-tide caused by Sidr at Gangamati, Boyar Char, Tazumuddin, Galachipa, and Kuakata is the largest from the historical simulations and equivalent to a 70- to 210-year event under present climate conditions at those sites. Based on the future scenarios from the STORM dataset, storm-tides as extreme as those caused by Sidr would remain as uncommon or become far less common under future climate, with return periods ranging from 70 to over 3000 years. This aligns with the analysis from this study as well as observations of decreased cyclone intensity in the GBM delta from ref. 35.

Table 3 Return periods of the most extreme modeled storm-tide from historical storms at each study site under all scenarios

The Kovvada Atomic Power Project faces an increased future risk from storm-tide-induced flooding. The 1.71 m surge at Kovvada was caused by the category 4 Cyclone Hudhud in October of 2014, which, based on this study, corresponds to a 1190-year storm under present climate conditions. Located approximately 70 km north of Vishakhapatnam, which saw an observed storm-tide of approximately 2 m during Hudhud, the Kovvada site can expect this level of surge to have a return period between 60 and 350 years under future climate conditions. The remaining two sites (Mazher Char and the Madras Atomic Power Station) do not appear to have drastically changing return periods/levels under future climate conditions; however, the maximum surges from the historical storms simulated in this study (Thane and Aila) were not the result of particularly powerful storms. Cyclone Thane was a category 2 storm which occurred in 2011, making landfall approximately 100 km south of the Madras Atomic Power Station while Cyclone Aila maxed out at category 1 winds and made landfall approximately 150 km to the west of Mazher Char.

Discussion

In this study, we estimate storm-tide return periods on the order of hundreds to thousands of years at sites of critical infrastructure in the BoB. Using the STORM dataset, we simulated a thousand years of TCs under five different climate conditions and quantified the influence of tides, riverine discharge, mean sea level rise, and their nonlinear interactions on extreme coastal water levels. We show that using simple linear superposition to estimate the effects of flooding due to storm-driven surge, tides, and riverine flow gives between −20% to +40% differences in the estimates of long-term return levels (1000–5000 years). Commensurate with other studies24, we find that the impact of wave runup has the largest relative impact on increasing storm-tides as compared to linear superposition—especially along the east coast of India—the nonlinear interactions between tides and surge; mean sea levels and surge; and tides and mean sea levels are substantial and must be considered. While the numerical cost to simulate an ensemble of conditions for each storm is far from negligible, the potential difference in predicted return level has the potential to impact both the design of and decisions made regarding critical infrastructure. Moreover, reduced-order models, like the linear model described in “Sensitivity analysis”, offer a potential pathway to simulate much larger storm ensembles.

Of particular interest is the finding that—based on the scenarios used by STORM—the eastern GBM delta sees a lower risk from TC-induced flooding under future scenarios, while the eastern coast of India sees increases in risk due to storm-tides. These regional contrasts, even within the relatively small BoB, highlight the need for region-specific flood risk assessments when siting and designing coastal infrastructure.

The projected decrease in long-term return levels in the GBM delta is consistent with changes in synthetic storm occurrence and intensity in the STORM dataset. As noted in ref. 35, shifts in cyclogenesis in the BoB cause tropical cyclone to drift west, towards the eastern coast of India. This shift reduces the intensity that cyclones are able to attain prior to landfall as well as decreases the number of cyclones which make landfall in the GBM delta. In contrast, the east coast of India sees increased positive contributions to storm-tide from wave setup and a weaker negative tide-surge interaction than on the delta, contributing to increased storm-tide hazard at the mid-century.

Consistent with ref. 35, the STORM dataset finds that the GBM is one of two regions which have lower wind speeds under future climate scenarios. Other CMIP6 scenarios (or other algorithms for creating synthetic storms) could yield different return level predictions in the region22. The same holds true for the eastern coast of India, where we find generally higher risk due to storm-driven surge under the future climate. Other inputs to the ADCIRC+SWAN model also introduce uncertainty in the output. As described in “Methods”, riverine inflow was calculated from the historic record and does not take into account potential changes in the behavior of tributaries to the GBM delta that may have occurred by mid-century. If, for example, riverine flow were to drastically increase due to large amounts of precipitation, the calculated return levels at locations near the river could change. Additionally, the effects of land subsidence—which was not included in this study—would doubtless change return levels, particularly in the northern and eastern part of the BoB in Bangladesh, where subsidence rates of 1–7 mm per year have been observed10. More accurate predictions of riverine flow, as well as changes in mean sea level, and higher fidelity bathymetry, which accounts for future land subsidence, would all increase confidence in predicted return levels.

Future directions of inquiry for this work could include finding numerically efficient methods to simulate the remaining ~45,000 years of STORM synthetic cyclones. Running a larger number of storms would greatly increase confidence in the return levels and allow for more meaningful prediction of up to the 10,000-year storm. Such approaches include reducing the number of perturbations of each storm via analysis of the β coefficients calculated in “Sensitivity analysis”. For example, the small values of β3 at every site of interest indicate that there is little interaction between riverine flow and storm-tide, indicating that eliminating riverine flow perturbations may not drastically change predicted return levels. Additionally, more refined machine learning methodologies could be applied to the large amount of data already generated in this study and could be used to create an “emulator” of the full ADCIRC+SWAN model, which would allow for more rapid simulation of large amounts of storms. In addition to the remainder of the STORM dataset, other synthetic track datasets for the region, e.g., ref. 22, CMIP6 climate products, and reanalysis/forecasting products such as ERA537 could be used to force the numerical model. By using a more diverse set of wind products, more confidence in the predicted return levels will exist. Perhaps the most impactful improvement that could be made would be the inclusion of land subsidence in future climate simulations.

Our results highlight the importance of incorporating region-specific, high-resolution modeling frameworks into flood risk assessments for coastal infrastructure. As changing conditions result in different flood risks in coastal zones, the importance of integrating these findings into infrastructure design and disaster preparedness strategies becomes paramount. The demonstrated variability in flood risks across even the relatively small region of the BoB highlights the importance of localized analyses to account for nonlinear interactions between tides, storm surge, riverine discharge, and sea level rise.

Methods

Hydrodynamic model

For this study, version 56 of the ADCIRC model25,26,38 was used to perform hydrodynamic simulations of storm-tide and flooding. ADCIRC is a finite-element-based model that solves the Generalized Wave Continuity Equation (GWCE)39, a reformulation of the shallow water equations, on unstructured, triangle-based computational meshes. ADCIRC is widely used in modeling both tides and storm surge in coastal zones [e.g., refs. 40,41,42, among others]. In order to model wave-current interaction and the resultant wave setup, ADCIRC is coupled with the SWAN model27,43. SWAN is a spectral wave model that solves the action-balance equation. Both ADCIRC and SWAN are run on the same unstructured computational mesh.

ADCIRC simulations used a time step of 5 s, temporal weighting in the GWCE of 0.05 s−1, and the Smagorinsky closure model44,45 with a lateral mixing coefficient of 0.2. SWAN was simulated using a time step of 10 min and coupled to ADCIRC at every SWAN time step. SWAN has additional dimensions for wave frequency and direction. The frequency range is discretized on the interval \(\left[0.031384,1.420416\right]\) Hz using 41 bins with 36 directional bins (i.e., 10 degree spacing).

Model forcing

In order to capture nonlinear effects caused by the interaction of different water-level drivers, we run ADCIRC + SWAN forced with meteorology (surface winds and atmospheric pressure), astronomical tides, riverine discharge, and sea level rise. In this section, we detail our implementations of each of these forcings within the hydrodynamic model.

The STORM data set consists of 10,000 years of synthetic TCs. Each storm consists of the location of the center of the storm, the minimum atmospheric pressure, the maximum wind speed, and the radius of maximum winds at three-hourly time intervals. Timing information is limited to a year (from 0 to 9999) and the month during which the storm occurred. For present conditions (pc), STORM samples from the International Best Track Archive for Climate Stewardship (IBTrACS)29, covering the years 1980–2018. The resulting synthetic TC tracks match the statistical distribution of the 38 years of observed TCs30. Mid-century (2015–2050) scenarios, apply STORM to four different CMIP6 models: CMCC-CM2-VHR4 (cmcc), CNRM-CM6-1-HR (cnrm), EC-Earth3P-HR (ecearth), and HAdGEM3-GC31-HM (hadgem). Each model has a corresponding set of synthetic TCs32. The STORM dataset was subset to only include cyclones whose tracks come within 150 km of any floodplain node of the computational mesh. Following this subsetting, the pc scenario included 283 storms; cmcc included 175 storms; cnrm included 163 storms; ecearth included 363 storms; and hadgem included 233 storms. Figure 6 shows both the tracks of the subset of STORM as well as the relative frequency of the maximum category of each storm.

Fig. 6: Visualization of STORM synthetic tropical cyclone tracks used in the full model ensemble.
Fig. 6: Visualization of STORM synthetic tropical cyclone tracks used in the full model ensemble.The alternative text for this image may have been generated using AI.
Full size image

ae show the subset of STORM tracks for pc, cmcc, cnrm, hadgem, and ecearth climate scenarios, respectively. Each individual track is plotted with the white-to-red dots indicating the severity of the storm at that point. f shows the relative frequency of the maximum storm category on the Saffir–Simpson scale of the subset of storms.

Surface wind and atmospheric pressure fields are reproduced from STORM track data using a parametric cyclone model. For this study, the model described in ref. 46 (hereafter CLE15) was used due to its superior performance as described in ref. 47. Background winds fields are calculated with the model of ref. 48. In addition to the values of maximum velocity (Vmax), radius of maximum winds (Rmax), and minimum pressure (Pmin) provided by the STORM dataset, the CLE15 model has free parameters ck, cd, and wcool. ck and cd are the exchange coefficients of enthalpy and momentum respectively and, in this study, are calculated from the parametric relationship \({c}_{k}/{c}_{d}=0.00055{V}_{m}^{2}-0.0259{V}_{m}+0.763\)49. wcool is the magnitude of the radiative-subsidence rate in the free troposphere and has been estimated from climatology to be ≈2 mm s−1. The LC12 “background” wind field accounts for the translation component of surface wind from a moving TC48. determined that a background wind reduction factor of 0.55 and a rotation of 20 best matched observed wind fields from TCs.

Tidal forcing in the model comes from both the equilibrium tide forced at every node in the model and from enforcing tidal boundary conditions at the open ocean boundary in the south of the domain. Tidal elevations for the boundary are obtained from the TPXO9-atlas version 250. Both the boundary condition and the equilibrium tide were forced using the major 8 constituents (M2, Q1, O1, P1, K1, N2, S2, and K2).

In order to account for the impact of tidal stage when a TC makes landfall, each member of the STORM dataset was run at four different tidal stages: (1) high tide at landfall; (2) low tide at landfall; (3) maximum along-storm velocity at landfall; and (4) minimum along-storm velocity at landfall. These tidal conditions were calculated at the nearest computational node to the point where the TC makes landfall in the model. This methodology helps to account for the nonlinear impacts that tides can have on storm surge51.

The effect of riverine discharge on water levels is accounted for by including two major rivers as inflow conditions at the model boundary. These two rivers are the Meghna and Padma rivers. Due to the location of the northern model boundary, this condition captures the flow from the three major tributaries of the GBM delta. Riverine discharge is from the Global Flood Awareness System (GloFAS)33. GloFAS data covering the time period 1979 through 2022 at daily frequency were averaged on a monthly basis (mean monthly flow). For each month, three different flow conditions were considered: (1) maximum mean monthly flow; (2) mean mean monthly flow; and (3) minimum mean monthly flow.

In order to account for the impact of mean sea level rise on coastal water levels, a persistent offset derived from the Deltares Global Tide and Surge Model (GTSM) version 3.034 was applied at the open ocean boundary of the model. For the pc scenario, no mean sea level adjustment was applied since the bathymetry from which the numerical mesh was created is referenced to present-day mean sea level. For future conditions, the mid-century (2050) mean sea level rise was interpolated from the GTSM output and added at the open ocean boundary. The mean sea level (ηmsl) in Eqs (1) and (2) is calculated as the zero-phase constituent from the auxiliary simulations resulting from this additional offset at the boundary. It is critical to note that the GTSM model does not include the effects of land subsidence on mean sea levels10. estimates that the GBM delta is subsiding at rates between 1 and 7 mm per year, which would affect flooding levels in the region.

Computational mesh development and setup

The computational mesh used in this study was created using OceanMesh2D52, a MATLAB toolbox designed to create unstructured meshes for coastal domains. As can be seen in Fig. 1, the model covers the majority of the BoB, including the eastern coast of India and the Ganges River Delta. The mesh resolution ranges from 25 km in the deep-ocean to 500 m in coastal zones and on floodplains. OceanMesh2D determines target resolution via a number of parameters designed to ensure that the mesh adequately resolves bathymetric/topographic features. A “one-step” procedure was used to generate the mesh, wherein both the ocean and floodplain portions of the mesh are created simultaneously in order to ensure a smooth transition from ocean to land within the computational domain. Bathymetry for the model was interpolated primarily from GEBCO202053. Additionally, the oceanic portion of the bathymetric dataset described in ref. 12 was included to improve results along the coast of Bangladesh. Topography was interpolated from the SRTM15+ dataset54. Bathymetry and topography were modified via a “pit-filling” algorithm to help with the numerical stability of the wetting and drying algorithm.

The model uses parameterizations of both boundary layer dissipation and internal tide dissipation. Bottom friction is calculated using Manning’s n converted to bottom-drag coefficients using the relationship cd = (gn2)/H1/3, where cd is the drag coefficient, g is acceleration due to gravity, n is Manning’s n coefficient, and H is the total water column height. Internal tide dissipation is included using a modified55 parameterization that is a function of local bathymetric slopes and density gradients56,57. Bathymetric slopes are calculated from GEBCO2020 data using a cell-averaged approach.

Hydrodynamic solutions are often sensitive to dissipation, motivating the determination of an approximately optimal spatial distribution of coefficients for internal tide and bottom friction57. To this end, we divided the model domain into independent regions for Manning’s n and internal tide, where the vertices in each region are assigned a single value for the coefficient. 100 Manning’s n regions were generated using a k-means clustering algorithm, which grouped mesh vertices based on location, bathymetric depth (>0 m), and seabed lithology58,59. Separately, 10 internal tide dissipation regions were generated based on location, bathymetric depth (>1 km), and bathymetric slope. Using the Stochastic reduced-order Models with Python (SROMPy) library60, 100 randomized samples of combinations of Manning’s n and internal tide coefficient values were generated. For each sample, a tide-only simulation was performed, and the mean discrepancy (Eq. (3)) compared to tide gauges in the region was assessed. By using a functional minimization technique from SciPy61, the optimal (given the arrangement of regions and tidal error metric) friction coefficients were estimated. Note that overland Manning’s n values were excluded from this process and instead calculated from the Global Land Cover Characterization database62 based off of recommended values from the National Land Cover Database63.

Model validation

In order to assess the skill of the numerical model in accurately capturing extreme water levels, hindcasts assessing the accuracy of both tides and surge caused by historical cyclones were performed.

For tidal validation, a tide-only simulation forced with the 8 major constituents was performed. The model was given 10 days to “spin-up” and a harmonic analysis was performed using UTide64 on 354 days of water levels at stations. Comparisons are made between modeled tidal amplitudes and phases and those from the following observations: pelagic DART buoy 2340165; coastal tide gauges from the Global Extreme Sea Level Analysis dataset version 366 (GESLAv3); and coastal stations used in the validation of the models described in refs. 12,15. In order to assess both the amplitude and phase errors simultaneously, the root mean square discrepancy67 (D) was calculated for the dominant lunar constituent (M2); a combination of four major constituents (M2, S2, K1, and O1); and the combination of all 8 major constituents. D is calculated as:

$$D=\sqrt{0.5\mathop{\sum }\limits_{k}\left[{\left({A}_{o}^{k}\right)}^{2}+{\left({A}_{m}^{k}\right)}^{2}-2{A}_{o}^{k}{A}_{m}^{k}\cos \left({\theta }_{o}^{k}-{\theta }_{m}^{k}\right)\right]}$$
(3)

Where A and θ are amplitude and phase, the superscript k denotes the kth constituent, and the subscripts o and m denote observed and modeled, respectively. Mean values across all stations are 14.68, 16.74, and 14.71 cm for M2, 4 constituents, and 8 constituents, respectively. The 4 constituent error value is larger than the 8 constituent because of the inclusion of tide gauges where only 4 constituents are publicly available. When only stations with observational data for all 8 constituents are included in the calculation of error statistics the mean error values are 12.29, 14.09, and 14.71 cm. Full error statistics across all stations can be found in Table S1. Figure S1 shows scatter plots of the amplitude and phase errors.

Model simulations are compared against water level observations for TCs in the historical record (Fig. 7c). The best track data for each storm were retrieved from the Joint Typhoon Warning Center (JTWC) Best Track Archive. Like the STORM dataset, the parameters available in the best track archive can be used to compute wind and pressure fields using the CLE15+LC12 model. Each storm was simulated using the TPXO9-atlas tidal forcing and GloFAS riverine forcing consistent with the time of the storm. Both event maximum storm-tide—defined as the total water level incorporating the impact of meteorology, tides, and riverine flow—and storm surge—defined as the total water level minus the tidal water level, also called a non-tidal residual—of the model are compared to observations (Fig. 7a, b). Root mean square errors (RMSE) for maximum storm-tide and storm surge are 71 and 66 cm, respectively. Observations where the storm center came within 50–150 km of the station, as well as the extreme surges, are fairly well represented. The negative mean bias is mostly associated with further afield observations—e.g., storm-tide at Galachipa during Cyclone Amphan—where the parametric TC winds are too weak. Parametric models better capture peak winds near the center of a storm but do not necessarily resolve the external winds. When these same historical storms are simulated using a reanalysis wind product such as ERA537, stations that are further from the center of the storm are better captured (see Fig. S2). At the same time, ERA5 underestimates high-intensity TC winds and hence some of the larger water levels that we capture with the parametric model. Overall, given that the tidal results of the model are comparable to others for the region12,15, and that the extreme surges are relatively well captured, we believe that these results provide adequate confidence in the model and, subsequently, the projections of long-term flood return periods.

Fig. 7: Validation data for the Bay of Bengal model comparing modeled and observed storm-tides and storm-surges.
Fig. 7: Validation data for the Bay of Bengal model comparing modeled and observed storm-tides and storm-surges.The alternative text for this image may have been generated using AI.
Full size image

a Modeled versus observed event-maximum storm-tides for stations with available observations in the model domain. b Modeled versus observed event-maximum storm-surges for stations with available data in the model domain. c Station/Storm observation availability matrix. The number labels in (a, b) correspond to the values in (c). For example, point (1) in both (a, b) shows the storm-tide (storm-surge) at Chittagong caused by Cyclone Aila. The colors of points in (a, b) denote the minimum distance from the storm track to the station.

Extreme value analysis methodology

Each 1000 year period (pc, cmcc, cnrm, ecearth, and hadgem) was sampled separately with replacement 200 times. The peak storm-tide was taken as the highest water level from the ensemble member within 5 km of each site of interest. For each sample, one of the 12 storm perturbations (excluding the baseline case) was selected. Thus, for example, sample number 1 of the pc scenario might include: storm 1 with low tidal stage and minimum riverine flow, storm 2 with maximum along-storm tidal flow with maximum riverine flow, …, and storm 283 with high tidal stage and mean riverine flow. Sample number 12 of the hadgem scenario might include: storm 1 with high tidal stage and maximum riverine flow, storm 2 with low tidal stage and minimum riverine flow, …, and storm 233 with minimum along-storm tidal flow and maximum riverine flow. The resulting samples were individually fit using a Generalized Pareto Distribution (GPD) with peaks over threshold (POT)68,69. The GPD is commonly used to model the heavy tails of environmental extremes, including TC-driven coastal flooding68,70. The cumulative distribution function takes the form:

$$F(x)=\left\{\begin{array}{lc}1-{(1-k\frac{x-\mu }{\sigma })}^{1/k} & \,\text{for}\,k\ne 0\\ 1-{e}^{-\frac{x-\mu }{\sigma }} & \,\text{for}\,k=0\end{array}\right.$$
(4)

where,

$$\begin{array}{rcl}k & = & \,\text{shape parameter}\\ \mu & = & \,\text{location parameter}\\ \sigma & = & \,\text{scale parameter}\end{array}$$

To ensure that the GPD is properly fit to the tail of the distribution, POT is used to select only values in the sample greater than a given threshold, tμ. The best methodology for threshold selection is a widely discussed topic, with approaches ranging from graphical methods, to various goodness of fit tests, to parameter stability analyses68,71. Due to the large number of samples to be fit in this study, the automated methodology of ref. 72 was adopted. This approach uses the modified Anderson Darling (\({A}_{R}^{2}\)) test73 and p-value minimization to find the “best” threshold. \({A}_{R}^{2}\) is calculated as:

$${A}_{R}^{2}=\frac{n}{2}-\mathop{\sum }\limits_{i=1}^{n}\left[\left(2-\frac{2i-1}{n}\right)\log \left(1-{z}_{i}\right)+2{z}_{i}\right]$$
(5)

where xi are sorted samples x1x2xn, n is the number of samples and zi = F(xi) are the probabilities that xi was drawn from the parametric distribution F(x) fit to the sample using the L-Moments method74. This method of estimating the shape, location, and scale parameters of a GPD is highly computationally efficient, greatly reducing the time required to fit large numbers of thresholds. Shape, location, and scale parameters are estimated as:

$$\begin{array}{rcl}k & = & (3{t}_{3}-1)/(1+{t}_{3})\\ \sigma & = & {I}_{2}(1-k)(2-k)\\ \mu & = & {I}_{1}-\sigma /(1-k)\\ {I}_{1} & = & {b}_{0}\\ {I}_{2} & = & 2{b}_{1}-{b}_{1}\\ {I}_{3} & = & 6{b}_{2}-6{b}_{1}+b0\\ {t}_{3} & = & \frac{{I}_{3}}{I+2}\\ {b}_{k} & = & \frac{1}{n}\mathop{\sum }\limits_{i=k+1}^{n}\frac{(i-1)(i-2)\cdots (i-k)}{(n-1)(n-2)\cdots (n-k)}{x}_{i}\end{array}$$

To account for the effect of sample size on the parametric fit, the complement of the p-value (1 − p)—where p is defined as the probability that a value greater than or equal to \({A}_{R}^{2}\) was drawn from a parametric distribution given shape parameter k and number of samples n—is minimized in favor of minimizing \({A}_{R}^{2}\). As shown in ref. 72, the distribution of \({A}_{R}^{2}\) calculated from a parametric distribution is insensitive to scale and location parameters. p-values were calculated via bootstrapping for \(n\in \left[5,363\right]\) and \(k\in \left[-3.00\right.\), − 2.99, − 2.98, …, 2.98, 2.99, \(\left.3.00\right]\).

The procedure used to calculate return levels for each scenario can be summarized as:

  • Create 200 samples of ηmax of the 1000 year period, randomly selecting a storm perturbation with replacement.

  • For each sample j, sort the maximum water levels ηmax,i such that ηmax,1ηmax,2ηmax,nstorms where nstorms is the number of storms for that scenario.

  • Identify the potential thresholds (tk) for sample j as the unique values of ηmax,i.

  • Perform a GPD fit using L-Moments for each tk on the n values ηmax, itk.

  • Calculate \({A}_{R}^{2}\) and 1 − p for each tk. \({A}_{R}^{2}\) and 1 − p are functions of the estimated shape parameter and the number of points n used to perform the fit.

  • Select tk (and corresponding GPD parameters) that minimizes 1 − p.

  • From the 200 samples, calculate mean return periods and confidence intervals (via non-parametric bootstrapping) from each GPD fit.

Figure S3 shows an example of fitting to one individual sample (steps 2–6).