Toward skillful forecasting of super El Niño events using a diffusion-based westerly wind burst parameterization

Ji, Chaopeng; Mu, Mu; Qin, Bo; Lian, Tao; Yuan, Shijin; Feng, Jie; Song, Xunshu; Wei, Yuntao; Dai, Guokun; Wang, Jinyu; Fang, Xianghui

doi:10.1038/s41612-025-01158-x

Download PDF

Article
Open access
Published: 19 July 2025

Toward skillful forecasting of super El Niño events using a diffusion-based westerly wind burst parameterization

Chaopeng Ji^1,2,3,
Mu Mu^1,2,3,
Bo Qin^1,2,3,
Tao Lian^4,5,6,
Shijin Yuan^7,8,9,
Jie Feng⁴,
Xunshu Song⁴,
Yuntao Wei^1,2,3,
Guokun Dai^1,2,3,
Jinyu Wang^1,2,3 &
…
Xianghui Fang^1,2,3

npj Climate and Atmospheric Science volume 8, Article number: 273 (2025) Cite this article

1884 Accesses
4 Citations
Metrics details

Subjects

Abstract

Forecasting super El Niño remains challenging, partly due to poor representation of westerly wind bursts (WWBs). We developed an artificial intelligence-based denoising diffusion probabilistic model (DDPM) to skillfully parameterize WWBs, capturing their joint modulation by oceanic and atmospheric processes. The DDPM-based scheme effectively captures observed WWBs’ characteristics (e.g., frequency, intensity, and spatial center). When implemented in the Community Earth System Model, it outperforms both the control (CTRL, without WWBs parameterization) and conventional warm pool eastern edge (WPEE)-dependent parameterization in predicting intensity and seasonal phase-locking for super El Niños (1982/83, 1997/98, 2015/16). This improvement stems from DDPM’s realistic WWBs representation, correcting CTRL and WPEE’s biases of overly weak and westward-shifted winds during El Niño growth. Consequently, DDPM produces more realistic eastern Pacific sea surface temperature anomaly warming patterns. These findings underscore WWB's accuracy as key to super El Niño prediction and demonstrate machine learning’s potential for WWB's parameterization.

Early signs of strong El Niño–Southern Oscillation episodes

Article Open access 18 May 2025

Diverse NPMM conditions deviate the 2023/24 El Niño from the 1997/1998 and 2015/2016 extreme El Niño events

Article Open access 31 March 2025

Change of El Niño onset location around 1970

Article Open access 05 July 2024

Introduction

El Niño is the strongest interannual climate signal on Earth¹, driving significant global impacts on weather, ecosystems, and economies^2,3. Among these, super (or extreme) El Niño events have attracted significant attention due to their broader spatial extent and stronger amplitude of sea surface temperature (SST) anomalies, which lead to more severe meteorological and climatic disasters^4,5. However, considerable uncertainty remains in forecasting the amplitude of super El Niño events^6,7. The uncertainties in forecasting super El Niño events stem from multiple sources, including (but not limited to) complex inter-basin interactions among the Indian Ocean, Atlantic Ocean, and Pacific Ocean⁸, the nonlinear response of the atmosphere to oceanic processes⁹, and high-frequency atmospheric stochastic forcing¹⁰. Notably, a key manifestation of this stochastic forcing is westerly wind bursts (WWBs). Extensive research from observational, modeling, and theoretical perspectives has consistently demonstrated that WWBs play a pivotal role in shaping super El Niño evolution by injecting wind energy into the western and central Pacific^11,12,13. This process enhances eastward zonal current anomalies and generates eastward-propagating downwelling equatorial Kelvin waves, which deepen the thermocline and facilitate the eastward migration of the warm pool^14,15,16. Collectively, these dynamical mechanisms contribute to surface warming in the central and eastern equatorial Pacific, leading to the development of El Niño. For instance, Lian and Chen (2021) demonstrated that the strong and persistent WWBs observed in March 1997 were a necessary condition for the onset of the super 1997 El Niño event. Similarly, numerous studies have highlighted that the contrasting intensities of the 2014/15 and 2015/16 El Niño events were largely driven by the frequency and intensity of WWBs^17,18,19. Thus, the ability to realistically simulate WWBs during super El Niño development is vital for predicting the spatiotemporal evolution of super El Niño^20,21.

However, many numerical models exhibit significant biases in simulating WWBs^22,23,24. This underscores the necessity for the widespread application of WWB's parameterization schemes to thoroughly investigate their impacts on the diversity and predictability of El Niño^13,25,26,27. For example, studies indicated that the occurrence of WWBs is modulated by oceanic conditions, manifesting as a type of semi-stochastic (or multiplicative) noise of El Niño^14,28,29. This motivated Gebbie et al.³⁰ to develop a now widely used semi-stochastic parameterization scheme for WWBs where WWB occurrence is influenced by SST (see “Methods” section). Beyond this success, subsequent studies also highlighted the importance of atmospheric internal variability in influencing the formation of WWBs^31,32,33. For instance, the convective phase of the Madden-Julian Oscillation³⁴ (MJO) is frequently associated with an increased likelihood of WWBs^33,35, and Lian et al.³¹ highlighted that nearly 70% of WWBs are closely linked to tropical cyclone (TC). Thus, it is essential to consider oceanic and atmospheric variabilities simultaneously to capture the complexity of WWBs comprehensively.

In recent years, artificial intelligence (AI) has been widely used in atmospheric science and achieved remarkable advancements^36,37,38. It offers innovative approaches for parameterizing WWBs. For example, building on the foundational work of Gebbie et al.³⁰, You et al.³⁹ developed a neural WWBs parameterization scheme leveraging AI techniques, incorporating oceanic and atmospheric variables as predictors. However, these parameterization schemes enforce fixed spatiotemporal structures on WWBs and are limited to deterministic representations of only a few physical parameters. Additionally, since WWBs are only one component of high-frequency zonal wind (HFZW) anomalies, the WWB models may not objectively reproduce the uncertainty of the spatial-temporal evolution of WWBs comprehensively without including the realistically full spectral components of HFZW anomalies.

Moreover, given the impact of WWBs on El Niño forecasts, as well as the unpredictable nature of WWBs on the seasonal forecasting timescale, it is more crucial to estimate their occurrence likelihood throughout the season than to predict their exact timing⁴⁰. As an attempt, Ji et al.²⁷ developed an El Niño ensemble forecasting framework based on WWBs ensemble forecasting, which improves the forecasting skills of El Niño since it better accounts for interactions across different timescales than the widely used initial condition-based framework. However, the WWB's parameterization scheme used by Ji et al.²⁷ considers only the role of the ocean. Developing a probability parameterization scheme for WWBs that simultaneously accounts for both oceanic and atmospheric processes is essential for capturing their associated uncertainties in the super El Niño forecast. In this paper, we aim to develop a more skillful parameterization for WWBs based on the Denoising Diffusion Probabilistic Model (DDPM, see “Methods” section), a state-of-the-art generative framework in AI. The DDPM-based parameterization is then integrated online into the Community Earth System Model (CESM) to systematically investigate the influence of WWBs on the prediction of super El Niño events.

Results

Evaluation of the new DDPM-based WWBs parameterization

First, we evaluated the simulation performance of WWBs across four DDPM-based parameterization schemes with different conditional physical variable combinations (i.e., SST anomalies (SSTA), outgoing longwave radiation anomalies (OLRA), and sea level pressure anomalies (SLPA)) designed to capture the spatiotemporal characteristics and stochastic nature of WWBs (see “Methods” section). As shown in Fig. 1a, e, the [SSTA] generates almost no WWBs. This discrepancy may arise because SSTA evolves relatively slowly, making it challenging for AI models to establish a robust mapping between slow-varying SSTA and rapidly varying HFZW anomalies. This limitation is corroborated by the power spectrum of HFZW anomalies (Fig. S1), which reveals that the HFZW anomalies generated by [SSTA] are predominantly characterized by low-frequency variability. As a result, these models fail to accurately capture the high-frequency, episodic nature of WWBs. With the OLRA included in [SSTA, OLRA], both the WWB intensity (NCWI below, see “Methods” section) and numbers increase substantially (Fig. 1a, b). However, their longitudinal center (LonCen, see “Methods” section) is too concentrated in the western Pacific to capture the widespread character in the observation (Fig. 1c, f), which was argued to be important in inducing the El Niño diversity⁴¹. On this basis, including SLPA as another input parameter in [SSTA, OLRA, SLPA] further refines the simulation details of the WWBs numbers and NCWI (Fig. 1a, b), supporting the finding of Lian et al.²³ that WWBs are closely associated with TCs. Besides, the [SSTA, OLRA, SLPA] significantly improves the simulation of WWBs occurrence probability in each month (Fig. 1d), which is crucial for WWBs and El Niño ensemble forecasting²⁷.

**Fig. 1: Comparative analysis of WWB physical characteristics in DDPM simulations with different parameterizations.**

Moreover, given the extensive research emphasizing the regulatory effects of SSTA on WWBs^28,42, we trained [OLRA, SLPA] using OLRA and SLPA as constraints to further investigate the respective roles of oceanic and atmospheric processes in WWBs. The results indicate that compared to the [SSTA, OLRA, SLPA], the [OLRA, SLPA] simulates fewer and weaker WWBs (Fig. 1a, b), with a slight bias in the simulated monthly occurrence probability of WWBs (Fig. 1d). Additionally, the LonCen of WWBs in the [OLRA, SLPA] are more concentrated in the western Pacific (Fig. 1c, g), showing an obvious bias compared to observations. These results suggest that in the AI model, SSTA may provide a conducive environment for WWBs occurrence and regulate their central location, while the frequency and intensity of WWBs are primarily dominated by atmospheric internal variability. In other words, the regulatory effect of SSTA on WWBs requires the cooperation of atmospheric internal variability to better characterize the various physical attributes of WWBs, which is consistent with the findings of Liang et al.⁴³ in numerical models. We also compared the [OLRA, SLPA] and [SSTA, OLRA, SLPA] regarding WWBs’ maximum amplitude, latitudinal center, zonal range, duration, monthly frequency, and numbers of each year between 2011 and 2022. The [SSTA, OLRA, SLPA] demonstrated superior simulation capability in all these aspects (Figs. S2–S3).

Furthermore, we compared the performance of the [SSTA, OLRA, SLPA] with the traditional warm pool eastern edge (WPEE, see “Methods” section)-dependent WWBs parameterization³⁰ in capturing key characteristics of WWBs, including their LonCen, monthly occurrence accuracy, numbers, and NCWI. As illustrated in Fig. 2, the [SSTA, OLRA, SLPA] significantly outperforms the WPEE-based approach in representing the physical features of WWBs. This result underscores the importance of incorporating atmospheric variability into the parameterization of WWBs^35,39 and the high efficiency of the diffusion-based AI scheme in doing this. By better capturing the state-dependent nature of WWBs and their interactions with large-scale air-sea processes, the [SSTA, OLRA, SLPA] is believed to provide a more robust framework for understanding and predicting the role of WWBs in super El Niño events.

**Fig. 2: Differences in simulated WWB characteristics between DDPM and WPEE parameterizations.**

Embedding DDPM-based WWBs parameterization into CESM and forecasting of super El Niño

The evaluation above demonstrates that the [SSTA, OLRA, SLPA] achieves the best performance in simulating WWBs among the diffusion-based frameworks. Therefore, in the subsequent forecast experiments focusing on super El Niño events, we exclusively coupled the [SSTA, OLRA, SLPA] with the CESM, which is compared with the one that adopts the traditional WPEE-dependent WWBs parameterization. For convenience, the super El Niño forecast experiments incorporating the two parameterizations are termed as “DDPM” and “WPEE” in the following text, respectively (see “Methods” section).

Figure 3 presents the observed, control (CTRL, CESM without WWBs parameterization), WPEE, and DDPM forecast experiments for the Niño3.4 index over a 12-month lead time, initialized in February and May (i.e., before and end of the boreal spring season) of 1982, 1997, and 2015, respectively. As shown, the CTRL consistently underestimates the intensity of super El Niño events, aligning with findings from previous studies^44,45. In the WPEE experiment, the inclusion of WWBs partially improves the predicted El Niño intensity, but the overall underestimation persists. In contrast, the DDPM experiment demonstrated significant improvements in predicting the intensity of super El Niño events. Notably, the spread of ensemble members (the green shade in Fig. 3) effectively encompassed observations, indicating the reliability of the DDPM approach. This improvement stems from the DDPM’s more accurate representation of WWBs (Fig. 2), which better captures their critical role in establishing the super El Niño events^46,47. Additionally, while observations indicate that all three El Niño events peak in December (i.e., seasonal phase-locking), the CTRL and WPEE exhibit significant biases in capturing this character. When initialized in February, CTRL and WPEE experiments exhibit a distinctive double-peak evolution: an initial Niño3.4 peak at 6-month lead time (summer) followed by weakening and subsequent re-intensification to a second peak (winter) at lead 12 months, contrasting with the observed steady intensification toward a single December peak (Fig. 3a–c). Notably, we emphasize that in the 1997 case, all three experimental configurations reproduce the observed December peak in El Niño development (Fig. 3b). For May initializations (post-spring when air-sea coupling is better established), while the double-peak feature weakens, the predicted peaks still lag observations by approximately one month (Fig. 3d–f). These errors reflect common seasonal phase-locking prediction biases prevalent in many complex climate models^48,49,50. Notably, the DDPM experiment efficiently overcomes all these shortcomings and accurately predicts the evolution of all these super El Niño events. This hints that improved representation of high-frequency atmospheric processes like WWBs in climate models may help to mitigate the seasonal phase-locking bias of El Niño. Moreover, forecast experiments for the 1994/1995 and 2009/2010 moderate El Niño events showed DDPM’s improvement was slightly smaller than for super El Niño, but still yielded the best overall forecast results (Fig. S4).

**Fig. 3: Observed and forecasted Niño3.4 index.**

To better illustrate the spatiotemporal evolution, Fig. 4 presents the SSTA along the equatorial (5°S–5°N mean) Pacific from observations and the three forecast experiments (taking the 1997/98 event as an example, the results for the other two events are similar, as illustrated in Figs. S5–S8). The observed SSTA exhibits a broad spatial distribution spanning the central and eastern Pacific (Fig. 4a, e). In contrast, the major warming in the CTRL experiment is primarily confined to the far eastern Pacific, with only limited westward extension (Fig. 4b, f). Consequently, the SSTAs in the Niño3.4 region (black dashed boxes) are significantly underestimated compared to observations. Additionally, although manifesting weak warming within the Niño3.4 region, the CTRL exhibits a distinct seasonal double-peak structure (i.e., summer and winter), markedly diverging from observations (Fig. 4a, b, e, f). These findings align with the phase-locking biases identified in Fig. 3. The WPEE experiment produces a more westward-extended warming pattern compared to CTRL (Fig. 4c, g), exhibiting broader spatial coverage across the eastern Pacific. While this configuration partially mitigates the characteristic double-peak bias in the Niño3.4 index evolution (Fig. 3), its impact remains substantially limited—the simulated SSTA in the Niño3.4 region still exhibits pronounced underestimation compared to observations. Furthermore, the slower westward propagation of SSTA, combined with the unrealistic weakening of anomalies in the far eastern Pacific, causes a delayed peak in the Niño3.4 region relative to observations, leading to seasonal phase-locking biases in forecasts (Fig. 4f, g). In contrast, the DDPM experiment demonstrates notable improvements over CTRL and WPEE in terms of both the spatial distribution and intensity of SSTA (Fig. 4d, h).

**Fig. 4: Observed and forecasted monthly mean SSTA.**

Mechanisms of DDPM in improving super El Niño prediction

The results above demonstrate the significant advantages of the DDPM experiment in improving the prediction of super El Niño events. To understand the underlying physical mechanisms, Fig. 5 illustrates the spatiotemporal evolution of observed and predicted monthly mean westerly wind stress and sea surface height anomalies (SSHA). It is evident that, compared to observations, the westerly wind stress anomalies in the CTRL and WPEE experiments are weaker and more confined to the western Pacific. This results in a reduced propagation time for the upwelling Rossby waves (excited by westerly wind stress) to reflect at the western boundary as Kelvin waves, thereby more rapidly counteracting the eastward-propagating downwelling Kelvin waves⁵¹. Consequently, the SSHA and SSTA in the eastern Pacific are significantly weakened, as the positive feedback has less time to amplify them⁴¹. Although the westerly wind stress anomalies in CTRL and WPEE gradually strengthen and shift eastward with the development of SSTA, the anomalies show a systematic westward displacement compared to observations. This systematic bias fundamentally impairs the models’ capacity to sustain the Bjerknes feedback, leading to biases in the prediction of El Niño’s seasonal phase-locking and intensity. In contrast, the DDPM experiment, with its stronger and more eastward-shifted westerly wind stress anomalies that closely align with observations, facilitates the accumulation of downwelling Kelvin waves in the eastern Pacific, manifesting as an enhancement of positive SSHA in this region. This process intensifies positive SSHA in the region, thereby strengthening the Bjerknes feedback⁵² and accelerating the amplification of SSTA. Simultaneously, the continuous growth of SSTA further increases the probability of WWBs occurrence, creating a positive feedback loop that amplifies the SSTA. The improved representation of these dynamical interactions in the DDPM framework highlights its superior ability to capture the key mechanisms driving super El Niño events, leading to more accurate predictions of their seasonal phase-locking and intensity. These results demonstrate that improved representation of WWBs’ characteristics contributes to reducing model systematic biases, thereby improving the forecast skill for super El Niño events.

**Fig. 5: Same as Fig. 4, but for SSHA (units: m).**

Discussion

This study first developed a skillful parameterization scheme for WWBs using an AI-based diffusion model, which effectively reproduces multiple observed physical attributes of WWBs. Based on this scheme, we further revealed that WWBs, as a form of multiplicative noise on the interannual timescale, originate from oceanic states that provide the background conditions for WWBs formation and regulate their central location. Meanwhile, internal atmospheric processes (e.g., MJO, TC) influence the intensity and frequency of WWBs. Therefore, WWBs cannot be solely attributed to SSTA regulation in the AI model as in previous parameterization schemes^26,30. This is because SSTA regulation of WWBs requires coordination with internal atmospheric variability.

We then incorporate DDPM-based parameterization, along with a conventional WWBs parameterization dependent on the WPEE, into the CESM to conduct ensemble forecasting experiments for three historical super El Niño events. The results indicate that the DDPM scheme significantly improves the prediction of El Niño intensity and mitigates the seasonal phase-locking bias in comparison to the CTRL and WPEE experiments. This improvement primarily stems from the ability of the DDPM scheme to better characterize WWB occurrences, thereby mitigating the systematic biases in the CTRL and WPEE experiments, which tend to produce weaker and more westward-displaced westerly wind stress anomalies. As a result, SSTAs in the eastern Pacific are better maintained and progressively develop westward, exhibiting spatiotemporal evolution and intensity that closely align with observations.

Our results highlight the critical role of accurately representing WWBs in improving super El Niño predictions, while demonstrating the efficacy of AI-based approaches in addressing this longstanding challenge. Furthermore, as emphasized in the introduction, the predictability limit of WWBs on interannual timescales implies that any deterministic forecast of WWBs inherently carries considerable uncertainty, which inevitably propagates into super El Niño predictions. Our findings suggest that ensemble forecasting may serve as an effective strategy for addressing the interactions between phenomena across different timescales, such as WWBs and El Niño, thereby improving the reliability of climate predictions.

Methods

Data

The National Oceanic and Atmospheric Administration daily Optimum Interpolation Sea Surface Temperature (OISST⁵³) and daily OLR spanning 1981–2022 were used⁵⁴. Daily 10-m zonal wind and SLP data were obtained from the National Centers for Environmental Prediction–National Center for Atmospheric Research (NCEP-NCAR) Reanalysis 2 Project⁵⁵, spanning 1979–2022. Additionally, daily SST, OLR, SLP, and 10-m zonal wind data from the European Centre for Medium-Range Weather Forecasts Reanalysis v5 (ERA5⁵⁶) were also used, covering the same period (1979–2022). Daily anomalies within our analysis were defined as deviations from the 30-year climatological mean (1981–2010). Then, a 60-day high-pass filter was applied to the 10-m zonal wind anomalies to further isolate the high-frequency components. The observational data for monthly SST, SSH, and wind stresses were sourced from the Global Ocean Data Assimilation System (GODAS⁵⁷).

Mathematical formulation of DDPM

DDPMs are a class of generative models designed to model the gradual transformation of data from a simple, known distribution, such as Gaussian noise, into more complex distributions, like those found in real-world atmospheric states⁵⁸. DDPM has been successfully applied in various domains, including image synthesis⁵⁹, and natural language processing⁶⁰. More recently, their application in atmospheric science has gained attention, particularly in ensemble forecasting, due to their probabilistic framework for forecasting^61,62.

The fundamental concept underlying DDPM is to simulate a forward diffusion process in which data is incrementally corrupted by noise, which is then followed by a reverse denoising process where the model iteratively learns to remove the noise to eventually reconstruct the original data. This reverse denoising process can subsequently be employed to generate new data, beginning with pure noise and denoising it through iterative refinement.

1.
Forward Diffusion Process

The forward diffusion process gradually adds noise to the data over a series of discrete time steps, effectively converting the data distribution ${x}_{0}$ into a simple Gaussian distribution ${x}_{T}(T\to +\infty )$. This process is often modeled as a Markov chain, where the data at each time step $t$ is conditioned only on the data at the previous time step $t-1$:
$$q({x}_{t}|{x}_{t-1})={\mathcal{N}}\left({x}_{t};\sqrt{1-{\beta }_{t}}{x}_{t-1},{\beta }_{t}{\boldsymbol{I}}\right)$$
(1)
where ${x}_{t}$ represents the data at time step $t$; ${\beta }_{t}$ is a small positive scalar (linear interpolation from 0.0001 to 0.02) that controls the variance of the noise added at each step. ${\mathcal{N}}{\mathscr{(}}\cdot ;\mu ,\sum )$ denotes a Gaussian distribution with mean $\mu$ and covariance $\sum$; ${\boldsymbol{I}}$ is the identity matrix.

The cumulative effect of this forward process over $T$ steps can be described by:
$$q({x}_{t}|{x}_{0})={\mathcal{N}}{\mathscr{(}}{x}_{t};{\sqrt{\bar{\alpha }}}_{t}{x}_{0},{(1-\bar{\alpha }}_{t}){\boldsymbol{I}})$$
(2)
where ${\alpha }_{t}=1-{\beta }_{t}$ and ${\bar{\alpha }}_{t}={\prod }_{s=1}^{t}{\alpha }_{s}$

This formulation indicates that ${x}_{t}$ is a noisy version of the original data ${x}_{0}$, with noise increasing as $t$ approaches $T$:
$${x}_{t}={\sqrt{\bar{\alpha }}}_{t}{x}_{0}+\sqrt{{1-\bar{\alpha }}_{t}}{\xi }_{t}$$
(3)
Where ${\xi }_{t} \sim {\mathcal{N}}{\mathscr{(}}0,{\bf{I}})$.
2.
Reverse Diffusion Process

The reverse diffusion process aims to recover the original data from the noisy data ${x}_{t}$ by learning a model that approximates the reverse Markov chain. When ${\beta }_{t}$ is small enough, the inverse process is also a Gaussian distribution:
$${p}_{\theta }({x}_{t-1}|{x}_{t})={\mathcal{N}}\left({x}_{t-1};{\mu }_{\theta }\left({x}_{t},t\right),\sum _{\theta }({x}_{t},t)\right)$$
(4)
where ${\mu }_{\theta }({x}_{t},t),{\sum }_{\theta }({x}_{t},t)$ are functions (often parameterized by neural networks) that predict the mean and variance of ${x}_{t-1}$ given ${x}_{t}$ and the time step $t$. $\theta$ represents the parameters of the model. The goal of training is to learn the parameters $\theta$ such that the reverse process accurately inverts the forward process, ultimately leading to the reconstruction of the original data ${x}_{0}$.
3.
Training Objective

It is not practical to directly calculate the distribution of the inverse operation of adding noise for all data ${p}_{\theta }({x}_{t-1}|{x}_{t})$. However, if a training set is used as input ${x}_{0}$, it allows us to approximate ${p}_{\theta }({x}_{t-1}|{x}_{t})$ effectively:

$$q({x}_{t-1}|{x}_{t},{x}_{0})=q({x}_{t}|{x}_{t-1},{x}_{0})\frac{q({x}_{t-1}|{x}_{0})}{q({x}_{t}|{x}_{0})}$$

(5)

where $q({x}_{t-1}|{x}_{t},{x}_{0})$ shows the inverse operation of adding noise and its mean and variance need to be determined. $q({x}_{t}|{x}_{t-1},{x}_{0})={\mathcal{N}}\left({x}_{t};\sqrt{1-{\beta }_{t}}{x}_{t-1},{\beta }_{t}{\boldsymbol{I}}\right)$ denotes the distribution of the added noise. Since ${x}_{0}$ is known, we have:

$$q({x}_{t}|{x}_{0})={\mathcal{N}}{\mathscr{(}}{x}_{t};\sqrt{{\bar{\alpha }}_{t}}{x}_{0},{(1-\bar{\alpha }}_{t}){\boldsymbol{I}})$$

(6)

$$q({x}_{t-1}|{x}_{0})={\mathcal{N}}{\mathscr{(}}{x}_{t-1};\sqrt{{\bar{\alpha }}_{t-1}}{x}_{0},{(1-\bar{\alpha }}_{t-1}){\boldsymbol{I}})$$

(7)

Substituting Eqs. (6) and (7) into Eq. (5), we can get:

$$q({x}_{t-1}|{x}_{t},{x}_{0})={\mathcal{N}}\left({x}_{t-1};{\widetilde{\mu }}_{t}\left({x}_{t},{x}_{0}\right),{\widetilde{\beta }}_{t}{\bf{I}}\right)$$

(8)

where ${\widetilde{\mu }}_{t}$ and ${\widetilde{\beta }}_{t}$ represent the mean and variance of $q({x}_{t-1}|{x}_{t},{x}_{0})$, respectively.

$${\widetilde{\mu }}_{t}=\frac{1}{\sqrt{{\alpha }_{t}}}\left({x}_{t}-\frac{1-{\alpha }_{t}}{\sqrt{1-{\bar{\alpha }}_{t}}}{\xi }_{t}\right)$$

(9)

$${\widetilde{\beta }}_{t}=\frac{1-{\bar{\alpha }}_{t-1}}{1-{\bar{\alpha }}_{t}}{\beta }_{t}$$

(10)

Here, ${\alpha }_{t}$, ${\beta }_{t}$, ${\bar{\alpha }}_{t}$, and ${\bar{\alpha }}_{t-1}$ are all known parameters, while only ${\xi }_{t}$ is unknown. Thus, in training the reverse process of the neural network, the core objective is to predict the noise added in the forward process at each step.

So, this objective can be expressed as:

$${\mathcal{L}}={{\mathbb{E}}}_{q\left({x}_{0}\right),{\xi }_{t}{\mathscr{ \sim }}{\mathcal{N}}\left(0,{\bf{I}}\right),t}\left[{||}{\xi }_{t}-{\xi }_{\theta }\left({x}_{t},t\right)|{|}^{2}\right]$$

(11)

where ${\xi }_{\theta }\left({x}_{t},t\right)$ is the neural network’s estimate of the added noise ${\xi }_{t}$, and $t$ is a random time step chosen during training. ${{\mathbb{E}}}_{q({x}_{0})}$ is the mathematical expectation of $q({x}_{0})$.

DDPM-based parameterization of WWBs

We adopted the DDPM to construct a new WWBs parameterization with high spatiotemporal complexity. Its framework is illustrated in Fig. 6. The forward chain indicated by black arrows in Fig. 6a is typically designed to map a complex data distribution into a standard Gaussian distribution by gradually adding noise, which is also the distribution shift learning process from practical data to Gaussian. The reverse chain, as red arrows in Fig. 6a, gradually turns Gaussian distributions into practical data distributions, which is regarded as the generation process. During the implementation, the reverse chain reconstructs the data by predicting the noise added during the forward chain and progressively denoising it step by step, as illustrated in the two boxes in Fig. 6a. Figure 6b displays the network structure we used in DDPM with detailed data flow and tensor operators, which contains two kinds of inputs for noised data and conditions, outputting the noise present in the data. Conditions represent a constraint, ensuring that the generated data maintains physical consistency. This network is designed with state-of-the-art neural blocks, including ConvNeXt block⁶³ and Swin-Transformer block⁶⁴, as well as our designed Down-/Up-Sampling block; the detailed architectures of internal modules are exhibited in Fig. 7.

As highlighted in the introduction, El Niño, MJO, and TC events play significant roles in modulating the occurrence of WWBs. We selected various combinations of SSTA, OLRA, and SLPA as conditions for the DDPM, as they effectively capture the activity states of El Niño, MJO, and TC, analogous to You et al.³⁹. We trained four distinct model configurations utilizing varying conditions: (1) SSTA, (2) SSTA and OLRA, (3) SSTA, OLRA, and SLPA, and (4) OLRA and SLPA, denoted as [SSTA], [SSTA, OLRA], [SSTA, OLRA, SLPA], [OLRA, SLPA], respectively. For example, SSTA serves as a constraint for the [SSTA], guiding the initial random noise to transform into daily HFZW anomalies at the corresponding time via the reverse process. The output variable for all model configurations was the daily HFZW anomalies. The spatial domain of the predictors and output was the tropical Pacific region (120°E–80°W, 30°S–30°N), with a spatial resolution of 2.5° × 2.5°. The remaining model configurations adopt a comparable methodology. We use NCEP and ERA5 data from 1979 to 2010 for training, and ERA5 data from 2011 to 2022 as test sets. For these four models, we generate 20 ensemble members to assess their simulation performance in simulating WWBs.

Traditional WPEE-dependent WWBs parameterization scheme

The traditional WWBs parameterization scheme, initially developed by Gebbie et al.³⁰, captures the multiplicative noise characteristics of WWBs by linking their occurrence probability (${p}_{{wwb}}$) to the position of the WPEE. This approach has been widely adopted in studies investigating the dynamical impacts of WWBs on El Niño^{27,65,66,67,68}. In this parameterization scheme, ${p}_{{wwb}}$ and the spatiotemporal distribution of their associated wind stress anomalies (${\tau }_{{wwb}}\left(x,y,t\right)$) are defined as follows:

$${p}_{{wwbs}}\left(t\right)=\frac{{p}_{1}}{2}\left\{\tanh \left(\frac{(w{p}_{{edge}}\left(t\right)-180)\pi }{60}\right)\right\}+1$$

(12)

$${\tau }_{{wwb}}\left(x,y,t\right)=A\exp \left(-\frac{{\left(t-{T}_{0}\right)}^{2}}{{T}^{2}}{-\frac{\left(x-{x}_{0}\right)}{{L}_{x}^{2}}}^{2}{-\frac{\left(y-{y}_{0}\right)}{{L}_{y}^{2}}}^{2}\right)$$

(13)

where ${{wp}}_{{edge}}$ is the WPEE, defined as the longitude of the 28.5 °C isotherm; $t$ is the considered time. A WWB event was initiated only when ${p}_{{wwb}}$ was greater than a random number. The meanings and specific values of the parameters in Eqs. (12) and (13) are summarized in Table 1 below, consistent with those used in Chen et al.⁶⁶. While the parameters in Eq. (13) are presently treated as deterministic, future work could explore integrating stochastic processes to better capture WWB variability. Such refinements might improve the parameterization’s skill and its utility for El Niño forecasting, though this extension lies beyond the scope of the present study.

Table 1 The physical meanings and values of each parameter of the WWB parameterization

Full size table

Definitions of WWBs and Niño index

The definition of a WWB event follows Ji et al.²⁷, where the threshold for WWB detection is defined as three times the mean standard deviation of HFZW anomalies over the 5°S–5°N, 120°E–80°W region, consistent with Seiki et al.²⁴. The WWBs thresholds for observations and the four model configurations, i.e., [SSTA], [SSTA, OLRA], [SSTA, OLRA, SLPA], and [OLRA, SLPA], are 5 m/s, 4.6 m/s, 4.2 m/s, 4.9 m/s, and 4.8 m/s, respectively. Additionally, previous studies have demonstrated that the cumulative WWBs intensity (CWI) and WWBs’ longitudinal center (LonCen) are the key physical attributes affecting El Niño dynamics. For example, Chen et al.⁶⁹ noted that the strong CWI was a crucial factor in the occurrence of the super El Niño in 2015. Moreover, the central position of WWBs significantly affects the annual cycle and diversity of El Niño^41,70. Therefore, we primarily evaluate the DDPM’s ability to characterize these two features. The CWI, LonCen, of WWBs are defined as follows^35,69:

$$\left(\mathrm{LonCen},\mathrm{LatCen}\right)=\frac{\iiint \left({lon}\left(x\right),{lat}\left(y\right)\right)\cdot {u}_{10}\left(x,y,t\right){dxdydt}}{\iiint {u}_{10}\left(x,y,t\right){dxdydt}}$$

(14)

$${\rm{CWI}}=\iiint {u}_{10}\left(x,y,t\right){dxdydt}$$

(15)

where the integral covers the whole spatiotemporal domain of a WWB event. The term ${lon}\left(x\right),{lat}\left(y\right)$ represent the spatial position of the longitude and latitude of ${u}_{10}$, with $x$, $y$, and $t$ representing longitude, latitude, and WWB durations, respectively. We normalized the CWI by dividing it by the standard deviation, hereafter termed NCWI. Besides, the maximum WWB amplitude is defined as the maximum value within the WWB spatiotemporal region, and the zonal range is defined as the range between the farthest and nearest points within this region. The duration is defined as the interval between the first and last day that meets the WWB definition. The latitude center (LatCen) of WWBs is defined as Eq. (14). The Niño3.4 index is defined as the area-averaged SSTA over the region 5°N–5°S, 120°–170°W.

Relative Operating Characteristic curve

The Relative Operating Characteristic (ROC) curve is commonly utilized to evaluate the performance of probabilistic forecast models. When assessing the occurrence of an event, the model’s predictions are verified against actual outcomes, resulting in one of the following categories⁷¹: true positive (TP), false negative (FN), false positive (FP), or true negative (TN). Based on these results, a binary contingency table (Table 2) can be obtained:

Table 2 Contingency table

Full size table

The hit rate (HR) and false-alarm rate (FR) are defined as:

$${HR}=\frac{{TP}}{{TP}+{FN}}$$

(16)

$${FR}=\frac{{FP}}{{FP}+{TN}}$$

(17)

Here, the ROC curve is used to evaluate the simulation accuracy of monthly occurrences of WWBs. In the observational data (2011–2022), each month was assigned a binary WWBs occurrence probability (1 if WWBs occurred, 0 otherwise). For each WWB parameterization, the predicted probability was defined as the proportion of ensemble members that predict WWB occurrence in a given month. Given a specific probability threshold (e.g., 0.5)—if the forecasted probability exceeds this threshold, a WWB event is predicted to occur; otherwise, it is predicted not to occur. Using varying probability thresholds, we calculated corresponding HR and FR pairs. The ROC curve was then generated by plotting HR against FR across all thresholds. The closer the ROC curve approaches the top-left corner of the coordinate plane, the higher the predictive accuracy of the simulation.

CESM

CESM version 1.2.2 is employed in this study for our forecast experiments. CESM is one of the most widely used fully coupled climate models, encompassing comprehensive components for the atmosphere, ocean, land, land ice, and sea ice⁷². Its ability to realistically simulate key features of El Niño variability and complexity has made it a cornerstone in El Niño-related research^46,73,74. In this study, the atmospheric component was represented by the Community Atmosphere Model 4, configured with a horizontal resolution of approximately 0.9° × 1.25° (f09) and a 26-layer hybrid sigma-pressure vertical coordinate system. The oceanic processes were simulated using the Parallel Ocean Program 2 model, which features a horizontal resolution of roughly 1.1°× (0.54°–1°) (gx1v6) and 60 vertical layers. Additionally, the modeling framework incorporated several other critical components: the Community Land Model, the Los Alamos National Laboratory Sea Ice Model, the Community Ice Sheet Model, and the River Transport Model.

Design of super El Niño forecast experiments

Using the analysis fields derived from Song et al.⁷⁴, we conducted three forecast experiments (Table 3) for three historical super El Niño events (1982/83, 1997/98, and 2015/16). The experiments were initialized in February and May of the El Niño development year, with each forecast lead time of 12 months.

Table 3 Forecast experiments

Full size table

We conducted a control forecast (CTRL) using the default configuration of the CESM as a baseline. Moreover, as highlighted in the introduction, the inherent predictability limit of WWBs on interannual timescales implies that deterministic forecasts of WWBs are subject to significant uncertainties, which inevitably affect predictions of super El Niño events. To address this, utilizing the stochastic nature of WWB's parameterization, we generated a 10-member ensemble forecast for both the DDPM and WPEE forecast experiments. This ensemble approach highlights the effectiveness of capturing multi-timescale interactions, such as those between WWBs and El Niño²⁷, while emphasizing the importance of accurately representing WWB physical characteristics to enhance super El Niño forecast skills. Notably, given that the CESM inherently captures high-frequency atmospheric variability, we employed the online low-pass filtering method developed by Lian and Chen (2021) to remove high-frequency zonal wind stress components from the model before incorporating WWBs. This approach ensures numerical integration stability and mitigates potential impacts on model climate drift⁶⁷.

Data availability

All data used in this study are publicly available online. OISST and OLR are accessed at https://psl.noaa.gov/data/gridded/data.noaa.oisst.v2.highres.html⁵³ and https://psl.noaa.gov/data/gridded/data.olrcdr.interp.html⁵⁴, respectively. NCEP data are freely available at https://psl.noaa.gov/data/gridded/data.ncep.reanalysis2.html⁵⁵. ERA5 data are freely available at https://www.ecmwf.int/en/forecasts/datasets/browse-reanalysis-datasets⁵⁶. The GODAS data are obtained from https://psl.noaa.gov/data/gridded/data.godas.html. Information and the source code for data analysis are available from the Matrix Laboratory (MATLAB 2023). *Version R2023a* [Software]. Natick, Massachusetts: The MathWorks Inc. (https://www.mathworks.com).

Code availability

The CESM source codes are publicly available at https://www.cesm.ucar.edu/models/releases. The source code of the DDPM is available at https://zenodo.org/records/15655250.

References

Neelin, J. D. et al. ENSO theory. J. Geophys. Res. Oceans 103, 14261–14290 (1998).
Article Google Scholar
Alexander, M. A. et al. The atmospheric bridge: the influence of ENSO teleconnections on air–sea interaction over the global oceans. J. Clim. 15, 2205–2231 (2002).
Article Google Scholar
McPhaden, M. J., Zebiak, S. E. & Glantz, M. H. ENSO as an Integrating Concept in Earth. Sci. Sci. 314, 1740–1745 (2006).
CAS Google Scholar
Glantz, M. H. Currents of Change: Impacts of El Niño and La Niña on Climate and Society (Cambridge University Press, 2001).
L’Heureux, M. L. et al. Observing and predicting the 2015/16 El Niño. Bull. Am. Meteorol. Soc. 98, 1363–1382 (2017).
Article Google Scholar
Fang, X. & Chen, N. Quantifying the predictability of ENSO complexity using a statistically accurate multiscale stochastic model and information theory. J. Clim. 36, 2681–2702 (2023).
Article Google Scholar
Mu, M. & Ren, H.-L. Enlightenments from researches and predictions of 2014–2016 super El Niño event. Sci. China Earth Sci. 60, 1569–1571 (2017).
Article Google Scholar
Fan, H., Wang, C., Yang, S. & Zhang, G. Coupling is key for the tropical Indian and atlantic oceans to boost super El Niño. Sci. Adv. 10, eadp2281 (2024).
Article Google Scholar
Srinivas, G. et al. Dominant contribution of atmospheric nonlinearities to ENSO asymmetry and extreme El Niño events. Sci. Rep. 14, 8122 (2024).
Article CAS Google Scholar
Shi, L., Alves, O., Hendon, H. H., Wang, G. & Anderson, D. The role of stochastic forcing in ensemble forecasts of the 1997/98 El Niño. J. Clim. 22, 2526–2540 (2009).
Article Google Scholar
Hu, S. & Fedorov, A. V. The extreme El Niño of 2015–2016: the role of westerly and easterly wind bursts, and preconditioning by the failed 2014 event. Clim. Dyn. 52, 7339–7357 (2019).
Article Google Scholar
Fedorov, A. V., Hu, S., Lengaigne, M. & Guilyardi, E. The impact of westerly wind bursts and ocean initial state on the development, and diversity of El Niño events. Clim. Dyn. 44, 1381–1401 (2015).
Article Google Scholar
Chen, N., Fang, X. & Yu, J.-Y. A multiscale model for El Niño complexity. Npj Clim. Atmos. Sci. 5, 1–13 (2022).
Article Google Scholar
Fedorov, A. V. The response of the coupled tropical ocean–atmosphere to westerly wind bursts. Q. J. R. Meteorol. Soc. 128, 1–23 (2002).
Article Google Scholar
Lopez, H. & Kirtman, B. P. WWBs, ENSO predictability, the spring barrier and extreme events. J. Geophys. Res. Atmos. 119, 114–10,138 (2014).
Article Google Scholar
Yu, S. & Fedorov, A. V. The role of westerly wind bursts during different seasons versus ocean heat recharge in the development of extreme El Niño in climate models. Geophys. Res. Lett. 47, e2020GL088381 (2020).
Article Google Scholar
Levine, A., Jin, F. F. & McPhaden, M. J. Extreme noise–extreme El Niño: how state-dependent noise forcing creates El Niño–La Niña asymmetry. J. Clim. 29, 5483–5499 (2016).
Article Google Scholar
Chiodi, A. M. & Harrison, D. E. Observed El Niño SSTA development and the effects of easterly and westerly wind events in 2014/15. J. Clim. 30, 1505–1519 (2017).
Article Google Scholar
Puy, M. et al. Influence of westerly wind events stochasticity on El Niño amplitude: the case of 2014 vs. 2015. Clim. Dyn. 52, 7435–7454 (2019).
Article Google Scholar
Hu, R. et al. Predicting the 2023/24 El Niño from a multi-scale and global perspective. Commun. Earth Environ. 5, 1–8 (2024).
Article Google Scholar
Lian, T., Wang, J., Chen, D., Liu, T. & Wang, D. A strong 2023/24 El Niño is staged by tropical Pacific Ocean heat content buildup. Ocean-Land-Atmos. Res. 2, 0011 (2023).
Article Google Scholar
Dellaripa, E. M. R., DeMott, C., Cui, J. & Maloney, E. D. Evaluation of equatorial westerly wind events in the Pacific Ocean in CMIP6 models. J. Clim. 37, 5953–5971 (2024).
Article Google Scholar
Lian, T. et al. Westerly wind bursts simulated in CAM4 and CCSM4. Clim. Dyn. 50, 1353–1371 (2018).
Article Google Scholar
Seiki, A. Westerly wind bursts and their relationship with ENSO in CMIP3 models. J. Geophys. Res. Atmos. 116, D03303 (2011).
Article Google Scholar
Chen, N. & Fang, X. A simple multiscale intermediate coupled stochastic model for El Niño diversity and complexity. J. Adv. Model. Earth Syst. 15, e2022MS003469 (2023).
Article Google Scholar
Hayashi, M. & Watanabe, M. ENSO complexity induced by state dependence of westerly wind events. J. Clim. 30, 3401–3420 (2017).
Article Google Scholar
Ji, C., Mu, M., Fang, X. & Tao, L. Improving the forecasting of El Niño amplitude based on an ensemble forecast strategy for westerly wind bursts. J. Clim. 36, 8675–8694 (2023).
Article Google Scholar
Eisenman, I., Yu, L. & Tziperman, E. Westerly wind bursts: ENSO’s tail rather than the dog?. J. Clim. 18, 5224–5238 (2005).
Article Google Scholar
Seiki, A. & Takayabu, Y. N. Westerly wind bursts and their relationship with intraseasonal variations and ENSO. Part I: statistics. Mon. Weather Rev. 135, 3325–3345 (2007).
Article Google Scholar
Gebbie, G., Eisenman, I., Wittenberg, A. & Tziperman, E. Modulation of westerly wind bursts by sea surface temperature: a semistochastic feedback for ENSO. J. Atmos. Sci. 64, 3281–3295 (2007).
Article Google Scholar
Lian, T. et al. Linkage between westerly wind bursts and tropical cyclones. Geophys. Res. Lett. 45, 431–11,438 (2018).
Article Google Scholar
Liang, Y. & Fedorov, A. V. Linking the Madden–Julian oscillation, tropical cyclones and westerly wind bursts as part of El Niño development. Clim. Dyn. 57, 1039–1060 (2021).
Article Google Scholar
Puy, M., Vialard, J., Lengaigne, M. & Guilyardi, E. Modulation of equatorial Pacific westerly/easterly wind events by the Madden–Julian oscillation and convectively-coupled Rossby waves. Clim. Dyn. 46, 2155–2178 (2016).
Article Google Scholar
Madden, R. A. & Julian, P. R. Description of global-scale circulation cells in the tropics with a 40–50 day period. J. Atmos. Sci. 29, 1109–1123 (1972).
Article Google Scholar
Feng, J. & Lian, T. Assessing the relationship between MJO and equatorial Pacific WWBs in observations and CMIP5 models. J. Clim. 31, 6393–6410 (2018).
Article Google Scholar
Mu, M., Qin, B. & Dai, G. A commentary of “Artificial intelligence models bring new breakthroughs in global accurate weather forecasting”: top 10 scientific advances of 2023, China. Fundam. Res. 4, 690–692 (2024).
Article Google Scholar
Mu, M., Qin, B. & Dai, G. The predictability study of weather and climate events related to artificial intelligence models. Adv. Atmos. Sci. 41, 1005–1025 (2024).
Google Scholar
Qin, B. et al. The first kind of predictability problem of El Niño predictions in a multivariate coupled data-driven model. Q. J. R. Meteorol. Soc. 150, 5452–5471 (2024).
Article Google Scholar
You, L., Tan, X. & Tang, Y. Construction of deep-learning based WWBs parameterization for ENSO prediction. Atmos. Res. 289, 106770 (2023).
Article Google Scholar
L’Heureux, M. L. et al. ENSO prediction. In El Niño Southern Oscillation in a Changing Climate 227–246 (American Geophysical Union (AGU), 2020).
Fang, X., Dijkstra, H., Wieners, C. & Guardamagna, F. An overlooked aspect concerning the effect of the spatial pattern of zonal wind stress anomalies on El Niño evolution and diversity. Clim. Dyn. 62, 7037–7047 (2024).
Article Google Scholar
Gebbie, G. & Tziperman, E. Predictability of SST-modulated westerly wind bursts. J. Clim. 22, 3894–3909 (2009).
Article Google Scholar
Liang, Y., Fedorov, A. V. & Haertel, P. Intensification of westerly wind bursts caused by the coupling of the Madden-Julian oscillation to SST During El Niño Onset and Development. Geophys. Res. Lett. 48, e2020GL089395 (2021).
Article Google Scholar
Landsea, C. W. & Knaff, J. A. How much skill was there in forecasting the very strong 1997–98 El Niño?. Bull. Am. Meteorol. Soc. 81, 2107–2120 (2000).
Article Google Scholar
Ineson, S. et al. Predicting El Niño in 2014 and 2015. Sci. Rep. 8, 10733 (2018).
Article Google Scholar
Lian, T. & Chen, D. The essential role of early-spring westerly wind burst in generating the centennial extreme 1997/98 El Niño. J. Clim. https://doi.org/10.1175/JCLI-D-21-0010.1 (2021).
Wang, J. et al. Suppressive MJO in April 2014 downgraded the 2014/15 El Niño. J. Clim. 37, 3377–3391 (2024).
Article Google Scholar
Chen, H.-C. & Jin, F.-F. Simulations of ENSO phase-locking in CMIP5 and CMIP6. J. Clim. 34, 5135–5149 (2021).
Article Google Scholar
Chen, H.-C. & Jin, F.-F. Fundamental behavior of ENSO phase locking. J. Clim. 33, 1953–1968 (2020).
Article Google Scholar
Liao, H., Wang, C. & Song, Z. ENSO phase-locking biases from the CMIP5 to CMIP6 models and a possible explanation. Deep Sea Res. Part II Top. Stud. Oceanogr. 189–190, 104943 (2021).
Article Google Scholar
Rydbeck, A. V., Jensen, T. G. & Flatau, M. Characterization of intraseasonal Kelvin waves in the equatorial Pacific Ocean. J. Geophys. Res. Oceans 124, 2028–2053 (2019).
Article Google Scholar
Bjerknes, J. Atmospheric teleconnections from the equatorial Pacific. Mon. Weather Rev. 97, 163–172 (1969).
Article Google Scholar
Huang, B. et al. Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1. J. Clim. 34, 2923–2939 (2021).
Article Google Scholar
Liebmann, B. & Smith, C. A. Description of a complete (interpolated) outgoing longwave radiation dataset. Bull. Am. Meteorol. Soc. 77, 1275–1277 (1996).
Google Scholar
Kalnay, E. et al. The NCEP/NCAR 40-year reanalysis project. Bull. Am. Meteorol. Soc. 77, 437–472 (1996).
Article Google Scholar
Hersbach, H. et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 146, 1999–2049 (2020).
Article Google Scholar
Behringer, D. W., Ji, M. & Leetmaa, A. An improved coupled model for ENSO prediction and implications for ocean initialization. Part I: The ocean data assimilation system. Mon. Weather Rev. 126, 1013–1021 (1998).
Article Google Scholar
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. In Advances in Neural Information Processing Systems (eds. Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M. F. & Lin, H.) 33, 6840–6851 (Curran Associates, Inc., 2020).
Bhunia, A. K. et al. Person Image Synthesis via Denoising Diffusion Model. In 2023 IEEE/CVF Conference on Computer Visionand Pattern Recognition (CVPR) 5968–5976 https://doi.org/10.1109/CVPR52729.2023.00578 (IEEE, Vancouver, BC, Canada, 2023).
Lovelace, J., Kishore, V., Wan, C., Shekhtman, E. & Weinberger, K. Q. Latent diffusion for language generation. Adv. Neural Inf. Process. Syst. 36, 56998–57025 (2023).
Google Scholar
Li, L., Carver, R., Lopez-Gomez, I., Sha, F. & Anderson, J. Generative emulation of weather forecast ensembles with diffusion models. Sci. Adv. 10, eadk4489 (2024).
Article Google Scholar
Nai, C. et al. Reliable precipitation nowcasting using probabilistic diffusion models. Environ. Res. Lett. 19, 034039 (2024).
Article Google Scholar
Liu, Z. et al. A ConvNet for the 2020 s. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 11966–11976 https://doi.org/10.1109/CVPR52688.2022.01167 (2022).
Liu, Z. et al. Swin Transformer: hierarchical vision transformer using shifted windows. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 9992–10002 https://doi.org/10.1109/ICCV48922.2021.00986 (2021).
Lian, T., Chen, D., Tang, Y. & Wu, Q. Effects of westerly wind bursts on El Niño: a new perspective. Geophys. Res. Lett. 41, 3522–3527 (2014).
Article Google Scholar
Chen, D. et al. Strong influence of westerly wind bursts on El Niño diversity. Nat. Geosci. 8, 339–345 (2015).
Article CAS Google Scholar
Tan, X. et al. A study of the effects of westerly wind bursts on ENSO based on CESM. Clim. Dyn. 54, 885–899 (2020).
Article Google Scholar
Tan, X. et al. Effects of semistochastic westerly wind bursts on ENSO predictability. Geophys. Res. Lett. 47, e2019GL086828 (2020).
Article Google Scholar
Chen, L., Li, T., Wang, B. & Wang, L. Formation mechanism for 2015/16 super El Niño. Sci. Rep. 7, 2975 (2017).
Article Google Scholar
Li, A., Ji, C. & Fang, X. A new insight on El Niño diversity: decadal variability in westerly wind bursts. Atmos. Sci. Lett. 26, e1301 (2025).
Article Google Scholar
Forecast verification. In Statistical Methods in the Atmospheric Sciences 4th edn (ed. Wilks, D. S.) xv https://doi.org/10.1016/B978-0-12-815823-4.09991-0 (Elsevier, 2019).
Hurrell, J. W. et al. The community earth system model: a framework for collaborative research. Bull. Am. Meteorol. Soc. 94, 1339–1360 (2013).
Article Google Scholar
Liu, T., Song, X., Tang, Y., Shen, Z. & Tan, X. ENSO predictability over the past 137 years based on a CESM ensemble prediction system. J. Clim. 35, 763–777 (2022).
Article Google Scholar
Song, X. et al. A new nudging scheme for the current operational climate prediction system of the National Marine Environmental Forecasting Center of China. Acta Oceanol. Sin. 41, 51–64 (2022).
Article Google Scholar

Download references

Acknowledgements

The research of C.J., Bo Qin, M.M., S.Y., X.F., Y.W., G.D., and J.W. are supported by the National Natural Science Foundation of China (Grant Nos. 42288101, 42405147, 42192564, and U2142211), the China National Postdoctoral Program for Innovative Talents (BX20230071), the Ministry of Science and Technology of the People’s Republic of China (Grant No. 2020YFA0608802), the National Key Scientific and Technological Infrastructure project “Earth System Science Numerical Simulator Facility” (EarthLab) and the Academician Workstation of AP-TCRC.

Author information

Authors and Affiliations

Key Laboratory of Polar Atmosphere-Ocean-Ice System for Weather and Climate, Ministry of Education, Department of Atmospheric and Oceanic Sciences and Institute of Atmospheric Sciences, Fudan University, Shanghai, China
Chaopeng Ji, Mu Mu, Bo Qin, Yuntao Wei, Guokun Dai, Jinyu Wang & Xianghui Fang
Shanghai Key Laboratory of Ocean-land-atmosphere Boundary Dynamics and Climate Change, Fudan University, Shanghai, China
Chaopeng Ji, Mu Mu, Bo Qin, Yuntao Wei, Guokun Dai, Jinyu Wang & Xianghui Fang
Shanghai Frontiers Science Center of Atmosphere-Ocean Interaction, Shanghai, China
Chaopeng Ji, Mu Mu, Bo Qin, Yuntao Wei, Guokun Dai, Jinyu Wang & Xianghui Fang
State Key Laboratory of Satellite Ocean Environment Dynamics, Second Institute of Oceanography, Ministry of Natural Resources, Hangzhou, China
Tao Lian, Jie Feng & Xunshu Song
Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
Tao Lian
School of Oceanography, Shanghai Jiao Tong University, Shanghai, China
Tao Lian
School of Computer Science and Technology, Tongji University, Shanghai, China
Shijin Yuan
National Key Laboratory of Autonomous Intelligent Unmanned Systems, Tongji University, Shanghai, China
Shijin Yuan
Frontiers Science Center for Intelligent Autonomous Systems, Ministry of Education of China, Shanghai, China
Shijin Yuan

Authors

Chaopeng Ji
View author publications
Search author on:PubMed Google Scholar
Mu Mu
View author publications
Search author on:PubMed Google Scholar
Bo Qin
View author publications
Search author on:PubMed Google Scholar
Tao Lian
View author publications
Search author on:PubMed Google Scholar
Shijin Yuan
View author publications
Search author on:PubMed Google Scholar
Jie Feng
View author publications
Search author on:PubMed Google Scholar
Xunshu Song
View author publications
Search author on:PubMed Google Scholar
Yuntao Wei
View author publications
Search author on:PubMed Google Scholar
Guokun Dai
View author publications
Search author on:PubMed Google Scholar
Jinyu Wang
View author publications
Search author on:PubMed Google Scholar
Xianghui Fang
View author publications
Search author on:PubMed Google Scholar

Contributions

C.J., B.Q. and X.F. designed the project. C.J., B.Q., and J.F. contributed the model codes. C.J. and B.Q. conducted model experiments. C.J., B.Q. and X.F. wrote the manuscript. C.J., B.Q., M.M., T.L., S.Y., J.F., X.S., Y.W., G.D. and J.W. discussed the results and reviewed the manuscript.

Corresponding authors

Correspondence to Bo Qin or Xianghui Fang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supporting Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Ji, C., Mu, M., Qin, B. et al. Toward skillful forecasting of super El Niño events using a diffusion-based westerly wind burst parameterization. npj Clim Atmos Sci 8, 273 (2025). https://doi.org/10.1038/s41612-025-01158-x

Download citation

Received: 14 April 2025
Accepted: 30 June 2025
Published: 19 July 2025
Version of record: 19 July 2025
DOI: https://doi.org/10.1038/s41612-025-01158-x

This article is cited by

AI-Enabled conditional nonlinear optimal perturbation enhances ensemble prediction of extreme El Niño events
- Lu Zhou
- Rong-Hua Zhang
- Lingjiang Tao
npj Climate and Atmospheric Science (2025)

Subjects

Abstract

Similar content being viewed by others

Early signs of strong El Niño–Southern Oscillation episodes

Diverse NPMM conditions deviate the 2023/24 El Niño from the 1997/1998 and 2015/2016 extreme El Niño events

Change of El Niño onset location around 1970

Introduction

Results

Evaluation of the new DDPM-based WWBs parameterization

Embedding DDPM-based WWBs parameterization into CESM and forecasting of super El Niño

Mechanisms of DDPM in improving super El Niño prediction

Discussion

Methods

Data

Mathematical formulation of DDPM

DDPM-based parameterization of WWBs

Traditional WPEE-dependent WWBs parameterization scheme

Definitions of WWBs and Niño index

Relative Operating Characteristic curve

CESM

Design of super El Niño forecast experiments

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supporting Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

AI-Enabled conditional nonlinear optimal perturbation enhances ensemble prediction of extreme El Niño events

Search

Quick links