Introduction

In Japan, South Korea, Taiwan, the Philippines, and other southeast Asian coastal regions, tropical cyclones (TCs) are the most costly and deadly of natural disasters1. A successful prediction of the TC frequency in a season at least a few months ahead could help reduce the socio-economic losses through necessary mitigation measures and can benefit a range of industries, including insurance, agriculture, and tourism. Real-time seasonal forecasts of TC activity are provided by some agencies2. Such a research stream is becoming critically important in the low-latitude northwestern Pacific, where the typhoon intensity is expected to increase due to ongoing global warming3 and the impacts of the natural year-to-year variability are also expected to be more serious.

Previous works identified several potential sources of the seasonal predictability of TC frequency over the western North Pacific. The most famous phenomenon is El Niño–Southern Oscillation (ENSO)1,4,5,6,7. In El Niño years, TC over the western North Pacific tends to be more intense and longer-lived than in La Niña years7. However, TC frequency in the western North Pacific does not change significantly between El Niño and La Niña years8. On the other hand, El Niño events display a diverse range of amplitudes, triggers, spatial patterns, and life cycles9. TC over the western North Pacific may be sensitive to this event-to-event diversity10. Because of the limited events, it is difficult to show high confidence in such assertion. However, some studies tried to show the possibility. For example, El Niño Modoki11 or Central Pacific (CP)-type El Niño12 can induce substantial increases in TC duration, intensity, and frequency over the western North Pacific13,14,15. In the positive Pacific Meridional Mode (PMM)16 phase, although which cannot be interpreted as an independent phenomenon from CP-type El Niño17, more TCs tend to occur over the western North Pacific18,19. Contributions from the Indian Ocean were also shown. TC activity over the western North Pacific could be suppressed by the tropical Indian Ocean warming20,21. The warm/cold eastern Indian Ocean could suppress/promote the TC genesis over the western North Pacific22. Also, the warm (cold) northern Indian Ocean could cause fewer (more) TCs forming north of 10°N and more (fewer) TCs forming south of 10°N over the western North Pacific in boreal summer23. In the summer following the spring subtropical Indian Ocean Dipole24, more TCs tend to form over the western North Pacific25. The TC activity could be remarkably enhanced near South China coastal areas during negative IOD autumns26. The extremely low TC activity during September–October 2018 may be attributed to a positive Indian Ocean Dipole (IOD)27. A combination of an El Niño Modoki and a positive IOD28, as in 1994, can enhance seasonal TC activity over the South China Sea29. A dipole SST anomaly in the Indo-Pacific warm pool can influence the total TC genesis number over the western North Pacific30.

Although the current status of the different models used to predict seasonal TC activity shows good levels of skill before the start of the climatological periods of peak typhoon activity2, there remains room for improvement. To the best of our knowledge, an exploration of how to reduce the large uncertainty of seasonal prediction of TC activity by a large-ensemble climate model has not yet been presented, thus it is the focus of this study. In this study, we have analyzed the reforecast outputs by the 108-ensemble-member SINTEX-F2 seasonal prediction system from the nine initialized dates (1st–9th) May and August of 1982–202231,32 to find predictable events, explore the origin of the success, and, hopefully, find potential room for improvement in the seasonal predictions by analyzing the co-variability of the inter-ensemble member anomalies. Although the spatial resolution of our system (T106) is relatively coarser than that of the other operational systems2, the 108-member system has an advantage in finding a predictable signal against unpredictable atmospheric noise on a seasonal timescale and possible co-variability patterns influencing predictions of TC frequency. Besides, the reforecast period in this study (1982–2022) is much longer than the 16-year period 2003–2018 in the previous work2.

Results

Skill assessment

First, we checked the climatological fields of TC frequency in the prediction model. We tuned some parameters to detect TC in the model because its horizontal resolution (T106) was not enough to resolve TC realistically (see TC detection in Methods). We also used the Okubo–Weiss–Zeta (OWZ) scheme, which is an innovative direct TC detection and tracking scheme (see TC detection in Methods). The averaged TC frequency in June–August (JJA) was overestimated in the ensemble mean prediction relative to the JRA55 reanalysis data (Fig. 1a–c), although the model underestimated the TC frequency before the tuning. We also found that the positive bias in the OZW detection (Supplementary Fig. 1a–c and TC detection in Methods). To understand the cause of this bias, we have analyzed the Genesis Potential Index (GPI) defined by Emanuel and Nolan33. The GPI in JJA was also overestimated in the ensemble mean prediction relative to the JRA55 reanalysis data (Supplementary Fig. 2a–c). We estimated the relative contributions of the four components; the absolute vorticity at 850 hPa, the relative humidity at 700 hPa, the maximum potential intensity, and the vertical wind shear between 850 and 200 hPa, to the bias by recalculating the GPI in the model by replacing the model outputs one by one with the reanalysis data for the 4 components (Supplementary Fig. 2d–g). We found that the maximum potential intensity was critical. Murakami et al. 34 showed that maximum potential intensity is a major factor of GPI in the western North Pacific, and convective available potential energy (CAPE) plays a major role in the change of maximum potential intensity. Considering that CAPE is approximately controlled by sea surface temperature (SST), it is suggested that the positive TC frequency bias is due to SST to some extent. Since the model has the positive SST bias around Okinawa and Taiwan (Supplementary Fig. 3), it could cause the positive TC frequency bias there.

Fig. 1: Climatological tropical cyclone frequency.
figure 1

a Climatological tropical cyclone (TC) frequency in June–August (JJA) from the JRA55 reanalysis data (numbers in 5°×5° grid.). The target region (120°E–130°E and 20°N–30°N) is shown by a black box. (b) Same as (a), but for the prediction issued in early May by the SINTEX-F2 (108-ensemble mean). c (b) minus (a). (d, e, f) Same as (a, b, f), but for the standard deviation of year-to-year variations of TC frequency. g–l Same as (a–d), but for September–November (SON), with the prediction issued in early August.

The ensemble mean of standard deviations of year-to-year variations of TC frequency by an individual ensemble member was also larger than the standard deviation of the reanalysis data in some areas (Fig. 1e–g and Supplementary Fig. 1d–f for the OWZ detection), although the standard deviation of year-to-year variations of ensemble mean TC frequency in the ensemble mean was much smaller than the standard deviation of the reanalysis data (figure not shown). The standard deviation divided by the climatological TC frequency showed that the model slightly underestimated the TC variability relative to the mean (Supplementary Fig. 4a–c).

Next, we assessed the prediction skills for the JJA prediction issued in early May: the correlation skill of the ensemble-mean prediction, the signal-to-noise ratio (which indicates potential predictability, see the definition in Methods), and the Symmetric extremal dependence index35 (SEDI) (see the definition in Methods) skill for probabilistic prediction of the extreme positive tail with more than one standard deviation (approximately 16%). The probability threshold is 20%. Although the skills were limited (Fig. 2a–c), the correlation skill was statistically significant above the 95% significance level around Okinawa and Taiwan (Fig. 2a), which includes the Japanese Exclusive Economic Zone. Therefore, we would like to focus on that region (120°E–130°E and 20°N–30°N). The similar feature was also seen even if we used the OWZ detection (see Supplementary Fig. 5).

Fig. 2: Seasonal prediction skill of tropical cyclone frequency.
figure 2

a Correlation skill for prediction of TC frequency anomalies in JJA issued in early May by the 1982–2022 reforecast experiments by the SINTEX-F2 system (108-ensemble mean). Areas where the values are statistically significant above the 95% significance level are shaded. The target region (120°E–130°E and 20°N–30°N) is shown by a red box. b Same as a, but for signal-to-noise ratio (standard deviation of ensemble mean divided by ensemble spread for 1982–2022). c Same as a, but for symmetric extremal dependence index (SEDI) skill for the extreme positive tail with more than one standard deviation (approximately 16%). The probability threshold is 20%. Areas where the values are statistically significant beyond the standard errors above the 95% significance level are dotted. df Same as (a-c), but for SON issued in early August.

There is not a large bias in the target region in the September–November (SON) mean TC frequency (Fig. 1g–i) and its variability (Fig. 1j–l). The SON prediction issued in early August was further challenging relative to JJA (Fig. 2d–f) and could not find the predictable area. The seasonality is consistent with Ogata et al.13, who used a 50-km atmospheric model, and Takaya et al.20,21, who used a dynamical seasonal prediction system (a 110-km atmospheric component).

As shown in the time series of TC frequency anomalies in JJA over the target area (Fig. 3), the TC active (quiescent) summers in 2018 and 1994 (1983 and 1998) were well predicted. As discussed above, there is a positive bias in the TC frequency (absolute values) and the variability (Supplementary Fig. 6).

Fig. 3: Time series of anomalous tropical cyclone frequency in the target region.
figure 3

Time series of TC frequency anomalies in JJA averaged over 120°E-130°E and 20°N-30°N; shown by a red box in Fig. 2a from the JRA55 reanalysis data (gray bar), the prediction issued on early May of each year by the SINTEX-F2 system (red cross: 108-ensemble mean; orange dot: each ensemble member). The unit of the y-axis is tropical cyclone frequency, i.e. counted TCs. We have highlighted the ensemble mean of 2018, 1994, 1983 and 1998 by blue boxes.

TC active summer in 2018

Eighteen TCs were observed over the western North Pacific in JJA 2018, which was ranked the second most active summer since 197927. Although the model could not capture the landfalling TC frequency anomalies around Japan in JJA 2018, it successfully predicted the positive anomalies in the target area (Fig. 4a, b). The model correctly predicted the evolution of a positive IOD, a CP El Niño, and a positive PMM at the time (Fig. 4d, e). The suppressed convection in the eastern tropical Indian Ocean (Fig. 4g) could excite the northward branch of the meridional circulation and then lead the low-level cyclonic flow over the target region (Fig. 4j) with the associated divergent flow from the eastern pole of the positive IOD into the south of the target region, the exit region of the monsoonal westerly jet around 10°–20°N (Fig. 4m). This pattern was similar to the Pacific-Japan (PJ) teleconnection pattern36, but the location was a little shifted westward37. The target region was also located in the exit region of the Trades and showed zonal asymmetries in the climatological mean field, which could enhance barotropic energy conversion from the mean flow to the PJ-associated anomalies in the lower troposphere38. The low-level cyclonic circulation could favor a TC frequency increase in the target region. Those patterns were mostly predicted by the ensemble mean prediction, although it showed some errors: the location of the negative OLR anomalies in the low-latitude northwestern Pacific was about 10° shifted southward (Fig. 4h), the low-level cyclonic flow over the target region further extended eastward (Fig. 4k), and the large-scale low-level convergence in the North Pacific was overestimated (Fig. 4n).

Fig. 4: Anomalies and ensemble co-variability for JJA 2018.
figure 4

a TC frequency anomalies in JJA 2018 from the JRA55 reanalysis data in 5°×5° grid. (b) Same as (a), but for the prediction issued in early May 2018 by the SINTEX-F2 system (108-ensemble mean). c Inter-ensemble member correlation in the 108-member prediction between the TC frequency anomalies in the target area (120°E-130°E and 20°N-30°N) and the TC frequency anomalies for JJA of 2018 issued in early May 2018 by the SINTEX-F2. Values above 0.25, which are statistically significant above the 99% significance level, are shaded. d, e, f Same as (a, b, c), but for SST (°C). Dotted in e are area where the signal-to-noise ratio is above 1. (g, h, i) Same as (a, b, c), but for OLR (W m-2). (j, k, l) Same as (a, b, c), but for GH (m) and wind speed (m s-1) at 850 hPa. (m, n, o) Same as (a, b, c), but for velocity potential (10-6 m2 s-1) and divergent wind speed (m s-1) at 850 hPa. In all panels, the target area, the Dipole Mode Index (DMI, the SST anomaly difference between the western pole off East Africa (50°E–70°E, 10°S–10°N) and the eastern pole off Sumatra (90°E–110°E, 10°S–Equator), introduced by Saji et al.28, and the Niño3.4 (170°W–120°W, 5°S–5°N) regions are shown by black boxes, respectively.

The inter-ensemble correlation between the TC frequency anomalies in the target region and them in the other areas could indicate the TC frequency anomalies in the target region had no linear relationship with the TC frequency anomalies in the other regions (Fig. 4c). Interestingly, the prediction of the TC frequency anomalies in the target region was significantly linked with the prediction of the positive IOD, particularly the eastern pole (Fig. 4f): ensemble members that predicted stronger positive IOD tend to predict higher TC frequency in the target region. Although the differences among ensemble members due to large atmospheric internal variability have been considered to be unpredictable noise, the inter-ensemble correlation analyses could support that the teleconnection from the positive IOD commonly appears in ensemble members to some extent, as mentioned above for the observed/reanalysis data and the ensemble mean (Fig. 4i, l), although the inter-ensemble correlations with the associated velocity potential anomalies were not statistically significant (Fig. 4o). Our finding of the IOD contribution can add a new insight to the previous work by Qian et al.39, who showed that the 2018 TC active season in the North Pacific was primarily caused by warming in the subtropical Pacific (PMM) and secondarily by warming in the tropical Pacific (CP El Nino).

We also investigated the TC active (quiescent) summers in 1994 (1983 and 1998). The 1994 (1998) summer could suggest the positive (negative) IOD contribution, although the inter-ensemble correlations were too small to support it. The 1983 summer could suggest the negative PMM contribution. The details are shown in Supplementary Figs. 7-9a-o.

Statistical analysis

The correlation between the eastern pole of the DMI and the TC frequency anomalies around Okinawa and Taiwan for JJA of 1982–2022 was positive in the observational and reanalysis datasets, which was statistically significant above the 90% significance level (Fig. 5a). This is in agreement with a previous work22. The relationships were also well captured by the seasonal prediction system (Fig. 5b). The feature was also seen when we used the OWZ detection (Supplementary Fig. 10a, b). The correlations between the eastern pole of the DMI and the circulation anomalies also support the IOD teleconnection patterns discussed in the 2018 case study (Fig. 6): suppressed convection in the eastern tropical Indian Ocean enhanced divergent wind from the eastern tropical Indian Ocean to the Okinawa and Taiwan areas. That helped to generate low pressure in the target area, which was favorable for the TC frequency. This is also consistent with the previous study40, which showed that the positive IOD composite analysis also exhibits a low-level cyclonic circulation anomaly in the target region, although it is not statistically significant.

Fig. 5: Correlation map between the eastern pole of DMI and TC frequency in JJA of 1982–2022.
figure 5

a Correlation map between the eastern pole of DMI from the observation and the TC frequency anomalies in JJA of 1982–2022 from the JRA55 reanalysis data in 5°×5° grid. b Same as (a), but for the prediction issued in early May by the SINTEX-F2 system (108-ensemble mean). In all panels, the target area is shown by a black box. Values above 0.25, which are statistically significant above the 90% significance level, are shaded.

Fig. 6: Correlation map with the eastern pole of the DMI in JJA of 1982–2022.
figure 6

a Correlation map between the eastern pole of the DMI and the SST anomalies in JJA of 1982–2022 from the observational data. b Same as (a), but for the prediction issued in early May by the SINTEX-F2 system (108-ensemble mean). c, d Same as (a, b), but for OLR. (e, f) Same as (a, b), but for GH and wind speed at 850 hPa. g, h Same as (a, b), but for velocity potential and divergent wind speed at 850 hPa. Values above 0.25, which are statistically significant above the 90% significance level, are shaded. In all panels, the target area, the DMI, and the Niño3.4 regions are shown by black boxes, respectively.

Discussions

We found that a positive IOD played a key role in the seasonal predictability of the 2018 summer TC frequency around Okinawa and Taiwan by the dynamical prediction system. The 108-member system had an advantage in finding the predictable signal against the large unpredictable noise and the possible IOD teleconnection via the inter-member co-variability analyses. By reducing the uncertainty of the IOD prediction, we could reduce the uncertainty of the TC frequency prediction around Okinawa and Taiwan and increase the signal. The IOD contributions to the predictability were also seen in the correlation analyses in 1982–2022 and some case studies in 1994 and 1998.

Interestingly, the signals, the ensemble-mean predictions, were relatively strong for the four well-predicted summers in 1983, 1994, 1998, and 2018. When the signal is relatively strong for the coming TC season, we could add information that the prediction is relatively reliable41 although the sample size was limited to discuss the statistical skill scores. For the four summers, the IOD and/or PMM could provide a window of opportunity to issue a much more confident forecast than average skill estimates would suggest. Further careful assessments of extreme climate events and their drivers are necessary to utilize windows of opportunity for stakeholders to issue a range of plausible predictions, trustworthy and actionable early warnings with real-time forecasts42.

The PJ pattern was clear for the summer of 1994 and 1983 (Supplementary Figs. 7 and 8), so we have calculated inter-ensemble member correlation in the 108-member prediction between the sea level pressure at Hengchun of Taiwan and the OLR anomalies (Supplementary Fig. 11a, c). The sea level pressure at Hengchun of Taiwan was used to the PJ index43. Supplementary Fig. 6a and c suggested the PJ pattern in 1994 and 1983. However, we could not find the co-variability between the PJ pattern and the tropical climate variations such as ENSO and IOD (Supplementary Fig. 11b, d). For the summer of 1983, we could find the co-variability between the PJ pattern and the local SST over the western North Pacific (Supplementary Fig. 11d). This is partly consistent with the previous study44, which showed that the PJ pattern can self-aggregate over the Taiwan region, which could be less related to IOD or ENSO. Note that the correlation between the sea level pressure at Hengchun of Taiwan and the SST anomalies for JJA of 1982–2022 in the observations suggested that the weak contribution of the eastern pole of IOD to the PJ pattern (Supplementary Fig. 11e), although the correlation with the OLD anomalies did not suggest it (Supplementary Fig. 11g). The ensemble mean predictions overestimated the statistical relationship with ENSO and IOD (Supplementary Fig. 11f, h), which may be due to the fact that the ensemble mean did not include the noise. Further studies are needed to investigate possible contributions of the self-sustained PJ pattern to the seasonal predictability of the TC frequency anomalies over the western North Pacific.

The presented results are based only on a single system. In the next step, the robustness should be investigated further by including the results of multi-model systems. Particularly, further studies with high-resolution models45,46,47,48 are necessary because the horizontal resolution of the SINTEX-F (T106) was not enough to realistically resolve TC. Such research streams are underway49,50,51.

There could be differences between the detected TC from the reanalysis data and the observed TC. We have calculated the correlation between the detected TC frequency anomalies and the Japan Meteorological Agency (JMA) Best Track Data for JJA and SON of 1982–2022 (Supplementary Fig. 12). Since the correlations were statistically significant above the 95% significance level in the target area (Supplementary Fig. 12a, c), we did the comparison with the detected tropical cyclones from the JRA55 reanalysis data. Note that the correlations in JJA are slightly higher in the target area relatively to the OWZ detection (Supplementary Fig. 12a, b), while the correlations in SON by the OWZ detection are slightly higher in the target area (Supplementary Fig. 12c, d). Although the results might be sensitive to the TC detection methods, the robustness of the IOD contribution to the predictability of the summer TC frequency around Okinawa and Taiwan could be ensured to some extent. Further fine-tuning of the TC detection method might be helpful to improve the prediction skill of the SINTEX-F. However, manually tuning is typically a time-consuming. Machine learning techniques may help addressing it. We would like to challenge it as a future research topic.

The 5°grid interpolation in this study might be weak for constructing relationship between TC frequency and tropical climate modes. Some previous studies52,53 induced some new interpolation methods, which could enhance the stability of the statistical analysis. We would like to study the sensitivity of the interpolation methods as a future research topic.

Methods

Reforecast experiments

The SINTEX-F2 seasonal prediction system is based on a global ocean-atmosphere-land-sea ice coupled model54,55 with a surface and subsurface oceanic initialization scheme56,57 developed under the EU-Japan collaborative framework. This system adopts a relatively simple initialization scheme based only on the nudging of the SST data56 and a three-dimensional variational ocean data assimilation (3DVAR) method by taking three-dimensional observed ocean temperature and salinity data into account57. The atmospheric component of the SINTEX-F2 has a horizontal resolution of 1.125° (T106) with 31 vertical levels, while the oceanic component has a horizontal resolution of about 0.5° × 0.5° with 31 vertical levels. To determine the anomalies, we have removed the model monthly climatology for the period 1991–2020 at each lead time.

Observational and reanalysis datasets

To evaluate the prediction results, we used the JRA55 reanalysis data58,59 for TC detection, the NOAA OISSTv2 high-resolution version60 for SST, the NOAA Interpolated Outgoing Longwave Radiation (OLR)61 for OLR, and the NCEP/NCAR reanalysis data62 for other atmospheric fields. The monthly anomalies were derived through deviations from the monthly climatology calculated by averaging the monthly data from 1991 to 2020.

TC detection

We used TempestExtremes63,64,65 to detect TC from the six hourly reanalysis data: 1) search for candidates as minima in the sea level pressure field; 2) eliminate if a more intense minimum exists within a great-circle-distance of 10.0°; 3) enforce a closed contour, requiring an increase at least 50 Pa within 5° of the candidate node, and a decrease in 300 hPa air temperature of 0.1 K within 5° of the node within 1.0° of the candidate with maximum air temperature; 4) candidates are then stitched in time to form paths, 4a) with a maximum distance between candidates of 6.0° (great-circle-distance), 4b) consisting of at least 8 candidates per path (48 hours), 4c) with a maximum gap size of 2 (12 hours, most consecutive timesteps with no associated candidate), and 4 d) with a minimum distance between endpoints of 20 degrees. Once the complete set of TC paths has been computed, total TC is counted over each 5° grid cell in JJA and SON for each year. We tuned some parameters to detect TC in the model because the horizontal resolution ( ~ T106) was not enough to resolve it realistically (too weak and too short lifespan) in the procedures 3) and 4 d) as follows: 3) enforce a closed contour requiring an increase of 50 Pa within 10 degrees of the candidate and a decrease of 0.1 K within 10° of the node within 2.0° of the candidate with maximum air temperature; in 4 d) with a minimum distance between endpoints of 10 degrees. To ensure the robustness of the results, we also used the OWZ detection66 by adapting DetectNodes code and StitchNodes code67 into TempestExtremes63,64,65. The OWZ scheme is an innovative direct TC detection and tracking scheme that utilizes only tropospheric variables. It is based on evaluating the OWZ, defined as

$${OWZ}=\max \left\{\frac{{\zeta }^{2}-({E}^{2}+{F}^{2})}{{\zeta }^{2}},0\right\}\times \eta \times {sign}(f),$$
(1)

where f is the Coriolis parameter, η is the absolute vorticity, ζ is relative vorticity, E =∂u/∂x + ∂υ/∂y is the stretching deformation, and F = ∂υ/∂x + ∂u/∂y is the shearing deformation. We first identified local maxima of OWZ at 850hPa. Candidates for which a stronger OWZ maximum exists within 5° great-circle-distance are eliminated. Next, only those candidates that satisfy the following six conditions within a distance of 2° great-circle-distance of that maximum are retained (OWZ at 850 hPa ≥ 5 ×10−5 s−1; OWZ at 500 hPa ≥ 4 ×10−5 s−1; relative humidity at 950 hPa ≥ 70%; relative humidity at 700 hPa ≥ 50%; specific humidity at 950 hPa ≥ 10 g kg−1; vws (vertical wind shear between 200 and 850 hPa) ≤ 25 m s−1). Consecutive TC points are stitched together when they lie within a maximum distance of 5° great-circle-distance from one another, allowing for a maximum 24-hour gap. Additional core thresholds must be reached for at least 48 hours (OWZ at 850 hPa ≥ 6 ×10−5 s−1; OWZ at 500 hPa ≥ 5 ×10−5 s−1; relative humidity at 950 hPa ≥ 85%; relative humidity at 700 hPa ≥ 70%; specific humidity at 950 hPa ≥ 14 g kg−1; vws ≤ 12.5 m s−1). Finally, tracks that do not reach tropical storm intensity (10-m zonal wind speed = 16 m s−1) for at least one time step are filtered out (see the details in Bourdin et al.67).

Inter-ensemble correlation analysis

The inter-ensemble correlation (i.e., correlation in the ensemble phase space) can measure how two predictions co-vary over the ensemble phase space and indicate the linear relationship as a number between -1 and 1. It provides useful insights into possible precursors and teleconnection patterns related to a climate event32,68,69,70,71,72. Here, we calculate inter-ensemble correlation coefficients between the target anomalies and horizontal maps of anomalous fields for each grid point among the 108 ensemble members with the SINTEX-F2 system for the target season as

$$R(x,y)=\frac{\displaystyle \frac{1}{ne}{\sum }_{e=1}^{ne}(X(e)-\overline{X})(Y(x,y,e)-\overline{Y(x,y)})}{\sqrt{\displaystyle \frac{1}{ne}{\sum }_{e=1}^{ne}{(X(e)-\overline{X})}^{2}}\sqrt{\displaystyle \frac{1}{ne}{\sum }_{e=1}^{ne}{(Y(x,y,e)-\overline{Y(x,y)})}^{2}}}$$

Here \({ne}\) is the ensemble size: 108, \(X(e)\) is predictions of the target index as a function of ensemble member (\(e\)), \(\mathop{X}\limits^{\bar{} }\) is the ensemble mean of \(X\left(e\right)\), \(Y(x,y,e)\) is predictions of the other variables and a function of two-dimensional horizontal (\(x,y\)), and ensemble member (\(e\)), \(\mathop{Y(x,y)}\limits^{\bar{} }\) is the ensemble mean of \(Y(x,y,e)\), respectively. In this analysis, the large ensemble size has an advantage in finding significant co-variability patterns when the signal-to-noise ratio of the target is generally low73.

Signal-to-noise ratio

We assessed the potential predictability by dividing the predicted variability into signal (S) and unpredictable noise (N) components. Here, S is estimated as the ensemble mean of the 108 members. N is estimated as the ensemble spread, which provides a measure of the uncertainties among the predictions. By calculating the signal-to-noise (SN) ratio, we measured robustness of the prediction of the signal as well as the potential predictability in a given month32.

Symmetric extremal dependence index

Symmetric extremal dependence index35 (SEDI) is a skill score suitable for extreme events. It provides meaningful results for rare events where the hit rate and false alarm rate approach zero. It is defined for a binary event and therefore requires a threshold to be set. In this study, the extreme positive tail was set by one standard deviation and the probability threshold is 20%.