Hybrid Wavelet–ML models for regional drought forecasting in Norway

Tuğrul, Türker; Oruç, Sertaç; Hall, Jessica Louise; Şenocak, Ali Ulvi Galip; Hınıs, Mehmet Ali

doi:10.1038/s41598-025-22416-1

Download PDF

Article
Open access
Published: 04 November 2025

Hybrid Wavelet–ML models for regional drought forecasting in Norway

Scientific Reports volume 15, Article number: 38573 (2025) Cite this article

2578 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Drought is a natural disaster that often remains unnoticed until ecosystem impacts become severe. Therefore, monitoring and detecting droughts are important research topics. Consequently, drought indices with different focuses, such as precipitation or soil moisture, have been developed. Yet, the utility of the indices is limited before the beginning of the drought. To overcome this shortcoming, drought forecasting and providing decision-makers with an early warning to mitigate the effects is an important research topic. This study aims to take on the forecasting of the droughts with its novelty on the spatial focus, Norway (Drammen, Hamar, and Lillehammer). We forecast the Effective Drought Index (EDI) across spatially diverse Norwegian regions without hydrological constraints. To achieve this, we have utilized precipitation data between 1980 and 2025 and trained our machine learning models, namely, Support Vector Machine (SVM), Multi-layer Perceptron (MLP), Extreme Gradient Boosting (XGboost), Long-Short Term Memory network (LSTM), and Categorical Boosting Algorithm (Catboost). Moreover, the latent feature space is extended by wavelet transformation (WT). The innovative aspect of this study and its contribution to the literature is its novel application of the WT to some algorithms. Furthermore, unlike the literature, EDI was chosen as the drought index in this study, further increasing its innovative nature. Our results indicate that long short-term memory networks enhanced by wavelet transformation provide the best forecasts. Here, the best performance, LSTMW-M04, is achieved over Drammen (r = 0.9765, NSE = 0.9510, KGE = 0.8641, PI = 0.3211, and RMSE = 0.2207). Although LSTM is already an innovative and successful algorithm, we have further improved the model performance. This result will help decision-makers in a future drought study with both the model input structure and the algorithm used.

Introduction

Droughts are considered as one of the most severe and complex forms of natural hazards; these climate related events cause catastrophic consequences which have an impact on ecosystems, food security and agriculture worldwide¹. Droughts are a variable type of natural hazard that take shape in many forms and are commonly identified by their intensity, frequency and length². Meteorological drought, one form of drought, is routinely defined by a significant reduction in precipitation over a specific amount of time³. When such conditions endure, the resulting lack of moisture gradually reduces soil water availability, which in turn disrupts crop growth and agricultural productivity. This progression illustrates how sustained meteorological drought can trigger agricultural drought, establishing a clear chain of impact between atmospheric conditions and land-based consequences⁴. Another form of drought is “hydrological drought”⁵ which is the result of long-term imbalances in water flow and access to water, caused by periods of below average precipitation. These often last more than nine months and can severely impact the behavior of rivers, lakes and groundwater systems⁶. A reduction in precipitation can lead to environmental disturbances that can impair socioeconomic activities⁷.

Due to the continuous intensification of climate change, the frequency, length and severity of droughts are predicted to continually increase, leading to heightened risks to ecosystems as well as to humanity^8,9. The threat posed by droughts is a matter of global concern; although some regions are affected more severely than others, the increase of drought intensity worldwide highlights the urgent need for universally coordinated responses^7,10. Undoubtedly, there is a need to develop multitiered water management strategies⁷ as many communities are vulnerable due to increasing challenges to their food security, livelihoods and economic stability^11,12.

Norway is often perceived as a country with an abundant water supply due to its rivers, lakes and glacial reserves which contributes both to the country’s natural beauty and to its development of energy infrastructure. Although this is the case, over the last few years, there has been an increase in the frequency of dry spells and summer droughts in southeastern and central regions of Norway¹³. This pattern is part of a heightened probability for larger water deficits in southeastern Norway whereby local water resources are put under pressure by prolonged periods of low precipitation and a consequent reduction in soil moisture¹⁴. This trend has created notable challenges for agriculture, ecosystems and water management in the region. Consequently, a nation long regarded as having a stable and abundant supply of water is now increasingly confronting periods of drought. There is a need for localized drought monitoring and forecasting that ensures consistent and reliable data collection, identifying drought patterns and formulating coherent data sets to be utilised.

There are many different drought indices in existence, that have been developed to allow for the monitoring of droughts. The Standardized Precipitation Index (SPI) and the Palmer Drought Severity Index (PDSI) are both used to help quantify the severity, length and spatial extent of moisture deficits. The SPI largely focuses on precipitation changes over fluctuating time scales whilst the PDSI uses both precipitation and temperature data to evaluate the cumulative impact on soil moisture. This makes the PDSI a useful index to understanding agricultural and hydrological drought^14,15. However, the Effective Drought Index (EDI) is unique in that it offers an advantage by capturing both the duration and intensity of precipitation deficits¹⁶, which ensures its usefulness in the context of short - to medium-term drought detection. Developed in the late 1990s by Byun and Wilhite¹⁶, the EDI was a response to the perceived limitations of current drought indices. The EDI goes beyond the SPI and PDSI as it uses daily precipitation data as well as incorporating the cumulative precipitation deviation from the mean over time. EDI can be calculated both daily and monthly. Its greatest advantage over other drought indices is its greater resolution in identifying droughts, allowing it to produce more meaningful results than other indices. One of its greatest advantages is its ability to consider one-year calculations, particularly in its daily calculations. Many researchers in the literature have cited its reliability and ability to identify significant droughts. In addition, the need for a single dataset in its calculations increases the interest of researchers in EDI¹⁷.

Machine learning (ML) and deep learning (DL) have become useful tools in hydrometeorological forecasting because of the increase in the computational power and the data availability. It is argued that these new approaches are outperforming traditional, statistical methods because of their ability to model complex, nonlinear and high-dimensional relationships between climactic variables¹⁸. ML and DL models can capture subtle patterns and temporal dependencies that may have been overlooked by conventional models, which improves predictive accuracy and reliability¹⁹.

Ensemble learning techniques are well-suited for environmental datasets. These include eXtreme Gradient Boosting (XGBoost), and Categorical Boosting (CatBoost). They are well suited to environmental datasets as they can handle structured tabular data as well as managing missing values^20,21,22. Building on this, Support Vector Machines (SVMs) are efficient when used in high-dimensional spaces. These have been used successfully in classification and regression tasks related to drought monitoring²³. Networks such as Multi-Layer Perceptrons (MLP) and Long Short-Term Memory (LSTM) are effective tools for capturing temporal dependencies in sequential data. This is essential for time series-based drought prediction²⁴.

By combining these models together, a strong foundation is created that can in turn be used to develop data-centered and regionally adaptive drought forecasting systems. This is particularly evident when combined with drought indices such as the Effective Drought Index (EDI) as this joint approach accounts for both short- and long-term precipitation deficits¹⁶. The integration of drought indices has the potential to inform early warning systems as well as water resource management in the face of increasing climate instability.

In recent years there has been an increase in the use of machine learning (ML) and deep learning (DL) techniques in drought prediction. Scholars argue that these newer methods have demonstrated substantial improvements in terms of accuracy and responsiveness, when compared to traditional statistical approaches¹⁸. Many studies have utilized ML/DL models with standard drought indices, such as SPI, but there is a noticeable lack of research that centers on the EDI combined with ML/DL methods. This is particularly evident in the context of Norway.

Many researchers are using machine learning methods to predict future EDI scenarios. EDI is attracting attention because it offers more effective solutions than SPI. Piri et al.²⁵ developed drought prediction models for the Iran region using various drought indices, including EDI. In their study, they used SVM as the machine learning method and utilized optimization techniques. Another study on the future prediction of EDI is by Deo et al.²⁶. In their study, the Artificial Neural Networks (ANN), SVM and the Extreme Learning Machine (ELM) were used as machine learning methods and WT was used as the data preprocessing method. Another study is of Deo and Şahin²⁷. They used EDI with machine methods, Extreme Learning Machine (ELM) and Artificial Neural Networks (ANN), without pre-processing data. When examining studies used not only for EDI but also for SPI estimation, models were generally developed using two or three machine learning techniques. Furthermore, a single scenario was generally applied to the model inputs^23,28,29. In this study, however, four different model input structures were used. Considering the literature, it was observed that the number of previous studies using LSTM, MLP, Xgboost, Catboost and SVM algorithms simultaneously are limited. Additionally, the innovative aspect of this study is enhanced using WT. This study aims to fill this gap in the literature.

Norway’s diverse climatic zones, differing topography and hydrological characteristics necessitate regionally specific drought forecasting models. However, the majority of the existing literature on ML- and DL-based drought prediction has focused on regions that are prone to severe water scarcity, such as North America, South Asia and parts of the Mediterranean²³. There has been little exploration of this in a Norwegian context. This is a notable point when developing models that are specifically tailored to individual climates that include the agricultural identity of different regions of the country. In many data-based modeling studies in the literature, region of interest selected as areas suffering from drought^30,31,32. This study examines regions of Norway that have so far remained largely unaffected by severe droughts and, as a result, have been neglected in most research. Yet, these areas are not immune and could experience both positive and negative impacts from drought. For example, some agricultural lands covered by glaciers may be restored to agriculture due to drought. Based on all these factors, this study selected specific regions in Norway that further enhances the innovative nature of this article.

This study aims to fill this research gap. This shall be done by introducing the Effective Drought Index (EDI) into a Norwegian context. As noted previously, the EDI has been used in many other geographical locations as an appropriate indicator of both short- and long-term drought conditions. However, its application in a Norwegian context is limited, the integration of EDI with ML and DL techniques is still widely underexplored. This is particularly so in the context of regionally specific forecasting which is needed in Norway. This research utilizes and compares a suite of advanced ML and DL algorithms: XGBoost, CatBoost, Support Vector Machines (SVM), Multi-Layer Perceptrons (MLP), and Long Short-Term Memory (LSTM) networks, evaluates their relative performance and explores how these methods can be calibrated to Norway’s diverse climatic and agricultural zones.

This study’s unique contribution is found in its regional specificity by applying predictive models across three climatically distinct cities - Lillehammer, Hamar and Drammen. By doing this, the study adds to the scientific field by advancing the scientific understanding of drought dynamics in high-latitude regions as well as advancing the practical capacity for localized, anticipatory water management systems. The insights from this study will be particularly of interest and value for sectors such as agriculture, energy and municipal planning as these requireupdated and accurate drought forecasts to inform adaptive decision-making. Thus, this study will fill a methodological and geographic gap in the literature, laying the groundwork for futureresearch on Nordic drought resilience. This research also offers an applicable and usable framework for similar analyses in other vulnerable, climate-sensitive regions.

Methodology

This work presents an analysis of EDI data derived from three distinct locations of Norway. The analysis used a range of machine and deep learning methods implemented within different model architectures. In this study, the algorithms implemented include Long-Short Term Memory network (LSTM), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), Cat Boosting (CatBoost), Multilayer Perceptrons (MLPs).

Upon completing the analyses, wavelet transformation (WT) was paired to enhance the performance of the results.

Study region and data

The study considers three diverse regions across Norway, each characterized by distinct geographical and climatic conditions that influence their drought forecasting needs (Fig. 1). These regions include Lillehammer, Hamar, and Drammen were selected to contrast inland continental and coastal settings. Lillehammer and Hamar lie in the interior (Gudbrandsdal and Hedmark) where relatively dry continental climate supporting grain and dairy production with Hamar, surrounding areas being among Norway’s most fertile agricultural lands. Drammen situated southwest of Oslo along the Drammensfjord located in southeastern coastal regions, experience milder climate and nearby fertile lowlands, supporting horticulture, vegetable farming, and grain cultivation; however, while not a particular concern for Drammen, though past land-use policies have led to concerns about soil erosion from intensified arable farming as with many urbanizing areas^{33,34,35,36,37}. Monthly precipitation was used to calculate EDI data of these three cities are given in Table 1 together with the coordinates of the stations. Across these regions, agriculture remains largely small-scale and part-time and shifting climatic patterns with urban development shaping the future of agricultural production systems and land use.

Missing data were minimal (0.6%) and filled by linear interpolation. Data normalization was carried out by min-max procedure (Eq. 1).

$$\:X^{\prime\:}=\frac{X-{X}_{min}}{{X}_{max}-{X}_{min}}$$

(1)

Table 1 Data statistics.

Full size table

Effective drought index (EDI)

The Effective Drought Index (EDI) is first introduced by Byun and Wilhite¹⁶. This index quantifies drought (or wetness) on a daily (or monthly) basis by comparing current effective precipitation to its climatological mean and standard deviation. Compared to other drought indices that rely on long accumulation windows (e.g., SPI), EDI responds quickly to emerging dry or wet spells while still incorporating prior moisture conditions.

Effective Precipitation (EP).

For a target month (day) t, each month’s (day’)s precipitation in the antecedent window is weighted inversely by its temporal distance (i.e., more recent rainfall has greater influence).

If $\:P_{t-i}$ is the precipitation that occurred i months before t:

$$\:E{P}_{t}={\sum\:}_{\left\{i=1\right\}}^{\left\{N\right\}}\frac{P_{t-i}}{i}$$

(2)

where N is the chosen memory length (commonly 365 days for daily EDI or 12 months for monthly EDI). This weighting scheme mimics soil-moisture recession, giving emphasis to recent rainfall.

Climatological Mean and Standard Deviation of EP.

Compute long-term statistics for the same calendar day (or month) over the full record:

$$\:{EP}_{m}=\frac{1}{Y}{\sum\:}_{\left\{y=1\right\}}^{\left\{Y\right\}}{EP}_{m,y\:}$$

(3)

$$\:{\sigma\:}_{EP,m}=\sqrt{\frac{1}{Y-1}{\sum\:}_{\left\{y=1\right\}}^{\left\{Y\right\}}{\:}_{{\left({{EP}_{m,y\:}-\:EP}_{m\:}\right)}^{2}}}$$

(4)

where EP_{m, y} is the effective precipitation for calendar month m in year y and Y is the number of years.

EDI Standardization

$$\:{EDI}_{t}=\frac{{{EP}_{t\:}-\:EP}_{m\:}}{{\sigma\:}_{EP,m}}$$

(5)

Long-Short term memory network (LSTM)

The Long Short-Term Memory (LSTM) model is an improvement over the standard Recurrent Neural Network (RNN) that effectively can address the vanishing gradients problem. Proposed by Hochreiter and Schmidhuber³⁷, LSTM incorporates mechanisms that allow it to process information across extended time periods. The strength of the LSTM lies in its ability to maintain long-term data sequences through improved memory components, such as memory cells and various gates, making it more efficient for handling extended sequences compared to standard RNNs³⁸.

The architecture of an LSTM network includes a sequence input (SI) layer which is essential for feeding time series data into the network. This layer connects to the LSTM layer, composed of units that feature an input gate, a forget gate, a cell with a self-recurrent connection, and an output gate. These components together control the information by adding or removing it as necessary³⁷. To optimize the LSTM parameters and minimize the loss function, various algorithms like stochastic gradient descent (SGD), Root Mean Square Propagation (RMSProp), and Adaptive Moments (Adam) can be employed. Like RNNs, LSTMs compute a mapping from an input sequence x to an output sequence y by calculating the network unit activations using the following equations iteratively from t = 1 to t = τ with initial values $\:{C}_{o}$ = 0 and $\:{h}_{o}$ = 0. The main difference between an LSTM and a traditional recursive network is that an LSTM cell has a unique memory component C_t at each time step. This cell state, along with the weighted input x_t and the previous output h_(t−1) is used to calculate the new hidden layer output and cell state:

$$i_t = \gamma(W_i x_t + U_ih_{t-1} + b_i )$$

(6)

$$f_t = \gamma(W_fx_t + U_fh_{t-1} + b_f )$$

(7)

$$o_t = \gamma(W_o x_t + U_oh_{t-1} + b_o )$$

(8)

$$\widetilde{C_t}= tanh (W_c x_t + U_ch_{t-1} + b_c)$$

(9)

$$C_t= f_t\odot C_{t-1} + i_t \odot \widetilde{C_t}$$

(10)

$$h_t= o_t \odot tanh (C_t)$$

(11)

where C_(t−1) and C_t represent the previous and current cell memories, respectively. W_i: input gates, W_f: forget gate, and W_o: output gates. U_i, U_f, and U_o represent the weight matrices connecting the input, forget, and output gates to the hidden layer respectively. Similarly, b_i, b_f, and b_o are the bias vectors for the input, forget, and output gates. The (γ) denotes a logistic sigmoid function that is used as a non-linear activation function applied element-wise. The vectors i_t, f_t, o_t, and C_t correspond to the input, forget, output gates, and the cell state at any given time stamp t and those are dimensionally equivalent to the cell output vector, h_t. The operation $\odot$ symbolizes the element-wise multiplication between two vectors. For a more detailed understanding of LSTM networks and their mechanisms, Kratzert et al.²⁴, Zhang et al.³⁹, Dikshit et al.⁴⁰, and Wang et al.⁴¹ can provide in-depth discussions and analyses.

Support vector machine (SVM)

The support vector machine (SVM) was developed in 1992 by Boser et al.⁴², and since then it is commonly employed for classification and regression tasks. Support Vector Machine (SVM) is a supervised learning approach which distinguishes itself by providing a singular, optimal solution for a specific dataset. Conversely, other algorithms may yield several answers in the same context. This characteristic makes SVMs a preferred technique in mitigating overfitting problems by employing a kernel function in nonlinear scenarios to establish decision boundaries⁴³. Further, its adaptability has been thoroughly examined, with numerous adjustments producing favorable results^44,45,46.

In the context of regression issues, SVM is designated as Support Vector Regression (SVR)^47,48. The principal objective of SVM is to reduce statistical learning mistakes and improve the model’s stability and robustness⁴⁹. Gunn⁵⁰, Vapnik⁵¹, and Panahi et al.⁵² provides a brief description of the theory behind support vector regression.

The selection of kernel function affects the performance of SVM models. These functions include; linear, polynomial, radial basis function (RBF), sigmoid, or Gaussian. In this study the Gaussian kernel was selected because of its substantial contribution to model performance. Three critical parameters directly affect the model’s performance with the Gaussian kernel: the scale parameter (γ), the regularization constant (C), and epsilon (ε), as articulated by Belayneh et al.⁵³. The parameters were automatically tuned in MATLAB to improve the model’s efficacy. The mathematical formulation of SVM is presented in Eq. (12). The relationship between input and output variables can be distinguished from the equation;

$$\:f\left(x\right)=\left(w,\phi\:\left(x\right)\right)+b$$

(12)

where f(x) denotes a high dimensional feature space, w represents a weight of the output variable, and b referred as the bias term.

Extreme gradient boosting (XGBoost)

XGBoost is a powerful machine learning algorithm that has gained significant traction in various fields due to its exceptional predictive performance, particularly in scenarios involving large datasets and complex patterns. Due to the fundamental difference between the boosting and bagging approaches⁵⁴, as a boosting algorithm, where each consequent tree aims to improve the previous ones’ forecast, XGBoost is developed to have lower bias but higher variance when compared to bagging-based models like random forest (RF). Moreover, while still depending on the hyperparameter set, XGBoost is an enhancement of traditional gradient boosting frameworks, incorporating optimizations such as parallel computation, cache awareness, and regularization to mitigate overfitting and enhance model robustness⁵⁵.

Moreover, XGBoost has shown remarkable efficacy in tackling imbalanced (i.e., long-tailed) datasets, a common challenge in atmospheric parameters like wind, precipitation and temperature. For example, Senocak et al.⁵⁶, underlines the XGBoost performance for predicting the daily total precipitation where the dataset includes underrepresented (i.e., extreme) events. Again, in the realm of environmental science, XGBoost was employed to predict PM2.5 concentrations effectively, showcasing its versatility across different contexts^20,57.

The scalability of XGBoost also contributes to its popularity. By employing a novel sparsity-aware algorithm and weighted quantile sketch for efficient tree learning, it allows practitioners to handle larger datasets without compromising on performance²⁰. Its architecture supports parallel computing, making it not only faster but also more efficient, which is crucial in real-time applications like operational weather forecasting^22,58.

Categorical boosting algorithm (CatBoost)

CatBoost, an innovative gradient boosting algorithm developed by Yandex²¹, is gaining prominence in machine learning for its unique capability to efficiently handle categorical features. This capability and other design decisions such as ordered boosting leads to significant initial advantages over other traditional gradient boosting techniques such as XGBoost⁵⁹. In a more detailed perspective, CatBoost’s ordered boosting provides a mitigation against the case where the traditional boosting approaches train their models with the entire dataset and this may result in over-optimism in model performance due to possible feature leakage to consequent trees²¹.

Literature indicates that CatBoost exhibits superior performance in various topics of atmospheric research. For instance, a study indicates that CatBoost successfully improve the performances of the best performing numerical weather prediction forecast by up to 72% for atmospheric parameters including precipitation, temperature, and wind over a complex topography spanning across ten different Koppen climate zones and ocean areas. As another instance, a paper focusing on the sub-tropical and sub-humid regions of India compared the performance of CatBoost with various ML methodologies including XGBoost over their predictive performance for predicting weekly pan-evaporation and underlined the performance of the CatBoost⁶⁰.

CatBoost’s architecture contributes to its rapid training and prediction capabilities. This is partly due to its implementation of a symmetric decision tree structure, which allows for faster completion of gradient calculations and enhanced accuracy while mitigating overfitting through the use of mirrored nodes^60,61,62.

Multilayer perceptrons (MLPs)

Multilayer Perceptrons (MLPs) is a widely used neural network model architecture in various fields that require modeling and forecasting time series. The MLP is a feedforward neural network ANN with input, hidden, and output layers⁶³. Each layer contains an activation function, which expresses the quantity of output based on the input data mathematically^63,64. They are widely employed across disciplines, including medical diagnostics, such as predicting heart diseases, where MLPs analyze patient data⁶⁵, and in engineering for applications like fault detection in systems or time series forecasting of energy inputs^66,67. MLPs use backpropagation-based learning algorithms, adjusting weights across layers to reduce the output errors so that they can effectively model complex and handle the non-linear nature of the relationships^66,68,69. Furthermore, advancements in optimization techniques, such as Particle Swarm Optimization and Genetic Algorithm integrations, further increase MLP performance in training processes by overcoming local minima and accelerating convergence issues⁷⁰. Their functionality also extends to social sciences, where MLPs assist customer satisfaction predicting research in business environments⁷¹, indicating their versatility across fields.

Discrete wavelet transformation

Wavelet Transform is typically presented in two versions in the literature as Continuous Wavelet Transform (CWT) and as Discrete Wavelet Transform (DWT). However, because of the computational complexities associated with the implementation of CWT, DWT is frequently preferred^72,73,74. The Discrete Wavelet Transform (DWT) offers an alternative to Fourier transform, decomposing time series data into sub-signals across different frequency components by employing a signal processing, so that the extraction of specific features is enabled^75,76. It offers a time-frequency analysis of a signal by employing a mathematical function to deconstruct it in the time domain.

The Discrete Wavelet Transform employs a wavelet function, $\:{\psi\:}_{y}$ (t), referred to as the “mother wavelet,” which differentiates among various frequencies. It functions at several scales ($\:{s}_{0}$) and is temporally localized ($\:{\tau\:}_{0}$). The calculation of mother wavelet is presented in Eq. 13:

$$\:{\psi\:}_{m,n}\left(t\right)=\frac{1}{{s}_{0}^{m}}\psi\:\left\{\frac{t-{n{\tau\:}_{0}s}_{0}^{m}}{{s}_{0}^{m}}\right\}$$

(13)

here m and n indicate controlling parameters of scale and time. The most common selections for the parameters S₀ and τ₀ are 2 and 1, respectively. Based on Mallat’s theory⁷⁷, the Discrete Wavelet Transform (DWT) can decompose a signal into its inverse DWT, resulting in a sequence of approximation and detail signals that are linearly independent. Here, S₀ refers to the step of precision expansion, while τ₀ denotes the location parameter for the DWT applied to a discrete time series x_i, where each x_i, occurs at a discrete time i. The inverse DWT, as described by Mallat⁷⁷, is expressed in Eq. (14), which outlines the reconstruction process from these independent signals.

$$\:x\left(t\right)=T+\sum\limits_{m=1}^{M}\sum\limits_{t=0}^{{2}^{M-m-1}}{W}_{m,n}{2}^{-\frac{m}{2}}\psi\:\left({2}^{-m}t-n\right)$$

(14)

where $\:{W}_{m,n}{2}^{-\frac{m}{2}}{\sum\:}_{t=0}^{N-1}\psi\:\left({2}^{-m}t-n\right)x\left(t\right)$ is the wavelet coefficient for the discrete wavelet at scale $\:s={2}^{m}$ and $\:\tau\:={2}^{m}$. 5 level detailed studies were chosen and employed in wavelet transform in this study since we obtained improved model results. The calculation of the level (L) is based on the Eq. 15.

$$\:L=int\left(N\right)$$

(15)

where L is the level of the decomposition and N is the number of runs.

In this study, Daubechies45 was preferred among wavelet types such as Haar, Daubechies, and Biorthogonal because it positively affects the model performance.

WT is a data preprocessing method used to estimate the dominant frequency in time series. With this method, the most dominant part of the time series is determined over a certain temporal period. Determining the temporal coverage greatly facilitates forecasting of the time series, because the dominant period in the temporal coverage has an impact over the entire series. Many researchers emphasize that model results have improved, especially with the use of WT in machine learning^78,79,80. To combine WT with machine learning methods, analyses are performed after separating the time series into detailed and mother wavelet components at different levels of decomposition. In these analyses, the uniformity of the data type generally has a positive impact on the model results.

Model performance assessment

The evaluation of model performance was conducted using five recognized statistical metrics: the correlation coefficient (r), root mean square error (RMSE), Nash-Sutcliffe efficiency (NSE), Kling–Gupta efficiency (KGE), and Performance Index (PI). These metrics are defined in Eqs. 16, 17, 18, 19, 20 and 21 respectively.

$$\:r=\frac{{\sum\:}_{i=1}^{N}{(x}_{pi}-\underset{\_}{{x}_{p}})\left({x}_{oi}-\underset{\_}{{x}_{o}}\right)}{\sqrt{{\sum\:}_{i=1}^{N}{{(x}_{pi}-\underset{\_}{{x}_{p}})}^{2}}*\sqrt{{\sum\:}_{i=1}^{N}{\left({x}_{oi}-\underset{\_}{{x}_{o}}\right)}^{2}}}$$

(16)

$$\:RMSE=\sqrt{\frac{1}{N}{\sum\:}_{i=1}^{N}{\left({x}_{oi}-{x}_{pi}\right)}^{2}}$$

(17)

$$\:NSE=1-\left[\frac{{\sum\:}_{i=1}^{N}{\left({x}_{oi}-{x}_{pi}\right)}^{2}}{{\sum\:}_{i=1}^{N}{\left({x}_{oi}-\underset{\_}{{x}_{o}}\right)}^{2}}\right]$$

(18)

$$\:KGE=1-\sqrt{{\left(r-1\right)}^{2}+{\left(a-1\right)}^{2}+{\left(\beta\:-1\right)}^{2}}$$

(19)

$$\:\beta\:=\frac{{x}_{p}}{{x}_{o}},\:\:a=\frac{{\sigma\:}_{{x}_{p}}}{{\sigma\:}_{{x}_{o}}}$$

(20)

$$\:PI=\frac{RMSE}{\left|{{x}_{p}}\right|}/(1+r)$$

(21)

Model structure

In this study, analyses were conducted using machine learning algorithms. To further investigate model performance and enhance the article’s innovative approach, four different model structures were created by using cross-correlation. The structures of the generated models (time-lagged based on EDI) are shown in Table 2.

Table 2 Structure of models (Input lags and forecast target).

Full size table

Results

In this study, monthly rainfall data obtained from three different regions in Norway were first used to calculate the EDI, and then these values were used to make forward predictions using a series of machine learning algorithms. In the analyses, SVM, LSTM, MLP, XGBoost, and CatBoost machine learning algorithms were used. The results obtained from the analyses have been enhanced with wavelet transformation. All the results of the analyses are shown in Table 3. In this table, the best results are shown in bold.

Table 3 The results of all models for the regions.

Full size table

When examining the results obtained for the Drammen region, in analyses without wavelet transformation, the best performance metrics were achieved in LSTM-M03 (r = 0.7708, NSE = 0.5876, KGE = 0.5956, PI = 1.0397, and RMSE = 0.6402). While LSTM yields the best results in this category, SVM-M02 follows closely in performance (r = 0.7641, NSE = 0.5424, KGE = 0.3228, PI = 1.0994, and RMSE = 0.6743). Although the performance metrics of these two models are close to each other, the analysis results with LSTM are ahead of SVM. Therefore, in analyses conducted without wavelet transformation, LSTM has demonstrated effective performance compared to other algorithms. In the results obtained with MLP, it was generally found that the NSE (and sometimes KGE) performance metrics were negative. When compared to other methods, MLP is the algorithm with the lowest performance metrics. It is stated by many researchers in the literature that wavelet transformation generally improves the model results^43,81,82,83. In this study, wavelet transformation has been applied to all models at 5 detail levels. When these results are examined, just like in the analysis conducted without wavelet transformation, the most successful algorithm has been LSTMW. The performance values of LSTMW-M04 are r = 0.9765, NSE = 0.9510, KGE = 0.8641, PI = 0.3211, and RMSE = 0.2207. Although the most successful algorithm before the wavelet transformation is the same (LSTM), the different input data in the model inputs affects the model performance metrics. In other words, before the wavelet transformation, the most successful input structure for LSTM was M03, while after the wavelet transformation, the most successful model input structure was determined to be M04. After the wavelet transformation, significant improvements (almost 100% for some models) were detected for most of the models. As before the wavelet transformation, the second most successful model after the wavelet transformation is SVMW-M03. It has surpassed the other models with the results obtained using LSTM and SVM. It should also be noted that in the analyses conducted with the wavelet transform of MLP, performance metrics such as NSE and r were found to change from negative to positive.

Another region analyzed in this study is Hamar. When examining the results obtained from Hamar, it was found that, as in the Drammen region, the most effective results were achieved with LSTM and SVM in the analysis conducted without wavelet transformation. The performance metrics of LSTM-M02 are r = 0.7222, NSE = 0.5047, KGE = 0.4672, PI = 1.0999, and RMSE = 0.7015 while SVM-M02’s performance metrics are r = 0.7177, NSE = 0.4752, KGE = 0.3267, PI = 1.1352, and RMSE = 0.7221. These two algorithms are ahead of other machine learning algorithms and models in terms of performance metrics. In this category, although very close results were obtained with the analysis using Catboost-M01, SVM-M02 could not match its performance metrics. The algorithm that performed the worst in this class was MLP. The NSE values of all models except MLP-M03 are negative. Improvements were observed in the results of all models after the wavelet transformation. After the transformation, LSTMW-M03 exhibited the best performance, followed by SVMW-M03 in terms of performance. One of the most notable results after the wavelet transformation is that all the negative values of MLP have become positive. In the wavelet-transformed analysis conducted in this region, the most successful results were obtained with the input structure of M03.

Finally, the region analyzed is Lillehammer. In the analyses conducted before the wavelet transformation, the LSTM algorithm, like in other regions, showed superior performance compared to other models. When the model input structures were examined, it was observed that M02 was successful here. The performance values of LSTM-M02 are r = 0.7232, NSE = 0.51505, KGE = 0.5358, PI = 1.0677, RMSE = 0.6941. When comparing the performance metrics of the most successful models in other regions, it has been determined that the values are almost similar to each other. One of the notable results here is that while the second most successful result in other regions is generally achieved with SVM, in Lillehammer, the second most successful performance is achieved with Catboost-M03. In SVM-M03, this model is the second in terms of performance. Again, some parameters of the MLP values obtained within this region have been determined as negative. In the analyses conducted after the wavelet transformation, contrary to other regions, the most successful result was obtained with the SVMW-M03 model. In other regions, it should be noted that the most successful algorithm after the wavelet transform was LSTM. This result has differed from other regions. As in other regions, it has been determined that some NSE values, which were initially identified as negative in the MLP in this region, became positive after the wavelet transformation.

To compare the model results more effectively with each other, the best models in terms of performance metrics were identified and these are shown in the Violin diagram in Fig. 2.

When Fig. 2 is examined for Drammen, it has been determined that the models that show the most similarity to the observed values are LSTMW-M04 and SVMW-M03. Although these two models emerged as the best performers, when examined in detail and considering the average and median values, the most similar model is SVMW-M03. In the statistical results, however, this model is behind LSTMW-M04 in terms of performance. Here, the difference between the statistical results and those obtained from the Violin diagram is noteworthy. In the analysis conducted for Hamar, it was determined that the models most like the observed values were SVMW-M03 and LSTMW-M03. Again, there are very small differences here as well. But it should also be noted that the statistical results of these models are ahead of other models in terms of performance. Therefore, from this perspective, all the results for both regions overlap with each other. A similar situation exists within the Lillehammer region. The performance metrics of the SVM and LSTM algorithms after wavelet transformation are superior to those of the other models. Therefore, within this region, both the statistical results and the visual results are alike one another. In all regions, the morphological differences between the values obtained from the wavelet transformation of the MLP and the observed values are apparent. As a result, in the analyses, the prediction results of MLPW are significantly behind those of the other algorithms. This situation is the same in the statistical results.

In Fig. 3, the most successful models in each group are shown in the Taylor diagram. Through this diagram, the model with the best predictive power can be determined by comparing the models. In the analysis conducted for the Drammen region, LSTMW-M04 is the model closest to the observation value. Since the model closest to the observed value in the Taylor diagram exhibited the best performance, it was determined that the most successful model for Drammen is LSTMW-M04. In the Hamar region as well, the most successful model was determined to be LSTMW-M03. In these two regions, the algorithm that follows the most successful models is SVMW. The results obtained from here overlap with the statistical results. In the Lillehammer region, it has been stated in the statistical results that SVMW-M03 shows superiority over LSTM, unlike other regions. The same situation is observed in the Taylor diagram. For this reason, the most successful model for this region has been SVMW. One of the remarkable findings of this study is that this visual comparison method yielded clearer results compared to the Violin diagram.

In Fig. 4, another visual method, the box-plot graph, is compared with the normal distribution through hybridization. Again, the most successful models were selected within their own group, and the best model for each region was determined. As a result of the analysis conducted for Drammen, it was determined that the model consistent with the observed values was SVMW with LSTMW. Since all the predicted and observed values are shown here, they were compared not based on visual similarity but rather on average, outliers, and median values. Considering all these factors, it has been determined that the SVMW-M03 and LSTMW-M04 models have superiority over the other models. In the Hamar region, considering the average, median, and outlier parameters of the observation values, SVMW and LSTMW have shown superiority over the other models. In statistical methods as well, the performance metrics obtained with these two methods are superior to those of the other models. Therefore, the results obtained from this method are similar to those obtained from statistical methods. In the Lillehammer region, however, this situation is somewhat different. Although the superiority of these two models over other methods is obvious, it has been determined that effective results in predicting outliers were not achieved in the analysis conducted with LSTMW. For this reason, the most successful algorithm and model in this region is SVMW-M03. All the results obtained with this analysis method overlap with the statistical results.

Figure 5 shows the ridge diagram of the most successful models. According to the results obtained from this diagram, for the Drammen region, the models that are most successful in predicting peak points and are visually most similar to the observed values are SVMW-M03 and LSTMW-M02. In the Hamar region as well, the most successful models are the LSTMW-M03 and SVMW-M03 models, just like in Drammen. In these visuals, the shape differences and peak points are the most important factors in identifying the most successful model. The most successful algorithms for the Lillehammer region are SVMW-M03 and LSTMW-M02. All the results obtained from here overlap with the statistical results.

Figure 6 shows the Bland-Altman and Error Box plots for Drammen. When these graphs, created based on observation values, are examined, the small distances between the difference values within the limits of the Bland-Altman graphs, due to their basis on error values, indicate the best model to us. Therefore, in the Drammen region, the boundaries corresponding to the ± 1.96 significance level (limits of agreement) in the LSTMW-M04 model provide the best performance compared to the other models. This makes this model the best-performing model. In Fig. 7, the time series representation and scatter diagram of this model are provided. When this figure is examined, it has been determined that the observed values generally overlap with the predicted values.

In Fig. 8, Bland-Altman and Error Box plots for Hamar are shown. Bland-Altman graphs were created based on the observed values for each model. To enhance the model comparison, an Error box plot has been drawn. The model with the shortest distance between the difference values created based on the ± 1.96 significance levels (limits of agreement) offers the best model performance. Accordingly, the best result for the Hamar region was obtained with the LSTMW-M03 model. Additionally, when examining the Error box plot, it was concluded that the error rate was lowest for all values except for the outliers. The same result was obtained in both graphs. The time series and scatter diagram for this model are shown in Fig. 9. When this figure is examined, it has been identified that the observed values generally overlap with the predicted values.

Figure 10 shows the Bland-Altman and Error Box plots for Lillehammer. In the analyses conducted for this region, the best performance was exhibited by SVMW-M03. Although the LSTMW-M03 model showed similar results, it was determined that SVM exhibited superior performance with small differences, and this situation remained unchanged in the Error Box graph. The results obtained from these graphs, which are based on observation values, overlap with the statistical results. The time series and scatter diagram belonging to SVMW-M03, which has the best performance metrics in Lillehammer, are shown in Fig. 11. When this graph is examined, the observed values generally overlap with the predicted values.

For each region, we examined the mean difference between the top-performing model’s predictions and the test-set data. We used one-way ANOVA and also applied the Kruskal–Wallis test. In all cases, the p-values were greater than 0.05; therefore, H₀ (no difference) was not rejected (Table 4).

Table 4 Statistical significance (ANOVA / Kruskal–Wallis) for the top-performing models.

Full size table

Discussion

This study calculated monthly rainfall and EDI values obtained from Drammen, Hamar, and Lillehammer in the Norway region, and subsequently created different input structures with these values to be used in machine learning methods. These input structures were analyzed using SVM, LSTM, MLP, XGBoost, and CatBoost machine learning algorithms, and the obtained results were enhanced with wavelet transformation. 70% of the dataset was used for training, and 30% for testing. In this study, the performance metrics used to compare the models were derived from the test data.

The findings obtained from the study are consistent with the results of similar studies in the literature. Some of these include: Tuğrul et al.⁸¹ obtained precipitation data from a meteorological station near the Apa Dam located in the Konya Closed Basin, first calculating the SPI values for the region, and then attempting to predict these values using machine learning. They used machine learning methods such as SVM and LSTM. To strengthen the model results, they utilized the wavelet transform technique. In their findings, they emphasized that both methods showed superior performance and that the model results improved after the wavelet method in their study. In this study, improvements in model results were also detected using the wavelet method. Danandeh Mehr et al.⁸⁴ conducted analyses using the SPEI data calculated from two different stations in Ankara, employing LSTM, Genetic Programming (GP), Convolutional Neural Network (CNN), and ANN methods. They have strengthened their analyses by hybridizing some of these methods. In their findings, they stated that the CNN-LSTM method outperformed other methods and that LSTM also performed well on its own. In this study, wavelet transformation, referred to as a data preprocessing method or hybrid method, was used. Additionally, good performance metrics were achieved in the results obtained with both LSTM and LSTMW. As a result, the findings obtained from the two studies overlap with each other. In another study where LSTM was used for forward-looking drought prediction, it is Taylan’s⁸⁵ work. In Taylan’s study, some stations located in the Sakarya Basin first calculated SPI values using the precipitation data they obtained and then conducted forward-looking forecasting studies with the help of LSTM. In the study, different input structures were determined using autocorrelation, and analyses were conducted based on the most suitable model input structure. In this study, unlike Taylan⁸⁵, the most suitable input structure was determined using cross-correlation, and model diversity was created. Just like in this study, it is mentioned in the findings of Taylan⁸⁵ that good performance metrics were achieved with LSTM. Coşkun and Citakoglu⁸⁶ conducted a study comparing the performance of ELM and LSTM. In their study, they first calculated the SPI using data obtained from the Sakarya station and then conducted analyses using machine learning methods. They have stated in their findings that LSTM yields better results compared to ELM. In this study, it has been determined that LSTM is more effective compared to other methods.

One of the most important points that distinguishes this study from others in the literature is the use of EDI as input data. Making drought predictions using machine learning has become one of the quite popular topics recently. In these studies, researchers generally prefer SPI or SPEI as the drought index^{43,87,88,89,90,91}. In this study, EDI was preferred, unlike the studies in the literature. Because EDI can provide better drought resolutions compared to SPI¹⁷. With LSTM, not only droughts but also the prediction of hydrological and meteorological parameter is being made in the literature. In the findings obtained from these studies, it is generally stated that the model performances are at a satisfactory level^92,93.

In data-driven machine learning methods, data preprocessing processes generally improve model performance. These can be optimization techniques as well as WT. When these studies in the literature are examined, the analyses generally show positive impacts on model performance metrics^43,88,94. However, in some studies, especially in tree-based models, performance metrics may be negatively affected⁹⁴. In WT methods, parameters that can affect model performance metrics vary depending on the wavelet level and wavelet type. The daubechies45 wavelet type commonly yields the most effective results. Many researchers also prefer wavelet as a method^94,95,96.

Finally, we provide a table of hyper-parameters for all the machine learning algorithms we used in this study in Table 5. The values in this table have been adjusted to best predict the model outcome. Analysis results can be evaluated by trying different parameter values.

Table 5 Hyperparameters for all algorithms.

Full size table

Conclusion

In this study, the results obtained from different machine learning methods were presented using four different model input structures. The most significant results obtained are expressed below:

WT improved performance metrics across all algorithms and nearly all input configurations, demonstrating its positive effect.
For Drammen, the most successful result both before and after the wavelet was detected using the LSTM algorithm. Additionally, it has been determined that the most successful model input structure for this region is M04. In a modeling study to be conducted here, it is recommended to use the LSTMW-M04 model input structure.
In the analyses conducted in the Drammen region with MLP, the r and KGE performance metrics were determined to be negative without wavelet transformation, but after wavelet transformation, these performance metrics became positive. MLP is the algorithm that performs the worst in this region.
In the analysis conducted without wavelet transformation in the Hamar region, the best performance metrics were obtained in the LSTM’s M02 model. As in Drammen, the best performance in this region was also detected in LSTM, while the worst performance was detected in MLP. Additionally, SVM follows LSTM in terms of performance.
In Hamar, the CatBoost algorithm demonstrated performance close to SVM, creating one of the striking results.
In Hamar, the model inputs for the most successful models were obtained with M03. In any analysis conducted in this region, this input structure should be used.
The CatBoost and XGBoost algorithms did not achieve the performance metrics of LSTM and SVM either before or after the wavelet transformation.
In Lillehammer, unlike other regions, the most successful result before the wavelet transformation was achieved with LSTM-M02, while the second most successful result was obtained with Catboost-M03. Additionally, the most successful result after the wavelet transformation was obtained with SVMW-M03. In other regions, the most successful results before wavelet transformation are generally obtained with LSTM, whereas here, obtaining them with SVM is one of the most striking results of the study. This also shows that the algorithm used can yield different results in different regions.
In the study, the most effective performance in all regions was achieved in the Drammen region with LSTMW-M04. This algorithm and model input structure should be used for future forecasting models in this region.
Overall, LSTM wavelet transformation has provided superiority over other methods.
The results obtained from the wavelet transform of MLP exhibited the weakest performance in all regions.
Although MLP is a powerful one, MLP results are generally negative and low due to the high nonlinearity in the datasets. Our claim here is that WT improves model performance. With WT, seasonality, trends, and extreme points in the dataset are more clearly differentiated and accurately estimated.

With this study, the most effective models and algorithms for detecting future droughts in the working areas of Drammen, Hamar, and Lillehammer in Norway have been identified. In a future modeling study to be conducted in the region, the results and findings obtained from this study will serve as a guiding reference for the upcoming work. Additionally, this study is limited regionally in terms of the algorithms used, the data used, and the data preprocessing method.

This study will contribute to future regional agricultural policies and policies for water-based energy production facilities, as these policies are directly affected by drought. Therefore, the most appropriate drought model for each region has been determined here to help predict droughts and assist decision-makers.

The data obtained from the study area, the machine learning techniques used, and the data preprocessing methods applied can be mentioned among the limitations for this study. Future studies could improve model performance and obtain more effective results by using different data preprocessing methods and adding different meteorological and hydrological parameters to the input data such as temperature, streamflow.

Data availability

The original contributions presented in the study are included in the article. The raw data supporting the conclusions of this article will be made available by the corresponding authors upon reasonable request.

References

Ionita, M., Nagavciuc, V. & Scholz, P. Dima Long-term drought intensification over Europe driven by the weakening trend of the Atlantic meridional overturning circulation. J. Hydrology: Reg. Stud. 42, 101176 (2022).
Google Scholar
Cavus, Y. & Stahl, K. Aksoy drought intensity–duration–frequency curves based on deficit in precipitation and streamflow for water resources management. Hydrol. Earth Syst. Sci. 27, 3427–3445 (2023).
Article ADS Google Scholar
Spinoni, J. et al. Mazzeschi A new global database of meteorological drought events from 1951 to 2016. J. Hydrology: Reg. Stud. 22, 100593 (2019).
Google Scholar
Gevaert, A. I. & Veldkamp, T. I. E. Ward the effect of climate type on timescales of drought propagation in an ensemble of global hydrological models. Hydrol. Earth Syst. Sci. 22, 4649–4665 (2018).
Article ADS Google Scholar
Wang, F. et al. Hussain comprehensive evaluation of hydrological drought and its relationships with meteorological drought in the yellow river basin, China. J. Hydrol. 584, 124751 (2020).
Article Google Scholar
Kazemi Garajeh, M. et al. Mirzaei impact of Long-Term drought on surface water and water balance variations in iran: insights from Highland and lowland regions. Remote Sens. 16, 3636 (2024).
Article ADS Google Scholar
Nations, U. The Sustainable Development Goals Report 2022 (Cambridge University Press, 2022).
Van Loon, A. F. Hydrological drought explained. WIREs Water. 2, 359–392 (2015).
Article Google Scholar
IPCC. Climate Change 2021: the Physical Science basis. Contribution of Working Group I To the Sixth Assessment Report (Cambridge University Press, 2021).
Dai, A. Drought under global warming: a review. WIREs Clim. Change. 2, 45–65 (2011).
Article Google Scholar
Jain, S., Srivastava, A., Khadke, L. & Chatterjee, U. Elbeltagi Global-scale water security and desertification management amidst climate change. Environ. Sci. Pollut. Res. 31, 58720–58744 (2024).
Article Google Scholar
FAO, IFAD, UNICEF, WFP and WHO. The State of Food Security and Nutrition in the World 2022. Repurposing food and agricultural policies to make healthy diets more affordable. https://doi.org/10.4060/cc0639en (FAO, Rome, 2022).
Hanssen-Bauer, I. et al. A. Sandø climate in Norway 2100–a knowledge base for climate adaptation. NCCS Rep. 1, 52 (2017).
Google Scholar
Wilson, D. & Hisdal, H. D. Lawrence has streamflow changed in the nordic countries? – Recent trends and comparisons to hydrological projections. J. Hydrol. 394, 334–346 (2010).
Article ADS Google Scholar
Palmer, W. C. Meteorological drought. US. Weather Bureau Res. Paper. 45, 1–58 (1965).
Google Scholar
Byun, H. R. Wilhite objective quantification of drought severity and duration. J. Clim. 12, 2747–2756 (1999).
Article ADS Google Scholar
Dogan, S. & Berktay, A. Singh comparison of multi-monthly rainfall-based drought severity indices, with application to semi-arid Konya closed basin, Turkey. J. Hydrol. 470–471, 255–268 (2012).
Article Google Scholar
Mosavi, A. & Ozturk, P. K.-w. Chau flood prediction using machine learning models. Literature Review Water. 10, 1536 (2018).
Google Scholar
Prodhan, F. A., Zhang, J., Hasan, S. S., Pangali, T. P. & Sharma H.P. Mohana A review of machine learning methods for drought hazard monitoring and forecasting: current research trends, challenges, and future research directions. Environ. Model. Softw. 149, 105327 (2022).
Article Google Scholar
Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: Association for Computing Machinery. (2016).
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V. & Gulin, A. Catboost: Unbiased boosting with categorical features. in Proceedings of the 32nd International Conference on Neural Information Processing Systems, Montréal, 3-8 December 2018. 6639–6649 (2018).
Ke, G. et al. Lightgbm: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017. 3146–3154 (2017).
Pande, C. B. et al. Tolche A novel machine learning models for meteorological drought forecasting in the semi-arid climate region. Appl. Water Sci. 15, 121 (2025).
Article ADS Google Scholar
Kratzert, F., Klotz, D., Brenner, C. & Schulz, K. Herrnegger Rainfall–runoff modelling using long Short-Term memory (LSTM) networks. Hydrol. Earth Syst. Sci. 22, 6005–6022 (2018).
Article ADS Google Scholar
Piri, J., Abdolahipour, M. & Keshtegar, B. Advanced machine learning model for prediction of drought indices using hybrid SVRRSM. Water Resour. Manage. 37, 683–712. https://doi.org/10.1007/s11269-022-03395-8 (2023).
Article Google Scholar
Deo, R. C., Tiwari, M. K., Adamowski, J. F. & Quilty, J. M. Forecasting effective drought index using a wavelet extreme learning machine (W-ELM) model. Stoch. Environ. Res. Risk Assess. 31, 1211–1240. https://doi.org/10.1007/s00477-016-1265-z (2017).
Article Google Scholar
Deo, R. C. & Şahin, M. Application of the extreme learning machine algorithm for the prediction of monthly Effective Drought Index in eastern Australia. Atmos. Res. 153, 512–525 (2015).
Article Google Scholar
Sseguya, F. Jun drought quantification in Africa using remote sensing, Gaussian Kernel, and machine learning. Water 16, 2656 (2024).
Article Google Scholar
Kumar, R. R. et al. Leveraging artificial intelligence for meteorological drought modelling and forecasting. J. Indian Soc. Probab. Stat. https://doi.org/10.1007/s41096-025-00247-7 (2025).
Waqas, M., Humphries, U. W., Wangwongchai, A. & Dechpichai, P. Ahmad potential of artificial intelligence-based techniques for rainfall forecasting in thailand: a comprehensive review. Water 15, 2979 (2023).
Article Google Scholar
Waqas, M., Humphries, U. W. & Hlaing, P. T. Wangwongchai & P. Dechpichai advancements in daily precipitation forecasting: a deep dive into daily precipitation forecasting hybrid methods in the tropical climate of Thailand. MethodsX. 12, 102757 (2024).
Article PubMed PubMed Central Google Scholar
Waqas, M. & Humphries, U. W. Hlaing time series trend analysis and forecasting of climate variability using deep learning in Thailand. Results Eng. 24, 102997 (2024).
Article Google Scholar
Brunstad, R., Gaasland, I. & E. Vårdal agriculture as a provider of public goods: a case study for Norway. Agric. Econ. 13, 39–49 (1995).
Article Google Scholar
Lundekvam, H. & Romstad, E. Øygarden agricultural policies in Norway and effects on soil erosion. Environ. Sci. Policy. 6, 57–67 (2003).
Article Google Scholar
Scott, D., Steiger, R. & Dannevig, H. Aall climate change and the future of the Norwegian alpine ski industry. Curr. Issues Tourism. 23, 2396–2409 (2020).
Article Google Scholar
Vinge, H. Farmland conversion to fight climate change? Resource hierarchies, discursive power and ulterior motives in land use politics. J. Rural Stud. 64, 20–27 (2018).
Article Google Scholar
Hochreiter, S. Schmidhuber long Short-Term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Mikolov, T., Joulin, A., Chopra, S. & Mathieu, M. & M.A. Ranzato Learning longer memory in recurrent neural networks. arXiv preprint arXiv:1412.7753, (2014).
Zhang, J., Zhu, Y., Zhang, X. & Ye, M. Yang developing a long Short-Term memory (LSTM) based model for predicting water table depth in agricultural areas. J. Hydrol. 561, 918–929 (2018).
Article ADS Google Scholar
Dikshit, A. & Pradhan, B. Huete an improved SPEI drought forecasting approach using the long short-term memory neural network. J. Environ. Manage. 283, 111979 (2021).
Article PubMed Google Scholar
Wang, T., Tu, X., Singh, V. P., Chen, X. & Lin, K. Zhou drought prediction: insights from the fusion of LSTM and multi-source factors. Sci. Total Environ. 902, 166361 (2023).
Article CAS PubMed Google Scholar
Boser, B. E., Guyon, I. M. & Vapnik, V. N. A training algorithm for optimal margin classifiers. in Proceedings of the fifth annual workshop on Computational learning theory. (1992).
Oruc, S. & Tugrul, T. Hinis beyond traditional metrics: exploring the potential of hybrid algorithms for drought characterization and prediction in the Tromso Region, Norway. Appl. Sci. 14, 7813 (2024).
Article CAS Google Scholar
Achite, M., Katipoglu, O. M., Şenocak, S., Elshaboury, N. & Bazrafshan, O. H.Y. Dalkılıç modeling of meteorological, agricultural, and hydrological droughts in semi-arid environments with various machine learning and discrete wavelet transform. Theoret. Appl. Climatol. 154, 413–451 (2023).
Article ADS Google Scholar
Katipoğlu, O. M., Yeşilyurt, S. N. & Dalkılıç, H. Y. Akar application of empirical mode decomposition, particle swarm optimization, and support vector machine methods to predict stream flows. Environ. Monit. Assess. 195, 1108 (2023).
Article PubMed Google Scholar
Saha, S., Saha, A., Hembram, T. K. & Kundu, B. Sarkar novel ensemble of deep learning neural network and support vector machine for landslide susceptibility mapping in Tehri region, Garhwal himalaya. Geocarto Int. 37, 17018–17043 (2022).
Article ADS Google Scholar
Muller, K. R., Mika, S., Ratsch, G. & Tsuda, K. Scholkopf an introduction to kernel-based learning algorithms. IEEE Trans. Neural Networks. 12, 181–201 (2001).
Article ADS CAS PubMed Google Scholar
Abumohsen, M., Owda, A. Y. & Owda, M. & A. Abumihsan Hybrid machine learning model combining of CNN-LSTM-RF for time series forecasting of Solar Power Generation. e-Prime-Advances in Electrical Engineering, Electronics and Energy. 9, 100636 (2024).
Belayneh, A., Adamowski, J. & Khalil, B. Ozga-Zielinski Long-term SPI drought forecasting in the Awash river basin in Ethiopia using wavelet neural network and wavelet support vector regression models. J. Hydrol. 508, 418–429 (2014).
Article ADS Google Scholar
Gunn, S. R. Support vector machines for classification and regression Citeseer (1997).
Vapnik, V. N. An overview of statistical learning theory. IEEE Trans. Neural Networks. 10, 988–999 (1999).
Article ADS CAS PubMed Google Scholar
Panahi, M., Sadhasivam, N., Pourghasemi, H. R. & Rezaie, F. Lee Spatial prediction of groundwater potential mapping based on convolutional neural network (CNN) and support vector regression (SVR). J. Hydrol. 588, 125033 (2020).
Article Google Scholar
Belayneh, A. & Adamowski, J. Khalil Short-term SPI drought forecasting in the Awash river basin in Ethiopia using wavelet transforms and machine learning methods. Sustainable Water Resour. Manage. 2, 87–101 (2016).
Article Google Scholar
Sutton, C. D. Classification and regression trees, bagging, and boosting. Handb. Stat. 24, 303–329 (2005).
Article Google Scholar
Abba, S. I., Benaafi, M., Usman, A. G., Ozsahin, D. U. & Tawabini, B. I.H. Aljundi groundwater modelling and GIS-based vulnerability mapping coupled with evolutionary metaheuristic optimization in the Eastern Coast of Saudi Arabia. Earth Sci. Inf. 18, 45 (2024).
Article ADS Google Scholar
Senocak, A. U. G., Yilmaz, M. T., Kalkan, S. & Yucel, I. Amjad an explainable two-stage machine learning approach for precipitation forecast. J. Hydrol. 627, 130375 (2023).
Article Google Scholar
Pan, B. Application of XGBoost algorithm in hourly PM2. 5 concentration prediction. in IOP conference series: earth and environmental science. IOP publishing. (2018).
Qian, Q. F. & Jia, X. J. & H. Lin Machine Learning Models for the Seasonal Forecast of Winter Surface Air Temperature in North America. Earth and Space Science. 7, eEA001140 (2020). (2020).
Szczepanek, R. Daily streamflow forecasting in mountainous catchment using XGBoost, LightGBM and catboost. Hydrology 9, 226 (2022).
Article Google Scholar
Vishwakarma, D. K. et al. Vinayak evaluation of catboost method for predicting weekly Pan evaporation in subtropical and Sub-Humid regions. Pure. appl. Geophys. 181, 719–747 (2024).
Article ADS Google Scholar
Dorogush, A. V. & Ershov, V. & A. Gulin CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363, (2018).
Hao, Y. et al. Yu A machine learning model for predicting blood concentration of quetiapine in patients with schizophrenia and depression based on real-world data. Br. J. Clin. Pharmacol. 89, 2714–2725 (2023).
Article CAS PubMed Google Scholar
Agbasi, J. C. Egbueri prediction of potentially toxic elements in water resources using MLP-NN, RBF-NN, and ANFIS: a comprehensive review. Environ. Sci. Pollut. Res. 31, 30370–30398 (2024).
Article CAS Google Scholar
Aryadoust, V. Goh predicting listening item difficulty with Language complexity measures: A comparative data mining study. CaMLA Work Pap. 2, 1–16 (2014).
Google Scholar
Al Bataineh, A. Manacek MLP-PSO hybrid algorithm for heart disease prediction. J. Personalized Med. 12, 1208 (2022).
Article Google Scholar
Mohammadi, M., Jamshidi, S., Rezvanian, A. & Gheisari, M. Kumar advanced fusion of MTM-LSTM and MLP models for time series forecasting: an application for forecasting the solar radiation. Measurement: Sens. 33, 101179 (2024).
Google Scholar
Orrù, P. F., Zoccheddu, A., Sassu, L., Mattia, C. & Cozza, R. Arena machine learning approach using MLP and SVM algorithms for the fault prediction of a centrifugal pump in the oil and gas industry. Sustainability 12, 4776 (2020).
Article ADS Google Scholar
Amor, N. & Noman, M. T. Petru prediction of functional properties of nano TiO 2 coated cotton composites by artificial neural network. Sci. Rep. 11, 12235 (2021).
Article CAS PubMed PubMed Central Google Scholar
Du, K. L., Leung, C. S. & Mow, W. H. Swamy perceptron: Learning, generalization, model selection, fault tolerance, and role in the deep learning era. Mathematics 10, 4730 (2022).
Article Google Scholar
Rather, S. A. P.S. Bala A hybrid constriction coefficient-based particle swarm optimization and gravitational search algorithm for training multi-layer perceptron. Int. J. Intell. Comput. Cybernetics. 13, 129–165 (2020).
Article Google Scholar
Amalia, S. & Deborah, I. I.N. Yulita comparative analysis of classification algorithm: random Forest, SPAARC, and MLP for airlines customer satisfaction. Sinergi 26, 213–222 (2022).
Article Google Scholar
Kisi, O. Wavelet regression model as an alternative to neural networks for river stage forecasting. Water Resour. Manage. 25, 579–600 (2011).
Article Google Scholar
Alessio, S. M. Discrete wavelet transform (DWT). Digital signal processing and spectral analysis for scientists: concepts and applications: Springer. 645–714. (2015).
Arts, L. P. Van Den Broek the fast continuous wavelet transformation (fCWT) for real-time, high-quality, noise-resistant time–frequency analysis. Nat. Comput. Sci. 2, 47–58 (2022).
Article CAS PubMed PubMed Central Google Scholar
Sang, Y. F. A review on the applications of wavelet transform in hydrology time series analysis. Atmos. Res. 122, 8–15 (2013).
Article Google Scholar
Maheswaran, R. Khosa comparative study of different wavelets for hydrologic forecasting. Comput. Geosci. 46, 284–295 (2012).
Article ADS Google Scholar
Mallat, S. G. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 11, 674–693 (2002).
Article ADS Google Scholar
Seo, Y. & Choi, Y. Choi river stage modeling by combining maximal overlap discrete wavelet transform, support vector machines and genetic algorithm. Water 9, 525 (2017).
Article Google Scholar
Zhou, T. & Wang, F. Yang comparative analysis of ANN and SVM models combined with wavelet preprocess for groundwater depth prediction. Water 9, 781 (2017).
Article ADS Google Scholar
Quilty, J. & Adamowski, J. Boucher A stochastic data-driven ensemble forecasting framework for water resources: A case study using ensemble members derived from a database of deterministic wavelet‐based models. Water Resour. Res. 55, 175–202 (2019).
Article ADS Google Scholar
Tuğrul, T. & Hınıs, M. A. Oruç comparison of LSTM and SVM methods through wavelet decomposition in drought forecasting. Earth Sci. Inf. 18, 139 (2025).
Article ADS Google Scholar
Tuğrul, T. Hinis performance enhancement of models through discrete wavelet transform for streamflow forecasting in Çarşamba River, Türkiye. J. Water Clim. Change. 16, 736–756 (2025).
Article Google Scholar
Tuğrul, T. & Oruç, S. Hınıs transforming wind data into insights: A comparative study of stochastic and machine learning models in wind speed forecasting. Appl. Sci. 15, 3543 (2025).
Article Google Scholar
Danandeh Mehr, A., Rikhtehgar Ghiasi, A. & Yaseen, Z. M. Sorman & L. Abualigah A novel intelligent deep learning predictive model for meteorological drought forecasting. J. Ambient Intell. Humaniz. Comput. 14, 10441–10455 (2023).
Article Google Scholar
Taylan, E. D. An Approach for Future Droughts in Northwest Türkiye: SPI and LSTM Methods. Sustainability. 16, 6905 (2024).
Coşkun, Ö. Citakoglu prediction of the standardized precipitation index based on the long short-term memory and empirical mode decomposition-extreme learning machine models: the case of Sakarya, Türkiye. Phys. Chem. Earth Parts A/B/C. 131, 103418 (2023).
Article Google Scholar
Moghaddasi, M., Moradi, M., Mohammadi, M. & Ghaleni Jamei enhancing multi-temporal drought forecasting accuracy for iran: integrating an innovative hidden pattern identifier, recursive feature elimination, and explainable ensemble learning. J. Hydrology: Reg. Stud. 59, 102382 (2025).
Google Scholar
Oruc, S. & Hinis, M. A. Tugrul evaluating performances of LSTM, SVM, GPR, and RF for drought prediction in norway: A wavelet decomposition approach on regional forecasting. Water 16, 3465 (2024).
Article Google Scholar
Pande, C. B. et al. Forecasting of SPI and meteorological drought based on the artificial neural network and M5P model tree. Land 11, 2040 (2022).
Article Google Scholar
Pande, C. B. et al. Elbeltagi forecasting of meteorological drought using ensemble and machine learning models. Environ. Sci. Europe. 36, 160 (2024).
Article Google Scholar
Sadrtdinova, R., Perez, G. A. C. & D.P. Solomatine improved drought forecasting in Kazakhstan using machine and deep learning: a non-contiguous drought analysis approach. Hydrol. Res. 55, 237–261 (2024).
Article Google Scholar
Anshuka, A., Chandra, R., Buzacott, A. J. V. & Sanderson, D. Ogtrop Spatio Temporal hydrological extreme forecasting framework using LSTM deep learning model. Stoch. Env. Res. Risk Assess. 36, 3467–3485 (2022). van.
Article Google Scholar
Liu, J., Xu, T. & Lu VMDI-LSTM-ED, C. A novel enhanced decomposition ensemble model incorporating data integration for accurate non-stationary daily streamflow forecasting. J. Hydrol. 653, 132769 (2025).
Article Google Scholar
Tuğrul, T. Hinis improvement of drought forecasting by means of various machine learning algorithms and wavelet transformation. Acta Geophys. 73, 855–874 (2025).
Article ADS Google Scholar
Zhang, Y., Ji, T., Li, M. & Wu, Q. H. Application of discrete wavelet transform for identification of induction motor stator inter-turn short circuit. in 2015 IEEE Innovative Smart Grid Technologies-Asia (ISGT ASIA). IEEE. (2015).
Matsunaga, Y. et al. Spatio-temporal hierarchy in the dynamics of a minimalist protein model. J. Chem. Phys. 139(21), 215101. https://doi.org/10.1063/1.4834415 (2013).
Article ADS CAS Google Scholar

Download references

Acknowledgements

This research received no external funding. APC was supported by UiT, the Arctic University of Norway.

Funding

Open access funding provided by UiT The Arctic University of Norway (incl University Hospital of North Norway)

Author information

Authors and Affiliations

Technology Faculty, Civil Engineering Department, Gazi University, Ankara, Türkiye
Türker Tuğrul
The Center for Sámi Studies, UiT Norges Arktiske Universitet, Tromsø, N-9037, Norway
Sertaç Oruç
Faculty of Engineering and Natural Sciences, Department of Civil Engineering, Ankara Yıldırım Beyazıt University, Ankara, Türkiye
Sertaç Oruç & Ali Ulvi Galip Şenocak
The Arctic Youth Network and The Foundation for Law and International Affairs, Washington, USA
Jessica Louise Hall
Faculty of Engineering, Civil Engineering, Aksaray University, Aksaray, Türkiye
Mehmet Ali Hınıs

Authors

Türker Tuğrul
View author publications
Search author on:PubMed Google Scholar
Sertaç Oruç
View author publications
Search author on:PubMed Google Scholar
Jessica Louise Hall
View author publications
Search author on:PubMed Google Scholar
Ali Ulvi Galip Şenocak
View author publications
Search author on:PubMed Google Scholar
Mehmet Ali Hınıs
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, S.O. and M.A.H.; Formal analysis, T.T.; Methodology, T.T., S.O. and M.A.H.; Supervision, S.O. and M.A.H.; Visualization, T.T, S.O. and M.A.H.; Writing – original draft, T.T., S.O. A.U.G.S and J.L.H.; Writing – review & editing, T.T., S.O., J.L.H., A.U.G.S and M.A.H.

Corresponding author

Correspondence to Sertaç Oruç.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tuğrul, T., Oruç, S., Hall, J.L. et al. Hybrid Wavelet–ML models for regional drought forecasting in Norway. Sci Rep 15, 38573 (2025). https://doi.org/10.1038/s41598-025-22416-1

Download citation

Received: 28 June 2025
Accepted: 29 September 2025
Published: 04 November 2025
Version of record: 04 November 2025
DOI: https://doi.org/10.1038/s41598-025-22416-1