A power quality disturbances classification method based on multi-modal parallel feature extraction

Tong, Zhanbei; Zhong, Jianwei; Li, Jiajun; Wu, Jianjun; Li, Zhenwei

doi:10.1038/s41598-023-44399-7

Download PDF

Article
Open access
Published: 17 October 2023

A power quality disturbances classification method based on multi-modal parallel feature extraction

Zhanbei Tong¹^na1,
Jianwei Zhong¹^na1,
Jiajun Li²,
Jianjun Wu² &
…
Zhenwei Li²

Scientific Reports volume 13, Article number: 17655 (2023) Cite this article

5441 Accesses
8 Citations
Metrics details

Subjects

Abstract

Power quality disturbance (PQD) is an important problem affecting the safe and stable operation of power system. Traditional single modal methods not only have a large number of parameters, but also usually focus on only one type of feature, resulting in incomplete information about the extracted features, and it is difficult to identify complex and diverse PQD types in modern power systems. In this regard, this paper proposes a multi-modal parallel feature extraction and classification model. The model pays attention to both temporal and spatial features of PQD, which effectively improves classification accuracy. And a lightweight approach is adopted to reduce the number of parameters of the model. The model uses Long Short Term Memory Neural Network (LSTM) to extract the temporal features of one-dimensional temporal modes of PQD. At the same time, a lightweight residual network (LResNet) is designed to extract the spatial features of the two-dimensional image modality of PQD. Then, the two types of features are fused into multi-modal spatio-temporal features (MSTF). Finally, MSTF is input to a Support Vector Machine (SVM) for classification. Simulation results of 20 PQD signals show that the classification accuracy of the multi-modal model proposed in this paper reaches 99.94%, and the parameter quantity is only 0.08 MB. Compared with ResNet18, the accuracy of the proposed method has been improved by 2.55% and the number of parameters has been reduced by 99.25%.

Multi-modal fusion fault diagnosis for high-voltage transformers based on STFT-ResBIGRUNet

Article Open access 28 November 2025

A hybrid AI framework for identification of power quality disturbances in electrical network

Article Open access 02 April 2026

Power system security and protection considering the integration of new energy power plants

Article Open access 08 October 2025

Introduction

The environmental pollution caused by traditional fossil energy sources has forced mankind to develop cleaner energy sources. To promote sustainable development strategies, modern power systems use a lot of renewable energy represented by wind and solar energy¹. However, the random and intermittent nature of renewable energy sources has triggered a series of PQD problems, resulting in transformer overheating, decreased product quality, reduced equipment life, and false tripping of circuit breakers, which affect the safety and stability of the power system². Therefore, the problem of power quality needs to be addressed urgently. An important step in improving PQD problems is to accurately identify the type of PQD³.

The recognition of PQD consists of two parts: feature extraction and classification. Common feature extraction methods include wavelet transform^4,5, empirical mode decomposition^6,7, variational mode decomposition⁸, S transform^9,10, etc. According to the characteristics of the appropriate classification algorithm, such as: support vector machine^11,12,13,14, decision tree¹⁵, artificial neural networ^16,17, etc. Literature¹⁸ proposed using discrete wavelet to extract PQD characteristics , then combining artificial bee colony and particle swarm optimization to select optimal features, and finally using probabilistic neural network to realize PQD classification. Literature¹⁹, the cuckoo algorithm is improved to optimize the penalty factor, relaxation variable and feature number of multi-class SVM, which improves the classification accuracy to a certain extent. Literature²⁰, for the PQD of distributed new energy generators, the pyramid algorithm without wavelet transform is used to extract features, and then the random gradient lifting tree is used for classification, which effectively improves the classification accuracy. These methods have their own characteristics and have made significant contributions to the research of PQD classification. However, the artificially extracted features of the signals make the classification results susceptible to human interference.

With the advancement of artificial intelligence technology, some new methods have provided assistance in solving the classification problem of PQD^21,22. The deep learning methods represented by convolutional neural networks (CNN) transform time-series signals into two-dimensional images, and then automatically extract spatial features from the images²³. Literature²⁴ used continuous wavelets to convert PQD into color images, and then used Bayesian CNN for classification. This method achieves certain results, but ignores the influence of some temporal features. Methods represented by recurrent neural networks (RNNs) extract temporal features from one-dimensional PQD signals and then perform classification²⁵. Literature²⁶ proposes introducing a dual attention mechanism in Bi-LSTM to increase the weight of important features, reducing computational complexity and improving accuracy. This approach effectively extracts the temporal features of the signal, but does not focus on the impact of spatial features on classification. Literature²⁷ proposes a hybrid neural network model to convert PQD into a perturbed image, using CNN to automatically extract spatial features of the image, and then inputting the temporal features into a gated recurrent unit (GRU) for classification. This method extracts the spatial-temporal characteristics of the PQD signal, but may result in the loss of some temporal features during image conversion. Although the above methods can eliminate the interference of human factors, these single-modality models can easily lead to varying degrees of feature loss during feature extraction, which can have an impact on PQD classification. Meanwhile, in order to achieve better classification results, these deep learning models continuously increase the depth of the model by stacking, which makes the number of parameters of the model increase significantly and improves computational difficulty.

In recent years, inspired by the multisensory (visual and auditory) perception of the world by humans, research on classification methods has gradually shifted from unimodal to multi-modal domains²⁸. Multimodal data fusion aims to combine different distributed and different types of data in a single space, including images, audio, and measurement signals²⁹. Currently, it is mostly used in medical diagnostics, acoustics, and vision^30,31,32, research in PQD recognition suffers from the problems of feature extraction methods and a large number of model parameters³³. Multimodal information is obtained by fusing different modalities of information, and its amount of information exceeds that of a single modality. In previous studies, most of them adopted a single mode approach, that is, extracting features from one-dimensional signals or two-dimensional images. However, the types of PQDs in modern power systems are complex and diverse, and most single modal methods are prone to feature loss when extracting features, resulting in an inability to fully grasp the characteristics of the signal³⁴. To address the above issues, this paper combines these types of data and proposes a PQD classification method based on multimodality LResNet-LSTM parallel feature extraction. The model simultaneously inputs the one-dimensional time-series signal and two-dimensional disturbance images of the PQD. LSTM and LResNet are used to extract the temporal and spatial features, which are then fused into a multimodal spatiotemporal feature (MSTF), and finally input into SVM for classification.

Based on the above research, this paper proposes a PQD classification method based on multi-modal LResNet-LSTM parallel feature extraction. The model uses LResNet and LSTM to extract spatial features and temporal features in parallel, and then fuses them into MSTF, which are finally input into SVM for classification.

The main contributions of this paper are as follows:

This paper proposes a PQD classification model with multi-modal parallel feature extraction. The model designs spatial feature extraction (SFE) module and temporal feature extraction (TFE) module based on LResNet and LSTM respectively. The multi-modal feature fusion (MFF) module fuses the features extracted by SFE and TFE in parallel into MSTF to obtain more comprehensive feature information.
A Light ResNet is designed based on the idea of residuals and depth-separable convolution. Compared to the traditional ResNet18, its structure is simpler and the number of parameters is greatly reduced. The use of Swish activation functions optimizes the classification performance of the model. After being fused with the temporal features extracted by LSTM, the accuracy of the model reached 99.94
This article uses Cache to solve the problem of speed mismatch between SFE and TFE modules when running in parallel.
Unlike traditional deep learning models that improve classification accuracy by increasing depth, the proposed model in this paper not only improves classification accuracy, but also achieves the light weight of the model.

The work in this paper is organized as follows: “Multi-modal PQD classification model” section proposes a multi-modal PQD classification model. “Simulation experiments” section verifies the effectiveness of the method through simulation experiments. “Conclusion” section is the conclusion.

Multi-modal PQD classification model

PQD classification model framework

In this paper, we propose a multi-modal PQD classification model based on LResNet-LSTM, which mainly consists of a parallel feature extraction (PFE) module, an MFF module and a classification module. The model framework is shown in Fig. 1.

Parallel Feature Extraction: PFE consists of two submodules: SFE and TFE. The SFE first transforms PQD signals into 2D disturbance images and then extracts spatial features using LResNet. TFE normalizes the PQD signal and extracts temporal features using LSTM. After this, TFE stores the features in the Cache and waits for SFE to complete its operation.
Multi-modal feature fusion: This module splices the extracted spatial features and temporal features to obtain MSTF and outputs them to the next module.
Classification: This module uses a better performing SVM as a classifier, first inputting MSTF for training, and then inputting the test set into the trained SVM for classification.

Parallel feature extraction module

Due to the fact that LSTM needs to calculate the previous result before calculating the next result when processing sequences, it is unable to directly combine LSTM with LResNet into a parallel computing model. Therefore, we need to split the PFE module into two separate submodules: SFE and TFE. The two sub-modules can be used to extract the spatial features and temporal characteristics of the PQD signals respectively.

SFE module

SFE module consists of two parts: image conversion and feature extraction. Image conversion is done by gramian angular field (GAF), which maps the PQD signal into the polar coordinate system using the GAF method and converts it into a scrambled image by coding³⁵. This method has been widely used to convert 1D signals into 2D images. The conversion process is as follows.

Step 1::: The timing signal x(t) is mapped into the polar coordinate system. Where, is the angle and r is the radius.
$$\begin{aligned} {\left\{ \begin{array}{ll}\varphi =\textrm{arccos}(x) \\ r=\frac{t_i}{N},t_i\in N \end{array}\right. } \end{aligned}$$
(1)
Step 2::: Convert to Gram matrix.
$$\begin{aligned} G=\left[ \cos \left( \varphi _{i}+\varphi _{j}\right) \right] =\left[ \begin{array}{ccc} \cos \left( \varphi _{1}+\varphi _{2}\right) &{} \cdots &{} \cos \left( \varphi _{1}+\varphi _{n}\right) \\ \vdots &{} \ddots &{} \vdots \\ \cos \left( \varphi _{n}+\varphi _{1}\right) &{} \cdots &{} \cos \left( \varphi _{n}+\varphi _{n}\right) \end{array}\right] \end{aligned}$$
(2)
Step 3::: The imagesc function in MATLAB is called to implement the image conversion. Each element in the matrix specifies a pixel color in the image.

This article designed a lightweight residual network that focuses on the inherent relationships in the spatial domain, aiming to extract key information from images. The structure of LResNet is shown in Fig. 2. In the graph, BN refers to batch normalization and Swish is an activation function. ReLU is a commonly used activation function in neural networks. It is a left-saturated function with a derivative of 1, which prevents the gradient from decaying as quickly as the sigmoid function does, providing the advantage of speeding up training and overcoming gradient extinction. However, when the ReLu input is negative, the output is always 0, resulting in no activation. Swish overcomes the problem that ReLu is invalid when the input is negative³⁶. The Swish functional expression is shown in Eq. (3).

$$\begin{aligned} f(x)=x\cdot \textrm{sigmoid}(\beta ,x) \end{aligned}$$

(3)

Where $\beta $ is a constant.

PQD image generated through convolution operation 112 × 112 × 64 feature diagram. The maximum pooling layer compresses the input feature map and extracts the main features, while enhancing the robustness of the model to some extent. To achieve a lightweight model, this paper uses two deep separable convolutions (GConv) to construct a residual block, which has only 1.73% of the parameter of a standard convolutional residual block. Multiply the feature map from the residual module by the feature map from the previous layer to obtain a new feature map to prevent model degradation. Then, a convolutional layer is used to extract local features of the image. Finally, use GAP to integrate the spatial information on each channel, obtaining a 1 × 1 × 128 spatial feature vectors.

Compared to the traditional ResNet18, LResNet has a simpler structure and 99.25% fewer parameters. Its performance meets the requirements of extracting spatial features for this paper.

TFE module

The PQD signal is fed into the TFE module to extract temporal features, and the first requires normalization of the raw data. In this paper, min-max normalization is used to map the perturbed signal x to between [0,1]. The formula is Eq. (4).

$$\begin{aligned} x^{*}=\frac{x-min(x)}{max(x)-min(x)} \end{aligned}$$

(4)

Then input the normalized data into LSTM for training and extract temporal features. LSTM is a neural network that is improved to solve the problem of gradient vanishing in recurrent neural networks^37,38. The input gate of LSTM is used to read data, while the forgetting gate discards useless information and preserves valid information. The output gate transmits effective information to the next moment.

The input gate of LSTM is used to read the data and the forgetting gate discards the useless information and keeps the valid information. The output gate delivers valid information to the next moment. The calculation formula are shown in Eqs. (5)–(10)

$$\begin{aligned} f_t=\sigma (W_f [h_{t-1},x_t]+b_f) \end{aligned}$$

(5)

$$\begin{aligned} i_t=\sigma (W_i [h_{t-1},x_t]+b_i) \end{aligned}$$

(6)

$$\begin{aligned} g_t=\textrm{tanh} (W_g [h_{t-1},x_t]+b_g) \end{aligned}$$

(7)

$$\begin{aligned} o_t=\sigma (W_o [h_{t-1},x_t]+b_o) \end{aligned}$$

(8)

$$\begin{aligned} C_t=f_t C_{t-1}+i_t g_t \end{aligned}$$

(9)

$$\begin{aligned} h_t=o_t \textrm{tanh} (C_t) \end{aligned}$$

(10)

At time t, $x_t$ is the input of time t, $\sigma $ is sigmoid function, $f_t$ is forgetting gate, $i_t$ is input gate, $g_t$ is output of tanh function, $o_t$ is output gate, $C_{t-1}$ is the carrier of the previous round of global information, $h_{t-1}$ is the intermediate state output of the previous round. $C_t$ is the carrier of this round of global information, $h_t$ is the intermediate state output of this round. $W_f$, $W_i$, $W_g$, $W_o$ are the weights of the corresponding symbols, $b_f$, $b_i$, $b_g$, $b_o$ are the bias of the corresponding symbols.

Due to the different operating speeds of LSTM and LResNet, the time required for the TFE and SFE modules to complete feature extraction is not synchronized. Therefore, the module that completes the extraction first needs to wait for the other module to complete its operation before being input into the MFF layer for feature fusion. Experiments have shown that the TFE module runs faster, and the extracted temporal features need to be stored in memory before the SFE module completes the spatial feature extraction. Usually, data is stored in main memory, waiting for the CPU to issue a call instruction, and then the data is input into the CPU to execute the subsequent program. However, the CPU access to main memory is slow. To speed up the call, we use a cache between the CPU and main memory to store the timing characteristics. When the SFE completes its operation, the CPU reads the timing characteristics data directly from the Cache at high speed, speeding up the program’s execution.

MFF and classification

The PQD signal uses SFE to extract a size of 1 × 1 × 128 spatial features, extracted using TFE with a size of 1 × 1 × 128 temporal features. Directly concatenate and fuse two sets of feature vectors to obtain 1 × 1 × 256 is shown in Fig. 3. This multimodal feature vector preserves the spatiotemporal feature information of the PQD signal to the maximum extent through direct concatenation.

SVM is a machine learning algorithm that can handle high-dimensional data, overcome dimensionality catastrophe, has better robustness and interpretability, and has better generalization ability to provide more reliable results. Therefore, SVM is chosen as the classifier in this paper.SVM maps multimodal perturbation features into a high dimensional space by means of a kernel function and searches for an optimal hyperplane in that space to classify the PQD³⁹. The classification problem can be transformed into a quadratic programming problem. The objective function and constraints is shown in Eq. (11).

$$\begin{aligned} \left\{ \begin{array}{l} \min \frac{1}{2}\Vert \omega \Vert ^{2}+C \sum _{i=1}^{l} \xi _{i} \\ \text{ s.t. } y_{i}\left( \omega x_{i}+b\right) \ge 1-\xi _{i}, i=1,2, \cdots , l \end{array}\right. \end{aligned}$$

(11)

In Eq. (11), C is the penalty factor, $\xi _i$ is the relaxation term, $x_i$ is the training data, $y_i$ is the classification label, $\omega $ is the weight matrix, and b is the polarization parameter. The linear kernel function $K(x_i,x_j)=|\varphi (x_i),\varphi (x_j)|$ is used to find the optimal hyperplane through global search, and the optimal classification decision function is obtained as shown in Eq. (12).

$$\begin{aligned} f(x)=sgn\sum _{i=1,j=1}^{n}a_{i}^{*} y_j K(xi,x_j)+b^{*} \end{aligned}$$

(12)

In Eq. (12), $a_{i}^{*} $ and $b^{*} $ are solutions of the above formula, n is the number of training samples, $x_j$ is the input vector, $y_j$ is the corresponding expectation.

Simulation experiments

PQD dataset

The mathematical model of the PQD signal is based on IEEE 1159-2019 standard⁴⁰. The mathematical model of a single disturbance is shown in Table 1. A single disturbance superimposes each other to form a composite disturbance. Using MATLAB to call the rand function to generate 20 types of power quality signals with random amplitude and random disturbance occurrence time within the range of parameters and at a sampling frequency of 3 kHz, and add 30 dB white noise to simulate noise interference during the acquisition process. The PQD signal includes normal voltage (S1), sag (S2), swell (S3), interruption (S4), harmonics (S5), transient oscillation (S6), flicker (S7), and transient pulse (S8), sag + oscillation (S9), swell + oscillation (S10), flicker + oscillation (S11), sag + harmonic (S12), swell + harmonic (S13), flicker + harmonic (S14), sag + pulse (S15), swell + pulse (S16), oscillation + harmonic (S17), sag + harmonic + oscillation (S18), swell + harmonic + oscillation (S19), flicker + harmonic + oscillation (S20). There are 300 samples for each type of power quality signal, for a total of 6000 sets of sample data. The sample data is divided into a training set and a testing set at 7:3, and 10% of the training set is taken out for validation.

Table 1 Mathematical model of power quality signal.

Full size table

Evaluation indicators

In this paper, accuracy recall rate, precision rate, F1 score and parameters are used as evaluation metrics. The calculation formula is shown in Eqs. (13)–(15).

$$\begin{aligned} Recall=\frac{TP}{TP+FN} \end{aligned}$$

(13)

$$\begin{aligned} Precision=\frac{TP}{TP+FP} \end{aligned}$$

(14)

$$\begin{aligned} F1{\text{-}}score=\frac{2\times Precision \times Recall}{Precision+Recall} \end{aligned}$$

(15)

Where TP denotes true positive. TN means true negative. FP means false positive. FN means false negative.

Simulation analysis

The two sub-modules of the multi-modal model only need simple training, and then the output of the prediction intermediate layer is fused through the MFF layer, and finally input into the SVM to realize PQD classification. The SFE module converts the PQD signal into an image using GAF, and the conversion result is shown in Fig. 4.

Set LResNet training for 1 round with a batch size of 30 and a learning rate of 0.001. The number of neurons of LSTM is set to 128, the maximum number of rounds is 30, the batch size is 30, the initial learning rate is 0.001, and the learning rate decreases by a factor of 0.1 every 10 rounds. Both modules use the ‘adam’ optimizer. Using cross entropy as the loss function, the mathematical formula for the function is shown in Eq. (16).

$$\begin{aligned} C=\frac{1}{n} \sum _{x}^{} [yln(a)+(1-y)ln(1-a)] \end{aligned}$$

(16)

In Eq. (16), y is the expected output and a is the actual output.

The confusion matrix is introduced to show the classification results of the model, as shown in Fig. 5.

In Fig. 5, only one S2 (red box) was incorrectly identified as S20 in the test set, and the rest of the types were accurate. To verify the role of each module, ablation experiments were done in this paper, and the experimental protocol and classification results are shown in Table 2.

Table 2 Ablation experimental results.

Full size table

As can be seen from Table 2, it can be seen that the correct recall rate of Scheme 3 has increased by 6.22% compared to Scheme 2, and the recall rate of Scheme 5 has increased by 0.11% compared to Scheme 4. This verifies that the model using the Swish activation function is better than the model using the ReLu activation function. The correct rates when using LSTM and LResNet alone are only 76.94% and 86.44% due to insufficient feature information extracted by a single modality. The fusion of the features of the two modalities resulted in a significant increase in the correctness rate, which reached 99.94%. Therefore, multi-modal features contain more information than single modal features and can fully grasp the characteristics of PQD signals. And the model does not add too many parameters, only 0.08 MB.

In order to reflect the advantages of multi-modal models, nine single modal models are built for comparison in this paper, namely GRU, AlexNet, GoogLeNet, Xception, ResNet18, ResNet50, ResNet101, EfficientNet-B0, MobileNetV2 and ShuffleNetV1. The same data set was used for the experiments. Calculate the evaluation indicators of the model according to formulas (13–15), and the comparison results are shown in Table 3.

Table 3 Comparison of multi-modal and single modal model.

Full size table

In Table 3, the accuracy of LResNet-LSTM-SVM improved by 22.94%, 16.22%, 23.38%, 4.33%, 2.55%, 4.72%, 3.72%, 1.83%, 1.61% and 1.50%, respectively, over the comparison model. Compared with GRU, the model’s parameter size only increased by 0. 03 MB. The number of parameters in this model is only 0.18%, 1.40%, 0.42%, 0.75%, 0.36%, 0.20%, 2.07%, 3.48% and 5.71% of the other 9 deep learning models. Therefore, the proposed multi-modal model in this paper not only has higher classification accuracy than the single-modal model, but also has less parameters and achieves the lightweight of the model.

Real data validation

In order to further validate the feasibility of the method, this paper validates the method using a set of real signals as inputs. The dataset is provided by the Kaggle public database and includes six categories of power quality signals, namely normal voltage (S1), sag (S2), harmonics (S5), transient pulse (S8), sag + oscillations (S9), and sag + harmonics (S12). There are 600 samples for each signal type. The confusion matrix of the classification results is shown in Fig. 6.

As can be seen in Fig. 6, 2 groups of S1 are incorrectly identified as S8 due to the influence of noise, 2 groups of S2 are identified as S8 due to smaller amplitude of voltage drop and noise interference, 11 groups of S8 are identified as S1 due to smaller amplitude of pulse, 1 group of S8 is incorrectly identified as S5 due to larger number of pulses in the sample, and 1 group of S9 is identified as S1 due to smaller amplitude of voltage drop and oscillation. Although there is a certain difference between real data and simulated data, the model of this paper still achieves 99.53% classification accuracy, which is only 0.41% less than the simulation results, thus verifying the effectiveness of the method.

Conclusion

For the problem of PQD classification, this paper proposes a PQD classification model based on multimodal LResNet-LSTM parallel feature extraction.

The model proposed in this article consists of three modules: PFE, MFF, and classification. The two sub-modules SFE and TFE of PFE are utilized to extract spatial and temporal features in parallel. Then merge the two types of features into a MSTF. Finally, the MSTF is input into SVM for classification. The model can recognize 20 types of PQDs with an accuracy rate of 99.94% and a parameter size of only 0. 08MB.
A simple-structured Light ResNet was designed based on residuals. Unlike traditional ResNet18, the residual block of LResNet uses two deep separable convolutions, greatly reducing the number of parameters in the model. And LResNet uses the Swish activation function instead of the original ReLu, which optimizes the classification performance of the model.
This article uses a high-speed cache to address data storage problems caused by asynchronous execution of SFE and TFE, but the capacity of high-speed cache is limited and unsuitable for large-scale data storage. Therefore, we are considering adopting a more appropriate method to control the operation of both modules in the future, enabling them to complete feature extraction simultaneously, and avoiding data transfers and reads.
Unlike traditional deep learning models that improve classification accuracy by increasing depth, the proposed model in this paper not only improves classification accuracy, but also reduces the number of parameters in the model.

Data availability

The simulation dataset generated during and analysed during the current study are available from the corresponding author on reasonable request. Real dataset: https://www.kaggle.com/datasets/aswarthnarayanacv/power-quality-distribution-dataset-2.

References

Bullich-Massagué, E. et al. Power plant control in large-scale photovoltaic plants: Design, implementation and validation in a 9.4 mw photovoltaic plant. IET Renew. Power Gener. 10, 50–62. https://doi.org/10.1049/iet-rpg.2015.0113 (2016).
Article Google Scholar
Khosravi, N. et al. Improvement of power quality parameters using modulated-unified power quality conditioner and switched-inductor boost converter by the optimization techniques for a hybrid AC/DC microgrid. Sci. Rep. 12, 21675. https://doi.org/10.1038/s41598-022-26001-8 (2022).
Article ADS PubMed PubMed Central Google Scholar
Wang, K. & Xi, Y. A new method of power quality disturbance classification based on deep belief network. In Journal of Physics: Conference Series Vol. 1827, 012021 https://doi.org/10.1088/1742-6596/1827/1/012021 (2021).
Chen, S., Li, Z., Pan, G. & Xu, F. Power quality disturbance recognition using empirical wavelet transform and feature selection. Electronics 11, 174. https://doi.org/10.3390/electronics11020174 (2022).
Article Google Scholar
Chamchuen, S., Siritaratiwat, A., Fuangfoo, P., Suthisopapan, P. & Khunkitti, P. Adaptive salp swarm algorithm as optimal feature selection for power quality disturbance classification. Appl. Sci. 11, 5670. https://doi.org/10.3390/app11125670 (2021).
Article CAS Google Scholar
Li, H., Yi, B., Li, Q., Ming, J. & Zhao, Z. Evaluation of DC power quality based on empirical mode decomposition and one-dimensional convolutional neural network. IEEE Access 8, 34339–34349. https://doi.org/10.1109/ACCESS.2020.2974571 (2020).
Article Google Scholar
Lopez-Ramirez, M. et al. EMD-based feature extraction for power quality disturbance classification using moments. Energies 9, 565. https://doi.org/10.3390/en9070565 (2016).
Article Google Scholar
Fu, L. et al. Power quality disturbance recognition using VMD-based feature extraction and heuristic feature selection. Appl. Sci. 9, 4901. https://doi.org/10.3390/app9224901 (2019).
Article Google Scholar
Zhong, T. et al. Power quality disturbance recognition based on multiresolution S-transform and decision tree. IEEE Access 7, 88380–88392. https://doi.org/10.1109/ACCESS.2019.2924918 (2019).
Article Google Scholar
Tang, Q., Qiu, W. & Zhou, Y. Classification of complex power quality disturbances using optimized S-transform and kernel SVM. IEEE Trans. Ind. Electron. 67, 9715–9723. https://doi.org/10.1109/TIE.2019.2952823 (2019).
Article Google Scholar
Yılmaz, A. et al. An improved automated PQD classification method for distributed generators with hybrid SVM-based approach using un-decimated wavelet transform. Int. J. Electr. Power Energy Syst. 136, 107763. https://doi.org/10.1016/j.ijepes.2021.107763 (2022).
Article Google Scholar
Lin, W.-M., Wu, C.-H., Lin, C.-H. & Cheng, F.-S. Classification of multiple power quality disturbances using support vector machine and one-versus-one approach. In 2006 International Conference on Power System Technology, 1–8, https://doi.org/10.1109/ICPST.2006.321956 (2006).
Li, M., Zhang, J., Song, J., Li, Z. & Lu, S. A clinical-oriented non-severe depression diagnosis method based on cognitive behavior of emotional conflict. IEEE Trans. Comput. Soc. Syst. 10, 131–141. https://doi.org/10.1109/TCSS.2022.3152091 (2022).
Article CAS Google Scholar
Chen, H., Wang, T., Chen, T. & Deng, W. Hyperspectral image classification based on fusing s³-PCA, 2D-SSA and random patch network. Remote Sens. 15, 3402. https://doi.org/10.3390/rs15133402 (2023).
Article ADS Google Scholar
Mahela, O. P., Shaik, A. G., Khan, B., Mahla, R. & Alhelou, H. H. Recognition of complex power quality disturbances using s-transform based ruled decision tree. IEEE Access 8, 173530–173547. https://doi.org/10.1109/ACCESS.2020.3025190 (2020).
Article Google Scholar
Bhagat, A., Nimkar, S., Dongre, K. & Ali, S. Power quality disturbance detection and classification using artificial neural network based wavelet. Int. J. Comput. Intell. Res. 13, 2043–2064 (2017).
Google Scholar
Zhang, Y., Zhang, Y. & Zhou, X. Classification of power quality disturbances using visual attention mechanism and feed-forward neural network. Measurement 188, 110390. https://doi.org/10.1016/j.measurement.2021.110390 (2022).
Article Google Scholar
Chamchuen, S., Siritaratiwat, A., Fuangfoo, P., Suthisopapan, P. & Khunkitti, P. High-accuracy power quality disturbance classification using the adaptive ABC-PSO as optimal feature selection algorithm. Energies 14, 1238. https://doi.org/10.3390/en14051238 (2021).
Article Google Scholar
Mehedi, I. M. et al. Optimal feature selection using modified cuckoo search for classification of power quality disturbances. Appl. Soft Comput. 113, 107897. https://doi.org/10.1016/j.asoc.2021.107897 (2021).
Article Google Scholar
Yılmaz, A., Küçüker, A. & Bayrak, G. Automated classification of power quality disturbances in a SOFC &PV-based distributed generator using a hybrid machine learning method with high noise immunity. Int. J. Hydrogen Energy 47, 19797–19809. https://doi.org/10.1016/j.ijhydene.2022.02.033 (2022).
Article CAS Google Scholar
Das, S. R., Mishra, A. K., Ray, P. K., Salkuti, S. R. & Kim, S.-C. Application of artificial intelligent techniques for power quality improvement in hybrid microgrid system. Electronics 11, 3826. https://doi.org/10.3390/electronics11223826 (2022).
Article Google Scholar
Abd Jamlus, N. U. I., Shahbudin, S. & Kassim, M. Power quality disturbances classification analysis using residual neural network. In 2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA), 442–447, https://doi.org/10.1109/CSPA55076.2022.9782013 (2022).
Han, Y., Feng, Y., Yang, P., Xu, L. & Zalhaf, A. S. An efficient algorithm for atomic decomposition of power quality disturbance signals using convolutional neural network. Electric Power Syst. Res. 206, 107790. https://doi.org/10.1016/j.epsr.2022.107790 (2022).
Article Google Scholar
Ekici, S., Ucar, F., Dandil, B. & Arghandeh, R. Power quality event classification using optimized Bayesian convolutional neural networks. Electr. Eng. 103, 67–77 (2021).
Article Google Scholar
Dawood, Z. & Babulal, C. Red deer optimized recurrent neural network for the classification of power quality disturbance. Electr. Eng. https://doi.org/10.1007/s00202-022-01701-6 (2023).
Article Google Scholar
Khetarpal, P., Nagpal, N., Siano, P. & Al-Numay, M. Power quality disturbance signal segmentation and classification based on modified BI-LSTM with double attention mechanism. Authorea Preprints https://doi.org/10.22541/au.167865037.70684326/v1 (2023).
Yiğit, E., Özkaya, U., Öztürk, Ş, Singh, D. & Gritli, H. Automatic detection of power quality disturbance using convolutional neural network structure with gated recurrent unit. Mobile Inf. Syst. 1–11, 2021. https://doi.org/10.1155/2021/7917500 (2021).
Article Google Scholar
Sindi, H., Nour, M., Rawa, M., Öztürk, Ş & Polat, K. A novel hybrid deep learning approach including combination of 1D power signals and 2D signal images for power quality disturbance classification. Expert Syst. Appl. 174, 114785. https://doi.org/10.1016/j.eswa.2021.114785 (2021).
Article Google Scholar
Gao, J., Li, P., Chen, Z. & Zhang, J. A survey on deep learning for multimodal data fusion. Neural Comput. 32, 829–864. https://doi.org/10.1162/neco_a_01273 (2020).
Article MathSciNet PubMed MATH Google Scholar
Kong, Z., Zhang, C., Lv, H., Xiong, F. & Fu, Z. Multimodal feature extraction and fusion deep neural networks for short-term load forecasting. IEEE Access 8, 185373–185383. https://doi.org/10.1109/ACCESS.2020.3029828 (2020).
Article Google Scholar
Zhang, H. et al. Detection of coronary artery disease using multi-modal feature fusion and hybrid feature selection. Physiol. Meas. 41, 115007. https://doi.org/10.1088/1361-6579/abc323 (2020).
Article Google Scholar
Mehmood, A. Robust multimodal heartbeat detection using hybrid neural networks. IEEE Access 8, 82201–82214. https://doi.org/10.1109/ACCESS.2020.2990607 (2020).
Article Google Scholar
Baltrušaitis, T., Ahuja, C. & Morency, L.-P. Multimodal machine learning: A survey and taxonomy. IEEE Trans. Pattern Anal. Mach. Intell. 41, 423–443. https://doi.org/10.1109/TPAMI.2018.2798607 (2018).
Article PubMed Google Scholar
Sindi, H., Nour, M., Rawa, M., Öztürk, Ş & Polat, K. An adaptive deep learning framework to classify unknown composite power quality event using known single power quality events. Expert Syst. Appl. 178, 115023. https://doi.org/10.1016/j.eswa.2021.115023 (2021).
Article Google Scholar
Heping, P., Wenxiong, M., Yong, W., Le, L. & Zhong X. Identification method for power quality disturbances in distribution network based on transfer learning. Arch. Electr. Eng. https://doi.org/10.24425/aee.2022.141682 (2022).
Article Google Scholar
Ramachandran, P., Zoph, B. & Le, Q. V. Searching for activation functions. arXiv preprintarXiv:1710.05941 https://doi.org/10.48550/arXiv.1710.05941 (2017).
Turović, R. et al. An end-to-end deep learning method for voltage sag classification. Energies 15, 2898. https://doi.org/10.3390/en15082898 (2022).
Article Google Scholar
Wang, Q., Liang, X. & Qin, S. Research on power quality disturbance analysis and identification based on LSTM. Energy Rep. 8, 709–718. https://doi.org/10.1016/j.egyr.2022.09.170 (2022).
Article Google Scholar
Mengistu, E. et al. Utilization of Stockwell transform, support vector machine and D-STATCOM for the identification, classification and mitigation of power quality problems. Sustainability 15, 6007. https://doi.org/10.3390/su15076007 (2023).
Article Google Scholar
IEEE. Recommended Practice for Monitoring Electric Power Quality. 1159 (IEEE, 2019).

Download references

Author information

These authors contributed equally: Zhanbei Tong and Jianwei Zhong.

Authors and Affiliations

College of Intelligent Systems Science and Engineering, Hubei Minzu University, Enshi, 445000, China
Zhanbei Tong & Jianwei Zhong
State Grid Hubei Electric Power Company, Enshi Power Supply Company, Enshi, 445000, China
Jiajun Li, Jianjun Wu & Zhenwei Li

Authors

Zhanbei Tong
View author publications
Search author on:PubMed Google Scholar
Jianwei Zhong
View author publications
Search author on:PubMed Google Scholar
Jiajun Li
View author publications
Search author on:PubMed Google Scholar
Jianjun Wu
View author publications
Search author on:PubMed Google Scholar
Zhenwei Li
View author publications
Search author on:PubMed Google Scholar

Contributions

Z.T. conceived the experiment(s), Z.T. and J.Z. conducted the experiment(s), Z.T., J.Z. and J.W. analysed the results, Z.L. revised the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jianwei Zhong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tong, Z., Zhong, J., Li, J. et al. A power quality disturbances classification method based on multi-modal parallel feature extraction. Sci Rep 13, 17655 (2023). https://doi.org/10.1038/s41598-023-44399-7

Download citation

Received: 23 June 2023
Accepted: 07 October 2023
Published: 17 October 2023
Version of record: 17 October 2023
DOI: https://doi.org/10.1038/s41598-023-44399-7

This article is cited by

A multimodal deep learning model with differential evolution-based optimized features for classification of power quality disturbances
- Md Nurul Islam
Journal of Electrical Systems and Information Technology (2025)
Deep transfer learning approach for the classification of single and multiple power quality disturbances
- Uvesh Sipai
- Rajendrasinh Jadeja
- Kharizi Khin Ram
Scientific Reports (2025)
A novel amalgamation of pre-processing technique and CNN model for accurate classification of power quality disturbances
- Prity Soni
- Pankaj Mishra
- Debasmita Mondal
Electrical Engineering (2025)