Abstract
Power quality disturbance (PQD) is an important problem affecting the safe and stable operation of power system. Traditional single modal methods not only have a large number of parameters, but also usually focus on only one type of feature, resulting in incomplete information about the extracted features, and it is difficult to identify complex and diverse PQD types in modern power systems. In this regard, this paper proposes a multi-modal parallel feature extraction and classification model. The model pays attention to both temporal and spatial features of PQD, which effectively improves classification accuracy. And a lightweight approach is adopted to reduce the number of parameters of the model. The model uses Long Short Term Memory Neural Network (LSTM) to extract the temporal features of one-dimensional temporal modes of PQD. At the same time, a lightweight residual network (LResNet) is designed to extract the spatial features of the two-dimensional image modality of PQD. Then, the two types of features are fused into multi-modal spatio-temporal features (MSTF). Finally, MSTF is input to a Support Vector Machine (SVM) for classification. Simulation results of 20 PQD signals show that the classification accuracy of the multi-modal model proposed in this paper reaches 99.94%, and the parameter quantity is only 0.08 MB. Compared with ResNet18, the accuracy of the proposed method has been improved by 2.55% and the number of parameters has been reduced by 99.25%.
Similar content being viewed by others
Introduction
The environmental pollution caused by traditional fossil energy sources has forced mankind to develop cleaner energy sources. To promote sustainable development strategies, modern power systems use a lot of renewable energy represented by wind and solar energy1. However, the random and intermittent nature of renewable energy sources has triggered a series of PQD problems, resulting in transformer overheating, decreased product quality, reduced equipment life, and false tripping of circuit breakers, which affect the safety and stability of the power system2. Therefore, the problem of power quality needs to be addressed urgently. An important step in improving PQD problems is to accurately identify the type of PQD3.
The recognition of PQD consists of two parts: feature extraction and classification. Common feature extraction methods include wavelet transform4,5, empirical mode decomposition6,7, variational mode decomposition8, S transform9,10, etc. According to the characteristics of the appropriate classification algorithm, such as: support vector machine11,12,13,14, decision tree15, artificial neural networ16,17, etc. Literature18 proposed using discrete wavelet to extract PQD characteristics , then combining artificial bee colony and particle swarm optimization to select optimal features, and finally using probabilistic neural network to realize PQD classification. Literature19, the cuckoo algorithm is improved to optimize the penalty factor, relaxation variable and feature number of multi-class SVM, which improves the classification accuracy to a certain extent. Literature20, for the PQD of distributed new energy generators, the pyramid algorithm without wavelet transform is used to extract features, and then the random gradient lifting tree is used for classification, which effectively improves the classification accuracy. These methods have their own characteristics and have made significant contributions to the research of PQD classification. However, the artificially extracted features of the signals make the classification results susceptible to human interference.
With the advancement of artificial intelligence technology, some new methods have provided assistance in solving the classification problem of PQD21,22. The deep learning methods represented by convolutional neural networks (CNN) transform time-series signals into two-dimensional images, and then automatically extract spatial features from the images23. Literature24 used continuous wavelets to convert PQD into color images, and then used Bayesian CNN for classification. This method achieves certain results, but ignores the influence of some temporal features. Methods represented by recurrent neural networks (RNNs) extract temporal features from one-dimensional PQD signals and then perform classification25. Literature26 proposes introducing a dual attention mechanism in Bi-LSTM to increase the weight of important features, reducing computational complexity and improving accuracy. This approach effectively extracts the temporal features of the signal, but does not focus on the impact of spatial features on classification. Literature27 proposes a hybrid neural network model to convert PQD into a perturbed image, using CNN to automatically extract spatial features of the image, and then inputting the temporal features into a gated recurrent unit (GRU) for classification. This method extracts the spatial-temporal characteristics of the PQD signal, but may result in the loss of some temporal features during image conversion. Although the above methods can eliminate the interference of human factors, these single-modality models can easily lead to varying degrees of feature loss during feature extraction, which can have an impact on PQD classification. Meanwhile, in order to achieve better classification results, these deep learning models continuously increase the depth of the model by stacking, which makes the number of parameters of the model increase significantly and improves computational difficulty.
In recent years, inspired by the multisensory (visual and auditory) perception of the world by humans, research on classification methods has gradually shifted from unimodal to multi-modal domains28. Multimodal data fusion aims to combine different distributed and different types of data in a single space, including images, audio, and measurement signals29. Currently, it is mostly used in medical diagnostics, acoustics, and vision30,31,32, research in PQD recognition suffers from the problems of feature extraction methods and a large number of model parameters33. Multimodal information is obtained by fusing different modalities of information, and its amount of information exceeds that of a single modality. In previous studies, most of them adopted a single mode approach, that is, extracting features from one-dimensional signals or two-dimensional images. However, the types of PQDs in modern power systems are complex and diverse, and most single modal methods are prone to feature loss when extracting features, resulting in an inability to fully grasp the characteristics of the signal34. To address the above issues, this paper combines these types of data and proposes a PQD classification method based on multimodality LResNet-LSTM parallel feature extraction. The model simultaneously inputs the one-dimensional time-series signal and two-dimensional disturbance images of the PQD. LSTM and LResNet are used to extract the temporal and spatial features, which are then fused into a multimodal spatiotemporal feature (MSTF), and finally input into SVM for classification.
Based on the above research, this paper proposes a PQD classification method based on multi-modal LResNet-LSTM parallel feature extraction. The model uses LResNet and LSTM to extract spatial features and temporal features in parallel, and then fuses them into MSTF, which are finally input into SVM for classification.
The main contributions of this paper are as follows:
-
This paper proposes a PQD classification model with multi-modal parallel feature extraction. The model designs spatial feature extraction (SFE) module and temporal feature extraction (TFE) module based on LResNet and LSTM respectively. The multi-modal feature fusion (MFF) module fuses the features extracted by SFE and TFE in parallel into MSTF to obtain more comprehensive feature information.
-
A Light ResNet is designed based on the idea of residuals and depth-separable convolution. Compared to the traditional ResNet18, its structure is simpler and the number of parameters is greatly reduced. The use of Swish activation functions optimizes the classification performance of the model. After being fused with the temporal features extracted by LSTM, the accuracy of the model reached 99.94
-
This article uses Cache to solve the problem of speed mismatch between SFE and TFE modules when running in parallel.
-
Unlike traditional deep learning models that improve classification accuracy by increasing depth, the proposed model in this paper not only improves classification accuracy, but also achieves the light weight of the model.
The work in this paper is organized as follows: “Multi-modal PQD classification model” section proposes a multi-modal PQD classification model. “Simulation experiments” section verifies the effectiveness of the method through simulation experiments. “Conclusion” section is the conclusion.
Multi-modal PQD classification model
PQD classification model framework
In this paper, we propose a multi-modal PQD classification model based on LResNet-LSTM, which mainly consists of a parallel feature extraction (PFE) module, an MFF module and a classification module. The model framework is shown in Fig. 1.
-
Parallel Feature Extraction: PFE consists of two submodules: SFE and TFE. The SFE first transforms PQD signals into 2D disturbance images and then extracts spatial features using LResNet. TFE normalizes the PQD signal and extracts temporal features using LSTM. After this, TFE stores the features in the Cache and waits for SFE to complete its operation.
-
Multi-modal feature fusion: This module splices the extracted spatial features and temporal features to obtain MSTF and outputs them to the next module.
-
Classification: This module uses a better performing SVM as a classifier, first inputting MSTF for training, and then inputting the test set into the trained SVM for classification.
Multi-modal PQD classification model.
Parallel feature extraction module
Due to the fact that LSTM needs to calculate the previous result before calculating the next result when processing sequences, it is unable to directly combine LSTM with LResNet into a parallel computing model. Therefore, we need to split the PFE module into two separate submodules: SFE and TFE. The two sub-modules can be used to extract the spatial features and temporal characteristics of the PQD signals respectively.
SFE module
SFE module consists of two parts: image conversion and feature extraction. Image conversion is done by gramian angular field (GAF), which maps the PQD signal into the polar coordinate system using the GAF method and converts it into a scrambled image by coding35. This method has been widely used to convert 1D signals into 2D images. The conversion process is as follows.
- Step 1::
-
The timing signal x(t) is mapped into the polar coordinate system. Where, is the angle and r is the radius.
$$\begin{aligned} {\left\{ \begin{array}{ll}\varphi =\textrm{arccos}(x) \\ r=\frac{t_i}{N},t_i\in N \end{array}\right. } \end{aligned}$$(1) - Step 2::
-
Convert to Gram matrix.
$$\begin{aligned} G=\left[ \cos \left( \varphi _{i}+\varphi _{j}\right) \right] =\left[ \begin{array}{ccc} \cos \left( \varphi _{1}+\varphi _{2}\right) &{} \cdots &{} \cos \left( \varphi _{1}+\varphi _{n}\right) \\ \vdots &{} \ddots &{} \vdots \\ \cos \left( \varphi _{n}+\varphi _{1}\right) &{} \cdots &{} \cos \left( \varphi _{n}+\varphi _{n}\right) \end{array}\right] \end{aligned}$$(2) - Step 3::
-
The imagesc function in MATLAB is called to implement the image conversion. Each element in the matrix specifies a pixel color in the image.
This article designed a lightweight residual network that focuses on the inherent relationships in the spatial domain, aiming to extract key information from images. The structure of LResNet is shown in Fig. 2. In the graph, BN refers to batch normalization and Swish is an activation function. ReLU is a commonly used activation function in neural networks. It is a left-saturated function with a derivative of 1, which prevents the gradient from decaying as quickly as the sigmoid function does, providing the advantage of speeding up training and overcoming gradient extinction. However, when the ReLu input is negative, the output is always 0, resulting in no activation. Swish overcomes the problem that ReLu is invalid when the input is negative36. The Swish functional expression is shown in Eq. (3).
Where \(\beta \) is a constant.
Structure diagram of LResNet.
PQD image generated through convolution operation 112 × 112 × 64 feature diagram. The maximum pooling layer compresses the input feature map and extracts the main features, while enhancing the robustness of the model to some extent. To achieve a lightweight model, this paper uses two deep separable convolutions (GConv) to construct a residual block, which has only 1.73% of the parameter of a standard convolutional residual block. Multiply the feature map from the residual module by the feature map from the previous layer to obtain a new feature map to prevent model degradation. Then, a convolutional layer is used to extract local features of the image. Finally, use GAP to integrate the spatial information on each channel, obtaining a 1 × 1 × 128 spatial feature vectors.
Compared to the traditional ResNet18, LResNet has a simpler structure and 99.25% fewer parameters. Its performance meets the requirements of extracting spatial features for this paper.
TFE module
The PQD signal is fed into the TFE module to extract temporal features, and the first requires normalization of the raw data. In this paper, min-max normalization is used to map the perturbed signal x to between [0,1]. The formula is Eq. (4).
Then input the normalized data into LSTM for training and extract temporal features. LSTM is a neural network that is improved to solve the problem of gradient vanishing in recurrent neural networks37,38. The input gate of LSTM is used to read data, while the forgetting gate discards useless information and preserves valid information. The output gate transmits effective information to the next moment.
The input gate of LSTM is used to read the data and the forgetting gate discards the useless information and keeps the valid information. The output gate delivers valid information to the next moment. The calculation formula are shown in Eqs. (5)–(10)
At time t, \(x_t\) is the input of time t, \(\sigma \) is sigmoid function, \(f_t\) is forgetting gate, \(i_t\) is input gate, \(g_t\) is output of tanh function, \(o_t\) is output gate, \(C_{t-1}\) is the carrier of the previous round of global information, \(h_{t-1}\) is the intermediate state output of the previous round. \(C_t\) is the carrier of this round of global information, \(h_t\) is the intermediate state output of this round. \(W_f\), \(W_i\), \(W_g\), \(W_o\) are the weights of the corresponding symbols, \(b_f\), \(b_i\), \(b_g\), \(b_o\) are the bias of the corresponding symbols.
Due to the different operating speeds of LSTM and LResNet, the time required for the TFE and SFE modules to complete feature extraction is not synchronized. Therefore, the module that completes the extraction first needs to wait for the other module to complete its operation before being input into the MFF layer for feature fusion. Experiments have shown that the TFE module runs faster, and the extracted temporal features need to be stored in memory before the SFE module completes the spatial feature extraction. Usually, data is stored in main memory, waiting for the CPU to issue a call instruction, and then the data is input into the CPU to execute the subsequent program. However, the CPU access to main memory is slow. To speed up the call, we use a cache between the CPU and main memory to store the timing characteristics. When the SFE completes its operation, the CPU reads the timing characteristics data directly from the Cache at high speed, speeding up the program’s execution.
MFF and classification
The PQD signal uses SFE to extract a size of 1 × 1 × 128 spatial features, extracted using TFE with a size of 1 × 1 × 128 temporal features. Directly concatenate and fuse two sets of feature vectors to obtain 1 × 1 × 256 is shown in Fig. 3. This multimodal feature vector preserves the spatiotemporal feature information of the PQD signal to the maximum extent through direct concatenation.
Multimodal feature fusion.
SVM is a machine learning algorithm that can handle high-dimensional data, overcome dimensionality catastrophe, has better robustness and interpretability, and has better generalization ability to provide more reliable results. Therefore, SVM is chosen as the classifier in this paper.SVM maps multimodal perturbation features into a high dimensional space by means of a kernel function and searches for an optimal hyperplane in that space to classify the PQD39. The classification problem can be transformed into a quadratic programming problem. The objective function and constraints is shown in Eq. (11).
In Eq. (11), C is the penalty factor, \(\xi _i\) is the relaxation term, \(x_i\) is the training data, \(y_i\) is the classification label, \(\omega \) is the weight matrix, and b is the polarization parameter. The linear kernel function \(K(x_i,x_j)=|\varphi (x_i),\varphi (x_j)|\) is used to find the optimal hyperplane through global search, and the optimal classification decision function is obtained as shown in Eq. (12).
In Eq. (12), \(a_{i}^{*} \) and \(b^{*} \) are solutions of the above formula, n is the number of training samples, \(x_j\) is the input vector, \(y_j\) is the corresponding expectation.
Simulation experiments
PQD dataset
The mathematical model of the PQD signal is based on IEEE 1159-2019 standard40. The mathematical model of a single disturbance is shown in Table 1. A single disturbance superimposes each other to form a composite disturbance. Using MATLAB to call the rand function to generate 20 types of power quality signals with random amplitude and random disturbance occurrence time within the range of parameters and at a sampling frequency of 3 kHz, and add 30 dB white noise to simulate noise interference during the acquisition process. The PQD signal includes normal voltage (S1), sag (S2), swell (S3), interruption (S4), harmonics (S5), transient oscillation (S6), flicker (S7), and transient pulse (S8), sag + oscillation (S9), swell + oscillation (S10), flicker + oscillation (S11), sag + harmonic (S12), swell + harmonic (S13), flicker + harmonic (S14), sag + pulse (S15), swell + pulse (S16), oscillation + harmonic (S17), sag + harmonic + oscillation (S18), swell + harmonic + oscillation (S19), flicker + harmonic + oscillation (S20). There are 300 samples for each type of power quality signal, for a total of 6000 sets of sample data. The sample data is divided into a training set and a testing set at 7:3, and 10% of the training set is taken out for validation.
Evaluation indicators
In this paper, accuracy recall rate, precision rate, F1 score and parameters are used as evaluation metrics. The calculation formula is shown in Eqs. (13)–(15).
Where TP denotes true positive. TN means true negative. FP means false positive. FN means false negative.
Simulation analysis
The two sub-modules of the multi-modal model only need simple training, and then the output of the prediction intermediate layer is fused through the MFF layer, and finally input into the SVM to realize PQD classification. The SFE module converts the PQD signal into an image using GAF, and the conversion result is shown in Fig. 4.
Set LResNet training for 1 round with a batch size of 30 and a learning rate of 0.001. The number of neurons of LSTM is set to 128, the maximum number of rounds is 30, the batch size is 30, the initial learning rate is 0.001, and the learning rate decreases by a factor of 0.1 every 10 rounds. Both modules use the ‘adam’ optimizer. Using cross entropy as the loss function, the mathematical formula for the function is shown in Eq. (16).
In Eq. (16), y is the expected output and a is the actual output.
SFE module disturbance image conversion.
The confusion matrix is introduced to show the classification results of the model, as shown in Fig. 5.
Classification results of multi-modal LResNet-LSTM-SVM.
In Fig. 5, only one S2 (red box) was incorrectly identified as S20 in the test set, and the rest of the types were accurate. To verify the role of each module, ablation experiments were done in this paper, and the experimental protocol and classification results are shown in Table 2.
As can be seen from Table 2, it can be seen that the correct recall rate of Scheme 3 has increased by 6.22% compared to Scheme 2, and the recall rate of Scheme 5 has increased by 0.11% compared to Scheme 4. This verifies that the model using the Swish activation function is better than the model using the ReLu activation function. The correct rates when using LSTM and LResNet alone are only 76.94% and 86.44% due to insufficient feature information extracted by a single modality. The fusion of the features of the two modalities resulted in a significant increase in the correctness rate, which reached 99.94%. Therefore, multi-modal features contain more information than single modal features and can fully grasp the characteristics of PQD signals. And the model does not add too many parameters, only 0.08 MB.
In order to reflect the advantages of multi-modal models, nine single modal models are built for comparison in this paper, namely GRU, AlexNet, GoogLeNet, Xception, ResNet18, ResNet50, ResNet101, EfficientNet-B0, MobileNetV2 and ShuffleNetV1. The same data set was used for the experiments. Calculate the evaluation indicators of the model according to formulas (13–15), and the comparison results are shown in Table 3.
In Table 3, the accuracy of LResNet-LSTM-SVM improved by 22.94%, 16.22%, 23.38%, 4.33%, 2.55%, 4.72%, 3.72%, 1.83%, 1.61% and 1.50%, respectively, over the comparison model. Compared with GRU, the model’s parameter size only increased by 0. 03 MB. The number of parameters in this model is only 0.18%, 1.40%, 0.42%, 0.75%, 0.36%, 0.20%, 2.07%, 3.48% and 5.71% of the other 9 deep learning models. Therefore, the proposed multi-modal model in this paper not only has higher classification accuracy than the single-modal model, but also has less parameters and achieves the lightweight of the model.
Real data validation
In order to further validate the feasibility of the method, this paper validates the method using a set of real signals as inputs. The dataset is provided by the Kaggle public database and includes six categories of power quality signals, namely normal voltage (S1), sag (S2), harmonics (S5), transient pulse (S8), sag + oscillations (S9), and sag + harmonics (S12). There are 600 samples for each signal type. The confusion matrix of the classification results is shown in Fig. 6.
Confusion matrix for real data classification results.
As can be seen in Fig. 6, 2 groups of S1 are incorrectly identified as S8 due to the influence of noise, 2 groups of S2 are identified as S8 due to smaller amplitude of voltage drop and noise interference, 11 groups of S8 are identified as S1 due to smaller amplitude of pulse, 1 group of S8 is incorrectly identified as S5 due to larger number of pulses in the sample, and 1 group of S9 is identified as S1 due to smaller amplitude of voltage drop and oscillation. Although there is a certain difference between real data and simulated data, the model of this paper still achieves 99.53% classification accuracy, which is only 0.41% less than the simulation results, thus verifying the effectiveness of the method.
Conclusion
For the problem of PQD classification, this paper proposes a PQD classification model based on multimodal LResNet-LSTM parallel feature extraction.
-
The model proposed in this article consists of three modules: PFE, MFF, and classification. The two sub-modules SFE and TFE of PFE are utilized to extract spatial and temporal features in parallel. Then merge the two types of features into a MSTF. Finally, the MSTF is input into SVM for classification. The model can recognize 20 types of PQDs with an accuracy rate of 99.94% and a parameter size of only 0. 08MB.
-
A simple-structured Light ResNet was designed based on residuals. Unlike traditional ResNet18, the residual block of LResNet uses two deep separable convolutions, greatly reducing the number of parameters in the model. And LResNet uses the Swish activation function instead of the original ReLu, which optimizes the classification performance of the model.
-
This article uses a high-speed cache to address data storage problems caused by asynchronous execution of SFE and TFE, but the capacity of high-speed cache is limited and unsuitable for large-scale data storage. Therefore, we are considering adopting a more appropriate method to control the operation of both modules in the future, enabling them to complete feature extraction simultaneously, and avoiding data transfers and reads.
-
Unlike traditional deep learning models that improve classification accuracy by increasing depth, the proposed model in this paper not only improves classification accuracy, but also reduces the number of parameters in the model.
Data availability
The simulation dataset generated during and analysed during the current study are available from the corresponding author on reasonable request. Real dataset: https://www.kaggle.com/datasets/aswarthnarayanacv/power-quality-distribution-dataset-2.
References
Bullich-Massagué, E. et al. Power plant control in large-scale photovoltaic plants: Design, implementation and validation in a 9.4 mw photovoltaic plant. IET Renew. Power Gener. 10, 50–62. https://doi.org/10.1049/iet-rpg.2015.0113 (2016).
Khosravi, N. et al. Improvement of power quality parameters using modulated-unified power quality conditioner and switched-inductor boost converter by the optimization techniques for a hybrid AC/DC microgrid. Sci. Rep. 12, 21675. https://doi.org/10.1038/s41598-022-26001-8 (2022).
Wang, K. & Xi, Y. A new method of power quality disturbance classification based on deep belief network. In Journal of Physics: Conference Series Vol. 1827, 012021 https://doi.org/10.1088/1742-6596/1827/1/012021 (2021).
Chen, S., Li, Z., Pan, G. & Xu, F. Power quality disturbance recognition using empirical wavelet transform and feature selection. Electronics 11, 174. https://doi.org/10.3390/electronics11020174 (2022).
Chamchuen, S., Siritaratiwat, A., Fuangfoo, P., Suthisopapan, P. & Khunkitti, P. Adaptive salp swarm algorithm as optimal feature selection for power quality disturbance classification. Appl. Sci. 11, 5670. https://doi.org/10.3390/app11125670 (2021).
Li, H., Yi, B., Li, Q., Ming, J. & Zhao, Z. Evaluation of DC power quality based on empirical mode decomposition and one-dimensional convolutional neural network. IEEE Access 8, 34339–34349. https://doi.org/10.1109/ACCESS.2020.2974571 (2020).
Lopez-Ramirez, M. et al. EMD-based feature extraction for power quality disturbance classification using moments. Energies 9, 565. https://doi.org/10.3390/en9070565 (2016).
Fu, L. et al. Power quality disturbance recognition using VMD-based feature extraction and heuristic feature selection. Appl. Sci. 9, 4901. https://doi.org/10.3390/app9224901 (2019).
Zhong, T. et al. Power quality disturbance recognition based on multiresolution S-transform and decision tree. IEEE Access 7, 88380–88392. https://doi.org/10.1109/ACCESS.2019.2924918 (2019).
Tang, Q., Qiu, W. & Zhou, Y. Classification of complex power quality disturbances using optimized S-transform and kernel SVM. IEEE Trans. Ind. Electron. 67, 9715–9723. https://doi.org/10.1109/TIE.2019.2952823 (2019).
Yılmaz, A. et al. An improved automated PQD classification method for distributed generators with hybrid SVM-based approach using un-decimated wavelet transform. Int. J. Electr. Power Energy Syst. 136, 107763. https://doi.org/10.1016/j.ijepes.2021.107763 (2022).
Lin, W.-M., Wu, C.-H., Lin, C.-H. & Cheng, F.-S. Classification of multiple power quality disturbances using support vector machine and one-versus-one approach. In 2006 International Conference on Power System Technology, 1–8, https://doi.org/10.1109/ICPST.2006.321956 (2006).
Li, M., Zhang, J., Song, J., Li, Z. & Lu, S. A clinical-oriented non-severe depression diagnosis method based on cognitive behavior of emotional conflict. IEEE Trans. Comput. Soc. Syst. 10, 131–141. https://doi.org/10.1109/TCSS.2022.3152091 (2022).
Chen, H., Wang, T., Chen, T. & Deng, W. Hyperspectral image classification based on fusing s3-PCA, 2D-SSA and random patch network. Remote Sens. 15, 3402. https://doi.org/10.3390/rs15133402 (2023).
Mahela, O. P., Shaik, A. G., Khan, B., Mahla, R. & Alhelou, H. H. Recognition of complex power quality disturbances using s-transform based ruled decision tree. IEEE Access 8, 173530–173547. https://doi.org/10.1109/ACCESS.2020.3025190 (2020).
Bhagat, A., Nimkar, S., Dongre, K. & Ali, S. Power quality disturbance detection and classification using artificial neural network based wavelet. Int. J. Comput. Intell. Res. 13, 2043–2064 (2017).
Zhang, Y., Zhang, Y. & Zhou, X. Classification of power quality disturbances using visual attention mechanism and feed-forward neural network. Measurement 188, 110390. https://doi.org/10.1016/j.measurement.2021.110390 (2022).
Chamchuen, S., Siritaratiwat, A., Fuangfoo, P., Suthisopapan, P. & Khunkitti, P. High-accuracy power quality disturbance classification using the adaptive ABC-PSO as optimal feature selection algorithm. Energies 14, 1238. https://doi.org/10.3390/en14051238 (2021).
Mehedi, I. M. et al. Optimal feature selection using modified cuckoo search for classification of power quality disturbances. Appl. Soft Comput. 113, 107897. https://doi.org/10.1016/j.asoc.2021.107897 (2021).
Yılmaz, A., Küçüker, A. & Bayrak, G. Automated classification of power quality disturbances in a SOFC &PV-based distributed generator using a hybrid machine learning method with high noise immunity. Int. J. Hydrogen Energy 47, 19797–19809. https://doi.org/10.1016/j.ijhydene.2022.02.033 (2022).
Das, S. R., Mishra, A. K., Ray, P. K., Salkuti, S. R. & Kim, S.-C. Application of artificial intelligent techniques for power quality improvement in hybrid microgrid system. Electronics 11, 3826. https://doi.org/10.3390/electronics11223826 (2022).
Abd Jamlus, N. U. I., Shahbudin, S. & Kassim, M. Power quality disturbances classification analysis using residual neural network. In 2022 IEEE 18th International Colloquium on Signal Processing & Applications (CSPA), 442–447, https://doi.org/10.1109/CSPA55076.2022.9782013 (2022).
Han, Y., Feng, Y., Yang, P., Xu, L. & Zalhaf, A. S. An efficient algorithm for atomic decomposition of power quality disturbance signals using convolutional neural network. Electric Power Syst. Res. 206, 107790. https://doi.org/10.1016/j.epsr.2022.107790 (2022).
Ekici, S., Ucar, F., Dandil, B. & Arghandeh, R. Power quality event classification using optimized Bayesian convolutional neural networks. Electr. Eng. 103, 67–77 (2021).
Dawood, Z. & Babulal, C. Red deer optimized recurrent neural network for the classification of power quality disturbance. Electr. Eng. https://doi.org/10.1007/s00202-022-01701-6 (2023).
Khetarpal, P., Nagpal, N., Siano, P. & Al-Numay, M. Power quality disturbance signal segmentation and classification based on modified BI-LSTM with double attention mechanism. Authorea Preprints https://doi.org/10.22541/au.167865037.70684326/v1 (2023).
Yiğit, E., Özkaya, U., Öztürk, Ş, Singh, D. & Gritli, H. Automatic detection of power quality disturbance using convolutional neural network structure with gated recurrent unit. Mobile Inf. Syst. 1–11, 2021. https://doi.org/10.1155/2021/7917500 (2021).
Sindi, H., Nour, M., Rawa, M., Öztürk, Ş & Polat, K. A novel hybrid deep learning approach including combination of 1D power signals and 2D signal images for power quality disturbance classification. Expert Syst. Appl. 174, 114785. https://doi.org/10.1016/j.eswa.2021.114785 (2021).
Gao, J., Li, P., Chen, Z. & Zhang, J. A survey on deep learning for multimodal data fusion. Neural Comput. 32, 829–864. https://doi.org/10.1162/neco_a_01273 (2020).
Kong, Z., Zhang, C., Lv, H., Xiong, F. & Fu, Z. Multimodal feature extraction and fusion deep neural networks for short-term load forecasting. IEEE Access 8, 185373–185383. https://doi.org/10.1109/ACCESS.2020.3029828 (2020).
Zhang, H. et al. Detection of coronary artery disease using multi-modal feature fusion and hybrid feature selection. Physiol. Meas. 41, 115007. https://doi.org/10.1088/1361-6579/abc323 (2020).
Mehmood, A. Robust multimodal heartbeat detection using hybrid neural networks. IEEE Access 8, 82201–82214. https://doi.org/10.1109/ACCESS.2020.2990607 (2020).
Baltrušaitis, T., Ahuja, C. & Morency, L.-P. Multimodal machine learning: A survey and taxonomy. IEEE Trans. Pattern Anal. Mach. Intell. 41, 423–443. https://doi.org/10.1109/TPAMI.2018.2798607 (2018).
Sindi, H., Nour, M., Rawa, M., Öztürk, Ş & Polat, K. An adaptive deep learning framework to classify unknown composite power quality event using known single power quality events. Expert Syst. Appl. 178, 115023. https://doi.org/10.1016/j.eswa.2021.115023 (2021).
Heping, P., Wenxiong, M., Yong, W., Le, L. & Zhong X. Identification method for power quality disturbances in distribution network based on transfer learning. Arch. Electr. Eng. https://doi.org/10.24425/aee.2022.141682 (2022).
Ramachandran, P., Zoph, B. & Le, Q. V. Searching for activation functions. arXiv preprintarXiv:1710.05941 https://doi.org/10.48550/arXiv.1710.05941 (2017).
Turović, R. et al. An end-to-end deep learning method for voltage sag classification. Energies 15, 2898. https://doi.org/10.3390/en15082898 (2022).
Wang, Q., Liang, X. & Qin, S. Research on power quality disturbance analysis and identification based on LSTM. Energy Rep. 8, 709–718. https://doi.org/10.1016/j.egyr.2022.09.170 (2022).
Mengistu, E. et al. Utilization of Stockwell transform, support vector machine and D-STATCOM for the identification, classification and mitigation of power quality problems. Sustainability 15, 6007. https://doi.org/10.3390/su15076007 (2023).
IEEE. Recommended Practice for Monitoring Electric Power Quality. 1159 (IEEE, 2019).
Author information
Authors and Affiliations
Contributions
Z.T. conceived the experiment(s), Z.T. and J.Z. conducted the experiment(s), Z.T., J.Z. and J.W. analysed the results, Z.L. revised the manuscript. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tong, Z., Zhong, J., Li, J. et al. A power quality disturbances classification method based on multi-modal parallel feature extraction. Sci Rep 13, 17655 (2023). https://doi.org/10.1038/s41598-023-44399-7
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-023-44399-7
This article is cited by
-
A multimodal deep learning model with differential evolution-based optimized features for classification of power quality disturbances
Journal of Electrical Systems and Information Technology (2025)
-
Deep transfer learning approach for the classification of single and multiple power quality disturbances
Scientific Reports (2025)
-
A novel amalgamation of pre-processing technique and CNN model for accurate classification of power quality disturbances
Electrical Engineering (2025)








