Abstract
Transformers, as critical components of power networks, are subjected to various mechanical and electrical stresses under different loading conditions. Their windings may experience minor, recurring faults that are difficult to detect in the early stages before they become apparent. Therefore, early prediction and diagnosis of these faults are of utmost importance. In the power industry, Frequency Response Analysis (FRA) is widely used for transformer fault diagnosis. However, one of the main challenges of this method is the complex interpretation of its results. This paper addresses this challenge by presenting advanced data visualization techniques for interpreting FRA results and diagnosing transformer winding faults. To achieve this, three independent methods—Factor Analysis, Fuzzy Clustering Analysis, and Principal Component Analysis—are employed. Each method demonstrates outstanding performance due to its unique characteristics: Factor Analysis identifies complex faults by uncovering hidden factors; Fuzzy Clustering Analysis detects combined fault conditions by handling uncertainty; and Principal Component Analysis enhances interpretability by reducing data dimensionality. Based on these techniques, a two-stage identification model is developed. In the first stage, the distinction between healthy and faulty conditions is made, and in the second stage, fault classification under faulty conditions is performed. Experimental results show that the proposed techniques can effectively extract various frequency response features and identify faults with high accuracy. After implementing the proposed techniques on a transformer, the need for expertise in fault diagnosis and classification is significantly reduced. This approach helps engineers and operators interpret the results more simply and efficiently.
Similar content being viewed by others
Introduction
Background
RANSFORMERS are crucial components for power network distribution and transmission, ensuring that consumers receive high-quality, smooth, and reliable power. These devices are key components of interconnected power networks, making them among the most important assets. They are vulnerable to various external and internal faults. A significant portion of power transformer failures is caused by internal winding defects, the most detrimental of which are axial displacement (AD), radial deformation (RD), and short-circuit (SC) faults1. These faults may result in catastrophic transformer failures, costing electricity providers’ significant expenses due to power network outages, high repair costs, potential fires, and even casualties. Therefore, it is crucial to identify internal faults in transformers at an early stage to prevent unexpected outages or costly secondary failures. Accumulated faults should be assessed before they lead to transformer failure. Thus, early-stage winding fault detection using sensitive methods is essential2.
Literature survey
In recent years, numerous techniques for detecting winding deformations have been proposed successively. The primary techniques include the voltage-current locus diagram (VCL)3,4, low-voltage impulse (LVI)5, short-circuit impedance (SCI)6, ultra-wideband (UWB) antenna7, and the FRA method8-9. Among these, FRA is the most commonly utilized due to its reliance on an output-to-input transfer function (TF) analysis, which has proven effective in accurately and sensitively detecting electrical and mechanical faults in power transformers10. Furthermore, FRA has become the most widely used fault diagnosis technique among commercial methods due to its straightforward, non-destructive, economical, and rapid procedure11. As a result, many studies have concentrated on the challenges in implementing, interpreting, and reproducing FRA responses. Although the theory and technique of FRA measurement are well-standardized and developed, the practical interpretation of frequency responses remains challenging and calls for expert-level professional expertise12. Due to the unique structure and corresponding frequency response of each transformer, it is not feasible to develop a universal method for interpreting FRA results13. As a result, expert-based visual interpretation of FRA.
As shown in Fig. 1, existing literature on FRA interpretation is classified into four main categories: knowledge-based methods, mathematical models, pattern/template-based methods, and graphical models. Methods that use expert classifiers to identify faults belong to the first category. In these techniques, necessary frequency response features are extracted and then fed to classifiers. Reference14 enhanced the classification performance by combining SVM, PSO, and GA algorithms. This approach was successfully tested on a physical transformer model. However, these techniques require extensive data. Another important algorithm in unsupervised learning is the fuzzy inference system, which also falls under this category.
The second category consists of analytical models, detailed models, and adaptive models. In the detailed model approach, each winding section is represented by a different circuit element in the circuit model15. To apply this model, changes in the circuit elements must first be mapped to the corresponding alterations in the transformer structure. The circuit model is then modified with the element variations to examine the changes in the FRA trace16. Further studies have used the finite element method to simulate the geometrical dimensions of transformers to better approximate real transformer operation conditions17. However, the conversion process introduces additional errors to the model. The main limitation of the circuit model is that incorporating certain faults is difficult.
The third category is pattern/template-based. A substantial number of fault instances are required to effectively train the neural networks in wavelet- and neural network-based techniques proposed by various researchers18-19. For instance, the neural network described in20 is not well-suited for the limited data available on transformer winding defects, and may converge to local optima rather than the global solution. When the amount of data is below a certain threshold, classifiers may not be adequately trained, leading to erroneous results. These methods have been combined with other approaches, including numerical index-based21-22 and algorithmic estimation methods20,23. In23, M. Bigdeli employed the frequency-amplitude and phase characteristics of the TF for classifying winding faults using an SVM algorithm.
The last category comprises graphical models, including fault trees, bond graphs, and diagrams. The logical structure of decision trees performs multiple binary classifications at each level by comparing two classes, thereby enhancing system performance. Reference24 developed a diagnostic decision tree that can both isolate faults and identify failure modes based on detailed model data. Bond graph techniques are used to analyze signatures/signals with significant random components and to assess the similarity between two signals.
FRA signatures exhibit many uncertain properties in addition to changes in minima and maxima caused by resonant frequencies. These characteristics reflect alterations in both the type and condition of the winding. Researchers recommend using statistical methods as a more robust approach to differentiate between healthy and faulty FRA signatures25,26. The most significant numerical indices are presented in27. These indicators, which offer improved fault diagnosis capability, can be derived by computing them for various fault conditions and comparing them to the healthy state. Using statistical criteria is more straightforward than estimating model parameters, which is prone to errors and challenges. Several statistical parameters, including maximum absolute difference, spectrum deviation, and correlation coefficient (CC), have been proposed to quantify differences between FRA measurements28,29. Recent studies30-31 demonstrate increased sensitivity when using mathematical indices within appropriate frequency ranges for fault diagnosis. Thus, distinguishing between disturbances and actual failures is essential when using indices. However, numerical indices require extensive data to establish threshold levels for each fault type and severity.
The main goal of this research is to enhance the interpretation of FRA results by employing new statistical methods. To address the aforementioned shortcomings, fuzzy clustering analysis (FCA), factor analysis (FA), and principal component analysis (PCA) are employed to detect transformer winding faults using FRA. These methods are used to first determine the probability of a fault occurring based on variations in the FRA trace; subsequently, the specific type of fault is categorized graphically using this approach. Identifying the fault type enables proper assessment and appropriate corrective action to be taken.
PCA and FA methods use the correlation matrix to transform a set of features into lower-dimensional sets called principal components and factors, respectively. Similarly, FCA utilizes the degree of similarity between features to group them into lower-dimensional clusters. The proposed method offers the advantages of low computational complexity and minimal effort required to determine classifier characteristics.
Research innovations
The following are, in a nutshell, the innovations’ key features:
-
This study employs three independent methods—Factor Analysis (FA), Fuzzy Clustering Analysis (FCA), and Principal Component Analysis (PCA)—to interpret Frequency Response Analysis (FRA) results. Each method demonstrates exceptional performance owing to its unique characteristics in diagnosing transformer winding faults.
-
FA uncovers complex faults by identifying hidden factors and latent patterns in the data. This method is particularly effective for detecting faults that conventional approaches might miss.
-
FCA is particularly adept at managing uncertainty and detecting combined fault conditions, particularly in scenarios where the boundaries between different fault states are ambiguous. This method proves invaluable for diagnosing concurrent or overlapping faults.
-
PCA enhances interpretability by reducing data dimensionality and filtering out noise while retaining key information. This enables clearer and more efficient identification of fault patterns.
-
The developed two-stage identification model first distinguishes between healthy and faulty conditions, then classifies the specific fault type in the second stage. This approach enhances fault diagnosis accuracy.
-
Advanced data visualization techniques are employed to independently present the results of each method in a visual format. This significantly simplifies result interpretation for end-users.
-
Implementation of the proposed techniques substantially reduces the requirement for expert knowledge in fault diagnosis and classification. This enables engineers and operators to interpret results more efficiently.
-
The reliability of each method has been validated using actual transformer data and practical experiments, confirming their effectiveness in real-world applications.
FRA concept and experimental setup
The frequency response is represented by the transfer function (TF) plot as a function of the input excitation frequency. Any physical alteration to the active components of a power transformer modifies the characteristics of its equivalent electrical circuit, consequently altering the frequency response. This principle forms the basis of FRA for transformers. In practice, impedance or admittance functions, or voltage ratios between specified terminals (e.g., end-to-end measurements), are commonly used to assess winding frequency responses. Transformer FRA can be performed using either the Sweep FRA (SFRA) or Low Voltage Impulse (LVI) method32. Both approaches involve applying excitation signals to the winding and monitoring the response signals to obtain the transformer’s frequency response characteristics. While both SFRA and Impulse FRA (IFRA) utilize excitation signals, SFRA employs a sinusoidal sweep signal, whereas IFRA uses an impulse signal – yet the outcomes are equivalent.
Currently, precise interpretation of FRA data remains challenging. Visual inspection23 remains the predominant method for FRA interpretation. This is because specific frequency ranges correlate with particular transformer faults; that is, different fault types manifest in distinct frequency ranges of the FRA trace. Following Chinese power industry standards33, FRA signatures are divided into three frequency bands: low (1–100 kHz), mid (100–600 kHz), and high (600–1000 kHz), with analyses conducted for each range. However, this method requires skilled experts and comprehensive knowledge of how various winding defects affect each frequency range, as false positives and false negatives may occur.
Evaluating the effectiveness of intelligent classifiers requires establishing a database of transformers in both good and faulty condition (with varying fault intensities). For this study, mechanical and electrical faults were artificially induced and simulated in different locations of windings at various severity levels in a high-voltage laboratory. To identify short-circuit (SC), axial displacement (AD), and radial deformation (RD) faults, FRA measurements were performed on a 1200 kVA transformer. The transformer features round-shaped windings and a round core. Its high-voltage (HV) and low-voltage (LV) windings consist of 70 disks (each with 80 turns) and a continuous layer (with 112 turns), respectively. The transformer’s insulating system comprises Kraft paper and mineral oil.
To create SD, AD, and RD faults, the leads were removed from the transformer to allow easy access to the internal turns while enabling external fault simulation. All measurements were conducted using an Omicron FRANEO 800 analyzer (Bode 100) with a precision of approximately 90 dB and a maximum amplification factor of 40 dB. The network analyzer’s tracking generator provided the measurement system’s reference signal: a 5-volt alternating voltage. SFRA measurements were obtained at 1,280 frequency points ranging from 100 Hz to 1 MHz.
Theory of suggested approaches for FRA interpretation
This research employs three statistical methods—PCA, FA, and FCA—to identify power transformer faults. The theoretical foundations of these methods and their respective implementation algorithms for fault detection are elaborated below.
Principal component analysis
PCA is a widely-used multivariate technique that transforms a set of correlated features into a set of linearly uncorrelated features called principal components. The most significant features are captured by the first few components in this transformation34. As a dimensionality reduction technique, PCA projects high-dimensional data onto a lower-dimensional space by retaining only the first principal components, thereby reducing the data size. According to Kaiser’s criterion, the number of eigenvalues exceeding one from the correlation matrix determines the number of significant principal components.
Suppose that the goal is to study p random variables \(\:{X}_{1},\dots\:,{X}_{p}\). We consider the vector \(\:\varvec{X}\) as follows:
We define the mean vector and the variance matrix of the vector X as follows
and
where \(\:{\mu\:}_{i}\) is the mean of the random variable \(\:{X}_{i}\) and \(\:{\sigma\:}_{ij\:}\)is the covariance between the random variables \(\:{X}_{i}\) and \(\:{X}_{j}.\) Assume \(\:{\lambda\:}_{1}\ge\:{\lambda\:}_{2}\ge\:\dots\:\ge\:{\lambda\:}_{p}\) are the eigenvalues and \(\:{\varvec{e}}_{1},\:\dots\:,\:{\varvec{e}}_{p}\) are the eigenvectors of the matrix \(\:{\Sigma\:}.\) Then the ith principal component (\(\:{Y}_{i}\)) is computed by:
Factor analysis
Similar to PCA, factor analysis (FA) is a widely-used multivariate approach that transforms multiple correlated features into a smaller set of features known as factors. The initial factors in this transformation capture the most significant information from the original dataset34. In contrast to PCA, FA focuses on exploring correlations between features, where features within the same factor have higher correlations while those across different factors have lower correlations. As with PCA, FA reduces high-dimensional data to fewer dimensions by retaining only the most significant factors, thereby achieving data compression. The number of factors is selected using the same method as PCA. A FA structure with m factors (where m ≤ p) can be expressed as:
such that
and
where \(\:\varvec{F}\) is the factors vector, \(\:\text{L}\) is the loading matrix and \(\:\varvec{\epsilon\:}\) is the error vector.
The FA structure can be represented as
where \(\:{l}_{ij}\) is called as the loading of \(\:{X}_{i}\) on the factor \(\:{F}_{j}.\).
For orthogonal FA can be proved that
and
such that
Therefore,
and
The primary goal of FA is to determine the loadings values. Different techniques, such as maximum likelihood (ML) and PCA, can be used to compute the matrices L and \(\:{\Psi\:}\). To determine the matrix L, the principal component technique decomposes the matrix \(\:{\Sigma\:}\) using eigenvalues and eigenvectors. The maximum likelihood method computes and optimizes the likelihood to discover the matrices L and \(\:{\Psi\:}\). Loading plots can be considered once the loading values have been estimated. Loading plots have many applications in.
-
Investigating the correlations between features,
-
Features classification and categorization.
-
Detection of m.
The correlation (\(\:r\)) of two features is determined by their angle (\(\:\theta\:\)) (Fig. 2). \(\:\theta\:={90}^{0}\) suggests that two features are uncorrelated (\(\:r=0)\). The case \(\:\theta\:={0}^{0}\) is equal to the exact positive linear relationship and the case \(\:\theta\:={180}^{0}\) is equal to the exact negative linear relationship.
Fuzzy clustering analysis
Clustering35 is a powerful data analysis tool in data mining. Among clustering techniques, soft clustering algorithms36,37 have recently gained popularity, as studies have shown that these methods outperform conventional hard clustering algorithms38,39. Unlike hard clustering, soft clustering allows every point to belong to multiple clusters with varying membership degrees (probabilities). Among these, Fuzzy C-means (FCM) clustering40 is the most widely-used soft clustering technique. Consider n observations from a p-dimensional vector X = (X₁, …, Xp)ᵀ, represented as:
The aim is to convert dataset in lower-dimensional clusters \(\:{C}_{1},\dots\:,{C}_{k}.\) Assume \(\:{\varvec{c}}_{1},\dots\:,{\varvec{c}}_{k}\) as the centroids of \(\:\varvec{X}\) for the members of \(\:{C}_{1},\dots\:,{C}_{k}\). Suppose that \(\:{u}_{ij}\) is the probability of membership of \(\:{\varvec{X}}_{i}\) in the cluster \(\:{C}_{j}\). In FCM, we minimize
where \(\parallel.\parallel\) is any arbitrary norm to compute the similarity between \(\:{\varvec{X}}_{i}\) and \(\:{\varvec{c}}_{j}.\) After some iterations, the updated probability of membership and the centroids is as following:
and
The procedure will stop when
where \(\:0<\delta\:<1\) is a termination criterion and \(\:{U}^{\left(s\right)}\) are probabilities of membership matrix in iteration 0 of s.
It should be noted that the number of clusters is determined using Silhouette’s index.
A comparative summary across PCA, FA, and FCA is as follows:
-
PCA, FA, and FCA are three independent multivariate techniques that can be employed in the fault detection process.
-
The PCA method, using covariance matrix decomposition, enables the extraction of principal components with the highest variance. By calculating eigenvalues and eigenvectors, this method represents data in a lower-dimensional space. Its computational simplicity and high execution speed are prominent features that make it suitable for preliminary data analysis.
-
The FA method, by modeling latent factors, allows for the examination of complex relationships between variables. This technique, by considering measurement error and using a factor loading matrix, can identify hidden structures in data. Its robustness against noisy data and ability to work with highly correlated variables are among its key advantages.
-
The FCA method, leveraging fuzzy set concepts, enables data classification under uncertainty. By assigning membership degrees and optimizing the objective function, this method demonstrates high flexibility in analyzing complex patterns. Although its computational complexity is higher, its ability to handle ambiguous and borderline data gives it a distinct advantage.
Each of these methods has unique characteristics that make them suitable for different analytical scenarios. PCA, with its focus on dimensionality reduction and computational simplicity; FA, with its ability to uncover hidden relationships and robustness against noise; and FCA, with its capability to operate under uncertainty, provide researchers with a comprehensive set of analytical tools.
Implementation of diagnostic procedure
Figure 3 illustrates the proposed clustering-based framework for Frequency Response Analysis (FRA). The systematic implementation methodology encompasses the following key phases:
Phase 1: data acquisition & Preparation
-
Perform FRA scans (100 Hz-10000 kHz) using FRANEO 800 analyzer.
-
Record data for healthy and faulty transformers.
-
Import datasets and validate signal integrity.
-
Generate comparative plots for: Low-band (100 Hz-100 kHz), Mid-band (100–600 kHz), High-band (600–10000 kHz).
Phase 2: feature extraction
The system employs three parallel methods for extracting key features:
-
1.
PCA Method:
-
Calculation of principal components through covariance matrix decomposition.
-
Selection of components with eigenvalues greater than 1 (Kaiser criterion).
-
Retention of components covering at least 95% of data variance.
-
2.
FA Method:
-
Modeling of latent factors by examining factor loadings.
-
Validation using Kaiser-Meyer-Olkin (KMO) test (value > 0.6).
-
Selection of factors with loadings greater than 0.7.
-
3.
FCM Method:
-
Implementation of fuzzy clustering with fuzzification degree m = 2.
-
Calculation of membership degrees and cluster center updates.
-
Process termination upon convergence (δ < 1).
Phase 3: fault detection
Each method employs specific fault detection criteria:
-
PCA: Deviations exceeding 3σ in principal components and significant variance changes.
-
FA: Changes exceeding 25% in factor loadings and residual increases.
-
FCM: Cluster center displacements and membership distribution anomalies.
Phase 4: making decision
-
Final output specifying fault type (SC/AD/RD).
-
Graphical results presentation.
Case study
Power transformer data set
The study utilizes a three-phase 1200 kVA power transformer with a voltage rating of 20/0.4 kV (D/Yg connection), representing a typical distribution transformer configuration commonly employed in power systems. The transformer features distinct winding designs for high and low voltage sides to facilitate comprehensive fault analysis. The high-voltage winding comprises 70 interleaved disks with 80 turns per disk (totaling 5600 turns), utilizing round conductors with an inner diameter of 987 mm and outer diameter of 1086 mm. In contrast, the low-voltage winding employs a continuous layer design with 112 turns, featuring round conductors of 823 mm and 891 mm inner and outer diameters respectively. The core structure measures 2033 mm in height and 3785 mm in length, with winding heights of 1154 mm (HV) and 1249 mm (LV). The transformer’s electrical characteristics include a 2.431% impedance at 50 Hz frequency, with Kraft paper and mineral oil serving as the primary insulation materials. To systematically evaluate fault detection capabilities, ten distinct severity levels of three fundamental fault types (axial displacement, radial deformation, and short circuit) were artificially induced at strategic locations across all three phases (A, B, and C). The complete parametric details of these fault configurations are comprehensively documented in Tables 1, 2 and 3 to ensure reproducibility and facilitate comparative analysis.
a) Short Circuit Fault Simulation: This study implemented ten distinct levels of inter-disk short circuit faults at specified disk pairs in the HV winding. The investigated disk pairs included 11–12, 13–15, 18–20, 21–24, 25–26, 27–29, and 32–35, with additional analysis of combined faults at disks 11–12/25–26 and 18–20/27–29. Table 1 shows these simulations:
b) Axial Displacement: Ten progressive levels of axial displacement faults were systematically simulated by displacing the HV winding relative to the LV winding in precise 6.25 mm increments, corresponding to 1–5% of the total winding height (1154 mm). This resulted in displacement magnitudes ranging from 12.5 mm (1.08%) to 62.5 mm (5.41%), covering both minor misalignments and severe deformations observed in practical scenarios. Table 2 shows how these faults created.
c) Radial Deformation: This study systematically investigated radial deformation (RD) by introducing ten distinct fault levels through controlled mechanical deformation of the disk winding. The simulation encompassed various deformation patterns, including single-axis (Fig. 4a), dual-axis opposed (Fig. 4b), three-axis (Fig. 4c), and four-axis symmetric (Fig. 4 d) configurations, with the angular position fixed at θ = 45° for standardization. The deformation severity was precisely quantified using the ratio d/R (Eq. 20), where d represents the radial bending magnitude (d = R - R₁) and R denotes the original average radius. The complete parameter sets for all test cases, including detailed geometric specifications and deformation patterns, are comprehensively documented in Table 3, while Fig. 4 (a)~(d) visually illustrates the various deformation modes.
The power transformer’s dimensions, specifications, and capacity are given in Table 4.
FRA simulation results
The impacts of RD, AD, and SC faults on the transformer’s FRA waveforms are illustrated in Figs. 5(a)–(c) for ten levels of each fault. This study employs an OMICRON analyzer to conduct the FRA measurements. Although the FRA changes are visible in Fig. 5, their analysis is highly challenging. Additionally, low-level fault recognition poses a challenge for conventional FRA. However, the proposed method automates the interpretation process and can be readily applied to FRA, as described below.
Three proposed methods simulations and results
This section reports the results of PCA, FA, and FCA for detecting transformer winding faults. The analysis was performed using R software version 3.6.1 and Minitab version 18. Subsection A presents the PCA results, while Subsections B and C provide the FA and FCA results, respectively.
C.1. Results of PCA to diagnose winding faults
This section presents the results of using PCA for fault detection to diagnose winding faults in the transformer. The eigenvalues of the correlation matrix for variables at low frequency are shown on the left side of Fig. 6a. As shown, only the first two values are greater than 1. The right side of Fig. 6a illustrates that these variables can be classified into two categories: healthy, AD, RD, and SC systems. Consequently, at low frequencies, the RD and AD data are similar to those of a healthy system. However, PCA does not confirm this similarity for SC data. Similarly, the eigenvalues of the mid-frequency correlation matrix are presented on the left side of Fig. 6b, where again only the first two values exceed 1.
The right side of Fig. 6b shows that these variables can be classified into two categories: healthy, SC, RD, and AD systems. Consequently, at mid-frequency, the SC and RD data show similarity with a healthy system. However, PCA does not confirm this similarity for AD data. The left side of Fig. 6c displays the eigenvalues of the high-frequency correlation matrix variables, where only the first two values exceed 1. The right side of Fig. 6c demonstrates that these variables can again be classified into two categories: healthy, SC, AD, and RD systems. At high frequencies, the SC and AD data are similar to those of a healthy system, while PCA fails to establish this similarity for RD data.
C.2. Results of FA to indicate winding faults
The results of FA to detect transformer winding faults are reported in this section. Figures 7a-c show that these variables can be categorized into two groups across all frequency ranges. At low frequencies (Fig. 7a), the variables separate into: (1) healthy, AD, and RD systems, and (2) SC systems. Consequently, the AD and RD data show similarity with healthy system data, while FA does not confirm this similarity for SC cases.
In the mid-frequency range (Fig. 7b), the variables divide into healthy systems and SC, AD, RD defects. Here, the SC and RD data match healthy system data, whereas FA fails to establish this correspondence for AD cases. At high frequencies (Fig. 7c), the classification yields healthy systems versus SC, AD, and RD systems. While the SC and AD data correlate with healthy system data, FA does not demonstrate this correlation for RD cases.
C.3. Results of FCA to indicate winding faults
The findings of FCA to detect transformer winding defects are provided in this section. As illustrated in Figs. 8a-c, the variables can be classified into two groups according to their frequency characteristics. At low frequencies (Fig. 8a), the classification reveals: (1) healthy, AD, and RD systems, and (2) SC systems. Accordingly, the AD and RD values closely match those of healthy systems, while FCA does not demonstrate this correspondence for SC cases. In the medium frequency range (Fig. 8b), the groups comprise: (1) healthy, SC, and RD systems, and (2) AD systems. Here, the SC and RD values align with healthy system values, whereas FCA fails to establish this relationship for AD data. At high frequencies (Fig. 8c), the classification yields: (1) healthy, SC, and AD systems, and (2) RD systems. While the SC and AD measurements correspond to healthy system values, FCA does not confirm this similarity for RD data.
Comparative analysis
In this section, we provide a comparative analysis of our proposed methods against several alternative methods, including random forest (RF)41, artificial neural network (ANN)42, gradient boosting (GB)43, and decision tree (DT)44. The evaluation is based on four key performance metrics: precision, recall, F1-score, and accuracy by the following formulas:
and
.
where TP, FP, FN, and TN denote True Positives, False Positives, False Negatives, and True Negatives, respectively. As it can be seen in Table 5, our data visualization approaches outperformed all comparative methods in terms of most performance metrics. The FCA approach achieves the highest accuracy, with 98.9, outperforming all other methods.
FA, PCA and RF acts approximately similar with accuracies 96.6%, 96.6% and 96.4%, respectively. Although the performance of RF is similar to PCA and FA, but since PCA and FA are visual approaches, we recommended these techniques instead of RF. The results from the comparative analysis clearly demonstrate that our proposed visualization approaches outperform alternative methods across all evaluated metrics. This consistent superiority of our methods highlights their potential as more reliable and effective solutions for the problems at hand.
Discussion
FRA is a cost-effective, accurate, and non-destructive technique for rapid assessment of transformers’ mechanical integrity. However, interpreting FRA results is not yet automated. This study proposes a novel SFRA-based methodology to automate fault detection and interpretation. The proposed approach was tested on a three-phase 50 Hz, 1.2 MVA, 20/0.4 kV transformer. Various electrical faults (SCs) and mechanical faults (AD and RD) were artificially simulated at multiple levels in transformer windings for FRA testing. The FRA signatures were divided into three frequency bands: low (100 Hz-100 kHz), mid (100–600 kHz), and high (600–1000 kHz). To overcome the interpretation challenge, the automatic detection module simultaneously employs three graphical methods (FA, FCA, and PCA) to analyze FRA results for detecting and classifying RD, AD, and SC defects. These multivariate techniques reduce high-dimensional data complexity:
-
1.
PCA and FA transform multiple features into principal components and factors, respectively.
-
2.
FCA, unlike PCA and FA, focuses on feature similarities rather than correlations.
-
3.
Within-cluster feature values are highly similar, while between-cluster values show significant divergence.
The model demonstrates three key capabilities: simulating diverse fault types for FRA interpretation, facilitating easy detection of various faults, and eliminating expert-dependent interpretation of frequency responses, thereby reducing subjective judgments.
Conclusion
Interpreting transformer FRA results is challenging and has traditionally relied on error-prone human experts. This study proposes, for the first time, three automatic graphical clustering methodologies for interpreting FRA measurements across different frequency ranges. To validate these methods, a series of tests was conducted on an actual transformer. The required data were obtained through FRA measurements performed with an FRANEO 800 analyzer on both healthy and faulty transformers. The assessed faults included SC, RD, and AD defects. The measured FRA characteristics were analyzed across three sub-frequency bands. The proposed clustering techniques were validated by experimental FRA measurements obtained from artificial fault simulations. Key findings include:
-
1.
The clustering results match the original FRA label distribution, demonstrating the method’s applicability for processing FRA data.
-
2.
Different winding fault types form distinct clusters with clear boundaries, effectively separating the three types of winding deformation faults.
-
3.
The optimal frequency bands for diagnosing RD, AD, and SC faults are high, medium, and low frequencies, respectively.
-
4.
The proposed methods accurately assess fault severity.
Future research directions include:
-
1.
Creating models to forecast faults based on FRA data and statistical methods could enhance preventive maintenance.
-
2.
Exploring integration with IoT and smart systems could enable automated, intelligent fault detection.
-
3.
Investigating these techniques in various transformer types would help generalize the findings.
-
4.
Studying temperature, humidity, and pollution effects on FRA results could improve detection accuracy.
-
5.
Additional experiments under diverse real-world conditions would strengthen the results.
Data availability
The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.
References
Moradzadeh, A., Pourhossein, K., Mohammadi-Ivatloo, B. & Mohammadi, F. Locating Inter-Turn faults in transformer windings using isometric feature mapping of frequency response traces. IEEE Trans. Ind. Inf. 17 (10), 6962–6970 (2021).
Guan, S. et al. Power transformer fault diagnosis method based on multi source signal fusion and fast spectral correlation. Sci. Rep. 15, 6984. https://doi.org/10.1038/s41598-025-91428-8 (2025).
Seifi, A. et al. A novel method mixed power flow in transmission and distribution systems by using master-slave splitting method. Electr. Power Compon. Syst. 36 (11), 1141–1149. https://doi.org/10.1080/15325000802084380 (2008).
Yao, C. et al. Improved online monitoring method for transformer winding deformations based on the Lissajous graphical analysis of voltage and current. IEEE Trans. Power Del. 30(4), 1965–1973 (2015).
Khalili Senobari, R., Sadeh, J. & Borsi, H. Frequency response analysis (FRA) of Transformers as a tool for fault detection and location: A review. Electr. Power Syst. Res. 155, 172–183 (2018).
Rao, T. M., Mitra, S. & Pramanik, S. A novel Estimation methodology for multi-resonance equivalent inductance of transformer winding for inter-turn short-circuit fault detection. Electr. Power Syst. Res. 231, 110359. https://doi.org/10.1016/j.epsr.2024.110359 (2024).
Kavousi-Fard, A. et al. Optimal probabilistic reconfiguration of smart distribution grids considering penetration of plug-in hybrid electric vehicles. J. Intell. Fuzzy Syst. 29 (5), 1847–1855, https://doi.org/10.3233/IFS-151663
Zhao, X. et al. Enhanced detection of power transformer winding faults through 3D FRA signatures and image processing techniques. Electr. Power Syst. Res., 242, 2025,111433, https://doi.org/10.1016/j.epsr.2025.111433
Guan, S., Yang, H. & Wu, T. Transformer fault diagnosis method based on TLR-ADASYN balanced dataset. Sci. Rep. 13, 23010. https://doi.org/10.1038/s41598-023-49901-9 (2023).
Li, P. et al. Diagnosis of interturn faults of voltage transformer using excitation current and phase difference. Eng. Fail. Anal. 134, 105979. https://doi.org/10.1016/j.engfailanal.2021.105979 (2022).
Zhongyong Zhao, S. et al. Interpretation of transformer winding deformation fault by the spectral clustering of FRA signature. Int. J. Electr. Power Energy Syst., 130, (2021).
Mohammad Hamed, S. et al. Investigating the applicability of the finite integration technique for studying the frequency response of the transformer winding. Int. J. Electr. Power Energy Syst. 110, 411–418 (2019).
Vosoogh, M. A novel modification approach based on MTLBO algorithm for optimal management of renewable micro-grids in power systems. J. Intell. Fuzzy Syst. 27 (1), 465–473. https://doi.org/10.3233/IFS-131014 (2014).
Abbasi, A. R. Investigation of simultaneous effect of demand response and load uncertainty on distribution feeder reconfiguration, IET generation. Transmission Distribution. 14 (8), 1438–1449. https://doi.org/10.1049/iet-gtd.2019.0854 (2020).
Senobari, R. et al. Frequency Response Analysis of Transformers as a Tool for Fault Detection and Location: A Review pp. 172–183 (Electric Power Systems Research, 2018).
Rahimpour, E. & Tenbohlen, S. Experimental and theoretical investigation of disc space variation in real high-voltage windings using transfer function method. IET Electr. Power. 4 (6), 451–461 (2010).
Goodarzi, S. et al. Tight convex relaxation for TEP problem: a multiparametric disaggregation approach, Transmission Distribution, 14 (14), 2810–2817, https://doi.org/10.1049/iet-gtd.2019.1270 (2020).
Ghanizadeh, A. J. & Gharehpetian, G. B. ANN and cross-correlation based features for discrimination between electrical and mechanical defects and their localization in transformer winding. IEEE Trans. Dielectr. Electr. Insul. 21 (5), 2374–2382 (2014).
Guan, S., Wu, T. & Yang, H. Research on transformer fault diagnosis method based on ACGAN and CGWO-LSSVM. Sci. Rep. 14, 17676. https://doi.org/10.1038/s41598-024-68141-z (2024).
Bigdeli, M., Vakilian, M. & Rahimpour, E. A probabilistic neural network classifier based method for transformer winding fault identification through its transfer function measurement. Int. Trans. Electr. Energy Syst. 23 (3), 392–404 (2013).
Abdolmohammad, D. et al. Multi-objective dynamic generation and transmission expansion planning considering capacitor bank allocation and demand response program constrained to flexible-securable clean energy. Sustain. Energy Technol. Assess. 47, 101469. https://doi.org/10.1016/j.seta.2021.101469 (2021).
Cui, J. et al. Intelligent fault diagnosis and operation condition monitoring of transformer based on multi-source data fusion and mining. Sci. Rep. 15, 7606. https://doi.org/10.1038/s41598-025-91862-8 (2025).
Bigdeli, M., Vakilian, M. & Rahimpour, E. Transformer winding faults classification based on transfer function analysis by support vector Machine, IET electr. Power Appl. 6 (5), 268–276 (2012).
Li, Z. et al. Fault diagnosis of transformer windings based on decision tree and fully connected neural network. Energies 4, 1531 (2021).
Mahvi, M., Behjat, V. & Mohseni, H. Analysis and interpretation of power auto-transformer winding axial displacement and radial deformation using frequency response analysis. Eng. Fail. Anal. 113, (2020).
Mahmoudi, M. R. et al. Diagnosis and clustering of power transformer winding fault types by Cross-Correlation and clustering analysis of FRA results. IET Generation Transmission Distribution. 12 (19), 4301–4309 (2018).
Bigdeli, M. et al. Intelligent classifiers in distinguishing transformer faults using frequency response Analysis, in IEEE access, 9, pp. 13981–13991, (2021).
Samimi, M. et al. Evaluation of numerical indices for the assessment of transformer frequency response. IET Gener Transm Dis. 11 (1), 218–227 (2017).
Seifi, A. R. et al. Unified electrical and thermal energy expansion planning with considering network reconfiguration, Transmission Distribution, 9 (6), 592–601, https://doi.org/10.1049/iet-gtd.2014.0196. (2015).
Jianqiang et al. The Actual Measurement and Analysis of Transformer Winding Deformation Fault Degrees by FRA Using Mathematical Indicators Vol. 184 (Electric Power Systems Research, 2020).
Chiradeja, P. & Ngaopitakkul, A. Winding-to-ground fault location in power transformer windings using combination of discrete wavelet transform and back-propagation neural network. Sci. Rep. 12, 20157. https://doi.org/10.1038/s41598-022-24434-9 (2022).
Ali Reza Seifi et al., Considering cost and reliability in electrical and thermal distribution networks reinforcement planning, energy, 84, Pages 25–35, (2015). https://doi.org/10.1016/j.energy.2015.01.113
Professional standard of. the People’s Republic of China - Frequency Response Analysis on Winding Deformation of Power Transformers - DL/T911-2004, (2004).
Johnson, R. A. & Wichem, D. W. Applied Multivariate Statistical Analysis (Prentice-Hall, 2002).
Parvin, H., Beigi, A. & Mozayani, N. A clustering ensemble learning method based on the ant colony clustering algorithm. Int. J. Appl. Comput. Math. 11 (2), 286–302 (2012).
Bagherinia, A., Minaei-Bidgoli, B., Hossinzadeh, M. & Parvin, H. Reliability-Based Fuzzy Clustering Ensemble, Fuzzy Sets and Systems, (2020).
Bagherinia, A. et al. Elite fuzzy clustering ensemble based on clustering diversity and quality measures. Appl. Intell. 49 (5), 1724–1747 (2019).
Mojarad, M. et al. A fuzzy clustering ensemble based on cluster clustering and iterative fusion of base clusters. Appl. Intell. 49 (7), 2567–2581 (2019).
Nazari, A., Dehghan, A., Nejatian, S., Rezaie, V. & Parvin, H. A comprehensive study of clustering ensemble weighting based on cluster quality and diversity. Pattern Anal. Appl. 22 (1), 133–145 (2019).
Dunn, J. C. A fuzzy relative of the ISODATA process and its use in detecting compact Well-Separated clusters. J. Cybernetics. 3 (3), 32–57 (1973).
Wang, S. et al. Diagnosis of AD and DSV Winding Faults Based on FRA Method and Random Forest Algorithm, 2023 IEEE 4th International Conference on Electrical Materials and Power Equipment (ICEMPE), Shanghai, China, 2023, pp. 1–4. https://doi.org/10.1109/ICEMPE57831.2023.10139451
Tahir, M. and Stefan Tenbohlen. Transformer winding condition assessment using feedforward artificial neural network and frequency response measurements energies 14, no. 11: 3227. (2021). https://doi.org/10.3390/en14113227
Çuhadaroğlu, H. & Yılmaz Uyaroğlu Detection of transformer faults: AI-Supported machine learning application in sweep frequency response analysis energies 18, no. 10: 2481. (2025). https://doi.org/10.3390/en18102481
Li, Z. H. et al. Fault Diagnosis of Transformer Windings Based on Decision Tree and Fully Connected Neural Network Energies 14, no. 6: 1531. (2021). https://doi.org/10.3390/en14061531
Author information
Authors and Affiliations
Contributions
Abdollah Hosseini: took part in methodology, software, validation, formal analysis, investigation, resources, data curation, writing– review & editinAli Abbasi: involved in conceptualization, methodology, software, validation, formal analysis, investigation, supervision, project administration resources, data curation, writing– review & editing. Ali Reza Abbasi: took part in methodology, validation, methodology, formal analysis, investigation, supervision, data curation, writing – review & editing the manuscript.Mohammadreza Mahmoudi: took part in validation, investigation, data curation, writing – review & editing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Hosseini, A., Abbasi, A., Abbasi, A.R. et al. Transformer windings defects identification using frequency response analysis and advanced data visualization techniques. Sci Rep 15, 40595 (2025). https://doi.org/10.1038/s41598-025-24207-0
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-24207-0










