Introduction

Against the backdrop of the global energy transition today, wind energy, as a core component of clean and renewable energy, relies on the performance improvement of permanent magnet synchronous wind turbine (PMSWT) for efficient utilization1,2. Owing to advantages such as high-power density and reliable operation, PMSWT are widely applied in medium and high-power wind power systems3,4. In countries with advanced wind power industries, the installed capacity of PMSWT has been increasing year by year, accompanied by continuous growth in investment in technological research and development5,6,7,8. Furthermore, with the advancement of smart grid construction, PMSWT are increasingly integrated into intelligent grid management systems, which improves the stability of their power output and grid-friendliness, providing a solid guarantee for large-scale, efficient, and stable wind power generation9,10. However, the interaction between the rotor structure of PMSWT and stator slots tends to generate CT; during load operation, TR is also formed due to the superposition of armature reaction. Both not only increase the operating vibration and noise of the generator but also reduce power quality, thereby restricting its application in high-precision wind power systems11,12 Therefore, how to coordinately suppress CT and TR while considering the overall performance of the motor has become a key technical challenge that major research institutions and enterprises are striving to overcome13,14,15,16.

In17, the generator’s CT was reduced through the design of unequal pole piece widths and unequal spacing, with reductions of 50% and 42% achieved respectively, but permanent magnet utilization was not considered. In18, CT was maximally reduced by 66.29% by optimizing the flux barrier shape and rotor slotting. However, this approach only targeted a single structural parameter and did not involve coordinated optimization of TR. In19, a binary evolutionary algorithm was used for topology optimization, which considered both mass and TR but failed to integrate specific structural improvements, leading to low optimization efficiency. In20 and21, the motor structure was modified in both studies, and a multi-objective genetic algorithm was adopted for parameter optimization to reduce TR and CT, but the designed motors were only applicable to electric vehicles. In summary, existing research on wind turbine mostly focuses on single structure or single algorithm, lacking a global design approach of structural improvement - parameter stratification - multi-objective coordinated optimization, making it difficult to simultaneously meet the comprehensive requirements of low disturbance, high utilization, and high performance.

To address the above shortcomings, this paper proposes an integrated scheme of structural improvement-parameter stratification coordinated optimization. First, three improved rotor structures are designed, and the optimal basic structure is selected through comparison of electromagnetic characteristics. Second, based on parameter correlation stratification, a hierarchical strategy combining the improved multi-objective slap swarm algorithm (IMOSSA) - parameter sweep method is adopted to optimize the first-layer and second-layer parameters respectively. Finally, multi-objective coordinated optimization of CT suppression, TR reduction, permanent magnet consumption reduction, and torque density improvement is achieved, providing a new method for the high-performance design of IRSP-PMSWT.

Generator topological structure

This paper takes a 10.75 MW IRSP-PMSWT as an example. The basic parameters of the generator are shown in Table 1.

Table 1 Basic parameters of the generator.

Figure 1 shows the schematic diagrams of four generator structures. IRSP-PMSWT is the most traditional inner-rotor salient-pole permanent-magnet synchronous wind turbine. Type A is the uniform Halbach array IRSP-PMSWT. Type B is the non-uniform three segment Halbach array IRSP-PMSWT. Type C is the non-uniform two segment Halbach array IRSP-PMSWT.

To ensure the rationality of the comparative analysis, the basic parameters of each generator, including its dimensions and electrical parameters, are set according to Table 1, while the generator structure parameters are shown in Table 2.

Fig. 1
figure 1

Schematic diagram of the generator structure.

In Fig. 2, \({\beta _1}\) represents the central angle occupied by the main pole, and .\({\beta _2}\). represents the central angle occupied by the auxiliary pole, where \({{\rm P}_1}=\frac{{{\beta _1}}}{{{\beta _1}+{\beta _2}}}\)and \({{\rm P}_2}=\frac{{{\beta _2}}}{{{\beta _1}+{\beta _2}}}\). Figure 2a shows the structure of the Type A permanent magnet, with \({\beta _1}={\beta _2}\). Figure 2b shows the structure of the Type B permanent magnet, where the auxiliary pole is divided into two equal parts. Figure 2c shows the structure of the Type C permanent magnet, with \({\beta _1}>{\beta _2}\).

Table 2 Generator structure parameters.
Fig. 2
figure 2

Schematic diagram of the permanent magnet structure.

Electromagnetic characteristic analysis

Analysis of air gap magnetic density (AGMD)

The vector diagram of the AGMD is shown in Fig. 3.

$${B_{\text{r}}}={B_x}\cos \theta +{B_y}\sin \theta$$
(1)

In the formula, \({B_{\text{r}}}\) represents the radial AGMD, \({B_x}\) represents the component of the AGMD along the x axis, \({B_y}\) represents the component of the AGMD along the y axis, and \(\theta\) represents the included angle between the radial AGMD \({B_{\text{r}}}\) and the x axis.

Fig. 3
figure 3

Vector diagram of AGMD.

The expression for the total harmonic distortion (THD) of the AGMD waveform is

$$THD=\frac{{\sqrt {B_{{r2}}^{2}+B_{{r3}}^{2}+...+B_{{ri}}^{2}} }}{{{B_{r1}}}} \times 100\%$$
(2)

In the formula, \({B_{ri}}\) is the amplitude of the i-th harmonic of the radial AGMD, where \(i=2,3,...,10\);\({B_{r1}}\) is the amplitude of the fundamental wave of the radial air-gap magnetic flux density.

Fig. 4
figure 4

Waveform diagram of the AGMD.

Fig. 5
figure 5

Waveform diagram of the radial AGMD.

Fig. 6
figure 6

Fourier decomposition results of the radial AGMD.

Fig. 7
figure 7

Waveform of electromagnetic torque.

The Halbach array makes use of the different magnetization directions of the permanent magnets to enhance the magnetic field on one side of the air gap and weaken it on the other side, thus effectively increasing the magnetic density in the useful area of the air gap. Compared with the two-segment array, the non-uniform three-segment Halbach array can better regulate the magnetic field, increase the air-gap magnetic density, and reduce the harmonic content. Figure 4 shows the AGMD waveforms of four types of generator structures. From the figure, it can be seen that the root mean square (RMS) value of Type B is the largest, which is 0.587 T, while that of Type A is the smallest, being 0.543 T. The RMS values of Type C and IRSP-PMSWT are 0.582 T and 0.555 T respectively. Figure 5 shows the radial component of the air-gap magnetic flux density. The value of IRSP-PMSWT is 0.550 T, that of Type A is 0.539 T, Type B is 0.584 T, and Type C is 0.579 T. Figure 6 presents the Fourier decomposition results of the radial AGMD. For all four structures, the odd order harmonics of the 1st, 3rd, 5th, 7th, and 9th orders are large, while the even-order harmonics are small. The THD of IRSP-PMSWT is 15.23%. For Type A, the amplitudes of the 1st, 3rd, and 7th harmonics are the smallest, but the amplitudes of the 5th and 9th harmonics are quite large. Its THD reaches 24.21%, which is 11.98% higher than that of the traditional structure. The THD of Type B is 10.86%, 4.37% smaller than that of the traditional structure. It has the smallest THD and the best sinusoidal waveform. The THD of Type C is 14.04%, 1.19% smaller than that of the traditional structure.

Electromagnetic torque analysis

Electromagnetic torque is a key physical quantity of a generator and is closely related to the output power of the generator. The electromagnetic TR can reflect the quality of the generator’s operation22,23. The non-uniform three-segment Halbach array makes the interaction among the stator current, the magnetic field of the permanent magnets, and the rotor magnetic resistance more sufficient, thus generating a larger electromagnetic torque. The non-uniform three-segment Halbach array can make better use of the salient-pole effect of the rotor, enabling the magnetic resistance torque to be more fully exerted. Figure 7 shows the electromagnetic torque waveforms of the four structures. Further analysis of Fig. 7 yields Fig. 8, which shows the average electromagnetic torque, and Fig. 9, which shows the electromagnetic TR. Compared with the traditional structure, the structures of Type B and Type C are improved in terms of both electromagnetic torque and TR, while the Type A structure shows the opposite, with worse performance in both electromagnetic torque and TR.

Fig. 8
figure 8

Average electromagnetic torque.

Fig. 9
figure 9

Electromagnetic TR.

Analysis of CT

The CT is generated by the interaction between the magnetic field produced by the permanent magnets and the teeth and slots of the stator core. When the permanent magnets of the rotor are aligned with the teeth of the stator, the magnetic resistance of the air-gap magnetic circuit is relatively small, and the magnetic permeance is relatively large. When the permanent magnets of the rotor are in the middle position of the stator slots, the magnetic resistance of the air-gap magnetic circuit is relatively large, and the magnetic permeance is relatively small. As the rotor keeps rotating, the air-gap magnetic permeance fluctuates periodically between large and small values. The magnetic field energy of the permanent magnets is closely related to the air-gap magnetic permeance. The periodic change of the magnetic permeance will cause the magnetic field energy of the permanent magnets to change periodically as well, and this energy change leads to the generation of the CT. Through finite element simulation, the CT of the four structures are shown in Fig. 10.

Fig. 10
figure 10

Waveform diagram of CT.

The maximum value of the CT of IRSP-PMSWT is 0.85\(\:\text{k}\text{N}\cdot\:\text{m}\). The CT of Type A, Type B, and Type C are 1.13 \(\:\text{k}\text{N}\cdot\:\text{m}\), 0.625 \(\:\text{k}\text{N}\cdot\:\text{m}\), and 0.81 \(\:\text{k}\text{N}\cdot\:\text{m}\) respectively. The CT performance of the Type B structure is better than that of the other structures. Through the above analysis of the four structures, the Type B structure is selected for further optimization in this paper.

MOSSA-multi-objective hierarchical optimization based on slap behaviour

The For the multi variable, strongly coupled, and non-linear mathematical programming problem in the design of PMWT, many studies have done a great deal of work on aspects such as stator slot shape optimization and rotor permanent magnet optimization. However, the screening and analysis of the influence of the stator and rotor on the generator’s performance and efficiency are not thorough enough, and there is a lack of analysis from the perspective of simultaneous optimization of both the stator and rotor24,25,26.

The multi-objective slap swarm algorithm (MOSSA) is inspired by the group swimming behavior of slaps in the ocean. In this algorithm, the slap group is divided into leaders and followers. The followers update their positions based on the positions of the leaders and the previous individuals27. In this way, the coordinated swimming of the group is achieved, and the algorithm approaches the overall optimal solution while performing single objective optimization28. Although MOSSA can avoid local optima to some extent, in certain cases, it may still converge prematurely to a locally optimal non-dominated solution set, thus missing the global optimal solution or better non-dominance29,30. To minimize the occurrence of this phenomenon, this paper proposes an improvement to MOSSA by integrating the adaptive weight factor and the Levy flight perturbation strategy into MOSSA, namely AWFLFPS-MOSSA.

Adaptive weight factor theory

In the initial stage of iteration, in order to enhance the global search ability of the algorithm and effectively avoid the risk of getting trapped in local optimal solutions, the position update range of the leader needs to be set relatively large31. As the iteration progresses and enters the later stage, to ensure the accuracy of the search results, it is necessary to shrink the position update range of the leader32,33. Regarding followers, in the early stage of iteration, the influence of the previous - generation leader on them should be emphasized; while in the later stage of iteration, this influence needs to be gradually weakened. Based on the above considerations, we introduce an adaptive weight factor.

$$\left\{ {\begin{array}{*{20}{c}} {{\omega _1}={\omega _{\hbox{max} 1}}+({\omega _{\hbox{min} 1}} - {\omega _{\hbox{max} 1}}) \times {B^2}} \\ {{\omega _2}={\omega _{\hbox{min} 2}}+({\omega _{\hbox{max} 2}} - {\omega _{\hbox{min} 2}}) \times {B^2}} \end{array}} \right.$$
(3)
$$B=\frac{n}{N}$$
(4)

In the formula, \(\omega\) is the adaptive weight factor; \({\omega _{\hbox{max} 1}}\) and \({\omega _{\hbox{max} 2}}\) are the maximum values of the adaptive weight factor, which are taken as 1.5 and 0.7 respectively. \({\omega _{\hbox{min} 1}}\) and \({\omega _{\hbox{min} 2}}\) are the minimum values of the adaptive weight factor, which are taken as 1 and 0.5 respectively. B is the iteration weight factor, n is the current number of iterations, and N is the total number of iterations.

Levy flight perturbation

Levy flight perturbation refers to using the characteristics of Levy flight to perturb the current solution, thereby increasing the diversity of solutions and preventing the algorithm from prematurely converging to local optimal solutions34,35,36. To generate random step sizes that follow the Levy distribution, the Mantegna algorithm is commonly used. This algorithm generates the Levy flight step size in the following way:

First, define two random variables u and v that follow the normal distribution.

$$u\sim N\left( {0,\sigma _{u}^{2}} \right),v\sim N\left( {0,\sigma _{v}^{2}} \right)$$
(5)
$${\sigma _u}={\left[ {\frac{{\Gamma (1+\beta )\sin (\pi \beta /2)}}{{\Gamma ((1+\beta )/2){2^{(\beta - 1)/2}}\beta }}} \right]^{1/\beta }}$$
(6)
$${\sigma _v}=1$$
(7)

\(\beta\) is the index parameter of Levy flight, which is usually taken between (0, 2). The Levy flight step size s is

$$s=\frac{u}{{|v{|^{1/\beta }}}}$$
(8)

The position of the leader in the original MOSSA is…

$$x_{j}^{i}=\left\{ {\begin{array}{*{20}{c}} {{F_j}+{c_1}\left( {(u{b_j} - n{b_j}){c_2}+n{b_j}} \right)}&{{c_3} \geqslant 0.5} \\ {{F_j} - {c_1}\left( {(u{b_j} - n{b_j}){c_2}+n{b_j}} \right)}&{{c_3}<0.5} \end{array}} \right.$$
(9)
$${c_1}=2{e^{ - {{(\frac{{4{\text{t}}}}{T})}^2}}}$$
(10)

In the formula, \(x_{j}^{i}\) represents the position of the ith leader in the jth dimension, \({F_j}\) represents the position of the food source in the j-th dimensional space; \({c_2}\) and \({c_3}\) are random coefficients that maintain the search space, and both take values between [0, 1], where \({c_2}\) determines the search step-size of the current spatial position, and \({c_3}\) determines the search direction; \({c_1}\) is an important parameter for balancing the global exploration and local exploitation stages; t is the current number of iterations, and T is the total number of iterations.

Position of the followers:

$$y_{j}^{i}=\frac{1}{2}\left( {x_{j}^{i}+x_{j}^{{i - 1}}} \right)$$
(11)

In the formula, \(y_{j}^{i}\) represents the position of the i-th follower in the j-th dimension.

The position of the leader in the improved AWFLFPS-MOSSA is

$$x_{j}^{i}=\left\{ {\begin{array}{*{20}{c}} {{F_j}+{c_1}{\omega _1}\left( {(u{b_j} - n{b_j}){c_2}s+n{b_j}} \right)}&{{c_3} \geqslant 0.5} \\ {{F_j} - {c_1}{\omega _1}\left( {(u{b_j} - n{b_j}){c_2}s+n{b_j}} \right)}&{{c_3}<0.5} \end{array}} \right.$$
(12)

The position of the followers in the improved AWFLFPS-MOSSA becomes

$$y_{j}^{i}={\omega _2}x_{j}^{i}+\left( {1 - {\omega _2}} \right)x_{j}^{{i - 1}},i \geqslant 2$$
(13)

Algorithm testing

Fig. 11
figure 11

Algorithm testing results.

Fig. 12
figure 12

Multi-objective optimization process.

The DTLZ2 function is used to evaluate the performance of the MOSSA and AWFLFPS-MOSSA algorithms. The MOSSA and AWFLFPS-MOSSA algorithms are each run 30 times repeatedly. The convergence and diversity of the algorithms are judged by introducing the Generation Distance (GD) and Spacing metric (SP)37,38,39,40.

$$GD=\frac{{\sqrt {\sum\limits_{{i=1}}^{{|A|}} {L_{i}^{2}} } }}{{|A|}}$$
(14)
$$SP=\sqrt {\frac{1}{{|A|}}\sum\limits_{{i=1}}^{{|A|}} {{{({L_i} - \overline {L} )}^2}} }$$
(15)

In the formula, A is the approximate Pareto solution set obtained by the algorithm, \({L_i}\) is the Euclidean distance from the i-th solution in A to the nearest solution in the Pareto optimal solution set, \(\left| A \right|\) is the number of solutions in A, and \(\overline {L}\) is the average of the minimum Euclidean distances of all individuals.

The algorithm testing results are shown in Fig. 11. The means and standard deviations of GD and SP of AWFLFPS-MOSSA are smaller than those of MOSSA. It can be seen that AWFLFPS-MOSSA performs better than MOSSA in terms of convergence and diversity. The optimization process based on AWFLFPS-MOSSA is shown in Fig. 12.

Selection of optimization objectives and optimization parameters

To further reduce the CT of the generator, minimize the TR, enhance the operational stability of the generator, and extend its service life, this study selects the peak value of CT \({T_{cog}}\), the average electromagnetic torque \({T_{avg}}\), the TR \({T_r}\), the volume of the permanent magnet \({V_{pm}}\), and the fundamental wave amplitude of the no-load back EMF \({E_m}\) as the optimization objectives.

Table 3 Parameters to be screened and optimized and their constraints.

The parameters in Table 3 are used as the parameters to be screened and optimized for the correlation analysis of the optimization objectives. The highly correlated parameters are selected as the first layer optimization parameters, and the AWFLFPS-MOSSA is used for multi-objective optimization. The remaining parameters are used as the second layer optimization parameters, and the single-objective parameter scanning optimization is carried out. The objective function is

$$\hbox{min} \left\{ {\begin{array}{*{20}{c}} {\begin{array}{*{20}{c}} {{g_1}(x)={T_{cog}}} \\ {{g_2}(x)= - {T_{avg}}} \end{array}} \\ {{g_3}(x)={T_r}} \\ {\begin{array}{*{20}{c}} {{g_4}(x)= - {E_m}} \\ {{g_5}(x)={V_{pm}}} \end{array}} \end{array}} \right.$$
(16)
Fig. 13
figure 13

Schematic diagram of the stator slot.

Figure 13 is a schematic diagram of the stator slot. The relationship between the stator slot width \({B_{s1}}\) and \({B_{s2}}\) is

$$\frac{{{B_{s2}}}}{{{B_{s1}}}}=k$$
(17)

In the formula, \(\:\text{k}\) is the proportionality coefficient, with a value of 1.6.

Parameter correlation analysis

For the parameters with a linear relationship, this paper selects Pearson correlation analysis to analyse the parameters. The correlation coefficient is obtained by calculating the ratio of the covariance between the parameters to the product of the standard deviations of the variables41,42.

$$r=\frac{{\sum\limits_{{i=1}}^{s} {({x_i} - \bar {x})} ({y_i} - \bar {y})}}{{\sqrt {\sum\limits_{{i=1}}^{s} {{{({x_i} - \bar {x})}^2}} } \sqrt {\sum\limits_{{i=1}}^{s} {{{({y_i} - \bar {y})}^2}} } }}$$
(18)

In the formula, r is the Pearson correlation coefficient, x and y are two parameters, \(\overline {x}\) and \(\overline {y}\) are the mean values, and s is the number of parameters.

For the parameters with a non-linear relationship, this paper selects the Spearman correlation analysis to analyse the parameters. The correlation coefficient is reflected through the rank relationship between the two parameters.

$${r_s}=1 - \frac{{6\sum\limits_{{i=1}}^{s} {d_{i}^{2}} }}{{s({s^2} - 1)}}$$
(19)
$${d_i}={x_i} - {y_i}$$
(20)

In the formula, \({r_s}\)is the Spearman correlation coefficient, and \({d_i}\) is the larger difference between the corresponding ranks of parameters x and y.

Finally, the correlation coefficients between the parameters in Table 3 and the peak value of CT \({T_{{\text{cog}}}}\), the fundamental wave amplitude of no load back EMF\({E_m}\), the average electromagnetic torque \({T_{avg}}\), and the TR \({T_r}\)are obtained as shown in Fig. 14.

Fig. 14
figure 14

Correlation coefficients.

Since \({V_{pm}}\) is only related to \({\beta _1}\) and \({H_m}\), the Spearman correlation coefficient for \({V_{pm}}\) is not calculated in Fig. 14. The calculation formula for \({V_{pm}}\) is as follows:

$${V_{pm}}=\frac{{\pi {D_{si}}{\beta _1}{H_m}{L_a}}}{{2p}}$$
(21)

In the formula, \({D_{si}}\) is the inner diameter of the stator core, \({L_a}\) is the length of the core, and p is the number of pole pairs.

In order to comprehensively measure the correlation between each optimization parameter and the optimization target, the comprehensive correlation degree between each parameter and the optimization target was calculated. The calculation formula is as follows.

$${\rho _{mi}}=\frac{{\sum\limits_{{j=1}}^{n} {\left| {{\rho _{ij}}} \right|} }}{u}$$
(22)

In the formula, \({\rho _{mi}}\) represents the comprehensive correlation degree of each optimization parameter, \({\rho _{ij}}\) is the correlation coefficient of the ith optimization parameter with respect to the jth optimization target, and u is the number of optimization targets.

Fig. 15
figure 15

Comprehensive correlation coefficients.

According to the criteria for judging the strength of correlation, when \(\left| {{\rho _{mi}}} \right| \leqslant 0.3\), it is considered to have a low correlation. As can be seen from Fig. 15, parameters \({P_1}\), \({H_{\text{m}}},\delta\) and \({B_{{\text{s}}1}}\) have relatively high correlations. Therefore, they are used as the first layer optimization parameters for multi-objective optimization. Parameters \(\theta ,{H_{s0}}\) and \(B{}_{{{\text{s}}0}}\) have low correlations and are used as the second layer optimization parameters.

Polynomial response surface model based on response surface method

The Box Behnken Design (BBD) response surface method is used to establish a polynomial response surface model for the first layer optimization. The 29 sets of experimental data corresponding to the four optimization parameters are shown in Table 4.

The formula for the second order polynomial response surface model is as follows:

$$y={Q_0}+\sum\limits_{{i=1}}^{z} {{Q_i}} {x_i}+\sum\limits_{{i=1}}^{z} {{Q_{ii}}} x_{i}^{2}+\sum\limits_{{1 \leqslant i<j \leqslant z}} {{Q_{ij}}} {x_i}{x_j}+\epsilon$$
(23)

In the formula, \({x_i}\) and \({x_j}\) represent independent variables;\({Q_0}\) is the constant term, \({Q_i}\) is the coefficient of the first order term; \({Q_{ii}}\) is the coefficient of the second order term; \({Q_{ij}}\) is the coefficient of the interaction term; ε is the random error term.

The formula for the third order polynomial response surface model is as follows:

$$\begin{aligned} y& ={Q_0}+\sum\limits_{{i=1}}^{z} {{Q_i}} {x_i}+\sum\limits_{{i=1}}^{z} {{Q_{ii}}} x_{i}^{2}+\sum\limits_{{1 \leqslant i<j \leqslant k}} {{Q_{ij}}} {x_i}{x_j}+\sum\limits_{{i=1}}^{z} {{Q_{iii}}} x_{i}^{3} \\ & \quad +\sum\limits_{{1 \leqslant i<j \leqslant k}} {\sum\limits_{{i=1}}^{z} {{Q_{iji}}} } {x_i}{x_j}{x_i}+\sum\limits_{{1 \leqslant i<j<m \leqslant z}} {{Q_{ijm}}} {x_i}{x_j}{x_m}+\epsilon \end{aligned}$$
(24)

In the formula, \({Q_{iii}}\) is the coefficient of the cubic term, which is used to describe the cubic effect of the independent variable \({x_i}\); \({Q_{iji}}\) is the interaction term coefficient, representing the interaction effect among the independent variables \({x_i}\), \({x_j}\), and \({x_i}\); \({Q_{ijm}}\) is also an interaction term coefficient, indicating the interaction effect among the independent variables \({x_i}\), \({x_j}\), and \({x_m}\).

Table 4 Data of optimization parameters and optimization targets.

Finally, the target equations are obtained as shown in Eq. (25).

$$\left\{ {\begin{array}{*{20}{l}} \begin{aligned} {T_{avg}} & =2.276 - 59.405{P_1} - 0.704{H_m}+2.912\delta - 0.764{B_{s1}}+0.588{P_1}{B_{s1}} \hfill \\ & \quad - 0.03456{H_m}\delta +0.0064{H_m}{B_{s1}} \\ & \quad - 0.065\delta {B_{s1}}+24.64{P_1}^{2}+0.010H_{m}^{2}+0.054{\delta ^2}+0.019B_{{s1}}^{2} \hfill \\ \end{aligned} \\ \begin{aligned} {T_r}& =50.453+101.707{P_1} - 32.568{H_m} - 101.300\delta \\ & \quad +34.766{B_{s1}}+98.423{P_1}{H_m}+137.328{P_1}\delta - 112.278{P_1}{B_{s1}} \hfill \\ & \quad - 0.168\delta {B_{s1}} - 186.995{P_1}^{2}+25.037{\delta ^2}+0.053B_{{s1}}^{2} \\ & \quad - 73.726{P_1}^{2}{H_m}+85.898{P_1}^{2}{B_{s1}} - 33.545{P_1}{\delta ^2} \hfill \\ \end{aligned} \\ \begin{aligned} {T_{cog}} & =266404.5 - 814000.5{P_1} - 24.516{H_m} - 97515.45\delta +337.005{B_{s1}} \hfill \\ & \quad +293350.5{P_1}\delta - 574.875{P_1}{B_{s1}}+627961.5{P_1}^{2}+179.775{\delta ^2} - 222462{P_1}^{2}\delta , \hfill \\ \end{aligned} \\ \begin{aligned} {E_m} & =120.6 - 2429{P_1} - 23.45{H_m}+122.6\delta - 30.54{B_{s1}}+7.119{P_1}{H_m} \\ & \quad - 17.14{P_1}\delta +27.72{P_1}{B_{s1}} - 1.402{H_m}\delta +0.231{H_m}{B_{s1}} \hfill \\ & \quad - 2.684\delta {B_{s1}}+908.3{P_1}^{2}+0.262H_{m}^{2}+2.682{\delta ^2}+0.694B_{{s1}}^{2}. \hfill \\ \end{aligned} \end{array}} \right.$$
(25)

To determine whether the model fitting effect is close to the true value, this paper introduces the coefficient of determination \({R^2}\).

First, calculate the total sum of squares \(SST\).

$$SST=\sum\limits_{{i=1}}^{n} {{{({y_i} - \bar {y})}^2}}$$
(26)

In the formula, \({y_i}\) represents the observed values of the dependent variable, \(\bar {y}\)represents the mean value of the dependent variable, and n represents the number of dependent variables.

Then calculate the sum of squares of regression.

$$SSR=\sum\limits_{{i=1}}^{n} {{{({{\hat {y}}_i} - \bar {y})}^2}}$$
(27)

In the formula, \({\hat {y}_i}\)represents the predicted values of the dependent variable.

Coefficient of determination \({R^2}\)

$${R^2}=\frac{{SSR}}{{SST}}$$
(28)
Fig. 16
figure 16

Response surface.

By calculation, the coefficients of determination for cogging torque\({T_{cog}}\), average electromagnetic torque\({T_{a\nu g}}\), TR \({T_r}\), and the fundamental amplitude of no load back EMF\({E_m}\) are 0.985, 0.981, 0.996, and 0.999 respectively, all of which are very close to 1. This indicates that the constructed model is reasonable. Figure 16 shows the response surface model fitted according to Eq. (25).

First layer parameter optimization

During the implementation of the program in Matrix Laboratory R2021a (https://www.mathworks.com/), both the population size and the number of iteration generations are set to 200. The population size is the same as the number of iterations, both taking the value of 200. The variables are \({\beta _1}\),\({H_m}\),\(\delta\)and \({B_{s1}}\). Optimization is performed on the functions \({T_{cog}}\),\({T_{avg}}\),\({T_r}\),\({E_m}\)and \({V_{pm}}\) respectively. The obtained Pareto front is shown in Fig. 17. In this figure, the volume of the small spheres corresponds to \({V_{pm}}\), and the depth of the sphere colour represents \({E_m}\). Finally, the simulation results of Type B are used as the constraint conditions for the optimization objectives to screen the obtained solutions. The constraint formulas are as follows:

$$\left\{ {\begin{array}{*{20}{l}} {{T_{a\nu g}}>322.05{\text{kN}} \cdot {\text{m}},}&{} \\ {{T_r}<18.74\% ,}&{} \\ {{T_{cog}}<625{\text{N}} \cdot {\text{m}},}&{} \\ {{E_m}>940{\text{V}},}&{} \\ {{V_{pm}}<1303.9c{m^2}}&{} \end{array}} \right.$$
(29)

Combine Eq. (29) and Fig. 17 to screen the solutions. To obtain the optimal solution, perform normalization and weighting operations on the screened solutions. Normalization usually maps the values of variables to the interval [0, 1], and the calculation formula is as follows:

Fig. 17
figure 17

Pareto front.

$${x^{\prime}}=\frac{{x - {x_{\hbox{min} }}}}{{{x_{\hbox{max} }} - {x_{\hbox{min} }}}}$$
(30)
$$G={f_1}T_{{avg}}^{\prime }+{f_2}T_{{cog}}^{\prime }+{f_3}T_{r}^{\prime }+{f_4}E_{m}^{\prime }+{f_5}V_{{pm}}^{\prime }$$
(31)

In the formula, x represents the original variable, \({x_{\hbox{min} }}\) represents the minimum value of the variable, and \({x_{\hbox{max} }}\) represents the maximum value of the variable. In this paper, the normalized values of \({T_{cog}}\), \({T_{avg}}\),\({T_r}\),\({E_m}\), and \({V_{pm}}\) are \(T_{{cog}}^{\prime }\), \(T_{{avg}}^{\prime }\), \(T_{r}^{\prime }\), \(E_{m}^{\prime }\), and \(V_{{pm}}^{\prime }\) respectively. Among them, the larger \({T_{avg}}\) and \({E_m}\) are, the better, so positive normalization is carried out. For the generator, the smaller \({T_{cog}}\), \({T_r}\), and \({V_{pm}}\) are, the better. Therefore, reverse normalization is adopted. The weight coefficients corresponding to \({T_{cog}}\), \({T_{avg}}\), \({T_r}\), \({E_m}\), and \({V_{pm}}\) are \({f_1}\), \({f_2}\), \({f_3}\), \({f_4}\), and \({f_5}\) respectively. In this paper, the weight coefficients are taken as 0.2, 0.25, 0.25, 0.1, and 0.2 respectively. The comparison of the parameter values of the first layer before and after optimization is shown in Table 5.

Table 5 Comparison before and after the first layer parameter optimization.

Second layer parameter optimization

On the basis of the first layer optimization, parameter scanning is carried out on \({T_{cog}}\) and \({T_r}\) with \(\theta\), \({H_{{\text{s}}0}}\), and \({B_{{\text{s}}0}}\) as variables.

Fig. 18
figure 18

Three-dimensional diagram of the changing trends of \({T_{avg}}\)and \({T_r}\) with \({H_{s0}}\) and \({B_{s0}}\).

As can be seen from Fig. 18a, the overall trend of \({T_{avg}}\) is that it decreases as \({H_{s0}}\) increases, and it first increases and then decreases as C increases. The maximum value occurs at \({B_{s0}}\) = 9 mm and \({H_{s0}}\) = 1.5 mm. As can be seen from Fig. 18b, the overall trend of \({T_r}\) is that it increases as \({H_{s0}}\) increases and decreases as \({B_{s0}}\) increases. The minimum value occurs at \({B_{s0}}\)= 12 mm and \({H_{s0}}\) = 1.5 mm. For \({H_{s0}}\), the optimal point \({H_{s0}}\) = 1.5 mm is selected. For \({B_{s0}}\), \({T_{avg}}\) is given priority consideration. Therefore, finally, \({B_{s0}}\) = 9 mm and \({H_{s0}}\) = 1.5 mm are selected.

As can be seen from Fig. 19, when \({B_{s0}}\) = 9 mm and \({H_{s0}}\)= 1.5 mm, the overall trend of \({T_{avg}}\) is that it first increases and then decreases as \(\theta\) increases, and the maximum value is around \(\theta\) = 50°. The overall trend of \({T_r}\)is that it decreases as \(\theta\)increases, and the minimum value occurs when \(\theta\)= 75°. Considering the comprehensive situations of \({T_{avg}}\)and \({T_r}\), finally \(\theta\) = 55° is selected. After the second layer optimization, the values of the second layer parameters are obtained as shown in Table 6.

Fig. 19
figure 19

Shows the variation of \({T_{avg}}\) and \({T_r}\) with \(\theta\).

Table 6 Comparison before and after the second layer parameter optimization.

Although the convergence speed of the AWFLFPS-MOSSA algorithm has been improved based on the MOSSA algorithm, the overall computational complexity has increased due to the incorporation of the adaptive weight factor and Levy flight perturbation. Moreover, although the AWFLFPS-MOSSA algorithm coordinates and optimizes the parameters of the stator and rotor simultaneously, when the algorithm iterates to the neighborhood of the global optimum, the long-distance jump of the algorithm may cause the parameters to suddenly jump out of the high-precision convergence region, forcing the algorithm to re-explore the low-quality solution space. Sensitive parameters such as the pole arc coefficients of the main and auxiliary magnetic poles will mutate during the iteration of the AWFLFPS-MOSSA algorithm, leading to the distortion of the air-gap magnetic density waveform and an increase in the torque ripple, which results in the need to spend additional time for re-solving.

Analysis of optimization results

A comparative analysis is conducted among the optimized Type B, the pre-optimized Type B, Type A, and ISRSP-PMSWT. When a current excitation is applied to the generator, the image of the variation of the average electromagnetic torque \({T_{avg}}\)of the generator with the current angle under the rated current is shown in Fig. 20. When the current angle is 130°, the generator has the maximum average output torque. Figure 21 shows the waveform diagram of the electromagnetic torque. Further analysis of Fig. 21 yields Fig. 22, which depicts the average electromagnetic torque, and Fig. 23, which shows the electromagnetic TR. It can be seen from the figures that the optimized Type B generator has a maximum average electromagnetic torque of 322.7 kNm and a minimum TR of 6.8%. The electromagnetic torque characteristics of the Type B generator is superior to those of the other three structures.

Fig. 20
figure 20

Curve of average electromagnetic torque \({T_{avg}}\)versus current angle.

Fig. 21
figure 21

Comparison diagram of electromagnetic torque waveforms.

Fig. 22
figure 22

Comparison diagram of average electromagnetic torques.

Fig. 23
figure 23

Comparison diagram of electromagnetic TR.

The comparison diagram of the CT of the four types of generators is shown in Fig. 24. After optimization, the peak value of the CT of the generator is reduced. The CT of the optimized Type B is 376.49 N·m, that of the non-optimized Type B is 625 N·m, that of the IRSP-PMSWT is 850 N·m, and that of Type A is 1130 N·m.

Fig. 24
figure 24

Comparison diagram of CT.

Fig. 25
figure 25

Comparison diagram of no load back EMF.

It can be seen from Fig. 25, the comparison diagram of no load back EMF, that the waveform of the optimized Type B generator is closer to a smooth sine wave, indicating an improved waveform. The waveform distortion rate is significantly reduced.

Conclusion

In this paper, improvements are made based on the structure of the IRSP-PMSWT. Type A uniform Halbach magnetization structure, Type B non-uniform three segment Halbach magnetization structure, and Type C non-uniform two segment Halbach magnetization structure is proposed. First, electromagnetic characteristic analyses are carried out for these four structures, highlighting the advantages of the three structures of Type A, Type B, and Type C. Subsequently, the Pearson and Spearman methods are used to calculate the correlations between parameters, and the parameters are stratified. For the first layer parameters, the improved slap swarm algorithm AWFLFPS-MOSSA is employed for multi-objective optimization; for the second layer parameters, parameter scanning is used for optimization. Finally, the simulation results show that compared with the traditional structure generator, the outer rotor permanent magnet synchronous generator with the non-uniform Halbach array proposed in this paper reduces the usage of permanent magnets, increases the torque density, enhances the fundamental wave amplitude of the no load back EMF, reduces the harmonic distortion rate, and decreases the TR and CT by 15.07% and 55.71% respectively. The electromagnetic performance of the generator is significantly improved.