Introduction

The electric drive system (EDS), as the core of new energy vehicles, is pivotal to the overall driving safety and operational reliability. As EDS continues to evolve toward higher power density, increased rotational speeds, elevated voltage platforms, and greater integration, the intensified load under multi-physical field coupling exacerbates performance degradation and fault occurrence, thereby constraining further improvements in the reliability of new energy vehicles1. Furthermore, the EDS comprises multiple components, including the motor, reducer, and controller, which are susceptible to various failure modes such as fatigue and aging2. Traditional reliability assessment methods, which rely on failure times and specific distributions, account for a single failure mode and are mainly applicable to steady-state operating conditions3. Under complex and dynamic real-world environments, the above approach fails to adequately capture the degradation behaviors and predict the remaining useful life (RUL) of EDS. Therefore, conducting research on EDS degradation modeling and RUL prediction is of paramount importance for enhancing driving safety and extending system longevity.

With the rapid advancement of prognostics and health management technology, real-time monitoring, diagnostics, and prediction of a system’s health status and RUL enable effective assessment of safety and reliability under actual operating conditions4. Such capabilities facilitate enhanced product health management and optimized maintenance strategies. Extensive research on RUL prediction has been conducted by scholars worldwide, primarily encompassing physics-based modeling, artificial intelligence-based methods, and statistical data-driven approaches5. Physics-based approaches require explicit knowledge of failure mechanisms and typically establish degradation models based on the physics of failure6. Representative models include the Paris law for fatigue crack propagation7, the Basquin equation for fatigue life prediction8, the Coffin-Manson equation for low-cycle fatigue life9, and the Arrhenius equation for aging life prediction10. For instance, Zhao11 constructed a Paris-based degradation model using gear monitoring data and applied Bayesian updating to model parameter distributions, achieving accurate RUL prediction. Vega-Garita12 provides an overview of the aging mechanisms in lithium-ion batteries and methods for lifetime prediction, while also exploring parameter control strategies to reduce aging rates and enhance the lifespan of battery packs. However, physics-based methods depend heavily on extensive real-world failure data, which is challenging to obtain during vehicle operation and incurs significant costs. In contrast, artificial intelligence-based RUL prediction methods eliminate the need for explicit degradation models13, leveraging machine learning algorithms to establish an end-to-end mapping from monitoring data to RUL14. Li15 enhanced prediction performance by integrating knowledge-driven insights with a CNN-LSTM model. However, artificial intelligence-based approaches often neglect the underlying physical failure mechanisms, rendering the prediction process less interpretable, with model accuracy heavily reliant on the quantity and quality of training data.

Statistical data-driven approaches are primarily grounded in degradation modeling, employing prior parameter estimation and posterior distribution updates to derive the probability distribution function of the RUL16. A prerequisite for degradation modeling is the construction of health indicators (HIs) that characterize system performance degradation. Current HIs construction methods include physics-based health indicators (PHIs) and virtual health indicators (VHIs)17. PHIs are grounded in explicit failure mechanisms and reflect degradation features closely associated with failure processes, examples include direct physical quantities such as crack length and wear volume, as well as indirect quantities such as vibration and internal resistance. For instance, Feng18 utilized crack length and depth to characterize the performance degradation of welded plates, while Xu19 assessed wear in external gear pumps using parameters such as rotational speed, pressure, and flow. However, these physical quantities are difficult to obtain in large-scale real-world operations. In contrast, VHIs are constructed through data fusion techniques. Li20 developed a virtual health indicator based on frequency-domain similarity and transfer learning, which effectively identified both the health states and degradation progression of bearings. Qin21 generated an HI with evident degradation trends from multi-dimensional features of vibration signals by incorporating a variational autoencoder model. Nevertheless, a key challenge persists in the ambiguous correlation between VHIs and physical degradation patterns, which compromises RUL prediction accuracy. For establishing degradation models based on HIs, common empirical models include linear22, exponential23, and power-law models. Additionally, stochastic processes such as the Gamma process24, Inverse Gaussian process25, and Wiener process26 are widely applied. The Gamma and Inverse Gaussian processes are suitable for modeling continuous monotonic degradation data27, whereas the Wiener process is better suited for non-monotonic degradation processes but assumes stable operating environments and constant degradation rates. For the subject of this study, degradation modeling must align with actual degradation patterns to ensure the precision and effectiveness of RUL predictions.

Previous studies typically assume that the exceedance of a single performance degradation indicator beyond its threshold leads to a single failure mode. For complex systems involving coupled multiple failure modes, researchers have integrated multiple degradation indicators into a composite health indicator as a system performance evaluation metric, establishing a unified failure threshold to guide life prediction and maintenance decision-making28. The failure that occurs when the fused hi exceeds the composite threshold is referred to as a fusion failure mode29. For instance, when the weighted fusion of multiple degradation indicators in a gas turbine exceeds the total failure threshold, the turbine is considered to have failed30. Ye31 proposed a RUL prediction method based on parallel multi-attention and contrastive fusion across multi-source domains, capable of extracting composite fault features from the main reducer and validating the method’s effectiveness using full life cycle vibration data. Zhang et al.32 analyzed multiple performance indicators to classify different fault types and developed corresponding RUL prediction models. Under fusion failures, they introduced a weighting strategy to account for the contributions of different indicators to the overall system health state. Li et al.33 further proposed a generalized cumulative degradation model for wind turbines, in which root mean square values of vibration signals and variations in rotational speed are used to characterize degradation processes. These studies provide valuable insights and references for constructing health indicators and modeling degradation in EDS.

Moreover, for complex systems exhibiting nonlinear degradation, the RUL prediction process is subject to various uncertainties, including measurement uncertainty in sample data, random uncertainty arising from individual differences, and parameter uncertainty in degradation modeling, all of which ultimately influence the prediction outcomes34,35. Niu36 applied the Monte Carlo method, randomly sampling values from uncertain parameter estimation intervals, conducting multiple RUL prediction simulations, and adopting the average as the final predicted value. Zhang37 leveraged prior knowledge from physical models alongside statistical patterns from empirical data to develop a random effects Inverse Gaussian process model for tools, employing Bayesian methods for estimating and updating unknown parameters, deriving the RUL probability density function, and quantifying prediction uncertainty. Therefore, a central focus of this study is to incorporate multiple failure modes in EDS to construct realistic health indicators, and enable RUL prediction with uncertainty quantification across diverse user scenarios. The main contributions of this study are as follows:

(1) A novel method is proposed for constructing composite health indicators that integrates multidimensional damage contribution features of the EDS, with the slope of linear fitting utilized to characterize the variability in degradation rates across different users.

(2) Through multi-prototype bench life testing, the degradation trajectories and inherent uncertainties of EDS are elucidated. By accounting for the differences in degradation rates between accelerated and unaccelerated spectrum, nonlinear degradation trajectories are generated, and the corresponding degradation models are established.

(3) The expectation-maximization (EM) parameter estimation and Bayesian update methods are employed to predict the RUL and quantify uncertainty for EDS across various service stages, demonstrating applicability across diverse user scenarios.

This study primarily presents a nonlinear degradation modeling and RUL prediction method for complex systems with multiple failure modes. The rest of this study is organized as follows: Sect. ❝Composite health indicator construction❞ focuses on the construction of composite health indicators. Section ❝Nonlinear degradation modeling❞ details the nonlinear degradation modeling. Section ❝Model construction and degradation trajectory generation❞ discusses the RUL prediction model. Section ❝Parameter estimation with the EM algorithm❞ provides case studies and comparative analysis, and Sect. ❝Parameter updates based on bayesian theory❞ concludes the study. The overall research content and implementation framework are illustrated in Fig. 1. The research is divided into three main stages, each highlighted in a different color to clearly illustrate the processes of composite health indicator construction, nonlinear degradation modeling, and remaining useful life prediction for the electric drive system.

Fig. 1
figure 1

Flow chart of the proposed research content and implementation framework.

Composite health indicator construction

HIs as representative parameters for evaluating the degradation state of systems or components, play a critical role in quantifying failure thresholds, monitoring service conditions, and supporting predictive maintenance. In complex systems with multiple components, the risk loads and failure mechanisms vary significantly among components under stochastic operating conditions. Therefore, constructing composite health indicators that effectively characterize the overall degradation state of complex systems is essential.

Consider a system comprising core components such as shafts, gears, bearings, stators, and rotors, which are subject to multiple failure modes, including fatigue failure of rotating mechanical components, aging of stator windings, and fatigue of rotor magnetic bridges. Lifetime analysis of these components can be performed based on their respective failure mechanisms38.

For mechanical components like shafts and gears, the Basquin equation is used to establish the relationship between stress and fatigue life. Equation (1) illustrates the shear stress caused by the torque on the motor shaft39.

$$\tau =\frac{{16T}}{{\pi {D^3}}}$$
(1)

where \(\tau\) is shear stress, T is motor torque, D is the motor shaft section diameter.

For gear components in EDS, the contact fatigue life of tooth surface caused by contact stress \({\sigma _H}\) and the bending fatigue life of tooth root generated by bending stress \({\sigma _F}\) are mostly considered. The above stress calculation approach can be determined by referring to ISO 633640. For the calculation of bearing life, the basic rating life \({L_{10}}\) of the bearing can be obtained based on the Lundberg-Palmgren theory41, which can be denoted as:

$${L_{10}}={(\frac{C}{P})^{\frac{{10}}{3}}}$$
(2)

where C is the basic dynamic load rating, P is the equivalent dynamic load. The calculation equation of equivalent dynamic load P is calculated as:

$$P={f_p}(X{F_r}+Y{F_a})$$
(3)

where \({f_p}\) is the load factor, X is the radial load factor, Y is the axial load factor, \({F_r}\) is the radial force, \({F_a}\) is the axial force.

To account for the risk of fatigue failure caused by significant centrifugal forces due to speed, the maximum stress of the bridge in the permanent magnet synchronous motor is calculated using the equivalent center of mass principle42, as follows:

$${\sigma _{\hbox{max} }}={\alpha _\sigma }R_{0}^{2}{\omega ^2}{\rho _{eq}}$$
(4)

where \({\alpha _\sigma }\) is stress concentration factor, \({R_0}\) is rotor equivalent radius (m), \({\rho _{eq}}\) is rotor equivalent density (kg/m^2), \(\omega\) is motor speed (rad/s).

The insulation aging failure of stator winding, which may occur during continuous high-temperature operation, can be addressed by using the Arrhenius equation to describe the aging life43.

$$\ln L=\frac{{{E_a}}}{{kT}} - G$$
(5)

where L represents the average insulation life (in hours), T represents the temperature of the insulation material (in Kelvin), k represents the Boltzmann constant (in eV/K), and Ea and G represent parameters related to the insulation material.

Under stochastic user conditions, the number of load cycles experienced by each component can be determined using methods such as the rainflow cycle counting or the joint distribution counting methods44. Rainflow cycle counting is typically used to extract alternating stress cycles for shaft-like components, while the joint distribution counting is more suitable for rotating parts such as gears and bearings, as it considers the impact of rotational speed on the cycle count. Assuming the operational data is divided into n segments, the damage incurred by component j in segment i can be calculated using Palmgren-Miner rule as follows:

$${D_{ji}}=\sum\nolimits_{{k=1}}^{{{N_k}}} {\frac{{{r_{i\_k}}}}{{{N_{i\_k}}}}}$$
(6)

where \({N_k}\) represents the number of stress levels, \({r_{i\_k}}\) denotes the rainflow cycle counting or rotational rainflow counting for segment i at stress level k, and \({N_{i\_k}}\) is the fatigue life at stress level k for segment i.

Assuming the failure threshold of component j over its full life cycle (e.g., 300,000 km) is \({\delta _j}\), the damage contribution \({d_{ji}}\) for component j during operational segment i is defined as:

$${d_{ji}}={D_{ji}}/{\delta _j}$$
(7)

where \(i=1,2,...,n\), \(j=1,2,...,m\).

To evaluate the overall damage contribution of the system under different failure modes, the entropy weight method is introduced to determine the weights of various factors during information fusion. The entropy weight method assigns weights based on the inherent attributes of the original data45, using information entropy to represent the degree of dispersion of an indicator. A smaller entropy value indicates greater dispersion of the indicator and, consequently, more valuable information. Assuming the normalized damage contribution matrix under multiple failure modes is \(d_{{ji}}^{z}\), the information entropy \({E_j}\) for the j-th indicator is defined as:

$${E_j}= - \frac{1}{{\ln n}}\sum\limits_{{i=1}}^{n} {{p_{ji}}} \cdot \ln ({p_{ji}})$$
(8)
$${p_{ji}}=d_{{ji}}^{z}/\sum\limits_{{i=1}}^{n} {d_{{ji}}^{z}}$$
(9)

where \({E_j} \in [0,1]\). Since entropy values and weights are inversely related, the weight coefficient for each feature is defined as:

$${\omega _j}=(1 - {E_j})/\sum\limits_{{j=1}}^{m} {( 1 - {E_j})}$$
(10)

In addressing the construction of degradation trajectory under coupled multiple failure modes, the entropy weight method is initially employed to fuse damage contribution metrics from multiple components, generating the composite health indicator and a linear degradation trajectory. Subsequently, this trajectory is refined based on actual degradation patterns observed in bench testing, yielding a nonlinear composite health indicator for the electric drive system. Let \({x_{p\_i}}\) represent the composite health indicator for the p-th (\(p=1,2, \cdots ,U\)) user during the i-th operational segment. The damage contribution of component j for the p-th user in segment i is denoted as \({d_{pj}}(i)\), while the weight coefficient \({\omega _j}\) reflects the proportional impact of the component j. The composite health indicator is expressed as:

$${x_{p\_i}}={{\mathbf{D}}_p}(i) \cdot {\mathbf{W}}=\sum\limits_{{j=1}}^{m} {{d_{pj}}(i) \cdot {\omega _j}}$$
(11)

where \({{\mathbf{D}}_p}(i)=[{d_{p1}}(i),{d_{p2}}(i), \cdots ,{d_{pm}}(i)]\),\({{\mathbf{D}}_p}(i) \in {\varvec{R}^{1 \times m}}\) denotes the damage contribution vector of the m-th component for the p-th user, and \({\mathbf{W}}={[{\omega _1},{\omega _2}, \cdots ,{\omega _m}]^{\text{T}}}\), \({\mathbf{W}} \in {\varvec{R}^{m \times 1}}\)represents the vector of weight coefficients.

Nonlinear degradation modeling

Model construction and degradation trajectory generation

Assuming that the cumulative impact of operating condition cycles on system degradation is negligible, and that degradation accumulates linearly across service cycles, the linear equation characterizing the degradation state of user p is formulated as follows:

$${y_{p\_i}}={k_p} \cdot \sum\nolimits_{{i=1}}^{n} {{x_{p\_i}}} +{b_p}$$
(12)

where \({k_p}\) represents the linear degradation rate of different users.

For complex systems such as the electric drive system in NEVs, multiple failure modes with components emerge under the influence of multi-physical field loads such as mechanical stress, thermal stress, and electromagnetic stress. the interaction of these failure modes leads to a nonlinear degradation trend in the system. Since actual degradation trajectories over the full life cycle are difficult to obtain, researchers typically rely on bench accelerated durability tests to monitor vibration signal features that reflect the evolution of degradation states. However, a key challenge lies in the fact that degradation rates observed during accelerated testing are significantly higher than those experienced under normal user conditions. Therefore, developing a method to transfer the degradation patterns observed in experimental conditions to actual operational environments is critical for ensuring prediction accuracy.

Assuming that the RMS of vibration signals monitored during the full life cycle in bench tests follows an exponential distribution, the generalized exponential model is formulated as:

$${f_i}=a\exp (b \cdot {s_i})$$
(13)

where \({s_i}(i=1,2,\; \cdots ,n)\) represents the RMS of the vibration signals, while a and b are fitting parameters.

To determine the fitting parameters a and b in the exponential model, the least squares method is applied. The objective function S is formulated to minimize the sum of squared residuals between the observed values and the predicted values. The parameters a and b are estimated using the following equation:

$$S=\hbox{min} \left( {\sum\limits_{{i=1}}^{c} {{{({f_i} - a{e^{b{s_i}}})}^2}} } \right)$$
(14)
$$b=\frac{{c\sum\limits_{{i=1}}^{c} {{s_i}} {f_i} - \sum\limits_{{i=1}}^{c} {{s_i}} \sum\limits_{{i=1}}^{c} {{f_i}} }}{{c\sum\limits_{{i=1}}^{c} {s_{i}^{2}} - {{(\sum\limits_{{i=1}}^{c} {{s_i}} )}^2}}}$$
(15)
$$a=\frac{1}{c}\left( {\sum\limits_{{i=1}}^{c} {{f_i}} - b\sum\limits_{{i=1}}^{c} {{s_i}} } \right)$$
(16)

To address the heterogeneity in user operating conditions and empirical observations from bench testing regarding degradation patterns, we develop a nonlinear exponential function to formulate the system degradation trajectory:

$${f_{p\_i}}=a\exp ({k_i}\delta \cdot t) \cdot {y_{p\_i}}+\varepsilon$$
(17)

where \({f_{p\_i}}\) represents the composite degradation indicator, a represents the model’s initial random parameter, \({k_i}\) is the degradation slope in the linear model, which characterizes the variation in degradation rates across different users. \(\delta\)is the correction factor that represents the relationship between the degradation rate under experimental conditions and that under user conditions, and \(\varepsilon \sim {\rm N}(0,{\sigma ^2})\) is the random noise in the degradation process.

To determine \(\delta\), bench tests are performed under both accelerated durability conditions and non-accelerated user conditions, with vibration signals from the electric drive system monitored during the tests. These signals are used to evaluate the acceleration factor m between the accelerated test spectrum and the user’s non-accelerated spectrum. The degradation trajectories of the electric drive system under both accelerated and non-accelerated spectra are illustrated in Fig. 2.

Fig. 2
figure 2

Schematic of degradation trajectories under accelerated and non-accelerated spectra.

Assuming the degradation equation under the accelerated test spectrum is \({f_{\text{A}}}(t)\) and the degradation equation under the non-accelerated user spectrum is \({f_{\text{B}}}(t)\), the acceleration factor and the coefficient \(\delta\)can be expressed as follows:

$$\left\{ \begin{gathered} \ln ({f_{\text{A}}}(t))=\ln ({a_{\text{A}}})+{b_{\text{A}}}t \hfill \\ \ln ({f_{\text{B}}}(t))=\ln ({a_{\text{B}}})+{b_{\text{B}}}t \hfill \\ \frac{{\ln ({f_{\text{A}}}(t+\Delta )) - \ln ({f_{\text{A}}}(t))}}{{\ln ({f_{\text{B}}}(t+\Delta )) - \ln ({f_{\text{B}}}(t))}}=\frac{{{b_{\text{A}}}}}{{{b_{\text{B}}}}}=m \hfill \\ \delta ={b_{\text{B}}}={b_{\text{A}}}/m \hfill \\ \end{gathered} \right.$$
(18)

Based on the nonlinear degradation trajectory, the user degradation model is constructed. Considering the inherent uncertainty introduced by the use uncertainty of different users and the performance difference of the prototype in the degradation process of the electric drive system, the exponential model is used to describe the cumulative process of the degradation state of the electric drive system. Assuming that the discrete time monitoring point is \({t_s},s=1,2, \cdots ,L\), the degradation amount at time \({t_s}\) can be described as:

$$Y({t_s})=\varphi +\theta \exp (\beta {t_s}+\varepsilon ({t_s}) - \frac{{{\sigma ^2}}}{2})$$
(19)

where \(\varphi\) is the constant representing the intercept of the model, \(\theta\)and \(\beta\) are random parameters that determine the slope of the model and captures the variability in usage conditions and individual differences among users. It’s assumed that \(\ln \theta\) follows a normal distribution with mean \({\mu _1}\) and variance \(\sigma _{1}^{2}\), \(\ln \theta \sim N({\mu _1},\sigma _{1}^{2})\),\(\beta \sim N({\mu _2},\sigma _{2}^{2})\). The \(\varepsilon ({t_s})\) are the stochastic error terms which express inherent uncertainties due to monitoring errors on the degradation data from the test machine, \(\varepsilon ({t_s})\sim N(0,{\sigma ^2})\) and \(\varepsilon (0)=0\). Assuming they are mutually independent and identically distributed, \({{ - {\sigma ^2}} \mathord{\left/ {\vphantom {{ - {\sigma ^2}} 2}} \right. \kern-0pt} 2}\) is defined such that the expected value of the degradation state \(Y({t_s})\) equals\(E[Y({t_s})|\theta ,\beta ]=\varphi +\theta \exp (\beta {t_s})\).

$$\begin{gathered} F({t_s})=\ln (Y({t_s}) - \varphi ) \hfill \\ {\text{ }}=\ln \theta - \frac{{{\sigma ^2}}}{2}+\beta {t_s}+\varepsilon ({t_s}) \hfill \\ {\text{ }}=\lambda +\beta {t_s}+\varepsilon ({t_s}) \hfill \\ \end{gathered}$$
(20)

where\(\lambda =\ln \theta - {{{\sigma ^2}} \mathord{\left/ {\vphantom {{{\sigma ^2}} 2}} \right. \kern-0pt} 2}\), \(\lambda \sim N({\mu _1} - {{{\sigma ^2}} \mathord{\left/ {\vphantom {{{\sigma ^2}} 2}} \right. \kern-0pt} 2},\sigma _{1}^{2})\),\(\mu _{1}^{\prime }={\mu _1} - {{{\sigma ^2}} \mathord{\left/ {\vphantom {{{\sigma ^2}} 2}} \right. \kern-0pt} 2}\). such that \(\lambda\) follows a Gaussian distribution. When considering \(\varphi =0\), the degenerate quantity\(F({t_s})=\ln (Y({t_s})\).

Parameter Estimation with the EM algorithm

In nonlinear degradation models with unknown parameters \({\mathbf{\Theta }}=\{ \mu _{1}^{\prime },\sigma _{1}^{2},{\mu _2},\sigma _{2}^{2},{\sigma ^2}\}\), prior to updating the model parameters, it is essential to integrate prior distributions with historical monitoring data for parameter estimation, which subsequently updates the posterior distribution of the stochastic parameters. Considering the randomness of latent variables \(\lambda\) and \(\beta\) as historical data updates, the expectation-maximization (EM) algorithm is employed to estimate the parameter set \({\mathbf{\Theta }}\)46. First, the expectation step computes the expected value using the current estimates of the latent variables. Next, the maximization step updates the parameter estimates by maximizing the complete log-likelihood function. Define the degradation indicator at time \({t_s}\) as \({{\mathbf{F}}_{1:s}}=\{ F({t_1}),F({t_2}), \cdots ,F({t_s})\}\), with the estimated parameter is \({{\mathbf{\Theta }}_s}=\{ \mu _{{1,s}}^{\prime },\sigma _{{1,s}}^{2},{\mu _{2,s}},\sigma _{{2,s}}^{2},\sigma _{s}^{2}\}\). after the n-th iteration, the parameter estimate is represented as \({\mathbf{\hat {\Theta }}}_{s}^{{(n)}}=\{ \hat {\mu }_{{1,s}}^{{'(n)}},\hat {\sigma }_{{1,s}}^{{2(n)}},\hat {\mu }_{{2,s}}^{{(n)}},\hat {\sigma }_{{2,s}}^{{2(n)}},\hat {\sigma }_{s}^{{2(n)}}\}\). In the expectation step, the complete log-likelihood function can be expressed as:

$$\begin{gathered} L({{\mathbf{\Theta }}_s}|{\mathbf{\hat {\Theta }}}_{s}^{{(n)}})={E_{\lambda ,\beta |{F_{1:s}},{\mathbf{\hat {\Theta }}}_{s}^{{(n)}}}}[ln({{\mathbf{F}}_{1:s}},\lambda ,\beta |{{\mathbf{\Theta }}_s})] \hfill \\ =\ln p(\begin{array}{*{20}{c}} {{{\mathbf{F}}_{1:s}}} \end{array}| \lambda ,\beta ,{{\mathbf{\Theta }}_s})+\ln p(\lambda ,\beta |{{\mathbf{\Theta }}_s}) \hfill \\ = - \frac{{s+2}}{2}{\text{ln}}2{\pi ^2} - \frac{s}{2}{\text{ln}}\sigma _{s}^{2} - \sum\limits_{{j=1}}^{s} {\frac{{{{({F_j} - \lambda - \beta {t_j})}^2}}}{{2\sigma _{s}^{2}}}} \hfill \\ - \frac{1}{2}{\text{ln}}\sigma _{{1,s}}^{2} - \frac{1}{2}{\text{ln}}\sigma _{{2,s}}^{2} - \frac{{{{(\lambda - \mu _{{1,s}}^{\prime })}^2}}}{{2\sigma _{{1,s}}^{2}}} - \frac{{{{(\beta - {\mu _{2,s}})}^2}}}{{2\sigma _{{2,s}}^{2}}} \hfill \\ \end{gathered}$$
(21)

By maximizing the complete log-likelihood function, the estimated value after the (n + 1)-th iteration is obtained as follows:

$${\mathbf{\hat {\Theta }}}_{s}^{{(n+1)}}=\operatorname{argmax} L({{\mathbf{\Theta }}_s}|{\mathbf{\hat {\Theta }}}_{s}^{{(n)}})$$
(22)

The first-order partial derivative with respect to parameter \({{\mathbf{\Theta }}_s}\) is calculated based on Eq. (23) as follows:

$$\frac{{\partial L({{\mathbf{\Theta }}_s}|{\mathbf{\hat {\Theta }}}_{s}^{{(n)}})}}{{\partial {{\mathbf{\Theta }}_s}}}=0$$
(23)

The estimated result for parameter \({\mathbf{\hat {\Theta }}}_{s}^{{(n+1)}}\)after the (n + 1)-th iteration is given by:

$$\hat {\sigma }_{s}^{{2(n+1)}}=\frac{1}{s}\sum\limits_{{j=1}}^{s} {(F_{j}^{2} - 2{F_j}({\mu _{\lambda ,s}}+{\mu _{\beta ,s}})+\mu _{{\lambda ,s}}^{2}+\sigma _{{\lambda ,s}}^{2}} +2{t_j}({\rho _s}{\sigma _{\lambda ,s}}{\sigma _{\beta ,s}}+{\mu _{\lambda ,s}}{\mu _{\beta ,s}})+t_{j}^{2}(\mu _{{\beta ,s}}^{2}+\sigma _{{\beta ,s}}^{2}))$$
(24)
$$\left\{ \begin{gathered} \hat {\mu }_{{1,s}}^{{'(n+1)}}{\text{=}}{\mu _{\lambda ,s}} \hfill \\ \hat {\sigma }_{{1,s}}^{{2(n+1)}}{\text{=}}\sigma _{{\lambda ,s}}^{2} \hfill \\ \hat {\mu }_{{2,s}}^{{(n+1)}}{\text{=}}{\mu _{\beta ,s}} \hfill \\ \hat {\sigma }_{{2,s}}^{{2(n+1)}}{\text{=}}\sigma _{{\beta ,s}}^{2} \hfill \\ \end{gathered} \right.$$
(25)

where \({\mu _{\lambda ,s}}\) and \(\sigma _{{\lambda ,s}}^{2}\) represent the mean and variance of the posterior distribution of parameter \(\lambda\), while \({\mu _{\beta ,s}}\) and \(\sigma _{{\beta ,s}}^{2}\) denote the mean and variance of the posterior distribution of parameter \(\beta\), and \({\rho _s}\) is the correlation coefficient. when the estimation error \(\left\| {\left. {{\mathbf{\hat {\Theta }}}_{s}^{{(n+1)}} - {\mathbf{\hat {\Theta }}}_{s}^{{(n)}}} \right\|} \right.\) falls below a predefined threshold, the parameter estimation results at time \({t_s}\)is output.

Parameter updates based on bayesian theory

For the parameter updates of \(\lambda\) and \(\beta\), the posterior distribution is computed using Bayesian theory based on prior distributions and historical observations. Given that \(\lambda\), \(\beta\), and \(\varepsilon ({t_s})\) are mutually independent and \(\varepsilon ({t_1}),\varepsilon ({t_2}), \cdots ,\varepsilon ({t_{\text{L}}})\) is an independently and identically distributed random variable, the Bayesian process utilizes the degradation index \({{\mathbf{F}}_{1:s}}=[{F_1},{F_2}, \cdots ,{F_s}]\) as prior information to perform posterior estimation of the model parameters, enabling real-time updates of the joint posterior distribution of the random parameter \((\lambda ,\beta )\). The joint probability density function of \({{\mathbf{F}}_{1:s}}\) is given as follows:

$$p({F_1},{F_2}, \ldots ,{F_s}|\lambda ,\beta )={\left( {\frac{1}{{\sqrt {2\pi {\sigma ^2}} }}} \right)^s} \times \exp \left( { - \sum\limits_{{j=1}}^{s} {\left( {\frac{{{{({F_j} - \lambda - \beta {t_j})}^2}}}{{2{\sigma ^2}}}} \right)} } \right)$$
(26)

where the joint posterior distribution of \(\lambda\) and \(\beta\) given \({{\mathbf{F}}_{1:s}}=[{F_1},{F_2}, \cdots ,{F_s}]\) follows a Gaussian distribution, \(\lambda ,\beta |{{\mathbf{F}}_{1:s}}\sim N({\mu _{\lambda ,s}},\sigma _{{\lambda ,s}}^{2},{\mu _{\beta ,s}},\sigma _{{\beta ,s}}^{2},{\rho _s})\).

Assuming the prior joint bivariate normal distribution of the random parameter \((\lambda ,\beta )\) is \(p(\lambda ,\beta )\), the posterior joint density function is expressed as:

$$\begin{gathered} p(\lambda ,\beta |{{\mathbf{F}}_{1:s}}) \propto p({{\mathbf{F}}_{1:s}}|\lambda ,\beta ) \cdot p(\lambda ,\beta ) \hfill \\ \propto \frac{1}{{2\pi {\sigma _{\lambda ,s}}{\sigma _{\beta ,s}}\sqrt {1 - \rho _{s}^{2}} }} \times \exp \left\{ {\left. {\frac{1}{{2(1 - \rho _{s}^{2})}} \cdot \left[ {\frac{{{{(\lambda - {\mu _{\lambda ,s}})}^2}}}{{\sigma _{{\lambda ,s}}^{2}}} - 2{\rho _s}\frac{{(\lambda - {\mu _{\lambda ,s}})(\beta - {\mu _{\beta ,s}})}}{{\sigma _{{\lambda ,s}}^{2}\sigma _{{\beta ,s}}^{2}}}+\frac{{{{(\beta - {\mu _{\beta ,s}})}^2}}}{{\sigma _{{\beta ,s}}^{2}}}} \right]} \right\}} \right. \hfill \\ \end{gathered}$$
(27)

Through algebraic manipulation, the expression for parameter \({\mu _{\lambda ,s}},\sigma _{{\lambda ,s}}^{2},{\mu _{\beta ,s}},\sigma _{{\beta ,s}}^{2},{\rho _s}\) in Eqs. (28), (29), (30), (31), and (32) is derived as follows:

$${\mu _{\lambda ,s}}=\frac{{\left( {\sum\limits_{{j=1}}^{s} {{F_j}} \sigma _{1}^{2}+\mu _{1}^{\prime }{\sigma ^2}} \right)\left( {\sum\limits_{{j=1}}^{s} {t_{j}^{2}} \sigma _{2}^{2}+{\sigma ^2}} \right) - \left( {\sum\limits_{{j=1}}^{s} {{t_j}} \sigma _{1}^{2}} \right)\left( {\sum\limits_{{j=1}}^{s} {{F_j}} {t_j}\sigma _{2}^{2}+{\mu _2}{\sigma ^2}} \right)}}{{\left( {s\sigma _{1}^{2}+{\sigma ^2}} \right)\left( {\sum\limits_{{j=1}}^{s} {t_{j}^{2}} \sigma _{2}^{2}+{\sigma ^2}} \right) - \left( {\sum\limits_{{j=1}}^{s} {{t_j}} \sigma _{2}^{2}} \right)\left( {\sum\limits_{{j=1}}^{s} {{t_j}} \sigma _{1}^{2}} \right)}}$$
(28)
$${\mu _{\beta ,s}}=\frac{{\left( {s\sigma _{1}^{2}+{\sigma ^2}} \right)\left( {\sum\limits_{{j=1}}^{s} {{F_j}} {t_j}\sigma _{2}^{2}+{\mu _2}{\sigma ^2}} \right) - \left( {\sum\limits_{{j=1}}^{s} {{t_j}} \sigma _{2}^{2}} \right)\left( {\sum\limits_{{j=1}}^{s} {{F_j}} \sigma _{1}^{2}+\mu _{1}^{\prime }{\sigma ^2}} \right)}}{{\left( {s\sigma _{1}^{2}+{\sigma ^2}} \right)\left( {\sum\limits_{{j=1}}^{s} {t_{j}^{2}} \sigma _{2}^{2}+{\sigma ^2}} \right) - \left( {\sum\limits_{{j=1}}^{s} {{t_j}} \sigma _{2}^{2}} \right)\left( {\sum\limits_{{j=1}}^{s} {{t_j}} \sigma _{1}^{2}} \right)}}$$
(29)
$$\sigma _{{\lambda ,s}}^{2}=\frac{{{{\bar {\sigma }}^2}}}{{\sigma _{2}^{2}}}\frac{{\sum\limits_{{j=1}}^{s} {t_{j}^{2}} \sigma _{2}^{2}+{\sigma ^2}}}{{\left( {s\sigma _{1}^{2}+{\sigma ^2}} \right)\left( {\sum\limits_{{j=1}}^{s} {t_{j}^{2}} \sigma _{2}^{2}+{\sigma ^2}} \right) - {{\left( {\sum\limits_{{j=1}}^{s} {{t_j}} } \right)}^2}\sigma _{1}^{2}\sigma _{2}^{2}}}$$
(30)
$$\sigma _{{\beta ,s}}^{2}=\frac{{{{\bar {\sigma }}^2}}}{{\sigma _{1}^{2}}}\frac{{s\sigma _{1}^{2}+{\sigma ^2}}}{{\left( {s\sigma _{1}^{2}+{\sigma ^2}} \right)\left( {\sum\limits_{{j=1}}^{s} {t_{j}^{2}} \sigma _{2}^{2}+{\sigma ^2}} \right) - {{\left( {\sum\limits_{{j=1}}^{s} {{t_j}} } \right)}^2}\sigma _{1}^{2}\sigma _{2}^{2}}}$$
(31)
$${\rho _s}=\frac{{ - {\sigma _1}{\sigma _2}\sum\limits_{{j=1}}^{s} {{t_j}} }}{{\sqrt {s\sigma _{1}^{2}+{\sigma ^2}} \sqrt {\sigma _{2}^{2}\sum\limits_{{j=1}}^{s} {t_{j}^{2}} +{\sigma ^2}} }}$$
(32)

where \({\bar {\sigma }^2}={\sigma ^2}\sigma _{1}^{2}\sigma _{2}^{2}\), \({\mu _{\lambda ,s}}\), and \(\sigma _{{\lambda ,s}}^{2}\) denote the mean and variance of the posterior distribution of parameter \(\lambda\), while \({\mu _{\beta ,s}}\) and \(\sigma _{{\beta ,s}}^{2}\) represent the mean and variance of the posterior distribution of parameter \(\beta\).

Remaining useful life prediction model

Model construction

Based on the degradation model parameter estimation and updates, the degradation degree \(F({t_s}+t)\) at time \({t_s}+t\) can be predicted using the historical degradation data \({{\mathbf{F}}_{1:s}}\), and \(F({t_s}+t)=\lambda +\beta ({t_s}+t)+\varepsilon ({t_s}+t) - {\sigma ^2}/2\). As the predicted degradation degree follows a Gaussian distribution, its mean and variance are given by the following expressions:

$$\tilde {\mu }({t_s}+t)={\mu _\lambda }+{\mu _\beta }({t_s}+t) - \frac{{{\sigma ^2}}}{2}$$
(33)
$${\tilde {\sigma }^2}({t_s}+t)=\sigma _{\lambda }^{2}+{({t_s}+t)^2}\sigma _{\beta }^{2}+{\sigma ^2}+2{\rho _s}({t_s}+t){\sigma _\lambda }{\sigma _\beta }$$
(34)

The RUL L is defined as the time between the degradation degree reaching the given failure threshold \(\delta\). Since \(F({t_s})\) is the degradation model after a logarithmic transformation, L should satisfy \(F(L+{t_s})=\ln \delta\). Therefore, the conditional cumulative distribution function of the RUL can be expressed as:

$$\begin{gathered} {R_{L|{{\mathbf{F}}_{1:s}}}}(t)=P(L \leqslant t|{F_1},{F_2}, \cdots {F_s}) \hfill \\ =P(F({t_s}+t) \geqslant \ln \delta |{F_1},{F_2}, \cdots {F_s}) \hfill \\ =P\left( {Z \geqslant \frac{{\ln \delta - \tilde {\mu }({t_s}+t)}}{{\sqrt {{{\tilde {\sigma }}^2}({t_s}+t)} }}} \right)={\mathbf{\Phi }}(g(t)) \hfill \\ \hfill \\ \end{gathered}$$
(35)
$$g(t)=\frac{{\tilde {\mu }(t+{t_s}) - \ln \delta }}{{\sqrt {{{\tilde {\sigma }}^2}({t_s}+t)} }}$$
(36)

where Z is a random variable following the standard normal distribution, and \({\mathbf{\Phi }}(\cdot )\) is the cumulative distribution function of the standard normal distribution.

Since the RUL is non-negative (\(L \geqslant 0\)), the actual conditional cumulative distribution function is given by:

$$\begin{gathered} {R_{L|{{\mathbf{F}}_{1:s}}}}(t)=P(L \leqslant t|{F_1},{F_2}, \cdots {F_s},L \geqslant 0) \hfill \\ =\frac{{P\{ 0 \leqslant L \leqslant t|{F_1},{F_2}, \cdots {F_s}\} }}{{P\{ L \geqslant 0|{F_1},{F_2}, \cdots {F_s}\} }} \hfill \\ =\frac{{{\mathbf{\Phi }}(g(t)) - {\mathbf{\Phi }}(g(0))}}{{1 - {\mathbf{\Phi }}(g(0))}} \hfill \\ \end{gathered}$$
(37)

Further derivation leads to the conditional probability density function of the RUL as:

$${f_{L|{F_1},{F_2}, \cdots {F_s},L \geqslant 0}}(t)=\frac{{\psi (g(t)){g^\prime }(t)}}{{1 - {\mathbf{\Phi }}(g(0))}}$$
(38)

In Eq. (38), \(\psi (\cdot )\) represents the probability density function of a standard normal distribution. For the RUL estimation, based on Eq. (33), the predicted mean \(\tilde {\mu }({t_s}+t)\) from the degradation state is used in place of \(F(L+{t_s})\). Then, combining the failure threshold \(F(L+{t_s})=\ln \delta\), the expected RUL can be expressed as:

$$RU{L_s}=\frac{{\ln \delta - {\mu _{\lambda ,s}}+{\sigma ^2}/2}}{{{\mu _{\beta ,s}}}} - {t_s}$$
(39)

Evaluation metrics

To comprehensively evaluate the prediction performance of the proposed degradation modeling and RUL prediction framework, three evaluation metrics are employed: root mean square error (RMSE), mean absolute error (MAE), and confidence interval coverage (CIC). These metrics jointly assess both the accuracy and reliability of the predicted results.

RMSE quantifies the average magnitude of prediction errors by penalizing larger deviations more heavily. it provides an overall measure of the prediction accuracy and is expressed as:

$${\text{RMSE}}=\sqrt {\frac{1}{N}\sum\nolimits_{{i=1}}^{N} {{{({{\hat {y}}_i} - {y_i})}^2}} }$$
(40)

where \({y_i}\) denotes the true value, \({\hat {y}_i}\) represents the predicted value, and N is the number of samples.

MAE captures the average absolute difference between predicted and true values, reflecting the robustness of the model against outliers. the metric is defined as:

$${\text{MAE}}=\frac{1}{N}\sum\nolimits_{{i=1}}^{N} {\left| {{{\hat {y}}_i} - {y_i}} \right|}$$
(41)

CIC evaluates the reliability of the predicted confidence intervals by measuring the proportion of true values that fall within the estimated bounds. It is calculated as:

$${\text{CIC}}=\frac{1}{N}\sum\nolimits_{{i=1}}^{N} {F({y_i} \in [{L_i},{U_i}])}$$
(42)

where \([{L_i},{U_i}]\) represents the lower and upper bounds of the prediction interval for the i-th sample, and \(F( \cdot )\) is the indicator function.

In this study, the pseudocode for the nonlinear degradation modeling and RUL prediction algorithm for the electric drive system is presented as Algorithm 1. The algorithm incorporates input data including user operational data, bench test data, model initialization parameters, and failure thresholds, integrating failure physics models for different components to enable RUL predictions across various service stages under real-world user scenarios.

figure a

.

Case study

Degradation trajectory modeling based on bench test correction

The datasets used in this study consists of operational data from 30 vehicles of the same model, covering one year of driving behavior and road conditions from different users. The schematic representation of user time-domain data is illustrated in Fig. 3. Furthermore, statistics are compiled for each user, including total operating time, annual mileage, cumulative damage contribution, and damage contribution rate, as illustrated in Fig. 4.

Fig. 3
figure 3

Schematic diagram of user time-domain data

Fig. 4
figure 4

Statistical characteristics of different user data.

In Fig. 4, the cumulative damage contribution is defined as the ratio of a user’s composite health indicator to the total composite health indicator across all users. The damage contribution rate is calculated by dividing a user’s cumulative damage contribution by their total mileage.

The composite health indicator integrates the effects of multiple failure modes and reflects the overall degradation trend of the system. This study employs the entropy weight method to linearly fuse damage contribution features across different failure modes. Additionally, a comparative analysis of weighting method variations is conducted, utilizing the coefficient of variation method and the CRITIC weighting method to construct composite health indicators47. Linear fitting is applied to the resulting composite health indicators, yielding the slope of the linear fitting equation for each user under different methods, which represents the linear degradation rate, as depicted in Fig. 5. The result reveals that the degradation rates of different users are highly consistent across the various methods, indicating that the choice of weighting method has a minimal impact on the composite health indicator.

Fig. 5
figure 5

Comparison of composite health indicator results across different weighting methods; (a) Entropy weight method; (b) Coefficient of variation method; (c) CRITIC weighting method; (d) Comparison of linear degradation rates.

How to refine the aforementioned composite health indicator into a degradation trajectory that aligns with actual degradation patterns is of paramount importance. To address this, an accelerated durability test of the EDS is conducted, during which vibration signals from the test prototype are monitored to characterize the evolution of its degradation state. Furthermore, a comparative analysis between the degradation rates under user non-accelerated conditions and bench accelerated conditions is performed to establish an equivalence relationship, which is then used to correct the nonlinear degradation model under real-world user conditions.

For the accelerated durability test of the electric drive system, a load spectrum is developed based on real-world user operation data. Simulated load signals are applied to both ends of the electric drive assembly using the dynamometer to replicate typical user conditions. The test bench setup and a single-cycle load spectrum are shown in Fig. 5.

Fig. 6
figure 6

Electric drive system test bench and load spectrum; (a) Test bench; (b) Durability test load spectrum.

Each cycle lasts 12 h and is repeated 100 cycles to replicate the equivalent damage corresponding to 300,000 km of user operation, resulting in a total test duration of 1,200 h. Throughout the test, key parameters such as rotational speed, torque, voltage, current, oil temperature, and vibration are continuously monitored to track the degradation process.

In this study, the vibration signals of the prototype monitored during the bench tests are used to indirectly represent its degradation trajectory. The RMS of the vibration signals from each test cycle is extracted to establish the system’s degradation model. Considering the impact of inherent uncertainties, such as material properties, manufacturing processes, and assembly errors, on the lifespan of the prototype, accelerated durability tests are conducted on three identical prototypes to quantify these uncertainties. As the number of cycles increases, the degradation trends and fitting models for each prototype are shown in Figs. 7 and 8, and 9. The RMS values of the vibration signals from the three prototypes are normalized to the [0,1] range, and an exponential model is used to fit the RMS values of the vibration signals. The overall degradation exhibits nonlinear accumulation, and the degradation rate increases non-stationarily. The fitting formula and the 95% confidence interval of the fitting parameters are as follows:

$$\left\{ \begin{gathered} {f_1}(x)={a_1} \cdot \exp ({b_1} \cdot x)=0.1501 \cdot \exp (0.0182 \cdot x) \hfill \\ {f_2}(x)={a_2} \cdot \exp ({b_2} \cdot x)=0.0964 \cdot \exp (0.0237 \cdot x) \hfill \\ {f_3}(x)={a_3} \cdot \exp ({b_3} \cdot x)=0.1536 \cdot \exp (0.0186 \cdot x) \hfill \\ {a_1} \in [0.1222,0.1780];{b_1} \in [0.0163,0.0201] \hfill \\ {a_2} \in [0.0715,0.1214];{b_2} \in [0.0216,0.0258] \hfill \\ {a_3} \in [0.1271,0.1801];{b_3} \in [0.0168,0.0204] \hfill \\ \end{gathered} \right.$$
(43)

Based on the normal distribution characteristics of the fitting parameters, the Monte Carlo method is used to randomly sample the distribution of the fitting parameters, generating 30 degradation datasets. The initial values are normalized, and the degradation indicator distribution model corresponding to the full life cycle is used to quantify the inherent uncertainty of the prototype, as shown in Fig. 10. The results indicate that the accelerated durability test effectively defines the range of inherent uncertainty in the random fluctuations of the degradation indicator. However, in real-world user applications, it is crucial to consider the correlation between the acceleration factor of the accelerated test spectrum and the actual user load spectrum.

Fig. 7
figure 7

The degradation trajectory fitting of prototype 1

Fig. 8
figure 8

The degradation trajectory fitting of prototype 2.

Fig. 9
figure 9

The degradation trajectory fitting of prototype 3

Fig. 10
figure 10

The quantification of the inherent uncertainty of prototypes.

Given that the degradation rate under real-world operating conditions is significantly lower than that in accelerated tests, a dual-prototype comparative test is conducted in this study. Prototype A is subjected to the user’s non-accelerated load spectrum, while prototype B is exposed to the accelerated test load spectrum. The total test duration under the accelerated spectrum is 875 h, aimed at replicating the damage target corresponding to 300,000 km of user operation. The accelerated spectrum consists of 350 cycles, with vibration signals recorded every 10 cycles. Vibration acceleration signals in the Y-axis of the bearing, Y-axis of the differential, and Z-axis of the gearbox are monitored simultaneously, as shown in Fig. 11.

To quantify the relationship between the degradation rates of the accelerated and user spectra, the acceleration factor is defined, as the ratio of the linear fit slope of the vibration signal under the accelerated spectrum to the linear fit slope of the vibration signal under the non-accelerated load spectrum48. The comparison of the three-axis vibration signals is shown in Figs. 12 and 13, and 14. The average acceleration factor of the three directions is 8.7, which is used as the overall acceleration factor and substituted into Eq. (18) to correct the degradation model parameters. The standard deviation ratios for vibration acceleration (VA) of the three directions are also calculated, and by combining the standard deviation of the failure threshold in the accelerated test from Fig. 8, the random error term under user conditions is derived to follow \(N(0,{0.0146^2})\).

Fig. 11
figure 11

Test bench schematic of the EDS.

Fig. 12
figure 12

Comparison of VA at the bearing Y-direction.

Fig. 13
figure 13

Comparison of VA at the differential Y-direction.

Fig. 14
figure 14

Comparison of VA at the gearbox Z-direction.

To account for the variability in degradation rates among different users, the overall user failure threshold is defined based on the damage target replicated in the bench test. The user sample set is selected corresponding to the 50th percentile of cumulative damage contribution rates from Fig. 4. Using damage equivalence and factor extrapolation, the user failure threshold \({\delta _T}=1.65\) at 50th percentile is determined. Subsequently, nonlinear degradation trajectories incorporating random errors are generated for different users, as shown in Fig. 15.

Fig. 15
figure 15

Nonlinear degradation trajectories of different users.

Model parameter Estimation and dynamic updating

This section selects four representative users corresponding to the 25th, 50th, 75th, and 95th percentiles of the cumulative damage rate distribution (damage rate - cumulative distribution function, Ds-CDF) to perform model parameter estimation. Based on the nonlinear degradation trajectories derived in Sect. 5.1, it is essential to first specify the prior distribution of the model parameters before performing parameter estimation and dynamic updating.

In this study, the user at the 50th percentile of the Ds-CDF is selected as the target subject. For the prior distribution parameters \({\mathbf{\Theta }}=\{ \mu _{1}^{\prime },\sigma _{1}^{2},{\mu _2},\sigma _{2}^{2},{\sigma ^2}\}\), the initial ranges are determined based on historical degradation data and bench test results, with \(\mu _{1}^{\prime } \in [0.05,1]\),\(\sigma _{1}^{2} \in [0.01,0.5]\),\({\mu _2} \in [0.05,1]\),\(\sigma _{2}^{2} \in [0.01,0.5]\), \({\sigma ^2} \in [0.001,0.01]\). From each range, 10 evenly spaced sample points are selected, resulting in a total of 100,000 parameter sets. Each parameter set is used as input for model parameter estimation and Bayesian updating, followed by RUL prediction. To evaluate the influence of different hyperparameter combinations on the prediction results, the prediction accuracy is assessed using the RMSE. The optimal parameter combination is selected when the RMSE reached its minimum value. Among the 100,000 tested combinations, the best set is found to be \(\mu _{1}^{\prime }=0.05\), \(\sigma _{1}^{2}=0.5\), \({\mu _2}=0.05\), \(\sigma _{2}^{2}=0.01\), \({\sigma ^2}=0.002\).

To further analyze the influence of individual parameters, the control variable method is applied. When keeping \(\sigma _{1}^{2},\sigma _{2}^{2},{\sigma ^2}\) constant and varying the initial mean values of parameters \(\lambda\) and \(\beta\), the corresponding prediction errors are shown in Fig. 16. Similarly, by keeping \(\mu _{1}^{\prime },{\mu _2},{\sigma ^2}\) constant and varying the initial variances of parameters \(\lambda\) and \(\beta\), the sensitivity of variance selection is analyzed as shown in Fig. 17. These results indicate that the choice of prior parameter values significantly affects RUL prediction performance. In particular, the sensitivity analysis of the initial mean values of the parameters shows that the lowest RMSE is achieved when \(\mu _{1}^{\prime }=0.05\), \({\mu _2}=0.05\). For sensitivity analysis of the initial variance values of the parameters, the lowest RMSE is observed when \(\sigma _{1}^{2}=0.5\), \(\sigma _{2}^{2}=0.01\). Since the noise variance \({\sigma ^2}\) is closely related to the actual degradation process, its prior information is derived from user degradation trajectories, and the optimal value is identified as \({\sigma ^2}=0.002\). Additionally, for other users, the aforementioned method is similarly applied to conduct a sensitivity analysis of the model’s prior distribution parameters, ensuring the accuracy of RUL predictions.

Fig. 16
figure 16

Sensitivity analysis of the initial mean values of parameters \(\lambda\) and \(\beta\).

Fig. 17
figure 17

Sensitivity analysis of the initial variance values of parameters \(\lambda\) and \(\beta\).

In addition, the influence of the EM iteration threshold \(\omega\) on RUL prediction is examined. The iteration stopping criterion is defined as \(\left\| {\left. {{\mathbf{\hat {\Theta }}}_{s}^{{(n+1)}} - {\mathbf{\hat {\Theta }}}_{s}^{{(n)}}} \right\|} \right.<\omega\), with the threshold range set to [1E-10,1E-02]. The RMSE variation under different thresholds is presented in Fig. 18. Results show that large values of \(\omega\) lead to higher prediction errors, while decreasing \(\omega\)results in progressively lower RMSE values. when the threshold reaches 1E-06, the RMSE stabilizes at its minimum level. Hence, the iteration threshold in this study is set to \(\omega ={\text{1E-06}}\).

Fig. 18
figure 18

Variation trend of RMSE under different iteration thresholds.

The user’s full life cycle data is divided into 100 operational cycles. For each cycle, model parameters are estimated, including the mean \(\mu _{1}^{\prime }\) and variance \(\sigma _{1}^{2}\) of parameter \(\lambda\), the mean \({\mu _2}\) and variance \(\sigma _{2}^{2}\) of parameter \(\beta\), and the estimated correlation coefficient \({\rho _s}\) between parameters \(\lambda\) and \(\beta\). The true values of the parameters are defined as the converged values of the degradation model parameters for the target user. As more historical degradation data becomes available, the model parameters are dynamically updated, with the results shown in Fig. 19. It is observed that as the number of iterations increases, the variation in parameter estimates gradually decreases and tends toward stability. In particular, the variances of parameters \(\lambda\) and \(\beta\) show minimal changes after approximately 30 iterations. Therefore, a logarithmic scale is used to present the variance update process, allowing for a clearer illustration of the convergence behavior.

Fig. 19
figure 19

Parameter estimation results for the target user’s degradation model; (a) Mean \(\mu _{1}^{\prime }\) estimation of parameter \(\lambda\); (b) Variance \(\sigma _{1}^{2}\) estimation of parameter \(\lambda\); (c) Mean \({\mu _2}\) estimation of parameter \(\beta\); (d) Variance \(\sigma _{2}^{2}\) estimation of parameter \(\beta\); (e) Correlation coefficient \({\rho _s}\) estimation between parameters \(\lambda\) and \(\beta\).

The final degradation model parameters for the target user are as follows: \(\mu _{1}^{\prime }=0.0237\), \(\sigma _{1}^{2}={\text{1}}{\text{.869E-04}}\), \({\mu _2}=0.0103\), \(\sigma _{2}^{2}={\text{6}}{\text{.9213E-08}}\), \({\rho _s}={\text{-0}}{\text{.8684}}\). Following the above analytical procedure, parameter estimation is performed for four representative users at the 25th, 50th, 75th, and 95th percentiles of cumulative damage rate. The parameters of the exponential degradation model for each user are iteratively updated, and the final estimation results are summarized in Table 1.

Table 1 Estimated parameters of the exponential degradation model for different users.

Remaining useful life prediction and result evaluation

To elucidate the evolution of degradation states, this study examines the degradation process across the full user life cycle by selecting the 25th, 50th, 75th, and 95th percentiles of service cycles. It analyzes the performance degradation of the electric drive system and the dynamic changes in RUL predictions under different service cycles, as shown in Fig. 20.

Fig. 20
figure 20

RUL prediction results of the EDS at different service stages; (a) RUL prediction at the 25th percentile service cycle; (b) RUL prediction at the 50th percentile service cycle; (c) RUL prediction at the 75th percentile service cycle; (d) RUL prediction at the 95th percentile service cycle.

Figure 20 illustrates the evolution of the composite health indicator for the electric drive system, the RUL predictions, and the 95% confidence intervals across four service cycles. At the 25th percentile service cycle, limited historical degradation data result in larger errors in parameter estimation, leading to wider confidence intervals and significant discrepancies between the predicted and actual RUL values. As service cycles accumulate, the degradation model parameters converge toward optimal values, the confidence intervals for the nonlinear composite health indicator narrow, and the RUL probability density distribution becomes more concentrated, reducing the error between predicted and actual RUL values. These findings confirm the effectiveness of the degradation model and parameter estimation method in modeling the composite health indicator and predicting RUL for the electric drive system.

To thoroughly analyze the RUL predictions across different service cycles, the RUL probability density functions are calculated for each cycle using Eq. (38). The expected RUL values are derived from Eq. (39) and compared with actual RUL values, as shown in Fig. 21a. The RUL probability density functions for all service cycles encompass the actual RUL values of the target user, with the distributions becoming increasingly concentrated as service cycles progress. Comparison of predicted and actual RUL values reveals that, in earlier cycles, limited historical data lead to inaccurate parameter estimation, resulting in significant deviations. However, after the 20th percentile service cycle, predicted values closely align with actual values. Furthermore, the prediction error, defined as the difference between predicted and actual values, is presented in Fig. 21b. Beyond the 20th percentile, the prediction error stabilizes, with relative errors consistently within 10%.

Fig. 21
figure 21

Comparison of RUL Prediction results and relative error; (a) Comparison between predicted and actual RUL values; (b) Relative error.

The RUL prediction results for the four representative users are shown in Fig. 22, including comparisons between actual and predicted RUL values, as well as the corresponding RUL probability density distributions.

Fig. 22
figure 22

RUL prediction results for users under different cumulative damage rates; (a) The user at the 25th percentile of the Ds-CDF; (b) The user at the 50th percentile of the Ds-CDF; (c) The user at the 75th percentile of the Ds-CDF; (d) The user at the 95th percentile of the Ds-CDF.

As illustrated in Fig. 22, the RUL probability density distributions for different users at specific service cycles consistently encompass the actual RUL values. In the early service stages, the predicted RUL mean exhibits noticeable deviations from the actual values. However, as historical data accumulate, the model parameter estimates stabilize, leading to closer alignment between predicted and actual RUL values in later service stages, with probability density distributions becoming more concentrated compared to earlier stages.

Comparative analysis

To validate the effectiveness of the proposed method, comparative experiments are conducted using the classical wiener process model and a CNN-LSTM deep learning model on the same degradation dataset. Prediction performance is evaluated in terms of RMSE, MAE, and CIC.

The Wiener process model characterizes degradation dynamics according to drift and diffusion terms, with parameters estimated online from degradation increments. RUL is predicted under the assumption of an inverse Gaussian distribution, where the expected value serves as the point estimate. The confidence bounds \([{F^{ - 1}}(\alpha /2),{F^{ - 1}}(1 - \alpha /2)]\) are computed using the inverse cumulative distribution function, with the significance level set to 0.05, corresponding to a 95% confidence interval.

The CNN-LSTM deep learning model integrates the strengths of convolutional neural networks and long short-term memory networks, enabling the simultaneous capture of spatial features and temporal dependencies during degradation processes. The model architecture comprises convolutional layers, batch normalization layers, activation layers, pooling layers, LSTM layers, and fully connected layers. The input to the model includes features representing the current degradation state, supplemented by statistical feature parameters such as the mean, variance, and extrema of the electric drive system’s rotational speed, torque, and temperature over the service cycle. Hyperparameters for this model have been optimized through sensitivity analysis, with the number of convolutional filters set to 32, the number of LSTM units to 30, the convolutional kernel size to 10, the initial learning rate to 0.001, and the learning rate decay factor to 0.1. To mitigate overfitting, a dropout ratio of 0.3 is incorporated. During training, degradation data from 26 users are utilized, with the aforementioned four target users employed for prediction. Given that the CNN-LSTM model inherently produces point predictions, the standard deviation of the prediction error \({\sigma _{{\text{res}}}}\) is derived through residual analysis, assuming the residuals approximate a normal distribution. Confidence intervals for the predicted values are then constructed using the quantiles of this normal distribution, defined as \([\hat {y} - {z_{1 - \alpha /2}} \cdot {\sigma _{{\text{res}}}},\hat {y}+{z_{1 - \alpha /2}} \cdot {\sigma _{{\text{res}}}}]\).

Figure 23 presents the comparison of RUL prediction results across the three models for the four target users. As observed, the wiener model exhibits large fluctuations and significant deviations from the ground truth. The CNN-LSTM model is able to track the degradation trend more effectively, but with occasional overestimation or underestimation. In contrast, the proposed method produces prediction curves closely aligned with the true RUL, demonstrating superior robustness and accuracy across different users.

Fig. 23
figure 23

Comparison of RUL prediction expected values across different models; (a) The user at the 25th percentile of the Ds-CDF; (b) The user at the 50th percentile of the Ds-CDF; (c) The user at the 75th percentile of the Ds-CDF; (d) The user at the 95th percentile of the Ds-CDF.

To further quantify prediction accuracy and uncertainty, a comparative analysis of evaluation metrics across different models for various users is presented, as detailed in Table 2.

Table 2 Comparative results of evaluation metrics for each user across different models.

The Wiener model, which relies solely on assumptions of linear drift and Gaussian noise, exhibits significant prediction variability when degradation trajectories display pronounced nonlinearity, resulting in notably higher RMSE and MAE values compared to other models, alongside a lower CIC metric. In contrast, the CNN-LSTM model demonstrates superior RMSE and MAE performance across different stages compared to the Wiener model, with CIC values ranging from 0.87 to 0.91, indicating robust fitting capabilities for complex nonlinear degradation data. However, this model is limited to point predictions and lacks explicit modelling of prediction uncertainty. The proposed method in this study addresses these limitations by explicitly incorporating degradation mechanisms and constructing nonlinear degradation trajectories during the modelling process. By integrating an exponential degradation model with EM parameter estimation and Bayesian updates, it effectively models and predicts complex nonlinear degradation processes. The results reveal a high degree of consistency between predicted and actual RUL values, with RMSE and MAE maintained at low levels, and a CIC metric exceeding 0.93, significantly outperforming both the Wiener and CNN-LSTM models.

Conclusion

This study introduces an innovative framework for nonlinear degradation modeling and RUL prediction tailored to electric drive system with multiple failure modes. The conclusions are as follows:

(1) Leveraging actual operational data collected from the EDS, this study correlates the failure physics characteristics of core EDS components and employs the entropy weight method to construct a composite health indicator for the system. Accelerated durability bench testing of EDS is conducted, with comparative experiments between accelerated and unaccelerated spectra, establishing nonlinear degradation trajectories and models under user-specific scenarios.

(2) Building on the foundation of nonlinear degradation modeling, a RUL prediction model is proposed, integrating EM parameter estimation with Bayesian updates. This model enables dynamic RUL prediction and uncertainty quantification across multiple users and various service stages. The case study reveals that prediction errors stabilize within 10% after approximately 20% of the service life.

(3) Comparative experiments involving multiple algorithmic models are performed, revealing that the Wiener model exhibits significant prediction variability and notable deviations from actual values. The CNN-LSTM model demonstrates robust fitting capabilities for nonlinear degradation data but is subject to fluctuations involving overestimation and underestimation. In contrast, the method proposed in this study explicitly incorporates degradation mechanisms and nonlinear degradation modeling, exhibiting enhanced robustness and precision across different users and degradation stages.

Overall, the proposed methods not only improve the accuracy of health state monitoring and RUL prediction for electric drive system but also lay the groundwork for broader applications in industries involving complex, multi-component systems. Future research can expand on this approach by incorporating additional failure modes, enhancing the model’s scalability, and exploring its integration with other condition monitoring technologies to further advance predictive maintenance strategies.