High precision implicit function learning for forecasting supercapacitor state of health based on Gaussian process regression

Ren, Jiahao; Cai, Junfei; Li, Jinjin

doi:10.1038/s41598-021-91241-z

Download PDF

Article
Open access
Published: 08 June 2021

High precision implicit function learning for forecasting supercapacitor state of health based on Gaussian process regression

Jiahao Ren^1,2^na1,
Junfei Cai^1,2^na1 &
Jinjin Li^1,2

Scientific Reports volume 11, Article number: 12112 (2021) Cite this article

2788 Accesses
15 Citations
Metrics details

Subjects

Abstract

State of health (SOH) prediction of supercapacitors aims to provide reliable lifetime control and avoid system failure. Gaussian process regression (GPR) has emerged for SOH prediction because of its capability of capturing nonlinear relationships between features, and tracking SOH attenuations effectively. However, traditional GPR methods based on explicit functions require multiple screenings of optimal mean and covariance functions, which results in data scarcity and increased time consumption. In this study, we propose a GPR-implicit function learning, which is a prior knowledge algorithm for calculating mean and covariance functions from a preliminary data set instead of screening. After introducing the implicit function, the average root mean square error (Average RMSE) is 0.0056 F and the average mean absolute percent error (Average MAPE) is 0.6%, where only the first 5% of the data are trained to predict the remaining 95% of the cycles, thereby decreasing the error by more than three times than previous studies. Furthermore, less cycles (i.e., 1%) are trained while still obtaining low prediction errors (i.e., Average RMSE is 0.0094 F and Average MAPE is 1.01%). This work highlights the strength of GPR-implicit function model for SOH prediction of energy storage devices with high precision and limited property data.

Toward the accurate estimation of elliptical side orifice discharge coefficient applying two rigorous kernel-based data-intelligence paradigms

Article Open access 05 October 2021

Physics-informed hybrid reinforcement learning for estimating lithium-ion battery state of health

Article Open access 15 December 2025

Generative learning assisted state-of-health estimation for sustainable battery recycling with random retirement conditions

Article Open access 23 November 2024

Introduction

Supercapacitors (SCs) are widely utilized and deployed in various systems because of their high power density, long cycle life, and operability over a wider temperature range^1,2,3,4. However, the performance of SCs is constrained by irreversible aging problems, which threatens their reliable operation and leads to unpredictable system failures^5,6,7. State of health (SOH), an important indicator for managing health and avoiding system failures, has received much attention and its prediction methods are developed rapidly. The challenge of SOH prediction is that, for one thing, it is difficult to construct an accurate failure model to simulate a real complex degradation process with multiple physical and chemical reactions. For another thing, many important parameters are difficult to measure which require expensive test equipment. Therefore, the methods of predicting SOH usually focus on capturing the internal state from easily measured parameters⁸.

The SOH can be predicted mainly by two different methods: model-based and data-driven. Model-based method uses complex mathematical equations to simulate degradation mechanisms, necessitating a deeper understanding of electrochemical principles. Recently, various filtering models with model-based methods are applied in the previous research about the SOH prediction^9,10. For example, Mejdoubi et al.¹¹ proposed a particle filter (PF) model to estimate the SOH of SCs, in which the capacitance and resistance could be predicted accurately under various conditions by considering the aging temperature and voltage. Walker et al.¹² compared the PF model with Kalman filter (KF) model for predicting the remaining useful life of a lithium-ion battery, and found the PF model to be superior to a KF model. Mohamed et al.¹³ proposed an enhanced mutated PF (EMPF) model for the SOH prediction of a lithium-ion battery and proved that the model can effectively capture dynamic system behaviors. These studies can effectively capture part of the dynamic behaviors of the system by establishing accurate mathematical models. However, such prediction methods have the following problems¹⁴: (1) The physical and chemical processes in energy storage devices are very complex. Therefore, establishing physical models of RUL and SOH for energy storage devices is quite difficult. (2) These methods cannot accurately simulate the real attenuation process, because the trajectory of the attenuation is affected by the surrounding environment and load conditions, which are constantly changing. (3) The model-based method is semi-empirical, so the corresponding results depend on the knowledge of the researchers and the quality of the experiment equipment.

Compared with the model-based methods, the data-driven methods focus on finding the interaction relationship between data and do not make any assumptions about the RUL degradation of the device. Data-driven methods avoid building accurate electrochemical models, and have attracted significant attention, gradually becoming the mainstream for SOH prediction¹⁵. Various machine learning methods have been proposed for predicting SOH, for example, linear or nonlinear regression models^16,17, support vector machine method^18,19,20,21, artificial neural network method^22,23, long short term memory algorithm^24,25, and so on. The goal of these models are the same: to find the mapping of input features for SOH prediction. However, the mapping from cycle of charging and discharging based on these conventional data-driven methods is over-simplistic due to the uncertainty of vectors in degradation process²⁶. Compared to conventional data-driven methods, Gaussian process regression (GPR), a data-driven SOH prediction method, possesses hyper-parameter adaptive acquisition and can be easily implemented, does not involve parameters. Because of the non-parametric characteristic²⁷ of GPR, the model can be calibrated naturally according to data requirements. Furthermore, GPR can not only give a specific output, but also provides confidence for the predicted results of the model, so as to make an informed decision. According to the research of Chen et al.²⁸, GRP is very suitable for solving complex regression problems, including high-dimensional and nonlinear predictions. Therefore, GPR has attracted great attention in the prediction of SOH, which is a complex nonlinear regression problem. For example, Liu et al.²⁹ applied linear and quadratic polynomial mean function with compound covariance function composed of squared exponential and periodic covariance function to predict the SOH of self-recharging lithium-ion batteries (the first 100 cycles for training, the rest 68 cycles for test). Richardson et al.²⁶ used power function as a mean function and analyzed prediction results with different covariance functions, including squared exponential, periodic, and Matérn covariance functions (approximately the first 72 cycles for training, approximately the rest 108 cycles for test). Yang et al.²⁷ selected four input features based on charging curves, analyzed these features based on gray relational analysis, and proposed a compound covariance function called double squared exponential for SOH prediction (the first 80 cycles for training, approximately the rest 90 cycles for test). These pioneering studies with satisfactory prediction accuracy are based on the explicit functions (i.e., having a specified equation) of GPR methods, and mainly focus on the selection of input features, mean function, and covariance function. In these studies, at least 40% of the data set were used as training data, which helps improve the accuracy of SOH prediction; however, finding the most suitable configuration is time-consuming. In addition, once the system changes, the previously selected input features, mean function, and covariance function are not necessarily applicable anymore, which means researchers must find different configurations for different systems. To solve the problem mentioned above and find GPR with higher efficiency and accuracy, numerous methods have been proposed to improve the performance of GPR (efficiency and accuracy) recent years. For example, Li et al. combined GPR with other regression methods and proposed a multi-time-scale framework for predicting the SOH and RUL of Lithium ion batteries³⁰; Wang et al. used a modified kernel GPR algorithm to simulate the degradation process of batteries³¹; Hu et al. designed a dual GPR model for the SOH and RUL prognosis of battery packs³². These studies are all aimed at improving the accuracy and efficiency of the GPR methods in the prognosis of SOH and RUL, illustrating the necessity of modifying the GPR to obtain better performance.

In this study, we present a more general method based on the use of implicit mean and covariance functions of GPR methods. Specifically, this study presents the following procedure of SOH prediction: (1) a batch of SCs undergo cyclic charge and discharge tests (called preliminary data set), following which the capacitance after each cycle is collected to calculate the mean value of each cycle (regarded as the implicit mean function) and the covariance between a pair of cycles (regarded as the implicit covariance function)³³; (2) the explicit and implicit mean and covariance functions are used to perform GPR predictions, respectively; and (3) the SOH predictions with reduced cycle data are discussed. The workflow of SOH prediction with a GPR-implicit function model is shown in Fig. 1, where the blue parts correspond to traditional SOH prediction with explicit functions and the green parts correspond to GPR-implicit function model proposed in this study. The compound mean and covariance functions indicate that the mean function and covariance function contain the explicit function and implicit function simultaneously. Since the test set contains 22 SCs and each SC will produce a RMSE and a MAPE, we use average root mean square error (average RMSE) and average mean absolute percent error (average MAPE) to reasonably evaluate models (see “Evaluation” section for details). In this study, the predicted Average RMSE is 0.0056 F and the Average MAPE is 0.6% after introducing the implicit function, where only the 5% of the cycles are used as the training data to predict the remaining 95% of the cycles. Compared with the previous studies based on explicit functions, the GPR-implicit function model decreases the prediction error by more than three times. We further use less cycles as the training data, that is, 1% of the cycles, to predict the remaining 99% of the cycles, which also produce reasonable prediction errors (i.e., Average RMSE of 0.0094F and Average MAPE of 1.01%). Although only the univariate model, that is, capacitance vs. cycle, is applied, the GPR-implicit function model’s high prediction accuracy and good robustness with less training data highlights the ability of implicit functions to model the real SOH of SCs. The method proposed in this study is applicable not only to SOH prediction of SCs, but also to other energy storage devices such as lithium-ion batteries.

Data preprocessing

The research framework is shown in Fig. S1 of Supplementary Materials (SM). As shown in Fig. S1a, the supercapacitors undergo cycles of charging and discharging, which provides a preliminary data set for the subsequent predictions of the Gaussian process. Then, based on the preliminary data set, we predict the SOH of supercapacitors by different Gaussian regression processes (i.e., with different mean or covariance functions), and obtained their RMPE and MAPE as the judging criteria for performance prediction. This comparison shows the precision advantage of GPR with implicit functions. Figure S1b shows the comparison of accuracy when using different proportions of data as the training set, where the GPR with an implicit function has a higher accuracy than conventional GPR even if the proportion of the training set is reduced. More details can be found in “Results and discussion” section.

Definition of SOH

SOH is an indicator for characterizing the degree of aging in SCs, providing users with basic information on health status, thereby ensuring a stable operation of energy storage devices (i.e., SCs or lithium-ion batteries). There are two widely-adopted definitions of SOH: one based on equivalent series resistance (ESR) increase, and the other based on capacitance decrease^34,35. The SOH in this study refers to the definition of capacitance ratio, which is expressed in Eq. (1), where SOH_i represents the SOH value of the i-th cycle, C_i is the capacitance of the i-th cycle, and C_R is the rated capacitance of the SC. As the number of cycles increase, the SC ages because of an irreversible reaction during the charge and discharge processes, leading to a decrease in capacitance. In fact, the rated capacitance of the SC used in this study is 1C; therefore, SOH_i and C_i are equal.

$$ \begin{array}{*{20}c} {SOH_{i} = \frac{{C_{i} }}{{C_{R} }}.} \\ \end{array} $$

(1)

Experimental data and analysis

In this work, the cycle data of 88 SCs are obtained, and each SC is cycled 10,000 times with a charging and discharging policy in a temperature-controlled environment (28 ℃). All SCs are discharged at a constant current of 20 mA. To conform to the actual charging process, we apply two different charging policies for SCs. The first is a 20 mA constant current charging policy, and the second is a stepped charging policy which refers to a stepped current of 15 mA from 1 to 1.85 V, 10 mA from 1.85 to 2.36 V, and 5 mA from 2.36 to 2.7 V.

Preliminary data set refers to the data set of previous SCs that have been tested before a new SC cycle. For example, for a user who has tested 10 SCs, the cycle data for these 10 SCs can be used as the preliminary data set to predict the SOH of 11th SC. Figure 2 shows the SOH trajectories of preliminary data set of SCs used in this study, most of which intersect each other and follow a similar aging rate: from fast to slow. Such an aging behavior is different from the observed battery aging behavior^36,37. To better understand the data from a statistical point of view, the inset of Fig. 2 presents the capacitance distribution of the 500th cycle, which is remarkably close to the Gaussian distribution. Such Gaussian distribution can also be observed in other cycles, which inspires us to use the GPR-implicit function model for predicting the SOH of SCs, according to the GPR assumption that each predicted output follows (i.e., capacitance) a Gaussian distribution, and any pair of outputs obey a joint Gaussian distribution. These concepts will be detailed in the subsequent sections.

Different from the explicit function used widely in GPR methods, the implicit function, coming from the preliminary data set, does not have a specific equation. To achieve a high-precision SOH prediction with appropriate implicit functions, we calculate the mean and standard deviation of capacitance for each cycle based on the attenuation curves of preliminary data set of 66 SCs. Figure 3 shows the mean capacitance (blue curve) and its positive and negative standard deviations (blue shaded area) as a function of cycle number, where the decay goes from fast to slow with increasing cycle number. The decay behavior in Fig. 3 is the same as in Fig. 2. The standard deviation in Fig. 3 quantifies the dispersion degree of capacitance of each cycle in Fig. 2, which can be used to calculate the implicit covariance function in GPR. Specifically, the average capacitance of the 500th cycle calculated from 66 SCs is 0.943 F, corresponding to a standard deviation of 0.015 F, which is shown as the black point in Fig. 3. Therefore, the implicit mean function and covariance function can be obtained from the preliminary data set for high-precision SOH predictions.

Method

GPR is a nonparametric stochastic process based on probability³⁸. Unlike most machine learning algorithms that give only a specific output, GPR gives the probability distribution of an output based on the Bayesian theory. The key idea behind Bayesian theory is that the posterior probability can be modified based on prior probability after observing an event. Such a result gives credibility to the prediction from a probabilistic perspective, and provides more adequate prediction information.

GPR

The goal of GPR is to learn the mapping from input vector x to some Gaussian distribution f(x), given the observed input–output data points , where n is the total number of training data points, $x_{i}$ is the i-th n-dimensional input vector, and y_i is the i-th output. In this study, $x_{i}$ refers to the i-th cycle number, and y_i refers to the i-th capacitance.

To obtain the mapping, a joint distribution among these distributions should be considered. That is, the properties of multivariate distribution $f\left( {\mathbf{x}} \right)$ can be completely determined by a mean function m(x) and a covariance function $\kappa \left( {{\mathbf{x}}, {\mathbf{x^{\prime}}}} \right)$, which is denoted as

(2)

where m(x) and $\kappa \left( {{\mathbf{x}}, {\mathbf{x^{\prime}}}} \right)$ are denoted by

$$ \begin{array}{*{20}c} {\left\{ {\begin{array}{*{20}l} {m\left( {\mathbf{x}} \right) = E\left[ {f\left( {\mathbf{x}} \right)} \right]} \hfill \\ {\kappa \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) = E\left[ {\left( {f\left( {\mathbf{x}} \right) - m\left( {\mathbf{x}} \right)} \right)\left( {f\left( {{\mathbf{x^{\prime}}}} \right) - m\left( {{\mathbf{x^{\prime}}}} \right)} \right)} \right]} \hfill \\ \end{array} } \right..} \\ \end{array} $$

(3)

In a basic GPR, the mean function is usually set to zero, and the covariance function is usually composed of a squared exponential (SE) covariance function and noise function, as shown in Eq. (4).

$$ \begin{array}{*{20}c} {\kappa \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) = \kappa_{SE} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) + \kappa_{n} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right).} \\ \end{array} $$

(4)

$\kappa_{SE} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$ denotes the n × m matrix of the covariance evaluated at all pairs of training and test points, where x contains n training points and x′ contains m test points. In practice, the covariance between two points, x_i and x_j can be calculated by

$$ \begin{array}{*{20}c} {k_{ij} = \sigma_{f}^{2} \exp \left( { - \frac{{x_{i} - x_{j}^{2} }}{{2l^{2} }}} \right),} \\ \end{array} $$

(5)

where ${\upsigma }_{f}$ and l are hyper-parameters for controlling the vertical scale and length scale of the Gaussian function, respectively. Many types of covariance functions exist, and most have their own application scenarios. These covariance functions can also be combined with each other by affine transformations to produce a compound function for discovering more subtle data rules. In essence, a covariance function describes the correlation strength between the inputs of two points, and will finally affect the predicted probability distribution through a joint distribution. This means the choice of covariance function plays a crucial role in the final prediction result.

In a regular regression problem, the output $y$ of the model consists of two parts as follows:

$$ \begin{array}{*{20}c} {y = f\left( x \right) + \varepsilon ,} \\ \end{array} $$

(6)

where $f\left( x \right)$ represents the actual rule and is invisible noise. In GPR, $f\left( x \right)$ is the Gaussian process we expect to obtain. Here, the observed y is also a Gaussian process restricted by Gaussian joint distribution. Therefore, when calculating the n-dimensional symmetric positive definite covariance matrix of training points $ \kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right)$, the noise function is expectedly considered, and denoted as

$$ \begin{array}{*{20}c} {\kappa_{n} \left( {{\mathbf{x}},{\mathbf{x}}} \right) = \sigma_{n}^{2} I,} \\ \end{array} $$

(7)

where ${\upsigma }_{n}$ is a hyper-parameter of noise and I is a n-dimensional identity matrix.

After the covariance function is determined, the corresponding hyper-parameters,$ \Theta = \left[ {\sigma_{f}^{2} , l, \sigma_{n}^{2} } \right]$, are optimized by maximizing the log-likelihood function with training points, which is defined as

$$ L = \log p\left( {{\mathbf{y}}{|}{\mathbf{x}}, \Theta } \right){ } = - \frac{1}{2}\log \left( {\det \left( {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} \right)} \right) - \frac{1}{2}{\text{y}}^{T} \left[ {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} \right]^{ - 1} {\mathbf{y}} - \frac{n}{2}{\text{log}}2\pi . $$

(8)

Based on the above implementation, the training phase of GPR is completed. Then, the posterior distribution of test points is derived from the conditions of prior distributions, joint distributions, and training points through the Bayesian formula for performing the regression prediction. Specifically, for a given set of inputs ${ }{\mathbf{x}}^{*}$ of test points, we assume that their corresponding outputs ${\mathbf{f}}^{*}$ also follow a Gaussian distribution, and are limited to the joint Gaussian distribution with training points. This relationship can be described as

$$ \begin{array}{*{20}c} {\left( {\begin{array}{*{20}c} {\mathbf{y}} \\ {{\mathbf{f}}^{*} } \\ \end{array} } \right)\sim \left( {\begin{array}{*{20}c} {0, \left( {\begin{array}{*{20}c} {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} & {\kappa \left( {{\mathbf{x}},{\mathbf{x}}^{*} } \right)} \\ {\kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}} \right)} & {\kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}^{*} } \right)} \\ \end{array} } \right)} \\ \end{array} } \right).} \\ \end{array} $$

(9)

The posterior distribution is derived as

(10)

This means that when ${\mathbf{x}},{\mathbf{y}},{\mathbf{x}}^{*}$ are known, ${\mathbf{f}}^{*}$ obeys a multivariate Gaussian distribution with mean vector $\overline{{{\mathbf{f}}^{*} }}$ and covariance matrix ${\text{cov}}\left( {{\mathbf{f}}^{*} } \right)$, where

$$ \begin{array}{*{20}c} {\overline{{{\mathbf{f}}^{*} }} = \kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}} \right)\left[ {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} \right]^{ - 1} {\mathbf{y}},} \\ \end{array} $$

(11)

$$ \begin{array}{*{20}c} {cov\left( {{\mathbf{f}}^{*} } \right) = \kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}^{\user2{*}} } \right) - \kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}} \right)\left[ {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} \right]^{ - 1} \kappa \left( {{\mathbf{x}},{\mathbf{x}}^{*} } \right),} \\ \end{array} $$

(12)

when a specific predicted value is required, the mean set, $\overline{{{\mathbf{f}}^{*} }}$, is used as the output of GPR model. Meanwhile, the matrix ${\text{cov}}\left( {{\mathbf{f}}^{*} } \right)$ is used for determining the confidence area by the variance of output extracted from the matrix diagonal.

The input vector x_i and output f(x_i) in this article are both one-dimensional scalars, where x_i is a time series, that is, the number of cycles^26,29, rather than physical features^14,27,39,40, and the output is the corresponding discharge capacitance. However, when needed, a similar study for obtaining better results can be performed in the case of a high-dimensional input, although it requires some feature engineering.

Improved GPR

In most cases, especially for an interpolation prediction problem (the input of the test SC is located between the inputs of the training SC), setting the mean function to 0 is acceptable because of the flexibility of GP to model the real mean⁴¹. However, when the test input is far from the training input, such as a prediction problem based on time series, the prediction ability of GPR will be limited, and the prediction performance will decline rapidly with the increase of the distance. In this case, most covariance functions are difficult to obtain effective information, so the final prediction is mainly determined by the mean function, which is set as a constant of 0. For example, in the case of the basic GPR mentioned above, when the distance between $x_{i}$ (training input) and $x_{j} $(test input) is relatively large, the covariance term $k_{ij}$ is almost zero, resulting in the output $y_{j}$ being zero.

If the mean function is adjusted so that it can capture some rules embedded in the model, the prediction performance can be improved. Most existing studies mainly focus on parametric function fitting, whose mean function is obtained by fitting the training set, for example, using a linear function, a quadratic function, or other degradation formulas. However, for energy storage devices such as batteries and SCs, a parametric function describing the degradation process is difficult to determine, and it also can hardly describe the entire degradation process accurately, since degradation is a complex nonlinear process. Here, the mean function is improved by prior knowledge from the preliminary data set, rather than by curve fitting. Specifically, for a given input x_i, its mean function is

$$ \begin{array}{*{20}c} {{\text{m}}_{pre} {(}x_{i} {) = }\frac{1}{N}\mathop \sum \limits_{{k \in D_{p} }} y_{i}^{\left( k \right)} ,} \\ \end{array} $$

(13)

where $D_{p}$ is the preliminary data set, $N$ is the size of preliminary data set, and $y_{i}^{\left( k \right)}$ is the observed output at i-th point of k-th SC. The mean function remains unchanged after the preliminary data set determined.

By adjusting the mean function in this way, the posterior distribution can similarly be derived from the prior distribution. The prior distribution is denoted as

$$ \begin{array}{*{20}c} {\left( {\begin{array}{*{20}c} {\mathbf{y}} \\ {{\mathbf{f}}^{*} } \\ \end{array} } \right)\sim \left( {\left( {\begin{array}{*{20}c} {{\text{m}}_{pre} \left( {\mathbf{x}} \right)} \\ {{\text{m}}_{pre} \left( {{\mathbf{x}}^{\user2{*}} } \right)} \\ \end{array} } \right)\begin{array}{*{20}c} {, \left( {\begin{array}{*{20}c} {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} & {\kappa \left( {{\mathbf{x}},{\mathbf{x}}^{\user2{*}} } \right)} \\ {\kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}} \right)} & {\kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}^{\user2{*}} } \right)} \\ \end{array} } \right)} \\ \end{array} } \right).} \\ \end{array} $$

(14)

And the posterior distribution is

(15)

This means that when ${\mathbf{x}},{\mathbf{y}},{\mathbf{x}}^{*}$ are known, ${\mathbf{f}}^{*}$ obeys a multivariate Gaussian distribution with mean vector $\overline{{{\mathbf{f}}^{*} }}$ and covariance matrix ${\text{cov}}\left( {{\mathbf{f}}^{*} } \right)$, where

$$ \begin{array}{*{20}c} {\overline{{{\text{f}}^{*} }} = {\text{m}}_{pre} \left( {{\mathbf{x}}^{*} } \right) + \kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}} \right)\left[ {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} \right]^{ - 1} \left( {{\mathbf{y}} - {\text{m}}_{pre} \left( {\mathbf{x}} \right)} \right),} \\ \end{array} $$

(16)

$$ \begin{array}{*{20}c} {cov\left( {{\mathbf{f}}^{*} } \right) = \kappa \left( {{\mathbf{x}}^{\user2{*}} ,{\mathbf{x}}^{*} } \right) - \kappa \left( {{\mathbf{x}}^{*} ,{\mathbf{x}}} \right)\left[ {\kappa \left( {{\mathbf{x}},{\mathbf{x}}} \right) + \sigma_{n}^{2} {\mathbf{I}}} \right]^{ - 1} \kappa \left( {{\mathbf{x}},{\mathbf{x}}^{*} } \right),} \\ \end{array} $$

(17)

where the ${\text{cov}}\left( {{\mathbf{f}}^{*} } \right)$ is the same as Eq. (12). More details can be found in Ref.³⁷.

The prior knowledge includes not only the mean but also the covariance matrix. The traditional covariance matrix is calculated by an explicit covariance function, such as the SE covariance function mentioned above. However, it is also difficult to determine the covariance function, which requires more modeling ability than the mean function. This is because it plays a bigger role in GPR. Since the mean function is obtained from the preliminary data set, a similar procedure can be used for the covariance function. For any two inputs x_i and x_j, the covariance term is defined as

$$ k_{ij} = {\text{ m(}}x_{i} x_{j} {)} - {\text{ m(}}x_{i} {\text{)m(}}x_{j} {)} = \frac{1}{N}\mathop \sum \limits_{{k \in D_{p} }} y_{i}^{\left( k \right)} y_{j}^{\left( k \right)} - \frac{1}{{N^{2} }}\mathop \sum \limits_{{k \in D_{p} }} y_{i}^{\left( k \right)} \mathop \sum \limits_{{k \in D_{p} }} y_{j}^{\left( k \right)} . $$

(18)

A covariance function composed of these terms is called $\kappa_{pre}$. At this point, the final covariance function can be extended as follows:

$$ \begin{array}{*{20}c} {\kappa \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) = \kappa_{pre} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) + \kappa_{SE} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) + \kappa_{n} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right).} \\ \end{array} $$

(19)

The acquisition process of the above-mentioned mean and covariance functions is shown in Fig. S2, which corresponds to the green part of Fig. 1.

Evaluation

The GPR model is evaluated using RMSE, MAPE, and confidence intervals. RMSE and MAPE are used to quantitatively evaluate the predicted capacitance error, reflecting the model accuracy. Given a certain cycle c as the dividing point, cycle c and cycles before c are the training phase, and cycles after c are the test phase. RMSE and MAPE only apply to the test points, and are defined as

$$ \begin{array}{*{20}c} {{\text{RMSE}}^{\left( k \right)} = \sqrt {\frac{1}{m}\mathop \sum \limits_{i = c + 1}^{c + m} \left( {y_{i}^{\left( k \right)} - \hat{y}_{i}^{\left( k \right)} } \right)^{2} } ,} \\ \end{array} $$

(20)

$$ \begin{array}{*{20}c} {{\text{MAPE}}^{\left( k \right)} = \sqrt {\frac{1}{m}\mathop \sum \limits_{i = c + 1}^{c + m} \left| {\frac{{y_{i}^{\left( k \right)} - \hat{y}_{i}^{\left( k \right)} }}{{y_{i}^{\left( k \right)} }}} \right|} ,} \\ \end{array} $$

(21)

where ${\mathrm{RMSE}}^{(k)}$ and ${\mathrm{MAPE}}^{(k)}$ are the RMSE and MAPE of the k-th SC, respectively. m is the number of points in test phase. Each test SC produces a RMSE and a MAPE. However, an accidental error in a single SC may result in a failure to accurately evaluate the model. Therefore, in order to further improve the evaluation ability of metrics, Average RMSE and Average MAPE are proposed, and defined as

$$ \begin{array}{*{20}c} {{\text{Average }}\;{\text{RMSE}} = \frac{1}{M}\mathop \sum \limits_{k = 1}^{M} {\text{RMSE}}^{\left( k \right)} ,} \\ \end{array} $$

(22)

$$ \begin{array}{*{20}c} {{\text{Average}}\;{\text{ MAPE}} = \frac{1}{M}\mathop \sum \limits_{k = 1}^{M} {\text{MAPE}}^{\left( k \right)} ,} \\ \end{array} $$

(23)

where M is the number of test SCs, which equals 22 in this study.

Since the output of GPR is a probability distribution, confidence intervals can be used to measure the model’s credibility. Confidence interval provides an interval estimation of the predicted value of a test point, and also represents the probability that the predicted value falls within a given interval. A confidence interval of 95% is usually adopted, and the bounds for Gaussian distribution are defined as⁴²

$$ \begin{gathered} {\text{Upper bound}} = \mu + 1.96\sigma \hfill \\ {\text{Lower bound}} = \mu - 1.96\sigma , \hfill \\ \end{gathered} $$

(24)

where $\mu$ and $\sigma$ are the mean and standard deviation of the Gaussian distribution, respectively.

Results and discussion

An SC labeled No. 10 is randomly selected from the test SCs, and the corresponding prediction result is analyzed. This randomly selected SC is representative since most of the SCs have a similar SOH trend. Cycle 500 is selected in most models as the boundary between training data and test data, implying that only the first 5% of the data (first 500 cycles) are used to predict the remaining 95% of the cycles. The data set is divided into three groups: preliminary data set, training data set, and test data set, where the preliminary data set comes from the previous SCs that have been tested before a new SC cycle. The training data set and test data set come from the 5% training cycles and the remaining 95% test cycles for a selected SC, respectively. In Table S1, seven models with different mean and covariance functions are used for comparison: (1) Model 1: with explicit logarithmic function, where the mean function is described as ${\text{m}}\left( x \right) = A{\text{log}}x + B$, and the covariance function is $\kappa_{SE} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$; (2) Model 2: the mean function can be obtained from the preliminary data set, which is ${\text{m}}\left( x \right) = {\text{m}}_{pre} \left( x \right)$, and the covariance function is the same as Model 1; (3) Model 3: the mean function is the summation of Model 1 and Model 2, which is ${\text{m}}\left( x \right) = A{\text{log}}x + B + {\text{m}}_{pre} \left( x \right)$, and the covariance function is the same as Model 1; (4) Model 4: the mean function is the same as Model 1, but with different a covariance function $\kappa_{pre} + \kappa_{SE}$; (5) Model 5: the mean function is the same as Model 2, but with different a covariance function $\kappa_{pre} + \kappa_{SE}$; (6) Model 6: the mean function is the same as Model 3, but with a different covariance function $\kappa_{pre} + \kappa_{SE}$; and (7) Model 7: the mean function and covariance function are the same as Model 5, but use the first 1% of the cycles to predict the remaining 99% of the cycles.

In Table S1, the prediction result for Model 1 is the worst, where the Average RMSE is 0.021 F and the Average MAPE is 2.18%. The prediction result for Model 2 is the best, where the Average RMSE is 0.0139 F and the Average MAPE is 1.53%. The Average RMSE and Average MAPE for Model 3 is 0.0161 F and 1.72%, respectively. The results show that with the same covariance function, the mean function, consisting entirely of an implicit function, can achieve the lowest prediction error. Figure 4 shows the predicted capacitances and errors for models 1, 2, and 3, where the used covariance function of $\kappa_{SE} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$ are the same and the mean function are different. The dotted line indicates the 500th cycle. Before the 500th cycle, the predicted results by GPR are in good agreement with our experiment. However, with an increase in the cycle numbers, the difference between GPR prediction and experiment increases, and the area of confidence interval increases. Figure 4 shows that the first 5% of the data can generate excellent prediction results for the first 500 cycles, but poor results for the remaining 95% of the cycles. As seen in Fig. 4b,d,f, the prediction errors increase with an increase in the cycle number, which is consistent with the characteristic of GPR model, that is, the farther it is away from the training point, the larger is the overall error. These results highlight the ability of the GPR-implicit function model to describe the degradation of SCs, especially for the first 5% of the data.

The optimized covariance functions in Models 1, 2, and 3 are based on the first 500 cycles. The degradation between the first 500 cycles and the remaining 9500 cycles is quite different; therefore, it is hard to capture the interaction between inputs and obtain a high SOH prediction accuracy. Rather than only considering the first 500 cycles in Models 1, 2, and 3, Models 4, 5, and 6 consider all cycles of an SC with prior knowledge from preliminary data set by introducing the implicit function $\kappa_{pre}$ in the covariance function. Among Models 4, 5, and 6, the prediction accuracy of Model 4 is the worst, with the predicted Average RMSE and Average MAPE being 0.0174 F and 1.77%, respectively. The prediction accuracy of Model 5 is the best, where the Average RMSE is 0.0056 F and the Average MAPE is 0.60%. The Average RMSE and Average MAPE of Model 6 is 0.0121 F and 1.26%, respectively. Compared with Models 1, 2, and 3, the prediction errors for Models 4, 5, and 6 decrease significantly after the introduction of implicit function. For example, by considering $\kappa_{pre}$ in the covariance function, Model 4 has a better performance and lower prediction error than Model 1, while Model 5 performs better than Model 2, and Model 6 performs better than Model 3, implying that the implicit function $\kappa_{pre}$ in the covariance function provides sufficient prior knowledge and strongly captures the interactions between inputs.

In addition to the GPR models discussed in this study, we also compare several non-gaussian data-driven methods as benchmark models, including two extrapolation models and three machine learning models (such as Auto Regression, Support Vector Machine and Random Forest models), please see Note 1 of the SM. Among these GPR and benchmark models, Model 5 is still the best performer, which further demonstrates the accuracy of implicit functions.

Figure 5 shows the predicted capacitances of an SC as functions of cycle numbers based on Models 4, 5, and 6. In Fig. 5a,c,e, the predicted mean capacitances (blue curves) based on Models 4, 5, and 6 are closer to the observed green curves, and the areas of confidence interval are smaller than the corresponding areas in Fig. 4. Figure 5 demonstrates that Models 4, 5, and 6, which consider all cycles of an SC with prior knowledge from the preliminary data set, are superior to Models 1, 2, and 3, which only consider the first 500 cycles. In Table S1, the best model is Model 5, with the lowest Average RMSE of 0.0056 and Average MAPE of 0.6%. This model uses the implicit function ${\text{m}}_{pre} \left( x \right)$ as a mean function and $\kappa_{pre} + \kappa_{SE}$ (implicit function + explicit function) as a covariance function. Models 8 and 9 are from the pioneering studies by Yang et al.²⁴ and Liu et al.²², with predicted RMSEs of 0.0167 and 0.0332, respectively, having higher prediction errors than Model 5, further indicating that the implicit functions $\kappa_{pre} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$ and ${\text{m}}_{pre} \left( x \right)$ can significantly enhance the SOH prediction accuracy. Furthermore, the above studies usually use at least 40% of the cycles as training data for SOH prediction, but Models 1, 2, 3, 4, 5, and 6 only use the first 5% (i.e. the first 500 cycles for training, the rest 9500 cycles for test) cycles as training data, exhibiting a strong and accurate long-term ability of GPR to predict the reliable SOH of energy storage devices and solve the data scarcity problem.

As a case study, Fig. 5 shows the strength of implicit mean and covariance functions for SOH prediction of a selected SC. To more reasonably investigate the model performances, we predict the SOH of all test SCs, and obtain their RMSEs and MAPEs (the values are shown in SM). The predicted Average RMSE and Average MAPE of all SCs are 0.0056F and 0.60%, respectively.

In Table S1, Model 5 has the best performance with the first 5% cycles for training, which demonstrates the strength of the implicit function for SOH prediction. To further test the capacity of implicit function, Model 7 shows the SOH prediction using only the first 1% of the cycles as training data, with the same mean function and covariance function as Model 5. As shown in Fig. 6, the dotted lines indicate the 100th cycle, the first 1% of total SC cycles. For Model 7, the predicted RMSE is 0.0156 F and the MAPE is 1.69%, and the Average RMSE and Average MAPE of all SCs are 0.0094 F and 1.01%, respectively (Table S1). Although less information is provided, the GRP-implicit function model can still capture the SOH trend, and the performance is better than all models excluding Model 5 that uses the first 500 cycles. Compared with Model 5, the prediction error of Model 7 is slightly larger, but the number of training data cycles is only 1%, indicating that it does not require a large number of training data, and can ease the problem of data scarcity in SOH prediction.

In addition, models (model 10, model 11, model 12, model 13, model 14 and model 15) with different kernel functions except SE function ($\kappa_{SE} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$) are also applied as benchmark models to illustrate the universality of the method reported in this paper. Two kernel functions are chosen, including function and rational quadratic function (RQ function), which are both common kernel functions in the prediction of SOH⁴³. The formula of function is shown as follow:

(25)

where γ is a hyperparameter to reflect the smoothness and ${\mathcal{R}}_{\gamma }$ represents the Bessel function. The was a widely used for functions as kernel function of GPR⁴³. Therefore, the value of $\gamma$ was set as 2.5.

Along with SE function and function, the RQ function is also a popular kernel function for the GPR. The formula of RQ function can be obtained as follow:

$$ \begin{array}{*{20}c} {\kappa_{RQ} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) = \delta_{RQ}^{2} \left( {1 + \frac{{\left( {k - k^{^{\prime}} } \right)^{2} }}{{2\alpha l_{RQ}^{2} }}} \right)^{ - \alpha } ,} \\ \end{array} $$

(26)

where α reflects the relative weight of scale changes. $\delta_{RQ}$ and $l_{RQ}$ are hyper-parameters for controlling the vertical scale and length scale of the Gaussian function, respectively.

The kernel functions of model 10, model 11 and model 12 include function ($\kappa_{MA} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$). The kernel functions of model 13, model 14 and model 15 include rational quadratic function (RQ function, $\kappa_{RQ} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$). The average RMSE and average MAPE of these six models are shown in Table S1. Through the comparison between errors of these models, the superiority of GPR with implicit functions can be further demonstrated: (1) The mean function of model 10 and model 11 is the same. The kernel function of model 11 is $\kappa_{pre} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right) + \kappa_{MA} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$ (include the implicit function) and the kernel function of model 10 is $\kappa_{MA} \left( {{\mathbf{x}},{\mathbf{x^{\prime}}}} \right)$ (does not include the implicit function). As shown in Table S1, the average RMSE and average MAPE of model 11 (0.0323F, 3.17%) is much lower than model 10 (0.0715F, 7.11%), demonstrating the higher performance of GPR with implicit function as kernel function. (2)The kernel function of model 12 and model 11 is the same. The mean function of model 11 is ${\text{m}}_{pre} \left( x \right)$ (an implicit function) and the mean function of model 10 is $A{\text{log}}x + B$(an explicit function). Similarly, the average RMSE and average MAPE of model 11 is much lower than that of model 12 (0.1148F, 11.19%), demonstrating the higher performance of GPR as a mean function.

The comparison between models 10, 11 and 12 discussed above illustrated that the GPRs with function in the kernel functions conform to the conclusion of these paper. Similar comparation can be performed between models 13, 14 and 15, which are based on the kernel functions with RQ function. The better performance of model 14 (average RMSE = 0.0150F, average MAPE = 1.59%) than model 13 (average RMSE = 0.0201F, average MAPE = 2.12%) and model 15 (average RMSE = 0.221F, average MAPE = 2.27%) demonstrated that the modified functions has good applicability to the RQ function based GPR method.

Conclusion

Accurate SOH prediction can grasp the working state of SCs and avoid failure in advance. This work proposes an implicit function learning method to predict the SOH of SCs based on Gaussian process regression, where a preliminary data set is considered in the mean and covariance functions, and only 5% of the cycles are used as training data to predict the remaining 95% of the cycles. The predicted errors of the GPR-implicit function method have an Average RMSE of 0.0056 F and an Average MAPE of 0.06%, which are much lower than traditional SOH predictions with only explicit functions. The present work demonstrates the GPR-implicit functions can effectively compensate or replace the GPR-explicit functions for SOH prediction, with advantages of high precision and low data requirement. We further investigate SOH prediction using fewer cycles (1% of the cycles) as the training data, and produce reasonable prediction errors, indicating that the GPR-implicit function learning has a strong ability to predict SOH.

In the future, some additional extensions can be explored to improve the prediction performance of the GPR-implicit function method, including (1) using more comprehensive feature descriptors such as voltage, current, and temperature, or their mathematical transformations; (2) using different explicit functions, for example, an exponential function can be used as a mean function, and the Matérn function can be used as a covariance function; (3) optimizing the coefficients of explicit and implicit functions. (4) The proposed method follows a sequence from simple to complex. Since the attenuation process of supercapacitors is relatively simple, the proposed method is applied to supercapacitors first. In the future, we will update and optimize the algorithm to adapt to the attenuation process of more complex electrical systems, including a variety of different batteries. We hope that this work can promote the study of SOH prediction for supercapacitors and other energy storage devices.

Data availability

The raw data required to reproduce these findings are available to download from https://dx.doi.org/10.6084/m9.figshare.11522082.

References

Salanne, M. et al. Efficient storage mechanisms for building better supercapacitors. Nat. Energy 1, 16070 (2016).
Article ADS CAS Google Scholar
Zhang, S. & Pan, N. Supercapacitors performance evaluation. Adv. Energy Mater. 5, 1401401 (2015).
Article CAS Google Scholar
Zhang, L. L. & Zhao, X. S. Carbon-based materials as supercapacitor electrodes. Chem. Soc. Rev. 38, 2520–2531 (2009).
Article CAS PubMed Google Scholar
Dyatkin, B. et al. Development of a green supercapacitor composed entirely of environmentally friendly materials. Chemsuschem 6, 2269–2280 (2013).
Article CAS PubMed Google Scholar
Kötz, R., Ruch, P. W. & Cericola, D. Aging and failure mode of electrochemical double layer capacitors during accelerated constant load tests. J. Power Sources 195, 923–928 (2010).
Article ADS CAS Google Scholar
Rizoug, N., Bartholomeus, P. & Le Moigne, P. Study of the ageing process of a supercapacitor module using direct method of characterization. IEEE Trans. Energy Convers. 27, 220–228 (2012).
Article ADS Google Scholar
Wang, G., Zhang, L. & Zhang, J. A review of electrode materials for electrochemical supercapacitors. Chem. Soc. Rev. 41, 797–828 (2012).
Article CAS PubMed Google Scholar
Zheng, X. & Deng, X. State-of-health prediction for lithium-ion batteries with multiple gaussian process regression model. IEEE Access 7, 150383–150394 (2019).
Article Google Scholar
Tian, J., Xu, R., Wang, Y. & Chen, Z. Capacity attenuation mechanism modeling and health assessment of lithium-ion batteries. Energy 221, 119682 (2021).
Article CAS Google Scholar
Tian, H., Qin, P., Li, K. & Zhao, Z. A review of the state of health for lithium-ion batteries: Research status and suggestions. J. Clean. Prod. 261, 120813 (2020).
Article CAS Google Scholar
El Mejdoubi, A., Chaoui, H., Sabor, J. & Gualous, H. Remaining useful life prognosis of supercapacitors under temperature and voltage aging conditions. IEEE Trans. Ind. Electron. 65, 4357–4367 (2018).
Article Google Scholar
Walker, E., Rayman, S. & White, R. E. Comparison of a particle filter and other state estimation methods for prognostics of lithium-ion batteries. J. Power Sources 287, 1–12 (2015).
Article ADS CAS Google Scholar
Ahwiadi, M. & Wang, W. An enhanced mutated particle filter technique for system state estimation and battery life prediction. IEEE Trans. Instrum. Meas. 68, 923–935 (2019).
Article Google Scholar
Li, L., Wang, P., Chao, K.-H., Zhou, Y. & Xie, Y. Remaining useful life prediction for lithium-ion batteries based on gaussian processes mixture. PLoS ONE 11, e0163004 (2016).
Article PubMed PubMed Central CAS Google Scholar
Liu, K., Li, Y., Hu, X., Lucu, M. & Widanage, W. D. Gaussian process regression with automatic relevance determination kernel for calendar aging prediction of lithium-ion batteries. IEEE Trans. Ind. Inf. 16, 3767–3777 (2020).
Article Google Scholar
Uno, M. & Tanaka, K. Accelerated charge–discharge cycling test and cycle life prediction model for supercapacitors in alternative battery applications. IEEE Trans. Ind. Electron. 59, 4704–4712 (2012).
Article Google Scholar
Uno, M. & Kukita, A. Cycle life evaluation based on accelerated aging testing for lithium-ion capacitors as alternative to rechargeable batteries. IEEE Trans. Ind. Electron. 63, 1607–1617 (2016).
Article Google Scholar
Patil, M. A. et al. A novel multistage support vector machine based approach for li ion battery remaining useful life estimation. Appl. Energy 159, 285–297 (2015).
Article CAS Google Scholar
Nuhic, A., Terzimehic, T., Soczka-Guth, T., Buchholz, M. & Dietmayer, K. Health diagnosis and remaining useful life prognostics of lithium-ion batteries using data-driven methods. J. Power Sources 239, 680–688 (2013).
Article CAS Google Scholar
Meng, J., Cai, L., Luo, G., Stroe, D.-I. & Teodorescu, R. Lithium-ion battery state of health estimation with short-term current pulse test and support vector machine. Microelectron. Reliab. 88–90, 1216–1220 (2018).
Article CAS Google Scholar
Yang, Q. et al. State-of-health estimation of lithium-ion battery based on interval capacity. Energy Procedia 105, 2342–2347 (2017).
Article Google Scholar
Dai, H., Zhao, G., Lin, M., Wu, J. & Zheng, G. A novel estimation method for the state of health of lithium-ion battery using prior knowledge-based neural network and Markov chain. IEEE Trans. Ind. Electron. 66, 7706–7716 (2019).
Article Google Scholar
Lin, H., Liang, T. & Chen, S. Estimation of battery state of health using probabilistic neural network. IEEE Trans. Ind. Inf. 9, 679–685 (2013).
Article Google Scholar
Zhou, Y., Huang, Y., Pang, J. & Wang, K. Remaining useful life prediction for supercapacitor based on long short-term memory neural network. J. Power Sources 440, 227149 (2019).
Article CAS Google Scholar
Li, P. et al. State-of-health estimation and remaining useful life prediction for the lithium-ion battery based on a variant long short term memory neural network. J. Power Sources 459, 228069 (2020).
Article CAS Google Scholar
Richardson, R. R., Osborne, M. A. & Howey, D. A. Gaussian process regression for forecasting battery state of health. J. Power Sources 357, 209–219 (2017).
Article ADS CAS Google Scholar
Yang, D., Zhang, X., Pan, R., Wang, Y. & Chen, Z. A novel Gaussian process regression model for state-of-health estimation of lithium-ion battery using charging curve. J. Power Sources 384, 387–395 (2018).
Article ADS CAS Google Scholar
Chen, T., Morris, J. & Martin, E. Gaussian process regression for multivariate spectroscopic calibration. Chemom. Intell. Lab. Syst. 87, 59–71 (2007).
Article CAS Google Scholar
Liu, D., Pang, J., Zhou, J., Peng, Y. & Pecht, M. Prognostics for state of health estimation of lithium-ion batteries based on combination Gaussian process functional regression. Microelectron. Reliab. 53, 832–839 (2013).
Article CAS Google Scholar
Li, X., Yuan, C. & Wang, Z. Multi-time-scale framework for prognostic health condition of lithium battery using modified Gaussian process regression and nonlinear regression. J. Power Sources 467, 228358 (2020).
Article CAS Google Scholar
Wang, Z., Yuan, C. & Li, X. Lithium battery state-of-health estimation via differential thermal voltammetry with gaussian process regression. IEEE Trans. Transp. Electrif. 7, 16–25 (2021).
Article Google Scholar
Hu, X., Che, Y., Lin, X. & Deng, Z. Health prognosis for electric vehicle battery packs: A data-driven approach. IEEE/ASME Trans. Mechatron. 25, 2622–2632 (2020).
Article Google Scholar
Ren, J. et al. Engineering early prediction of supercapacitors’ cycle life using neural networks. Mater. Today Energy 18, 100537 (2020).
Article Google Scholar
Rezvanizaniani, S. M., Liu, Z., Chen, Y. & Lee, J. Review and recent advances in battery health monitoring and prognostics technologies for electric vehicle (EV) safety and mobility. J. Power Sources 256, 110–124 (2014).
Article ADS CAS Google Scholar
Zhang, L., Hu, X., Wang, Z., Sun, F. & Dorrell, D. G. A review of supercapacitor modeling, estimation, and applications: A control/management perspective. Renew. Sustain. Energy Rev. 81, 1868–1878 (2018).
Article Google Scholar
Severson, K. A. et al. Data-driven prediction of battery cycle life before capacity degradation. Nat. Energy 4, 383 (2019).
Article ADS Google Scholar
Harris, S. J., Harris, D. J. & Li, C. Failure statistics for commercial lithium ion batteries: A study of 24 pouch cells. J. Power Sources 342, 589–597 (2017).
Article ADS CAS Google Scholar
Chu, W. & Ghahramani, Z. Gaussian processes for ordinal regression. J. Mach. Learn. Res. 6, 1019–1041 (2005).
MathSciNet MATH Google Scholar
Jia, J. et al. SOH and RUL prediction of lithium-ion batteries based on gaussian process regression with indirect health indicators. Energies 13, 375 (2020).
Article CAS Google Scholar
Richardson, R. R., Osborne, M. A. & Howey, D. A. Battery health prediction under generalized conditions using a Gaussian process transition model. J. Energy Storage 23, 320–328 (2019).
Article Google Scholar
Williams, C. K. & Rasmussen, C. E. Gaussian Processes for Machine Learning Vol. 2 (MIT Press, 2006).
MATH Google Scholar
Li, X., Wang, Z. & Yan, J. Prognostic health condition for lithium battery using the partial incremental capacity and gaussian process regression. J. Power Sources 421, 56–67 (2019).
Article ADS CAS Google Scholar
Liu, K., Shang, Y., Ouyang, Q. & Widanage, W. D. A data-driven approach with uncertainty quantification for predicting future capacities and remaining useful life of lithium-ion battery. IEEE Trans. Ind. Electron. 68, 3170–3180 (2021).
Article Google Scholar

Download references

Acknowledgements

The authors are grateful for the financial support provided by the National Natural Science Foundation of China (No. 21901157), and the SJTU Global Strategic Partnership Fund (2020 SJTU-HUJI).

Author information

These authors contributed equally: Jiahao Ren and Junfei Cai.

Authors and Affiliations

National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
Jiahao Ren, Junfei Cai & Jinjin Li
Key Laboratory for Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano-Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
Jiahao Ren, Junfei Cai & Jinjin Li

Authors

Jiahao Ren
View author publications
Search author on:PubMed Google Scholar
Junfei Cai
View author publications
Search author on:PubMed Google Scholar
Jinjin Li
View author publications
Search author on:PubMed Google Scholar

Contributions

J.R.: Conceptualization, Methodology, Visualization, Data curation, Writing-original draft. J.C.: Methodology, Visualization, Data curation, Writing-original draft. J.L.: Conceptualization, Methodology, Supervision, Resources, Writing—review & editing.

Corresponding author

Correspondence to Jinjin Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ren, J., Cai, J. & Li, J. High precision implicit function learning for forecasting supercapacitor state of health based on Gaussian process regression. Sci Rep 11, 12112 (2021). https://doi.org/10.1038/s41598-021-91241-z

Download citation

Received: 26 March 2021
Accepted: 25 May 2021
Published: 08 June 2021
Version of record: 08 June 2021
DOI: https://doi.org/10.1038/s41598-021-91241-z

This article is cited by

Lithium battery state of health (SOH): analysis based on capacity increments and data-driven methods
- Lu He
- Chen Lu
- Wei Wei
Electrical Engineering (2025)