Deep belief rule based photovoltaic power forecasting method with interpretability

Han, Peng; He, Wei; Cao, You; Li, YingMei; Zhang, YunYi

doi:10.1038/s41598-022-18820-6

Download PDF

Article
Open access
Published: 24 August 2022

Deep belief rule based photovoltaic power forecasting method with interpretability

Peng Han¹,
Wei He^1,2,
You Cao²,
YingMei Li¹ &
…
YunYi Zhang¹

Scientific Reports volume 12, Article number: 14467 (2022) Cite this article

2243 Accesses
17 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Accurate prediction of photovoltaic (PV) output power is of great significance for reasonable scheduling and development management of power grids. In PV power generation prediction system, there are two problems: the uncertainty of PV power generation and the inexplicability of the prediction result. The belief rule base (BRB) is a rule-based modeling method and can deal with uncertain information. Moreover, the modeling process of BRB has a certain degree of interpretability. However, rule explosion and the inexplicability of the optimized model limit the modeling ability of BRB in complex systems. Thus, a PV output power prediction model is proposed based on a deep belief rule base with interpretability (DBRB-I). In the DBRB-I model, the deep BRB structure is constructed to solve the rule explosion problem, and inefficient rules are simplified by a sensitivity analysis of the rules, which reduces the complexity of the model. Moreover, to ensure that the interpretability of the model is not destroyed, a new optimization method based on the projection covariance matrix adaptation evolution strategy (P-CMA-ES) algorithm is designed. Finally, a case study of the prediction of PV output power is conducted to illustrate the effectiveness of the proposed method.

The development of CC-TF-BiGRU model for enhancing accuracy in photovoltaic power forecasting

Article Open access 21 April 2025

Multi-label machine learning for power forecasting of a grid-connected photovoltaic solar plant over multiple time horizons

Article Open access 23 September 2025

Evaluating machine learning models comprehensively for predicting maximum power from photovoltaic systems

Article Open access 28 March 2025

Introduction

Photovoltaic (PV) power generation has developed rapidly due to its clean energy characteristics. However, PV power output is affected by external uncertain factors, such as solar irradiance, voltage, module temperature, and ambient temperature. Its power output has fluctuation, randomness and uncertainty¹. These characteristics will cause many problems for the safe operation and reasonable dispatch of the power grid². Therefore, an accurate PV power forecasting system is crucial for the grid dispatch center to make decisions such as storage requirements, scheduling arrangements, and balancing supply and demand in the power market³.

In current research, three main methods can be found in PV power prediction: physical models, statistical models and hybrid approaches⁴. The physical model is used to calculate the main design parameters of the PV power generation system. Chen et al. proposed a spatio-temporal (ST) PV power nowcasting method with predictor preselection; this method enables fast and accurate PV nowcasts in different scenarios⁵. Wen et al. proposed a novel multistep forecasting (MSF) scheme, which can make multiple predictions while maintaining the same high temporal resolution⁶. Moreover, Saint-Drenan et al. proposed an empirical method for the parameterization of PV power plants for power prediction; this model makes full use of the information commonly available in PV plants⁷. Almeida et al. used a mathematical model with multiple parameters to predict PV power generation, but prediction accuracy relies on multiple mathematical models capable of describing PV system parameters⁸. The accuracy of the PV power prediction physical model relies on accurate meteorological data and complete PV battery information. However, due to the incomplete parameters provided by manufacturers and the limited prediction accuracy of numerical weather prediction (NWP), the modeling accuracy based on physical models is limited^4,9.

The statistical model is established by learning the law of observational data. For exapmple, Miao et al. proposed a Markov chain model to evaluate solar power generation performance; this method fully investigates the effects of seasonal solar profile patterns and PV module types on solar power generation¹⁰. Massidda et al. used multilinear adaptive regression splines and NWP to predict PV power output, and the model has regression coefficients that are easy to interpret¹¹. Moreover, Rodriguez et al. proposed an artificial neural network (ANN) to predict the amount of solar energy generated by PV generators, which effectively solves the complexity of control in solar energy systems¹². Yagli et al. used 68 machine learning models to automate hourly sun forecasts, and the comparative analysis proves that the tree-based method performs well in terms of overall results¹³. However, statistical models based on data-driven methods are black-box models whose output results are not interpretable. The hybrid method refers to the combination of two different methods. Halabi et al. proposed a hybrid adaptive neuro-fuzzy inference system model for the efficient prediction of solar radiation, and the hybrid model can accurately predict solar radiation according to different meteorological parameters, such as sunshine hours and temperature¹⁴. Barman et al. proposed a season-specific method for short-term load forecasting based on hybrid firefly algorithm-support vector machine (FA-SVM); the method takes into account seasonal effects and has excellent predictive power in all cases¹⁵. Hybrid physics and data-driven modeling approaches provide high accuracy in forecasting PV power generation forecasting techniques⁴. Moreover, most of the research focuses on point prediction^1,5,9, the result of point prediction is a specific value at a certain prediction moment, and the prediction result obtained by point prediction is intuitive compared with probability prediction. Intuitive prediction results can help grid operators make faster decisions about grid dispatch.

In PV power prediction system, two problems need to be solved to construct an accurate and reliable prediction system. First, PV power generation is affected by many meteorological factors, such as solar irradiance, ambient temperature, and module temperature^16,17. Due to the inherent uncertainty of meteorology, PV power generation is uncertain, fluctuating and intermittent². These stochastic behaviors create many problems for the power dispatching and security of the power grid³. Thus, the constructed PV power prediction system needs to have the ability to deal with uncertain information. Second, PV power generation is the main energy source of the power grid. The main requirements for grid management are reliability and efficiency, and grid operators need fast and accurate knowledge of the power supply¹⁸. Therefore, PV power output needs to be forecasted in an interpretable way, which can help grid operators make reasonable decisions and make operators trust the model¹⁹. BRB can effectively solve the above problems, and BRB has nonlinear modelling ability, which can effectively deal with the coexistence of uncertain information. BRB based on the IF–THEN rule modelling method can describe the modelling process and decision-making process in linguistic terms²⁰. Moreover, BRB has a transparent reasoning process based on the evidence reasoning (ER) algorithm, and grid operators can directly touch and access the model²¹.

However, there are three problems in the PV power output prediction system based on BRB. First, PV power generation is affected by many attributes, but the number of BRB rules is generated as the Cartesian product of the attribute reference values, and the number of BRB rules increases exponentially, which causes the problem of rule explosion. Moreover, in a PV power generation system based on BRB, the input attributes vary with seasons and geographic locations. Thus, expert knowledge needs to be reluctantly redesigned as new attribute information is added to the model²². Second, the initial BRB model constructed by experts is subjective, so the BRB model needs to be optimized through observational data. However, the interpretability of the BRB model will be destroyed due to the randomness of the algorithm²³. Third, there are many inefficient rules in the BRB's rule base, which reduces the readability and interpretability of the BRB model. Thus, it is important to implement rule reduction on the rule base in a reliable way²³. Hence, a new prediction model deep belief rule base with interpretability (DBRB-I) is proposed to solve the above problems. The DBRB-I model is composed of multiple Sub-BRBs in a deep structure, which effectively solves the rule explosion problem and weak extensibility. The DBRB-I model uses a new optimization method based on the projection covariance matrix adaptation evolution strategy (P-CMA-ES) algorithm to ensure the interpretability of the model after optimization. Moreover, the DBRB-I model implements rule reduction in a reliable manner by performing a sensitivity analysis on the initial BRB model.

The main contributions of this paper include the following: (1) The DBRB-I model is proposed to predict photovoltaic power output in an interpretable manner. (2) A new optimization algorithm with interpretability is designed. (3) A sensitivity analysis method is proposed to implement rule reduction.

The remainder of this paper is organized as follows. In “Problem description” section, the problems of PV power generation systems are formulated, and a new prediction model based on DBRB-I is proposed. The interpretability of the DBRB-I model is introduced in “The DBRB-I model interpretability” section, including interpretability criteria and interpretability constraints. In “Sensitivity analysis” section, sensitivity analysis methods are introduced. The reasoning process and optimization process of the DBRB-I model are given in “Reasoning process and optimization process of the DBRB-I model” section. Then, a case study is conducted to verify the effectiveness of the proposed model in “Case study” section. This paper is concluded in “Conclusion” section.

Problem description

Before starting the work in this paper, a hypothesis needs to be formulated. The initial modeling assumption for the interpretability model is that expert knowledge is reliable. Expert knowledge is obtained through the analysis of the mechanism operation of PV power generation systems and the accumulation of long-term knowledge. Although expert knowledge has certain limitations in accuracy, the direction of expert knowledge must be correct. Therefore, the initial model of this paper is constructed with reliable and reasonable expert knowledge.

In “Problems with prediction systems” section, the problem of predicting PV output power is formulated. Then, in “Construction of the DBRB-I model” section, a prediction model of PV output power based on DBRB-I was constructed.

Problems with prediction systems

To build a PV output power prediction model based on DBRB-I, the following three problems need to be solved.

Problem 1

Build a new extensible BRB model. Expert knowledge is crucial in the BRB expert system, which can effectively ensure the interpretability and accuracy of the model. However, due to the complexity of PV power generation systems, the number of rules increases exponentially with the number of attribute referential values. This has led to the problem of rule explosion, which reduces the interpretability and readability of the model. Meanwhile, input attributes vary with season and geographic location in PV power generation system, and weak extendability can cause expert knowledge to be reluctant to be redesigned when new attribute information is used. Therefore, the first question is how to build a new BRB model, which has good extendability and can avoid the combination explosion problem.

$$y = f(x,C,\Omega ,Ek)$$

(1)

where $x$ is the input data of the PV power generation system,$C$ is the set of interpretability constraints,$\Omega$ is the model parameter,$Ek$ is the expert knowledge introduced into the model, and $y$ is the predicted result of PV power generation systems.

Problem 2

Interpretability constraints are designed. BRB is a rule-based modelling method that can easily understand the modelling process of the model. However, the interpretability of the model optimization process will be destroyed due to the randomness of the optimization algorithm. Thus, the second problem is how to design effective interpretability constraints to ensure that the interpretability of the model is not destroyed.

$$Interpretability:\left\{ {C|C_{1} ,C_{2} ,...C_{m} } \right\}$$

(2)

$$\Omega = optimize(x,y,P,C)$$

(3)

where $P$ is the set of parameters in the optimization process.

Problem 3

Improve the simplicity of the rules base. The rule base in BRB is generated by the attribute reference value in the form of Descartes. Some redundant rules and inefficient rules exist in the rules base of BRB, which will reduce the accuracy and readability of the model. Therefore, the third question is how to reasonably and transparently remove redundant rules and inefficient rules in the BRB rule base.

$$g = \left| {MSE_{SA} - MSE_{{({\text{initial}})}} } \right|$$

(4)

where $g$ is the error fluctuation of the model,$MSE_{SA}$ is the accuracy of the model after sensitivity analysis, and $MSE_{{({\text{initial}})}}$ is the accuracy of the model built with expert knowledge.

Construction of the DBRB-I model

BRB is composed of a series of belief rules, and the kth rule can be described as follows:

$$\begin{gathered} IF \, x_{1} {\text{ is A}}_{1}^{k} \wedge x_{2} {\text{ is A}}_{2}^{k} \wedge \cdots \wedge x_{{T_{k} }} {\text{ is A}}_{{T_{k} }}^{k} , \, \hfill \\ {\text{THEN }}y \, is\{(D_{1} {,}\beta_{1,k} {),(}D_{2} {,}\beta_{2,k} {),}...{,(}D_{N} {,}\beta_{N,k} )\} ,(\sum\limits_{n = 1}^{N} {\beta_{n,k} \le 1} {),} \hfill \\ {\text{with rule weight }}\theta_{k} ,k \in \{ 1,2,...,L\} . \hfill \\ and{\text{ attribute weight }}\delta_{1} ,\delta_{2} ,...,\delta_{i} ,i \in \{ 1,2,...,T_{i} \} \hfill \\ in{\text{ C}}_{1} ,{\text{C}}_{2} ,...,{\text{C}}_{n} \hfill \\ \end{gathered}$$

(5)

where $x_{1} ,x_{2} ,...,x_{{T_{k} }}$ is the antecedent attribute of the PV power prediction system.$A_{1}^{k} ,A_{2}^{k} ,...,A_{{T_{k} }}^{k}$ is a series of reference values for the antecedent attribute $x_{1} ,x_{2} ,...,x_{{T_{k} }}$.$T_{k}$ is the number of attributes in the kth rule.$D_{1} ,D_{2} ,...,D_{N}$ are the consequences, and $\beta_{1,k} ,\beta_{2,k} ,...,\beta_{N,k}$ are their corresponding belief degrees.$\theta_{k}$ is the weight of the kth belief rule.$\delta_{1} ,\delta_{2} ,...,\delta_{i}$ is the weight of the ith attribute.$L$ is the number of rules.${\text{C}}_{1} ,{\text{C}}_{2} ,...,{\text{C}}_{n}$ is the interpretability constraint of the model. The DBRB-I model is composed of multiple Sub-BRBs in a deep structure. The modelling process of the DBRB-I model based on the PV power generation system is shown in Fig. 1. First, the attributes of PV power generation systems are trended analysed and sorted by correlation to changes in results. Second, the rules of the initial BRB constructed from expert knowledge are simplified through sensitivity analysis. Finally, each Sub-BRBn of DBRB-I is optimized by an optimization algorithm with interpretability.

The DBRB-I model interpretability

To maintain the interpretability of the DBRB-I model, it is crucial to construct interpretability constraints for PV power generation systems. Cao et al. constructed a general BRB interpretability criterion²³. Thus, the DBRB-I model should conform to the general interpretability BRB criterion. Moreover, to make the DBRB-I model more interpretable, this paper focuses on the interpretability criteria7, 8. The interpretability of the DBRB-I model is shown in Fig. 2.

Criterion 7::: The simplicity of the rule base.

The modelling method based on IF–THEN rules can make the model structure clear and easy to understand. However, the number of rules grows exponentially in complex systems of PV power generation, which leads to the explosion of rules in the BRB system. Too many rules can reduce the readability of the model and reduce model interpretability.

Thus, to improve the interpretability and readability of the DBRB-I model, the rules in the system should be reasonably reduced. A reasonable number of rules can improve the accuracy of the BRB model²⁴. The initial BRB constructed by expert knowledge can be effectively analysed to determine which rules are redundant and which are efficient rules in systems. Therefore, an effective way to reduce rules is to perform sensitivity analysis on the initial BRB model.

Criterion 8::: The optimized belief rule satisfies the PV power generation system.

Belief rules can provide a clear semantic description between the input and output of the PV power generation system, which is the main manifestation of the interpretability of the DBRB-I model²⁵. Expert knowledge can be introduced into the model as a parameter by belief rules. Thus, the prediction results of the model will be convincing by the expert. However, to obtain better accuracy, the model is optimized in a large search domain. Inevitably, many incorrect rules that are contrary to the actual system will be generated. For example, in the PV power generation predicting system, the belief distribution of the output results are {(Excellent, 0.4353), (Good, 0,0944), (Middle, 0.1098), (Low, 0.3605)}, which means the “Excellent” support of the system's power generation is 0.4353 and the “Low” support is 0.3605. This belief distribution is unrealistic, and a reasonable belief distribution should not give high confidence to conflicting results, as shown in Fig. 3.

Thus, in a PV power generation system, to ensure the reasonableness of the prediction results, the interpretability constraints are as follows:

$$\begin{gathered} \beta_{k} \sim U_{k} { (}k = 1,2,...,L{)} \hfill \\ U_{k} \in \{ \{ \beta_{1} \le \beta_{2} \le \cdots \le \beta_{n} \} \hfill \\ \quad or\{ \beta_{1} \ge \beta_{2} \ge \cdots \ge \beta_{n} \} \hfill \\ \quad or\{ \beta_{1} \le \cdots \le max(\beta_{1} ,\beta_{2} ,...\beta_{n} ) \ge \cdots \ge \beta_{n} \} \} \hfill \\ \end{gathered}$$

(6)

where $U_{k}$ is the interpretability constraint in the kth rule. For different systems, the interpretability constraints will be different, but it should be noted that the operating mechanism and common sense of the actual system need to be satisfied²⁵. Moreover, a reasonable belief distribution shape should be monotonic or convex.

BRB is a rule-based modelling method, and the relationship between the input and output of the model is traceable. Thus, the interpretability of the model structure is the intrinsic feature of BRB. Due to limited expert knowledge, the initial BRB model built by experts does not meet the needs of the actual system, and the model needs to be optimized by observation data. However, optimization algorithms have randomness, which destroys the interpretability of BRB models. Therefore, the following constraints are designed to preserve the interpretability of the BRB model, and the feasible region for DBRB-I model optimization is shown in Fig. 4.

Constraint 1::: Effective use of expert knowledge.

Expert knowledge is obtained through the analysis of the actual system and the experience of long-term accumulation. It is one of the important sources of model interpretability²⁶. The optimization process of the interpretable BRB model is a local search process based on the initial judgement of experts²². Thus, expert knowledge is converted into parameters and brought into the initial population of the optimization algorithm, which can provide guidance for the optimization process and effectively extract useful information from the search space²⁷.

$$mean^{(g)} = \left\{ {\begin{array}{*{20}c} {Ek,\,\,\,\,\,\,\,\,\,if\,\,\,\,\,\,{\text{ g = 1}}} \\ {mean^{(g)} ,\,\,if\,\,\,\,\,{\text{g}} \ne {1}} \\ \end{array} } \right.$$

(7)

Constraint 2::: The optimized parameters meet the judgement of experts.

Compared with the black box model, such as the backpropagation neural network (BPNN) and support vector machine (SVM), the parameters of the BRB model are of practical significance, which makes users more trust this model. However, the physical significance of the optimized BRB model parameters may be lost. For example, the initial rule weight is given 0.9 by experts, but the optimized rule weight is 0.005. Experts believe that this rule is crucial, but the results of the model after optimization are inconsistent with the judgment of the expert. This will lead to the reduction of experts' trust in the model. Thus, the key parameters of the BRB model need to be guaranteed to be used effectively. To solve this problem, the parameters of the BRB model are constrained as follows:

$$\begin{gathered} P_{lp} \le P \le P_{up} : \{ \theta_{{lp_{k} }} \le \theta_{k} \le \theta_{{up_{k} }} \quad k \in \{ 1,2,...,L\} . \hfill \\ \, \delta_{{lp_{i} }} \le \delta_{i} \le \delta_{{up_{i} }} \quad n \in \{ 1,...,N\} . \hfill \\ \, \beta_{{lp_{{_{k,n} }} }} \le \beta_{{_{{_{k,n} }} }} \le \beta_{{up_{{_{k,n} }} }} \quad i,m \in \{ 1,2,...,T_{k} \} .\} \hfill \\ \end{gathered}$$

(8)

where $P_{lp}$ is the minimum value of the parameter and $P_{up}$ is the maximum value of the parameter. Effective constraints for the parameters of the BRB model can avoid modelling accuracy reduction due to overoptimization.

Constraint 3::: Local optimization process based on expert knowledge.

The interpretability of the optimization process reflects the optimization in a local search domain judged by experts for interpretability BRBs²². Thus, to enhance the interpretability of the model, an interpretability constraint is designed.

Interpretability constraints on individuals of the initial population by introducing Euclidean distance. The Euclidean distance reflects the straight-line distance between two points in space. Constrain the distance between the individual and the expert knowledge of the algorithm, which further realizes the optimization of the local search domain based on the initial judgment of the expert and enhances the interpretability of the optimization.

$$\begin{aligned} \rho (x_{n} ,x_{n}^{^{\prime}} ) & = \sqrt {(x_{1} - x_{1}^{^{\prime}} )^{2} + (x_{2} - x_{2}^{^{\prime}} )^{2} + \cdots + (x_{n} - x_{n}^{^{\prime}} )^{2} } \hfill \\ & \quad \rho (x_{n} ,x_{n}^{^{\prime}} ) \le d \hfill \\ \end{aligned}$$

(9)

$\rho (x_{n} ,x_{n}^{^{\prime}} )$ is the Euclidean distance between the individuals of the initial population and expert knowledge. $d$ is the parameter of the distance, and the value is determined by the expert.

BRB is a rule-based modeling method that conforms to the method of human knowledge expression. Through expert knowledge and observational data, the reasoning process and modelling process of the BRB model can be easily understood. Moreover, BRB has a transparent reasoning process based on the ER algorithm, and decision makers and users can directly touch and access the model. Thus, interpretability is an intrinsic feature of BRB models. However, any algorithm has randomness, and this randomness destroys the interpretability of the BRB model. Therefore, it is necessary to design the above interpretability constraints to protect the interpretability of the BRB model²³.

Sensitivity analysis

An appropriate BRB model structure and parameters are crucial to improve the accuracy and interpretability of the predicted PV output power. In the literature²⁴, Zhang et al. demonstrated that a suitable number of rules can improve the accuracy of the BRB model. Moreover, the simplicity of the BRB rule base can improve the readability of the model²³. Therefore, there are many methods of rule reduction^28,29. However, reasonable and transparent reduction rules are important for interpretable BRB models. Sensitivity analysis is a great way to simplify the rule base in an interpretable way.

Sensitivity analysis (SA) refers to analysing the direct influence of model input parameters and model results³⁰. SA can identify key parameters of the model, which can help users improve the prediction accuracy of the model. There are two main methods of sensitivity analysis: global sensitivity analysis (GSA) and local sensitivity analysis (LSA). GSA refers to the study of the effect of two or more parameters changing together on the model parameters. However, GAS is computationally expensive³⁰. LSA refers to exploring changes in the response of a model by changing one parameter of the model while keeping the other parameters constant. The advantages of local sensitivity analysis are simplicity and ease of understanding.

Thus, this paper uses LSA to simplify the structure of the BRB model. The input parameter of the LSA is the rule weight, and the sensitivity of the rule is reflected by the fluctuation of the model error g; that is, the greater the mean square error (MSE), the greater the sensitivity of the rule. Through LSA, users can learn which rules are important and which rules are inefficient. Finally, the inefficient rules are simplified, which simplifies the BRB model and improves the model readability.

$$g = \left| {MSE_{SA} - MSE_{{({\text{initial}})}} } \right|.$$

(10)

Reasoning process and optimization process of the DBRB-I model

In “The reasoning process of the DBRB-I model” section, the reasoning process of the DBRB-I model is introduced. Then, in “Optimization process of the DBRB-I model” section, the optimization process of the DBRB-I model is introduced.

The reasoning process of the DBRB-I model

The reasoning process of BRB is based on the evidential reasoning (ER) algorithm. The ER algorithm with transparency and reliability is described as follows:

Step 1 Different forms of input information are transformed into belief distributions.
$$S(x_{i} ) = \{ (A_{i,j} ,\alpha_{i,j} ),i = 1,...,M;j = 1,...,J_{i} \}$$
(11)
$$a_{i,j} = \left\{ \begin{gathered} \frac{{A_{i,j + 1} - x_{i} }}{{A_{i,j + 1} - A_{i,j} }}, \, j = k,{\text{ if }}A_{i,j} \le x_{i} \le A_{i,j + 1} \hfill \\ \begin{array}{*{20}l} {\frac{{x_{i} - A_{i,j} }}{{A_{i,j + 1} - A_{i,j} }}, \, j = k + 1 \, } \\ {0, \, j = 1,...,J_{i} ,j \ne k,k + 1} \\ \end{array} \hfill \\ \end{gathered} \right.$$
(12)
where $a_{i,j}$ is the matching degree between the input information and the reference value $A_{i,j}$.
Step 2 The activation weight of the kth belief rule is calculated.
$$w_{k} = \frac{{\theta_{k} \prod\limits_{i = 1}^{{M_{k} }} {\left( {a_{i,j}^{k} } \right)} \overline{{^{{\delta_{i} }} }} }}{{\left( {\sum\limits_{l = 1}^{K} {\theta_{l} } \prod\limits_{i = 1}^{{M_{l} }} {(a_{i,j}^{l} )} \overline{{^{{\delta_{i} }} }} } \right)}},\,\,\,\overline{{\delta_{i} }} = \frac{{\delta_{i} }}{{\mathop {max}\limits_{{i = 1,...,M_{k} }} \{ \delta_{i} \} }}$$
(13)
Step 3 The belief degree of the inference output is generated by the analytical ER algorithm.
$$\beta_{n} = \frac{{\mu \times \left[ {\prod\limits_{i = 1}^{L} {\left( {\omega_{l} \beta_{n,l} + 1 - \omega_{l} \sum\limits_{i = 1}^{N} {\beta_{i,l} } } \right)} - \prod\limits_{l = 1}^{L} {\left( {1 - \omega_{l} \sum\limits_{i = 1}^{N} {\beta_{i,l} } } \right)} } \right]}}{{1 - \mu \times \left[ {\prod\limits_{l = 1}^{L} {\left( {1 - \omega_{l} } \right)} } \right]}}$$
(14)
$$\mu = \frac{1}{{\sum\limits_{n = 1}^{N} {\prod\limits_{l = 1}^{L} {\left( {\omega_{l} \beta_{n,l} + 1 - \omega_{l} \sum\limits_{i = 1}^{N} {\beta_{i,l} } } \right) - (N - 1)\prod\limits_{l = 1}^{L} {\left( {1 - \omega_{l} \sum\limits_{i = 1}^{N} {\beta_{i,l} } } \right)} } } }}$$
(15)
Step 4 The final belief distribution of the inference output is expressed as follows:
$$S(A^{^{\prime}} ) = \{ (D_{n} ,\beta_{n} );n = 1,...,N\}$$
(16)
$$u(S(A^{^{\prime}} )) = \sum\limits_{j = 1}^{N} {u(D_{j} )\beta_{j} }$$
(17)
where $A^{^{\prime}}$ is the input vector of the actual system,$u(D_{j} )$ is the utility of the $D_{j}$.$u({\text{S}}(A^{\prime}))$ is the final expected utility.

Optimization process of the DBRB-I model

In current research, the projection covariance matrix adaptation evolution strategy (P-CMA-ES) optimization algorithm is one of the effective algorithms and has also been applied to the research of different BRBs²³. The P-CMA-ES optimization algorithm has the following advantages: (1) It has good optimization performance. (2) The algorithm has a fast convergence speed and strong robustness. (3) It has the advantages of rotation invariance and spread rotation invariance. Thus, the P-CMA-ES algorithm is used to optimize the DBRB-I model in this paper.

However, the P-CMA-ES algorithm generates new solutions that will destroy DBRB-I model interpretability. Therefore, to maintain the interpretability of the DBRB-I model, interpretability constraints are added to the original P-CMA-ES algorithm. The newly modified P-CMA-ES optimization algorithm is shown in Fig. 5. The pseudocode of the optimization method is given in Algorithm 1, and the specific process is as follows:

Step 1 (Construct the objective function) To improve the prediction accuracy of the DBRB-I model, the parameters of the model are optimized through the training data. Therefore, the objective optimization function of the DBRB-I model is described as follows:
$$\begin{gathered} \min MSE(\Omega ) \hfill \\ s.t. \, \sum\limits_{n = 1}^{N} {\beta_{{_{{_{k,n} }} }} } = 1 \hfill \\ \, \theta_{{lp_{k} }} \le \theta_{k} \le \theta_{{up_{k} }} \quad k \in \{ 1,2,...,L\} . \hfill \\ \, \delta_{{lp_{i} }} \le \delta_{i} \le \delta_{{up_{i} }} \quad {\text{ n}} \in \{ 1,...,N\} . \hfill \\ \, \beta_{{lp_{{_{k,n} }} }} \le \beta_{{_{{_{k,n} }} }} \le \beta_{{up_{{_{k,n} }} }} \quad i,m \in \{ 1,2,...,T_{k} \} . \hfill \\ \end{gathered}$$
(18)
where $MSE( \cdot )$ is the degree of difference between the predicted value of the PV system and the true value, which can be described as follows:
$$MSE = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {y_{i} - \hat{y}_{i} } \right)^{2} }$$
(19)
where $n$ is the number of training data.$y_{i}$ is the true value of the PV power generation system.

$\hat{y}_{i}$ is the predicted value of the DBRB-I model.

Step 2 (Initial operation) Initial population size $\lambda$, the initial mean value $mean^{0} = \Omega^{0} (\theta,\delta ,\beta )$, the offspring population size $\tau$, the initial step size $\varepsilon^{0}$ and the initial covariance matrix $C^{0}$.
Step 3 (Sampling operation) Through interpretability constraints 1, 2, and 3, generate the initial population by
$${\text{Constraints1}}:\,\,\Omega_{k}^{g + 1} = mean^{(g)} + \varepsilon^{(g)} N(0,C^{(g)} ),k = 1,2,...,\lambda$$
(20)
$$mean^{(g)} = \left\{ {\begin{array}{*{20}l} {EK, \, if{\text{ g = 1}}} \\ {mean^{(g)},if{\text{ g}} \ne {1}} \\ \end{array} } \right.$$
(21)

Interpretability constraint 1 incorporates expert knowledge into the initial population of the model, and expert knowledge can play a guiding role in the model optimization process, which improves the model optimization process. Moreover, interpretability constraint 1 enables the starting point of the optimization to be close to the optimal solution of the model.

$${\text{Constraints2}}:\begin{array}{*{20}c} {\Omega_{k}^{(g + 1)} \Leftarrow P = mean^{(g)} + \varepsilon^{(g)} N\left( {0,C^{(g)} } \right)} & {} \\ \begin{gathered} P_{lp} \le P \le P_{up} : \{ \theta_{{lp_{k} }} \le \theta_{k} \le \theta_{{up_{k} }} \hfill \\ \delta_{{lp_{i} }} \le \delta_{i} \le \delta_{{up_{i} }} \hfill \\ \beta_{{lp_{{_{k,n} }} }} \le \beta_{{_{{_{k,n} }} }} \le \beta_{{up_{{_{k,n} }} }} \hfill \\ \end{gathered} & \begin{gathered} \, k \in \{ 1,2,...,L\} . \hfill \\ n \in \{ 1,...,N\} . \hfill \\ i,m \in \{ 1,2,...,T_{k} \} .\} \hfill \\ \end{gathered} \\ \end{array}$$

(22)

Interpretability constraint 2 guarantees that the parameters do not lose their physical meaning during optimization, which maintains the interpretability of the model.

$${\text{Constraints}}\,{3}:\,\,\begin{array}{*{20}c} {\rho (\Omega,EK) = \sqrt {(x_{1} - x_{1}^{^{\prime}} )^{2} + (x_{2} - x_{2}^{^{\prime}} )^{2} + \cdots + (x_{n} - x_{n}^{^{\prime}} )^{2} } } \\ {\rho (x_{n},x_{n}^{^{\prime}} ) \le d} \\ \end{array}$$

(23)

Interpretability constraint 3 guarantees that the optimization process is a local optimization based on expert knowledge for the interpretability model, which further enables the optimized parameters to have good similarity to the expert knowledge.

Step 4 (Constraint operation) Through interpretability criterion 8, the rules that do not meet the actual system are adjusted.
$$\begin{gathered} \Omega_{k}^{(g + 1)} \Leftarrow \beta_{k}^{(g + 1)} = mean^{(g)} + \varepsilon^{(g)} N(0,C^{(g)} ) \hfill \\ \beta_{k}^{(g + 1)} \sim C_{8},k = 1,2,...,\lambda \hfill \\ \end{gathered}$$
(24)
where $\Omega_{k}^{(g + 1)}$ is the kth solution in the (g + 1)th generation, which may not satisfy the belief distribution of the actual system.$\beta_{k}^{(g + 1)}$ is the newly generated belief distribution satisfying the interpretability criterion 8.$\Leftarrow$ is the replacement operation.
Step 5 (Projection operation) To satisfy the equality constraint, the projection operation maps candidates back into the feasible region hyperplane.
$$A_{e} \Omega_{k}^{(g + 1)} (1 + n_{e} \times (j - 1):n_{e} \times j) = 1,j = 1,2,...,N + 1$$
(25)

The projection operation is implemented as follows:

$$\begin{gathered} \Omega_{k}^{(g + 1)} (1 + n_{e} \times (j - 1):n_{e} \times j) = \Omega_{k}^{(g + 1)} (1 + n_{e} \times (j - 1):n_{e} \times j) \hfill \\ - A_{e}^{T} \times (A_{e} \times A_{e}^{T} )^{ - 1} \times \Omega_{k}^{(g + 1)} (1 + n_{e} \times (j - 1):n_{e} \times j) \times A_{e} \hfill \\ \end{gathered}$$

(26)

Step 6 (Selection operation) Calculate the MSE value of population individuals and sort them. The process is described as follows:
$$MSE(\Omega_{ \, 1:\lambda }^{(g + 1)} ) \le MSE(\Omega_{ \, 2:\lambda }^{(g + 1)} ) \le \cdots \le MSE(\Omega_{ \, i:\lambda }^{(g + 1)} ) \le \cdots \le MSE(\Omega_{ \, \lambda :\lambda }^{(g + 1)} )$$
(27)

The optimal subgroup is updated as follows:

$$mean^{(g + 1)} = \sum\nolimits_{i = 1}^{\mu } {\omega_{i} \Omega_{ \, i:\lambda }^{(g + 1)} }$$

(28)

Step 7 (Adapting operation) Update the search covariance matrix, the evolution path of the covariance matrix, the search step size and the evolution path through the most subgroup strategy.
$$\begin{aligned} C^{(g + 1)} & = (1 - c_{1} - c_{2} ) \cdot C^{(g)} + c_{1} p_{c}^{(g + 1)} \left( {p_{c}^{(g + 1)} } \right)^{T} \\ & \,\,\,\, + c_{2} \sum\limits_{i = 1}^{\tau } {\omega_{i} } \left( {\frac{{\Omega_{ \, i:\lambda }^{(g + 1)} - mean^{(g)} }}{{\varepsilon^{(g)} }}} \right)\left( {\frac{{\Omega_{ \, i:\lambda }^{(g + 1)} - mean^{(g)} }}{{\varepsilon^{(g)} }}} \right)^{T} \\ \end{aligned}$$
(29)
$$\begin{aligned} p_{c}^{(g + 1)} & = (1 - c_{c} ) \cdot p_{c}^{(g)} + \sqrt {c_{c} (2 - c_{c} )} \cdot \left( {\sum\nolimits_{i = 1}^{\tau } {w_{i}^{2} } } \right)^{ - 0.5} \\ \quad\,\,\,\,\, \cdot \left( {{\text{mean}}^{(g + 1)} - {\text{mean}}^{(g)} } \right)/\varepsilon^{(g)} \\ \end{aligned}$$
(30)
$$\varepsilon^{g + 1} = \varepsilon^{g} \exp \left( {\frac{{c_{\sigma } }}{{d_{\sigma } }}\left( {\frac{{\left\| {p_{ \, \sigma }^{g + 1} } \right\|}}{{E\left\| {N(0,1)} \right\|}} - 1} \right)} \right)$$
(31)
$$\begin{gathered} p_{\sigma }^{(g + 1)} = (1 - c_{c} ) \cdot p_{ \, \sigma }^{(g)} + \sqrt {c_{c} (2 - c_{c} )} \cdot \left( {\sum\nolimits_{i = 1}^{\tau } {w_{i}^{2} } } \right)^{ - 0.5} \hfill \\ \quad\, \cdot \left( {C^{(g)} } \right)^{ - 0.5} \cdot \left( {mean^{(g + 1)} - mean^{(g)} } \right)/\varepsilon^{(g)} \hfill \\ \end{gathered}$$
(32)

Case study

In “Description of the dataset” section, the dataset is described. The initial DBRB-I model is constructed in “Construct the initial BRB by expert knowledge” section. Then, in “Sensitivity analysis of the initial DBRB-I model” section, a sensitivity analysis of the rules of the DBRB-I model is presented. The optimized DBRB-I model is described in “The optimized DBRB-I model” section. In “Discussion of the interpretability of the DBRB-I model” section, the interpretability of the DBRB-I model is discussed.

The increase in the proportion of PV power generation will increase the difficulty of power grid scheduling. When the proportion exceeds 15%, it may cause paralysis of the grid system³¹. An interpretable model can provide grid operators with some reference for grid scheduling and management. Thus, it is very important to predict PV power in a reliable, safe and interpretable way³¹.

Description of the dataset

The PV power dataset is obtained from AI Studio. The dataset is the operation data of PV modules in China in 2018. The data collection interval was 10 min, and the data were desensitized. This paper selects a week of data from January 1, 2018, to January 7, 2018, for the experiment, as shown in Fig. 6. A total of 280 data samples are used for training, and 140 data samples are used for testing.

Construct the initial BRB by expert knowledge

PV power generation systems use solar energy to generate electricity, and their output power is strongly affected by solar irradiance. At the same time, voltage, ambient temperature and module temperature are important factors that affect PV output efficiency³². Then, the dataset is normalized, and the relationship between each attribute and output power is trend analysed, as shown in Fig. 7. The irradiance has the largest relationship with the output power, while the ambient temperature has the smallest relationship with the output power, and the most obvious part is marked with a black curve. Thus, the order of importance to the output power attributes of the PV power generation system is as follows: irradiance, voltage, module temperature, and ambient temperature. The initial DBRB-I model is shown in Fig. 8.

In practical engineering, the selection of the reference value requires the judgment of expert knowledge to select the interval range with typical significance and then combine the data statistics to give accurate results³³. Thus, normalized data usually use four semantic values to describe the attributes and the state of the system, that is, “Excellent”, “Good”, “Middle”, and “Low”. The reference values are given in Tables 1 and 2. Moreover, the beliefs of the initial models Sub-BRB1, Sub-BRB2 and Sub-BRB3 are shown in “Appendix A”.

Table 1 The attribute reference value.

Full size table

Table 2 The output power of the PV power generation system.

Full size table

Sensitivity analysis of the initial DBRB-I model

To meet the needs of the PV power generation system, the model error g = 0.001. A sensitivity analysis of the rules for the initially constructed Sub-BRB1, Sub-BRB2 and Sub-BRB3 is shown in Figs. 9, 10, and 11, respectively. Rules 3, 4, 5, 7, 8, 9, 12, 13, 14, and 16 in Sub-BRB1 have little effect on the system. These rules can be considered inefficient rules of the system. However, rules 1, 2, 6, 10, 11, and 15 have a large impact on the system and satisfy the model error g.

The purpose of sensitivity analysis is to remove inefficient rules, which will reduce the complexity of the model and increase the readability of the model by describing the PV system with fewer rules. Moreover, the difficulty of optimization is reduced, and the effect of optimization is improved. Table 3 shows the effective rules of each sub-BRB of DBRB-I.

Table 3 The number of DBRB-I model rules.

Full size table

To further prove the redundancy of inefficient rules, the rule activation weights of DBRB-I are analyzed. Figures 12, 13, and 14 show the activation weights of each rule for Sub-BRB1, Sub-BRB2, and Sub-BRB3, respectively. As seen from Fig. 11 (the blue curve represents that the activation weight of the rule is too small, the green curve represents that the rule is not activated, the black curve represents that the rule is very important, and the purple curve represents that the rule has little effect on the model), the activation weight of rules 3, 12, and 14 is too small, it is difficult for experts to judge whether rule 3 has an effect on the system, which reduces the user's understanding of the model, and rule 3 will reduce the interpretability of the model. Thus, rule 3 is classified as an inefficient rule. Moreover, rules 4, 8, 9, and 13 have no effect on the model, which shows that these rules are redundant. Reducing these rules does not affect the accuracy of the model but improves the interpretability of the model. The number of times rule 6 is activated in the system shows that rule 6 is very important to the system. Rule 6 is activated many times in the system, which shows that rule 6 has a great influence on the system. Finally, although the activation weight of rule 16 is relatively large, the number of activations is too small, and it has little effect on the model, so rule 16 is also classified as an inefficient rule.

To prove the reliability and rationality of expert knowledge and whether the DBRB-I model can correctly reduce inefficient rules, the sensitivity analysis of the Sub-BRB1, Sub-BRB2 and Sub-BRB3 rules after DBRB-I model training is shown in Figs. 15, 16 and 17, respectively. Table 4 gives the effective rules for the trained DBRB-I model to satisfy the threshold g. Through the comparison of Tables 3 and 4, the inefficient rules of the DBRB-I model after training are consistent with the judgment of expert knowledge. Reasonable rule reduction for the initial DBRB-I can reduce the complexity of the model and improve the optimization process of the model. In this paper, the initial DBRB-I is constructed from expert knowledge, and the expert knowledge is reliable. Only rule 12 in Sub-BRB2 is inconsistent with the judgment of the initial DBRB-I model. However, in Fig. 10, rule 12 has a certain effect on the initial model but does not meet the threshold g. Moreover, as shown in Fig. 13, the partial activation weight of rule 12 is too small, and experts cannot judge whether this rule works, which will reduce the interpretability of the model. Therefore, rule 12 is considered an inefficient rule in the initial model.

Table 4 The number of trained DBRB-I model rules.

Full size table

The optimized DBRB-I model

In this paper, for interpretability criterion 8 and interpretability constraint 1,2, as shown in “Appendix A”, the d of interpretability constraint 3 is 1.4, 1.4, 1.9, respectively. The DBRB-I model with 20 rules is named DBRB-I(20), the DBRB-I model with 41 rules is named DBRB-I(41), and the DBRB-I model without interpretability constraints is named DBRB. The optimized parameters of the DBRB-I(20) model are shown in “Appendix B”.

In “Transparency and traceability of the DBRB-I model” section, transparency and traceability of the DBRB-I model are demonstrated. The DBRB-I model can meet the judgment of experts in “The DBRB-I model conforms to expert judgment” section. Then, in “Rule reduction under different conditions” section, the rule reductions under different threshold g are given. The interpretability constraints of the optimized DBRB-I model are analyzed in “Analysis of interpretability constraints of the DBRB I model” section. In “Analysis of convergence of the DBRB I model” section, the convergence of the DBRB-I model is analyzed. The accuracy of the DBRB-I model is analyzed in ’Analysis of the accuracy of different DBRB I models“ section. Then, in “Robustness Analysis of the DBRB I model” section, the robustness of the DBRB-I model is analyzed. Skill scores are calculated in “Analysis of DBRB-I model skill scores” section.

Transparency and traceability of the DBRB-I model

The transparency and traceability of the DBRB-I(20) model are shown in Fig. 18. Taking the 38th and 64th test data as an example, the true values of the 38th and 64th test data are 0.3479 and 0.3653, respectively. In the DBRB-I(20) model, the belief level of 0.4 is continuously improved, while the other belief levels continuously lower, which is practical and understandable by the user. This shows that the DBRB-I(20) model can continuously improve the accuracy of the model. Moreover, every test data point is a clear, transparent and traceable process.

The DBRB-I model conforms to expert judgment

The belief distribution of each rule of Sub-BRB3 is shown in Fig. 19. In rules 2, 3, 5, 6, 7, 8, and 9, the optimized belief distribution is close to the expert knowledge, which shows that Sub-BRB3 improves accuracy while maintaining interpretability. In rules 1 and 4, the overall belief distribution of each rule is consistent with the actual system and can be trusted by users. Moreover, it can be seen that the belief distributions in the optimized DBRB-I(20) are close to the initial DBRB-I(20), which indicates that these parameters are locally optimized based on the initial judgement of experts.

Rule reduction under different conditions

The prediction results of the DBRB-I(20) model are shown in Fig. 20. The DBRB-I(20) model can predict the entire change trend of the PV power generation system, but it does not have the ability to accurately predict the output result of “excellent”. The reason for the poor prediction effect is that the rules with less influence of the model are simplified. An effective way to improve the accuracy of the DBRB-I(20) model is to lower the threshold g for rule reduction and allow more rules to participate in the model.

To prove that reducing the rule reduction threshold g can improve the accuracy of the model, therefore, the threshold g = 0, only the rules that have no effect on the system are reduced for the DBRB-I model. Table 5 gives the number of rules for DBRB-I(41). As shown in Fig. 21, the DBRB-I(41) model can accurately predict the PV power generation system in an interpretable manner. However, DBRB-I(41) has 41 rules, and DBRB-I(20) has 20 rules; that is, the DBRB-I(20) model is more readable than DBRB-I(41).

Table 5 The DBRB-I model rules.

Full size table

To further prove that removing inefficient rules can improve the accuracy of the model, a comparison between the DBRB(48) model and the DBRB(41) model is shown in Fig. 22. There are 7 fewer rules and 35 fewer optimization parameters in the entire optimization process, which saves more computing resources for DBRB(41) and improves the optimization process of the DBRB(41) model. The MSEs of DBRB(41) and DBRB(48) are 2.81E-5 and 4.83E-4, respectively. Figure 22 shows the comparison between DBRB(41), DBRB(48) and the atcual value. It can be seen that the prediction effect of DBRB(41) is better than that of DBRB(48). This proves that reducing inefficient rules can improve the accuracy of the model. When inefficient rules exist in the system, it will increase the burden on the optimization process of the model and reduce the interpretability of the model. Therefore, choosing appropriate rules is crucial for the model.

Analysis of interpretability constraints of the DBRB-I model

The effectiveness of interpretability constraints 1 and 3 is demonstrated in Fig. 23. The blue and red curves are the modified P-CMAES algorithm and the original P-CMAES algorithm, respectively. Compared with the original P-CMAES algorithm, the modified P-CMAES algorithm is more suitable for the optimization process of the interpretability model. The starting point of the optimization process of the modified algorithm starts from the vicinity of the expert knowledge so that the initial population of the algorithm will carry some characteristics of the expert knowledge information. Moreover, the optimization process of the DBRB-I model is a local search process based on the initial judgment of expert knowledge. This proves that the optimization process of the modified algorithm can maintain a good balance between model interpretability and modelling accuracy.

For the DBRB-I model, the expert knowledge accumulated from the PV power generation system plays a crucial role in the interpretability of the model. The parameters obtained by expert knowledge are closer to the optimal point in the feasible domain search space; that is, expert knowledge can provide a guiding direction for the optimization process. Therefore, putting expert knowledge into the initial population of the algorithm can improve the optimization process of the model. Furthermore, interpretability constraint 3 further realizes that the optimization process of the DBRB-I model is a local search domain based on expert judgment.

Belief rule base is an intelligent expert system that combines the characteristics of expert system and data-driven model²¹. Experts can build the initial BRB model according to their experience and domain knowledge, which effectively reduces the difficulty of parameter optimization and improves the interpretability of the model²³. Moreover, the model parameters are further optimized by observation data, which enables the BRB model to achieve good accuracy. Thus, the DBRB-I model is a hybrid model that combines knowledge and data. The initial DBRB-I model is reasonably constructed by expert knowledge, so that the model has good interpretability. Through observational data, an optimization algorithm with interpretability constraints is used for optimization, enabling fine-tuning of parameters based on expert knowledge. Therefore, the DBRB-I model is able to maintain a good balance between interpretability and accuracy.

Analysis of convergence of the DBRB-I model

The convergence rate of the DBRB-I(20) model is shown in Fig. 24, which shows that DBRB-I (20) has a faster convergence rate and the starting point of the optimization is closer to the optimal solution. The reasons for this phenomenon are as follows: First, DBRB-I(20) reduces the inefficient rules and improves the optimization process of the model. Second, the DBRB-I(20) model has interpretability constraints that limit the solution space. Moreover, DBRB-I(20) can make the starting point of the optimization process close to the optimal solution through interpretability constraint 1. However, the model convergence accuracy is limited due to the reduction of inefficiency rules and the increase of interpretability constraints for the DBRB-I(20) model. Moreover, DBRB(41) reduces redundant rules in the model and reduces the complexity of the model, so DBRB(41) converges faster than DBRB(48). Due to the use of multiple interpretability constraints in the optimization process, the model has good performance in interpretability and accuracy, but the DBRB-I model shows a longer execution time, as shown in Appendix Table B4.

Analysis of the accuracy of different DBRB-I models

Figure 25 shows the comparison between the DBRB-I(20) model and the number of rules per sublayer of DBRB(48). Table 6 shows the comparison of the DBRB-I model with other models. Although the prediction accuracy of the DBRB-I(20) model is not as good as that of DBRB(48), the DBRB-I(20) model is interpretable. The interpretability of DBRB-I(20) is as follows: (1) The belief distribution of the optimized model satisfies the PV power generation system, while DBRB(48) has no such capability. (2) The optimization of the DBRB-I(20) model is carried out in the local search domain judged by experts, which indicates that the optimized parameters of the DBRB-I(20) model are fine-tuned on the basis of expert knowledge; that is, the DBRB-I(20) model can be used in interpretability and precision are well balanced. However, DBRB(48) is optimized in the entire solution space, and the optimized parameters will conflict with expert knowledge. 3) The number of rules for the DBRB-I(20) model is 20, while the number of rules for DBRB(48) is 48. The DBRB-I(20) model reduces the complexity of the system by using fewer rules, the readability of the model is improved, and the interpretability of the model is enhanced.

Table 6 Comparison of model accuracy.

Full size table

Robustness analysis of the DBRB-I model

A comparison of the prediction results of the DBRB-I model, the popular long short-term memory (LSTM) model and the deep boltzmann machine (DBN) model is shown in Fig. 26^34,35,36. To verify the robustness of the proposed DBRB-I model, 20 repeated optimization processes are conducted. Although the accuracy of the DBRB-I model is similar to that of LSTM and DBN, the DBRB-I model is an interpretable model, while LSTM and DBN have no such ability. Moreover, the robustness analysis of the model is shown in Table 7. The MSE standard deviation of DBRB-I is smaller than that of LSTM and DBN, which means the robustness of the DBRB-I model is stronger than LSTM and DBN. In practical engineering, the DBRB-I model is more suitable for reliable and safe systems²³.

Table 7 Robustness analysis of the model.

Full size table

Analysis of DBRB-I model skill scores

To further verify the prediction effect of the DBRB-I model, benchmarking is also performed by skill score, which is defined as follows³⁷:

$$Skill\,\,score = 1 - \frac{{error_{proposed} }}{{error_{reference} }}$$

(33)

where $error_{proposed}$ is the error of the proposed model and $error_{reference}$ is the error of the reference model.

Through the above analysis, when the reference model selects the LSTM model, the DBRB-I (41) skill score = 0.20; when the reference model selects the DBN model, the DBRB-I(41) skill score = 0.11. Although the accuracy improvement of the DBRB-I(41) model is not large, the DBRB-I(41) model is an interpretable model. The interpretability of the DBRB-I model compared with LSTM and DBN is as follows: (1) The DBRB-I model is a rule-based modeling method, which can describe the modeling process of the system in a language semantic way. (2) The DBRB-I model has a transparent inference engine, which makes the internal structure clear and transparent and can be directly accessed by users. (3) The DBRB-I model can incorporate expert knowledge and system mechanisms, which can better help people understand and trust the model.

Discussion of the interpretability of the DBRB-I model

As an interpretable model, DBRB-I has the characteristics of a transparent reasoning process, process interpretability, and traceability of results. Power grid operators can directly access the model, but the internal structure of the data-driven black-box model is invisible. Moreover, DBRB-I can identify key parameters of the PV power generation system, and experts can further improve their expert knowledge by analyzing the key parameters. The sensitivity analysis for each rule of Sub-BRB3 is shown in Fig. 27. The parts drawn with black circles clearly show that there are jumps in the system, which can have a huge impact on the PV system when the rule weights are within a certain interval. Therefore, to ensure that the system can maintain a balance between accuracy and interpretability, interpretability constraint 2 is necessary. For the higher sensitivity rules 1, 2, 4, 5, and 7, grid operators should analyze the hidden mechanisms in detail as feedback on the actual situation³⁸. For the less sensitive rules 3, 6, 8, and 9, the modelling process should be further improved, which can improve the modelling accuracy³⁸.

The reasoning process and modelling process of the DBRB-I(20) model can be accessed by decision makers and users. Therefore, DBRB-I as an interpretability model can effectively analyse the system. Each rule of Sub-BRB1, Sub-BRB2, and Sub-BRB3 of DBRB-I(20) is shown in Fig. 28. As shown in Figure a, Sub-BRB1 has low sensitivity to the belief level G but high sensitivity to E and L. The reason for this is that the irradiance, voltage and output power data of the model are not well fitted at the belief level G, as shown in Fig. 6, which is consistent with reality. As shown in Figure b, Sub-BRB2 has low sensitivity to belief level E but high sensitivity to G, L, and M. The reason for this is that the module temperatures of the system do not reach the belief level of E, as shown in Fig. 6. As shown in Figure c, Sub-BRB3 combines Sub-BRB1 and Sub-BRB2 and is sensitive to belief levels E, G, M, and L. This further demonstrates the effectiveness of the DBRB-I(20) model.

Conclusion

In this paper, a new PV power generation prediction model based on deep belief rule base with interpretability (DBRB-I) is proposed. In the DBRB-I model, first, the attributes of the PV power generation system are trended toward analysis, and the Sub-BRB is constructed through the correlation between the attributes and the results. Second, sensitivity analysis of the initial DBRB-I model constructed by experts, which reduces inefficient rules and redundant rules to reduce model complexity. Finally, the simplified DBRB-I model is optimized by an interpretability optimization algorithm.

There are three innovations in this paper. A PV power generation prediction model is proposed based on the DBRB-I model. The DBRB-I model consists of multiple Sub-BRBs in a deep structure, which effectively solves the problem of rule explosion and weak scalability of BRBs. To ensure that the interpretability of the model after optimization is not destroyed, a new optimization method is designed. Moreover, to improve the readability of PV power generation prediction models, a transparent and reliable rule reduction method sensitivity analysis is proposed. A case study of a PV power generation system is used to verify the validity of the proposed model. The results show that the DBRB-I model can maintain a good balance between interpretability and accuracy.

However, the local sensitivity analysis method used in this paper has limitations. This method neglects the mutual influence between uncertain parameters, which will interfere with the decision-making results. Therefore, how to use global sensitivity analysis deserves further study. Moreover, more benchmark datasets should be used and how to adequately analyze and interpret the sensitivity of a model in future research.

Data availability

The datasets generated and analysed during the current study are available in the AI Studio repository, https://aistudio.baidu.com/aistudio/datasetdetail/147402.

References

Wang, K. J., Qi, X. X. & Liu, H. D. Photovoltaic power forecasting based LSTM- convolutional network. Energy 189, 116225 (2020).
Article Google Scholar
Antonanzas, J. et al. Review of photovoltaic power forecasting. Sol. Energy 136, 78–111 (2016).
Article ADS Google Scholar
Eseye, A. T., Zhang, J. H. & Zheng, D. H. Short-term photovoltaic solar power forecasting using a hybrid Wavelet-PSO-SVM model based on SCADA and Meteorological information. Renewable Energy 118, 357–367 (2017).
Article Google Scholar
Halabi, L. M., Mekhilef, S. & Hossain, M. Performance evaluation of hybrid adaptive neuro-fuzzy inference system models for predicting monthly global solar radiation. Appl. Energy 213, 47–261 (2018).
Article Google Scholar
Chen, X. Y., Du, Y., Lim, E. G., Wen, H. Q. & Jiang, L. Sensor network based PV power nowcasting with spatio-temporal preselection for grid-friendly control. Appl. Energy 255, 113760 (2019).
Article Google Scholar
Wen, H. R. et al. Deep learning based multistep solar forecasting for PV ramp-rate control using sky images. IEEE Trans. Ind. Inf. 17(2), 1397–1406 (2021).
Article Google Scholar
Saint-Drenan, Y. M. et al. An empirical approach to parameterizing photovoltaic plants for power forecasting and simulation. Sol. Energy 120, 479–493 (2015).
Article ADS Google Scholar
Almeida, M. P., Muoz, M., de la Parra, I. & Perpinan, O. Comparative study of PV power forecast using parametric and nonparametric PV models. Sol. Energy 155, 854–866 (2017).
Article ADS Google Scholar
Wang, K. J., Qi, X. X. & Liu, H. D. A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network. Appl. Energy 251, 113315 (2019).
Article Google Scholar
Miao, S. W., Ning, G. T., Gu, Y. Z., Yan, J. H. & Ma, B. T. Markov Chain model for solar farm generation and its application to generation performance evaluation. J. Clean. Prod. 186, 905–917 (2018).
Article Google Scholar
Massidda, L. & Marrocu, M. Use of multilinear adaptive regression splines and numerical weather prediction to forecast the power output of a PV plant in Borkum Germany. Sol. Energy 146, 141–149 (2017).
Article ADS Google Scholar
Rodriguez, F., Fleetwood, A., Galarza, A. & Fontan, L. Predicting solar energy generation through artificial neural networks using weather forecasts for microgrid control. Renew. Energy 126, 855–864 (2018).
Article Google Scholar
Yagli, G. M., Yang, D. Z. & Srinivasan, D. Automatic hourly solar forecasting using machine learning models. Renew. Sustain. Energy Rev. 105, 487–498 (2019).
Article Google Scholar
Halabi, L. M., Mekhilef, S. & Hossain, M. Performance evaluation of hybrid adaptive neuro-fuzzy inference system models for predicting monthly global solar radiation. Appl. Energy 213, 247–261 (2018).
Article Google Scholar
Barman, M. & Choudhury, N. B. D. Season specific approach for short-term load forecasting based on hybrid FA-SVM and similarity concept. Energy 174, 886–896 (2019).
Article Google Scholar
Das, U. K. et al. Forecasting of photovoltaic power generation and model optimization: A review. Renew. Sustain. Energy Rev. 81(1), 912–928 (2018).
Article Google Scholar
Baskarad, T., Kuzle, I. & Holjevac, N. Photovoltaic system power reserve determination using parabolic approximation of frequency response. IEEE Trans. Sustain. Energy. 12(4), 3175–3184 (2021).
Google Scholar
D’Andrea, E. & Lazzerini, B. A hierarchical approach to multiclass fuzzy classifiers. Exp. Syst. Appl. 40(9), 3828–3840 (2013).
Article Google Scholar
Toubeau, J. F., Bottieau, J., Wang, Y. & Vallee, F. Interpretable probabilistic forecasting of imbalances in renewable-dominated electricity systems. IEEE Trans. Sustain. Energy. 13(2), 1267–1277 (2021).
Article ADS Google Scholar
Feng, Z. C. et al. A new belief rule base model with attribute reliability. IEEE Trans. Fuzzy Syst. 27(5), 903–916 (2018).
Article Google Scholar
Zhou, Z. J., Hu, G. Y., Hu, C. H., Wen, C. L. & Chang, L. L. A survey of belief rule-base expert system. IEEE Trans. Syst. Man Cybernetics-Syst. 51(8), 4944–4958 (2021).
Article Google Scholar
Cao, Y., Zhou, Z. J., Hu, C. H., Tang, S. W. & Wang, J. A new approximate belief rule base expert system for complex system modelling. Decis. Support Syst. 150, 113558 (2021).
Article Google Scholar
Cao, Y., Zhou, Z. J., Hu, C. H., He, W. & Tang, S. W. On the interpretability of belief rule-based expert systems. IEEE Trans. Fuzzy Syst. 29(11), 3489–3503 (2021).
Article Google Scholar
Zhang, A., Gao, F., Yang, M. & Bi, W. H. A new rule reduction and training method for extended belief rule base based on DBSCAN algorithm. Int. J. Approx. Reason. 119, 20–39 (2020).
Article MathSciNet MATH Google Scholar
Zhou, Z. J. et al. New health-state assessment model based on Belief Rule Base with interpretability. Sci. China (Inf. Sci.) 64(7), 15 (2021).
Google Scholar
Zhou, Z. J. et al. Interpretability and development of rule-based modelling methods. Acta Autom. Sin. 47(6), 1201–1216 (2021).
MATH Google Scholar
Ramachandran, A., Gupta, S., Rana, S., Li, C. & Venkatesh, S. Incorporating expert prior in bayesian optimization via space warping. Knowl.-Based Syst. 195, 105663 (2020).
Article Google Scholar
Yang, L. H., Wang, Y. M., Lan, Y. X., Chen, L. & Fu, Y. G. A data envelopment analysis (DEA)-based method for rule reduction in extended belief-rule-based systems. Knowl.-Based Syst. 123, 174–187 (2017).
Article Google Scholar
Gao, F., Zhang, A., Bi, W. H. & Ma, J. W. A greedy belief rule base generation and learning method for classification problem. Appl. Soft Comput. 98, 106856 (2020).
Article Google Scholar
Lai, X., Wang, S. Y., Ma, S. D., Xie, J. Y. & Zheng, Y. J. Parameter sensitivity analysis and simplification of equivalent circuit model for the state of charge of lithium-ion batteries. Electrochim. Acta 330, 135239 (2019).
Article Google Scholar
Cui, Y., Sun, Y. C. & Chang, Z. L. Research progress on short-term solar photovoltaic power generation forecasting methods. Resour. Sci. 35(7), 8 (2013).
Google Scholar
Dong, L., Zhou, W. P., Zhang, P., Liu, G. Y. & Li, W. D. Short-term probability prediction of photovoltaic power generation based on dynamic Bayesian network. Proc. CSEE. 33(S1), 38–45 (2013).
Google Scholar
Feng, Z. C. et al. A new safety assessment method based on belief rule base with attribute reliability. IEEE/CAA J. Autom. Sin. 8(11), 1774–1785 (2021).
Article Google Scholar
Agga, A., Abbou, A., Labbadi, M. & EI Houm, Y. Short-term self consumption PV plant power production forecasts based on hybrid CNN-LSTM ConvLSTM models. Renew. Energy. 177, 101–112 (2021).
Article Google Scholar
Ahmed, R., Sreeram, V., Togneri, R., Datta, A. & Arif, M. D. Computationally expedient Photovoltaic power Forecasting: A LSTM ensemble method augmented with adaptive weighting and data segmentation technique. Energy Convers. Manage. 258, 115563 (2022).
Article Google Scholar
Chang, G. W. & Lu, H. J. Integrating gray data preprocessor and deep belief network for day-ahead PV power output forecast. IEEE Trans. Sustain. Energy. 11(1), 185–194 (2020).
Article ADS Google Scholar
Yang, D. Z. A guideline to solar forecasting research practice: Reproducible, operational, probabilistic or physically-based, ensemble, and skill (ROPES). J. Renew. Sustain. Energy. 11(2), 22701 (2019).
Article Google Scholar
Chang, L. L. & Zhang, L. M. Explainable data-driven optimization for complex systems with non-preferential multiple outputs using belief rule base. Appl. Soft Comput. 110, 107581 (2021).
Article Google Scholar

Download references

Acknowledgements

This work was supported in part by the Postdoctoral Science Foundation of China under Grant No. 2020M683736, in part by the Natural Science Foundation of Heilongjiang Province of China under Grant No. LH2021F038, in part by the innovation practice project of college students in Heilongjiang Province under Grant Nos. 202010231009, 202110231024, and 202110231155, and in part by the basic scientific research business expenses scientific research projects of provincial universities in Heilongjiang Province Grant Nos. XJGZ2021001.

Author information

Authors and Affiliations

Harbin Normal University, Harbin, 150025, China
Peng Han, Wei He, YingMei Li & YunYi Zhang
Rocket Force University of Engineering, Xi’an, 710025, China
Wei He & You Cao

Authors

Peng Han
View author publications
Search author on:PubMed Google Scholar
Wei He
View author publications
Search author on:PubMed Google Scholar
You Cao
View author publications
Search author on:PubMed Google Scholar
YingMei Li
View author publications
Search author on:PubMed Google Scholar
YunYi Zhang
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, H.P. and H.W; methodology, H.P. and Y.M.; investigation, C.Y.; writing—original draf preparation, H.P. and Y.M.; writing—review and editing, C.Y. and Y.Y. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Wei He or YingMei Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Han, P., He, W., Cao, Y. et al. Deep belief rule based photovoltaic power forecasting method with interpretability. Sci Rep 12, 14467 (2022). https://doi.org/10.1038/s41598-022-18820-6

Download citation

Received: 14 May 2022
Accepted: 19 August 2022
Published: 24 August 2022
Version of record: 24 August 2022
DOI: https://doi.org/10.1038/s41598-022-18820-6

This article is cited by

An interpretable health state assessment method for aerospace equipment based on belief rule base with fuzzy credibility factor
- Zongjun Zhang
- Haifeng Wan
- Wei He
The Journal of Supercomputing (2025)
A new interpretable belief rule base model with step-length convergence strategy for aerospace relay health state assessment
- Xiuxian Yin
- Bing Xu
- Wei He
Scientific Reports (2023)

Subjects

Abstract

Similar content being viewed by others

The development of CC-TF-BiGRU model for enhancing accuracy in photovoltaic power forecasting

Multi-label machine learning for power forecasting of a grid-connected photovoltaic solar plant over multiple time horizons

Evaluating machine learning models comprehensively for predicting maximum power from photovoltaic systems

Introduction

Problem description

Problems with prediction systems

Problem 1

Problem 2

Problem 3

Construction of the DBRB-I model

The DBRB-I model interpretability

Sensitivity analysis

Reasoning process and optimization process of the DBRB-I model

The reasoning process of the DBRB-I model

Optimization process of the DBRB-I model

Case study

Description of the dataset

Construct the initial BRB by expert knowledge

Sensitivity analysis of the initial DBRB-I model

The optimized DBRB-I model

Transparency and traceability of the DBRB-I model

The DBRB-I model conforms to expert judgment

Rule reduction under different conditions

Analysis of interpretability constraints of the DBRB-I model

Analysis of convergence of the DBRB-I model

Analysis of the accuracy of different DBRB-I models

Robustness analysis of the DBRB-I model

Analysis of DBRB-I model skill scores

Discussion of the interpretability of the DBRB-I model

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

An interpretable health state assessment method for aerospace equipment based on belief rule base with fuzzy credibility factor

A new interpretable belief rule base model with step-length convergence strategy for aerospace relay health state assessment

Search

Quick links