Introduction

In the current context of a socially conscious economy, sustainability trends have attracted the interest of regulators and researchers. Sustainability is reflected in corporate social responsibility (CSR) activities that integrate environmental, social, and governance (ESG) aspects into decision-making processes (Sassen et al., 2016). These CSR activities can be evaluated based on ESG performance scores (Chollet and Sandwidi, 2018; Nofsinger et al., 2019), which explicitly integrate governance aspects (Gillan et al., 2021).

In this sense, a rich literature has developed on the effect of ESG scores on corporate performance (Lee and Faff, 2009; Oikonomou et al., 2012; Albertini, 2013; Lisin et al., 2022) and banks (Paltrinieri et al., 2020; Aracil et al., 2021; SN Azmi et al., 2023). This impact has generally been considered positive concerning the banking sector’s profitability (Buallay, 2019). The interest in the banking sector stems from its importance in sustainable development due to its intermediation role, which allows financial resources to be mobilized to achieve sustainable goals (Yip and Bocken, 2018). In addition, several external shocks have affected the banking sector, most notably the 2008 financial crisis (Forcadell and Aracil, 2017), digitalization (Del Gaudio et al., 2021; Chinoda and Kapingura, 2023), and facilitating the sustainable digital transformation of banking institutions (Forcadell et al., 2020).

The attention paid to the impact of ESG commitments on the banking sector, especially after the 2008 financial crisis, has been the origin of several studies on the relationship between ESG performance and banking risk or stability (Srivastav and Hagendorff, 2016; Chollet and Sandwidi, 2018; Chiaramonte et al., 2022; Neitzert and Petras, 2022; Aevoae et al., 2023; Galletta et al., 2023). The literature on the nexus between ESG scores and bank stability mainly reveals a theoretical debate between the shareholder view of the firm (Friedman, 1970), which sees these ESG practices as a cost and, therefore, an overinvestment, and the stakeholder view (Freeman, 1984), which sees them more as an ethical obligation to mitigate bank risk. Both views suggest opposite effects. Despite the existence of recent literature, major theoretical gaps remain in examining the nexus between ESG scores and the probability of bank failure (DZ Huang, 2022), and bank stability. Indeed, there is still no consensus on the nature, intensity, and direction of the relationship, given the complexity of the institutional context in which ESG activities take place. This complexity is due to organizational constraints and the diversity of individual decision-makers (DZX Huang, 2021). Furthermore, the nexus between ESG and banking stability remains unclear despite recent empirical literature. It requires further exploration, including consideration of non-linearity. To our knowledge, the study by Lupu et al. (2022) is the only one to test this aspect empirically. Finally, previous empirical studies on the impact of ESG commitments on bank stability have not considered the evolution of banking activities in an increasingly digitized environment, except for the Salah Mahdi et al. (2023) study.

Against this backdrop, this paper addresses the following central question: How do changes in ESG scores affect bank stability in the digital era? In addition to this main question, there are other equally important ones. These include the following: What is the link between the impact of digitization on bank stability and ESG activities? Does the stability of banks in the digital era depend on the level of their ESG scores?

In doing so, this paper highlights its contribution to empirical research. First, this article aims to test the hypothesis that the relationship is characterized by non-linearity and contingent upon the level of ESG scores, complementing the recent literature that has shown the positive impact of high ESG scores on bank stability (Chiaramonte et al., 2022), reducing systemic distress (Aevoae et al., 2023), and enabling a better predictive ability of bank distress (Citterio and King, 2023). Second, to account for the impact of the digital age, this study also examines the impact of ICT diffusion and endowment on bank stability, controlling for ESG scores. To our knowledge, this is the first study to link the impact of digitization on bank stability to ESG activities. Third, this article highlights its contribution to the empirical literature from a methodological point of view, opting for the Panel Smooth Transition AutoRegressive (hereafter, PSTAR) modeling approaches to capture the nuances and regime change of the nexus between ESG scores and banking stability in the digital era for a group of European banks operating in 15 European countries between 2005 and 2022. This relatively long period encompasses both the 2008 global financial crisis and the European 2010–2012 sovereign debt crisis, providing insight into the nexus between ESG and stability in the digital age, particularly in the context of crises.

The European context is important for several reasons. First, the European banking sector is a major component of the global financial system (Batten et al., 2022), so its stability is important not only for the region but also for all global economies, given the potential contagion effects outside Europe (Gabrieli and Salakhova, 2019; Teply and Klinger, 2019). Second, in the European banking context, engagement in ESG activities is a priority, especially for the European Banking Authority, which requires ESG disclosure in a regulatory framework aimed at improving the sustainability of the European financial system (Bruno and Lagasio, 2021). In addition, there are several countries in the region whose banks have been pioneers in sustainable development. Finally, the focus on the EU is important because it allows us to identify the possible impact of digitization and ESG ratings on bank stability in the presence of a mature banking sector.

The results highlight three key findings. First, the empirical results confirm the non-linear nexus between ESG performance and banking stability in the advent of the digital age. Second, higher ESG scores are associated with a lower risk of bank failure. Finally, European banks with low ESG scores fail to avoid the fragility caused by investments in technological infrastructure.

The subsequent section is structured in the following sequence: in section “Theoretical background and hypothesis”, the theoretical and empirical literature, on the influence of ESG scores and digitization on bank stability, are checked. The PSTAR modeling approach and associated data are presented in the section “Econometric modeling and variable processing”. Descriptive analysis of variables and the model employed are the subjects of the section “Empirical analysis”. The primary estimation results are discussed in section “Discussion”. Section “Conclusion and policy recommendations” concludes with the policy implications.

Theoretical background and hypothesis

This section aims to distinguish between two relationships. Firstly, it reports on the theoretical debate regarding the relationship between digitization and bank stability. This debate is based on two opposing hypotheses: ‘innovation-stability’ and ‘innovation-fragility’. Secondly, it aims to deepen the analysis of the relationship between ESG performance and bank stability, referring to existing theories and empirical investigations.

Digitization and bank stability

The digitization of the financial sector, thanks to information and communication technologies (ICT), is one of the defining phenomena of today’s financial world (Kasri et al., 2022). Indeed, ICTs are profoundly reshaping the economic landscape, eliminating information asymmetries thanks to big data (Kosmidou et al., 2017; Guérineau and Léon, 2019), and enabling the rapid spread of financial innovations. The development of digital capabilities is accompanied by a rise in customer expectations and the entry of fintech companies into the banking ecosystem. As a result, competition is intensifying, and the dynamics of financial services, in the broadest sense, within the banking ecosystem are becoming completely different (Zavolokina et al., 2017). Consequently, in the digital age, the competitiveness and viability of banks are becoming increasingly complex, stimulating scientific research into the impacts of digitization on bank stability.

The beneficial influence of ICT on financial institutions’ profitability and operational effectiveness (DeYoung et al., 2007; Ciciretti et al., 2009; Scott et al., 2017), in line with creative destruction Schumpeter’s theory of (Schumpeter, 1934), which considers technological progress and innovation as the primary drivers of business performance, has not prevented a theoretical discussion on the impact of digitization on banking stability. This debate is based on two opposing hypotheses “innovation stability” and “innovation fragility.”

According to the first hypothesis, the technological innovation that characterizes the digital age translates into improved bank stability. Several empirical studies supported this view. Del Gaudio et al. (2021), who study the impact of ICT on bank stability in EU countries, find a positive effect of ICT diffusion, endowment, and infrastructure. They find empirical evidence of a positive relationship between the reduction of bank failure risk and ICT diffusion and endowment. They show that the reduced fragility of European banks resulted from improved performance and information dissemination. Kasri et al. (2022), in their study of the banking sector in Indonesia, using the ARDL model, asserted the existence of a positive short-term relationship between digital payments and bank stability with unidirectional causality. Similarly, Hasan et al. (2012), in their study of banks in 27 European countries, found a positive relationship between efficient electronic payment services and bank performance and stability.

According to the second hypothesis, the technological innovation that characterizes the digital era makes the banking sector more fragile. Indeed, empirical studies have shown that the ICT behind financial innovation can increase systemic risks to financial stability and bank soundness (Uddin et al., 2020). Numerous empirical studies have highlighted the distortions introduced by financial innovation and its contribution to significant risk-taking, which was reported as a primary contributing factor to the 2007–2008 global financial crisis (Rainey and Ibikunle, 2012). Moreover, according to Beck et al. (2016), financial innovation can also lead to fragility in banks because of greater profitability volatility, especially in nations with more stringent regulatory environments and larger securities markets.

It should be noted that some studies have found mixed results, as in the case of Syed et al. (2022), which included one developed and one developing country. Using ARDL estimation, this study examined the influence of digital financial services on banking stability and efficiency in India and the US. Their results show that the effects vary by country’s level of development. Specifically, the “innovation-stability” hypothesis holds only in the case of the US in the short and long terms. However, in the short-term, the Indian banking sector supports the “innovation-fragility” hypothesis.

A synthesis of the empirical literature reveals two significant gaps. First, the extant studies tend to concentrate on the influence of financial innovation or digital financial inclusion resulting from digitization on stability. As far as we are aware, only the study by Del Gaudio et al. (2021) has analyzed the impact of digitization on bank stability through ICT diffusion, endowment, and infrastructure. Second, all studies assumed the linearity of the relationship without considering possible non-linearity. Following in the footsteps of Del Gaudio et al. (2021), this research attempts to present new findings while exploring the non-linear relationship between digitization and bank stability.

Based on the above discussions, we put forth the following hypotheses for consideration:

H1. The diffusion of ICT has a positive impact on bank stability.

H2. The technological endowment of the banking sector positively impacts bank stability.

ESG performance and bank stability

The relationship between ESG performance and bank stability is a topic of theoretical debate. The classical view (Friedman, 1970) supports a negative relationship, while the stakeholder view (Freeman, 1984) confirms a positive impact of ESG scores on bank stability.

In line with the classical view, the sole responsibility of banks is to maximize profits. From this perspective, engaging in ESG activities diverts scarce bank resources away from maximizing shareholder wealth and increasing banking value (Barnea and Rubin, 2010). In this line of thinking, ESG investments, according to the managerial opportunism argument, are a waste of resources and negatively affect the bank’s performance (Barnea and Rubin, 2010). According to Barnea and Rubin (2010), these investments can be considered agency costs, in the sense that managers, concerned with their reputation, make them to the detriment of shareholders’ interests. From this perspective, investors perceived these firms as riskier. This view of overinvestment thus predicts a positive (negative) relationship between bank risk (stability) and the ESG score.

In line with the overinvestment perspective, we can formulate the hypothesis as follows:

H3. The relationship between ESG scores and bank stability is negative.

Conversely, the stakeholder theory suggests that involvement in ESG practices leads to investments in ESG activities, which generate moral capital that can act as an insurance mechanism to mitigate both operational (Galletta et al., 2023) and ESG risks. Thus, the more a bank engages in ESG activities, the more it demonstrates greater transparency, enabling it to garner greater support from stakeholders (W. Azmi et al., 2021).

Such signals improve the bank’s reputation (Forcadell and Aracil, 2017) and attract more customers (McWilliams and Siegel, 2011). The benefits of reputational capital allow banks to finance their operations from stable sources, thereby reducing risk and consequently improving bank stability (Gangi et al., 2019). From this perspective, banks need to become more involved in ESG practices to enhance the image of the banking sector, particularly in the wake of the 2007–2008 global financial crisis, which led to customer distrust. Moreover, it is thanks to this reputational effect that the cost of capital and cash flows allow ESG activities to have a favorable influence on bank stability (W. Azmi et al., 2021).

The empirical literature has shown that most findings are consistent with the vision of stakeholder theory. Indeed, Aevoae et al. (2023) in their study of the influence of changes in ESG scores on the systemic risk found conclusions consistent with stakeholder theory. They documented the positive impact of ESG scores on reducing banks’ contribution to systemic risk. In other words, the higher the ESG score, the lower the banks’ contribution to system-wide difficulties. These findings are consistent with those of Scholtens and van’t Klooster (2019). Aevoae et al. (2023) analysis of the different sub-pillars of ESG showed that only the governance pillar has a significant positive impact. Their results are consistent with previous studies (Kiesel and Lücke, 2019; Chiaramonte et al., 2022). It should be noted, however, that the literature finds a significant positive impact of the social pillar on reducing banks’ risk (Chiaramonte et al., 2022). Similarly, the empirical literature shows that the more banks address environmental issues, the lower the risk of bank failure (Gangi et al., 2019). This effect is even stronger in times of crisis (Chiaramonte et al., 2022). In the same vein, Neitzert and Petras (2022), who analyzed the impact of ESG practices on banking risk and explored its factors of influence on a sample of banks worldwide between 2002 and 2018, showed that a high ESG score for banks is likely to reduce the risk of bank failure. However, this study notes that decomposing this overall effect by considering sub-pillars separately shows the importance of environmental engagement in mitigating and managing banks’ risks, which is consistent with the findings of Gangi et al. (2019) and Chiaramonte et al. (2022).

This study by Chiaramonte et al. (2022), which addresses the European banking sector, could be considered a useful reference for our study. Considering a group of banks in 21 European nations from 2005 to 2017, Chiaramonte et al. (2022) find that the impact of the ESG score on banking stability is positive. They find a stabilizing effect and a reduction in the fragility of banks with high ESG scores. These findings are consistent with those of Di Tommaso and Thornton (2020), who support the negative relationship between ESG engagement and risk-taking.

Di Tommaso and Thornton (2020) examine this relationship between ESG scores and the risk-taking of 81 European banks active in 19 nations after the global financial crisis. They show that the effect of high ESG scores on risk-taking, while negative, remains relatively modest for both low-risk and high-risk banks. Their results remain consistent with stakeholder theory regarding the positive impact of ESG practices on banking stability (risk reduction). Nevertheless, when examining the effect of ESG scores on bank value, they find a negative relationship, with high ESG scores reducing bank value. Such a conclusion is consistent with the view of “overinvestment” associated with ESG activities. They thus show that, in terms of bank value, the indirect positive effect of reduced risk-taking resulting from these ESG activities cannot compensate for their direct negative impact on bank value. The extant empirical review on the relationship between banks’ engagement in ESG activities and bank stability shows that most researchers find a positive effect of high ESG scores on risk reduction, consistent with stakeholders’ views. Considering the above discussion, we develop the hypothesis to be tested:

H4. The effect of European banks’ ESG scores on bank stability is positive.

The non-linear relationship between ESG performance and bank stability

Although a growing body of empirical evidence shows that ESG activities can reduce bank risk and improve bank stability, the issue still requires further investigation. Indeed, the evolving nature of ESG, as well as advances in digital technologies linked to big data and the potential of new empirical models, will allow new evidence to enrich the debate (Gillan et al., 2021). The empirical studies previously cited have all assumed a linear relationship between ESG scores and bank stability. However, as pointed out by Korinth and Lueg (2022), non-linear relationships should also be considered (Nollet et al., 2016; Barnett and Salomon, 2012). According to Barnett and Salomon (2012), engagement in social responsibility activities initially leads to a decrease in the firm’s financial performance, but subsequently, increased investment in social responsibility leads to an increase in financial performance. In this sense, their results confirm the hypothesis of a U-shaped relationship between social and financial performance. This shape of the relationship is also supported by the study of Nollet et al. (2016). However, Nollet et al. (2016) specify that low engagement in social responsibility activities is better for the firm’s financial performance compared to moderate engagement. Furthermore, high social performance is associated with the highest financial performance. They find that involvement in social responsibility activities only pays off once a certain threshold is reached.

Based on these studies of the relationship between ESG performance and firm financial performance, and by translating an improvement in financial performance into a reduction (improvement) in risk (stability), it is possible to predict a non-linear relationship between ESG performance and risk (or stability). The Korinth and Lueg (2022) study, which examines the relationship between ESG scores and risk on the German stock market for the 100 largest listed companies in the post-crisis period from 2012 to 2019, fits into this perspective. They show that the relationship between ESG scores and risk is inverse and non-linear. In the same vein, Lupu et al. (2022) studied the impact of ESG scores on the financial stability of 110 European banks between 2010 and 2022 using the cross-quantilogram approach. Their results show that the ESG performance of listed European banks influences their financial stability, which is consistent with the results of Tóth et al. (2021). Furthermore, Lupu et al. (2022) support the existence of a non-linear relationship between ESG performance and the financial stability of European banks. Their methodology analyzes the dependence on the lower and upper percentiles and demonstrates that ESG scores around their 60% percentile exhibit low levels of instability.

To fully comprehend the nexus between ESG scores and bank stability, it is imperative to give due consideration to the digital environment in which banks operate. All previous studies on this relationship have neglected the effect of ESG performance on the nexus between digitization and bank stability. Based on stakeholder theory, the more a bank engages in ESG activities, the more it demonstrates greater transparency and thus receives greater support from stakeholders (W. Azmi et al., 2021). This support is even more important with the digitalization of the banking sector, which protects stakeholders by reducing the risk of mismanagement (Mu et al., 2023). Salah Mahdi et al. (2023) argue that the digitization of the banking sector could improve the financial stability of banks by promoting ESG activities. On the other hand, the negative effects of digitization may be offset by the reputation generated by engaging in ESG activities (Forcadell et al., 2020), thus ensuring that bank stability is maintained. In the same vein, Stefanovic et al. (2021), studying the impact of digitization and sustainability on bank performance, argue that banks engaged in both digitization and ESG practices are profitable and, therefore, more stable even in times of crisis.

Based on the above discussion, we propose the following hypothesis:

H5. The relationship between ESG scores and bank stability is non-linear.

H6: The impact of digitization on bank stability depends on the level of banks’ ESG scores.

Figure 1 presents the conceptual model of the study.

Fig. 1
figure 1

Conceptual model.

The present study is similar to Lupu et al. (2022) and Salah Mahdi et al. (2023) in its focus but differs in its approach. Like Lupu et al. (2022), we investigate the relationship between ESG performance and European financial stability in the banking sector, but our study differs in two keyways. The study considers the digital environment in which European banks operate to capture the impact of ESG performance on the nexus between digitization and bank stability. The methodological approach is different, using the PSTAR model to estimate ESG score thresholds and their impact on bank stability. Finally, while the study by Salah Mahdi et al. (2023) focuses on Fintech, the present study examines the effect of ESG performance on the impact of ICT diffusion and endowment on bank stability to account for the digital era in which European banks operate.

Econometric modeling and variable processing

The paper introduces Panel Smooth Transition AutoRegressive (PSTAR) modeling approaches to capture the nuances and regime change of the relationship between bank stability and ESG scores in the digital era, following the presentation of the econometric model and the data. Diagnostic tests will be conducted to evaluate the assumptions and the model’s appropriateness in order to guarantee the results’ validity and reliability. These tests identify potential issues, including stationarity and dependency tests, and implement any required modifications.

Econometric model and variables description

The following general form of a linear model is assumed for the subject relationship:

$${{\rm{BZ}}\_{\rm{Score}}}_{{\rm{it}}}={{\rm{\beta }}}_{0}+{{\rm{\beta }}}_{1}{{\rm{IUI}}}_{{\rm{it}}}+{{\rm{\beta }}}_{2}{{\rm{MCS}}}_{{\rm{it}}}+{{\rm{\beta }}}_{3}{{\rm{ATM}}}_{{\rm{it}}}+{{\rm{\beta }}}_{4}{{\rm{ESG}}}_{{\rm{it}}}+\,{{\rm{\beta }}}_{5}{{\rm{NIM}}}_{{\rm{it}}}+{{\rm{\beta }}}_{6}{{\rm{ROA}}}_{{\rm{it}}}+{{\rm{\beta }}}_{7}{{\rm{CIR}}}_{{\rm{it}}}+{{\rm{\beta }}}_{8}{{\rm{GDPG}}}_{{\rm{it}}}+{{\rm{\lambda }}}_{{\rm{i}}}+{{\rm{\mu }}}_{{\rm{t}}}+{{\rm{\varepsilon }}}_{{\rm{it}}}$$

where the subscripts “\({\rm{i}}\)” and “\({\rm{t}}\)” denote the bank and time, respectively. The bank-specific effect is captured by \({{\rm{\lambda }}}_{{\rm{i}}}\), the relevant time effect is taken into consideration by \({{\rm{\mu }}}_{{\rm{t}}}\), and the random error term is represented by \({{\rm{\varepsilon }}}_{{\rm{it}}}\). We provide a comprehensive summary of all variables in Table 1.

Table 1 Model variables.

The current study covers the years 2005–2022. It included a panel of 15 European countries (France, Germany, Belgium, Denmark, Czechia, the Netherlands, Finland, Spain, Italy, Poland, Portugal, Norway, the United Kingdom, Switzerland, and Sweden). The selection of countries is not arbitrary. As the objective of this study is to examine the influence of ESG scores on bank stability in the digital era, it is based on the 2022 GII. What’s more, this diversified sample encompasses nations that exemplify the disparate rankings associated with this criterion, with every nation situated within the top 30. In addition, the selection of banks is a function of data availability.

Moreover, this diversified sample comprises nations that are illustrative of the diverse rankings to this criterion, all of which are situated within the top 30. Furthermore, the selection of banks is contingent upon the availability of pertinent data.

This study excludes countries with less than 18 years of bank observations and countries with insufficient observations to obtain reasonable variable estimates.

PSTAR model

In line with the research conducted by Helali and Kalai (2021), Bouattour et al. (2024), and Kalai et al. (2024), this study builds upon the work of Gonzalez et al. (2017), who developed regime change models that can be enhanced through the Panel Smooth Transition Regression (PSTR) modeling. This model’s contribution is analogous to the transitioning of time series from rapid transition approaches to gradual transition approaches.

A two-tier PSTR model is therefore satisfied by the process (\({y}_{{it}},t\in Z,i\in Z\)) but only if:

$${y}_{{it}}={{\rm{\beta }}}_{0}+{\beta }_{1}^{{\prime} }{X}_{{it}}+{\beta }_{2}^{{\prime} }{X}_{{it}}G\left({q}_{{it}}{{;}}\,\gamma {{;}}\,c\right)+{\lambda }_{i}+{\mu }_{t}+{\varepsilon }_{{it}}$$

where \(G\left({q}_{{it}};\gamma {;c}\right)\) represents the transition function about the transition variable \({q}_{{it}}\), the threshold parameter \(c\), and a smoothing coefficient \(\gamma\). The matrix of \(k\) exogenous variables, \({X}_{{it}}=\left({X}_{it}^{1},\ldots ,{X}_{it}^{k}\right)\) does not contain any delayed explanatory variables, \(\beta =\left({\beta }_{1},\ldots ,{\beta }_{k}\right)\) and \({\varepsilon }_{{it}} \sim \aleph \left(0,{\sigma }^{2}\right)\). The index \(i=1,\ldots ,N\) denotes the individual dimension, while the index \(t=1,\ldots ,T\) denotes the temporal dimension.

Like the growth model of time series, the continuous transition function \(G\left({q}_{{it}};\gamma {;c}\right)\) modifies the indicator function of the Panel Threshold Regression (PTR) modeling. This function is differentiable on the interval \(\left(\mathrm{0,1}\right)\). The change permits the gradual transition process from one regime to another. Two distinct theories can be posited to explain the PSTR model. It can initially be viewed as an approach that is bound by two extreme schemes and contains an infinite number of schemes. In this study, we examine the coefficients that have the potential to fluctuate based on the units and the time period under consideration. The second area of focus is the commentary on the non-linear PSTAR technique, which entails a slow alternation between two homogenous and linear extreme regimes.

Ideally, a speed-to-speed transition function can be achieved by utilizing a variety of transition functions that are continuous and derivable over the range of \(\left(\mathrm{0,1}\right)\). Gonzalez et al. (2017) recommend the establishment of a logistic transition function of order \(m\) in this scenario:

$$G\left({q}_{{it}}{{;}}\,\gamma {{;}}\,c\right)={\left[1+\exp \left(-\gamma \mathop{\prod }\limits_{j=1}^{n}\left({q}_{{it}}-{c}_{j}\right)\right)\right]}^{-1},\gamma \,> \,0\,{\rm{and}}\,{c}_{1}\le {c}_{2}\le \ldots \le {c}_{m}$$

where \(c\) is an m-dimensional vector of location parameters denoted by \(c={\left({c}_{1},\ldots ,{c}_{m}\right)}^{{\prime} }\). The slope parameter \(\gamma\) is the determinant of the regularity of the transitions. The restrictions \(\gamma \,> \,0\) and \({c}_{1}\le {c}_{2}\le \ldots \le {c}_{m}\) are enforced for the purpose of identification. In practical application, it is typically adequate to take into account m = 1 or m = 2, as these values accommodate frequently observed variations in the parameters.

It is crucial to assess linearity, or if the regime change effect is statistically significant, before estimating the PSTR model. A methodology for evaluating the null linearity hypothesis in comparison to a PSTR model was developed by Gonzalez et al. (2017). In the context of the PSTR model with unknown nuisance parameters under the null hypothesis \({H}_{0}:{\beta }_{2}=0\), the conventional testing approach is not standard. Gonzalez et al. (2017) proposed an alternative method to assess the linearity hypothesis \({H}_{0}:\gamma =0\). This involves substituting a first-order Taylor expansion around \(\gamma =0\) for \(G\left({q}_{{it}};\gamma {;c}\right)\) and resetting, leading to the auxiliary regression as presented below.

$${y}_{{it}}={{\rm{\beta }}}_{0}+{\beta }_{1}^{{\prime} * }{X}_{{it}}+{\beta }_{2}^{{\prime} * }{X}_{{it}}G\left({q}_{{it}}{{;}}\,\gamma {{;}}\,c\right)+{\varepsilon }_{{it}}^{* }$$

where \({\beta }_{1}^{{\prime} * }\) are inversely proportional to the slope parameter \(\gamma\); \({\varepsilon }_{it}^{* }={\varepsilon }_{{it}}+{R}_{1}{\beta }_{2}^{{\prime} * }{X}_{{it}}\), where \({R}_{1}\) is the remaining Taylor’s development. Consequently, the evaluation of \({H}_{0}:\gamma =0\) is equivalent to the evaluation of \({H}_{0}^{* }:{\beta }_{2}^{{\prime} * }=0\).

The following statistics are defined for each test: Wald, likelihood ratio and Fisher:

$$\begin{array}{lll}L{M}_{w}&=&\frac{{TN}\left({{SSR}}_{0}\,-\,{{SSR}}_{1}\right)}{{{SSR}}_{0}};\\{LR}&=&-2\left[{log} \left({{SSR}}_{0}\right)-{log} \left({{SSR}}_{1}\right)\right];\\L{M}_{F}&=&\frac{{TN}\left({{SSR}}_{0}\,-\,{{SSR}}_{1}\right)/K}{{{SSR}}_{0}/\left({TN}\,-\,N\,-\,K\right)}\end{array}$$

In the context of a linear panel model with individual effects, \({{SSR}}_{0}\) represents the residual sum of squares, while \({{SSR}}_{1}\) represents the residual sum of squares of the non-linear panel model with two regimes. If the null hypothesis is valid, the Wald Lagrange multiplier (\(L{M}_{w}\)) and the likelihood ratio (\({LR}\)) statistics will exhibit a chi-square distribution with k degrees of freedom, where k represents the number of explanatory variables. In other ways, the Fisher Lagrange multiplier (\(L{M}_{F}\)) statistic follows a two-degree Chi-square distribution.

Furthermore, if we assume the existence of two regimes (\(h=1\) and \(h=2\)) and consider the potential presence of an autoregressive issue, our fundamental linear model can be expressed in Panel Smooth Transition AutoRegression (PSTAR) specification as follows:

$$\begin{array}{l}{{\rm{BZ}}\_{\rm{Score}}}_{{\rm{it}}}=\left(\mathop{\sum}\limits_{j=1}^{p}{\rho}_{1j}{\scriptstyle{1}\atop}{{\rm{BZ}}\_{\rm{Score}}}_{{it}-j}+{\beta }_{0}{\scriptstyle{1}\atop}+{\beta}_{1}{\scriptstyle{1}\atop}{{\rm{IUI}}}_{{\rm{it}}}+{\beta}_{2}{\scriptstyle{1}\atop}{{\rm{MCS}}}_{{\rm{it}}}+{\beta }_{3}^{\scriptstyle{1}\atop}{{\rm{ATM}}}_{{\rm{it}}}\right.\\ \qquad\qquad\qquad\left.+\,{\beta}_{4}^{\scriptstyle{1}\atop}{{\rm{ESG}}}_{{\rm{it}}}+{\beta }_{5}{\scriptstyle{1}\atop}{{\rm{NIM}}}_{{\rm{it}}}+{\beta }_{6}^{1}{{\rm{ROA}}}_{{\rm{it}}}+{\beta }_{7}{\scriptstyle{1}\atop}{{\rm{CIR}}}_{{\rm{it}}}+{\beta }_{8}{\scriptstyle{1}\atop}{{\rm{GDPG}}}_{{\rm{it}}}+{\varepsilon }_{1{it}}\right)\\\qquad\qquad\qquad+\left(\mathop{\sum}\limits_{j=1}^{p}{\rho}_{1j}{\scriptstyle{2}\atop}{{\rm{BZ}}\_{\rm{Score}}}_{{it}-j}+{\beta }_{0}{\scriptstyle{2}\atop}+{\beta }_{1}{\scriptstyle{2}\atop}{{\rm{IUI}}}_{{\rm{it}}}+{\beta }_{2}{\scriptstyle{2}\atop}{{\rm{MCS}}}_{{\rm{it}}}+{\beta }_{3}{\scriptstyle{2}\atop}{{\rm{ATM}}}_{{\rm{it}}}\right.\\\left. \qquad\qquad\qquad+\,{\beta }_{4}{\scriptstyle{2}\atop}{{\rm{ESG}}}_{{\rm{it}}}+{\beta }_{5}{\scriptstyle{2}\atop}{{\rm{NIM}}}_{{\rm{it}}}+{\beta }_{6}{\scriptstyle{2}\atop}{{\rm{ROA}}}_{{\rm{it}}}+{\beta }_{7}{\scriptstyle{2}\atop}{{\rm{CIR}}}_{{\rm{it}}}+{\beta }_{8}{\scriptstyle{2}\atop}{{\rm{GDPG}}}_{{\rm{it}}}+{\varepsilon }_{2{it}}\right)\\ \qquad\qquad\qquad\times\, G\left({{ESG}}_{{it}-1}{{;}}\,\gamma \,{{;}}c\right)\end{array}$$

The coefficients of the lagged Banking Z_Score (\({{BZ\_Score}}_{{it}-j}\)) are \({\rho }_{1j}^{1}\) and \({\rho }_{1j}^{2}\) at each regime, \(h=1\) or \(h=2\).

Empirical analysis

Data analyze

Concerning Table 2, we describe the main variables in our model. We restrict ourselves to the variables BZ_Score, IUI, MCS, ATM, and ESG.

Table 2 Descriptive analysis.

The BZ_Score variable has an overall mean of 29.959 with a standard deviation of 75.458, indicating considerable heterogeneity and dispersion in our distribution (CV = 2.519 > 1). The 1854 observations all lie between −322.080 and 1943.306. This distribution shows a rightward skewness (skewness > 0) and a marked leptokurtosis (kurtosis > 0). Overall, the distribution rejects the null hypothesis of normality (JB p-value < 5%) and shows a significant serial correlation (BB p-value < 5%).

The variable IUI exhibits an overall mean of 67.459 with a standard deviation of 24.625, indicating relatively low heterogeneity in our distribution (CV = 0.365 < 1). All values lie between 7.285 and 99.058. The sample distribution shows a slight left skewness (skewness < 0) and a marked leptokurtosis (kurtosis > 0). Overall, the distribution did not follow the null hypothesis of normality (JB p-value < 5%) and showed significant serial correlation (BB p-value < 5%).

The MCS variable displays an average value of 112.282, accompanied by a relatively low standard deviation of 24.565, indicating a distribution that is not highly heterogeneous (CV = 0.25 < 1). All the values of this variable are between 17,523 and 192,073. The sample distribution is weakly skewed to the left (skewness < 0) and leptokurtic (kurtosis >0). Overall, the distribution rejects the null hypothesis of normality (JB p-value < 5%) and exhibits strong serial correlation (BB p-value < 5%).

The ATM variable shows comparatively little heterogeneity in our distribution (CV = 0.4 < 1), with an average value of 80.451 and a standard deviation of 32.232. The range of all values is 19.386–194.63. There is a noticeable leptokurtosis (kurtosis > 0) and a modest rightward skewness (skewness > 0) in the sample distribution. Overall, there is a substantial serial correlation (BB p-value < 5%), and the distribution does not fit the null hypothesis of normality (JB p-value < 5%).

The mean value of the ESG variable is 49.028, and its standard deviation is a quite modest 18.967, suggesting that our distribution is not too diverse (CV = 0.387 < 1). This variable’s values are all between 0.000 and 95.533. The distribution of the sample is leptokurtic (kurtosis > 0) and somewhat skewed to the left (skewness < 0). Overall, the distribution shows a substantial serial correlation (BB p-value < 5%) and deviates from the assumption of normalcy (JB p-value < 5%).

At the bottom of Table 2, all the coefficients are well below 0.8, which is below the limit recommended by Kennedy (1985), above which serious problems of multicollinearity between the explanatory variables may arise. In addition, the independent variables were found to be weakly correlated. Consequently, the problem of multicollinearity does not arise in our situation. The correlation matrix provided above reveals that there are indeed correlations among the exogenous variables, potentially indicating the presence of multicollinearity. This issue could be addressed by exploring alternative models that incorporate less strongly related data. Notably, all variables demonstrate a positive and significant correlation with the BZ_Score variable, except for the ESG and CIR variables, which exhibit a negative and statistically significant correlation. Additionally, the Variance Inflation Factor (VIF) test yields a low value of 1.24, confirming the absence of significant multicollinearity.

Unit root tests are of crucial importance in panel data analysis to check whether the data can be used in stationary time series models. If unit root behavior is identified, it may be necessary to consider methods such as differencing to make the data stationary before continuing with the analysis. In this context, we intend to present the results of the first- and second-generation unit root tests.

This phase will involve the interpretation of the unit root tests for two generations. Based on the notion that residuals are not influenced by individual differences, the first generation of tests was developed. This allows for the calculation of statistical distributions for the tests and the acquisition of asymptotic or semi-asymptotic normal distributions. From this perspective, any correlations between individuals are regarded as nuisance parameters. In the majority of cases, the assumption of independence is removed in the second generation of tests, which are more recent. Rather than viewing individual correlations as nuisance parameters, these tests suggest that these co-movements be taken advantage of to establish new test statistics, thereby completely reversing the previously adopted perspective. The null hypothesis of the two generations of tests infers the existence of a unit root in the series, with the exception of the Hadri (2000) LM test, which assumes the absence of a unit root.

Table 3 presents the first-generation unit root tests of Levin et al. (2002) (LLC), Im et al. (2003) (IPS), and Hadri (2000) (LM), which indicate the stationarity of all model variables in the first difference. Moreover, the stationarity of all variables in the first difference is observed in the second-generation studies of M. H. Pesaran (2003) and M. H. Pesaran (2007), as illustrated in Table 4. This implies that the data have been transformed in a manner that ensures their behavior is time-independent, at least until the initial difference.

Table 3 First-generation unit root tests.
Table 4 Second-generation unit root tests.

The unit root test with break, as originally proposed by Karavias and Tzavalis (2014) and summarized in Table 5, was implemented to enhance our findings. The purpose of this is to verify the stability of the data series under various analysis methods and to guarantee that the conclusions operated from Pesaran’s tests are robust. According to Table 5, the results confirm that all the variables are stationary in the first difference with the presence of breaks linked to economic and financial shocks, namely the global financial crisis and the European sovereign debt crisis (2008–2012) and the COVID-19 pandemic crisis, which began in early 2020. In this case, it is necessary to test the cointegration between the variables in the model. The existence of breaks offers empirical proof of the correlation between ESG and stability in the digital era, particularly in the context of crises.

Table 5 Break unit root test.

The presence of several structural breaks in the variables suggests that the relationship will not be stable over the 2005–2022 study period. Thus, it is essential to choose the regime-switching model in this article.

The following section will analyze cross-sectional dependence by employing a variety of tests, including Friedman (1937), Breusch and Pagan (1980), Frees (1995, 2004), M. H. Pesaran (2004, 2006, 2015), and M. H. Pesaran et al. (2008) tests. Designed to ascertain the existence of a dependency relationship among individuals, these assessments are described in Table 6. The existence of a dependency among individuals is confirmed by the majority of these tests, as all of the associated probabilities are less than 1%, as the results indicate. Furthermore, the hypothesis of homogeneity is rejected at the 5% significance level when the p-values of the homogeneity tests are less than 0.05. In addition, the homogeneity test conducted by Pesaran and Yamagata (2008) provides evidence of heterogeneous coefficients.

Table 6 Cross-sectional dependence & homogeneity tests.

Implemented to evaluate the cross-sectional independence of the residuals in a fixed-effects regression model, the Breusch and Pagan (1980) test, thereby providing additional support for the aforementioned findings. The Wald statistic was used to evaluate groupwise heteroscedasticity in the residues of the regression model with the fixed effects, and the Breusch and Pagan (1979) and White (1980) tests were implemented upon this foundation. The residuals demonstrate robust cross-sectional independence, as described in Table 7, which underscores the existence of substantial individual heteroscedasticity.

Table 7 Autocorrelation and Heteroscedasticity tests.

According to the heteroscedasticity test results, the coefficients are associated with a probability that is less than 1% (chi2 statistic = 3.5e + 06) and statistically significant (p-value = 0.000). Consequently, we reject the homoscedasticity hypothesis (H0). Additionally, the autocorrelation test provides evidence of the model’s strong serial autocorrelation, which is significant (p-value = 0.000) and less than 1% (chi2 = 12625.733). Consequently, we reject the H0 hypothesis. These outcomes reinforce our earlier observations regarding the interdependence between individuals and underscore the necessity of considering such heteroscedasticity in our analytical framework.

It is crucial to investigate the existence of a cointegration relationship between the variables under examination and to look at the dependence test because, in the first difference, the majority of the variables in question are predominantly stationary. These outcomes reinforce our earlier observations regarding the interdependence between individuals and underscore the necessity of considering such heteroscedasticity in our analytical framework. According to SN Azmi et al. (2023), Pedroni (1999) is the first to introduce cointegration tests in a panel model configuration. The Kao (1999), Pedroni (2004), and Westerlund (2007) tests were acknowledged as first-generation panel cointegration tests, which resolved the issue of limited observations and handled slope variations and intercepts among panel members. Yet, they overlook the existence of cross-sectional interdependence between individuals. The literature has suggested several panel cointegration tests that consider cross-sectional dependence. The test of second-generation panel cointegration proposed by Persyn and Westerlund (2008) accounted for the presence of cross-sectional dependence among individuals. This test is also useful in cases where the variables have different integration levels, provided that the dependent variable is not stationary at that level, i.e., I(0).

We can conclude that there is at least one cointegrating relationship for all of the variables in our model based on the findings of Table 8, which presents the results of various cointegration tests of the first and second generations. All these tests have a probability lower than the 5% threshold.

Table 8 First- and second-generation cointegration tests.

PSTAR analysis

We proceeded to re-estimate our fundamental model using the PSTAR strategy described above. To address the issue of serial autocorrelation, initially identified in Table 7, we employed the Chi-square statistic, which yielded a value of 1751.124 with a zero p-value. This confirms the existence of a single lag in the model, a PSTAR (p = 1). The main non-linearity results are shown in Table 9. These results demonstrate the existence of a non-linear effect of the ESG on the BZ_Score with an optimal order “m” equal to 1. Additionally, there are two optimal thresholds (r = 2).

Table 9 PSTAR linearity tests.

Table 10 findings reveal two optimal thresholds of 6.975 and 33.018, suggesting that the PSTAR model’s transition parameter, with values of 18.557 and 975.09, influences the smooth transition. These results suggest the existence of a non-linear relationship between BZ_Score and ESG. In addition, the Akaike and Schwarz information criteria are respectively equivalent to 8.108 and 8.183, and the RSS (Residual Sum of Squares) value is 7.6e + 7.

Table 10 PSTAR optimal threshold.

After the application of linearity tests, we move on to estimate the PSTAR model. Table 11 displays the results. This table shows three regimes, with the first regime representing an ESG lower than 33.314, the second regime representing an ESG framed between 33.314 and 63.348, and the last regime representing an ESG greater than 63.348.

Table 11 PSTAR estimation.

In the first regime (385 observations), a 1% increase in the ESG score decreases the BZ_Score by 0.245%. In the second regime (970 observations), a 1% increase in the ESG reduces the BZ_Score by 0.171%. In the third regime (499 observations), where the ESG score is higher than 63.348%, a 1% increase in the ESG score increases the BZ_Score by 0.173%.

The two transition functions of the ESG score, are in Fig. 2 and Fig. 3 below.

Fig. 2
figure 2

BZ_Score first estimated transition function of ESG.

Fig. 3
figure 3

BZ_Score second estimated transition function of ESG.

From these two figures, it is important to note that transitions between regimes are abrupt. The high value indicates that the ESG score has an immediate influence on stability. The continuous effect of the ESG score on stability suggests that changes in favor of ESG commitment have a significant and lasting influence.

Robustness tests: Granger-causality tests

In order to consolidate our findings on the link between ESG scores and banking stability in the digital era, we perform robustness tests based on the new Granger-causality tests, namely Dumitrescu and Hurlin (2012) and Juodis et al. (2021).

Table 12 presents the causal relationship between the model variables of Dumitrescu and Hurlin (2012). The results show a bi-directional causality between NIM, ROA, IUI, MCS, ATM, ESG, and BZ_Score, and a unidirectional relationship move from CIR and GDPG to BZ_Score.

Table 12 Dumitrescu and Hurlin (2012) causality test.

The bi-directional relationship found between engagement in ESG activities, and the stability of European banks is consistent with previous literature (Ben Abdallah et al., 2020; Gaies and Jahmane, 2022). Ben Abdallah et al. (2020) support the classical view that such engagement leads to higher costs that alter bank soundness. On the other hand, bank soundness favors sustainable development investments. In the same vein, Gaies and Jahmane (2022) confirm this bi-directional relationship, pointing out that the stability of European banks encourages engagement in ESG activities, which in turn positively affects their stability.

The estimates of the Granger-causality test of Juodis et al. (2021) are presented in Table 13 to consolidate the results below, which are nearly identical. The majority of the variables are causal for the variable BZ_Score, with the exception of ATM and GDPG, as determined by the jackknife estimator in demi-panel and the transversal estimation of variance with correction of heteroscedasticity.

Table 13 Juodis et al. (2021) causality test.

Discussion

To move away from the long-standing but unsuccessful quest to prove that the relationship between ESG performance and bank stability is either always positive or always negative, this article attempts to develop a contingent perspective that specifies under which conditions the impact is positive or negative, based on the hypothesis of non-linearity of the relationship. To this end, two hypotheses are put forward. In line with the overinvestment perspective, H3 (The relationship between ESG scores and bank stability is negative), and in line with the stakeholder perspective, H4 (The effect of European banks’ ESG scores on bank stability is positive). In addition, to account for the digital environment in which banks operate, this study has put forward two hypotheses on the relationship between digitization and bank stability, H1 (The diffusion of ICT has a positive impact on bank stability) and H2 (The technological endowment of the banking sector positively impacts bank stability). Finally, to account for the effect of ESG performance on the relationship between digitization and bank stability, hypothesis H6 (The impact of digitization on bank stability depends on the level of banks’ ESG scores) was issued.

In order to address the research question, we will execute a two-level analysis of the results. Initially, the interpretation will concentrate on the influence of ESG performance on bank stability in order to either confirm or disprove our research hypotheses (H3, H4, and H5). Secondly, the analysis will concentrate on the interpretation of the impact of digitization on bank stability in order to either confirm or refute our research hypotheses (H1 and H2) and to ascertain whether this effect is contingent upon the ESG score of the banks (H6).

ESG scores and bank stability

Regarding the impact of engagement in ESG activities on bank stability, the study found a negative impact in the first two regimes. Only in the third regime, when ESG scores reach 63.348 and above, does their effect on bank stability shift from weakening to stabilization. These results confirm the non-linear relationship between engagement in ESG activities and bank stability, supporting hypothesis H5. The results of the study are consistent with previous literature suggesting a non-linear relationship (Barnett and Salomon, 2012; Korinth and Lueg, 2022) and, more specifically, with Lupu et al. (2022), who, it should be recalled, support the existence of a non-linear relationship between ESG performance and the financial stability of European banks. The present study, therefore, provides new empirical evidence challenging the assumption of linearity in the relationship between ESG performance and bank stability supported by most of the empirical literature (Gangi et al., 2019; Di Tommaso and Thornton, 2020; Chiaramonte et al., 2022; Neitzert and Petras, 2022).

Regarding the negative effect of ESG scores on bank stability in the first two regimes, the results show that this negative effect becomes weaker in the second regime and then becomes positive in the third regime. These results are consistent with those of Barnett and Salomon (2012), who argue that engagement in social responsibility activities initially leads to a decrease in financial performance, but subsequently, increased investment in social responsibility leads to an increase in financial performance. By translating an improvement in financial performance into a reduction (improvement) in risk (stability), it becomes possible to explain the sequence of negative to positive effects of ESG performance on bank stability.

Therefore, in the absence of a sufficient ESG score (i.e., below 63.348) to transform their ESG investment into a financial return, banks with low ESG performance obtain a negative return on their ESG expenditure, translating into a negative impact on stability. Such results are in line with the classic view of overinvestment, which maintains a negative relationship between ESG performance and financial performance (Friedman, 1970) and consequently alters the bank’s stability. From this perspective, ESG investments are a waste of resources, acting negatively on the bank’s performance (Barnea and Rubin, 2010), resulting in increased risk and, therefore, bank fragility. These investments can be seen as agency costs in the sense that managers, concerned about their reputation, make them to the detriment of shareholders’ interests. From this point of view, these banks are considered riskier by investors. The results of the first two regimes thus confirm hypothesis H3. However, as banks invest more and exceed the ESG score of 33.314, the negative impact on bank stability becomes smaller (Regime 2). In other words, before adequate ESG scores (i.e., scores below 63.348), engagement in ESG activities continues to alter bank stability, but to a lesser extent.

In contrast, banks with ESG scores equal to or above 63.348 can transform ESG investments into financial returns, thereby strengthening their stability. Although banks with higher ESG scores have higher costs, these costs are offset by the positive financial returns generated by these additional ESG investments, in line with previous literature (Barnett and Salomon, 2012). The more a bank engages in ESG activities, the more it demonstrates greater transparency, enabling it to garner greater support from stakeholders (W Azmi et al., 2021). Furthermore, these results are close to those of Lupu et al. (2022), who show that high levels of stability are recorded when ESG values are around their 60% percentile. This positive effect of ESG score on bank stability is in line with stakeholder theory, which is consistent with previous literature highlighting that high ESG scores are associated with lower bank insolvency risk (Aevoae et al., 2023; Neitzert and Petras, 2022; Cerqueti et al., 2021; Scholtens and van’t Klooster, 2019). Thus, hypothesis H4 is only retained if ESG scores reach the threshold of 63.348 and above.

Overall, our results suggest that critics and proponents of banks’ commitment to social responsibility are, to some extent, right. Indeed, low or moderate ESG performance (Regimes 1 and 2) is consistent with the conventional view, while high ESG performance (Regime 3) is consistent with the stakeholder view. Thus, our results support Barnett’s (2007) theoretical argument that social responsibility commitment enables a stakeholder influence capacity that enhances the ability to convert ESG investments into financial returns and, consequently, improves bank stability. The accumulation of this stakeholder influence capacity means that the benefits of ESG performance increase at a higher rate than the costs, enabling a shift from a negative relationship between ESG performance and bank stability to a positive one.

The role of ESG performance in the relationship between digitization and bank stability

The digital era in which European banks operate favors their stability through the spread of ICT. This evidence follows from the results presented above (Table 13), showing that ICT diffusion in terms of the Internet (IUI) and mobile devices (MCS) has a positive impact on bank stability in all three regimes. These results are consistent with previous literature supporting the “innovation-stability” hypothesis in studies concerning the European banking sector (Del Gaudio et al., 2021), thus confirming the first H1 research hypothesis.

In contrast, the technological endowment of the European banking sector, as reflected by ATM penetration, had a negative impact on bank stability in the first regime, which corresponds to an ESG score below 33.314. This result is in line with the literature supporting the “innovation-fragility” vision (Instefjord, 2005; Fostel and Geanakoplos, 2012; Gennaioli et al., 2012), highlighting the distortions introduced by digitization and its contribution to significant risk-taking by banks. On the other hand, the impact of the technological endowment of the European banking sector on bank stability becomes positive for the second regime, where the ESG score is framed between 33.314 and 63.348, and for the last regime, where the ESG score is equal to or above 63.348. Thus, the endowment of technological infrastructure, which reflects the ICT investments of European banks, keeps them away from the risk of insolvency. This positive effect is in line with the extensive literature supporting the “innovation-stability” vision (Hasan et al., 2012; Del Gaudio et al., 2021; Kasri et al., 2022).

This ATM variable, which reflects access to financial services through ICT investments by European banks, with a negative impact on bank stability in the first regime, invalidates research hypothesis H2. However, in the other two regimes, hypothesis H2 is confirmed. Therefore, an interpretation based on the threshold variable is required.

According to the results, European banks with a low ESG score (i.e., below 33.314) are unable to avoid the fragility caused by investments in technological infrastructure. Only in the second regime (i.e., an ESG score between 33.314 and 63.348), do banks manage to avoid the risk of insolvency. In other words, banks with a low commitment to ESG activities become riskier in the digital age. These results are consistent with the literature, which argues that digitization could improve banks’ financial stability by promoting social responsibility (Forcadell et al., 2020; Stefanovic et al., 2021; Salah Mahdi et al., 2023), thus confirming hypothesis H6.

Finally, the study’s findings are consistent with previous literature and provide additional empirical insights into the relationship between ESG performance and bank stability in the digital age. Nevertheless, the discrepancies in the present study’s sample, in terms of both country and bank selection, preclude a direct comparison with previous literature. This calls for further research in this area.

Conclusion and policy recommendations

Unlike researchers who have studied the relationship between bank stability and ESG performance, this study integrates the digital environment in which the banking system evolves to capture the relationship between ESG performance, digitization, and bank stability. From a methodological standpoint, this paper opted for a regime-switching model to test the hypothesis of a non-linearity of the relationship not yet considered in previous studies. Estimates from the PSTAR model revealed evidence, for the occurrence of three regimes with different characteristics. The first regime is characterized by low ESG scores, the second by moderate ESG scores, and the last by high ESG scores.

Overall, our results suggest that critics and proponents of banks’ commitment to social responsibility are, to some extent, right. Indeed, low or moderate ESG performance (Regime 1 and Regime 2) is congruent with the classical perspective, while high ESG score (Regime 3) is consistent with the stakeholder view. So, our results back up Barnett’s (2007) theory that a commitment to social responsibility gives stakeholders more power, which makes it easier to turn ESG investments into financial returns and, in turn, makes banks more stable. The accumulation of this stakeholder influence capacity means that the benefits of ESG score increase at a rate that exceeds the costs, enabling a shift from a negative relationship between ESG performance and bank stability to a positive one. Additionally, banks with a low commitment to ESG activities become riskier in the digital age. These results are consistent with the literature, which argues that digitization could improve banks’ financial stability by promoting social responsibility (Forcadell et al., 2020; Stefanovic et al., 2021; Salah Mahdi et al., 2023).

From an empirical point of view, this study makes three important contributions. First, this paper has provided empirical evidence that the relationship is characterized by non-linearity and contingent upon the level of ESG scores, complementing the recent literature showing the positive impact of high ESG scores on bank stability. Second, to consider the impact of the digital age, this study also examines the impact of ICT diffusion and endowment on bank stability, controlling for ESG scores. To our knowledge, this is the first study to link the impact of digitization on bank stability to ESG activities. Third, this article has highlighted its contribution to the empirical literature from a methodological point of view by opting for the PSTAR model, which allowed us to estimate ESG score thresholds and their impact on the stability of European banks.

In terms of important implications, it is possible to distinguish between those for European banks and those for the Union’s policymakers. Indeed, given the positive impact of ICT on bank stability, European banks need to embrace digitization as a means of better exploiting the opportunities offered by the single market by relaxing the subsidiary-based model and opting for the free provision of services. Furthermore, knowing the threshold for a positive impact of ESG scores on bank stability should encourage them to improve their ESG performance by adopting a digital strategy based on a business model overhaul and improved organizational sustainability. On the policy side, the findings suggest that Europe, and more specifically, the European Commission, needs to ensure that the overall framework for regulating and supervising bank stability is adapted to the changes brought about by digitization and the quest for sustainability. While remaining neutral, policymakers need to strengthen the regulatory infrastructure to support their joint evolution.

Despite these important implications, this study has limitations that suggest areas for future research. Indeed, considering ESG sub-pillars would enable us to capture the relative importance of each pillar in the relationship between ESG performance and bank stability in the digital age. In addition, considering other variables, such as banks’ investments in digital technologies, would provide a more robust basis for understanding the relationship between ESG performance, bank stability, and banks’ digital transformation.