Introduction

The renewable energy sector has grown dramatically over the last decade, accompanied by increased market capitalization and investment flows. The “Energy Transition Investment Trends 2024" report by BloombergNEF indicates a 17% rise in global investments in the low-carbon energy transition in 2023, totaling around $1.77 trillion (BloombergNEF, 2024). In tandem, the International Energy Agency’s (IEA) “Renewables 2023" publication reveals a nearly 50% increase in global annual renewable capacity additions, reaching a record 510 gigawatts (IEA, 2023). This is the fastest growth rate in two decades and the 22nd consecutive year of record-setting additions (IEA, 2023). These trends highlight the sector’s resiliency and critical role in promoting economic growth and innovation to ensure a sustainable future. According to the IEA, while the world is on track to more than double renewable energy capacity by 2030, further support is required to meet the net-zero emissions objective by mid-century (IEA, 2024).

Despite this expansion, the clean energy stock markets have experienced increased volatility and risk, with sharp price fluctuations and market corrections. The sector experienced substantial declines in the early phases of the COVID-19 pandemic in 2020 (Liu et al., 2022), as global markets responded to economic instability. The sector’s fast rebound, fueled by investor expectations of a green shift in global energy regulations, was followed by further volatility into 2021 (NS Energy, 2021). For example, the WilderHill Clean Energy Index (ECO), which tracks the performance of companies in the renewable energy and conservation sectors, has undergone major corrections after reaching record highs (WilderShares LLC, 2022a). This volatility continued into 2022, influenced by market forces such as supply-chain issues, inflation concerns, and geopolitical tensions (WilderShares LLC, 2022b). The persistent volatility highlights the importance of investors using accurate risk assessment and management tools to efficiently navigate the complexity of the clean energy market.

The burgeoning field of clean energy equities has garnered substantial scholarly attention, particularly in relation to traditional energy and financial assets like oil and technology stocks. Existing literature extensively examines mean returns, volatility, tail risk spillovers, and price forecasting in this domain (e.g., Dutta et al., 2020a; Inchauspe et al., 2015; Managi and Okimoto, 2013; Reboredo et al., 2017; Sadorsky, 2012a, 2022; Saeed et al., 2021). However, despite the high-risk profile associated with clean energy investments (e.g., Ortas and Moneva, 2013; Rezec and Scholtens, 2017), there remains a critical shortcoming in the literature regarding effective risk management strategies, particularly concerning hedging techniques and volatility risk mitigation (e.g., Ahmad et al., 2018; Gustafsson et al., 2022). Notably, the prediction of tail risks in clean energy equities has been under-explored, with limited examination of how distributional features and volatility dynamics affect forecasting accuracy (Pradhan and Tiwari, 2021; Zhuo et al., 2023). Furthermore, prior research has not rigorously investigated the performance of predictive models across different investment strategies specifically distinguishing between long and short positions, nor has it compared the efficacy of various tail risk models like Value at Risk (VaR) and Expected Shortfall (ES).

These research gaps are particularly significant because modeling tail risk during market stress remains a major challenge in financial econometrics. The 2008 financial crisis exposed the limitations of standard VaR models and prompted extensive research into more robust approaches (Berger and Missong, 2014; Chen et al., 2012), including comparative studies aimed at identifying optimal VaR specifications for turbulent periods (Abad et al., 2016; Louzis et al., 2014; Tran and Tran, 2023). Notably, Halbleib and Pohlmeier (2012) showed that, although individual VaR models often failed during crises, their performance could be improved through data-driven forecast combination techniques, highlighting the importance of addressing model misspecification risk. Rather than proposing new methodologies, this study contributes to the literature by systematically applying established models to clean energy assets, an area still underexplored in risk modeling. Specifically, we investigate whether carefully tailoring model specifications for long and short positions can serve as an alternative path to improving VaR forecasts, parallel to the improvements achieved by combination methods in the existing VaR literature (Chiu et al., 2010; Taylor, 2020). In doing so, we extend robust post-crisis frameworks to address the unique risks of clean energy investments.

Addressing these deficiencies is crucial for several interconnected reasons. As the clean energy sector experiences unprecedented growth and attracts significant investment flows, accurate risk assessment and management have become increasingly vital (Demiralay et al., 2023). The absence of robust, sector-specific risk models creates fundamental vulnerabilities whereby investors systematically underestimate potential losses, resulting in suboptimal capital allocation that hinders sectoral development. This concern becomes particularly acute for tail risk, i.e. the probability of rare but severe, high-impact events. Empirical evidence demonstrates that clean energy asset returns exhibit pronounced “fat tails" and negative skewness, indicating that extreme losses occur with significantly greater frequency than predicted by standard normal distributions (Zhang et al., 2023a). Traditional risk frameworks that assume normality systematically underestimate both the probability and magnitude of worst-case scenarios, exposing investors to substantial downside risk (Sheikh and Qiao, 2010). Consequently, developing models capable of capturing these non-normal characteristics represents a central methodological challenge, particularly as regulatory frameworks evolve to support sustainable investments, creating urgent demand for sophisticated risk measures in this inherently volatile sector.

The asymmetric nature of volatility in clean energy markets further compounds these modeling challenges and necessitates increasingly nuanced analytical approaches (Zhang et al., 2023). Financial markets exhibit a well-documented phenomenon whereby negative news generates disproportionately larger increases in future volatility compared to positive news of equivalent magnitude-a relationship known as the leverage effect (Black, 1976). This asymmetric volatility response proves particularly pronounced within the clean energy sector, which demonstrates heightened sensitivity to adverse regulatory announcements, supply chain disruptions, and technological failures (Hassan, 2023; Zhao, 2020). Risk models that fail to incorporate this asymmetric behavior will systematically miscalculate volatility escalation following negative market events, thereby leaving long-position investors inadequately prepared for subsequent market turbulence. These empirical realities provide direct motivation for employing asymmetric volatility models specifically designed to capture leverage effects (Engle and Ng, 1993).

The distinction between long and short position risk profiles represents another critical dimension that has received insufficient attention in existing literature. Long positions, predominantly held by institutional investors optimistic about clean energy prospects, require protection against sudden market downturns. Conversely, short positions, employed by hedge funds and risk managers for speculative or hedging purposes, face qualitatively different exposures, including potential unlimited losses during rapid price appreciation. Despite these divergent risk characteristics, the majority of financial risk literature maintains an implicit long-only focus, concentrating on modeling downside movements while leaving short position dynamics under-explored. This constitutes a critical gap, as models optimized for downside risk may prove ill-suited for quantifying upside risk exposures (Angelidis and Degiannakis, 2005). The present research addresses this limitation by proposing a dual-risk framework that independently assesses model performance for both strategies, providing market participants with a more complete and practically relevant risk management toolkit.

Overall, this study fills these significant gaps by making the following three innovative contributions:

First, we introduce and validate a dual-risk framework for the clean energy sector. To our knowledge, this is the first study to systematically model and backtest tail risk for both long and short positions in clean energy equities. By demonstrating that optimal risk models differ significantly between these two strategies, we challenge the conventional one-size-fits-all approach and provide more targeted, practical insights for a wider range of market participants, from long-term investors to hedge funds.

Second, we improve the methodology of risk modeling by tailoring volatility models to the unique characteristics of clean energy stocks. We refine a set of established conditional volatility models chosen to represent symmetric, asymmetric, and highly persistent volatility structures, by integrating sophisticated distributional assumptions that more accurately capture the observed skewness and fat tails in clean energy returns. This methodological enhancement increases the predictive power of these models for this specific, under-researched asset class.

Third, we provide robust empirical validation across multiple market regimes. Our analysis is not limited to a single market environment but draws on over a decade of data, including major market upheavals like the 2008 financial crisis and the COVID-19 pandemic. By rigorously backtesting our models through these periods of extreme stress, we offer strong evidence of their reliability and real-world applicability, providing actionable insights for investors, risk managers, and policymakers navigating this volatile sector.

The remainder of the article is organized as follows. Section 2 provides a literature review. Section 3 outlines the models and evaluation methods. Section 4 explains the data and presents the empirical findings. Section 5 concludes.

Literature review

The intricate interdependencies between clean energy stock prices and other financial assets are pivotal to understanding their risk profiles. Pioneering work in this domain reveals that renewable energy equities are significantly influenced by technology shares and oil prices, suggesting a unidirectional impact (Henriques and Sadorsky, 2008). Kumar et al. (2012) substantiates this by revealing a stronger linkage with technology equities over oil prices. The 2008 financial crisis exacerbated the correlation between oil prices and the valuation of renewable and technology companies (Sadorsky, 2012a). This trend continued, with Inchauspe et al. (2015) observing a significant influence of the MSCI World index and technology shares on renewable energy investments post-2007, albeit with an increasing influence of oil prices. In contrast, Dutta et al. (2020a) emphasizes an increased sensitivity of green investments to oil market volatility rather than price fluctuations. Moreover, leveraging advanced machine learning techniques, Sadorsky (2022) has identified technical indicators and market volatility measures as key predictors of clean energy stock prices, highlighting the nuanced relationships these stocks have with technology and oil assets.

The second strand of literature examines extreme risk spillovers in the renewable energy sector. Tan et al. (2021) uses a spillover index and multivariate quantile models to uncover asymmetric and time-varying risk relationships between oil and clean energy equities, identifying the oil market as a net recipient of volatility. Zhang et al. (2023) builds on volatility models that link short-term market risk to long-term macroeconomic trends, emphasizing the importance of asymmetric market reactions and extreme events. Studies by Di Febo et al. (2021) and Saeed et al. (2021) contribute to the discussion of tail risk dynamics, highlighting the significance of considering market asymmetry during times of stress. Other researchers such as Hassan (2023), Xia et al. (2019), and Fu et al. (2022) investigated how external factors like energy security and macroeconomic indicators affect the volatility of clean energy stock during volatile market periods. Naeem and Arfaoui (2023) and Syuhada et al. (2024) used conditional risk models to explore systemic risk and interconnectedness in energy markets, while Zhang et al. (2023b) and Chen et al. (2022) found increased interconnectedness during extreme market conditions.

From a risk management perspective, the high volatility of renewable energy stocks has attracted considerable academic interest. Sadorsky (2012b) and Ortas and Moneva (2013) examined the key risk drivers for renewable energy companies, with Ortas and Moneva (2013) identifying a structural shift in risk levels during the financial crisis. More recently, Lehnert (2023) and Ghosh et al. (2022) have drawn parallels with historical market bubbles, suggesting that clean energy stocks may be subject to similar market dynamics. From a hedging viewpoint, Ahmad et al. (2018) and Gustafsson et al. (2022) investigated optimal hedges for clean energy stocks, with the latter finding that traditional energy metals are not effective hedges for clean energy equities. Moreover, Troster et al. (2020), Pradhan and Tiwari (2021) and Zhuo et al. (2023) have made notable contributions by using the VaR or ES techniques to assess market risk, with the research by Zhuo et al. (2023) highlighting the superior performance of conditional volatility models, particularly during the COVID-19 pandemic. Despite the extensive research, gaps remain. The analysis by Troster et al. (2020), Pradhan and Tiwari (2021) and Zhuo et al. (2023) focused on long positions while ignoring the impact of short positions on risk modeling. The reliance on traditional conditional volatility models by Pradhan and Tiwari (2021) and Zhuo et al. (2023) fails to account for critical volatility aspects such as leverage effect and long memory, which are necessary for accurate tail risk forecasting. Furthermore, these studies lack a comparative study across different market stress levels, as well as direct comparisons between VaR and ES projections, both of which are crucial for determining the relative usefulness of these risk measures. This research gap serves as the foundation for our investigations.

Methods and models

Value at Risk (VaR) serves as a pivotal risk metric mandated by the Basel Committee on Banking Supervision (BCBS) in 1996 for quantifying the potential maximum loss a financial portfolio may incur over a specified time horizon, typically one day, at a chosen confidence level. The formula for VaR at a confidence level \(\left(1-\alpha \right)\) is \({\rm{Pr}}\,\left({r}_{t+h}\le {{\rm{VaR}}}_{t+h}^{\alpha }\right)=\alpha\) where rt+h denotes the return of the portfolio at time t + h. Expected Shortfall (ES), also known as Conditional VaR, extends the VaR concept by not only quantifying the potential loss but also providing the expected size of loss given that the VaR threshold has been breached. The BCBS has recommended a shift in the regulatory framework from using a 99% confidence level VaR to a 97.5% confidence level ES for a more nuanced quantification of risk BCBS (2016) .

The logarithmic daily return of a financial asset, denoted by rt, is modeled as follows:

$${r}_{t}={\mu }_{t}+{\varepsilon }_{t}=\left({c}_{0}+{c}_{1}{r}_{t-1}\right)+{\varepsilon }_{t}$$
(1)
$${\varepsilon }_{t}={\sigma }_{t}{z}_{t}$$
(2)

Here, the conditional mean, μt, is modeled as a first-order autoregressive AR(1) process to account for any serial correlation in the returns (Troster et al., 2020). In this mean equation, c0 represents the constant term (intercept), and c1 is the autoregressive coefficient that captures the effect of the previous day’s return rt−1 on the current day’s return. The term εt is the residual at time t, which is composed of the conditional volatility, σt, and an independent and identically distributed (i.i.d.) random variable, zt, with a mean of zero and variance of one.

For the purpose of one-day ahead risk assessment, the VaR and ES are estimated using the following expressions (Brio et al., 2020):

$${\,\text{VaR}\,}_{t}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\hat{F}}_{z}^{-1}(\alpha )$$
(3)
$${\,\text{ES}\,}_{t}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\hat{S}}_{z}(\alpha )$$
(4)

Here, \({\hat{\mu }}_{t}\) and \({\hat{\sigma }}_{t}\) represent the predicted values for the daily conditional mean and conditional volatility, respectively. \({\hat{F}}_{z}^{-1}(\alpha )\) signifies the inverse cumulative distribution function (CDF) of the standardized innovations at the α quantile, and \({\hat{S}}_{z}(\alpha )\) represents the expected value of z given that z is less than its α quantile.

As demonstrated by Equations (3) and (4), the accuracy of one-day-ahead VaR and ES forecasts depends critically on the precise estimation of two key components: the conditional volatility σt and the quantile of the standardized innovations \({F}_{z}^{-1}(\alpha )\). The volatility forecast scales the overall risk estimate, capturing the time-varying magnitude of market fluctuations, while the innovation distribution determines the shape of the tail risk, particularly in extreme market conditions. Inaccurate volatility modeling leads to risk estimates that are consistently biased, while incorrect distributional assumptions result in fundamental miscalculations of tail probabilities. The following subsections detail the specific models used to estimate these two primary drivers of tail risk.

Volatility models

A core feature of financial returns is the phenomenon of volatility clustering, where periods of high volatility tend to be followed by more high volatility, and calm periods are followed by calm periods (Engle, 1982). The Generalized Autoregressive Conditional Heteroskedasticity (GARCH) family of models is specifically designed to capture this time-varying nature of financial risk (Bollerslev, 1986).

The decision to utilize GARCH-type models for volatility analysis in clean energy stocks is anchored in both theoretical and empirical considerations. They are widely recognized for their ability to capture stylized facts of financial time series, such as volatility clustering and leptokurtosis (Bollerslev, 1986; Engle, 1982). These features are particularly pertinent in the clean energy sector, where returns are often characterized by rapid technological change, policy uncertainty, and shifting investor sentiment (Athari and Kirikkaleli, 2025; Song et al., 2019). Moreover, GARCH models offer a practical balance between flexibility and parsimony, delivering reliable performance while remaining computationally efficient and interpretable for practitioners (Francq and Zakoian, 2019). This transparency and theoretical foundation are crucial for risk management, particularly when compared to advanced machine learning techniques, which often function as “black boxes" with high data requirements and increased risk of overfitting (Alessi and Savona, 2021; Wang, 2024). Given their extensive validation and widespread adoption, GARCH-type models provide a robust and comparable framework for our analysis (Christoffersen, 2012).

In this study, we employ three distinct GARCH specifications to test competing hypotheses about the volatility dynamics characterizing clean energy stocks. The standard GARCH model (Bollerslev, 1986) serves as our foundational benchmark, assuming symmetric volatility responses. To test for the leverage effect, we employ the Glosten, Jagannathan, and Runkle (GJR) GARCH specification (Glosten et al., 1993), which allows for asymmetric volatility responses to positive and negative news. Additionally, to investigate whether volatility shocks exhibit exceptional persistence, we use the Integrated GARCH (IGARCH) model (Engle and Bollerslev, 1986), which allows for near-permanent effects of market disturbances. The specific mathematical formulations for each model are detailed in the following subsections.

Standard GARCH

The GARCH model, introduced by Bollerslev (1986), is a widely used framework for modelling volatility in financial time series. The GARCH(1,1) model, in particular, is characterized by its ability to capture phenomenon of volatility clustering, where large changes in asset prices are often followed by further large changes, and periods of small price changes are followed by further small changes. The GARCH(1,1) model for the conditional variance of a time series rt is given by:

$${\sigma }_{t}^{2}=\omega +\alpha {\varepsilon }_{t-1}^{2}+\beta {\sigma }_{t-1}^{2}$$
(5)

Where ω is a constant term, and α and β are parameters to be estimated, with α + β < 1, ensuring that the conditional variance \({\sigma }_{t}^{2}\) remains positive and stationary. The persistence of volatility is captured by the sum of α and β. Specifically, as the sum α + β approaches 1, the decay rate of the autocorrelation associated with the conditional variance σt decreases toward zero. This implies that shocks to volatility have an increasingly persistent effect.

GJRGARCH

The GJRGARCH model is an extension of the standard GARCH model that allows for asymmetric effects of past shocks on volatility (Glosten et al., 1993). This model is particularly useful when the impact of negative and positive shocks on future volatility is different, a feature known as the leverage effect. This asymmetry is particularly relevant for clean energy equities, which are highly susceptible to regulatory announcements and technological developments (Li, 2023; Zhao et al., 2018). The GJRGARCH(1,1) model is specified as follows:

$${\sigma }_{t}^{2}=\omega +\left[\alpha +\gamma I\left({\varepsilon }_{t-1} < 0\right)\right]{\varepsilon }_{t-1}^{2}+\beta {\sigma }_{t-1}^{2}$$
(6)

where \(I\left(\cdot \right)\) is an indicator function that takes the value of 1 if εt−1 < 0 and 0 otherwise, and γ captures the additional impact of negative shocks on future volatility. If γ > 0, negative shocks increase volatility more than positive shocks.

IGARCH

The IGARCH model (Engle and Bollerslev, 1986) is a special case of the GARCH(1,1) model in which the sum of α and β is constrained to equal one, implying a unit root in the GARCH process and thus persistence of shocks to the variance over an infinite horizon. This specification is well-suited for clean energy markets, which often experience protracted periods of heightened volatility during policy debates and technology development cycles (Athari and Kirikkaleli, 2025). The IGARCH(1,1) model is specified as follows:

$${\sigma }_{t}^{2}=\omega +\alpha {\varepsilon }_{t-1}^{2}+\beta {\sigma }_{t-1}^{2}$$
(7)

where α + β = 1. This specification suggests that shocks to the conditional variance have a persistent effect on future volatility levels.

Innovation models

After accounting for time-varying volatility with a GARCH model, the remaining standardized residuals (zt in Equation (2)) are known as innovations. The distributional assumption made about these innovations is critical for accurately forecasting tail risks like VaR and ES. Although GARCH-type models with normal innovations can generate data with unconditionally fat tails, they are insufficient to account for all of the unconditional leptokurtosis and skewness observed in financial return series (Kuang, 2021). This is particularly important for clean energy markets, which are often subject to extreme events such as sudden price jumps or collapses driven by policy changes and technological breakthroughs.

To address this empirical reality, our study follow Kuang (2022) to evaluates four distinct innovation distributions, each representing a different approach to capturing the true characteristics of clean energy stock returns: The Normal distribution serves as the conventional benchmark, despite its likely mis-specification for financial data. The skewed Student’s t-distribution provides a flexible parametric framework to simultaneously capture both asymmetry and heavy tails Fernandez and Steel (1998). The Filtered Historical Simulation (FHS) offers a non-parametric alternative that makes no distributional assumptions, instead utilizing the empirical distribution of historical innovations directly Barone-Adesi et al. (1999). The Cornish-Fisher expansion represents a semi-parametric compromise, adjusting normal distribution quantiles based on the observed higher-order moments of the innovation series Favre and Galeano (2002). The mathematical specifications and risk forecasting equations for each approach are detailed in the subsequent subsections.

Normal distribution

The normal distribution assumption is both the simplest and the most commonly used for financial return innovations, mainly because of its analytical convenience. Assuming that the standardized residual zt adheres to a standard normal distribution, its cumulative distribution function delineated as follows:

$$\Phi (x)=\frac{1}{\sqrt{2\pi }}\mathop{\int}\nolimits_{\!\!-\infty }^{x}{e}^{-{t}^{2}/2}$$
(8)

The next days’ risk forecasts are provided as below (Brio et al., 2020):

$${\,\text{VaR}\,}_{t,N}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\Phi }^{-1}(\alpha )$$
(9)
$${\,\text{ES}\,}_{t,N}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}\frac{1}{\alpha }\phi \left[{\Phi }^{-1}(\alpha )\right]$$
(10)

In this context, ϕ represents the probability density function (pdf) of a standard normal distribution, while Φ−1(α) denotes the corresponding quantile for an arbitrary value of α.

Skewed student distribution

The skewed Student’s t-distribution, as proposed by Fernandez and Steel (1998), accommodates both skewness and excess kurtosis in the distribution of innovations. The quantile function for the skewed Student’s t-distribution can be written as below (Lambert and Laurent, 2001):

$${c}_{\alpha ,\nu ,\xi }^{skst}=\left\{\begin{array}{ll}\left\{\frac{1}{\xi }{c}_{\alpha ,\nu }^{st}\left[\frac{\alpha }{2}\left(1+{\xi }^{2}\right)\right]-m\right\}/s,\,\qquad\,\text{if}\,\ \alpha < \frac{1}{1+{\xi }^{2}}\\ \left\{-\xi {c}_{\alpha ,\nu }^{st}\left[\frac{1-\alpha }{2}\left(1+{\xi }^{-2}\right)\right]-m\right\}/s,\,\qquad\,\text{if}\,\ \alpha \ge \frac{1}{1+{\xi }^{2}}\end{array}\right.$$
(11)

where \({c}_{\alpha ,\nu ,\xi }^{skst}\) represents the αth quantile of the skewed student distribution with unit variance, characterized by ν > 2 degrees of freedom and an asymmetric parameter ξ > 0. The term \({c}_{\alpha ,\nu }^{st}\) denotes the quantile function of the standardized Student-t density function. In addition, m and s stand for the non-standardized skewed student distribution’s mean and standard deviation, respectively. Consequently, the next days’ risk forecasts are provided as follows:

$${\,\text{VaR}\,}_{t,skst}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{c}_{\alpha ,\nu ,\xi }^{skst}$$
(12)
$${\,\text{ES}\,}_{t,skst}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\mathbb{E}}\left[z| z < {c}_{\alpha ,\nu ,\xi }^{skst}\right]$$
(13)

Filtered historical simulation

The Filtered Historical Simulation (FHS) method is a semi-parametric technique Barone-Adesi et al. (1999). This approach uses a non-parametric estimator for the residual distribution and parametric models for the mean and volatility dynamics. Subsequently, the next days’ risk forecasts are provided as follows:

$${\,\text{VaR}\,}_{t,HS}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\hat{H}}^{-1}(\alpha )$$
(14)
$${\,\text{ES}\,}_{t,HS}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\mathbb{E}}\left[z| z < {\hat{H}}^{-1}(\alpha )\right]$$
(15)

where the α quantile of the empirical distribution of the standardized residual zt is indicated by \({\hat{H}}^{-1}(\alpha )\).

Cornish-Fisher expansion

The Cornish-Fisher expansion, as proposed by Favre and Galeano (2002), is a semi-parametric method that modifies the quantiles of a normal distribution to incorporate skewness and kurtosis. The α quantile of a distribution is defined as below.

$${\delta }_{{\rm{CF}}}(\alpha )=Z(\alpha )+\frac{1}{6}\left(Z{(\alpha )}^{2}-1\right)S+\frac{1}{24}\left(Z{(\alpha )}^{3}-3Z(\alpha )\right)K-\frac{1}{36}\left(2Z{(\alpha )}^{3}-5Z(\alpha )\right){S}^{2}$$
(16)

where Z(α) represents the standard normal distribution’s α quantile. Additionally, S and K symbolize the innovation distribution’s excess kurtosis and skewness, respectively. The risk forecasts for the upcoming day are shown below:

$${\,\text{VaR}\,}_{t,CF}^{\alpha }={\mu }_{t}+{\hat{\sigma }}_{t}{\delta }_{{\rm{CF}}}(\alpha )$$
(17)
$${\,\text{ES}\,}_{t,CF}^{\alpha }={\hat{\mu }}_{t}+{\hat{\sigma }}_{t}{\mathbb{E}}\left[z| z < {\delta }_{{\rm{CF}}}(\alpha )\right]$$
(18)

Backtesting

This study adheres to the analytical framework of Kuang (2022) by employing a dual approach to backtesting methods. We integrate conventional VaR backtesting techniques with advanced ES backtesting methods, which provide a more nuanced evaluation of model performance in capturing tail risk. This comprehensive approach is further complemented by the evaluation of loss functions, which serve as a quantitative measure to compare the relative efficacy of various forecasting models. By using these robust methodologies, our analysis aims to provide a rigorous assessment of the predictive capabilities of models, ensuring a more accurate and reliable risk management strategy for investors and practitioners in the clean energy sector.

Value-at-risk

The Actual over Expected (AE) Exceedance Ratio is a simple backtesting metric that compares the number of exceedances to the expected number under the null hypothesis of model adequacy. A ratio close to one indicates a well-calibrated model, while a ratio significantly above or below than one suggests over- or underestimation of risk by the model, respectively. More strict backtesting procedures are based on the statistical tests described below.

The Unconditional Coverage Test, denoted as LRuc by Kupiec (1995), evaluates the congruence of exceedance proportions with anticipated levels over a specified time frame. This test assumes that exceedances are independently and identically distributed, an assumption that is often not met in financial returns. Under the null hypothesis that the model provides correct unconditional coverage, the LRuc test statistic is asymptotically distributed as a chi-squared \(\left({\chi }^{2}\right)\) variable with one degree of freedom.

The Conditional Coverage Test, proposed by Christoffersen (1998) and referred to as LRcc, combines the Unconditional Coverage Test with a test for exceedance independence. This provides a joint test statistic capable of identifying both correct coverage and temporal independence of exceedances. The resulting LRcc test statistic follows an asymptotic chi-squared \(\left({\chi }^{2}\right)\) distribution with two degrees of freedom under the null hypothesis of correct conditional coverage.

Finally, the Dynamic Quantile (DQ) test, introduced by Engle and Manganelli (2004), uses a regression-based methodology to assess whether VaR exceedances can be predicted on the basis of their previous values or other data points. This provides a direct assessment of correct conditional coverage and is adept at incorporating dynamic elements into quantile predictions. The DQ test statistic is also asymptotically distributed as a chi-squared \(\left({\chi }^{2}\right)\) variable, with the degrees of freedom equal to the number of included lags and other explanatory variables in the test regression (excluding the intercept).

Expected shortfall

The Exceedance Residuals (ER) test, as introduced by McNeil and Frey (2000), serves as a prevalent method for evaluating ES models. This test quantifies the ER as the difference between the actual return and the ES forecast when a VaR breach occurs. The ER test operates under the premise that a well-calibrated risk model should result in an ER with an expected value of zero. To validate this, the bootstrap method, pioneered by Efron and Tibshirani (1993), is utilized. This method employs a one-tailed test to assess whether a negative mean ER suggests a systematic bias in the ES forecasts, indicating that the model consistently underestimates the ES.

The ES regression (ESR) test, proposed by Bayer and Dimitriadis (2022), approaches the evaluation of ES forecasts by treating the conditional ES as a linear function of returns. In this framework, returns are the dependent variable, and the ES forecasts, which include an intercept term, are the independent variables. For accurate specification of ES forecasts, the intercept and slope parameters should ideally be zero and one, respectively. To further refine the assessment, Bayer and Dimitriadis (2022) recommend an Intercept ESR test. This test focuses on the intercept term, ensuring that the ES forecasts are not consistently biased low. By constraining the slope parameter to one, the method isolates the intercept for evaluation, providing a targeted test for the accuracy of ES predictions.

MCS tests

The Model Confidence Set (MCS) approach, as developed by Hansen et al. (2011), is employed in this study to evaluate and compare the performance of various forecasting models. The MCS method is a statistical tool designed to identify a subset of models that outperform others at a specified confidence level. In this research, MCS testing is conducted at a 90% confidence level, utilizing different loss functions to assess the predictive accuracy of risk forecasts. Models that exhibit lower average loss levels over the projection period are deemed to have superior performance, providing a more nuanced evaluation than individual statistical tests.

The Quasi-Likelihood (QLIKE) loss function is utilized to evaluate the accuracy of volatility predictions, with daily squared returns serving as a surrogate for realized volatility (RV). The QLIKE function is asymmetric, penalizing under-forecasting more severely than over-forecasting. This property, along with its robustness against noise in the volatility proxy, as demonstrated by Patton (2011), makes it a preferred choice over the Mean Squared Error (MSE) for assessing volatility forecasts. The QLIKE function’s reduced sensitivity to outliers further enhances its suitability for evaluating volatility models. The QLIKE function is defined as:

$${\text{QLIKE}}_{t}=\frac{{\text{RV}}_{t}}{{\hat{\sigma }}_{t}^{2}}-\ln \left(\frac{{\text{RV}}_{t}}{{\hat{\sigma }}_{t}^{2}}\right)-1$$
(19)

In the context of quantile forecast evaluation, the Tick Loss Function (TLF) is employed. This technique, derived from quantile regression as proposed by Koenker (1978), is an asymmetric loss function that imposes substantial penalties on observations that fall below the VaR threshold. The TLF’s design ensures that the evaluation of VaR forecasts is sensitive to the magnitude of underestimations, making it a valuable tool for assessing the reliability of quantile forecasts in risk management applications.

$${\,\text{TLF}\,}_{t}^{\alpha }=\left(\alpha -{I}_{{r}_{t} < {\text{VaR}}_{t}^{\alpha }}\right)\left({r}_{t}-{\,\text{VaR}\,}_{t}^{\alpha }\right)$$
(20)

To evaluate the joint prediction of VaR and ES, we use the Fissler-Ziegel Loss (FZL) function proposed by Fissler and Ziegel (2016). The FZL function is a novel loss function that measures the joint performance of VaR and ES forecasts. It rewards models that correctly predict the loss severity when the VaR threshold is exceeded. The FZL function is defined as:

$${\,\text{FZL}}_{t}^{\alpha }=\frac{1}{\alpha {\text{ES}}_{t}^{\alpha }}{I}_{{r}_{t} < {\text{VaR}}_{t}^{\alpha }}\left({r}_{t}-{\,\text{VaR}}_{t}^{\alpha }\right)+\frac{{\text{VaR}}_{t}^{\alpha }}{{\text{ES}\,}_{t}^{\alpha }}+\log \left(-{\,\text{ES}\,}_{t}^{\alpha }\right)-1$$
(21)

Empirical analysis

Data and estimation

This study focuses on two prominent clean energy stock indices: the PowerShares WilderHill Clean Energy Portfolio (PBW) and the Invesco Global Clean Energy ETF (PBD). These indices were chosen for three primary reasons. First, both indices are designed to encapsulate the entire clean energy sector, covering a wide range of technologies and services within this industry. This comprehensive representation facilitates a more precise depiction of the sector’s performance and trends. Second, as ETFs, PBW and PBD offer high liquidity, making them accessible to a diverse range of investors, including individual and institutional investors. This accessibility ensures a robust and active trading market for research purposes. Third, their long history in tracking the performance of the clean energy sector provides valuable insights into the risk characteristics and market dynamics of this sector.

Daily price data for PBW and PBD has been sourced from investing.com, spanning from the inception of each index until December 22, 2023. Specifically, PBW has approximately 4700 data points since March 7, 2005, while PBD has around 4100 data points since June 15, 2007. Data integrity checks confirmed that the raw daily price series are complete, with no missing values requiring imputation. The continuous compounded daily returns for both PBW and PBD are calculated by taking the natural logarithm of the difference between consecutive daily spot prices, multiplied by 100, denoted as \({R}_{t}=\left[ln{P}_{t}-ln{P}_{t-1}\right]* 100\).

The daily price changes and percentage returns for PBW and PBD are shown in Figs. 1 and 2, respectively. Visual inspection of these plots reveals no anomalous data points, such as those arising from data entry errors, that would warrant removal or adjustment. Consequently, the unfiltered return series are used for all subsequent analysis. A central focus of this study is the distinct risk profiles of long and short investment positions. It is important to clarify how the same return distribution is used to evaluate these opposing risks. The tail risk for a long position corresponds to the left tail of the return distribution, where large negative returns represent significant losses. Conversely, the risk for a short position, which profits from price declines, arises from large positive returns. Therefore, the tail risk for a short position corresponds to the right tail of the distribution. This study explicitly models both tails to investigate whether the statistical properties governing extreme negative returns are the same as those governing extreme positive returns, a distinction crucial for comprehensive risk management.

Fig. 1: PBW daily price and return.
figure 1

The graphs illustrate the daily price and return series for PBW, spanning from March 7, 2005 to December 22, 2023. Additionally, the return density graph is provided. The forecast windows of the three sub-samples are delineated by dashed lines.

This distinction becomes particularly relevant when examining the historical volatility patterns of these indices. Both indices have exhibited substantial volatility, marked by significant peaks and troughs, indicative of broader market dynamics and the specific challenges faced by the clean energy sector. In particular, they experienced significant declines during the 2008 financial crisis and the initial phases of the COVID-19 pandemic. The more pronounced decline during the 2008 financial crisis was attributed to the systemic nature of the economic downturn and the sector’s vulnerability to investor sentiment and credit availability. The COVID 19 pandemic caused an initial sharp decline, followed by a rapid recovery as the market anticipated a green economic resurgence. However, the subsequent decline may be influenced by evolving market priorities, supply chain disruptions, and the temporary competitiveness of traditional energy sources due to oil price volatility. The inherent volatility of the clean energy market, characterized by significant fluctuations, underlines the need for robust risk management strategies by both market participants and regulators.

The PBW and PBD models are calibrated using a 500-observation estimation window to generate one-step-ahead VaR and ES forecasts for long and short positions on day 501. The choice of a 500-observation window for model estimation in the PBW and PBD indices is driven by two objectives: firstly, to ensure an adequate number of data points for accurate parameter estimation; secondly, to retain the period of the 2008 financial crisis for subsequent forecast evaluation. This approach is reinforced by a robustness check using a window of 1000 observations, which produces qualitatively similar results. Following this initial estimation, the window is moved forward by one day, and the model is re-estimated to predict day 502. This process is repeated until the dataset has been fully exhausted, resulting in 500 observations for model estimation and over 3500 for out-of-sample forecasting evaluation. The rolling window approach is utilized to incorporate the latest market information and eliminate outdated data, thereby enhancing the models’ adaptability to current market conditions.

In order to evaluate the performance of the model under various market conditions, we divided the dataset into three sub-samples based on the observed market movements in Figs. 1 and 2. The start and end dates for each sample period can be found in Table 1. These sub-samples represent significant market events such as sharp declines during the 2008 financial crisis and COVID-19 pandemic, robust recoveries, and sustained declines after initial recoveries. Each sub-sample contains 500 observations for model estimation and 250 for out-of-sample forecasting, allowing a comprehensive analysis of the models’ forecasting capabilities during periods of high volatility. The sub-sample periods are indicated by dashed lines in the price histories of PBW and PBD, as shown in the figures above. For PBD, we can’t evaluate the out-of-sample forecast in Sample I because there are not enough data points to estimate the model.

Fig. 2: PBD daily price and return.
figure 2

The graphs illustrate the daily price and return series for PBD, spanning from June 15, 2007 to December 22, 2023. Additionally, the return density graph is provided. The forecast windows of the three sub-samples are delineated by dashed lines.

Table 1 Data samples.

Table 2 presents the summary statistics for the PBW and PBD return series across the full dataset and the three sub-sample periods. The mean daily returns for both indices are slightly negative during the Full sample period, reflecting a slight underperformance. However, the mean returns are notably negative in Sample I and Sample III, indicating periods of market downturns, while Sample II exhibits a positive mean return, indicating a period of market recovery. The largest standard deviation is observed in Sample I, which is consistent with the increased market volatility during the 2008 financial crisis. Interestingly, the lowest return is recorded in Sample II, which can be attributed to the market’s reaction to the initial stages of the COVID-19 pandemic, where investor sentiment was particularly pessimistic despite the eventual market recovery. The rejection of the normality assumption by the Jarque-Bera test, due to the high kurtosis and negative skewness, underlines the need for models that take these non-normal characteristics into account. The ADF test confirms the stationarity of the return series, while the ARCH portmanteau test reveals significant volatility clustering, emphasizing the importance of capturing these features for accurate tail risk forecasting.

Table 2 Summary statistics.

Table 3 presents the estimation results for the GARCH, GJRGARCH, and IGARCH models, each specified with an AR(1) mean equation and a skewed t-distribution for the innovations. For both the PBW and PBD indices, the significant AR(1) coefficient suggests the presence of autocorrelation in the return series.

Table 3 Volatility estimation.

The results reveal several key features of clean energy volatility dynamics. First, for both the GARCH and GJRGARCH models, the sum of the ARCH and GARCH coefficients α + β is close to one, signifying a high degree of volatility persistence. Second, the leverage parameter γ in the GJRGARCH model is positive and significant for both indices. This confirms a pronounced leverage effect, wherein negative shocks have a more substantial impact on future volatility than positive shocks of the same magnitude. To account for the high persistence observed in the data, the IGARCH model was selected. While a Fractionally Integrated GARCH (FIGARCH) model Baillie et al. (1996) was initially considered for its ability to capture long-memory effects, preliminary analysis showed a long-memory parameter close to 0.99. Given that this indicates a very slow decay of shocks, the more parsimonious IGARCH model was chosen as a robust specification to account for this near-unit-root behavior in the volatility process. This is particularly relevant for clean energy stocks, whose volatility is influenced by both immediate market news and long-term trends in technology and regulation.

The estimated parameters for the skewed t-distribution confirm the presence of significant fat tails and negative skewness in the return distributions for both indices. Importantly, diagnostic tests on the standardized residuals from all estimated models show no remaining significant autocorrelation or ARCH effects at a lag of 20, indicating that the chosen model specifications are well-defined and have successfully captured the conditional dynamics of the data.

Results

We first assess the accuracy of volatility forecasts for the PBW and PBD indices based on the GARCH, IGARCH, and GJRGARCH models, which include the innovations based on the normal and skewed t-distributions. The parameters are estimated by quasi-maximum likelihood estimation (QMLE). This method is standard practice in financial econometrics as it provides consistent and asymptotically normal estimators even if the specified distribution of the innovations is incorrect, provided the mean and variance equations are correctly specified (Bollerslev and Wooldridge, 1992; Hamilton, 2020). The squared daily returns are used as a reference to evaluate the forecast accuracy. According to Table 4, the QLIKE loss function, with lower values indicating higher precision, reveals that the GJRGARCH, particularly with skewed t-innovation, provides the most accurate volatility forecasts for both indices. This model’s enhanced performance is attributed to its ability to differentiate between the effects of positive and negative returns on volatility, aligning with market behavior during various market trends. Overall, the models with skewed t innovation perform better than those with normal innovation, suggesting that non-normal innovation should be taken into account for more accurate volatility forecasting in clean energy stock markets.

Table 4 Volatility forecasts evaluation.

The performance of the VaR and ES forecasts for PBW and PBD is assessed at the 97.5% and 99% confidence levels, employing a robust set of evaluation criteria. These include AE ratios, p-values for various backtesting tests, and the MaxAD metric (McAleer and Medeiros, 2008), which measures the maximum absolute deviation from VaR forecasts. The MCS test at a 90% confidence level evaluates the mean values of the Tick and FZL functions, providing a stringent assessment of model accuracy. The VaR and ES forecasts’ mean and standard deviations, along with backtesting outcomes, are detailed in Tables 5 and 6 for long positions in PBW and PBD over the entire out-of-sample forecast periods. A more granular analysis of sub-sample periods at the 97.5% confidence level is given in Tables 7 and 8 for PBW and PBD, respectively. Due to the small sample sizes in these sub-samples, which can affect the power of p-value tests, we consider the AE ratio, ADMax, and loss function values as the main indicators of model performance. The key insights for long positions are as follows.

Table 5 Full sample VaR and ES results for long PBW positions.
Table 6 Full sample VaR and ES Results for Long PBD Positions.
Table 7 Sub-sample VaR and ES Results for Long PBW Positions at 97.5% Confidence.
Table 8 Sub-sample VaR and ES Results for Long PBD Positions at 97.5% Confidence.

First, the efficacy of various innovation distributions within the GARCH model framework is critical to accurate forecasting. Models based on the normal distribution consistently underperform, as evidenced by elevated AE ratios and frequent rejection of VaR and ES p-value tests, indicating potential model misspecification. In the context of high confidence level VaR predictions, the historical simulation approach is outperformed by both the Cornish-Fisher expansions and the skewed t. The skewed t distribution, when evaluated using the loss functions, is found to offer the most accurate combined forecasts for VaR and ES. Conversely, the Cornish-Fisher expansion, while characterized by the highest mean and standard deviation for VaR and ES forecasts, consistently delivers the most conservative estimates of tail risk. This is attributed to the lowest AE ratio and the smallest Mean Absolute Deviation (MAD) observed in the model’s performance. The conservative nature of the Cornish-Fisher expansion suggests its suitability for risk-averse investors or institutions that prioritize the minimization of potential losses during periods of market stress.

Second, the IGARCH model, which incorporates long-term memory into volatility, produces higher VaR and ES forecasts over the entire sample period for both PBW and PBD. During the 2008 Global Financial Crisis (GFC) period, the IGARCH model outperforms the GJRGARCH model in terms of AE ratio, tick loss, and FZL metrics. This suggests that it provides a more accurate representation of tail risk during this period of extreme market stress. This can be attributed to the IGARCH model’s ability to adapt to the slow decay of shocks, which is particularly pertinent during prolonged periods of market instability. However, despite its performance during the GFC, the GJRGARCH model generally produces more accurate VaR and ES forecasts. The integration of the GJRGARCH model with skewed t innovation outperforms all other models based on the evaluation of tick and FZL function. This improved performance is further corroborated by the results of the QLIKE test, which indicates that it generates the most accurate volatility forecasts.

Third, a comparative analysis of the sub-sample periods, with key descriptive statistics presented in Table 2 and detailed backtesting results in Table 7, reveals distinct risk characteristics across different market regimes. The GFC (Sample I) exhibits the highest overall market volatility, with the largest standard deviations in daily returns and VaR/ES forecasts, alongside significantly higher AE ratios indicating frequent VaR breaches and sustained systemic risk. In contrast, the COVID-19 period (Sample II) presents a different risk profile characterized by sharp, V-shaped market reactions rather than sustained volatility. While overall volatility was lower than the GFC, this period featured the most extreme single-day negative return and larger maximum deviations from VaR forecasts (ADMax), indicating a crisis defined by singular, extreme shocks rather than persistent high volatility. Sample III shows market adaptation to a new risk environment where mean VaR and ES forecasts remain high but with significantly lower standard deviations, suggesting more stable day-to-day risk fluctuations due to market resilience. These differences highlight the importance of using risk models robust to different crisis types-whether prolonged, systemic downturns or sharp, sudden shocks.

Next, we assess the prediction accuracy for short positions in PBW and PBD, as shown in Tables 9 and 10. The difference in prediction accuracy between forecasting short position for PBW and PBD are less pronounced than those for long positions. The IGARCH model produces more conservative VaR and ES forecasts than the GARCH and GJRGARCH models for both indices. At the 99% confidence level, the IGARCH model combined with a skewed t distribution yields the most accurate forecasts, as assessed by the tick and FZL functions. On the other hand, the GJRGARCH model, which surpasses the GARCH and IGARCH models in forecasting long positions, does not show equivalent accuracy for VaR and ES at the 99% confidence level. This discrepancy may be due to the asymmetric response of the GJRGARCH model to market shocks, which is less relevant for short positions.

Table 9 Full sample VaR and ES Results for Short PBW Positions.
Table 10 Full sample VaR and ES Results for Short PBD Positions.

Then, we delve into the comparative analysis of VaR and ES forecasts to enhance our understanding of risk assessment for the PBW and PBD indices. We adhere to the methodology proposed by Patra (2021) by calculating the ES to VaR ratio at a 99% confidence level, which serves as a critical metric for evaluating the effectiveness of risk models. The results, presented in Table 11, reveal a nuanced picture of risk sensitivity across different market conditions. The ratio, which stands at approximately 1.15 for a normal distribution of innovations, escalates dramatically for non-normal distributions, varying among models and time-periods. Notably, the third sample period exhibits the highest ratios, suggesting that market conditions during this period were particularly conducive to a more pronounced risk profile. The study further uncovers that the ES consistently outperforms the VaR when estimated by the GJRGARCH model with skewed t-innovation, surpassing it by over 40% for both long and short positions. This large gap indicates that the ES measure has higher sensitivity to market volatility, which can provide a more reliable risk assessment in uncertain periods. Additionally, as shown in Table 12, the transition from VaR to ES, which is a shift from a 99.5% to a 97.5% confidence level, leads to a modest yet notable increase in risk prediction, particularly under normal innovation scenarios. The increase ranges from 2 to 3% for long positions in PBW and PBD indices estimated by GJRGARCH model, and a similar increase for short positions estimated by IGARCH model. These results demonstrate that skewed t distribution can be used to capture potential losses beyond VaR threshold.

Table 11 ES99 vs VaR99 ratio.
Table 12 ES97.5 vs VaR99 ratio.

The VaR forecast plots presented in Figs. 3 and 4 provide valuable insights into the dynamic risk profiles of clean energy ETFs and the comparative performance of different GARCH specifications. Several important patterns emerge from the visual analysis of the one-day-ahead VaR forecasts at 99% confidence across the out-of-sample forecast period. Most notably, the GARCH with normal innovation (GARCH-Normal) model presented in the top row (Panels A and B) is visually the poorest performer across both indices. The dense clustering of red violation dots indicates that this baseline model systematically underestimates risk for both long and short positions, particularly during periods of high market volatility, underscoring the inadequacy of assuming normal distributions when modeling extreme risk events in the volatile clean energy sector.

Fig. 3: Comparison of 99% VaR Forecasts for the PBW Index.
figure 3

The figure displays daily realized returns (grey line) against 99% VaR forecasts from three different models. The left column (A, C, E) shows the performance for a long position. The right column (B, D, F) shows the performance for a short position. Each row corresponds to a different model: GARCH-Normal (top), GJRGARCH-st (middle), and IGARCH-st (bottom). VaR violations, where losses exceeded the VaR estimate, are marked with red dots.

Fig. 4: Comparison of 99% VaR Forecasts for the PBD Index.
figure 4

The figure displays daily realized returns (grey line) against 99% VaR forecasts from three different models. The left column (A, C, E) shows the performance for a long position. The right column (B, D, F) shows the performance for a short position. Each row corresponds to a different model: GARCH-Normal (top), GJRGARCH-st (middle), and IGARCH-st (bottom). VaR violations, where losses exceeded the VaR estimate, are marked with red dots.

A detailed examination of model performance by position type reveals distinct patterns. For long positions (left column, Panels A, C, E), the GJRGARCH with skewed-t innovation model (Panel C) demonstrates visibly superior performance, with its VaR forecast line (in blue) being more adaptive to changes in market volatility,capturing the asymmetric response to negative shocks and resulting in noticeably fewer violation dots compared to the GARCH-Normal model. Conversely, for short positions (right column, Panels B, D, F), the IGARCH with skewed t innovation model (Panel F) emerges as the most effective specification, with its VaR forecast line (in purple) providing more appropriate risk boundaries for the upside return spikes characteristic of short-selling risk, leading to the fewest violations across both indices. These visual findings confirm that the optimal model for managing downside risk in long positions (GJRGARCH-st) differs from the optimal model for managing upside risk in short positions (IGARCH-st), thereby validating the necessity of a tailored, dual-model approach for comprehensive risk management in clean energy investments.

Discussion

This study offers valuable insights into risk forecasting for the volatile clean energy stock market, with findings that both reinforce and extend key themes from the broader VaR literature. The key results demonstrating the necessity of non-normal innovations and the divergent model requirements for long and short positions provide a clear pathway toward improving the predictive accuracy of VaR and ES for clean energy sector stocks, which is critical for both investors and regulators.

First, our study finds that non-normal innovations, specifically the skewed t distribution, outperform standard methods such as filtered historical simulation and the Cornish-Fisher expansion. The skewed t distribution excels at capturing both skewness and kurtosis, which is particularly important in the clean energy sector, where returns frequently exhibit asymmetric behavior and fat tails. This characteristic stems primarily from the sector’s heightened sensitivity to rapid technical improvements and legislative shifts. By offering a more flexible and robust estimation approach, especially for the smaller sample sizes common in emerging markets, this distribution enhances the reliability of risk forecasts. The result aligns with a significant stream of literature emphasizing the importance of accounting for skewness in VaR forecasting of equity indices (Abad et al., 2016; Chen et al., 2012; Kuang, 2021). By demonstrating the superiority of the skewed t-distribution for clean energy assets, a sector defined by high uncertainty and frequent policy-driven shocks, our study validates these general findings within a new and economically significant domain.

Second, the disparity in optimal models for long and short positions represents this study’s primary contribution to the literature on forecast improvement. While the challenge of producing reliable VaR forecasts during market stress has led researchers to explore various enhancement strategies, such as data-driven forecast combination techniques (Chiu et al., 2010; Halbleib and Pohlmeier, 2012), our research introduces a complementary perspective: forecast improvement through ex-ante model tailoring. This disparity can be attributed to the sector’s asymmetric price dynamics and risk characteristics. For long positions, the GJRGARCH model outperforms due to its ability to capture the leverage effect, whereby negative shocks have a greater influence on volatility than positive shocks of equivalent magnitude. This is especially important in the renewable energy sector, where adverse developments, such as regulatory rollbacks, can substantially impact investor sentiment. In contrast, the IGARCH model is more successful for short positions, reflecting the persistent nature of volatility during upward trends, which are frequently driven by positive technology breakthroughs or supportive policy announcements. These findings suggest that a single, universally optimal model may not exist even within a specific asset class, and that tailoring models to the directional nature of risk exposure represents a crucial yet underexplored avenue for enhancing forecast accuracy.

Finally, the variability in VaR and ES projections underscores the dynamic nature of the clean energy stock market, characterized by pronounced volatility responses to global economic disruptions, technological developments, and policy changes (Attarzadeh and Balcilar, 2022; Zhang et al., 2023). As investor sentiment, technological progress, and macroeconomic factors evolve, risk managers must remain adaptable to shifting risk profiles. Our research demonstrates that employing both ES and VaR, consistent with the Basel regulatory shift toward Expected Shortfall for capturing tail risks (Taylor, 2020), facilitates a more comprehensive assessment of tail risks. This dual approach proves crucial for regulatory authorities seeking to refine capital adequacy requirements and risk management guidelines. These findings highlight the necessity for regulatory frameworks to be responsive to the distinct volatility characteristics and systemic risks inherent in clean energy investments. As the sector expands, continued risk model research and adaption will be critical for ensuring financial stability and supporting the industry’s long-term development goals, particularly given the complex interplay between climate risks and financial stability considerations (Dietz et al., 2016).

Conclusion

This study contributes to the understanding of risk dynamics in clean energy investments by addressing the persistent challenge of tail risk modeling. By conducting a comprehensive empirical analysis focused on the unique characteristics of clean energy stocks, we demonstrate that a tailored, dual-model approach is essential for accurate risk management. Our key findings show that the GJRGARCH-skewed t model is best suited to long positions, whilst the IGARCH-skewed t model shines at short positions.

These findings have significant and actionable implications for risk management and portfolio optimization in the clean energy sector. For portfolio and risk managers, the practical benefits are multifaceted. First, adopting superior model specifications like the GJRGARCH-st for long positions leads to more accurate VaR and ES forecasts. This enhanced accuracy enables more efficient capital allocation, ensuring that capital reserves set aside to cover potential losses are neither inadequate (exposing the firm to unexpected risk) nor excessive (constraining profitable investment opportunities). The improved forecasting precision can translate into substantial cost savings, as financial institutions typically hold billions in regulatory capital that could be more productively deployed if risk estimates were more accurate.

Second, the daily risk forecasts from these dynamic models enable more responsive and proactive risk management strategies. For instance, a spike in forecasted VaR could trigger tactical position reductions, dynamic hedging adjustments, or the implementation of protective derivatives strategies, allowing investors to preemptively mitigate risk exposure in response to evolving market conditions. This real-time risk monitoring capability is particularly valuable in the clean energy sector, where regulatory announcements, technological breakthroughs, or policy shifts can rapidly alter risk profiles.

Furthermore, these models significantly enhance portfolio optimization decisions across multiple dimensions. By providing more accurate measures of volatility and tail risk-critical inputs for risk-adjusted performance metrics such as the Sharpe ratio, modified Sharpe ratio, and Sortino ratio, investors can make better-informed strategic asset allocation decisions regarding clean energy exposure. This improved risk measurement also facilitates more sophisticated portfolio construction techniques, such as risk parity strategies or maximum diversification approaches, which rely heavily on accurate volatility estimates.

From a regulatory and systemic risk perspective, our results underscore the inadequacy of standardized, one-size-fits-all risk models in capturing the nuanced risk characteristics of emerging sectors. The empirical finding that long and short positions require fundamentally different modeling approaches supports the ongoing regulatory shift towards more sophisticated internal models for calculating capital requirements under Basel and similar frameworks. As sustainable investments become increasingly integral to the global financial system, accurate risk measurement in clean energy assets becomes crucial not only for individual firm stability but also for broader financial system resilience and the successful transition to a sustainable economy.

Despite these contributions, our study has limitations. First, our analysis focuses on a small number of renewable energy indices, which may limit the generalizability of our findings. Future research should look at a broader range of clean energy assets, such as individual equities and sub-industry indexes, to validate and expand on our findings. Second, our study evaluated several competing parametric and non-parametric distributions for the innovations within the GARCH framework. A further refinement could be achieved by employing Extreme Value Theory (EVT) (McNeil and Frey, 2000), a specialized approach designed to directly model the extreme tails of the standardized residuals. While powerful, the application of EVT requires a very long time series to ensure the reliable estimation of tail parameters. As more historical data becomes available, integrating EVT to model the innovations’ tails would be a valuable and logical extension of our work. Third, while our focus on GARCH-type models yields valuable insights, investigating alternative methodology, such as machine learning approaches, could improve risk forecasting in this dynamic sector. Finally, the study’s reliance on historical backtesting may fail to adequately account for future market changes caused by regulatory reforms or unexpected economic upheavals. Integrating scenario analysis or stress testing could be beneficial in future investigations.

In conclusion, the insights from this research are both timely and relevant as the world moves towards sustainable energy investments. By refining traditional risk models to better encapsulate the complexities of clean energy stocks, we offer a more precise and adaptable framework for risk assessment in this critical sector. As clean energy markets expand, our findings will help to guide investment decisions, strengthen financial resilience, and contribute to the overarching aims of sustainable development.