0
Views
0
CrossRef citations to date
0
Altmetric
Econometrics

A comparative VaR analysis between low-frequency and high-frequency conditional EVT models during COVID-19 crisis

ORCID Icon, &
Article: 2377495 | Received 23 Aug 2023, Accepted 03 Jul 2024, Published online: 19 Jul 2024

Abstract

The aim of this paper is to assess whether the availability of high-frequency data enhances the accuracy of extreme market risk estimation in comparison to low-frequency data by using Value-at-risk (VaR) and Expected shortfall (ES). The sample data used for analysis comprised the daily closing stock prices and 5-minute intraday stock prices of DJIA, FTSE100, BOVESPA, and MERVAL Index from 2014 to 2022. The data analysis was done to compare the performance of two-stages hybrid methods called conditional EVT that combined the GARCH, RV and HAR specification models with the EVT approach. To assess the accuracy of the VaR forecasts, out-of-sample VaR forecast was backtested by using unconditional coverage (UC) and conditional coverage (CC) tests. The VaR backtesting procedure also incorporated the utilization loss function which are the regulatory loss function (RLF) and the firm’s loss function (FLF). The accuracy of the forecasted ES was backtested by using the generalized breach indicator (GBI) method. The findings of this research emphasized that high-frequency conditional EVT, incorporating the HAR specification outperformed the low-frequency conditional EVT in predicting market risk during periods characterized by extreme returns. Based on the VaR and ES measure, the HAR-EVT typed models are the best performance model compared to the GARCH-EVT and RV-EVT typed models during both crisis and non-crisis periods. This research study contributes to the current literature on the forecasting ability of risk models by concentrating on the hybrid model of long-memory models (FIEGARCH, RV-FIEGARCH and HAR-FIEGARCH) for with the EVT approach.

Impact statement

The analysis compares the performance of two-stages hybrid methods called conditional EVT that combined the GARCH, RV and HAR specification models with the EVT approach during non-crisis and crisis COVID-19 pandemic. Based on the VaR and ES measure, the HAR-EVT typed models are the best performance model compared to the GARCH-EVT and RV-EVT typed models. The findings of this research study are also beneficial to market participants, especially those who engage in high-frequency trading to utilize the RV and HAR models as well the application of the EVT model to measure the potential risks and detect any vulnerabilities in financial systems especially during a sudden market crash.

1. Introduction

The degree of stock market volatility should raise significant concerns among different stakeholders, including investors, regulators, and financial institutions, to prevent potential economic challenges in the future. Historical records indicate that fluctuations in stock prices during periods of market turbulence can exert a considerable influence on financial downturns, as evidenced by events like the catastrophic Black Monday in October 1987 and Black Friday in October 1989. These unexpected and rare historical occurrences instilled concerns about economic recession and had repercussions on numerous countries across the globe. Another notable extreme event that caused a devastating financial crisis was the Asian financial crisis of 1997. This crisis has impacted financial markets in terms of extreme price movements which lead to the depreciation in currency and a crash in the stock market globally. This crisis had heightened volatility not only in Asian counties as well as other nations which hesitate foreign investors to invest in the Asian market. Due to that, most of the banks and financial institutions are collapsed which gives severe losses for creditors and investors. Further to that, the global financial crises 2007–2008 crisis stands as the most terrifying crisis. This crisis derived from a subprime mortgage lending crisis in 2007 and quickly accelerated into a worldwide banking crisis with the collapse of Lehman Brothers investment bank in September 2008. Regardless of substantial efforts to sustain the fallout, the damage proved uncontrollable, plunging the global economy into a deep recession. On December 31, 2019, the emergence of the coronavirus (COVID-19) began in China and subsequently spread to 220 countries and territories worldwide. On March 11, 2020, the World Health Organization (WHO) officially declared it a global pandemic (Kickbusch et al., Citation2020). The emergence of COVID-19 cases had a profound impact on the financial market. The rapid spread of the virus and the subsequent global lockdown measures implemented to contain it led to significant disruptions in economic activities and investor sentiment. The uncertainty surrounding the duration and severity of the pandemic caused widespread volatility and instability in financial markets as reported by Ullah (Citation2022). The global stock markets experienced sharp declines as investors reacted to the escalating health crisis and its potential economic implications. The concerns regarding a worldwide economic downturn, diminished consumer expenditure, and disrupted supply chains led to significant selloffs and declines in the financial markets. There are recent studies by (Basuony et al., Citation2022; Hui & Chan, Citation2022; Rakshit & Neog, Citation2022) that found the impact of COVID-19 in global stock price by exhibited large volatility and risk uncertainty. The impact of financial crises on the stock markets as discussed above gives motivation to this study to include the recent COVID-19 period as the sample data for analysis. This choice is due to the absence of prior research regarding the efficiency of the high-frequency conditional EVT in forecasting the stock market risk during the COVID-19 period.

The analysis of this study includes the application of the heterogeneous market hypothesis (HMH) introduced by Dacorogna et al. (Citation2001). According to HMH, investors follow the nonlinear trading behaviours of heterogeneous market participants where it is impossible for one price to reflect everyone’s information even when each investor receives similar information as claimed by the efficient market hypothesis (EMH) introduced by Fama (Citation1965). Due to the heterogeneity perspective, there are various trading strategies which produce uncertainty of risk or volatility cascade and created the long memory property in the financial markets as mentioned by Cheong et al. (Citation2016). Based on the HMH framework, the stock market returns were analyzed by considering how the market participants react to past information or news at different time horizons (daily, weekly and monthly). This paper considers the recent model introduced by researchers that incorporates the HMH framework called the heterogeneous autoregressive (HAR) model. The HAR model is an extension of the autoregressive conditional heteroskedasticity (ARCH) model that considers the long memory property observed in financial time series. The HAR model also aligns with the HMH theory because it considers the varying responses of different investor groups by incorporating lagged volatility measures at different time scales (short-term, medium-term, and long-term).

Based on previous research done in the risk forecasting field, the efficacy of risk estimation models selected are impacted by various elements, such as the type of risk model used, the type of data used (daily or intraday data) and the specific economics conditions under examination. Therefore, these two elements will be included in this study to see the efficiency of the low-frequency and high-frequency models in forecasting the stock market risk. With the rise of high-frequency data, the stock market now offers more information to investors. In today’s context, data can be gathered within seconds, such as 1-second intraday stock prices, contrasting with the past when data was collected at the close of the trading day. As the data undergoes frequent updates, it provides more recent and accurate information on market movements, thereby offering an advanced depiction of the current market conditions. During a financial crisis period, the quickest and most updated information is important so that investors can respond to any market volatility and uncertainty in the fastest way. The advantages of this new high-frequency data give motivation to this study to apply the high-frequency data (intraday data) to see whether it outperformed the low-frequency data (daily data) in forecasting the stock market risk especially during the financial crisis.

Previous literature has established the effectiveness of using the GARCH model in forecasting stock market risk based on low-frequency data (daily returns) in financial time series analysis. The GARCH model, with its ability to capture volatility clustering and persistence, has been widely employed and found to be suitable for modeling and forecasting volatility in various financial markets. At present, the accessibility of high-frequency data has spurred researchers to delve into alternative risk forecasting models, moving beyond the conventional GARCH model. Given that researchers have had sustained attention with the existence of long memory in daily returns, it is also imperative to examine the presence of long memory in intraday returns, especially regarding the RV and HAR specifications which give another motivation of this study. It is known that during periods of market turbulence, the financial stock market is vulnerable to extreme fluctuations, making the measurement of market risk a paramount concern for risk-averse investors. In this research, the utilization of high-frequency models based on the RV and HAR specifications are applied with the EVT approach. By employing the hybrid approach of high-frequency models with EVT, it assumed to improve the estimation of tail risk by more accurately capturing extreme market events. While the HAR model focuses on the volatility cascade across different investment horizons, the statistical framework of EVT facilitates the modeling and analysis by focusing on the extreme returns of tail distributions.

The hybrid method that combining any low-frequency or high frequency models with the EVT approach is generally called the conditional EVT model. Previous research on the conditional EVT model has been proven to enhance the ability to forecast and assess risk during turbulent market conditions, however mostly the research used daily returns as the volatility proxy. Due to the emergence of high-frequency data, researchers have started to explore how the intraday returns based on the RV specification help to improve the risk forecast. Further to the introduction of the HMH theory researchers try to combine the HAR model with the EVT approach. The finding by Louzis et al. (Citation2007) found that the combined HAR-GARCH-EVT model, which incorporates the conditional heteroscedasticity of the HAR errors, demonstrates superior performance overall and produces precise estimates of VaR that effectively minimize Basel II regulatory capital requirements. The model’s accuracy is evident in both the full out-of-sample period and the specific crisis period of 2007–2009. Further research has been done on the accuracy of the HAR-EVT models in forecasting the stock market risk. The availability of high-frequency data gives another area of research on the efficiency of the low-frequency conditional EVT models compared to the high-frequency conditional EVT models. Researchers found that mostly, by comparing the high-frequency data, the conditional EVT performed better. The adoption of realized volatility as the volatility estimator of high-frequency data shows better prediction than the daily returns. Numerous studies have been conducted to substantiate the benefits of incorporating the RV specification as a hybrid model alongside the EVT approach. The current research also employs the RV specification derived from the HAR model in the context of the conditional EVT model; however, this area of study remains relatively unexplored. This study aims to contribute to the current literature on the forecasting ability of risk models by concentrating on the long-memory models of FIEGARCH, RV-FIEGARCH and HAR-FIEGARCH models with the EVT approach which has not been explored yet. The findings of the study will contribute to the grasp of how different type of data frequencies can influence stock market risk forecasting and provide accurate models to be used in any risk management practices and investment strategies.

Despite the advantage of high-frequency data researchers argue on whether it’s worth it to use high-frequency data due to its higher cost such as a study done by Lyócsa et al. (Citation2021). Instead of the higher cost, it also requires more time to analyze compared to the low-frequency data. For investors and traders with large capital, the advantages of using this high-frequency data may outweigh the costs. The ability of this high-frequency data to quickly respond to market fluctuations will benefit them in increasing the capital invested and minimizing the potential loss. However, for other small investors and traders, the expenses associated with acquiring and analyzing high-frequency data will be the main barrier. In such instances, the low-frequency data, which is generally more affordable and accessible will be chosen. Some researchers which rely on low-frequency data will miss opportunities offered by high-frequency data especially in the current big data era where the growth of big data application database solutions and analytics is anticipated to expand by $12 billion in 2027 (Djuraskovic, Citation2023). The decision to utilize high-frequency data should be based on a detail evaluation of the potential advantages and disadvantages, as well as an understanding of how the data will be employed and what specific trading strategies or investment objectives it will support. The cost issue gives additional motivation of this study to explore whether the potential benefits of employing high-frequency data in data-rich markets outweigh or hold value when contrasted with low-frequency data. The question that lingers is whether this cost will ultimately yield substantial value in the context of modeling and forecasting financial markets.

2. Literature review

Volatility forecasting through the application of EVT has gained considerable attention among researchers. EVT provides a framework for analyzing extreme events and tail behavior in financial markets. By focusing on extreme observations, EVT enables the estimation of tail probabilities and extreme quantiles, which are crucial for volatility forecasting. EVT stands out as the optimal choice for analyzing the tail distribution characterized by extreme events. Extensive studies have been done to study the accuracy of the EVT approach compares other alternatives models in forecasting the stock market risk. A study done by Mcneil and Frey (Citation2000) found that the hybrid model GARCH-EVT is effective to deals with asymmetric tail distributions on the S&P Index. A further research study by Paul and Sharma (Citation2018) found that EVT is reliable in estimating the VaR and ES compared to other models that are not combined with EVT. A study done by Soltane et al. (Citation2012) supports that the combination of the GARCH model and EVT outperforms the conventional standalone EVT model in predicting stock market volatility during extreme events. Another research conducted by Altun and Tatlidil (Citation2015) which analyzed the ISE-100, S&P-500, and Nikkei-225 indexes, found that the GARCH-EVT model outperforms compared to standalone GARCH model. Additionally, research done by Youssef et al. (Citation2015) found that the FIAPARCH-EVT model performs better in predicting one-day-ahead VaR compared to FIGARCH-EVT and HYGARCH-EVT. Another research done by Tabasi et al. (Citation2019) shown the superiority of the GARCH-EVT model over the simple skewed-t GARCH model in estimating stock market risk during the capital depression period of the Tehran Stock Exchange. A study done by Omari et al. (Citation2020) found that the GARCH-EVT model significantly improved VaR forecasting on the daily returns data of twelve major international stock indices. Further research done by Roy (Citation2022) found that the GARCH-EVT models outperformed other models in forecasting intraday VaR during the COVID-19 pandemic. Additionally, study done by Jin et al. (Citation2022) found that the GARCH-EVT-copula approach is the best to measure the VaR and ES before and during the COVID-19 pandemic. Recent study by Zhao et al. (Citation2023) found that the GARCH-EVT-Copula-CoVaR model is accurate in forecasting the spillover risk impact of global oil prices on Chinese sectoral stock markets.

The new revolution of high-frequency data has encouraged researchers to use high frequency data that is based on intraday data to forecast the stock market risk. The realized volatility becomes new latent volatility used to estimate the VaR and ES instead of the daily returns’ volatility. New recent research used the combination realized volatility models and the EVT approach and the result shown that it has more advantages and reliable estimates compared to low-frequency models that used daily returns series. A study done by Watanabe (Citation2012) examines the efficacy of the realized GARCH model in the US equity market. Subsequently, (Paul & Sharma, Citation2018) extend the analysis of Watanabe (Citation2012) and conclude that the realized-GARCH EVT models demonstrate superior forecasting performance in estimating VaR and ES Another research done by Zhang and Zhang (Citation2016) showed that the EGARCH model combined with the EVT approach does very well in predicting a critical loss for precious metal markets. According to Tabasi et al. (Citation2019) the GARCH-EVT model enables a correct VaR estimation also in this market and outperforms the simple GARCH model with Student’s t and normal distributions for residuals. As discussed earlier, the HAR model, which captures the heterogeneity of market participants’ trading behaviors, has shown promise in capturing volatility dynamics at different time scales. The finding by Huang et al. (Citation2016) demonstrates that the HAR-GARCH specification more effectively captures the persistent memory patterns in volatility when contrasted with the conventional GARCH model. Another research done by Tao et al. (Citation2018) discovered that the HAR-RV models surpasses its conventional counterparts in VaR performance. The finding by Zahid et al. (Citation2022) shows that the HAR-GARCH model exhibited superior forecasting precision for Bitcoin volatility when contrasted with alternative realized volatility models. Recent study by Maaz et al. (Citation2023) show that the GARCH-EVT is superior in predicting the risk of high-frequency returns data at 15-minute intervals in the metal market.

As discussed in the Introduction section, the HAR model can capture the heterogeneity of market participants’ trading behaviors and creates a volatility cascade throughout the short to long-term investment period. By combining the HAR model with EVT, it assumes to add prediction of rare and extreme market events at different levels of market volatilities including short-term, medium-term, and long-term. Previously research by Louzis et al. (Citation2007) found that the asymmetric HAR-EVT model is the best model to forecast the VaR. Another finding by Sojka (Citation2015) shows that HAR-RV, HAR-RV-J and ARFIMA are accurate to forecast the daily and intraday returns with comparable results where the daily returns scored the minimum value of RLF while the intraday returns scored the minimum value of FLF. Furthermore, Bee et al. (Citation2016) forecast the intraday returns including the HAR-J and EVT approach but the result found that the GARCH-EVT based on the daily returns is better than the high-frequency based filter. Next, Liu et al. (Citation2022) forecast the out-of-sample VaR of the CSI300 index by combining the HARQ model and EVT and found that the HARQ-EVT models outperform several conventional HAR-type models in predicting the VaR of the Chinese stock market. A recent finding by James and Leung (Citation2023) shows the capabilities of the generalized quantile random forests (GQRF) model to other risk models, including the GARCH-EVT and HAR-EVT models. Their findings indicated that the GQRF risk model consistently provides dependable VaR and ES forecasts.

3. Materials and methods

3.1. Sample data

The dataset of this study includes daily returns (low-frequency data) and 5-min intraday returns (high-frequency data) spanning from January 2014 to April 2022 for four selected global stock markets namely DJIA Index and FTSE Index (developed markets) and BVSP Index and MERVAL Index (emerging markets). To examine the effectiveness of risk models in both regular and extreme situations, the period of the COVID-19 pandemic was specifically selected. The analysis was then carried out on two distinct sub-sample periods: the normal period spanning from January 2014 to December 2019, and the crisis period covering January 2020 to April 2022. The data analysis is divided into preliminary analysis, in-sample empirical analysis and out-of sample for VaR and ES forecast evaluation. Given Pt is the closing stock prices, the logarithmic daily returns of the chosen stock markets were computed by measuring the disparity between the logarithmic price levels on two consecutive days using the following formula: (1) {rt}t1T=log(PtPt1).(1)

The 5-minute logarithmic realized variance, logRVt was calculated based on two steps. Firstly, the 5-min stock prices were transformed into one-day trading returns by using the R-programming code based on EquationEquation (1). Given m is the total 5-min stock prices with i = 2,3,4,5, 6,…m. Next, the one-day trading returns was transformed into RVt by using the formula: (2) RVtd=i=2m|Rt,i|2,(2) where t = 1,2, 3…n and where n represents the total number of observations. The RVt was then transformed into logRVt. The reason of choosing logRVt  compared to raw RVt is due to the facts that the distribution of logRVt  is closer to a normal distribution and the conditional heteroskedasticity is significantly lowered in log-RV as mentioned in a study done by Jafry et al. (Citation2020). The log-returns of rt was used in the in-sample empirical analysis by using the GARCH, EGARCH and FIEGARCH models (low-frequency models) while the log-realized variance, logRV was used in the RV and HAR models (high-frequency models). The residuals series generated from each low and high-frequency models will be used in the second stage of data analysis that combined with the EVT approach. These two stages combination method is called the conditional EVT method.

3.2. The conditional EVT model

In this study, the conditional EVT method introduced by Mcneil and Frey (Citation2000) has been applied where the combination of GARCH, RV and HAR specifications with the EVT approach has been tested. The first stage of the analysis involved the filtering process of the original data series by using GARCH, EGARCH and FIEGARCH models (low-frequency data) along with the filtering process by using the RV and HAR types of models (for high-frequency data). This streaming process aimed to generate standardized residuals (innovations) that resemble iid distribution which is the requirement for the EVT approach where the filtered return series are assumed to be free from series correlation and heteroscedasticity effects. After the streaming process, the filtered return series obtained from the streaming process was utilized to calculate the EVT parameters. This study utilized the peak-over-threshold (POT) method where extreme value observations surpassing a specified threshold are considered, irrespective of the clustering tendencies of the time series. The thresholds generated from the POT method will be used in estimating the Generalized Pareto Distribution (GPD) distributions. By using the GPD parameters, the VaR and ES quantiles are calculated which next will be used in the estimation of the one-day VaR and ES forecast.

3.3. The Peak-Over-threshold (POT) method

Generally, there are two methods in EVT used to model the extreme returns which are the Block Maxima (BM) method use a generalized extreme value (GEV) distribution and the POT method that based on the GPD distribution. In this analysis the Block Maxima (BM) method will not be tested because it is not suitable for analyzing time series data due to the presence of volatility clustering, which results in extreme values being clustered together. Consequently, this study utilized the POT method, where extreme value observations that surpass a specified threshold are considered, irrespective of the clustering tendencies of the time series. In the POT method, given the logarithmic return series of a stock index, denoted as ={x1,x2,.,xt}. The possible values of x represent the exceedance generated form extreme values that surpasses a specified threshold u. The cumulative probability function of y, expressed as Fu(y) is given by: (3) Fu(y)=Pr(Xuy|X>u).(3)

The conditional probability in EquationEquation (10) can be simplified into: (4) Fu(y)=F(x)F(u)1F(u),(4) where y=xu is the magnitude of exceedance and x=y+u is the exceedance.

3.4. The GPD distribution

One of the distributions used in EVT is called the GPD distribution. According to Balkema and Haan (Citation1974) and Pickands (Citation1975) the cumulative probability function Fu(y) happens when a threshold u is sufficiently high. This function will converge to GPD distribution defined as: (5) Gξψ(y)={1(1+ξyψ)1ξ, if ξ0,1eyψ, if ξ=0,(5)

The GPD parameters ψ (scale parameter) and ξ (shape parameter) were estimated by using the maximum likelihood method. Since  Fu(y)Gξψ(y) introduced by Embrechts et al. (Citation2013) at a sufficiently high threshold u, EquationEquation (3) and Equation(4) are combined to be: (6) F(x)=(1F(u))Gξψ(y)+F(u).(6)

Assumed F(u)=nkn, the function F(x) can be simplified as: (7) F(x)=(1kn)[1+ξxuψ]1ξ,(7) where n denotes the total number of observations and k represents the number of exceedances.

3.5. The VaR and ES quantiles

When considering the cumulative distribution function F(.), at a specific probability q, the estimation of VaR can be expressed as: (8) VaRq=F1(1q),(8) where F1 (.) is the inverse function of F(.). By inverting EquationEquation (8), the VaRq is given by: (9) VaRq=xq=u+ψξ[(1qk/n)ξ1].(9)

Since VaR solely focuses on the distribution quantile and neglecting extreme losses beyond the VaR level, valuable information regarding the tails of the underlying distributions may be disregarded. To address this limitation, suggested the adoption of expected shortfall (ES) denoted as: (10) ESq=E(X|X>VaRq).(10)

Based on (Mcneil & Frey, Citation2000), the value of ES can be estimated as: (11) ESq=VaRq1ξ+ψξu1ξ.(11)

3.6. The forecasted VaR

The 1-ahead forecasted of conditional mean μ̂t+1  and conditional variance σ̂t+1  are forecasted by using the GARCH and HAR specifications. The 1-ahead conditional mean μ̂t+1  is given by: (12) μ̂t+1=â0+i=1pâ1rti+1+j=1qb̂jεtj+1 ,(12) while conditional variance σ̂t+1 is forecasted by using the GARCH (1,1), EGARCH (1,1), FIEGARCH (1,1) and HAR models. The 1-ahead VaR and ES is calculated by the POT method into the standardized residuals series Zt by using the formula in EquationEquation (9) and Equation(11). The 1-ahead conditional VaR based on the conditional EVT, VaRqt+1  is given by: (13) VaRqt+1=μt+1+σt+1VaRq,(13) while the one-step-ahead forecasted ES, Esqt+1 is given by: (14) ESqt+1=μt+1+σt+1ESq.(14)

3.7. Backtesting procedure

To assess the VaR performance of each model, a two-stage backtesting procedure is presented. In the initial stage of backtesting, statistical tests are applied, including the unconditional coverage test developed by Kupiec (Citation1995) and conditional coverage tests introduced by Christoffersen (Citation1998). The Kupiec likelihood-ratio (LR) test evaluates the VaR models based on the violation ratio (VR), which is defined as follows: (15) VR=NpH,(15) where N=t=T+1T+HIr(rt<VaRt|t11p) and pH are the theoretical and the actual number of violations. A violation is said to be violated if the realized return value lies lower than the estimated VaR(rt<VaRt|t11p). The LR test statistic is given as: (16) LR=2lnf̂N(1f̂)HNpN(1p)HN.(16)

The unconditional coverage test which also known as the Kupiec POF test is used to check whether the violations’ proportion obtained, p̂ is significantly different from the expected proportion, p. Given that the probability of exceedance is constant, the number of VaR violations, T1=t=1TIt follows a binomial distribution where T  is the total number of observations. The VaR (p) measure is said to be accurate if the unconditional coverage p̂=T1T is equal to p% and the null hypothesis, H0 is given by H0=p̂=p. The likelihood ratio statistics used to perform Kupiec POF test is given by: (17) LRUC=2(log(p̂T1(1p̂)TT1)log(pT1(1p)TT1)).(17)

This study also utilized the conditional coverage test to backtest the VaR by examining the joint evaluation of unconditional coverage and serial independence. The test statistics combine the sum of the individual test statistics for unconditional coverage and serial independence, denoted as LRCC=LRUC+LRIND. The LRIND assesses whether exceptions are statistically independent of one another. The conditional coverage joint test, LRCC=LRUC+LRIND mainly examine the percentage of exception whether it is statistically equal to the actual proportion, p and whether the exceptions are statistically independent of based on the independence’s indicator. Therefore, the null hypothesis of the actual proportion, p and statistically independence is given by H0:π01=π11=p. The test statistics of the conditional coverage test is given by: (18) LRCC=2(log(π̂01T01(1π̂01)T00π̂11T11(1π̂11)T10)log(pT01+T11(1p)T00+T10)).(18)

In addition, the models that passed the UC and CC test will be tested again by using the Regulatory Loss Function (RLF) and the Firm’s Loss Function (FLF). The RLF function, as proposed in the work of Lopez (Citation1998), quantifies the disparity between the predicted VaR and the actual returns, while considering a collection of regulatory restrictions defined by: (19) RLF={1+(VaRtrt)2 if rt<VaRt, 0 if rtVaRt.(19)

The loss function proposed by Sarma et al. (Citation2003), known as the FLF (Firm Loss Function), imposes a penalty on regular days by considering the potential cost linked to the capital allocated by the company for the purpose of risk management, as expressed by: (20) FLF={(VaRtrt)2if rt<VaRt,βVaRtif rtVaRt.(20)

To validate the ES through backtesting, the utilization of a traffic light approach was implemented, as introduced by the Basel Committee on Banking Supervision (Citation1996). This study applied the same approach to backtest the ES introduced by Costanzino and Curran (Citation2015) to evaluate the adequacy of the ES by measuring the number of breaches of the ES threshold over a given investment period. In this method, a breach occurs when the actual loss exceeds the ES estimate, and the method measures the severity of the breach by considering the difference between the actual loss and the ES estimate. The larger value of conditional indicates more severe breaches. A condition of zero indicates no breaches, while a positive generalized breach (GB) value indicates that the ES estimate was exceeded on average. The null hypothesis, H0 stated that the ES estimate model is accurate if the expected losses calculated from the ES estimate are consistent with the actual losses incurred. If the actual losses exceed the expected losses by a significant amount, then the H0 is rejected, indicating that the ES estimate is inaccurate and needs to be revised.

4. Results

4.1. Preliminary analysis

The skewness coefficients in and indicate negative values for daily returns and positive values for intraday returns. The Jarque–Bera statistics indicate that the data deviates from normality for all stock market indices. The value of kurtosis greater than three shows that the daily returns for all stock market series have heavier tails compared to a normal distribution. In , the logRVt  for all stock markets also exhibit the excess value of kurtosis. However, the kurtosis value for daily returns, rt is higher compared to the logRVt  which mean the tail of low-frequency data is fatter than the tails of high-frequency data.

Table 1. Summary statistics of daily log-returns, rt.

Table 2. Summary statistics of intraday returns, logRVt.

In terms of the graphical illustration, the QQ plot depicted in shows that there is an existence of heavy-tailed in the data series. The depiction of daily log-returns denoted as rt  reveals an asymmetrical perspective of tail distribution, with notably greater deviations observed in the realm of extreme negative returns compared to the expectations of a Gaussian distribution. In contrast, for the logRVt , the Q-Q plot exhibits a right-skewed curve, implying that the predominance of extremes primarily lies within the right tail. The clear concavity in both daily and intraday returns is the strong signal of the heavy tailed presence in all data series, even though the intraday series displays stronger volatility periodicity patterns compared to the daily return series.

Figure 1. Q-Q Plot of daily returns, rt  and intraday returns, logRVt.

Figure 1. Q-Q Plot of daily returns, rt  and intraday returns, log RVt.

4.2. Empirical analysis

In specific applications of EVT such as modeling exceptional occurrences in finance, the assumption of independence and identical distribution (iid) is appropriate. Because the EVT assumes iid observations, the original return series cannot be applied directly but must be filtered to make sure it follows an iid distribution. In this study, the standardized residuals filtered from GARCH, EGARCH and HAR model were tested by using the ARCH-LM, Q and Q2 statistics to make sure that it follows iid observations. The findings are displayed in and which indicate that the ARCH-LM, Q, and Q2 tests yield insignificant results. This implies that the filtered standardized residuals series does not demonstrate serial dependence, in other words, free from serial correlation and conditional heteroskedasticity. Therefore, these filtered standardized residuals series are now assumed to be iid, making them suitable to be used in the EVT analysis. The filtered standardized residual series were estimated from January 1, 2014, to December 31, 2019, representing the non-crisis period, as well as the crisis years from January 1, 2020, to April 30, 2022.

Table 3. The heteroskedasticity and serial correlation test during the non-crisis period.

Table 4. The heteroskedasticity and serial correlation test during the crisis period.

In the second phase of data analysis utilizing EVT, the POT method was employed. This involved using the GPD distribution to estimate the tails of the data series. The in-sample filtered standardized residual series were estimated using conditional EVT models from January 1, 2014, to December 31, 2019, representing the non-crisis period, as well as the crisis years from January 1, 2020, to April 30, 2022. The threshold, u for each residual was determined by utilizing the following mean excess function (MEF) given by: (21) MEF(u)=i=1n(Xiu)i=1nI(Xi>u),(21) where n represents the total number of in-sample observations. The selection of each threshold u occurs when the MEF plot reaches a satisfactory level of linearity. The range of possible values for all chosen u in this study were analyzed by using the R-programming code. The GPD parameters, ξ and ψ, were estimated for each selected u by using the maximum likelihood method. The resulting u, the number of exceedances (k), and the estimated parameters ξ and ψ are presented in and . The unconditional VaRq and ESq are calculated by using the formula given in EquationEquations (9) and Equation(11) at 95% and 99% confidence level.

Table 5. The GPD parameter estimates during non-crisis period (2014–2019).

Table 6. The GPD parameter estimates during crisis period (2020–2022).

4.3. Backtesting VaR

The assessment of various volatility models for daily and intraday returns, in both normal and crisis times, has relied on the utilization of the violation ratio (VR) as an evaluative measure. According to the guidelines provided by Danielsson (Citation2011), a VR value within the range of [0.8,1.2] indicates a good forecast. However, the VR values outside this range suggest an imprecise or not good model. Nevertheless, the VR within [0.5,0.8] or [1.2,1.5] is still acceptable. Based on the results presented in , most of the HAR-type models combined with EVT are precise and acceptable to be used in the VaR forecast. Next, and reveal the VaR backtesting results by using UC and CC test. Based on the result, during normal and crisis period, all high-frequency HAR models accepted the null hypothesis at a 5% significance level which implies that the corresponding high-frequency VaR models have adequate forecasting ability. The low-frequency models of GARCH-EVT and EGARCH-EVT reject the null hypothesis at 5% significance level which means the low-frequency models combined with EVT are not suitable to forecast the VaR during both normal and crisis period.

Table 7. The VaR violation ratios of the Conditional-EVT Model.

Table 8. The UC and CC test statistics of the Conditional-EVT Model during normal period.

Table 9. The UC and CC test statistics of the Conditional-EVT Model during crisis period.

Since HAR type-models passed all the first stage backtesting procedure by using the statistical test of the UC and CC test, these models will be tested again by using the RLF and FLF lost function on the VaR accuracy. In , during the normal period, the HAR-EVT model exhibits better performance than alternative HAR models in forecasting VaR. Conversely, in the period of crisis, the HAR-EGARCH-EVT model demonstrated the most superior performance by achieving the lowest values for both the RLF and FLF loss functions.

Table 10. The RLF and FLF values.

4.4. Backtesting ES

Based on the results obtained in , the low-frequency conditional EVT models have produced a larger value of GB, which indicates that there are severe breaches in the data. Additionally, the ES estimate is inaccurate based on the significant p-value. If the p-value is significant, the H0 is rejected, suggesting that the low-frequency conditional EVT models are not adequate for accurately estimating the ES during a non-crisis period. However, during the crisis period, it shows that the low-frequency conditional EVT models are accurate to be used in estimating the ES. The high-frequency conditional EVT models generate lower values of the GB which suggests that the actual losses are closer to the expected losses calculated from the ES estimate, indicating that the ES estimate is more accurate. The bold figure shows the accurate ES measure by using the HAR-type models.

Table 11. The backtesting procedure of expected shortfall.

5. Discussion

The result provided in the previous section shows that for VaR forecast, the result of the the first stage basktesting procedure reveal that the HAR-EVT typed-models outperformed other risk models that apply the GARCH and RV specification with EVT. Furthermore, the result from the second stage of the bactesting procedure shown that among all hybrid version of HAR-EVT models, the HAR-EGARCH-EVT model exhibits superior performance in forecasting the VaR during both normal and crisis period as summarized in .

Table 12. The best performance models based on the VaR forecast.

On the other hand, based on the ES forecast, during the non-crisis period, based on samples of DJIA and FTSE indices, the result shows insignificant values, suggesting that the HAR specification models combined with EVT are adequate in estimating the ES during the non-crisis period. This result is consistent with the VaR forecast result where HAR-EVT typed models outperformed other models in estimating the risk. However, during the crisis period, GARCH-EVT, EGARCH-EVT and FIEGARCH-EVT show insignificant values and suitable to estimate expected shortfall with higher severity of the breach. There is no breach detected in the HAR-EVT typed models during the crisis period. The result is not consistent with the VaR forecast. The reason might be because the small sample of sizes used to forecast the ES during the crisis period make it difficult to detect the number of breaches by using HAR specification that based on monthly and weekly data.

This research findings also provide practitioners and policymakers with the best volatility forecast model to be used during any extreme market sudden event. There are unique features possessed by high-frequency data but lacking in low-frequency data which make it more suitable to be used in the current volatility estimation model. The findings of this research study are also beneficial to market participants, especially those who engage in high-frequency trading to utilize the RV and HAR models as well the application of the EVT model to measure the potential risks and detect any vulnerabilities in financial systems especially during a sudden market crash. In the context of economic sustainability, the research findings help to provide an accurate risk model which will help in reducing the unanticipated risk in the stock market and maintain the stability of financial systems thus sustaining the global economics. Furthermore, this research study will provide insights into the superiority of high-frequency data in comparison to low-frequency data. An increase in high-frequency research has the potential to propel future technological advancements, particularly in artificial intelligence (AI) and machine learning. This surge could notably bolster the processing of high-frequency data by streamlining and automating the analysis procedures. Larger data volumes can be handled in a more efficient and precise manner, obtaining more valuable information and trends faster than traditional methods. Over time, the prospect of diminishing the overall cost associated with processing high-frequency data is highly probable. This reduction could make data processing more accessible and affordable for smaller-scale and individual investors.

6. Conclusion

The utilization of daily returns for risk prediction through EVT has experienced a significant transformation, transitioning from the traditional EVT approach to the more advanced hybrid method called a conditional EVT model. The research on how the conditional EVT outperforms other models is still ongoing until 2023 with different hybrid methods among all available risk models. The emergence of high-frequency data contributes to another area of research on the efficiency of the low-frequency conditional EVT models compared to the high-frequency conditional EVT models. Researchers discovered that, for the most part, when high-frequency data was compared, the conditional EVT exhibited superior performance as discussed earlier in the literature review. Many investigations have been carried out to validate the advantages of integrating the RV specification into a hybrid model alongside the EVT method. The result from this study highlighted that high-frequency conditional EVT models can surpass low-frequency conditional EVT models in risk forecasting the VaR and ES. The summary result suggests that all models that combined the HAR specification with EVT have passed all first stage VaR backtesting procedure during both non-crisis and crisis periods which concludes that the combination of the HAR model and EVT approach can produce precise predictions of market risk, especially during sudden market crashes such as the COVID-19 pandemic period.

This research study contributes to the theoretical and practical implications in terms of its findings. Theoretically, this study differs from previous research and contributes to current literature on the forecasting ability of risk models by concentrating on the hybrid model of long-memory models (FIEGARCH, RV-FIEGARCH and HAR-FIEGARCH) for various types of data (daily and intraday returns) with the EVT approach. This research filled the gap by introducing the exponential variant of the fractionally integrated RV model, known as the RV-FIEGARCH model that not only encompasses the long-memory aspect but also accounts for the asymmetric effects in the return series. In addition to this, the research study also implemented the HAR framework, by utilizing the daily, weekly, and monthly data within the exponential fractionally integrated RV model, referred to as the HAR-RV-FIEGARCH model, to assess its potential in enhancing the accuracy of forecasting particularly during periods of crisis and stability. The result of this research verifies that the HAR specifications combined with the EVT approach including the HAR-FIEGARCH-EVT outperformed other risk models during periods of for both developed and emerging markets.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

Nor Azliana Aridi

Nor Azliana Aridi is a senior lecturer at the Faculty of Management, Multimedia University (MMU) in Cyberjaya, Malaysia. She specializes in teaching Mathematics and Statistics to both foundation and undergraduate students. She holds a bachelor’s degree in mathematical science from International Islamic University Malaysia (IIUM) obtained in 2007 and a master’s degree in mathematics from the National University of Malaysia (UKM) obtained in 2009. Recently in March 2024, she completed her PhD in Management, specializing in Financial Econometrics at MMU. Nor Azliana is an active researcher, frequently presenting at international conferences and publishing journal papers. Her research interests focus on financial econometrics, and she is proficient in using various statistical software, including R, EViews, OxMetrics, and SPSS. Additionally, she has been successful in securing internal research grants to support her work. In her role at MMU, she also takes on administrative and management tasks, contributing to her versatility and well-rounded skill set within the academic community.

Tan Siow Hooi

Tan Siow Hooi is an Associate Professor at Faculty of Management, Multimedia University. She received her PhD degree majoring in Financial Economics from Faculty of Economics and Management, Universiti Putra Malaysia. She has many years of experience teaching a variety of economics and econometrics courses at both the undergraduate and graduate levels. Her research profile is interdisciplinary. She has published widely on a number of issues in the field of behavioural finance, tourism and cultural heritage, as well as environmental responsibility. Topics of interest in recent years include predictability of financial markets, technology for elderly, placed-based education and heritage. Her articles have been published in Web of Science SSCI indexed journals, including The North American Journal of Economics and Finance, Journal of Behavioural Finance, Journal of Environmental Management, Annals of Tourism Research, Current Issues in Tourism, Tourism Management, Tourism Economics, among many others. She has extensively used a wide range of statistical packages, which include but are not limited to EViews, STATA, Matlab, R, SPSS, AMOS, Excel and Google Spreadsheet in her research works as well as in statistical laboratory for the teaching of multivariate data analysis, time series analysis and forecasting, and econometrics. She is a HRDF-certified trainer for Train-the-Trainer (TTT) and Alibaba Certified Global E-Commerce Talent (GET).

Chin Wen Cheong

Chin Wen Cheong is affiliated to Xiamen University, Malaysia. Chin Wen Cheong is currently providing services as Associate Professor in School of Mathematics and Physics. He obtained his bachelor’s degree in Physics from University of Malaya, Malaysia (1998), master’s degree in Applied Statistics from University of Putra Malaysia, Malaysia (2003) and Ph. D Degree (Statistics) from National University Malaysia, Malaysia (2011). He has authored and co-authored multiple peer-reviewed scientific papers and presented works at many national and international conferences. His contributions have acclaimed recognition from honourable subject experts around the world. Chin Wen Cheong is actively associated with different societies and academies. His academic career is decorated with several reputed awards and funding. His research interests include time series and risk management.

References

  • Altun, E., & Tatlidil, H. (2015). A comparison of extreme value theory with heavy - tailed distributions in modeling daily VaR. Journal of Finance and Investment Analysis, 4, 1–5.
  • Balkema, A. A., & Haan, L. D. (1974). Residual life time at great age. The Annals of Probability, 2(5), 792–804. https://doi.org/10.1214/aop/1176996548
  • Basel Committee on Banking Supervision. (1996). Supervisory framework for the use of “backtesting” in. conjunction with the internal models approach to market risk capital requirements. Bank for International Settlements, Basle, Report nr. 22, http://www.bis.org
  • Basuony, M. A., Bouaddi, M., Ali, H., & EmadEldeen, R. (2022). The effect of COVID-19 pandemic on global stock markets: Return, volatility, and bad state probability dynamics. Journal of Public Affairs, 22(S1), e2761. https://doi.org/10.1002/pa.2761
  • Bee, M., Dupuis, D. J., & Trapin, L. (2016). Realizing the extremes: Estimation of tail-risk measures from a high-frequency perspective. Journal of Empirical Finance, 36, 86–99.
  • Cheong, C. H., Cherng, L. M., & Yap, G. L. C. (2016). Heterogeneous market hypothesis evaluations using various jump-robust realized volatility. Romanian Journal of Economic Forecasting, XIX(4), 50–64.
  • Christoffersen, P. F. (1998). Evaluating interval forecasts. International Economic Review, 39(4), 841–862. https://doi.org/10.2307/2527341
  • Costanzino, N., & Curran, M. (2015). Backtesting general spectral risk measures with application to expected shortfall. Available at SSRN 2514403.
  • Dacorogna, M., Mller, U., Olsen, R., & Pictet, O. (2001). Defining efficiency in heterogeneous markets. Quantitative Finance, 1(2), 198–201. https://doi.org/10.1088/1469-7688/1/2/602
  • Danielsson, J. (2011). Financial risk forecasting: The theory and practice of forecasting market risk with implementation in R and Matlab. John Wiley & Sons.
  • Djuraskovic, O. (2023). Big data statistics 2023: How much data is in the world? First Site Guide. https://firstsiteguide.com/big-data-stats/
  • Embrechts, P., Klüppelberg, C., & Mikosch, T. (2013). Modelling extremal events: for insurance and finance (Vol. 33). Springer Science & Business Media.
  • Fama, E. (1965). The behavior of stock-market prices. The Journal of Business, 38(1), 34–105. https://doi.org/10.1086/294743
  • Huang, Z., Liu, H., & Wang, T. (2016). Modeling long memory volatility using realized measures of volatility: A realized HAR GARCH model. Economic Modelling, 52(Part B), 812–821. https://doi.org/10.1016/j.econmod.2015.10.018
  • Hui, E. C. M., & Chan, K. K. K. (2022). How does Covid-19 affect global equity markets? Financial Innovation, 8(1), 25. https://doi.org/10.1186/s40854-021-00330-5
  • Jafry, N. H. A., Razak, R. A., & Ismail, N. (2020). Dependence measure of five-minutes returns compared to daily returns using static and dynamic copulas. Sains Malaysiana, 49(8), 2023–2034.
  • James, R., & Leung, J. W. Y. (2023). Forecasting financial risk using quantile random forests. Available at SSRN 4324603.
  • Jin, F., Li, J., & Li, G. (2022). Modeling the linkages between Bitcoin, gold, dollar, crude oil, and stock markets: A GARCH-EVT-copula approach. Discrete Dynamics in Nature and Society, 2022, 1–10. https://doi.org/10.1155/2022/8901180
  • Kickbusch, I., Leung, G. M., Bhutta, Z. A., Matsoso, M. P., Ihekweazu, C., & Abbasi, K. (2020). Covid-19: How a virus is turning the world upside down. BMJ, 369, m1336. https://doi.org/10.1136/bmj.m1336
  • Kupiec, P. H. (1995). Techniques for verifying the accuracy of risk measurement models. The Journal of Derivatives, 3(2), 73–84. https://doi.org/10.3905/jod.1995.407942
  • Liu, G., Yu, C. P., Shiu, S. N., & Shih, I. T. (2022). The efficient market hypothesis and the fractal market hypothesis: Interfluves, fusions, and evolutions. SAGE Open, 12(1), 215824402210821. https://doi.org/10.1177/21582440221082137
  • Lopez, J. A. (1998). Methods for evaluating value-at-risk estimates. Economic Policy Review, 4(3), 119–991124.
  • Louzis, D. P., Sissinis, S. X., & Refenes, A. (2007). Stock index Value-at Risk forecasting: A realized volatility extreme value theory approach. Economics Bulletin, 32(1), 981–991.
  • Lyócsa, S., Molnár, P., & Výrost, T. (2021). Stock market volatility forecasting: Do we need high-frequency data? International Journal of Forecasting, 37(3), 1092–1110. https://doi.org/10.1016/j.ijforecast.2020.12.001
  • Maaz, K., Mrestyal, K., & Muhammad, I. (2023). Estimating value-at-risk and expected shortfall of metal commodities: Application of GARCH-EVT method. Journal of Risk Management in Financial Institutions, Henry Stewart Publications, 16(2), 189–199.
  • Mcneil, A. J., & Frey, R. (2000). Estimation of tail-related risk measures for heteroscedastic financial time series: an extreme value approach. Journal of Empirical Finance, 7(3-4), 271–300. https://doi.org/10.1016/S0927-5398(00)00012-8
  • Omari, C., Mundia, S., & Ngina, I. (2020). Forecasting value-at-risk of financial markets under the global pandemic of COVID-19 using conditional extreme value theory. Journal of Mathematical Finance, 10(04), 569–597. https://doi.org/10.4236/jmf.2020.104034
  • Paul, S., & Sharma, P. (2018). Quantile forecasts using the realized GARCH-EVT approach. Studies in Economics and Finance, 35(4), 481–504. https://doi.org/10.1108/SEF-09-2016-0236
  • Pickands, J. (1975). Statistical inference using extreme order statistics. The Annals of Statistics, 3(1), 119–131.
  • Rakshit, B., & Neog, Y. (2022). Effects of the COVID-19 pandemic on stock market returns and volatilities: evidence from selected emerging economies. Studies in Economics and Finance, 39(4), 549–571. https://doi.org/10.1108/SEF-09-2020-0389
  • Roy, A. (2022). Efficacy of GARCH-EVT model in intraday risk management: Evidence from severely pandemic-affected countries. Global Business Review, 097215092211048. https://doi.org/10.1177/09721509221104848
  • Sarma, M., Thomas, S., & Shah, A. (2003). Selection of value-at-risk models. Journal of Forecasting, 22(4), 337–358. https://doi.org/10.1002/for.868
  • Sojka, B. B. (2015). Daily VaR forecasts with realized volatility and GARCH Models. Argumenta Oeconomica, 34(1), 157–173.
  • Soltane, H. B., Karaa, A., & Bellalah, M. (2012). Conditional VaR using GARCH-EVT approach: forecasting volatility in Tunisian financial market. Journal of Computations & Modelling, 2(2), 95–115.
  • Tabasi, H., Yousefi, V., Tamošaitienė, J., & Ghasemi, F. (2019). Estimating conditional value at risk in the Tehran stock exchange based on the extreme value theory using GARCH models. Administrative Sciences, 9(2), 40. https://doi.org/10.3390/admsci9020040
  • Tao, Q., Wei, Y., Liu, J., & Zhang, T. (2018). Modeling and forecasting multifractal volatility established upon the heterogeneous market hypothesis. International Review of Economics & Finance, 54, 143–153. https://doi.org/10.1016/j.iref.2017.08.003
  • Ullah, S. (2022). Impact of COVID-19 pandemic on financial markets: A global perspective. Journal of the Knowledge Economy, 14(2), 982–1003. https://doi.org/10.1007/s13132-022-00970-7
  • Watanabe, T. (2012). Quantile forecasts of financial returns using realized Garch models. Japanese Economic Review, 63(1), 68–80. https://doi.org/10.1111/j.1468-5876.2011.00548.x
  • Youssef, M., Belkacem, L., & Mokni, K. (2015). Value-at-risk estimation of energy commodities: A long-memory GARCH–EVT approach. Energy Economics, 51, 99–110. https://doi.org/10.1016/j.eneco.2015.06.010
  • Zahid, M., Iqbal, F., Raziq, A., & Sheikh, N. (2022). Modeling and forecasting the realized volatility of bitcoin using realized HAR-GARCH-type models with jumps and inverse leverage effect. Sains Malaysiana, 51(3), 929–942. https://doi.org/10.17576/jsm-2022-5103-25
  • Zhang, Z., & Zhang, H. K. (2016). The dynamics of precious metal markets VaR: A GARCH-EVT approach. Journal of Commodity Markets, 4(1), 14–27. https://doi.org/10.1016/j.jcomm.2016.10.001
  • Zhao, J., Cui, L., Liu, W., & Zhang, Q. (2023). Extreme risk spillover effects of international oil prices on the Chinese stock market: A GARCH-EVT-Copula-CoVaR approach. Resources Policy, 86(Part B), 104142. https://doi.org/10.1016/j.resourpol.2023.104142