Full article: Singular Conditional Autoregressive Wishart Model for Realized Covariance Matrices

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Realized covariance matrices are often constructed under the assumption that richness of intra-day return data is greater than the portfolio size, resulting in nonsingular matrix measures. However, when for example the portfolio size is large, assets suffer from illiquidity issues, or market microstructure noise deters sampling on very high frequencies, this relation is not guaranteed. Under these common conditions, realized covariance matrices may obtain as singular by construction. Motivated by this situation, we introduce the Singular Conditional Autoregressive Wishart (SCAW) model to capture the temporal dynamics of time series of singular realized covariance matrices, extending the rich literature on econometric Wishart time series models to the singular case. This model is furthermore developed by covariance targeting adapted to matrices and a sector wise BEKK-specification, allowing excellent scalability to large and extremely large portfolio sizes. Finally, the model is estimated to a 20-year long time series containing 50 stocks and to a 10-year long time series containing 300 stocks, and evaluated using out-of-sample forecast accuracy. It outperforms the benchmark models with high statistical significance and the parsimonious specifications perform better than the baseline SCAW model, while using considerably less parameters.

Keywords:

1 Introduction

The covariance matrix of asset returns plays a key role in several financial applications, such as portfolio allocation, risk management and option pricing. It is well-documented that this quantity changes over time, why describing and understanding its temporal dynamics is fundamental to financial decision making. A typical approach is to capture this evolution in discrete time by applying multivariate GARCH-type models, summarized in Bauwens, Laurent, and Rombouts (Citation2006), where the conditional covariance matrix is determined by past observations of daily returns. Another classic method is to use multivariate stochastic volatility-type models, reviewed in Asai, McAleer, and Yu (Citation2006), in where the covariance matrix process is assumed to be random.

During the last two decades, increased availability of asset price data on high frequencies has paved the way for numerous novel approaches in this area. Many of them are built upon the notion of realized covariance, where the daily return covariance matrix is estimated by a large number of intra-day returns, on for example 5 or 10 min intervals (see e.g., Andersen et al. Citation2003; Barndorff-Nielsen and Shephard Citation2004). Modeling the time series dynamics for realized covariance matrices in discrete time has become a large branch in the econometric literature. A popular approach is to assume the underlying stochastic process to be Wishart, a well-studied distribution that ensures positive-definiteness almost surely. For example, the Wishart Autoregressive (WAR) model introduced in Gouriéroux, Jasiak, and Sufana (Citation2009), assumes realized covariances are conditionally distributed as noncentral Wishart, where the noncentrality parameter is described by historical realized covariance matrices. The High-Frequency-Based Volatility (HEAVY) model presented in Noureldin, Shephard, and Sheppard (Citation2012) and the Conditional Autoregressive Wishart (CAW) model introduced by Golosnoy, Gribisch, and Liesenfeld (Citation2012) rely on the centralized Wishart distribution, where the scale matrix parameters are determined by past observations. A central Wishart distribution is also considered in Jin and Maheu (Citation2012), but here the scale matrix is decomposed into either multiplicative components or additive components determined by sample means of historical realized covariances. The General Conditional Autoregressive Wishart (GCAW) model is proposed in Yu, Li, and Ng (Citation2017), parameterized with both a noncentral parameter as in the WAR model and a scale matrix as in CAW model. In Anatolyev and Kobotaev (Citation2018), the CAW model is extended to the Conditional Threshold Autoregressive Wishart (CTAW) model with the aim to include the effects of price asymmetry on future realized covariances. Goodness-of-fit tests for models driven by an underlying central Wishart distribution, such as the CAW model, is presented in Alfelt, Bodnar, and Tyrcha (Citation2020).

All of the models discussed above assume a realized covariance matrix that is positive definite. This can be ensured as long as the number, n, of intra-day returns used to compute the realized covariance matrix is larger than or equal to the number, m, of assets included into the portfolio. Regarding small and moderately sized portfolios or reasonably liquid assets, this relation can often be justified. However, in many applications it is of interest to consider portfolios of large dimensions, containing perhaps 50, 100 or even 1000 assets (see e.g., Rubio, Mestre, and Palomar Citation2012; Hautsch, Kyj, and Malec Citation2015; Ledoit and Wolf Citation2017; Ao, Yingying, and Zheng Citation2019; Cai et al. Citation2020; Bodnar et al. Citation2021; Ding, Li, and Zheng Citation2021; Bodnar, Bodnar, and Parolya Citation2022; Bodnar, Okhrin, and Parolya 2022). Furthermore, available data for the portfolio assets might be restricted, due to for example low liquidity resulting in a few price quotes per day. In addition, there might be limits to how high of an intra-day return sample frequency that is suitable, in presence of so-called market microstructure noise (see, e.g., Aït-Sahalia and Yu 2009). Any combination of these factors might result in a situation where m > n, and hence daily realized covariance matrices that are singular. Finally, Jacod and Podolskij (Citation2013) derived an asymptotic test for inferring the rank of multivariate volatility processes.

To solve the problem with the singularity of the covariance matrix, one can consider data of higher frequency and/or can make use of the ultra high frequency data (see e.g., Engle Citation2000; Aït-Sahalia, Mykland, and Zhang Citation2011; Christensen, Oomen, and Podolskij Citation2014). On the other side, in spite of the broad availability of ultra high-frequency data, there are several reasons for possibly using moderate intra-daily sample sizes for the purpose of realized covariance estimation, namely market microstructure noise and zeros. Both of the issues are largely present due to illiquidity of stocks traded on financial markets.

The microstructure noise can directly be incorporated in the model for stock prices (see e.g., Bandi and Russell Citation2006, Citation2008; Bibinger et al. Citation2014, Citation2019), while the illiquidity issue related to nonsynchronous trading can be treated as a missing-value problem (see e.g., Corsi, Peluso, and Audrino Citation2015; Shephard and Xiu Citation2017; Buccheri et al. Citation2021), where the idea is to use all available price data and to perform the estimation of model parameters by employing the Kalman filter recursion together with the ordinary least squares estimation or quasi maximum likelihood estimation. To this end, the presence of zeros in the price process, that is, the periods where the price of a stock does not change, is another source that influences the properties of the realized covariance matrices and the way how they are constructed. The issue has recently been discussed in a number of literature studies. While Bandi, Pirino, and Renò (Citation2017) and Bandi et al. (Citation2020a, Citation2020b) consider the case of continuous-time modeling, the results in the discrete-time case can be found in Catania, Mari, and de Magistris (Citation2020) and Sucarrat and Grønneberg (Citation2020).

The above mentioned approaches deal with modeling the price process. Although the developed methods suggest an efficient way how to deal with the microstructure noise, zeros and nonsynchronous trading, the issues might become severe when the dimension of the holding portfolio becomes large and as such high-dimensional realized covariance matrices should be constructed from the intra-day data. To reduce the impact of nonsynchronous trading and the presence of zeros in the prices, one can reduce the sample frequency. As such the realized covariance matrices might become singular by construction and new econometric models should be developed. This is the aim of this article, namely to focus on time series models of singular realized covariance matrices directly, and to extend the family of econometric autoregressive Wishart models to the singular case by introducing the Singular Conditional Autoregressive Wishart (SCAW) model to describe such time series. It is based on the assumption that realized covariance matrices follow a conditional singular Wishart distribution, described in, for example, Srivastava (Citation2003) and Bodnar and Okhrin (Citation2008), where the scale matrix parameter is determined by historical observations in an autoregressive fashion similar to the BEKK-specification of Engle and Kroner (Citation1995), alike for example the CAW model in Golosnoy, Gribisch, and Liesenfeld (Citation2012). This specification ensures positive definiteness and allows us to directly estimate the model parameter with the maximum likelihood method. Furthermore, parameter-based conditions for weak stationarity of the model are deduced. To this end, SCAW model coincides with the multivariate GARCH process based on the BEKK specification when the intra-day data are replaced by daily asset returns, that is, when n = 1.

Since the singular case is closely related to portfolios of large dimensions, a challenge in this setting is to capture the temporal dynamics of the time series, while simultaneously maintaining a parsimonious model that can be feasibly estimated. To deal with this scaling challenge, two novel approaches are introduced. The first one regards covariance targeting (see e.g., Pedersen and Rahbek Citation2014), where the approach of Noureldin, Sheppard, and Shephard (Citation2014) is adapted to the matrix case. It concerns standardizing the time series by its unconditional mean, which allows implementing straightforward conditions on the model parameters such that positive definiteness is maintained also under a covariance targeting regime. This method circumvents estimating the large number of parameters present in the constant matrix of the BEKK-specification, greatly increasing estimation feasibility. The second approach uses the similarity of assets that belong to the same market sector. This specification assumes that the parameters governing temporal dynamics of the matrix time series are homogeneous for assets of the same sector. As such, the number of parameters does not depend on the number of portfolio assets, but rather of the number of market sectors these assets stem from. Combining these approaches results in a model that is well equipped for implementation on large or extremely large portfolios. In addition, an extension using the heterogeneous autoregressive (HAR) specification, adapted from Corsi (Citation2009), is applied to the SCAW model. This approach considers long-time memory dependence by including historical realized covariance matrices on longer horizons, such as weekly or monthly.

In the empirical part of the article the SCAW model with various specifications is estimated to a time series of 50 and 300 assets traded on the National Association of Securities Dealers Automated Quotations (NASDAQ) over 20 and 10 years, respectively. It is evaluated by several out-of-sample forecast measures and benchmarked against similarly specified Multivariate GARCH models and two DCC-type models extended to model large-dimensional dynamic covariance matrices (see Engle Citation2002; Engle, Ledoit, and Wolf Citation2019; Ledoit and Wolf Citation2020; De Nard et al. Citation2020).

The results of the empirical study reveal that the SCAW model outperforms the benchmark models with high statistical significance for the vast majority of the forecasts measures. Moreover, it suggests that the presented sectorwise parameterization, scalar parameterization and HAR-extension each can be useful, outperforming the original SCAW model in out-of-sample forecast accuracy, despite having substantially fewer parameters.

The rest of the article is organized as follows. Section 2 introduces the SCAW model and presents its stochastic properties. In Section 3, covariance targeting, the sectorwise specification, and the HAR-extension are introduced. Section 4 governs the estimation procedure for the SCAW model with its various specifications. The empirical application is presented in Section 5, while Section 6 concludes. Proofs of the obtained theoretical results and some tables with the results of the empirical study can be found in the supplementary materials.

2 Singular Conditional Autoregressive Wishart (SCAW) Model

Let $R_{t}$ be an m × m realized covariance matrix, constructed using n intra-day return vectors recorded during day t. In addition, suppose that the number of intra-day return vectors used in the computation of $R_{t}$ is less than the dimension of these vectors, such that n < m. As a result, $R_{t}$ is a singular matrix by construction. Furthermore, let ${R_{t}}$ be a discrete time series of such matrices, and let $F_{t}$ denote the filtration associated with ${R_{t}}$ . Now, assume that conditioned on $F_{t - 1}, R_{t}$ follows a singular Wishart distribution of dimension m. That is,(1) $R_{t} | F_{t - 1} \sim S W_{m} (n, S_{t} / n),$ (1) where $S W_{m} (ν, Σ)$ denote the singular Wishart distribution with degrees of freedom ν and symmetric, positive-definite scale matrix $Σ$ of dimension m × m. In addition, since $E [R_{t} | F_{t - 1}] = S_{t}$ , the matrix $S_{t}$ is the conditional mean matrix of ${R_{t}}$ . Note that the singularity of $R_{t}$ stems from the degrees of freedom n being lower than the matrix dimension m, while the conditional mean matrix, $S_{t}$ , is assumed to be nonsingular.

Now, let $R_{t}$ be partitioned as(2) $R_{t} = [\begin{matrix} R_{11, t} & R_{12, t} \\ R_{21, t} & R_{22, t} \end{matrix}],$ (2) where $R_{11, t}$ is an n × n nonsingular matrix, $R_{12, t}$ is $n \times (m - n), R_{21, t} = R_{12, t}^{'}$ and $R_{22, t}$ is $(m - n) \times (m - n)$ with $R_{22, t} = R_{21, t} R_{11, t}^{- 1} R_{12, t}$ . That any singular, symmetric matrix can be partitioned this way is shown by, for example, Harville (Citation1997, Lemma 9.2.2). Consequently, in accordance with Srivastava (Citation2003) regarding the singular Wishart distribution, the conditional density for $R_{t}$ is given by(3) $\begin{matrix} f (R_{t} | F_{t - 1}) = \frac{π^{n (n - m) / 2}}{2^{m n / 2} Γ_{n} (n / 2) | S_{t} / n |^{n / 2}} | R_{11, t} |^{(n - m - 1) / 2} \\ exp (- \frac{1}{2} tr ({(S_{t} / n)}^{- 1} R_{t})) \\ = \frac{π^{n (n - m) / 2} n^{m - n / 2}}{2^{m n / 2} Γ_{n} (n / 2) | S_{t} |^{n / 2}} | R_{11, t} |^{(n - m - 1) / 2} \\ exp (- \frac{n}{2} tr (S_{t}^{- 1} R_{t})), \end{matrix}$ (3) where $Γ_{n} (\cdot)$ denotes the multivariate gamma function (see e.g., Gupta and Nagar Citation2000). In addition, the conditional covariance matrix of $R_{t}$ consists of the following elements(4) $cov [r_{i j, t}, r_{k l, t} | F_{t - 1}] = \frac{1}{n} (s_{i k, t} s_{j l, t} + s_{i l, t} s_{j k, t}),$ (4) for $i, j, k, l = 1, \dots, m$ , where $r_{i j, t}$ and $s_{i j, t}$ denotes the element on row i and column j of $R_{t}$ and $S_{t}$ , respectively.

The conditional mean matrix $S_{t}$ , which is measurable by $F_{t - 1}$ , captures the time series dynamics of singular realized covariance matrices ${R_{t}}$ . In the following it is assumed that(5) $S_{t} = CC' + \sum_{i = 1}^{p} B_{i} S_{t - i} B_{i}^{'} + \sum_{j = 1}^{q} A_{j} R_{t - j} A_{j}^{'},$ (5) where we will denote the lag order of the model by (p, q) and $A_{j}, B_{i}, C$ are m × m parameter matrices for $i = 1, \dots, p$ and $j = 1, \dots, q$ where C is lower-triangular with strictly positive diagonal elements. This form is similar to the BEKK specification of Engle and Kroner (Citation1995) in the multivariate GARCH case, which is also adapted for the CAW model in Golosnoy, Gribisch, and Liesenfeld (Citation2012). It ensures that $S_{t}$ is symmetric and positive definite as long as the initial matrices $S_{0}, S_{- 1}, \dots, S_{- p + 1}$ are symmetric and positive semidefinite. It is notable that the conditional covariance matrix in GARCH-BEKK(p, q) coincides with the expression presented in (5) with $R_{t - j} = x_{t - j} x_{t - j}^{'}$ , where $x_{t}$ is the one day return vector of day t. As such, the proposed SCAW model (1) and (5) is a generalization of the GARCH-BEKK(p, q) process, where the GARCH-BEKK(p, q) model is a special case corresponding to n = 1. Furthermore, the specification of the SCAW(p, q) process is similar to the CAW(p, q) model suggested in Golosnoy, Gribisch, and Liesenfeld (Citation2012) with the difference that the SCAW(p, q) process models singular realized covariance matrices, while Golosnoy, Gribisch, and Liesenfeld (Citation2012) consider nonsingular ones.

In the article we will further consider different structures of the parameter matrices $A_{j}, B_{i}, C, j = 1, \dots, q, i = 1, \dots, p$ . Since large dimensional cases will generally be discussed, the matrices $A_{j}, B_{i}, C$ need to be specified parsimoniously such that estimation of the model remains feasible. If for example one allows $A_{j}, B_{i}$ to be general m × m matrices and C lower-triangular, the model (5) will consist of $m (m + 1) / 2 + (p + q) m^{2}$ parameters. With a large dimensional case, such as m = 50 and $p = q = 2$ , this results in 11275 parameters, a formidable estimation exercise indeed.

2.1 Stochastic Properties of the SCAW Model

In this section we will present conditions under which the matrix-variate process ${R_{t}}$ is weakly stationary. As with the CAW(p, q) model in Golosnoy, Gribisch, and Liesenfeld (Citation2012), the stochastic properties of the SCAW(p, q) model defined in (1) and (5) are derived using the VARMA representation of the model. The proofs of the results presented in this section can be found in the supplement.

Let $vec (\cdot)$ be the vectorization operator and let $vech (\cdot)$ be the half-vectorization operator. The symbol $D_{m}$ denotes the duplication matrix, while $L_{m}$ stands for the elimination matrix.Footnote¹ We define $r_{t} = vech (R_{t}), s_{t} = vech (S_{t}), c = vech (CC'),$ such that the vector representation of (5) is(6) $s_{t} = c + \sum_{i = 1}^{p} ℬ_{i} s_{t - i} + \sum_{j = 1}^{q} 𝒜_{j} r_{t - j},$ (6) where $𝒜_{j}$ and $ℬ_{i}$ are k × k matrices, with $k = m (m + 1) / 2$ given by $𝒜_{j} = L_{m} (A_{j} \otimes A_{j}) D_{m}, ℬ_{i} = L_{m} (B_{i} \otimes B_{i}) D_{m},$ where the symbol ⊗ denotes the Kroenecker product. Furthermore, $r_{t}$ can be written as(7) $r_{t} = E [r_{t} | F_{t - 1}] + v_{t} = s_{t} + v_{t},$ (7) where $v_{t}$ is a martingale difference sequence such that $E [v_{t}] = 0 and E [v_{t} v_{s}^{'}] = 0, \forall s \neq t .$

Plugging in (7) into (6), the SCAW(p, q) model can be written with the following VARMA(max( $p, q), p$ ) representation:(8) $r_{t} = c + \sum_{i = 1}^{\max (p, q)} (𝒜_{i} + ℬ_{i}) r_{t - i} + v_{t} - \sum_{j = 1}^{p} ℬ_{j} v_{t - j},$ (8) where $𝒜_{i} = ℬ_{j} = 0$ for $i > q, j > p$ . From (8) the conditions for weak stationarity of $R_{t}$ can be obtained. First, we derive a condition for the existence of the unconditional expectation of the SCAW(p, q) process, given by the following proposition.

Proposition 1.

The unconditional expectation of the SCAW(p, q) model is finite if and only if all eigenvalues of the matrix(9) $Ψ_{1} = \sum_{i = 1}^{max (p, q)} (𝒜_{i} + ℬ_{i})$ (9) are less than 1 in modulus. In that case the unconditional expectation is given by(10) $E [r_{t}] = \bar{r} = {(I_{k} - \sum_{i = 1}^{max (p, q)} (𝒜_{i} + ℬ_{i}))}^{- 1} c .$ (10)

EquationEquation (8)(8) $r_{t} = c + \sum_{i = 1}^{\max (p, q)} (𝒜_{i} + ℬ_{i}) r_{t - i} + v_{t} - \sum_{j = 1}^{p} ℬ_{j} v_{t - j},$ (8) can also be represented as an infinite vector moving average time series by (see, e.g., sec. 11.3 and 11.4 in Lütkepohl Citation2005)(11) $r_{t} = \bar{r} + \sum_{i = 0}^{\infty} Φ_{i} v_{t - i}, where$ (11) (12) $Φ_{i} = - ℬ_{i} + \sum_{j = 1}^{i} (𝒜_{j} + ℬ_{j}) Φ_{i - j}, i, j = 1, 2, \dots,$ (12) (13) $Φ_{0} = I_{m} .$ (13)

Moreover, given that they exist, the autocovariances of $r_{t}$ can then be expressed as(14) $E [(r_{t} - \bar{r}) (r_{t - τ} - \bar{r})'] = \sum_{i = 0}^{\infty} Φ_{i + τ} E [v_{t} v_{t}^{'}] Φ_{i}^{'} .$ (14)

Let(15) $\begin{matrix} Ω = \frac{1}{n} (L_{m} \otimes L_{m}) [I_{m^{2}} \otimes (I_{m^{2}} + K_{m m})] \\ (I_{m} \otimes K_{m m} \otimes I_{m}) (D_{m} \otimes D_{m}), \end{matrix}$ (15) where $K_{m m}$ is the commutation matrix.Footnote² Then the following holds.

Proposition 2.

The unconditional second moment of the SCAW(p, q) model is finite if and only if all eigenvalues of the matrix(16) $Ψ_{2} = \sum_{i = 1}^{\infty} (Φ_{i} \otimes Φ_{i}) Ω$ (16) are less than 1 in modulus. In that case the second moment is given by(17) $vec (E [r_{t} r_{t}^{'}]) = (Ω + I_{k^{2}}) {(I_{k^{2}} - \sum_{i = 1}^{\infty} (Φ_{i} \otimes Φ_{i}) Ω)}^{- 1} vec (\bar{r} \bar{r}'),$ (17) with $\bar{r}$ given by (10).

Proposition 3.

Given that the unconditional second moments of the SCAW(p, q) model exist, the autocovariance matrix at lag τ is given by $\begin{matrix} vec (E [(r_{t} - \bar{r}) (r_{t - τ} - \bar{r})']) = \sum_{i = 0}^{\infty} (Φ_{i + τ} \otimes Φ_{i}) Ω \\ {( I_{k^{2}} - \sum_{i = 1}^{\infty} (Φ_{i} \otimes Φ_{i}) Ω )}^{ - 1} vec (\bar{r} \bar{r}') . \end{matrix}$

As such, the process ${R_{t}}$ under the SCAW(p, q) model defined by (1)–(5) is weakly stationary if and only if the eigenvalues of the matrix (16) are less than 1 in modulus.

3 Parameterization

As mentioned in Section 2, when the dimension of ${R_{t}}$ grows large, it is important to parameterize $S_{t}$ in (5) parsimoniously, in order to maintain feasible estimation. Simultaneously, the specification must be rich enough to capture the time series dynamics observed in data. This section discusses several parameterizations that can be applied to this end.

3.1 Covariance Targeting

The constant term $CC'$ in (5) consists of $m (m + 1) / 2$ parameters, rapidly increasing the estimation burden as the portfolio size grows. One approach to reduce the number of parameters is to consider covariance targeting, an extension of the idea of variance targeting (see Engle and Mezrich Citation1996), where the constant term $CC'$ is consistently estimated (see Pedersen and Rahbek Citation2014) as follows. Let $R_{t} = S_{t} + V_{t}$ , where $V_{t}$ is a martingale difference, s.t. $E [V_{t}] = 0$ . Further denote the unconditional mean of ${R_{t}}$ as $E [R_{t}] = \bar{S}$ . Then we can write (5) as $\begin{matrix} S_{t} = CC' + \sum_{i = 1}^{p} B_{i} S_{t - i} B_{i}^{'} + \sum_{j = 1}^{q} A_{j} R_{t - j} A_{j}^{'} \\ R_{t} - V_{t} = CC' + \sum_{i = 1}^{p} B_{i} (R_{t - i} - V_{t - i}) B_{i}^{'} + \sum_{j = 1}^{q} A_{j} R_{t - j} A_{j}^{'} . \end{matrix}$

Taking unconditional expectations we obtain $\begin{matrix} E [R_{t}] = CC' + \sum_{i = 1}^{p} B_{i} E [R_{t}] B_{i}^{'} + \sum_{j = 1}^{q} A_{j} E [R_{t}] A_{j}^{'} \\ \bar{S} = CC' + \sum_{i = 1}^{p} B_{i} \bar{S} B_{i}^{'} + \sum_{j = 1}^{q} A_{j} \bar{S} A_{j}^{'} . \end{matrix}$ such that(18) $CC' = \bar{S} - \sum_{i = 1}^{p} B_{i} \bar{S} B_{i}^{'} - \sum_{j = 1}^{q} A_{j} \bar{S} A_{j}^{'} .$ (18)

The idea is then to replace $CC'$ in (5) by the expression (18), and to estimate $\bar{S}$ by the sample mean of the process. This specification determines the constant term $CC'$ by the persistence parameters $A_{j}$ and $B_{i}$ , such that $k = m (m + 1) / 2$ parameters less needs to be estimated in the model.

In order to ensure that the expression (18) is positive-definite, particular restrictions on the parameter matrices $A_{j}$ and $B_{i}$ must be imposed, and in general it is difficult to specify such conditions. One approach to circumvent this issue is considered in Noureldin, Sheppard, and Shephard (Citation2014) in the case of an ARCH model, where the original series of return vectors is rotated by its unconditional mean, to create a standardized series of returns particularly suitable to model with covariance targeting. In this article we adapt this approach to singular realized covariance matrices in order to obtain a parsimonious parameterization. To this end, apply the eigenvalue decomposition to the unconditional expectation such that $\bar{S} = P Λ P',$ where P is a matrix with eigenvectors of $\bar{S}$ as columns and $Λ$ is a diagonal matrix with the eigenvalues of $\bar{S}$ as diagonal entries. Note that although ${R_{t}}$ is a series of singular matrices, its unconditional mean $\bar{S}$ is nonsingular and as such all the eigenvalues in $Λ$ are positive. Further note that the symmetric square root of $\bar{S}$ is ${\bar{S}}^{1 / 2} = P Λ^{1 / 2} P'$ and that ${\bar{S}}^{- 1 / 2} = P Λ^{- 1 / 2} P'$ , since P is an orthogonal matrix.

Next we define the standardized realized covariance as(19) $E_{t} = {\bar{S}}^{- 1 / 2} R_{t} ({\bar{S}}^{- 1 / 2})' = P Λ^{- 1 / 2} P' R_{t} P Λ^{- 1 / 2} P' .$ (19) which has expected value $E [E_{t}] = {\bar{S}}^{- 1 / 2} E [R_{t}] ({\bar{S}}^{- 1 / 2})' = {\bar{S}}^{- 1 / 2} \bar{S} ({\bar{S}}^{- 1 / 2})' = I_{m} .$

Similarly, define $G_{t} = {\bar{S}}^{- 1 / 2} S_{t} ({\bar{S}}^{- 1 / 2})'$ , such that(20) $E_{t} \sim S W_{m} (n, G_{t} / n)$ (20)

due to the affine transformation property of the singular Wishart distribution (see Theorem 2 of Bodnar, Mazur, and Okhrin Citation2014). As such, we will model $G_{t}$ with an specification equivalent to (5) given by $G_{t} = \tilde{C} \tilde{C}' + \sum_{i = 1}^{p} {\tilde{B}}_{i} G_{t - i} {\tilde{B}}_{i}^{'} + \sum_{j = 1}^{q} {\tilde{A}}_{j} E_{t - j} {\tilde{A}}_{j}^{'} .$

Note that since $E_{t}$ follows a conditional singular Wishart distribution and the specification of $G_{t}$ is equivalent to that of $S_{t}$ , all results in Section (2.1) applies to the process ${E_{t}}$ as well, with regards to parameters ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}$ . Moreover, by applying the covariance targeting technique described in (18) with $E [E_{t}] = I_{m}$ we get(21) $\begin{matrix} G_{t} = (I_{m} - \sum_{i = 1}^{p} {\tilde{B}}_{i} {\tilde{B}}_{i}^{'} - \sum_{j = 1}^{q} {\tilde{A}}_{j} {\tilde{A}}_{j}^{'}) + \sum_{i = 1}^{p} {\tilde{B}}_{i} G_{t - i} {\tilde{B}}_{i}^{'} \\ + \sum_{j = 1}^{q} {\tilde{A}}_{j} E_{t - j} {\tilde{A}}_{j}^{'} . \end{matrix}$ (21)

The restrictions on the persistence parameters ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}$ needed to ensure the positive definiteness of $G_{t}$ are easily obtained for several parameterizations, as discussed below. In the following, the covariance targeting SCAW model described by (19), (20), and (21) will be referred to as SCAW_CT (p, q).

Furthermore, since $S_{t} = {\bar{S}}^{1 / 2} G_{t} ({\bar{S}}^{1 / 2})'$ and $E_{t} = {\bar{S}}^{- 1 / 2} R_{t} ({\bar{S}}^{- 1 / 2})'$ the model (21) for the standardized series ${E_{t}}$ implies the following equalities(22) $A_{j} = {\bar{S}}^{1 / 2} {\tilde{A}}_{j} ({\bar{S}}^{- 1 / 2})'$ (22) (23) $B_{i} = {\bar{S}}^{1 / 2} {\tilde{B}}_{i} ({\bar{S}}^{- 1 / 2})'$ (23) (24) $CC' = {\bar{S}}^{1 / 2} (I_{m} - \sum_{i = 1}^{p} {\tilde{B}}_{i} {\tilde{B}}_{i}^{'} - \sum_{j = 1}^{q} {\tilde{A}}_{j} {\tilde{A}}_{j}^{'}) ({\bar{S}}^{1 / 2})'$ (24) for the parameterization in (5), modeling the nonstandardized series ${R_{t}}$ .

Moreover, as discussed in Noureldin, Sheppard, and Shephard (Citation2014), there are several ways to parameterize the conditional mean, in this model described by (21): scalar and diagonal specification of ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}$ , as well as specifications with common persistence or with orthogonal parameter matrices. In this presentation we will focus on the scalar and the diagonal specification, such that ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}$ are all diagonal matrices, with the additional condition that the first element of each parameter matrix is positive, in order to ensure model identification. As such, the constant term in (21) will be positive-definite if and only if (see Engle and Kroner Citation1995)(25) $\sum_{j = 1}^{q} {\tilde{a}}_{j, l l}^{2} + \sum_{i = 1}^{p} {\tilde{b}}_{i, l l}^{2} < 1, l = 1, \dots, m,$ (25) where ${\tilde{a}}_{j, l l}$ is the l:th diagonal element of ${\tilde{A}}_{j}$ and ${\tilde{b}}_{i, l l}$ is the l:th diagonal element of ${\tilde{B}}_{i}$ . Conditions for the other specifications mentioned above can be obtained correspondingly. The diagonal parameterization of (21) results in $m (p + q)$ parameters, which is substantially lower than the $m (m + 1) / 2 + (p + q) m^{2}$ parameters in the original specification (5), particularly for large dimensional cases. In the example with m = 50 and $p = q = 2$ above, the diagonal model suggested thus results in 200 parameters, instead of the 11,275 parameters in the original specification, making estimation much more feasible.

Hence, instead of modeling the series ${R_{t}}$ with conditional means ${S_{t}}$ directly, the above approach instead models the standardized realized covariances ${E_{t}}$ with the conditional means ${G_{t}}$ . In turn, this model implies ${S_{t}}$ to be specified as (5) with parameters obtained as (22)–(24). Note that while ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}$ are diagonal, the implied parameters for ${S_{t}}, A_{j}$ and $B_{i}$ , are in general not, since the transformations (22)–(24) do not necessarily result in diagonal matrices. This does indeed suggest a rich dynamic for the original series of realized covariance matrices, as discussed in Noureldin, Sheppard, and Shephard (Citation2014) in the equivalent ARCH case. However, it does not mean that the specification ${S_{t}}$ results in an entirely general BEKK model, since its parameters are constrained by the unconditional mean $\bar{S}$ .

3.2 Sectorwise Parameterization

Prices for assets that belong to the same market sector tend to exhibit some level of similarity in price movements (see e.g., King Citation1966; Chan, Lakonishok, and Swaminathan Citation2007). To incorporate this feature, we introduce a model specification that assumes that covariance dynamics are homogeneous within market sectors. For this sectorwise parameterization, we define the diagonal elements of the parameter matrices ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}, j = 1, \dots, q, i = 1, \dots, p$ , in (21) as ${\tilde{a}}_{l l, j} = {\tilde{a}}_{k k, j}$ and ${\tilde{b}}_{l l, i} = {\tilde{b}}_{k k, i}$ if asset l and asset k belong to the same market sector. The number of parameters for this specification is as such $s (p + q)$ , where s denote the number of sectors that the considered assets belong to. Hence, the number of parameters for this approach is independent of process dimension m, which makes it an attractive modeling candidate when very large asset portfolios are considered.

We note that the sectorwise parameterization is applied to the model formulated for the standardized realized covariance matrices ${E_{t}}$ . Such a model specification is motivated by the observation that a high average correlation is present between the elements of $E_{t}$ and $R_{t}$ with the same indices (0.75 in the empirical illustration with m = 50 stocks and 0.56 in the empirical study with m = 300, both presented in Section 5), while the average correlation between the elements of two matrices with different indices as well as the average correlation between their absolute values are close to zero. As such, the sectorwise parameterization introduced in the model for ${E_{t}}$ roughly corresponds to the sectorwise parameterization in the case of the model for ${R_{t}}$ .

3.3 HAR Extension

To account for the high persistence in volatility processes, we also adapt the SCAW model with a heterogeneous autoregressive (HAR) extension, as proposed by Corsi (Citation2009) in the univariate case and implemented by Golosnoy, Gribisch, and Liesenfeld (Citation2012) in a matrix-variate version. Such an approach considers the long-memory dependence in daily volatility by including lagged realized covariances observed on longer horizons, like weekly and monthly. Consequently, for this specification, we define the conditional process mean $G_{t}$ as(26) $G_{t} = (I_{m} - \sum_{j = 1}^{q} {\tilde{A}}_{j} {\tilde{A}}_{j}^{'} - \tilde{D} \tilde{D}') + \sum_{j = 1}^{q} {\tilde{A}}_{j} E_{t - j} {\tilde{A}}_{j}^{'} + \tilde{D} E_{t - 1}^{(h)} \tilde{D}',$ (26) where $E_{t}^{(h)} = \sum_{j = q}^{h} E_{t - j}$ . Further, we define $\tilde{D}$ as a diagonal matrix with sectorwise parameterization as described above. As such, (26) can be specified in terms of (21), but we denote it with a separate parameter matrix $\tilde{D}$ for ease of interpretation.

4 Estimation

Similar to Noureldin, Sheppard, and Shephard (Citation2014), we apply a two-step estimation procedure in order to obtain the parameter estimates of the considered model (21). Given a sample of the realized covariance process, ${R_{t}}_{1 \leq t \leq T}$ , a method of moments approach is first used to estimate the unconditional mean of the process $\bar{S}$ , as(27) $\hat{\bar{S}} = \frac{1}{T} \sum_{t = 1}^{T} R_{t} .$ (27)

The estimate $\hat{\bar{S}}$ is then decomposed into estimates $\hat{P}$ and $\hat{Λ}$ . From these estimates, a standardized series is obtained in correspondence to (19) as $E_{t} = \hat{P} {\hat{Λ}}^{- 1 / 2} \hat{P}' R_{t} \hat{P} {\hat{Λ}}^{- 1 / 2} \hat{P}',$ consistent with the approach in Noureldin, Sheppard, and Shephard (Citation2014). In the second step, we estimate the diagonal parameter matrices ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}, i = 1, \dots, p, j = 1 \dots, q$ , in (21), by the maximum likelihood method. Similarly to Golosnoy, Gribisch, and Liesenfeld (Citation2012), in order to ensure the positivity of the first diagonal element in each of the parameter matrices, the square roots of these values are estimated. To enforce the condition (25), the diagonal elements of ${\tilde{A}}_{j}$ are specified according to the following function, for $l = 1, \dots, m$ ,(28) ${\tilde{a}}_{l l, j} = {\begin{matrix} a_{l l, j}^{*} & if s_{l} < 1 \\ \frac{a_{l l, j}^{*} (1 - ϵ)}{s_{l}} & if s_{l} \geq 1 \end{matrix}$ (28) (29) ${\tilde{b}}_{l l, i} = {\begin{matrix} b_{l l, i}^{*} & if s_{l} < 1 \\ \frac{b_{l l, i}^{*} (1 - ϵ)}{s_{l}} & if s_{l} \geq 1 \end{matrix},$ (29) where $s_{l} = \sum_{j = 1}^{q} {\tilde{a}}_{j, l l}^{2} + \sum_{i = 1}^{p} {\tilde{b}}_{i, l l}^{2}$ and ϵ is positive and close to zero. As such, we define the argument vector to the log-likelihood function as $ψ' = (ψ'_{a}, ψ'_{b})$ with $ψ'_{a} = (\sqrt{a_{11, 1}^{*}}, a_{22, 1}^{*}, \dots, a_{l l, q}^{*})$ and $ψ'_{b} = (\sqrt{b_{11, 1}^{*}}, b_{22, 1}^{*}, \dots, b_{l l, q}^{*})$ . Furthermore, since by (20), $E_{t}$ follows a singular Wishart distribution, the log-likelihood obtains directly from the density (3) as(30) $\begin{matrix} L (ψ) = \sum_{t = 1}^{T} [c + \frac{n}{2} ln | G_{t} | + \frac{n - m - 1}{2} ln | E_{11, t} | \\ - \frac{n - m - 1}{2} tr (G_{t}^{- 1} E_{t})], \end{matrix}$ (30) where(31) $c = \frac{n (n - m)}{2} ln (π) + (m - \frac{n}{2}) ln n - ln Γ_{p} (\frac{n}{2}) .$ (31)

Finding the vector ψ that maximizes the log-likelihood function (30) can then be done by applying numerical optimization techniques.

5 Empirical Application

5.1 Data and Estimation

The SCAW model presented in Section 2 is applied to analyze the daily realized covariance matrices of 50 assets traded at National Association of Securities Dealers Automated Quotations (NASDAQ) from mid 1997 to mid 2017, and 300 assets traded in the same market between mid 2007 to mid 2017. The assets have been classified with an associated market sector following the NASDAQ sector classification (as e.g., Litimi, BenSaïda, and Bouraoui Citation2016; BenSaïda Citation2017). Furthermore, the assets have been selected such that the sample sector distribution is proportional to the sector distribution of assets traded at NASDAQ for the considered time period. The realized covariance matrix of these assets, for trading day t, is constructed as $R_{t} = \sum_{i}^{n} x_{t, i} x_{t, i}^{'}$ , where $x_{t, i}$ is the $m \times 1$ return vector obtained for the i:th 10-minute interval of day t between 09:30 and 16:00. In turn, this results in n = 39 return vectors, such that the rank of the m × m matrix $R_{t}$ is 39, making it a singular matrix. The sample period for the time series with 50 assets starts 2nd of June 1997 and ends 15th of June 2017, resulting in about 20 years of data, and 4994 trading days. The time series with 300 assets starts 27th of June 2007 and ends 15th of June 2017, resulting in about 10 years of data, and 2498 trading days. As such, the considered series covers two exceptionally volatile time periods: the so-called Dot-com bubble, which had its peak around the year 2000, and the global financial crisis of 2007–2008.

and summarize statistics for the realized variance in each of the 12 market sectors in the two samples. According to these statistics, the energy sector experiences the largest average variance, while assets in the financial sector are the most right skewed and leptokurtic.

Table 1 Summary statistics for the realized variance (multiplied by 10⁴) of 50 assets in each of the 12 market sectors considered.

Download CSV Display Table

Table 2 Summary statistics for the realized variance (multiplied by 10⁴) of 300 assets in each of the 12 market sectors considered.

Download CSV Display Table

In the medium dimension case of m = 50, the models discussed in Section 5.2 have been estimated with a rolling window approach. Every 20 trading days (corresponding to roughly one month) in the time series sample, the models are re-estimated using the last 500 trading days (roughly two years). These estimates are then used to make 1-step ahead forecasts for the coming 20 trading days.

Regarding the large dimension case of m = 300, the models are instead estimated with a fixed window approach. The parameters of the models are estimated on the first 80% (1998 trading days) of the time series sample, while the forecast accuracy of the models are evaluated on the last 20% (500 trading days) of the sample.

The parameters of the considered models are estimated as described in Section 4, where $ϵ = 10^{- 7}$ is used in EquationEquations (28)(28) ${\tilde{a}}_{l l, j} = {\begin{matrix} a_{l l, j}^{*} & if s_{l} < 1 \\ \frac{a_{l l, j}^{*} (1 - ϵ)}{s_{l}} & if s_{l} \geq 1 \end{matrix}$ (28) and Equation(29)(29) ${\tilde{b}}_{l l, i} = {\begin{matrix} b_{l l, i}^{*} & if s_{l} < 1 \\ \frac{b_{l l, i}^{*} (1 - ϵ)}{s_{l}} & if s_{l} \geq 1 \end{matrix},$ (29) . Such a value of ϵ has only a minor impact on the resulting estimators, since only 0.8% of s_l values across all models are larger than 1.

5.2 Models

To study these data using the suggested SCAW model, the estimation and forecasting are performed using the various model specifications discussed in Section 3. The following SCAW-parameterizations are considered:

SCAW_CT (p, q): Parameter matrices ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}, j = 1, \dots, q, i = 1, \dots, p$ of (21) are diagonal.
SCAW-SCALAR(p, q): Parameter matrices ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}, j = 1, \dots, q, i = 1, \dots, p$ of (21) are proportional to the identity matrix.
SCAW-SS_CT (p, q): Parameter matrices ${\tilde{A}}_{j}$ and ${\tilde{B}}_{i}, j = 1, \dots, q, i = 1, \dots, p$ of (21) are diagonal. Further, ${\tilde{a}}_{l l, j} = {\tilde{a}}_{k k, j}$ and ${\tilde{b}}_{l l, i} = {\tilde{b}}_{k k, i}$ if asset l and asset k belongs to the same sector.
SCAW-SS-HAR_CT (q, h): Parameter matrices ${\tilde{A}}_{j}$ and $\tilde{D}$ of (26) are diagonal. Further, ${\tilde{a}}_{l l, j} = {\tilde{a}}_{k k, j}$ and ${\tilde{d}}_{l l} = {\tilde{d}}_{k k}$ if asset l and asset k belongs to the same sector.
SCAW-SCALAR-HAR(q, h): Parameter matrices ${\tilde{A}}_{j}$ and $\tilde{D}$ of (26) are proportional to the identity matrix.

Similarly to Golosnoy, Gribisch, and Liesenfeld (Citation2012), Multivariate GARCH models fitted to daily return data are used as forecast accuracy benchmarks to the parameterizations described above. To have comparable results, the MGARCH benchmark models follow equivalent specifications and are consequently denoted MGARCH_CT (p, q), MGARCH-SCALAR(p, q), MGARCH-SS_CT (p, q), MGARCH-SS-HAR_CT (q, h), and MGARCH-SCALAR-HAR(q, h). As discussed in Section 2, the Multivariate GARCH model with BEKK-specification can be thought of as a special case of the SCAW model, with the number of intra-day returns (and matrix rank) n = 1. Apart from MGARCH, two additional models applicable to singular realized covariance matrices, the DCC-S and DCC-NL models discussed in Engle, Ledoit, and Wolf (Citation2019), are included as benchmarks.

5.3 Forecasting

The models discussed in Section 5.2 are evaluated by out-of-sample forecast accuracy. As described in Section 5.1, for m = 50 the considered models are evaluated with a rolling window approach throughout the sample, while for the m = 300 sample, a fixed window technique is applied. For each of the models, the $l$ -step-ahead forecast is computed recursively as(32) $E [R_{t + l} | F_{t}] = E [S_{t + l} | F_{t}] = P Λ^{1 / 2} P' E [G_{t + l} | F_{t}] P Λ^{1 / 2} P', with$ (32) (33) $\begin{matrix} E [G_{t + l} | F_{t}] = (I_{m} - \sum_{i = 1}^{p} {\tilde{B}}_{i} {\tilde{B}}_{i}^{'} - \sum_{j = 1}^{q} {\tilde{A}}_{j} {\tilde{A}}_{j}^{'}) + \\ + \sum_{i = 1}^{p} {\tilde{B}}_{i} E [G_{t + l - i} | F_{t}] {\tilde{B}}_{i}^{'} \\ + \sum_{j = 1}^{q} {\tilde{A}}_{j} E [E_{t + l - j} | F_{t}] {\tilde{A}}_{j}^{'}, \end{matrix}$ (33) (34) $E [E_{t + l - j} | F_{t}] = E [G_{t + l - j} | F_{t}],$ (34) where the parameter matrices are estimated as described in Section 4. The specifications SCAW-SS-HAR_CT (q, h) and SCAW-SCALAR-HAR(q, h) are computed similarly, employing that they can be represented in the form of (21).

The forecast accuracy of ${\hat{R}}_{t + l} = E [R_{t + l} | F_{t}]$ is evaluated using several measures. First, the average Frobenius norm of the $l$ -step-ahead forecast error is computed as(35) ${FN}_{l} = \frac{1}{T_{l}} \sum_{t} | | {\hat{R}}_{t + l} - R_{t + l} | |,$ (35) where $T_{l}$ is the sample-size for $l$ -step-ahead forecasts and $| | M | |$ denote the Frobenius norm of the matrix M. Further, in practice, one is often interested in applying covariance matrix forecasts in a portfolio setting. As such, we also compute mean squared error of the standard deviation of an equally weighted (EW) portfolio, a popular portfolio in financial literature (see DeMiguel, Garlappi, and Uppal Citation2009), using the obtained covariance forecast and the realized covariance: ${SD}_{EW, l} = \frac{1}{T_{l}} \sum_{t} \frac{1}{m^{2}} {(\sqrt{1' {\hat{R}}_{t + l} 1} - \sqrt{1' R_{t + l} 1})}^{2} .$

Another important portfolio is the global minimum variance (GMV) portfolio (see Frahm and Memmel Citation2010; Glombeck Citation2014; Bodnar, Parolya, and Schmid Citation2018; Bodnar et al. Citation2019; Ding, Li, and Zheng Citation2021). This portfolio has the lowest risk of all possible portfolios of risky assets, and its weight vector is solely determined by the covariance matrix of asset returns. We employ the l-step ahead forecast of the covariance matrix ${\hat{R}}_{t + l}$ produced by each model for the computation of the weights of the GMV portfolio expressed as(36) ${\hat{w}}_{t + l} = {\hat{R}}_{t + l}^{- 1} 1 / (1' {\hat{R}}_{t + l}^{- 1} 1) .$ (36)

As a performance measure of the constructed portfolio for different models, we use the standard deviation of the GMV portfolio of future time periods given by ${SD}_{GMV, l} = \frac{1}{T_{l}} \sum_{t} \sqrt{{\hat{w}}_{t + l}^{'} R_{t + l} {\hat{w}}_{t + l}} .$

The quantity ${SD}_{GMV, l}$ measures the average forecasted standard deviation of the GMV portfolios in the out-of-sample period. Hence, ${SD}_{EW, l}$ and ${SD}_{GMV, l}$ illustrate the ability of each model to forecast different things. The former is a measure of the squared difference between the predicted standard deviation of the equally weighted portfolio, and the observed standard deviation of the equally weighted portfolio. The latter measures the predicted standard deviation of the GMV portfolio. For this measure, more accurate predictions will result in lower values, and the minimum value is obtained when inserting ${\hat{R}}_{t + l} = R_{t + l}$ into (36). An accurate prediction of this quantity has considerable economic value, since the standard deviation of the GMV portfolio is a key input value in many financial applications.

In addition, to further asses the properties of the portfolios constructed using the GMV weights predicted by the models, ${\hat{w}}_{t + l}$ , we measure the empirical variance, turnover and leverage they produce. Hence, we introduce the following measures $\begin{matrix} {EGMV}_{var, l} = \frac{1}{T_{l}} \sum_{t} {({\hat{w}}_{t + l} r_{t + l} - (\sum_{t} {\hat{w}}_{t + l} r_{t + l}) / T_{l})}^{2}, \\ {EGMV}_{tu, l} = \frac{1}{T_{l}} \sum_{t} ‖ {\hat{w}}_{t + l} - {\hat{w}}_{t + l - 1} ‖, \\ {EGMV}_{lev, l} = \frac{1}{T_{l}} \sum_{t} \sum_{i = 1}^{m} | {\hat{w}}_{i, t + l} |, \end{matrix}$ where $r_{t}$ is the observed return vector at time t and ${\hat{w}}_{i, t}$ is the ith element of ${\hat{w}}_{t}$ .

Finally, it is relevant to see if the difference in a forecast accuracy measure between the SCAW model and its benchmarks is statistically significant. To this end, a two-sided paired t-test is applied to the sample of terms in ${FN}_{l}, {SD}_{EW, l}, {SD}_{GMV, l}, {EGMV}_{tu, l}$ or ${EGMV}_{lev, l}$ for the SCAW model, and to the sample of terms for the same measure in the equivalent MGARCH specification, the DCC-S model, and the DCC-NL model, respectively. Similarly, the measure ${EGMV}_{var, l}$ is evaluated using the ${HAC}_{PW}$ method described in Ledoit and Wolf (Citation2011). Significance level 0.05 is used for all the applied tests. Beside the two-sided paired t-test and ${HAC}_{PW}$ method used to compare the SCAW models and its benchmark, we also construct the 90% Hansen’s model confidence set (Hansen, Lunde, and Nason Citation2011) with respect to the several forecast measures, namely, ${FN}_{l}, {SD}_{EW, l}$ , and ${SD}_{GMV, l}$ .

5.4 Results

and –S.3 in the supplementary materials summarize the forecasts performance of the models discussed in Section 5.2, with rolling window for forecast horizon $l = 1$ and with fixed window for forecast horizons $l = 1, 5, 10$ . For each measure, the most favorable value in each column is highlighted bold.

Table 3 Summary of the rolling window forecasts for each respective model for 20 years of data (mid 1997 to mid 2017) on 50 assets proportionally distributed among the 12 sectors in NASDAQ classification, evaluated on a monthly basis.

Display Table

To have a robust forecast evaluation we run a rolling window estimation (see e.g., de Brito, Medeiros, and Ribeiro Citation2018; Archakov, Hansen, and Lunde Citation2020; De Nard et al. Citation2020), available in , for each respective model evaluated on a monthly basis based on 50 assets. The forecasts are then evaluated toward the observed realized covariances with the following measures: Frobenius norm ( ${FN}_{l}$ ), discrepancy in the standard deviation of the equally weighted portfolio ( ${SD}_{EW, l}$ ), forecasted standard deviation of the GMV portfolio ( ${SD}_{GMV, l}$ ), variance of the empirical return computed from the forecasted weights of the GMV portfolio ( ${EGMV}_{var, l}$ ), the turnover ( ${EGMV}_{tu, l}$ ), and the leverage ( ${EGMV}_{lev, l}$ ). For ${SD}_{EW, l}, {SD}_{GMV, l}$ , and ${EGMV}_{var, l}$ the annualized values are provided. Each SCAW model’s forecast is compared with the forecast of the equivalent MGARCH model, the DCC-S model and the DCC-NL model. As can be seen, with respect to each measure, proposed models, such as, SCAW-SCALAR(2,2) and SCAW-SS_CT (1,2) outperform the others, where the most favorable value in each column is emphasized in bold. Overall, the MGARCH models tend to be superior in terms of ${EGMV}_{tu, l}$ . These models tend to have smaller $A_{j}$ parameters and, hence, their predictions tend to vary less with data. Thus, it is not surprising the turnover is also lower. For this particular case, the SCAW models tend to have a lower ${EGMV}_{lev, l}$ and outperformed the other benchmark models with respect to this criteria.

Models in the upper panel are also compared with the equivalent MGARCH model, the DCC-S model and the DCC-NL model through a pairwise forecast test. For brevity, we report these result in the same table with the labels M, S, and N, where each of the letters indicates statistical significance at 5% or less in favor of the SCAW model when compared with MGARCH, DCC-S and DCC-NL, respectively. The table also presents reward to risk through the Sharpe ratio, presented in the last column. Here again, our proposed models appear to outperform the benchmarks

Next, we evaluate the suggested models forecast ability on high-dimensional data and compare it with the results obtained for the benchmark models. –S.3 in the supplementary materials depict fixed window forecasts for each respective model for 10 years of data (mid 2007 to mid 2017) on 300 assets proportionally distributed among the 12 sectors in NASDAQ classification and evaluated on a monthly basis. In this application, models with parameterizations adapted to large portfolio sizes are included. For the evaluation, the model parameters are estimated on first 80% of the observations, and evaluated on the last 20% observations, at different forecasting horizons, namely $l = 1, 5$ , and 10, toward the observed realized covariance with the following performance measures: Frobenius norm ( ${FN}_{l}$ ), discrepancy in the standard deviation of the equally weighted portfolio ( ${SD}_{EW, l}$ ), forecasted variance of the GMV portfolio ( ${SD}_{GMV, l}$ ), variance of the empirical return computed from the forecasted weights of the GMV portfolio ( ${EGMV}_{var, l}$ ), the turnover ( ${EGMV}_{tu, l}$ ), and the leverage ( ${EGMV}_{lev, l}$ ). For ${SD}_{EW, l}, {SD}_{GMV, l}$ , and ${EGMV}_{var, l}$ the annualized values are presented. Each SCAW model’s forecast is compared to the forecast of the equivalent MGARCH model, the DCC-S model and the DCC-NL model. As can be seen, with respect to each performance measure, results at different forecast horizons suggest that models from the group SCAW-SS_CT and SCAW-SCALAR outperform the other competitors, where the most favorable value in each column is emphasized bold.

In the case of a high-dimensional portfolio, the MGARCH models tend again to be superior in terms of the turnover. The DCC-NL model possesses the lowest ${EGMV}_{lev, l}$ . Finally, the SCAW models are also compared to the equivalent MGARCH model, the DCC-S model and the DCC-NL model by using a pairwise forecast test. The labels M, S, and N have the same meaning as in . We observe that the proposed SCAW models statistically outperform the benchmark approaches in almost all of the considered cases. To this end, the table also present reward to risk defined by the Sharpe ratio in the last column. Here, the proposed models outperform the benchmarks at the shorter horizon, while the DCC-NL is found to beat the other models at the longer forecast horizons $(l = 5, 10)$ .

To further assess the strength of proposed models, we consider model confidence set of Hansen, Lunde, and Nason (Citation2011) among the different methodologies for the rolling and the fixed forecasts at different forecast horizons ( $l = 1, 5, 10)$ . provides ranking among models for the MCS approach based on three different loss functions, namely ${FN}_{l}, {SD}_{EW, l}$ , and ${SD}_{GMV, l}$ , in the case of data consisting of 50 stocks. The notation ‘-’ indicates the absence of the model in the corresponding confidence set with respect to the considered loss function, while the numbers are the ranking of the models, which are included in the 90% confidence set. We observe that that none of the benchmark models managed to outperform the suggested SCAW models.

Table 4 Ranking among models computed by using the MCS test (see Hansen, Lunde, and Nason Citation2011) with three loss functions ( ${FN}_{l}, {SD}_{EW, l}$ , and ${SD}_{GMV, l}$ , respectively) computed for rolling window forecasting in the case of data consisting of 50 stocks.

Display Table

summarizes ranking among models obtained by the fixed window forecasting at horizons $l = 1, 5, 10$ in the case of the high-dimensional data consisting of 300 stocks. For each horizon the ranking obtained via the same three loss functions ${FN}_{l}, {SD}_{EW, l}$ , and ${SD}_{GMV, l}$ as presented and separated by the slash symbol in respective order. Also, the notation ‘-’ states that the model does not belong to the confidence set, while the numbers specifies the ranks of the model within the confidence set. For a shorter horizon, quite a few benchmark models lie in the confidence set, which are outperformed by the SCAW models showing higher ranks. For longer forecasting horizons $l = 5, 10$ , none of the benchmark models appeared in the confidence set and the proposed SCAW-type models clearly show better performance for all considered loss functions.

Table 5 Ranking among models computed by using the MCS test (see Hansen, Lunde, and Nason Citation2011) with three loss functions ( ${FN}_{l}$ / ${SD}_{EW, l}$ / ${SD}_{GMV, l}$ ) computed for fixed window estimation with forecasting horizon $l = 1, 5, 10$ in the case of data consisting of 300 stocks.

Display Table

In general, it is noteworthy that any specification of SCAW-SCALAR model turns out to be best with respect to ${SD}_{EW, l}, {SD}_{GMV, l}$ , and ${EGMV}_{var, l}$ measures at different forecasting setup (rolling or fixed), while the SCAW-SS models are found to be best w.r.t. to the ${FN}_{l}$ measure. This implies that different SCAW models may be suitable, depending on the application at hand. Finally, it is notable that the specifications SCAW-SS_CT (1, 2) and $SCAW - SCALAR (2, 2)$ most of the times outperform both SCAW_CT (0, 1) and SCAW_CT (1, 1) in terms of ${FN}_{l}, {SD}_{EW, l}, {SD}_{GMV, l}$ , and ${EGMV}_{var, l}$ for portfolio sizes m = 50 and m = 300, despite using a lower number of parameters (36 and 4 vs. 50 and 100, respectively). This suggests that the sectorwise and scalar approaches introduced in Section 3.2 indeed can be useful, especially in the high-dimensional case: they reduce the number of model parameters and outperform the diagonally parameterized model in most of the cases.

To summarize, the SCAW approach performs very well comparing to the considered benchmark approaches in terms of out-of-sample forecast accuracy for the time period studied. Among the various SCAW-specifications, the sectorwise parameterization, SCAW-SS_CT (p, q), the sectorwise parameterization with an HAR-extension, SCAW-SS-HAR_CT (q, h) and the scalar specification, SCAW-SCALAR(p, q), display the most favorable results. The performance of these specifications are important, since the number of parameters in these approaches are independent of the number of assets m. As such, these parameterizations are likely to be a feasible when very large asset portfolios are considered.

6 Conclusion

In this article, we present the Singular Conditional Autoregressive Wishart (SCAW) model to capture the temporal dynamics for time series of singular realized covariance matrices. The model employs a BEKK-type specification, thus, ensuring positive definitiveness, and allowing for straight forward estimation through the maximum likelihood method. Since the case of singular realized covariance is closely related to large portfolio dimensions, we also introduce methods to maintain parsimony in large dimensions. First, a covariance targeting approach adapted to the matrix case is presented. Second, we propose a sectorwise specification, using asset homogeneity within market sectors. As an additional extension, the well-established HAR-approach is adapted to our model. These approaches results in a model well adapted for large or extremely large portfolio sizes.

The SCAW model is estimated to 50 stocks in a time series covering 20 years and to 300 stocks covering 10 years, and evaluated out-of-sample with Multivariate GARCH models of similar specifications and the DCC-type model recently suggested in the literature to model large-dimensional dynamic covariance matrices. This study reveals that the SCAW models outperform the benchmark models in the vast majority of the forecast accuracy measures, with high statistical significance. Furthermore, it suggests that the SCAW-SCALAR(p, q) specifications, where parameter matrices are proportional to the identity matrix, the sectorwise specification and its HAR-extension show great promise, greatly improving the out-of-sample performances in relation to the baseline fully parameterized SCAW model, while using only a fraction of parameters. This empirical finding becomes very important by noting that the number of parameters in the parsimonious specifications does not depend on the portfolio size and thus can provide a very useful alternative to the general specification of the SCAW model, especially in the high-dimensional case where the portfolio size m is large.

Future venues of research include extension of the SCAW model by for example the MIDAS-extension employed in Golosnoy, Gribisch, and Liesenfeld (Citation2012), or an adaptation including the leverage effect in the spirit of Anatolyev and Kobotaev (Citation2018).

Supplemental material

Supplemental Material

Download PDF (324.3 KB)

Acknowledgments

The authors thank Professor Christian Hansen, the Associate Editor, and two anonymous Reviewers for their comments and suggestions which have improved the presentation of the article. We gratefully acknowledge the comments and the discussion from the participants at the Workshop on Financial Econometrics 2019 (Örebro University School of Business) and at the International Conference on Computational and Financial Econometrics 2019 (University of London).

Supplementary Materials

The supplementary materials contain proofs of the propositions in Section 2.1, as well as additional Tables with forecast results to complement the results in Section 5.4

Additional information

Funding

Taras Bodnar was partially supported by the Swedish Research Council (VR) via the project Bayesian Analysis of Optimal Portfolios and Their Risk Measures. The computations and data storage were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at HPC2N partially funded by the Swedish Research Council through grant agreement no. 2018-05973. Farrukh Javed acknowledges financial support from the project “Models for macro and financial economics after the financial crisis” (Dnr: P18-0201) funded by the Jan Wallander and Tom Hedelius. Foundation.

Notes

1 The matrices

D_{m}

and

L_{m}

are defined as the matrices which satisfy the following equalities

vec (A) = D_{m} vech (A)

and

vech (A) = L_{m} vec (A)

for a symmetric matrix A, respectively (see, e.g., Harville Citation1997).

2 It is defined by the following equality $K_{m m} vec (A) = vec (A')$ for any m × m matrix A (see Harville Citation1997)

References

Alfelt, G., Bodnar, T., and Tyrcha, J. (2020), “Goodness-of-Fit Tests for Centralized Wishart Processes,” Communications in Statistics – Theory and Methods, 49, 5060–5090.
Web of Science ®Google Scholar
Anatolyev, S., and Kobotaev, N. (2018), “Modeling and Forecasting Realized Covariance Matrices with Accounting for Leverage,” Econometric Reviews, 37, 114–139.
Web of Science ®Google Scholar
Andersen, T. G., Bollerslev, T., Diebold, F. X., and Labys, P. (2003), “Modeling and Forecasting Realized Volatility,” Econometrica, 71, 579–625. DOI: 10.1111/1468-0262.00418.
Web of Science ®Google Scholar
Ao, M., Yingying, L., and Zheng, X. (2019), “Approaching Mean-Variance Efficiency for Large Portfolios,” The Review of Financial Studies, 32, 2890–2919. DOI: 10.1093/rfs/hhy105.
Web of Science ®Google Scholar
Archakov, I., Hansen, P. R., and Lunde, A. (2020), “A Multivariate Realized GARCH Model,” available at https://sites.google.com/site/peterreinhardhansen/research-papers/amultivariaterealizedgarchmodel
Google Scholar
Asai, M., McAleer, M., and Yu, J. (2006), “Multivariate Stochastic Volatility: A Review,” Econometric Reviews, 25, 145–175. DOI: 10.1080/07474930600713564.
Web of Science ®Google Scholar
Aït-Sahalia, Y., Mykland, P. A., and Zhang, L. (2011), “Ultra High Frequency Volatility Estimation with Dependent Microstructure Noise,” Journal of Econometrics, 160, 160–175. DOI: 10.1016/j.jeconom.2010.03.028.
Web of Science ®Google Scholar
Aït-Sahalia, Y., and Yu, J. (2009), “High Frequency Market Microstructure Noise Estimates and Liquidity Measures,” Annals of Applied Statistics, 3, 422–457.
Web of Science ®Google Scholar
Bandi, F. M., Kolokolov, A., Pirino, D., and Renò, R. (2020a), “Realized Moments: Identification and Pricing,” working paper.
Google Scholar
Bandi, F. M., Kolokolov, A., Pirino, D., and Renò, R. (2020b), “Zeros,” Management Science, 66, 3466–3479.
Web of Science ®Google Scholar
Bandi, F. M., Pirino, D., and Renò, R. (2017), “Excess Idle Time,” Econometrica, 85, 1793–1846. DOI: 10.3982/ECTA13595.
Web of Science ®Google Scholar
Bandi, F. M., and Russell, J. R. (2006), “Separating Microstructure Noise from Volatility,” Journal of Financial Economics, 79, 655–692. DOI: 10.1016/j.jfineco.2005.01.005.
Web of Science ®Google Scholar
Bandi, F. M., and Russell, J. R. (2008), “Microstructure Noise, Realized Variance, and Optimal Sampling,” The Review of Economic Studies, 75, 339–369.
Web of Science ®Google Scholar
Barndorff-Nielsen, O. E., and Shephard, N. (2004), “Econometric Analysis of Realized Covariation: High Frequency Based Covariance, Regression, and Correlation in Financial Economics,” Econometrica, 72, 885–925.
Web of Science ®Google Scholar
Bauwens, L., Laurent, S., and Rombouts, J. V. (2006), “Multivariate GARCH Models: A Survey,” Journal of Applied Econometrics, 21, 79–109. DOI: 10.1002/jae.842.
Web of Science ®Google Scholar
BenSaïda, A. (2017), “Herding Effect on Idiosyncratic Volatility in U.S. Industries,” Finance Research Letters, 23, 121–132.
Web of Science ®Google Scholar
Bibinger, M., Hautsch, N., Malec, P., and Reiß, M. (2014), “Estimating the Quadratic Covariation Matrix from Noisy Observations: Local Method of Moments and Efficiency,” The Annals of Statistics, 42, 1312–1346.
Web of Science ®Google Scholar
Bibinger, M., Hautsch, N., Malec, P., and Reiss, M. (2019), “Estimating the Spot Covariation of Asset Prices-Statistical Theory and Empirical Evidence,” Journal of Business & Economic Statistics, 37, 419–435.
Web of Science ®Google Scholar
Bodnar, O., Bodnar, T., and Parolya, N. (2022), “Recent Advances in Shrinkage-Based High-dimensional Inference,” Journal of Multivariate Analysis, 188, 104826. DOI: 10.1016/j.jmva.2021.104826.
Web of Science ®Google Scholar
Bodnar, T., Dmytriv, S., Okhrin, Y., Parolya, N., and Schmid, W. (2021), “Statistical Inference for the Expected Utility Portfolio in High Dimensions,” IEEE Transactions on Signal Processing, 69, 1–14. DOI: 10.1109/TSP.2020.3037369.
Web of Science ®Google Scholar
Bodnar, T., Dmytriv, S., Parolya, N., and Schmid, W. (2019), “Tests for the Weights of the Global Minimum Variance Portfolio in a High-dimensional Setting,” IEEE Transactions on Signal Processing, 67, 4479–4493.
Web of Science ®Google Scholar
Bodnar, T., Mazur, S., and Okhrin, Y. (2014), “Distribution of the Product of Singular Wishart Matrix and Normal Vector,” Theory of Probability and Mathematical Statistics, 91, 1–15.
Google Scholar
Bodnar, T., and Okhrin, Y. (2008), “Properties of the Singular, Inverse and Generalized Inverse Partitioned Wishart Distributions,” Journal of Multivariate Analysis, 99, 2389–2405.
Web of Science ®Google Scholar
Bodnar, T., Okhrin, Y., and Parolya, N. (2022), “Optimal Shrinkage-based Portfolio Selection in High Dimensions,” Journal of Business & Economic Statistics, to appear, DOI: 10.1080/07350015.2021.2004897.
Google Scholar
Bodnar, T., Parolya, N., and Schmid, W. (2018), “Estimation of the Global Minimum Variance Portfolio in High Dimensions,” European Journal of Operational Research, 266, 371–390. DOI: 10.1016/j.ejor.2017.09.028.
Web of Science ®Google Scholar
Buccheri, G., Bormetti, G., Corsi, F., and Lillo, F. (2021), “A Score-driven Conditional Correlation Model for Noisy and Asynchronous Data: An Application to High-Frequency Covariance Dynamics,” Journal of Business & Economic Statistics, 39, 920–936.
Web of Science ®Google Scholar
Cai, T. T., Hu, J., Li, Y., and Zheng, X. (2020), “High-dimensional Minimum Variance Portfolio Estimation based on High-frequency Data,” Journal of Econometrics, 214, 482–494.
Web of Science ®Google Scholar
Catania, L., Mari, R. D., and de Magistris, P. S. (2020), “Dynamic Discrete Mixtures for High-Frequency Prices,” Journal of Business & Economic Statistics, 40, 559–577.
Web of Science ®Google Scholar
Chan, L. K., Lakonishok, J., and Swaminathan, B. (2007), “Industry Classifications and Return Comovement,” Financial Analysts Journal, 63, 56–70. DOI: 10.2469/faj.v63.n6.4927.
Web of Science ®Google Scholar
Christensen, K., Oomen, R. C., and Podolskij, M. (2014), “Fact or Friction: Jumps at Ultra High Frequency,” Journal of Financial Economics, 114, 576–599. DOI: 10.1016/j.jfineco.2014.07.007.
Web of Science ®Google Scholar
Corsi, F. (2009), “A Simple Approximate Long-Memory Model of Realized Volatility,” Journal of Financial Econometrics, 7, 174–196.
Web of Science ®Google Scholar
Corsi, F., Peluso, S., and Audrino, F. (2015), “Missing in Asynchronicity: A Kalman-em Approach for Multivariate Realized Covariance Estimation,” Journal of Applied Econometrics, 30, 377–397.
Web of Science ®Google Scholar
de Brito, D., Medeiros, M., and Ribeiro, R. (2018), “Forecasting Large Realized Covariance Matrices: The Benefits of Factor Models and Shrinkage,” working paper.
Google Scholar
De Nard, G., Engle, R., Ledoit, O., and Wolf, M. (2020), “Large Dynamic Covariance Matrices: Enhancements based on Intraday Data,” working paper.
Google Scholar
DeMiguel, V., Garlappi, L., and Uppal, R. (2009), “Optimal Versus Naive Diversification: How Inefficient is the 1/n Portfolio Strategy?” The Review of Financial Studies, 22, 1915–1953.
Web of Science ®Google Scholar
Ding, Y., Li, Y., and Zheng, X. (2021), “High Dimensional Minimum Variance Portfolio Estimation Under Statistical Factor Models,” Journal of Econometrics, 222, 502–515.
Web of Science ®Google Scholar
Engle, R. F. (2000), “The Econometrics of Ultra-High-Frequency Data,” Econometrica, 68, 1–22.
Web of Science ®Google Scholar
Engle, R. F. (2002), “Dynamic Conditional Correlation: A Simple Class of Multivariate Generalized Autoregressive Conditional Heteroskedasticity Models,” Journal of Business & Economic Statistics, 20, 339–350.
Web of Science ®Google Scholar
Engle, R., and Mezrich, J. (1996), “GARCH for Groups,” Risk, 9, 36–40.
Google Scholar
Engle, R. F., and Kroner, K. F. (1995), “Multivariate Simultaneous Generalized ARCH,” Econometric Theory, 11, 122–150. DOI: 10.1017/S0266466600009063.
Web of Science ®Google Scholar
Engle, R. F., Ledoit, O., and Wolf, M. (2019), “Large Dynamic Covariance Matrices,” Journal of Business & Economic Statistics, 37, 363–375.
Web of Science ®Google Scholar
Frahm, G., and Memmel, C. (2010), “Dominating Estimators for Minimum-Variance Portfolios,” Journal of Econometrics, 159, 289–302. DOI: 10.1016/j.jeconom.2010.07.007.
Web of Science ®Google Scholar
Glombeck, K. (2014), “Statistical Inference for High-dimensional Global Minimum Variance Portfolios,” Scandinavian Journal of Statistics, 41, 845–865.
Web of Science ®Google Scholar
Golosnoy, V., Gribisch, B., and Liesenfeld, R. (2012), “The Conditional Autoregressive Wishart Model for Multivariate Stock Market Volatility,” Journal of Econometrics, 167, 211–223.
Web of Science ®Google Scholar
Gouriéroux, C., Jasiak, J., and Sufana, R. (2009), The Wishart Autoregressive Process of Multivariate Stochastic Volatility,” Journal of Econometrics, 150, 167–181.
Web of Science ®Google Scholar
Gupta, A. K., and Nagar, D. K. (2000), Matrix Variate Distributions, Boca Raton, FL: CRC Press.
Google Scholar
Hansen, P. R., Lunde, A., and Nason, J. M. (2011), “The Model Confidence Set,” Econometrica, 79, 453–497.
Web of Science ®Google Scholar
Harville, D. (1997), Matrix Algebra from Statistician’s Perspective, New York: Springer.
Google Scholar
Hautsch, N., Kyj, L. M., and Malec, P. (2015), “Do High-Frequency Data Improve High-Dimensional Portfolio Allocations?” Journal of Applied Econometrics, 30, 263–290. DOI: 10.1002/jae.2361.
Web of Science ®Google Scholar
Jacod, J., and Podolskij, M. (2013), “A Test for the Rank of the Volatility Process: The Random Perturbation Approach,” The Annals of Statistics, 41, 2391–2427.
Web of Science ®Google Scholar
Jin, X., and Maheu, J. M. (2012), “Modeling Realized Covariances and Returns,” Journal of Financial Econometrics, 11, 335–369. DOI: 10.1093/jjfinec/nbs022.
Web of Science ®Google Scholar
King, B. F. (1966), “Market and Industry Factors in Stock Price Behavior,” The Journal of Business, 39, 139–190. DOI: 10.1086/294847.
Web of Science ®Google Scholar
Ledoit, O., and Wolf, M. (2011), Robust Performances Hypothesis Testing With the Variance, Wilmott, 2011, 86–89.
Google Scholar
Ledoit, O., and Wolf, M. (2017), “Nonlinear Shrinkage of the Covariance Matrix for Portfolio Selection: Markowitz Meets Goldilocks,” The Review of Financial Studies, 30, 4349–4388.
Web of Science ®Google Scholar
Ledoit, O., and Wolf, M. (2020), “The Power of (non-)linear Shrinking: A Review and Guide to Covariance Matrix Estimation,” working paper.
Google Scholar
Litimi, H., BenSaïda, A., and Bouraoui, O. (2016), Herding and Excessive Risk in the American Stock Market: A Sectoral Analysis,” Research in International Business and Finance, 38, 6–21. DOI: 10.1016/j.ribaf.2016.03.008.
Web of Science ®Google Scholar
Lütkepohl, H. (2005), The New Introduction to Multiple Time Series Analysis, Berlin: Springer.
Google Scholar
Noureldin, D., Shephard, N., and Sheppard, K. (2012), Multivariate High-Frequency-based Volatility (HEAVY) Models,” Journal of Applied Econometrics, 27, 907–933. DOI: 10.1002/jae.1260.
Web of Science ®Google Scholar
Noureldin, D., Sheppard, K., and Shephard, N. (2014), Multivariate Rotated ARCH Models,” Journal of Econometrics, 179, 16–30. DOI: 10.1016/j.jeconom.2013.10.003.
Web of Science ®Google Scholar
Pedersen, R. S., and Rahbek, A. (2014), “Multivariate Variance Targeting in the BEKK-GARCH Model,” The Econometrics Journal, 17, 24–55. DOI: 10.1111/ectj.12019.
Web of Science ®Google Scholar
Rubio, F., Mestre, X., and Palomar, D. P. (2012), “Performance Analysis and Optimal Selection of Large Minimum Variance Portfolios Under Estimation Risk,” IEEE Journal of Selected Topics in Signal Processing, 6, 337–350. DOI: 10.1109/JSTSP.2012.2202634.
Web of Science ®Google Scholar
Shephard, N., and Xiu, D. (2017), “Econometric Analysis of Multivariate Realised qml: Estimation of the Covariation of Equity Prices Under Asynchronous Trading,” Journal of Econometrics, 201, 19–42.
Web of Science ®Google Scholar
Srivastava, M. (2003), “Singular Wishart and Multivariate Beta Distributions,” The Annals of Statistics, 31, 1537–1560. DOI: 10.1214/aos/1065705118.
Web of Science ®Google Scholar
Sucarrat, G., and Grønneberg, S. (2020), “Risk Estimation with a Time-Varying Probability of Zero Returns,” Journal of Financial Econometrics, 20, 278–309.
Web of Science ®Google Scholar
Yu, P. L., Li, W., and Ng, F. (2017), “The Generalized Conditional Autoregressive Wishart Model for Multivariate Realized Volatility,” Journal of Business & Economic Statistics, 35, 513–527.
Web of Science ®Google Scholar

Singular Conditional Autoregressive Wishart Model for Realized Covariance Matrices

Abstract

1 Introduction

2 Singular Conditional Autoregressive Wishart (SCAW) Model

2.1 Stochastic Properties of the SCAW Model

3 Parameterization

3.1 Covariance Targeting

3.2 Sectorwise Parameterization

3.3 HAR Extension

4 Estimation

5 Empirical Application

5.1 Data and Estimation

Table 1 Summary statistics for the realized variance (multiplied by 10⁴) of 50 assets in each of the 12 market sectors considered.

Table 2 Summary statistics for the realized variance (multiplied by 10⁴) of 300 assets in each of the 12 market sectors considered.

5.2 Models

5.3 Forecasting

5.4 Results

Table 3 Summary of the rolling window forecasts for each respective model for 20 years of data (mid 1997 to mid 2017) on 50 assets proportionally distributed among the 12 sectors in NASDAQ classification, evaluated on a monthly basis.

Table 4 Ranking among models computed by using the MCS test (see Hansen, Lunde, and Nason Citation2011) with three loss functions ( ${FN}_{l}, {SD}_{EW, l}$ , and ${SD}_{GMV, l}$ , respectively) computed for rolling window forecasting in the case of data consisting of 50 stocks.

6 Conclusion

Supplemental Material

Acknowledgments

Supplementary Materials

References

Information for

Open access

Opportunities

Help and information

Singular Conditional Autoregressive Wishart Model for Realized Covariance Matrices

Abstract

1 Introduction

2 Singular Conditional Autoregressive Wishart (SCAW) Model

2.1 Stochastic Properties of the SCAW Model

3 Parameterization

3.1 Covariance Targeting

3.2 Sectorwise Parameterization

3.3 HAR Extension

4 Estimation

5 Empirical Application

5.1 Data and Estimation

Table 1 Summary statistics for the realized variance (multiplied by 104) of 50 assets in each of the 12 market sectors considered.

Table 2 Summary statistics for the realized variance (multiplied by 104) of 300 assets in each of the 12 market sectors considered.

5.2 Models

5.3 Forecasting

5.4 Results

Table 3 Summary of the rolling window forecasts for each respective model for 20 years of data (mid 1997 to mid 2017) on 50 assets proportionally distributed among the 12 sectors in NASDAQ classification, evaluated on a monthly basis.

Table 4 Ranking among models computed by using the MCS test (see Hansen, Lunde, and Nason Citation2011) with three loss functions (FNl, SDEW,l, and SDGMV,l, respectively) computed for rolling window forecasting in the case of data consisting of 50 stocks.

Table 5 Ranking among models computed by using the MCS test (see Hansen, Lunde, and Nason Citation2011) with three loss functions (FNl/SDEW,l/SDGMV,l) computed for fixed window estimation with forecasting horizon l=1,5,10 in the case of data consisting of 300 stocks.

6 Conclusion

Supplemental Material

Acknowledgments

Supplementary Materials

Additional information

Funding

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1 Summary statistics for the realized variance (multiplied by 10⁴) of 50 assets in each of the 12 market sectors considered.

Table 2 Summary statistics for the realized variance (multiplied by 10⁴) of 300 assets in each of the 12 market sectors considered.

Table 4 Ranking among models computed by using the MCS test (see Hansen, Lunde, and Nason Citation2011) with three loss functions ( ${FN}_{l}, {SD}_{EW, l}$ , and ${SD}_{GMV, l}$ , respectively) computed for rolling window forecasting in the case of data consisting of 50 stocks.