1,432
Views
7
CrossRef citations to date
0
Altmetric
Articles

State-Varying Factor Models of Large Dimensions

&
Pages 1315-1333 | Published online: 17 Jun 2021
 

Abstract

This article develops an inferential theory for state-varying factor models of large dimensions. Unlike constant factor models, loadings are general functions of some recurrent state process. We develop an estimator for the latent factors and state-varying loadings under a large cross-section and time dimension. Our estimator combines nonparametric methods with principal component analysis. We derive the rate of convergence and limiting normal distribution for the factors, loadings, and common components. In addition, we develop a statistical test for a change in the factor structure in different states. We apply the estimator to the U.S. Treasury yields and S&P500 stock returns. The systematic factor structure in treasury yields differs in times of booms and recessions as well as in periods of high market volatility. State-varying factors based on the VIX capture significantly more variation and pricing information in individual stocks than constant factor models.

Supplementary Materials

The online appendix collects the proofs and additional results that support the main text. The additional theoretical results include a detailed description of special cases and related models and an extension to noisy and misspecified state processes. We also provide an estimator for the number of factors. The additional empirical results consider alternative state processes and discuss the choice of tuning parameters. We also study a portfolio application of our state-varying factors. The extensive simulation section compares the performance relative to alternative latent factor models that allow for time-variation and studies the choice of bandwidth and number of factors with cross-validation arguments. Lastly, we collect the detailed proofs for all the theoretical statements.

Acknowledgments

We thank Yacine Aït-Sahalia, Daniele Bianchi, Frank Diebold, Kay Giesecke, Eric Ghysels, Lisa Goldberg and seminar and conference participants at Stanford, the NBER-NSF Time-Series Conference, the Society for Financial Econometrics, the North American Summer Meeting of the Econometric Society, the European Meeting of the Econometric Society, the Western Mathematical Finance Conference, the SIAM Conference on Financial Mathematics & Engineering and INFORMS for helpful comments.

Notes

1 Extensions of the constant loading model include sparse and interpretable latent factors in Pelger and Xiong (Citation2020), estimation from incomplete datasets in Xiong and Pelger (Citation2020) and including additional moments to estimate weak factors as in Lettau and Pelger (Citation2020a,b).

2 Pelger (Citation2020) provided empirical evidence for time-variation in latent factor models that is related to recessions.

3 Fan et al.’s (2016) model loadings as nonlinear functions of time-varying features of the cross-sectional units. Their estimation approach applies PCA to the data matrix that is projected in the cross-section on the subject-specific covariates. We also apply PCA to a projected data matrix, but our projection is applied in the time dimension.

4 In this article, we consider a scalar state process and leave the extension to multivariate state processes to future research. We expect multivariate state processes to lead to lower convergence rates due to the “curse of dimensionality” inherent in higher dimensional kernel projections. An additional challenge is that many multivariate state processes do not have the recurrence property.

5 Wang et al. (Citation2019) also study a state-dependent latent factor model. Their focus is the estimation of a large-dimensional state-dependent covariance matrix, while we derive the inferential theory for the factors and conditional loadings. Ma et al. (Citation2020) developed a high-dimensional asset pricing test for conditional factor model with time-varying loadings and a large number of test assets.

6 Generalized correlations or canonical correlations have been studied in Anderson (Citation1958), Yuan and Bentler (Citation2000), Bai and Ng (Citation2006), Pelger (Citation2019), and Andreou et al. (Citation2019).

7 Our test differs from the existing tests, such as those of Breitung and Eickmeier (Citation2011), Chen, Dolado, and Gonzalo (Citation2014), Han and Inoue (Citation2015), and Yamamoto and Tanaka (Citation2015), which check the stability of the moments of factor loadings or common factors, but do not take invertible transformations into account. Our test takes a “micro” view to compare loadings in any two states, while Su and Wang (Citation2017) takes a “global” view to test whether loadings change in the whole time dimension. In the context of asset pricing, Ma et al. (Citation2020) focused on testing alpha, which is the first moment, while our test of loadings resembles testing beta, which is related to second moments.

8 See Diebold, Piazzesi, and Rudebusch (Citation2005), Diebold and Li (Citation2006), Cochrane and Piazzesi (Citation2005), and Cochrane (2009).

9 Breitung and Eickmeier (Citation2011) developed three test statistics for structural breaks. Chen, Dolado, and Gonzalo (Citation2014) study the detection of large breaks in loadings through a two-stage procedure. Han and Inoue’s (Citation2015) test for structural breaks by studying the stability in the second moments. Yamamoto and Tanaka (Citation2015) generalized Breitung and Eickmeier’s (Citation2011) test. Cheng, Liao, and Schorfheide (Citation2016) proposed a test where both the factor loadings and the number of factors change simultaneously. Baltagi, Kao, and Wang (Citation2020) and Bai, Han, and Shi (Citation2020) estimated structural breaks with pseudo factors, Ma and Su (Citation2018) proposed a three-step approach with local estimates and Barigozzi, Cho, and Fryzlewicz (Citation2018) estimated structural breaks with wavelets. Chen, Wang, and Wu (Citation2020) study the detection and inference of change points when factors are nonstationary and loadings are nonparametric functions of characteristics.

10 Compared with the convergence rate of the estimated loadings in the constant loading model, our convergence rate is slower by a polynomial factor while the convergence rate of the structural break models is slower by a logarithmic factor (e.g., Bai, Han, and Shi Citation2020; Chen, Wang, and Wu Citation2020).

11 In contrast to other time-varying factor models, for example, Bates et al. (Citation2013), Cheng, Liao, and Schorfheide (Citation2016), and Su and Wang (Citation2017), our model directly incorporates the driving forces for the changes in loadings. Park et al. (Citation2009) study a similar semiparametric factor model but require cross-sectional variation in the loadings to come from observable covariates, this means they estimate a function Λ(Cit) of observable covariates Cit, where Λ(.)Rr is the same function for all i. They apply B-splines to estimate the unknown loading function and estimate the factors with a Newton–Raphson algorithm. Our approach is based on a simple-to-implement PCA method, which allows us to derive an inferential theory.

12 Many common kernels satisfy this assumption: (i) Gaussian kernel K(u)=12πexp(u22). (ii) Uniform kernel K(u)=121(|u|1). (iii) Epanechnikov kernel k(u)=34(1u2)1(|u|1). (iv) Biweight kernel (k(u)=1516(1u2)21(|u|1)). (v) Triweight kernel (k(u)=3532(1u2)31(|u|1))). (vi) Many higher order kernels obtained by multiplying them by a higher order polynomial in u 2.

13 We can relax the stationarity assumption of St, if St is smooth and we have a growing number of observations in a Section 3 upweights observations in a local window.

14 Su and Wang (Citation2017) provided a “global” test for the constancy of factor loadings over time. Similar arguments could be applied to our framework. The proof would go through with some modification about controlling the bias from using data in other states. In a similar spirit, Kong (Citation2018) provided a global test in a high-frequency setup.

15 In order to study Λ¯(s1) and Λ¯(s2), we need to redefine Hsl=(Fsl)FslT(sl)Λ(sl)Λ¯(sl)N(Vrsl)1,Hslp(Qsl)1, where Qsl=Vsl(Υsl)ΣF|sl1/2, and Vsl are eigenvalues of ΣF|sl1/2ΣΛ(sl)ΣF|sl1/2, Υsl is the corresponding eigenvector matrix such that (Υsl)TΥsl=Ir. Under the same assumptions as Theorem 4, the asymptotic distribution of Λ¯i(sl) is Th(Λ¯i(sl)(Hsl)Λi(sl))dN(0,(Vsl)1QslΦis(Qsl)T(Vsl)1), where Φis is the same as the Φis in Theorem 4. Let λli=Λi(sl) and vli=(Hsl)ThT(sl)(1Nk=1Nλlkλlk)((Fsl)eisl), then we have Th(Λ¯i(sl)(Hsl)Λi(sl))=vli+op(1) under the same assumptions as in Theorem 4.

16 Vsl here is the same as the Vsl in Theorem 3, since the eigenvalues of ΣΛ(s)1/2ΣF|sΣΛ(s)1/2 are the same as those of ΣF|sl1/2ΣΛ(sl)ΣF|sl1/2.

17 Here, we denote by vec(·) the vectorization operator. Inevitably, the matrix ΣB,B is singular due to the symmetric nature of the covariance and a proper formulation uses vech operators and elimination matrices.

18 Here, we use the following result:

Lemma 1. Let Λ1RN×k1 and Λ2RN×k2. Assume Nmax(k1,k2),rank(Λ1)=k1 and rank(Λ2)=k2, let k=min(k1,k2), then we have ρk.

19 Note that our generalized correlation test statistic would also work when the dimensions of the loading spaces change with the state.

20 The squared error of the nonparametric method is Op(max(1N,1Th,h2)). In order for the results in Section 5 and 6 to hold, we have Nh,Th,Nh20 and Th30. This gives us a guideline for selecting the bandwidth h in the simulation and empirical studies and suggests range of 0.1 to 0.5. The appendix (supplementary material) collects the results for various bandwidths and shows that our findings are robust to the choice of h.

21 In Figure IA.15 in the appendix (supplementary material), we compare the estimation results of our state-varying factor model with the local time-varying model of Su and Wang (Citation2017) under the same simulation setup. Our state-varying factor model can recover the correct functional form while the local window estimator fails.

22 Although we correct for the bias, the empirical distribution is still slightly shifted to the left. Our bias correction term only takes into account the dominant bias term. We believe that correcting for higher order bias terms can correct the remaining minor bias. Note that the remaining minor bias makes our test statistic more conservative, that is, we are more likely to reject the null hypothesis.

23 See Diebold, Piazzesi, and Rudebusch (Citation2005), Diebold and Li (Citation2006), Cochrane and Piazzesi (Citation2005) and Cochrane (2009).

24 Our test-statistic is not a global test for changes in the factor structure, but aims at comparing two specific states. In order to use our results for a global test the p-values would need to be adjusted to account for multiple hypothesis testing.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 123.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.