1,812
Views
6
CrossRef citations to date
0
Altmetric
Theory and Methods

Learning Latent Factors From Diversified Projections and Its Applications to Over-Estimated and Weak Factors

&
Pages 909-924 | Received 01 Mar 2020, Accepted 25 Sep 2020, Published online: 20 Nov 2020
 

Abstract

Estimations and applications of factor models often rely on the crucial condition that the number of latent factors is consistently estimated, which in turn also requires that factors be relatively strong, data are stationary and weakly serially dependent, and the sample size be fairly large, although in practical applications, one or several of these conditions may fail. In these cases, it is difficult to analyze the eigenvectors of the data matrix. To address this issue, we propose simple estimators of the latent factors using cross-sectional projections of the panel data, by weighted averages with predetermined weights. These weights are chosen to diversify away the idiosyncratic components, resulting in “diversified factors.” Because the projections are conducted cross-sectionally, they are robust to serial conditions, easy to analyze and work even for finite length of time series. We formally prove that this procedure is robust to over-estimating the number of factors, and illustrate it in several applications, including post-selection inference, big data forecasts, large covariance estimation, and factor specification tests. We also recommend several choices for the diversified weights. Supplementary materials for this article are available online.

Supplementary Materials

The supplementary material contains all the technical proofs.

Notes

1 We show in the proof that (MF̂F̂M) and F̂F̂ are both invertible with probability approaching one. So PF̂M and PF̂ are well defined asymptotically.

2 Suppose {γi:iN} are cross-sectionally conditionally weakly dependent given Z=(zi:iN). Then ||1Niϕ(zi)γi||2=OP(X)1Ni||ϕ(zi)||2=oP(1) given the assumption that X:=maxi1Nj||E[γiγj|Z]||=oP(1), which holds if γi are conditionally weakly correlated. Then νmin(1NWΛ)νmin(A)||1Niϕ(zi)γi||coP(1). In addition, νmax(H)=νmax(1NWΛ)νN[νmax(A)+oP(1)]νNCνNCc1νmin(A)νNCc1[νmin(1NWΛ)+oP(1)]νN2Cc1νmin(H).

3 A simple calculation would only yield NTσ1N2||AW||||Σ̂uΣu||||W||OP(1) but not necessarily oP(1).

Additional information

Funding

Jianqing Fan’s research is supported by NSF grants DMS-1662139 and DMS-1712591.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.