2,646
Views
1
CrossRef citations to date
0
Altmetric
Articles

A Robust Approach to Heteroscedasticity, Error Serial Correlation and Slope Heterogeneity in Linear Models with Interactive Effects for Large Panel Data

, ORCID Icon, &

Abstract

In this article, we propose a robust approach against heteroscedasticity, error serial correlation and slope heterogeneity in linear models with interactive effects for large panel data. First, consistency and asymptotic normality of the pooled iterated principal component (IPC) estimator for random coefficient and homogeneous slope models are established. Then, we prove the asymptotic validity of the associated Wald test for slope parameter restrictions based on the panel heteroscedasticity and autocorrelation consistent (PHAC) variance matrix estimator for both random coefficient and homogeneous slope models, which does not require the Newey-West type time-series parameter truncation. These results asymptotically justify the use of the same pooled IPC estimator and the PHAC standard error for both homogeneous-slope and heterogeneous-slope models. This robust approach can significantly reduce the model selection uncertainty for applied researchers. In addition, we propose a Lagrange Multiplier (LM) test for correlated random coefficients with covariates. This test has nontrivial power against correlated random coefficients, but not for random coefficients and homogeneous slopes. The LM test is important because the IPC estimator becomes inconsistent with correlated random coefficients. The finite sample evidence and an empirical application support the reliability and the usefulness of our robust approach.

1 Introduction

In recent years, the increasing availability of panel data with both a large cross-sectional dimension N and a large time series dimension T has necessitated the development of statistical methods to exploit the rich information they contain, but it has also created technical challenges. In particular, the control of cross-sectional dependence, parameter heterogeneity and serial dependence has been the main focus of the literature. Among other things, the modeling of the cross-sectional dependence by interactive effects has been widely adopted in recent years.

In particular, the past decade has seen significant development in estimation and inferential methods for linear models with interactive effects for large panel data. Two estimation approaches are particularly popular. The first involves eliminating the interactive effects from both the error term and the regressors. Representative methods for static models include the Common Correlated Effects (CCE) approach proposed by Pesaran (Citation2006), the principal component (PC) approach investigated by Westerlund and Urbain (Citation2015), the maximum likelihood (ML) approach of Bai and Li (2014), and the two-step instrumental variable (2SIV) approach proposed by Cui et al. (Citation2021), among others. The second approach asymptotically eliminates the interactive effects from the error term only. The representative method is the iterative principal components (IPC) estimator of Bai (Citation2009), further developed by Moon and Weidner (Citation2015), and Bai and Liao (Citation2017), among many others.

Concerning the NT-consistent estimation and the associated inference against error serial correlation and heteroscedasticity, the CCE, PC, 2SIV, and IPC approaches typically permit idiosyncratic errors to be serially correlated and heteroscedastic (see, Remark 1 in Westerlund and Urbain Citation2015, Assumptions 2.1 and 2.2 in Cui et al. Citation2021, Remark 8 in Bai Citation2009), whereas the ML approach generally does not allow such a degree of robustness (see Assumption B in Bai and Li 2014). The CCE, PC, and IPC estimators, however, will have asymptotic biases, essentially due to the endogeneity arising from estimating the interactive effects. Note that the properties of the asymptotic biases in the CCE and PC estimators and those of the IPC estimator are different, because of the difference in their approaches for eliminating the interactive effects. These asymptotic biases should be controlled, particularly for inference, either analytically or numerically by, for example, jackknife subsampling. For the latter, see Fernández and Weidner (Citation2016) and Westerlund (Citation2018).

For testing linear restrictions on the slope coefficients, most existing work routinely employs the Newey and West (Citation1987) type heteroscedasticity and autocorrelation consistent (HAC) variance-covariance estimator, which requires time-series parameter truncation. This includes the CCE and the PC estimators (see Westerlund Citation2018), as well as the IPC estimator (see Remark 8 in Bai Citation2009). The 2SIV estimator of Cui et al. (Citation2021) and other related papers, including Greenaway-McGrevy, Han, and Sul (Citation2012), Bonhomme and Manresa (Citation2015), and Fernández and Weidner (Citation2016), employ the Hansen (Citation2007) type panel HAC (PHAC) variance-covariance estimator without the parameter truncation. In this article we employ this PHAC estimator.Footnote1

One of the attractive features of the CCE, PC, and 2SIV approaches is that they permit estimation and inference for the models with (cross-sectionally) heterogeneous slopes. For such models, the parameter of interest is often the population average of slope coefficients, which is typically estimated as an average of estimators of cross-section specific slopes. This is called a mean-group (MG) estimator. For the MG estimators based on the CCE, PC, and 2SIV approaches, see Pesaran (Citation2006) and Cui et al. (Citation2021). Interestingly, as shown by Pesaran (Citation2006) and Reese and Westerlund (Citation2018), if the cross-sectional variation of slopes is independent of the regressors (i.e., if slopes are random coefficients), the aforementioned pooled CCE and PC estimators are N-consistent to the population average of slope coefficients. Pesaran (Citation2006) proposes a nonparametric estimator of the variance-covariance matrix of the N-consistent pooled CCE estimator. This estimator uses the sample variance-covariance matrix of cross-section specific slope estimates. The evidence therein has shown that this estimator behaves very well in finite samples.Footnote2

On the other hand, the IPC approaches typically do not permit models with heterogeneous slopes.Footnote3 In this article we investigate asymptotic properties of the pooled IPC estimator for the models with heterogeneous slopes assuming the regressors are subject to a factor structure. To the best of our knowledge, this is new to the literature. In this article, we show that the pooled IPC estimator is N-consistent for the models with heterogeneous slopes (and confirm it is NT-consistent for the models with homogeneous slopes), when the regressors have the factor structure. We also prove that the use of the PHAC variance-covariance matrix estimator for the pooled IPC estimator is asymptotically justified for the models with heterogeneous slopes, as well as for the models with homogeneous slopes. Our findings essentially imply that the inference based on the pooled IPC estimator and the associated PHAC variance estimator is asymptotically valid for both homogeneous slope and heterogeneous slope models.

Building upon these novel asymptotic results, we propose a robust approach against heteroscedasticity, error serial correlation and slope heterogeneity in linear models with interactive effects for large panel data. The basic idea behind this approach is classical. For the models with homogeneous slopes and homoscedastic and uncorrelated idiosyncratic errors, the pooled estimator is expected to be most efficient. Otherwise, an alternative estimator, such as a suitable generalized least squares (GLS) estimator, can be more efficient; see Bai and Liao (Citation2017) for the extension of the IPC estimator in this direction. This potential efficiency gain hinges on the correct specification of, in our context, error serial correlation, heteroscedasticity and slope heterogeneity, and achieving such specifications simultaneously does not seem easy in practice. In this kind of situation, the use of the pooled estimator, together with a robust variance-covariance matrix estimator for inference, is very likely to have practical appeal.Footnote4

Another main contribution of this article is a novel test for correlation and dependency between the random coefficients and covariates (hereafter called correlated random coefficients). For the proposed robust approach to work, the slope heterogeneity, if any, has to be essentially independent of the covariates (i.e., random coefficients). The proposed test is designed to detect departures from this independence and slope homogeneity. Note that the existing slope homogeneity tests for large panel data, such as Pesaran and Yamagata (Citation2008) and Su and Chen (Citation2013), among others, have power against random coefficients as well as correlated random coefficients, and these are not suitable for our purpose.

Here, we briefly discuss a related study, Galvao and Kato (Citation2014), which considers estimation and inference of pseudo-true parameters under some model misspecifications, such as inconsistent estimators in the fixed effect models with correlated random coefficients (see sec. 5.1.2 in Galvao and Kato Citation2014). Their theoretical results are closely related to ours in that the convergence rate of the pooled estimator changes from NT to N under correlated random coefficients. It is important to emphasize that this article deals with consistent estimation of true parameters of interest and the associated inference in the models with interactive effects, which is not explicitly considered in Galvao and Kato (Citation2014). In addition, in contrast to Galvao and Kato (Citation2014), our approach advocates a method that enables empirical researchers to statistically identify misspecified models (i.e., correlated random coefficient models) that can result in inconsistent estimation.

We have examined the finite sample performance of the estimators, the tests of parameter restrictions using the PHAC variance estimator, and the LM test for correlated random coefficients. The results show that the size of the proposed robust Wald test with the bias-corrected IPC estimator is sufficiently close to the nominal level in both slope homogeneity and slope heterogeneity, and that the LM test for correlated random coefficients has correct size under both slope homogeneity and random coefficients while exhibiting high power for correlated random coefficients. The practical usefulness of our robust approach is illustrated by applying the proposed methods to analyze the Feldstein–Horioka puzzle, presented in Feldstein and Horioka (Citation1980).

The article is organized as follows. The asymptotic properties of the IPC estimator and the associated robust Wald test statistic based on the PHAC variance-covariance estimator are investigated in Section 2. In Section 3, we introduce the LM test statistic for correlated random coefficients and study its limiting distribution. The finite sample performance of the IPC estimators, the robust Wald test and the LM test for correlated random coefficients is investigated using the Monte Carlo method in Section 4. An empirical application is provided in Section 5. Section 6 contains some concluding remarks. Proofs of the main results, additional discussions and experimental results are relegated to the supplementary materials.

Notations: we denote the largest eigenvalues of the N × N matrix A=(aij) by μmax(A), its trace by tr(A)=i=1Naii, its Frobenius norm by ||A||=tr(AA). The projection matrix on A is PA=A(AA)1A and MA=IPA. Δ is a generic positive constant large enough, δNT2=min{N,T}. We use N,T to denote that N and T pass to infinity jointly. We use () to represent () up to a positive constant factor. For any positive sequences an and bn that converge to some points or diverge as n, we write anbn if anbn and anbn.

2 Panel Data Models with Interactive Effects

We consider the following panel data model with possibly heterogeneous coefficients,(1) yit=xitβi+λi0ft0+εit,(i=1,2,,N ;t=1,2,,T ),(1) (2) xit=Γi0gt0+vit,(2) where xit is a k×1 vector of regressors, ft0 and gt0 denote r1×1 and r2×1 vectors of latent factors, respectively. Correspondingly, their factor loadings are λi0 and Γi0=(γ1i0,,γki0). εit and vit are the idiosyncratic disturbance terms.

If we stack the EquationEquations (1) and Equation(2) over t, we have(3) yi=Xiβi+F0λi0+εi ,(3) (4) Xi=G0Γi0+Vi,(4) where yi=(yi1,,yiT),Xi=(xi1,,xiT),F0=(f10,,fT0),G0=(g10,,gT0), εi=(εi1,,εiT) and Vi=(vi1,,viT). Supposing random coefficients(5) βi=β0+ηi,(5) we can rewrite the model (3) using the process (4) as(6) yi=Xiβ0+W0ϑi0+Viηi+εi,(6) where W0=(G0,F0) and ϑi0=(ηiΓi0,λi0). The results in this article will hold when F0 and G0 share common factors (say, W0=G0F0 and F0G00). However, for simplicity, we suppose F0G0=0 and rank(W0)=r1+r2.

Noting that for the IPC estimator the interactive effects F0λi0 for the models with homogeneous slopes and W0ϑi0 for the models with heterogeneous slopes are extracted from the error vector yiXiβi0, we employ the following common expression to describe both homogeneous-slope and heterogeneous-slope models:(7) yi=Xiβ0+ui, with ui=H0ϕi0+ei,(7) where {H0,ϕi0,ei}={F0,λi0,εi} and r=r1 for the models with homogeneous slopes and {H0,ϕi0,ei}={W0,ϑi0,Viηi+εi} and r=r1+r2 for the models with heterogeneous slopes. Hereafter, we use EquationEquation (7) to describe both homogeneous-slope and homogeneous-slope models.

Controlling the interactive effects, H0ϕi0, typically results in asymptotic biases of the NT-consistent CCE, PC, and IPC estimators, as shown by Pesaran (Citation2006), Bai (Citation2009), and Westerlund and Urbain (Citation2015), among others.

As discussed in the introduction section, in this article we focus on the iterative principal component (IPC) estimator proposed by Bai (Citation2009), which estimates the interactive effects in ui by the PC method and the slope coefficient β0 by the least-squares method, iteratively.

In particular, define the least squares objective functionSSR(β,H)=1NTi=1N(yiXiβHϕi)(yiXiβHϕi)subject to the constraints HH/T=Ir and i=1Nϕiϕi being diagonal.

The least squares estimator for (β,H), denoted by (β̂,Ĥ), is the IPC estimator, which is the solution to the following nonlinear equations:(8) β̂=(i=1NXiMH^Xi)1(i=1NXiMH^yi)[1NTi=1N(yiXiβ̂)(yiXiβ̂)]H^=H^VNT(8) where VNT is a diagonal matrix that contains the r largest eigenvalues of the matrix in the square brackets in decreasing order. Given (β̂,H^), we can estimate ϕi byϕ̂i=1TĤ(yiXiβ̂).

The number of factors, r, and an initial estimator of β0, should be chosen to obtain the IPC estimator. In this section and the next, we suppose that r and the initial estimator of β0 are given, and the procedure to choose them is discussed in Section 4.1.

Remark 1.

Permitting different sets of interactive effects in Xi and ui is important, because the IPC estimator extracts the interactive effects in ui only, which does not necessarily project out the factors in Xi. Nevertheless, our results in this article hold when Xi and ui share the same set of interactive effects. See Norkutė et al. (Citation2021) and Cui et al. (Citation2021) for further discussions.

Remark 2.

The factor model for the regressors specified in (2) has been widely employed in the literature: see Pesaran (Citation2006), Bai and Li (2014), Westerlund and Urbain (Citation2015), Li, Cui, and Lu (Citation2020), Norkutė et al. (Citation2021), among many others. For the IPC estimators, typically no factor structure in the regressors is imposed. However, only the model with homogeneous slopes is considered; see Bai (Citation2009) and Moon and Weidner (Citation2015), among others. Importantly, our new results shown below will reveal that, the pooled IPC estimator remains consistent for the random coefficient models with interactive effects if the regressors have a factor structure as in (2).

For the asymptotic analysis of the estimator, we impose the following assumptions.

Assumption 1

(idiosyncratic error in y). (i) εit is distributed independently across i; (ii) E(εit)=0 and E|εit|8Δ; (iii) T1s=1Tt=1TE|εisεit|Δ; (iv) E|N1/2i=1N [εisεitE(εisεit)]|4Δ for each (s, t); (v) N1T2i=1N t=1Ts=1Tr=1Tw=1T|cov(εisεit,εirεiw)|Δ; (vi) Ωε,i=E(εiεi) is positive definite and its largest eigenvalue is bounded, uniformly every i and T.

Assumption 2

(idiosyncratic error in x). Let vlit be the lth element of vit and vl,i=(vli1,,vliT). Then we assume that (i) vit is independently distributed across i and independent of {εjs} for 1jN and 1sT; (ii) E(vit)=0 and E||vit||8Δ; (iii) T1s=1Tt=1TE|vlisvlit|Δ; (iv) E|N1/2i=1N[vlisvlitE(vlisvlit)]|4Δ for every l, s, and t; (v) N1T2i=1Nt=1Ts=1Tr=1Tw=1T |cov(vlisvlit,vlirvliw)|Δ; (vi) the largest eigenvalue of E(vl,ivl,i) is bounded uniformly for every l, i, and T.

Assumption 3

(random coefficients). is independent across i, and is independent of εit, vit,λi0 and Γi0; E(ηi)=0 and E||ηi||4Δ; E(ηiηi)=Ωη,i is a fixed positive definite matrix uniformly over i.

Assumption 4

(factor components). (i) Let wt0=(ft0,gt0). E||wt0||4Δ and T1t=1Twt0wt0pΣw=[ΣfΣfgΣgfΣg], which is positive definite and wt0 is independent of vit and εit; (ii) E||Γi0||4Δ, E||λi0||4Δ, N1i=1NΓi0Γi0pΣΓ,N1i=1NΓi0ηiηiΓi0pΣγη and N1i=1Nλi0λi0pΣλ, where ΣΓ,Σγη, and Σλ are positive definite matrices. Γi0 and λi0 are independent of vit and εit.

Assumption 5

(identification and variance matrices). We assume that the four matricesA0=limN,T1NTi=1NE(ViVi),A1=plimN,T1NTi=1NZiMH0Zi,C0=limN,T1NT2i=1NE(ViViηiηiViVi),B1=plimN,T1NTi=1NZiMH0Ωε,iMH0Ziare fixed and positive definite, where Zi=XiN1j=1Nϕi0 ϒϕ01ϕj0Xj, ϒϕ0=N1i=1Nϕi0ϕi0.

Assumption 6

(Central Limit Theorem). When N,Tsuch that T/Nρ(0,Δ], 1NTi=1NZiMH0εidN(0,B1).

Assumptions 1 and 2 are very similar to Assumption in Bai (Citation2009), though, εit and vit are independent of each other and independent over i. Assumption 3 is a typical assumption for random coefficient models with interactive effects (see, e.g., Assumption HET in Reese and Westerlund Citation2018). Assumption 4 implies that the factor loadings Γi0 and λi0 are allowed to be correlated with each other. Assumption 5 is for identification of the parameters and for estimation of the variance-covariance matrices. Assumption 6 is imposed following Bai (Citation2009) for the asymptotic normality of the pooled IPC estimator for the models with homogeneous slopes.

In what follows, we investigate asymptotic properties of the IPC estimator for the models with homogeneous and heterogeneous slopes, assuming that the regressors have a factor structure. As discussed in the introduction section, the results for the models with heterogeneous slopes are new additions to the literature. We first derive asymptotic expressions of the pooled IPC estimator, β̂, for the models with homogeneous and heterogeneous slopes in Proposition 1. These confirm the presence of asymptotic bias in β̂ for the models with homogeneous slopes, as shown by Bai (Citation2009) and Cui et al. (Citation2021), among others. After introducing an asymptotic expression for the bias-corrected IPC estimator, which is common to the models with homogeneous and heterogeneous slopes in Proposition 2, we present asymptotic normality of the bias-corrected IPC estimator for both models in Theorem 1. Finally, in Theorem 2, we show the asymptotic validity of the robust Wald test for linear constraints on β0, which is based on the same statistic for both homogeneous-slope and heterogeneous-slope models.

A similar line of discussion in Bai (Citation2009) and Cui et al. (Citation2021) for the model with homogeneous slopes, and in Pesaran (Citation2006) for the case of heterogeneous slopes provides the following asymptotic expressions:

Proposition 1.

Suppose that Assumptions 1–5 hold.

  1. For the model with homogeneous slopes, we have(9) β̂β0=(1NTi=1NZiMH0Zi)11NTi=1NZiMH0εi+ 1NξNT+1TζNT+Op(δNT3),(9)

    where the bias terms are given byξNT=(1NTi=1NZiMH0Zi)11NTi=1Nt=1TZiH0(H0H0)1ϒϕ01ϕi0E(eit2),ζNT=(1NTi=1NZiMH0Zi)11N2i=1Nj=1NXiMH0E(ejej)H0(H0H0)1ϒϕ01ϕi0.

  2. For the model with heterogeneous slopes, we have(10) β̂β0=(1NTi=1NViVi)11NTi=1NViViηi+Op(δNT2).(10)

A few remarks regarding Proposition 1 are warranted. Proposition 1(a) is essentially the same result for the models with the homogeneous slopes shown in Bai (Citation2009) and Cui et al. (Citation2021). Because of the estimation of the interactive effects, Xi has to be replaced by Zi, and the bias terms ξNT and ζNT are present. Under our assumptions ξNT and ζNT are Op(1), because Zi can be correlated with ϕi0 and E(ejej) is not necessarily proportional to IT. Proposition 1(b) reveals that the pooled IPC estimator is N-consistent and asymptotically free from the effects of interactive effects. See a similar result for pooled CCE and PC estimators in Pesaran (Citation2006) and Reese and Westerlund (Citation2018).

To derive an asymptotically unbiased estimator of β0 in the case of homogeneous slope, we should estimate the bias terms. Following Bai (Citation2009) and Moon and Weidner (Citation2015), we propose an analytically bias corrected estimator, which is defined by(11) β˜=β̂1Nξ̂NT1Tζ̂NT,(11) whereξ̂NT=(1NTi=1NẐiMĤẐi)11NT2i=1Nt=1TẐiĤϒ̂ϕ1ϕ̂iêit2ζ̂NT=(1NTi=1NẐiMĤẐi)1Ni=1NXiMHΩĤTϒ̂ϕ1ϕ̂i with Ẑi=XiN1j=1Nϕ̂iϒ̂ϕ1ϕ̂jXj,ϒ̂ϕ=N1i=1Nϕ̂iϕ̂i,(12) XiMHΩĤT=1TNj=1N[t=1Têjt2x^ith^t+s=1St=s+1T(1sS+1)êjtêj,ts(x^ith^ts+x^i,tsh^t)](12) and êit being the tth element of êi=yiXiβ̂Ĥϕ̂i.

Following Bai’s (Citation2009) suggestion we employ the Newey and West (Citation1987) type estimator of ζNT as shown in EquationEquation (12). In the Monte Carlo experiment in Section 4, we set S=T1/4.

Remark 3.

Recently, jackknife bias-correction, a numerical way of asymptotically eliminating the asymptotic biases, was proposed and applied in Fernández and Weidner (Citation2016) and Westerlund (Citation2018), among others. The bias terms in EquationEquation (11) are Op(N1) and Op(T1), which indicates that the jackknife bias-correction of Dhaene and Jochmans (Citation2015) by splitting the time-series and cross-section dimensions would be applicable for β̂. We provide the procedure for this jackknife bias-correction and examine its finite sample performance in Section 4.

Now consider the robust approach against slope heterogeneity. Such robustness may be achieved when the leading terms in EquationEquations (9) and Equation(10) share a common expression, and the estimated asymptotic bias terms are bounded for the models with heterogeneous slopes. Indeed, we prove that for the models with heterogeneous slopes 1NTi=1NZiMH0Zi=1NTi=1NViVi+op(1), 1NTi=1NZiMH0ui=1NTi=1NViViηi+op(1), ζ̂NT=Op(1) and ζ̂NT=Op(1), and for the models with homogeneous slopes 1NTi=1NZiMH0ui=1NTi=1N ZiMH0εi under Assumptions 1–5; see the supplementary materials.

The following proposition shows a common stochastic representation of the bias-corrected IPC estimator for the models with homogeneous and heterogeneous slopes.

Proposition 2.

Suppose that Assumptions 1–5 hold. Then, we have(13) β˜β0=(1NTi=1NZiMH0Zi)1× 1NTi=1NZiMH0ui+Op(δNTϵ)(13) where ϵ = 3 for the models with homogeneous slopes and ϵ = 2 for the models with heterogeneous slopes, as (N,T) and T/Nρ(0,Δ].

Equipped with Assumption 6, we are ready to present the asymptotic normality of the bias-corrected estimator β˜.

Theorem 1.

Suppose that Assumptions 1–6 hold and T/Nρ(0,Δ].

  1. For the model with homogeneous slopes, we haveNT(β˜β0)dN(0,A11B1A11).

  2. For the model with heterogeneous slopes, we haveN(β˜β0)dN(0,A01C0A01),

    where A0,A1,B1, and C0 are defined in Assumption 5.

Let us now turn our attention to the estimation of the asymptotic variance-covariance matrix of β˜. Define the panel heteroscedasticity and autocorrelation consistent (PHAC) variance-covariance matrix estimator by(14) Σ̂β˜=(i=1NẐiMĤẐi)1(i=1NẐiMĤûiûiMĤẐi)×(i=1NẐiMĤẐi)1,(14) where ûi=yiXiβ̂. Note that this PHAC estimator does not require Newey and West (Citation1987) type time-series parameter truncation.

One of the highlights of this article is the revelation of the asymptotic validity of the PHAC variance estimator for both homogeneous-slope and heterogeneous-slope models with interactive effects.Footnote5 This may be intuitively understood by the following discussion, with the help of Propositions 1 and 2 and Theorem 1. For the models with homogeneous slopes, EquationEquations (9) and Equation(13) yieldNT(β˜β0)=(1NTi=1NZiMH0Zi)1×1NTi=1NZiMH0ui+op(1)=(1NTi=1NZiMH0Zi)1×1NTi=1NZiMH0εi+op(1).

From this, we can easily infer that NTΣ̂β˜pA11B1A11. For the models with heterogeneous slopes, EquationEquations (10) and Equation(13) give(15) N(β˜β0)=(1NTi=1NZiMH0Zi)1×1NTi=1NZiMH0ui+op(1)=(1NTi=1NViVi)1×1NTi=1NViViηi+op(1),(15) from which, we can conjecture that NΣ̂β˜pA01C0A01.

In the following theorem, we formally establish the limit distribution of the Wald test statistic for linear restrictions on β0, based on the bias-corrected IPC estimator β˜ and the PHAC variance estimator Σ̂β˜.

Theorem 2.

Consider testing m linearly independent restrictions on β0, H0:β0=r against H1:β0r, where is an m × k fixed matrix of full row rank. Consider the model (7) and the Wald statistic(16) W˜NT=(β˜r)(Σ̂β˜)1(β˜r)(16) where β˜ and Σ̂β¯ are defined by (11) and (14), respectively. Suppose that Assumptions 1–6 hold. Then, under the null hypothesis, for the models with homogeneous slopes (ηi=0 for all i) and for the models with heterogeneous slopes, W˜NTdχm2, as (N,T) and T/Nρ(0,Δ].

To our knowledge, this is the first article which shows the consistency of the PHAC estimator in the random coefficients models with interactive effects for large panel data. As discussed in the introduction section, the use of the PHAC variance estimator for the models with homogeneous slopes with large N and T is proposed by Hansen (Citation2007) for the fixed effects models, and considered by Greenaway-McGrevy, Han, and Sul (Citation2012), Vogelsang (Citation2012), Fernández and Weidner (Citation2016), and Cui et al. (Citation2021) for the models with homogeneous slopes, among others.

Remark 4.

Our approach is also robust against mixtures of homogeneous and heterogeneous slopes.Footnote6 To illustrate this, consider the model without common components and the case in which the k slopes are partitioned in such a way that k=k1+k2, where βi=(β1i,β20),β1i=β10+η1i,E(η1i)=0 and var(η1i)=Ω1i, with β0=(β10,β20). The expansion of the pooled ordinary least squares (OLS) estimator β̂=(β̂1,β̂2) givesN(β̂β)=(i=1NXiXiNT)11NTi=1NXiui=(i=1NXiXiNT)11Ni=1N(XiX1iT)η1i+ Op(1/T)dN(0,A1CA1)where A=plimN,T(NT)1i=1NXiXi, which is accordingly partitioned asA=[A11A12A21A22],andC=plimN,T1Ni=1N(XiX1iT)η1iη1i(X1iXiT), which are assumed to be fixed and positive definite. Also note that the convergence rate of (β̂1 and) β̂2 is N, as the variation of β̂ is dominated by that of η1i.

Now consider a special case in which plimTX1iX2i=0. Then, a similar discussion gives N(β̂1β10)dN (0,A111C11A111) and NT(β̂2β20)dN(0,A221B22A221), where C11=plimN,T(NT2)1i=1NX1iX1iη1iη1i X1iX1i and B22=plimN,T(NT)1i=1NX2iεiεiX2i. Therefore, the asymptotic normality of β̂, the consistency of the PHAC estimator and the asymptotic validity of Wald test hold with mixtures of homogeneous and heterogeneous slopes.

3 LM Test for Correlated Random Coefficients with Covariates

As discussed earlier, the proposed robust approach is asymptotically justified for the models with homogeneous slopes and for random coefficient models, in which the slope heterogeneity is generally independent of the variations of the regressors (see Assumption 3). In particular, when the heterogeneous coefficients are correlated with the defactored regressor matrix, MH0Xi, the approach may not work. To see this, consider the bias-corrected IPC estimator β˜ defined by EquationEquation (11) for the models with heterogeneous slopes. EquationEquation (15) gives β˜β0(i=1NViVi)1i=1NViViηi. If E(ηi|Vi)0, which violates the random coefficient assumption (in Assumption 3), E[ViViE(ηi|Vi)] is not necessarily a zero vector, and this may render β˜ inconsistent, in general. Note that even if we had nonzero correlations between ηi and the factor loadings, the slope estimator would remain consistent so long as the regressor matrix is defactored, because the leading term in EquationEquation (15) does not contain the factor loadings. Therefore, we focus on the correlated random coefficients caused by E(ηi|Vi)0.

There are studies which propose tests to detect slope heterogeneity for large panel data. However, most of them have focused only on testing the slope heterogeneity, not the dependence of the slope heterogeneity on the regressors. Such examples are found in Pesaran and Yamagata (Citation2008) and Su and Chen (Citation2013), among others. These existing tests are not suitable for our robust approach, because they have power against the random coefficient models as well as the correlated random coefficient models.

In view of this, we propose a novel Lagrange Multiplier (LM) or Score test of dependence of random coefficients on covariates. The proposed test is designed to detect departures from the assumption of independence between slope heterogeneity and the regressors, or departures from the assumption of slope homogeneity toward slope heterogeneity dependent on the regressors. Hereafter, for simplicity, we call it the LM test for correlated random coefficients (CRC). Specifically, we extend the test of correlated random effects proposed in (Wooldridge Citation2010, chap. 11.7.4) for short panels in the fixed effects models. Therein, using a similar approach taken by Mundlak (Citation1978) and Chamberlain (Citation1980), the correlated heterogeneity is modeled as a linear function of within average of the regressors, βi=μ+θx¯i+ωi, where x¯i=T1t=1Txit and ωi is a mean-zero random vector which is uncorrelated with x¯i. From Equationequations (4) and Equation(5), it is easily seen that the error term ui for the model of omitted slope heterogeneity will contain the term Xiβi and this motivates his test of correlated random coefficients, which examines the significance of Xix¯i. As pointed out by (Wooldridge Citation2010, p. 386), this test does not have power against uncorrelated slope heterogeneity, which is the desirable property for our approach. We extend the test by considering the models with interactive effects for large N and T and by permitting the alternatives, E(ηi|Vi), to be a nonlinear function of Vi.

To motivate the LM test for correlated random coefficients, suppose that the random part of the coefficients, ηi (see EquationEquation (5)), is decomposed into correlated and uncorrelated parts:(17) ηi=ψ(Vi)+ωi,(17) where E[ψ(Vi)|Vi]0 but E[ωi|Vi]=0. Various functional forms of ψ(.) can be entertained. As an illustration, suppose the lth element of ψ(Vi) to be ψli=1Tt=1T{f(vlit)E[f(vlit)]}, where the function f(.) satisfies certain regularity conditions. Then, the qth-order Maclaurin series expansion of f(vlit)E[f(vlit)] gives (Tex translation failed), where θp=1p!f(p)(0),f(p)(.) is the pth-order derivative of f(.), and Rq is the reminder term. For testing purposes, examining whether the first few terms of the expansion, say g(q) terms, are zero or not may suffice. This motivates our test of correlated random coefficients (defined below) for the null hypothesis,(18) H0:θ=0 against H1:θ0,(18) where θ=(θ1,,θg).Footnote7

Having discussed a process of the correlated random coefficients, let us turn our attention to the test statistic. Following on from the above discussion, we define the infeasible score of the LM test for the models with homogeneous (respectively, heterogeneous) slopes as 1NTi=1NLiMH0ui (resp. 1NTi=1NLiMH0ui), where Li=XiT(ΞiΞ),(19) Ξi=(v¯i(1),v¯i(2),,v¯i(g)),(19)

Ξ=limNN1i=1NE[Ξi], with v¯i(p)=(v¯1i(p),v¯2i(p),, v¯ki(p)),v¯li(p)=T1t=1Tvlitp,vlitp is the pth power of the (t,l) element of Vi,p=1,,g. Observe that the direct alternative model of ψ(Vi) for this score is T(ΞiΞ)θ, where θ is the g×1 constant vector.

To construct the associated feasible score, Li, H0, and ui should be replaced by their sample counterparts, which are L^i=XiT(ΞiΞ¯), Ĥ, and u˜=yiXiβ˜, respectively, where Ξ̂i=(v¯̂i(1),v¯̂i(2),,v¯̂i(g)),v¯̂i(p)=(v¯̂1i(p),v¯̂2i(p),,v¯̂ki(p)),v¯̂li(p)=T1t=1Tv̂litp with V̂i=MĤXi=(v̂lit),p=1,,g. The resulting feasible score for the models with homogeneous (resp. heterogeneous) slopes is 1NTi=1NL̂iMH^u˜i (respectively, 1NTi=1NL̂iMH^u˜i). Let us sketch the derivation of the LM test statistic next. For the models with homogeneous slopes, a familiar expansion gives u˜i=uiXi(β˜β0), which yields1NTi=1NL̂iMH^u˜i=1NTi=1NL̂iMH^ui+1NTi=1NL̂iMH^XiNT(β˜β0).

For a reason similar to why β̂ has asymptotic bias terms, the first term of the right hand side of the above equation will have asymptotic bias terms. For the second term, demeaning Li and Xi is required for a reason similar to why the asymptotic variance of β̂ involves ZiMH0Zi rather than XiMH0Xi. Define such a demeaned Li, Li=LiN1j=1Nϕi0ϒϕ01ϕj0Lj. For the LM test, we impose the following assumption, which is similar to Assumption 6:

Assumption 7.

When N,Tsuch that T/Nρ(0,Δ], 1NTi=1NLiMH0εidN(0,B2), where B2=plimN,T1NTi=1NLiMH0Ωε,iMH0Li, which is positive definite.

For the slope heterogeneous case, the use of the same score scaled by 1T will be justified. For more details, see proof of Theorem 3 in the supplementary materials.

Consequently, the LM test statistic for correlated random coefficients, which is common to the model with homogeneous slopes and the random coefficient model, is defined by(20) LMCRC(g)=ŝΣ̂ss1ŝ(20) where ŝ=ŝN1ξ̂NTT1ζ̂NT with ŝ=i=1NL̂iMH^u˜i,ξNT=1NT2i=1Nt=1TL̂iĤϒ̂ϕ1ϕ̂iêit2, ζNT=1Ni=1NLiMHΩĤTϒ̂ϕ1ϕ̂i,

L̂i=L̂iN1j=1Nϕ̂iϒ̂ϕ1ϕ̂jL̂j, LiMHΩĤT is computed analogously to the computation of XiMHΩĤT (see EquationEquation (12)), and Σ̂ss=i=1NK̂iMĤu˜iu˜iMĤK̂i withK̂i=L̂iẐi(i=1NẐiMĤẐi)1i=1NẐiMĤL̂i.

Now we are ready to present the results of the limiting distribution of the LM test statistic.

Theorem 3.

Consider the model (7). Suppose that Assumptions 1–7 hold. Then, under the null hypothesis specified in (18), LMCRC(g)dχg2, as N,T and T/Nρ(0,Δ], for both homogeneous-slope and heterogeneous-slope models.

In practice, the degree of polynomial, g, for the statistic LMCRC(g), should be chosen. In order to approximate the function ψ(Vi) to the extent that the LM test has sufficient power, a small value of g, such as two, may be sufficient. The experimental results reported below support the choice of g = 2. In addition, the asymptotic local power analysis, which is provided in the supplementary materials, suggests that the LM test has nontrivial power when ||θ||1NT and that the LM test associated with the pth column of the test matrix Li has nontrivial power only against the pth term in the expansion of ψ(Vi), that is, H1:θp0. The experimental results shown below support these analytical results.

When the null hypothesis of random coefficient models is rejected in favor of the alternatives, it is preferable to employ estimators which are consistent to β0. For the estimation of the models with interactive effects for large panel data, the CCE and 2SIV mean group estimators and the ML estimators proposed by Pesaran (Citation2006), Cui et al. (Citation2021), and Li, Cui, and Lu (Citation2020), among others, would be possible choices.

4 Monte Carlo Experiments

In this section we investigate the finite sample performance of our robust approach against slope heterogeneity, error serial correlation and heteroscedasticity. We consider the performance of the (analytically) bias-corrected IPC estimator defined by (11), which is denoted as BC-IPC. In addition, as discussed in Remark 3, we introduce a jackknife bias-corrected IPC estimator, which is denoted as JK-IPC. To evaluate accuracy of the bias-correction, the IPC estimator without bias-corrections is considered as well. In particular, we examine biases, standard deviations and root mean square errors (RMSEs) of these estimators, and empirical size of the (Wald) test for linear restrictions of slope coefficients. Furthermore, performance of the LM test for correlated random coefficients is examined.

Before introducing our experimental design, we discuss how to choose the initial estimator and the number of factors r for the IPC estimation in Section 4.1, then the procedure for computing the jackknife bias-corrected IPC estimator in Section 4.2.

4.1 Initial Value for the IPC Estimator and the Number of Factors

In practice, to compute the IPC estimator, an initial (consistent) estimator of β0 and the number of factors in ui must be given. Exploiting the factor structure in Xi and ui, we propose to employ the consistent PC estimator of Westerlund and Urbain (Citation2015) and Reese and Westerlund (Citation2018) as the initial estimator of β0, which is defined as β̂PC=(i=1NXiMŴXi)1i=1NXiMŴyi, where Ŵ is a T×r1+r2̂ PC estimator of a span of W0=(F0,G0). r1+r2̂ and Ŵ are estimated using {Xi,yi}i=1N, applying the Eigenvalue Ratio (ER) or the Growth Ratio (GR) estimators proposed by Ahn and Horenstein (Citation2013), and the PC method, respectively. The estimate of the number of factors in ui,r̂, is obtained using the residual, ûPCi=yiXiβ̂PC, applying the ER or GR methods.Footnote8 In the experiment below, we set the maximum number of factors to six to estimate r1+r2 and r. Alternatively, the CCE or 2SIV estimator of Pesaran (Citation2006) and Cui et al. (Citation2021) can be used as an initial estimator of β0, which is not pursued in this article.

4.2 Jackknife Bias Correction

In Sections 2 and 3, the analytical bias correction of the IPC estimator β̂ is considered. Instead, as discussed in Remark 3, subsampling methods can be employed for bias corrections. Specifically, we consider the split panel jackknife in both the cross-section and time dimensions, which is proposed in Fernández and Weidner (Citation2016).

To define the jackknife bias-corrected IPC estimator, assuming that N and T are even numbers for simplicity, consider the index set, N={1,,N}, which is divided into two subgroups, N1={1,,N/2} and N2={(N/2)+1,,N}. Similarly define T={1,,T},T1={1,,T/2} and T2={(T/2)+1,,T}. Let us denote the IPC estimator as β̂N,T(=β̂), in order to explicitly express the dependence on the (sub)sample, (i,t)(N,T). Then, we define the jackknife bias-corrected IPC estimator as β̂JK=(β̂TS+β̂CS)β̂N,T, where β̂TS=2β̂N,Tβ̂N,T1+β̂N,T22 and β̂CS=2β̂N,Tβ̂N1,T+β̂N2,T2. For more details, see the supplementary materials and Section 3.2 in Fernández and Weidner (Citation2016).

4.3 Experimental Design

Consider the following data generating process:(21) yit=l=1kxlitβli0+f1t0λ1i0+f2t0λ2i0+σε,itεit,i=1,2,,N;t=1,2,,T(21) where λsi0iidN(0,1),fst0=ρffs,t10+1ρf2ϵf,st,ϵf,stiidN(0,1) with fs00iidN(0,1) for s=1,2,3,εit=ρεεit1+1ρε2ϵε,it,ϵε,itiidN(0,1) with εi0iidN(0,1), and(22) σε,it=(κε,iκε,t)1/2, κε,iiidU(0.5,1.5) and κε,t=0.5+t/T.(22)

The regressors xlit,l=1,2,,k, are generated as(23) xlit=f1t0γl1i0+f3t0γl3i0+0.3σv,itvlit,(23) where vlit=ρvvlit1+1ρv2ϖlit. Note that f1t0 is common to both yit and xlit, whereas f2t0 and f3t0 are unique to yit and xlit, respectively. ϖlit is generated as ϖlit=(ϖlitc)/2c,ϖlitiidχc2 and vli0=(vli0c)/2c,vli0iidχc2 with c=6.Footnote9 The factor loadings in xlit are generated as(24) γlsi0=0.7λsi0+(10.72)1/2φlsi,(24)

φlsiiidN(0,1) for l=1,,k and s=1,3, so that they are correlated with factor loadings in yit .(25) σv,it=(κv,iκv,t)1/2,κv,iiidU(0.5,1.5) and κv,t=4.5+t/T.(25)

Finally we have(26) βli=βl0+σηηli,(26) (27) ηli=ρxηψli+1ρxη2ωli(27)

ωliiidN(0,1) for l=1,,k and ψli(P)=p=1qθpz˜li,p/ p=1qθp2, where P={p:θp0,pI} with I={1,2,,q} and θp{0,1},z˜li,p=zli,pz¯l,pszl,pwith z¯l,p=N1i=1Nzli,p,szl,p2=(N1)1i=1N (zli,pz¯l,p)2 and zli,p=T1t=1Tvlitp. Observe that the variance of βli is controlled to be ση2. θp selects the term z˜li,p. Specifically, in the experiment, we consider ψli(1,2,3,4)=p=14z˜li,p/4 and ψli(p)=z˜li,p,p=1,,4. It is interesting to examine the impact of such linear and highly nonlinear relationships on the estimators and the power of the LM tests.

We set k = 2 (two regressors) for all experiments. Before the estimation, the data is within-transformed and cross-sectionally demeaned, to make the results invariant to the inclusion of (additive) individual effects and time effects. For parameter values, we set (β10,β20)=(1,1).

We consider seven experimental designs as outlined in . Designs 1–7 are considered to investigate the finite sample behavior of the slope estimator, the associated robust Wald test and the LM test for correlated random coefficients.

Table 1 Experimental designs.

Design 1 corresponds to the model with homogeneous slopes (ση=0) and Design 2 to the model with random coefficients (ση=0.5 and ρxη=0). Designs 3-7 correspond to models with heterogeneous slopes that depend on the regressors (ση=0.5 and ρxη=0.5) in different ways. These designs are considered in order to investigate the impact of such dependence on the estimators and the power properties of the LM test. In Design 3, the slope heterogeneity contains a fourth-degree polynomial of the defactored regressors (ψli(1,2,3,4)), which is highly nonlinear. In Designs 4–7, the slope heterogeneity contains ψli(1),ψli(2),ψli(3), and ψli(4), respectively. In the article we present the results for Designs 1 to 3, with the rest in the supplementary materials.

We consider all combinations of N = 50, 100, 200 and T=25,50,100,200. Throughout the experiment, we set ρf=0.5,ρε=0.5 and ρv=0.5. All tests are conducted at 5% significance level. All experimental results are based on 2000 replications.

4.4 Results

In order to compute the IPC estimator, it is necessary to estimate the number of factors in the error term, ui. The frequencies of the number of factors estimated by the ER and GR methods proposed by Ahn and Horenstein (Citation2013) are reported in ; see Section 4.1 for the estimation procedure. Note that, as summarized in , r is two in the models with homogeneous slopes (Design 1) and three in the models with random coefficients (Design 2). The results in suggest that ER and GR estimate r precisely enough for all designs, though the ER estimator is slightly more accurate than the GR, especially when T is not large. Based on this result, the experimental results reported hereafter are based on the r estimated by the ER method.

Table 2 Frequency of the estimated number of factors in the error term by the ER and GR methods.

Given the estimated number of factors, we can proceed to the estimation of β0 and associated inferences based on the robust PHAC standard errors. In this experiment, we consider three different estimators of β0: the IPC estimator without bias correction (IPC), the analytically bias-corrected IPC estimator (BC-IPC), and the jackknife bias-corrected IPC estimator (JK-IPC). summarizes the performance of these IPC estimators for Designs 1–3. The “ER” signifies the dependence of the estimators on the ER estimation of r. In the table, the bias (Bias), standard deviation (SD), root mean square error (RMSE) of the estimates for β10, and the size (in percent) of the robust Wald test for H0:β10=1 (Size), are reported. Bias, SD, and RMSE are all multiplied by 100. The results for β20 are not reported because they are qualitatively very similar.

Table 3 Finite sample properties of the IPC estimators of β10 and the associated robust Wald test with interactive effects, heteroscedastic and serially correlated idiosyncratic errors.

First, we discuss the results for Design 1 (the model with homogeneous slopes). As can be seen in the table, the IPC estimator (without bias-correction) suffers from a large bias. As a result, the size of the robust Wald test deviates significantly from the nominal level. In contrast, the BC-IPC estimator with the analytical bias correction and the JK-IPC estimator with the jackknife bias correction successfully reduce the bias. The BC-IPC estimator and the JK-IPC estimator perform comparably. However, when the sample size is small (e.g., T = 25 or N = 50), the jackknife method can reduce the bias more effectively than the analytical bias correction. Meanwhile, the jackknife bias correction is likely to lead to greater sampling variability than the analytical bias correction. The standard deviation of the JK-IPC estimator is uniformly larger than the standard deviation of the BC-IPC estimator. As a result, the BC-IPC estimator uniformly outperforms the JK-IPC estimator in terms of the mean squared error criterion.Footnote10 Reflecting this small sample bias, there is a mild size distortion in the robust Wald test for small T or small N, which tends to disappear as N and T increase.

The results for Design 2 (random coefficient models) in are qualitatively similar to those for Design 1. However, reflecting the fact that the IPC estimators are less efficient for random coefficient models, there is a moderate small sample bias in the BC-IPC and JK-IPC estimators, which dissipates as N increases. Due to this bias, there is a moderate size distortion in the robust Wald test for small N, but it quickly decreases as the sample size increases.

Next, let us look at the results for Design 3, in which the heterogeneity of the slopes depends on the regressors. The bias of the IPC, BC-IPC, and JK-IPC estimators is large and persistent for any sample size. As a consequence, the size distortions of the robust Wald tests are large and increase as the sample size rises. This implies the importance of the LM test for correlated random coefficients in our robust approach.

reports the size and power of the LM test for the dependence of the random coefficients on the regressors. We note that in our alternative model (Design 3), the slope heterogeneity contains a fourth-degree polynomial of the defactored regressors (i.e., ψli(1,2,3,4)), which is highly nonlinear. In this experiment, we consider LMCRC(1) and LMCRC(2) tests, which are defined in Equationequation (20). Recall that we recommend the test using the LMCRC(2) statistic, because it is expected to have nontrivial power against a sufficiently broad class of alternative models, while the LMCRC(1) test may not be sufficiently powerful against certain alternatives (see discussions in Section 3). The results reported in show that the size of the LMCRC(1) and LMCRC(2) tests is very close to the nominal level for Design 1 (the model with homogeneous slopes) and Design 2 (random coefficient models). In the results for Design 3 (correlated random coefficients), we find that the power of the LMCRC(2) test is high and rises as N and T increase, while the LMCRC(1) test is much less powerful. in the supplementary materials reports the power of the LMCRC(1) and LMCRC(2) tests for Designs 4–7. Therein it is found that the LMCRC(2) test remains as powerful across Designs 4–7 as in Design 3, whilst the LMCRC(1) test is substantially less powerful in Design 5 (ψli(2)) and Design 7 (ψli(4)). These findings are in line with the results of our asymptotic local power analysis and they support our claim that the LMCRC(2) test is powerful against a broad class of alternative models of correlated coefficients.

Table 4 Size and power of the LMCRC(1) and LMCRC(2) tests of correlated random coefficient.

5 An Empirical Application: Feldstein–Horioka Puzzle

In this section, we present an empirical illustration of the proposed method. Specifically, we analyze the Feldstein–Horioka (F–H) puzzle, which has attracted recent attention in the literature.

Feldstein and Horioka (Citation1980) found that in OECD countries, the long-run average of the national savings rate is significantly correlated with the long-run average of the domestic investment rate. This empirical finding seems to conflict with macroeconomic theory, which predicts that the determinants of saving and investment are different and that, assuming perfect international capital mobility, the investment decisions in a country should not be constrained by domestic saving. This is known as the F–H puzzle, and is included as one of the six major puzzles in international macroeconomics by Obstfeld and Rogoff (Citation2000).

Several econometric methods have been applied to investigate the F–H puzzle, including instrumental variable methods, cointegration regressions and the standard fixed effect models.Footnote11 Recently, Giannone and Lenza (Citation2010) and Ginama, Hayakawa, and Kanemi (Citation2018) analyzed the F–H puzzle by considering a panel regression model with interactive effects.

In this section, we apply our approach to analyze the F–H puzzle by using the OECD panel dataset covering 1968–1996 with N = 24 and T=29, as used in Ginama, Hayakawa, and Kanemi (Citation2018). Specifically, we estimate the following model:(IitYit)=βi(SitYit)+λi0ft0+εit,where Iit , Sit , and Yit denote investment, gross savings, and GDP for country i and year t, respectively. This specification is more general than considered in the literature in that the coefficient is allowed to be heterogeneous and interactive effects are included.

We now describe how we proceed when using our approach. Our interest lies in estimating the population mean of βi . First, we need to determine the number of factors in the error term for the model in which the slope heterogeneity (if any) is ignored. Here, we follow the procedure described in Section 4.1. Having estimated the number of factors in the error term, we conduct the LM test for correlated random coefficients, which is proposed in Section 3. If the null hypothesis is not rejected, we proceed to statistical inference using the robust PHAC standard errors; otherwise we need to consider an alternative approach, as described at the end of Section 3.

provides the estimation results. As a comparison, we computed a jackknife bias-corrected CCEP estimator proposed by Westerlund (Citation2018) and the associated nonparametric standard error of (Pesaran Citation2006, eq. (69)) for random coefficient models.Footnote12 With regard to the number of factors, both r̂GR and r̂ER suggest that the number of factors is one. The LM test for correlated random coefficients, LMCRC(2), does not reject the null hypothesis of random coefficients or homogeneous slopes, which validates our robust approach. Note that the PHAC standard error is asymptotically valid for the models with homogeneous slopes or random coefficients. Meanwhile, the nonparametric standard error used for the CCEP estimator is developed primarily for the models with random coefficients, and a different standard error may have to be employed for the models with homogeneous slopes; see Theorem 4 in Pesaran (Citation2006). Nevertheless, the BC-IPC and JK-CCEP estimation results show that the savings-GDP ratio is highly significant in the regression of the investment-GDP ratio, indicating the existence of the F–H puzzle.

Table 5 Estimation results for the Feldstein–Horioka puzzle based on OECD data covering 1968–1996, (N = 24, T = 29).

6 Concluding Remarks

In this article, we have proposed a robust approach against heteroscedasticity, error serial correlation and slope heterogeneity in linear models with interactive effects for large panel data. First, consistency and asymptotic normality of the pooled iterated principal component (IPC) estimator for the models with random coefficients and homogeneous slopes have been established. Then, we have proved the asymptotic validity of the associated Wald test for slope parameter restrictions based on the panel heteroscedasticity and autocorrelation consistent (PHAC) variance matrix estimator for the models with random coefficients and homogeneous slopes, which does not require the Newey-West type time-series parameter truncation. These results asymptotically justify the use of the same pooled IPC estimator and the PHAC standard error for both homogeneous-slope and heterogeneous-slope models. This robust approach can significantly reduce the model selection uncertainty for applied researchers. In addition, we have proposed a Lagrange Multiplier (LM) test for correlated random coefficients with covariates. This test has nontrivial power against correlated random coefficients, but not for random coefficients and homogeneous slopes. The LM test is important because the IPC estimator becomes inconsistent with correlated random coefficients. The finite sample evidence and an empirical application support the usefulness and reliability of our robust approach.

We have examined finite sample performance of the estimators, the tests of linear restrictions on the slope parameters, and the LM tests for correlated random coefficients. We have examined the finite sample performance of the estimators, the tests of parameter restrictions using the PHAC variance estimator, and the LM test for correlated random coefficients. The results show that the size of the proposed robust Wald test with the bias-corrected IPC estimator is sufficiently close to the nominal level in both slope homogeneity and slope heterogeneity, and that the LM test for correlated random coefficients has correct size under both slope homogeneity and random coefficients while exhibiting high power for correlated random coefficients. The finite sample evidence, together with the empirical application for the Feldstein–Horioka puzzle, support the reliability and the usefulness of the proposed robust approach.

It is well recognized that bootstrapping in general can provide more accurate inference; see, for example, Hall (Citation1992). Recently, there has been a rapid growth in the study of bootstrap methods for factor models; see Gonçalves and Perron (Citation2014, Citation2020), among others. This is an interesting research direction, as applying bootstrap methods to panel data models with interactive effects can lead to more precise inferences.

The proposed approach investigated for the IPC estimator is most likely to be applicable for pooled estimators in other approaches, including the Common Correlated Effects (CCE) estimator of Pesaran (Citation2006), the principal component (PC) estimator investigated by Westerlund and Urbain (Citation2015), and the two-step instrumental variable (2SIV) estimator proposed by Cui et al. (Citation2021), among others. In particular, for these estimators the factors in the regressors and in the error term are extracted, whilst the IPC estimator only exploits the information of the factors in the error term. Studying such applications seems to be an interesting extension.

As emphasized in the article, when the null hypothesis of random coefficient models is rejected in favor of the alternatives, it is preferable to employ estimators which are consistent for the models with correlated random coefficients. In this situation, the mean group estimators based on the CCE, 2SIV and ML approaches proposed by Pesaran (Citation2006), Cui et al. (Citation2021), and Li, Cui, and Lu (Citation2020), among others, seem to be possible choices. Investigating inferential methods for such estimators under correlated random coefficients seems to be an intriguing future research theme.

Supplemental material

Supplemental Material

Download Zip (375.1 KB)

Acknowledgments

The authors thank Christian Hansen (the editor), the associate editor, and two anonymous referees for their valuable comments and suggestions. We are grateful to Milda Norkutė, Joakim Westerlund and the participants of the 2019 Asia Meeting of the Econometric Society for their helpful comments on the earlier version of this article.

Supplementary Materials

The supplementary material consists of online appendices that discuss the experimental results with proofs of key results and additional discussion, and computational codes that replicate the experimental and empirical results.

Additional information

Funding

This work was supported by JSPS KAKENHI (grant numbers 20H01484, 20H05631, 20K20760, 21H00700, and 21H04397).

Notes

1 In the fixed T panel data analysis, the PHAC estimator is proposed by Arellano (1987) and widely used in the related literature.

2 In this article, we propose to use the PHAC estimator for the N-consistent estimator, which is different from Pesaran (Citation2006) and Reese and Westerlund (Citation2018). See Section 2.

3 One important exception is Ando and Bai (2017), in which shrinkage estimators for sparse heterogeneous slopes in the models with interactive effects are considered.

4 In this article, we do not consider the model with the group-wise homogeneous slope model, which is considered by Su and Ju (2018).

5 See Galvao and Kato (Citation2014) for a similar discussion in misspecified dynamic panel data models.

6 We do not consider cross-sectional and/or time-series structural breaks in βi which is beyond the scope of this article.

7 For testing H0:θp=0 for p=1,,, a nonparametric regression approach (e.g., sieve or spline regressions) could be employed.

8 Alternatively, we may use the criteria proposed by Bai and Ng (2002).

9 We also considered the case in which ϖlit is generated from normal distribution. The results are not reported because they were qualitatively very similar.

10 A similar superior performance of analytical bias-correction over jackknife bias-correction is found in Fernández and Weidner (Citation2016).

11 See sec. 1 of Ginama, Hayakawa, and Kanemi (Citation2018) for a brief literature survey.

12 For the IPC estimation, the data are within-transformed and cross-sectionally demeaned, while for the CCEP estimation the data are within transformed only.

References

  • Ahn, S., and Horenstein, A. (2013), “Eigenvalue Ratio Test for the Number of Factors,” Econometrica, 81, 1203–1227.
  • Ando, T., and Bai, J. (2017), “Clustering Huge Number of Financial Time Series: A Panel Data Approach with High-Dimensional Predictors and Factor Structures,” Journal of the American Statistical Association, 112, 1182–1198. DOI: 10.1080/01621459.2016.1195743.
  • Arellano, M. (1987), “Computing Robust Standard Errors for Within Group Estimators,” Oxford Bulletin of Economics and Statistics, 49, 431–434. DOI: 10.1111/j.1468-0084.1987.mp49004006.x.
  • Bai, J. (2009), “Panel Data Models with Interactive Fixed Effects,” Econometrica, 77, 1229–1279.
  • Bai, J., and Li, K., (2014), “Theory and Methods of Panel Data Models with Interactive Effects,” Annals of Statistics, 42, 142–170.
  • Bai, J., and Liao, Y. (2017), “Inferences in Panel Data with Interactive Effects Using Large Covariance Matrices,” Journal of Econometrics, 200, 59–78. DOI: 10.1016/j.jeconom.2017.05.014.
  • Bai, J., and Ng, S. (2002), “Determining the Number of Factors in Approximate Factor Models,” Econometrica, 70, 191–221. DOI: 10.1111/1468-0262.00273.
  • Bonhomme, S., and Manresa, E. (2015), “Grouped Patterns of Heterogeneity in Panel Data,” Econometrica, 83, 1147–1184. DOI: 10.3982/ECTA11319.
  • Chamberlain, G. (1980), “Analysis of Covariance with Qualitative Data,” Review of Economic Studies, 47, 225–238. DOI: 10.2307/2297110.
  • Cui, G., Norkute, M., Sarafidis, V., and Yamagata, T. (2021), “Two-Stage Instrumental Variable Estimation of Linear Panel Data Models with Interactive Effects,” Econometrics Journal, 25, 340–361. DOI: 10.1093/ectj/utab029.
  • Dhaene, G., and Jochmans, K. (2015), “Split-Panel Jackknife Estimation of Fixed-Effect Models,” Review of Economic Studies, 82, 991–1030. DOI: 10.1093/restud/rdv007.
  • Feldstein, M. and Horioka, C. Y. (1980), “Domestic Saving and International Capital Flows,” Economic Journal, 90, 314–329. DOI: 10.2307/2231790.
  • Fernández, V., and Weidner, M. (2016), “Individual and Time Effects in Nonlinear Panel Models with Large N, T,” Journal of Econometrics, 192, 291–312. DOI: 10.1016/j.jeconom.2015.12.014.
  • Galvao, A. F., and Kato, K. (2014), “Estimation and Inference for Linear Panel Data Models Under Misspecification When both n and T are Large,” Journal of Business and Economic Statistics, 32, 285–309. DOI: 10.1080/07350015.2013.875473.
  • Giannone, D., and Lenza, M. (2010), “The Feldstein–Horioka Fact,” in NBER International Seminar on Macroeconomics (Vol. 6), eds. R. Clarida and F. Giavazzi, pp. 103–117, Chicago, IL: University of Chicago Press.
  • Ginama, I., Hayakawa, K., and Kanemi, T. (2018), “Examining the Feldstein–Horioka Puzzle Using Common Factor Panels and Interval Estimation,” Japan and the World Economy, 48, 11–21. DOI: 10.1016/j.japwor.2018.06.001.
  • Gonçalves, S., and Perron, B. (2014), “Bootstrapping Factor-Augmented Regression Models,” Journal of Econometrics, 182, 156–173. DOI: 10.1016/j.jeconom.2014.04.015.
  • Gonçalves, S., and Perron, B. (2020), “Bootstrapping Factor Models with Cross Sectional Dependence,” Journal of Econometrics, 218, 476–495.
  • Greenaway-McGrevy, R., Han, C., and Sul, D. (2012), “Asymptotic Distribution of Factor Augmented Estimators for Panel Regression,” Journal of Econometrics, 169, 48–45. DOI: 10.1016/j.jeconom.2012.01.003.
  • Hall, P. (1992), The Bootstrap and Edgeworth Expansion, New York: Springer.
  • Hansen, C. (2007), “Asymptotic Properties of a Robust Variance Matrix Estimator for Panel Data When t is Large,” Journal of Econometrics, 141, 597–620. DOI: 10.1016/j.jeconom.2006.10.009.
  • Li, K., Cui, G., and Lu, L. (2020), “Efficient Estimation of Heterogeneous Coefficients in Panel Data Models with Common Shocks,” Journal of Econometrics, 216, 327–353. DOI: 10.1016/j.jeconom.2019.08.011.
  • Moon, H. R., and Weidner, M. (2015), “Linear Regression for Panel with Unknown Number of Factors as Interactive Fixed Effects,” Econometrica, 83, 1543–1579. DOI: 10.3982/ECTA9382.
  • Mundlak, Y. (1978), “On the Pooling of Time Series and Cross Section Data,” Econometrica, 46, 69–85. DOI: 10.2307/1913646.
  • Newey, W. K., and West, K. D. (1987), “A Simple Positive Semi-Definite, Heteroskedasticity and Autocorrelation Consistent Covariance Matrix,” Econometrica, 55, 703–708. DOI: 10.2307/1913610.
  • Norkutė, M., Sarafidis, V., Yamagata, T., and Cui, G. (2021), “Instrumental Variable Estimation of Dynamic Linear Panel Data Models with Defactored Regressors and a Multifactor Error Structure,” Journal of Econometrics, 220, 416–446. DOI: 10.1016/j.jeconom.2020.04.008.
  • Obstfeld, M., and Rogoff, K. (2000), “The Six Major Puzzles in International Macroeconomics: Is There a Common Cause?,” NBER Macroeconomics Annual, 15, 339–390. DOI: 10.1086/654423.
  • Pesaran, M. H. (2006), “Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error Structure,” Econometrica, 74, 967–1012. DOI: 10.1111/j.1468-0262.2006.00692.x.
  • Pesaran, M. H., and Yamagata, T. (2008), “Testing Slope Homogeneity in Large Panels,” Journal of Econometrics, 142, 50–93. DOI: 10.1016/j.jeconom.2007.05.010.
  • Reese, S., and Westerlund, J. (2018), “Estimation of Factor-Augmented Panel Regressions with Weakly Influential Factors,” Econometric Reviews, 37, 401–465. DOI: 10.1080/07474938.2015.1106758.
  • Su, L. and Chen, Q. (2013), “Testing Homogeneity in Panel Data Models with Interactive Fixed Effects,” Econometric Theory, 29, 1079–1135. DOI: 10.1017/S0266466613000017.
  • Su, L., and Ju, G. (2018), “Identifying Latent Grouped Patterns in Panel Data Models with Interactive Fixed Effects,” Journal of Econometrics, 206, 554–573. DOI: 10.1016/j.jeconom.2018.06.014.
  • Vogelsang, T. J. (2012), “Heteroskedasticity, Autocorrelation, and Spatial Correlation Robust Inference in Linear Panel Models with Fixed-Effects,” Journal of Econometrics, 166, 303–319. DOI: 10.1016/j.jeconom.2011.10.001.
  • Westerlund, J. (2018), “CCE in Panels with General Unknown Factors,” Econometrics Journal, 21, 264–276. DOI: 10.1111/ectj.12110.
  • Westerlund, J., and Urbain, J. P. (2015), “Cross-Sectional Averages versus Principal Components,” Journal of Econometrics, 185, 372–377. DOI: 10.1016/j.jeconom.2014.09.014.
  • Wooldridge, J. (2010), Econometric Analysis of Cross Section and Panel Data, Cambridge, MA: MIT Press.