Sequential and efficient GMM estimation of dynamic short panel data models: Econometric Reviews: Vol 40 , No 10

Abstract

This paper considers generalized method of moments (GMM) and sequential GMM (SGMM) estimation of dynamic short panel data models. The efficient GMM motivated from the quasi maximum likelihood (QML) can avoid the use of many instrument variables (IV) for estimation. It can be asymptotically efficient as maximum likelihood estimators (MLE) when disturbances are normal, and can be more efficient than QML estimators when disturbances are not normal. The SGMM, which also incorporates many IVs, generalizes the minimum distance estimation originated in Hsiao et al. . By focusing on the estimation of parameters of interest, the SGMM saves computational burden caused by nuisance parameters such as variances of disturbances. It is asymptotically as efficient as the corresponding GMM. In particular, the SGMM based on QML scores can generate a closed-form root estimator for the dynamic parameter, which is asymptotically as efficient as the QML estimator. Nuisance parameters can also be estimated efficiently by an additional SGMM step if they are of interest.

Keywords:

JEL CLASSIFICATION:

Acknowledgements

We are grateful to the Editor Esfandiar Maasoumi, Co-Editor Tong Li, and two anonymous referees for their valuable and helpful comments.

Notes

1 In the following, the ML or QML estimates for fixed effects DPD models all refer to those based on first differenced equations of the dependent variable.

2 We note that, “stationarity” here refers to the situation that the process has started a long time ago.

3 For stationary random effects DPD models, the quasi log likelihood function can be decomposed as a sum of the quasi log likelihood function of within equations and that of between equations (Lee and Yu, Citation2020). We use the decomposition to derive simple moment vectors, which can yield GMM estimators that are asymptotically as efficient as ML estimators under normal disturbances, but can be more efficient relative to QML estimators.

4 On the other hand, if $tr (F_{T}^{'} A_{T}^{'} H_{T} F_{T}^{'} A_{T}^{'} H_{T})$ is not equal to zero, $Var (Q_{n T}^{'} e_{n T})$ would not be necessarily equal to $σ_{v 0}^{2} E [Q_{n T}^{'} (H_{T} \otimes I_{n}) Q_{n T}] .$

5 If the disturbances are not normal, from the proof of Theorem 3(iii), the asymptotic variance of the IV estimate ${\hat{θ}}_{1, i v}$ is equal to that of an optimal GMM estimator. We show that there is no best GMM under non-normal disturbances in the supplementary file, so there is no best IV under non-normal disturbances.

6 Initial consistent parameter estimates for various models considered in this paper are given in the supplementary file.

7 As in Hsiao et al. (Citation2002), when the process ${y_{i t}}$ starts from a finite past, γ₀ can be 1. We thank an anonymous referee for pointing out this.

8 When T goes to infinity, the second component is dominated by the first one, so that the asymptotic precision of the MLE is asymptotically equal to that of the best IV estimate. The best IV estimation is possible by ignoring the first row of $H_{T} (ω)$ or simply replacing it by $H_{T} (2)$ with 2 replacing $ω .$ The approximation or replacement will be good when T becomes large.

9 Recall that $Δ Y_{n T} = {[Δ Y_{n 1}^{'}, \dots, Δ Y_{n T}^{'}]}^{'}$ and $Δ Y_{n, T - 1} = {[0, Δ Y_{n 1}^{'}, \dots, Δ Y_{n, T - 1}^{'}]}^{'} .$

10 We note that as contrary to later sequential GMM estimation, these moments are quadratic in e_nT but not quadratic in the parameter γ because $B_{T} (γ)$ is nonlinear in γ.

11 See the supplementary file for a proof.

12 We thank a referee for raising this issue.

13 We may show that ${(B_{ω T} B_{T}^{- 1})}^{s} = - B_{T} J_{T} B_{T}^{'} .$ See the proof of Theorem 3 in the supplementary file.

14 If the disturbances are not normally distributed, we show in the supplementary file that the limiting variance of the GMM estimator ${\hat{θ}}_{2, gmm}$ has a lower bound by the generalized Schwarz inequality, but the lower bound cannot be achieved. The reason is that $D_{T, T + 1} B_{T}^{'} C_{j T}^{s} B_{T} D_{T, T + 1}$ needs to be a diagonal matrix for some $C_{j T}^{s},$ but this cannot be the case given the specific form of $D_{T, T + 1} .$

15 If the number of best moments is just identified, and the score vector and the best moment vector are linear transformations of each other, then their estimators would be the same. In this exactly identified moments case, the best GMM estimator would not have an asymptotic gain.

16 The explicit expression of $E (\frac{1}{n} \frac{\partial ln L_{w} (θ_{0})}{\partial θ} \frac{\partial ln L_{w} (θ_{0})}{\partial θ'})$ can be derived similarly as that of the variance matrix of $\sqrt{n} g_{n T} (θ_{20}),$ thus we omit it for simplicity. We can see that it does not depend on n.

17 Instead of moments based on the score vector, one may used the best moments in (Equation2.25(2.25) $g_{n T}^{*} (θ_{2}) = (\begin{matrix} ι_{n T}^{'} (H_{T}^{- 1} \otimes I_{n}) e_{n T} (θ_{1}) \\ ι_{n T}^{'} (F_{T}^{'} H_{T}^{- 1} \otimes I_{n}) e_{n T} (θ_{1}) \\ e_{n T}^{'} (θ_{1}) (B_{T}^{'} (ω) C_{1 T}^{*} B_{T} (ω) \otimes I_{n}) e_{n T} (θ_{1}) \\ e_{n T}^{'} (θ_{1}) (B_{T}^{'} (ω) C_{2 T}^{*} B_{T} (ω) \otimes I_{n}) e_{n T} (θ_{1}) \end{matrix}),$ (2.25) ) to obtain an SGMM estimate of γ. But as the number of moments involved is over-identified for γ, the corresponding SGMM would not have a tractable explicit expression. Such an SGMM estimation approach will be considered in a subsequent section on models with exogenous regressors. The moments in (Equation2.25(2.25) $g_{n T}^{*} (θ_{2}) = (\begin{matrix} ι_{n T}^{'} (H_{T}^{- 1} \otimes I_{n}) e_{n T} (θ_{1}) \\ ι_{n T}^{'} (F_{T}^{'} H_{T}^{- 1} \otimes I_{n}) e_{n T} (θ_{1}) \\ e_{n T}^{'} (θ_{1}) (B_{T}^{'} (ω) C_{1 T}^{*} B_{T} (ω) \otimes I_{n}) e_{n T} (θ_{1}) \\ e_{n T}^{'} (θ_{1}) (B_{T}^{'} (ω) C_{2 T}^{*} B_{T} (ω) \otimes I_{n}) e_{n T} (θ_{1}) \end{matrix}),$ (2.25) ) could be regarded as a special case of the estimation with ι_nT as a regressor vector.

18 The estimate of κ for given γ and ω is ${[ι_{n T}^{'} (H_{T}^{- 1} (ω) \otimes I_{n}) ι_{n T}]}^{- 1} ι_{n T}^{'} (H_{T}^{- 1} (ω) \otimes I_{n}) (Δ Y_{n T} - γ Δ Y_{n, T - 1}) = \frac{1}{n} l_{n}^{'} Δ Y_{n 1} + \frac{1}{n} \sum_{t = 2}^{T} (1 - \frac{t - 1}{T}) l_{n}^{'} (Δ Y_{n t} - γ Δ Y_{n, t - 1}),$ which does not depend on ω.

19 In our case, the concentration works on the solution of scores, so it is a method of elimination and substitution in solving a system of equations.

20 The SGMM in Jin and Lee (Citation2021) is asymptotically equivalent to the approach in Trognon and Gouriéroux (Citation1990) applied to the GMM, which is derived by a first order Taylor expansion of the moment vector at the nuisance parameter estimator.

21 Root estimators for spatial autoregressive models are considered in Jin and Lee (Citation2012).

22 The consistent estimation of ${plim}_{n \to \infty} \frac{\partial g_{n T, γ} (γ_{0}, τ_{0})}{\partial τ'} {(\frac{\partial g_{n T, τ} (γ_{0}, τ_{0})}{\partial τ'})}^{- 1}$ by ${\hat{C}}_{n T, γ}$ would not have an asymptotic influence on the moment equation due to its role as coefficients for linear combinations of valid moments.

23 The probability limit of $s_{n T, 4}$ depends on ω₀, γ₀, T and $σ_{v 0}^{2},$ and it can be positive or negative.

24 This estimate can be further simplified to $\frac{1}{n} l_{n}^{'} Δ Y_{n 1} + \frac{1}{n} \sum_{t = 2}^{T} (1 - \frac{t - 1}{T}) l_{n}^{'} (Δ Y_{n t} - γ Δ Y_{n, t - 1}),$ which does not depend on ω. Using the form ${[ι_{n T}^{'} (H_{T}^{- 1} (ω) \otimes I_{n}) ι_{n T}]}^{- 1} ι_{n T}^{'} (H_{T}^{- 1} (ω) \otimes I_{n}) (Δ Y_{n T} - γ Δ Y_{n, T - 1})$ simplifies the presentation of the concentrated moments.

25 As in Section 2, we can also directly follow the approach in Jin and Lee (Citation2021) to construct an SGMM estimator of γ using moment conditions derived from the QML first order conditions. On the other hand, we do not use $g_{n T} (δ, α_{1})$ to construct an SGMM estimator of only γ due to an identification issue. As shown below, by using the concentrated moments derived from the QML first order conditions, we can have closed-form roots of γ and investigate which root is consistent.

26 κ no longer exists, and the mean of $Δ Y_{n 1}$ is zero.

27 Detailed Monte Carlo results for other values of γ₀ (0.2, 0.8, 0.9) are presented in the supplement file.

28 Due to space limit, we present the details of simulation results under non-normal disturbances for $γ_{0} = 0.2, γ_{0} = 0.5, γ_{0} = 0.8, γ_{0} = 0.9$ in the supplementary file.

Additional information

Funding

Fei Jin gratefully acknowledges the financial support from the National Natural Science Foundation of China (No. 71973030 and No. 71833004) and Program for Innovative Research Team of Shanghai University of Finance and Economics. Jihai Yu gratefully acknowledges the financial support from the National Natural Science Foundation of China (No. 71925006 and No. 92046021) and support from the Center for Statistical Science of Peking University and the Key Laboratory of Mathematical Economics and Quantitative Finance (Peking University), the Ministry of Education.

Sequential and efficient GMM estimation of dynamic short panel data models

Log in via your institution

Log in to Taylor & Francis Online

Restore content access

Related Research

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Sequential and efficient GMM estimation of dynamic short panel data models

Abstract

Acknowledgements

Notes

Additional information

Funding

Log in via your institution

Log in to Taylor & Francis Online

Log in to Taylor & Francis Online

Restore content access

Related Research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date