297
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Nonlinear Spatial Dynamic Panel Data Models with Endogenous Dominant Units: An Application to Share Data

ORCID Icon, &
Published online: 12 Apr 2024
 

Abstract

This article develops a nonlinear spatial dynamic panel data model with one particularly interesting application to a structural interaction model for share data. To account for effects from dominant (popular) units, the spatial weights matrix in our model can allow for unbounded column sums. To account for heterogeneity, our model includes two-way fixed effects and heteroscedastic errors. We further consider the potential endogeneity of the spatial weight matrix constructed from socioeconomic distance. We investigate the quasi-maximum likelihood estimator (QMLE), generalized methods of moments estimator (GMME), and root estimator (RTE), and establish their consistency and asymptotic normality based on the near epoch dependence (NED) framework. The RTE can derive a relatively computationally simple and closed-form solution without evaluating the QMLE’s Jacobian matrix as well as the iterations by GMME. We consider both n,T, and the strength of the dominant units is equal to 1 when T. For the purpose of empirical analysis, we derive the marginal effects and their limiting distributions based on the proposed estimators. In an empirical application, we apply our model to China’s prefecture city-level data, revealing significant spillover effects of the tertiary industry share. These findings suggest that the development of the tertiary sector in large cities can foster its growth in small cities.

Acknowledgments

The authors thank to the editor, associate editor, and three referees for their constructive comments, and Yahui Chen and Xingbai Xu for their helpful discussions.

Disclosure Statement

The authors report there are no competing interests to declare.

Notes

1 The model can also handle the continuous data within a specific open interval, such as (1,1). Section S5.2 of the supplement gives more discussions.

2 Since the IV estimation is a special case of the GMM estimation, related discussions are in Section S2.1 of the supplement.

3 The RTE of the SAR model is discussed in Jin and Lee (Citation2012) and Jin and Lee (2020). For the dynamic panel data model, the RTE is discussed in Jin, Lee, and Yu (Citation2021).

4 As discussed in Footnote 5 in Qu, Lee, and Yu (Citation2017), the spatial lag term MntZnt could be specified in the Ynt or Znt equations as long as Mnt is exogenous. But we cannot allow for endogenous Mnt . If Mnt is the endogenous Wnt , (2.1) and (2.3) become a highly nonlinear system and we have possible multiple solutions of Ynt that is generated by the system. Additionally, there is no research on Znt being a nonlinear VAR process with spatial dependence currently. One closest study is a linear spatial VAR (SpVAR) model with time-invariant exogenous weights proposed by Yang and Lee (Citation2021). In summary, the study of a nonlinear (SpVAR) model with time-varying exogenous/endogenous matrices is sufficient for another new study and is worth considering in the future. Due to these complicated issues, we do not cover the exogenous/endogenous weights matrix in (2.3).

5 In the simulation part, we illustrate the results of directly estimating the model without considering the “endogeneity” of spatial weights, especially for estimating λ0 and ρ0.

6 Some studies treat FEs as estimators, especially in the discrete nonlinear panel data model. For example, Fernández-Val and Vella (Citation2011) and Fernández-Val and Lee (Citation2013), while our approach is to eliminate FEs. Unlike the linear model of Lee and Yu (Citation2010a), the model in this article is nonlinear and does not exist in a reduced form, and hence the transformation matrices Fn,n1 and FT,T1 mentioned in Lee and Yu (Citation2010a) cannot be applied. Therefore, we use the direct approach to consistently estimate the two-way FEs and concentrate them out from the log-likelihood function, which requires both n and T to be large. Details are in Footnote 8. When T is finite, the GMME based on moments after eliminating the two-way FEs can estimate parameters consistently when 0δ<1. However, when δ = 1 and T is finite, the probability of the related statistics converges to their expectation is O(1/T), which will not satisfy the ST-NED condition, as O(1/T)0 is required. To satisfy the ST-NED condition and 0δ1, we need both n and T are large. Additionally, we may treat time FEs as additional regressors under finite T. From Table S25 of the supplement, we find that there is no significant difference between the two approaches. However, as T increases, especially when T = 20, the estimators obtained from the time dummy approach may have larger root mean squared errors. Due to the fact that T = 18 in our empirical application, compared with the time dummy approach, the method of eliminating FEs may obtain more robust estimates.

7 Thus, Yn1=F(λ0Wn1Yn1+γ0Yn0+ρ0Wn0Yn0+X1n0β0+cn0+α10ln+Vn0) which can be correlated with two-way FEs and weighting matrix. The endogeneity is modeled in Assumption 1(i).

8 Since L=diagt=1T(nt) and the derivatives of ZL do not include any parameters, we exclude them. From the first-order condition of (3.2), we have ĉn=1Tt=1Tsnt(θ̂qml)snt(θ̂qml), α̂t=1nln((snt(θ̂qml)1Th=1Tsnh(θ̂qml))), ĉ2n=1Tt=1Ts2nt(θ̂qml)s2nt(θ̂qml) and α̂2t=1nln((s2nt(θ̂qml)1Th=1Ts2nh(θ̂qml))), where snt(θ)=YntTntκ and s2nt(θ)=ZntKntϰ. Therefore, the consistent estimation of FEs based on the likelihood function requires large n and large T. Plugging the estimated FEs into (3.2), we have (3.3).

9 To derive κ̂rt, we need to evaluate Gnt(λ) based on an initial consistent estimator λ˜, which is also discussed in Jin and Lee (2020).

10 Note that κ̂rt is affected by the estimated ϵ˜L=ϵL(τ˜), which is evaluated by the initial consistent estimator τ˜. The C(α)-type moment proposed by Jin and Lee (2020) and Jin and Lee (2021) can eliminate the asymptotic impact on κ̂rt, which is currently studied on another working paper by Chen et al. (Citation2024).

11 Assumption 6(ii) assumes that the regressors in deviation form from cross section and time means are linearly independent so that there is no issue of multicollinearity. Of course, implicitly all the exogenous variables cannot be either time or cross-sectional invariant variables. If any of them is invariant in any dimension, the corresponding individual and/or time FEs would absorb them. Except for the invariant cases, we note that we allow variables in X1nt and X2nt to overlap so that there could be common variables. Theoretically and empirically, X1nt and X2nt could even be the same as identification is not an issue here. This is so, because the parameters in the equations Znt can be identified and consistently estimated solely from (2.3), and X1nt and the control variable is not linearly dependent, even though X1nt and X2nt might be so.

12 From Kuersteiner and Prucha (Citation2020) and Jeong and Lee (Citation2021), we can treat (ξl,ϵl) as the part of vl such that ξl is the strict exogenous part while ϵl is the endogenous part.

13 Metropolitan areas generally consist of a densely populated urban core and its surrounding territories, sharing industry, infrastructure, and labor markets (Mills Citation1967).

14 In 2019, the added value of the tertiary industry accounted for 53.9% of GDP.

15 The classification is based on the “Notice on Adjusting the Criteria for the Classification of Urban Sizes” issued by the State Council of China in November 2014, using the urban permanent population as the statistical caliber.

16 An example is Wuhan in Hubei province affects Xinyang in Henan province, their distance is about 208km.

17 To further check the reasonableness of the specification of weight matrix, we assume that the unidirectional impact of these four types of cities is D →C →B →A and repeat the estimation. We find that λ is insignificant in all specifications, which verifies the weight setting before, that is, only large cities have spillover effects on small cities. Results are shown in Tables S32–S33 of the supplement.

18 Following Lee, Yang, and Yu (Citation2022), νm(t) can be estimated by ν̂m(t)=lnm(t)lnn. From Table 1, we find that δ̂=0.802,ν̂m=0.245 and δ̂g=0.905. The estimation procedure of δ̂g is in Claim S1 of the supplement. We also assume that types A and B are both dominant units, the results are in Table S28 of the supplement.

Additional information

Funding

Xi Qu gratefully acknowledges the National Natural Science Foundation of China (No. 72222007, No. 71973097, and No. 72031006).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 123.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.