667
Views
3
CrossRef citations to date
0
Altmetric
Articles

Applying a Population Flow–Based Spatial Weight Matrix in Spatial Econometric Models: Conceptual Framework and Application to COVID-19 Transmission Analysis

ORCID Icon, ORCID Icon & ORCID Icon
Pages 2266-2286 | Received 06 Jul 2021, Accepted 25 Feb 2022, Published online: 16 Jun 2022
 

Abstract

This article proposes a novel method for constructing an asymmetric spatial weight matrix and applies it to improve spatial econometric modeling. As opposed to traditional spatial weight matrices that simply consider geographic or economic proximity, the spatial weight matrix proposed in this study is based on large-volume daily population flow data. It can more accurately reflect the socioeconomic interactions between cities over any given period. To empirically test the validity and accuracy of this proposed spatial weight matrix, we apply it to a spatial econometric model that analyzes COVID-19 transmission in Mainland China. Specifically, this matrix is used to address spatial dependence in outcome and explanatory variables and to calculate the direct and indirect effects of all predictors. We also propose a practical framework that combines instrumental variable regressions and a Hausman test to validate the exogeneity of this matrix. The test result confirms its exogeneity; hence, it can produce consistent estimates in our spatial econometric models. Moreover, we find that spatial econometric models using our proposed population flow–based spatial weight matrix significantly outperform those using the traditional inverse distance weight matrix in terms of goodness of fit and model interpretation, thus providing more reliable results. Our methodology not only has implications for national epidemic control and prevention policies but can also be applied to a wide range of research to better address spatial autocorrelation issues. Key Words: COVID-19 transmission, endogeneity, population flow, spatial dependence (autocorrelation), spatial weight matrix.

本文提出了构建非对称空间权重矩阵的新方法,并用于改进空间计量经济建模。不同于传统的仅考虑地理或经济邻近性的空间权重矩阵,本文提出的空间权重矩阵以海量的日人口流量数据为基础,能更准确地反映任何时期城市之间的社会经济互动。为了验证本文的空间权重矩阵的有效性和准确性,我们将其应用于中国大陆COVID-19疾病传播分析的空间计量模型。具体的,该矩阵考虑了结果变量和解释变量的空间相关性,计算了各预测因子的直接和间接影响。我们还提出了一个应用框架,可以结合工具变量回归和Hausman检验,验证该矩阵的外生性。验证结果证实了矩阵的外生性,并能够在空间计量经济模型中提供稳定的估计。此外,我们发现,在拟合优度和模型解释方面,采用基于人口流动的空间权重矩阵的空间计量模型,明显优于基于传统的反距离权重矩阵的空间计量模型,能提供更可靠的结果。我们的方法不仅有益于国家流行病控制和预防政策,也可用于其它研究,能更好地解决空间自相关性的问题。

En este artículo se propone un método novedoso para construir una matriz asimétrica de pesos espaciales, la cual se aplica para mejorar la modelización econométrica espacial. A diferencia de las tradicionales matrices de ponderación espacial que apenas consideran la proximidad geográfica o económica, la matriz de ponderación espacial que se propone en el estudio se basa en datos de flujo de población diario de gran volumen. Esta puede reflejar con mayor precisión las interacciones socioeconómicas entre ciudades, durante un determinado período. Para poner a prueba empíricamente la validez y precisión de la matriz de pesos espaciales propuesta, la aplicamos a un modelo econométrico espacial que analiza la propagación del COVID-19 en la China continental. Específicamente, esta matriz se usa para abordar la dependencia espacial de las variables de resultado y las explicativas, y para calcular los efectos directos e indirectos de todos los predictores. Proponemos, además, una enmarcación práctica que combina regresiones de variables instrumentales y una prueba de Hausman para validar la exogeneidad de la matriz. El resultado de la prueba confirma su exogeneidad; entonces, puede producir estimaciones consistentes en nuestros modelos econométricos espaciales. Además, descubrimos que los modelos econométricos espaciales que usen nuestra matriz propuesta de pesos espaciales basada en flujo de población superan de modo significativo a los que usan la matriz convencional de ponderación de distancia inversa, en términos de buen acoplamiento e interpretación del modelo, generando así resultados de mayor confiabilidad. Nuestra metodología no solo tiene implicaciones para las políticas nacionales de control y prevención de epidemias, sino que también tiene aplicabilidad en una amplia gama de investigaciones con las que se aborden mejor los problemas de autocorrelación espacial.

Acknowledgment

This work was supported by the Natural Science Foundation of Guangdong Province [Grant Number: 2021A1515011250].

Notes

1 The moving-out indexes and moving-in indexes are based on the travel intensity between specific city pairs. For example, the moving-out index of Beijing to Tianjin refers to the volume of population flow traveling from Beijing to Tianjin. According to Baidu’s metadata, population flow from city i to city j is considered to be the move-out index for city i and the move-in index for city j.

2 The inverse distance weighted interpolation follows the equation: Corresponding Percentageij=1Distanceij2*(1001AAccurate Daily  Moving_Out Indexia)1V1Distanceiv2, where Distanceij is the Euclidean distance between city i and city j, A is the number of destination cities with accurate move-out indexes for origin city i, and V is the number of destination cities without accurate move-out indexes for origin city i.

3 According to Vega and Elhorst (Citation2013), there are three other types of spatial econometric models: (1) the SLX (spatial lag of X) model that includes spatial interactions of explanatory variables, (2) the SAC model that includes a spatially lagged dependent and a spatially correlated error term, and (3) the general nesting spatial model that includes all three types of spatial interaction effects.

4 During Period I, population flow between cities in China was not disrupted because it was only announced on 20 January that COVID-19 could be transmitted human to human and no further warnings on travel risks were announced until 23 January. Moreover, Baidu did not publicly release daily population flow data until 11 January. Therefore, in the models for Period I, the spatial weight matrix is built based on the population flow volume during Period I. In the models for Period II, we still use the population flow volume during Period I to account for the fourteen-day incubation period of the virus.

5 Note that the exogeneity of our population flow–based W depends on what outcome variable is used in the model. In our models, daily COVID-19 case number is the outcome variable. In other models using socioeconomic variables as outcome variables, the exogeneity of our population flow–based W might be affected.

6 According to DXY.cn, the COVID-19 infection data they published were reported by thirty-two provincial-level Health Commissions in China.

7 To convert the two indexes into the actual volume of person movements in and out of each city, we use the daily number of people traveling into and out of Hong Kong provided by the Hong Kong Immigration Department to calibrate and calculate the number of people to which each moving-in index and moving-out index unit corresponds. Using these data, we estimate that one index unit in the move-in index and move-out index corresponds to 71,121 person movements. This estimated converting factor is constant across all cities and is used to calculate the actual daily volume of population inflow and outflow of each city.

8 Kriging is a type of statistical technique for optimal spatial prediction that has been used widely in meteorological applications, agriculture, geosciences, and many other disciplines due to its minimized prediction error. Compared with classical kriging methods, the empirical Bayesian kriging is more robust because it accounts for the errors introduced by the estimation of the Semivariogram model (Krivoruchko and Gribov Citation2019).

9 After January 24, different levels of prevention and control measures (i.e., the shutdown of public transport and public places, the lockdown of residential buildings and neighborhoods, and the setup of checkpoints to control the population entering the city) were implemented by most of the cities, especially for cities in Hubei province and some cities with relatively more confirmed cases like Wenzhou, Hangzhou, and Harbin (Fang, Wang, and Yang Citation2020).

10 Based on our original population flow–based SWM (before normalization), we calculate the value of each element in symmetric population flow–based SWM as wij=12(Moving_Out Indexij+Moving_Out Indexji)=wji, where the wij is the spatial weight of unit i toward unit j, wji is the spatial weight of unit j toward unit i, Moving_Out Indexij is the index that reflects the volume of population traveling from city i to city j, and Moving_Out Indexji is the index that reflects the volume of population traveling from city j to city i.

11 The gravity model used in this article can be expressed as follows: wij= GDPiαGDPjβPopiφPopjωDistanceijτ, where the wij is the spatial weight of unit i toward unit j, GDPi (GDPj) is the GDP of city i (city j), Popi (Popj) is the total population of city i (city j), and Distanceij is the  geographic distance between city i and city j. We first select city pairs with accurate move-out indexes in our data set to estimate α, β, φ, ω, and τ based on the preceding equation. We then use the estimation results to impute move-out indexes for city pairs without accurate move-out indexes.

Additional information

Notes on contributors

Pengyu Zhu

PENGYU ZHU is an Associate Professor in the Division of Public Policy at the Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong SAR. E-mail: [email protected]. His research interests include big data and urban planning, sustainable transportation, economic development policy, housing and land use policy, and migration and employment.

Jiarong Li

JIARONG LI is a PhD student in the Urban Governance and Design Thrust at Hong Kong University of Science and Technology (Guangzhou), Nansha, Guangzhou, 511400, Guangdong, China. She is also a research assistant at the Center for Applied Social and Economic Research at Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong. E-mail: [email protected]. Her research interests include spatial data analysis, urban transportation, and spatial econometrics.

Yuting Hou

YUTING HOU is an assistant professor in the Department of Building and Real Estate at the Hong Kong Polytechnic University, Kowloon, Hong Kong SAR. E-mail: [email protected]. Her research interests include the interactions between land use and transportation, urban and regional economics, and applied spatial analysis.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 312.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.