13,798
Views
33
CrossRef citations to date
0
Altmetric
Articles

The impact of regional inequality on economic growth: a spatial econometric approach

ORCID Icon & ORCID Icon
Pages 687-702 | Received 09 Feb 2019, Published online: 21 Apr 2021

ABSTRACT

This paper investigates the relationship between economic growth and regional income inequality in a spatial econometric perspective. The role of space in the measure of inequality is discussed, and a new theoretical model that relates inequality with economic growth is introduced. The proposed model extends a spatial Mankiw–Romer–Weil specification by introducing regional income inequality as a determinant of economic growth. The measure of inequality proposed as a covariate in the model is derived by a spatial decomposition of the Gini index. An empirical analysis focused on European Union NUTS-2 regions is carried out to illustrate the model.

INTRODUCTION

There is a body of literature examining the relationship between inequality and economic growth. Using individual income data, the impact of inequality on growth performances has been mainly investigated through cross-sectional growth regressions, where the average growth rate of income per capita is regressed on initial inequality and several other control variables (Alesina & Rodrik, Citation1994; Persson & Tabellini, Citation1994). Using cross-section analysis, Chen (Citation2003) assessed the existence of an inverted ‘U’-shaped relationship between initial income distribution and long-run economic growth. Using panel data, Li and Zou (Citation1998) and Forbes (Citation2000) proposed fixed-effects estimates. Fixed-effects and generalized method of moments (GMM) estimations of the relationship between inequality and growth have been performed by Panizza (Citation2002). An attempt to separate long- and short-run influences of inequality on growth has been made by Partridge (Citation2005).

Depending on the data and the methodologies used, the empirical literature reported conflicting results about the effects of inequality on economic growth. A negative impact of inequality on growth has been mainly supported by empirical cross-country studies (Alesina & Rodrik, Citation1994; Persson & Tabellini, Citation1994), while several other studies derived a positive relationship (Forbes, Citation2000; Li & Zou, Citation1998). Panizza (Citation2002) found a negative relationship between inequality and growth, which, however, is not robust to small changes in data or econometric specifications.

A different approach has been proposed in some contributions focused on regional disparities, their determinants, and their relations with growth and the level of economic development (Lessmann, Citation2014; Lessmann & Seidel, Citation2017; Petrakos et al., Citation2005). Furthermore, the inequality in the distribution of economic activities has been linked to both geographical characteristics and variables related to the level of development by Henderson et al. (Citation2018).

The need to assess the impact of inequality on growth in a spatial perspective has been highlighted in the recent literature (Martino & Perugini, Citation2008; Rey & Janikas, Citation2005). Along this research line, Janikas and Rey (Citation2008) proposed a spatial analysis that allows for simultaneous interactions between regional growth and inequality. A spatially augmented model which allows intra-regional disparities to be one of the determinants of regional economic growth has been proposed by De Dominicis (Citation2014).

Following these contributions, in this paper we introduce a new theoretical growth model that relates economic growth with regional inequality in a spatial perspective. Based on the spatially augmented Mankiw–Romer–Weil (MRW) (Mankiw et al., Citation1992) model introduced by Fischer (Citation2011), the proposed specification allows the impact of regional inequality on the growth process to be assessed. The model includes a regional inequality measure as a covariate. This measure is derived by the spatial decomposition of the Gini index introduced by Rey and Smith (Citation2013). In addition to verifying the impact of inequality on growth in a regional economy, the proposed model allows the extent to which regional growth is influenced by inequality in neighbouring regions to be assessed.

Our approach is different in some important aspects from De Dominicis (Citation2014), who also investigated this issue. First, while the theoretical model developed by De Dominicis is based on the specification given in Ertur and Koch (Citation2007), our model extends the framework proposed by Fischer (Citation2011).

Besides, our approach defines a theoretical model where the impact of human capital on growth is explicitly considered. Conversely, in De Dominicis (Citation2014), the impact of human capital is only empirically estimated without any theoretical implication. For this reason, our theoretical model can be considered as a methodological extension of the De Dominicis approach.

Finally, while De Dominicis (Citation2014) considered the within component of the Theil index (Theil, Citation1967) as a measure of regional inequality, we use the within component of the Gini index and its spatial decomposition proposed by Rey and Smith (Citation2013). Specifically, we introduce as an inequality variable in our model the non-spatial component of the within-region inequality, which expresses the component of intra-regional inequality that is not influenced by the spatial interactions among regional units. This definition allows us to consider the idiosyncratic component of inequality, isolating the spatial component of intra-regional inequality whose impact could alter the real effect of inequality on regional growth.

The proposed model is illustrated through an empirical analysis focused on European Union (EU) NUTS-2 regions over the period 2003–16. EU regions represent interesting units of analysis. In fact, the persistence of inequalities across EU regions, exacerbated by the global financial crisis, requires the definition of regional policies aimed at enhancing growth and regional equity. Most of the EU funds and programmes are specifically delivered for less developed regions to promote the convergence of these regions toward the richer ones. Considering the relationship between inequality and growth facilitates the assessment of the role that disparities within regional economies play in the convergence process. Within-region disparities could thus become a variable on which regional policies could be targeted to promote growth and convergence. The introduction of spatial effects in the analysis entails considering both spatial similarities and instabilities across space, facilitating a study of disparities and reduction policies more anchored to the characteristics of the spatial units under consideration.

Accounting for the peculiarities of regional economies could offer useful insights into the major socioeconomic trends that have interested the EU in recent decades. In fact, the higher internal disparities characterizing the new accession member states could impact on their catch-up process toward more developed regional economies. Assessing such dynamics also offers useful indications to address policies targeted to different regional contexts.

The rest of the paper is organized as follows. Next, we discuss the general theoretical growth model and our specification. We then present an empirical application concerning European NUTS-2 regions. Finally, we provide some concluding remarks and future research agenda.

A THEORETICAL MODEL OF ECONOMIC GROWTH

Analysing the relationship between regional inequality and economic growth in a spatial perspective requires an appropriate model specification. In fact, regional growth could be influenced by inequality within the regional economy as well as by inequality in neighbouring regions. This spatial dependence effect is so included in the proposed model that we derive theoretically.

Our model specification is obtained as an extension of the spatial MRW model developed by Fischer (Citation2011). However, while Fischer considers as the dependent variable in the model the level of regional gross domestic product (GDP) at the end of the period, we focus on the GDP growth rate, testing for the β-convergence hypothesis.

In our proposal, the spatially augmented MRW is modified by introducing a measure of regional inequality among the covariates. Specifically, as in De Dominicis (Citation2014), in our model the investment in physical capital is expressed as a function of inequality. This assumption allows the impact of inequality on growth to be assessed, and is based upon the idea that higher inequality among regions stimulates higher investments from richer regions. Moreover, according to standard economic theories, saving and investment are assumed to be identical and is a fairly typical result that the redistribution of resources from rich to poor tends to lower the aggregate rate of saving in an economy (Thorbecke & Charumilind, Citation2002). This argument found support in Kaldor (Citation1956) who highlighted that a high initial inequality of income favours high saving, because of the higher propensity to save that rich people have with respect to poor people and, thus, it stimulates higher capital accumulation. On the same line, Galor (Citation2000) argues that high income inequality has a positive impact on savings, resulting in a higher physical capital accumulation. Furthermore, a positive relationship between inequality and aggregate saving has been recently reported by Koo and Song (Citation2016) who found a greater and statistically more significant effect for richer and more developed countries.

In this paper we assume that each regional economy is characterized by a Cobb–Douglas production function that, for region i at time t, is specified as follows: (1) Yit=AitCitαCHitαHLit1αCαH(1) where Y is the output; A is the level of technology; C is the physical capital; H is the human capital; and L is the labour. The exponents αC and αH are the output elasticities with respect to physical capital and human capital, respectively. These output elasticities are positive constants (i.e., αC,αH>0) such that αC+αH<1, hence the production function has decreasing returns to both types of capital.

Denoted by siC and siH, which are the fractions of output Yit invested in physical capital and human capital, respectively; and by δ, which is a rate of depreciation that is assumed to be constant and identical for both physical and human capital, the dynamic equations for C and H are given by: (2) C˙it=siCYitδCit(2) (3) H˙it=siHYitδHit(3) where the dots over the variables indicate their time-derivatives. The dynamic equation for labour and technology are L˙it=niLit and A˙it=gAit, where ni is the working population growth rate; and g is a constant growth rate for technology. Now define the per worker quantities for output, physical capital and human capital as yit=Yit/Lit, cit=Cit/Lit and hit=Hit/Lit, respectively. The production function (1) in per worker terms becomes: (4) yit=AitcitαChitαH(4) Introducing spatial externalities, the level of technology, for each region i, is expressed as follows: (5) Ait=ΩtcitϕChitϕHj=1nAjtγwij(5) According to equation (5), the function Ait depends on four terms. The first term, Ωt, represents the proportion of technological progress that, by assumption, is exogenous and identical for all regions. It is defined by Ωt=Ω0eτ, where τ is the exogenous technological progress growth rate; and Ω0 is its initial level. The next two terms imply that the level of technology for each region i depends on both the level of physical capital per worker, cit, and the level of human capital per worker, hit. This implies that investments in both physical and human capital increase not only capital stocks but also the level of technology through knowledge spillovers. The strength of these externalities is determined by the parameters ϕC, with 0ϕC<1, and ϕH, with 0ϕH<1, respectively. The last term implies that the level of technology of region i also depends on the level of technology of its neighbour regions j, forj=1,2,,N. The degree of technological interdependence among regions depends on the scalar parameter γ, with 0γ<1, and on the proximity between regions that is summarized by the matrix W. The (i,j)-th element of W, wij, is such that 0wij1, with wij=0 for i=j, and j=1Nwij=1. For regions that are neighbours of region i, wij>0, for i,j=1,2,,N.

The dynamic equations for physical capital per worker and human capital per worker (equations 2 and 3) become: (6) c˙it=siCyit(δ+ni)cit(6) (7) h˙it=siHyit(δ+ni)hit(7) Besides, we assume that the fraction of output that is invested in physical capital is a function of a measure of inequality, that is, siC=f(Gi)=Ineqi, where Gi is a measure of inequality for region i. This assumption implies that higher inequality among regions should stimulate higher investments from richer regions.

Following this assumption, equation (6) can be rewritten as: (8) c˙it=Ineqiyit(δ+ni)cit(8) Equations (8) and (7) imply that the physical capital output and human capital output are constant so that: (9) ciyi=Ineqi(δ+g+ni)(9) (10) hiyi=siH(δ+g+ni)(10) where the asterisk denotes the steady-state value of the variable. The steady-state conditions for the physical capital and human capital variables can be derived by equations (9) and (10), respectively. Substituting these expressions in equation (4) and taking the logarithm, after some algebra, we obtain: (11) lnyi=11ηlnΩt+αC+ϕC1ηlnIneqi+αH+ϕH1ηlnsiHη1ηln(δ+g+ni)αC1ηγj=1NwijlnIneqiαH1ηγj=1NwijlnsjH+αC+αH1ηγj=1Nwijln(δ+g+nj)+1αCαH1ηγj=1Nwijlnyj(11) where η=αC+αH+ϕC+ϕH. In order to define the convergence of output per worker in a given region to the region’s steady-state value, we consider a log-linearization of equations (8) and (7) around steady states, which lead to a system of linear differential equations that can be solved by using the technique developed by Ertur and Koch (Citation2007). Postulating that the gap of region i relative to its own steady state is proportional to the corresponding gap for region j, it follows that: (12) dlnyitdt=gλi[lnyitlnyi](12) where g denotes the constant growth rate of physical and human capital; and λi is the speed of convergence. Solving equation (12) for lnyit, and subtracting from both sides the output per worker at some initial date, lnyi0, we obtain the following expression, written in a matrix notation: (13) G=gtιDy0+Dy(13) where G=yty0, with yt the N×1 vector of logarithms of output per worker at time t; and y0 the N×1 vector of logarithms of the initial level of output per worker, ι is a N×1 vector of ones, D is a N×N diagonal matrix with (1eλit) elements on the main diagonal, and y is the N×1 vector of logarithms of output per worker at steady state. Following the rationale of Ertur and Koch (Citation2007), rewriting equation (11) in a matrix notation we have: (14) y=(IρW)1[11ηΩ+αC+ϕC1ηIn+αH+ϕH1ηSαCγ1ηWInαHγ1ηWS](14) where ρ=γ(1αCαH)1η, In and S are the N×1 vectors of the logarithms of inequality measures and saving rates of human capital (i.e., lnsiH), respectively, divided by the effective rate of depreciation (i.e., by ln(δ+g+ni)). Substituting (14) in (13), and pre-multiplying both sides by the inverse of D(IρW)1, after rearranging terms, we obtain: (15) G=gtιρgtDWD1ι+11ηDΩDy0+αC+ϕC1ηDIn+αH+ϕH1ηDS+ρDWy0αCγ1ηDWInαHγ1ηDWSρDWD1G(15) Equation (15) can be rewritten for region i as follows: (16) lnyitlnyi0=Δi(1eλit)lnyi0+(1eλit)αC+ϕC1ηlnIneqi+(1eλit)αH+ϕH1ηlnsiH(1eλit)η1ηln(δ+g+ni)+(1eλit)1αCαH1ηγj=1Nwijlnyj0(1eλit)αC1ηγj=1NwijlnIneqj(1eλit)αH1ηγj=1NwijlnsjH+(1eλit)αC+αH1ηγj=1Nwijln(δ+g+nj)+(1eλit)1αCαH1ηγj=1N11eλitwij(lnyitlnyi0)(16) where: Δi=gtρgt(1eλit)j=1Nwij1eλjt+1eλit1ηlnΩt.Equation (16) states that the growth rate of output per worker, in region i, depends negatively on its initial level, positively on inequality and on human capital investment rate, and negatively on the population growth rate. The growth rate of output per worker in region i also depends on the same variables in the neighbouring regions j, identified by wij>0, because of technological interdependence (Ertur & Koch, Citation2007). Specifically, it is negatively influenced by inequality and human capital investment rate in neighbouring regions and is positively influenced by their working population growth rate. As a further evidence in equation (16), the initial level of output per worker in neighbouring regions has a positive influence on the growth rate of output per worker in region i. Finally, the last term in equation (16) represents the rate of growth of output per worker in the neighbour regions and reflects the spatial autocorrelation process that is implied by the technological interdependence.

MODEL SPECIFICATION

The inequality variable

Our approach represents an extension of the spatial MRW introduced by Fischer (Citation2011), where an inequality measure is included among the explanatory variables. In our model specification the inequality variable expresses the intra-regional inequality, that is, the level of internal inequality characterizing each regional unit.

Specifically, in our proposal we define and use an inequality index that is based on the non-spatial component of the within-region Gini index.

The Gini index G can be decomposed in three terms, such as the within-region inequality Gwithin, the between-region inequality Gbetween, and a residual or interaction term R (Mookherjee & Shorrocks, Citation1982; Shorrocks & Wan, Citation2005). While the between-group component captures the interregional inequality, and thus measures the distance between the average GDP of each region, the within-group term expresses the distance between the GDP values observed for spatial units that belong to the same region. The latter component thus expresses the inequality in each regional economy and could be assumed as influencing the economic growth rate in each region (see also De Dominicis, Citation2014).

Considering N NUTS-2 regions labelled as i=1,2,,N with respective mean income and population share μi and pi, the within-term of the Gini index can be expressed as: (17) Gwithin=i=1Npi2μiμGi(17) where μ represents the mean of the overall income distribution; and Gi is the Gini index for region i. The index in (17) represents a kind of average of the inequality values within each regional unit, and is fully specified by information about population share, mean income and inequality values.

To compute the Gini index Gi for each NUTS-2 region i, it is needed to consider the value of income yl of each l NUTS-3 sub-region that belongs to the specific NUTS-2 region i, withl=1,2,,Mi . The Gini index Gi is thus defined as: (18) Gi=l=1Miz=1Mi|ylyz|2Mi2y¯i(18) where: y¯i=l=1Miyl/Mi.

To define our measure of inequality, we apply the spatial decomposition of the Gini index into its neighbour and non-neighbour components introduced by Rey and Smith (Citation2013). Other contributions in this direction have been proposed by Arbia (Citation2001), Márquez et al. (Citation2019) and Panzera and Postiglione (Citation2020).

Denoted by vlz, the generic element of the spatial weight matrix V, which summarizes the proximity between the spatial units at NUTS-3 level, the Gini index in (18) can be expressed as follows (Rey & Smith, Citation2013): (19) Gi=l=1Miz=1Mivlz|ylyz|2Mi2y¯i+l=1Miz=1Mi(1vlz)|ylyz|2Mi2y¯i=NGi+NNGi(19) where, for each NUTS-2 region i, the first term on the right-hand side represents the neighbour component (NGi), and the second one represents the non-neighbour component (NNGi). This decomposition reveals that the Gini index nests a measure of spatial autocorrelation. As the spatial autocorrelation increases, the NNGi component should grow relatively to the NGi component, since values similarity in space would be in effect. The result is the opposite in the presence of negative spatial autocorrelation, since the difference between the variate values observed for non-neighbour pairs would be smaller than that between neighbour pairs. In either case, the degree of spatial autocorrelation can be assessed by comparing the relative contribution of these two components (Rey & Smith, Citation2013). According to this interpretation, the NNGi component could be considered as the spatial component of Gi, which changes in the same direction as the positive spatial autocorrelation relationship. The non-spatial component of inequality can be thus identified and measured by the NGi component. The identification of the spatial and non-spatial components of inequality allows a better understanding of the phenomenon and can offer useful insights for assessing the impact of inequality on regional growth.

Equation (17) can be thus rewritten as: (20) Gwithin=i=1Npi2μiμNGi+i=1Npi2μiμNNGi=NGwithin+NNGwithin(20) where NGi and NNGi indicate the neighbour and non-neighbour components, respectively, of the Gini index within region i, with i=1,2,,N. In our model we consider, for each region i, the non-spatial inequality (Ineqi=pi2μiμNGi, the summands of the first term on the right side of equation 20). Since population-weighted inequality indices are considered, instead of assuming the GDP per worker as representative of the whole region, we account for how many individuals this GDP per worker represents (Mookherjee & Shorrocks, Citation1982). Hence, weighting by population facilitates accounting for regional sizes that, especially in the European context, present relevant differences (Portnov & Felsenstein, Citation2010). This approach also appears consistent with the approach proposed by De Dominicis (Citation2014), which quantifies intra-regional inequality in the EU using the within component of the Theil index that implicitly includes the weighting by population.

The non-neighbour component introduced in the model represents the component of inequality that is not influenced by spatial autocorrelation in the GDP values. However, the spatial interactions among regional economies are included in the model by considering the spatial lag of the inequality variable.

The empirical model

Assuming parameter homogeneity (λi=λ for all i), the empirical counterpart of the model given in equation (16) can be expressed, at a given time, for region i, as follows: (21) gi=β0+β1lnyi0+β2lnIneqi0+β3lnsiH+β4ln(δ+g+ni)+ρj=1Nwijgj+ρ1j=1Nwijlnyj0+ρ2j=1NwijlnIneqi+ρ3j=1NwijlnsiH+ρ4j=1Nwijln(δ+g+nj)+ϵi(21) where: gi=1Tln(yityi0)with T denotes the number of time periods under investigation; yit and yi0 express the GDP values per worker at the final and initial periods, respectively; and the inequality variable is expressed as: lnIneqi0=ln(pi02μi0μ0NGi0),where the subscript indicates, for all the quantities, that they refer to the initial time period. Considering the initial level of inequality is in line with the specification proposed by De Dominicis (Citation2014), and also appears consistent with the assumption about the relation between initial inequality and investments on which the theoretical model is based (Galor, Citation2000; Kaldor, Citation1956; Koo & Song, Citation2016).

The parameter β0 is a constant: ϵiN(0,σ2),as are the error terms: β1=1eλTT β2=1eλTTαC+ϕC1η β3=1eλTTαH+ϕH1η,β4=1eλTTη1η,ρ1=1eλTT(1αCαH)1ηγ ρ2=1eλTTαC1ηγ,ρ3=1eλTTαH1ηγ,ρ4=1eλTTαC+αH1ηγ

and ρ=γ(1αCαH)1η.

The specification in (21) is known in literature as the spatial Durbin model (SDM) (LeSage & Pace, Citation2009). The SDM introduces a spatial lag on the dependent variable as well as on all the explanatory variables.

The linear specification in (21) is supported by the theoretical model proposed and developed in the previous section. When the empirical specifications are not supported by the theory, the unknown functional form represents a potential source of misspecification. This issue could be addressed by adopting a semiparametric framework, which is generally more flexible with respect to parametrical models. Semiparametric specifications that combine non-linearity with the simultaneous treatment of spatial dependence and spatial heterogeneity have been proposed in the recent literature (Basile & Gress, Citation2005; McMillen, Citation2012). A penalized-spline SDM has been introduced by Basile et al. (Citation2014). This model represents an important tool to identify the functional form of the relationship between the response variable and its predictors, and even in the presence of structured model can be useful to distinguish between substantive spatial dependence (i.e., spatial spillovers) and spatial dependence generated by omitted variables.

The substantive spatial dependence occurs when the variable of interest at one location is influenced by its values at other locations and is captured by spatial weight matrices commonly specified according to physical, economic or social distances. Conversely, the spatial dependence generated from a number of latent factors is interpreted as a residual dependence that affects the error term (Anselin & Rey, Citation1991).

Common factor structures, that consider interaction effects among the error terms, are increasingly used in recent models developed to deal with cross-sectional dependence in a panel data setting (Bailey et al., Citation2016; Pesaran, Citation2015). In a cross-sectional setting, the discrimination between substantive and residual spatial dependence has been mainly based on tests on parameter restriction, such as the common factor test (Anselin, Citation1988). According to the common factor hypothesis an SDM specification can be simplified by a spatial error model, which includes the spatial interactions among the error terms. This hypothesis is thus formulated by imposing a parameter restriction on the SDM specification.Footnote1

A further issue related to spatial data pertains to the simultaneous treatment of spatial dependence and spatial heterogeneity. The identification of spatial regimes could be a suitable alternative to account for the heterogeneity in the model parameters (Durlauf & Johnson, Citation1995; Quah, Citation1997). Clusters of regions that share similar structural characteristics and growth paths could be identified by some a priori criteria, like the initial level of GDP per capita or the belonging to some geographical zones, or using endogenous methods (Durlauf & Johnson, Citation1995; Postiglione et al., Citation2010, Citation2013). The application of exogenous criteria for the identification of clusters of EU regions has been advised by Ertur et al. (Citation2006) and a classification of European regions based on their level of development has been recently proposed by Iammarino et al. (Citation2019). The classification of regions according to their level of development is relevant in a policy perspective, being the basis for determining the regional eligibility for structural and investment funds. Our exogenous identification of spatial regimes is based on the 2007–13 funding period, in which EU regional policy consisted of three objectives, such as convergence, regional competitiveness, and employment and European territorial cooperation. The regions covered by the convergence objective correspond to the less developed regions, with per capita GDP less than 75% of the EU average. In the proposed empirical application, we focus on the latter classification to identify clusters of regional units that potentially differ in their growth paths.

The identification of clusters of regions based on the convergence objective allows to control for specific characteristics that the regions with similar development patterns share.

Furthermore, different regional growth conditions are introduced in our empirical model, by including dummy variables that facilitate considering the presence of capital cities in the regions as well as the cross-border effects. The presence of the capital city in the region could represent a growth promoting factor since these cities usually concentrate a number of relevant economic and social activities, being the administrative and economic centre of the country (Butkus et al., Citation2018). Conversely, being located on borders could represent a barrier to regional economic growth. Border regions represent a peculiar regional typology (Eurostat, Citation2019). They could be classified into two main categories such as internal border regions and external border regions. While internal border regions identify regional units that are located on borders between EU member states and/or European Free Trade Association (EFTA) countries, the external border regions correspond to those regions that are located on borders between EU member states and non-member countries (or countries outside the EFTA). The border typology is usually identified relying on land borders (Eurostat, Citation2019), but different definitions, including both land and maritime borders, are possible (Capello et al., Citation2018). Our analysis is focused on the land border typology, and both the internal and the external border regions are defined at the NUTS-2 level (see the empirical application). Border regions are typically characterized by a geographically peripheral location, that is far from the power and administrative centre of the country, and close to regions belonging to other countries. These areas could suffer from the border effect, that consists in limited cross-country economic interactions due to differences in culture, regulations, and business norms and higher transaction costs between countries (Capello et al., Citation2018). A number of empirical analyses focused on European countries revealed the presence of border effects also for regions that are located on EU internal borders, despite the market integration (see, among others, Kashiha et al., Citation2016). The impact of international borders has been mostly analysed with reference to trade relations, and measured in term of lost trade (Anderson & van Wincoop, Citation2003; McCallum, Citation1995). Recent contributions focused on the relation between spatial inequality within countries and trade have been proposed by Rodríguez-Pose (Citation2012), and Ezcurra and Rodríguez-Pose (Citation2013, Citation2014). Border effects have been linked to economic growth in Capello et al. (Citation2018) that developed a model including a dummy variable to identify border regions. The authors found that the growth performances in internal border regions do not significantly differ from those of non-border regions, even if the former appear to be unable to exploit their resources as efficiently as the other EU regions. These results could be explained through the positive effects produced by the cross-border cooperation programmes supported by the European Commission.

The empirical model (21) is augmented with the aforementioned dummy variables, yielding to the following specification: (22) gi=β0+β1lnyi0+β2lnIneqi0+β3lnsiH+β4ln(δ+g+ni)+β5CCi+β6IBi+β7EBi+ρj=1Nwijgj+ρ1j=1Nwijlnyj0+ρ2j=1NwijlnIneqi+ρ3j=1NwijlnsiH+ρ4j=1Nwijln(δ+g+nj)+ϵi(22) where CCi denotes the capital city dummy variable, such that CCi=1 if the capital city is located in the region i and CCi=0 otherwise; IBi is the internal border dummy variable, such that IBi=1 if unit i is an internal border region and IBi=0 otherwise; and EBi represents the external border dummy variable, with EBi=1 if unit i is an external border region and IBi=0otherwise. Equation (22) represents our empirical model that will be estimated in our illustration.

Possible bidirectional causal effects between inequality and regional growth are not investigated in our analysis, which has the main objective of investigating the role of regional disparities in driving economic growth. For a recent contribution on assessing the impact of regional income levels on inequality using spatial econometrics tools, see Artelaris and Petrakos (Citation2016). In our proposal, the link between regional growth and within-region inequality is derived by the developed theoretical model, which is empirically tested in the following section.

EMPIRICAL ANALYSIS

To test the model (22), we focus on data related to 245 EU NUTS-2 regions from 22 EU countries.Footnote2 The source of the data is Eurostat’s general and regional statistics database; the period under investigation ranges from 2003 to 2016.

In model (22), the dependent variable, gi, expresses the natural logarithm of the average growth rate of GDP per worker. The inequality variable, lnIneqi0, is represented by the natural logarithm of the neighbour component of Gwithin (that is considered as the part of inequality not determined by spatial dependence). For each of 245 NUTS-2 region, Gwithin of the Gini index is calculated using GDP values per worker related to 1255 NUTS-3 regionsFootnote3 belonging to the aforementioned NUTS-2 units and it is decomposed into its neighbour and non-neighbour components according to equation (20). The resulting inequality variable is referred to the initial time period (i.e., 2003).

The human capital variable, lnsiH, is measured as the natural logarithm of the percentage of population aged from 25 to 64 years who attained tertiary education (levels 5–8 of the International Standard Classification of Education (ISCED) 2011 classification). This percentage is considered with reference to 2005, which is the early period for which data on this variable are available for the considered regions.

Note that considering the explanatory variables at the beginning of the period under investigation, or as close as possible to it, helps to mitigate problems connected to endogeneity (Crespo Cuaresma et al., Citation2014).

The variable ni is measured by the natural logarithm of the average (over the period 2003–16) growth rate of the working population. The term (δ+g) is supposed to be equal to 0.05, following a common assumption in the literature (Mankiw et al., Citation1992).

The proximity relationships among regional units are summarized by the spatial weight matrix that, in our empirical analysis is constructed according to the k-nearest neighbours, with k=7Gwithin. Thek-nearest neighbour criterion is built considering the Euclidean distance between the centroid of each spatial unit i and the centroids of all other units ji, for i,j=1,2,..,N. The k closest units to i define the set of its neighbours. The choice of k=7 allows to avoid the presence of non-connected regions. In fact, as highlighted by Le Gallo and Ertur (Citation2003), EU regions have on average five to six contiguous neighbours. Different values of k, as k=5and k=9, have been considered in the definition of the proximity matrix. However, these different specifications of the proximity matrix did not give rise to significant differences in the results.Footnote4

The main summary statistics of the dependent and independent variables are reported in . All variables are not expressed in natural logarithm terms, and the values of the inequality variable are not weighted by the population shares.

Table 1. Summary statistics by variable.

As additional variables in our model we consider dummy variables that indicate whether the region includes the national capital city, and if it is an internal or external border region. In our analysis, border regions are identified at the NUTS-2 level. Around the 48% of the regions under consideration are internal border areas (i.e., 117 regions), while around the 10% of these regions (i.e., 24 regions) are located on external borders. A map of border regions is presented in . Dark grey indicates the border regions.

Figure 1. Internal border (a) and external border (b) European Union NUTS-2 regions.

Note: Dark grey indicates the border regions.

Figure 1. Internal border (a) and external border (b) European Union NUTS-2 regions.Note: Dark grey indicates the border regions.

The variables under consideration are modelled through the SDM specification that is supported by the developed theoretical model. As a robustness check, this specification is compared with the spatial lag and the spatial error specifications. To this end, we use the Lagrange multiplier (LM) tests, in their classic and robust versions, which are based on the residuals of the ordinary least squares (OLS) model (Elhorst, Citation2010). According to the approach introduced by Elhorst (Citation2010), when, based on the LM tests, the spatial lag, the spatial error model or both these specifications are preferable to the OLS model, the SDM specification should be estimated. Moreover, when these spatial models are estimated by maximum likelihood (ML), likelihood ratio (LR) tests can subsequently be used to verify two different hypotheses. The first hypothesis concerns whether the SDM can be simplified by a spatial lag model; the second hypothesis concerns whether the SDM could be simplified by a spatial error specification. The latter hypothesis corresponds to the common factor hypothesis. Results for LM and LR tests are reported in .

Table 2. Lagrange multiplier (LM) tests on spatial dependence and spatial error autocorrelation, likelihood ratio (LR) tests on parameter restrictions.

Results for the classic LM tests indicate that the OLS model should be rejected in favour of both the spatial specifications, and, similarly, both the robust LM tests are highly significant. These results indicate that the SDM specification should be estimated. Results for the LRerr test show that the common factor hypothesis is rejected and, similarly, the LRlag test is significant. These results confirm the appropriateness of the SDM specification.

The ML estimates for the SDM specification are reported in . The standard OLS estimates of the model are also presented, together with some diagnostics and performance measures. As shown, the coefficients associated with the initial level of GDP per worker, the capital city and the internal border dummy variables are significant in the non-spatial model. Furthermore, the signs of the coefficients are as expected for all explanatory variables, with the only exception of the working population growth rate. The coefficient associated with the capital city dummy variable expresses the differences in growth between the regions where the national capital cities are located and the other regions, given the other characteristics. The coefficient estimate is positive and statistically significant indicating the presence of a slight difference in the growth rate of GDP per worker between the regions where the capital cities are located and other regions, with the first ones overperforming with respect to the second ones. Similar considerations apply to the coefficient associated with the internal border dummy variable that indicates a growth rate of GDP per worker slightly higher in the internal border areas than in the other regional units. The coefficient associated with the external border dummy is negative, but it is not statistically significant. The goodness of fit of the model measured in term of R2 is around 0.60, and the Moran’s I test is highly significant, giving evidence of the presence of spatial dependence.

Table 3. Estimates for ordinary least squares (OLS) and the spatial Durbin model (SDM).

Moving to the SDM specification, all coefficients associated with the independent variables have the expected sign, with the only exception of the coefficient associated with the spatial lag of the inequality variable. The coefficient associated with this variable is not statistically significant as well as the coefficient associated with the inequality variable and the spatial lag of the initial level of GDP per worker. The coefficients associated with the dummy variables in the model are all significant. The positive sign of the coefficients associated with the capital city and the internal border dummy variables reveal that both these categories of regions overperform the other regions in the GDP per worker growth rate. The negative sign of the coefficient associated with the external border dummy variable shows that being located on the external border negatively impacts on the regional growth performances.

A lower value of the Akaike information criterion (AIC) indicates an improvement in the goodness of fit of the spatial specification with respect to the non-spatial one. Both the specifications indicate that a convergence process is taking place among the considered regions, but the inclusion of spatial effects tends to reduce the estimated speed of convergence.

As pointed out by LeSage and Pace (Citation2009), in order to correctly interpret the SDM coefficients, the computation of direct, indirect and total impacts is needed. In fact, because of the spatial dependence effect that is incorporated in the model, a change in an explanatory variable for a single spatial unit has a direct impact on the dependent variable at the same location and could indirectly affect the dependent variable at different locations. The average direct impact provides a summary measure of the impact arising from changes in the i-th observation of a covariate on the dependent variable of the i-th region, and also includes feedback influences. The average indirect impact reflects spatial spillovers. The sum of the average direct and indirect impacts measures the average total impact. reports the average direct, indirect, and total impacts that would arise from changing each explanatory variable in the SDM specification. A set of 2000 Monte Carlo (MC) draws is used to produce the impact estimates along with inferences regarding their statistical significance.

Table 4. Average impacts for the spatial Durbin model (SDM).

A comparison between the average direct impacts in and the parameter estimates in reveals that these estimates have the same signs and are similar in magnitude. The slight differences between the two sets of estimates are due to feedback effects (Fischer, Citation2011).

The average indirect impact estimates for the human capital and the population growth rate are significant and have the expected signs. The average indirect impact associated with the initial level of GDP per worker is negative and significant, while the average indirect impact estimate for the inequality variable is positive but not significant. The average total impact estimates are significant only for the initial level of GDP per worker.

To account for spatial heterogeneity, we estimate the SDM model for two different clusters of regions corresponding to regions that are covered by the convergence objective and regions that are not covered by the convergence objective. A spatial map of these two clusters is shown in . Dark grey indicates regions that are covered by the convergence objective. Light grey is used for non-convergence regions.

Figure 2. Convergence objective regions (dark grey) and non-convergence objective regions (light grey) identified according to the European Union's Cohesion Policy (2007–13).

Figure 2. Convergence objective regions (dark grey) and non-convergence objective regions (light grey) identified according to the European Union's Cohesion Policy (2007–13).

Estimation results for the identified spatial clusters are reported in . As shown, some differences in the parameter estimates emerge between the two clusters of regions. The coefficient associated with the initial level of GDP per worker is negative and significant for both clusters, indicating that a convergence process is taking place in both these groups of regions. However, the speed of convergence for less developed regions is higher with respect to that reported for the more developed regions. This result indicates that economic growth proceeds faster in less developed regions than in other regions, and this circumstance could be partly attributed to the presence of EU funding in support of these regions. The coefficient associated with the inequality variable is positive and significant only for the cluster of less developed regions. This result indicates that, for this group of regions, inequality positively impacts on the process of economic growth. This result is consistent with the empirical findings in De Dominicis (Citation2014). As a further result, we obtain that the human capital positively impacts on growth, but this effect is only significant for more developed regions. The coefficient associated with the population growth rate is negative and significant for both clusters of regions. The coefficients associated with the dummy variables reveal that the presence of the national capital city has a significant positive impact on regional growth for both more developed and less developed regions. Being located on the internal borders appears to influence regional growth positively, but this effect is significant only for more developed regions. Conversely, being located on the external border appears as detrimental for growth, but this negative impact is only significant for less developed regions. All coefficients associated with the spatially lagged explanatory variables have the expected sign in the cluster of regions not covered by the convergence objective. However, the coefficients associated with the spatial lag of the initial level of GDP per worker and with the spatial lag of the inequality variable are not significant. For less developed regions, the coefficient associated with the spatially lagged inequality variable is positive and statistically significant, revealing that the inequality observed in neighbour regions positively impact on growth in a particular region. As further results, we can note that the spatial autocorrelation parameter is positive and highly significant, and the AIC reveals that the goodness of fit of the model is improved in the heterogeneous model, with respect to the homogeneous one. These results point out the importance of accounting for both the heterogeneity and the spatial dependence of observations.

Table 5. Parameter estimates for spatial regimes, convergence objective versus non-convergence objective regions.

In order to evaluate the significance of the identified spatial clusters, we can use the Chow test (Chow, Citation1960). The null hypothesis of the test is that the vector of parameters is constant. This test is implemented for all model parameters jointly (overall stability test) as well as for each parameter separately (individual stability test). Results for the overall stability test reveal that the spatial Chow test is highly significant, giving support to the idea of considering spatial regimes to control for spatial heterogeneity. Concerning the individual stability tests, we can observe that tests on the equality of the coefficients associated with the initial level of GDP per worker, with the inequality variable and with the capital city dummy variable are significant. This evidence shows that the spatial heterogeneity present in the considered regions is mainly due to the joint effect of these variables.

For the correct interpretation of the SDM coefficients we refer to direct, indirect and total impacts that, for both the identified spatial clusters, are reported in . For both groups of regions, the average direct impacts have the same sign as the parameter estimates (). The results reveal that the human capital is a growth promoting factor in more developed regions, while does not have a significant direct effect in less developed regions. Conversely, inequality positively impacts on growth in less developed regions, while does not have a significant direct effect on growth in more developed areas. For more developed regions, the average indirect effects reveal significant spatial spillovers for the human capital variable (positive indirect effect) and for the population growth rate (negative indirect effect). In regions covered by the convergence objective, significant spatial spillover effects are associated with the initial level of GDP per worker (negative indirect effect), the inequality variable (positive indirect effect), and the population growth rate (negative indirect effect).

Table 6. Average impacts for spatial regimes, convergence objective versus non-convergence objective regions.

These empirical results suggest that that differences in the level of development of regions could determine differences in the factors driving regional growth. In fact, the positive impact of inequality on regional growth that is assumed in our theoretical model is only verified for less developed regions. Modelling spatial heterogeneity in the proposed model allows to appreciate the differences between the convergence-objective and the non-convergence objective regions, and to confirm the assumed relationship between inequality and regional growth relatively to the first group.

The positive relationship between inequality and growth is significant at the earlier stages of development because of the main role played by physical capital accumulation on regional growth (Galor & Moav, Citation2004). Hence, the mechanism according to higher inequality stimulates higher saving and physical capital accumulation, and thus growth, is verified for less developed regions. Conversely, in more developed economies, the human capital accumulation appears to replace physical capital accumulation as a growth promoting factor (Galor & Moav, Citation2004). Our empirical results support this evidence, revealing a significant and positive effect of human capital on growth only for transition and more developed regions.

Our empirical results partly confirm empirical findings in the previous literature. Like in De Dominicis (Citation2014) our empirical evidence shows the growth promoting effect of a relatively concentrated distribution of GDP at lower level of development. Unlike De Dominicis (Citation2014), we found a positive and significant spillover effect associated with the inequality variable for less developed regions. This implies that, at lower levels of development, the initial internal disparities in neighbour regions have a positive impact on economic growth in a particular region. Our results confirm the empirical evidence in De Dominicis (Citation2014) about the role of inequality in more developed regions. In fact, like in the De Dominicis (Citation2014), our empirical results indicate, for this group of regions, not significant impacts of inequality on regional growth. This suggests that initial regional disparities in more advanced regional economies do not exert a role in enhancing regional growth. However, our analysis provides new evidence for less developed regions, by isolating the actual inequality (i.e., the component of inequality that is not influenced by spatial dependence) from the component of inequality that is influenced by regional interdependencies driven by geographical proximity.

The different impacts of inequality on growth for different groups of regions suggest the existence of several factors underlying the distinctive growth experiences. In fact, the differences in the economic convergence process experienced by regions result from their different levels of development and from the interactions among policy and institutional aspects, as well as among historical and geographical factors characterizing the regional units. Raising the awareness about these differences among regions could be useful in promoting place-based policies, which focus on the peculiarities and the specific needs of territories.

CONCLUSIONS

In this paper we developed a theoretical model that relates inequality and regional growth, explicitly taking into account the spatial interactions among regions. The proposed model extends the spatial MRW model developed by Fischer (Citation2011), introducing inequality among the explanatory variables. The physical capital investment rate is expressed as a function of inequality, based on the assumption that higher inequality enhances investments from richer regions, while the redistribution of resources from rich to poor regions, determines a decreasing of savings and investments (De Dominicis, Citation2014).

The proposed specification includes the spatial lag of the exogenous variables as well as the spatial lag of the endogenous variable and is referred to as the spatial Durbin model (LeSage & Pace, Citation2009). The inequality variable included in the model specification is derived by the spatial decomposition of the Gini index proposed by Rey and Smith (Citation2013).

Our theoretical model states that the growth rate of output in a regional economy depends positively on inequality in the region and negatively on inequality in neighbouring regions. By taking into account the spatial interactions among regions, the predictions of our model allow a better understanding of the role played by geographical location and spatial externalities in regional growth and in its link with income inequality.

The proposed model is tested focusing on 245 NUTS-2 regions across 22 European countries, over the period 2003–16. Results from test on parameter restrictions confirm the appropriateness of the proposed specification, and the empirical estimates mainly support the theoretical model. The coefficient associated with the inequality variable is not significant when considering the homogeneous model. Assuming two different spatial regimes, identified according to the eligibility of regions for EU funds under the convergence objective, reveals differences in the growth process as well as in the growth promoting factors. Specifically, we found that in less developed regions the converge process is taking place at a faster rate than that obtained for the more developed regions. This result could be explained by the EU policy that support convergence in poorer areas, but also by the combination of different factors that exert a growth promoting effect. Specifically, the initial internal disparities appear to enhance growth in less developed regions, while do not play a significant role in wealthy regional economies. In this latter group of regions, the investment in human capital has a positive impact on growth, while it does not have a significant effect on growth at lower level of development. These differences emerge only when spatial similarity is considered jointly with regional peculiarities, and confirm the relevance of place-based policies, differentiated according to the specific needs of regional economies. The identification of two separate clusters, corresponding to different levels of regional development, facilitate a deeper understanding of the impact of inequality on growth in different regional economies. A further step, in the future research, might consist in introducing these differences in the definition of the proximity relationships. This would imply assigning different spatial weights to neighbour regions according to their level of development.

To proper extract the influence of neighbour regions, such proximity matrices could be also defined differentiating among the neighbour units that belong to the same or a different country with respect to the region under consideration. Defining proximity structures based on these criteria would allow to summarize in a single matrix specific effects that in our model are introduced by using dummy variables and spatial clustering.

DISCLOSURE STATEMENT

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This study was financed by the project H2020 IMAJINE, which received funding from the European Union's Horizon 2020 Research and Innovation Programme [grant agreement number 726950]. This paper reflects only the authors' view. The European Commission is not responsible for any use that may be made of the information the paper contains.

Notes

1. Given the SDM specification Y=ρWY+Xβ+WXθ+ϵ, with ρ, β and θ denoting the parameters associated with the spatially lagged endogenous explanatory variable, with the exogenous explanatory variable and with the spatially lagged exogenous explanatory variables, respectively, the spatial error specification is derived by imposing the following parameter restriction: θ=ρβ.

2. The NUTS-2 regions that, because of missing data, are excluded by the analysis are the five regions belonging to Denmark, two regions of Germany: Chemnitz and Leipzig; two regions of Spain: Ciudad Autonoma de Ceuta and Ciudad Autonoma de Melilla; the five overseas Departments in France; Cyprus; the two regions of Slovenia; the two regions of Croatia; the seven regions of Hungary; and the one region of the UK: Highlands and Islands. The region of London is partitioned into Inner London and Outer London, according to the 2010 NUTS classification. For the identification of all other NUTS-2 regions, we refer to the 2013 NUTS classification. Luxembourg is excluded from the analysis because of the coincidence between the NUTS-2 and NUTS-3 levels of classification, which does not allow the calculation of the within-component of inequality.

3. Among the EU NUTS-2 regions there are some regions composed by a relatively large number of NUTS-3 regions, and other regions composed by only one NUTS-3 region. For the latter regions, the Gini index assumes a value of 0. In order to include also these regions in our analysis, we follow the same approach proposed by De Dominicis (Citation2014), and when a NUTS-2 region is composed by only one NUTS-3 region, we assign to the region a value of the Gini index equal to the average of the Gini index computed for the NUTS-2 regions in the same country.

4. Estimation results obtained for these different specifications of the proximity matrix are available from the authors upon request.

REFERENCES