3,319
Views
3
CrossRef citations to date
0
Altmetric
Original Articles

A trade hierarchy of cities based on transport cost thresholds

ORCID Icon, ORCID Icon & ORCID Icon
Pages 1359-1376 | Received 01 Nov 2020, Published online: 17 Sep 2021

ABSTRACT

Empirical evidence has been lacking to explain trade agglomerations within countries. Starting with a novel micro-database of road freight shipments between Spanish municipalities for the period 2003–07, we break down city (municipal) trade flows into the extensive and intensive margins and assess trade frictions and trade concentration relying on a unique generalized transport cost measure and three internal borders: NUTS-5 (municipal), NUTS-3 (provincial) and NUTS-2 (regional). We find a stark accumulation of trade flows up to a transport cost value of €189 (approximately 170 km) and conclude that this high density is not due to administrative borders effects but to significant changes in the trade-to-transport costs relationship. To support this hypothesis, we propose and conduct an endogenous Chow test to identify significant thresholds at which trade flows change structurally with transport costs. These breakpoints allow us to split the sample when controlling for internal borders, and to define trade market areas corresponding to specific transport costs that consistently reveal an urban hierarchy of cities. The results provide clear evidence with which to corroborate the predictions of Central Place Theory.

INTRODUCTION

In the absence of reliable data, intra-national trade patterns are in general neither well understood nor well characterized. The lack of highly granular micro-data on inter-regional trade flows deters empirical analyses on why trade agglomerations occur in some places and not in others within a country.Footnote1 Since the first empirical and seminal studies (Hillberry & Hummels, Citation2003; Wolf, Citation1997, Citation2000) up to more recent works (Andresen, Citation2010; Capello et al., Citation2018b; Martínez-San Román et al., Citation2017), there is a growing literature pointing out an internal home-bias effect as, inter alia, one of the key determinants of the location of economic activity within countries and intra-national trade agglomerations.

The causes and explanations for this intra-country effect are multiple. Whereas some studies focus on historical and cultural barriers to trade (Capello et al., Citation2018a; Nitsch & Wolf, Citation2013; Tubadji & Nijkamp, Citation2015; Wrona, Citation2018), hub-and-spoke structures (Gallego et al., Citation2015), transport-mode competitions within a country (Llano et al., Citation2017), or even the impediments raised by business networks (Garmendia et al., Citation2012), others emphasize unexpected increases in the internal border effect over the last decades (Crafts & Klein, Citation2014; Martínez-San Román et al., Citation2017). Not only these empirical factors but also prominent theoretical set-ups have been cited to ascribe the internal home-bias effect either to the fragmentation of global production chains in areas near international borders (Yi, Citation2010) or to spatial frictions between cities (Behrens et al., Citation2017).

Within this framework, the study by Hillberry and Hummels (Citation2008) sheds light on the appearance of an internal home bias at the municipal level once accurate inter-regional trade flows and very precise measures of internal distance are used. Even if this trade concentration holds over short distances in their study, the authors conclude that the internal home bias is a reductio ad absurdum of the one observed at the international level. Departing from this finding, we argue that the internal home-bias effect not only is real but also results from breakpoint changes in the trade-to-transport costs relationship that shapes a series of trade market areas around a country’s largest trading cities.

More specifically, we find discontinuous changes between trade and transport costs that map successive and distance-increasing market areas which, as a result, give rise to a hierarchy of cities as predicted by Central Place Theory (CPT) (Hsu, Citation2012; Hsu et al., Citation2014; Mulligan et al., Citation2012; Parr, Citation2002; Tabuchi & Thisse, Citation2011; Taylor & Hoyler, Citation2021; Zhu et al., Citation2021).

To arrive at these conclusions, we have relied on two unique and very detailed databases: (1) a panel database (2003–07) on individual trade flows for Spanish municipalities; and (2) an additional bilateral panel database with the precise and realistic transport costs associated with these trade flows. With these databases we propose and adopt a new methodological approach within the trade literature to determine spatial breakpoints in trade flows caused by the changing nature of transport costs. The new method resorts to a stability econometric (Chow) test for structural changes, first introduced by Berthelemy and Varoudakis (Citation1996). We apply this test to assess whether trade flows (structurally) change beyond a certain transport cost breakpoint. This information allows us to split internal distances by thresholds and introduce them into a standard gravity set-up. This new definition of internal distances is the basis, on one hand, to define cities’ market areas and, on the other, to accurately measure the internal home bias at different spatial levels for either the overall value of trade or its margins. Whereas we interpret the former under the insights of CPT and the consequent hierarchy of cities, we show that the latter leads to an overestimation of the (‘illusory’) internal border effect when transport costs are not correctly controlled for. Indeed, we argue that the arbitrary use of administrative boundaries as a spatial limitation to collect data and account for border effects becomes an aggregation artefact (Hillberry, Citation2002a; Llano-Verduras et al., Citation2011) that leads to misleading conclusions about the impediments to intra-national trade flows.

This paper contributes to and links with the literatures on intra-national trade flows and their associated system of city mapping. We show that it is not internal administrative borders that effectively hamper trade, but the change in the relationship between the trade flows and transport costs that generates the border effects achieved within a country. This is consistent with the removal of intra-national administrative impediments to trade through single-market agreements. We also contribute to the empirical study of trade-market areas, as intra-national economic boundaries depending on either their geographical reach (Löffler, Citation1998; Sohn & Licheron, Citation2018) or their shape throughout a hierarchy of cities (Hsu, Citation2012; Hsu et al., Citation2014; Taylor & Hoyler, Citation2021), and provide empirical evidence for the prominent and theoretical literature on trading cities (Anas & Xiong, Citation2003; Behrens et al., Citation2017; Cavailhes et al., Citation2007; Mori & Wrona, Citation2018). Finally, we develop further methodological approaches for highly granular gravity-trade models (Head & Mayer, Citation2014) and their crucial debate on the treatment of non-linear transport costs (Abbate et al., Citation2012; Gallego & Llano, Citation2014).

After reviewing the contributions of the study, we now outline our sequential research strategy. The first step compiles a novel database of road freight shipments consisting of micro-data for individual shipments between Spanish municipalities. This statistical information, drawn from the Spanish Road Freight Transportation Survey (RFTS), allows us to calculate the bilateral value of trade and decompose it into the extensive and intensive margins. Next, for each bilateral flow we compute the actual monetary transport cost, or generalized transport cost (GTC), which corresponds to the minimum economic cost between any origin and destination. This is the sum of distance-related costs (e.g., fuel, toll, tires) and time-related costs (e.g., salaries, insurance, taxes) for each route between any pair of Spanish municipalities. To calculate these GTC, we use the programming techniques available in Geographic Information Systems (Arc/GIS) which optimize for the lowest cost routes using the existing and digitized road networks for the period 2003–07. In contrast to all previous studies, which use the standard and non-monetary transport cost proxies of distance or travel time, we introduce a real euro and time-varying measure of the spatial frictions affecting trade.

Subsequently, we analyse transport costs and border impediments for the three intra-national administrative borders in Spain, that is, NUTS-5 (municipal), NUTS-3 (provincial) and NUTS-2 (regional), using the Poisson pseudo-maximum likelihood (PPML) estimator to control for pervasive zero-valued trade flows within the gravity model (Santos Silva & Tenreyro, Citation2006, Citation2010, Citation2011). The results show that municipal borders have a stronger impact on trade flows and extensive and intensive trade margins than the results reported by, among others, Hillberry and Hummels (Citation2008). Provinces and regions, meanwhile, have a much weaker impact than the literature suggests (Requena & Llano, Citation2010). Thanks to the PPML estimator, we show that this spatial trade pattern is explained by the over-concentration of trade flows over short distances; therefore, the internal effect of administrative borders tends to be underestimated for short distances and overstressed for long ones.

To explain these results further, we introduce to the trade literature the method developed by Berthelemy and Varoudakis (Citation1996) for endogenous economic growth models. They propose a stability Chow test to detect specific breakpoints shaping different structural models in their data. We adopt this test to detect differences in the trade-to-transport costs relationship and find that only breakpoints arise in the total value of trade and its extensive margin, not the intensive one, pointing out that the spatial concentration of trade flows is mainly driven by the extensive margin. Once we divide our transport costs measure into subsamples (distance intervals) and determine the existence of a series of statistically significant breakpoints, we split our PPML regressions by these new transport cost thresholds. This stands in contrast with, for example, Eaton and Kortum (Citation2002) and Anderson and Yotov (Citation2012), who segment trade flows by arbitrary distances. Thanks to these statistically defined transport-cost thresholds, we conclude both that the internal border effect is neither unique nor has a single non-linear quadratic impact on trade, as the literature emphasizes. Indeed, we argue that it is precisely the existence of such thresholds that truly raise internal impediments to trade, which eventually spill over several and consecutive administrative boundaries. To the best of our knowledge, this is the first analysis using a structural-break methodology for the trade-to-transport cost relationship in the trade literature.

Lastly, since the density of trade flows is geographically localized in short-distance thresholds and mainly driven by the extensive margin, we show that the resulting agglomeration economies shape a series of nested trade-market areas around increasingly large cities, giving rise to an urban hierarchy whose mapping coincides with the postulates of the CPT. Thus, we highlight that the so-called internal border effect is the consequence of higher order cities that act as supply centres for either their metropolitan areas or surrounding cities of lower rank, a relatively unexplored relationship in the empirical literature on urban areas. These empirical regularities are promising for future research linking trade and CPT.

The remainder of the paper is structured as follows. The next section explains the databases and the decomposition of trade flows. The third section presents the econometric analysis. The fourth section discusses the structural breakpoint methodology and its application to the gravity model of trade. The interpretation of our results in light of CPT and its associated hierarchy of cities is performed in the fifth section. The last section draws the main conclusions.

TRADE FLOWS, EXTENSIVE AND INTENSIVE MARGINS, AND TRANSPORT COSTS

Trade value data and the extensive and intensive margins

To build our value trade data we rely on a micro-database on freight flows by road: the Road Freight Transportation Survey (RFTS), compiled by the transport division of the Spanish Ministry of Transport, Mobility and Urban Agenda for the period 2003–07. This database is based on a randomly surveyed sample of freight companies and independent truckers with vehicles weighing more than 3.5 tons, which together account for almost 85% of all internal freight flows. It includes vehicle and shipment characteristics such as the type of commodity shipped, the number of tons carried by the truck, the number of shipments, and the distance travelled between any origin i and destination j, registered at the NUTS-5 municipal level.Footnote2 For the 2003–07 period, the database contains more than 1,890,000 records (empty shipments included) which, on average, involve 7178 municipalities as origins and 7913 municipalities as destinations. However, most of them are municipalities with little relevance in terms of population and trade volumes. Therefore, we consider municipalities with more than 10,000 inhabitants in the period 2003–07. The final database consists of 633 municipalities whose freight trade volume by road represents 75.5% of the total freight flows.

Thanks to the RFTS micro-database we calculate the total value of trade and subsequently break it down into the extensive and the intensive margins. Adopting Hillberry and Hummels’ (Citation2008) proposal, the total value of trade between each origin–destination (Tijt) at time t can be decomposed as follows (dropping subscript t to avoid notational clutter): (1) Tij=NijPQ¯ij(1) where Nij represents the total number of shipments (i.e., the extensive margin); and PQ¯ij is the average value per shipment (i.e., intensive margin) composed of the mean of values of prices (P¯ij) and tons (Q¯ij).Footnote3

maps the total value of trade (in logarithms) for the 633 municipalities. The largest trade flows occur in areas with the highest levels of economic activity (Madrid, the Mediterranean coast and the Basque Country), whereas less populated areas (south-west and north-west) register trade only around largest cities. This spatial pattern is consistent with previous evidence on the concentration of Spanish international exporters (Ramos & Moral-Benito, Citation2017).

Figure 1. Logarithm of total value of trade for the 633 municipalities.

Note: Graduated map with averages for the period 2003–07.

Source: Authors’ own elaboration from the Spanish Road Freight Transportation Survey (RFTS) data.

Figure 1. Logarithm of total value of trade for the 633 municipalities.Note: Graduated map with averages for the period 2003–07.Source: Authors’ own elaboration from the Spanish Road Freight Transportation Survey (RFTS) data.

Generalized transport cost (GTC): advantages over distance and time proxies

Another novel aspect of this study is the use of a monetary measure of transport cost that improves the proxies normally used in the literature, mainly geographical distance and travel time. This variable stands for a GTC definition corresponding to the least-cost itinerary between an origin and a destination, as discussed by Combes and Lafourcade (Citation2005), Zofío et al. (Citation2014) and Persyn et al. (Citation2020).

In particular, we have calculated the specific GTCs that match the inter- and intra-municipal trade flows represented by door-to-door shipments. Unlike distance and travel time, the GTC differentiates economic costs related to both distance and time cost, and accounts for changes in transport operating costs, particularly those related to the transport service market (e.g., fuel prices) and regulatory and institutional conditions (e.g., mostly related to wages in the labour market).

In the GTCs methodology, it is assumed that the road freight firms providing the transportation service between any two locations minimize the economic costs. For a given origin i and destination j in year t, a least cost algorithm is implemented using the Arc/GIS Network Toolbox. The solution yields the optimal itinerary between them accounting for both physical and economic attributes.

We make use of a set of digitized road networks for 2003–07 containing information about these attributes for every road arc, a (or segment) in a given year, t. As for the physical attributes, the main characteristics are capacity, gradient and congestion (upon which the arc speed depends). Regarding economic costs, these are related to the arc distance and travel time: dat and tat = dat/sat, where sat is the maximum speed in the arc, bounded by the legal limit. All economic costs are referred to the most representative vehicle (e.g., 80% of all transport services are performed with a 40 ton articulated truck), and vary by location l and time period t, depending on where the arcs are located (e.g., rates for every tolled road are included, and yearly updated). The economic distance unit costs in time t, denoted by ekt, that is, euros/km, include the following variables: (1) fuel costs, denoted by fuelit (i.e., it is assumed that the tank is refilled in origin i where the transport firm is located) – for long trips the average fuel price at origin and destination is used; (2) toll costs, tolllt; (3) accommodation and allowance costs, accomallowlt; (4) tire costs, tireit; and (5) vehicle maintenance and repairing operating costs, repmantit. Taking into account these operating costs, the total distance cost corresponding to an itinerary Iijt is: DistCijt=aIijt(kekt)dat=aIijt(fuelit+tolllt+accomallowlt+tireit+repmantlt)dat.Likewise, the economic unit costs associated with time, denoted by eht, that is, euros/h, include the following variables (when the location subscript is omitted, it indicates that these are nationwide prices): (1) labour costs; labit; (2) financial costs associated with amortization, amortt; (3) financing, fint; (4) insurance costs, inst; (5) taxes, taxit; and (6) indirect costs, indit, associated with other administration overhead, operating expenses and commercial costs. Given the driving time for an arc: tat, the overall cost associated with travel the whole length of an itinerary is: TimeCijt=(leht)(aIijttat)=(leht)(aIijtdatsat)==(labit+amortt+fint+inst+taxit+indit)(aIijtdatsat).With this information we solve for the GTC, which is the solution to the following minimization problem that identifies the least cost route Iijt in period t, among the set of itineraries Iijt: GTCijt=minIijtIijt(DistCijt+TimeCijt)==aIijt(kekt)dat+(leht)(aIijttat)=(leht)(aIijtdatsat),and whose associated optimal distance and time values are dijt=aIijtdat and tijt=aIijttat.Footnote4

Throughout we argue that a municipal and time-variant measure of transport costs, such as the GTC, is more suitable for trade flows analyses than distance and travel time because it is the only measure that captures simultaneous improvements in road transport infrastructure as well as changing regulations in time. This is portrayed in , which shows the average value for each measure of transport cost for the period 2003–07 and the individual years, plus their growth rates. The GTC is the measure with the highest variation with a significant decline over the years. Zofío et al. (Citation2014) show that both economic cost reductions and road infrastructure improvements drive this reduction (particularly through time-related costs). Physical distance remains mainly unaffected by road improvements over the years. Although it may constitute a good proxy for transport costs in cross–sectional studies, its lack of time variability makes it certainly inadequate in panel databases where transport costs are expected to change. Correspondingly, whereas the travel time proxy for transports costs captures the improvements in road infrastructure, it cannot account for changes in operating (economic) costs.

Table 1. Transport costs changes: averages for pairs of municipalities.

Additionally, to reinforce the need for a GTC definition instead of its distance and time proxies, we regress the growth rate of trade flows (ΔTijt) and their margins (ΔNijt,ΔPQ¯ij) on the growth rates of transport costs (ΔCostijt) – representing either GTC, distance or travel time – and including also a year variant and origin–destination invariant fixed effects: (2) ΔTijt=β0+β1ΔCostijt+yeart+ηi+ηj+ϵijt(2) shows the results. As expected, distance and travel time do not have significant effects either on the growth rates of trade flows or on their margins. By contrast, the GTC has a negative and significant impact on the three dependent variables. That is, reductions in the GTC – such as those related to economic cost reductions resulting from improved truck efficiency and reductions in fuel prices and salaries – lead to an increase in trade flows.

Table 2. Ordinary least squares (OLS) regressions in growth rates.

Trade densities on transportation costs

We now resort to non-parametric estimation (kernel regressions) to study spatial trade distribution using the GTC. presents the kernel regressions for each component in equation (1), respectively.Footnote5 Total value of trade – marginally lower in 2003 than in 2007 as a result of the Spanish economic boom – falls sharply in density as transport costs increase (a). This pattern is driven by the number of shipments (extensive margin) (b) dropping rapidly for all years up to the point in which an approximate GTC value is reached. The average value per shipment (intensive margin) (c), on the other hand, shows an upward trend even with increasing transport costs. This pattern is associated with the price component of the trade flows (see Figure A1 in the supplemental data online) as we discuss below in the fourth section.

Figure 2. Kernel regressions of trade variables on generalized transport costs (GTCs) for pair of municipalities.

Figure 2. Kernel regressions of trade variables on generalized transport costs (GTCs) for pair of municipalities.

ECONOMETRIC SPECIFICATION: TRADE FRICTIONS

In this section we first explore the effects of trade frictions on the total value of trade and its margins in a standard gravity model using the PPML estimator.Footnote6 We propose a set of regressions in equation (3) for ij municipal pairs (NUTS-5) relying on the trade decompositions presented in equation (1). We include trade flows (Tijt) and their margins (Nijt, PQ¯ijt) in levels. Moreover, transport costs are in levels and take a quadratic functional form (GTCijt and GTCSqijt) to capture non-linear effects of transport costs on trade flows (Combes et al., Citation2005) as well as increasing returns to scale in transportation activities (i.e., size of the cargo being transported), coupled with increasing returns to distance as larger batches require larger vehicles (McCann, Citation2001, Citation2005). To accurately isolate the effect of municipal border, the contiguity variable is calculated as a first-order queen contiguity that takes the value of 1 when i and j share a border, but also when flows take place within the same municipality (Hillberry, Citation2002b).

For internal administrative boundaries, we consider three dummy border variables, which are codified as follows: NUTS-5 (municipalities) is set to 1 when flows are performed within the same municipality, and to 0 otherwise; NUTS-3 (provinces) takes the value of 1 when flows are in the same province, but not in the same municipality; and NUTS-2 (regions) captures flows between two municipalities that are located in different provinces but in the same NUTS-2 region. Finally, time, origin and destination fixed effects are included.Footnote7 Thus, the final specification to estimate takes the form: (3) Xijt=exp[β0+β1GTCijt+β2GTCSqijt+β3Contiguityijt+β4NUTS5ijt+β5NUTS3ijt+β6NUTS2ijt+yeart+ηi+ηj]×ϵijt,(3) where Xijt summarizes the trade flows (Tijt) and their margins (Nijt, PQijt¯). shows the results. GTC coefficients exhibit the expected negative sign in the linear component and signal negligible increasing returns in transport (i.e., an infinitesimally positive and significant value in the quadratic term). For internal borders, trade within the same municipality (NUTS-5) is much greater than inter-municipal trade flows. Indeed, the extensive margin achieves a higher NUTS-5 coefficient than the intensive one, indicating that the extensive margin is what truly drives intra-municipal trade flows on average. Other administrative levels (NUTS-3 and NUTS-2) reduce their importance, especially regional flows. It turns out that border effects for relatively long-distance flows between regions are quite reduced or almost negligible, reinforcing the role of local markets and the extensive margin in trade concentrations over short distances. Finally, our results for the border coefficients and R2 get correspondingly better statistical significance and goodness-of-fit than those previously obtained in the literature (Hillberry & Hummels, Citation2008). Given the number of properties satisfied by the PPML estimators over their ordinary least squares (OLS) counterparts, we are able to accurately capture and explain the trade patterns over very short distances. Indeed, the use of granular measures for trade flows and transport costs reduces the impact of higher regional borders, showing the relevance of the municipal border. That is, these estimations confirm the ‘illusory effect’ of regional borders once lower administrative limits are controlled for. Moreover, they indicate a non-disruption of the market for large administrative levels, in contrast to other studies for Spain (Garmendia et al., Citation2012; Requena & Llano, Citation2010) and China (Poncet, Citation2005).Footnote8

Table 3. Poisson pseudo-maximum likelihood (PPML) estimations with generalized transport cost (GTC) and internal borders.

Moving forward, we also study trade dynamics in our panel database. We re-estimate equation (3) in a cross-sectional analysis for 2003 and 2007 independently. Focusing on the relevant results, reports only the coefficients for administrative borders. All administrative levels reflect the same pattern for total trade and its margins, that is, there exists a decreasing trend between 2003 and 2007.Footnote9 Note that the NUTS-5 border in 2003 becomes even larger than in the panel data regressions, whereas the almost null (non-significant) impediments to trade of the NUTS-2 border are confirmed. These findings reflect the time-varying behaviour of the internal home-bias effect, that is, internal borders do not constantly hamper the same amount of trade. Contrary to previous evidence for the US case (Martínez-San Román et al., Citation2017), Spain experienced a decreasing internal border effect leading to a more trade integration at least in the period 2003–07.

Table 4. Cross-section regressions for 2003 and 2007.

STRUCTURAL TRADE PATTERNS: GTC THRESHOLDS AND THE GRAVITY MODEL

Breakpoints in the trade flows-to-transport cost relationship

Previous regressions provide an estimate of the internal border effect qualified by the non-linear effect of transport costs and its customary quadratic second-order effect. Nevertheless, this specification does not consider the possibility of having alternative trade frictions once we control for structural changing patterns in trade flows. Visual inspection of the kernel regressions in reveals at least two different trade flows–transport cost structures around a critical value in the interval €110–190, where their non-linear nature is clearly visible. This value implies that in very short distances trade flows are extremely sensitive to relatively low transport costs, particularly in their extensive margin, whereas they are almost independent (‘becoming flat’) for larger GTCs. Indeed, for GTC values larger than the previous range, further and multiple GTC thresholds may coexist at different levels. That is, we should consider and assess the existence of multiple GTC values at which trade flows (structurally) change once they reach one (or several) transport cost thresholds or breakpoints.

As remarked by Abbate et al. (Citation2012), previous studies (Anderson & Yotov, Citation2012; Eaton & Kortum, Citation2002; Gallego & Llano, Citation2014) propose different distance intervals to determine the magnitude of trade barriers, although they arbitrarily define these intervals. On the contrary, we rely on statistical methods to identify these intervals endogenously. We follow Berthelemy and Varoudakis (Citation1996) and apply successive F-tests on stability (Chow tests) that allow us to assess the existence of structural breakpoints in the trade-to-transport cost relationship. Thanks to these non-arbitrary breakpoints we split our transport cost variable into thresholds of distance (GTCs) that can be later considered in our gravity specification in equation (3).

More specifically, we perform a series of Chow tests to check the stability of the relationship between our two variables of interest, trade flows and transport costs. In this context, the Chow test works as follows. First, it divides the sample of trade flows and transport costs (GTC) into two j subsamples, j = 1 and 2, of size nj. Second, it defines a quasi-log-likelihood function such as QL = j=12njln(σj2). The (Chow) F-stability test detects a GTC breakpoint (valued in monetary terms, €) when the QL function is maximized for two subsamples. In this specific breakpoint, the F-test rejects the null hypothesis (at the 1% and 5% significance levels) of structural stability between trade flows and transport costs.

Note that the Chow test is performed using our specification in equation (3) in which the trade flows are the dependent variable, whereas transport costs (GTC) constitute the threshold variable upon which the breakpoints are detected. Once the first GTC breakpoint is detected using the Chow test, we perform a second test for the subsamples below and above the first GTC breakpoint previously detected. We follow the same logic for the additional subsamples that emerge as further GTC breakpoints appear. Hence, successive Chow tests are applied multiple times in sequential order to check for the existence of successive thresholds.Footnote10 In what follows, sorting the GTC breakpoints from the smallest to the greatest value we obtain the different thresholds defining the extension of successive market areas.

To simplify our stability analysis, we apply this Chow test taking means of our variables in the period 2003–07. This way, we get a cross-sectional version of our panel database.Footnote11 As stated, the econometric specification in equation (3) represents our baseline model. Nevertheless, we do not include the quadratic term of the GTC as we aim to determine non-linear relationships by means of transport cost breakpoints. Therefore, including the quadratic term of GTC would distort our analysis on the stability between trade and transports costs.

shows the GTC breakpoints obtained for the total value of trade and the extensive margin. Unsurprisingly, the test fails to detect breakpoints for the intensive margin as its components do not drive reductions in trade as shown in the kernel regressions (c). However, we confirm the existence of multiple GTC thresholds across the full spectrum of distances, providing strong evidence that trade flows are driven by the extensive margin, especially for short and medium GTCs – the first and second breakpoints for the total value of trade go hand in hand with those from the extensive margin. Moreover, trade flows are highly concentrated at low transport cost values (around €189 and €233), but over longer distances the difference between breakpoints becomes larger, indicating a decline in transport costs as an impediment to trade flows.

Table 5. Generalized transport cost (GTC) breakpoints for the total value of trade and extensive margin: averages for the period 2003–07.

shows geographical evidence of the existence of these breakpoints by mapping the specific GTC values obtained for the two largest Spanish municipalities: Barcelona and Madrid.Footnote12 The Arc/GIS Network Toolbox allows us to calculate the exact coordinates corresponding to the maximum GTC distance for the type of road and its specific attributes (capacity, gradient, congestion, etc.). Each GTC breakpoint defines an isocost ring whose shape varies with the accessibility of the road network. We refer to the market area comprised within these market isocost rings as natural trade areas because they are defined in monetary transport costs by resorting to the statistical features underlying the Chow structural test.

Figure 3. Natural trade areas using generalized transport cost (GTC) breakpoints.

Note: Averages for the period 2003–07.

Figure 3. Natural trade areas using generalized transport cost (GTC) breakpoints.Note: Averages for the period 2003–07.

These maps provide a representation of the existence of an urban hierarchical system in terms of trade areas. From these maps several aspects emerge: (1) the first breakpoint refers to the supply centre between the main city and its metropolitan area and other nearby satellite cities; (2) the second and third breakpoints successively reach some important cities (provincial capitals); (3) the fourth breakpoint appears as very relevant because it joins Madrid and Barcelona with other Spanish cities that are wealthy in terms of income and trade (mainly Valencia and Zaragoza); and (4) the last breakpoint directly links Madrid and Barcelona, indicating that trade flows overlap for the two largest Spanish cities. Also, we observe that Barcelona and Madrid have stronger links to other cities densely surrounded by highways. This is the case especially with Madrid, whose road network-centrality allows for larger trade areas reaching all regions in Spain. Finally, note that the geographical extension of the first breakpoint spans over municipal (NUTS-5) and provincial (NUTS-3) borders. It confirms that using administrative boundaries to collect and infer internal border effects could be misleading in its conclusions.

With these findings we argue that the methodology based on the Chow structural and stability test is a promising procedure to determine trade areas, once a correct definition of the dependent and explanatory variables is made available to the analysis. To our knowledge, there is no previous empirical representation of cities’ market areas based on actual trade flows and monetary transport costs (Löffler, Citation1998). More interestingly, we show in the fifth section that these empirical regularities are in line with the predictions of Lösch and Christaller’s model, whose analytical framework constitutes the core of CPT and gives rise to so-called urban hierarchical systems.

Accounting for GTC thresholds in the gravity model

Once the breakpoints capturing structural changes in the trade-to-transport cost relationship are obtained, we split our GTC distance variable into these GTC thresholds. We perform a set of independent PPML regressions for each GTC threshold using the specification in equation (3), while dropping from each GTC interval the administrative borders in which no observations are recorded. For instance, the transport cost interval of €285–513 (between breakpoints 3 and 4) has no trade flows either at the intra-municipal level or between contiguous municipalities and, therefore, these variables are dropped in the regressions for this interval. We follow the same rationale for the rest of the transport costs intervals. In fact, we contend that this is the correct approach to capture the effects of transport costs and administrative borders in such highly granular trade flows. To give support to this argument, we again perform a general specification using equation (3) and controlling for the standard quadratic non-linear GTC instead of the distance-thresholds approach. As we show in what follows, when using this continuous approach administrative borders overestimate the border effect over short distances.

and show the PPML regression for the total value of trade and the extensive margin for each independent GTCs threshold. For the total value of trade, the GTC is highly penalizing over short distances, but its negative impact decreases as the transport costs increase. It confirms that, first, the negative linear effect of frictions on trade flows varies over the spectrum of distances, and, second, it indicates that transport costs are not as detrimental to trade as normally thought – a result related to the existence of economies of distance and scale on transportation (McCann, Citation2001). Total trade flows even show a weak positive relationship with transport costs in the range €189–285. This particular result illustrates that total trade is partially driven by the intensive margin (c) and, despite growing GTCs, may present steep increases in the value of this margin. It is consistent with the existence of substitution effects in favour of higher price goods (Alchian & Allen, Citation1964; Hummels & Skiba, Citation2014), which also corresponds to higher mean price levels dominating the total value of trade (see equation A2 and Figure A1c in the supplemental data online). This result sheds light on some specific features of the trade–transport cost relationship that remain normally unexplored in the literature.

Table 6. Pseudo-maximum likelihood (PPML) estimations by generalized transport cost (GTC) thresholds: total value of trade, averages for the period 2003–07.

Table 7. Pseudo-maximum likelihood (PPML) estimations by generalized transport cost (GTC) thresholds: extensive margin, averages for the period 2003–07.

The impact of internal borders is very negative for the total value of trade and its extensive margin before the first threshold is reached, especially the (municipal) NUTS-5 border as in . NUTS-3 and NUTS-2 again reduce their negative impact as GTC increases, except for the €513–706 GTC interval (€582–655 for the extensive margin), where the regional border has its highest impact on trade flows as a result of being the only border within the interval (extensive margin). Finally, the coefficients for trade frictions in the ‘general specification’ are greater than those obtained for each GTC threshold. Therefore, we conclude that gravity models without transport cost-thresholds do not properly capture, or even overestimate, the impact of transport costs and internal borders.

EXTENSIVE MARGIN, TRADE AREAS AND THE HIERARCHICAL SYSTEM OF CITIES

The GTC breakpoints and the accumulation of trade flows driven by the extensive margin give rise to a series of empirical regularities matching the theoretical insights of CPT and its associated urban hierarchies à la Christaller. Reciprocally, trade in the intensive margin, whose density remains constant as transport costs increase (c), reflects persistent trade interactions between high-ranked cities, belonging to the spatial network of locations existing in Spain. The study of these horizontal interactions has been termed as Central Flow Theory (CFT) (Taylor & Derudder, Citation2016; Zhu et al., Citation2021) and complements CPT. Whereas the latter focuses on local relationships within the enclosed limits of precise market areas (i.e., the urban centre and its hinterlands), CFT emphasizes the network relationships between high-ranking cities. Both fields of research are experiencing a revival thanks to a new series of studies (Hsu, Citation2012; Hsu et al., Citation2014; Mulligan et al., Citation2012; Parr, Citation2002; Tabuchi & Thisse, Citation2011; Taylor & Hoyler, Citation2021).

Based on CPT, one expects market areas of influence whose geographical reach is driven by transport costs, that is, consumers and firms locate in places where they can be supplied by different cities, and taking into account the transport costs in which they incur because of their consumption or production processes. As a reflection of Von Thünen’s land–rent model, the trade-off between scale economies and transportation costs leads to the emergence of central places, which demand low value-added and homogeneous (standardized) goods from surrounding locations within its market area, while supplying them with a variety of more sophisticated goods. In these models, cities serve those locations for which consumers are willing to cover the transports costs of having the goods shipped to them. Within a given market area, CPT predicts the cooperation between vertically ranked cities located within the same market areas. However, for high-ranked cities, their demand schedules result in larger market areas that, while exhibiting a decreasing density of trade as transport costs increase, come eventually into spatial competition with other cities, that is, their respective market areas overlap (Taylor, Citation2012).

This prediction by CFT across high-ranked cities is shown in our data for Madrid and Barcelona (). Using our trade database, we compare in the kernel distributions of trade by distance for the two largest Spanish cities. It is clear that there exists spatial competition between both cities, although they present interesting differences in their trade patterns. While Barcelona spreads along the geography, that is, it presents a larger trade area, Madrid shows a higher density of trade in short distances. That is because the metropolitan area of both cities leads them to supply the surrounding cities in different ways. To meet intermediate production processes and final demand, Madrid has to reach nearby large cities with a high volume of trade by road, while Barcelona uses its port as a supplying hub for its surrounding cities.

Figure 4. Kernel regressions between total value of trade and generalized transport costs (GTCs) for the municipalities of Madrid and Barcelona.

Figure 4. Kernel regressions between total value of trade and generalized transport costs (GTCs) for the municipalities of Madrid and Barcelona.

In a stylized version of the CPT model, these cities’ market areas are spatially represented by a lattice of nested hexagons whose radii of influence are determined by the geographical extension of trade flows, which are decreasing in the GTC. Therefore, it is the trade-to-transport cost relationship evidenced through the Chow test and its associated breakpoints that determines, for different cities, the geographical extension of their market areas. In turn, this implies that cities exhibiting high trade densities across longer distances will rank at the upper echelons of the settlement hierarchy, shipping a large range of goods to many commercial partners (corresponding to the extensive margin), and presenting larger GTC thresholds, whereas small cities trade few goods with a small number of cities. Specifically, an urban hierarchy à la Christaller emerges because when considering a range of different goods, few cities can serve high-order items (specialized goods) over long distances, exhibiting longer GTCs thresholds. Meanwhile, as the order-scale of items reduces (low-order items), smaller cities provide more standardized products. In this sense:

the most central location in the entire system provides all of its goods and services thereby satisfying the so called exhaustive principle. But, moving down through this functional continuum (of goods), other locations on the landscape are sufficiently well located to provide some, but not all, goods that are provided at the most central location.

(Mulligan et al., Citation2012, p. 404)

That is, in the hierarchy of cities represented by the CPT, we should expect one or few cities (rank 1) to harbour the largest market areas and supply a full range of products, a second group of large cities (rank 2) to serve a huge variety of commodities within a still relatively large market area, and a lower tier of medium and small-size cities (ranks 3 and 4) to be geographically scattered across the space and trading fewer products with smaller or even insignificant market areas (Tabuchi & Thisse, Citation2011). Since the density of trade flows highly correlates with the number of varieties (goods) being traded and the number of trading partners, we rely on these data to perform a cluster analysis that classifies cities into ranking positions depending on the values of these two variables. This approach concurs with the assumptions of CPT and can be traced back to Christaller himself (Preston, Citation2009; Taylor & Hoyler, Citation2021; Zhu et al., Citation2021).

Empirical studies testing urban hierarchy systems based on CPT have focused only on urban population patterns and administrative borders, even when theoretical predictions and the underlying assumptions relate it to the magnitude and composition of trade flows. The reason to rely on the population proxy for trade flows is the scarcity of intercity trade data. The underlying assumption is that population size is not only a good proxy of a city market area, but also implies a more diversified demand that supports the production of a wider range of products. Thanks to our highly granular trade flows, we can now test these predictions following the aforementioned approach that relies on actual trade data. decomposes the extensive margin of trade and shows the distributions of the Spanish municipalities in 2003 and 2007 according to the number of different commodities traded by each municipality and the number of municipalities (trading partners) with which trade takes place, that is, Nijk, as defined in equation A1 and Figure A1a in the supplemental data online. Data are expressed as a percentage over the total number of municipalities. As observed, the largest number of municipalities trade between 10 and 50 commodities (10 <k 50) with a set of 10–50 municipalities (10 <j 50). The same follows for intervals greater than 50 commodities and 50 municipalities. More interestingly, these results shed light on a hierarchy of cities. The main diagonal characterizes various municipalities: a vast majority of them trade few commodities with few trading partners (these municipalities are subsequently classified as rank 4 and 3 locations by our cluster analysis), another group of municipalities trades an increasing number commodities with more municipalities (rank 2), and finally a group of only few municipalities trade a huge number of commodities (more than 100) with a large number partners (more than 100) (rank 1).

Table 8. Distribution of municipalities by different products and trading partners: Nijk. Percentage of municipalities in each interval, 2003 and 2007.

sheds light on the urban hierarchy system. As it can be observed, the main diagonal characterizes different types of cities in the lines expressed above. We rely on cluster analysis to group these municipalities getting further evidence of the hierarchy of cities.Footnote13 The group classification of cities obtained with the cluster analysis is reported in , with an overwhelming predominance of medium and small municipalities (ranks 3 and 4) and a skewness distribution for the largest ones (ranks 1 and 2).

Table 9. Municipalities grouped by clusters: averages for the period 2003–07.

confirms the theoretical insights of CPT by means of kernel regressions for each group of clustered municipalities. Those belonging to a higher rank present higher trade densities across all distances (GTCs), especially over short ones. Moreover, the elasticity of shipments to GTCs is lower for all transport costs, that is, in the short distances ranks 3 and 4 cities are not as sensitive to GTCs as higher ranked cities. The overall concentration of trade flows in short distances observed in is clearly driven by these ranks 1 and 2 municipalities whose higher kernels envelop those of lower ranks.

Figure 5. Kernel regressions between total trade and generalized transport costs (GTCs) for each pair of municipalities grouping them by clusters.

Note: Averages for the period 2003–07.

Figure 5. Kernel regressions between total trade and generalized transport costs (GTCs) for each pair of municipalities grouping them by clusters.Note: Averages for the period 2003–07.

Furthermore, presents the results of applying the Chow test methodology to each of these four city ranks. We report the identified breakpoints (GTCij in €), along with the average value of trade observed at each breakpoint (Tij), and the value of trade per euro of transport cost (Tij/GTCij). The results clearly show that higher ranked cities present larger market areas by trading more for any given GTC threshold or, equivalently, that a given value of trade is achieved at lower GTCs for higher ranked cities. Note that the method identifies a single breakpoint for rank 4 cities because their trade values over short distances are much lower.

Table 10. Generalized transport cost (GTC) breakpoints by city rank: averages for the period 2003–07.

We map the Spanish urban hierarchy from these results. To our knowledge, is the first illustration of a hierarchy based on trade flows, differentiating municipalities by their rank. We show the market areas corresponding to the breakpoints reported in for rank 1 municipalities (Madrid, Barcelona, Valencia and Seville). The first threshold (BP(1):€94), covers their metropolitan areas, whereas their second and third breakpoints reach rank 2 or even rank 3 municipalities, as predicted by CPT. There is an additional breakpoint at the value of €665 that we do not show to ease the readability of the map This last breakpoint represents the distance between the rank 1 cities, particularly Madrid and Barcelona, emphasizing the results illustrated in .

Figure 6. Urban hierarchy and natural trade areas for rank 1 municipalities.

Note: Plotted are the first, second and third generalized transport cost (GTC) breakpoints. The fourth breakpoint is equal to e665.

Figure 6. Urban hierarchy and natural trade areas for rank 1 municipalities.Note: Plotted are the first, second and third generalized transport cost (GTC) breakpoints. The fourth breakpoint is equal to e665.

Moreover, taking as reference , if we portrayed the rings corresponding to the market areas of lower ranked cities based on the density of trade values by GTC thresholds, this would show that their (smaller) market areas are nested (or ‘enveloped’) within the (larger) market areas of higher ranked cities. For instance, the second breakpoint for rank 1 cities, equal to €188, and presenting an average value of trade of 16.33 (in logs), is greater than the first breakpoint for rank 2 cities, €144, whose value of trade is smaller at 15.67 (in logs). This relationship holds as we move towards higher breakpoints and city ranks. In this regard, resembles the classical lattice of nested hexagons represented in Christaller’s diagram. Nevertheless, we recall that Christaller’s model constitutes a stylized representation of precise geometrical relationships and, therefore, the actual size of real market areas may not match exactly the patterns of a hexagon lattice. First, markets areas may overlap, that is, following Taylor (Citation2012) cities compete for commercial influence, as shown above for Madrid and Barcelona. Second, low ranked market areas may fall within two high-ranked market areas, a situation that in Christaller’s idealized model does not occur. Third, it is also worth noticing that the geographical extension of the detected breakpoints exceeds internal administrative borders, particularly NUTS-5 (municipal) and NUTS-3 (provincial) territorial units, that is, trading activity spans several administrative levels. Overall, in light of these results, we contend that the high agglomeration of trade flows at the municipal level obtained in is mostly driven by the trading activity of ranks 1 and 2 cities.

CONCLUSIONS

We compiled two unique and granular databases of municipal trade flows and monetary transport costs to assess trade agglomerations around specific areas (municipalities). Relying on the gravity model, we focus on the role played by trade frictions and intra-national borders to show that regional borders have a much lower or even negligible impact on trade, whereas NUTS-5 borders and surrounding areas concentrate the largest share of trade flows. In contrast to previous studies, we argue that this trade agglomeration over short distances has nothing to do with (unreal) border impediments, but arises as a result of transport cost thresholds that shape trade market areas around large municipalities, thereby defining a hierarchical system of cities.

To confirm this idea, we apply the endogenous Chow test to the trade literature, allowing us to determine these specific breakpoints at which trade flows change structurally with transport costs. With this approach, we correctly measure the impact of internal borders and confirm the high density of trade at GTC values below €189 (equivalent to approximately 170 km or 110 min). Finally, we map these GTC breakpoints showing the emergence of an organized system of cities in terms of trade market areas in which trade agglomerations are driven by the largest municipalities in the urban hierarchy. We highlight that these results provide empirical evidence for the insights of the CPT and call for future research on the spatial trade distribution of the Spanish cities.

Supplemental material

Supplemental Material

Download PDF (280.3 KB)

DISCLOSURE STATEMENT

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by MINECO [grant number ECO2013-46980-P]; and the Ministerio de Educacion, Cultura y Deporte [grant number FPU-2010].

Notes

1. Although the lack of official databases on intra-national trade flows discourages empirical trade analysis, recent and prominent studies provide estimates that apply context-specific techniques to survey data (Wolf, Citation2000; Hillberry, Citation2002a; Hillberry & Hummels, Citation2008), use other official data sources at the national level (Poncet, Citation2005; Yilmazkuday, Citation2011, Citation2012) or rely on regional input–output databases (Thissen et al., Citation2019).

2. For a full description of the elaboration of the trade database, see Appendix A1 in the supplemental data online.

3. Appendix A2 in the supplemental data online explores a second-level decomposition for the extensive and intensive margins.

4. For a complete presentation of GTC methods, their calculation over time in real terms and the definition of suitable variation indices, see Zofío et al. (Citation2014) for the Spanish case and Persyn et al. (Citation2020) for the whole European Union region.

5. We use the Gaussian kernel estimator with n=100 points and allow it to calculate and employ the optimal bandwidth.

6. The PPML method accounts for the fact that the sample of municipalities includes a large number of zeros, offering estimators that are more efficient than their OLS counterparts. This method identifies and eventually drops regressors that may cause the non-existence of (pseudo-) maximum likelihood estimates. Therefore, it presents several advantages for trade data in light of the problems posed by the presence of numerous zeros, and in the presence of dummies (see also Head & Mayer, Citation2014). In addition, the PPML can resolve the bias caused by heteroscedasticity, serial correlated error and multicollinearity due to a high correlation among determinants in the gravity equation and the origin–destination fixed effects.

7. We control for time, origin and destination fixed effects to recover estimates on time-invariant controls, although our estimates remain robust to the inclusion of origin–year and destination–year fixed effects. We acknowledge that further developments of the PPML estimator allow us to include time-invariant dyadic fixed effects (Correia et al., Citation2020), but thanks to the inclusion of our main variable of interest, the bilateral GTC, plus the other dyadic variables (contiguity and administrative borders), we are able to capture bilateral characteristics between all pairs of municipalities.

8. Appendix A3 in the supplemental data online discusses equivalent PPML regressions for the second level of the trade flows decomposition.

9. We perform a set of mean tests and reject the null hypotheses of equal border coefficients between 2003 and 2007.

10. As argued by Diallo (Citation2014), the advantage of this endogenous version of the Chow stability test is that it is not required to provide in advance a breakpoint at which we suspect the existence of a structural change in the stability between our variables of interests. Instead, it endogenously determines the exact (GTC) breakpoint at which the dependent variable (trade flows) changes structurally with respect to our GTC variable. This test is available in STATA thanks to Diallo (Citation2014) (see also Berthelemy & Varoudakis, Citation1996, for an application of the Chow test in the economic growth literature).

11. We choose to convert our panel database for the period 2003–07 into a cross-sectional version of it based on averages of our trade and transport costs variables because applying the Chow test for the individual years 2003–07 gives approximately similar GTC breakpoints as those finally reported in .

12. The GTC values in the maps are the same in as those in , but expressed in distance-equivalent terms, so as to be able to plot them with Arc/GIS software. The GTCs represented are those obtained for the complete sample of 633 municipalities, but city-specific breakpoints could be also provided by the authors upon request.

13. We base our municipal cluster analysis on the averages for the number of commodities and the number of trading partners for each municipality in the period 2003–07. We use the K-means (centroids) algorithm and obtain the four groups portrayed after 25 iterations.

 

REFERENCES