1,643
Views
0
CrossRef citations to date
0
Altmetric
Articles

The Impact of Road Development on Household Welfare in Rural Papua New Guinea

ORCID Icon, ORCID Icon, &
Pages 933-953 | Received 15 May 2020, Accepted 24 Feb 2023, Published online: 03 Apr 2023

Abstract

In this paper we evaluate the impact of road development on household welfare in rural Papua New Guinea (PNG) between 1996 and 2010, using two geocoded cross-sectional national household surveys and corresponding road maps. We make use of time-variation in road surface type and condition as recorded in PNG’s National Road Asset Management System, focusing on routes that connect rural households to urban areas. To tackle endogenous placement of road infrastructure programs, we employ a correlated random effects model that controls for the location-specific average road quality over the period of analysis. We also use a newly developed generalised quantile regression method to investigate whether road works favour the poor. Our estimates show that better roads to nearest towns lead to higher consumption levels and housing quality, and to less reliance on subsistence farming. The effects are stronger among poor and remote households.

1. Introduction

Road access is one of the key elements necessary for rural economic development, generally offering the sole means for connecting people to markets and public services. Better roads connect rural households to goods and labour markets, providing a greater variety and lower prices of essential inputs and consumption goods, as well as higher prices and demand for local products (Gibson & Rozelle, Citation2003). Increased market access may raise local productivity and wages, and enable the transformation from subsistence agriculture to growing cash crops and to non-agricultural activities, thus diversifying household income sources (Aggarwal, Citation2018; Mu & van de Walle, Citation2011). Better roads may also attract financial service providers, facilitating agricultural investments and consumption smoothing (Binswanger, Khandker, & Rosenzweig, Citation1993), and enhance access to and quality of services like schools and hospitals (Bell & van Dillen, Citation2014).

Each of these factors suggests that better roads lead to higher average household consumption, which is confirmed by several studies (Knox, Daccache, & Hess, Citation2013). The distributional effects of better roads, however, are less understood, and the relative benefits to poor households are unclear. If better-off households are more able to compensate a lack of good roads –say, because they can better smooth out their consumption during seasons where roads are not usable, or through access to four-wheel drives – the poor would experience a relatively higher productivity gain from improved roads. Conversely, non-poor households may benefit more from better roads, since they might be able to scale up agricultural production more easily, or because the poor may be kept from road utilization due to transport costs. The effects of roads on other development outcomes beside consumption are also ambiguous. Non-agricultural employment may increase or decrease as a result of better roads, depending on the relative change in agricultural productivity. School enrolment may increase or decrease, depending on how infrastructure changes education returns and the opportunity cost of schooling (Adukia, Asher, & Novosad, Citation2020).

In this paper, we investigate the effect of road quality on household welfare in rural Papua New Guinea (PNG) between 1996 and 2010. PNG is a good case study ground for the effects of rural infrastructure development, due to its limited road access and high variability in road quality. In 2009, the country had a road density of 56 km per 1000 km2, which is very low compared to other countries in the region (neighbouring Indonesia had 250 km per 1,000 km2). In the same year, only 13% of roads were sealed, while the majority of roads were gravel or dirt roads.

We combine two household surveys with censuses as well as GIS datasets of the road network from the time the household surveys were administered. The road map data contains measures of surface type and condition for each road section of 100 m. Together with the location of households, we can calculate the distance to the nearest road, and the quality and length of the shortest route leading to the nearest town. This allows us to construct a set of road variables that captures quality at a very granular level, considering not just the section proximate to the households but arguably the most important road for each household in its entirety.

We use six indicators for household welfare: consumption, poverty status, household engagement in subsistence farming, wage employment, housing quality, and school enrolment. Our results suggest that access to better roads increases consumption and reduces poverty by facilitating the transition out of subsistence farming and into more market-based agriculture. Access to better roads leads to a higher probability of living in a house made of non-traditional material, and greater likelihood that children are enrolled in school. We also find that disadvantaged and more remote households benefit comparatively more from upgraded roads.

To identify the impact of road quality on the route that leads to the nearest town, we use a correlated random effects model to account for endogeneity of road quality. Following the approach by Mundlak (Citation1978), we correct for unobserved location-specific effects by including the average of the road quality variables obtained from the two maps. The resulting estimator is identical to the within estimator, despite the absence of panel data. Our results generally point to beneficial effects of sealing roads. The estimates suggest that, between 1996 and 2010, upgrading 1 km of the route leading to the nearest town from dirt to sealed road surface increased average household consumption by about 3.2%, raised the chance households lived in a house with a high-quality roof by about 1.3 percentage points, reduced the probability a household relied on subsistence farming by 0.5 percentage points, and increased the likelihood for a school-aged child to be enrolled in school by 1.4 percentage points.

We also explore effect heterogeneity. Our model is run on several population subgroups to see how the effect of road quality on consumption and poverty varies by remoteness of the household, its education level, and its demographic characteristics. We find that the effects on consumption and poverty are at least twice as high for households living more than 30 km from the nearest town, when compared to those living closer than 30 km. In addition, we apply a generalised quantile regression estimator (Powell, Citation2020) to investigate how the effects of road infrastructure vary across the consumption distribution. This procedure allows us to examine how different consumption quantiles are affected while also accounting for covariates, making the results comparable to the results from our base specification. Our estimates weakly indicate that the effect of upgrading dirt roads is higher for the poorest households, suggesting that road works can be considered anti-poverty measures in the case of rural PNG.

The remainder of the paper is structured as follows. In section 2 we discuss the related literature on roads and household welfare, and the associated methodological challenges. In section 3 we give the country context of the study and explain the data we use. Section 4 outlines the estimation techniques we use. In section 5 we present and discuss estimation results. In section 6 we offer some concluding points.

2. Literature

Estimating the impact of roads is complicated by the fact that government decisions about where to construct new roads, or whether to rehabilitate or upgrade existing ones, are likely correlated with areas’ growth and other development achievements. These decisions are often made based on unobserved factors like expected traffic volume, local productivity, investment cost, and political benefits of placing roads in particular areas – all factors that may also affect household welfare directly (see e.g. Burgess, Jedwab, Miguel, Morjaria, & Padró i Miquel, Citation2015).

Existing research on the impact of transport infrastructure has used a variety of approaches to address this endogeneity problem. Instrumental variable estimation – which requires an exogenous variable that affects road development but has no direct effect on the outcome variable of interest – is one approach. Gibson and Rozelle (Citation2003), who also study the impacts of road infrastructure in PNG, use the year in which the national highway system penetrated the district as an instrument. A related method uses the distance from a hypothetical connection between historical centres, such as straight lines or routes predicted based on geography, as an instrument. Areas close to the connection have a higher chance of being connected to the road network, while this distance is arguably not a causal factor for economic development. This approach has been followed to obtain impact estimates of transport infrastructure in China (Banerjee, Duflo, & Qian, Citation2020) and Nepal (Shrestha, Citation2020). Gertler, Gonzalez-Navarro, Gračner, & Rothenberg (Citation2022) instrument district-level road maintenance investments in Indonesia by interacting province budgets with characteristics from other districts, exploiting the fact that resource allocation from the central government to provinces follows a strict rule set. Regression discontinuity designs have been used to study road impacts where investment decisions were made based on whether localities exceeded a threshold at some priority measure. Examples include program evaluations in Sierra Leone (Casaburi, Glennerster, & Suri, Citation2013) and India (Adukia et al., Citation2020, Asher & Novosad, Citation2020; Aggarwal, Citation2018). A technique often used for binary road variables is difference-in-differences estimation, sometimes combined with propensity score matching to allow the common trend assumption to hold conditional on covariates (Lokshin & Yemtsov, Citation2005; Mu & van de Walle, Citation2011).

Where panel data is available, another approach to address potential endogeneity in road works is to use village or household fixed effects (Gibson & Olivia, Citation2010; Khandker, Bakht, & Koolwal, Citation2009; Khandker & Koolwal, Citation2010) to assess the impact of road investments over the period covered by the panel. The fixed effects approach accounts for endogeneity caused by time-invariant characteristics of the location. The availability of multiple time periods further allows instrumentation using lagged outcomes (Dercon, Gilligan, Hoddinott, & Woldehanna, Citation2009; Khandker & Koolwal, Citation2011). The studies mentioned here all require that the same households are interviewed repeatedly over a time span sufficiently long to measure the impact of road investments. However, our study only uses repeated cross-sectional data, in conjunction with repeated measurements for road variables. Using the Mundlak (Citation1978) estimator, we apply a correlated random effects model to incorporate village fixed effects. That makes our approach applicable in a broad range of settings where panel data is not available.

Only a few studies have examined heterogeneity in the impacts of roads. Dercon et al. (Citation2009) find that access to an all-weather road in rural Ethiopia between 1994 and 2004 reduced poverty by 6.9 percentage points. They find no evidence of heterogeneity of this effect across household characteristics like size of landholdings, livestock holdings, or literacy of the household head. However, their estimates show the effect on consumption growth is larger for households with landholdings of at least a hectare and a literate household head. Dercon, Hoddinott, and Woldehanna (Citation2012) obtain complementary results, finding that remoteness from towns and poor roads are among the factors most associated with chronic poverty. Khandker et al. (Citation2009) investigate how households in Bangladesh profited from road improvement projects. They predict that villages next to an improved road experience a poverty reduction of 3–6 percentage points over 4 years. The impact on household expenditure is higher for lower expenditure quintiles in this study, suggesting that road investments are pro-poor. However, using a larger dataset and controlling for other investment programs, Khandker and Koolwal (Citation2010) find the opposite pattern. Mu and van de Walle (Citation2011) find positive and significant average effects of rural road rehabilitation on local market development in Vietnam. The authors note a tendency for poorer localities to have higher impacts due to lower levels of initial market development. A replication study by Nguyen (Citation2019) confirms these results.

3. Data and context

With a population of roughly 8.9 million people in 2020, PNG is the largest and most populated country of the Pacific region. During the years covered by the data used in this study (1996–2010), PNG’s real per capita growth was moderate – averaging 2.6% per year – and poverty incidence even grew by two percentage points to roughly 40% of the population (Gibson, Citation2012). During that time period, the share of PNG’s population living in rural areas has remained constant at around 87% according to the 2011 census, of which around 90% largely relied on (semi-)subsistence agriculture (see ).

Table 1. Summary statistics for the analysis sample

Development and maintenance of PNG’s road network suffered during the two decades following independence, when funding for road maintenance fell by half (Kwa, Howes, & Lin, Citation2010). Government expenditure on infrastructure per capita reached its minimum in 2001, but large and sustained increases in funding only began in 2010 (Dornan, Citation2016). Beside low government revenues, several other factors have made it difficult to build and maintain roads during this period: limited road management capacity in the private sector due to unsteady provision of maintenance contracts, competition for construction equipment and skilled engineers between resource extraction enterprises and the Department of Works, and disputes with owners of land proximate to road works (Lucius, Citation2010). Alongside these practical obstacles to implementation, a lack of political will due to the low visibility of road investments, as well as high levels of corruption contributed to the insufficient maintenance of the road network (Dornan, Citation2016). Lastly, PNG’s climate and geography, with its rugged topography, seismic activity, intense weathering, and high seasonal rainfall in many regions – especially in the densely populated agricultural heartland of the Highlands region – make road construction and maintenance challenging (Stead, Citation1990).

Two nationally representative cross-sectional household surveys conducted in PNG – the 1996 Papua New Guinea Household Survey (PNGHS 96) and the 2009–2010 Household Income and Expenditure Survey (HIES 09/10) – provide the primary source of data on household consumption and other indicators used in this study. The 1996 household survey included a nationally representative sample of 830 rural households in 73 census units (villages). The 2009 survey collected information from 2208 rural households in 125 villages. We also make use of variables from the PNG Census 2000 Community Profile System (CPS 2000), which contains information on the location and population of all villages and towns from the census of 2000. This allows us to locate all the villages in the two surveys as points on a map.Footnote1 The HIES 09/10 includes GPS coordinates of all surveyed households, allowing us to calculate fairly precise household-specific distances to the nearest road. In addition, we make use of the Papua New Guinea Resource Information System (PNGRIS) of the PNG National Agricultural Research Institute (Bryan & Shearman, Citation2008). This spatial database contains information on elevation, climate, and other biophysical characteristics which we include as control variables in our analysis.

For data on the status of road infrastructure over time, we rely on the road information data bank and the geographical information system of the Road Asset Management System (RAMS). The RAMS project, initiated in 1998 by the PNG Department of Works (DoW), was designed to provide a road database and analytical tool to inform policy makers about road needs and economic efficiency of investments in the road network (Jusi, Mumu, Jarvenpaa, Neausemale, & Sangrador, Citation2003). We link initial RAMS data – based on road surveys conducted between 1999 and 2001 – to the PNGHS 96 (we refer to the combined data as the 2000 map). Due to continued underinvestment in the transport sector, certain dimensions of the RAMS – particularly its traffic counts (vital to estimating a road’s value) – were not updated after 2001. However, the provincial works managers of the DoW were given financial support to update data on road quality, and data collected by DoW provide the basis for our dataset of roads in 2009. The road system consisted of roughly 26,000 km of roads in 2009. We link this dataset to the HIES 09/10 and refer to it as the 2009 map.Footnote2

Both the 2000 and 2009 road maps include detailed information for each road segment. Most importantly, surface type (sealed, gravel, dirt) and condition (good, fair, poor) are given for all segments. The 2009 road map has much more extensive coverage of the road network – covering an additional 14,000 km of roads not included in the 2000 map, consisting mostly of gravel or dirt feeder roads. However, within our study period the focus of road works in PNG was on maintenance and upgrading, and reportedly no new roads were constructed during this period (World Food Programme & Logistics Cluster 2011; also confirmed in discussions with informants at the DoW). This leads us to conclude that the higher density of roads depicted in the 2009 map is a result of an improvement of the information contained in the map, rather than the construction of new roads.

and map the roads, distinguished by surface type, comprising identified stretches of the national network in 2000 and 2009. A comparison of the maps shows that most of the missing road segments in 2000 are classified as dirt roads in 2009. This is partly because the additional roads on the 2009 map are made up almost entirely of provincial roads, which were more likely to have a dirt surface than national roads. In the online appendix we show descriptive statistics of the road network for both years, as well as transition matrices of surface type and condition for those roads included in both maps. They do not reveal an apparent trend in road status, in that the length of roads upgraded or improved is roughly offset by roads that deteriorated.

Figure 1. Roads by surface type in 2000.

Figure 1. Roads by surface type in 2000.

Figure 2. Roads by surface type in 2009.

Figure 2. Roads by surface type in 2009.

Using data from the road maps and household surveys, we construct variables indicating the length, surface types and condition of the road leading to the nearest town for the sampled households. We consider the shortest route to the nearest town from the stretch of road that is closest to the household. This route may not be a perfect representation for the roads used by surveyed households – as sometimes the nearest town may not be the most relevant – but we believe that it is a useful heuristic nonetheless. For this route, we calculate the distance for each type of surface and road condition.Footnote3 For our analysis, we focus on households that are connected to a town by a road. We exclude households that are located more than 15 km from any road as it seems unlikely that they use the nearest road regularly, making road quality unlikely to have a significant impact. Furthermore, we consider only towns that had more than 1000 inhabitants according to the census 2011 and are within 5 km of the nearest road.Footnote4 Taken together, these restrictions mean that about 20% of the rural census units in the two surveys are dropped from our analysis. The villages left out are mostly on small islands, in the deep interior in the west of the Momase region, or at the coast of Western province (all of which have a very low population density).

Due to the fact that our road map of 2000 is less detailed than the 2009 map, we have missing information on surface type and condition for some of the road segments of 2000. For the routes used in the analysis, this information was lacking for 24% of the total distance. For that reason, we drop observations where all road segments leading to the nearest town have unknown characteristics for 2000. This leaves a total of 20% of the total distance unknown for 2000. For the remaining observations, we simply assign the segments with unknown characteristics for 2000 the same characteristics as for 2009. We conduct sensitivity checks to examine the implications of this decision, which we discuss in section 5.

provides summary statistics for our sample. It includes geographic and road access variables that were merged with the household survey data. Both household surveys include sections that allow calculation of per adult-equivalent household expenditure – based on a closed interval recall method in the PNGHS 96, and on consumption diaries in the HIES 09/10 – as well as regional poverty lines based on the cost of locally consumed foods.Footnote5 Across the two household surveys, average per adult-equivalent consumption decreased and poverty incidence increased slightly. School enrolment increased by more than 14 percentage points, and average years of schooling of adults increased by almost one year. Households in the 2009/2010 HIES were slightly older (higher average age of household members), smaller (with nearly one person less, on average), and lived closer to the nearest town, compared to the households in the PNGHS 96. These differences could indicate demographic changes but are more likely caused by differences in the sampling schemes for the two surveys. also shows the distances by surface type for the routes leading from the sampled households to the nearest town, as measured by us using the road maps and household locations.

We use six different outcome variables in our regressions of road quality. The first one is the same one as used in Gibson and Rozelle (Citation2003): the logarithm of real yearly consumption per adult-equivalent.Footnote6 We divide yearly consumption by the respective regional rural poverty lines to calculate real consumption. A similar outcome variable is poverty status. We include it as our second outcome to examine specifically how the probability of being poor is affected by road infrastructure.

In addition to consumption, we are also interested in the effects of infrastructure on rural employment and structural transformation. One common change observed among rural households as a result of improved access to markets is reduced dependence on subsistence farming. Therefore, our third outcome is whether members of the household reported engaging in subsistence farming in the days prior to the survey date.Footnote7 Another indicator of structural change is having a wage job. Accordingly, we use a dummy for whether at least one member of the household is employed in a non-agricultural sector as fourth outcome variable. We expect that better roads improve off-farm earning opportunities and therefore reduce the necessity for subsistence agriculture and increase the likelihood of formal employment.

Another indicator of material well-being is housing quality.Footnote8 Traditional dwellings, with walls made of bush material and with grass or leaf roofs, are cheap to come by but have a relatively low service life. So, investments in non-traditional housing materials may signify not only an immediate improvement of living conditions but also an intention to stay in a given location more permanently. Due to the lack of credible and intertemporally comparable housing value estimates, we chose to use a dummy indicating whether the house has a good roof, i.e. a roof made of metal, tiles, or cement. If improved road access leads to better opportunities and living conditions and reduces transportation costs, we would expect more good roofs due to better access to roofing materials as well as higher demand for them.

Lastly, we examine the school enrolment ratio of children at school age. Our hypothesis is that school enrolment increases with better road infrastructure due to easier access to schools for both children and teachers.

4. Estimation

To estimate the causal relationship between the quality of the road infrastructure and our outcomes of interest, we propose a linear model of the form (1) yijt=βRjt+γDj+δXijt+μj+πpt+εijt,(1) where yijt is an outcome for household i in village j at time t, Rjt and Dj are vectors of variables related to road infrastructure, Xijt is a vector of exogenous control variables (at the household- and the village level), μj denotes unobserved, time-invariant heterogeneity at the village level, πpt is a province time fixed effect, and εijt is an independent disturbance term.

We include two types of road infrastructure variables. Dj includes the Euclidean distance to the nearest road and the distance on that road leading to the closest town with a 2010 population above 1,000 people. Since we assume no new roads were added between 1996 and 2010, these distance variables are time-invariant. Rjt captures the combined length of road segments of a particular type on the route between surveyed households and the nearest town. Since these distances add up to the total distance travelled on that road to the nearest town, which is already included in Dj, we leave out the lowest quality category of road type. This means that the coefficients in β give the welfare impact of changing a kilometre of road from the lowest quality type to the other types. Road segments are upgraded, left to deteriorate, or remain the same type over time, so Rjt is time-varying.

Since there is no overlap in sampled villages between the two surveys, we cannot difference out the term μj, and treating it as a random effect uncorrelated with all independent variables might lead to a biased estimate of β. Instead, we use the correlated random effects approach introduced by Mundlak (Citation1978). The idea is to substitute μj with the mean of Rjt over t, R¯j*, and an independent village random effect, ωj: (2) μj=αR¯j*+ωj.(2) Rjt is uncorrelated with ωj+εijt. This combined disturbance term is also independent between nearby localities, as any spatial correlation is captured by πpt and R¯j*. Since distance variables Dj are time-invariant, they cannot be included in the model of μj and are assumed to be conditionally exogenous. The model is estimated using OLS and takes into account the survey design, i.e. sampling weights,Footnote9 stratum divisions, and standard error clustering by village. To reduce collinearity between the endogenous variables and their respective intertemporal averages, we estimate EquationEquation (1) using centred terms RjtR¯j* instead of Rjt to reflect variables in terms of their deviations from averages over time.

We include several location-specific control variables. Among these are the geoclimatic variables used in Gibson and Rozelle (Citation2003) – namely, altitude (in metres), a dummy for whether the slope is above 10 degrees, a dummy indicating that the land is subject to flooding, a dummy indicating that rainfall deficits are rare, and annual rainfall (in metres). To control for the economic importance of the nearest town, we also include the logarithm of its population as measured in the census closest to the survey year.Footnote10 Population numbers could be endogenous, say, because more productive areas could enhance household welfare as well as faster population growth. To account for this potential source of endogeneity, we include the average of log population in EquationEquation (2).

We also include some household-level controls. This demands caution, however, since changes in road access could alter household characteristics. An example for such an endogenous household variable is the sector of work, which could change as a result of new opportunities created by improved road access, and thus cannot be included. We select a parsimonious set of variables that describes the composition and education level of adults in the household (see ). We report results with and without these household controls for EquationEquation (1). Province-time fixed effects are included to control for differences in outcomes due to unobserved factors such as local economic conditions, and the ability and political will to build, maintain, and upgrade roads. They constitute particularly effective controls since the roads to the nearest town, while not having geographical point locations, are for almost all observations located within the same province or at least have the largest part in the province as the corresponding villages. In addition, the province level is where decisions about road rehabilitation, maintenance, and upgrading for non-national roads are being made.

With regard to road types, we explore specifications with different levels of detail. A simple way to capture road quality is to consider only the surface type, i.e. whether a road is sealed, gravelled, or a dirt track. A more detailed categorization of road segments includes surface type together with road condition, i.e. whether a road is in good, fair, or poor shape. In this paper, we only present the results of the analysis using surface type. The more detailed model is consistent with the simpler model but shows no clear effects of road condition, likely due to a lack of statistical power as well as differences in the way road condition was assessed across time and between provinces. The results using road condition can be found in the online appendix.

Our basic model specification rests on the assumption that unobserved factors which both contribute to the respective outcomes and are correlated with road infrastructure are location-specific and fixed over the period between the two surveys. This by itself is a daring presumption given the 13-year period between surveys. Although overall rural economic output and poverty have stagnated over the study period, some areas may have gained or lost in population or economic importance in those years, potentially affecting infrastructure as well as household welfare. We attempt to capture changes in local economic and political conditions by including province-time dummies and year-specific town population numbers. The household level covariates also serve as indicators for economic and demographic conditions at a given time. With these controls, we believe that unobserved time-varying factors are not a cause for concern.

A caveat is that we cannot model selective migration that occurs in response to changes in road quality. This is a potential source of bias for most studies on the benefits of infrastructure; even when panel data is available, the whereabouts of individual migrating household members are seldom recorded. The actual effects of rural road quality on migration are unclear, with some evidence pointing to reduced out-migration (Castaing Gachassin, Citation2013), while other recent studies find no significant effects (Aggarwal, Citation2018; Asher & Novosad, Citation2020). In the online appendix, we look at correlations between our road variables and indicators of migration in PNG, which all turn out small and statistically insignificant. Given this, we believe that selective migration does not bias our estimates to a degree that would invalidate them.

In addition to the average welfare effects of rural roads, we are also interested in whether rural roads affect all households in the same manner. One open question is whether high education levels are complementary to road infrastructure, or whether it is mostly low-skilled labour that becomes more productive through better roads. Other sources of effect heterogeneity might be the gender and age of household members. For example, additional opportunities created by roads may help empower women and thereby have a larger effect on their consumption. Poor quality or lacking road infrastructure may trap older people and diminish their prospects more than those of young people due to physical constraints on walking long distances or transiting rough roads. The marginal effects of road quality may also differ between households who live close to a town and those who do not. To investigate whether this type of impact heterogeneity exists, we follow Dercon et al. (Citation2009) and divide our sample in two subsamples to estimate the model separately by subsample. We define the subsamples on the basis of: (i) whether the road distance to the nearest town is more than 30 km, (ii) whether the average years of education for household members at least 18 years old is larger than 4 years, (iii) whether the household head is female, and (iv) whether the ratio of household members above 50 exceeds 30%.

Last of all, we explore effect heterogeneity across the distribution of real consumption, which the impact estimates on poverty status do not capture. The effect of roads on poverty is driven by relatively few households around the poverty line. However, we are also interested in how relatively poor households are affected by infrastructure compared to relatively well-off households. To this end, we employ quantile regressions.

The principal idea of quantile regression is that unlike in the least squares regression framework, it is not the conditional expectation but a conditional quantile of the outcome that is a linear function of the covariates. The τth quantile of consumption yijt conditional on road quality Rjt is expressed in the linear quantile function q(Rjt,τ)=β(τ)Rjt. Each possible outcome can be related to this function: (3) yijt=q(Rjt,Uijt*), Uijt*U(0,1),(3) where Uijt* is a non-separable disturbance term normalised to a standard uniform distribution. Uijt* determines the rank of the outcome within the conditional distribution and is what causes heterogeneity in outcomes conditional on Rjt.

Khandker et al. (Citation2009) and Khandker and Koolwal (Citation2010) use quantile regressions in their studies of road infrastructure. Similarly to our estimation above, they include correlated random effect terms as covariates in their quantile regressions to account for unobserved heterogeneity. The issue with this approach is that including control variables in a quantile regression model alters its interpretation. For instance, the τth consumption quantile for households with good roads is not the same as the τth quantile for households with good roads and low levels of education. Using a similar reasoning, including the correlated random effect terms – and thereby, implicitly, an approximation to the location-specific fixed effect – yields an interpretation different from the one associated with model (3). On the other hand, not conditioning on controls and the correlated random effect terms likely creates biased estimates. It requires the assumption that Uijt* is independent of Rjt, which is quite strong, and likely holds only conditionally on covariates.

To circumvent this problem, we use a generalised quantile regression (GQR) model, as introduced by Powell (Citation2020). Here, outcomes are modelled by the same quantile function as in (3), but the disturbance term Uijt* is itself dependent on covariates Cijt. GQR jointly estimates β(τ) and the conditional quantile P(yijtRjtβ(τ)|Cijt). The covariates are thus used to predict the position in the conditional outcome distribution. This way, including control variables in the model does not alter the causal interpretation of the quantile impact estimates of road quality.

For our model, we focus on the distances by surface type Rjt, while the time-invariant distances to the nearest road and to the nearest town, Dj, are included in the set of covariates Cijt. The main reason is that the more variables are conditioned on, the lower is the variance of the conditional outcome distributions, and the lower is the difference between quantile effects. So, if we considered the outcome distribution conditional on both Rjt and Dj, we would not expect to see much effect heterogeneity. As further control variables we include everything we controlled for in our most detailed specification above, which includes location- and household-specific control variables, province-time dummies, and the Mundlak terms R¯j*. For estimation, we use the Stata routine genqreg for all quantiles from 1 to 99. To obtain 95% confidence intervals we apply a cluster bootstrap at the village level.

5. Results

Our main estimation results, based on EquationEquation (1), are summarised in . For each outcome variable, we estimate three specifications. Specification 1 is a linear regression of the model specified in EquationEquation (1), without household level controls and without Mundlak terms. Specification 2 includes household-level control variables. Specification 3 in addition contains the Mundlak terms. The first two specifications allow us to examine the effect that time-varying control variables have on the main coefficients. As these lead to considerable improvements in fit while leaving the coefficients of road-related variables nearly unchanged, we are not worried about them being endogenous to the model. Furthermore, the Mundlak terms in specification 3 are jointly statistically significant for four of the six outcomes. Accordingly, the third specification is our preferred one. For the analyses by subgroups and the quantile regressions we use this one as well.

Table 2. Impact of road type and distances on consumption and poverty status

Table 3. Impact of road type and distances on subsistence farming and wage employment

Table 4. Impact of road type and distances on having a good roof and school enrolment

To account for the fact that we test multiple hypotheses, we compute sharpened False Discovery Rate (FDR) q-values using the two-stage procedure described in Anderson (Citation2008), in addition to p-values. The q-values (in square brackets) indicate the smallest level at which the coefficient in question would be significant, given the p-values of the simultaneously tested hypotheses. We apply this correction using all road quality variables and all outcomes together, but separately for each specification.

shows the impact of road condition variables on log real per adult-equivalent consumption and poverty status. The results indicate that improved road conditions have a positive impact on consumption. An upgrade of 1 km of dirt road to sealed road on the route to the nearest town leads to a 3.2% increase in household consumption. Upgrading 1 km of gravel road to sealed road increases consumption by 2.2%. The difference between dirt and gravel roads is positive but not significant. Similarly, transforming a kilometre of dirt road to sealed road reduces the probability to be poor by 1.3 percentage points, an upgrade to gravel road reduces it by about 1.1 percentage points. The effects of sealing dirt roads on consumption and poverty hold up to the scrutiny of FDR q-values, while the effects of gravelling dirt roads have q-values slightly above 10%.

presents estimates of the impact of road type on the likelihood of engagement in subsistence farming and in wage employment. The point estimates of the effects on these two outcomes indicate that better quality roads facilitate the structural transformation from subsistence farming to economic activities that are more integrated into local markets. In particular, a 1-km change of road surface from gravel to sealed reduces the probability that a member of the household engages in subsistence farming by around 0.6 percentage points. For wage employment, the picture is less clear. It appears that turning gravel into sealed roads reduces the probability that a household member has a wage job. But the Mundlak terms in specification 3 are jointly insignificant with a p-value of 0.93, so including them might only drive up coefficient variability without reducing bias. Indeed, when leaving the Mundlak terms out, the effects break down to zero. In summary, better roads drive people out of subsistence farming and presumably toward cultivation of cash crops, as opposed to non-agricultural employment.

shows the effects on the likelihood of having a good roof and of school enrolment of children between 7 and 17. We find clear signs that improvements in roads lead to investment in housing. A 1-km increase in gravel versus dirt roads raises the probability of having a good roof by around 1.3 percentage points. The same holds for sealed versus dirt roads, even though the latter result is marginally insignificant. These findings are unsurprising given the high transportation costs of tiles or corrugated sheet metal. Finally, upgrading 1 km of dirt road on the route to the nearest town to sealed or gravel appears to increase the probability of a school-aged child to be enrolled in school by 1.4 and 1.6 percentage points, respectively. However, the FDR q-values for this last result are slightly above the 10% significance threshold, making it a somewhat speculative finding.

The magnitude of some of the effects per kilometre of changed surface type appears quite large, given a median of 29 km to the nearest town for our sample. This can in part be attributed to the long observation period of 14 years, which may have compounded the differences in outcomes between households with better and worse roads. More importantly, the estimates are identified over a road system with relatively few changes over the years. For 81% of the part that we observe for both years, the surface type remained unchanged (see the transition matrix in in the online appendix). The marginal effects of changes in surface type are likely somewhat diminishing: road planners can be expected to maintain and upgrade road segments with an eye to necessity or expected gains. The high magnitude of our findings thus seems to confirm the underfunded state of PNG's road policy at the time.

Another reason for a possible bias of our estimates away from zero lies in the way we handle road segments with missing road surface in the 2000 map. Since we assign the same surface type as the corresponding segments in the 2009 map to the missing segments, it is possible that actual changes along the route are biased downwards, leading to an overestimation of marginal effects. We repeat all the estimations but gradually exclude households with more than 50%, 20%, 5%, and 0% of segments missing on their shortest route to the nearest town, thereby respectively reducing the estimation sample to about 85%, 63%, 42%, and 20% of its original size. For all the outcomes and all but the last sample reduction, the results remain qualitatively the same and remain statistically significant, with most estimates even further away from zero than the original. Regression tables are omitted here but are available on request. The exercise indicates that our treatment of missing segments does not have a major effect on the results.

Next, we discuss the estimates disaggregated by subgroups of households to study heterogeneous effects of road quality. We only consider the models of consumption and poverty, since these are the key outcome variables of this paper. The results are reported in and , with the subsample defined in the top row. In the third row from the bottom of the tables, we report estimated correlation coefficients between the outcome variables and subset identifiers. In the last two rows, we provide p-values of Wald tests of the equality of the surface material coefficients between the two subsamples, and of the same for the coefficients of all road variables, respectively.

Table 5. Impact of road type and distances on log real per adult-equivalent consumption by subgroups

Table 6. Impact of road type and distances on poverty status by subgroups

We find clear differences when effects are estimated for households distinguished by distance to the nearest town. Households living farther than 30 km to the nearest town benefit more from dirt road upgrades than those living closer, both in terms of consumption as well as the likelihood of being poor. It appears that for shorter trips road quality plays a minor role, while for longer distances the travel cost differences between better and worse roads may prove pivotal for the decision to make a trip more often, with measurable consequences for material wellbeing. A short distance to the nearest road, on the other hand, is more relevant to those households living closer to the nearest town. Furthermore, it appears that the likelihood of being poor is less affected by road upgrades for female-led households than for male-led ones, and that households with a comparatively higher share of older household members seem to benefit more from better roads, though the differences in surface coefficients are not jointly significant. Since low consumption and poverty status are positively correlated with remoteness and relatively more older household members, these results also indicate that poorer groups benefit more from better roads.

Last of all, we present the results of the generalised quantile regressions examining the effect of road quality on log per adult-equivalent consumption. shows the marginal effects of the difference of 1 km of road between dirt and gravel, gravel and sealed, and dirt and sealed, for each quantile from 1 to 99. The confidence bands come from our cluster bootstrap. All the control variables from the prior regression models as well as the Mundlak terms are included.

Figure 3. Road type coefficients by consumption quantile.

Figure 3. Road type coefficients by consumption quantile.

The effect of the difference between a kilometre of sealed and dirt road is on average relatively larger for the lowest 20 quantiles and relatively lower for the highest 20. The average effect on log consumption for the former is 0.041, while it is 0.007 for the latter. Similarly, the effect of the difference between gravel and dirt road is decreasing along the income distribution, with an average effect of 0.030 for the poorest 20 quantiles and 0.001 for the richest 20. However, the large confidence bands generated by the bootstrapping procedure mean that these trends are not statistically significant. We interpret the results from the GQR model as limited evidence that upgrading dirt roads disproportionately benefits the poorest households in PNG and may thus be regarded a pro-poor policy measure.

6. Conclusion

In this paper, we examine the effects of the quality of PNG’s road network on rural household welfare over a 13-year period. We find evidence that upgrading roads leads to improvements in household welfare, housing quality, and school enrolment. The effects on consumption and poverty are higher for disadvantaged and more remote households. This finding complements the argument by Gibson and Rozelle (Citation2003) that due to the sparse road network and the remoteness of many poor households, infrastructure spending may be one of the few feasible targeted antipoverty measures in PNG. Our results also show that upgrading roads supports the structural transformation of households away from subsistence farming and toward market-oriented activities. This is consistent with the inhomogeneous impact on consumption and suggests that connecting rural households to local markets particularly benefits smallholder farmers.

Our identification strategy makes only use of existing administrative road inventory data in combination with repeated cross-sectional household surveys. These kinds of datasets are available for many other countries as well. The appeal of our method thus lies in its simplicity and its low data requirements. It easily lends itself for replication elsewhere, and again in PNG after the next national household survey has been conducted.

Supplemental material

Supplemental Material

Download PDF (249.1 KB)

Acknowledgements

The authors wish to thank John Gibson for his assistance with the survey data and his useful comments during early phases of the research. In addition, we thank Bryant Allen for providing access to the PNGRIS database.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The authors commit to make the data and do-files available on request.

Additional information

Funding

This work was supported by the Asian Development Bank [Regional project 46185 ‘Developing Impact Evaluation Methodologies, Approaches, and Capacities in Selected Developing Member Countries’, subproject 2].

Notes

1 Since the PNGHS 96 used sampling based on census units from the census of 1990, on which data is unavailable, we first had to recode the 1990 census units. For this, we relied on the census unit names listed in Gibson and Rozelle (Citation1998) as well as the generous help of staff at the National Statistical Office of PNG. The HIES 09/10 was sampled from the census of 2000, which made locating the census units straightforward.

2 The two road datasets offer slightly different spatial representations of common road segments, with positional differences ranging up to several hundred metres. To ensure that our analysis is not influenced by differences in the coverage and spatial representation of the road network across the two years, we use the representation of the 2009 map for both maps. That means that we have to include the information on surface type and road condition of 2000 in the 2009 road map. We match roads based on road section IDs, and where those are missing, on spatial proximity. It should also be mentioned that we cannot make use of the most detailed survey of the national road network to date, the Comprehensive Visual Road Condition Survey, which was collected in 2014 by the Papua New Guinea-Australia Transport Sector Support Program (TSSP) together with the DoW. Due to the sudden heavy rise in national road investments starting in 2011, we believe that the conditions in this new survey do not adequately reflect the conditions around the time the HIES 09/10 was conducted.

3 Details on the construction of this variable, as well as a discussion of remote sensing as an alternative to identify road quality and surface type, can be found in the appendix of Edmonds, Wiegand, Koomen, Pradhan, and Andrée (Citation2018).

4 The latter criterion leads to the exclusion of three settlements that can only be reached by water. All towns with more than 1000 inhabitants in 2011 were already towns in 2000.

5 We construct per capita expenditure as well as regional poverty lines as explained in Gibson and Rozelle (Citation1998) and Gibson (Citation2012). Particularly, we use the revised consumption figures, poverty lines, and sampling weights for the PNGHS 96 explained in Gibson (Citation2012) to make expenditure and poverty comparable between the two surveys. For the HIES 09/10, Gibson (Citation2012) suggests three different consumption figures. Due to evidence of diary fatigue, we use the figure based on the shortest time horizon (7 days). The poverty lines we use take the cost of a locally consumed food basket and add the non-food spending of households whose food expenditures exactly meet this cost (Lanjouw & Lanjouw, Citation2001).

6 Like Gibson and Rozelle (Citation2003), we assign children aged between 0 and 6 years a weight of 0.5, while children older than 6 years as well as adults are assigned a weight of 1.

7 The definitions vary slightly between survey rounds. For the PNGHS 96, the variable indicates that in the two weeks prior to the survey, at least one household member engaged in the production of sago, bananas, corn, sweet potato, cassava, taro, or other fresh fruits or vegetables without selling them. For the HIES 09/10, the variable indicates that in the week prior to the survey, at least one household member engaged in agricultural production for own consumption. The means of both variables are very close, as shown in .

8 Measures of housing quality were left out in the construction of the consumption figures (Gibson, Citation2012), so are not part of the first two outcome variables.

9 The sampling weights used in the regressions of consumption and poverty status are person-specific, those in the regressions of schooling are children-specific, and those in the other regression are household-specific.

10 We take the population size for 1996 from the 2000 census, and the population size for 2009/10 from the 2011 census.

References

  • Adukia, A., Asher, S., & Novosad, P. (2020). Educational investment responses to economic opportunity: Evidence from Indian road construction. American Economic Journal: Applied Economics, 12(1), 348–376.
  • Aggarwal, S. (2018). Do rural roads create pathways out of poverty? Evidence from India. Journal of Development Economics, 133, 375–395. doi:10.1016/j.jdeveco.2018.01.004
  • Anderson, M. L. (2008). Multiple inference and gender differences in the effects of early intervention: A reevaluation of the abecedarian, perry preschool, and early training projects. Journal of the American Statistical Association, 103(484), 1481–1495. doi:10.1198/016214508000000841
  • Asher, S., & Novosad, P. (2020). Rural roads and local economic development. American Economic Review, 110(3), 797–823. doi:10.1257/aer.20180268
  • Banerjee, A., Duflo, E., & Qian, N. (2020). On the road: Access to transportation infrastructure and economic growth in China. Journal of Development Economics, 145, 102442–36. doi:10.1016/j.jdeveco.2020.102442
  • Bell, C., & van Dillen, S. (2014). How does India’s rural roads program affect the grassroots? Findings from a survey in Orissa. Land Economics, 90(2), 372–394. doi:10.3368/le.90.2.372
  • Binswanger, H. P., Khandker, S. R., & Rosenzweig, M. R. (1993). How infrastructure and financial institutions affect agricultural output and investment in India. Journal of Development Economics, 41(2), 337–366. doi:10.1016/0304-3878(93)90062-R
  • Bryan, J., & Shearman, P. (2008). Papua New Guinea resource information system handbook (3rd ed.). Port Moresby, Papua New Guinea: University of Papua New Guinea.
  • Burgess, R., Jedwab, R., Miguel, E., Morjaria, A., & Padró I Miquel, G. (2015). The value of democracy: Evidence from road building in Kenya. American Economic Review, 105(6), 1817–1851. doi:10.1257/aer.20131031
  • Casaburi, L., Glennerster, R., & Suri, T. (2013). Rural roads and intermediated trade: Regression discontinuity evidence from Sierra Leone (Working Paper). Cambridge, MA: Harvard University Department of Economics Working Paper.
  • Castaing Gachassin, M. (2013). Should I stay or should I go? The role of roads in migration decisions. Journal of African Economies, 22(5), 796–826. doi:10.1093/jae/ejt004
  • Dercon, S., Gilligan, D. O., Hoddinott, J., & Woldehanna, T. (2009). The impact of agricultural extension and roads on poverty and consumption growth in fifteen Ethiopian villages. American Journal of Agricultural Economics, 91(4), 1007–1021. doi:10.1111/j.1467-8276.2009.01325.x
  • Dercon, S., Hoddinott, J., & Woldehanna, T. (2012). Growth and chronic poverty: Evidence from rural communities in Ethiopia. Journal of Development Studies, 48(2), 238–253. doi:10.1080/00220388.2011.625410
  • Dornan, M. (2016). The political economy of road management reform: Papua New Guinea’s national road fund. Asia & the Pacific Policy Studies, 3(3), 443–457. doi:10.1002/app5.142
  • Edmonds, C., Wiegand, M., Koomen, E., Pradhan, M., & Andrée, B. P. J. (2018). Assessing the impact of road development on household welfare in rural Papua New Guinea. In N. Yoshino, M. Helble, & U. Abidhadjaev (Eds.), Financing infrastructure in Asia and the Pacific: Capturing impacts and new sources (pp. 189–235). Tokyo: Asian Development Bank Institute.
  • Gertler, P. J., Gonzalez-Navarro, M., Gračner, T., & Rothenberg, A. D. (2022). Road maintenance and local economic development: Evidence from Indonesia’s highways (NBER Working Paper No. w30454). Cambridge, MA: National bureau of economic research (NBER).
  • Gibson, J. (2012). Papua New Guinea poverty profile. Based on the household income and expenditure survey (Technical Report). Port Moresby, Papua New Guinea: Department of National Planning and Monitoring.
  • Gibson, J., & Olivia, S. (2010). The effect of infrastructure access and quality on non-farm enterprises in rural Indonesia. World Development, 38(5), 717–726. doi:10.1016/j.worlddev.2009.11.010
  • Gibson, J., & Rozelle, S. (1998). Results of the household survey component of the 1996 poverty assessment for papua new Guinea (Technical Report). Port Moresby, Papua New Guinea: Department of National Planning and Monitoring.
  • Gibson, J., & Rozelle, S. (2003). Poverty and access to roads in Papua New Guinea. Economic Development and Cultural Change, 52(1), 159–185. doi:10.1086/380424
  • Jusi, P., Mumu, R., Jarvenpaa, S., Neausemale, B., & Sangrador, E. Jr. (2003). Road asset management system implementation in Pacific region: Papua New Guinea. Transportation Research Record: Journal of the Transportation Research Board, 1819(1), 323–332. doi:10.3141/1819b-41
  • Khandker, S. R., Bakht, Z., & Koolwal, G. B. (2009). The poverty impact of rural roads: Evidence from Bangladesh. Economic Development and Cultural Change, 57(4), 685–722. doi:10.1086/598765
  • Khandker, S. R., & Koolwal, G. B. (2010). How infrastructure and financial institutions affect rural income and poverty: Evidence from Bangladesh. The Journal of Development Studies, 46(6), 1109–1137. doi:10.1080/00220380903108330
  • Khandker, S. R., & Koolwal, G. B. (2011). Estimating the long-term impacts of rural roads: A dynamic panel approach (Policy Research Working Paper). Washington, DC: World Bank.
  • Knox, J., Daccache, A., & Hess, T. (2013). What is the impact of infrastructural investments in roads, electricity and irrigation on agricultural productivity? CEE review 11-007. Collaboration for Environmental Evidence (CEE) Syntheses. http://www.environmentalevidence.org/wp-content/uploads/2014/05/CEE11-007.pdf (last accessed 6 December 2016).
  • Kwa, E., Howes, S., & Lin, S. (2010). Review of the PNG–Australia Development Cooperation Treaty (1999). Canberra: Department of Foreign Affairs and Trade, Australian Government.
  • Lanjouw, J. O., & Lanjouw, P. (2001). How to compare apples and oranges: Poverty measurement based on different definitions of consumption. Review of Income and Wealth, 47(1), 25–42. doi:10.1111/1475-4991.00002
  • Lokshin, M., & Yemtsov, R. (2005). Has rural infrastructure rehabilitation in georgia helped the poor? The World Bank Economic Review, 19(2), 311–333. doi:10.1093/wber/lhi007
  • Lucius, D. (2010). Civil works capacity constraints (Technical Report). Manila: Asian Development Bank.
  • Mu, R., & van de Walle, D. (2011). Rural roads and local market development in Vietnam. Journal of Development Studies, 47(5), 709–734. doi:10.1080/00220381003599436
  • Mundlak, Y. (1978). On the pooling of time series and cross section data. Econometrica, 46(1), 69–85. doi:10.2307/1913646
  • Nguyen, C. V. (2019). Impacts of rural roads on household welfare in Vietnam: Evidence from a replication study. Journal of Economics and Development, 21(1), 83–112.
  • Powell, D. (2020). Quantile treatment effects in the presence of covariates. The Review of Economics and Statistics, 102(5), 994–1005. doi:10.1162/rest_a_00858
  • Shrestha, S. A. (2020). Roads, participation in markets, and benefits to agricultural households: Evidence from the topography-based highway network in Nepal. Economic Development and Cultural Change, 68(3), 839–864. doi:10.1086/702226
  • Stead, D. (1990). Engineering geology in Papua New Guinea: A review. Engineering Geology, 29(1), 1–29. doi:10.1016/0013-7952(90)90079-G
  • World Food Programme and Logistics Cluster. (2011). Papua New Guinea emergency preparedness: Operational logistics contingency plan part 2 – existing response capacity and overview of logistics situation (Technical Report). Rome, Italy: World Food Programme (WFP). http://reliefweb.int/sites/reliefweb.int/files/resources/PNG%20Logistics%20CP%20-%20Part%202%20-%20Existing%20Response%20Capacity%20and%20Logistics%20Overview.pdf (last accessed 21 December 2016)