79
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Explaining central government’s tax revenue categories through the Bradley-Terry Regression Trunk model

, & ORCID Icon
Received 20 Mar 2022, Accepted 13 May 2024, Accepted author version posted online: 16 May 2024
Accepted author version

Abstract

The Bradley-Terry Regression Trunk (BTRT) model combines the log-linear Bradley-Terry model, including subject-specific covariates, with a particular tree-based model, the so-called regression trunk. It aims to consider simultaneously the main effects and the interaction effects of covariates on data expressed as paired comparisons. We apply this model to financial data expressed as rankings and then transformed into paired comparisons. Tax revenues differentiated by category represent the statistical units of the analysis (i.e., taxes on income, social security contributions, taxes on property, and taxes on goods and services). We combine data from OECD, World Bank, and IMF databases for the year 2018 to investigate the effect size of socio-economic covariates and their interaction on the composition of tax revenues for a set of 100 countries worldwide. We also present a comparison with a more established method proposed in tax determinants literature and with two alternative models used for matched pairs. Finally, we discuss the implications of reported results for stakeholders and policymakers.

Disclaimer

As a service to authors and researchers we are providing this version of an accepted manuscript (AM). Copyediting, typesetting, and review of the resulting proofs will be undertaken on this manuscript before final publication of the Version of Record (VoR). During production and pre-press, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal relate to these versions also.

1 Introduction

Public finance studies the government’s role in the economic system (Gruber, 2004), emphasizing the importance of maintaining a balanced budget by managing the gap between public revenues and expenditures. This balance became crucial after the 2007 financial crisis in the US, which later escalated into a sovereign debt crisis. The objective of public authorities is to achieve a balance between revenues and expenditures to ensure positive outcomes and prevent negative ones (Jain, 1989). Examining revenues and expenditures separately shows their effect on a country’s Gross Domestic Product (GDP): government spending directly affects GDP, whereas taxation has a contrasting effect. Nonetheless, the complex interaction between tax revenues and public spending with economic factors is not fully understood. Some studies have investigated the relationships between taxation and government expenditure. Manage and Marlow (1986) demonstrated that taxation leads to expenditure at the state level, with this causality becoming bidirectional in the short term. Anderson et al. (1986) found that government spending leads to increased taxation.

The marked interest in understanding the determinants of tax revenues led to the pivotal question of the sources of variance in tax revenues among countries. Kaldor (1963) observed that tax revenues are lower in underdeveloped countries than in developed ones, a disparity attributed to the capability of populations to pay taxes from their income surplus beyond basic subsistence needs (Boukbech et al., 2019). Emerging economies, therefore, face limitations in converting national income into tax revenues to fulfill community needs without inciting severe social unrest. It follows logically that a country’s ability to generate tax resources is positively correlated with its level of development, suggesting that more developed nations have a higher capacity to levy taxes (Brun and Diakite, 2016).

Within the discussed context, we present a modeling approach to analyze the relationship between tax revenues and government expenditure, focusing on tax revenue components and the diverse characteristics of countries. We employ the Bradley-Terry Regression Trunk (BTRT) model (Baldassarre et al., 2022) to examine the impact of socio-economic covariates and their interactions on tax revenue composition across 100 countries globally. The most significant covariates are identified using the Bradley-Terry-Luce Lasso method and are utilized in the BTRT model to group countries based on the BTRT’s output. BTRT facilitates the incorporation into the model of country-specific fiscal system heterogeneity and covariate interactions. Notably, the Environmental Performance Index (EPI) is employed as an indicator of a country’s development level and economic growth. Our findings reveal that EPI significantly influences tax revenue composition by interacting with government gross debt, health expenditure, and employment rate. The BTRT algorithm selects EPI as the primary splitting covariate. This interaction notably enhances the model’s fit, highlighting the critical role of EPI in delineating tax revenue structures.

The paper is organized as follows. Section 2 presents previous literature. Section 3 presents the original data set. Section 4 reports the key features of both the BTRT model and algorithm. Section 5 focuses on variable selection. Empirical results are shown in Section 6. Next, the BTRT model is compared with a more established model in tax determinants literature (Section 6.1), as well as with both the basic log-linear Bradley-Terry model without subject-specific covariates and the Bradley-Terry tree model (Section 6.2). Finally, Section 7 ends the paper with final remarks and advices to policymakers.

2 Background and motivations

The relationship between socioeconomic explanatory variables and tax revenues (response variable) has been investigated by analyzing cross-sectional or panel data. The goal has been to find the determinants of tax revenues. Main findings suggest the most important explanatory variables are the per capita GDP (Gupta, 2007; Pessino and Fenochietto, 2010), the peculiarities of the industries reproduced in the sectoral composition of GDP (Chelliah, 1971; Chelliah et al., 1975; Tait et al., 1979; Piancastelli, 2001; Karagoz, 2013), external factors like the levels of Foreign Direct Investment (FDI), trade (Cassou, 1997; Gupta, 2007; Bird et al., 2008), and public debt (Teera and Hudson, 2004), as well as policy makers’ choices like exchange rate, inflation rules (a.k.a. Keynes-Oliveira-Tanzi effect) and financial-fiscal policies (Tanzi, 1989; Brun and Diakite, 2016). Other studies analyze the role of government efficiency and institutional factors like political stability and political and civil rights (Bird et al., 2008; Martin and Uribe Teran, 2010). On the social side, the impact of the educational level (as a share of public expenditure on education), illiteracy rate, and population growth on tax revenues have been studied (Wallace and Bahl, 2005; Piancastelli, 2001; Pessino and Fenochietto, 2010). Accountability and civil and political rights are also considered determinants of tax revenues (Bird et al., 2008; Martin and Uribe Teran, 2010). Lastly, corruption, entry regulations, and the rule of law play an important role in explaining differences in tax revenues (Bird et al., 2014).

Different methodological approaches are utilized for similar purposes. Feltenstein and Cyan (2012) apply the dynamic general equilibrium model, whilst other studies use econometric techniques. Lotz and Morss (1967) proposed the first cross-sectional study on international tax ratios: the tax effort concept is introduced and it is demonstrated that per capita income and trade share are determinants of the tax share. Studies analyzing panel data utilize static fixed and random effect models based on the generalized method of moments for dynamic panels. Apart from these, Pessino and Fenochietto (2010) introduces a panel version of a stochastic tax frontier model.

As for the sample size, Teera and Hudson (2004) and Pessino and Fenochietto (2010) consider large samples of countries selected for the geographical location or the income level. Other studies are based on a restricted sample of countries. Based on the idea that the research for tax determinants may be inadequate if a heterogeneous group of countries is observed, as the composition of tax revenues can be different in low, middle, and high-income countries, Castro and Camarillo (2014) analyze the 34 countries that are part of the Organization for Economic Co-operation and Development (OECD) only and, for them, consider lagged values of the tax revenues over the 10 years 2001-2011.

Our empirical analysis is aimed at finding the tax revenue determinants taking into account the heterogeneity of countries, with a focus on the relationship between tax revenues and government expenditure. The design of the empirical study is peculiar and differs from those previously presented. Following the OECD classification, we study the determinants of tax revenues by decomposing them into four categories: taxes on income, social security contributions - SSC, taxes on property, and taxes on goods. This decomposition-based approach represents one of the major advantages of this analysis: the disaggregation of tax revenues allows the identification of different effects of the same covariate on different tax revenue categories. A single socio-economic characteristic could affect in opposite direction the size of two or more different tax revenue categories. In addition, the tax categories are paired compared on their size, which means that the tax revenue categories are first ordered according to their size and next compared to each other. To quantify for each country the effects of socioeconomic covariates on the size of each specific tax revenues category, the variable tax revenues is re-coded into a set of categorical variables (i.e., tax revenues are transformed into rankings and next into paired comparisons) to apply the Bradley-Terry model for matched pairs (Bradley and Terry, 1952) using the log-linear formulation with subject-specific covariates introduced in Dittrich et al. (1998).

The use of paired comparisons is reasonable when comparing countries with different fiscal systems. The OECD tax revenues classification defines a common ground where data from different countries are compared. In line with this, paired comparisons work on sizes, and not on numerical values of tax revenues, thus facilitating the comparability between countries. Most of the studies on tax revenue determinants use panel data since they assume that tax revenues change over time. Since the order of tax revenues by categories is more stable, our study utilize cross-sectional data and estimates tax determinants through paired comparisons in a cross-country dataset for a single year. We integrate data from OECD, International Monetary Fund (IMF), and the World Bank, and focus on 2018. The heterogeneity among countries is included in the BTRT model also concerning the covariates: their values are expressed as a percentage of GDP. Government expenditure is decomposed following the Classification Of the Functions Of Government (COFOG) to capture the effect of each type of government expenditure on each category of tax revenues.

To model possible interaction effects in addition to the main effects we resort to BTRT. It allows us to define interaction effects when no a-priori hypotheses are available, and produces a small tree, called a trunk, representing a fair compromise between a straightforward interpretation of the interaction effects and an easy-to-read partition of countries based on their socioeconomic characteristics and the order of their tax revenues. Thus, our study focuses on finding interaction effects on tax determinants in a data-driven manner.

Few works in public finance studies focus on paired comparison data. Dittrich et al. (2000) analyze rank-ordered preference through Bradley-Terry type models with categorical subject-specific covariates. The (complete) rankings are transformed into paired comparisons on which a tree-based log-linear model is estimated. Data partitioning considers model deviance and a forward selection and backward elimination procedure. This approach is well suited for hypothesis-based modeling. However, when no a priori hypotheses are known it requires the arbitrary introduction of higher-order interactions. Instead, Strobl et al. (2011) propose a tree-based classifier where paired comparisons are treated as response variables in a Bradley-Terry tree (BTtree) model. BTtree defines interactions when no a priori hypothesis is known based on a model-based recursive partitioning approach (Zeileis et al., 2008): Splits are selected assessing the instability of the parameters of the basic Bradley-Terry model. The tree returns in each terminal node the preference scales deriving from the object-related parameters. Nevertheless, BTtree lacks of information about how the subject-specific covariates affect the judges’ preferences. This semi-parametric model does not return beta coefficients neither for the main effects nor for the interaction effects.

BTRT is preferable to BTtree and Dittrich’s model as it accounts for the data-driven definition of interaction effects between subject-specific covariates. It results in a completely parametric model: each covariate that enters the main effect part of the model and each interaction effect has an associated regression parameter. Thus, the interpretation of results is highly facilitated as it derives both from the inspection of the tree and the interpretation of the estimated regression parameters.

3 Data

We consider 100 countries and their associated tax revenues by category for the year 2018. To ensure consistency across countries and granularity of tax revenue categories (OECD, 2018a), data are taken from the Global Revenue Statistics Database (OECD, 2018b), which utilizes the OECD classification of taxes. Taxes are classified by their base in: income and profits (heading 1000), compulsory Social Security Contribution (SSC, head. 2000), payroll and workforce (head. 3000), property (head. 4000), goods and services (head. 5000), other taxes (head. 6000). All categories are expressed in terms of the level of taxation through the tax-to-GDP ratio, computed by the ratio of the nominal tax revenue of a country and its nominal GDP for the year. This ratio is used in cross-country research studies where the aim is to compare tax levels across countries with different degrees of development.

The numerical variables of tax revenue categories are transformed into rankings assigning values from 1 to 6 to each tax category: the category generating the highest (lowest) tax revenue is assigned a value of 1 (6). Rankings are first transformed into paired comparisons between tax categories and next into consensus ranking, which represents the best compromise between a set of rankings and corresponds to that ranking in the permutation space that maximizes the sum of correlations between itself and all the other rankings. Operationally, consensus rankings maximize the extended correlation coefficient τx (Emond and Mason, 2002). The R package ConsRank (D’Ambrosio et al., 2019) is used to compute τx , and the result obtained for the tax categories is: (goods and services > income and profits > compulsory SSCs > property > workforce = other taxes). The categories taxes on the workforce and other taxes are both ranked in the last position. In the original data, in most cases, they present values equal to 0. For this reason, we do not consider them. Thus, our analysis covers 100 countries and 4 tax revenue categories.

Initial data on tax revenues are integrated with other socio-economic covariates, in prevalence expressed in terms of GDP, that explain variations in tax revenues and are obtained by combining data from IMF, OECD, and World Bank databases. These covariates are (see Table 9 in the Appendix): current account balance; employment and unemployment rate; government gross debt; government net lending/borrowing; gross fixed capital formation; change in the gross domestic product (in %); gross national savings; the volume of exports and imports of goods and services; account ownership at a financial institution; subsidies and other transfers; interest payments; compensation of employees; value-added of agriculture, services, industry, and manufacturing; population density; final consumption expenditure; banking non-performing loans; claims on central government; households consumption; government consumption; trade volume; environmental performance index; government expenditure for military, education, health, and other (as a residual category). The last covariate is obtained by subtracting military expenses, education expenses, and healthcare expenses from the total government expenditure and is intended as a proxy of the other public expenditure items included in the COFOG classification.

The variable is location (OECD database) is used to assign the H = 100 countries to four geo-economic areas: OECD countries (36), Africa (29), Asia (15), and South America (21). This variable is used to impute missing values for the other covariates. Missing values are replaced with the median value of the covariate computed in the specific area (see Table 9 in the Appendix). Observed countries for each area are displayed in the Appendix (Fig. A1 to A4) together with the tax revenues composition for each country.

4 Road to the Bradley-Terry Regression Trunk (BTRT)

Bradley and Terry (1952) proposed a method to derive a latent preference scale from paired comparison data when no natural measuring scale is available. This model has been successfully used in Dittrich et al. (2006), Choisel and Wickelmaier (2007) and Rodriguez Montequin et al. (2020).

Hereinafter, the terms rankings and orderings are used interchangeably. Rankings are numerical vectors that assign values to objects based on their size. Usually, they represent preferences expressed by H individuals (or judges). In our study, they are intended as orders of magnitude of tax revenues differentiated by tax category for H countries. Instead, orderings are nominal lists of objects that express the same concept as rankings.

The paired comparison method splits the ordering process into a series of evaluations carried out on two objects at a time. To address the problem of determining the scale values of a set of objects on a preference continuum that is not directly observable, each pair is compared and a decision is made based on which of the two objects is preferred. When the number of objects is no , the number of paired comparisons is equal to (no2).

The basic Bradley-Terry (BT) model computes the probability π(ij)i that an object i is ranked above another object j (Agresti, 1990) as(1) π(ij)i=πiπi+πj,(1) where πi and πj are non-negative worth parameters describing the location of objects on the overall ranking scale. When i=1noπi=1 is imposed, πi and πj can be regarded as probabilities: for example, πi is the probability that the object i is ranked above all the other objects by considering all the rankings expressed by or related to H individuals.

The BT model is expressed as a quasi-symmetry logistic model with parameters λiO (2) logit(π(ij)i)=log(π(ij)iπ(ij)j)=λiOλjO,(2) where λiO and λjO are object-parameters related to πi ’s in Eq. (1) by λiO=12ln(πi). The superscript O refers to object-specific parameters. The estimated probability in Eq. (2) is computed as π̂(ij)i=exp(λ̂iOλ̂jO)1+exp(λ̂iOλ̂jO), and π̂(ij)i=12 if λiO=λjO (the two objects are tied). Eq. (2) is equivalent to Eq. (1), and the identifiability of these two models requires a restriction on the parameters related to the last object no , such as λnoO=0 and inoπi = 1.

Sinclair (1982) extended Eq.(1) introducing a log-linear version (LLBT): in comparing object i with object j, the Poisson-distributed random variables y(ij)i and y(ij)j are the number of times in which the objects i and j are ranked above the other objects, respectively. In total, 2(no2) expected counts are estimated in a design matrix that is used to synthesize the information about all the considered rankings. Sometimes, it is reasonable to assume that the rankings associated with judges or individuals depend on their features. For example, the preferences expressed by judges usually depend on their characteristics. This issue characterizes the present study as well: the tax revenues composition of a country h and their order is associated with the country’s socio-economic structure. In the BT model literature, these features are called subject-specific covariates to differentiate them from the object-specific covariates and subject-object-specific covariates (Schauberger, 2015).

The LLBT model accounts for multiple subject-specific covariates. The BT model can include categorical or continuous covariates. Considering numerical covariates only, as in our application, the LLBT for the h-th judge and objects i and j is(3) logm(y(ij)i,h)=μij,h+y(ij)i,h(λi,hOλj,hO).(3)

LLBT with subject-specific covariates requires a contingency table for each judge, or individual, and each covariate value; λi,hO can be re-parameterized through a linear formulation(4) λi,hO=λiO+p=1Pβipxp,h,(4) where xp,h is the continuous covariate xp (p=1,,P) observed for individual h, and βip is the effect of xp on the location of the object i in a ranking h. The intercept λiO is the location of the object i in the overall consensus ranking, which includes the rankings associated with all the H individuals.

The deviance of the model is computed as the deviance of a fitted Poisson regression(5) D=2h=1Hyij,h×log(yij,hŷij,h),(5) where yij,h is the value of the comparison ij for a judge h, and ŷij,h is the predicted value based on the estimated model parameters.

4.1 Combining STIMA, regression trunk, and BTRT

The LLBT model with subject-specific covariates is expressed for ordinal data through a regression model for paired comparisons. The aim is to estimate in a data-driven way the main effects part of the model and, if present, the interaction effects part by implementing the Simultaneous Threshold Interaction Modeling Algorithm (STIMA) (Dusseldorp et al., 2010; Conversano and Dusseldorp, 2017) within the BTRT model. STIMA utilizes the regression trunk methodology (Dusseldorp and Meulman, 2004). Interaction effects are considered when two or more covariates do not combine additively (de González and Cox, 2007), or when they have a combined effect over and above their additive combination (Cohen et al., 2022, p.257). STIMA simultaneously estimates both main and interaction effects by growing a small tree called a regression trunk, which is next pruned to avoid overfitting. The covariates can interact with each other so that the interaction effects are considered as a particular kind of non-additivity. The STIMA’s recursive partitioning algorithm is based on the integration of Generalized Linear Models (GLM) (McCullagh and Nelder, 1989) and Classification And Regression Trees (CART) (Breiman et al., 1984).

The BTRT model combines the LLBT model with subject-specific covariates (Eq. 4) with STIMA, when the response derives from the outcome of paired comparisons. The estimated object-parameters λ̂i,h for the object i and the individual h in the node t are1 (6) λ̂i,h=λ̂i+p=1Pβ̂i,pxp,h+t=1T1β̂i,P+tI{(x1,h,,xP,h)t}(6)

The main effect part is p=1Pβ̂i,pxp,h, whilst t=1T1β̂i,P+tI{(x1,h,,xP,h)t} are the interaction effects estimated for individuals in each terminal node T. The interaction terms are T – 1 since one terminal node is treated as the reference group. The intercept λ̂i quantifies the overall location of the object i for all the individuals of the trunk. To estimate the intercept in each terminal node it is necessary to add β̂i,P+t to λ̂i. The overall number of parameters includes the number of intercepts λi , plus all the main effects βi,P and the interaction effects βi,P+t. Since one object and one terminal node are considered as reference categories, the number of model parameters is (no1)+[P×(no1)]+[(T1)×(no1)].

4.2 The BTRT algorithm

The BTRT algorithm, described in Algorithm 1, follows the STIMA algorithm with some adjustments due to pairwise comparisons.

**** Algorithm 1 about here ***

First, a design matrix composed of n=no×(no1)×H rows is defined. Indeed, the total number of rows is 2×(no×(no1)/2)×H, i.e., the product between the number of comparing objects, that is 2 when ties are not allowed, the number of paired comparisons (no×(no1)/2), and the number of individuals H. If ties are considered, the model can be extended by incorporating undecidedness parameters. Table 1 is an example of a design matrix with two subjects, three objects, and one continuous subject-specific covariate.

**** Table 1 about here ***

A generalized linear model with a log link and Poisson distribution is fitted to the root node to compute the main effects βp . The splitting procedure is applied to the root node, and the resulting child nodes, by searching for a threshold interaction between the P covariates assuming they have a combined effect on the objects’ order besides their individual (main) effect. The best candidate split maximizes the decrease in deviance of a Poisson regression model when moving from a parent node to the two possible child nodes. Once found, the best split defines a dichotomous variable zijp,t* that updates the indicator function I(·) in Eq. (6). Next, all regression coefficients are re-estimated. The number of dichotomous variables zijp,t* is equal to the number of splits leading to node t.

Two main approaches to growing the BTRT trunk are presented in Baldassarre et al. (2022): One Split Only (OSO) and Multiple Splitting (MS). In OSO, all the covariates are used as splitting covariates only once. In MS, each covariate can be used more than once to split a node. Generally, MS performs better than OSO in reducing deviance, although OSO is more appropriate when the goal is obtaining an easier-to-read partition. For our analysis, results suggest MS represents a fair compromise between goodness-of-fit and interpretation. The selected trunk shown in Sec. 6 presents few splits and fits the data satisfactorily.

The BTRT stopping criterion is based on the prior definition of a minimum number of individuals (minimum bucket size) in a node. In the default BTRT implementation, it is set to 5 but it can be changed based on the desired depth of the trunk.

Once the entire trunk is grown, pruning based on V-fold cross-validation and the c·SE rule is applied. The cross-validation deviance Dcv for a Poisson distribution and its associated standard error SEcv are computed in each step of the splitting procedure2. Dcv initially follows a decreasing trend and then begins to grow after a certain number of splits. The c·SE rule works as follows: Let t*[1,T] be the size of the regression trunk with the lowest Dcv , say Dt*cv. The best size of the BTRT trunk t** corresponds to the minimum value of t such that Dt**cvDt*cv+c·SEt*cv, where c[0,1]. The optimal choice of the pruning parameter c has been investigated in Baldassarre et al. (2022): the proportion of times BTRT captures a true interaction effect is higher than 0.8 when 0.5<c1, and it is advised to increase c as the number of objects, rankings, and the effect size of the covariates on the orderings increase. In general, the higher the value of c, the more weight is given to the standard error so that the selected trunk has a reduced size. In this case, since BTRT includes main effects and interaction effects within the same model, it would be advisable to choose conservative values of c (i.e., 0.5<c1).

5 Bradley-Terry-Luce Lasso for covariates selection

We consider a wide set of covariates as initial candidate variables for BTRT. However, one of the main drawbacks of applying the BT model with subject-specific covariates is the number of parameters to estimate. This drawback is accentuated when the BTRT model is used: in addition to the intercepts for each tax category, also main effects and interaction effects between covariates are included. This issue is solved by following a penalized likelihood approach instead of ordinary ML estimation. In this way, it is possible to first consider a high number of candidate covariates, and next, reduce the number of involved parameters by selecting the relevant ones only.

Lasso regression (Tibshirani, 1996) is commonly used for variable selection in presence of numerical covariates. In case of paired comparisons the penalty terms used in standard Lasso have to adapt the specific structure of the model (Schauberger and Tutz, 2017), and the method for variable selection that adapts to paired comparison is the Bradley-Terry-Luce Lasso (BTLL) (Schauberger, 2015). We use BTLL to select the most influential country-specific covariates to be included in BTRT. In BTLL, the BT model is reinterpreted according to Luce’s axiom (Luce, 1959) of independence from irrelevant alternatives. It states that other objects do not influence the decision between two objects. BTLL is based on the maximization of the penalized log-likelihood:(7) lp(ξ)=l(ξ)ΛJ(ξ),(7) where l(ξ) is the log-likelihood of the vector of model parameters (ξ),J(ξ) is a penalty term and Λ is a tuning parameter that quantifies the penalty degree. If Λ = 0, lp(ξ) reduces to the standard maximum likelihood estimate. In BTLL, the penalty for the set xp of P subject-specific covariates that share the same effect size on the response is(8) ϕP(βi,1,,βi,P)=p=1Pi<j|βi,pβj,p|(8)

If Λ, all the effect sizes of xp are merged into a single one, and xp is removed from the model as all its effect sizes tend to zero. Instead, when Λ = 0 BTLL reduces to BT with subject-specific covariates. In general, for a finite value of Λ some of the covariates are removed, whilst the remaining are still identified in the model. The penalty terms ϕP can be weighted based on the adaptive lasso principle (Zou, 2006), and next combined as(9) J(ξ)=l=1LϕlPl,(9) where ψl represents penalty-specific weights. Thus, all penalty terms Pl are combined into a unique penalty J(ξ) controlled by the tuning parameter Λ. The comparability of different penalty terms requires two conditions: 1) all subject-specific covariates have to be scaled to allow the comparison of their effect sizes; 2) the weights ψl have to be assigned according to the number of penalties and free parameters they include (Bondell and Reich, 2009; Oelker et al., 2015). The penalty is interpreted as a Lasso-type fusion penalty rather than a simple Lasso (Schauberger and Tutz, 2017). Similar approaches have been used in GLM (Bondell and Reich, 2009; Gertheiss and Tutz, 2010), and in paired comparison models (Masarotto and Varin, 2012; Tutz and Schauberger, 2015). Since the information matrix used in penalized likelihood approaches does not lead to standard errors or confidence intervals, we apply bootstrap to the cross-validated model to compute confidence intervals that account for the additional variance originating from the model selection procedure.

To select covariates for BTRT we use the BTLLasso R package (Schauberger and Tutz, 2019), which penalizes the Fisher scoring with a L1 penalty. The optimal tuning parameters Λ is selected through 10-fold cross-validation. For the main effect part of the BTRT model (root node), BTLL returns the cross-validated Λ as the optimal value for the size of penalties associated with the βi,p s; 13 out of 30 subject-specific covariates with an effect size equal to zero at least on two tax revenue categories are removed from the model. Descriptive statistics for the remaining covariates are reported in Table 2.

**** Table 2 about here ***

The final dataset includes H = 100 countries, no = 4 tax revenue categories, and 17 covariates. Interestingly, the covariates related to public expenditure remain in the model as the size of their main effect on the composition of tax revenues can not be overlooked.

The major drawback of our approach is the computational cost of the bootstrap procedure. In addition, BTLL selects the best covariates by only considering their main effects on each tax revenues category. Nevertheless, it still constitutes the more appropriate covariate selection procedure when dealing with the Bradley-Terry model.

6 Results

Figure 1 and Table 3 describe the BTRT pruned regression trunk. The CART’s node numbering criterion is used: each terminal node Rt is the region of the predictor space obtained from the recursive partitioning procedure. For each Rt , the number of countries H, the consensus ranking C and the correlation coefficient τx between rankings are reported. C is computed maximizing τx inside the nodes. In Figure 1, it represents a summary measure for each Rt that results from the minimization of distances between preferences without considering the characteristics of the countries (the subject-specific covariates).

**** Table 3 about here ***

**** Figure 1 about here ***

BTRT confirms EPI (x13) is a key covariate as it is selected to split the first node: 76 countries are assigned to node 2 (EPI0.7) and 24 to node 3 (EPI>0.7). The second splitting covariate is the government health spending (x4) leading to nodes 4 and 5 that are further split according to EPI and Employment rate (x1), respectively, to obtain terminal nodes R2 to R5. The employment rate is a useful indicator of current labour market conditions as it is unaffected by voluntary changes in labour force participation. The unemployment rate is influenced by the labour force size, decreasing when workers stop job-seeking and potentially increasing during labour market recovery as more individuals join the labour force. The third splitting covariate is Gross debt (x2) which splits node 3 into terminal nodes R1 and R6. Thus, BTRT finds a first-order interaction between EPI and Gross debt (R1 and R6), another first-order interaction between EPI and Health spending (R2 and R3), and a third-order interaction between EPI, Health spending and Employment rate (R4 and R5). Overall, in 5 terminal nodes τx0.7 indicates a broad consensus. R2, R3, and R5 show a similar consensus ranking: for them, taxes on goods have a substantial effect on the composition of tax revenues. In R1, R4 and R6 taxes on income are the most influential item of tax revenues. Ties are observed in R1 and R4.

Table 4 summarizes the BTRT pruning procedure. Results of model6 (mod6) are reported even if the pruning indicates model5 (mod5) as the final model. Adding another split to the mod5 trunk causes the deviance D and the cross-validation deviance Dcv to decrease. The Dcv values are mean values of deviances computed on each row of the design matrix, thus they are much smaller than the D values, computed as a sum (Eq. 5). The c·SE rule with c = 0.5 leads to prune mod6 since 0.3565+0.5×0.0014=0.3572>0.3567. Thus, the final trunk corresponds to mod5 with five splits and T = 6 terminal nodes.

**** Table 4 about here ***

Figure 2 shows the world map with the final breakdown of the countries in the regions R1,,R6. Most of the OECD countries with higher EPI and gross public debt are in R1 (reference region). R2 includes 31 countries with lower EPI and health expenditure. Besides Turkey, they are all located in Asia and Africa. R3 includes 15 countries located on different continents (e.g. Egypt, Morocco, Mongolia, and Peru) that have poor EPI but spend more than 6% of GDP on health. R4 consists of 5 countries, all located in South Africa except Tunisia, characterized by low EPI and employment rates, but with health spending larger than 6% of GDP. Countries in R5, in prevalence located in South America and Central Africa, share the same characteristics of those in R4 but have a higher labour force participation rate. Finally, R6 includes the most virtuous countries in terms of EPI and public budget management.

**** Figure 2 about here ***

In general, Figure 2 provides a proxy for groups of countries that follow a common fiscal policy. For example, R1 includes all OECD member countries with fairly similar fiscal policies (e.g. high levels of taxation on income, goods and services, and social security contribution; lower taxation on property). As discussed in Caselli and Reynaud (2020), Buera et al. (2011) and Giuliano et al. (2013), the intuition is that tax reforms in neighbouring countries affect the adoption of reforms through equal tax burdens and imitation effects.

**** Table 5 about here ***

Besides the inspection of terminal nodes, BTRT provides the estimated regression coefficients for each tax revenue. These are reported in Table 5 for taxes on income (o1), compulsory SSC (o2), and taxes on property (o3). For the reference category taxes on goods (o4) they are set to zero. Some interesting considerations emerge:

as for o1 (taxes on income), military spending (x14) and exports (x4) have a strong impact on its size. The more is the level of x14 or x4, the lower the probability that taxes on income are higher than taxes on goods (o4, reference category). On the contrary, imports (x5) and health expenditures (x16) have a positive effect on o1. Then, for taxes on income, all the coefficients of the interaction effects except for R6 (OECD members) have a negative sign.

As for SSC (o2), savings (x3), EPI (x13), and other spending (x17) have a positive and strong impact on its size. For instance, the higher x13, the lower the log odds that SSC is higher than taxes on goods. EPI has a positive and high impact on o2, despite the negative effect observed for o1. In most developed countries SSCs are likely higher than taxes on goods. In addition, government consumption (x12) has a strong and negative effect on SSC. Finally, the interaction terms for R4 (R6) has a positive (negative) effect on o2;

Taxes on property (o3) has a strong tendency to be the last ranked object as it shows the lowest value for the estimated intercept. As for the main effects, export levels (x4) and government consumption (x12) have a strong and negative effect on the size of taxes on the property. On the contrary, military spending (x14) and other spending (x17) have a positive impact on o3. All the interaction effects have a positive impact on the comparison between taxes on property and taxes on goods and services.

**** Table 6 about here ***

Table 6 reports for each terminal node the worth parameters πi (Eq. 6) and the mean value of the incidence of each tax category for all the countries inside the node. The worth parameters are computed based on the relationship between πi and λi . They are more informative than the consensus ranking C. For instance, the regions R2 and R3 in Figure 1 show the same consensus ranking. These orderings are in line with the order of the πi s in Table 6, but with a relevant difference: in R2, SSC and taxes on property are very close to each other, while in R3 the difference among them is larger. The πi s provide additional information about the composition of the estimated ordering given the characteristics of the countries falling in a terminal node. Note that OECD countries (most of them in green in Fig. 2 and belonging to R1) are characterized by the highest EPI and government gross debt. Compared to countries in other terminal nodes, they show a more balanced composition of tax revenues, with direct income taxes and social contributions dominating. Taxes on goods and services play a secondary role, whilst property taxes make up a residual share of government revenues. This interpretation is useful for researchers and policy-makers: a country with high gross debt and high environmental performances is likely to have tax revenue compositions similar to that of OECD countries.

6.1 Comparing BTRT results with tax determinants literature

Results presented in Section 6 are compared with those obtainable from the model specified in Gupta (2007), whose study is characterized by the same objectives and structure of data and represents a milestone in the tax determinants literature, and is based on the study of Lotz and Morss (1967), one of the first attempts to estimate tax determinants.

Lotz and Morss (1967) estimate the tax ratio for 52 developing countries using Ordinary Least Squares (OLS) and including the per capita income and openness to foreign trade as covariates. Their dataset is cross-sectional since tax ratios are computed as the average ratio for the years 1963-1965. The estimated tax ratios are compared, to create groups of countries with similar tax performance. Similarly, Gupta (2007) examines the main determinants of central government tax revenue using panel data for 105 developing countries over 25 years. The dependent variable is the total central government tax revenue (as a percentage of GDP). As in our analysis, Gupta focuses on the impact of debt, agricultural value-added, and trade on the determination of tax revenues. In addition, he utilizes the disaggregation of tax revenue categories as covariates, whereas in BTRT these are transformed them into paired comparisons. Moreover, in line with most of the studies on tax determinants, Gupta investigates the effect of socio-economic covariates on total tax revenue through a panel regression with fixed and random effects that include some institutional factors as additional covariates. Instead, BTRT is orientated towards the investigation of the relationship between tax revenue category and government expenditures by function (COFOG). To achieve comparability of results, the model proposed in Gupta (2007) is applied as follows: a) The dependent variable is the logarithm of the total central government tax revenue instead of tax revenue categories. The logarithmic term accounts for the heterogeneity of total tax revenues across countries. As confirmed by BTRT, low-income countries typically have tax revenues between 10% and 20% of GDP, while for high-income countries it rises to 40% (see also Besley and Persson (2014)); b) OLS is estimated on the same set of countries used for BTRT; c) All the data refer to the year 2018; d) The OLS model includes the same set of covariates used for BTRT (Table 2).

Beside the main effects, an exhaustive search of all the possible cross-product interaction effects up to the third order is performed. The selected OLS model is the one minimizing the Bayesian Information Criterion (BIC). It is compared with BTRT (Table 5) to highlight the additional information supplied by BTRT when interpreting the interaction effects.

**** Table 7 about here ***

Table 7 shows the OLS estimated coefficients. To understand how disaggregation of tax revenues by category improves interpretation of the interaction effects, values reported in Table 7 is compared with Table 5. For both OLS and BTRT the null hypothesis that the correct model includes only the main effects part is rejected in favor of the model with interaction effects (for OLS the test statistic is χ42=36.09; p < .000001, whilst for BTRT we get χ152=141.81; p < .000001). Important differences leading to different findings arise when interpreting the estimated coefficients. Gupta (2007) reports that central government gross debt has a positive impact on total tax revenue. This result is confirmed in the estimated OLS (Table 7, β̂3=0.09; p = 0.04) but β̂3 is borderline significant. This positive effect is due to the behaviour of countries with the burden of future loan repayments, which induces policymakers to raise tax rates. BTRT reports the same result in terms of the direction of the causal effect for all tax categories. For x3, the increase in gross debt has a positive and significant effect on social security contributions (o2).

Consistent with Lotz and Morss (1967), Gupta (2007) outlines the positive relationship between openness and revenue performance. Having an efficient tax system is critical, especially for countries noncompetitive in exports and natural resources (Karagoz, 2013). The OLS estimated coefficients for exports and imports are not significant. On the contrary, BTRT confirms the findings of Karagoz (2013): there is a negative (positive) relationship between exports (imports) and some tax categories: exports (x4) influence negatively taxes on income and property, whilst imports (x5) influence positively taxes on income only.

As for the effect of agriculture value-added on tax revenue, Gupta (2007) reports a strong negative impact (in line with Leuthold (1991); Stotsky and WoldeMariam (1997), and Eltony et al. (2001)). Likewise, Agbeyegbe et al. (2006) find a negative association between tax revenue and agricultural share, but not in Sub-Saharan countries where agricultural products are extensively exported and in turn increase tax revenues. However, also for these countries, the sign of the agricultural sector share is negative when income tax revenue is used as the response variable. OLS results in Table 7 show this negative association ( β̂8=4.58; p = 0.00). The unique cross-product interaction of the third order found by OLS involves agriculture value added together with government consumption and military spending. It indicates a relevant decrease in tax revenues ( β̂21=1334.55; p = 0.00), at the net of the positive effect on tax revenues of agriculture combined with either government consumption or military spending (see estimated coefficient of the second-order interactions). The same finding is not reported by BTRT: agriculture (x7) hurts the taxes on income only. On the supply side, BTRT shows that for a country with a large agricultural sector, it is difficult or politically infeasible to raise taxes, especially if the agricultural sector is mostly subsistence or if it is organized on a small scale basis (Agbeyegbe et al., 2006). On the demand side, if the agricultural sector is leading, there is no need for high expenditures for the country. The need for public goods and services, which in most cases leads to raised tax rates, tends to be relatively urban-based (Gupta, 2007).

The EPI index, used as a proxy for the country and human development, has a positive effect in OLS ( β̂13=0.35; p = 0.00). In Gupta (2007), the GDP per capita is used as a proxy for development and has a positive effect on tax revenue. This finding is consistent with Chelliah (1971) and demonstrates that a high level of development corresponds to a higher capacity to collect and pay taxes. BTRT shows similar results, with one unique difference: the proxy for the degree of development (x13) has a positive impact on every tax revenue category except for income taxes. This result demonstrates that taxes from broad-based sources such as VAT and income taxes are lower in case of greater access to other forms of revenue (Besley and Persson, 2014). BTRT provides additional findings from threshold interactions induced by EPI (from β̂i,R2 to β̂i,R6 in Table 5). In R2 and R3, low values of EPI interact with health spending to reduce taxes on income. Conversely, in R4 and R5 the average value of EPI interacts with health spending and employment rate to increase taxes on social security contributions (and reduce taxes on income when the employment rate is high). In R6, countries with high levels of EPI and reduced gross debt experience a reduction in taxes on social security contributions.

As for government expenditure by categories, OLS reports a positive association for education spending, health spending and other spending. In the main effects part their β̂ have the largest positive impact on the total tax revenue. For example, an increase in health expenditure is compensated by an increase in total tax revenue of about 2.42%. BTRT does not always estimate the same positive effect: in the main effects part, a consistent increase in taxes on property is caused by an increase in military spending (x14). Interestingly, the interaction effects induced by β̂i,R4 to β̂i,R5 indicate that higher health spending induces an increase in taxes on social security contributions (and a decrease in taxes on income) in countries with reduced (increased) employment rate and EPI below 0.70. The positive associations involving health spending indicate that high public health spending leads to lower household consumption of health services, which in turn leads to higher disposable income and, consequently, to a wider tax base for income tax (Jiang, 2021).

It is possible to explore the BTRT results from a different perspective by focusing on the threshold interaction effects that lead to the terminal nodes of the trunk. Additional information derived from BTRT is obtained from the worth parameters estimated in each terminal node, which are conditioned on a specific country-specific covariate. The worth parameters allow us to investigate how the composition of tax revenues varies as the magnitude of a covariate changes in a group of countries with similar socioeconomic characteristics. To demonstrate that this additional information is useful for policymakers, we fix all the covariates at the mean value observed in each terminal node and compute the worth parameters for each tax category as the size of agricultural value-added to GDP varies. This corresponds to simulating agricultural sector size values within the range [0,1].

**** Figure 3 about here ***

Figure 3 shows the results for each terminal node. It highlights some common features of all the groups identified by the terminal nodes. In each node, as the value added by the agricultural sector increases relative to GDP, income tax revenue (black line) decreases. This decrease is consistent with the BTRT estimated coefficients for the share of the agricultural sector. Moreover, in all terminal nodes except for R1 (third row, bottom right), when the contribution of the agricultural sector to the economy is high, countries prefer taxes on goods and services (blue line), whilst resources from property taxes and social contributions are lower (red and green lines, respectively). The only case when this situation does not occur is for countries belonging to R1 (countries belonging to OECD). Here, when the agricultural sector is important, other things being equal, governments prefer to raise resources through social contributions, whilst goods and services and property taxes have approximately the same degree of preference. Finally, countries in R6 (third line, bottom left), which are those more virtuous in terms of debt management and environmental performance, raise their resources mainly through income taxes until the share of the agricultural sector is lower than about 80% of its maximum value. If this share exceeds 80% most of the fiscal resources of central governments come from taxes on goods and services.

The analysis of the composition of tax revenues for different groups of countries based on the size of their agricultural sector can be repeated for any covariate. Once the final trunk and the groups of countries with similar tax revenues have been identified, BTRT further investigates how the composition of government budget revenues varies as long as a country-specific covariate varies within the same group. This investigation can be done by comparing the groups of countries in the terminal nodes of the tree against each other.

To conclude, BTRT is preferable to OLS because of its tree-based structure: an OLS coefficient is not case-specific, as it refers to a conditional expectation. Thus, it is more interesting to estimate specific coefficients for different subsets of observations. In this respect, BTRT identifies automatically a homogeneous subset of cases corresponding to groups of countries with which the same set of coefficients and similar orders of tax revenues are associated. This homogeneous subset identification helps in explaining the structure of, and the differences between, the fiscal and economic systems of the subsets of countries.

6.2 Comparing BTRT with LLBT and BTtree

BTRT is compared with LLBT (Sinclair, 1982) without country-specific covariates and BTtree (Strobl et al., 2011). LLBT is similar to the BT model of Dittrich et al. (2000).

Both models are implemented in the prefmod R package (Hatzinger and Dittrich, 2012), which does not allow for the inclusion of numerical subject-specific covariates. Thus, these models assume the covariates do not affect the order of magnitude of the tax revenue categories. Thus, their main aim is to find a consensus ranking based on a probabilistic approach. Moreover, LLBT is not based on recursive partitioning, thus different results based on the characteristics of individuals are not available and only the average value of λ for each individual is computed. The LLBT’s estimated worth parameters (Table 8) are: π1=0.33,π2=0.11,π3=0.04, and π4=0.50. The consensus ranking in the root node of the BTRT trunk reproduces the same ordering of the items: Table 8 ( 4th object) in the first position; taxes on income ( 1stobject) in the second position; taxes on property ( 3rd object) in the third position and the SSC ( 2nd object) in the last position.

**** Table 8 about here ***

The BTtree model, implemented in the psychotree R package (Strobl et al., 2011), is instead capable of integrating recursive partitioning and the BT model: splits are found semi-parametrically searching for instability of the basic BT model object parameters.

**** Figure 4 about here ***

The tree obtained from BTtree (Figure 4) consists of two terminal nodes, with EPI being the splitting covariate. Results align with those derived from BTRT. BTtree does not identify interactions among covariates. After fitting a BT model, it verifies whether the item order established in the root node remains consistent across all observations. Covariates that introduce instability into the item order, and thus into the λ̂, are prioritized. Employing EPI as the initial split covariate allows for the generation of two distinct orders: For nations where EPI0.661, income tax revenues are inferior to those from goods. In contrast, for EPI > 0.661, the situation inversely applies. Additionally, in the left child node, income taxes occupy the second position, while taxes on goods take precedence. Conversely, in the right child node, this order is reversed.

Interaction effects in BTRT allow to further differentiation of the subgroups based on the ordering of their income by taxation. Thus, the final output of BTtree contains less information than those contained in BTRT. BTtree provides the preference scales in each group of the partition that derive from the order of object-related parameters, but it does not specify how the subject-specific covariates affect the judges’ preferences. Therefore, this semi-parametric model returns beta coefficients neither for the main effects nor for the interaction effects. In addition, as pointed out in Strobl et al. (2011), the testing procedure for the split search is challenging (Wiedermann et al., 2021). BTtree is based on the M-fluctuation test which is a score-based procedure (Zeileis and Hornik, 2007), whereas BTRT is based on the easy-to-compute decrease in deviance.

7 Discussion

This study innovatively examined global tax revenue categories, diverging from traditional analyses, to assess how countries’ characteristics influence their tax revenue composition. A specialized dataset was created by merging information from the OECD, IMF, and World Bank for 2018. Tax revenues for 100 countries were classified into four categories and ranked. The Bradley-Terry Regression Trunk (BTRT) model, a novel probabilistic method for analyzing preference rankings, was applied. BTRT allows for an assessment of tax categories across a broad range of countries through pairwise comparisons. The BTRT model segments individuals by considering their tax revenue rankings and the causal link between these rankings and individual characteristics. It operates on the premise that the hierarchy of tax revenues within a country is influenced by its socio-economic traits. Thus, for each country, determinants of tax revenue were modeled as covariates. Our findings added to the discourse on the interplay between tax revenues and government spending by examining four key government expenditure categories: military, education, health, and a residual group encompassing all other types of government spending.

In our application, BTRT partitioned 100 countries into six groups based on several socio-economic covariates. It identified important interactions between these covariates, including both first-order interactions and complex interactions. Moreover, BTRT computed coefficients that helped in ordering tax revenues, shedding light on the impact of significant covariates and their interactions. Unlike traditional Bradley-Terry models which focus on preference data, BTRT offers a more nuanced understanding by probabilistically assessing the causal relationships between covariates and the outcomes in paired comparisons, facilitating the grouping of data into homogenous clusters from the model’s terminal nodes. This approach provides a more actionable analysis compared to standard Bradley-Terry models. Compared to Strobl’s semi-parametric BTtree, BTRT provides precise estimates of the impact of subject-specific covariates in paired comparisons. Our empirical results revealed several advantages of BTRT: it accommodates a diverse array of countries and fiscal systems thanks to its paired comparison structure; it enables the identification of varied effects of covariates on different tax revenue categories; its fully parametric nature facilitates the detection of threshold interactions, enhancing model fit; and its tree structure offers a clear visualization of countries grouped by similar fiscal revenue characteristics.

In conclusion, this paper presents a public finance application that provides valuable insights for policymakers, exploring the impact of economic shocks on tax revenue structures and offering guidance for countries looking to modify their tax compositions. The study suggests that changes in a country’s socio-economic characteristics can influence tax revenue amounts without necessitating new tax policy regulations, which are often slow to implement. The paper also discusses the limitations of discretionary fiscal policy due to recognition and implementation delays, advocating for the effectiveness of automatic stabilizers that respond immediately to economic changes. Using the BTRT model, the study identifies comparable fiscal policies across diverse fiscal systems by analyzing socioeconomic traits.

Disclaimer

The authors declare they have no conflict of interest and the findings, interpretations, and conclusions expressed herein do not necessarily reflect the view of the institutions of affiliation.

Algorithm

Algorithm 1: Bradley-Terry Regression Trunkinput: X (matrix of covariates), Y paired comparisons;

1 initialize: estimate the main effects model in the root node of the trunk: λ̂i,h=λ̂i+p=1Pβ̂i,pxp,h;

2 while splitting rule causes a decrease in model deviance or sample size within each child is greater or equal to 5 do

3 Split search for each current node tc ;

4 for p=1P do

5 find the dichotomous variable zijp,t that minimizes the local loglikelihood deviance of the model;

6 end

7 detect zijp,t* as the quantity that minimizes the global loglikelihood deviance of the model;

8 create child nodes tc+1 and tc+2;

9 zijp,t* updates the indicator function I{(x1,h,,xP,h)}t and the model including the last threshold interaction effect is re-estimated:λ̂i,h=λ̂i+i=1Pβ̂i,pxp,h+t=1T1β̂i,P+tI{(x1,h,,xP,h)t}

10 endoutput: Bradley-Terry Regression Trunk

Notes

1 The superscript O used in Eq. 4 is intentionally left out to lighten notation.

2 See Eq. 14 and Eq. 15 in Baldassarre et al. (2022).

Fig. A1 OECD: tax revenues composition by country

Fig. A1 OECD: tax revenues composition by country

Fig. A2 Africa: tax revenues composition by country

Fig. A2 Africa: tax revenues composition by country

Fig. A3 Asia: tax revenues composition by country

Fig. A3 Asia: tax revenues composition by country

Fig. A4 South America: tax revenues composition by country

Fig. A4 South America: tax revenues composition by country

Table 1 Design matrix with two judges/individuals, three objects, and one continuous subject-specific covariate.

Table 2 Descriptive statistics for the subject-specific covariates selected by the BTLL model.

Table 3 Pruned BTRT regression trunk.

Table 4 Pruned BTRT regression trunk: 10-fold cross-validation results.

Table 5 BTRT estimated coefficients.

Table 6 Mean worth parameters πi for each tax revenue category in each terminal node of the pruned regression trunk.

Table 7 OLS estimated coefficients with interaction effects

Table 8 Log-linear Bradley-Terry model without subject-specific covariates: results

Table 9 Descriptive statistics for the subject-specific covariates.

Figure 1 Pruned regression trunk.

Notes: In each terminal node R1,,R6 and in the root note, the number of countries H, the consensus ordering C, and the correlation coefficient τx are shown. Taxes are named as follows: I - Taxes on income, S - Social security contributions, P - Taxes on property, and G - Taxes on goods. When two tax categories are in brackets, it means they are in a tie.

Figure 1 Pruned regression trunk.Notes: In each terminal node R1,…,R6 and in the root note, the number of countries H, the consensus ordering C, and the correlation coefficient τx are shown. Taxes are named as follows: I - Taxes on income, S - Social security contributions, P - Taxes on property, and G - Taxes on goods. When two tax categories are in brackets, it means they are in a tie.

Figure 2 Countries’ distribution map in each region (terminal node) R1,,R6, where R1 is the reference region in the BTRT model.

Figure 2 Countries’ distribution map in each region (terminal node) R1,…,R6, where R1 is the reference region in the BTRT model.

Figure 3 Mean worth parameters πi conditioned to the share of agricultural sector to GDP.

Notes: The worth parameters are computed for R1,,R6 by fixing all covariates (except for the share of the agricultural sector) to the average level of the group. The order of graphs within the figure follows the trunk in Figure 1: R2 top left, R3 top right, R4 center left, R5 center right, R6 bottom left, R1 bottom right. The x-axis shows simulated values of the covariate agricultural share within the standardized range [0,1]. Finally, for each plot, the mean values of the covariates resulting from the splitting procedure of the regression trunk are reported. For the sake of clarity, the mean values of all the other country-specific covariates are not reported.

Figure 3 Mean worth parameters πi conditioned to the share of agricultural sector to GDP.Notes: The worth parameters are computed for R1,…,R6 by fixing all covariates (except for the share of the agricultural sector) to the average level of the group. The order of graphs within the figure follows the trunk in Figure 1: R2 top left, R3 top right, R4 center left, R5 center right, R6 bottom left, R1 bottom right. The x-axis shows simulated values of the covariate agricultural share within the standardized range [0,1]. Finally, for each plot, the mean values of the covariates resulting from the splitting procedure of the regression trunk are reported. For the sake of clarity, the mean values of all the other country-specific covariates are not reported.

Figure 4 Output of BTtree.

Notes: H is the number of countries, I = taxes on income, S = social security contributions, P = taxes on property, and G = taxes on goods and services. The plot shows the estimated worth parameters in both terminal nodes.

Figure 4 Output of BTtree.Notes: H is the number of countries, I = taxes on income, S = social security contributions, P = taxes on property, and G = taxes on goods and services. The plot shows the estimated worth parameters in both terminal nodes.
Supplemental material

Supplemental Material

Download PDF (161.3 KB)

References

  • Agbeyegbe, T. D., J. Stotsky, and A. WoldeMariam (2006). Trade liberalization, exchange rate changes, and tax revenue in sub-saharan africa. Journal of Asian Economics 17 (2), 261–284.
  • Agresti, A. (1990). Categorical data analysis. A Wiley-Interscience publication. Wiley.
  • Anderson, W., M. S. Wallace, and J. T. Warner (1986). Government spending and taxation: What causes what? Southern Economic Journal 52 (3), 630–639.
  • Baldassarre, A., E. Dusseldorp, A. D’Ambrosio, M. d. Rooij, and C. Conversano (2022). The bradley–terry regression trunk approach for modeling preference data with small trees. Psychometrika, 1–23.
  • Besley, T. and T. Persson (2014). Why do developing countries tax so little? Journal of economic perspectives 28 (4), 99–120.
  • Bird, R. M., J. Martinez-Vazquez, and B. Torgler (2008). Tax effort in developing countries and high income countries: The impact of corruption, voice and accountability. Economic Analysis and Policy 38 (1), 55–71.
  • Bird, R. M., J. Martinez-Vazquez, and B. Torgler (2014). Societal institutions and tax effort in developing countries. Annals of Economics and Finance 15 (1), 185–230.
  • Bondell, H. D. and B. J. Reich (2009). Simultaneous factor selection and collapsing levels in anova. Biometrics 65 (1), 169–177.
  • Boukbech, R., A. Bousselhami, and E. Ezzahid (2019, January). Determinants of Tax Revenues: Evidence From a Sample of Lower Middle Income Countries. Applied Economics and Finance 6 (1), 11–20.
  • Bradley, R. A. and M. E. Terry (1952). Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39 (3/4), 324–345.
  • Breiman, L., J. H. Friedman, R. A. Olshen, and C. J. Stone (1984). Classification and Regression Trees. Wadsworth.
  • Brun, J. and M. Diakite (2016). Tax potential and tax effort: An empirical estimation for non-resource tax revenue and vat’s revenue. Working Papers 201610, CERDI.
  • Buera, F. J., A. Monge-Naranjo, and G. E. Primiceri (2011). Learning the wealth of nations. Econometrica 79 (1), 1–45.
  • Caselli, F. and J. Reynaud (2020). Do fiscal rules cause better fiscal balances? a new instrumental variable strategy. European Journal of Political Economy 63, 101873.
  • Cassou, S. P. (1997). The link between tax rates and foreign direct investment. Applied Economics 29 (10), 1295–1301.
  • Castro, G. and D. Camarillo (2014). Determinants of tax revenue in oecd countries over the period 2001-2011. Contaduria y Administracion 59 (3), 35–59.
  • Chelliah, R. H., H. J. Baas, and M. R. Kelly (1975). Tax ratios and tax effort in developing countries, 1969-71. IMF Staff Papers 1975 (001), A006.
  • Chelliah, R. J. (1971). Trends in taxation in developing countries. IMF Staff Papers 1971 (002), A002.
  • Choisel, S. and F. Wickelmaier (2007, January). Evaluation of multichannel reproduced sound: scaling auditory attributes underlying listener preference. J Acoust Soc Am 121, 388–400.
  • Cohen, J., P. Cohen, S. G. West, and L. S. Aiken (2022). Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences. Routledge.
  • Conversano, C. and E. Dusseldorp (2017, 10). Modeling threshold interaction effects through the logistic classification trunk. Journal of Classification 34, 399–426.
  • D’Ambrosio, A., C. Iorio, M. Staiano, and R. Siciliano (2019). Median constrained bucket order rank aggregation. Comput. Stat. 34 (2), 787–802.
  • de González, A. B. and D. R. Cox (2007). Interpretation of interaction: A review. The Annals of Applied Statistics 1 (2), 371 – 385.
  • Dittrich, R., B. Francis, R. Hatzinger, and W. Katzenbeisser (2006, 02). Modelling dependency in multivariate paired comparisons: A log-linear approach. Mathematical Social Sciences 52, 197–209.
  • Dittrich, R., R. Hatzinger, and W. Katzenbeisser (1998). Modelling the effect of subject-specific covariates in paired comparison studies with an application to university rankings. Journal of the Royal Statistical Society: Series C (Applied Statistics) 47 (4), 511–525.
  • Dittrich, R., W. Katzenbeisser, and H. Reisinger (2000, 02). The analysis of rank ordered preference data based on bradley-terry type models. Or Spektrum 22, 117–134.
  • Dusseldorp, E., C. Conversano, and B. J. V. Os (2010). Combining an additive and tree-based regression model simultaneously: Stima. Journal of Computational and Graphical Statistics 19 (3), 514–530.
  • Dusseldorp, E. and J. J. Meulman (2004). The regression trunk approach to discover treatment covariate interaction. Psychometrika 69 (3), 355–374.
  • Eltony, N. et al. (2001). The determinants of tax effort in Arab countries. Arab Planning Institute.
  • Emond, E. J. and D. W. Mason (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis 11 (1), 17–28.
  • Feltenstein, A. and M. Cyan (2012). A Computational General Equilibrium Approach to Sectoral Analysis for Tax Potential: An Application to Pakistan. International Center for Public Policy Working Paper Series, at AYSPS, GSU paper1226, International Center for Public Policy, Andrew Young School of Policy Studies, Georgia State University.
  • Gertheiss, J. and G. Tutz (2010). Sparse modeling of categorial explanatory variables. The Annals of Applied Statistics 4 (4), 2150–2180.
  • Giuliano, P., P. Mishra, and A. Spilimbergo (2013). Democracy and reforms: evidence from a new dataset. American Economic Journal: Macroeconomics 5 (4), 179–204.
  • Gruber, J. ([2005, 2004]). Public finance and public policy. New York, NY: Worth Publishers, [2005, 2004] [copyright]2005.
  • Gupta, A. S. (2007). Determinants of tax revenue efforts in developing countries. Working Papers 07/184, International Monetary Fund.
  • Hatzinger, R. and R. Dittrich (2012, 04). prefmod: An r package for modeling preferences based on paired comparisons, rankings, or ratings. JSS Journal of Statistical Software 48.
  • Jain, P. ([1989]). Economics of Public Finance. Dehli, India: Atlantic Publishers.
  • Jiang, T. Y. (2021). Drelationship between government expenditures on health and residents’ consumption: New evidence from china based on the bootstrap rolling-window causality test. Frontiers in public health 9, 710147.
  • Kaldor, N. (1963). Taxation for economic development. The Journal of Modern African Studies 1 (1), 7–23.
  • Karagoz, K. (2013). Determinants of tax revenue: Does sectorial composition matter? Journal of Finance, Accounting and Management 4 (2), 50–63.
  • Leuthold, J. H. (1991). Tax shares in developing economies a panel study. Journal of development Economics 35 (1), 173–185.
  • Lotz, J. R. and E. R. Morss (1967). Measuring tax effort in developing countries. IMF Staff Papers 1967 (003), A004.
  • Luce, R. D. (1959). Individual Choice Behavior: A Theoretical analysis. New York, NY, USA: Wiley.
  • Manage, N. and M. L. Marlow (1986). The causal relation between federal expenditures and receipts. Southern Economic Journal 52 (3), 617–629.
  • Martin, F. and C. Uribe Teran (2010, 01). Economic and institutional determinants of fiscal effort in latin america. Investigacion Economica 69, 85–113.
  • Masarotto, G. and C. Varin (2012). The ranking lasso and its application to sport tournaments. The Annals of Applied Statistics 6 (4), 1949–1970.
  • McCullagh, P. and J. Nelder (1989). Generalized Linear Models, Second Edition. Chapman and Hall/CRC Monographs on Statistics and Applied Probability Series. Chapman & Hall.
  • OECD (2018a). Constructing the global revenue statistics database. Technical paper, Organization for Economic Co-operation and Development.
  • OECD (2018b). Global revenue statistics database. Technical report, Organization for Economic Co-operation and Development.
  • Oelker, M.-R., W. Pößnecker, and G. Tutz (2015). Selection and fusion of categorical predictors with l0-type penalties. Statistical Modelling 15 (5), 389–410.
  • Pessino, C. and R. Fenochietto (2010, 11). Determining countries’ tax effort. Hacienda Publica Espanola 195, 65–87.
  • Piancastelli, M. (2001). Measuring the tax effort of developed and developing countries: Cross country panel data analysis. IPEA Working Paper (818).
  • Rodriguez Montequin, V., J. M. Villanueva Balsera, M. Diaz Piloneta, and C. Alvarez Perez (2020). A bradley-terry model-based approach to prioritize the balance scorecard driving factors: The case study of a financial software factory. Mathematics 8 (2).
  • Schauberger, G. and G. Tutz (2017). Subject-specific modelling of paired comparison data: A lasso-type penalty approach. Statistical Modelling 17 (3), 223–243.
  • Schauberger, G. and G. Tutz (2019). BTLLasso: A common framework and software package for the inclusion and selection of covariates in Bradley-Terry models. Journal of Statistical Software 88 (9), 1–29.
  • Schauberger, G. J. (2015). Regularization methods for item response and paired comparison models. Cuvillier Verlag.
  • Sinclair, C. D. (1982). Glim for preference.
  • Stotsky, M. J. G. and M. A. WoldeMariam (1997). Tax effort in sub-Saharan Africa. International Monetary Fund.
  • Strobl, C., F. Wickelmaier, and A. Zeileis (2011). Accounting for individual differences in bradley-terry models by means of recursive partitioning. Journal of Educational and Behavioral Statistics 36 (2), 135–153.
  • Tait, A. A., W. L. Gratz, and B. J. Eichengreen (1979). International comparisons of taxation for selected developing countries, 1972-1976. IMF Staff Papers 1979 (001), A005.
  • Tanzi, V. (1989). The impact of macroeconomic policies on the level of taxation and the fiscal balance in developing countries. IMF Staff Papers 1989 (003), A005.
  • Teera, J. M. and J. Hudson (2004). Tax performance: a comparative study. Journal of International Development 16 (6), 785–802.
  • Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 58 (1), 267–288.
  • Tutz, G. and G. Schauberger (2015). A penalty approach to differential item functioning in rasch models. Psychometrika 80 (1), 21–43.
  • Wallace, S. and R. W. Bahl (2005). Public financing in developing and transition countries. Public Budgeting & Finance 25 (4S), 83–98.
  • Wiedermann, W., U. Frick, and E. C. Merkle (2021). Detecting heterogeneity of intervention effects in comparative judgments. Prevention science (Mar 9).
  • Zeileis, A. and K. Hornik (2007). Generalized m-fluctuation tests for parameter instability. Statistica Neerlandica 61 (4), 488–508.
  • Zeileis, A., T. Hothorn, and K. Hornik (2008). Model-based recursive partitioning. Journal of Computational and Graphical Statistics 17 (2), 492–514.
  • Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association 101 (476), 1418–1429.