Full article: Asymptotic behavior of encompassing test for independent processes: Case of linear and nearest neighbor regressions

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Encompassing test has been well developed for fully parametric modeling. In this study, we are interested on encompassing test for parametric and nonparametric regression methods. We consider linear regression for parametric modeling and nearest neighbor regression for nonparametric methods. We establish asymptotic normality of encompassing statistic associated to the encompassing hypotheses for the linear parametric method and the nonparametric nearest neighbor regression estimate. We also obtain convergence rate depending only on the number of neighbors $k$ while it depends on the number of observation $n$ and the bandwidth $h_{n}$ for kernel method. We achieve the same convergence rate when $h_{n} = k / n$ . Moreover, asymptotic variance of the encompassing statistic associated to kernel regression depends on the density, this is not the case for nearest neighbor regression estimate.

Keywords:

PUBLIC INTEREST STATEMENT

Regression techniques are used for quantitative analysis method in many fields such as in economic or financial modeling. They are a useful tool for identification of factors, which may explain the evolution of any variable of interest. In economic modeling for example, when we want to analyze the evolution of Gross Domestic Product or GDP, it might be affected by many variables like interest rate, inflation, exchange rate, sentiment indicators. Researchers or experts may face several admissible models from parametric or/and nonparametric regression methods. Encompassing test can be helpful for detection of redundant models among admissible models. The findings in this study contribute on encompassing test between linear and nearest neighbor regression estimates.

1. Introduction

Encompassing tests lie on model selection step. They are used for detection of redundant models among admissible models. In that case an encompassing model is intended to account for the results found by encompassed model. Theoretical development on encompassing test can be found in Mizon (Citation1984), Gouriéroux and Monfort (Citation1995) and Florens et al. (Citation1996). For their application, we refer readers to the general to specific computer based model selection procedure, Hendry and Doornik (Citation1994).

Recently, Bontemps et al. (Citation2008) have developed encompassing test for linear parametric against kernel nonparametric regression methods. They provide asymptotic normality of the associated encompassing statistics under the independent and identically distributed hypothesis (i.i.d.). As stated in Hendry et al. (Citation2008) that the work of Bontemps et al. (Citation2008) is the starting treatment of encompassing tests to functional parameter based on nonparametric methods.

We extend this result to nearest neighbor regression method, which has been claimed more flexible compared to kernel. Other motivation would be its interest in application like in Nowman and Saltoglu (Citation2003), Guégan and Huck (Citation2005), Ferrara et al. (Citation2010), Guégan and Rakotomarolahy (Citation2010), and Puspitasari and Rustam (Citation2018), among others.

In the next section, we provide an overview of the encompassing test. After, we establish asymptotic normality for various encompassing statistics associated to linear parametric and nearest neighbor regression methods. Last, we conclude.

2. Encompassing test for independent processes

This section introduces the encompassing test and then builds the corresponding encompassing hypothesis. So, given two regression models $M_{1}$ and $M_{2}$ , we are interested in knowing if model $M_{1}$ can account the result of model $M_{2}$ . In fact, we want to know if $M_{1}$ encompasses $M_{2}$ or, in a short notation $M_{1} E M_{2}$ . Testing such a hypothesis will be done using the notion of encompassing test.

Generally speaking, model $M_{1}$ encompasses model $M_{2}$ , if the parameter $θ_{M_{2}}$ of the latter model can be expressed in function of the parameter $θ_{M_{1}}$ of the former model. In other words, let $Δ (θ_{M_{1}})$ be the pseudo-true value of $θ_{M_{2}}$ on $M_{1}$ . In general, the pseudo-true value is defined as the plim of ${\hat{θ}}_{M_{2}}$ on $M_{1}$ . For more discussion on pseudo-true value associated with the KLIC,Footnote¹ we refer to Sawa (Citation1978) and Govaerts et al. (Citation1994). The encompassing statistic is given by the difference between ${\hat{θ}}_{M_{2}}$ and $Δ ({\hat{θ}}_{M_{1}})$ scaled by a coefficient $a_{n}$ .

Let $S = (Y, X, Z)$ be a zero mean random process with valued in $R$ x $R^{d}$ x $R^{q}$ where $d, q \in N^{*}$ . For $x \in R^{d}$ and $z \in R^{q}$ , we consider the two models $M_{1}$ and $M_{2}$ defined as follows:

(1)

M_{1} : m (x) = E [Y | X = x] a n d M_{2} : g (z) = E [Y | Z = z] .

(1)

In addition, the general unrestricted model is given by $r (x, z) = E [Y | X = x, Z = z]$ . Following the encompassing test for functional parameter in Bontemps et al. (Citation2008), we have the null hypothesis:

H : E [Y | X = x, Z = z] = E [Y | X = x] .

This null states that $M_{1}$ is the owner model, and $M_{2}$ will be served on validating this statement and is called the rival model. We test this hypothesis $H$ through the following implicit encompassing hypothesis:

H^{*} : E [E [Y | X = x] / Z = z] = E [Y | Z = z] .

The following homoskedasticity condition will be assumed all along this work:

(2)

V a r [Y | X = x, Z = z] = σ^{2} .

(2)

Moreover, a necessary condition for the encompassing test relies on the errors of both models where the intended encompassing model $M_{1}$ should have smaller standard error than the encompassed model $M_{2}$ .

Given a sample, of size $n$ , $s_{i} = (y_{i}, x_{i}, z_{i})$ for $i = 1, \dots, n$ as realization of the random process $S = (Y, X, Z)$ . We suppose that $s_{i}$ , $i = 1, \dots, n$ are $i . i . d .$ . Then, for given functional estimates $m_{n}$ and $g_{n}$ of the functions $m$ and $g$ , respectively, we have the following encompassing statistic:

{\hat{δ}}_{m_{n}, g_{n}} = g_{n} - \hat{G} (m_{n}),

where $\hat{G} (m_{n})$ is an estimate of the pseudo-true value, associated with $g_{n}$ on $H$ , in the LHS of the hypothesis $H^{*}$ . Bontemps et al. (Citation2008) has provided asymptotic normality of this encompassing statistic $\hat{δ}$ by considering kernel regression estimate for nonparametric method. This result can be extended to nearest neighbor regression estimate but of course with different assumptions.

For nearest neighbor regression estimate, we consider the representation in Mack (Citation1981), that is the $k$ nearest neighbor (or $k$ -NN) estimate $g_{n}$ of $g$ is given by:

g_{n} (z) = \frac{\frac{1}{n R_{n}^{q}} \sum_{i = 1}^{n} w (\frac{z - Z_{i}}{R_{n}}) Y_{i}}{\frac{1}{n R_{n}^{q}} \sum_{i = 1}^{n} w (\frac{z - Z_{i}}{R_{n}})},

where $R_{n}$ will be defined as distance, according to the Euclidean norm in $R^{q}$ , from $z$ to its $k (n) t h$ neighbors, and $w (u)$ is a bounded, non-negative weight function satisfying

(3)

\int w (u) d u = 1 a n d w (u) = 0 f o r | u | \geq 1.

(3)

To establish an asymptotic distribution of ${\hat{δ}}_{m_{n}, g_{n}}$ , we need some assumptions. The following assumptions will be used for insuring the asymptotic normality and are taken from Mack (Citation1981). Without loss of generality, the function $f$ will be a marginal density or a conditional density or a joint density according to the variables on its arguments.

The first assumption relies on the density function of the couple $(Y, Z)$ .

Assumption 1. The function $χ_{β} (z) = \int y^{β} f (z, y) d y$ is bounded and continuous at $z$ for $β = 0, 1, 2,$ and continuously differentiable in a neighborhood of $z$ for $β = 0, 1$ .

The following assumption concerns conditions on the moments up to order three of the variable of interest.

Assumption 2. $E [| Y |^{3}] < \infty$ , $V a r [Y | Z = z] > 0$ and $f (z) > 0$ .

The last assumption states conditions on the relationship between the number of neighbors $k$ and the sample size $n$ .

Assumption 3. $k = n^{α}$ with $0 < α < \frac{4}{4 + d}$ .

When assumptions 1–3 hold and the relation (3) is satisfied, then Mack (Citation1981) has established the asymptotic normality of the centered $k$ -NN regression of $g_{n}$ . Moreover, under assumption 3, the bias of such $k$ -NN regression estimate vanishes to zero.

Without loss of generality, we proceed as previously when model $M_{1}$ will be estimated by $k$ -NN regression method. In the rest of the paper, $N (μ, v)$ denotes the normal distribution with mean $μ$ and variance $v$ . We now present the asymptotic normality of the encompassing statistic.

3. Asymptotic normality of the encompassing statistic

In general, $M_{1}$ or $M_{2}$ can be estimated using nonparametric or parametric regression methods. We can encounter the following four situations: $M_{1}$ and $M_{2}$ are both estimated parametrically, $M_{1}$ and $M_{2}$ are both estimated nonparametrically, $M_{1}$ is estimated nonparametrically and $M_{2}$ parametrically and $M_{1}$ is estimated parametrically and $M_{2}$ nonparametrically.

For development on the asymptotic behavior of the encompassing statistic for fully parametric case, i.e the two models $M_{1}$ and $M_{2}$ have parametric specification, we refer readers to Gouriéroux et al. (Citation1983) and Mizon and Richard (Citation1986) among others. For recent discussion on this encompassing test for fully parametric case, Bontemps et al. (Citation2008) is a good reference.

Next, we will study the completely nonparametric case.

3.1. Nonparametric specification for $M_{1}$ and $M_{2}$

We consider the case where the two models $M_{1}$ and $M_{2}$ defined in (1) are estimated using the nonparametric nearest neighbor regression method. To test the hypothesis” $M_{1}$ encompasses $M_{2}$ ”, we establish asymptotic normality of the associated encompassing statistic.

Theorem 3.1. Assume that assumptions 1–3 and relations (2) and (3) hold. Then under $H$ , we have:

\sqrt{k - 1} {\hat{δ}}_{m_{n}, g_{n}} (z) \to N (0, c . V a r (ϵ / Z = z) \int w^{2} (u) d u) i n d i s t r i b u t i o n a s n \to \infty,

where $ϵ_{i} = Y_{i} - m (x_{i})$ for $i = 1, \dots, n$ are the residuals from model $M_{1}$ and $c = \frac{π^{q / 2}}{Γ ((q + 2) / 2)}$ is the volume of unit ball in $R^{q}$ with $Γ (.)$ the gamma function.

Proof of Theorem 3.1

The proof will be based on the decomposition of the encompassing statistic into two parts as an expression of nearest neighbor regression and a kind of bias. Before all, let’s denote by:

W (\frac{z - Z_{i}}{R_{n}}) = \frac{\frac{1}{n R_{n}^{q}} w (\frac{z - Z_{i}}{R_{n}})}{\frac{1}{n R_{n}^{q}} \sum_{i = 1}^{n} w (\frac{z - Z_{i}}{R_{n}})} .

We write down our encompassing statistic by replacing our estimates $g_{n}$ and $\hat{G} (m_{n})$ at a given point $z$ , and we have:

\begin{aligned} \sqrt{k - 1} {\hat{δ}}_{m_{n}, g_{n}} (z) = \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) Y_{i} - \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) m_{n} (x_{i}) \\ = \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) ϵ_{i} + \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) (m (x_{i}) - m_{n} (x_{i})) = A + B, \end{aligned}

where A is the first expression in RHS of the equality. This involves a $k$ -NN regression of $ϵ_{i} = Y_{i} - m (x_{i})$ on $Z_{i}$ scaled by the coefficient $\sqrt{k - 1}$ seeing as convergence speed rate when $n$ goes to infinity. Using Mack (Citation1981), under assumptions 1–3 and when relation (3) holds, we have:

A \to N (0, c . V a r (ϵ / Z = z) \int w^{2} (u) d u) i n d i s t r i b u t i o n a s n \to \infty .

Next, for the second expression $B$ , we can bound by taking its supremum with respect to $x_{i}$ and then we get:

(4)

\begin{aligned} | B | \leq S u p_{x_{i}} \sqrt{k - 1} | m_{n} (x_{i}) - m (x_{i}) | \leq S u p_{x_{i}} \sqrt{k - 1} | m_{n} (x_{i}) - E [m_{n} (x_{i})] | + S u p_{x_{i}} \sqrt{k - 1} | E [m_{n} (x_{i})] \\ - m (x_{i}) | = B 1 + B 2. \end{aligned}

(4)

When using the expression of the bias, Theorem 1 in [2], $B_{2}$ becomes:

B_{2} = (S u p_{x_{i}} A (x_{i})) {(\frac{k}{n})}^{\frac{2}{d}} \sqrt{k - 1} + o ({(\frac{k}{n})}^{\frac{2}{d}}) \sqrt{k - 1} + O (\frac{1}{k}) \sqrt{k - 1},

where $A (.)$ is a function depending only on $x_{i}$ and its expression can be found in Mack (Citation1981). Then from Assumption 3, $B_{2}$ vanishes to zero when $n \to \infty$ . It remains on showing that $B_{1}$ goes to zero also. This can be achieved using result of Mukerjee (Citation1993) extension of Cheng’s work (Cheng, Citation1984). Therefore, we remark that when the number of neighbors $k$ increases more the weights given to neighbors decrease, then rewriting $m_{n} (x_{i})$ and we have the following equivalence:

m_{n} (x_{i}) = \frac{\sum_{j = 1}^{n} K (\frac{x_{i} - X_{j}}{R_{i}}) Y_{j}}{\sum_{j = 1}^{n} K (\frac{x_{i} - X_{j}}{R_{i}})} ≅ \sum_{j = 1}^{n} \frac{c_{j}}{k} Y_{j},

where $K (.)$ is a given weight function which satisfies condition (3), $c_{j}$ is a bounded weight equal to zero when $j$ is larger than the number of neighbors and $R_{i}$ is the distance between $x_{i}$ and its $k^{t h}$ neighbor. When we denote by ${\tilde{m}}_{n} (x_{i}) = \sum_{j = 1}^{n} \frac{c_{j}}{\sqrt{k}} Y_{j}$ , then from Theorem 2.1 in Mukerjee (Citation1993), we have:

B_{1} = S u p_{x_{i}} | {\tilde{m}}_{n} (x_{i}) - E [{\tilde{m}}_{n} (x_{i})] | = O (\frac{1}{θ_{n}}) + O (n^{- \frac{r - 1}{r}}),

with $r > 1$ and $θ_{n}$ a positive sequence which tends to zero as $n \to \infty$ . So we get $| B |$ converges to zero in probability as $B_{1}$ . This completes the proof of theorem.

Next, we will consider the mixed situation where the owner model has parametric specification and the rival is from nonparametric method.

3.2. Parametric modelling for $M_{1}$ vs nonparametric specification for $M_{2}$

In this section, we consider the case that model $M_{1}$ is a linear parametric model and $M_{2}$ is estimated by nearest neighbor regression technique. Therefore, the hypothesis $H$ will have linear parametric specification. The encompassing statistic associated to the null $M_{1} E M_{2}$ can be rewritten as follows:

(5)

{\hat{δ}}_{β, g} (z) = g_{n} (z) - {\hat{G}}_{L} (\hat{β}) (z),

(5)

where ${\hat{G}}_{L} (\hat{β})$ is an estimate of the pseudo-true value $G_{L} (β) (z)$ associated with $g_{n}$ on $H$ , and is defined as $G_{L} (β) (z) = β \primeE [X | Z = z]$ .

We estimate the rival model $M_{2}$ using $k$ -NN regression method where the owner model $M_{1}$ is still with linear parametric specification. The following theorem provides the asymptotic normality of the encompassing statistic introduced in relation (5).

Theorem 3.2. Assume that assumptions 1–3, relations (2) and (3) hold.

Then under $H$ , we get:

\sqrt{k - 1} {\hat{δ}}_{β, g} (z) \to N (0, Σ) i n d i s t r i b u t i o n a s n \to \infty,

where $Σ = c σ^{2} \int w^{2} (u) d u$ with $c$ is the volume of unit ball in $R^{q}$ .

Proof of Theorem 3.2.

When the owner model $M_{1}$ is the linear regression parametric and the rival model $M_{2}$ is the $k$ -NN regression, we write the encompassing statistic as follows:

(6)

\begin{aligned} \sqrt{k - 1} {\hat{δ}}_{β, g} (z) = \sqrt{k - 1} (g_{n} (z) - {\hat{G}}_{L} (\hat{β}) (z)) = \sqrt{k - 1} (\sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) Y_{i} - \sum_{i = 1}^{n} \tilde{W} (\frac{z - Z_{i}}{{\tilde{R}}_{n}}) \hat{β}' X_{i}) \\ = \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) (Y_{i} - β' X_{i}) + \sqrt{k - 1} \sum_{i = 1}^{n} \tilde{W} (\frac{z - Z_{i}}{{\tilde{R}}_{n}}) (β - \hat{β})' X_{i} \\ + \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) β' X_{i} - \sqrt{k - 1} \sum_{i = 1}^{n} \tilde{W} (\frac{z - Z_{i}}{{\tilde{R}}_{n}}) β' X_{i} = N_{1} + N_{2} + N_{3} - N_{4}, \end{aligned}

(6)

where $\tilde{W} (\frac{z - Z_{i}}{{\tilde{R}}_{n}}) = \frac{\frac{1}{n {\tilde{R}}_{n}^{q}} \tilde{w} (\frac{z - Z_{i}}{{\tilde{R}}_{n}})}{\frac{1}{n {\tilde{R}}_{n}^{q}} \sum_{i = 1}^{n} \tilde{w} (\frac{z - Z_{i}}{{\tilde{R}}_{n}})}$ is the weight associated to the nearest neighbor regression of ${\hat{β}}^{'} X_{i}$ on $Z_{i}$ and ${\tilde{R}}_{n}$ is the distance from $z$ to its ${\tilde{k}}^{t h}$ neighbor.

We remark that $Y_{i}$ and $\hat{β}' X_{i}$ as fitted values of $Y_{i}$ would have the same $Z_{i}$ nearest to $z$ . We then have $N_{3} - N_{4} = 0$ . Otherwise, this can happen asymptotically, that is $Y_{i}$ and $\hat{β}' X_{i}$ as fitted values of $Y_{i}$ have the same $Z_{i}$ nearest to $z$ when $k$ and $\tilde{k}$ tend to infinity. Thus, $N_{3}$ is asymptotically equivalent to $N_{4}$ .

For the first expression $N_{1} = \sqrt{k - 1} \sum_{i = 1}^{n} W (\frac{z - Z_{i}}{R_{n}}) ϵ_{i}$ , with $ϵ_{i} = Y_{i} - β' X_{i}$ . Under assumptions in Theorem 3.2, then using result in Mack (Citation1981), we have:

N_{1} \to N (0, Σ) i n d i s t r i b u t i o n a s n \to \infty,

where $Σ = c . σ^{2} \int w^{2} (u) d u$ .

For $N_{2} = (β - \hat{β})' \sqrt{k - 1} \sum_{i = 1}^{n} \tilde{W} (\frac{z - Z_{i}}{{\tilde{R}}_{n}}) X_{i}$ , under assumptions in Theorem 3.2, we know that the estimate $\sqrt{n} (β - \hat{β})$ converges in distribution to a normal law $Z$ with mean zero. The remaining part of $N_{2}$ has the following expression $\frac{\sqrt{k - 1}}{\sqrt{n}} \sum_{i = 1}^{n} \tilde{W} (\frac{z - Z_{i}}{{\tilde{R}}_{n}}) X_{i}$ which converges in distribution to zero. Thus, from Slutsky’s theorem, $N_{2}$ tends to zero in distribution. $\boxempty$

We will consider the last case where the owner model $M_{1}$ is a nonparametric method and the rival model $M_{2}$ is a linear parametric model.

3.3. Nonparametric specification for $M_{1}$ vs parametric modelling for $M_{2}$

We now consider the owner model $M_{1}$ to be estimated using a $k$ -NN nonparametric regression and the rival model $M_{2}$ to be a linear parametric method. Therefore, the encompassing statistic associated to the null $M_{1} E M_{2}$ is given by:

(7)

{\hat{δ}}_{m, γ} = \hat{γ} - \hat{γ} (m_{n}),

(7)

where $\hat{γ} (m_{n})$ is an estimate of the pseudo-true value $γ (m)$ associated with $\hat{γ}$ on $H$ , which is defined by $γ (m) = (E [Z Z'])^{- 1} E [Z m]$ . We estimate the unknown conditional mean $m$ associated to the model $M_{1}$ using $k$ -NN regression estimate. We state in the following theorem the asymptotic normality of the encompassing statistic in relation (7). For precision, we use the assumptions introduced in previous section for $k$ -NN regression estimate $m_{n}$ instead of $g_{n} .$

Theorem 3.3. Assume that relations (2) and (3), assumptions 1–3, and the regularity conditions in linear regression are satisfied.

Then under $H$ , we get:

\sqrt{n} {\hat{δ}}_{m, γ} \to N (0, Ω) i n d i s t r i b u t i o n a s n \to \infty,

where $Ω = σ^{2} (E [Z Z'])^{- 1}$ .

Proof of Theorem 3.3.

When the functional parameters $m_{n}$ is from $k$ -NN regression estimate, we rewrite the associated encompassing statistic as follows:

(8)

\begin{aligned} \sqrt{n} {\hat{δ}}_{m, γ} = \sqrt{n} (\hat{γ} - \hat{γ} (m_{n})) = \sqrt{n} ({(\frac{1}{n} \sum_{i = 1}^{n} Z_{i} Z'_{i})}^{- 1} (\frac{1}{n} \sum_{i = 1}^{n} Z_{i} Y_{i}) - {(\frac{1}{n} \sum_{i = 1}^{n} Z_{i} Z'_{i})}^{- 1} (\frac{1}{n} \sum_{i = 1}^{n} Z_{i} m_{n} (x_{i}))) \\ = \sqrt{n} {(\frac{1}{n} \sum_{i = 1}^{n} Z_{i} Z'_{i})}^{- 1} (\frac{1}{n} \sum_{i = 1}^{n} Z_{i} (Y_{i} - m (x_{i}))) + \sqrt{n} {(\frac{1}{n} \sum_{i = 1}^{n} Z_{i} Z'_{i})}^{- 1} (\frac{1}{n} \sum_{i = 1}^{n} Z_{i} (m (x_{i}) - m_{n} (x_{i}))) \\ = L_{1} + L_{2}, \end{aligned}

(8)

where $L_{1}$ corresponds to the first expression in the RHS of the equality (8). It coincides to the linear regression of the error $ϵ$ (with $ϵ_{i} = Y_{i} - m (x_{i})$ ) on $Z$ .

Under i.i.d. assumption in Theorem 3.3, $L_{1}$ converges in distribution to $Z$ where $Z$ is normally distributed with mean zero and variance $Ω = σ^{2} (E [Z Z'])^{- 1}$ . For the second expression $L_{2}$ , we bound it by taking the maximum with respect to $x_{i}$ and then we get $| L_{2} | \leq \sqrt{n} S_{n} D_{n} S u p {(m (x_{i}) - m_{n} (x_{i}))), x_{i} \in R^{d}}$ where $S_{n} = \frac{1}{n} \sum_{i = 1}^{n} | Z_{i} |$ and $D_{n} = (\frac{1}{n} \sum_{i = 1}^{n} Z_{i} Z'_{i})^{- 1}$ . We remark that $\sqrt{n} S u p {(m (x_{i}) - m_{n} (x_{i}))), x_{i} \in R^{d}}$ is asymptotically equivalent to the bound of $| B |$ in EquationEquation 4(4) $\begin{aligned} | B | \leq S u p_{x_{i}} \sqrt{k - 1} | m_{n} (x_{i}) - m (x_{i}) | \leq S u p_{x_{i}} \sqrt{k - 1} | m_{n} (x_{i}) - E [m_{n} (x_{i})] | + S u p_{x_{i}} \sqrt{k - 1} | E [m_{n} (x_{i})] \\ - m (x_{i}) | = B 1 + B 2. \end{aligned}$ (4) which converges to zero in probability. Thus, the product vanishes to zero also from Slutsky’s theorem. This completes the proof.

4. Illustration

In this section, we illustrate our theoretical results on real data. We focus on socio-economic factor determinants of Life expectancy. As explanatory variables for Life expectancy at birth, we consider the Gross National Income per capita in US $$$ , the Gross Domestic Product per capita in US $$$ and the government health expenditure per capita in US $$$ . Impact of these variables on Life expectancy at birth has been analyzed a long way in the literature, for regression analysis we may look at Hussain (Citation2002) and Ali and Ahmad (Citation2014). We use cross sectional data for 169 countries in 2017, which have been collected from the United Nation and the World Health Organization websites. To start our empirical study, we compute some basic statistics.

The highest life expectancy hits 84 years and belongs to Japan. While, the lowest is around 52 years belonging to Central African Republic. The best life expectancy of 84 years would be remarkable. Besides, life expectancy mean 72 years seems interesting. Moreover, the median value 73.69 indicates that around 84 countries have life expectancy above 73 years, largely beyond the retirement ages.

For socio-economic variables; Luxembourg, Switzerland and USA have the highest GDP, Income and health expenditure per capita, respectively. Burundi has the lowest GDP and Income per capita. Congo Democratic Republic registers the lowest government spending on health care. These variables exhibit some common behaviors such as the median of each variable is around fifteen times of its minimum and one over fifteen times of its maximum. They also have high dispersion. We now proceed on analysis of their relationship with life expectancy.

Let compute the correlation coefficients between life expectancy and the predictor variables.

From Table , Life expectancy has positive and high correlation with each explanatory variables. Such correlations indicate that higher GDP, income or expenditure on health will link with longer life expectancy. This preliminary analysis could motivate us on exploring other statistic and econometric analysis of the relationship between life expectancy and the three socio-economic variables. We will use the linear and the nearest neighbor regression methods. In sequel, we will work on demeaned and scaled (by a factor $\frac{1}{M a x - M i n}$ ) variables.

Table 1. Summary statistics

Download CSV Display Table

Table 2. Correlation between life expectancy and the socio-economic variables

Download CSV Display Table

For the linear regression, we explain life expectancy at birth $Y$ by health expenditure per capita $X$ , gross income per capita $Z$ and GDP per capita $W$ . Considering several combination of these explanatory variables, following we summarize regression coefficient estimates with their standard errors in parenthesis.

(9)

\begin{aligned} \underset{(0.02)}{M_{1} : Y_{i} = 0.8 X_{i} + {\hat{u}}_{i}^{1}} \underset{(0.006)}{M_{2} : Y_{i} = 0.76 Z_{i} + {\hat{u}}_{i}^{2}} \underset{(0.008)}{M_{3} : Y_{i} = 0.89 W_{i} + {\hat{u}}_{i}^{3}} \\ \underset{(0.21)}{M_{4} : Y_{i} = - 0.26 X_{i}} + \underset{(0.18)}{0.96 Z_{i} + {\hat{u}}_{i}^{4}} \underset{(0.19)}{M_{5} : Y_{i} = 0.02 X_{i}} + \underset{(0.18)}{0.86 W_{i} + {\hat{u}}_{i}^{5}} \\ \underset{(0.21)}{M_{6} : Y_{i} = - 0.28 X_{i}} + \underset{(0.44)}{1.24 Z_{i}} - \underset{(0.46)}{0.32 W_{i} + {\hat{u}}_{i}^{6}}, \end{aligned}

(9)

where ${\hat{u}}^{j}$ an estimate of the error term of model $M_{j}$ , $j = 1, \dots, 6$ .

Coefficients of models $M_{1}$ , $M_{2}$ and $M_{3}$ are all significant. In contrast, models $M_{4}$ and $M_{6}$ nest to model $M_{2}$ due to non-significance of $X$ and $Z$ coefficient estimates. Besides, $M_{5}$ nests to $M_{3}$ as $X$ ’s coefficient estimate is not significant. We then focus our analysis on models $M_{1}$ , $M_{2}$ and $M_{3}$ and proceed on their diagnostics. Results are reported in Table .

Table 3. Regression diagnostics

Display Table

From Table , we accept the homoscedasticity property of residuals and their non-correlation with predictors. In addition, residuals of the three models have zero mean. Thus, our three models meet standard assumptions on linear regression.

$M_{1}$ , $M_{2}$ and $M_{3}$ are non-nested models. Thus, the decision on choosing one model will be based on encompassing test. A necessary condition is that the encompassing model should fit better than encompassed model. Therefore, encompassing model is expected to have smaller error variance than its rival. The standard errors of models $M_{i}$ , $i = 1, 2, 3$ are $σ_{1} = 0.192$ , $σ_{2} = 0.179$ and $σ_{3} = 0.182$ , respectively. Then, among the three models, $M_{1}$ has the worst fit and $M_{2}$ has the best fit. We report in Table various encompassing tests associated to models $M_{1}$ , $M_{2}$ and $M_{3}$ .

Table 4. Encompassing tests for models $M_{1}$ , $M_{2}$ and $M_{3}$

Display Table

From Table , we accept the null $M_{2} E M_{3}$ and $M_{3} E M_{1}$ that is, $M_{2}$ encompasses $M_{3}$ and $M_{3}$ encompasses $M_{1}$ . In contrast, we reject $M_{3} E M_{2}$ and $M_{1} E M_{3}$ , there are no mutual encompassing. Thus, we retain model $M_{2}$ as it also has the smallest standard error. We will re-examine the link between life expectancy and explanatory variables using nearest neighbor regression.

For $k$ -NN regression of life expectancy, we need the specification of the weighting function $w (\cdot)$ and the estimation of the parameter $k$ . Two weighting functions have been mostly used in the literature: the exponential function $\frac{e x p (- | | z - Z_{(i)} {| |}^{2})}{\sum_{j = 1}^{k} e x p (- | | z - Z_{(i)} {| |}^{2})}$ with $(Z_{(i)})_{i = 1, \dots, k}$ the $k$ nearest to $z$ , and the uniform function $\frac{1}{k}$ . We also consider these two weighting functions.

Assumption 3 states that the number $k$ should satisfy $1 < k = n^{α} < n^{} \frac{4}{4 + d}$ , for $n$ observations and $d$ explanatory variables. Then, as $n = 169$ , we have maximum values for $k$ which are 60, 30, and 18 for $d = 1$ , $d = 2$ and $d = 3$ respectively. We estimate this parameter $k$ by minimizing the root mean squared error (RMSE). Results are summarized in Table where we keep the following notation already used in linear regression: $X$ for health expenditure per capita, $Z$ for gross income per capita and $W$ for GDP per capita.

Table 5. Specification of $k$ -NN regression estimates

Display Table

For models $M_{7}$ to $M_{11}$ , model $M_{10}$ has the lowest standard error. We also remark that standard errors of models $M_{9}$ and $M_{10}$ are very close. We will check if model $M_{10}$ can account results of other models and if there are mutual encompassing between $M_{10}$ and $M_{9}$ . We now compute the following standardized encompassing statistics using result developed in Theorem 3.1:

δ_{s} = \frac{\sqrt{k - 1} \hat{δ}}{\sqrt{\frac{π^{q / 2}}{Γ ((q + 2) / 2)} . V a r (ϵ / Z = z) \int w^{2} (u) d u}}

where $\hat{δ}$ is the $k$ -NN regression of the residuals $ϵ$ of owner model on explanatory variables $Z$ of rival model. Results are reported in Table .

Table 6. Encompassing tests for models $M_{7}$ to $M_{11}$

Display Table

Values in Table are all less than 1.96 in absolute value, except for $M_{9} E M_{10}$ . We accept null hypotheses $M_{10} E M_{8}$ , $M_{8} E M_{7}$ , $M_{10} E M_{9}$ and $M_{10} E M_{11}$ . In other word, $M_{10}$ can account information content in other models. As $M_{9}$ does not encompass $M_{10}$ , there is no mutual encompassing. Thus, we can retain model $M_{10}$ from all $k$ -NN regression models.

Next illustration concerns encompassing test on nonparametric and parametric regression techniques in Theorem 3.3, having as null hypothesis: the nearest neighbor regression $M_{10}$ encompasses the linear regression $M_{2}$ . Under this null, we have the following statistic from Theorem 3.3:

(10)

δ_{S} = \frac{\sqrt{n} \hat{δ}}{\sqrt{\hat{Ω}}} = \frac{\sqrt{n} {(\sum_{i = 1}^{169} Z_{i}^{2})}^{- 1} (\sum_{i = 1}^{169} {\hat{e}}_{i} Z_{i})}{\sqrt{{\hat{σ}}^{2} {(\frac{1}{n} \sum_{i = 1}^{169} Z_{i}^{2})}^{- 1}}} = \frac{\sum_{i = 1}^{169} {\hat{e}}_{i} Z_{i}}{\sqrt{{\hat{σ}}^{2} (\sum_{i = 1}^{169} Z_{i}^{2})}},

(10)

where $\hat{Ω}$ is an estimate of the asymptotic variance $Ω$ , ${\hat{e}}_{i}$ residuals of model $M_{10}$ and ${\hat{σ}}^{2}$ is a $k$ -NN regression estimate of the conditional variance $σ^{2} = v a r (Y / X = x, Z = z)$ .

Absolute value of the standardized encompassing statistic $δ_{S} = - 0.01$ is less than $1.96$ . Therefore, we accept the null hypothesis at a risk level $5 %$ i.e the nearest neighbor regression $M_{10}$ encompasses the linear regression $M_{2}$ . We conclude that we may retain $k$ -NN regression of life expectancy on health expenditure and income.

5. Conclusion

We know that different approaches of encompassing tests present in the literature provide different results. We have considered encompassing test in asymptotic way which is inline with the encompassing principle announced in the introduction. The work has been conducted for parametric and nonparametric regression techniques.

As stated in Hendry et al. (Citation2008) that the work of Bontemps et al. (Citation2008) is the starting treatment of encompassing tests to functional parameter based on nonparametric methods. We have extended that work to nearest neighbor functional parameter estimate under the i.i.d. assumption. When using linear and nearest neighbor regressions as estimators for conditional expectations, we have established asymptotic normality of the associated encompassing statistics for independent processes.

Comparing the convergence rate of the asymptotic encompassing statistic of $k$ -NN regression estimate to kernel regression obtained by Bontemps et al. (Citation2008), it depends only on the number of neighbors $k$ for $k$ -NN while for kernel ones depends on the number of observation $n$ and the bandwidth $h_{n}$ . We have the same convergence rate when $h_{n} = k / n$ .

Moreover, Bontemps et al. (Citation2008) obtained asymptotic variance of the encompassing statistic associated to kernel regression depending on the density, which is not the case for nearest neighbor regression estimate.

Development of encompassing test to nonparametric methods opens new research direction in theory as well as in practice.

Acknowledgments

The author thanks the anonymous referees and the Editor Professor Hiroshi Shiraishi.

Additional information

Funding

The author received no direct funding for this research.

Notes on contributors

Patrick Rakotomarolahy

Patrick Rakotomarolahy is the assistant professor in the department of mathematics and their applications at Fianarantsoa University. He has completed his Bsc and Msc in applied mathematics. He received his doctorate at the Panthéon-Sorbonne Paris 1 University. His current researches are in statistical model selection and in modeling macroeconomic and financial variables. He focuses especially on issues about model selection between parametric and nonparametric techniques. This study is in-line with this direction as the findings on asymptotic behavior of encompassing tests allow us to detect redundant models.

Notes

1. Kullback-Leibler Information Criterion

References

Ali, A., & Ahmad, K. (2014). The impact of socio-economic factors on life expectancy in sultanate of Oman: An empirical analysis. Middle-East Journal of Scientific Research, 22(2), 218–12. https://www.idosi.org/mejsr/mejsr22(2)14/8.pdf
Google Scholar
Bontemps, C., Florens, J. P., & Richard, J. F. (2008). Parametric and non-parametric encompassing procedures. Oxford Bulletin of Economics and Statistics, 70(1), 751–780. https://doi.org/10.1111/j.1468-0084.2008.00529.x
Web of Science ®Google Scholar
Cheng, P. E. (1984). Strong consistency of nearest neighbor regression function estimators. Journal of Multivariate Analysis, 15(1), 63–72. https://doi.org/10.1016/0047-259X(84)90067-8
Web of Science ®Google Scholar
Ferrara, L., Guégan, D., & Rakotomarolahy, P. (2010). GDP nowcasting with ragged-edge data: A semi-parametric modeling. Journal of Forecasting, 29(1–2), 186–199. https://doi.org/10.1002/for.1159
Web of Science ®Google Scholar
Florens, J. P., Hendry, D. F., & Richard, J. F. (1996). Encompassing and specificity. Econometric Theory, 12(4), 620–656. https://doi.org/10.1017/S0266466600006964
Web of Science ®Google Scholar
Gouriéroux, C., & Monfort, A. (1995). Testing, encompassing, and simulating dynamic econometric models. Econometric Theory, 11(2), 195–228. https://doi.org/10.1017/S0266466600009142
Web of Science ®Google Scholar
Gouriéroux, C., Monfort, A., & Trognon, A. (1983). Testing nested or non-nested hypotheses. Journal of Econometrics, 21(1), 83–115. https://doi.org/10.1016/0304-4076(83)90121-5
Web of Science ®Google Scholar
Govaerts, B., Hendry, D. F., & Richard, J. F. (1994). Encompassing in stationary linear dynamic models. Journal of Econometrics, 63(1), 245–270. https://doi.org/10.1016/0304-4076(93)01567-6
Web of Science ®Google Scholar
Guégan, D., & Huck, N. (2005). On the use of nearest neighbors in finance. Revue De Finance, 26(2), 67–86. https://www.cairn.info/revue-finance-2005-2-page-67.htm#
Google Scholar
Guégan, D., & Rakotomarolahy, P. (2010). A short note on the nowcasting and the forecasting of Euro-area GDP using non-parametric techniques. Economics Bulletin, 30(1), 508–518. http://www.accessecon.com/Pubs/EB/2010/Volume30/EB-10-V30-I1-P46.pdf
Google Scholar
Hendry, D. F., & Doornik, J. A. (1994). Modelling linear dynamic econometric systems. Scottish Journal of Political Economy, 41(1), 1–33. https://doi.org/10.1111/j.1467-9485.1994.tb01107.x
Web of Science ®Google Scholar
Hendry, D. F., Marcellino, M., & Mizon, G. E. (2008). Encompassing. Oxford Bulletin of Economics and Statistics, Guest Editor Introduction. Special Issue
Google Scholar
Hussain, A. R. (2002). Life expectancy in developing countries: A cross-section analysis. The Bangladesh Development Studies, 28(1/2), 161–178.
Google Scholar
Mack, Y. P. (1981). Local Properties of k-NN Regression Estimates. SIAM Journal on Algebraic and Discrete Methods, 2(3), 311–323. https://doi.org/10.1137/0602035
Google Scholar
Mizon, G. E. (1984). The encompassing approach in econometrics. In D. F. Hendry & K. F. Wallis (Eds.), Econometrics and quantitative economics (pp. 135–172). Blackwell Publishers.
Google Scholar
Mizon, G. E., & Richard, J. F. (1986). The encompassing principle and its application to non-nested hypothesis tests. Econometrica, 54(3), 657–678. https://doi.org/10.2307/1911313
Web of Science ®Google Scholar
Mukerjee, H. (1993). Nearest neighbor regression with heavy-tailed errors. Annals of Statistics, 21(2), 681–693. https://doi.org/10.1214/aos/1176349144
Web of Science ®Google Scholar
Nowman, B., & Saltoglu, B. (2003). Continuous time and nonparametric modelling of U.S. interest rate models. International Review of Financial Analysis, 12(1), 25–34. https://doi.org/10.1016/S1057-5219(02)00123-0
Google Scholar
Puspitasari, D. A., & Rustam, Z. (2018). Application of SVM-KNN using SVR as feature selection on stock analysis for indonesia stock exchange. AIP Conference Proceedings 2023, Bali, Indonesia, 020207; https://doi.org/10.1063/1.5064204.
Google Scholar
Sawa, T. (1978). Information criteria for discriminating among alternative regression models. Econometrica, 46(6), 1273–1292. https://doi.org/10.2307/1913828
Web of Science ®Google Scholar

Asymptotic behavior of encompassing test for independent processes: Case of linear and nearest neighbor regressions

Abstract

PUBLIC INTEREST STATEMENT

1. Introduction

2. Encompassing test for independent processes

3. Asymptotic normality of the encompassing statistic

3.1. Nonparametric specification for $M_{1}$ and $M_{2}$

3.2. Parametric modelling for $M_{1}$ vs nonparametric specification for $M_{2}$

3.3. Nonparametric specification for $M_{1}$ vs parametric modelling for $M_{2}$

4. Illustration

Table 1. Summary statistics

Table 2. Correlation between life expectancy and the socio-economic variables

Table 3. Regression diagnostics

Table 4. Encompassing tests for models $M_{1}$ , $M_{2}$ and $M_{3}$

Table 5. Specification of $k$ -NN regression estimates

Table 6. Encompassing tests for models $M_{7}$ to $M_{11}$

5. Conclusion

Acknowledgments

Notes on contributors

Patrick Rakotomarolahy

References

Information for

Open access

Opportunities

Help and information

Asymptotic behavior of encompassing test for independent processes: Case of linear and nearest neighbor regressions

Abstract

PUBLIC INTEREST STATEMENT

1. Introduction

2. Encompassing test for independent processes

3. Asymptotic normality of the encompassing statistic

3.1. Nonparametric specification for M1 andM2

3.2. Parametric modelling for M1 vs nonparametric specification forM2

3.3. Nonparametric specification for M1 vs parametric modelling for M2

4. Illustration

Table 1. Summary statistics

Table 2. Correlation between life expectancy and the socio-economic variables

Table 3. Regression diagnostics

Table 4. Encompassing tests for models M1, M2 and M3

Table 5. Specification of k-NN regression estimates

Table 6. Encompassing tests for models M7 to M11

5. Conclusion

Acknowledgments

Additional information

Funding

Notes on contributors

Patrick Rakotomarolahy

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

3.1. Nonparametric specification for $M_{1}$ and $M_{2}$

3.2. Parametric modelling for $M_{1}$ vs nonparametric specification for $M_{2}$

3.3. Nonparametric specification for $M_{1}$ vs parametric modelling for $M_{2}$

Table 4. Encompassing tests for models $M_{1}$ , $M_{2}$ and $M_{3}$

Table 5. Specification of $k$ -NN regression estimates

Table 6. Encompassing tests for models $M_{7}$ to $M_{11}$