Search in:

Statistical Theory and Related Fields Volume 4, 2020 - Issue 2

Submit an article Journal homepage

Free access

314

Views

CrossRef citations to date

Altmetric

Listen

Articles

Power-expected-posterior prior Bayes factor consistency for nested linear models with increasing dimensions

D. Fouskakisa Department of Mathematics, National Technical University of Athens, Athens, GreeceCorrespondence[email protected]

https://orcid.org/0000-0001-8514-6911 View further author information

J. K. Innocentb Department of Mathematics, University of Puerto Rico, San Juan, USAView further author information

L. Pericchib Department of Mathematics, University of Puerto Rico, San Juan, USAView further author information

Pages 162-171 | Received 13 May 2019, Accepted 14 Jan 2020, Published online: 30 Jan 2020

Cite this article
https://doi.org/10.1080/24754269.2020.1719355
CrossMark

In this article

Abstract
1. Introduction
2. Bayes factor consistency under power-expected-posterior priors
3. Summary and conclusions
Disclosure statement
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The power-expected-posterior prior is used in this paper for comparing nested linear models. The asymptotic behaviour of the method is investigated for different values of the power parameter of the prior. Focus is given on the consistency of the Bayes factor of comparing the full model $M_{p}$ versus a generic submodel $M_{ℓ}$ . In each case, we allow the true generating model to be either $M_{p}$ or $M_{ℓ}$ and we keep the dimension of $M_{ℓ}$ fixed, while the dimension of $M_{p}$ can be either fixed or (grow as) $O (n)$ , with n denoting the sample size.

Keywords:

Bayesian model selection
Bayes factor
consistency
expected-posterior prior
Gaussian linear models
increasing dimension
power-expected-posterior prior

1. Introduction

Pérez and Berger (Citation2002) developed priors for objective Bayesian model comparison, through the utilisation of the device of ‘imaginary training samples’. The expected-posterior prior (EPP) for the parameter under a model is an expectation of the posterior distribution given imaginary observations $y^{*}$ of size $n^{*}$ . The expectation is taken with respect to a suitable probability measure of a reference model $M_{0}$ , while the posterior distribution is computed via Bayes's theorem starting from a default, typically improper, prior. One of the advantages of using EPPs is that impropriety of baseline priors causes no indeterminacy in the computation of Bayes factors. On the other hand, the EPPs depend on the training sample size and particularly in variable selection problems, imaginary design matrices should also be introduced, under each competing model, and therefore the resulting prior will further depend on this choice (for a detailed discussion on this issue, see Fouskakis, Ntzoufras, & Draper, Citation2015). The selection of a minimal training sample, of size $n^{*}$ , has been proposed (see, for example, Berger & Pericchi, Citation2004), to make the information content of the prior as small as possible, and this is an appealing idea. But even under this set-up, the resulting prior can be influential when the sample size n is not much larger than the total number of parameters under the full model (see Fouskakis et al., Citation2015).

The power-expected-posterior (PEP) prior, introduced by Fouskakis et al. (Citation2015), is an objective prior which amalgamates ideas from the power prior (Ibrahim & Chen, Citation2000), the expected-posterior prior (Pérez & Berger, Citation2002) and the unit-information-prior approach of Kass and Wasserman (Citation1995) to simultaneously (a) produce a minimally informative prior and (b) diminish the effect of training samples under the EPP methodology. The main idea is to substitute the likelihood by a density-normalised version of a power-likelihood in EPP. Fouskakis et al. (Citation2015) and Fouskakis and Ntzoufras (Citation2016b) studied in detailed the PEP priors under the variable selection problem in Gaussian regression models. In the first paper, they introduced the PEP prior by considering as parameter of interest both the coefficients of the model and the error variance while in the second paper they studied the conditional version of PEP, named PCEP, where they considered only the coefficients as the parameter of interest and the error variance as a common nuisance parameter. Here we focus in the former case. Under this approach, for every model $M_{ℓ}$ in $M$ (the set of all models under consideration) the sampling distribution $f_{ℓ} (\cdot | β_{ℓ}, σ_{ℓ}^{2})$ is specified by (1) $\begin{aligned} (Y | X_{ℓ}, β_{ℓ}, σ_{ℓ}^{2}, M_{ℓ}) \sim N_{n} (X_{ℓ} β_{ℓ}, σ_{ℓ}^{2} I_{n}), \end{aligned}$ (1) where $Y = (Y_{1}, \dots, Y_{n})$ is a vector containing the responses for all subjects, $X_{ℓ}$ is an $n \times d_{ℓ}$ design matrix containing the values of the explanatory variables in its columns, $I_{n}$ is the $n \times n$ identity matrix, $β_{ℓ}$ is a vector of length $d_{ℓ}$ summarising the effects of the covariates in model $M_{ℓ}$ on the response $Y$ and $σ_{ℓ}^{2}$ is the error variance for model $M_{ℓ}$ . Finally, by p we denote the total number of the explanatory variables under consideration and by $M_{p}$ the full model, including all p covariates.

Furthermore, we denote by $π_{ℓ}^{N} (β_{ℓ}, σ_{ℓ}^{2})$ the baseline prior to the parameters of model $M_{ℓ}$ . Here we use the independence Jeffreys prior (or reference prior) as the baseline prior distribution. Hence, for any $M_{ℓ} \in M$ , we have (2) $\begin{aligned} π_{ℓ}^{N} (β_{ℓ}, σ_{ℓ}^{2}) = \frac{c_{ℓ}}{σ_{ℓ}^{2}}, \end{aligned}$ (2) where $c_{ℓ}$ is an unknown normalising constant.

We assume that in $M$ there exists a model $M_{0}$ , with parameters $β_{0}$ and $σ_{0}^{2}$ , sampling distribution $f_{0} (\cdot | β_{0}, σ_{0}^{2})$ and baseline prior $π_{0}^{N} (β_{0}, σ_{0}^{2}) \propto σ_{0}^{- 2}$ , which is nested into each of the remaining models and we consider it as a reference model. This is the typical case in the variable selection problem, studied in this paper. Given then a set of imaginary data $y^{*} = (y_{1}^{*}, \dots, y_{n^{*}}^{*})^{T}$ and a positive power parameter δ, that is used to regulate, essentially, the contribution of the imaginary data on the ‘final’ prior, we introduce the density-normalised power-likelihood, under model $M_{ℓ}$ , given by (3) $\begin{aligned} f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, δ, X_{ℓ}^{*}) = \frac{f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, X_{ℓ}^{*})^{1 / δ}}{\int f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, X_{ℓ}^{*})^{1 / δ} d y^{*}} . \end{aligned}$ (3) The above density-normalised power-likelihood is still a normal distribution with variance inflated by a factor of δ; in the above, $X_{ℓ}^{*}$ denotes the imaginary design matrix under model $M_{ℓ}$ . In a similar manner, under the reference model, the density-normalised power-likelihood takes the form of (Equation3(3) $\begin{aligned} f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, δ, X_{ℓ}^{*}) = \frac{f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, X_{ℓ}^{*})^{1 / δ}}{\int f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, X_{ℓ}^{*})^{1 / δ} d y^{*}} . \end{aligned}$ (3) ) but using now the likelihood $f_{0} (y^{*} | β_{0}, σ_{0}^{2}, X_{0}^{*})$ of $M_{0}$ .

In order to apply the PEP methodology, the density-normalised power-likelihood (Equation3(3) $\begin{aligned} f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, δ, X_{ℓ}^{*}) = \frac{f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, X_{ℓ}^{*})^{1 / δ}}{\int f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, X_{ℓ}^{*})^{1 / δ} d y^{*}} . \end{aligned}$ (3) ) is used to evaluate, under the imaginary data and the baseline prior, the prior predictive distribution $m_{0}^{N} (y^{*} | δ, X_{0}^{*})$ of model $M_{0}$ as well as the posterior distribution of the parameters of model $M_{ℓ}$ (4) $\begin{aligned} π_{ℓ}^{N} (β_{ℓ}, σ_{ℓ}^{2} | y^{*}, δ, X_{ℓ}^{*}) \\ = \frac{f_{ℓ} (y^{*} | β_{ℓ}, σ_{ℓ}^{2}, δ, X_{ℓ}^{*}) π_{ℓ}^{N} (β_{ℓ}, σ_{ℓ}^{2})}{m_{ℓ}^{N} (y^{*} | δ, X_{ℓ}^{*})}, \end{aligned}$ (4) where (5) $\begin{aligned} m_{j}^{N} (y^{*} | δ, X_{j}^{*}) \\ = \int \int f_{j} (y^{*} | β_{j}, σ_{j}^{2}, δ, X_{j}^{*}) π_{j}^{N} (β_{j}, σ_{j}^{2}) d β_{j} d σ_{j}^{2}, \end{aligned}$ (5) is the prior predictive distribution of model $M_{j}$ for $j = ℓ, 0$ .

Finally, the imposed prior for the parameters of any model $M_{ℓ}$ has the following form (6) $\begin{aligned} π_{ℓ}^{P E P} (β_{ℓ}, σ_{ℓ}^{2} | δ, X_{ℓ}^{*}) \\ = \int π_{ℓ}^{N} (β_{ℓ}, σ_{ℓ}^{2} | y^{*}, δ, X_{ℓ}^{*}) m_{0}^{N} (y^{*} | δ, X_{0}^{*}) d y^{*} . \end{aligned}$ (6) The default choice for δ is to set it equal to $n^{*}$ , i.e. the sample size of the imaginary data, so that the overall information of the imaginary data in the posterior is equal to one data point. Furthermore, setting $n^{*} = n$ and, consequently, the design matrix of the imaginary data $X_{ℓ}^{*} \equiv X_{ℓ}$ simplifies significantly the overwhelming computations required when considering all possible ‘minimal’ training samples (Pérez & Berger, Citation2002) while it also avoids the complicated issue (in some cases) of defining the size of the minimal training samples (Berger & Pericchi, Citation2004). In addition, under the choice $n^{*} = n$ , the PEP prior remains relatively non-informative even for models with dimension close to the sample size n, while the effect on the evaluation of each model is minimal since the resulting Bayes factors are robust over different values of $n^{*}$ . Detailed information about the default specifications of the PEP prior is provided in Fouskakis et al. (Citation2015). Finally, the null model (with no explanatory variables) is a standard choice for the reference model in regression problems; see, for example, Pérez and Berger (Citation2002). In the above definition of PEP prior, the power parameter can also be model depended, and denoted by $δ_{ℓ}$ .

Fouskakis and Ntzoufras (Citation2016a) proved the consistency of the Bayes factor when using the PEP methodology, with the independence Jeffreys as a baseline prior, for Gaussian linear models, under very mild conditions on the design matrix, when the dimension of each model is fixed, the size of the training sample is equal to the sample size n and the power parameter is also set equal to n. In a similar manner as in Fouskakis and Ntzoufras (Citation2016a), when comparing the full model $M_{p}$ to a reduced model $M_{ℓ}$ , the Bayes factor under the PEP prior is given by (7) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (δ_{ℓ} + \sin^{2} ϕ)^{(n - p) / 2}}{{(δ_{ℓ} \frac{R S S_{p}}{R S S_{ℓ}} + \sin^{2} ϕ)}^{(n - d_{ℓ}) / 2}} d ϕ, \end{aligned}$ (7) with $R S S_{j}$ denoting the residual sum of squares of model $M_{j}$ ( $j = ℓ, p$ ). For large n, we can approximate the Bayes factor given in (Equation7(7) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (δ_{ℓ} + \sin^{2} ϕ)^{(n - p) / 2}}{{(δ_{ℓ} \frac{R S S_{p}}{R S S_{ℓ}} + \sin^{2} ϕ)}^{(n - d_{ℓ}) / 2}} d ϕ, \end{aligned}$ (7) ) as (8) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} {(\frac{1}{2})}^{(p - d_{ℓ}) / 2}, \end{aligned}$ (8) if p is fixed constant; and as (9) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{ρ_{ℓ p}})}^{(r p - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} 2^{(2 (r - 1) p - 1) / 2} \\ \times \frac{(r - 1)^{(r - 1) p / 2} r^{(r p - d_{ℓ} - 1) / 2}}{(2 r - 1)^{((2 r - 1) p - d_{ℓ} - 1) / 2}}, \end{aligned}$ (9) if p increases as n grows to infinity and $(n - p)$ grows to infinity, with rate $r > 1$ so that $n = r \times p$ (for a detailed proof of (Equation8(8) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} {(\frac{1}{2})}^{(p - d_{ℓ}) / 2}, \end{aligned}$ (8) ) and (Equation9(9) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{ρ_{ℓ p}})}^{(r p - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} 2^{(2 (r - 1) p - 1) / 2} \\ \times \frac{(r - 1)^{(r - 1) p / 2} r^{(r p - d_{ℓ} - 1) / 2}}{(2 r - 1)^{((2 r - 1) p - d_{ℓ} - 1) / 2}}, \end{aligned}$ (9) ) see Innocent, Citation2016).

In the rest of the paper, we denote by $ρ_{ℓ p} = \frac{R S S_{p}}{R S S_{ℓ}}$ and by $ϵ_{p ℓ} = \frac{1}{σ_{T}^{2}} β_{T}^{t} \frac{X_{T}^{t} (I_{n} - H_{ℓ}) X_{T}}{n} β_{T},$ where $M_{T}$ denotes the ‘true’ model and $H_{ℓ}$ the hat matrix of model $M_{ℓ}$ (see Casella, Girón, Martínez, & Moreno, Citation2009). Since the reduced model $M_{ℓ}$ is nested in the full model $M_{p}$ , we have that $ρ_{ℓ p} \in (0, 1]$ .

Finally, the following results hold, as n increases, with respect to the distribution and the limiting behaviour of the statistic $ρ_{ℓ p}$ (see Girón, Moreno, & Casella, Citation2010):

If $d i m (M_{ℓ}) = d_{ℓ} = O (1)$ and $d i m (M_{p}) = p = O (1)$ :
1. When sampling from model $M_{ℓ}$ , the distribution of the statistic $ρ_{ℓ p}$ is the central beta distribution $B e ((n - p) / 2, (p - d ℓ) / 2)$ and $lim_{n \to + \infty} ρ_{ℓ p} = 1.$
2. When sampling from model $M_{p}$ , the distribution of the statistic $ρ_{ℓ p}$ is the non-central beta distribution $B e ((n - p) / 2, (p - d_{ℓ}) / 2, 0, n ϵ_{p ℓ})$ and $lim_{n \to + \infty} ρ_{ℓ p} = \frac{1}{1 + ϵ},$ with $lim_{n \to + \infty} ϵ_{p ℓ} = ϵ > 0.$
If $d i m (M_{ℓ}) = d_{ℓ} = O (1)$ and $d i m (M_{p}) = p = O (n)$ with $r = lim_{n, p \to + \infty} \frac{n}{p} > 1, p > d_{ℓ} > 1 :$
1. When sampling from model $M_{ℓ}$ , the distribution of the statistic $ρ_{ℓ p}$ is the central beta distribution $B e (p (r - 1) / 2, (p - d_{ℓ}) / 2)$ and $lim_{n \to + \infty} ρ_{ℓ p} = \frac{r - 1}{r}, r > 1.$
2. When sampling from model $M_{p}$ the distribution of the statistic $ρ_{ℓ p}$ is the non-central beta distribution $B e (p (r - 1) / 2, (p - d_{ℓ}) / 2, 0, r p ϵ_{p ℓ})$ and $lim_{n \to + \infty} ρ_{ℓ p} = \frac{r - 1}{r (1 + ϵ)},$ where $lim_{n \to + \infty} ϵ_{p ℓ} = ϵ > 0.$

In this paper, we examine the consistency of the Bayes factor, for nested normal linear models, under the PEP methodology, using the pair of models $M_{ℓ}$ and $M_{p}$ . The number of parameters of the simpler model $M_{ℓ}$ is always fixed, while for the full model is of order $O (n^{α}),$ where $α \in {0, 1}$ . We investigate the effect of the power parameter $δ_{ℓ}$ by examining four different scenarios. In each case, the ‘true’ model is set equal to either $M_{ℓ}$ or $M_{p}$ .

2. Bayes factor consistency under power-expected-posterior priors

In what follows we set the size of the training sample $n^{*}$ equal to the sample size n as in Fouskakis et al. (Citation2015).

2.1. When the power $δ_{ℓ} = n$

First, we consider the case where the power parameter is set equal to the sample size n, and studying the consistency when the dimension p of the full model $M_{p}$ is either a fixed constant number or large and goes to infinity.

Then (Equation7(7) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (δ_{ℓ} + \sin^{2} ϕ)^{(n - p) / 2}}{{(δ_{ℓ} \frac{R S S_{p}}{R S S_{ℓ}} + \sin^{2} ϕ)}^{(n - d_{ℓ}) / 2}} d ϕ, \end{aligned}$ (7) ) becomes: (10) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (n + \sin^{2} ϕ)^{(n - p) / 2}}{(n ρ_{ℓ p} + \sin^{2} ϕ)^{(n - d_{ℓ}) / 2}} d ϕ . \end{aligned}$ (10)

2.1.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

Theorem 2.1

Let the sample size n increases and being strictly greater than the dimension of the full model $M_{p}$ . Furthermore, suppose that the dimension of both models, under consideration, are fixed non-negative natural numbers, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1)$ and $d i m (M_{p}) = p = O (1),$ where $p > d_{ℓ} > 1.$ Under the condition $δ_{ℓ} = n,$ when sampling from model $M_{j},$ where j is either ℓ or p we have: $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = \{\begin{cases} 0 & i f j = ℓ \\ + \infty & i f j = p \end{cases} .$

Proof.

For $δ_{ℓ} = n$ , (Equation8(8) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} {(\frac{1}{2})}^{(p - d_{ℓ}) / 2}, \end{aligned}$ (8) ) becomes (11) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 n})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} . \end{aligned}$ (11)

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation11(11) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 n})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} . \end{aligned}$ (11) ) becomes: (12) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 n})}^{(p - d ℓ) / 2} . \end{aligned}$ (12) Since p and $d_{ℓ}$ are constants and n goes to infinity we get $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0.$ Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent under the reduced model $M_{ℓ}$ .

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation11(11) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 n})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} . \end{aligned}$ (11) ) becomes: (13) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{2 n})}^{(p - d_{ℓ}) / 2} (1 + ϵ)^{n / 2} \approx {(\frac{1}{2})}^{(p - d_{ℓ}) / 2} \\ \times e^{- n ((p - d_{ℓ}) / 2) (\log (n) / n) + (n / 2) \log (1 + ϵ)} . \end{aligned}$ (13) Thus $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = e^{lim_{n \to + \infty} (n / 2) \log (1 + ϵ)} = + \infty,$ since $ϵ > 0,$ and $(n / 2) \log (1 + ϵ) \to + \infty$ as $n \to + \infty$ . Therefore, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent when sampling from the full model $M_{p}$ .

2.1.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

Theorem 2.2

Let $δ_{ℓ} = n$ and suppose that the reduced model $M_{ℓ}$ has a fixed number of parameters, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1),$ as the simple size n increases, and in the full model $M_{p}$ the number of parameters increase with rate $d i m (M_{p}) = p = O (n)$ with $r = lim_{n, p \to + \infty} \frac{n}{p} > 1, p > d_{ℓ} > 1.$ Then:

When sampling from model $M_{ℓ}$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0.$
When sampling from model $M_{p}$ $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ = \{\begin{cases} 0 i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} 0 & i f lim_{n \to + \infty} ϵ_{p ℓ} < ϵ_{p}^{2} (r) \\ + \infty & i f lim_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{2} (r) \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} \end{aligned}$ for some function $ϵ_{p}^{2}$ given by $ϵ_{p}^{2} (r) : (1, + \infty) ⟶ R, r ⟼ (2 r p)^{1 / r} - 1$ .

Proof.

By replacing $n \approx r p,$ and $δ_{ℓ} = r p$ , (Equation9(9) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{ρ_{ℓ p}})}^{(r p - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} 2^{(2 (r - 1) p - 1) / 2} \\ \times \frac{(r - 1)^{(r - 1) p / 2} r^{(r p - d_{ℓ} - 1) / 2}}{(2 r - 1)^{((2 r - 1) p - d_{ℓ} - 1) / 2}}, \end{aligned}$ (9) ), becomes (14) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[\frac{1}{2 (r - 1) p}]}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (14)

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation14(14) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[\frac{1}{2 (r - 1) p}]}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (14) ) becomes $B F_{p ℓ}^{P E P} \approx {[\frac{1}{2 (r - 1) p}]}^{(p - d_{ℓ}) / 2} {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2}$ and then $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[\frac{1}{2 (r - 1) p} {(\frac{2 r}{2 r - 1})}^{2 r - 1}]}^{p / 2} \\ \times {(\frac{(2 r - 1) (r - 1) p}{r})}^{d_{ℓ} / 2} {(1 - \frac{1}{2 r})}^{1 / 2} . \end{aligned}$ So for large value of p, we have $\begin{aligned} B F_{p ℓ}^{P E P} \approx {[\frac{1}{2 (r - 1) p} {(\frac{2 r}{2 r - 1})}^{2 r - 1}]}^{p / 2} \end{aligned}$ and then (15) $\begin{aligned} B F_{p ℓ}^{P E P} \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} & i f r > 1 i s a f i x e d c o n s t a n t \\ {(\frac{1}{2 r p})}^{p / 2} & i f r i s a l a r g e n u m b e r \end{cases} . \end{aligned}$ (15) In both cases, for large p, we get $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0,$ since $lim_{n \to + \infty} {(\frac{1}{p})}^{p / 2} = lim_{n \to + \infty} \exp (- \frac{p}{2} \log p) = 0.$ Thus the Bayes factor of the full model $M_{p}$ against the reduced model $M_{ℓ}$ is consistent under the reduced model $M_{ℓ}$ .

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation14(14) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[\frac{1}{2 (r - 1) p}]}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (14) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[\frac{1}{2 (r - 1) p}]}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} (1 + ϵ)^{(r p - d_{ℓ}) / 2} . \end{aligned}$ So for large p, we have $\begin{aligned} B F_{p ℓ}^{P E P} \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} i f r > 1 i s a f i x e d c o n s t a n t \\ {[\frac{(1 + ϵ)^{r}}{2 r p}]}^{p / 2} (2 r p)^{d_{ℓ} / 2} \\ i f r i s a l a r g e n u m b e r \end{cases} \end{aligned}$ and then $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} (2 r p)^{d_{ℓ} / 2} & i f \frac{(1 + ϵ)^{r}}{2 r p} = 1 \\ {[\frac{(1 + ϵ)^{r}}{2 r p}]}^{p / 2} & i f \frac{(1 + ϵ)^{r}}{2 r p} \neq 1 \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} . \end{aligned}$ Solving the equation $(1 + ϵ)^{r} / 2 r p = 1$ for ε, we get $ϵ = (2 r p)^{1 / r} - 1.$ Therefore using the function $ϵ_{p}^{2} (r) : (1, + \infty) ⟶ R, r ⟼ (2 r p)^{1 / r} - 1$ we have $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ = \{\begin{cases} 0 i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} 0 & i f lim_{n \to + \infty} ϵ_{p ℓ} < ϵ_{p}^{2} (r) \\ + \infty & i f l i m_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{2} (r) \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} . \end{aligned}$ Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent under the full model $M_{p}$ if and only if $lim_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{2} (r)$ when r is large and goes to infinity.

2.2. When the power $δ_{ℓ} = (n - p)$

Second, we consider the case where the power $δ_{ℓ} = (n - p)$ and studying the consistency when the dimension p of the full model $M_{p}$ is either a fixed constant number or large and goes to infinity. Then (Equation7(7) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (δ_{ℓ} + \sin^{2} ϕ)^{(n - p) / 2}}{{(δ_{ℓ} \frac{R S S_{p}}{R S S_{ℓ}} + \sin^{2} ϕ)}^{(n - d_{ℓ}) / 2}} d ϕ, \end{aligned}$ (7) ) becomes: $\begin{aligned} B F_{p ℓ}^{P E P} & = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{\begin{matrix} (\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} ((n - p) \\ + \sin^{2} ϕ)^{(n - p) / 2} \end{matrix}}{{[(n - p) ρ_{i p} + s i n^{2} ϕ]}^{(n - d_{ℓ}) / 2}} d ϕ \end{aligned}$

2.2.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

Let the simple size n increases and being strictly greater than the dimension of the full model $M_{p}$ . Furthermore, suppose that the dimension of both models, under consideration, are fixed non-negative natural numbers, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1)$ and $d i m (M_{p}) = p = O (1)$ , where $p > d_{ℓ} > 1$ .

For $δ_{ℓ} = (n - p)$ , (Equation8(8) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} {(\frac{1}{2})}^{(p - d_{ℓ}) / 2}, \end{aligned}$ (8) ) becomes $B F_{p ℓ}^{P E P} \approx {(\frac{1}{2})}^{(p - d_{ℓ}) / 2} {(\frac{1}{n - p})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2},$ and then since p and $d_{ℓ}$ are fixed constants and for large values of n, we get (16) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{n})}^{p / 2} {(\frac{1}{ρ_{ℓ p}})}^{n / 2} . \end{aligned}$ (16) Working as in the proof of Theorem 2.1, we conclude that the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent when sampling from either models.

2.2.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

Theorem 2.3

Let $δ_{ℓ} = (n - p)$ and suppose that the reduced model $M_{ℓ}$ has a fixed number of parameters, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1),$ as the simple size n increases, and in the full model $M_{p}$ the number of parameters increase with rate $d i m (M_{p}) = p = O (n)$ with $r = lim_{n, p \to + \infty} \frac{n}{p} > 1, p > d_{ℓ} > 1.$ Then:

When sampling from model $M_{ℓ}$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0.$
When sampling from model $M_{p}$ $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ = \{\begin{cases} 0 \\ i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} 0 & i f l i m_{n \to + \infty} ϵ_{p ℓ} < ϵ_{p}^{2} (r) \\ + \infty & i f l i m_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{2} (r) \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} \end{aligned}$ for some function $ϵ_{p}^{2}$ given by $ϵ_{p}^{2} (r) : (1, + \infty) ⟶ R, r ⟼ (2 r p)^{1 / r} - 1.$

Proof.

By replacing $n \approx r p,$ and $δ_{ℓ} = r p - p$ , (Equation9(9) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{ρ_{ℓ p}})}^{(r p - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} 2^{(2 (r - 1) p - 1) / 2} \\ \times \frac{(r - 1)^{(r - 1) p / 2} r^{(r p - d_{ℓ} - 1) / 2}}{(2 r - 1)^{((2 r - 1) p - d_{ℓ} - 1) / 2}}, \end{aligned}$ (9) ), becomes (17) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} \\ \times {(\frac{r}{2 p (r - 1)^{2}})}^{(p - d_{ℓ}) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (17)

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation17(17) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} \\ \times {(\frac{r}{2 p (r - 1)^{2}})}^{(p - d_{ℓ}) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (17) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} \\ \times {(\frac{r}{2 p (r - 1)^{2}})}^{(p - d_{ℓ}) / 2} . \end{aligned}$ So for large value of p, we have $B F_{p ℓ}^{P E P} \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} & i f r > 1 i s a f i x e d c o n s t a n t \\ {(\frac{1}{2 r p})}^{p / 2} & i f r i s a l a r g e n u m b e r \end{cases} .$ In both cases, for large p, we get $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0,$ Thus the Bayes factor of the full model $M_{p}$ against the reduced model $M_{ℓ}$ is consistent under the reduced model $M_{ℓ} .$

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation17(17) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} \\ \times {(\frac{r}{2 p (r - 1)^{2}})}^{(p - d_{ℓ}) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (17) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 p (r - 1)^{2}})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} (1 + ϵ)^{(r p - d_{ℓ}) / 2}, \end{aligned}$ or (18) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[{(\frac{2 r}{2 r - 1})}^{2 r - 1} \frac{r (1 + ϵ)^{r}}{2 p (r - 1)^{2}}]}^{p / 2} \\ \times {(\frac{(2 r - 1) (r - 1)^{2} (1 + ϵ)^{- 1} p}{r^{2}})}^{d_{ℓ} / 2} \\ \times {(1 - \frac{1}{2 r})}^{1 / 2} . \end{aligned}$ (18) So for large p, we have $B F_{p ℓ}^{P E P} \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} i f r > 1 i s a f i x e d c o n s t a n t \\ {(\frac{(1 + ϵ)^{r}}{2 r p})}^{p / 2} (2 r p)^{d_{ℓ} / 2} \\ i f r i s a l a r g e n u m b e r \end{cases}$ Thus working as in the proof of Theorem 2.2 we conclude that the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent under the full model $M_{p}$ if and only if $lim_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{2} (r)$ when r is large and goes to infinity.

2.3. When the power $δ_{ℓ} = p$

Third, we consider the case where the power is equal to the dimension of the full model and studying the consistency when the dimension $p = d i m (M_{p})$ of the full model $M_{p}$ is either a fixed constant number or large and goes to infinity.

Under this set-up, (Equation7(7) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (δ_{ℓ} + \sin^{2} ϕ)^{(n - p) / 2}}{{(δ_{ℓ} \frac{R S S_{p}}{R S S_{ℓ}} + \sin^{2} ϕ)}^{(n - d_{ℓ}) / 2}} d ϕ, \end{aligned}$ (7) ) becomes: $\begin{aligned} B F_{p ℓ}^{P E P} & = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \\ \times \int_{0}^{π / 2} \frac{\begin{matrix} (\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} \\ (p + \sin^{2} ϕ)^{(n - p) / 2} \end{matrix}}{(p ρ_{ℓ p} + \sin^{2} ϕ)^{(n - d_{ℓ}) / 2}} d ϕ . \end{aligned}$

2.3.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

Theorem 2.4

Let $δ_{ℓ} = p$ and the sample size n increases and being strictly greater than the dimension of the full model $M_{p}$ . Furthermore, suppose that the dimension of both models, under consideration, are fixed non-negative natural numbers, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1)$ and $d i m (M_{p}) = p = O (1),$ where $p > d_{ℓ} > 1.$ Then when sampling from model $M_{j},$ where j is either ℓ or p we have: $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = \{\begin{cases} C o n s t a n t > 0 & i f j = ℓ \\ + \infty & i f j = p \end{cases} .$

Proof.

For $δ_{ℓ} = p$ , (Equation8(8) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} {(\frac{1}{2})}^{(p - d_{ℓ}) / 2}, \end{aligned}$ (8) ) becomes (19) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 p})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} \end{aligned}$ (19) Then we consider the following two cases.

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation19(19) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 p})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} \end{aligned}$ (19) ) becomes $B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 p})}^{(p - d ℓ) / 2} .$ Since p and $d_{ℓ}$ are constants, with $p > d_{ℓ} > 1$ , we get $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = lim_{n \to + \infty} {(\frac{1}{2 p})}^{(p - d ℓ) / 2} = C o n s t a n t > 0.$ Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is inconsistent under the reduced model $M_{ℓ} .$

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation19(19) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 p})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} \end{aligned}$ (19) ) becomes $B F_{p ℓ}^{P E P} \approx e^{(n / 2) \log (1 + ϵ)} .$ and thus $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = + \infty .$ Therefore, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent when sampling from the full model $M_{p} .$

2.3.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

Theorem 2.5

Let $δ_{ℓ} = p$ and suppose that the reduced model $M_{ℓ}$ has a fixed number of parameters, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1),$ as the simple size n increases, and in the full model $M_{p}$ the number of parameters increase with rate $d i m (M_{p}) = p = O (n)$ with $r = lim_{n, p \to + \infty} \frac{n}{p} > 1, p > d_{ℓ} > 1.$ Then:

When sampling from model $M_{ℓ}$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0.$
When sampling from model $M_{p}$ $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ = \{\begin{cases} 0 i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} 0 & i f l i m_{n \to + \infty} ϵ_{p ℓ} < ϵ_{p}^{1} (r) \\ + \infty & i f l i m_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{1} (r) \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} \end{aligned}$ for some function $ϵ_{p}^{1}$ given by $ϵ_{p}^{1} (r) : (1, + \infty) ⟶ R, r ⟼ (2 p)^{1 / r} - 1$ .

Proof.

By replacing $n \approx r p,$ and $δ_{ℓ} = p$ , (Equation9(9) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{ρ_{ℓ p}})}^{(r p - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} 2^{(2 (r - 1) p - 1) / 2} \\ \times \frac{(r - 1)^{(r - 1) p / 2} r^{(r p - d_{ℓ} - 1) / 2}}{(2 r - 1)^{((2 r - 1) p - d_{ℓ} - 1) / 2}}, \end{aligned}$ (9) ) becomes (20) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) p})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (20)

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation20(20) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) p})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (20) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) p})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} (1 + ϵ)^{(r p - d_{ℓ}) / 2} . \end{aligned}$ So for large p, we have $\begin{aligned} B F_{p ℓ}^{P E P} & \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} \\ i f r > 1 i s a f i x e d c o n s t a n t \\ {(\frac{(1 + ϵ)^{r}}{2 p})}^{p / 2} (2 p)^{d_{ℓ} / 2} \\ i f r i s a l a r g e n u m b e r \end{cases} \end{aligned}$ and then $\begin{aligned} B F_{p ℓ}^{P E P} \\ \approx \{\begin{cases} {(\frac{1}{p})}^{p / 2} \\ i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} (2 p)^{d_{ℓ} / 2} & i f \frac{(1 + ϵ)^{r}}{2 p} = 1 \\ {(\frac{(1 + ϵ)^{r}}{2 p})}^{p / 2} & i f \frac{(1 + ϵ)^{r}}{2 p} \neq 1 \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} . \end{aligned}$ Solving the equation $(1 + ϵ)^{r} / 2 p = 1$ for ε, we get $ϵ = (2 p)^{1 / r} - 1.$ Therefore using the function $ϵ_{p}^{1} (r) : (1, + \infty) ⟶ R, r ⟼ (2 p)^{1 / r} - 1$ we have $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} = \{\begin{cases} 0 i f r > 1 i s a f i x e d c o n s t a n t \\ \{\begin{cases} 0 & i f l i m_{n \to + \infty} ϵ_{p ℓ} < ϵ_{p}^{1} (r) \\ + \infty & i f l i m_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{1} (r) \end{cases} \\ i f r > 1 i s a l a r g e n u m b e r \end{cases} . \end{aligned}$ Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent under the full model $M_{p}$ if and only if $lim_{n \to + \infty} ϵ_{p ℓ} \geq ϵ_{p}^{1} (r)$ when r is large and goes to infinity.

2.4. When the power $δ_{ℓ} = δ$

Finally, we consider the case where the power parameter is set equal to a fixed non-negative constant δ, and studying the consistency when the dimension p of the full model $M_{p}$ is either a fixed constant number or large and goes to infinity.

Then (Equation7(7) $\begin{aligned} B F_{p ℓ}^{P E P} = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{(\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} (δ_{ℓ} + \sin^{2} ϕ)^{(n - p) / 2}}{{(δ_{ℓ} \frac{R S S_{p}}{R S S_{ℓ}} + \sin^{2} ϕ)}^{(n - d_{ℓ}) / 2}} d ϕ, \end{aligned}$ (7) ) becomes: (22) $\begin{aligned} B F_{p ℓ}^{P E P} & = 2 \frac{Γ (n - p)}{Γ^{2} (\frac{n - p}{2})} \int_{0}^{π / 2} \\ \times \frac{\begin{matrix} (\sin ϕ)^{n - d_{ℓ} - 1} (\cos ϕ)^{n - p - 1} \\ (δ + \sin^{2} ϕ)^{(n - p) / 2} \end{matrix}}{(δ ρ_{ℓ p} + \sin^{2} ϕ)^{(n - d_{ℓ}) / 2}} d ϕ . \end{aligned}$ (22)

2.4.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

Theorem 2.6

Let the sample size n increases and being strictly greater than the dimension of the full model $M_{p}$ . Furthermore, suppose that the dimension of both models, under consideration, are fixed non-negative natural numbers, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1)$ and $d i m (M_{p}) = p = O (1),$ where $p > d_{ℓ} > 1.$ Under the condition $δ_{ℓ} = δ > 0,$ when sampling from model $M_{j},$ where j is either ℓ or p we have: $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ = \{\begin{cases} 0 & i f j = ℓ a n d δ i s l a r g e \\ C o n s t a n t > 0 & i f j = ℓ a n d δ i s n o t l a r g e \\ + \infty & i f j = p \end{cases} . \end{aligned}$

Proof.

For $δ_{ℓ} = δ$ , (Equation8(8) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} {(\frac{1}{2})}^{(p - d_{ℓ}) / 2}, \end{aligned}$ (8) ) becomes (23) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 δ})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} \end{aligned}$ (23) Then we consider the following two cases.

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation23(23) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 δ})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} \end{aligned}$ (23) ) becomes $B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 δ})}^{(p - d ℓ) / 2} .$ Since p and $d_{ℓ}$ are constants, with $p > d_{ℓ} > 1$ , if δ is large, we get $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0,$ while if δ is not large, we get $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = C o n s t a n t > 0.$ Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent under the reduced model $M_{ℓ},$ only for large values of δ.

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation23(23) $\begin{aligned} B F_{p ℓ}^{P E P} \approx {(\frac{1}{2 δ})}^{(p - d_{ℓ}) / 2} {(\frac{1}{ρ_{ℓ p}})}^{(n - d_{ℓ}) / 2} \end{aligned}$ (23) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{2 δ})}^{(p - d_{ℓ}) / 2} (1 + ϵ)^{(n - d_{ℓ}) / 2} \\ \approx e^{(n / 2) (- ((p - d_{ℓ}) / n) \log (2 δ) + \log (1 + ϵ))} . \end{aligned}$ Thus $lim_{n \to + \infty} B F_{p i}^{P E P} = + \infty .$ Therefore, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent when sampling from the full model $M_{p}$ .

2.4.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

Theorem 2.7

Let $δ_{ℓ} = δ > 0$ and suppose that the reduced model $M_{ℓ}$ has a fixed number of parameters, i.e. $d i m (M_{ℓ}) = d_{ℓ} = O (1),$ as the simple size n increases, and in the full model $M_{p}$ the number of parameters increase with rate $d i m (M_{p}) = p = O (n)$ with $r = lim_{n, p \to + \infty} \frac{n}{p} > 1, p > d_{ℓ} > 1.$ Then:

When sampling from model $M_{ℓ}$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = \{\begin{cases} 0 & i f δ > β_{1} (r) \\ + \infty & i f δ < β_{1} (r) \\ C o n s t a n t > 1 & i f δ = β_{1} (r) \end{cases}$ for a continuous and decreasing function $β_{1} : (1, + \infty) ⟶ R, r ⟼ (2 r / (2 r - 1))^{2 r - 1} (r / 2 (r - 1))$ .
When sampling from model $M_{p}$ $\begin{aligned} lim_{n \to + \infty} B F_{p ℓ}^{P E P} \\ = \{\begin{cases} 0 & i f δ > β_{2} (r) \\ + \infty & i f δ < β_{2} (r) \\ + \infty & i f δ = β_{2} (r) a n d l a r g e \\ C o n s t a n t > 0 & i f δ = β_{2} (r) a n d s m a l l \end{cases} \end{aligned}$ for a continuous function $β_{2} : (1, + \infty) ⟶ R, r ⟼ β_{1} (r) (1 + r)^{r} .$

Proof.

By replacing $n \approx r p$ and $δ_{ℓ} = δ$ , (Equation9(9) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{1}{ρ_{ℓ p}})}^{(r p - d_{ℓ}) / 2} {(\frac{1}{δ_{ℓ}})}^{(p - d_{ℓ}) / 2} 2^{(2 (r - 1) p - 1) / 2} \\ \times \frac{(r - 1)^{(r - 1) p / 2} r^{(r p - d_{ℓ} - 1) / 2}}{(2 r - 1)^{((2 r - 1) p - d_{ℓ} - 1) / 2}}, \end{aligned}$ (9) ) becomes (24) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) δ})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (24)

(a) Suppose that the Reduced Model $M_{ℓ}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation24(24) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) δ})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (24) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) δ})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} \end{aligned}$ and then $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) δ} {(\frac{2 r}{2 r - 1})}^{2 r - 1})}^{p / 2} \\ \times {(\frac{(2 r - 1) (r - 1) δ}{r^{2}})}^{d_{ℓ} / 2} {(1 - \frac{1}{2 r})}^{1 / 2} . \end{aligned}$ We consider the following cases:

If $(r / 2 (r - 1) δ) (2 r / (2 r - 1))^{2 r - 1} = 1 \Rightarrow δ = β_{1} (r)$ then $B F_{p ℓ}^{P E P} \approx {(\frac{2 r}{2 r - 1})}^{(r - 1) d_{ℓ}} {(1 - \frac{1}{2 r})}^{1 / 2} .$ Thus for any r>1 $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = C o n s t a n t > 0.$
If $(r / 2 (r - 1) δ) (2 r / (2 r - 1))^{2 r - 1} \neq 1$ for large values of p we get $B F_{p ℓ}^{P E P} \approx {(\frac{r}{2 (r - 1) δ} {(\frac{2 r}{2 r - 1})}^{2 r - 1})}^{p / 2} .$ Then if
1. $(r / 2 (r - 1) δ) (2 r / (2 r - 1))^{2 r - 1} < 1 \Rightarrow δ > β_{1} (r)$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0.$
2. $(r / 2 (r - 1) δ) (2 r / (2 r - 1))^{2 r - 1} > 1 \Rightarrow δ < β_{1} (r)$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = + \infty .$

Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is consistent under the reduced model $M_{ℓ}$ if and only if the power $δ > β_{1} (r)$ .

(b) Suppose that the Full Model $M_{p}$ is true

Using the asymptotic results of $ρ_{ℓ p}$ given in Section 1, (Equation24(24) $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) δ})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(\frac{r - 1}{r ρ_{ℓ p}})}^{(p r - d_{ℓ}) / 2} . \end{aligned}$ (24) ) becomes $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {(\frac{r}{2 (r - 1) δ})}^{(p - d_{ℓ}) / 2} \\ \times {(\frac{2 r}{2 r - 1})}^{((2 r - 1) p - d_{ℓ} - 1) / 2} {(1 + ϵ)}^{(p r - d_{ℓ}) / 2}, \end{aligned}$ or $\begin{aligned} B F_{p ℓ}^{P E P} & \approx {[{(\frac{2 r}{2 r - 1})}^{2 r - 1} \frac{r (1 + ϵ)^{r}}{2 (r - 1) δ}]}^{p / 2} \\ \times {(\frac{(2 r - 1) (r - 1) δ}{r^{2} (1 + ϵ)})}^{d_{ℓ} / 2} {(1 - \frac{1}{2 r})}^{1 / 2} . \end{aligned}$ We consider the following cases

If $(2 r / (2 r - 1))^{2 r - 1} (r (1 + ϵ)^{r} / 2 (r - 1) δ) = 1 \Rightarrow δ = β_{2} (r)$ then $B F_{p ℓ}^{P E P} \approx ((2 r - 1) (r - 1) δ / r^{2} (1 + ϵ))^{d_{ℓ} / 2} (1 - 1 / 2 r)^{1 / 2}$ and for large values of δ we have $lim_{n \to + \infty} B F_{p ℓ}^{P E P} \approx lim_{n \to + \infty} {(\frac{2 δ}{1 + ϵ})}^{d_{ℓ} / 2} = + \infty,$ while if δ is not large $lim_{n \to + \infty} B F_{p ℓ}^{P E P} \approx lim_{n \to + \infty} {(\frac{2 δ}{1 + ϵ})}^{d_{ℓ} / 2} = C o n s t a n t > 0.$
If $(2 r / (2 r - 1))^{2 r - 1} (r (1 + ϵ)^{r} / 2 (r - 1) δ) \neq 1$ , for large value p we have $B F_{p ℓ}^{P E P} \approx ((2 r / (2 r - 1))^{2 r - 1} (r (1 + ϵ)^{r} / 2 (r - 1) δ))^{p / 2} .$ Then if
1. $(2 r / (2 r - 1))^{2 r - 1} (r (1 + ϵ)^{r} / 2 (r - 1) δ) < 1 \Rightarrow δ > β_{2} (r)$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = 0.$
2. $(2 r / (2 r - 1))^{2 r - 1} (r (1 + ϵ)^{r} / 2 (r - 1) δ) > 1 \Rightarrow δ < β_{2} (r)$ $lim_{n \to + \infty} B F_{p ℓ}^{P E P} = + \infty .$

Thus, the Bayes factor of the full model $M_{p}$ versus the reduced model $M_{ℓ}$ is inconsistent under the full model $M_{p}$ if $δ > β_{2} (r)$ or when $δ = β_{2} (r)$ and δ is small.

3. Summary and conclusions

In this paper, we examined the asymptotic behaviour of the power-expected-posterior methodology when comparing nested normal linear models. Emphasis was given on the consistency of the Bayes factor of the full model $M_{p}$ versus a generic submodel $M_{ℓ}$ . The number of parameters of the simplest model $M_{ℓ}$ was kept always fixed, while for the full model was set of order $O (n^{α}),$ where $α \in {0, 1}$ . We investigated the effect of the prior power parameter $δ_{ℓ}$ , by examining four different scenarios. In each case, the ‘true’ model was set equal to either $M_{ℓ}$ or $M_{p}$ . Tables – summarise our findings.

Table 1. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} \in {n, n - p}$ .

Display Table

Table 2. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} = p$ .

Display Table

Table 3. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} = δ > 0$ .

Display Table

The consistency properties of the Power-Expected-Posterior (PEP) prior Bayes factors are eminently reasonable, assuming that we are sampling from either of the candidate models. It is always consistent for fixed dimensions of the candidate models and even in the difficult situation on which the alternative model can grow with the sample size, for the situations described in Tables – , the PEP Bayes factor is consistent, unless the alternative model is extremely close to the null model, in which case, we conjecture, the lack of consistency is not a critical issue, at least for prediction purposes.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

D. Fouskakis

D. Fouskakis is an Associate Professor in the Department of Mathematics, at the National Technical University of Athens, in Greece. He is also the Director of the Stats Lab at the same University. His research mostly focuses on Bayesian model and variable selection, on objective priors and on stochastic optimization methods.

J. K. Innocent

J. K. Innocent received a Ph.D in Mathematics at the University of Puerto Rico, Puerto Rico, USA in 2016. He is currently back to Haiti, where he teaches mathematical and Statistical courses at a university level. His main research areas are on Bayesian Statistics, Statistical Analysis, Biostatistics and Epidemiology.

L. Pericchi

L. Pericchi is a Full Professor in the Department of Mathematics of the University of Puerto Rico Rio Piedras, USA. He is also the Director of the Center of Biostatistics and Bioinformatics of the College of Natural Sciences. His research is in the Theory and Applications of Statistics, with emphasis in the Bayesian Approach.

References

Berger, J., & Pericchi, L. (2004). Training samples in objective Bayesian model selection. Annals of Statistics, 32, 841–869. doi: 10.1214/009053604000000229
Web of Science ®Google Scholar
Casella, G., Girón, F. J., Martínez, M. L., & Moreno, E. (2009). Consistency of Bayesian procedures for variable selection. Annals of Statistics, 37, 1207–1228. doi: 10.1214/08-AOS606
Web of Science ®Google Scholar
Fouskakis, D., & Ntzoufras, I. (2016a). Limiting behaviour of the Jeffreys power-expected-posterior Bayes factor in Gaussian linear models. Brazilian Journal of Probability and Statistics, 30, 299–320. doi: 10.1214/15-BJPS281
Web of Science ®Google Scholar
Fouskakis, D., & Ntzoufras, I. (2016b). Power-conditional-expected priors: Using g-priors with random imaginary data for variable selection power-conditional-expected priors. Journal of Computational and Graphical Statistics, 25, 647–664. doi: 10.1080/10618600.2015.1036996
Web of Science ®Google Scholar
Fouskakis, D., Ntzoufras, I., & Draper, D. (2015). Power-expected-posterior priors for variable selection in Gaussian linear models. Bayesian Analysis, 10, 75–107. doi: 10.1214/14-BA887
Web of Science ®Google Scholar
Girón, F. J., Moreno, E., & Casella, G. (2010). Consistency of objective Bayes factors as the model dimension grows. Annals of Statistics, 38, 1937–1952. doi: 10.1214/09-AOS754
Web of Science ®Google Scholar
Ibrahim, J. G., & Chen, M. H.. (2000). Power prior distributions for regression models. Statistical Science, 15, 46–60. doi: 10.1214/ss/1009212673
Web of Science ®Google Scholar
Innocent, J. K. (2016). Bayes factors consistency for nested linear models with increasing dimensions (Unpublished doctoral dissertation). University of Puerto Rico.
Google Scholar
Kass, R. E., & Wasserman, L. (1995). A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. Journal of the American Statistical Association, 90, 928–934. doi: 10.1080/01621459.1995.10476592
Web of Science ®Google Scholar
Pérez, J. M., & Berger, J. O. (2002). Expected-posterior prior distributions for model selection. Biometrika, 89, 491–512. doi: 10.1093/biomet/89.3.491
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Power-expected-posterior prior Bayes factor consistency for nested linear models with increasing dimensions

Abstract

1. Introduction

2. Bayes factor consistency under power-expected-posterior priors

2.1. When the power $δ_{ℓ} = n$

2.1.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.1.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

2.2. When the power $δ_{ℓ} = (n - p)$

2.2.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.2.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

2.3. When the power $δ_{ℓ} = p$

2.3.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.3.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

2.4. When the power $δ_{ℓ} = δ$

2.4.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.4.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

3. Summary and conclusions

Table 1. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} \in {n, n - p}$ .

Table 2. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} = p$ .

Table 3. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} = δ > 0$ .

Disclosure statement

Notes on contributors

D. Fouskakis

J. K. Innocent

L. Pericchi

References

Information for

Open access

Opportunities

Help and information

Power-expected-posterior prior Bayes factor consistency for nested linear models with increasing dimensions

Abstract

1. Introduction

2. Bayes factor consistency under power-expected-posterior priors

2.1. When the power δℓ=n

2.1.1. When dim(Mℓ)=O(1) and dim(Mp)=O(1)

2.1.2. When dim(Mℓ)=O(1) and dim(Mp)=O(n)

2.2. When the power δℓ=(n−p)

2.2.1. When dim(Mℓ)=O(1) and dim(Mp)=O(1)

2.2.2. When dim(Mℓ)=O(1) and dim(Mp)=O(n)

2.3. When the power δℓ=p

2.3.1. When dim(Mℓ)=O(1) and dim(Mp)=O(1)

2.3.2. When dim(Mℓ)=O(1) and dim(Mp)=O(n)

2.4. When the power δℓ=δ

2.4.1. When dim(Mℓ)=O(1) and dim(Mp)=O(1)

2.4.2. When dim(Mℓ)=O(1) and dim(Mp)=O(n)

3. Summary and conclusions

Table 1. Consistency of BFpℓPEP when model Mℓ has dimension dim(Mℓ)=i=O(1) and δℓ∈{n,n−p}.

Table 2. Consistency of BFpℓPEP when model Mℓ has dimension dim(Mℓ)=i=O(1) and δℓ=p.

Table 3. Consistency of BFpℓPEP when model Mℓ has dimension dim(Mℓ)=i=O(1) and δℓ=δ>0.

Disclosure statement

Additional information

Notes on contributors

D. Fouskakis

J. K. Innocent

L. Pericchi

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

2.1. When the power $δ_{ℓ} = n$

2.1.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.1.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

2.2. When the power $δ_{ℓ} = (n - p)$

2.2.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.2.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

2.3. When the power $δ_{ℓ} = p$

2.3.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.3.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

2.4. When the power $δ_{ℓ} = δ$

2.4.1. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (1)$

2.4.2. When $d i m (M_{ℓ}) = O (1)$ and $d i m (M_{p}) = O (n)$

Table 1. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} \in {n, n - p}$ .

Table 2. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} = p$ .

Table 3. Consistency of $B F_{p ℓ}^{P E P}$ when model $M_{ℓ}$ has dimension $d i m (M_{ℓ}) = i = O (1)$ and $δ_{ℓ} = δ > 0$ .