Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The Lomax distribution is an important member in the distribution family. In this paper, we systematically develop an objective Bayesian analysis of data from a Lomax distribution. Noninformative priors, including probability matching priors, the maximal data information (MDI) prior, Jeffreys prior and reference priors, are derived. The propriety of the posterior under each prior is subsequently validated. It is revealed that the MDI prior and one of the reference priors yield improper posteriors, and the other reference prior is a second-order probability matching prior. A simulation study is conducted to assess the frequentist performance of the proposed Bayesian approach. Finally, this approach along with the bootstrap method is applied to a real data set.

Keywords:

1. Motivation

A random variable is said to be distributed as a Lomax distribution if its density function has the following form: (1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) where $α > 0$ is the shape parameter and $β > 0$ is the scale parameter. The distribution is originally introduced in Lomax (Citation1954) for the analysis of business failure data. Since then, the Lomax model has been widely applied in many other fields. For example, Atkinson and Harrison (Citation1978) utilized the Lomax distribution to model personal wealth data; Bain and Engelhardt (Citation1992) found that the Lomax distribution provided a good model for biomedical problems, such as survival time following a heart transplant; Holland et al. (Citation2006) applied the Lomax distribution to model the distribution of the sizes of computer files on servers; and Marshall and Olkin (Citation2007) showed that the Lomax distribution can be applied as a lifetime distribution. For some extensions of the Lomax distribution, one is referred to Nayak (Citation1987), Roy and Gupta (Citation1996), Nadarajah (Citation2005), Lemonte and Cordeiro (Citation2013), Kang et al. (Citation2021) among others.

For the Lomax model (Equation1(1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) ), there is no closed-form expression for the classical maximum likelihood estimator (MLE). More importantly, as pointed out by Deville (Citation2016), the MLE does not exist if the sample coefficient of variation $C V_{n} < 1$ , which was also analyzed in detail in Chakraborty (Citation2019). In addition, a simulation study indicates that the probability of $C V_{n} < 1$ is not negligible. For example, when $(α, β) = (3, 1)$ and n = 20, the empirical probability of $C V_{n} < 1$ is as high as 0.25. In this sense, the MLE method is not applicable under these cases.

Thus, it is natural to consider Bayesian estimation for the parameters $(α, β)$ in model (Equation1(1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) ). In a Bayesian paradigm, the specification of a prior distribution is one of the most important problems. We first consider the following vague prior for $θ = (α, β)$ , that is, (2) $π_{G} (θ) \propto \frac{1}{β} α^{τ - 1} e^{- α},$ (2) where $τ > 0$ is a hyperparameter. That is to say, the ‘marginal’ prior for α is a gamma distribution. It is shown that the posterior distribution of $π_{G} (θ ∣ X)$ is proper for any $n \geq 1$ (see the Appendix for proofs), where $X = (X_{1}, X_{2}, \dots, X_{n})$ is the sample. To assess the sensitivity of the corresponding Bayesian estimation with respect to the hyperparameter τ, we conduct a sensitivity analysis here.

The values of $(α, β)$ are set as $(2, 1.5)$ , and τ is set as 1, 5 and 9, respectively. The empirical square root of the mean squared error ( $\sqrt{M S E}$ ) and coverage probability (CP) of the Bayesian estimation with respect to the sample size n are shown in Figure . It can be observed that the performance of the MSE and CP based on the prior $π_{G}$ is very sensitive to the choice of the hyperparameter τ, which makes it difficult to specify $π_{G}$ in applications. Obviously, if one is interested in making an optimal decision based on his/her beliefs, such a (subjective) prior could be appropriate.

Figure 1. Square root of the mean squared error and coverage probability of the Bayesian estimation of α based on the gamma priors $π_{G}$ with $τ = 1$ (circle), $τ = 5$ (cross), and $τ = 9$ (diamond). Panel (a) is for the square root of mean squared error, and panel (b) is for the coverage probability.

Based on the above motivation, we propose an objective Bayesian analysis for the Lomax model in this paper, particularly when there is little prior information on the parameters. As we know, one of the most appealing features of the objective Bayesian analysis is to use noninformative priors. In the work of Ferreira et al. (Citation2016, Citation2020), Jeffreys prior and independent Jeffreys prior (i.e., one of the reference priors) were considered. In this paper, we propose a more systematic and deeper analysis based on an extensive class of objective priors including probability matching priors, the maximal data information (MDI) prior, Jeffreys prior and reference priors.

The remainder of this paper is organized as follows. In Section 2, noninformative priors, including probability matching priors, the MDI prior, Jeffreys prior and reference priors are derived. Moreover, the posterior propriety under each prior is validated. In Section 3, a simulation study is conducted to evaluate the frequentist properties of Bayesian estimates based on the noninformative priors. In Section 4, the proposed Bayesian approach is applied to analyze a real data set. Some concluding remarks are given in Section 5.

2. Noninformative priors and their properties

In this section, we derive some important noninformative priors for the parameters $(α, β)$ , which contain probability matching priors, the MDI prior, Jeffreys prior and reference priors.

2.1. Probability matching priors

The rationale behind a probability matching prior is that a noninformative prior should provide inferences that are similar to those obtained from a frequentist perspective, such as in terms of credible versus confidence intervals. In this perspective, a probability matching prior is a prior such that the posterior coverage probability of Bayesian credible interval matches the corresponding frequentist coverage probability (Consonni et al., Citation2018).

Given a prior $π (\cdot)$ for the parameters $(ϕ, φ)$ , suppose that ϕ is the parameter of interest, and $ϕ^{(1 - γ)} (π (\cdot), X)$ is the $(1 - γ)$ -th percentile of the marginal posterior distribution of ϕ. Then, $π (\cdot)$ is called a second-order probability matching prior if $P (ϕ \leq ϕ^{(1 - γ)} (π (\cdot), X)) = 1 - γ + o (n^{- 1})$ holds for all $γ \in (0, 1)$ , see Datta and Mukerjee (Citation2004) for more details.

For the parameters α and β in the Lomax model (Equation1(1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) ), we have the following theorem.

Theorem 2.1

(a)	When α is the parameter of interest and β is the nuisance parameter, the second-order probability matching prior has the form of (3) $π_{M_{1}} (θ) \propto F_{1} (α) \cdot G_{1} (β),$ (3) where $F_{1} (α) \propto \exp {- \int \frac{(c_{1} + 3) α + 2 c_{1} + 3}{α (α + 1)} d α}$ , $G_{1} (β) \propto β^{c_{1}}$ , and $c_{1}$ is an arbitrary constant.
(b)	When β is the parameter of interest and α is the nuisance parameter, the second-order probability matching prior is given by (4) $π_{M_{2}} (θ) \propto F_{2} (α) \cdot G_{2} (β),$ (4) where $F_{2} (α) \propto \exp {- \int \frac{(c_{2} + 3) α^{2} + 3 (c_{2} + 2) α + 2 c_{2} + 2}{α^{2} (α + 2)} d α}$ , $G_{2} (β) \propto β^{c_{2}}$ , and $c_{2}$ is an arbitrary constant.

2.2. The MDI prior

Lindley (Citation1956) applied the Shannon entropy to develop an information theoretic analysis of the structure of Bayesian modelling. This prompted the works on the definition of the least informative prior distribution based on some definitions of the amount of information. Zellner (Citation1977) proposed an important noninformative prior, which is called the MDI prior. Zellner (Citation1977) proved that using this prior could emphasize the information in the likelihood function. Therefore, the information in the prior is weak compared with that in the data (Ramos et al., Citation2018).

For the Lomax model (Equation1(1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) ), we have the following result, the proofs of which are deferred to the Appendix.

Theorem 2.2

(a)	The MDI prior for the parameters $θ = (α, β)$ is given by (5) $π_{M} (θ) \propto \frac{α}{β e^{\frac{1}{α}}} .$ (5)
(b)	For any $n \geq 1$ , the posterior distribution under $π_{M} (θ)$ is improper.

2.3. Jeffreys prior

Jeffreys prior is probably the most popular noninformative prior method among practitioners. According to Jeffreys (Citation1961), Jeffreys prior is proportional to the square root of the determinant of the Fisher information matrix. Besides being parametrization invariant, Jeffreys prior enjoys many optimality properties in the absence of nuisance parameters. It maximizes the asymptotic divergence between the prior and the posterior under several different metrics. However, Jeffreys prior also has some potential drawbacks. Particularly, in the multidimensional case, its use may lead to incoherence and paradoxes. See Consonni et al. (Citation2018) for more discussions.

For the Lomax model (Equation1(1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) ), Jeffreys prior for the parameters $θ = (α, β)$ has the following form: (6) $π_{J} (θ) \propto \frac{1}{β (α + 1) \sqrt{α (α + 2)}} .$ (6) And it was shown in Ferreira et al. (Citation2020) that, for any $n \geq 1$ , the posterior distribution under $π_{J} (θ)$ is proper.

By Theorem 2.1, we have the following theorem.

Theorem 2.3

Regardless of whether α is the parameter of interest or β is the parameter of interest, $π_{J} (θ)$ is always not a second-order probability matching prior.

2.4. Reference priors

Reference analysis uses information-theoretical concepts to precisely define the objective prior, which should be maximally dominated by the data, in the sense of maximizing the missing information on the parameters (Berger et al., Citation2009). The original formulation of reference priors was introduced in Bernardo (Citation1979), which was largely informal. Berger and Bernardo (Citation1992) gave more precise definitions of the sequential reference process in continuous multiparameter problems. In addition, a rigorous general definition of reference priors was formally given in Berger et al. (Citation2009) for one block of parameters.

As we know, reference priors separate the parameters into different ordering groups of interest. For the ordering group ${β, α}$ , it was shown in Ferreira et al. (Citation2020) that the reference prior is (7) $π_{R_{1}} (θ) \propto \frac{1}{α β} .$ (7) Furthermore, for any $n \geq 1$ , the posterior distribution under the reference prior $π_{R_{1}} (θ)$ is improper.

For the ordering group $(α, β)$ , we have the following theorem.

Theorem 2.4

(a)	The reference prior under the ordering group ${α, β}$ is given by (8) $π_{R_{2}} (θ) \propto \frac{1}{β α (α + 1)} .$ (8)
(b)	For n = 1, the posterior distribution under the reference prior $π_{R_{2}} (θ)$ is improper; while for $n \geq 2$ , the posterior under $π_{R_{2}} (θ)$ is proper.
(c)	The prior $π_{R_{2}} (θ)$ is a second-order probability matching prior while $π_{R_{1}} (θ)$ is not.

The proofs of Theorem 2.4 are also deferred to the Appendix. It follows from Theorems 2.2–2.4 that only Jeffreys prior $π_{J}$ and the reference prior $π_{R_{2}}$ enable posterior inferences. However, $π_{R_{2}}$ is a second-order probability matching prior while $π_{J}$ is not. In this sense, $π_{R_{2}}$ is recommended for potential users. In fact, this is also verified in the following numercial studies.

3. Simulation study

To evaluate the frequentist performance of the Bayesian estimation based on $π_{J}$ and $π_{R_{2}}$ , we simulate data from the Lomax model (Equation1(1) $f (x; α, β) = \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}, x > 0,$ (1) ) with different true values of the parameters α and β and different sample sizes n. Then, posterior samples are drawn from the joint posterior distribution of α and β by using the random-walk Metropolis algorithm in Roberts et al. (Citation1997). For each chain, the sample size is 50000 after 5000 burn-in samples. By choosing samples with jump of 10, a final chain of 5000 values is obtained. In order to make the estimation more robust, we take the posterior median as the Bayesian estimator for each parameter. The process is replicated 5000 times. Thus, we can obtain estimated mean squared errors and coverage probabilities of credible intervals (CIs).

The empirical results of the MSE and CP for the 95% CIs are listed in Table , where the estimated probabilities that the sample coefficient of variation $C V_{n}$ is less than 1 are also associated. From Table , the following observations can be found.

As is expected, the MSEs of the Bayesian estimators decrease as the sample size increases. Meanwhile, the CPs of the 95% CIs approach the nominal level of 0.95.
The larger the value of α is, the higher the probability $P (C V_{n} < 1)$ . For each parameter, the larger the true value is, the larger the corresponding MSE.
According to both the MSE and CP, the performance of the Bayesian estimators under the reference prior $π_{R_{2}}$ is much better than that under Jeffreys prior $π_{J}$ . In fact, this is because $π_{R_{2}}$ is a second-order probability matching prior while $π_{J}$ is not.

Table 1. Empirical MSEs and CPs (within parentheses) of Bayesian estimators based on the priors $π_{J}$ and $π_{R_{2}}$ .

Display Table

4. Real data analysis

Now we apply the proposed Bayesian approach to analyze a sample of computer file sizes (in bytes) for 269 files with the *.ini extension on a Windows-based personal computer. The data are available on the website http://web.uvic.ca/∼dgiles/downloads/data. The data were also analyzed by Holland et al. (Citation2006) and Ferreira et al. (Citation2016), where the Lomax distribution was shown to appropriately fit this data.

For comparison purposes, the parametric bootstrap and the Bayesian approaches based on the Jeffreys prior $π_{J}$ and the reference prior $π_{R_{2}}$ are included here. The parametric bootstrap is based on the MLE since the sample coefficient of variation $C V_{n}$ is greater than 1 here. The estimators along with the corresponding standard deviation (SD) and 95% confidence/credible interval (CI) for α and β are listed in Table . It can be seen from Table that the Bayesian estimates of β are much more accurate than those of the parametric bootstrap according to the SD and the width of the CI. In addition, the performances of the two Bayesian estimates are close to each other, although the prior $π_{R_{2}}$ behaves slightly better than $π_{J}$ . To conclude, it is noted that our results are close to those in Ferreira et al. (Citation2016) with respect to Jeffreys prior.

Table 2. Summary of the parametric bootstrap and the Bayesian estimates.

Display Table

5. Concluding remarks

In this paper, objective Bayesian methods are developed to make inferences on the parameters of a Lomax distribution. Compared with the work in the literature, our contribution lies in the following points. First, we consider a larger class of noninformative priors, which includes probability matching priors, the MDI prior and both of reference priors. Second, it is revealed that one of reference priors is a second-order probability matching prior while Jeffreys prior is not. Third, we clarify that the MLE does not exist if the sample coefficient of variation $C V_{n} < 1$ , and also consider the probability of such phenomenon in the simulation study. As a result, it is feasible to use objective Bayesian analysis for the Lomax distribution in practice.

Acknowledgments

The authors would like to thank the Editor, the Associate Editor and the anonymous Reviewers for their valuable comments and suggestions on earlier versions of this paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The work is supported by the National Social Science Foundation of China (Grant No. 21BTJ034).

References

Atkinson, A. B., & Harrison, A. J. (1978). Distribution of personal wealth in Britain. Cambridge University Press.
Google Scholar
Bain, L. J., & Engelhardt, M. (1992). Introduction to probability and mathematical statistics. PWSKENT Publishing Company.
Google Scholar
Berger, J. O., & Bernardo, J. M. (1992). Ordered group reference priors with application to the multinomial problem. Biometrika, 79(1), 25–37. https://doi.org/10.1093/biomet/79.1.25
Web of Science ®Google Scholar
Berger, J. O., Bernardo, J. M., & Sun, D. C. (2009). The formal definition of reference priors. The Annals of Statistics, 37(2), 905–938. https://doi.org/10.1214/07-AOS587
Web of Science ®Google Scholar
Bernardo, J. M. (1979). Reference posterior distributions for Bayesian inference (with discussion). Journal of the Royal Statistical Society: Series B (Methodological), 41(2), 113–147. https://doi.org/10.1111/j.2517-6161.1979.tb01066.x
Web of Science ®Google Scholar
Chakraborty, T. (2019). An analysis of the maximum likelihood estimates for the Lomax distribution. ArXiv: 1911.12612v2, 1–14.
Google Scholar
Consonni, G., Fouskakis, D., Liseo, B., & Ntzoufras, I. (2018). Prior distributions for objective Bayesian analysis. Bayesian Analysis, 13(2), 627–679. https://doi.org/10.1214/18-BA1103
Web of Science ®Google Scholar
Datta, G. S., & Mukerjee, R. (2004). Probability matching priors: Higher order asymptotics. Lecture Notes in Statistics. Sringer.
Google Scholar
Deville, Y. (2016). Renext: Renewal method for extreme values extrapolation (p. 25). https://cran.r-project.org/web/packages/Renext/Renext.pdf.
Google Scholar
Ferreira, P. H., Gonzales, J. F. B., Tomazella, V. L. D., Ehlers, R. S., Louzada, F., & Silva, E. B. (2016). Objective Bayesian analysis for the Lomax distributionms. ArXiv: 1602.08450v1, 1–19.
Google Scholar
Ferreira, P. H., Ramos, E., Ramos, P. L., Gonzales, J. F. B., Tomazella, V. L. D., R. S. Ehlers, Silva, E. B., & Louzad, F. (2020). Objective Bayesian analysis for the Lomax distribution. Statistics & Probability Letters, 159, Article 108677. https://doi.org/10.1016/j.spl.2019.108677
Google Scholar
Holland, O., Golaup, A., & Aghvami, A. H. (2006). Traffic characteristics of aggregated module downloads for mobile terminal reconfiguration. IEEE Proceedings: Communications, 153(5), 683–690. https://doi.org/10.1049/ip-com:20045155
Google Scholar
Jeffreys, H. (1961). Theory of probability (3rd ed.). Oxford University Press.
Google Scholar
Kang, S. G., Lee, W. D., & Kim, Y. (2021). Posterior propriety of bivariate lomax distribution under objective priors. Communication in Statistics – Theory and Methods, 50(9), 2201–2209. https://doi.org/10.1080/03610926.2019.1662049
Web of Science ®Google Scholar
Lemonte, A. J., & Cordeiro, G. M. (2013). An extended Lomax distribution. Statistics, 47(4), 800–816. https://doi.org/10.1080/02331888.2011.568119
Web of Science ®Google Scholar
Lindley, D. V. (1956). On a measure of the information provided by an experiment. The Annals of Mathematical Statistics, 27(4), 986–1005. https://doi.org/10.1214/aoms/1177728069
Google Scholar
Lomax, K. (1954). Business failures: Another example of the analysis of failure data. Journal of the American Statistical Association, 49(268), 847–852. https://doi.org/10.1080/01621459.1954.10501239
Web of Science ®Google Scholar
Marshall, A. W., & Olkin, I. (2007). Life distributions: Structure of nonparametric, semiparametric, and parametric families. Springer.
Google Scholar
Nadarajah, S. (2005). Sums, products, and ratios for the bivariate Lomax distribution. Computational Statistics & Data Analysis, 49(1), 109–129. https://doi.org/10.1016/j.csda.2004.05.003
Web of Science ®Google Scholar
Nayak, T. K. (1987). Multivariate Lomax distribution: Properties and usefulness in reliability theory. Journal of Applied Probability, 24(1), 170–177. https://doi.org/10.2307/3214068
Web of Science ®Google Scholar
Peers, H. W. (1965). On confidence sets and Bayesian probability points in the case of several parameters. Journal of the Royal Statistical Society: Series B (Methodological), 27(1), 9–16. https://doi.org/10.1111/j.2517-6161.1965.tb00581.x
Web of Science ®Google Scholar
Ramos, P. L., Louzada, F., & Ramos, E. (2018). Posterior properties of the Nakagami-m distribution using non-informative priors and applications in reliability. IEEE Transactions on Reliability, 67(1), 105–117. https://doi.org/10.1109/TR.24
Web of Science ®Google Scholar
Roberts, G. O., Gelman, A., & Gilks, W. R. (1997). Weak convergence and optimal scaling of random walk metropolis algorithms. Annals of Applied Probability, 7(1), 110–120 . http://doi.org/10.1214/aoap/1034625254
Web of Science ®Google Scholar
Roy, D., & Gupta, R. P. (1996). Bivariate extension of Lomax and finite range distributions through characterization approach. Journal of Multivariate Analysis, 59(1), 22–33. https://doi.org/10.1006/jmva.1996.0052
Web of Science ®Google Scholar
Zellner, A. (1977). Maximal data information prior distributions. In A. Aykac & C. Brumat (Eds.), New developments in the applications of Bayesian methods (pp. 211–232). North-Holland.
Google Scholar

Appendix

Proof

Proof of the posterior propriety of $π_{G}$

The joint posterior density of $(α, β)$ based on the prior $π_{G}$ is given by $\begin{aligned} π_{G} (α, β ∣ x_{1}, \dots, x_{n}) & \propto π_{G} (θ) \cdot L (x_{1}, \dots, x_{n} ∣ α, β) = β^{- 1} α^{τ - 1} e^{- α} {(\frac{α}{β})}^{n} \prod_{i = 1}^{n} {(1 + \frac{x_{i}}{β})}^{- (α + 1)} . \end{aligned}$ Denote $x_{m} = min {x_{1}, x_{2}, \dots, x_{n}}$ . Then we have $\begin{aligned} \int_{0}^{+ \infty} \int_{0}^{+ \infty} π_{G} (α, β ∣ x_{1}, \dots, x_{n}) d β d α & \leq \int_{0}^{+ \infty} α^{n + τ - 1} e^{- α} \int_{0}^{+ \infty} β^{- (n + 1)} {(1 + \frac{x_{m}}{β})}^{- n (α + 1)} d β d α \\ \propto \int_{0}^{+ \infty} α^{n + τ - 1} e^{- α} B (n α, n) d α \\ \propto \int_{0}^{+ \infty} α^{n + τ - 1} e^{- α} \frac{1}{n α (n α + 1) \dots (n α + n - 1)} d α, \end{aligned}$ where $B (a, b)$ is the Beta function. Let $g_{1} (α) = \frac{α^{n + τ - 1} e^{- α}}{n α (n α + 1) \dots (n α + n - 1)} .$ Then, it can be seen that $\begin{aligned} g_{1} (α) & = O (α^{n + τ - 2}), α \to 0, \\ g_{1} (α) & = O (α^{τ - 1} e^{- α}), α \to + \infty . \end{aligned}$ Thus, $\int_{0}^{+ \infty} g_{1} (α) d α < \infty$ for any $n \geq 1$ . Consequently, the posterior distribution of $π_{G}$ is proper.

Proof

Proof of Theorem 2.1

According to Peers (Citation1965), the second-order probability matching prior $π_{M_{1}} (θ)$ satisfies the following partial differential equation: $\frac{\partial}{\partial α} {α (α + 1) π_{M_{1}} (α, β)} + \frac{\partial}{\partial β} {β (α + 2) π_{M_{1}} (α, β)} = 0,$ the solution of which is given by formula (Equation3(3) $π_{M_{1}} (θ) \propto F_{1} (α) \cdot G_{1} (β),$ (3) ). Similarly, the second-order probability matching prior $π_{M_{2}} (θ)$ is such that $\frac{\partial}{\partial α} {α^{\frac{3}{2}} (α + 2)^{\frac{1}{2}} π_{M_{2}} (α, β)} + \frac{\partial}{\partial β} {α^{- \frac{1}{2}} β (α + 1) (α + 2)^{\frac{1}{2}} π_{M_{2}} (α, β)} = 0,$ and the solution of this equation is given by formula (Equation4(4) $π_{M_{2}} (θ) \propto F_{2} (α) \cdot G_{2} (β),$ (4) ).

Proof

Proof of Theorem 2.2

(a) In the light of Zellner (Citation1977), the MDI prior for $θ = (α, β)$ has the following form: $π_{M} (α, β) \propto \exp {H (α, β)},$ where $H (α, β) = E [\log f (X)]$ , and $f (x)$ is the density function of the Lomax distribution. Note that $H (α, β) = \int_{0}^{+ \infty} \log {\frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)}} \cdot \frac{α}{β} {(1 + \frac{x}{β})}^{- (α + 1)} d x = \log (\frac{α}{β}) - \frac{α + 1}{α} .$ It follows that the MDI prior is $π_{M} (θ) \propto \exp {\log (\frac{α}{β}) - \frac{α + 1}{α}} \propto \frac{α}{β e^{\frac{1}{α}}} .$ (b) The joint posterior density of $(α, β)$ based on $π_{M}$ is $\begin{aligned} π_{M} (α, β ∣ x_{1}, \dots, x_{n}) & \propto π_{M} (θ) \cdot L (x_{1}, \dots, x_{n} | α, β) \\ = \frac{1}{β e^{\frac{1}{α}}} {(\frac{α}{β})}^{n} \prod_{i = 1}^{n} {(1 + \frac{x_{i}}{β})}^{- (α + 1)} . \end{aligned}$ Denote $x_{M} = max {x_{1}, x_{2}, \dots, x_{n}}$ . Then, we have $\begin{aligned} \int_{0}^{+ \infty} \int_{0}^{+ \infty} π_{M} (α, β ∣ x_{1}, \dots, x_{n}) d β d α \\ \geq \int_{0}^{+ \infty} α^{n} e^{- \frac{1}{α}} \int_{0}^{+ \infty} β^{- (n + 1)} {(1 + \frac{x_{M}}{β})}^{- n (α + 1)} d β d α \\ \propto \int_{0}^{+ \infty} α^{n} e^{- \frac{1}{α}} B (n α, n) d α \\ \propto \int_{0}^{+ \infty} α^{n} e^{- \frac{1}{α}} \frac{1}{n α (n α + 1) \dots (n α + n - 1)} d α . \end{aligned}$ Let $g_{2} (α) = α^{n} e^{- \frac{1}{α}} \frac{1}{n α (n α + 1) \dots (n α + n - 1)} .$ Then, it can be seen that $g_{2} (α) \to \frac{1}{n^{n}}$ as $α \to + \infty$ . Thus, $\int_{0}^{+ \infty} g_{2} (α) d α = \infty$ for any $n \geq 1$ . Consequently, the posterior distribution $π_{M} (α, β ∣ x_{1}, \dots, x_{n})$ is improper.

Proof

Proof of Theorem 2.4

(a) Let S be the inverse of the Fisher information matrix I. Then, up to a constant, $S = (\begin{array}{cc} α^{2} (α + 1)^{2} & α β (α + 1) (α + 2) \\ α β (α + 1) (α + 2) & β^{2} α^{- 1} (α + 1)^{2} (α + 2) \end{array}) .$ Following the notations in Bernardo (Citation1979), it holds that $h_{1} = \frac{1}{α^{2} (α + 1)^{2}}, h_{2} = \frac{α}{β^{2} (α + 2)} .$ Now we select compact set series $Ω_{l} = [c_{1 l}, d_{1 l}] \times [c_{2 l}, d_{2 l}]$ for $(α, β)$ , $l = 1, 2, \dots$ , such that $c_{1 l}, c_{2 l} \to 0$ , and $d_{1 l}, d_{2 l} \to + \infty$ as $l \to \infty$ . Then, $π_{2}^{l} (β ∣ α) = \frac{| h_{2} |^{1 / 2} 1_{[c_{2 l}, d_{2 l}]} (β)}{\int_{c_{2 l}}^{d_{2 l}} | h_{2} |^{1 / 2} d β} := \frac{k_{1}}{β} 1_{[c_{2 l}, d_{2 l}]} (β),$ where $1_{[a, b]} (\cdot)$ refers to the indicator function on the interval $[a, b]$ , and $k_{1} = \log (d_{2 l}) - \log (c_{2 l})$ is a constant. Note that $h_{1}$ is independent of β. It follows that $E_{1}^{l} (\log | h_{1} | ∣ α) = \int_{c_{2 l}}^{d_{2 l}} \log | h_{1} | \cdot \frac{k_{1}}{β} d β = \log | h_{1} | .$ Subsequently, $\begin{aligned} π_{1}^{l} (α, β) & = \frac{π_{2}^{l} (β ∣ α) \cdot \exp {\frac{1}{2} E_{1}^{l} (\log | h_{1} | ∣ α)} 1_{[c_{1 l}, d_{1 l}]} (α)}{\int_{c_{1 l}}^{d_{1 l}} \exp {\frac{1}{2} E_{1}^{l} (\log | h_{1} | ∣ α)} d α} \\ = \frac{π_{2}^{l} (β ∣ α) \cdot | h_{1} |^{\frac{1}{2}} \cdot 1_{[c_{1 l}, d_{1 l}]} (α)}{\int_{c_{1 l}}^{d_{1 l}} | h_{1} |^{\frac{1}{2}} d α} \\ := \frac{k_{2}}{β α (α + 1)} 1_{[c_{1 l}, d_{1 l}]} (α) 1_{[c_{2 l}, d_{2 l}]} (β), \end{aligned}$ where $k_{2} = k_{1} \cdot (\log \frac{d_{1 l}}{d_{1 l} + 1} - \log \frac{c_{1 l}}{c_{1 l} + 1})^{- 1}$ is a constant.

Let $(α^{*}, β^{*})$ be an inner point of $Ω_{l}$ . Then, the reference prior under the ordering group ${α, β}$ is given by $π_{R_{2}} (θ) = lim_{l \to \infty} \frac{π_{1}^{l} (α, β)}{π_{1}^{l} (α^{*}, β^{*})} \propto \frac{1}{β α (α + 1)} .$ (b) Let $π_{R_{2}} (α, β ∣ x_{1}, \dots, x_{n})$ be the posterior density based on the prior $π_{R_{2}} (θ)$ . Then, (A1) $π_{R_{2}} (α, β ∣ x_{1}, \dots, x_{n}) \propto \frac{1}{β α (α + 1)} {(\frac{α}{β})}^{n} \prod_{i = 1}^{n} {(1 + \frac{x_{i}}{β})}^{- (α + 1)} .$ (A1) When $n \geq 2$ , we have $\begin{aligned} \int_{0}^{+ \infty} \int_{0}^{+ \infty} π_{R_{2}} (α, β ∣ x_{1}, \dots, x_{n}) d β d α \\ \propto \int_{0}^{+ \infty} \int_{0}^{+ \infty} \frac{1}{β α (α + 1)} {(\frac{α}{β})}^{n} \prod_{i = 1}^{n} {(1 + \frac{x_{i}}{β})}^{- (α + 1)} d β d α \\ \leq \int_{0}^{+ \infty} \frac{α^{n - 1}}{α + 1} B (n α, n) d α \\ \propto \int_{0}^{+ \infty} \frac{α^{n - 1}}{α + 1} \cdot \frac{1}{n α (n α + 1) \dots (n α + n - 1)} d α . \end{aligned}$ Denote $g_{3} (α) = \frac{α^{n - 1}}{α + 1} \cdot \frac{1}{n α (n α + 1) \dots (n α + n - 1)} .$ Then, $g_{3} (α) = O (α^{n - 2})$ as $α \to 0$ , and $g_{3} (α) = O (α^{- 2})$ as $α \to + \infty$ . It follows that $\int_{0}^{+ \infty} g_{3} (α) d α < \infty$ for $n \geq 2$ , which implies that the posterior is proper for $n \geq 2$ .

When n = 1, it follows from (EquationA1(A1) $π_{R_{2}} (α, β ∣ x_{1}, \dots, x_{n}) \propto \frac{1}{β α (α + 1)} {(\frac{α}{β})}^{n} \prod_{i = 1}^{n} {(1 + \frac{x_{i}}{β})}^{- (α + 1)} .$ (A1) ) that $π_{R_{2}} (α, β ∣ x_{1}, \dots, x_{n}) \propto \frac{1}{β^{2} (α + 1)} {(1 + \frac{x_{1}}{β})}^{- (α + 1)} .$ Note that $\int_{0}^{+ \infty} \int_{0}^{+ \infty} \frac{1}{β^{2} (α + 1)} {(1 + \frac{x_{1}}{β})}^{- (α + 1)} d β d α \propto \int_{0}^{+ \infty} \frac{1}{α (α + 1)} d α = \infty .$ Thus, we have $\int_{0}^{+ \infty} \int_{0}^{+ \infty} π_{R_{2}} (α, β ∣ x_{1}, \dots, x_{n}) d β d α = \infty,$ which shows that $π_{R_{2}} (α, β | x_{1}, \dots, x_{n})$ is improper for n = 1.

Bayesian analysis for the Lomax model using noninformative priors

Abstract

1. Motivation

2. Noninformative priors and their properties

2.1. Probability matching priors

2.2. The MDI prior

2.3. Jeffreys prior

2.4. Reference priors

3. Simulation study

Table 1. Empirical MSEs and CPs (within parentheses) of Bayesian estimators based on the priors $π_{J}$ and $π_{R_{2}}$ .

4. Real data analysis

Table 2. Summary of the parametric bootstrap and the Bayesian estimates.

5. Concluding remarks

Acknowledgments

Disclosure statement

References

Appendix

Proof of the posterior propriety of $π_{G}$

Proof of Theorem 2.1

Proof of Theorem 2.2

Proof of Theorem 2.4

Information for

Open access

Opportunities

Help and information

Bayesian analysis for the Lomax model using noninformative priors

Abstract

1. Motivation

2. Noninformative priors and their properties

2.1. Probability matching priors

2.2. The MDI prior

2.3. Jeffreys prior

2.4. Reference priors

3. Simulation study

Table 1. Empirical MSEs and CPs (within parentheses) of Bayesian estimators based on the priors πJ and πR2.

4. Real data analysis

Table 2. Summary of the parametric bootstrap and the Bayesian estimates.

5. Concluding remarks

Acknowledgments

Disclosure statement

Additional information

Funding

References

Appendix

Proof of the posterior propriety of πG

Proof of Theorem 2.1

Proof of Theorem 2.2

Proof of Theorem 2.4

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. Empirical MSEs and CPs (within parentheses) of Bayesian estimators based on the priors $π_{J}$ and $π_{R_{2}}$ .

Proof of the posterior propriety of $π_{G}$