Full article: The generalized weighted Lindley distribution: Properties, estimation, and applications

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In this study, a three-parameter lifetime distribution namely generalized weighted Lindley (GLW) distribution is proposed. The GLW distribution is an useful generalization of the weighted Lindley distribution which accommodates increasing, decreasing, decreasing-increasing-decreasing, bathtub, and unimodal hazard rate making it a flexible model for reliable data. A significant account of mathematical properties for this distribution is presented. Different estimation procedures are discussed such as maximum likelihood estimators, method of moments, ordinary and weighted least-squares, percentile, maximum product of spacings, and minimum distance estimators. The estimators are compared by extensive numerical simulations. Finally, two data-sets are analyzed for illustrative purposes proving that the GWL outperforms several other three-parameter lifetime distributions.

Keywords:

Public Interest Statement

We have proposed and presented a probability distribution called generalized weighted Lindley (WL) distribution. This distribution is an useful generalization of the WL distribution which accommodates increasing, decreasing, decreasing-increasing-decreasing, bathtub, and unimodal hazard rate. A significant account of mathematical properties for this distribution was presented. Different estimation procedures were proposed and compared by extensive numerical simulations. We believe that new distribution will allow the users to describe different data-sets obtaining a better predictive performance in comparison with other usual distributions.

1. Introduction

In recent years, several new extensions of the exponential distribution have been introduced in the literature for describing real problems. Ghitany, Atieh, and Nadarajah (Citation2008) investigated different properties of the Lindley distribution and outlined that in many cases the Lindley distribution outperforms exponential distribution. Since then, many generalizations of the Lindley distribution have been introduced such as generalized Lindley (Zakerzadeh & Dolati, Citation2009), extended Lindley (Bakouch, Al-Zahrani, Al-Shomrani, Marchi, & Louzada, Citation2012), exponential Poisson Lindley (Barreto-Souza & Bakouch, Citation2013), and Power Lindley (Ghitany, Al-Mutairi, Balakrishnan, & Al-Enezi, Citation2013) distribution.

Ghitany, Alqallaf, Al-Mutairi, and Husain (Citation2011) introduced a new class of weighted Lindley (WL) distribution adding more flexibility to the Lindley distribution. Let T be a random variable with a WL distribution. Then probability density function (p.d.f) is given by(1) $\begin{matrix} f (t | λ, ϕ) = \frac{λ^{ϕ + 1}}{(λ + ϕ) Γ (ϕ)} t^{ϕ - 1} (1 + t) e^{- λ t}, \end{matrix}$ (1)

for all $t > 0$ , $ϕ > 0$ and $λ > 0$ and $Γ (ϕ) = \int_{0}^{\infty} e^{- x} x^{ϕ - 1} d x$ is the gamma function. One of its peculiarities is that the hazard function can have an increasing $(ϕ \geq 1)$ or bathtub $(0 < ϕ < 1)$ shape. Different properties and estimation methods for this model were presented by Mazucheli, Louzada, and Ghitany (Citation2013), Ali (Citation2015), Wang and Wang (Citationin press), Al-Mutairi, Ghitany, and Kundu (Citation2015).

In this study, a new lifetime distribution family is proposed which is a direct generalization of the WL distribution. The p.d.f is given by(2) $\begin{matrix} f (t | ϕ, λ, α) = \frac{α λ^{α ϕ}}{(λ + ϕ) Γ (ϕ)} t^{α ϕ - 1} (λ + {(λ t)}^{α}) e^{- {(λ t)}^{α}}, \end{matrix}$ (2)

for all $t > 0$ , $ϕ > 0, λ > 0$ and $α > 0$ . Important probability distributions can be obtained from the GWL distribution as the WL distribution ( $α = 1$ ) , Power Lindley distribution ( $ϕ = 1$ ) and the Lindley distribution ( $ϕ = 1$ and $α = 1$ ). Due to this relationship, such model could also be named as weighted power Lindley or generalized power Lindley distribution.

Torabi, Falahati-Naeini, and Montazeri (Citation2014) discussed a class of distribution with four parameters which is a generalization of the proposed model. Such distribution includes the generalized WL, generalized gamma (GG) distribution, gamma and Weibull, among others. The main difference of this study lies in the fact that the proposed three-parameter distribution has a simple structure with less computational issues. In this way, the behavior of the p.d.f and the hazard function can be studied. This model has different forms of hazard function such as increasing, decreasing, bathtub, unimodal, or decreasing-increasing-decreasing shape making the GWL distribution a flexible model for reliable data. Moreover, a significant account of mathematical properties for the new distribution is provided.

The inferential procedures for the parameters of GLW distribution are presented considering different methods such as maximum likelihood estimators (MLE), methods of moments (ME), ordinary least-squares estimation (OLSE), weighted least-squares estimation (WLSE), maximum product of spacings (MPS), Cramer-von Mises type minimum distance (CME), Anderson–Darling (ADE) and right-tail Anderson–Darling (RADE). The performance of these estimation procedures are compared using extensive numerical simulations. Finally, two data-sets are analyzed for illustrative purposes proving that the GWL outperforms several usual three-parameter lifetime distributions such as the GG distribution (Stacy, Citation1962), the generalized Weibull (GW) distribution (Mudholkar, Srivastava, & Kollia, Citation1996), the generalized exponential-Poisson (GEP) distribution (Barreto-Souza & Cribari-Neto, Citation2009), and the exponentiated Weibull (EW) distribution (Mudholkar, Srivastava, & Freimer, Citation1995).

The results of this paper are organized as follows. Section 2 provides a significant account of mathematical properties for the new distribution. Section 3 presents the eight estimation methods which are considered. In the Section 4, a simulation study is presented in order to identify the most efficient procedure. Section 5 illustrates the proposed methodology in two real data-sets. Section 6 summarizes the present work.

2. Generalized weighted Lindley distribution

The generalized WL distribution (2) can be expressed as a two-component mixture $\begin{matrix} f (t | ϕ, λ, α) = p f_{1} (t | ϕ, λ, α) + (1 - p) f_{2} (t | ϕ, λ, α) \end{matrix}$

where $p = λ / (λ + ϕ)$ and $T_{j} \sim GG (ϕ + j - 1, λ, α)$ , for $j = 1, 2$ , i.e. $f_{j} (t | λ, ϕ)$ has GG distribution, given by(3) $\begin{matrix} f_{j} (t | ϕ, λ, α) = \frac{α}{Γ (ϕ + j - 1)} λ^{α (ϕ + j - 1)} t^{α (ϕ + j - 1) - 1} e^{- {(λ t)}^{α}} . \end{matrix}$ (3)

The behavior of the p.d.f. (2) when $t \to 0$ and $t \to \infty$ are, respectively, given by $\begin{matrix} f (0) = \{\begin{matrix} \infty, & if α ϕ < 1 \\ \frac{α λ^{2}}{(λ + ϕ) Γ (ϕ)}, & if α ϕ = 1 \\ 0, & if α ϕ > 1 \end{matrix}, f (\infty) = 0 . \end{matrix}$

Figure gives examples of the shapes of the density function for different values of $ϕ, λ$ and $α$ .

Figure 1. Density function shapes for GWL distribution considering different values of $ϕ, λ$ and $α$ .

The cumulative distribution function from the GWL distribution is given by(4) $\begin{matrix} F (t | ϕ, λ, α) = \frac{γ [ϕ, {(λ t)}^{α}] (λ + ϕ) - {(λ t)}^{α ϕ} e^{- {(λ t)}^{α}}}{(λ + ϕ) Γ (ϕ)} \end{matrix}$ (4)

where $γ [y, x] = \int_{0}^{x} w^{y - 1} e^{- w} d w$ is the lower incomplete gamma function.

2.1. Moments

Many important features and properties of a distribution can be obtained through its moments such as mean, variance, kurtosis, and skewness. In this section, important moment functions such as the moment-generating function, r-th moment, r-th central moment, among others are presented.

Theorem 2.1

For the random variable T with GWL distribution, the moment-generating function is given by(5) $\begin{matrix} M_{X} (t) = \sum_{r = 0}^{\infty} \frac{t^{r}}{λ^{r} r!} \frac{(\frac{r}{α} + ϕ + λ) Γ (\frac{r}{α} + ϕ)}{(λ + ϕ) Γ (ϕ)} . \end{matrix}$ (5)

Proof

Note that, the moment-generating function from GG distribution (3) is given by $\begin{matrix} M_{X, j} (t) = \sum_{r = 0}^{\infty} \frac{t^{r}}{r!} \frac{Γ (\frac{r}{α} + ϕ + j - 1)}{λ^{r} Γ (ϕ + j - 1)} . \end{matrix}$

Since the GWL distribution (2) can be expressed as a two-component mixture, we have $\begin{matrix} M_{X} (t) & = E [e^{t X}] = \int_{0}^{\infty} e^{t x} f (x | ϕ, λ, α) d x = p M_{X, 1} (t) + (1 - p) M_{X, 2} (t) \\ = \frac{λ}{(λ + ϕ)} \sum_{r = 0}^{\infty} \frac{t^{r}}{r!} \frac{Γ (\frac{r}{α} + ϕ)}{λ^{r} Γ (ϕ)} + \frac{ϕ}{(λ + ϕ)} \sum_{r = 0}^{\infty} \frac{t^{r}}{r!} \frac{Γ (\frac{r}{α} + ϕ + 1)}{λ^{r} Γ (ϕ + 1)} \\ = \frac{1}{(λ + ϕ)} \sum_{r = 0}^{\infty} \frac{t^{r}}{r!} \frac{λ Γ (\frac{r}{α} + ϕ)}{λ^{r} Γ (ϕ)} + \frac{1}{(λ + ϕ)} \sum_{r = 0}^{\infty} \frac{t^{r}}{r!} \frac{(\frac{r}{α} + ϕ) Γ (\frac{r}{α} + ϕ)}{λ^{r} Γ (ϕ)} \\ = \sum_{r = 0}^{\infty} \frac{t^{r}}{λ^{r} r!} \frac{(\frac{r}{α} + ϕ + λ) Γ (\frac{r}{α} + ϕ)}{(λ + ϕ) Γ (ϕ)} . \end{matrix}$ $□$

Corollary 2.2

For the random variable T with GWL distribution, the r-th moment is given by(6) $\begin{matrix} μ_{r} = E [T^{r}] = \frac{(\frac{r}{α} + ϕ + λ) Γ (\frac{r}{α} + ϕ)}{(λ + ϕ) λ^{r} Γ (ϕ)} . \end{matrix}$ (6)

Proof

Note that, $μ_{r} = M_{X}^{(r)} (0) = \frac{d^{n} M_{X} (0)}{d t^{n}}$ and the result follows. $□$

Corollary 2.3

For the random variable T with GWL distribution, the r-th central moment is given by(7) $\begin{matrix} M_{r} & = E {[T - μ]}^{r} = \sum_{i = 0}^{r} (\begin{matrix} r \\ i \end{matrix}) {(- μ)}^{r - i} E [T^{i}] \\ = \sum_{i = 0}^{r} (\begin{matrix} r \\ i \end{matrix}) {(- \frac{(\frac{1}{α} + ϕ + λ) Γ (\frac{1}{α} + ϕ)}{λ (λ + ϕ) Γ (ϕ)})}^{r - i} \frac{(\frac{i}{α} + ϕ + λ) Γ (\frac{i}{α} + ϕ)}{(λ + ϕ) λ^{i} Γ (ϕ)} . \end{matrix}$ (7)

Corollary 2.4

A random variable T with GWL distribution has the mean and variance respectively given by(8) $\begin{matrix} μ & = \frac{(\frac{1}{α} + ϕ + λ) Γ (\frac{1}{α} + ϕ)}{λ (λ + ϕ) Γ (ϕ)}, \end{matrix}$ (8) (9) $\begin{matrix} σ^{2} & = \frac{λ (λ + ϕ) (\frac{2}{α} + ϕ + λ) Γ (\frac{2}{α} + ϕ) - {(\frac{1}{α} + ϕ + λ)}^{2} Γ {(\frac{1}{α} + ϕ)}^{2}}{λ^{2} {(λ + ϕ)}^{2} Γ {(ϕ)}^{2}} . \end{matrix}$ (9)

Proof

From (6) and considering $r = 1$ we have $μ_{1} = μ$ . The second result follows from (7) considering $r = 2$ with some algebraic manipulations. $□$

Another moment function that can be easily achieved for GWL distribution and plays an important role in information theory is given by(10) $\begin{matrix} E [log (T)] = \frac{(ψ (ϕ) - α log λ + {(λ + ϕ)}^{- 1})}{α} . \end{matrix}$ (10)

2.2. Survival properties

In this section, we present the survival, hazard, and mean residual life (MRL) function for the GWL distribution. The survival function of T is given by(11) $\begin{matrix} S (t | ϕ, λ, α) = \frac{Γ [ϕ, {(λ t)}^{α}] (λ + ϕ) + {(λ t)}^{α ϕ} e^{- {(λ t)}^{α}}}{(λ + ϕ) Γ (ϕ)} \end{matrix}$ (11)

where $Γ (x, y) = \int_{0}^{x} w^{y - 1} e^{- x} d w$ is called upper incomplete gamma. The hazard function is given as(12) $\begin{matrix} h (t | ϕ, λ, α) = \frac{f (t | ϕ, λ, α)}{S (t | ϕ, λ, α)} = \frac{α λ^{α ϕ} t^{α ϕ - 1} (λ + {(λ t)}^{α}) e^{- {(λ t)}^{α}}}{Γ [ϕ, {(λ t)}^{α}] (λ + ϕ) + {(λ t)}^{α ϕ} e^{- {(λ t)}^{α}}} . \end{matrix}$ (12)

The behavior of the hazard function (12) when $t \to 0$ and $t \to \infty$ are, respectively, given by $\begin{matrix} h (0) = \{\begin{matrix} \infty, & if α ϕ < 1 \\ \frac{α λ^{2}}{(λ + ϕ) Γ (ϕ)}, & if α ϕ = 1 \\ 0, & if α ϕ > 1 \end{matrix} and h (\infty) = \{\begin{matrix} 0, & if α ϕ < 1 \\ λ, & if α ϕ = 1 \\ \infty, & if α ϕ > 1 . \end{matrix} \end{matrix}$

Theorem 2.5

The hazard rate function h(t) of the GWL distribution is increasing, decreasing, bathtub, unimodal, or decreasing-increasing-decreasing shaped.

Proof

The theorem proposed by Glaser (Citation1980) is not easily applied in the GLW distribution. Since the hazard rate function (12) is complex, we considered the following cases:

(1)	Let $α = 1$ , then GWL distribution reduces to the WL distribution. Ghitany et al. (Citation2011) proved that the hazard function is bathtub-shaped (increasing) if $0 < ϕ < 1$ $(ϕ > 0)$ , for all $λ > 0$ .
(2)	Let $ϕ = 1$ , then GWL distribution reduces to the PL distribution. Considering $β = λ^{α}$ , Ghitany et al. (Citation2013) proved that the hazard function is

increasing when $\{0 < α \geq 1, β > 0\}$ ;
decreasing when $\{0 < α \leq \frac{1}{2}, β > 0\}$ or $\{\frac{1}{2} < α < 1, β \geq {(2 α - 1)}^{2} {(4 α (1 - α))}^{- 1}\}$ ;
decreasing-increasing-decreasing if $\{\frac{1}{2} < α < 1, 0 < β < {(2 α - 1)}^{2} {(4 α (1 - α))}^{- 1}\}$ .

(3)

Let $α = 2$ and $λ = 1$ , from Glaser’s theorem (Glaser, Citation1980), the hazard rate function is decreasing shaped (unimodal) for $0 < ϕ < 1$ $(ϕ > 1)$ .

□

These properties make the GWL distribution a flexible model for reliable data. Figure gives examples of the shapes of the hazard function for different values of $ϕ, λ$ and $α$ .

Figure 2. Hazard function shapes for GWL distribution and considering different values of $ϕ, λ$ and $α$ .

The MRL has been widely used in survival analysis and represents the expected additional lifetime given that a component has survived until time t.

Proposition 2.6

The MRL function $r (t | ϕ, λ, α)$ of the GWL distribution is given by(13) $\begin{matrix} r (t | ϕ, λ, α) = \frac{(ϕ + \frac{1}{α} + λ) Γ (ϕ + \frac{1}{α}, {(λ t)}^{α}) - λ t (λ + ϕ) Γ (ϕ, {(λ t)}^{α})}{λ [(λ + ϕ) Γ (ϕ, {(λ t)}^{α}) + {(λ t)}^{α ϕ} e^{- {(λ t)}^{α}}]} . \end{matrix}$ (13)

Proof

Note that $\begin{matrix} r (t | ϕ, λ, α) & = \frac{1}{S (t)} \int_{t}^{\infty} y f (y | λ, ϕ) d y - t \\ = \frac{1}{S (t)} [p \int_{t}^{\infty} y f_{1} (y | λ, ϕ) d y + (1 - p) \int_{x}^{\infty} y f_{2} (y | λ, ϕ) d y] - t \\ = \frac{(ϕ + \frac{1}{α} + λ) Γ (ϕ + \frac{1}{α}, {(λ t)}^{α}) - λ t (λ + ϕ) Γ (ϕ, {(λ t)}^{α})}{λ [(λ + ϕ) Γ (ϕ, {(λ t)}^{α}) + {(λ t)}^{α ϕ} e^{- {(λ t)}^{α}}]} . \end{matrix}$ $□$

The behavior of the MRL function when $t \to 0$ and $t \to \infty$ are, respectively, given by $\begin{matrix} r (0) = \frac{1}{λ ((λ + ϕ) Γ (ϕ))} and r (\infty) \{\begin{matrix} \infty, & if α < 1 \\ \frac{1}{λ}, & if α = 1 \\ 0, & if α > 1 . \end{matrix} \end{matrix}$

2.3. Entropy

In information theory, entropy has played a central role as a measure of uncertainty associated with a random variable. Shannon’s entropy is one of the most important metrics in information theory. For the GWL distribution, Shannon’s entropy can be obtained by solving(14) $\begin{matrix} H_{S} (ϕ, λ, α) = - \int_{0}^{\infty} log (\frac{α λ^{α ϕ} t^{α ϕ - 1} (λ + {(λ t)}^{α}) e^{- {(λ t)}^{α}}}{(λ + ϕ) Γ (ϕ)}) f (t | ϕ, λ, α) d t . \end{matrix}$ (14)

Proposition 2.7

A random variable T with GWL distribution has Shannon’s entropy given by(15) $\begin{matrix} H_{S} (ϕ, λ, α) & = log (λ + ϕ) + log Γ (ϕ) - log α - log λ - \frac{ϕ (1 + ϕ + λ)}{(λ + ϕ)} \\ - \frac{ψ (ϕ) (α ϕ - 1)}{α} - \frac{(α ϕ - 1)}{α (λ + ϕ)} - \frac{η (ϕ, λ)}{(λ + ϕ) Γ (ϕ)} \end{matrix}$ (15)

where $\begin{matrix} η (ϕ, λ) = \int_{0}^{\infty} (λ + y) log (λ + y) y^{ϕ - 1} e^{- y} d y = \int_{0}^{1} (λ - log u) log (λ - log u) {(- log u)}^{ϕ - 1} d u . \end{matrix}$

Proof

From the Equation (14), we have(16) $\begin{matrix} H_{S} (ϕ, λ, α) & = - log α - α ϕ log λ + log (λ + ϕ) + log (Γ (ϕ)) + λ^{α} E [T^{α}] \\ - (α ϕ - 1) E [log T] - E [log (λ + {(λ T)}^{α})] . \end{matrix}$ (16)

Note that $\begin{matrix} E [log (λ + {(λ T)}^{α})] = \int_{0}^{\infty} {log (λ + (λ T)}^{α} \frac{α λ^{α ϕ} t^{α ϕ - 1} (λ + {(λ t)}^{α}) e^{- {(λ t)}^{α}}}{(λ + ϕ) Γ (ϕ)} d t, \end{matrix}$

using the change of variable $y = {(λ t)}^{α}$ and after some algebra $\begin{matrix} E [log (λ + {(λ T)}^{α})] & = \frac{1}{(λ + ϕ) Γ (ϕ)} \int_{0}^{\infty} (λ + y) log (λ + y) y^{ϕ - 1} e^{- y} d y \\ = \frac{η (ϕ, λ)}{(λ + ϕ) Γ (ϕ)} . \end{matrix}$

From Equations (6) and (10), we can easily find the solution of $E [T^{α}]$ and $E [log T]$ and the result as follows. $□$

Another popular entropy measure is proposed by Renyi (Citation1961). Some recent applications of the Renyi entropy can be seen in Popescu and Aiordachioaie (Citation2013). If T has the probability density function (1) then Renyi entropy is defined by(17) $\begin{matrix} \frac{1}{1 - ρ} log \int_{0}^{\infty} f^{ρ} (x) d x . \end{matrix}$ (17)

Proposition 2.8

A random variable T with GWL distribution, has the Renyi entropy given by(18) $\begin{matrix} H_{R} (ρ) = \frac{(ρ - 1) (log α + log λ) - ρ (log (λ + ϕ) + log Γ (ϕ)) - log (δ (ρ, ϕ, λ, α))}{1 - ρ} \end{matrix}$ (18)

where $δ (ρ, ϕ, λ, α) = \int_{0}^{\infty} y^{\frac{ρ ϕ - ρ + 1 - α}{α}} {(λ + y)}^{ρ} e^{- ρ y} d y$ .

Proof

The Renyi entropy is given by $\begin{matrix} H_{R} (ρ) & = \frac{1}{1 - ρ} log (\frac{α^{ρ} λ^{ρ}}{{(λ + ϕ)}^{ρ} Γ {(ϕ)}^{ρ}} \int_{0}^{\infty} {(λ t)}^{α ρ (ϕ - \frac{1}{α})} {(λ + {(λ t)}^{α})}^{ρ} e^{- ρ {(λ t)}^{α}} d t) \\ = \frac{1}{1 - ρ} log (\frac{α^{ρ} λ^{ρ}}{{(λ + ϕ)}^{ρ} Γ {(ϕ)}^{ρ}} \int_{0}^{\infty} y^{\frac{ρ ϕ - ρ + 1 - α}{α}} {(λ + y)}^{ρ} e^{- ρ y} d y) \\ = \frac{1}{1 - ρ} log (\frac{α^{ρ} λ^{ρ}}{{(λ + ϕ)}^{ρ} Γ {(ϕ)}^{ρ}} δ (ρ, ϕ, λ, α)) \end{matrix}$

and with some algebra the proof is completed. $□$

2.4. Lorenz curves

The Lorenz curve (Bonferroni, Citation1930) is a well-known measure used in reliability, income inequality, life testing and renewal theory. The Lorenz curve for a non-negative T random variable is given through the consecutive plot of $\begin{matrix} L (F (t)) = \frac{\int_{0}^{t} x f (x) d x}{\int_{0}^{\infty} x f (x) d x} = \frac{1}{μ} \int_{0}^{t} x f (x) d x . \end{matrix}$

Proposition 2.9

The Lorenz curve for the GWL distribution is $\begin{matrix} L (p) = \frac{(\frac{1}{α} + ϕ + λ) γ (ϕ + \frac{1}{α}, {(λ F^{- 1} (p))}^{α}) - {(λ F^{- 1} (p))}^{α ϕ - 1} e^{- {(λ F^{- 1} (p))}^{α}}}{(\frac{1}{α} + ϕ + λ) Γ [\frac{1}{α} + ϕ]} \end{matrix}$

where $F^{- 1} (p) = t_{p}$ .

3. Methods of estimation

In this section, we present eight different estimation methods for the parameters $ϕ, λ$ and $α$ of the GWL distribution.

3.1. Maximum likelihood estimation

The maximum likelihood method has been widely used due to its better asymptotic properties. The estimates are obtained by maximizing the likelihood function. Let $T_{1}, \dots, T_{n}$ be a random sample where $T \sim GWL (ϕ, λ, α)$ , the likelihood function is given by(19) $\begin{matrix} L (ϕ, λ, α; t) = \frac{α^{n} λ^{n α ϕ}}{(λ + ϕ) Γ {(ϕ)}^{n}} \{\prod_{i = 1}^{n} t_{i}^{α ϕ - 1}\} \prod_{i = 1}^{n} (λ + {(λ t_{i})}^{α}) exp \{- λ^{α} \sum_{i = 1}^{n} t_{i}^{α}\} . \end{matrix}$ (19)

The log-likelihood function $l (ϕ, λ, α; t) = log L (ϕ, λ, α; t)$ is given by(20) $\begin{matrix} l (ϕ, λ, α; t) & = n log α + n α ϕ log λ - n log (λ + ϕ) - n log Γ (ϕ) + (α ϕ - 1) \sum_{i = 1}^{n} log (t_{i}) \\ + \sum_{i = 1}^{n} log (λ + {(λ t_{i})}^{α}) - λ^{α} \sum_{i = 1}^{n} t_{i}^{α} . \end{matrix}$ (20)

From the expressions $\frac{\partial}{\partial ϕ} l (ϕ, λ, α; t) = 0$ , $\frac{\partial}{\partial λ} l (ϕ, λ, α; t) = 0$ , $\frac{\partial}{\partial α} l (ϕ, λ, α; t) = 0$ , the likelihood equations are(21) $\begin{matrix} n \hat{α} log (\hat{λ}) + \hat{α} \sum_{i = 1}^{n} log (t_{i}) & = \frac{n}{\hat{λ} + \hat{ϕ}} + n ψ (\hat{ϕ}) \end{matrix}$ (21) (22) $\begin{matrix} \frac{n \hat{α} \hat{ϕ}}{\hat{λ}} + \sum_{i = 1}^{n} \frac{1 + \hat{α} {\hat{λ}}^{\hat{α} - 1} t_{i}^{\hat{α}}}{\hat{λ} + {(t_{i})}^{\hat{α}}} & = \hat{α} {\hat{λ}}^{\hat{α} - 1} \sum_{i = 1}^{n} t_{i}^{\hat{α}} + \frac{n}{\hat{λ} + \hat{ϕ}} \end{matrix}$ (22)

and(23) $\begin{matrix} \frac{n}{\hat{α}} + n \hat{ϕ} log (\hat{λ}) + \hat{ϕ} \sum_{i = 1}^{n} log (t_{i}) + \sum_{i = 1}^{n} \frac{{(\hat{λ} t_{i})}^{\hat{α}} log (\hat{λ} t_{i})}{\hat{λ} + {(\hat{λ} t_{i})}^{\hat{α}}} = {\hat{λ}}^{\hat{α}} \sum_{i = 1}^{n} {t_{i}}^{\hat{α}} log (\hat{λ} t_{i}), \end{matrix}$ (23)

where $ψ (k) = \frac{\partial}{\partial k} log Γ (k) = \frac{Γ^{'} (k)}{Γ (k)}$ . Numerical methods such as Newton-Rapshon are required to find the solution of the nonlinear system. Note that from (21) and (23) and after some algebra we have(24) $\begin{matrix} {\hat{α}}_{MLE} & = \frac{1}{(n log (\hat{λ}) + \sum_{i = 1}^{n} log (t_{i}))} (\frac{n}{\hat{λ} + \hat{ϕ}} + n ψ (\hat{ϕ})) \end{matrix}$ (24) (25) $\begin{matrix} {\hat{ϕ}}_{MLE} & = \frac{({\hat{λ}}^{\hat{α}} \sum_{i = 1}^{n} {t_{i}}^{\hat{α}} log (\hat{λ} t_{i}) - \sum_{i = 1}^{n} \frac{{(\hat{λ} t_{i})}^{\hat{α}} log (\hat{λ} t_{i})}{\hat{λ} + {(\hat{λ} t_{i})}^{\hat{α}}} - \frac{n}{\hat{α}})}{(n log (\hat{λ}) + \sum_{i = 1}^{n} log (t_{i}))} . \end{matrix}$ (25)

Under mild conditions, the maximum likelihood estimates (MLEs) are asymptotically normal distributed with a joint multivariate normal distribution given by $\begin{matrix} ({\hat{ϕ}}_{MLE}, {\hat{λ}}_{MLE}, {\hat{α}}_{MLE}) \sim N_{3} [(ϕ, λ, α), I^{- 1} (ϕ, λ, α))] as n \to \infty . \end{matrix}$

where $I (ϕ, λ, α)$ is the Fisher information matrix is given as(26) $\begin{matrix} I (ϕ, λ, α) = [\begin{matrix} I_{ϕ, ϕ} (ϕ, λ, α) & I_{ϕ, λ} (ϕ, λ, α) & I_{ϕ, α} (ϕ, λ, α) \\ I_{ϕ, λ} (ϕ, λ, α) & I_{λ, λ} (ϕ, λ, α) & I_{λ, α} (ϕ, λ, α) \\ I_{ϕ, α} (ϕ, λ, α) & I_{λ, α} (ϕ, λ, α) & I_{α, α} (ϕ, λ, α) \end{matrix}], \end{matrix}$ (26)

and the elements of the matrix are given in Appendix 2.

3.2. Moments estimators

The method of moments is one of the oldest methods used for estimating parameters in statistical models. The moments estimators (MEs) of the GLW distribution can be obtained by equating the first three sample moments $\bar{x} = \frac{1}{n} \sum_{i = 1}^{n} t_{i}$ , $\frac{1}{n} \sum_{i = 1}^{n} t_{i}^{2}$ and $\frac{1}{n} \sum_{i = 1}^{n} t_{i}^{3}$ with the theoretical moments $\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} t_{i} = \frac{(\frac{1}{α} + ϕ + λ) Γ (\frac{1}{α} + ϕ)}{(λ + ϕ) λ Γ (ϕ)} \frac{1}{n} \sum_{i = 1}^{n} t_{i}^{2} = \frac{(\frac{2}{α} + ϕ + λ) Γ (\frac{2}{α} + ϕ)}{(λ + ϕ) λ^{2} Γ (ϕ)} \\ and \frac{1}{n} \sum_{i = 1}^{n} t_{i}^{3} = \frac{(\frac{3}{α} + ϕ + λ) Γ (\frac{3}{α} + ϕ)}{(λ + ϕ) λ^{3} Γ (ϕ)} . \end{matrix}$

Therefore, the ME ${\hat{ϕ}}_{M E}$ , ${\hat{λ}}_{M E}$ and ${\hat{α}}_{M E}$ , can be obtained by solving the non-linear equations $\begin{matrix} \frac{(\frac{j}{α} + ϕ + λ) Γ (\frac{j}{α} + ϕ)}{(λ + ϕ) λ^{j} Γ (ϕ)} - \frac{1}{n} \sum_{i = 1}^{n} t_{i}^{j} = 0, j = 1, 2, 3 . \end{matrix}$

3.3. Ordinary and weighted least-square estimate

Let $t_{(1)}, t_{(2)}, \dots, t_{(n)}$ be the order statistics (the same notation is assumed for the next subsections) of the random sample of size n from $F (t | ϕ, λ, α)$ . The least square estimators ${\hat{ϕ}}_{LSE}$ , ${\hat{λ}}_{LSE}$ and ${\hat{α}}_{LSE}$ can be obtained by minimizing $\begin{matrix} V (ϕ, λ, α) = \sum_{i = 1}^{n} {[F (t_{(i)} | ϕ, λ, α) - \frac{i}{n + 1}]}^{2} \end{matrix}$

with respect to $ϕ, λ$ and $α$ . Equivalently, the estimates can be obtained by solving the non-linear equations $\begin{matrix} \sum_{i = 1}^{n} [F (t_{(i)} | ϕ, λ, α) - \frac{i}{n + 1}] Δ_{j} (t_{(i)} | ϕ, λ, α) = 0, j = 1, 2, 3 \end{matrix}$

where(27) $\begin{matrix} Δ_{1} (t_{(i)} | ϕ, λ, α) & = \frac{\partial}{\partial ϕ} F (t_{(i)} | ϕ, λ, α), Δ_{2} (t_{(i)} | ϕ, λ, α) = \frac{\partial}{\partial λ} F (t_{(i)} | ϕ, λ, α) \\ and Δ_{3} (t_{(i)} | ϕ, λ, α) = \frac{\partial}{\partial α} F (t_{(i)} | ϕ, λ, α) . \end{matrix}$ (27)

Note that the solution of $Δ_{i}$ for $i = 1, 2, 3$ involves partial derivatives of the lower incomplete gamma function. However, this can be easily achieved numerically with high precision.

The weighted least-squares estimates (WLSEs), ${\hat{ϕ}}_{WLSE}$ , ${\hat{λ}}_{WLSE}$ and ${\hat{α}}_{WLSE}$ , can be obtained by minimizing $\begin{matrix} W (ϕ, λ, α) = \sum_{i = 1}^{n} \frac{{(n + 1)}^{2} (n + 2)}{i (n - i + 1)} {[F (t_{(i)} | ϕ, λ, α) - \frac{i}{n + 1}]}^{2} . \end{matrix}$

These estimates can also be obtained by solving the non-linear equations $\begin{matrix} \sum_{i = 1}^{n} \frac{{(n + 1)}^{2} (n + 2)}{i (n - i + 1)} [F (t_{(i)} | ϕ, λ, α) - \frac{i}{n + 1}] Δ_{j} (t_{(i)} | ϕ, λ, α) = 0, j = 1, 2, 3, \end{matrix}$

where $Δ_{1} (\cdot | ϕ, λ, α)$ , $Δ_{2} (\cdot | ϕ, λ, α)$ and $Δ_{3} (\cdot | ϕ, λ, α)$ are given in (27).

3.4. Method of maximum product of spacings

The MPS method is a powerful alternative to MLE for the estimation of unknown parameters of continuous univariate distributions. Proposed by Cheng and Amin (Citation1979,Citation1983), this method was also independently developed by Ranneby (Citation1984) as an approximation to the Kullback–Leibler information measure. Cheng and Amin (Citation1983) proved desirable properties of the MPS such as asymptotic efficiency, invariance, and more importantly, the consistency of maximum product of spacing estimators holds under more general conditions than for MLEs.

Let $D_{i} (ϕ, λ, α) = F (t_{(i)} | ϕ, λ, α) - F (t_{(i - 1)} | ϕ, λ, α)$ , for $i = 1, 2, \dots, n + 1,$ be the uniform spacings of a random sample from the GWL distribution, where $F (t_{(0)} | ϕ, λ, α) = 0$ and $F (t_{(n + 1)} | ϕ, λ, α) = 1 .$ Clearly $\sum_{i = 1}^{n + 1} D_{i} (ϕ, λ, α) = 1$ . The MPS estimates ${\hat{ϕ}}_{MPS}$ , ${\hat{λ}}_{MPS}$ and ${\hat{α}}_{MPS}$ are obtained by maximizing the geometric mean of the spacings(28) $\begin{matrix} G (ϕ, λ, α) = {[\prod_{i = 1}^{n + 1} D_{i} (ϕ, λ, α)]}^{\frac{1}{n + 1}} \end{matrix}$ (28)

with respect to $ϕ$ , $λ$ and $α$ , or, equivalently, by maximizing the logarithm of the geometric mean of sample spacings(29) $\begin{matrix} H (ϕ, λ, α) = \frac{1}{n + 1} \sum_{i = 1}^{n + 1} log D_{i} (ϕ, λ, α) . \end{matrix}$ (29)

The estimates ${\hat{ϕ}}_{M P S}$ , ${\hat{λ}}_{MPS}$ and ${\hat{α}}_{MPS}$ of the parameters $ϕ$ , $λ$ and $α$ can be obtained by solving the nonlinear equations(30) $\begin{matrix} \frac{1}{n + 1} \sum_{i = 1}^{n + 1} \frac{1}{D_{i} (ϕ, λ, α))} [Δ_{j} (t_{(i)} | ϕ, λ, α) - Δ_{j} (t_{(i - 1)} | ϕ, λ, α)] = 0, j = 1, 2, 3, \end{matrix}$ (30)

where $Δ_{1} (\cdot | ϕ, λ, α)$ , $Δ_{2} (\cdot | ϕ, λ, α)$ and $Δ_{3} (\cdot | ϕ, λ, α)$ are given respectively in (27). Note that if $t_{(i + k)} = t_{(i + k - 1)} = \dots = t_{(i)}$ then $D_{i + k} (ϕ, λ, α) = D_{i + k - 1} (ϕ, λ, α) = \dots = D_{i} (ϕ, λ, α) = 0$ . Therefore, the MPS estimators are sensitive to closely spaced observations, especially ties. When the ties are due to multiple observations, $D_{i} (ϕ, λ, α)$ should be replaced by the corresponding likelihood $f (t_{(i)}, ϕ, λ, α)$ since $t_{(i)} = t_{(i - 1)}$ .

Under mild conditions for the GWL distribution, the MPS estimators are asymptotically normal distributed with a joint trivariate normal distribution given by $\begin{matrix} ({\hat{ϕ}}_{MPS}, {\hat{λ}}_{MPS}, {\hat{α}}_{MPS}) \sim N_{3} [(ϕ, λ, α), I^{- 1} (ϕ, λ, α))] as n \to \infty . \end{matrix}$

3.5. The Cramer-von Mises minimum distance estimators

The Cramer-von Mises estimator is a type of minimum distance estimators (also called maximum goodness-of-fit estimators) and is based on the difference between the estimate of the cumulative distribution function and the empirical distribution function (Luceño, Citation2006).

Macdonald (Citation1971) motivated the choice of the CME estimators providing empirical evidence that the bias of the estimator is smaller than the other minimum distance estimators. The Cramer-von Mises estimates ${\hat{ϕ}}_{CME}$ , ${\hat{λ}}_{CME}$ and ${\hat{α}}_{CME}$ of the parameters $ϕ$ , $λ$ and $α$ are obtained by minimizing(31) $\begin{matrix} C (ϕ, λ, α) = \frac{1}{12 n} + \sum_{i = 1}^{n} {(F (t_{(i)} | ϕ, λ, α) - \frac{2 i - 1}{2 n})}^{2}, \end{matrix}$ (31)

with respect to $ϕ$ , $λ$ and $α$ . These estimates can also be obtained by solving the nonlinear equations: $\begin{matrix} \sum_{i = 1}^{n} (F (t_{(i)} | ϕ, λ, α) - \frac{2 i - 1}{2 n}) Δ_{j} (t_{(i)} | ϕ, λ, α) = 0, j = 1, 2, 3, \end{matrix}$

where $Δ_{1} (\cdot | ϕ, λ, α)$ , $Δ_{2} (\cdot | ϕ, λ, α)$ and $Δ_{3} (\cdot | ϕ, λ, α)$ are given respectively in (27).

3.6. The Anderson–Darling and Right-tail Anderson–Darling estimators

Another type of minimum distance estimator is based on ADE statistic and is known as ADE estimator. The ADE estimates ${\hat{ϕ}}_{ADE}, {\hat{λ}}_{ADE}$ and ${\hat{α}}_{ADE}$ of the parameters $ϕ, λ$ and $α$ are obtained by minimizing, with respect to $ϕ$ , $λ$ and $α$ , the function(32) $\begin{matrix} A (ϕ, λ, α) = - n - \frac{1}{n} \sum_{i = 1}^{n} (2 i - 1) (log F (t_{(i)} | ϕ, λ, α) + log S (t_{(n + 1 - i)} | ϕ, λ, α)) . \end{matrix}$ (32)

These estimates can also be obtained by solving the nonlinear equations $\begin{matrix} \sum_{i = 1}^{n} (2 i - 1) [\frac{Δ_{j} (t_{(i)} | ϕ, λ, α)}{F (t_{(i)} | ϕ, λ, α)} - \frac{Δ_{j} (t_{(n + 1 - i)} | ϕ, λ, α)}{S (t_{(n + 1 - i)} | ϕ, λ, α)}] = 0, j = 1, 2, 3 . \end{matrix}$

The Right-tail ADE estimates ${\hat{ϕ}}_{RADE}, {\hat{λ}}_{RADE}$ and ${\hat{α}}_{RADE}$ of the parameters $ϕ, λ$ and $α$ are obtained by minimizing the function(33) $\begin{matrix} R (ϕ, λ, α) = \frac{n}{2} - 2 \sum_{i = 1}^{n} F (t_{i : n} | ϕ, λ, α) - \frac{1}{n} \sum_{i = 1}^{n} (2 i - 1) log S (t_{n + 1 - i : n} | ϕ, λ, α) . \end{matrix}$ (33)

with respect to $ϕ$ , $λ$ and $α$ . These estimates can also be obtained by solving the nonlinear equations: $\begin{matrix} - 2 \sum_{i = 1}^{n} Δ_{j} (t_{i : n} | ϕ, λ, α) + \frac{1}{n} \sum_{i = 1}^{n} (2 i - 1) \frac{Δ_{j} (t_{n + 1 - i : n} | ϕ, λ, α)}{S (t_{n + 1 - i : n} | ϕ, λ, α)} = 0, j = 1, 2, 3 . \end{matrix}$

where $Δ_{1} (\cdot | ϕ, λ, α)$ , $Δ_{2} (\cdot | ϕ, λ, α)$ and $Δ_{3} (\cdot | ϕ, λ, α)$ are given respectively in (27).

4. Simulation study

In this section, an intensive simulation study is presented to compare the efficiency of the estimation procedures for parameters of the GWL distribution. The following procedure was adopted:

(1)	Generate pseudo-random values from the $GWL (ϕ, λ, α)$ with size n.
(2)	Using the values obtained in step 1, calculate $\hat{ϕ}$ , $\hat{λ}$ and $\hat{α}$ via 1-MLE, 2-MPS, 3-ADE, 4-RTADE, 5-LSE, 6-WLSE, 7-ME, 8-CME.
(3)	Repeat the steps 1 and 2 N times.
(4)	Using $\hat{θ} = (\hat{ϕ}, \hat{λ}, \hat{α})$ and $θ = (ϕ, λ, α)$ , compute the mean relative estimates (MRE) $\sum_{j = 1}^{N} \frac{{\hat{θ}}_{i, j} / θ_{i}}{N}$ and the mean square errors (MSE) $\sum_{j = 1}^{N} \frac{{({\hat{θ}}_{i, j} - θ_{i})}^{2}}{N}$ , for $i = 1, 2, 3$ .

Considering this approach, the most efficient estimation method will have MREs closer to one and MSEs closer to zero. The results were computed using the software R using the seed 2015 to generate the pseudo-random values. The initial values considered were the same values used to generate the random samples. The chosen values to perform this procedure were

N = 10, 000

and

n = (50, 60, \dots, 250)

. For reasons of space, we have presented the results only for

θ = (2, 0.5, 0.1)

. However, the following results are similar for other choices of

θ

Figure 3. Proportion of failure from N simulated samples, considering different values of n using the following estimation method 1-MLE, 2-MPS, 3-ADE, 4-RTADE, 5-LSE, 6-WLSE, 7-ME, 8-CME.

Figure 4. MREs, MSEs related from the estimates of $ϕ = 0.5, λ = 0.7$ and $α = 1.5$ for N simulated samples, considering different values of n obtained using the following estimation method 1-MLE, 2-MPS, 3-ADE, 4-RTADE.

Figure 5. (left panel) the TTT-plot, (middle panel) the fitted survival superimposed to the empirical survival function and (right panels) the hazard function adjusted by GWL distribution.

Figure 6. (left panel) the TTT-plot, (middle panel) the fitted survival superimposed to the empirical survival function and (right panels) the hazard function adjusted by GWL distribution.

For this comparison to be meaningful, the estimation procedures need to be performed under same conditions. However, for some particular samples and estimation methods, the numerical techniques do not work well in finding the parameter estimates. Therefore, a rate study is presented to verify the frequency of convergence of the numerical solutions. This procedure is carried out by counting the number of times each estimation fails in finding the numerical solution. In Figure we present the proportion of failure from each method.

From Figure , the MLE, LSE, WLSE, ME, and the CME estimators fail in finding the parameter estimates for a significant number of samples. Therefore, such methods are not recommended for estimation of the GLW parameters. Hereafter, we consider the MPS, ADE, RADE estimators due to their better computational stability. The MLE is considered only for illustrative purposes since it is the most used estimation method. Figure presents the MREs, MSEs for the estimates of $ϕ, λ$ and $α$ using the MLE, MPS, ADE, RADE with N simulated samples and different values of $θ = (2, 0.5, 0.1)$ and n. The horizontal lines in both figures correspond to MREs and MSEs being one and zero, respectively.

From these results, the MSE of the MLE, MPS, ADE, and RADE estimators tend to zero for large n and also, as expected, the values of MREs tend to one, i.e. the estimates are consistent and asymptotically unbiased for the parameters. For small sample sizes, the MLE has the largest MSEs. The MPS has smaller MSEs with MREs closer to one for almost all values of n. Additionally, the MPS, ADE, and RADE estimators were the only methods that were able to find $\hat{ϕ}, \hat{λ}$ and $\hat{α}$ for all the $2 \times 10^{6}$ generated samples. Therefore, combining all results with the good properties of the MPS method such as consistency, asymptotic efficiency, normality and invariance, we conclude that the MPS estimators are a highly competitive method compared to the maximum likelihood for estimating the parameters of the GWL distribution.

5. Application

In this section, we compare the GWL distribution with other three-parameter lifetime distributions considering two data-sets, the first with bathtub hazard rate and the other with the increasing hazard function. The following lifetime distributions were considered. The GG distribution with p.d.f given by $\begin{matrix} f (t) = α Γ {(ϕ)}^{- 1} β^{α ϕ} t^{α ϕ - 1} e^{- {(β t)}^{α}} \end{matrix}$

where $β > 0, ϕ > 0$ and $α > 0$ . The GW distribution where the p.d.f is $\begin{matrix} f (t) = {(α ϕ)}^{- 1} {(t / ϕ)}^{1 / α - 1} {(1 - λ {(t / ϕ)}^{1 / α})}^{1 / λ - 1} \end{matrix}$

where $λ \in R, ϕ > 0$ and $α > 0$ . The GEP distribution with p.d.f given by $\begin{matrix} f (t) = (α β ϕ / {(1 - e^{- ϕ})}^{α}) e^{- ϕ - β t + ϕ exp (- β t)} {(1 - e^{- ϕ + ϕ exp (- β t)})}^{α - 1} \end{matrix}$

where $β > 0, ϕ > 0$ and $α > 0$

The EW distribution with p.d.f $\begin{matrix} f (t) = α ϕ β^{- 1} {(t / β)}^{α - 1} exp (- {(t / β)}^{α}) {(1 - exp (- {(t / β)}^{α}))}^{ϕ - 1} \end{matrix}$

where $β > 0, ϕ > 0$ and $α > 0$ .

The TTT-plot (total time on test) is considered in order to verify the behavior of the empirical hazard function (Barlow & Campo, Citation1975). The TTT-plot is obtained through the plot of [r / n, G(r / n)] where $\begin{matrix} G (r / n) = (\sum_{i = 1}^{r} t_{i} + (n - r) t_{(r)}) / \sum_{i = 1}^{n} t_{i}, r = 1, \dots, n, i = 1, \dots, n \end{matrix}$

and $t_{(i)}$ is the statistical order. If the curve is concave (convex), the hazard function is increasing (decreasing). On the other hand, when it starts convex and then becomes concave (concave and then convex) the hazard function has bathtub (inverse bathtub) shape.

The goodness of fit is checked considering the Kolmogorov–Smirnov (KS) test. This procedure is based on the KS statistic $D_{n} = sup |F_{n} (t) - F (t; ϕ, λ, α)|$ , where $sup t$ is the supremum of the set of distances, $F_{n} (t)$ is the empirical distribution function and $F (t; α, β, λ)$ is c.d.f. A hypothesis test is conducted at the $5 %$ level of significance to test whether or not the data come from $F (t; α, β, λ)$ . In this case, the null hypothesis is rejected if the returned p-value is smaller than 0.05.

To carry out the model selection, the following discrimination criterion methods are adopted: AIC (Akaike information criteria) and AICc (Corrected Akaike information criterion) computed, respectively, by $A I C = - 2 l (\hat{θ}; t) + 2 k$ and $A I C c = A I C + 2 k (k + 1) {(n - k - 1)}^{- 1}$ , where k is the number of parameters to be fitted and $\hat{θ}$ is estimation of $θ$ . For a set of candidate models for $t$ , the best one provides the minimum values.

5.1. Lifetimes data

Aarset (Citation1987) presents the data-set (see Table ) related to the lifetime in hours of 50 devices on test

Table 1. Lifetimes data (in hours) related to a device on test

Download CSV Display Table

Figure shows (left panel) the TTT-plot, (middle panel) the fitted survival superimposed to the empirical survival function and (right panels) the hazard function adjusted by GWL distribution. Table presents the AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the Aarset dataset.

Table 2. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the Aarset dataset

Download CSV Display Table

Table 3. MPS estimates, standard-error and $95 %$ CI for $ϕ, λ$ and $α$

Display Table

Table 4. January average flows (m $^{3}$ /s) of the Cantareira system

Display Table

Table 5. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the data-set related to the january average flows (m $^{3}$ /s) of the Cantareira system

Display Table

Table 6. ML estimates, standard-error and $95 %$ CI for $ϕ, λ$ and $α$

Display Table

Comparing the empirical survival function with the adjusted distributions, it can be observed that the GWL distribution is as a better fit. This result is also confirmed from the AIC and AICC (see Table ) since GWL distribution has the minimum values and also the p-values returned from the KS test are greater than 0.05. It should be emphasized that considering a significance level of $5 %$ , the others models are not able to fit the proposed data. Table displays the MPS estimates, standard errors, and the confidence intervals (CI) for $ϕ, λ$ and $α$ of the GWL distribution.

In this section, we consider the ML estimator showing that both MPS or MLE could be used successfully in applications. Figure shows (left panel) the TTT-plot, (middle panel) the fitted survival superimposed to the empirical survival function and (right panels) the hazard function adjusted by GWL distribution. Table presents the AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the data-set related to the January average flows (m $^{3}$ /s) of the Cantareira system.

From the empirical survival function and the adjusted distributions, it can be observed that the GWL distribution is better. This result is also confirmed from AIC and AICC since GWL distribution has the minimum values and the p-values returned from the KS test are greater than 0.05. Table displays the ML estimates, standard errors, and the CI for $ϕ, λ$ and $α$ of the GWL distribution.

5.2. Average flows data

The study of average flows has been proved to be of high importance to protect and maintain aquatic resources in streams and rivers (Reiser, Wesche, & Estes, Citation1989). In this section, we consider a real data-set related to the average flows (m $^{3}$ /s) of the Cantareira system during January at São Paulo city in Brazil. It is worth mentioning that the Cantareira system provides water to 9 million people in the São Paulo metropolitan area. The data-set available in Table was obtained from the National Water Agency from 1930 to 2012.

6. Concluding remarks

To summarize, we have proposed a three-parameter lifetime distribution. The GLW distribution is a straightforward generalization of the WL distribution proposed by Ghitany et al. (Citation2011), which accommodates increasing, decreasing, decreasing-increasing-decreasing, bathtub, and unimodal hazard rate making the GWL distribution a flexible model for reliable data. The mathematical properties of this distribution are also discussed.

The estimation procedures for the parameters of GWL distribution are also derived considering eight estimation methods. Since it is not feasible to compare these methods theoretically, we have presented an extensive simulation study in order to identify the most efficient procedure. We observed that the MLE, ME, LSE, WLSE, and the CME estimators fail in finding the parameter estimates for a significant number of samples. The simulations showed that the MPS (maximum product of spacing) is the most efficient method for estimating the parameters of the GWL distribution in comparison to its competitors. Finally, two data-sets were analyzed for illustrative purposes proving that the GWL distribution outperforms several usual three parameter lifetime distributions.

Acknowledgements

We are grateful to the Editorial Board and the reviewers for their valuable comments and suggestions which has improved the manuscript.

Additional information

Funding

The research was partially supported by CNPq, FAPESP, and CAPES of Brazil.

Notes on contributors

P.L. Ramos

P.L. Ramos holds a BSc degree in Statistics and an MSc in Applied and Computational Mathematics from the São Paulo State University, Brazil. He is currently reading for his PhD in Statistics at the Institute for Mathematical Science and Computing, University of São Paulo (USP), Brazil. His main research interests are in survival analysis, Bayesian inference, classical inference, and probability distribution theory.

F. Louzada

F. Louzada is a professor of Statistics at the Institute for Mathematical Science and Computing, University of So Paulo (USP), Brazil. He received his PhD degree in Statistics from the University of Oxford, UK, his MSc degree in Computational Mathematics from USP, Brazil, and his BSc degree in Statistics from UFSCar, Brazil. His main research interests are in survival analysis, data mining, Bayesian inference, classical inference, and probability distribution theory.

References

Aarset, M. V. (1987). How to identify a bathtub hazard rate. IEEE Transactions on Reliability, 36, 106–108.
Web of Science ®Google Scholar
Ali, S. (2015). On the bayesian estimation of the weighted lindley distribution. Journal of Statistical Computation and Simulation, 85, 855–880.
Web of Science ®Google Scholar
Al-Mutairi, D., Ghitany, M., & Kundu, D. (2015). Inferences on stress-strength reliability from weighted lindley distributions. Communications in Statistics-Theory and Methods, 44, 4096–4113.
Web of Science ®Google Scholar
Bakouch, H. S., Al-Zahrani, B. M., Al-Shomrani, A. A., Marchi, V. A., & Louzada, F. (2012). An extended lindley distribution. Journal of the Korean Statistical Society, 41, 75–85.
Web of Science ®Google Scholar
Barlow, R. E., & Campo, R. A. (1975). Total time on test processes and applications to failure data analysis (Technical report). Berkeley, CA: DTIC Document.
Google Scholar
Barreto-Souza, W., & Bakouch, H. S. (2013). A new lifetime model with decreasing failure rate. Statistics, 47, 465–476.
Web of Science ®Google Scholar
Barreto-Souza, W., & Cribari-Neto, F. (2009). A generalization of the exponential-poisson distribution. Statistics & Probability Letters, 79, 2493–2500.
Web of Science ®Google Scholar
Bonferroni, C. (1930). Elementi di statistica generale. Firenze: Seeber.
Google Scholar
Cheng, R. & Amin, N. (1979). Maximum product of spacings estimation with application to the lognormal distribution (Mathematical Report 79-1). Cardiff: University of Wales IST.
Google Scholar
Cheng, R., & Amin, N. (1983). Estimating parameters in continuous univariate distributions with a shifted origin. Journal of the Royal Statistical Society. Series B (Methodological), 45, 394–403.
Web of Science ®Google Scholar
Ghitany, M., Al-Mutairi, D., Balakrishnan, N., & Al-Enezi, L. (2013). Power lindley distribution and associated inference. Computational Statistics & Data Analysis, 64, 20–33.
Web of Science ®Google Scholar
Ghitany, M., Alqallaf, F., Al-Mutairi, D., & Husain, H. (2011). A two-parameter weighted lindley distribution and its applications to survival data. Mathematics and Computers in Simulation, 81, 1190–1201.
Web of Science ®Google Scholar
Ghitany, M., Atieh, B., & Nadarajah, S. (2008). Lindley distribution and its application. Mathematics and Computers in Simulation, 78, 493–506.
Web of Science ®Google Scholar
Glaser, R. E. (1980). Bathtub and related failure rate characterizations. Journal of the American Statistical Association, 75, 667–672.
Web of Science ®Google Scholar
Luceño, A. (2006). Fitting the generalized pareto distribution to data using maximum goodness-of-fit estimators. Computational Statistics & Data Analysis, 51, 904–917.
Web of Science ®Google Scholar
Macdonald, P. (1971). An estimation procedure for mixtures of distribution. Journal of the Royal Statistical Society. Series B (Methodological), 33, 326–329.
Web of Science ®Google Scholar
Mazucheli, J., Louzada, F., & Ghitany, M. (2013). Comparison of estimation methods for the parameters of the weighted lindley distribution. Applied Mathematics and Computation, 220, 463–471.
Web of Science ®Google Scholar
Mudholkar, G. S., Srivastava, D. K., & Freimer, M. (1995). The exponentiated weibull family: A reanalysis of the bus-motor-failure data. Technometrics, 37, 436–445.
Web of Science ®Google Scholar
Mudholkar, G. S., Srivastava, D. K., & Kollia, G. D. (1996). A generalization of the weibull distribution with application to the analysis of survival data. Journal of the American Statistical Association, 91, 1575–1583.
Web of Science ®Google Scholar
Popescu, T. D., & Aiordachioaie, D. (2013). Signal segmentation in time-frequency plane using renyi entropy-application in seismic signal processing. In 2013 Conference on Control and Fault-Tolerant Systems (SysTol) (pp. 312–317). Nice: IEEE.
Google Scholar
Ranneby, B. (1984). The maximum spacing method. An estimation method related to the maximum likelihood method. Scandinavian Journal of Statistics, 11, 93–112.
Web of Science ®Google Scholar
Reiser, D. W., Wesche, T. A., & Estes, C. (1989). Status of instream flow legislation and practices in north america. Fisheries, 14, 22–29.
Web of Science ®Google Scholar
Renyi, A. (1961). On measures of entropy and information. In Fourth Berkeley Symposium on Mathematical Statistics and Probability, 1, 47–561.
Google Scholar
Stacy, E. W. (1962). A generalization of the gamma distribution. The Annals of Mathematical Statistics, 33, 1187–1192.
Google Scholar
Torabi, H., Falahati-Naeini, M., & Montazeri, N. (2014). An extended generalized lindley distribution and its applications to lifetime data. Journal of Statistical Research of Iran, 11, 203–222.
Google Scholar
Wang, M., & Wang, W. (in press). Bias-corrected maximum likelihood estimation of the parameters of the weighted lindley distribution. Communications in Statistics-Simulation and Computation, 46, 530–545.
Web of Science ®Google Scholar
Zakerzadeh, H., & Dolati, A. (2009). Generalized lindley distribution. Journal of Mathe-matical Extension, 3, 1–17
Google Scholar

Appendix 1

Appendix

The elements of the Fisher information matrix are

\begin{matrix} I_{ϕ, ϕ} & = - E [\frac{\partial l (θ; t)}{\partial ϕ^{2}}] = - \frac{1}{{(λ + ϕ)}^{2}} + ψ^{'} (θ) \\ I_{ϕ, λ} & = - E [\frac{\partial l (θ; t)}{\partial ϕ \partial λ}] = - \frac{α}{λ} + \frac{1}{{(λ + ϕ)}^{2}} \\ I_{ϕ, α} & = - E [\frac{\partial l (θ; t)}{\partial ϕ \partial α}] = \frac{- α log (λ) - ψ (ϕ) + α log (λ) - {(λ + ϕ)}^{- 1}}{α} \\ I_{λ, λ} & = - E [\frac{\partial l (θ; t)}{\partial λ^{2}}] = \frac{α ϕ}{λ^{2}} + (α - 1) λ^{α - 2} (ψ (ϕ) - α log (λ) + {(λ + ϕ)}^{- 1}) \\ + E [\frac{α T^{α} λ^{α - 2} ((α - 2) λ - {(λ T)}^{α})}{(λ + {(λ T)}^{α})}] - \frac{1}{{(λ + ϕ)}^{2}} \\ I_{α, α} & = - E [\frac{\partial l (θ; t)}{\partial α^{2}}] = \frac{ϕ (λ + ϕ + 1) (ψ {(ϕ)}^{2} + ψ (ϕ))}{α^{2} (λ + ϕ)} + \frac{1}{α^{2}} \\ + \frac{2 (λ + 2 ϕ + 1) ψ (ϕ) + 2}{α^{2} (λ + ϕ)} - E [\frac{λ {(λ T)}^{α} log {(λ T)}^{2}}{(λ + {(λ T)}^{α})}] \\ I_{α, λ} & = - E [\frac{\partial l (θ; t)}{\partial α \partial λ}] = - \frac{ϕ}{λ} + \frac{λ (1 + ϕ ψ (ϕ)) + ϕ (1 + (ϕ + 1) ψ (ϕ + 1))}{λ (λ + ϕ)} \\ - E [\frac{(1 + α λ^{α - 1} T^{α}) {(λ T)}^{α} log (λ T)}{{(λ + {(λ T)}^{α})}^{2}}] + \frac{(ϕ + λ + 1 - \frac{1}{α}) Γ (ϕ + 1 - \frac{1}{α})}{(λ + ϕ) Γ (ϕ)} \\ - E [\frac{(α λ^{α - 1} T^{α} log (λ T) + {(λ T)}^{α - 1})}{(λ + {(λ T)}^{α})}] \end{matrix}

The generalized weighted Lindley distribution: Properties, estimation, and applications

Abstract

Public Interest Statement

1. Introduction

2. Generalized weighted Lindley distribution

2.1. Moments

2.2. Survival properties

2.3. Entropy

2.4. Lorenz curves

3. Methods of estimation

3.1. Maximum likelihood estimation

3.2. Moments estimators

3.3. Ordinary and weighted least-square estimate

3.4. Method of maximum product of spacings

3.5. The Cramer-von Mises minimum distance estimators

3.6. The Anderson–Darling and Right-tail Anderson–Darling estimators

4. Simulation study

5. Application

5.1. Lifetimes data

Table 1. Lifetimes data (in hours) related to a device on test

Table 2. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the Aarset dataset

Table 3. MPS estimates, standard-error and $95 %$ CI for $ϕ, λ$ and $α$

Table 4. January average flows (m $^{3}$ /s) of the Cantareira system

Table 5. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the data-set related to the january average flows (m $^{3}$ /s) of the Cantareira system

Table 6. ML estimates, standard-error and $95 %$ CI for $ϕ, λ$ and $α$

5.2. Average flows data

6. Concluding remarks

Acknowledgements

Notes on contributors

P.L. Ramos

F. Louzada

Related Research Data

References

Appendix 1

Appendix

Information for

Open access

Opportunities

Help and information

The generalized weighted Lindley distribution: Properties, estimation, and applications

Abstract

Public Interest Statement

1. Introduction

2. Generalized weighted Lindley distribution

2.1. Moments

2.2. Survival properties

2.3. Entropy

2.4. Lorenz curves

3. Methods of estimation

3.1. Maximum likelihood estimation

3.2. Moments estimators

3.3. Ordinary and weighted least-square estimate

3.4. Method of maximum product of spacings

3.5. The Cramer-von Mises minimum distance estimators

3.6. The Anderson–Darling and Right-tail Anderson–Darling estimators

4. Simulation study

5. Application

5.1. Lifetimes data

Table 1. Lifetimes data (in hours) related to a device on test

Table 2. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the Aarset dataset

Table 3. MPS estimates, standard-error and 95% CI for ϕ,λ and α

Table 4. January average flows (m3/s) of the Cantareira system

Table 5. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the data-set related to the january average flows (m3/s) of the Cantareira system

Table 6. ML estimates, standard-error and 95% CI for ϕ,λ and α

5.2. Average flows data

6. Concluding remarks

Acknowledgements

Additional information

Funding

Notes on contributors

P.L. Ramos

F. Louzada

Related Research Data

References

Appendix 1

Appendix

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 3. MPS estimates, standard-error and $95 %$ CI for $ϕ, λ$ and $α$

Table 4. January average flows (m $^{3}$ /s) of the Cantareira system

Table 5. Results of AIC and AICc criteria and the p-value from the KS test for all fitted distributions considering the data-set related to the january average flows (m $^{3}$ /s) of the Cantareira system

Table 6. ML estimates, standard-error and $95 %$ CI for $ϕ, λ$ and $α$