1,986
Views
3
CrossRef citations to date
0
Altmetric
STATISTICS

A New Exponentiated Generalized Linear Exponential Distribution: Properties and Application

&
Article: 1953233 | Received 14 Mar 2021, Accepted 04 Jul 2021, Published online: 29 Jul 2021

Abstract

A new exponentiated generalized linear exponential distribution (NEGLED) is introduced, which poses increasing, decreasing, bathtub-shaped, and constant hazard rate. Its various mathematical properties such as moments, quantiles, order statistics, hazard rate function (HRF), stress–strength parameter, etc. are derived. Five distributions, exponential distribution (ED), generalized linear exponential distribution (GLED), Rayleigh distribution (RD), Weibull distribution (WD), and generalized linear failure rate distribution (GLFRD) were used for comparison with NEGLED model using a dataset of blood cancer patients. Estimation of the parameters using the maximum likelihood estimation (MLE) method was obtained and to evaluate their performance a simulation study has been carried out. Finally, a dataset of 40 Leukemia patients was analysed for illustrative purpose proving that the NEGLED outperforms compared distributions.

PUBLIC INTEREST STATEMENT

Understanding the need for complex data in all sciences fields, the extension of existing distributions is necessary and timely. We have introduced a new probability distribution known as New Exponentiated Generalized Linear Exponential distribution. This distribution extends the Generalized Linear Exponential Distribution, which accommodates increasing, decreasing, bathtub shaped, and constant hazard rate. Proposed distribution models appear superior than Exponential, Weibull, Rayleigh, Generalized Linear Exponential, and Generalized Failure Rate distributions for the Leukemia dataset.

1. Introduction

Distribution theory plays a vital role in modelling lifetime data not only in life insurance but also in various fields like reliability, queuing theory, and other related areas. For illustration, the standard distributions including Normal, Gamma, and Weibull distributions have attracted wide attention among scientists and attracted very important applications in every branch of science, engineering, technology, demography, etc. These conventional distributions may not provide a satisfactory fit to the real datasets in some cases. Non-monotone hazard rate, for example, cannot be modelled using the above distributions. The Normal distribution has only increasing hazard rate while the Weibull and Gamma distributions show the increasing, decreasing, and constant hazard rate. Therefore, the distributions are modified or extended in the literature for further use. In this article, we use the same method to generate NEGLED, which was used by Gupta et al. (Citation1998) to introduce exponentiated exponential distribution.

Suppose α is any positive constant and Y be a random variable with distribution function FY(y). Then (FY(y))α, α>0 is known as exponentiated distribution and FY(y) is baseline distribution. For instance,

FY(y)=1eθyeθy+1α;θ,y>0,

is known as exponentiated Teissier distribution (ETD), see Sharma et al. (Citation2020). In recent time, many distributions are extended to the class of the exponentiated distributions. Sharma et al. (Citation2020) introduced ETD, this new generation made the Teissier distribution compatible with increasing, decreasing and bathtub shape hazard rate. Exponentiated Weibull distribution (EWD) was introduced by Mudholkar and Srivastava (Citation1993) to make WD compatible with non-monotone hazard rates. Later, Nassar and Eissa (Citation2003) again studied the EWD and gave some new statistical measures. Louzada et al. (Citation2014) proposed the exponentiated generalized Gamma distribution. S. Lee and Kim (Citation2019) introduced exponentiated generalized Pareto distribution. For other families of the exponentiated distributions and related studies, see Agarwal et al. (Citation2020), Biçer (Citation2019), C.-S. Lee and Tsai (Citation2017), De Andrade and Zea (Citation2018), Elbatal et al. (Citation2013), Handique et al. (Citation2019), Ghosh et al. (Citation2019), Louzada et al. (Citation2014), Mahmoud and Alam (Citation2010), Okasha and Kayid (Citation2016), Rajchakit et al. (Citation2021), Sarhan et al. (Citation2013), Shakhatreh et al. (Citation2016) Tian et al. (Citation2014a), Tian et al. (Citation2014b), and Wu et al. (Citation2021).

The linear exponential distribution (LED) constitutes the constant or increasing hazard rate shape and decreasing or unimodal density function (Sarhan and Kundu, Citation2009), which is unable to model the phenomenon with non-monotone, decreasing, and bathtub shape hazard rates. Bathtub-shaped hazard rates are very common in reliability studies and researchers use them extensively. Mahmoud and Alam (Citation2010) generalized LED to make it compatible with decreasing, increasing and bathtub shaped hazard rate and denoted by GLED(α,β,γ,λ). The cumulative distribution function of GLED is given by

FY(y)=1e(α2y2+βyγ)λI(Λ,)(y);α,λ>0,β,γ0,

where I(Λ,)(y)=1 if y>Λ, 0, otherwise and Λ=β+β2+2αγα. Further, Tian et al. (Citation2014a) gave new generalization of LED. Meanwhile, GLED does not provide reasonable fit to modelling phenomenon with bimodal density and constant hazard rate. We have introduced a five parameter NEGLED model, which is generalization of GLED (Mahmoud and Alam, Citation2010). The proposed model in this study provides increasing, decreasing, constant and bathtub shape hazard rate. It has right-skewed, unimodal and bimodal density function. Also, NEGLED includes LED, generalized exponential distribution (GED) (Gupta and Kundu, Citation1999), GLED (Sarhan and Kundu, Citation2009, Mahmoud and Alam, Citation2010; and Tian et al., Citation2014a), generalized Rayleigh distribution (GRD) (Kundu and Raqab, Citation2005), exponentiated generalized linear exponential distribution (EGLED) (Sarhan et al., Citation2013), and many other well-known distributions as sub-models which are extensively used in modelling lifetime datasets. It is easy to discuss various statistical properties of the GED, GLED, GRD, etc. on a single platform through NEGLED. Due to flexibility of NEGLED model, one can anticipate its application in different areas of research.

In Section 2, we have introduced the proposed distribution and some of its reliability expressions such as survival function, HRF, and reversed HRF are derived. Section 3 provides the statistical characteristics, i.e. raw moments, quantiles, and order statistics of the newly proposed distribution. The stress–strength parameter that measures the reliability of the component has been discussed for NEGLED in the same section. In Section 4, MLE of unknown parameters for the proposed distribution has been derived, and their performance was evaluated using simulation. For simulation, different sample sizes have been considered. A real-life application is also discussed in Section 5.

2. The NEGLED

The probability density function (PDF) of NEGLED with parameter vector =(θ1,θ2,θ3,θ4,λ) is given by

f(x;)=λθ4(θ1+θ2x)θ1x+θ22x2θ3θ41eθ1x+θ22x2θ3θ4
(2.1) ×1eθ1x+θ22x2θ3θ4λ1,x>δ,(2.1)

where θ2,θ4,λ>0; θ1,θ30 and δ=θ1+θ12+2θ2θ3θ2. The θ1 and θ2 are the scale parameters, θ4 is the shape parameter, and λ is the exponentiation parameter. The nature of the truncation parameter δ depends on θ3, δ>0(=0) if θ30(=0). The PDF defined in (2.1) can also be written in simplified manner as

f(x;)=λθ4ξ (x)ξθ41(x)eξθ4(x)1eξθ4(x)λ1,x>δ,

where ξ(x):=ξ(x;θ1,θ2,θ3)=θ1x+θ22x2θ3 and ξ (x):=θ1+θ2x.

The corresponding cumulative distribution function (CDF) of NEGLED is expressed in the following form

(2.2) F(x;)=1eθ1x+θ22x2θ3θ4λ,x>δ.(2.2)

It may be notice that if λN (the set of natural numbers), then (2.2) denotes the CDF of the largest order statistic having a random sample of size λ from GLED(Ω), where Ω=(θ1,θ2,θ3,θ4). As a result, NEGLED() can be used to describe a parallel system with λ components, each of which is distributed independently as GLED(Ω). In actuarial science, it can also be considered as the distribution function of independently distributed λ insured that has GLED(Ω) as individual distribution. The proposed distribution includes several known distributions as special cases, some of the most widely used distribution are shown in .

Table 1. Some well-known distributions derived from NEGLED()

The survival function, S(x), HRF, h(x), and the reversed HRF, r(x) for NEGLED () is given by (2.3), (2.4), and (2.5), respectively

(2.3) S(x;)=F(x;)=1F(x;)=11eξθ4(x)λ,x>δ,(2.3)
(2.4) h(x;)=f(x;)F(x;)=λθ4ξ (x)ξθ41(x)eξθ4(x)1eξθ4(x)λ111eξθ4(x)λ,x>δ,(2.4)

and

(2.5) r(x;)=f(x;)F(x;)=λθ4ξ (x)ξθ41(x)eξθ4(x)1eξθ4(x)λ11eξθ4(x)λ,x>δ.(2.5)

It is immediate from the that the density of NEGLED can be decreasing, decreasing-increasing type, unimodal or bimodal depending upon the different values of the parameters.

Figure 1. Density plots of NEGLED model at different values of for λ>1

Figure 1. Density plots of NEGLED model at different values of ∇ for λ>1

Figure 2. Density plots of NEGLED model at different values of for λ<1

Figure 2. Density plots of NEGLED model at different values of ∇ for λ<1

Figure 3. Density plots of NEGLED model at different values of

Figure 3. Density plots of NEGLED model at different values of ∇

Figure 4. Density plots of NEGLED model at different values of

Figure 4. Density plots of NEGLED model at different values of ∇

From the , we observe the hazard rate of NEGLED model can be decreasing, increasing, constant or bathtub shaped. Therefore, NEGLED can be used to model the phenomena with constant or non-monotone hazard rates.

Figure 5. The HRFs of NEGLED model at different values of for λ>1

Figure 5. The HRFs of NEGLED model at different values of ∇ for λ>1

Figure 6. The HRFs of NEGLED model at values of different for λ<1

Figure 6. The HRFs of NEGLED model at values of different ∇ for λ<1

Figure 7. The HRFs of NEGLED model at values of different

Figure 7. The HRFs of NEGLED model at values of different ∇

It can easily be shown that through specific parametric substitution, one can get the HRF for ED, RD, GRD, LED, and WD from (2.4). Since GLED is a sub-model of NEGLED() that has the same characteristics of increasing, bathtub-shaped or constant among others HRF for specific values of the parameters. Therefore, in dealing with a diverse range of hazard functions to analyse lifetime data, the NEGLED model demonstrates greater versatility than current literature models.

3. Statistical properties of NEGLED

In this section, we discuss various statistical characteristics of proposed NEGLED model. First, we begin with quantile function as well as random sample generation from NEGLED model. Later on, moments, stress–strength parameters, and order statistics will be discussed consecutively.

3.1. Quantile function and random sample generation

The quantile function which represents the inverse of CDF is given by

Q(q)=F1(q).

Mathematically, the quantile function of NEGLED() can be written as

(3.1) Q(q)=θ1+θ12+2θ2θ3+ln(1q1λ)1θ4θ2,(3.1)

where q Uniform(0,1). The median of NEGLED model can be derived by putting q=0.5 in EquationEquation (3.1). To generate random sample from NEGLED() model, one can generate it by using the quantile function given in EquationEquation (3.1) and a random sample from Uniform(0, 1) distribution. For example, the random sample generation of size n from NEGLED, first generate a random sample u1,u2,,un (say) of size n from Uniform(0, 1) distribution. Now replace q with ui as given in the following formula

xi=θ1+θ12+2θ2θ3+ln(1ui1λ)1θ4θ2;i=1,2,,n.

3.2. Moments

In pragmatic sciences, moments are important tools for statistical analysis. It can be used to investigate a distribution’s most essential properties (e.g. tendency, dispersion, skewness, and kurtosis). The expression for the moment generating function (MGF), variance, and the rth moment of NEGLED model have been derived in this sub-section.

Theorem 3.1. For the random variable X with NEGLED(), i.e. XNEGLED(), the rth raw moment of X is given by

μr()=λi=0rj=0m=0riri2jλ1(1)i+mθ1i2j(θ12+2θ2θ3)ri2jθ2rj
×γjθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)jθ4+1+2ri2j(θ12+2θ2θ3)jθ2r+i2+j
(3.2) ×Γri2θ4jθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)ri2θ4jθ4+1.(3.2)

Proof. We have

μ r()=δxrf(x;)dx
=λθ4δxr(θ1+θ2x)θ1x+θ22x2θ3θ41eθ1x+θ22x2θ3θ4
×1eθ1x+θ22x2θ3θ4λ1dx.

Substituting θ1x+θ22x2θ3θ4=u, we get θ4(θ1+θ2x)θ1x+θ22x2θ3θ41dx=du.

Since

x=θ1+θ12+2θ2θ3+2θ2u1/θ4θ2,

we have

μr()=λ0θ1+θ12+2θ2θ3+2θ2u1/θ4θ2reu1euλ1du
=λ0i=0rri(θ1)i(θ12+2θ2θ3+2θ2u1/θ4)ri2θ2reu1euλ1du
=λ0i=0rri(θ1)iθ2r1+θ12+2θ2θ32θ2u1/θ4ri2(2θ2u1/θ4)ri2eu1euλ1du.

Suppose ζ=2θ2θ12+2θ2θ3, it is easy to confirm the below expressions

|ζu1/θ4|<1if0<u<ζθ4|(ζu1/θ4)1|<1ifζθ4<u<.

So, we have

μr()=λi=0rri(θ1)iθ2r0ζθ41+1ζu1/θ4ri2(2θ2u1/θ4)ri2eum=0λ1(1)memudu
+λi=0rri(θ1)iθ2rζθ41+1ζu1/θ4ri2(2θ2u1/θ4)ri2eum=0λ1(1)memudu
=λi=0rj=0m=0riri2jλ1(1)i+mθ1i2j(θ12+2θ2θ3)ri2jθ2rj
×γjθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)jθ4+1+2ri2j(θ12+2θ2θ3)jθ2r+i2+j
×Γri2θ4jθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)ri2θ4jθ4+1,

where Γ(s,t) and γ(s,t), respectively, denotes the upper incomplete gamma function and lower incomplete gamma function.□

Remark 3.1. The rth raw moment, with shape parameter 2θ4 and scale parameter 2θ2, of the Weibull distribution can be found by putting θ1=θ3=0 and λ=1 in (3.2).

Lemma 3.1. Let XNEGLED(). Then we have

(3.3) j=0ri=02jrj2ji(1)rj(θ12+2θ2θ3)rjθ1i2rθ2r2j+iμ2ji()=λm=0λ1(1)mΓrθ4+1(m+1)rθ4+1.(3.3)

Proof.

Eθ1X+θ22X2θ3r=Eξr(X)
=θ4λδξ (x)ξr+θ41(x)eξθ4(x)1eξθ4(x)λ1dx

Substituting u=ξθ4(x), on taking derivative we get du=θ4ξ (x)ξθ41(x)dx. So,

Eθ1X+θ22X2θ3r=0urθ4eu1euλ1du
(3.4) =λm=0λ1(1)mΓrθ4+1(m+1)rθ4+1.(3.4)

Using binomially expansion in the left hand side of (3.4), we have

Ej=0ri=02jrj2ji(1)rj(θ12+2θ2θ3)rjθ1i2rθ2r2j+iX2ji=λm=0λ1(1)mΓrθ4+1(m+1)rθ4+1
j=0ri=02jrj2ji(1)rj(θ12+2θ2θ3)rjθ1i2rθ2r2j+iμ2ji()=λm=0λ1(1)mΓrθ4+1(m+1)rθ4+1,

which proves the result.□

Theorem 3.2. The variance of NEGLED model is derived as follow

(3.5) Var(X)=σ2=2θ2λm=0λ1(1)mΓ(1θ4+1)(m+1)1θ4+1θ1μ+θ3θ22μ2,(3.5)

where μ is the mean of X, which can be derived by substituting r=1 in (3.2).

Proof. Simply placing r=1 in (3.3), we get

μ2()=2θ2λm=0λ1(1)mΓ1θ4+1(m+1)1θ4+1θ1μ+θ3,

where μ is the mean of the random variable X. Also, it is easy to get μ 2() on putting r=1 in (3.4) as follow

Eθ1X+θ22X2θ3=λm=0λ1(1)mΓ1θ4+1(m+1)1θ4+1
θ1μ+θ22μ2()θ3=λm=0λ1(1)mΓ1θ4+1(m+1)1θ4+1
μ2()=2θ2λm=0λ1(1)mΓ1θ4+1(m+1)1θ4+1θ1μ+θ3.

So, the variance of NEGLE() distribution is derived as follow

Var(X)=μ 2()μ2
=2θ2λm=0λ1(1)mΓ1θ4+1(m+1)1θ4+1θ1μ+θ3θ22μ2.

Theorem 3.3. Suppose X has a NEGLED() distribution, then the MGF, i.e. MX(s), of X is given by

MX(s)=λr=0i=0rj=0m=0riri2jλ1(1)i+mθ1itrr!2j(θ12+2θ2θ3)ri2jθ2rj
×γjθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)jθ4+1+2ri2j(θ12+2θ2θ3)jθ2r+i2+j
×Γri2θ4jθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)ri2θ4jθ4+1.

Proof. By definition of the MGF of X, we have

MX(s)=E[esX]
=Er=0(sX)rr!
=r=0srμ r()r!
=λr=0i=0rj=0m=0riri2jλ1(1)i+mθ1itrr!2j(θ12+2θ2θ3)ri2jθ2rj
×γjθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)jθ4+1+2ri2j(θ12+2θ2θ3)jθ2r+i2+j
×Γri2θ4jθ4+1,(m+1)θ12+2θ2θ32θ2θ4(m+1)ri2θ4jθ4+1.

3.3. Stress–strength parameter

Let random stress and strength of the component are denoted by X and Y, respectively. Then R=P(Y<X) is known as the stress–strength parameter, which describes the measure of component’s reliability. Let XNEGLED(θ1,θ2,θ3,θ4,λ1) and YNEGLED(θ1,θ2,θ3,θ4,λ2) be two independent random variables. Then

(3.6) R=P(Y<X)=λ1λ1+λ2.(3.6)

Proof.

P(YX)=δδxf(y;θ1,θ2,θ3,θ4,λ2)dyf(x;θ1,θ2,θ3,θ4,λ1)dx
=λ1λ2θ42δξ (x)ξθ41(x)eξθ4(x)1eξθ4(x)λ11
×δxξ (y)ξθ41(y)eξθ4(y)1eξθ4(y)λ21dydx.

Consider the transformation u=eξθ4(y), which implies that du=θ4ξ (y)ξθ41(y)eξθ4(y)dy. So

P(YX)=λ1λ2θ4δξ(x)ξθ41(x)eξθ4(x)1eξθ4(x)λ11eξθ4(x)1(1u)λ21dudx
=λ1θ4δξ (x)ξθ41(x)eξθ4(x)1eξθ4(x)λ1+λ21dx.

Now, take s=1eξθ4(x), we get ds=θ4ξ (x)ξθ4(x)ξ(x)eξθ4(x)dx. So

P(Y<X)=λ101(1s)λ1+λ21ds
=λ1λ1+λ2.

3.4. Order statistics

Let X(1)X(2)X(m) denote the order statistics of the random sample X1,X2,,Xm from NEGLED() model. Then, using the standard formula of the PDF of kth order statistics see Arnold et al. (Citation1992)), the PDF of the kth order statistic X(k) is

fX(k)(x;)=m!(k1)!(mk)!λθ4ξ(x)ξθ41(x)eξθ4(x)1eξθ4(x)λk1
×11eξθ4(x)λmk;x>δ.

Thus, the PDF of X(1) (the smallest order statistic) is

fX(1)(x;)=mθ4λξ(x)ξθ41(x)eξθ4(x)1eξθ4(x)λ111eξθ4(x)λm1,xδ,

and the PDF of X(m) (the largest order statistic) is

fX(m)(x;)=mθ4λξ (x)ξθ41(x)eξθ4(x)1eξθ4(x)mλ1,x>δ.

Theorem 3.4. Let XkNEGLED(θ1,θ2,θ3,θ4,λk) for 1k be independent r.v.’s. Then X(m)NEGLED(θ1,θ2,θ3,θ4,k=1mλk).

The proof of Theorem 3.4 is simple and hence omitted.

The joint PDF of X(k) and X(l) is now calculated using the standard formula of the joint PDF of two order statistics (see Arnold et al., Citation1992) as

fX(k)X(l)(xk,xl;)=Mθ42λ2ξ(xk)ξ(xl)(ξ(xk)ξ(xl))θ41eξ4θ(xk)ξ4θ(xl)1eξθ4(xk)λk1
×1eξθ4(xl)λ11eξθ4(xl)λ1eξθ4(xk)λlk111eξθ4(xk)λml,

where M=m!(k1)!(lk1)!(ml)!. Then, for NEGLED, the joint density of X(1) and X(m) becomes of the form

fX(1)X(m)(x1,xm;)=m(m1)θ42λ2ξ(x1)ξ(xm)(ξ(x1)ξ(xm))θ41eξ4θ(x1)ξ4θ(xm)
×1eξθ4(x1)1eξθ4(xm)λ11eξθ4(xm)λ1eξθ4(x1)λm2.

4. Statistcal inference

Now, we discuss the estimation of the model parameters by using the method of maximum likelihood estimation. Let draw a random sample of size m, i.e. x=(x1,x2,,xm), from NEGLED(). The likelihood function L(;x) for is given by

L(;x)=k=1mf(xk;)=θ4mλmk=1mξ (xk)ξθ41(xk)eξ4θ(xk)1eξ4θ(xk)λ1

and corresponding log-likelihood functions l(;x) of above equation is

l(;x)=mlnθ4+mlnλ+k=1mlnξ (xk)+(θ41)k=1mlnξ(xk)k=1mξ4θ(xk)
+(λ1)k=1mln1eξ4θ(xk).

First, differentiate the log-likelihood function with respect to unknown parameters and equate it to 0. The normal equations are given as

l(;x)θ1=k=1m1θ2xk+θ1+2(θ41)k=1mxkθ2xk2+2θ1xk2θ3θ42θ41×
(4.1) k=1mxk(θ2xk2+2θ1xk2θ3)θ411λeθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ=0,(4.1)
l(;x)θ2=k=1mxkθ2xk+θ1+(θ41)k=1mxk2θ2xk2+2θ1xk2θ3θ42θ4×
(4.2) k=1mxk2(θ2xk2+2θ1xk2θ3)θ411λeθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ=0,(4.2)
l(;x)θ3=2(θ41)k=1m1θ2xk2+2θ1xk2θ3+θ42θ41×
(4.3) k=1m(θ2xk2+2θ1xk2θ3)θ411λeθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ=0,(4.3)
l(;x)θ4=mθ4+k=1mlnθ22xk2+θ1xkθ312θ4k=1m(θ2xk2+2θ1xk2θ3)θ4
(4.4) ×lnθ22xk2+θ1xkθ31λeθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ=0,(4.4)
(4.5) l(;x)λ=mλ+k=1mln1eθ22xk2+θ1xkθ34θ=0.(4.5)

Since the MLEs of NEGLED() cannot be obtained in a closed form, one can use iterative procedures like Newton–Raphson method to compute them. It would be impossible to determine the exact distributions of the MLE’s of the parameters due to lack of closed form solution.

The solution to the aforementioned non-linear EquationEquations (4.1)–(Equation4.5) are determined using simulation in R software, assuming asymptotic distribution based on large sample approximations. In our case, NEGLED() asymptotically follows N5(,B(ˆ)). Where ˆ=(θˆ1,θˆ2,θˆ3,θˆ4,λˆ) is the vector of MLE’s and denotes the mean vector meanwhile B(ˆ) denotes the dispersion matrix. Particularly, based on a sample of size m, as m, we have (ˆ)dN5(0,B(ˆ)), where 0=(0,0,0,0,0)  and B(ˆ)=W1(ˆ), the inverse of the observed information matrix W(ˆ)=(akp) for 1k,p5. Suppose (θ1,θ2,θ3,θ4,λ)=(ϑ1,ϑ2,ϑ3,ϑ4,ϑ5). Then, for 1k<p5, we obtain akp=2l(x;ˆ)ϑˆkϑˆp, akk=2l(x;ˆ)ϑˆk2 and akp=apk. Also, Cov(ϑˆk,ϑˆp)=dkp, Var(ϑˆk)=dkk and dkp=dpk, where B(ˆ)=(dkp). Thus, from normal EquationEquations (4.1)–(Equation4.5), we can calculate the elements of B(ˆ). The quantity ϑˆk±zα/2SE(ϑˆk) represents the 100(1α)% confidence interval of ϑk, where zα/2 denotes the upper α/2-th percentile of the standard normal distribution and α denotes the level of significance.

To study various properties of MLE, the estimates of parameters of NEGLED model are derived using simulation. Samples of size 20(20)100 are considered with iteration of 10,000 from NEGLED(5.2,2.5,7.8,1.6,0.7) using optim command in R software. The bias, standard error, and coverage length (length of 95% confidence interval) for the MLE of each parameter are evaluated for each case. Findings are presented in the .

Table 2. Bias, Standard error (SE), and Coverage length (CL) for MLEs from NEGLED model

We noticed that perhaps the standard errors, coverage lengths, and absolute biases of each of the θˆ1, θˆ2, θˆ3, θˆ4 and λˆ MLE’s decrease as increasing the size of the sample, see . This indicates that the MLE method provides good estimates of the parameters for the NEGLED model.

Further, we estimate the stress–strength parameter i.e. R. In the context of the reliability of a system, it is very important to study the system performance referred to as the stress–strength parameter. The system will only survive if the applied stress is lesser than the strength. In practise, a good design is one in which the strength is always greater than the expected stress. In the statistical sciences, inferring the stress–strength parameter from a complete or censored sample has piqued the interest of many scientists over years, and the challenge of estimating R under various scenarios has been extensively researched. Many research on the inference of stress–strength parameter R from various perspective have recently been published in the literature. For example, half logistic distribution (Ratnam et al., Citation2000), Burr type X distribution (Kim et al., Citation2000), and normal distribution (Guo and Krishnamoorthy (Citation2004), Barbiero (Citation2011)).

Let we draw two independent random samples, i.e. x=(x1,,xm) and y=(y1,,yn), from NEGLED(θ1,θ2,θ3,θ4,λ1) and NEGLED(θ1,θ2,θ3,θ4,λ2) of sizes m and n respectively. Then the log-likelihood function l(Υ;x,y) of Υ=(θ1,θ2,θ3,θ4,λ1,λ2) is

l(Υ;x,y)=(m+n)ln(θ4λ)+k=1mlnξ (xk)+(θ41)k=1mlnξ(xk)k=1mξ4θ(xk)
+(λ11)k=1mln1eξ4θ(xk)+k=1nlnξ (yk)+(θ41)k=1nlnξ(yk)
k=1nξ4θ(yk)+(λ21)k=1nln1eξ4θ(yk).

EquationEquations (4.6)–(Equation4.11) below are the normal equations for the log-likelihood function l(Υ;x,y).

l(Υ;x,y)θ1=k=1m1θ2xk+θ1+2(θ41)k=1mxkθ2xk2+2θ1xk2θ3
θ42θ41k=1mxk(θ2xk2+2θ1xk2θ3)θ411λ1eθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ
+k=1n1θ2yk+θ1+2(θ41)k=1nykθ2yk2+2θ1yk2θ3θ42θ41
(4.6) ×k=1nyk(θ2yk2+2θ1yk2θ3)θ411λ2eθ22yk2+θ1ykθ34θ1eθ22yk2+θ1ykθ34θ=0,(4.6)
l(Υ;x,y)θ2=k=1mxkθ2xk+θ1+(θ41)k=1mxk2θ2xk2+2θ1xk2θ3
θ42θ4k=1mxk2(θ2xk2+2θ1xk2θ3)θ411λ1eθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ
+k=1nykθ2yk+θ1+(θ41)k=1nyk2θ2yk2+2θ1yk2θ3θ42θ4
(4.7) ×k=1nyk2(θ2yk2+2θ1yk2θ3)θ411λ2eθ22yk2+θ1ykθ34θ1eθ22yk2+θ1ykθ34θ=0,(4.7)
l(Υ;x,y)θ3=2(θ41)k=1m1θ2xk2+2θ1xk2θ3
+θ42θ41k=1m(θ2xk2+2θ1xk2θ3)θ411λ1eθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ
2(θ41)k=1n1θ2yk2+2θ1yk2θ3+θ42θ41k=1n(θ2yk2+2θ1yk2θ3)θ41
(4.8) ×1λ2eθ22yk2+θ1ykθ34θ1eθ22yk2+θ1ykθ34θ=0,(4.8)
l(Υ;x,y)θ4=m+nθ4+k=1mlnθ22xk2+θ1xkθ3+k=1nlnθ22yk2+θ1ykθ3
12θ4k=1m(θ2xk2+2θ1xk2θ3)θ4lnθ22xk2+θ1xkθ3
×1λ1eθ22xk2+θ1xkθ34θ1eθ22xk2+θ1xkθ34θ12θ4k=1n(θ2yk2+2θ1yk2θ3)θ4
(4.9) ×lnθ22yk2+θ1ykθ31λ2eθ22yk2+θ1ykθ34θ1eθ22yk2+θ1ykθ34θ=0,(4.9)
(4.10) l(Υ;x,y)λ1=mλ1+k=1mln1eθ22xk2+θ1xkθ34θ=0,(4.10)
(4.11) l(Υ;x,y)λ2=nλ2+k=1nln1eθ22yk2+θ1ykθ34θ=0.(4.11)

The non-linear EquationEquations (4.6)–(Equation4.11) can also be solved using iterative procedures like discuss earlier. Thus the MLE of stress–strength parameter i.e. R is

Rˆ=λˆ1λˆ1+λˆ2,

where the MLEs of λ1 and λ2 are denoted by λˆ1 and λˆ2, respectively. The general scenario when different multiple parameters of NEGLED model are considered, in such cases, we can compute R but to obtain a closed form is difficult.

5. Statistical data analysis

In the present section, to elucidate the application of NEGLED model, we considered a dataset of 40 patients suffering from Leukemia (a type of blood cancer). Also, the log-likelihood values, Akaike information criterion (AIC) values, log-likelihood ratio (LR) test statistic, and Kolmogorov–Smirnov (KS) test statistic are calculated. These values will help to test the goodness-of-fit of the NEGLED model compare to more familiar distribution models, namely GLFRD, GLED, RD, WD, and ED. At last, a graphical representation provides the empirical and estimated survival functions of the NEGLED, GLFRD, GLED, RD, WD, and ED models for Leukemia dataset.

shows the dataset of the lifetime (in days) of 40 patients suffering from Leukemia from one of the Ministry of Health Hospitals in Saudi Arabia, studied by Abouammoh et al. (Citation1994). Taking into account the computational ease, all the data points was divided by 100 in . Six distribution models GLFRD, GLED, RD, WD, ED along with NEGLED are considered for fitting the dataset. To implement the LR test GLFRD, GLED, RD, WD, and ED have been considered as the null distributions, meanwhile, the NEGLED model has been taken as the alternative distribution. Furthermore, let H = 0 and H = 1 denotes the rejection and acceptance of the null hypotheses respectively. presents the MLEs of the parameters, KS measurements and associated p-values for the Leukemia dataset. And furnishes the AIC values, log-likelihood values, H values and LR test statistic for the compared distribution models. Additionally, gives the simple quartile summary of Leukemia data along with quartile summary based on NEGLED model.

Table 3. A dataset of lifetimes (in days) for 40 patients suffering from leukemia type blood cancer

Table 4. The MLEs of the parameters, KS measurements and associated p-values for the leukemia data

Table 5. Information criteria for the leukemia data

Table 6. Quartile summary of the leukemia dataset

The approximate 95% confidence intervals for θ1,θ2,θ3,θ4, and λ are (0.2594, 0.2877), (0.2703, 0.2768), (0.1638, 0.3833), (−1.0103, 1.5575), and (−0.1156, 0.6628), respectively. The observed Fisher information matrix for Leukemia data under NEGLED is given by

B(ˆ)=76573.8109626317.71665803.845268.92993615.4586626317.71665633446.524934581.3303645.857416481.08735803.845234581.33031500.634837.1054769.37668.9299645.857537.10542.109929.26603615.458616481.0873769.376029.2660534.3603.

In , KS test statistic values accompanying their p-values are shown for various modelling distributions. Again from the , for all distribution models except the ED, the p-values corresponding to the KS test statistics are higher than α=0.05 level of significance. It is therefore clearly evident that at 5% level of significance we reject ED and none of the five models GLFRD, GLED, RD, WD, and NEGLED are rejected at the considerable level of significance. The NEGLED model is best model in the sense that it has the largest p-value among all the used models here to fit the Leukemia dataset.

Comparing AIC values from , we mention that the NEGLED model has the smallest AIC value among all the considered distribution models. Therefore, the NEGLED model is chosen as the model with the best fit among all the distributions considered. Moreover, provides the proof in the support of NEGLED model for the given dataset compare to the all considered models. As we can see the theoretical reliability function of the NEGLED model is better fitted to empirical reliability function.

Figure 8. The empirical and estimated survival functions of the NEGLE, GLFRD, GLED, RD, WD, and ED models focused on the data in

Figure 8. The empirical and estimated survival functions of the NEGLE, GLFRD, GLED, RD, WD, and ED models focused on the data in Table 3

The log-likelihood value of NEGLED model is largest compare to the considered models which indicates the best fit of NEGLED to the given dataset, see . At 5% significance level the χ0.052 critical values for 1, 2, 3, and 4 d.f. are 3.841, 5.991, 7.815, and 9.488, respectively. Next, the LR test statistic for all the models are greater than χ0.052 critical values for corresponding d.f., see . Consequently, at 5% level of significance, we reject all the null hypotheses i.e. GLFRD, GLED, RD, WD, and ED. Considering all the above results, we may conclude that the NEGLED model is superior competitor for lifetime datasets than the ED, RD, WD, GLFRD, and GLED models.

6. Conclusion

In this article, a new distribution named NEGLED has been introduced which generalizes the GLED model studied by Mahmoud and Alam (Citation2010) and several other well-known distributions. We investigated some statistical properties of the proposed distribution like HRF, quantile function, random sample generation, moments, stress–strength parameter, and order relations. To illustrate the MLEs behavior with increasing sample size, MLE and inference for the NEGLED model using simulation were obtained. It was found that the MLE method provides good estimates for the NEGLED model. At last, using the proposed distribution and some well-known distributions, a real-life dataset is fitted. It was found that, compared to the other distributions (ED, RD, WD, GLED, and GLFRD), the NEGLED model offers a better fit to the Leukemia dataset. Therefore, accounting the flexibility of PDF and HRF, the NEGLED model can be utilized as an effective model for lifetime data applications.

Acknowledgements

We are very thankful to the Editorial board and the reviewers for their valuable comments and suggestions which helped to improve the manuscript significantly.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by the No direct funding.

Notes on contributors

Neeraj Poonia

Neeraj Poonia obtained his M.Sc. in Statistics (2017) from the Central University of Punjab, Bathinda, India. He is currently pursuing his Ph.D. in Statistics in the School of Basic Sciences at the Indian Institute of Technology Mandi, Himachal Pradesh, India. His research interest includes probability distribution theory and applied statistics.

Sarita Azad

Sarita Azad obtained her Ph.D. in Mathematics (2008) from Delhi University and the Indian Institute of Science, India. She is currently working as an assistant professor in the School of Basic Sciences at the Indian Institute of Technology Mandi, Himachal Pradesh, India. Her area of research includes climate change modelling, statistical data analysis, time series analysis and forecasting, and distribution theory.

Refereces