460
Views
0
CrossRef citations to date
0
Altmetric
Articles

On some aspects of a bivariate alternative zero-inflated logarithmic series distribution

ORCID Icon &
Pages 130-143 | Received 10 Feb 2022, Accepted 07 Feb 2023, Published online: 04 Mar 2023

Abstract

In this paper, we discuss some important aspects of the bivariate alternative zero-inflated logarithmic series distribution (BAZILSD) of which the marginals are the alternative zero-inflated logarithmic series distributions of Kumar and Riyaz (2015. An alternative version of zero-inflated logarithmic series distribution and some of its applications. Journal of Statistical Computation and Simulation, 85(6), 1117–1127). We study some important properties of the distribution by deriving expressions for its probability mass function, factorial moments, conditional probability generating functions, and recursion formulae for its probabilities, raw moments and factorial moments. The parameters of the BAZILSD are estimated by the method of maximum likelihood and certain test procedures are also considered. Further certain real-life data applications are cited for illustrating the usefulness of the model. A simulation study is conducted for assessing the performance of the maximum likelihood estimators of the parameters of the BAZILSD.

1. Introduction

Bivariate discrete distributions have received much attention in the literature. For example, see Ghosh and Balakrishnan (Citation2015), Hassan and El-Bassiouni (Citation2013), Kemp (Citation2013), Kumar (Citation2008), Kocherlakota and Kocherlakota (Citation1992) and references therein. Due to the extensive applications of logarithmic series distribution in various areas of scientific research especially in biology, ecology, meteorology, etc., the bivariate logarithmic series distribution (BLSD) is of particular interest. Chapter 7 of Kocherlakota and Kocherlakota (Citation1992) is fully devoted to the BLSD. Subrahmaniam (Citation1966) defined the BLSD through the following probability generating function (pgf) (1.1) A(t1,t2)=log(1θ1t1θ2t2θ3t1t2)log(1θ1θ2θ3),(1.1) in which θ1>0, θ2>0 and θ30 such that θ1+θ2+θ3<1. An important drawback of the BLSD in practical point of view is that it excludes the (0, 0)-th observation from its support. To overcome this difficulty, Kumar and Riyaz (Citation2014) considered a class of bivariate distribution namely the ‘bivariate zero-inflated logarithmic series distribution (BZILSD)’ through the following probability mass function (pmf), for any non-negative integers m and n, θ1>0, θ2>0 and θ30 such that θ1+θ2+θ31. (1.2) f(m,n)=δθ1mθ2nr=0min(m,n)Drr!(mr)!(nr)!(θ3θ1θ2)r,(1.2) where Dr=Πj=0m+nr1(j+1)2(j+2) and δ=[F2,1(1,1;2;θ1+θ2+θ3)]1 in which F2,1(a,b;c;z) is the Gauss hypergeometric function (cf. Mathai & Haubold, Citation2008).

Kumar and Riyaz (Citation2013) considered the zero-inflated logarithmic series distribution (ZILSD) through the following pgf, in which A=θ[ln(1θ)]1 with θ(0,1). (1.3) G1(t)=AF2,1(1,1;2;θt),(1.3) or equivalently, (1.4) G1(t)=At1ln(1θt).(1.4) Kumar and Riyaz (Citation2015) considered another zero-inflated logarithmic series distribution, which they termed as ‘the alternative zero-inflated logarithmic series distribution (AZILSD)’, through the following pmf, for x=0,1,2,, (1.5) g(x)=BF2,1(1+x,1+x;2+x;α)θx1+x,(1.5) in which B=[ln(1θα)]1(θ+α), θ>0, α1 and |θ+α|<1 such that θα. The pgf of the AZILSD with pmf (1.5) is (1.6) G2(t)=BF2,1(1,1;2;θt+α)(1.6) or equivalently, (1.7) G2(t)=Bln(1θtα)(θt+α).(1.7) Kumar and Riyaz (Citation2017) studied an extended version of AZILSD and its important properties. Kumar and Riyaz (Citation2016) considered an order k version of AZILSD and studied its important applications.

Through this paper, we consider a bivariate version of the AZILSD through the name ‘the bivariate alternative zero-inflated logarithmic series distribution’ or, in short ‘the BAZILSD’, and discuss some of its important aspects. In Section 2, we derive the BAZILSD as a bivariate random sum distribution of independent and identically distributed bivariate Bernoulli random variables and show that the marginal distributions of the BAZILSD are AZILSD. We obtain expressions for its pmf, mean, covariance, factorial moments and conditional pgfs which are included in Section 2. In Section 3, we derive certain recursion formulae for probabilities, raw moments and factorial moments of the BAZILSD. In Section 4, we describe the estimation of the parameters of the BAZILSD by method of maximum likelihood and certain test procedures are suggested. And in Section 5, we illustrate the usefulness of the BAZILSD through fitting the distribution to certain real-life data sets. In Section 6, a brief simulation study is conducted for examining the performance of the maximum likelihood estimators of the parameters of the BAZILSD.

It is important to note that the BAZILSD possesses a bivariate random sum structure as shown in Section 2. Certain bivariate random sum distributions are studied in the literature. For example, see Kumar (Citation2007, Citation2013). The random sum structure arises in several areas of scientific research particularly in actuarial science, agricultural science, biological science and physical science. Chapter 9 of Johnson et al. (Citation2005) fully devoted to univariate random sum distributions.

For simplicity in the notations, we adopt the following notations throughout in the manuscript. (1.8) Rj(θ)=F2,1(1+j,1+j;2+j;θ+α),(1.8) (1.9) Λ=R01(θ),(1.9) (1.10) hj=Rj(θ1t1+θ2t2+θ3t1t2+α),(1.10) (1.11) ψj=Rj(θ),(1.11) (1.12) βj=Rj(0),(1.12) (1.13) and H(t1,t2)=ΛR0(θ1t1+θ2t2+θ3t1t2+α).(1.13)

2. A genesis and some properties of the BAZILSD

First, we derive the BAZILSD in the following and discuss some of its properties.

Consider the sequence {Yn=(Y1n,Y2n);n1} of independent and identically distributed bivariate Bernoulli random vectors, each with pgf P(t1,t2)=λ1t1+λ2t2+λ3t1t2,in which λj=θjθ, j=1,2,3 with θ=θ1+θ2+θ3 such that θ1>0, θ2>0 and θ30. Let X be a non-negative integer valued random variable having AZILSD with pgf (1.6), in which θ=θ1+θ2+θ3. Assume that {Yn:n1} and X’s are independent. Define Sn=(S1n,S2n), for each n0 in which (S10,S20)=(0,0) and Srm=j=1mYrj, for r=1,2 and m1. Set SX=n=0SnI[X=n] where I[X=n] denotes the indicator function of an event [X=n]. Then the pgf of SX is (2.1) H(t1,t2)=G2{P(t1,t2)}=ΛF2,1(1,1;2;θ1t1+θ2t2+θ3t1t2+α),(2.1) where Λ is defined in (1.9).

We call a distribution with pgf (2.1) ‘the bivariate alternative zero-inflated logarithmic series distribution’ or, in short ‘the BAZILSD’. Clearly when α=0, the pgf given in (2.1) reduces to the following pgf of the BZILSD with pmf (1.2). (2.2) B(t1,t2)=F2,1(1,1;2;θ1t1+θ2t2+θ3t1t2)F2,1(1,1;2;θ1+θ2+θ3),(2.2) which shows that the proposed bivariate model of the AZILSD can be considered as a more flexible model in practical point of view compared to the BZILSD. Further, it can be noted that the marginals of the BAZILSD are AZILSD whereas the marginals of the BZILSD are not ZILSD.

Proposition 2.1.

If V=(V1,V2) follows the BAZILSD, then the marginal distribution of Vj for j=1,2 is AZILSD with pgf given below. HV1(t)=ΛF2,1[1,1;2;(θ1+θ3)t+θ2+α]and HV2(t)=ΛF2,1[1,1;2;(θ2+θ3)t+θ1+α].

The proof follows from the fact that HV1(t)=H(t,1) and HV2(t)=H(1,t).

Proposition 2.2.

The pgf of the conditional distribution of V1 given V2=v is the following: for any non-negative integer v, (2.3) HV1|V2(t)=(θ2+θ3tθ2+θ3)vF2,1(1+v,1+v;2+v;θ1t+α)F2,1(1+v,1+v;2+v;θ1+α).(2.3)

Proof:

For any non-negative integer v, assume that P(V2=v)>0. Now, we have the following partial derivatives of order (0,v) of H(t1,t2) with respect to t2 evaluated at (t1,t2)=(t,0). (2.4) H(0,v)(t,0)=Λ(θ2+θ3t)v(j=0v1Dj)Rv(θ1t),(2.4) where for j=0,1,2,, (2.5) Dj=(j+1)2(j+2)(2.5) and Rj(t) is defined in (1.8).

Now, applying the formula for the conditional pgf in terms of partial derivatives of the joint pgf developed by Subrahmaniam (Citation1966), we obtain the conditional pgf of V1 given V2=v as HV1|V2=v(t)=H(0,v)(t,0)H(0,v)(1,0)=(θ2+θ3t)v(θ2+θ3)vRv(θ1t)Rv(θ1),which implies (2.3) in the light of (1.8).

Remark 2.1.

The conditional distribution of V1 given V2=v as given in (2.3) can be written as HV1|V2(t)=HZ1(t)HZ2(t) where HZ1(t) is the pgf of a binomial random variable with parameters z1 and p=θ3θ2+θ3 and HZ2(t) is the pgf of a random variable following the AZILSD with parameters v, θ1 and α. Thus clearly, the conditional distribution V1 given V2=v is the distribution of the sum of two independent random variables Z1 and Z2.

By using Remark 2.1, we obtain the following proposition.

Proposition 2.3:

Let V=(V1,V2) follow the BAZILSD with pgf (2.1). Then (2.6) E(V1|V2=v)=vθ3(θ2+θ3)+θ1DvRv+1(θ1+α)Rv(θ1+α),(2.6) (2.7) Var(V1|V2=v)=vθ2θ3(θ2+θ3)2+θ1DvRv2(θ1+α)[Dv+1Rv(θ1+α)Rv+2(θ1+α)θ1+Rv(θ1+α)Rv+1(θ1+α)DvRv+12(θ1+α)θ1].(2.7)

Remark 2.2:

By a similar approach, for any non-negative integer v with P(V1=v)>0, we can obtain the conditional pgf of V2 given V1=v by interchanging θ1 and θ2 in (2.3). Therefore, it is evident that comments similar to those in Remark 2.1 are valid regarding conditional distribution of V2 given V1=v and the explicit expression for E(V2|V1=v) and Var(V2|V1=v) can be obtained by interchanging θ1 and θ2 in the right hand side expressions of (2.6) and (2.7) respectively.

Proposition 2.4 :

Let V=(V1,V2) follow the BAZILSD with pgf (2.1) and let m,n be any non-negative integers. The pmf f(m,n) and the (m,n)-th factorial moment μ[m,n] of the BAZILSD are (2.8) f(m,n)=Λθ1mθ2nr=0min(m,n)βm+nr(α)Drr!(mr)!(nr)!(θ3θ1θ2)r,(2.8) (2.9) μ[m,n]=Λm!n!(θ1+θ3)m(θ2+θ3)nr=0min(m,n)Drψm+nrr!(mr)!(nr)!ξr,(2.9) where Dr is defined in (1.2), for j=1,2,, ψj, βj(α)’s are defined in (1.11) and (1.12) and ξ=θ3(θ1+θ3)(θ2+θ3).

Proof :

In order to obtain the probability mass function of the BAZILSD, we need the following derivatives of H(t1,t2), in which m is a non-negative integer. (2.10) H(m,0)(t1,t2)=(i=0m1Di)(θ1+θ3t2)mΛhm(t1,t2),(2.10) where (2.11) hj(t1,t2)=F2,1(1+j,1+j;2+j;θ1t1+θ2t2+θ3t1t2+α),j=0,1,2,.(2.11) The following derivatives are needed in the sequel, in which 0ir and j1. (2.12) i(θ1+θ3t2)mt2i=m!(mi)!θ3i(θ1+θ3t2)mi,(2.12) (2.13) jhm(t1,t2)t2j=(i=mm+j1Di)(θ2+θ3t1)jhm+j(t1,t2).(2.13) Differentiating both sides of (2.10) n times with respect to t2 and applying (2.12) and (2.13), we get the following. (2.14) H(m,n)(t1,t2)=(i=0m1Di)Λr=0n(nr)r(θ1+θ3t2)mt2rnrhm(t1,t2)t2nr=(i=0m1Di)Λr=0min(m,n)(nr)m!(mr)!θ3r(θ1+θ3t2)mr×(i=mm+nr1Di)(θ2+θ3t1)nrhm+nr(t1,t2).(2.14) By putting (t1,t2)=(0,0) in (2.14) and by dividing m!n!, we get (2.8). By putting (t1,t2)=(1,1) in (2.14), we get (2.9).

Proposition 2.5 :

Let V=(V1,V2) follow the BAZILSD with pgf (2.1). Then we have the following, in which δj=ψjψ0, (2.15) E(V1)=D0δ1(θ1+θ3),(2.15) (2.16) E(V2)=D0δ1(θ2+θ3),(2.16) and (2.17) Cov(V1,V2)=D0(D1δ2D0δ12)(θ1+θ3)(θ2+θ3)+D0δ1θ3,(2.17) where D0 and D1 are given in (2.5).

The proof follows from (2.9) in the light of the relations: E(V1)=μ[1,0],E(V2)=μ[0,1]andCov(V1,V2)=μ[1,1]μ[1,0]μ[0,1].

Proposition 2.6.

Let V=(V1,V2) follow the BAZILSD with pgf (2.1). Then U=V1+V2 follows the modified AZILSD studied by Kumar and Riyaz (Citation2013).

The proof follows from the fact that the pgf of V1+V2 is HU(t)=H(t,t)=ΛF2,1[1,1;2;(θ1+θ2)t+θ3t2+α].

3. Recursion formulae

In this section, we develop certain recursion formulae for probabilities, raw moments and factorial moments. Let V=(V1,V2) be a random vector with pgf (2.1). For the sake of computational simplicity, we define u_+i=(1+i,1+i;2+i), for i=0,1,2,. Now we have the following from (2.1) in which f(m,n;u_)=P(V1=m,V2=n), for m,n0, (3.1) H(t1,t2)=m=0n=0f(m,n;u_)t1mt2n=ΛF2,1(1,1;2;θ1t1+θ2t2+θ3t1t2+α).(3.1) Now we obtain the following propositions.

Proposition 3.1

The probability mass function f(m,n;u_) of the BAZILSD satisfies the following recurrence formulae, in which δj is defined in Proposition 2.5. (3.2) (m+1)f(m+1,0;u_)=D0δ1θ1f(m,0;u_+1),m0,(3.2) (3.3) (m+1)f(m+1,n;u_)=D0δ1[θ1f(m,n;u_+1)+θ3f(m,n1;u_+1)],m0,n1,(3.3) (3.4) (n+1)f(0,n+1;u_)=D0δ1θ2f(0,n;u_+1),n0,(3.4) (3.5) (n+1)f(m,n+1;u_)=D0δ1[θ2f(m,n;u_+1)+θ3f(m1,n;u_+1)],m1,n0.(3.5)

Proof :

From (2.10) with m=1, we have the following. (3.6) H(1,0)(t1,t2)=ΛD0(θ1+θ3t2)h1(t1,t2).(3.6) On differentiating both sides of (3.1) with respect to t1, we have (3.7) H(1,0)(t1,t2)=m=0n=0mf(m,n;u_)t1m1t2n=m=0n=0(m+1)f(m+1,n;u_)t1mt2n.(3.7) From (3.1), we also have the following. (3.8) F2,1(2,2;3;θ1t1+θ2t2+θ3t1t2+α)=ψ1m=0n=0f(m,n;u_+1)t1mt2n.(3.8) Now by using (3.7) and (3.8) in (3.6) we get (3.9) m=0n=0(m+1)f(m+1,n;u_)t1mt2n=D0δ1[θ1m=0n=0f(m,n;u_+1)t1mt2n+θ3m=0n=0f(m,n;u_+1)t1mt2n+1].(3.9) On equating the coefficient of t1mt20 on both sides of (3.9), we get (3.2). By equating the coefficient of t1mt2n on both sides of (3.9), we get the relation (3.3). We omit the proof of relations (3.4) and (3.5) as it is similar to that of relations (3.2) and (3.3).

Proposition 3.2 :

Two recurrence formulae for the (m,n)-th raw moment μm,n(u_) of the BAZILSD are the following, for m,n0. (3.10) μm+1,n(u_)=D0δ1θ1j=0m(mj)μmj,n(u_+1)+D0δ1θ3j=0mk=0n(mj)(nk)μmj,nk(u_+1),(3.10) (3.11) μm+1,n+1(u_)=D0δ1θ2k=0n(nj)μm,nk(u_+1)+D0δ1θ3j=0mk=0n(mj)(nk)μmj,nk(u_+1).(3.11)

Proof :

The characteristic function φ(t1,t2) of the BAZILSD with pgf (2.1) is the following. For (t1,t2) in R2 and i=1, (3.12) φ(t1,t2)=H(eit1,eit2)=ΛF2,1[1;1;2;γ(t_;θ_)]=m=0n=0μm,n(u_)(it1)m(it2)nm!n!,(3.12) where γ(t_;θ_)=γ(t1,t2;θ1,θ2,θ3,α) =θ1eit1+θ2eit2+θ3ei(t1+t2)+α.

On differentiating (3.12) with respect to t1 we get, (3.13) D0ΛF2,1[2;2;3;γ(t_;θ_)]{i(θ1+θ3eit2)eit1}=m=1n=0iμm,n(u_)(it1)m1(it2)n(m1)!n!.(3.13) In the light of (3.12), we have the following from (3.13). D0δ1θ1m=0n=0(it1)m(it2)nm!n!eit1+D0δ1θ3m=0n=0(it1)m(it2)nm!n!eit1eit2=m=1n=0μm,n(u_)(it1)m1(it2)n(m1)!n!.Now, on expanding exponential functions, rearranging the term and by using standard properties of double sum we obtain the following. (3.14) m=0n=0μm+1,n(u_)(it1)m(it2)nm!n!=D0δ1m=0n=0(it1)m(it2)nm!n![θ1j=0m(mj)μmj,n(u_+1)+θ3j=0mk=0n(mj)(nk)μmj,nk(u_+1)].(3.14) On equating coefficients of (it1)m(it2)nm!n! on both sides of (3.14), we get the relation (3.10). A similar procedure will give (3.11).

Proposition 3.3

: The (m,n)-th order factorial moment μ[m,n](u_) of the BAZILSD satisfies the following recurrence formulae, for m,n0, in which μ[0,0](u_)=1. (3.15) μ[m+1,n](u_)=D0δ1(θ1+θ3)μ[m,n](u_+1)+D0δ1θ3nμ[m,n1](u_+1),(3.15) (3.16) μ[m,n+1](u_)=D0δ1(θ2+θ3)μ[m,n](u_+1)+D0δ1θ3mμ[m1,n](u_+1).(3.16)

Proof:

Let V=(V1,V2) be a random vector having the BAZILSD with pgf H(t1,t2) as given in (3.1). Then the factorial moment generating function F(t1,t2) of the BAZILSD is (3.17) F(t1,t2)=H(1+t1,1+t2)=ΛF2,1[1,1;2;η(t_;θ_)]=m=0n=0μ[m,n](u_)t1mt2nm!n!,(3.17) where η(t_;θ_)=η(t1,t2;θ1,θ2,θ3,α) =θ1+θ2+θ3+(θ1+θ3)t1+(θ2+θ3)t2+θ3t1t3+α.

On differentiating (3.16) with respect to t1, we get F(t1,t2)t1=[(θ1+θ3)+θ3t2]D0F2,1[2,2;3;η(t_;θ_)].In the light of (3.17), we can write this as (3.18) m=0n=0μ[m+1,n](u_)t1mt2nm!n!=D0δ1[(θ1+θ3)m=0n=0μ[m,n](u_+1)t1mt2nm!n!+θ3m=0n=0μ[m,n](u_+1)t1mt2n+1m!n!].(3.18) Equating the coefficient of t1mt2nm!n! on both sides of (3.18), we get (3.15). Similar procedures will lead to (3.16).

4. Estimation and testing

In this section, we discuss the estimation of the parameters θ1, θ2, θ3 and α of the BAZILSD by the method of method maximum likelihood and construct certain test procedures for testing the significance of the additional parameter α of the BAZILSD.

4.1. Maximum likelihood estimation

Let a(m,n) be the frequency of the (m,n)-th cell of a bivariate data. Let y be the highest value of m observed and z be the highest value of n observed. Then the likelihood function of the sample is (4.1) L=m=0yn=0z[f(m,n)]a(m,n),(4.1) where f(m,n) is the pmf of the BAZILSD as given in (2.8). Taking logarithm on both sides of (4.1), we get (4.2) logL=m=0yn=0za(m,n)[logΛ+logΩ(m,n;θ1,θ2,θ3,α)],(4.2) where Λ is given in (1.9), Ω(m,n;θ1,θ2,θ3,α)=r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!,and Dr is defined in Proposition 2.4.

Let θˆ1, θˆ2, θˆ3 and αˆ denote the maximum likelihood estimators of the parameters θ1, θ2, θ3 and α of the BAZILSD. On differentiating (4.2), partially with respect to the parameters θ1, θ2, θ3 and α, respectively, and equating to zero, we get the following likelihood equations, in which Φ(θ1,θ2,θ3,α)=m=0yn=0za(m,n)logΛ=m=0yn=0za(m,n)logR01(θ)=m=0yn=0za(m,n)R02(θ)R0(θ)=m=0yn=0za(m,n)R02(θ)D0R1(θ),in the light of Rj(θ)=DjRj+1(θ), where Dj and Rj(θ) are defined in (2.5) and (1.8), respectively. (4.3) (4.4) Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)=0,Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Dθ1mr(mr)!θ2nr1(nr1)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)=0,(4.3) (4.4) (4.5) Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3r1(r1)!Ω(m,n;θ1,θ2,θ3,α)=0,(4.5) and (4.6) Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!(m+nr+2)Ω(m,n;θ1,θ2,θ3,α)=0.(4.6) Now on solving these likelihood equations (4.3)–(4.6) by using some mathematical software such as MATHLAB, MATHCAD, MATHEMATICA, etc., one can obtain the maximum likelihood estimators of the parameters θ1, θ2, θ3 and α.

4.2. Testing of the hypothesis

For testing the hypothesis H0:α=0 against the alternative hypothesis H1:α0, we construct the generalized likelihood ratio test (GLRT) and Rao’s efficient score test (REST) as follows.

In case of (GLRT), the test statistic is (4.7) 2logλ=2[logL(Ωˆ_;x)logL(Ωˆ_;x)],(4.7) where Ωˆ_ is the maximum likelihood estimator of Ω_=(θ1,θ2,α) with no restrictions, and Ωˆ_ is the maximum likelihood estimator of Ω_ when α=0. The test statistic 2logλ given in (4.7) is asymptotically distributed as Chi-square with one degree of freedom. For details, see Rao (Citation1973).

In case of (REST), the following test statistic can be used. (4.8) S=Tϕ1T,(4.8) where T=(T1,T2,T3,T4) and ϕ=(Irs)4×4 are the Fisher information matrices in which Ti and Irs for i=1,2,3,4 and r,s=1,2,3,4 are as given in the Appendix. The test statistic given in (4.8) follows Chi-square distribution with one degree of freedom (see Rao, Citation1973).

5. Applications

For numerical applications, we consider two real-life data sets of which the first data set is from MitchelL and Paulson (Citation1981), which consists of the number of aborts by 109 aircrafts in two consecutive six months of one year period and the second data set, taken from Partrat (Citation1993), is the yearly frequency of hurricanes affecting tropical cyclones in two zones belonging to the North Atlantic coastal states in the USA. We have fitted the BZILSD, the BAZILSD and the bivariate Poisson distribution (BPD) to these data sets by the method of the maximum likelihood estimates of the parameter of the models. For the first data set, the maximum likelihood estimates (MLES) of the parameters in case of the BZILSD are θˆ1=0.75, θˆ2=0.17 and θˆ3=0.01, those in case of the BAZILSD are θˆ1=0.65, θˆ2=0.23, θˆ3=0.04 and αˆ=0.02, and those in case of the BPD are λˆ1=0.67 λˆ2=0.47and λˆ3=0.01. For the second data set, the MLES of the parameters in case of the BZILSD are θˆ1=0.55, θˆ2=0.36 and θˆ3 = 0.02, those in case of the BAZILSD are θˆ1=0.35, θˆ2=0.31, θˆ3=0.04 and αˆ=0.01, and those in case of the BPD are λˆ1=0.62, λˆ2=0.61 and λˆ3=0.01. The computed values of the expected frequencies of the BZILSD, the BAZILSD and the BPD are all presented in the Tables and .

Table 1. Observed frequencies and computed values of expected frequencies of the BZILSD, the BAZILSD and the BPD by method of maximum likelihood for the first data set.

Table 2. Observed frequencies and computed values of expected frequencies of the BZILSD, the BAZILSD and the BPD by method of maximum likelihood for the second data set.

(In each cell, the first row represents the observed frequency, the second row represents theoretical frequency of the BZILSD, the third row represents theoretical frequency of BAZILSD and the last row represents theoretical frequency of BPD).

(In each cell, the first row represents the observed frequency, the second row represents theoretical frequency of the BZILSD, the third row represents theoretical frequency of BAZILSD and the last row represents theoretical frequency of BPD).

The goodness of fit is applied to the first data set in case of the BAZILSD in nine categories [such as (0,0), (0,1), (0,2), (0, 3 and above); (1,0), (1, 1 and above); (2,0), (2, 1 and above) and (3,0 and above)], that in case of the BZILSD in eight categories [such as (0,0), (0,1), (0,2), (0, 3 and above); (1,0), (1, 1 and above); (2, 0 and above) and (3,0 and above)] and that in case of the BPD in seven categories [such as (0,0), (0,1 and above); (1,0), (1, 1 and above); (2, 0), (2, 1 and above); (3,0 and above)]. In the second data set, in case of the BAZILSD the goodness of fit is applied in seven categories [such as (0,0), (0,1), (0, 2 and above); (1,0), (1, 1 and above); (2, 0 and above) and (3,0 and above)], that in case of the BZILSD there are seven categories [such as (0,0), (0,1), (0, 2 and above); (1,0), (1, 1 and above) and (2, 0), (2,1 and above)] and that in case of the BPD in seven categories [such as (0,0), (0,1), (0, 2 and above); (1,0), (1, 1 and above); (2, 0), (2, 1 and above)]. The computed values of the Chi-square statistic and P in case of both the models – BZILSD, BAZILSD and BPD for data set 1 and data set 2 are all presented in Table . Based on the values of Chi-square statistic and P, it can be observed that BAZILSD gives a better fit to both data sets compared to the existing models – the BZILSD and the BPD.

Table 3. The computed Chi-square value and P value while fitting the models – BZILSD, BAZILSD and BPD for the Data set 1 and Data set 2.

Table contains the computed values of logL(Ωˆ_;x),logL(Ωˆ_;x) and the GLRT statistic for the BAZILSD in case of for both the data sets. We have also computed the values of S based on (4.8) for the BAZILSD in the case of first data set as S1 and for the BAZILSD in the case of second data set S2 as given below. S1=(1.583.287.8212.57) [0.080.040.050.010.040.060.010.040.050.010.060.020.010.040.020.04] (1.583.287.8212.57)=6.26,S2=(0.131.295.597.96) [0.400.260.140.030.260.180.080.020.050.080.060.0080.010.020.0080.02] (0.131.295.597.96)=4.98.Since the critical value for the test at 5% level of significance and one degree of freedom is 3.84, the null hypothesis that H0:α=0 is rejected in both the above cases in respect of GLRT and REST.

Table 4. The computed the values of logL(Ωˆ_;x),logL(Ωˆ_;x) and the generalized likelihood ratio test statistic under H0.

6. Simulation

It is quite difficult to examine the theoretical performance of the estimators of different parameters of the BAZILSD obtained by the method of maximum likelihood. So we have attempted a simulation study for assessing the performance of the estimators. We have simulated three data sets of sample size 150, 300 and 600 in both the positively correlated and negatively correlated situations of the BAZILSD by using Markov chain Monte Carlo (MCMC) procedure, and considered 200 replications in each case. We have considered the following two sets of parameters: (i) θ1=0.4361, θ2=0.2679, θ3=0.1905, α=0.0110 (positively correlated) and (ii) θ1=0.0847, θ2=0.0439, θ3=0.0216, α=0.0112 (negatively correlated) as initial values of the parameters while simulating the data sets. The computed values of the bias and standard errors in case of each of the estimators are given Table . From Table , it can be observed that both the bias and standard errors of the estimators of the parameters are in decreasing order as the sample size increases.

Table 5. Bias and standard errors (within parenthesis) of the estimators of the parameters θ1,θ2, θ3 and α of the BAZILSD for the simulated data sets.

Acknowledgements

The authors are grateful to the Editor-in-Chief and the anonymous Referees for their valuable comments on an earlier version which helped to improve the quality of this article.

References

  • Ghosh, I., & Balakrishnan, N. (2015). Study of incompatibility or near compatibility of bivariate discrete conditional probability distributions through divergence measures. Journal of Statistical Computation and Simulation, 85(1), 117–130. https://doi.org/10.1080/00949655.2013.806509
  • Hassan, M. Y., & El-Bassiouni, M. Y. (2013). Modelling Poisson marked point processes using bivariate mixture transition distributions. Journal of Statistical Computation and Simulation, 83(8), 1440–1452. https://doi.org/10.1080/00949655.2012.662683
  • Johnson, N. L., Kemp, A. W., & Kotz, S. (2005). Univariate discrete distributions. 3rd ed. Wiley.
  • Kemp, A. W. (2013). New discrete Appell and Humbert distributions with relevance to bivariate accident data. Journal of Multivariate Analysis, 113, 2–6. https://doi.org/10.1016/j.jmva.2011.08.011
  • Kocherlakota, S., & Kocherlakota, K. (1992). Bivariate discrete distributions. Marcel Dekker.
  • Kumar, C. S. (2007). Some properties of bivariate generalized hypergeometric probability distribution. Journal of the Korean Statistical Society, 36, 349–355. http://koreascience.or.kr/article/JAKO200734515966569.page?&lang=ko.
  • Kumar, C. S. (2008). A unified approach to bivariate discrete distributions. Metrika, 67(1), 113–121. https://doi.org/10.1007/s00184-007-0125-8
  • Kumar, C. S. (2013). The bivariate confluent hypergeometric series distribution. Economic Quality Control, 28(2), 23–30. https://doi.org/10.1515/eqc-2013-0009
  • Kumar, C. S., & Riyaz, A. (2013). On the zero-inflated logarithmic series distribution and its modification. Statistica, 73(4), 477–492. https://doi.org/10.6092/issn.1973-2201/4498
  • Kumar, C. S., & Riyaz, A. (2014). On a bivariate version of zero-inflated logarithmic series distribution and its applications. Journal of Combinatorics, Information and System Science, 39(4), 249–262.
  • Kumar, C. S., & Riyaz, A. (2015). An alternative version of zero-inflated logarithmic series distribution and some of its applications. Journal of Statistical Computation and Simulation, 85(6), 1117–1127. https://doi.org/10.1080/00949655.2013.867347
  • Kumar, C. S., & Riyaz, A. (2016). An order k version of the alternative zero-inflated logarithmic series distribution and its applications. Journal of Applied Statistics, 43(14), 2681–2695. https://doi.org/10.1080/02664763.2016.1142949
  • Kumar, C. S., & Riyaz, A. (2017). On some aspects of a generalized alternative zero-inflated logarithmic series distribution. Communications in Statistics – Simulation and Computations, 46(4), 2689–2700. https://doi.org/10.1080/03610918.2015.1057287
  • Mathai, A. M., & Haubold, H. J. (2008). Special functions for applied scientists. Springer.
  • MitchelL, C. R., & Paulson, A. S. (1981). A new bivariate negative binomial distribution. Naval Research Logistics Quarterly, 28(3), 359–374. https://doi.org/10.1002/nav.3800280302
  • Partrat, C. (1993). Compound model for two dependent kinds of clam. XXIVe ASTIN Colloquium.
  • Rao, C. R. (1973). Linear statistical inference and its applications. John Wiley.
  • Subrahmaniam, K. (1966). A test for intrinsic correlation in the theory of accident proneness. Journal of the Royal Statistical Society B, 35(1), 131–146. https://doi.org/10.1111/j.2517-6161.1966.tb00631.

Appendix

The entries of T for the computations of the test statistic in case of REST are as given below. T1=1nlogLθ1=1n(Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)),T2=1nlogLθ2=1n(Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)),T3=1nlogLθ2=1n(Φ(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3r1(r1)!Ω(m,n;θ1,θ2,θ3,α))and T4=1nlogLθ4=1n[Φ(θ1,θ2,θ3,α)]+1n(m=0yn=0za(m,n)r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!(m+nr+2)Ω(m,n;θ1,θ2,θ3,α))in which Ω(m,n;θ1,θ2,θ3,α) and Φ(θ1,θ2,θ3,α)are defined in Equations (4.2) and (4.3).

The entries of Irs for the computations of the test statistic in case of REST are as given below. For r,s=1,2,3 and 4, Irs’s are given below in which η(θ1,θ2,θ3,α)=m=0yn=0za(m,n)[D0D2R01(θ)R2(θ)D02[R01(θ)]2[R1(θ)]2. I11=log2Lθ12=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr2(mr2)!θ2nr(nr)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)[r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr!]2[Ω(m,n;θ1,θ2,θ3,α)]2, I12=I21=log2Lθ1θ2=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr1(nr1)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr![Ω(m,n;θ1,θ2,θ3,α)]2×r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3rr!, I13=I31=log2Lθ1θ3=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3r1(r1)!Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr![Ω(m,n;θ1,θ2,θ3,α)]2×r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3r1(r1)!, I14=I41=log2Lθ1α=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr!(m+nr+2)Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr1(mr1)!θ2nr(nr)!θ3rr![Ω(m,n;θ1,θ2,θ3,α)]2×r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!(m+nr+2), I22=log2Lθ22=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr2(nr2)!θ3rr!Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)[r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3rr!]2[Ω(m,n;θ1,θ2,θ3,α)]2, I23=I32=log2Lθ2θ3=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3r1(r1)!Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3rr![Ω(m,n;θ1,θ2,θ3,α)]2×r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3r1(r1)!, I24=I42=log2Lθ2α=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3rr!(m+nr+2)Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr1(nr1)!θ3rr![Ω(m,n;θ1,θ2,θ3,α)]2×r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!(m+nr+2), I33=log2Lθ32=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3r2(r2)!Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)[r=0min(m,n)βm+nr(α)Drθ1mr(mr)!θ2nr(nr)!θ3r1(r1)!]2[Ω(m,n;θ1,θ2,θ3,α)]2, I34=I43=log2Lθ3α=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr(nr)!θ3r1(r1)!(m+nr+2)Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)r=0min(m,n)βm+nr(α)Dθ1mr(mr)!θ2nr(nr)!θ3r1(r1)![Ω(m,n;θ1,θ2,θ3,α)]2×r=0min(m,n)(m+nr+1)2βm+nr+1(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!(m+nr+2) and I44=log2Lθ32=η(θ1,θ2,θ3,α)+m=0yn=0za(m,n)r=0min(m,n)(m+nr+1)2(m+nr+2)2βm+nr+2(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!(m+nr+2)(m+nr+3)Ω(m,n;θ1,θ2,θ3,α)m=0yn=0za(m,n)(r=0min(m,n)(m+nr+1)2βm+nr+2(α)Drθ1mr(mr)!θ2nr(nr)!θ3rr!)2(m+nr+2)2[Ω(m,n;θ1,θ2,θ3,α)]2.