289
Views
0
CrossRef citations to date
0
Altmetric
Applied & Interdisciplinary Mathematics

Actuarial measures, aggregate loss models, and insurance applications of the sine exponentiated Burr XII distribution

ORCID Icon | (Reviewing editor:)
Article: 2318885 | Received 10 Nov 2023, Accepted 11 Feb 2024, Published online: 26 Feb 2024

ABSTRACT

The appropriate use of trigonometric functions in distribution theory can not be over-emphasized. In this paper, an extension of the exponentiated Burr XII distribution is proposed via the sine-Gfamily of distributions. This distribution is known as the sine exponentiated Burr XII distribution. The density is decreasing and right skewed with varying degree of peakedness whereas the hazard rate is decreasing and unimodal shaped. Mathematical properties such as the quantile function, moments, incomplete moments, moment generating function, and order statistics are derived. Also, actuarial measures such as value at risk, tail value at risk, and tail variance are derived and studied. The results show that the actuarial measures of the SEBXII distribution increase with increasing confidence level an indication that the SEBXII distribution is heavy-tailed. Under collective risk model, the mean and variance of the aggregate loss distribution where the frequency distribution for the claim count is Poisson are derived and studied. Estimates of the expected payout and the variance of the payout for the aggregate loss are obtained for varying parameter values. Five estimation approaches are used namely the maximum likelihood, least squares, weighted least squares, percentile, and Anderson-Darling are employed in examining the behavior of the proposed distribution. Simulations are carried for each of the estimation method. The results show that in most of the estimation methods, the AB and MSE approaches zero as the sample size increases, an indication that the estimators are consistent. That is the estimators of the proposed distribution are consistent for all the estimation methods. The usefulness of the proposed distribution is demonstrated with two insurance loss datasets. The results show that the proposed distribution provide the best parametric fit for the two datasets. The proposed distribution is flexible and capable of modeling heavy-tailed datasets.

1. Introduction

The modeling of random processes that are often observed in applied sciences remain a growing concern for researchers to investigate. Some researchers have proposed many families of distributions in the literature in an attempt to improve the flexibility of existing distributions. For example see Zografos and Balakrishman (Citation2009); Eugene et al. (Citation2002); Gupta and Kundu (Citation2001); Cordeiro and de Castro (Citation2011).

New distributions based on trigonometric functions are gaining popularity. These distributions are mostly flexible by making use of trigonometric functions curvature properties. Without changing any existing parameters, additional power is particularly acquired in the tails of the distribution (Chesneau & Artault, Citation2021; Chesneau & Jamal, Citation2019; Chesneau et al., Citation2019; Jamal et al., Citation2021; Mahmood et al., Citation2019; Souza et al., Citation2019). Trigonometric functions have several uses in many fields, including mathematics, engineering, medicine, and finance. Gilbert (Citation1895) first used the sine distribution to calculate the impact angles that meteorites had on the moon’s craters.

The sine-G family of distributions was introduced by Souza (Citation2015). Let F(x) be the cumulative distribution function (CDF) and f(x) be the probability density function (PDF). Then, if a random variable X follows the sine-G family of distributions, its CDF is designated by

(1) F(x)=sin(π2G(x)),xR,(1)

with corresponding PDF given by

(2) f(x)=π2g(x)cos(π2G(x)),xR.(2)

It is desirable to know that the sine transformation can greatly increase the flexibility of F(x). This family of distributions has the advantages of being straightforward and having the same number of parameters for F(x) and G(x), preventing over-parameterization. As a result, F(x) can, by extension, increase G(x)’s flexibility. Similar works can be found in Famoyi et al. (Citation2021); Isa et al. (Citation2022); Abonongo et al. (Citation2022); Abonongo et al. (Citation2023); Souza (Citation2015); Souza et al. (Citation2019), and Chesneau and Jamal (Citation2019).

The Burr system of distributions include 12 types of cumulative distribution functions that yields various density shapes (Burr, Citation1942, Citation1968, Citation1973; Burr & Cislak, Citation1968; Hatke, Citation1949; Rodriguez, Citation1977). Alizadeh et al. (Citation2017) proposed and studied the Odd Burr power Lindley distribution. The attractiveness of this family of distributions for model fitting is that it combines a simple mathematical expression for cumulative distribution function with coverage in the skewness-kurtosis plane. The Burr XII distribution has received a lot of attention in modeling right skewed and heavty-tailed data. Some of the extensions of the Burr XII distribution can be found in Alizadeh et al. (Citation2017); Cordeiro et al. (Citation2018) and Gad et al. (Citation2019). Kumar et al. (Citation2017) introduced the exponentiated Burr XII (EBXII) distribution. Let G(x) be the CDF and g(x) be the PDF of the EBXII distribution. A random variable X, is said to follow the EBXII distribution if the CDF is given by

(3) G(x)=(1(1+xα)θ)β,x>0,α>0,β>0,θ>0,(3)

where α, θ and β are shape parameters.

The PDF is given by

(4) g(x)=αβθxα1(1+xα)(θ+1)(1(1+xα)θ)β1,x>0,(4)

where α, θ and β are as defined earlier.

Additionally, due to the rigidity and unsatisfactory performance of the exponential, gamma, and Weibull distributions, Dutta and Perry (Citation2006) disregarded them in their study on loss distributions. They concluded that one would need to use a model that is flexible enough. To develop flexible and useful distributions in the field of risk management, insurance, actuarial sciences, climate science, extreme value analysis, and portfolio optimization, it is essential to build on existing distributions or to develop new ones. In the literature, Ahmad et al. (Citation2021) proposed a new exponential-X family of distributions with the new exponential Weibull (NE-Weibull) as a special distribution. Also, the new beta power transformed family of distributions with the new beta transformed Weibull model capable of modeling heavy-tailed dataset was proposed by Ahmad et al. (Citation2022). The exponential T-X family of distributions was introduced by Ahmad et al. (Citation2021). They proposed the exponential T-X Weibull (ETX-Weibull) model and study it properties with applications to insurance data. Nevertheless, Lone et al. (Citation2023) examined the trade share dataset using a mixture of two log-Bilal distributions (MLBDs). Sindhu et al. (Citation2023) proposed and studied a novel nonhomogeneous Poisson method on the basis of the new power function distribution. Also, Shafiq et al. (Citation2023) proposed the unit Gumbel Type-II (UG-II) model and explored few statistical properties. The Shanker distribution was extended by Abushal et al. (Citation2023) where they proposed the 2-component mixture of the Shanker model (2-CMSM). A significant feature of the 2-CMSM hazard rate function is that it has increasing, decreasing and upside down bathtub shapes.

Furthermore, from the literature, most existing distributions have a lot of parameters introduced so as to make them flexibility. This leads to computational difficulties and over-parameterization amongst others. Also, the EBXII distribution has not yet receive much attention in terms of enhancing its flexibility with the same number of parameters. This is an attempt to extend the EBXII distribution by the use of the sine function. In this vain, the main goal of this paper is to extend/improve the flexibility of the EBXII distribution without introducing additional parameter(s) via trigonometric functions. Therefore, the SEBXII is proposed and studied to enhance the flexibility and tail behavior of the EBXII distribution. The density of the SEBXII is decreasing and right skewed with varying degree of peakedness whereas the hazard rate is decreasing and unimodal shaped. With these desirable properties it is hoped that the SEBXII distribution will be a good candidate in modeling data. This distribution is a contribution on distributions based on trigonometric transformation. The density and hazard rate plots of the distribution are desirable. Also, the VaR, TVaR, and TV plots show that the SEBXII distribution is capable of modeling heavy tailed datasets. The usefulness of the proposed distribution is demonstrated with two insurance loss datasets.

The rest of the paper is organized as follows: Section 2 presents the SEBXII distribution. The mathematical properties of the distribution is presented in Section 3. The aggregate loss model is presented in Section 4. In Section 5, the estimation methods are presented. Monte Carlo simulations are carried out in Section 6. In Section 7, the applications of the SEBXII distribution is illustrated using two real-life datasets. The assumptions and limitations of the study are discussed in Section 8. Finally, the conclusion is presented in Section 9.

2. Sine exponentiated Burr XII distribution

In this section, the sine exponentiated Burr XII (SEBXII) distribution is proposed and studied. Considering EquationEquation (3) as the baseline distribution in EquationEquation (1), the SEBXII distribution is obtained with CDF given by

(5) F(x)=sin[π2(1(1+xα)θ)β],x>0,α>0,β>0,θ>0,(5)

where α, θ and β are shape parameters.

Remark 1. When α = 1, one will obtain sine exponentiated Lomax (SELo) distribution-(New). When θ = 1, one will obtain sine exponentiated log-logistic (SELL) distribution-(New). Also, when β = 1, one will obtain the sine-Burr XII distribution (Isa et al., Citation2022).

The related PDF of the SEBXII distribution is given by

(6) f(x)=παβθ2xα1(1+xα)(θ+1)(1(1+xα)θ)β1×cos[π2(1(1+xα)θ)β],x>0.(6)

The corresponding hazard rate function is given by

(7) h(x)=παβθxα1(1+xα)(θ+1)(1(1+xα)θ)β1×cos[π2(1(1+xα)θ)β]2[1sin[π2(1(1+xα)θ)β]],x>0.(7)

The plots for the density and hazard rate functions of the SEBXII distribution are shown in Figure . The density is decreasing and right skewed with varying degree of peakedness whereas the hazard rate is decreasing and unimodal shaped. This is a demonstration of the flexibility of the SEBXII distribution.

Figure 1. Plots for the density (a) and hazard rate (b) functions of the SEBXII distribution.

Figure 1. Plots for the density (a) and hazard rate (b) functions of the SEBXII distribution.

Before deriving the mathematical properties of the SEBXII distribution, it is desirable to obtain the linear representation of the PDF. Employing Taylor series expansion of the cosine function; cosx=i=0(1)ix2i(2i)!. Hence, from EquationEquation (6),

cos[π2(1(1+xα)θ)β]=i=0(1)i(π/2)2i2i×(1(1+xα)θ)2.

This implies that

f(x)=αβθi=0(1)i(π/2)2i2ixα1(1+xα)(θ+1)×(1(1+xα)θ)2+β1.

Considering

(1(1+xα)θ)2+β1=j=0(1)j(2+β1j)×(1+xα),

the linear representation of the PDF of the SEBXII distribution becomes

(8) f(x)=αβθj=0i=0Φijxα1(1+xα)(θ+1),(8)

where Φij=(1)i+j(π/2)2i+12i!(2+β1j).

3. Mathematical properties

In this section, some mathematical properties of the SEBXII distribution are derived.

3.1. Quantile function

The quantile function is used in obtaining random samples from a probability distribution. The quantile function of the SEBXII distribution is given by

xp=[(1(2πarcsin(p))1/β)1/θ1]1/α,p(0,1).

3.2. Moments and incomplete moments

The moments are important in estimating measures of variation like the variance, standard deviation, coefficient of variation, mean deviation, median deviation, kurtosis, skewness amongst others.

The rth non-central moment by definition is given by

μr=0xrf(x)dx.

This implies

μr=αβθj=0i=0Φij0xrxα1(1+xα)(+(θ+1))dx.

After some algebra, simplification and making use of the identity B(s,t)=01xs1(1x)t1dx, one obtains

(9) μr=βθj=0i=0ΦijB(2+rαθ,1+rα),α>r,(9)

where B(.,.) is the beta function with r=1,2,3,

The values for the first four moments, standard deviation (σ), coefficient of variation (CV), coefficient of skewness (CS), and coefficient of kurtosis (CK) of the SEBXII distribution for four set of parameter values: I=(α = 1.82, β = 0.83, θ = 5.3), II=(α = 2.6, β = 0.14, θ = 1.82), III=(α = 1.1, β = 0.6, θ = 2.31), and IV=(α = 3.7, β = 0.31, θ = 4.52) are shown in Table via numerical integration. The standard deviation (σ), coefficient of variation (CV), coefficient of skewness (CS), and coefficient of kurtosis (CK) are defined as σ=μ2(μ1)2, CV=σμ1, CS=μ33μ1μ2+2(μ1)3σ3, and CK=μ44μ1μ3+6(μ1)2μ23(μ1)4σ4 respectively.

Table 1. First four moments, σ, CV, CS, and CK of the SEBXII distribution

The rth incomplete moment by definition is given by

(10) Mr(x)=0yxrf(x)dx.(10)

Substituting EquationEquation (8) into EquationEquation (10), one obtains

Mr(x)=αβθj=0i=0Φij0yxrxα1(1+xα)(+(θ+1))dx,

and solving, one obtains

Mr(x)=βθj=0i=0ΦijB((1+xα)1;2+rαθ,1+rα),α>r,

where B(.;.,.) is the incomplete beta function with r=1,2,3,

3.3. Moment generating function

The moment generating function (MGF) of a random variable X by definition is given as, Mx(t)=E(etx), if it exists. Using series expansion, Mx(t)=r=0trr!μr. Thus,

Mx(t)=r=0trr!βθj=0i=0ΦijB(2+rαθ,1+rα),α>r.

Therefore, the MGF of the SEBXII distribution is given by

Mx(t)=βθtrr!r=0ji=0ΦijB(2+rαθ,1+rα),α>r.

3.4. Actuarial measures

Actuarial measures such as value at risk (VaR), tail value at risk (TVaR), and tail variance (TV) of the SEBXII distribution are derived and studied.

3.4.1. Value at risk

The VaR is a benchmark for evaluating market risk. It is also called the quantile premium principle or quantile risk measure. It is usually expressed with a confidence level q (usually 90%, 95% or 99%), and represents the percentage of loss in portfolio value that will be equaled or exceeded only X percent of the time. The value at risk of the SEBXII distribution is given by

VaR(x)=[(1(2πarcsin(q))1/β)1/θ1]1/α,q(0,1).

3.4.2. Tail value at risk

The TVaR is used in estimating the risk beyond a certain probability level. The TVaR of random variable X is defined as

(11) TVaR(x)=11qVaRxf(x)dx.(11)

Substituting EquationEquation (8) into EquationEquation (11) will yield

TVaR(x)=αβθ1qj=0i=0ΦijVaRxxα1(1+xα)(θ+1)dx.

After some algebraic manipulations, one obtains the TVaR of the SEBXII distribution as

TVaR(x)=βθ1qj=0i=0ΦijB((1+VaRα)1;2+1αθ,1+1α)

3.4.3. Tail variance

The TV is important in estimating the risk at the tails. By definition, the TV of a random variable X given by

TV(x)=E(X2|X>xq)(TVaR(x))2.

Considering,

E(X2|X>xq)=αβθ1qj=0i=0ΦijVaRx2xα1(1+xα)(+(θ+1))dx.

On solving,

TV(x)=βθ1qj=0i=0ΦijB((1+VaRα)1;2+2αθ,1+2α)(TVaR(x))2.

Table shows the simulation results for the actuarial measures for n = 200 and different set of parameter values. The results show that the actuarial measures of the SEBXII distribution increase with increasing confidence level an indication that the SEBXII distribution is leptokurtic and heavy-tailed. This is confirmed by the plots in Figures where the plots are all increasing at varying parameter values for VaR, TVaR, and TV.

Figure 2. VaR plots for SEBXII distribution.

Figure 2. VaR plots for SEBXII distribution.

Figure 3. TVaR plots for SEBXII distribution.

Figure 3. TVaR plots for SEBXII distribution.

Figure 4. TV plots for SEBXII distribution.

Figure 4. TV plots for SEBXII distribution.

Table 2. Simulation results of actuarial measures for n = 200

The simulation steps are as follows:

  1. Random sample of size n = 200 are generated from the SEBXII distribution and parameters estimated via maximum likelihood method.

  2. 1000 repetitions are made to calculate the VaR, TVaR and TV for the SEBXII distribution.

3.5. Order statistics

Let X1,X2,,Xn be a sample of size n from the SEBXII distribution and X1:nX2:nXn:n denote the order statistics of the sample.

The PDF of the ith order statistics is defined as

(12) fi:n(x)=n!(i1)!(ni)!f(x)k=0ni(1)k(nik)(F(x))k+i1.(12)

Substituting EquationEquation (3) and EquationEquation (4) into EquationEquation (12), will give

(13) fi:n(x)=n!αβθ2(i1)!(ni)!xα1(1+xα)(θ+1)×(1(1+xα)θ)β1cos[π2(1(1+xα)θ)β].(13)

The PDF of the nth order statistics is defined as

(14) fn:n(x)=n[F(x)]n1f(x).(14)

Substituting EquationEquation (3) and EquationEquation (4) into EquationEquation (14), the PDF of the nth order statistics becomes

(15) fn:n(x)=nπαβθ2xα1(1+xα)(θ+1)(1(1+xα)θ)β1×cos[π2(1(1+xα)θ)β]×sin[π2(1(1+xα)θ)β]n1.(15)

The PDF of the first order statistics is given by

(16) f1:n(x)=n[1F(x)]n1f(x).(16)

Substituting EquationEquation (3) and EquationEquation (4) into EquationEquation (16), the PDF of the first order statistics becomes

(17) f1:n(x)=nπαβθ2xα1(1+xα)(θ+1)(1(1+xα)θ)β1×cos[π2(1(1+xα)θ)β]×[1sin[π2(1(1+xα)θ)β]]n1.(17)

4. Aggregate loss models under collective risk theory

In this section, the aggregate loss models of the SEBXII distribution are presented. Suppose N claims are experienced in the past and each unit xi is the claim size which is independent identical distributed SEBXII with parameter α, β, and θ. The aggregate loss S is defined as S=x1+x2+x3++xN. If the individual loss amounts xi are independent on the annual loss frequency N, then the PDF of the aggregate loss is defined as

fs(s)=i=1Pr(N=i)fxi(s),

where fxi(s) is the ith fold convolution of S. The mean and variance of the aggregate loss distribution are defined as

E(S)=E(N)E(X)

and

V(S)=E(N)V(X)+(E(X))2V(N)

respectively.

Suppose the frequency of the loss follows the Poisson distribution with parameter λ, the mean and variance of the aggregate loss distribution can be written as

(18) E(S)=λβθj=0i=02iΦijB(2+1αθ,1+1α)(18)

and

(19) V(S)=λβθj=0i=02iΦij[B(2+2αθ,1+2α)](19)

respectively.

Making use of EquationEquation (18) and EquationEquation (19) the estimates of the mean (expected payout) and the variance of the payout for the aggregate loss are obtained. These are presented in Table .

Table 3. Mean and variance of aggregate loss of SEBXII under the Poisson distribution with λ = 5

5. Estimation methods

In this section, five methods of estimation are presented on how to obtain the parameters of the SEBXII distribution either by maximizing or minimizing the function under consideration. These are the maximum likelihood (MLE), least-squares (LS), weighted least-squares (WLS), percentile (PC), and Anderson-Darling (AD).

5.1. Maximum likelihood estimation

The maximum likelihood estimation (MLE) is obtained by maximizing the log-likelihood function. Let x1,x2,,xn be n random samples from the SEBXII distribution, then, the log-likelihood function which is a n × 1 parameter vector Δ=(α,β,θ)T is given by

(20) (Δ)=n(log(παβθ2))+(α1)i=1nlog(xi)(θ+1)×i=1nlog(1+xiα)+(β1)i=1nlog(1(1+xiα)θ)+i=1nlog(cos[π2(1(1+xα)θ)β]).(20)

Equating the score functions (differentiating EquationEquation (20) with respect to each of the parameter) to zero and solving the resulting system of equations numerically, the maximum likelihood estimates of the parameters are obtained.

5.2. Method of Anderson-Darling

Let x1,x2,,xn be n random sample from the SEBXII distribution, the Anderson-Darling (AD) estimation is obtained by minimizing the following function:

(21) AD(Δ)=n1n1=1n(2i1)[logF(x(i);α,β,θ)+logS(x(n+1i);α,β,θ)]=n1n[log(sin[π2(1(1+x(i)α)θ)β])+log(1sin[π2(1(1+x(n+1i)α)θ)β])].(21)

5.3. Method of percentile

Let x1,x2,,xn be n random sample from the SEBXII distribution, and ui=i/(n+1). The percentile (PC) estimation of the SEBXII distribution is obtained by minimizing the following function:

(22) PC(Δ)=i=1n[xi[(1(2πarcsin(ui))1/β)1/θ1]1/α]2.(22)

5.4. Method of least square and weighted least square

Let x1,x2,,xn be n random sample from the SEBXII distribution, the least square (LS) estimation of the SEBXII is obtained by minimizing the following function:

(23) LS(Δ)=i=1n[F(x(i),α,β,θ)in+1]2(23)
=i=1n[sin[π2(1(1+x(i)α)θ)β]in+1]2.

The weighted least square (WLS) estimation of the parameters of the SEBXII distribution is obtained by minimizing the following function:

(24) WLS(Δ)=i=1n(n+1)2(n+2)i(ni+1)[F(x(i),α,β,θ)in+1]2=i=1n(n+1)2(n+2)i(ni+1)×[sin[π2(1(1+x(i)α)θ)β]in+1]2.(24)

6. Monte Carlo simulation

In this section, simulation study to assess the performance of the five estimation methods is carried out. The nlminb function in the R program is used in the simulation. The function uses the L-BFGS-B optimization method. The simulation steps are as follows:

  1. Generate N = 1000 sample of size n=25,100,$\break$200,350,500 from the quantile function of the SEBXII distribution.

  2. Compute the MLE, AD, LS, WLS, and PC parameter estimates of the sample obtained in (i).

  3. For each parameter estimate, calculate the absolute baise (AB) and mean square error (MSE) defined as

    AB=1Ni=1N(Δ^iΔi)

    and

    MSE=1Ni=1N(Δ^iΔi)2

    for Δ=(α,β,θ), respectively.

  4. Steps (i) and (iii) are repeated for two sets of parameter values; I=(α=1.02,β=0.53,θ=2.23) and II=(α=0.65,β=2.46,θ=0.82).

Tables and show the simulation results of the five estimation methods. For most of the estimation methods, the AB and MSE approaches zero as the sample size increases, an indication that the estimators are consistent. Also, for smaller sample size, PC performed better for α and θ, while MLE performed better for β. For larger sample size, PC performed better for α, while MLE performed better for β and θ.

Table 4. Simulation results of AB and MSE for α = 1.02, β = 0.53, θ = 2.23

Table 5. Simulation results of AB and MSE for α = 0.65, β = 2.46, θ = 0.82

7. Applications

In this section, the usefulness and flexibility of the SEBXII distribution using insurance loss datasets are demonstrated. The performance of the SEBXII distribution is compared with the exponentiated Burr XII (Kumar et al., Citation2017), Burr XII (Burr, Citation1942), sine Burr XII (Isa et al., Citation2022), modified Weibull distribution (Sarhan & Zaindin, Citation2009), and flexible Lomax (Ijaz et al., Citation2020) distributions. The performance of the distributions about providing proper parametric fit to the dataset is compared using the Akaike information criterion (AIC), Bayesian information criterion (BIC), Cramér-von Misses (W), Anderson-Darling (A) and Kolmogorov Smirnov (K-S) statistics. The distribution with the least of these measures provide a reasonable fit to the dataset. The density function of exponentiated Burr XII (EBXII), Burr XII (BXII), sine Burr XII (SBXII), modified Weibull distribution (MWD), and flexible Lomax (FL) distributions are as follows:

f(x)=αβθxα1(1+xα)(θ+1)(1(1+xα)θ)β1,x>0,α,β,θ>0.f(x)=ckxc1(1+xc)(k+1),x>0,c,k>0.f(x)=π2abxa1(1+xa)b+1cos(π2(11(1+xa)b)),x>0,a,b>0.f(x)=(α+βγxγ1)eαxβxγ,x>0,α,β,γ>0.f(x)=acbcxc1(1+(xb)c)(a+1),x>0,a,b,c>0.

7.1. Automobile collision data

The first dataset consist of severity; the average amount of claims (in pounds sterling) adjusted for inflation, of automobile collisions in the United Kingdom. It is available in the R package “insuranceData” (Wolny-Dominiak & Trzesiok, Citation2014).

The descriptive statistics of the automobile collision data as shown in Table is evident that the data is heavy-tailed and right skewed. The box plot, histogram, violin plot, and kernel density plot of the data is shown in Figure . It confirms the results in Table . The hazard rate function of the automobile collision dataset is increasing since the curve is above the 45% degree line, this is shown in Figure .

Figure 5. Box plot, histogram, violin plot, and kernel density of the automobile collision data.

Figure 5. Box plot, histogram, violin plot, and kernel density of the automobile collision data.

Figure 6. TTT plot of the automobile collision dataset.

Figure 6. TTT plot of the automobile collision dataset.

Table 6. Descriptive statistics of automobile collision data

The MLEs, their standard errors (SEs) and the values of AIC, BIC, K-S, W*, and A* measures are shown in Table . The SEBXII distribution has the least of the information criteria and goodness-of-fit measures, and thus provides the best parametric fit to the automobile collision dataset compared to the other competing distributions.

Table 7. Mles, SEs, information criteria, goodness-of-fit measures for automobile collision data

Figure shows the empirical, fitted CDF and density of the SEBXII distribution for the automobile collision data. The plot shows that the SEBXII distribution provides a good parametric fit to the dataset.

Figure 7. Empirical, fitted CDF and density of the SEBXII distribution for automobile collision dataset.

Figure 7. Empirical, fitted CDF and density of the SEBXII distribution for automobile collision dataset.

7.2. Catastrophe data

The second dataset consist of the cost associated with catastrophe disaster in Australia from 1967 to 2014. The normed cost in millions of 2014 Australian dollars (AUD), computed as the inflated cost using consumer price index. This dataset is available in CASdatasets package (Dutang & Charpentier, Citation2020) of the R software. The descriptive statistics in Table shows that the data is heavy-tailed and right skewed. Figure shows the box plot, histogram, violin, and kernel density plot of the data which confirms the results in Table .

Figure 8. Box plot, histogram, violin plot, and kernel density of the catastrophe data.

Figure 8. Box plot, histogram, violin plot, and kernel density of the catastrophe data.

Table 8. Descriptive statistics of catastrophe data

The hazard rate function of the catastrophe dataset is decreasing since the curve is convex below the 45% degree line, this is shown in Figure .

Figure 9. TTT plot of the catastrophe dataset.

Figure 9. TTT plot of the catastrophe dataset.

The MLEs, their standard errors (SEs) and the values of AIC, BIC, K-S, p-value, W*, and A* measures are shown in Table . The SEBXII distribution has the least of the information criteria and goodness-of-fit measures, and thus provides the best parametric fit to the automobile the catastrophe dataset compared to the other competing distributions.

Table 9. Mles, SEs, information criteria, goodness-of-fit measures for catastrophe data

Figure shows the empirical, fitted CDF and density of the SEBXII distribution for the catastrophe data. The plot shows that the SEBXII distribution provides a good parametric fit to the dataset.

Figure 10. Empirical, fitted CDF and density of the SEBXII distribution for catastrophe dataset.

Figure 10. Empirical, fitted CDF and density of the SEBXII distribution for catastrophe dataset.

8. Assumptions and limitations

In this paper, it is assumed that the EBXII distribution is a baseline distribution in the sine-G family of distributions. The limitations of this study has to do with choosing initial values to include in the simulations so as achieve convergence.

9. Conclusion

Recent studies have shown the potentials of trigonometric functions in proposing new distributions. This technique is flexible due to its desirable properties and mostly avoids over-parameterization. In this paper, an extension of the exponentiated Burr XII distribution is introduced via the sine-G family of distributions. This distribution is known as the sine exponentiated Burr XII distribution. The plots of the density can be decreasing and right skewed with varying degree of peakedness whereas the hazard rate plots can be decreasing and unimodal shaped. Mathematical properties such as the quantile function, moments, incomplete moments, moment generating function, and order statistics are derived. Actuarial measures such as value at risk, tail value at risk, and tail variance are derived and studied. The simulated results of the actuarial measures show that these measures increase with increasing confidence level an indication that the SEBXII distribution is heavy-tailed. Also, under collective risk model, the mean and variance of the aggregate loss distribution where the frequency distribution for the claim count is Poisson are derived and studied. Estimates of the expected payout and the variance of the payout for the aggregate loss are obtained for varying parameter values. To estimate the parameters of the SEBXII distribution, the following five different estimation approaches are used; maximum likelihood (MLE), least squares (LS), weighted least squares (WLS), percentile (PC), and Anderson-Darling (AD). The results of the Monte Carlo simulations show that the estimators of the SEBXII distribution are consistent under all the estimation methods. The usefulness of the SEBXII distribution is demonstrated with two insurance loss datasets namely; the automobile collision data and catastrophe data. The performance of the SEBXII distribution is compared with other competing distributions. It is evident that the SEBXII distribution provide the best parametric fit under the two datasets. This distribution is flexible and can be considered as a good candidate in modeling heavy-tailed datasets in insurance, finance, and other related fields.

Acknowledgements

The author wish to thank the editor and reviewers for their comments that helped in improving this paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Funding

The author received no direct funding for this research.

Data availability statement

The data that support the findings of this study are available in the CASdatasets and insuranceData packages of the R software.

Additional information

Notes on contributors

John Abonongo

John Abonongo is a lecturer at the Department of Statistics and Actuarial Science, School of Mathematical Sciences, C. K. Tedam University of Technology and Applied Sciences, Navrongo, Ghana. He has published several research articles in several reputable journals. His research areas: Financial time series, distribution theory, stochastic theory, financial derivative and pricing, actuarial mathematics and Applications.

References

  • Abonongo, J., Mwaniki, I. J., & Aduda, J. A. (2022). Sine F-Loss family of distribution: Properties and insurance applications. Applied Mathematics and Information Sciences, 16(5), 835–851.
  • Abonongo, J., Mwaniki, I. J., & Aduda, J. A. (2023). A new family of F-Loss distributions: Properties and applications. Advances and Applications in Statistics, 87(1), 81–117. https://doi.org/10.17654/0972361723030
  • Abushal, T. A., Sindhu, T. N., Lone, S. A., Hassan, M. K. H., & Shafiq, A. (2023). Mixture of Shanker distributions: Estimation, simulation, and application. Axiom, 12(3), 231. https://doi.org/10.3390/axioms12030231
  • Ahmad, Z., Mahmoudi, E., Alizadeh, M., Roozegar, R., & Afify, A. Z. (2021). The exponential T-X family of distributions: Properties and an application to insurance data. Journal of Mathematics, 2021, 18. https://doi.org/10.1155/2021/3058170
  • Ahmad, Z., Mahmoudi, E., Roozegar, R., Alizadeh, M., & Afify, A. Z. (2021). A new exponential X family: Modeling extreme value data in the finance sector. Mathematical Problems in Engineering, 2021, 14. https://doi.org/10.1155/2021/8759055
  • Ahmad, Z., Mahoudi, E., & Alizadeh, M. (2022). Modeling insurance using a new beta power transformed family of distributions. Communications in Statistics-Simulations and Computation, 51(8), 4470–4491. https://doi.org/10.1080/03610918.2020.1743859
  • Alizadeh, M., Altun, E., & Ozel, G. (2017). Odd burr power Lindley distribution with properties and applications. Gazi University Journal of Sciences, 30(3), 139–159.
  • Burr, I. W. (1942). Cumulative frequency functions. The Annals of Mathematical Statistics, 13(2), 215–232. https://doi.org/10.1214/aoms/1177731607
  • Burr, I. W. (1968). On a general system of distributions, iii. The sample range. Journal of the American Statistical Association, 63(322), 636–643. https://doi.org/10.1080/01621459.1968.11009282
  • Burr, I. W. (1973). Parameters for a general system of distributions to match a grid of α 3 and α 4. Communications in Statistics, 2(1), 1–21. https://doi.org/10.1080/0361/0927/3088/27052
  • Burr, I. W., & Cislak, P. J. (1968). On a general system of distributions: I. its curved-shaped characteristics, ii. The sample median. Journal of the American Statistical Association, 63(322), 627–635. https://doi.org/10.1080/01621459.1968.11009281
  • Chesneau, C., & Artault, A. (2021). On a comparative study on some trigonometric classes of distributions by the analysis of practical datasets. Journal of Nonlinear Modelling Anal, 3(2), 225–262.
  • Chesneau, C., Bakouch, H. S., & Hussain, T. (2019). A new class of probability distributions via cosine and sine functions with applications. Communications in Statistics - Simulation & Computation, 48(8), 2287–2300. https://doi.org/10.1080/03610918.2018.1440303
  • Chesneau, C., & Jamal, F. (2019). The sine kumaraswamy-G family of distributions. HAL, hal-02120197.
  • Cordeiro, G. M., & de Castro, M. (2011). A new family of generalized distribution. Journal of Statistical Computation and Simulation, 81(7), 883–898. https://doi.org/10.1080/00949650903530745
  • Cordeiro, G. M., Yousof, H. M., Ramires, T. G., & Ortega, E. (2018). The burr XII system of densities: Properties, regression model and application. Journal of Statistical Computation and Simulation, 88(3), 432–456. https://doi.org/10.1080/00949655.2017.1392524
  • Dutang, C., & Charpentier, A. (2020). Package CASdatasets.
  • Dutta, K., & Perry, J. (2006). A tale of tails: An empirical analysis of loss distribution models for estimating operational risk capital. Tech. rep., Federal Reserve Bank of Boston, Working Paper No. 06-13.
  • Eugene, N., Lee, C., & Famoye, F. (2002). The beta-normal distribution and its applications. Communications in Statistics-Theory and Methods, 31(4), 497–512. https://doi.org/10.1081/STA-120003130
  • Famoyi, A., Algarni, A., & Almarashi, A. M. (2021). Sine inverse Lomax generated family of distributions with applications. Mathematical Problems in Engineering, 2021, 1–11. https://doi.org/10.1155/2021/1267420
  • Gad, A. M., Hamedani, G. G., Salehabadi, S. M., & Yousof, H. M. (2019). The burr XII-Burr XII distribution: Mathematical properties and characterizations. Pakistan Journal of Statistics, 35(3), 229–248.
  • Gilbert, G. K. (1895). The moon’s face; a study of the origin of its features. Bulletin of the Philosophical Society of Washington, 12, 241–249 doi:.
  • Gupta, R. D., & Kundu, D. (2001). Generalized exponential distribution: Different method of estimations. Journal of Statistical Computation and Simulation, 69(4), 315–337. https://doi.org/10.1080/00949650108812098
  • Hatke, M. A. (1949). A certain cumulative probability function. Annals of Mathematical Statistics, 20(3), 261–463. https://doi.org/10.1214/aoms/1177730002
  • Ijaz, M., Asim, M., & Khali, A. (2020). Flexible Lomax Distribution. Songlanakarin Journal of Science and Technology, 42(5), 1125–1134.
  • Isa, A. M., Ali, B., & Zannah, U. (2022). Sine Burr XII distribution: Properties and application to real data set. Arid Zone Journal of Basic and Applied Research, 1(3), 48–58. https://doi.org/10.55639/607lkji
  • Jamal, F., Chesneau, C., & Aidi, K. (2021). The sine extended odd Fréchet-G family of distribution with applications to complete and censored data. Mathematica slovaca, 71(4), 961–982. https://doi.org/10.1515/ms-2021-0033
  • Kumar, D., Saran, J., & Jain, N. (2017). The exponentiated burr XII distribution: Moments and estimation based on lower record values. Sri Lankan Journal of Applied Statistics, 18(1), 1–17. https://doi.org/10.4038/sljastats.v18i1.7930
  • Lone, S. A., Sunhu, T. N., Anwar, S., Hassan, M. K. H., Alsahli, S. A., & Abushal, T. A. (2023). On construction and estimation of mixture of Log-Bilal distributions. Axioms, 12(3), 309. https://doi.org/10.3390/axioms12030309
  • Mahmood, Z., Chesneau, C., & Tahir, M. H. (2019). A new sin-G family of distributions: Properties and applications. Bulletin of Computational Applied Mathematics, 7(1), 53–81.
  • Rodriguez, N. (1977). A guide to the Burr type probability function. Biometrika, 64(1), 129–134. https://doi.org/10.1093/biomet/64.1.129
  • Sarhan, A. M., & Zaindin, M. (2009). Modified Weibull distribution. Applied Sciences, 11(2009), 123–136.
  • Shafiq, A., Sindhu, T. N., Hussain, Z., Mazucheli, J., & Alves, B. (2023). A flexible problem model for proportion data: Unit Gumbel type-II distribution, development, properties, different method of estimations and applications. Austrain Journal of Statistics, 52(2), 116–140. https://doi.org/10.17713/ajs.v52i2.1407
  • Sindhu, T. N., Anwar, S., Hassan, M. K. H., Lone, S. A., Abushal, T. A., & Shafiq, A. (2023). An analysis of the new reliability model based on bathtub-shaped failure rate distribution with application to failure data. Mathematics, 11(4), 842. https://doi.org/10.3390/math11040842
  • Souza, L. (2015). New Trigonometric Classes of Probabilistic Distributions. Ph.D. thesis,
  • Souza, L., Junior, W. R. O., de Brito, C. C. R., Chesneau, C., Ferreira, T. A. E., & Soares, L. (2019). General properties for the cos-G class of distribution with applications. Eurasian Bulling Math, 2(2), 63–79.
  • Wolny-Dominiak, A., & Trzesiok, M. (2014). Package insuranceData.
  • Zografos, K., & Balakrishman, N. (2009). On families of beta and generalized gamma-generated distributions and associated inference. Statistical Methodology, 6(4), 344–362. https://doi.org/10.1016/j.stamet.2008.12.003