943
Views
1
CrossRef citations to date
0
Altmetric
Articles

Stochastic loss reserving using individual information model with over-dispersed Poisson

, &
Pages 114-128 | Received 20 Aug 2020, Accepted 01 Mar 2021, Published online: 19 Mar 2021

Abstract

For stochastic loss reserving, we propose an individual information model (IIM) which accommodates not only individual/micro data consisting of incurring times, reporting developments, settlement developments as well as payments of individual claims but also heterogeneity among policies. We give over-dispersed Poisson assumption about the moments of reporting developments and payments of every individual claims. Model estimation is conducted under quasi-likelihood theory. Analytic expressions are derived for the expectation and variance of outstanding liabilities, given historical observations. We utilise conditional mean square error of prediction (MSEP) to measure the accuracy of loss reserving and also theoretically prove that when risk portfolio size is large enough, IIM shows a higher prediction accuracy than individual/micro data model (IDM) in predicting the outstanding liabilities, if the heterogeneity indeed influences claims developments and otherwise IIM is asymptotically equivalent to IDM. Some simulations are conducted to investigate the conditional MSEPs for IIM and IDM. A real data analysis is performed basing on real observations in health insurance.

1. Introduction

In the background of stochastic reserving, loss reserving is referred to a procedure to predict incurred outstanding liabilities in general insurance companies. It is well known that chain-ladder method proposed by Mack (Citation1993) and its related versions can be easily performed by using pencil and paper because of the simple aggregate data structure called run-off triangle and hence are popular in practice. However, as England and Verrall (Citation2002) mentioned, the advantages of aggregate data models are at the cost of prediction accuracy because of information loss caused by simply aggregating individual or micro data, which records incurring time, reporting time, settlement time as well as payment processes of individual claims. In risk management of insurance companies, with the modern computer technology, it is urgent for actuaries to explore the usage of related information to improve the accuracy in predicting the liabilities, which also attracts increasing interests of many scholars from actuarial science. Antonio and Plat (Citation2014), Pigeon et al. (Citation2013Citation2014) demonstrated by an empirical analysis that loss reserving based on individual data had more prediction accuracy than aggregate models. Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016) revealed that individual loss reserving had more accuracy than methods using aggregate data in sense that the former produced a smaller mean square error.

A small stream of earlier literature about IDMs, for example, Arjas (Citation1989) and Norberg (Citation1993Citation1999) formulated a probabilistic framework for the developments of individual claims. Most recently, Yu and He (Citation2016) modelled the individual claim development process by marked Cox processes (also known as double stochastic processes). As we all know, it is challenging to acquire analytic expressions for the moments of outstanding liabilities under continuous-time IDMs. Perhaps partly for this reason, there is a great deal of work that has been done under discrete-time IDMs, see, e.g., Pigeon et al. (Citation2013Citation2014), Verrall et al. (Citation2010), Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016). Zhao and Zhou (Citation2010) considered the R-delays so as to predict the incurred but not settled outstanding liabilities. Unfortunately, IDMs also confront information loss caused by neglecting individual information, i.e., information from policy or policyholder. It is not clear so far how much accuracy in predicting the outstanding liabilities is sacrificed, when the individual information is neglected. In the present paper, we will explore how much improvement in the accuracy that will be measured by conditional MSEP can be achieved by incorporating the useful individual information into modelling under discrete time framework similar as Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016). Besides, we avoid the strong Poisson distribution assumption for the number of individual claims assumed in Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016) and instead extend to weak assumptions about the first two moments so that parameters estimation can be conducted under quasi-likelihood theory (cf. McCullagh & Nelder, Citation1989).

The conditional MSEP is broadly used to compare different models for loss reserving. It is well known that the conditional MSEP is the sum of process variance caused by the randomness of outstanding liabilities and estimation error originating from uncertainty of parameters estimators. It is theoretically feasible to estimate it by bootstrap method. There were some examples which discussed the MSEP under collective models – for instance, Mack (Citation1993), Mack (Citation2000) (comparing three methods – Benktander, Bornhuetter–Ferguson and chain-ladder under the criteria of MSEP), Alai et al. (Citation2009Citation2010) (Bornhuetter–Ferguson method under generalised linear model) and Wüthrich and Merz (Citation2008) (comprehensive summary of the details of methods based on aggregate data). Besides, Lindholm et al. (Citation2020) introduced a semi-analytic approximation method to estimate the conditional MSEP, where the method is illustrated by loss reserving based on aggregate data. Examples that have applied the approximation method are Wahl (Citation2019), who computed explicit moments of outstanding liabilities by applying discretisation scheme under the framework of Antonio and Plat (Citation2014), and Wahl et al. (Citation2019) who modelled individual data on aggregate level. In the present paper, we also use the approximation method for the MSEP, which is derived under IIM, because of its simplification.

The paper is organised as follows. In Section 2, we describe the data structure and display the mathematical expression of outstanding liabilities caused by a risk portfolio at a given evaluation date. In Section 3, we separately model reporting developments, settlement delays and payments of claims and in each part, we formulate the model assumptions as well as estimation for the model parameters. Section 4 mainly derives the formulas of loss reserve and conditional variance of outstanding liabilities given historical observations, and studies the improvement of accuracy achieved by IIM with respect to IDM. Section 5 reports some simulation results and a real data analysis. Section 6 concludes the paper with a few remarks.

2. Data structure

Claim events incurred by some policy are usually reported to the insurer in some time periods (reporting delays) after their occurrence and the reported claims are finally settled with some time lags (settlement delays) between their reports and final settlements. Before going further, it is necessary to discuss the supports of reporting and settlement delays. In the following assumption, we assume that there exists maximum reporting delay Dr and settlement delay Ds. Actually, there are basically two cases for the supports of the delays: finite and infinite. It would be a known priori (generally read from the items of the insurance contracts) if the supports are finite or infinite before any loss reserving is taken care of. Even for the case the delays take unrestricted values, if the probability to take values over certain limits is quite small, one can safely assume a capped delays by cutting off the tails with probability small enough. As a result, the assumption of capped delays is reasonable in many real insurance businesses, especially for such insurance without very much high claims payments. An example is the general health insurance. The assumption of capped delays has been extensively adopted in such traditional methods as chain-ladder algorithm. If the tails cannot be safely cut off, however, the models such as the one proposed in Crevecoeur et al. (Citation2019) or some others would be more suitable. From the statistical point of view, for their distributions to be reasonably estimated with observations over a finite number of years, at least one of the two assumptions is necessary: they take only a finite number of values with arbitrary probabilities (but subject to normalisation) or countably infinitely many values but with their distribution functions identified by finite many parameters. Whatever the case, the number of unknown parameters that need to be estimated must be finite. Here the former is taken, whereas Crevecoeur et al. (Citation2019), for example, took the latter.

Then we specify the data structure used in our model. It is in discrete time version as, e.g., Huang, Qiu, Wu (Citation2015) did. Typically, the data for modelling is organised through periods with fixed length such as 1 year, one season or 1 month depending on lines of business. Conventionally, those periods are referred to ‘(accident) years’. This is also a way widely adopted by insurers to predict the incurred outstanding liabilities in practice. Specifically, the whole observation horizon is made of n accident years and loss reserving is evaluated at the end of nth accident year. In year i, i=1,2,,n, there are mi insurance policies, each of which is coded by (i,k), k=1,2,,mi.

Every individual (i,k) is associated with a random risk exposure rik and d-dimensional vector of covariate xik whose first entry is 1 and other entries indicating the individual information/features that influence the developments of individual claims. The developments of claims incurred by individual (i,k) are detailed as follows.

  1. The reporting developments of claims are recorded by Nikur, u=0,1,,Dr, where Nikur is the number of claims which are incurred in year i and reported in year i + u.

  2. For Nikur claims, u=0,1,,Dr, their settlement developments are tracked by Nikuv, v=0,1,,Ds, where Nikuv is the number of claims which are reported in year i + u and settled in year i + u + v.

  3. Payments for each claim are assumed to be paid for only once at its final settlement. For Nikuv claims, u=0,1,,Dr, v=0,1,,Ds, we use Yikuvl, l=1,2,,Nikuv to record corresponding payments.

Then the random element associated with individual (i,k) is denoted by {rik,xik;{Nikur;(Nikuv;(Yikuvl)l=1Nikuv)v=0Ds}u=0Dr},i=1,,n,k=1,,mi, which are i.i.d. from the population {r,x;{Nur;(Nuv;(Yuvl)l=1Nuv)v=0Ds}u=0Dr},which can be considered as a complete observation of a representative policy in year i.

Following conventional terms, a claim, which has been reported to the insurer but not settled, is known as RBNS claim and a claim, which has been incurred but not reported to the insurer, is known as IBNR claim. For accident year i, the individual observed data is as follows.

  1. The reporting developments of a representative policy in year i are truncated in sense that we can only observe Fr={N0r,N1r,,NDirr}, where (1) Dir=Dr(ni),(1) represents the largest reporting delays of the reported claims in accident year i.

  2. For Nur reported claims, u=0,1,,Dir, their settlement developments are censored in sense that we can observe {Nu0,Nu1,,NuDi+us,Nurbns}, where (2) Di+us=Ds(niu),(2) is the largest settlement delays of settled claims with reporting delay u in accident year i and the number Nurbns:=v=niu+1DsNuv, which is the number of RBNS claims with reporting delays u. Note that Nurbns=0 if niuDs. Denote by Fs=u=0Dir{Nu0,Nu1,,NuDi+us,Nurbns}.

  3. For Nuv settled claims, u=0,1,,Dir,v=0,1,,Di+us, the observed payments for them are gathered in set {Yuv1,Yuv2,,YuvNuv}. Denote Fp=u=0Dirv=0Di+us{Yuv1,Yuv2,,YuvNuv}.

Then individual observation Fo is the union of {r,x}, Fr, Fs and Fp, that is Fo={r,x}FrFsFpand the historical observations of all policies in the portfolio, denoted by Fuo, is just the union of policy-specified observation that is Fuo=i=1nk=1miFiko, where Fiko is the policy-specified realisations of Fo in year i that is Fiko={rik,xik}FikrFiksFikp.

It is well known that RBNS and IBNR claims of the risk portfolio naturally result in outstanding liabilities to the insurer. Specifically, the total of future payments for all the RBNS and IBNR claims can be represented as (3) R:=i=1nRirbns+i=1nRiibnr,(3) where Rirbns=k=1miu=0Dirv=niu+1Dsl=1NikuvYikuvlandRiibnr=k=1miu=ni+1Drv=0Dsl=1NikuvYikuvl,are RBNS and IBNR liabilities incurred in year i, respectively. Thoroughly, we take the convention j=j1j2=0 if j1>j2.

3. Model specification

This section separately specifies the models for the reporting developments, settlement developments and payments of claims. In each part, we first give model assumptions and then detail the parameter estimations under both IIM and IDM. The model assumptions in this section are all given under the condition that risk exposure r and covariates x are known.

3.1. Modelling reporting developments of claims

Model assumption for reporting developments of claims is given as follows. It is mainly about the first and second moments of reporting developments of claims. The assumption involves vectors of parameters β,π1,π2,,πDr, which are all d-dimensional vector.

Assumption 3.1

For an individual with (r,x), assume that Nur, u=0,1,,Dr are independent, E[Nur|r,x]=rλu and Var(Nur|r,x)=ϕrλu, where λu=λpu with λ=exp(xβ) and pu(π;x)=exp(xπu)j=0Drexp(xπj), π0=0 as well as π=(π1,π2,,πDr).

Remark 3.1

In order to make π be reasonably estimated, the condition n>Dr is necessary.

By independence among policies and assumption above, one can construct the quasi-likelihood function of reported claims as follows, (4) Qr(β,π)=1ϕi=1nk=1miu=0Dir(Nikurlogλikurikλiku),(4) where λiku is policy-specified quantities of λu that is λiku=λikpiku=exp(xikβ)exp(xikπu)j=0Drexp(xikπj).One can refer to McCullagh and Nelder (Citation1989) for more details about quasi-likelihood theory. Similar to maximum likelihood estimation, parameters (β,π) can be estimated by maximising Qr(β,π) with respect to the parameters. Denote by λ=vec(rikλiku, i=1,,n, k=1,,mi, u=0,,Dir) and stack Nikurs as a vector Nr such that entry Niku is corresponding to rikλiku in vector λ. The quasi-score function, i.e., partial derivatives of Qr(β,π) with respect to the parameters is (5) Qr(β,π):=Qr(β,π)(β,π)=1ϕXrdiag(λ)1(Nrλ),(5) where Xr=λ(β,π). To determine the block entries of Xr, one needs the unit vector δs with 1 at component s (any positive integer) and δ0=0, of which dimensions can be read from context, and the following partial derivatives rikλiku(β,π)=rikλiku(1δupik)xik,where pik=(pik1,pik2,,pikDr) and ⊗ is the Kronecker product.

The covariance matrix of Qr(β,π), which is also the negative expected value of Qr(β,π)(β,π), is (6) Ir(β,π)=1ϕXrdiag(λ)1Xr.(6) The parameters (β,π) are estimated by Newton–Raphson with Fisher scoring starting with initials (βold,πold) and updating estimated parameters in the following way: (βnew,πnew)=(βold,πold)+(X0rdiag(λ0)1X0r)1X0rdiag(λ0)1(Nrλ0),where X0r and λ0 are obtained by replacing (β,π) with (βold,πold). Write the estimated parameters as (βˆ,πˆ). To estimate dispersion parameter ϕ, we adopt conventional method–moment estimation that is, ϕˆ=1u=0Dri=1numi(Dr+1)p×u=0Dri=1nuk=1mi(Nikurrikλˆiku)2rikλˆiku,where λˆikus are plug-in estimates of λiku that is λˆiku=exp(xikβˆ)exp(xikπˆu)j=0Drexp(xikπˆj).IDM considers that policy's feature information has no effect on reporting developments that is the coefficients of x1,x2,,xd1 are thought to be zero. Obviously, IDM is a misspecified model if the feature information indeed influence those developments. Therefore, in IDM, λ and pu are thought to keep fixed among all policies and then λu is same for the policies. By maximising function Qr in (Equation4) with respect to λu, one can obtain that (7) λˆu=N~urr(u),u=0,1,,Dr,(7) where N~ur=i=1nui=1miNikur representing total number of reported claims with reporting delay u and r(u)=i=1nui=1mirik meaning total exposures in the first nu years.

3.2. Modelling settlement delays

In IIM, the settlement developments of individual claims after their reporting to the insurer have the following assumption. The assumption involves vectors of parameters ρ1,ρ2,,ρDr, which are all d-dimensional vector.

Assumption 3.2

Assume that given Nur, (Nu0,Nu1,,NuDs) follows multinomial distribution with parameters Nur and (q0,,qDs), where qv(ρ;x)=exp(xρv)j=0Dsexp(xρj),v=0,,Ds, with ρ0=0, as well as ρ=(ρ1,ρ2,,ρDs) and the tuples (Nu0,Nu1,,NuDs), u=0,,Dr are independent.

Remark 3.2

Similar to the condition in Remark 3.1, the condition n>Ds is necessary to make ρ be reasonably estimated. Therefore, it is enough to assume n>max(Dr,Ds).

For Nur (uDir) reported claims of representative policy in year i, one can only observe Nu0,Nu1,,Nu,Di+us and Nurbns:=v=niu+1DsNuv (the number of RBNS claims with settlement delays no less than niu), where Nurbns=0 if uniDs. According to the assumption above, the individual log-likelihood of settlement developments is (8) Qios(ρ)=v=0Disu=0Di+vrNuvlogqv+u=0DirNurbnslogQ¯niu,(8) where Q¯v:=s=v+1Dsqs is the tail probability of settlement delays no less than v. Obviously, an alternative form of term in the last term in the first line of (Equation8) is u=(niDs+1)+Dir. Further, if we write Nvs=u=0Di+vrNuv, which means number of settled claims with settlement delay v, (Equation8) becomes (9) Qios(ρ)=v=0DisNvslogqv+u=0DirNurbnslogQ¯niu.(9) To estimate ρ by Newton–Raphson with Fisher scoring, we need the identities in the following proposition.

Proposition 3.1

The gradient of Qios(ρ) with respect to ρ is Qios(ρ)ρ=[Nis+u=0DirNurbnsQ¯niu(0q¯niu)Nrq]x,and conditional expectation of Hessian matrix of Qios(ρ) given (r,x) is (10) E[2Qios(ρ)ρρ|r,x]=r[u=0niDsλu(diag(q)qq)+v=(niDr)+(Ds1)(ni)λniv×(diag(qv)qvqvqvq¯vq¯vqvQvQ¯vq¯vq¯v)]xx,(10) where Nis=(N1s,N2s,,NDiss), Nr=u=0DirNur and q=(q1,q2,,qDs),qv=(q1,q2,,qv),q¯v=(qv+1,qv+2,,qDs).

We estimate ρ by maximising overall log-likelihood function Qs(ρ) which is the summation of individual log-likelihood Qikios(ρ), that is ρˆ is obtained as follows: ρˆ=argmaxρQs(ρ),where Qs(ρ)=v=0Dsi=1nvk=1miNikvslogqikv+u=0Dri=(nuDs)++1nuk=1miNikurbnslogQ¯ik,niu.To obtain ρˆ, similar as previous section, we use Newton–Raphson with Fisher scoring which needs the following gradients Qs(ρ) and its covariance matrix Is(ρ), where (11) Qs(ρ):=Qs(ρ)ρ=i=1nk=1miQikios(ρ)ρ,Is(ρ):=i=1nk=1miE[2Qikios(ρ)ρρ|rik,xik].(11)

In IDM, similar as λu in the section above, probabilities (q0,,qDs) are thought to keep fixed among all policies that is (q0,,qDs) is independent of x. By MLE again, we have (12) qˆ0=hˆ0 and qˆv=hˆvs=0v1(1hˆs),v=0,1,,Ds,(12) where hˆv=N~vst=vDsN~ts+t=v+1DsGt with N~vs=i=1nvk=1miNikvsandvGv=i=1nv+1k=1miNik,niv+1rbns.

3.3. Modelling claim payments

We give some assumptions about payments of individual claims as follows. The assumptions involve a (d+Dr+Ds)-dimensional vector of parameters γ.

Assumption 3.3

Claim payments Yuvl, u=0,,Dr, v=0,,Ds, l=1,,Nuv are independent, independent of Nuv; u=0,,Dr, v=0,,Ds and also assume that conditional mean and variance satisfy E[Yuvl|x]=μuv,Var(Yuvl|x)=ϕpμuvwith μuv=exp(xuvγ), where xuv=(x,δu,δv) is a (d+Dr+Ds)-dimensional vector of covariates.

Arrange all settled payments of the risk portfolio into the set {(Yl,x~l),l=1,2,,Nts}, where x~l is covariate associated with payments Yl and Nts is the total number of settled claims. Construct quasi-likelihood by independence among policies and assumption above, (13) Qp(γ)=1ϕpl=1Nts(Yllogμlμl),(13) where μl=exp(x~lγ). Denote μ=(μ1,,μNts) and Y=(Y1,,YNts). The quasi-score function–partial derivatives of Qp(γ) with respect to the parameters is (14) Q˙p(γ):=Qp(γ)γ=1ϕpX~(Yμ),(14) where X~=(x~1,,x~Nts).

The covariance matrix of Q˙(γ), which is also the negative expected value of a˙Q(γ)/γ, is (15) Ip=1ϕpX~diag(μ)X~.(15) The parameters γ are estimated by iteratively re-weighted least square (IRLS) algorithm, which is as follows,

  1. Initialise γˆ=γ0 such that μˆl=exp(x~lγˆ) and μˆ=(μˆ1,μˆ2,,μˆNts), where γ0 is usually zero vector.

  2. Compute adjusted response zl=ylμˆl+X~γˆ.

  3. Update γˆ by what follows, γˆ=(X~diag(μˆ)X~)1X~diag(μˆ)Z,where Z=(z1,z2,,zNts), and then μˆl=exp(x~lγˆ).

To estimate dispersion parameter ϕp, we also adopt conventional method–moment estimation that is, ϕˆp=1Nts(p+Dr+Ds)l=1Nts(Ylμˆl)2μˆl,where μˆl=exp(x~lγˆ).

In IDM, the coefficients of covariates about individual features are considered to be zero, i.e., γ1==γd1=0, and μls only depended on reporting and settlement delays, which means it just needs to estimate γID:=(γ0,γd,,γd1+Dr,,γd1+Dr+Ds) by the similar procedure as stated above. Therefore, estimator γˆID for γID is a maximiser of the Qp, which is the function of γID that is under γ1==γd1=0, and μl in Equation (Equation13) is independent of individual information and only takes one of the following forms: (16) μuvID=exp((1,δu,δv)γID),u=0,1,,Dr,v=0,1,,Ds.(16) Then the estimate of μuvID under IDM is denoted by μˆuv:=exp((1,δu,δv)γˆID), which is a policy-free estimate.

4. Prediction for outstanding liabilities

In this section, the terminologies ‘loss reserve’ and ‘loss reserving’ are precisely specified, measurement of accuracy of loss reserving is then discussed and we also shows the improvement of accuracy of loss reserving basing on IIM with respect to IDM.

4.1. Loss reserve and loss reserving

Recalling the total outstanding liability R defined in (Equation3), by ‘loss reserve’, we refer to the projection (17) Rm=Rm(θ)=E[R|Fuo](17) of R on the observations Fuo by the evaluation date n, where the subscript ‘m’ indicates portfolio size, since loss reserve is based on specific risk portfolio. One can see that Rm is a function of unknown parameters θ:=(β,π,ρ,γ) and hence it needs to be estimated.

To derive moments about outstanding liabilities R and conditional variance of R, the following quantities are needed. For u=0,1,,Dr,v=0,1,,Ds, denote by (18) μ~uv=t=vDsqtμutt=vDsqtandμ~uvs=t=vDsqtμut2t=vDsqt,(18) where μ~uv is conditional moment of claim payments given x, reporting delays u and settlement delays no less than v, so that corresponding policy-specified quantities are (19) μ~ikuv=t=vDsqiktμikutt=vDsqiktandμ~ikuvs=t=vDsqiktμikut2t=vDsqikt.(19) Then we derive the following theorem which provides formulas to compute not only the loss reserve Rm but also variance of outstanding liabilities R given observations Fuo.

Theorem 4.1

Under the model formulated by Assumptions 3.1–3.3, the loss reserve is (20) Rm(θ)=v=1Dsu=0Dvrk=1mnvu+1N(nvu+1)kurbnsμ~(nvu+1)kuv+u=1Dri=nu+1nk=1mirikλikuμ~iku0,(20) and the variance of R given observations Fuo is Var(R|Fuo)=v=1Dsu=0Dvrk=1mnvu+1N(nvu+1)kurbns×(μ~nvu+1,kuvsQ¯nvu+1,k,v1μ~nvu+1,kuvsQ¯nvu+1,k,v1μ~nvu+1,kuv2+ϕpμ~nvu+1,kuv)+u=1Dri=nu+1nk=1mirikλiku×(μ~iku0s+(ϕ1)μ~iku02+ϕpμ~iku0).

It can be clearly seen that loss reserve Rm depends on not only the information from observed data in terms of the number of RBNS claims and policy's feature information but also unknown parameters θ, which results in the need for estimating Rm. Accordingly, the term ‘loss/claims reserving’ is used for certain reasonable estimate of the loss reserve. Formally, after getting certain reasonable estimates θˆ of the unknown parameters from the observed data, as, for example, what has been done in the previous section, we have the following theorem.

Theorem 4.2

By loss reserving we refer to the (random) quantity (21) RˆII=v=1Dsu=0Dvrk=1mnvu+1N(nvu+1)kurbnsμ~ˆ(nvu+1)kuv+u=1Dri=nu+1nk=1mirikλˆikuμ~ˆiku0,(21) where μ~ˆikuvs and λˆikus are obtained by substituting unknown parameters with their estimates.

According to the theorem above, it is easy to obtain loss reserving under IDM by simply replacing μˆikuvs and λˆikus in (Equation21) with policy-free estimates μˆuvs and λˆus, respectively. Specifically, to distinguish two different estimates for reserve, we use symbol RˆID to indicate loss reserving under IDM, which is (22) RˆID=v=1Dsu=0DvrN~nvu+1,urbnsμ~ˆuv+u=1Drr[u]λˆuμ~ˆu0,(22) where N~nvu+1,urbns=k=1mnvu+1Nnvu+1,kurbns, r[u]=i=nu+1nk=1mirik and μ~ˆuv=t=vDsqˆtμˆutt=vDsqˆt.

4.2. Measurement of prediction accuracy

It is essential to measure accuracy of loss reserving and especially accuracy improvement of loss reserving by considering useful individual information with respect to the one without this information. To measure the prediction accuracy of some reserve estimate Rˆ, which is Fuo measurable, a natural idea is conditional mean square error of prediction (MSEP) which is defined as (23) MSEP(R,Rˆ)=E[(RRˆ)2|Fuo]=Var(R|Fuo)+(E[R|Fuo]Rˆ)2.(23) For loss reserving RˆII, which includes individual information, and RˆID without individual information, their MSEPs are MSEP(R,RˆII) and MSEP(R,RˆID), respectively. To measure the difference in prediction accuracy of RˆII and RˆID, we use the following ratio: (24) Mr=MSEP(R,RˆII)MSEP(R,RˆID)=Var(R|Fuo)+(E[R|Fuo]RˆII)2Var(R|Fuo)+(E[R|Fuo]RˆID)2.(24) It is well known that individual information model performs better in terms of prediction accuracy than individual data model, if Mr<1, but it is hard to compute Mr with unknown parameters. Fortunately, we can compare Mr and number 1 when portfolio size m is large enough. It is notable that individual data model is nested in individual information model. Then we have the following theorem under some regular conditions (see Van der Vaart, Citation2000), which illustrates the advantages of individual information model over individual data model.

Theorem 4.3

When portfolio size m tends to infinity, MrP1, where P means converging in probability, if the individual data model is true, that is the coefficients of x1,x2,,xd1 are zero. Otherwise, (25) 1m(RˆIDRm(θ))PΔ=v=1Dsu=0Dvrκnvu+1E[rλuQ¯v1(μ~ˇuvμ~uv)]+u=1Dri=nu+1nκiE[rλu(μ~ˇu0μ~u0)],(25) where μ~ˇuv=s=vDsqˇsμˇus/s=vDsqˇs with (26) qˇv=hˇvs=0v1(1hˇs),hˇv=i=1nvu=0Di+vrκiE[rλuqv]i=1nvu=0Di+vrκiE[rλuQ¯v1],μˇuv=exp((1,δu,δv)γˇID),andγˇID=ArgmaxγIDu=0Drv=0Dsi=1nuvκiE×[rλuqv(μuvlogμuvIDμuvID)],μuvID in (16),(26) and if the asymptotic bias Δ0, MrP0.

The theorem above shows that IIM is asymptotically equivalent to IDM, if IDM is true and otherwise the former has higher prediction accuracy than the latter when portfolio size is large enough. One can intuitively understand that as portfolio size tends to infinity, both models can capture all the information included in observations when IDM holds true, since IIM is a generalised version of IDM. However, individual data model fails to capture the effects of policy's feature information and thus leads to greater bias when IIM holds true.

An important issue one concerns is how much prediction accuracy of loss reserving RˆII can be improved, if IIM holds true, in a fixed risk portfolio that is one cares about actual value of Mr under true IIM. However, there are unknown parameters θ in Var(R|Fuo) and E[R|Fuo]. An approximation method that comes to one's mind is substituting estimated parameters θˆ to them, which however needs to further take estimation error of θˆ into account. We directly use the method named semi-analytical approximation for MSEP(R,Rˆ) (One can refer to Lindholm et al. (Citation2020) for more details), which is also discussed in Wahl (Citation2019) under micro data model. Then the approximations for MSEP(R,RˆII) and MSEP(R,RˆID) are (27) MSEPˆ(R,RˆII)=Var(R|Fuo)(θˆ)+Rm(θˆ)Covˆ(θˆ)Rm(θˆ),MSEPˆ(R,RˆID)=Var(R|Fuo)(θˆ)+Rm(θˆ)Covˆ(θˆ)Rm(θˆ)+(RˆIIRˆID)2,(27) so that (28) Mˆr=Var(R|Fuo)(θˆ)+Rm(θˆ)Covˆ(θˆ)Rm(θˆ)Var(R|Fuo)(θˆ)+Rm(θˆ)Covˆ(θˆ)Rm(θˆ)+(RˆIIRˆID)2,(28) where Rm(θˆ) is the gradient of loss reserve Rm(θ) with respect to θ computed at θˆ and Covˆ(θˆ) is asymptotic covariance of θˆ. It is easily known that Covˆ(θˆ)=diag(ϕˆ(Xˆrdiag(λˆ)1Xˆr)1,(Is(ρˆ))1,ϕˆp(X~X~)1),

where Xˆr and λˆ are plug-in estimates and Is(ρˆ) is obtained by inserting ρˆ into Is(ρ) in Equation (Equation11). One can refer to Chapter 9 in McCullagh and Nelder (Citation1989) for more details.

Proposition 4.4

The gradient of Rm(θ) with respect to (β,π) is Rm(θ)(β,π)=u=1Dri=nu+1nk=1mirikλikuμ~iku0(1δupik)xik,the gradient of Rm(θ) with respect to ρ is Rm(θ)ρ=v=1Dsu=0Dvrk=1miuvNiuvkurbnsQ¯iuvk,v1(0qiku,v1mu)xiuvk+u=1Dri=nu+1nk=1mirikλiku×[diag(qik)μ¯iku0μ~iku0qik]xik,where iuv=nvu+1, qikuvmu=diag(q¯ikv)μ¯ikuvμ~iku,v+1qikv,and μ¯ikuv=(μiku,v+1,,μikuDs), and the gradient of Rm(θ) with respect to γ is Rm(θ)γ=v=1Dsu=0Dvrk=1mnvu+1N(nvu+1)kurbns×μ~˙(nvu+1)kuv+u=1Dri=nu+1nk=1mirikλikuμ~˙iku0,where μ~˙ikuv=t=vDsqiktμikutxikutt=vDsqikt.

5. Simulations and real data analysis

Reported in this section include the results from a few small simulations conducted to further investigate Mr. A real data in health insurance was also analysed to show the application of IIM and the accuracy improvement by using IIM with respect to IDM in practice.

5.1. Simulation

In this simulation, the risk exposures associated with every individuals were drawn from the uniform distribution on [0,1], the covariates were produced by multivariate standard normal distribution and we simulated the random developments of claims for a fixed risk portfolio. In each run, we directly compute Mr according to Equation (Equation24) so that we can know how much accuracy is improved by using IIM with respect to IDM under the fixed risk portfolio.

Because there are only assumptions about mean and variance for reporting developments and payments of claims, we need additional distributional assumptions to generate them, which arise as follows. First, for individual reporting developments Nurs, we generated them by the additional assumption which says that Nurϕ follows Poisson distribution with mean rλuϕ. Second, for individual payments Yl, similarly, we generated them by assuming that Ylϕp follows Poisson distribution with mean μlϕp. Each run in the simulation was conducted with the setting: n = 5, Dr=2, Ds=2, a risk portfolio size m=(10000,10,000,10,000,10,000,10,000), i.e., 10, 000 policies in each year, and any combination of parameters which varied according to the setting in the following two examples.

Example 5.1

Dimension d = 3 and the parameters varied in an auxiliary parameter t ranging in [1,1] by step 0.01 as

  • Parameters for reporting developments: β=(0.5,t,2t),π1=(1,t,t),π2=(1,t,2t),and ϕ=2.

  • Parameters for settlement developments: ρ1=(0.1,0.2t,0.3t) and ρ2=(0.1,0.2t,0.3t).

  • Parameters for payments: γ=(5,0.2t,0.4t,0.1,0.6,0.2,0.8) and ϕp=1.5.

Covariates were produced by bivariate standard normal distribution in this example.

Example 5.2

Dimension d = 4 and parameters varied over t ranging in [1,1] by step 0.01 as

  • Parameters for reporting developments: β=(2,0.2t,0.8t,0.5t),π1=(2,t,3t,2t),π2=(1,2t,t,2t),and ϕ=3.

  • Parameters for settlement developments: ρ1=(0.3,0.1t,0.5t,0.2t),ρ2=(0.2,0.3t,0.7t,0.4t).

  • Parameters for payments: γ=(3,0.6t,0.2t,0.7t,0.3,0.2,0.5,0.4) and ϕp=2.5.

Covariates were produced by ternary standard normal distribution in this example.

In each run, we estimated loss reserve by both IIM and IDM using the simulated data that is we computed RˆII by Equation (Equation21) as well as RˆID by (Equation22) and true parameters were used to compute Var(R|Fuo) and E[R|Fuo] according to Theorem 4.1. Then we computed Mr by inserting the computed RˆII, RˆID, Var(R|Fuo) and E[R|Fuo] into Equation (Equation24). At last, we plotted the simulated results in Figure .

Figure 1. The simulated Mr over varying coefficients of covariates. (a) Example 5.1 and (b) Example 5.2.

Figure 1. The simulated Mr over varying coefficients of covariates. (a) Example 5.1 and (b) Example 5.2.

We obtained the results consistent with Theorem 4.3 from the simulations above.

  1. When the coefficients of x1,x2,,xd1 approach zero, most Mrs are close to real number 1 that is loss reserving by IIM almost has the same accuracy as that by IDM.

  2. When those coefficients are away from zero, Mr tends to be zero that is the prediction accuracy of loss reserving by IIM is greatly improved with respect to IDM.

5.2. Real data analysis

In this section, we analysed a dataset, which was collected by a commercial insurance company in China. The dataset recorded writing and expiring dates of policies, individual information, see Table , and developments of reported claims between 1/1/2019 and 8/31/2019.

Table 1. The individual information in real data analysis.

To visualise the effects of individual information on the developments of claims, for example, the histograms of reporting and settlement delays measured in days were provided under a few combinations of covariate values including gender, geographical location and age, as presented in Figures  and . It was strongly proposed that the individual information had impacts on the distributions of reporting and settlement delays.In the dataset, all the reporting delays were not more than 150 days (5 months). By China Banking and Insurance Regulatory Commission, the reported claims in health insurance are generally required to be settled within 2 months if no disagreement exists. It is appropriate to take 1 month as the time unit (‘accident year’ in previous sections). Thus the maximum reporting and settlement delays were safely set to Dr=5 and Ds=3 (the real data supported this assumption).

Figure 2. Histograms of reporting delays (in days): (a) Female, Region III, age 9–20; (b) Male, Region I, age 45–50; (c) Male, Region VI, age 20–40; (d) Male, Region III, age >55.

Figure 2. Histograms of reporting delays (in days): (a) Female, Region III, age 9–20; (b) Male, Region I, age 45–50; (c) Male, Region VI, age 20–40; (d) Male, Region III, age >55.

Figure 3. Histograms of settlement delays (in days): (a) Female, Region III, age 9–20; (b) Male, Region I, age 45–50; (c) Male, Region VI, age 20–40; (d) Male, Region III, age >55.

Figure 3. Histograms of settlement delays (in days): (a) Female, Region III, age 9–20; (b) Male, Region I, age 45–50; (c) Male, Region VI, age 20–40; (d) Male, Region III, age >55.

To illustrate the proposed model for loss reserving, evaluation date was set as 8/31/2019. That is, we worked with n = 8, Dr=5 and Ds=3 (months). There are four factors organised into eight features x1,,x8, as shown in Table . Besides, reporting and settlement delays, which were regarded as factors to model claim payments as Assumption 3.2 formulated, were respectively organised into five features x9, x10,,x13 and three features x14,x15,x16.

The estimated parameters for the reporting developments under IIM, their standard errors and p-values of significance test were displayed in Table , while the corresponding estimated results under IDM, i.e., λˆu, u=0,1,,5 in (Equation7) are (0.0218,0.0341,0.0133,0.0058,0.0034,0.0022),respectively. Besides, the estimated dispersion parameter ϕˆ=1.9433. These results in () provide obvious evidence that individual information has effects on the reporting developments of claims in sense that most covariates associated with individual information are significant at significance level 0.05.

Similar results for settlement developments and payments are listed in Tables  and . These results also provide obvious evidence that individual information has effects on settlement developments and payments of claims. Besides, the estimated dispersion parameter ϕˆp=15467.1 and the estimates under IDM are (qˆ0,qˆ1,qˆ2,qˆ3)=(0.6435,0.3121,0.0363,0.0081),γˆID=(8.3055,0.4535,0.5830,0.6227,0.58380.4557,0.4204,0.6139,0.8472).

Table 2. Estimated parameters for reporting developments, their standard errors and p-values.

Table 3. Estimated parameters for settlements developments, their standard errors and p-values.

Table 4. Estimated parameters γˆ for payments, their standard errors and p-values.

In Table , the columns with names ‘IBNR’, ‘RBNS’ and ‘Loss reserving’ correspond to estimates of IBNR reserve, RBNS reserve and total loss reserve, respectively. The square roots of approximated conditional MSEPs under IIM and IDM are in the fourth column of Table . The rightmost column in this table showed the computed Mˆr by (Equation28). We can see that loss reserving by IIM provides more stable prediction of outstanding liabilities than that by IDM since the former has smaller conditional MSEP and after incorporating useful individual information into loss reserving, the prediction accuracy is greatly increased by 77.63%.

6. Conclusion

This paper explored the improvement of accuracy in predicting outstanding liabilities, which are incurred by general insurance companies, by incorporating useful individual information into modelling. The reporting developments and payments of individual claims were given weak assumptions about their first two moments and modelled under quasi-likelihood theory, while settlement delays were modelled by multinomial logistic regression. Based on the model specification, loss reserve and conditional variance of outstanding liabilities were derived, which were further used to compute loss reserving and conditional MSEP. It was theoretically proved that loss reserving incorporating useful individual information shows higher accuracy than that under IDM, where the accuracy is measured by the conditional MSEP, when portfolio size is large enough. The conclusion is also supported by the simulations and real data analysis.

Table 5. Reserving, accuracy of prediction and accuracy improvement of IIM with respect to IDM.

While the proposed model is basically a parametric model in statistical context, some one may be concerned with the limitation that the model is subjective and thus question its robustness in practical applications. Regarding this aspect, a possible next step is to study this problem under a nonparametric framework. Especially, it is more interesting to model the dependence of claims development on individual information by machine learning (including deep learning).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Natural Science Foundation of China (71771089), the Shanghai Philosophy and Social Science Foundation (2015BGL001), the National Social Science Foundation Key Program of China (17ZDA091) and China Scholarship Council (201906140045).

Notes on contributors

Zhigao Wang

Zhigao Wang is a Ph.D. candidate in Statistics at East China Normal University.

Xianyi Wu

Xianyi Wu is a professor in School of Statistics at East China Normal University.

Chunjuan Qiu

Chunjuan Qiu is an associate professor in School of Statistics at East China Normal University.

References

Appendix

Proof

Proof of Proposition 3.1

To derive the following gradient and Hessian matrix, we need the identities qvρ=qv(δvq)x, which gives that qρ=(diag(q)qq)x,logqρ=(IDsq1Ds)x. Then we have the following gradient according to formulas above, which is Qios(ρ)=[v=0DisNvs(δvq)+u=0DirNurbns((0,q¯niu)Q¯niuq)]x=[Nis+u=0DirNurbnsQ¯niu(0q¯niu)(v=0DisNvs+u=0DirNurbns)q]x.By some algebraic computation, it follows that 2Qios(ρ)ρρ=[(v=0DisNvs+u=0DirNurbns)(diag(q)qq)+u=0DirNurbns×(000diag(q¯niu)q¯niuq¯niuQ¯niu)]xx.Because v=0DisNvs+u=0DirNurbns is just the number of those reported claims incurred in accident year i, E[v=0DisNvs+u=0DirNurbns|r,x]=ru=0Dirλu.Observe further that Nurbns=0 for inDs and 0uniDs. Therefore, (A1) E[2Qios(ρ)ρρ|r,x]=r[u=0Dirλu(diag(q)qq)u=(niDs+1)+Dirλu×(000diag(q¯niu)q¯niuq¯niuQ¯niu)]xx.(A1)

Let v = niu and note that diag(q)qq(000diag(q¯v)q¯vq¯vQ¯v)=(diag(qv)qvqvqvq¯vq¯vqvQvQ¯vq¯vq¯v).Then, Equation (EquationA1) gives rise to the desired result.

Proof

Proof of Theorem 4.1

By (Equation3), the loss reserve can be computed as E(R|Fuo)=E[Rrbns|Fuo]+E[Ribnr|Fuo].According to Assumption 3.2, for a representative policy in year i, given Nurbns with niDs+1uDir, (Nu,niu+1,,NuDs) follows multinomial distribution with parameters Nurbns and 1Q¯niu(qniu+1,,qDs). Then by Assumption 3.3, the RBNS loss reserve is E[Rrbns|Fuo]=i=1nk=1miu=(niDs+1)+DirE[v=niu+1Dsl=1NikuvYikuvl|Fuo]=i=1nk=1miu=(niDs+1)+DirNikurbnsv=niu+1DsqikvμikuvQ¯ik,niu=v=1Dsu=0Dvrk=1mnvu+1N(nvu+1)kurbnsμ~(nvu+1)kuv.It can be easily proved that IBNR claims are independent of historical observation Fuo by Assumption 3.1–3.3. Hence, IBNR loss reserve is computed by E[Ribnr|Fuo]=E[i=1nk=1miu=ni+1Drv=0Dsl=1NikuvYikuvl|Fuo]=i=nDr+1nk=1miu=ni+1Drv=0DsE[l=1NikuvYikuvl]=u=1Dri=nu+1nk=1mipikurikexp(xikβ)μ~iku0.According to independence assumptions in Assumptions 3.1–3.3, the developments of RBNS claims are independent of developments of IBNR claims, which results in the independence between Rrbns and Ribnr. Then the variance of R given Fuo is Var(R|Fuo)=Var(Rrbns|Fuo)+Var(Ribnr|Fuo). First, for vniu+1, we compute Var(l=1NuvYuvl|Eo)=Var(E[l=1NuvYuvl|Nuv,Eo])+E[Var(l=1NuvYuvl|Nuv,Eo)]=μuv2Var(Nuv|Fo)+ϕpμuvE[Nuv|Fo]=Nurbns[μuv2qv(1qv)Q¯niu2+ϕpμuvqvQ¯niu].For v1,v2niu+1, compute Cov(l=1Nuv1Yuv1l,l=1Nuv2Yuv2l|Fo) which is equal to Cov(E[l=1Nuv1Yuv1l|Nuv1,Nuv2,Fo],E[l=1Nuv2Yuv2l|Nuv1,Nuv2,Fo])+E[Cov(l=1Nuv1Yuv1l,l=1Nuv2Yuv2l|Nuv1,Nuv2,Fo)],which can be computed as follows: Cov(l=1Nuv1Yuv1l,l=1Nuv2Yuv2l|Fo)=μuv1μuv2Cov(Nuv1|Fo],Nuv2|Fo])=Nurbnsμuv1μuv2qv1qv2Q¯niu2.Then by independence among policies and Assumptions 3.2 and 3.3, the variance of RBNS loss reserve given Fuo is Var(Rrbns|Fuo)=i=1nk=1miVar(u=(niDs+1)+Dirv=niu+1Dsl=1NikuvYikuvl|Fuo)=i=1nk=1miu=(niDs+1)+DirNikurbns(v=niu+1Dsqikvμikuv2Q¯ik,niu2(v=niu+1DsqikvμikuvQ¯ik,niu)2+ϕpv=niu+1DsqikvμikuvQ¯ik,niu)=v=1Dsu=0Dvrk=1mnvu+1Nnvu+1,kurbns(μ~nvu+1,kuvsQ¯nvu+1,k,v1μ~nvu+1,kuvsQ¯nvu+1,k,v1μ~nvu+1,kuv2+ϕpμ~nvu+1,kuv).Because IBNR claims are independent of historical observation Fuo, variance of IBNR loss reserve given Fuo is computed by Var(Ribnr|Fuo)=Var(i=1nk=1miu=ni+1Drv=0Dsl=1NikuvYikuvl)=i=nDr+1nk=1miu=ni+1DrVar(E[v=0Dsl=1NikuvYikuvl|Niku])+E[Var(v=0Dsl=1NikuvYikuvl|Niku)]=u=1Dri=nu+1nk=1mipikurikexp(xikβ)(μ~iku0s+(ϕ1)μ~iku02+ϕpμ~iku0).

Proof

Proof of Theorem 4.3

Expand Rm(θˆ) about true parameters θ by Taylor expansion. Then we have 1m(RˆIIRm)=1mRm(θ))θ(θˆθ)+op(θˆθm).One knows that μ˙uv:=μuvγ=μuvxuv. Write μuv=(μu,v+1,μu,v+2,,μu,Ds) and μ˙uv=μuvγ, u=0,1,,Dr and v=0,,Ds1. To compute the partial derivative in the Taylor expansion above, we need the following partial derivatives: [purexp(xβ)](β,π)=(1δup)(purexp(xβ)x),μ~uvρ=[0,μu,v1(1Q¯v121Q¯v1diag(q¯v1)1Q¯v12q¯v1q¯v1)]xu,v1,μ~u0ρ=[(diag(q)qq)μu0q0μu0q]x,and μ~uvγ=1Q¯v1μu,v1γq¯v1,v1,μ~u0γ=(μu0γ,μu0γ)(q0,q).By the law of large numbers, it can be proved that 1mRm(θ))θa.s.g, where g=(g1,g2,g3), where denoting Muv={(0,μuv(diag(q¯v)1Q¯vq¯vq¯v)),v>0(diag(q¯0)1Q¯0q¯0q¯0)μu0,v=0, (A2) g1=u=1Dri=nu+1nκiE[μ~u0(1δup)(rλux)],g2=u=0DrE[(v=0Ds1κnuvMuv+i=nu+1nκi×[(diag(q)qq)μu0q0μu0q]v=0Ds1)(rλux)],g3=u=0DrE[(v=0Ds1κnuvμ˙uvq¯v+i=nu+1nκi(μ˙u0,μ˙u0)(q0,q)v=0Ds1)(rλuxuv)].(A2)

It is well known that θˆPθ under some regular conditions and hence 1m(RˆIIRm)P0. Besides, Var(R|Fuo)ma.s.VR=v=1Dsu=0Dvrκnvu+1×E[rλuQ¯v1(μ~uvsQ¯v1μ~uv2+ϕpμ~uv)]+u=1Dri=nu+1nκiE[rλu(μ~u0s+(ϕ1)μ~u02+ϕpμ~u0)].If individual data model hold true, one can similarly prove that 1m(RˆIDRm)P0. Therefore, MrP1 in this case. If individual information model holds true, we can easily prove that RˆIDRmm is asymptotically biased, which results from the following arguments. The law of large numbers readily gives hˆva.s.hˇv and μˆuva.s.μˇuv. Further, we have qˆva.s.qˇv:=hˇvs=0v1(1hˇs) and μ~ˆuva.s.μ~ˇuv:=s=vDsqˇsμˇus/s=vDsqˇs. We have RˆIDRmm=RˆIDRˇIDm+RˇIDRmm, where RˇID=v=1Dsu=0Dvrk=1mnuv+1Nnuv+1,kurbnsμ~ˇuv+u=1Dri=nu+1nk=1mirikλˇuμ~ˇu0 and then RˇIDRm=v=1Dsu=0Dvrk=1mnuv+1Nnuv+1,kurbns(μ~ˇuvμ~nuv+1,kuv)+u=1Dri=nu+1nk=1mi(rikλˇuμ~ˇu0rikλikuμ~iku0).Apparently, RˆIDRˇIDma.s.0 and by the law of large numbers and some simple algebra operations, we show that 1m(RˇIDRm)a.s.Δ=v=1Dsu=0Dvrκnvu+1E[rλuQ¯v1(μ~ˇuvμ~uv)]+u=1Dri=nu+1nκiE[rλu(μ~ˇu0μ~u0)].Therefore, if asymptotic bias Δ is not zero, MrP0. Then we complete the proof.