Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

For stochastic loss reserving, we propose an individual information model (IIM) which accommodates not only individual/micro data consisting of incurring times, reporting developments, settlement developments as well as payments of individual claims but also heterogeneity among policies. We give over-dispersed Poisson assumption about the moments of reporting developments and payments of every individual claims. Model estimation is conducted under quasi-likelihood theory. Analytic expressions are derived for the expectation and variance of outstanding liabilities, given historical observations. We utilise conditional mean square error of prediction (MSEP) to measure the accuracy of loss reserving and also theoretically prove that when risk portfolio size is large enough, IIM shows a higher prediction accuracy than individual/micro data model (IDM) in predicting the outstanding liabilities, if the heterogeneity indeed influences claims developments and otherwise IIM is asymptotically equivalent to IDM. Some simulations are conducted to investigate the conditional MSEPs for IIM and IDM. A real data analysis is performed basing on real observations in health insurance.

Keywords:

1. Introduction

In the background of stochastic reserving, loss reserving is referred to a procedure to predict incurred outstanding liabilities in general insurance companies. It is well known that chain-ladder method proposed by Mack (Citation1993) and its related versions can be easily performed by using pencil and paper because of the simple aggregate data structure called run-off triangle and hence are popular in practice. However, as England and Verrall (Citation2002) mentioned, the advantages of aggregate data models are at the cost of prediction accuracy because of information loss caused by simply aggregating individual or micro data, which records incurring time, reporting time, settlement time as well as payment processes of individual claims. In risk management of insurance companies, with the modern computer technology, it is urgent for actuaries to explore the usage of related information to improve the accuracy in predicting the liabilities, which also attracts increasing interests of many scholars from actuarial science. Antonio and Plat (Citation2014), Pigeon et al. (Citation2013, Citation2014) demonstrated by an empirical analysis that loss reserving based on individual data had more prediction accuracy than aggregate models. Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016) revealed that individual loss reserving had more accuracy than methods using aggregate data in sense that the former produced a smaller mean square error.

A small stream of earlier literature about IDMs, for example, Arjas (Citation1989) and Norberg (Citation1993, Citation1999) formulated a probabilistic framework for the developments of individual claims. Most recently, Yu and He (Citation2016) modelled the individual claim development process by marked Cox processes (also known as double stochastic processes). As we all know, it is challenging to acquire analytic expressions for the moments of outstanding liabilities under continuous-time IDMs. Perhaps partly for this reason, there is a great deal of work that has been done under discrete-time IDMs, see, e.g., Pigeon et al. (Citation2013, Citation2014), Verrall et al. (Citation2010), Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016). Zhao and Zhou (Citation2010) considered the R-delays so as to predict the incurred but not settled outstanding liabilities. Unfortunately, IDMs also confront information loss caused by neglecting individual information, i.e., information from policy or policyholder. It is not clear so far how much accuracy in predicting the outstanding liabilities is sacrificed, when the individual information is neglected. In the present paper, we will explore how much improvement in the accuracy that will be measured by conditional MSEP can be achieved by incorporating the useful individual information into modelling under discrete time framework similar as Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016). Besides, we avoid the strong Poisson distribution assumption for the number of individual claims assumed in Huang, Qiu, Wu, Zhou (Citation2015), Huang, Qiu, Wu (Citation2015) and Huang et al. (Citation2016) and instead extend to weak assumptions about the first two moments so that parameters estimation can be conducted under quasi-likelihood theory (cf. McCullagh & Nelder, Citation1989).

The conditional MSEP is broadly used to compare different models for loss reserving. It is well known that the conditional MSEP is the sum of process variance caused by the randomness of outstanding liabilities and estimation error originating from uncertainty of parameters estimators. It is theoretically feasible to estimate it by bootstrap method. There were some examples which discussed the MSEP under collective models – for instance, Mack (Citation1993), Mack (Citation2000) (comparing three methods – Benktander, Bornhuetter–Ferguson and chain-ladder under the criteria of MSEP), Alai et al. (Citation2009, Citation2010) (Bornhuetter–Ferguson method under generalised linear model) and Wüthrich and Merz (Citation2008) (comprehensive summary of the details of methods based on aggregate data). Besides, Lindholm et al. (Citation2020) introduced a semi-analytic approximation method to estimate the conditional MSEP, where the method is illustrated by loss reserving based on aggregate data. Examples that have applied the approximation method are Wahl (Citation2019), who computed explicit moments of outstanding liabilities by applying discretisation scheme under the framework of Antonio and Plat (Citation2014), and Wahl et al. (Citation2019) who modelled individual data on aggregate level. In the present paper, we also use the approximation method for the MSEP, which is derived under IIM, because of its simplification.

The paper is organised as follows. In Section 2, we describe the data structure and display the mathematical expression of outstanding liabilities caused by a risk portfolio at a given evaluation date. In Section 3, we separately model reporting developments, settlement delays and payments of claims and in each part, we formulate the model assumptions as well as estimation for the model parameters. Section 4 mainly derives the formulas of loss reserve and conditional variance of outstanding liabilities given historical observations, and studies the improvement of accuracy achieved by IIM with respect to IDM. Section 5 reports some simulation results and a real data analysis. Section 6 concludes the paper with a few remarks.

2. Data structure

Claim events incurred by some policy are usually reported to the insurer in some time periods (reporting delays) after their occurrence and the reported claims are finally settled with some time lags (settlement delays) between their reports and final settlements. Before going further, it is necessary to discuss the supports of reporting and settlement delays. In the following assumption, we assume that there exists maximum reporting delay $D^{r}$ and settlement delay $D^{s}$ . Actually, there are basically two cases for the supports of the delays: finite and infinite. It would be a known priori (generally read from the items of the insurance contracts) if the supports are finite or infinite before any loss reserving is taken care of. Even for the case the delays take unrestricted values, if the probability to take values over certain limits is quite small, one can safely assume a capped delays by cutting off the tails with probability small enough. As a result, the assumption of capped delays is reasonable in many real insurance businesses, especially for such insurance without very much high claims payments. An example is the general health insurance. The assumption of capped delays has been extensively adopted in such traditional methods as chain-ladder algorithm. If the tails cannot be safely cut off, however, the models such as the one proposed in Crevecoeur et al. (Citation2019) or some others would be more suitable. From the statistical point of view, for their distributions to be reasonably estimated with observations over a finite number of years, at least one of the two assumptions is necessary: they take only a finite number of values with arbitrary probabilities (but subject to normalisation) or countably infinitely many values but with their distribution functions identified by finite many parameters. Whatever the case, the number of unknown parameters that need to be estimated must be finite. Here the former is taken, whereas Crevecoeur et al. (Citation2019), for example, took the latter.

Then we specify the data structure used in our model. It is in discrete time version as, e.g., Huang, Qiu, Wu (Citation2015) did. Typically, the data for modelling is organised through periods with fixed length such as 1 year, one season or 1 month depending on lines of business. Conventionally, those periods are referred to ‘(accident) years’. This is also a way widely adopted by insurers to predict the incurred outstanding liabilities in practice. Specifically, the whole observation horizon is made of n accident years and loss reserving is evaluated at the end of nth accident year. In year i, $i = 1, 2, \dots, n$ , there are $m_{i}$ insurance policies, each of which is coded by $(i, k)$ , $k = 1, 2, \dots, m_{i}$ .

Every individual $(i, k)$ is associated with a random risk exposure $r_{i k}$ and d-dimensional vector of covariate $x_{i k}$ whose first entry is 1 and other entries indicating the individual information/features that influence the developments of individual claims. The developments of claims incurred by individual $(i, k)$ are detailed as follows.

The reporting developments of claims are recorded by $N_{i k u}^{r}$ , $u = 0, 1, \dots, D^{r}$ , where $N_{i k u}^{r}$ is the number of claims which are incurred in year i and reported in year i + u.
For $N_{i k u}^{r}$ claims, $u = 0, 1, \dots, D^{r}$ , their settlement developments are tracked by $N_{i k u v}$ , $v = 0, 1, \dots, D^{s}$ , where $N_{i k u v}$ is the number of claims which are reported in year i + u and settled in year i + u + v.
Payments for each claim are assumed to be paid for only once at its final settlement. For $N_{i k u v}$ claims, $u = 0, 1, \dots, D^{r}, v = 0, 1, \dots, D^{s}$ , we use $Y_{i k u v l}$ , $l = 1, 2, \dots, N_{i k u v}$ to record corresponding payments.

Then the random element associated with individual $(i, k)$ is denoted by $\begin{aligned} {r_{i k}, x_{i k}; {N_{i k u}^{r}; (N_{i k u v}; (Y_{i k u v l})_{l = 1}^{N_{i k u v}})_{v = 0}^{D^{s}}}_{u = 0}^{D^{r}}}, \end{aligned}$ $i = 1, \dots, n, k = 1, \dots, m_{i}$ , which are i.i.d. from the population $\begin{aligned} {r, x; {N_{u}^{r}; (N_{u v}; (Y_{u v l})_{l = 1}^{N_{u v}})_{v = 0}^{D^{s}}}_{u = 0}^{D^{r}}}, \end{aligned}$ which can be considered as a complete observation of a representative policy in year i.

Following conventional terms, a claim, which has been reported to the insurer but not settled, is known as RBNS claim and a claim, which has been incurred but not reported to the insurer, is known as IBNR claim. For accident year i, the individual observed data is as follows.

The reporting developments of a representative policy in year i are truncated in sense that we can only observe $F^{r} = {N_{0}^{r}, N_{1}^{r}, \dots, N_{D_{i}^{r}}^{r}}$ , where (1) $\begin{aligned} D_{i}^{r} = D^{r} \land (n - i), \end{aligned}$ (1) represents the largest reporting delays of the reported claims in accident year i.
For $N_{u}^{r}$ reported claims, $u = 0, 1, \dots, D_{i}^{r}$ , their settlement developments are censored in sense that we can observe ${N_{u 0}, N_{u 1}, \dots, N_{u D_{i + u}^{s}}, N_{u}^{r b n s}}$ , where (2) $\begin{aligned} D_{i + u}^{s} = D^{s} \land (n - i - u), \end{aligned}$ (2) is the largest settlement delays of settled claims with reporting delay u in accident year i and the number $N_{u}^{r b n s} := \sum_{v = n - i - u + 1}^{D^{s}} N_{u v}$ , which is the number of RBNS claims with reporting delays u. Note that $N_{u}^{r b n s} = 0$ if $n - i - u \geq D^{s}$ . Denote by $F^{s} = ⋃_{u = 0}^{D_{i}^{r}} {N_{u 0}, N_{u 1}, \dots, N_{u D_{i + u}^{s}}, N_{u}^{r b n s}}$ .
For $N_{u v}$ settled claims, $u = 0, 1, \dots, D_{i}^{r}, v = 0, 1, \dots, D_{i + u}^{s}$ , the observed payments for them are gathered in set ${Y_{u v 1}, Y_{u v 2}, \dots, Y_{u v N_{u v}}}$ . Denote $F^{p} = ⋃_{u = 0}^{D_{i}^{r}} ⋃_{v = 0}^{D_{i + u}^{s}} {Y_{u v 1}, Y_{u v 2}, \dots, Y_{u v N_{u v}}}$ .

Then individual observation $F^{o}$ is the union of ${r, x}$ , $F^{r}$ , $F^{s}$ and $F^{p}$ , that is $\begin{aligned} F^{o} = {r, x} \cup F^{r} \cup F^{s} \cup F^{p} \end{aligned}$ and the historical observations of all policies in the portfolio, denoted by $F^{u o}$ , is just the union of policy-specified observation that is $F^{u o} = ⋃_{i = 1}^{n} ⋃_{k = 1}^{m_{i}} F_{i k}^{o}$ , where $F_{i k}^{o}$ is the policy-specified realisations of $F^{o}$ in year i that is $\begin{aligned} F_{i k}^{o} = {r_{i k}, x_{i k}} \cup F_{i k}^{r} \cup F_{i k}^{s} \cup F_{i k}^{p} . \end{aligned}$

It is well known that RBNS and IBNR claims of the risk portfolio naturally result in outstanding liabilities to the insurer. Specifically, the total of future payments for all the RBNS and IBNR claims can be represented as (3) $\begin{aligned} R := \sum_{i = 1}^{n} R_{i}^{r b n s} + \sum_{i = 1}^{n} R_{i}^{i b n r}, \end{aligned}$ (3) where $\begin{aligned} R_{i}^{r b n s} & = \sum_{k = 1}^{m_{i}} \sum_{u = 0}^{D_{i}^{r}} \sum_{v = n - i - u + 1}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l} and \\ R_{i}^{i b n r} & = \sum_{k = 1}^{m_{i}} \sum_{u = n - i + 1}^{D^{r}} \sum_{v = 0}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l}, \end{aligned}$ are RBNS and IBNR liabilities incurred in year i, respectively. Thoroughly, we take the convention $\sum_{j = j_{1}}^{j_{2}} \cdot = 0$ if $j_{1} > j_{2}$ .

3. Model specification

This section separately specifies the models for the reporting developments, settlement developments and payments of claims. In each part, we first give model assumptions and then detail the parameter estimations under both IIM and IDM. The model assumptions in this section are all given under the condition that risk exposure r and covariates $x$ are known.

3.1. Modelling reporting developments of claims

Model assumption for reporting developments of claims is given as follows. It is mainly about the first and second moments of reporting developments of claims. The assumption involves vectors of parameters $β, π_{1}, π_{2}, \dots, π_{D^{r}}$ , which are all d-dimensional vector.

Assumption 3.1

For an individual with $(r, x),$ assume that $N_{u}^{r},$ $u = 0, 1, \dots, D^{r}$ are independent, $E [N_{u}^{r} | r, x] = r λ_{u}$ and $V a r (N_{u}^{r} | r, x) = ϕ r λ_{u},$ where $λ_{u} = λ p_{u}$ with $λ = \exp (x^{'} β)$ and $p_{u} (π; x) = \frac{\exp (x^{'} π_{u})}{\sum_{j = 0}^{D^{r}} \exp (x^{'} π_{j})}$ , $π_{0} = 0$ as well as $π^{'} = (π_{1}^{'}, π_{2}^{'}, \dots, π_{D^{r}}^{'})$ .

Remark 3.1

In order to make $π$ be reasonably estimated, the condition $n > D^{r}$ is necessary.

By independence among policies and assumption above, one can construct the quasi-likelihood function of reported claims as follows, (4) $\begin{aligned} Q^{r} (β, π) = \frac{1}{ϕ} \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = 0}^{D_{i}^{r}} (N_{i k u}^{r} \log λ_{i k u} - r_{i k} λ_{i k u}), \end{aligned}$ (4) where $λ_{i k u}$ is policy-specified quantities of $λ_{u}$ that is $\begin{aligned} λ_{i k u} = λ_{i k} p_{i k u} = \exp (x_{i k}^{'} β) \cdot \frac{\exp (x_{i k}^{'} π_{u})}{\sum_{j = 0}^{D^{r}} \exp (x_{i k}^{'} π_{j})} . \end{aligned}$ One can refer to McCullagh and Nelder (Citation1989) for more details about quasi-likelihood theory. Similar to maximum likelihood estimation, parameters $(β, π)$ can be estimated by maximising $Q^{r} (β, π)$ with respect to the parameters. Denote by $λ = v e c (r_{i k} λ_{i k u}, i = 1, \dots, n, k = 1, \dots, m_{i}, u = 0, \dots, D_{i}^{r})$ and stack $N_{i k u}^{r}$ s as a vector $N^{r}$ such that entry $N_{i k u}$ is corresponding to $r_{i k} λ_{i k u}$ in vector $λ$ . The quasi-score function, i.e., partial derivatives of $Q^{r} (β, π)$ with respect to the parameters is (5) $\begin{aligned} \nabla Q^{r} (β, π) := \frac{\partial Q^{r} (β, π)}{\partial (β^{'}, π^{'})^{'}} = \frac{1}{ϕ} X^{r'} d i a g (λ)^{- 1} (N^{r} - λ), \end{aligned}$ (5) where $X^{r} = \frac{\partial λ}{\partial (β^{'}, π^{'})}$ . To determine the block entries of $X^{r}$ , one needs the unit vector $δ_{s}$ with 1 at component s (any positive integer) and $δ_{0} = 0$ , of which dimensions can be read from context, and the following partial derivatives $\begin{aligned} r_{i k} \cdot \frac{\partial λ_{i k u}}{\partial (β^{'}, π^{'})} = r_{i k} λ_{i k u} {(\begin{matrix} 1 \\ δ_{u} - p_{i k} \end{matrix})}^{'} \otimes x_{i k}^{'}, \end{aligned}$ where $p_{i k} = (p_{i k 1}, p_{i k 2}, \dots, p_{i k D^{r}})^{'}$ and ⊗ is the Kronecker product.

The covariance matrix of $\nabla Q^{r} (β, π)$ , which is also the negative expected value of $\frac{\partial \nabla Q^{r} (β, π)}{\partial (β^{'}, π^{'})}$ , is (6) $\begin{aligned} I^{r} (β, π) = \frac{1}{ϕ} X^{r'} d i a g (λ)^{- 1} X^{r} . \end{aligned}$ (6) The parameters $(β, π)$ are estimated by Newton–Raphson with Fisher scoring starting with initials $(β^{o l d}, π^{o l d})$ and updating estimated parameters in the following way: $\begin{aligned} (β^{n e w'}, π^{n e w'})^{'} = (β^{o l d'}, π^{o l d'})^{'} \\ + (X_{0}^{r'} d i a g (λ_{0})^{- 1} X_{0}^{r})^{- 1} X_{0}^{r'} d i a g (λ_{0})^{- 1} (N^{r} - λ_{0}), \end{aligned}$ where $X_{0}^{r}$ and $λ_{0}$ are obtained by replacing $(β, π)$ with $(β^{o l d}, π^{o l d})$ . Write the estimated parameters as $(\hat{β}, \hat{π})$ . To estimate dispersion parameter ϕ, we adopt conventional method–moment estimation that is, $\begin{aligned} \hat{ϕ} & = \frac{1}{\sum_{u = 0}^{D^{r}} \sum_{i = 1}^{n - u} m_{i} - (D^{r} + 1) p} \\ \times \sum_{u = 0}^{D^{r}} \sum_{i = 1}^{n - u} \sum_{k = 1}^{m_{i}} \frac{(N_{i k u}^{r} - r_{i k} {\hat{λ}}_{i k u})^{2}}{r_{i k} {\hat{λ}}_{i k u}}, \end{aligned}$ where ${\hat{λ}}_{i k u}$ s are plug-in estimates of $λ_{i k u}$ that is $\begin{aligned} {\hat{λ}}_{i k u} = \exp (x_{i k}^{'} \hat{β}) \cdot \frac{\exp (x_{i k}^{'} {\hat{π}}_{u})}{\sum_{j = 0}^{D^{r}} \exp (x_{i k}^{'} {\hat{π}}_{j})} . \end{aligned}$ IDM considers that policy's feature information has no effect on reporting developments that is the coefficients of $x_{1}, x_{2}, \dots, x_{d - 1}$ are thought to be zero. Obviously, IDM is a misspecified model if the feature information indeed influence those developments. Therefore, in IDM, λ and $p_{u}$ are thought to keep fixed among all policies and then $λ_{u}$ is same for the policies. By maximising function $Q^{r}$ in (Equation4(4) $\begin{aligned} Q^{r} (β, π) = \frac{1}{ϕ} \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = 0}^{D_{i}^{r}} (N_{i k u}^{r} \log λ_{i k u} - r_{i k} λ_{i k u}), \end{aligned}$ (4) ) with respect to $λ_{u}$ , one can obtain that (7) $\begin{aligned} {\hat{λ}}_{u} = \frac{{\tilde{N}}_{u}^{r}}{r_{(u)}}, u = 0, 1, \dots, D^{r}, \end{aligned}$ (7) where ${\tilde{N}}_{u}^{r} = \sum_{i = 1}^{n - u} \sum_{i = 1}^{m_{i}} N_{i k u}^{r}$ representing total number of reported claims with reporting delay u and $r_{(u)} = \sum_{i = 1}^{n - u} \sum_{i = 1}^{m_{i}} r_{i k}$ meaning total exposures in the first n−u years.

3.2. Modelling settlement delays

In IIM, the settlement developments of individual claims after their reporting to the insurer have the following assumption. The assumption involves vectors of parameters $ρ_{1}, ρ_{2}, \dots, ρ_{D^{r}}$ , which are all d-dimensional vector.

Assumption 3.2

Assume that given $N_{u}^{r},$ $(N_{u 0}, N_{u 1}, \dots, N_{u D^{s}})$ follows multinomial distribution with parameters $N_{u}^{r}$ and $(q_{0}, \dots, q_{D^{s}}),$ where $\begin{aligned} q_{v} (ρ; x) = \frac{\exp (x^{'} ρ_{v})}{\sum_{j = 0}^{D^{s}} \exp (x^{'} ρ_{j})}, v = 0, \dots, D^{s}, \end{aligned}$ with $ρ_{0} = 0$ , as well as $ρ^{'} = (ρ_{1}^{'}, ρ_{2}^{'}, \dots, ρ_{D^{s}}^{'})$ and the tuples $(N_{u 0}, N_{u 1}, \dots, N_{u D^{s}}), u = 0, \dots, D^{r}$ are independent.

Remark 3.2

Similar to the condition in Remark 3.1, the condition $n > D^{s}$ is necessary to make $ρ$ be reasonably estimated. Therefore, it is enough to assume $n > max (D^{r}, D^{s})$ .

For $N_{u}^{r}$ ( $u \leq D_{i}^{r}$ ) reported claims of representative policy in year i, one can only observe $N_{u 0}, N_{u 1}, \dots, N_{u, D_{i + u}^{s}}$ and $N_{u}^{r b n s} := \sum_{v = n - i - u + 1}^{D^{s}} N_{u v}$ (the number of RBNS claims with settlement delays no less than n−i−u), where $N_{u}^{r b n s} = 0$ if $u \leq n - i - D^{s}$ . According to the assumption above, the individual log-likelihood of settlement developments is (8) $\begin{aligned} Q^{i o s} (ρ) = \sum_{v = 0}^{D_{i}^{s}} \sum_{u = 0}^{D_{i + v}^{r}} N_{u v} \log q_{v} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} \log {\bar{Q}}_{n - i - u}, \end{aligned}$ (8) where ${\bar{Q}}_{v} := \sum_{s = v + 1}^{D^{s}} q_{s}$ is the tail probability of settlement delays no less than v. Obviously, an alternative form of term in the last term in the first line of (Equation8(8) $\begin{aligned} Q^{i o s} (ρ) = \sum_{v = 0}^{D_{i}^{s}} \sum_{u = 0}^{D_{i + v}^{r}} N_{u v} \log q_{v} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} \log {\bar{Q}}_{n - i - u}, \end{aligned}$ (8) ) is $\sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}}$ . Further, if we write $N_{v}^{s} = \sum_{u = 0}^{D_{i + v}^{r}} N_{u v}$ , which means number of settled claims with settlement delay v, (Equation8(8) $\begin{aligned} Q^{i o s} (ρ) = \sum_{v = 0}^{D_{i}^{s}} \sum_{u = 0}^{D_{i + v}^{r}} N_{u v} \log q_{v} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} \log {\bar{Q}}_{n - i - u}, \end{aligned}$ (8) ) becomes (9) $\begin{aligned} Q^{i o s} (ρ) = \sum_{v = 0}^{D_{i}^{s}} N_{v}^{s} \log q_{v} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} \log {\bar{Q}}_{n - i - u} . \end{aligned}$ (9) To estimate $ρ$ by Newton–Raphson with Fisher scoring, we need the identities in the following proposition.

Proposition 3.1

The gradient of $Q^{i o s} (ρ)$ with respect to $ρ$ is $\begin{aligned} \frac{\partial Q^{i o s} (ρ)}{ρ} \\ = [N_{i}^{s} + \sum_{u = 0}^{D_{i}^{r}} \frac{N_{u}^{r b n s}}{{\bar{Q}}_{n - i - u}} (\begin{matrix} 0 \\ {\bar{q}}_{n - i - u} \end{matrix}) - N^{r} q] \otimes x, \end{aligned}$ and conditional expectation of Hessian matrix of $Q^{i o s} (ρ)$ given $(r, x)$ is (10) $\begin{aligned} E [\frac{\partial^{2} Q^{i o s} (ρ)}{\partial ρ ρ^{'}} | r, x] = r [\sum_{u = 0}^{n - i - D^{s}} λ_{u} (d i a g (q) - q q^{'}) \\ + \sum_{v = (n - i - D^{r})_{+}}^{(D^{s} - 1) \land (n - i)} λ_{n - i - v} \\ \times (\begin{array}{cc} d i a g (q_{v}) - q_{v} q_{v}^{'} & - q_{v} {\bar{q}}_{v}^{'} \\ - {\bar{q}}_{v} q_{v}^{'} & \frac{Q_{v}}{{\bar{Q}}_{v}} {\bar{q}}_{v} {\bar{q}}_{v}^{'} \end{array})] \otimes x x^{'}, \end{aligned}$ (10) where $N_{i}^{s} = (N_{1}^{s}, N_{2}^{s}, \dots, N_{D_{i}^{s}}^{s})^{'},$ $N^{r} = \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r}$ and $\begin{aligned} q & = (q_{1}, q_{2}, \dots, q_{D^{s}})^{'}, q_{v} = (q_{1}, q_{2}, \dots, q_{v})^{'}, \\ {\bar{q}}_{v} & = (q_{v + 1}, q_{v + 2}, \dots, q_{D^{s}})^{'} . \end{aligned}$

We estimate $ρ$ by maximising overall log-likelihood function $Q^{s} (ρ)$ which is the summation of individual log-likelihood $Q_{i k}^{i o s} (ρ)$ , that is $\hat{ρ}$ is obtained as follows: $\begin{aligned} \hat{ρ} = \underset{ρ}{\arg max} Q^{s} (ρ), \end{aligned}$ where $\begin{aligned} Q^{s} (ρ) & = \sum_{v = 0}^{D^{s}} \sum_{i = 1}^{n - v} \sum_{k = 1}^{m_{i}} N_{i k v}^{s} \log q_{i k v} \\ + \sum_{u = 0}^{D^{r}} \sum_{i = (n - u - D^{s})_{+} + 1}^{n - u} \sum_{k = 1}^{m_{i}} N_{i k u}^{r b n s} \log {\bar{Q}}_{i k, n - i - u} . \end{aligned}$ To obtain $\hat{ρ}$ , similar as previous section, we use Newton–Raphson with Fisher scoring which needs the following gradients $\nabla Q^{s} (ρ)$ and its covariance matrix $I^{s} (ρ)$ , where (11) $\begin{aligned} \nabla Q^{s} (ρ) & := \frac{\partial Q^{s} (ρ)}{\partial ρ} = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \frac{\partial Q_{i k}^{i o s} (ρ)}{ρ}, \\ I^{s} (ρ) & := \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} E [\frac{\partial^{2} Q_{i k}^{i o s} (ρ)}{\partial ρ ρ^{'}} | r_{i k}, x_{i k}] . \end{aligned}$ (11)

In IDM, similar as $λ_{u}$ in the section above, probabilities $(q_{0}, \dots, q_{D^{s}})$ are thought to keep fixed among all policies that is $(q_{0}, \dots, q_{D^{s}})$ is independent of $x$ . By MLE again, we have (12) $\begin{aligned} {\hat{q}}_{0} = {\hat{h}}_{0} and {\hat{q}}_{v} = {\hat{h}}_{v} \prod_{s = 0}^{v - 1} (1 - {\hat{h}}_{s}), v = 0, 1, \dots, D^{s}, \end{aligned}$ (12) where ${\hat{h}}_{v} = \frac{{\tilde{N}}_{v}^{s}}{\sum_{t = v}^{D^{s}} {\tilde{N}}_{t}^{s} + \sum_{t = v + 1}^{D^{s}} G_{t}}$ with $\begin{aligned} {\tilde{N}}_{v}^{s} = \sum_{i = 1}^{n - v} \sum_{k = 1}^{m_{i}} N_{i k v}^{s} and v G_{v} = \sum_{i = 1}^{n - v + 1} \sum_{k = 1}^{m_{i}} N_{i k, n - i - v + 1}^{r b n s} . \end{aligned}$

3.3. Modelling claim payments

We give some assumptions about payments of individual claims as follows. The assumptions involve a $(d + D^{r} + D^{s})$ -dimensional vector of parameters $γ$ .

Assumption 3.3

Claim payments $Y_{u v l},$ $u = 0, \dots, D^{r},$ $v = 0, \dots, D^{s}, l = 1, \dots, N_{u v}$ are independent, independent of $N_{u v}; u = 0, \dots, D^{r}, v = 0, \dots, D^{s}$ and also assume that conditional mean and variance satisfy $\begin{aligned} E [Y_{u v l} | x] = μ_{u v}, V a r (Y_{u v l} | x) = ϕ^{p} μ_{u v} \end{aligned}$ with $μ_{u v} = \exp (x_{u v}^{'} γ),$ where $x_{u v} = (x^{'}, δ_{u}^{'}, δ_{v}^{'})^{'}$ is a $(d + D^{r} + D^{s})$ -dimensional vector of covariates.

Arrange all settled payments of the risk portfolio into the set ${(Y_{l}, {\tilde{x}}_{l}), l = 1, 2, \dots, N^{t s}}$ , where ${\tilde{x}}_{l}$ is covariate associated with payments $Y_{l}$ and $N^{t s}$ is the total number of settled claims. Construct quasi-likelihood by independence among policies and assumption above, (13) $\begin{aligned} Q^{p} (γ) = \frac{1}{ϕ^{p}} \sum_{l = 1}^{N^{t s}} (Y_{l} \log μ_{l} - μ_{l}), \end{aligned}$ (13) where $μ_{l} = \exp ({\tilde{x}}_{l}^{'} γ)$ . Denote $μ = (μ_{1}, \dots, μ_{N^{t s}})^{'}$ and $Y = (Y_{1}, \dots, Y_{N^{t s}})^{'}$ . The quasi-score function–partial derivatives of $Q^{p} (γ)$ with respect to the parameters is (14) $\begin{aligned} {\dot{Q}}^{p} (γ) := \frac{\partial Q^{p} (γ)}{\partial γ} = \frac{1}{ϕ^{p}} {\tilde{X}}^{'} (Y - μ), \end{aligned}$ (14) where $\tilde{X} = ({\tilde{x}}_{1}, \dots, {\tilde{x}}_{N^{t s}})^{'}$ .

The covariance matrix of $\dot{Q} (γ)$ , which is also the negative expected value of $\partial \dot{} Q (γ) / \partial γ^{'}$ , is (15) $\begin{aligned} I^{p} = \frac{1}{ϕ^{p}} {\tilde{X}}^{'} d i a g (μ) \tilde{X} . \end{aligned}$ (15) The parameters $γ$ are estimated by iteratively re-weighted least square (IRLS) algorithm, which is as follows,

Initialise $\hat{γ} = γ_{0}$ such that ${\hat{μ}}_{l} = \exp ({\tilde{x}}_{l}^{'} \hat{γ})$ and $\hat{μ} = ({\hat{μ}}_{1}, {\hat{μ}}_{2}, \dots, {\hat{μ}}_{N^{t s}})$ , where $γ_{0}$ is usually zero vector.
Compute adjusted response $z_{l} = y_{l} - {\hat{μ}}_{l} + \tilde{X} \hat{γ}$ .
Update $\hat{γ}$ by what follows, $\begin{aligned} \hat{γ} = ({\tilde{X}}^{'} d i a g (\hat{μ}) \tilde{X})^{- 1} {\tilde{X}}^{'} d i a g (\hat{μ}) Z, \end{aligned}$ where $Z = (z_{1}, z_{2}, \dots, z_{N^{t s}})$ , and then ${\hat{μ}}_{l} = \exp ({\tilde{x}}_{l}^{'} \hat{γ})$ .

To estimate dispersion parameter $ϕ^{p}$ , we also adopt conventional method–moment estimation that is, $\begin{aligned} {\hat{ϕ}}^{p} = \frac{1}{N^{t s} - (p + D^{r} + D^{s})} \sum_{l = 1}^{N^{t s}} \frac{(Y_{l} - {\hat{μ}}_{l})^{2}}{{\hat{μ}}_{l}}, \end{aligned}$ where ${\hat{μ}}_{l} = \exp ({\tilde{x}}_{l}^{'} \hat{γ})$ .

In IDM, the coefficients of covariates about individual features are considered to be zero, i.e., $γ_{1} = \dots = γ_{d - 1} = 0$ , and $μ_{l}$ s only depended on reporting and settlement delays, which means it just needs to estimate $γ^{I D} := (γ_{0}, γ_{d}, \dots, γ_{d - 1 + D^{r}}, \dots, γ_{d - 1 + D^{r} + D^{s}})^{'}$ by the similar procedure as stated above. Therefore, estimator ${\hat{γ}}^{I D}$ for $γ^{I D}$ is a maximiser of the $Q^{p}$ , which is the function of $γ^{I D}$ that is under $γ_{1} = \dots = γ_{d - 1} = 0$ , and $μ_{l}$ in Equation (Equation13(13) $\begin{aligned} Q^{p} (γ) = \frac{1}{ϕ^{p}} \sum_{l = 1}^{N^{t s}} (Y_{l} \log μ_{l} - μ_{l}), \end{aligned}$ (13) ) is independent of individual information and only takes one of the following forms: (16) $\begin{aligned} μ_{u v}^{I D} & = \exp ((1, δ_{u}^{'}, δ_{v}^{'}) γ^{I D}), \\ u & = 0, 1, \dots, D^{r}, v = 0, 1, \dots, D^{s} . \end{aligned}$ (16) Then the estimate of $μ_{u v}^{I D}$ under IDM is denoted by ${\hat{μ}}_{u v} := \exp ((1, δ_{u}^{'}, δ_{v}^{'}) {\hat{γ}}^{I D})$ , which is a policy-free estimate.

4. Prediction for outstanding liabilities

In this section, the terminologies ‘loss reserve’ and ‘loss reserving’ are precisely specified, measurement of accuracy of loss reserving is then discussed and we also shows the improvement of accuracy of loss reserving basing on IIM with respect to IDM.

4.1. Loss reserve and loss reserving

Recalling the total outstanding liability R defined in (Equation3(3) $\begin{aligned} R := \sum_{i = 1}^{n} R_{i}^{r b n s} + \sum_{i = 1}^{n} R_{i}^{i b n r}, \end{aligned}$ (3) ), by ‘loss reserve’, we refer to the projection (17) $\begin{aligned} R_{m} = R_{m} (θ) = E [R | F^{u o}] \end{aligned}$ (17) of R on the observations $F^{u o}$ by the evaluation date n, where the subscript ‘ $m$ ’ indicates portfolio size, since loss reserve is based on specific risk portfolio. One can see that $R_{m}$ is a function of unknown parameters $θ := (β^{'}, π^{'}, ρ^{'}, γ^{'})^{'}$ and hence it needs to be estimated.

To derive moments about outstanding liabilities R and conditional variance of R, the following quantities are needed. For $u = 0, 1, \dots, D^{r}, v = 0, 1, \dots, D^{s}$ , denote by (18) $\begin{aligned} {\tilde{μ}}_{u v} = \frac{\sum_{t = v}^{D^{s}} q_{t} μ_{u t}}{\sum_{t = v}^{D^{s}} q_{t}} and {\tilde{μ}}_{u v}^{s} = \frac{\sum_{t = v}^{D^{s}} q_{t} μ_{u t}^{2}}{\sum_{t = v}^{D^{s}} q_{t}}, \end{aligned}$ (18) where ${\tilde{μ}}_{u v}$ is conditional moment of claim payments given $x$ , reporting delays u and settlement delays no less than v, so that corresponding policy-specified quantities are (19) $\begin{aligned} {\tilde{μ}}_{i k u v} = \frac{\sum_{t = v}^{D^{s}} q_{i k t} μ_{i k u t}}{\sum_{t = v}^{D^{s}} q_{i k t}} and {\tilde{μ}}_{i k u v}^{s} = \frac{\sum_{t = v}^{D^{s}} q_{i k t} μ_{i k u t}^{2}}{\sum_{t = v}^{D^{s}} q_{i k t}} . \end{aligned}$ (19) Then we derive the following theorem which provides formulas to compute not only the loss reserve $R_{m}$ but also variance of outstanding liabilities R given observations $F^{u o}$ .

Theorem 4.1

Under the model formulated by Assumptions 3.1–3.3, the loss reserve is (20) $\begin{aligned} R_{m} (θ) & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} {\tilde{μ}}_{(n - v - u + 1) k u v} \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} λ_{i k u} {\tilde{μ}}_{i k u 0}, \end{aligned}$ (20) and the variance of R given observations $F^{u o}$ is $\begin{aligned} V a r (R | F^{u o}) & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} \\ \times (\frac{{\tilde{μ}}_{n - v - u + 1, k u v}^{s}}{{\bar{Q}}_{n - v - u + 1, k, v - 1}} \\ - {\tilde{μ}}_{n - v - u + 1, k u v}^{2} + ϕ^{p} {\tilde{μ}}_{n - v - u + 1, k u v}) \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} λ_{i k u} \\ \times ({\tilde{μ}}_{i k u 0}^{s} + (ϕ - 1) {\tilde{μ}}_{i k u 0}^{2} + ϕ^{p} {\tilde{μ}}_{i k u 0}) . \end{aligned}$

It can be clearly seen that loss reserve $R_{m}$ depends on not only the information from observed data in terms of the number of RBNS claims and policy's feature information but also unknown parameters $θ$ , which results in the need for estimating $R_{m}$ . Accordingly, the term ‘loss/claims reserving’ is used for certain reasonable estimate of the loss reserve. Formally, after getting certain reasonable estimates $\hat{θ}$ of the unknown parameters from the observed data, as, for example, what has been done in the previous section, we have the following theorem.

Theorem 4.2

By loss reserving we refer to the (random) quantity (21) $\begin{aligned} {\hat{R}}_{I I} & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} {\hat{\tilde{μ}}}_{(n - v - u + 1) k u v} \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} {\hat{λ}}_{i k u} {\hat{\tilde{μ}}}_{i k u 0}, \end{aligned}$ (21) where ${\hat{\tilde{μ}}}_{i k u v}$ s and ${\hat{λ}}_{i k u}$ s are obtained by substituting unknown parameters with their estimates.

According to the theorem above, it is easy to obtain loss reserving under IDM by simply replacing ${\hat{μ}}_{i k u v}$ s and ${\hat{λ}}_{i k u}$ s in (Equation21(21) $\begin{aligned} {\hat{R}}_{I I} & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} {\hat{\tilde{μ}}}_{(n - v - u + 1) k u v} \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} {\hat{λ}}_{i k u} {\hat{\tilde{μ}}}_{i k u 0}, \end{aligned}$ (21) ) with policy-free estimates ${\hat{μ}}_{u v}$ s and ${\hat{λ}}_{u}$ s, respectively. Specifically, to distinguish two different estimates for reserve, we use symbol ${\hat{R}}_{I D}$ to indicate loss reserving under IDM, which is (22) $\begin{aligned} {\hat{R}}_{I D} = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} {\tilde{N}}_{n - v - u + 1, u}^{r b n s} {\hat{\tilde{μ}}}_{u v} + \sum_{u = 1}^{D^{r}} r_{[u]} {\hat{λ}}_{u} {\hat{\tilde{μ}}}_{u 0}, \end{aligned}$ (22) where ${\tilde{N}}_{n - v - u + 1, u}^{r b n s} = \sum_{k = 1}^{m_{n - v - u + 1}} N_{n - v - u + 1, k u}^{r b n s}$ , $r_{[u]} = \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k}$ and ${\hat{\tilde{μ}}}_{u v} = \frac{\sum_{t = v}^{D^{s}} {\hat{q}}_{t} {\hat{μ}}_{u t}}{\sum_{t = v}^{D^{s}} {\hat{q}}_{t}} .$

4.2. Measurement of prediction accuracy

It is essential to measure accuracy of loss reserving and especially accuracy improvement of loss reserving by considering useful individual information with respect to the one without this information. To measure the prediction accuracy of some reserve estimate $\hat{R}$ , which is $F^{u o}$ measurable, a natural idea is conditional mean square error of prediction (MSEP) which is defined as (23) $\begin{aligned} M S E P (R, \hat{R}) & = E [(R - \hat{R})^{2} | F^{u o}] \\ = V a r (R | F^{u o}) + (E [R | F^{u o}] - \hat{R})^{2} . \end{aligned}$ (23) For loss reserving ${\hat{R}}_{I I}$ , which includes individual information, and ${\hat{R}}_{I D}$ without individual information, their MSEPs are $M S E P (R, {\hat{R}}_{I I})$ and $M S E P (R, {\hat{R}}_{I D})$ , respectively. To measure the difference in prediction accuracy of ${\hat{R}}_{I I}$ and ${\hat{R}}_{I D}$ , we use the following ratio: (24) $\begin{aligned} M^{r} & = \frac{M S E P (R, {\hat{R}}_{I I})}{M S E P (R, {\hat{R}}_{I D})} \\ = \frac{V a r (R | F^{u o}) + (E [R | F^{u o}] - {\hat{R}}_{I I})^{2}}{V a r (R | F^{u o}) + (E [R | F^{u o}] - {\hat{R}}_{I D})^{2}} . \end{aligned}$ (24) It is well known that individual information model performs better in terms of prediction accuracy than individual data model, if $M^{r} < 1$ , but it is hard to compute $M^{r}$ with unknown parameters. Fortunately, we can compare $M^{r}$ and number 1 when portfolio size m is large enough. It is notable that individual data model is nested in individual information model. Then we have the following theorem under some regular conditions (see Van der Vaart, Citation2000), which illustrates the advantages of individual information model over individual data model.

Theorem 4.3

When portfolio size m tends to infinity, $M^{r} \overset{P}{\to} 1,$ where $\overset{P}{\to}$ means converging in probability, if the individual data model is true, that is the coefficients of $x_{1}, x_{2}, \dots, x_{d - 1}$ are zero. Otherwise, (25) $\begin{aligned} \frac{1}{m} ({\hat{R}}_{I D} - R_{m} (θ)) \overset{P}{\to} Δ \\ = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} κ_{n - v - u + 1} E [r λ_{u} {\bar{Q}}_{v - 1} ({\overset{ˇ}{\tilde{μ}}}_{u v} - {\tilde{μ}}_{u v})] \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} κ_{i} E [r λ_{u} ({\overset{ˇ}{\tilde{μ}}}_{u 0} - {\tilde{μ}}_{u 0})], \end{aligned}$ (25) where ${\overset{ˇ}{\tilde{μ}}}_{u v} = \sum_{s = v}^{D^{s}} {\overset{ˇ}{q}}_{s} {\overset{ˇ}{μ}}_{u s} / \sum_{s = v}^{D^{s}} {\overset{ˇ}{q}}_{s}$ with (26) $\begin{aligned} {\overset{ˇ}{q}}_{v} & = {\overset{ˇ}{h}}_{v} \prod_{s = 0}^{v - 1} (1 - {\overset{ˇ}{h}}_{s}), \\ {\overset{ˇ}{h}}_{v} & = \frac{\sum_{i = 1}^{n - v} \sum_{u = 0}^{D_{i + v}^{r}} κ_{i} E [r λ_{u} q_{v}]}{\sum_{i = 1}^{n - v} \sum_{u = 0}^{D_{i + v}^{r}} κ_{i} E [r λ_{u} {\bar{Q}}_{v - 1}]}, \\ {\overset{ˇ}{μ}}_{u v} & = \exp ((1, δ_{u}^{'}, δ_{v}^{'}) {\overset{ˇ}{γ}}^{I D}), and \\ {\overset{ˇ}{γ}}^{I D} & = \underset{γ^{I D}}{Argmax} \sum_{u = 0}^{D^{r}} \sum_{v = 0}^{D^{s}} \sum_{i = 1}^{n - u - v} κ_{i} E \\ \times [r λ_{u} q_{v} (μ_{u v} \log μ_{u v}^{I D} - μ_{u v}^{I D})], μ_{u v}^{I D} i n (16), \end{aligned}$ (26) and if the asymptotic bias $Δ \neq 0,$ $M^{r} \overset{P}{\to} 0$ .

The theorem above shows that IIM is asymptotically equivalent to IDM, if IDM is true and otherwise the former has higher prediction accuracy than the latter when portfolio size is large enough. One can intuitively understand that as portfolio size tends to infinity, both models can capture all the information included in observations when IDM holds true, since IIM is a generalised version of IDM. However, individual data model fails to capture the effects of policy's feature information and thus leads to greater bias when IIM holds true.

An important issue one concerns is how much prediction accuracy of loss reserving ${\hat{R}}_{I I}$ can be improved, if IIM holds true, in a fixed risk portfolio that is one cares about actual value of $M^{r}$ under true IIM. However, there are unknown parameters $θ$ in $V a r (R | F^{u o})$ and $E [R | F^{u o}]$ . An approximation method that comes to one's mind is substituting estimated parameters $\hat{θ}$ to them, which however needs to further take estimation error of $\hat{θ}$ into account. We directly use the method named semi-analytical approximation for $M S E P (R, \hat{R})$ (One can refer to Lindholm et al. (Citation2020) for more details), which is also discussed in Wahl (Citation2019) under micro data model. Then the approximations for $M S E P (R, {\hat{R}}_{I I})$ and $M S E P (R, {\hat{R}}_{I D})$ are (27) $\begin{aligned} \hat{M S E P} (R, {\hat{R}}_{I I}) & = V a r (R | F^{u o}) (\hat{θ}) \\ + \nabla R_{m} (\hat{θ})^{'} \hat{C o v} (\hat{θ}) \nabla R_{m} (\hat{θ}), \\ \hat{M S E P} (R, {\hat{R}}_{I D}) & = V a r (R | F^{u o}) (\hat{θ}) \\ + \nabla R_{m} (\hat{θ})^{'} \hat{C o v} (\hat{θ}) \nabla R_{m} (\hat{θ}) \\ + ({\hat{R}}_{I I} - {\hat{R}}_{I D})^{2}, \end{aligned}$ (27) so that (28) $\begin{aligned} {\hat{M}}^{r} = \frac{V a r (R | F^{u o}) (\hat{θ}) + \nabla R_{m} (\hat{θ})^{'} \hat{C o v} (\hat{θ}) \nabla R_{m} (\hat{θ})}{\begin{matrix} V a r (R | F^{u o}) (\hat{θ}) + \nabla R_{m} (\hat{θ})^{'} \hat{C o v} (\hat{θ}) \nabla R_{m} (\hat{θ}) \\ + ({\hat{R}}_{I I} - {\hat{R}}_{I D})^{2} \end{matrix}}, \end{aligned}$ (28) where $\nabla R_{m} (\hat{θ})$ is the gradient of loss reserve $R_{m} (θ)$ with respect to $θ$ computed at $\hat{θ}$ and $\hat{C o v} (\hat{θ})$ is asymptotic covariance of $\hat{θ}$ . It is easily known that $\begin{aligned} \hat{C o v} (\hat{θ}) = d i a g (\hat{ϕ} ({\hat{X}}^{r'} d i a g (\hat{λ})^{- 1} {\hat{X}}^{r})^{- 1}, (I^{s} (\hat{ρ}))^{- 1}, {\hat{ϕ}}^{p} ({\tilde{X}}^{'} \tilde{X})^{- 1}), \end{aligned}$

where ${\hat{X}}^{r}$ and $\hat{λ}$ are plug-in estimates and $I^{s} (\hat{ρ})$ is obtained by inserting $\hat{ρ}$ into $I^{s} (ρ)$ in Equation (Equation11(11) $\begin{aligned} \nabla Q^{s} (ρ) & := \frac{\partial Q^{s} (ρ)}{\partial ρ} = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \frac{\partial Q_{i k}^{i o s} (ρ)}{ρ}, \\ I^{s} (ρ) & := \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} E [\frac{\partial^{2} Q_{i k}^{i o s} (ρ)}{\partial ρ ρ^{'}} | r_{i k}, x_{i k}] . \end{aligned}$ (11) ). One can refer to Chapter 9 in McCullagh and Nelder (Citation1989) for more details.

Proposition 4.4

The gradient of $R_{m} (θ)$ with respect to $(β^{'}, π^{'})^{'}$ is $\begin{aligned} \frac{\partial R_{m} (θ)}{\partial (β^{'}, π^{'})^{'}} \\ = \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} λ_{i k u} {\tilde{μ}}_{i k u 0} (\begin{matrix} 1 \\ δ_{u} - p_{i k} \end{matrix}) \otimes x_{i k}, \end{aligned}$ the gradient of $R_{m} (θ)$ with respect to $ρ$ is $\begin{aligned} \frac{\partial R_{m} (θ)}{\partial ρ} & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{i_{u v}}} \frac{N_{i_{u v} k u}^{r b n s}}{{\bar{Q}}_{i_{u v} k, v - 1}} (\begin{matrix} 0 \\ q_{i k u, v - 1}^{m u} \end{matrix}) \otimes x_{i_{u v} k} \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} λ_{i k u} \\ \times [d i a g (q_{i k}) {\bar{μ}}_{i k u 0} - {\tilde{μ}}_{i k u 0} q_{i k}] \otimes x_{i k}, \end{aligned}$ where $i_{u v} = n - v - u + 1$ , $\begin{aligned} q_{i k u v}^{m u} = d i a g ({\bar{q}}_{i k v}) {\bar{μ}}_{i k u v} - {\tilde{μ}}_{i k u, v + 1} q_{i k v}, \end{aligned}$ and ${\bar{μ}}_{i k u v} = (μ_{i k u, v + 1}, \dots, μ_{i k u D^{s}})^{'}$ , and the gradient of $R_{m} (θ)$ with respect to $γ$ is $\begin{aligned} \frac{\partial R_{m} (θ)}{\partial γ} & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} \\ \times {\dot{\tilde{μ}}}_{(n - v - u + 1) k u v} \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} λ_{i k u} {\dot{\tilde{μ}}}_{i k u 0}, \end{aligned}$ where ${\dot{\tilde{μ}}}_{i k u v} = \frac{\sum_{t = v}^{D^{s}} q_{i k t} μ_{i k u t} x_{i k u t}}{\sum_{t = v}^{D^{s}} q_{i k t}} .$

5. Simulations and real data analysis

Reported in this section include the results from a few small simulations conducted to further investigate $M^{r}$ . A real data in health insurance was also analysed to show the application of IIM and the accuracy improvement by using IIM with respect to IDM in practice.

5.1. Simulation

In this simulation, the risk exposures associated with every individuals were drawn from the uniform distribution on $[0, 1]$ , the covariates were produced by multivariate standard normal distribution and we simulated the random developments of claims for a fixed risk portfolio. In each run, we directly compute $M^{r}$ according to Equation (Equation24(24) $\begin{aligned} M^{r} & = \frac{M S E P (R, {\hat{R}}_{I I})}{M S E P (R, {\hat{R}}_{I D})} \\ = \frac{V a r (R | F^{u o}) + (E [R | F^{u o}] - {\hat{R}}_{I I})^{2}}{V a r (R | F^{u o}) + (E [R | F^{u o}] - {\hat{R}}_{I D})^{2}} . \end{aligned}$ (24) ) so that we can know how much accuracy is improved by using IIM with respect to IDM under the fixed risk portfolio.

Because there are only assumptions about mean and variance for reporting developments and payments of claims, we need additional distributional assumptions to generate them, which arise as follows. First, for individual reporting developments $N_{u}^{r}$ s, we generated them by the additional assumption which says that $\frac{N_{u}^{r}}{ϕ}$ follows Poisson distribution with mean $\frac{r λ_{u}}{ϕ}$ . Second, for individual payments $Y_{l}$ , similarly, we generated them by assuming that $\frac{Y_{l}}{ϕ^{p}}$ follows Poisson distribution with mean $\frac{μ_{l}}{ϕ^{p}}$ . Each run in the simulation was conducted with the setting: n = 5, $D^{r} = 2$ , $D^{s} = 2$ , a risk portfolio size $m = (10000, 10, 000, 10, 000, 10, 000, 10, 000),$ i.e., 10, 000 policies in each year, and any combination of parameters which varied according to the setting in the following two examples.

Example 5.1

Dimension d = 3 and the parameters varied in an auxiliary parameter t ranging in $[- 1, 1]$ by step 0.01 as

Parameters for reporting developments: $\begin{aligned} β & = (- 0.5, - t, 2 t)^{'}, π_{1} = (1, t, t)^{'}, \\ π_{2} & = (- 1, - t, - 2 t)^{'}, \end{aligned}$ and $ϕ = 2$ .
Parameters for settlement developments: $ρ_{1} = (0.1, 0.2 t, - 0.3 t)^{'}$ and $ρ_{2} = (- 0.1, - 0.2 t, 0.3 t)^{'}$ .
Parameters for payments: $γ = (5, 0.2 t, 0.4 t, 0.1, 0.6, 0.2, 0.8)^{'}$ and $ϕ^{p} = 1.5$ .

Covariates were produced by bivariate standard normal distribution in this example.

Example 5.2

Dimension d = 4 and parameters varied over t ranging in $[- 1, 1]$ by step 0.01 as

Parameters for reporting developments: $\begin{aligned} β & = (2, 0.2 t, - 0.8 t, 0.5 t)^{'}, \\ π_{1} & = (2, - t, 3 t, - 2 t)^{'}, π_{2} = (1, 2 t, - t, - 2 t)^{'}, \end{aligned}$ and $ϕ = 3$ .
Parameters for settlement developments: $\begin{aligned} ρ_{1} & = (0.3, 0.1 t, - 0.5 t, 0.2 t)^{'}, \\ ρ_{2} & = (- 0.2, - 0.3 t, 0.7 t, 0.4 t)^{'} . \end{aligned}$
Parameters for payments: $γ = (3, 0.6 t, - 0.2 t, 0.7 t, 0.3, 0.2, - 0.5, 0.4)^{'}$ and $ϕ^{p} = 2.5$ .

Covariates were produced by ternary standard normal distribution in this example.

In each run, we estimated loss reserve by both IIM and IDM using the simulated data that is we computed ${\hat{R}}_{I I}$ by Equation (Equation21(21) $\begin{aligned} {\hat{R}}_{I I} & = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} {\hat{\tilde{μ}}}_{(n - v - u + 1) k u v} \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} {\hat{λ}}_{i k u} {\hat{\tilde{μ}}}_{i k u 0}, \end{aligned}$ (21) ) as well as ${\hat{R}}_{I D}$ by (Equation22(22) $\begin{aligned} {\hat{R}}_{I D} = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} {\tilde{N}}_{n - v - u + 1, u}^{r b n s} {\hat{\tilde{μ}}}_{u v} + \sum_{u = 1}^{D^{r}} r_{[u]} {\hat{λ}}_{u} {\hat{\tilde{μ}}}_{u 0}, \end{aligned}$ (22) ) and true parameters were used to compute $V a r (R | F^{u o})$ and $E [R | F^{u o}]$ according to Theorem 4.1. Then we computed $M^{r}$ by inserting the computed ${\hat{R}}_{I I}$ , ${\hat{R}}_{I D}$ , $V a r (R | F^{u o})$ and $E [R | F^{u o}]$ into Equation (Equation24(24) $\begin{aligned} M^{r} & = \frac{M S E P (R, {\hat{R}}_{I I})}{M S E P (R, {\hat{R}}_{I D})} \\ = \frac{V a r (R | F^{u o}) + (E [R | F^{u o}] - {\hat{R}}_{I I})^{2}}{V a r (R | F^{u o}) + (E [R | F^{u o}] - {\hat{R}}_{I D})^{2}} . \end{aligned}$ (24) ). At last, we plotted the simulated results in Figure .

Figure 1. The simulated $M^{r}$ over varying coefficients of covariates. (a) Example 5.1 and (b) Example 5.2.

We obtained the results consistent with Theorem 4.3 from the simulations above.

When the coefficients of $x_{1}, x_{2}, \dots, x_{d - 1}$ approach zero, most $M^{r}$ s are close to real number 1 that is loss reserving by IIM almost has the same accuracy as that by IDM.
When those coefficients are away from zero, $M^{r}$ tends to be zero that is the prediction accuracy of loss reserving by IIM is greatly improved with respect to IDM.

5.2. Real data analysis

In this section, we analysed a dataset, which was collected by a commercial insurance company in China. The dataset recorded writing and expiring dates of policies, individual information, see Table , and developments of reported claims between 1/1/2019 and 8/31/2019.

Table 1. The individual information in real data analysis.

Display Table

To visualise the effects of individual information on the developments of claims, for example, the histograms of reporting and settlement delays measured in days were provided under a few combinations of covariate values including gender, geographical location and age, as presented in Figures and . It was strongly proposed that the individual information had impacts on the distributions of reporting and settlement delays.In the dataset, all the reporting delays were not more than 150 days (5 months). By China Banking and Insurance Regulatory Commission, the reported claims in health insurance are generally required to be settled within 2 months if no disagreement exists. It is appropriate to take 1 month as the time unit (‘accident year’ in previous sections). Thus the maximum reporting and settlement delays were safely set to $D^{r} = 5$ and $D^{s} = 3$ (the real data supported this assumption).

Figure 2. Histograms of reporting delays (in days): (a) Female, Region III, age 9–20; (b) Male, Region I, age 45–50; (c) Male, Region VI, age 20–40; (d) Male, Region III, age >55.

Figure 3. Histograms of settlement delays (in days): (a) Female, Region III, age 9–20; (b) Male, Region I, age 45–50; (c) Male, Region VI, age 20–40; (d) Male, Region III, age >55.

To illustrate the proposed model for loss reserving, evaluation date was set as 8/31/2019. That is, we worked with n = 8, $D^{r} = 5$ and $D^{s} = 3$ (months). There are four factors organised into eight features $x_{1}, \dots, x_{8}$ , as shown in Table . Besides, reporting and settlement delays, which were regarded as factors to model claim payments as Assumption 3.2 formulated, were respectively organised into five features $x_{9}$ , $x_{10}, \dots, x_{13}$ and three features $x_{14}, x_{15}, x_{16}$ .

The estimated parameters for the reporting developments under IIM, their standard errors and p-values of significance test were displayed in Table , while the corresponding estimated results under IDM, i.e., ${\hat{λ}}_{u}$ , $u = 0, 1, \dots, 5$ in (Equation7(7) $\begin{aligned} {\hat{λ}}_{u} = \frac{{\tilde{N}}_{u}^{r}}{r_{(u)}}, u = 0, 1, \dots, D^{r}, \end{aligned}$ (7) ) are $\begin{aligned} (0.0218, 0.0341, 0.0133, 0.0058, 0.0034, 0.0022), \end{aligned}$ respectively. Besides, the estimated dispersion parameter $\hat{ϕ} = 1.9433$ . These results in () provide obvious evidence that individual information has effects on the reporting developments of claims in sense that most covariates associated with individual information are significant at significance level 0.05.

Similar results for settlement developments and payments are listed in Tables and . These results also provide obvious evidence that individual information has effects on settlement developments and payments of claims. Besides, the estimated dispersion parameter ${\hat{ϕ}}^{p} = 15467.1$ and the estimates under IDM are $\begin{aligned} ({\hat{q}}_{0}, {\hat{q}}_{1}, {\hat{q}}_{2}, {\hat{q}}_{3}) = (0.6435, 0.3121, 0.0363, 0.0081), \\ {\hat{γ}}^{I D} = (8.3055, 0.4535, 0.5830, 0.6227, 0.5838 0.4557, \\ 0.4204, 0.6139, 0.8472)^{'} . \end{aligned}$

Table 2. Estimated parameters for reporting developments, their standard errors and p-values.

Display Table

Table 3. Estimated parameters for settlements developments, their standard errors and p-values.

Display Table

Table 4. Estimated parameters $\hat{γ}$ for payments, their standard errors and p-values.

Display Table

In Table , the columns with names ‘IBNR’, ‘RBNS’ and ‘Loss reserving’ correspond to estimates of IBNR reserve, RBNS reserve and total loss reserve, respectively. The square roots of approximated conditional MSEPs under IIM and IDM are in the fourth column of Table . The rightmost column in this table showed the computed ${\hat{M}}^{r}$ by (Equation28(28) $\begin{aligned} {\hat{M}}^{r} = \frac{V a r (R | F^{u o}) (\hat{θ}) + \nabla R_{m} (\hat{θ})^{'} \hat{C o v} (\hat{θ}) \nabla R_{m} (\hat{θ})}{\begin{matrix} V a r (R | F^{u o}) (\hat{θ}) + \nabla R_{m} (\hat{θ})^{'} \hat{C o v} (\hat{θ}) \nabla R_{m} (\hat{θ}) \\ + ({\hat{R}}_{I I} - {\hat{R}}_{I D})^{2} \end{matrix}}, \end{aligned}$ (28) ). We can see that loss reserving by IIM provides more stable prediction of outstanding liabilities than that by IDM since the former has smaller conditional MSEP and after incorporating useful individual information into loss reserving, the prediction accuracy is greatly increased by $77.63 %$ .

6. Conclusion

This paper explored the improvement of accuracy in predicting outstanding liabilities, which are incurred by general insurance companies, by incorporating useful individual information into modelling. The reporting developments and payments of individual claims were given weak assumptions about their first two moments and modelled under quasi-likelihood theory, while settlement delays were modelled by multinomial logistic regression. Based on the model specification, loss reserve and conditional variance of outstanding liabilities were derived, which were further used to compute loss reserving and conditional MSEP. It was theoretically proved that loss reserving incorporating useful individual information shows higher accuracy than that under IDM, where the accuracy is measured by the conditional MSEP, when portfolio size is large enough. The conclusion is also supported by the simulations and real data analysis.

Table 5. Reserving, accuracy of prediction and accuracy improvement of IIM with respect to IDM.

Display Table

While the proposed model is basically a parametric model in statistical context, some one may be concerned with the limitation that the model is subjective and thus question its robustness in practical applications. Regarding this aspect, a possible next step is to study this problem under a nonparametric framework. Especially, it is more interesting to model the dependence of claims development on individual information by machine learning (including deep learning).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Natural Science Foundation of China (71771089), the Shanghai Philosophy and Social Science Foundation (2015BGL001), the National Social Science Foundation Key Program of China (17ZDA091) and China Scholarship Council (201906140045).

Notes on contributors

Zhigao Wang

Zhigao Wang is a Ph.D. candidate in Statistics at East China Normal University.

Xianyi Wu

Xianyi Wu is a professor in School of Statistics at East China Normal University.

Chunjuan Qiu

Chunjuan Qiu is an associate professor in School of Statistics at East China Normal University.

References

Alai, D. H., Merz, M., & Wüthrich, M. V. (2009). Mean square error of prediction in the Bornhuetter–Ferguson claims reserving method. Annals of Actuarial Science, 4(1), 7–31. https://doi.org/https://doi.org/10.1017/S1748499500000580
Google Scholar
Alai, D. H., Merz, M., & Wüthrich, M. V. (2010). Prediction uncertainty in the Bornhuetter–Ferguson claims reserving method: Revisited. Annals of Actuarial Science, 5(1), 7–7. https://doi.org/https://doi.org/10.1017/S1748499510000023
Google Scholar
Antonio, K., & Plat, R. (2014). Micro-level stochastic loss reserving for general insurance. Scandinavian Actuarial Journal, 2014(7), 649–669. https://doi.org/https://doi.org/10.1080/03461238.2012.755938
Web of Science ®Google Scholar
Arjas, E. (1989). The claims reserving problem in non-life insurance: Some structural ideas. ASTIN Bulletin: The Journal of the IAA, 19(2), 139–152. https://doi.org/https://doi.org/10.2143/AST.19.2.2014905
Google Scholar
Crevecoeur, J., Antonio, K., & Verbelen, R. (2019). Modeling the number of hidden events subject to observation delay. European Journal of Operational Research, 277(3), 930–944. https://doi.org/https://doi.org/10.1016/j.ejor.2019.02.044
Web of Science ®Google Scholar
England, P. D., & Verrall, R. J. (2002). Stochastic claims reserving in general insurance. British Actuarial Journal, 8(3), 443–544. https://doi.org/https://doi.org/10.1017/S1357321700003809
Google Scholar
Huang, J., Qiu, C., & Wu, X. (2015). Stochastic loss reserving in discrete time: Individual vs. aggregate data models. Communications in Statistics-Theory and Methods, 44(10), 2180–2206. https://doi.org/https://doi.org/10.1080/03610926.2014.976473
Web of Science ®Google Scholar
Huang, J., Qiu, C., Wu, X., & Zhou, X. (2015). An individual loss reserving model with independent reporting and settlement. Insurance: Mathematics and Economics, 64(1), 232–245. https://doi.org/https://doi.org/10.1016/j.insmatheco.2015.05.010
Google Scholar
Huang, J., Wu, X., & Zhou, X. (2016). Asymptotic behaviors of stochastic reserving: Aggregate versus individual models. European Journal of Operational Research, 249(2), 657–666. https://doi.org/https://doi.org/10.1016/j.ejor.2015.09.039
Web of Science ®Google Scholar
Lindholm, M., Lindskog, F., & Wahl, F. (2020). Estimation of conditional mean squared error of prediction for claims reserving. Annals of Actuarial Science, 14(1), 93–128. https://doi.org/https://doi.org/10.1017/S174849951900006X
Web of Science ®Google Scholar
Mack, T. (1993). Distribution-free calculation of the standard error of chain ladder reserve estimates. ASTIN Bulletin: The Journal of the IAA, 23(2), 213–225. https://doi.org/https://doi.org/10.2143/AST.23.2.2005092
Google Scholar
Mack, T. (2000). Credible claims reserves: The Benktander method. ASTIN Bulletin: The Journal of the IAA, 30(2), 333–347. https://doi.org/https://doi.org/10.2143/AST.30.2.504639
Google Scholar
McCullagh, P., & Nelder, J. A. (1989). Generalized linear models (2nd ed). Chapman and Hall.
Google Scholar
Norberg, R. (1993). Prediction of outstanding liabilities in non-life insurance 1. ASTIN Bulletin: The Journal of the IAA, 23(1), 95–115. https://doi.org/https://doi.org/10.2143/AST.23.1.2005103
Google Scholar
Norberg, R. (1999). Prediction of outstanding liabilities II. Model variations and extensions. ASTIN Bulletin: The Journal of the IAA, 29(1), 5–25. https://doi.org/https://doi.org/10.2143/AST.29.1.504603
Google Scholar
Pigeon, M., Antonio, K., & Denuit, M. (2013). Individual loss reserving with the multivariate skew normal framework. ASTIN Bulletin: The Journal of the IAA, 43(3), 399–428. https://doi.org/https://doi.org/10.1017/asb.2013.20
Web of Science ®Google Scholar
Pigeon, M., Antonio, K., & Denuit, M. (2014). Individual loss reserving using paid-incurred data. Insurance: Mathematics and Economics, 58(2), 121–131. https://doi.org/https://doi.org/10.1016/j.insmatheco.2014.06.012
Google Scholar
Van der Vaart, A. W. (2000). Asymptotic Statistics. Cambridge University Press.
Google Scholar
Verrall, R. J., Nielsen, J. P., & Jessen, A. H. (2010). Prediction of RBNS and IBNR claims using claim amounts and claim counts. Astin Bulletin, 40(2), 871–887. https://doi.org/https://doi.org/10.2143/AST.40.2.2061139
Web of Science ®Google Scholar
Wahl, F. (2019). Explicit moments for a class of micro-models in non-life insurance. Insurance: Mathematics and Economics, 89(7), 140–156. https://doi.org/https://doi.org/10.1016/j.insmatheco.2019.10.001
Google Scholar
Wahl, F., Lindholm, M., & Verrall, R. (2019). The collective reserving model. Insurance: Mathematics and Economics, 87(7), 34–50. https://doi.org/https://doi.org/10.1016/j.insmatheco.2019.04.003
Google Scholar
Wüthrich, M. V., & Merz, M. (2008). Stochastic Claims Reserving Methods in Insurance. John Wiley and Sons.
Google Scholar
Yu, X., & He, R. (2016). Individual claims reserving models based on marked Cox processes. Chinese Journal of Applied Probability and Statistics 32(2), 201-219. http://aps.ecnu.edu.cn/EN/Y2016/V32/I2/201
Google Scholar
Zhao, X., & Zhou, X. (2010). Applying copula models to individual claim loss reserving methods. Insurance: Mathematics and Economics, 46(2), 290–299. https://doi.org/https://doi.org/10.1016/j.insmatheco.2009.11.001
Web of Science ®Google Scholar

Appendix

Proof

Proof of Proposition 3.1

To derive the following gradient and Hessian matrix, we need the identities

\frac{\partial q_{v}}{\partial ρ} = q_{v} (δ_{v} - q) \otimes x

, which gives that

\begin{aligned} \frac{\partial q^{'}}{\partial ρ} = (d i a g (q) - q q^{'}) \otimes x, \frac{\partial \log q^{'}}{\partial ρ} = (I_{D^{s}} - q 1_{D^{s}}^{'}) \otimes x . \end{aligned}

Then we have the following gradient according to formulas above, which is

\begin{aligned} \nabla Q^{i o s} (ρ) \\ = [\sum_{v = 0}^{D_{i}^{s}} N_{v}^{s} (δ_{v} - q) + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} (\frac{(0, {\bar{q}}_{n - i - u}^{'})^{'}}{{\bar{Q}}_{n - i - u}} - q)] \otimes x \\ = [N_{i}^{s} + \sum_{u = 0}^{D_{i}^{r}} \frac{N_{u}^{r b n s}}{{\bar{Q}}_{n - i - u}} (\begin{matrix} 0 \\ {\bar{q}}_{n - i - u} \end{matrix}) \\ - (\sum_{v = 0}^{D_{i}^{s}} N_{v}^{s} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s}) q] \otimes x . \end{aligned}

By some algebraic computation, it follows that

\begin{aligned} \frac{\partial^{2} Q^{i o s} (ρ)}{\partial ρ ρ^{'}} \\ = - [(\sum_{v = 0}^{D_{i}^{s}} N_{v}^{s} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s}) (d i a g (q) - q q^{'}) \\ + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} \\ \times (\begin{array}{cc} 0 & 0 \\ 0 & d i a g ({\bar{q}}_{n - i - u}) - \frac{{\bar{q}}_{n - i - u} {\bar{q}}_{n - i - u}^{'}}{{\bar{Q}}_{n - i - u}} \end{array})] \otimes x x^{'} . \end{aligned}

Because

\sum_{v = 0}^{D_{i}^{s}} N_{v}^{s} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s}

is just the number of those reported claims incurred in accident year i,

\begin{aligned} E [\sum_{v = 0}^{D_{i}^{s}} N_{v}^{s} + \sum_{u = 0}^{D_{i}^{r}} N_{u}^{r b n s} | r, x] = r \sum_{u = 0}^{D_{i}^{r}} λ_{u} . \end{aligned}

Observe further that

N_{u}^{r b n s} = 0

for

i \leq n - D^{s}

and

0 \leq u \leq n - i - D^{s} .

Therefore,

(A1)

\begin{aligned} E [\frac{\partial^{2} Q^{i o s} (ρ)}{\partial ρ ρ^{'}} | r, x] \\ = - r [\sum_{u = 0}^{D_{i}^{r}} λ_{u} (d i a g (q) - q q^{'}) \\ - \sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}} λ_{u} \\ \times (\begin{array}{cc} 0 & 0 \\ 0 & d i a g ({\bar{q}}_{n - i - u}) - \frac{{\bar{q}}_{n - i - u} {\bar{q}}_{n - i - u}^{'}}{{\bar{Q}}_{n - i - u}} \end{array})] \otimes x x^{'} . \end{aligned}

(A1)

Let v = n−i−u and note that $\begin{aligned} d i a g (q) - q q^{'} - (\begin{array}{cc} 0 & 0 \\ 0 & d i a g ({\bar{q}}_{v}) - \frac{{\bar{q}}_{v} {\bar{q}}_{v}^{'}}{{\bar{Q}}_{v}} \end{array}) \\ = (\begin{array}{cc} d i a g (q_{v}) - q_{v} q_{v}^{'} & - q_{v} {\bar{q}}_{v}^{'} \\ - {\bar{q}}_{v} q_{v}^{'} & \frac{Q_{v}}{{\bar{Q}}_{v}} {\bar{q}}_{v} {\bar{q}}_{v}^{'} \end{array}) . \end{aligned}$ Then, Equation (EquationA1(A1) $\begin{aligned} E [\frac{\partial^{2} Q^{i o s} (ρ)}{\partial ρ ρ^{'}} | r, x] \\ = - r [\sum_{u = 0}^{D_{i}^{r}} λ_{u} (d i a g (q) - q q^{'}) \\ - \sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}} λ_{u} \\ \times (\begin{array}{cc} 0 & 0 \\ 0 & d i a g ({\bar{q}}_{n - i - u}) - \frac{{\bar{q}}_{n - i - u} {\bar{q}}_{n - i - u}^{'}}{{\bar{Q}}_{n - i - u}} \end{array})] \otimes x x^{'} . \end{aligned}$ (A1) ) gives rise to the desired result.

Proof

Proof of Theorem 4.1

By (Equation3(3) $\begin{aligned} R := \sum_{i = 1}^{n} R_{i}^{r b n s} + \sum_{i = 1}^{n} R_{i}^{i b n r}, \end{aligned}$ (3) ), the loss reserve can be computed as $\begin{aligned} E (R | F^{u o}) = E [R^{r b n s} | F^{u o}] + E [R^{i b n r} | F^{u o}] . \end{aligned}$ According to Assumption 3.2, for a representative policy in year i, given $N_{u}^{r b n s}$ with $n - i - D^{s} + 1 \leq u \leq D_{i}^{r}$ , $(N_{u, n - i - u + 1}, \dots, N_{u D^{s}})$ follows multinomial distribution with parameters $N_{u}^{r b n s}$ and $\frac{1}{{\bar{Q}}_{n - i - u}} (q_{n - i - u + 1}, \dots, q_{D^{s}})$ . Then by Assumption 3.3, the RBNS loss reserve is $\begin{aligned} E [R^{r b n s} | F^{u o}] \\ = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}} E [\sum_{v = n - i - u + 1}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l} | F^{u o}] \\ = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}} N_{i k u}^{r b n s} \frac{\sum_{v = n - i - u + 1}^{D^{s}} q_{i k v} μ_{i k u v}}{{\bar{Q}}_{i k, n - i - u}} \\ = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{(n - v - u + 1) k u}^{r b n s} {\tilde{μ}}_{(n - v - u + 1) k u v} . \end{aligned}$ It can be easily proved that IBNR claims are independent of historical observation $F^{u o}$ by Assumption 3.1–3.3. Hence, IBNR loss reserve is computed by $\begin{aligned} E [R^{i b n r} | F^{u o}] & = E [\sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = n - i + 1}^{D^{r}} \sum_{v = 0}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l} | F^{u o}] \\ = \sum_{i = n - D^{r} + 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = n - i + 1}^{D^{r}} \sum_{v = 0}^{D^{s}} E [\sum_{l = 1}^{N_{i k u v}} Y_{i k u v l}] \\ = \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} p_{i k u} r_{i k} \exp (x_{i k}^{'} β) {\tilde{μ}}_{i k u 0} . \end{aligned}$ According to independence assumptions in Assumptions 3.1–3.3, the developments of RBNS claims are independent of developments of IBNR claims, which results in the independence between $R^{r b n s}$ and $R^{i b n r}$ . Then the variance of R given $F^{u o}$ is $\begin{aligned} V a r (R | F^{u o}) = V a r (R^{r b n s} | F^{u o}) + V a r (R^{i b n r} | F^{u o}) . \end{aligned}$ First, for $v \geq n - i - u + 1$ , we compute $\begin{aligned} V a r (\sum_{l = 1}^{N_{u v}} Y_{u v l} | E^{o}) \\ = V a r (E [\sum_{l = 1}^{N_{u v}} Y_{u v l} | N_{u v}, E^{o}]) \\ + E [V a r (\sum_{l = 1}^{N_{u v}} Y_{u v l} | N_{u v}, E^{o})] \\ = μ_{u v}^{2} V a r (N_{u v} | F^{o}) + ϕ^{p} μ_{u v} E [N_{u v} | F^{o}] \\ = N_{u}^{r b n s} [μ_{u v}^{2} \frac{q_{v} (1 - q_{v})}{{\bar{Q}}_{n - i - u}^{2}} + ϕ^{p} μ_{u v} \frac{q_{v}}{{\bar{Q}}_{n - i - u}}] . \end{aligned}$ For $v_{1}, v_{2} \geq n - i - u + 1$ , compute $C o v (\sum_{l = 1}^{N_{u v_{1}}} Y_{u v_{1} l}, \sum_{l = 1}^{N_{u v_{2}}} Y_{u v_{2} l} | F^{o})$ which is equal to $\begin{aligned} C o v (E [\sum_{l = 1}^{N_{u v_{1}}} Y_{u v_{1} l} | N_{u v_{1}}, N_{u v_{2}}, F^{o}], \\ E [\sum_{l = 1}^{N_{u v_{2}}} Y_{u v_{2} l} | N_{u v_{1}}, N_{u v_{2}}, F^{o}]) \\ + E [C o v (\sum_{l = 1}^{N_{u v_{1}}} Y_{u v_{1} l}, \sum_{l = 1}^{N_{u v_{2}}} Y_{u v_{2} l} | N_{u v_{1}}, N_{u v_{2}}, F^{o})], \end{aligned}$ which can be computed as follows: $\begin{aligned} C o v (\sum_{l = 1}^{N_{u v_{1}}} Y_{u v_{1} l}, \sum_{l = 1}^{N_{u v_{2}}} Y_{u v_{2} l} | F^{o}) \\ = μ_{u v_{1}} μ_{u v_{2}} C o v (N_{u v_{1}} | F^{o}], N_{u v_{2}} | F^{o}]) \\ = - N_{u}^{r b n s} μ_{u v_{1}} μ_{u v_{2}} \frac{q_{v_{1}} q_{v_{2}}}{{\bar{Q}}_{n - i - u}^{2}} . \end{aligned}$ Then by independence among policies and Assumptions 3.2 and 3.3, the variance of RBNS loss reserve given $F^{u o}$ is $\begin{aligned} V a r (R^{r b n s} | F^{u o}) \\ = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} V a r (\sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}} \sum_{v = n - i - u + 1}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l} | F^{u o}) \\ = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = (n - i - D^{s} + 1)_{+}}^{D_{i}^{r}} N_{i k u}^{r b n s} (\frac{\sum_{v = n - i - u + 1}^{D^{s}} q_{i k v} μ_{i k u v}^{2}}{{\bar{Q}}_{i k, n - i - u}^{2}} \\ - {(\frac{\sum_{v = n - i - u + 1}^{D^{s}} q_{i k v} μ_{i k u v}}{{\bar{Q}}_{i k, n - i - u}})}^{2} + ϕ^{p} \frac{\sum_{v = n - i - u + 1}^{D^{s}} q_{i k v} μ_{i k u v}}{{\bar{Q}}_{i k, n - i - u}}) \\ = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - v - u + 1}} N_{n - v - u + 1, k u}^{r b n s} (\frac{{\tilde{μ}}_{n - v - u + 1, k u v}^{s}}{{\bar{Q}}_{n - v - u + 1, k, v - 1}} \\ - {\tilde{μ}}_{n - v - u + 1, k u v}^{2} + ϕ^{p} {\tilde{μ}}_{n - v - u + 1, k u v}) . \end{aligned}$ Because IBNR claims are independent of historical observation $F^{u o}$ , variance of IBNR loss reserve given $F^{u o}$ is computed by $\begin{aligned} V a r (R^{i b n r} | F^{u o}) \\ = V a r (\sum_{i = 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = n - i + 1}^{D^{r}} \sum_{v = 0}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l}) \\ = \sum_{i = n - D^{r} + 1}^{n} \sum_{k = 1}^{m_{i}} \sum_{u = n - i + 1}^{D^{r}} V a r (E [\sum_{v = 0}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l} | N_{i k u}]) \\ + E [V a r (\sum_{v = 0}^{D^{s}} \sum_{l = 1}^{N_{i k u v}} Y_{i k u v l} | N_{i k u})] \\ = \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} p_{i k u} r_{i k} \exp (x_{i k}^{'} β) ({\tilde{μ}}_{i k u 0}^{s} \\ + (ϕ - 1) {\tilde{μ}}_{i k u 0}^{2} + ϕ^{p} {\tilde{μ}}_{i k u 0}) . \end{aligned}$

Proof

Proof of Theorem 4.3

Expand $R_{m} (\hat{θ})$ about true parameters $θ$ by Taylor expansion. Then we have $\begin{aligned} \frac{1}{m} ({\hat{R}}_{I I} - R_{m}) = \frac{1}{m} \frac{\partial R_{m} (θ))}{\partial θ^{'}} (\hat{θ} - θ) + o_{p} (\frac{‖ \hat{θ} - θ ‖}{m}) . \end{aligned}$ One knows that ${\dot{μ}}_{u v} := \frac{\partial μ_{u v}}{\partial γ} = μ_{u v} x_{u v}$ . Write $μ_{u v} = (μ_{u, v + 1}, μ_{u, v + 2}, \dots, μ_{u, D^{s}})^{'}$ and ${\dot{μ}}_{u v} = \frac{\partial μ_{u v}^{'}}{\partial γ}$ , $u = 0, 1, \dots, D^{r}$ and $v = 0, \dots, D^{s} - 1$ . To compute the partial derivative in the Taylor expansion above, we need the following partial derivatives: $\begin{aligned} \frac{\partial [p_{u} r \exp (x^{'} β)]}{\partial (β^{'}, π^{'})^{'}} = (\begin{matrix} 1 \\ δ_{u} - p \end{matrix}) \otimes (p_{u} r \exp (x^{'} β) x), \\ \frac{\partial {\tilde{μ}}_{u v}}{\partial ρ} = [0, μ_{u, v - 1}^{'} (\frac{1}{{\bar{Q}}_{v - 1}} d i a g ({\bar{q}}_{v - 1}) \\ - {\frac{1}{{\bar{Q}}_{v - 1}^{2}} {\bar{q}}_{v - 1} {\bar{q}}_{v - 1}^{'})]}^{'} \otimes x_{u}, v \geq 1, \\ \frac{\partial {\tilde{μ}}_{u 0}}{\partial ρ} = [(d i a g (q) - q q^{'}) μ_{u 0} - q_{0} μ_{u 0} q] \otimes x, and \\ \frac{\partial {\tilde{μ}}_{u v}}{\partial γ} = \frac{1}{{\bar{Q}}_{v - 1}} \frac{\partial μ_{u, v - 1}^{'}}{\partial γ} {\bar{q}}_{v - 1}, v \geq 1, \\ \frac{\partial {\tilde{μ}}_{u 0}}{\partial γ} = (\frac{\partial μ_{u 0}}{\partial γ}, \frac{\partial μ_{u 0}^{'}}{\partial γ}) (q_{0}, q^{'})^{'} . \end{aligned}$ By the law of large numbers, it can be proved that $\frac{1}{m} \frac{\partial R_{m} (θ))}{\partial θ^{'}} \overset{a . s .}{\to} g$ , where $g = (g_{1}^{'}, g_{2}^{'}, g_{3}^{'})^{'}$ , where denoting $\begin{aligned} M_{u v} = {\begin{cases} {(0, μ_{u v}^{'} (d i a g ({\bar{q}}_{v}) - \frac{1}{{\bar{Q}}_{v}} {\bar{q}}_{v} {\bar{q}}_{v}^{'}))}^{'}, & v > 0 \\ (d i a g ({\bar{q}}_{0}) - \frac{1}{{\bar{Q}}_{0}} {\bar{q}}_{0} {\bar{q}}_{0}^{'}) μ_{u 0}, & v = 0, \end{cases} \end{aligned}$ (A2) $\begin{aligned} g_{1} & = \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} κ_{i} E [{\tilde{μ}}_{u 0} (\begin{matrix} 1 \\ δ_{u} - p \end{matrix}) \otimes (r λ_{u} x)], \\ g_{2} & = \sum_{u = 0}^{D^{r}} E [(\sum_{v = 0}^{D^{s} - 1} κ_{n - u - v} M_{u v} + \sum_{i = n - u + 1}^{n} κ_{i} \\ \times [(d i a g (q) - q q^{'}) μ_{u 0} - q_{0} μ_{u 0} q]) \otimes (r λ_{u} x)], \\ g_{3} & = \sum_{u = 0}^{D^{r}} E [(\sum_{v = 0}^{D^{s} - 1} κ_{n - u - v} {\dot{μ}}_{u v} {\bar{q}}_{v} \\ + \sum_{i = n - u + 1}^{n} κ_{i} ({\dot{μ}}_{u 0}, {\dot{μ}}_{u 0}) (q_{0}, q^{'})^{'}) \otimes (r λ_{u} x_{u v})] . \end{aligned}$ (A2)

It is well known that $\hat{θ} \overset{P}{\to} θ$ under some regular conditions and hence $\frac{1}{m} ({\hat{R}}_{I I} - R_{m}) \overset{P}{\to} 0$ . Besides, $\begin{aligned} \frac{V a r (R | F^{u o})}{m} \overset{a . s .}{\to} V_{R} \\ = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} κ_{n - v - u + 1} \\ \times E [r λ_{u} {\bar{Q}}_{v - 1} (\frac{{\tilde{μ}}_{u v}^{s}}{{\bar{Q}}_{v - 1}} - {\tilde{μ}}_{u v}^{2} + ϕ^{p} {\tilde{μ}}_{u v})] \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} κ_{i} E [r λ_{u} ({\tilde{μ}}_{u 0}^{s} + (ϕ - 1) {\tilde{μ}}_{u 0}^{2} + ϕ^{p} {\tilde{μ}}_{u 0})] . \end{aligned}$ If individual data model hold true, one can similarly prove that $\frac{1}{m} ({\hat{R}}_{I D} - R_{m}) \overset{P}{\to} 0$ . Therefore, $M^{r} \overset{P}{\to} 1$ in this case. If individual information model holds true, we can easily prove that $\frac{{\hat{R}}_{I D} - R_{m}}{m}$ is asymptotically biased, which results from the following arguments. The law of large numbers readily gives ${\hat{h}}_{v} \overset{a . s .}{\to} {\overset{ˇ}{h}}_{v}$ and ${\hat{μ}}_{u v} \overset{a . s .}{\to} {\overset{ˇ}{μ}}_{u v}$ . Further, we have ${\hat{q}}_{v} \overset{a . s .}{\to} {\overset{ˇ}{q}}_{v} := {\overset{ˇ}{h}}_{v} \prod_{s = 0}^{v - 1} (1 - {\overset{ˇ}{h}}_{s})$ and ${\hat{\tilde{μ}}}_{u v} \overset{a . s .}{\to} {\overset{ˇ}{\tilde{μ}}}_{u v} := \sum_{s = v}^{D^{s}} {\overset{ˇ}{q}}_{s} {\overset{ˇ}{μ}}_{u s} / \sum_{s = v}^{D^{s}} {\overset{ˇ}{q}}_{s}$ . We have $\begin{aligned} \frac{{\hat{R}}_{I D} - R_{m}}{m} = \frac{{\hat{R}}_{I D} - {\overset{ˇ}{R}}_{I D}}{m} + \frac{{\overset{ˇ}{R}}_{I D} - R_{m}}{m}, \end{aligned}$ where ${\overset{ˇ}{R}}_{I D} = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - u - v + 1}} N_{n - u - v + 1, k u}^{r b n s} {\overset{ˇ}{\tilde{μ}}}_{u v} + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} r_{i k} {\overset{ˇ}{λ}}_{u} {\overset{ˇ}{\tilde{μ}}}_{u 0}$ and then $\begin{aligned} {\overset{ˇ}{R}}_{I D} - R_{m} \\ = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} \sum_{k = 1}^{m_{n - u - v + 1}} N_{n - u - v + 1, k u}^{r b n s} ({\overset{ˇ}{\tilde{μ}}}_{u v} - {\tilde{μ}}_{n - u - v + 1, k u v}) \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} \sum_{k = 1}^{m_{i}} (r_{i k} {\overset{ˇ}{λ}}_{u} {\overset{ˇ}{\tilde{μ}}}_{u 0} - r_{i k} λ_{i k u} {\tilde{μ}}_{i k u 0}) . \end{aligned}$ Apparently, $\frac{{\hat{R}}_{I D} - {\overset{ˇ}{R}}_{I D}}{m} \overset{a . s .}{\to} 0$ and by the law of large numbers and some simple algebra operations, we show that $\begin{aligned} \frac{1}{m} ({\overset{ˇ}{R}}_{I D} - R_{m}) \overset{a . s .}{\to} Δ \\ = \sum_{v = 1}^{D^{s}} \sum_{u = 0}^{D_{v}^{r}} κ_{n - v - u + 1} E [r λ_{u} {\bar{Q}}_{v - 1} ({\overset{ˇ}{\tilde{μ}}}_{u v} - {\tilde{μ}}_{u v})] \\ + \sum_{u = 1}^{D^{r}} \sum_{i = n - u + 1}^{n} κ_{i} E [r λ_{u} ({\overset{ˇ}{\tilde{μ}}}_{u 0} - {\tilde{μ}}_{u 0})] . \end{aligned}$ Therefore, if asymptotic bias Δ is not zero, $M^{r} \overset{P}{\to} 0$ . Then we complete the proof.

Stochastic loss reserving using individual information model with over-dispersed Poisson

Abstract

1. Introduction

2. Data structure

3. Model specification