538
Views
3
CrossRef citations to date
0
Altmetric
Articles

Optimal mean-variance reinsurance and investment strategy with constraints in a non-Markovian regime-switching model

, &
Pages 214-227 | Received 03 Jun 2019, Accepted 25 Nov 2019, Published online: 30 Jan 2020

Abstract

This paper is devoted to study the proportional reinsurance/new business and investment problem under the mean-variance criterion in a continuous-time setting. The strategies are constrained in the non-negative cone and all coefficients in the model except the interest rate are stochastic processes adapted the filtration generated by a Markov chain. With the help of a backward stochastic differential equation driven by the Markov chain, we obtain the optimal strategy and optimal cost explicitly under this non-Markovian regime-switching model. The cases with one risky asset and Markov regime-switching model are considered as special cases.

1. Introduction

Insurers usually control their risks via some business activities, such as investing in a financial market, purchasing reinsurance, and acquiring new business. The problem of finding the optimal reinsurance/new business and investment strategies has been investigated in a vast of literature under various criteria, among which minimising ruin probability, maximising expected utility of terminal wealth, and minimising the mean-variance cost are most popular.

Browne (Citation1995) first studied the optimal investment problem for an insurer aiming to maximise the terminal utility function and minimise the ruin probability under a diffusion risk model. Under a similar diffusion model, Promislow and Young (Citation2005) obtained the optimal investment and quota-share reinsurance strategies minimising the probability of ruin. Yang and Zhang (Citation2005) derived the optimal reinsurance-investment strategy maximising the expected utility of terminal wealth under a jump-diffusion model. Chen, Li, and Li (Citation2010) studied the reinsurance-investment problem with a dynamic value-at-risk (VaR) constraint and got the optimal strategy minimising the probability of ruin via the dynamic programming technique and Lagrange multiplier method. More recent work on minimising ruin probability and maximising utility includes Yi, Li, Viens, and Zeng (Citation2013), Liang and Bayraktar (Citation2014) and Xu, Zhang, and Yao (Citation2017), among others.

Since the pioneer work of Markowitz (Citation1952), the mean-variance optimisation problem has become a key topic in modern portfolio selection theory. By embedding the original mean-variance problem into a linear-quadratic (LQ, for short) control problem, Li and Ng (Citation2000) and Zhou and Li (Citation2000) derived the pre-commitment optimal solution to the dynamic mean-variance problem in a multi-period model and a continuous-time model, respectively. Bäuerle (Citation2005) studied the optimal proportional reinsurance problem under the benchmark and the mean-variance criterion. Bai and Zhang (Citation2008) considered the optimal reinsurance/new business and investment problem with no-shorting constraint under the mean-variance criterion. They considered both classical and diffusion risk models in the Markovian framework and solved the problem through the viscosity solution to the Hamilton–Jacobi–Bellman equation. Similarly, with the help of stochastic LQ control theory and viscosity solution, Bi (Citation2013) studied the optimal investment and reinsurance problem for an insurer under the mean-variance criterion with non-negative constraint on the strategies in a jump-diffusion financial market. By using the martingale method, Bi, Meng, and Zhang (Citation2014) investigated the mean-variance optimal investment and reinsurance problem with bankruptcy prohibition. For more recent work on mean-variance reinsurance/new business and investment problem, we refer the reader to Shen and Zeng (Citation2015), Zeng, Li, and Gu (Citation2016), Wang, Wang, and Wei (Citation2019), etc.

In this paper, we study the mean-variance reinsurance/new business and investment problem under a non-Markovian regime-switching model. It is well-known that in the Markov regime-switching model the coefficients are deterministic functions of a Markov chain. Due to its flexibility, the Markov regime-switching model is usually used to capture the business cycle and changes in the environment, etc. Zhang and Siu (Citation2012) studied the optimal proportional reinsurance and investment problem with no short-selling constraint in a Markovian regime-switching models. Chen and Yam (Citation2013) considered the mean-variance reinsurance-investment problem under a regime-switching model by using the similar method in Zhou and Yin (Citation2003). In contrast to the Markov regime-switching model, the coefficients are stochastic processes adapted to the filtration generated by a Markov chain (or jointly by a Markov chain and a Brownian motion, see Siu & Shen, Citation2017) in the non-Markovian regime-switching model. The advantage of this kind of model is that it may capture the path-dependence and memory effect in the financial market, since the parameters depend on not only the current state but also the historical information of the Markov chain. Under non-Markovian regime-switching models, Shen, Wei, and Zhao (Citation2018) investigated the mean-variance asset-liability management problem; Wang and Wei (Citation2019) studied the mean-variance portfolio selection problem via mean-field formulation.

We assume that the insurer can purchase proportional reinsurance and access a financial market consisting of a riskless asset and multiple risky assets. The claim process of the insurer and the price processes of the risky assets are correlated and modelled by drifted and geometric Brownian motions, respectively. The coefficients in the model depend on the historical information of a Markov chain. Similar to Bai and Zhang (Citation2008), we impose the non-negative constraint on the retention level and the investment strategy. Since our model is non-Markovian, the viscosity method used in their paper fails. Instead, we follow the method proposed by Hu and Zhou (Citation2005) which studied a stochastic LQ control problem with control constrained in a cone. By using a backward stochastic differential equation (BSDE, for short) driven by the Markov chain, we obtain the optimal strategy and efficient frontier in closed-form. We also study the Markov regime-switching model as a special case, where the BSDE degenerates to a system of ordinary differential equations (ODE, for short). It is worthy of noting that under the Markov regime-switching model, Chen and Yam (Citation2013) gave a condition under which the optimal reinsurance and investment strategy without constraints are indeed non-negative. We show that our results are consistent with theirs.

The remainder of this paper is organised as follows. In Section 2, we introduce some notations and formulate the mean-variance problem. In Section 3, we show the main results of the paper. In Section 4, we consider two special cases with one risky asset and Markovian regime-switching model. Finally, Section 5 concludes the paper.

2. The model

2.1. Notation

Let [0,T] be a fixed time duration, where 0<T<+. Let (Ω,F,F,P) be a filtered probability space on which an (n+1)-dimensional standard Brownian motion W():=(W0(),W1(),W2(),,Wn()) and an irreducible continuous-time Markov chain α() with finite states are defined. Here, the filtration F{Ft}0tT is the augmentation under P of the natural filtration generated by W() and α(). We also assume that the Brownian motion and the Markov chain are independent of each other.

Without loss of generality, let M={e1,,em} be the state pace of α(), where ei=(0,,1,,0), i=1,,m are the ith unit column vectors in Rm. Let Q(t)=(qij(t))m×m be the Q-matrix of the Markov chain α(t) at time t. We assume that the entries in Q() are uniformly bounded and continuous. Denote by G{Gt}0tT the augmentation under P of the natural filtration generated by the Markov chain α().

From Elliott, Aggoun, and Moore (Citation2008, Appendix B), we have the following semi-martingale representation of the Markov chain α(): (1) α(t)=α(0)+0tQ(s)α(s)ds+M(t),(1) where M() is a martingale valued in Rm.

Similar to Cohen and Elliott (Citation2008), we define ψ(t):=diagQ(t)α(t)Qtdiagα(t)diagα(t)Qt. For a matrix A with proper dimension, we denote Aα(t)2:=Tr(Aψ(t)A).

Throughout this paper, we will use the same notations as in Hu and Zhou (Citation2005). We list here for the reader's convenience. Denote the inverse of a non-singular square matrix A by A1 and the n-dimensional identity matrix by In. The transpose and the norm of any vector or matrix A are denoted by A and |A|=i,jaij2, respectively. If AR+k×l, it means that A is a k×l-dimensional real matrix in which all entries are non-negative. We denote the set of symmetric n×n (square) matrices by Sn. We list the following spaces about random variables or stochastic processes involved in this paper. For H=F,G, a positive integral number k, R=Rk,R+k,R+k{0},Sn, etc., let LH2s,t;R:=φ:Ω×s,tRφstφτ2dτis H-adapted and Estφτ2dτ<,SH2s,t;R:=φ:Ω×s,tRφsupτs,tφτ2is H-adapted, RCLL andEsupτs,tφτ2<,LG,α2s,t;R:=φ:Ω×s,tRφstφτατ2dτis H-adapted and Estφτατ2dτ<,LHs,t;R:=φ:Ω×s,tRφesssupτ,ωs,t×Ωis H-adapted and esssupτ,ωs,t×Ωφτ<,LH2Ω;Cs,t;R:=φ:Ω×s,tRφsupτs,tφτ2is H-adapted, continuous andEsupτs,tφτ2<.ALH2(s,t;Sk) is called uniformly positive definite, if there exists a deterministic constant c>0 such that A(τ,ω)>cIn for a.e. τ[s,t] and P-a.s.. For any real number we define xy:=max{x,y}, xy:=min{x,y}, and especially, x+:=x0 and x:={x0}.

2.2. Mean-variance reinsurance and investment problem

Assume that the insurer is allowed to invest the surplus into a financial market, consisting of a risk-free asset and n risky assets. The price of the risk-free asset S0() satisfies (2) dS0t=rtS0tdt,t[0,T],S00=s0>0,(2) where the interest rate r()>0 is a deterministic, uniformly bounded, scalar-valued function. For k=1,2,,n, the price of the kth risky asset Sk() is given by (3) dSkt=Sktμktdt+l=1nσkltdWlt,t[0,T],Sk0=sk>0,(3) where μk()(>r()) is the expected return rate of the kth risky asset and σk():=(σk1(),,σkn())R+n{0} is the volatility rate. β(t):=(μ1(t)r(t),,μn(t)r(t)) is usually called the risk premium vector at time t. Usually, β()R+n{0} are more popular and reasonable for every investor or insurer. For convenience, denote by σ():=(σ1(),,σn())=(σkl())n×nRn×n in following.

Following Promislow and Young (Citation2005), we model the claim process C(t) according to a Brownian motion with drift as (4) dCt=atdtb¯tdWt,(4) where a(), b¯():=(b0(),b1(),,bn()) are bounded G-adapted stochastic processes. We assume that a()>0, b0()>0, b():=(b1(),,bn())R+n. In this paper, the claim process C() and the risky assets are correlated while they are independent to each other in Chen and Yam (Citation2013).Footnote1

We assume that the premium is paid continuously at rate c0()=(1+θ())a() with safety loading θ()>0. Then before introducing reinsurance and investment, the surplus process U() is given by (5) dUt=c0tdtdCt=θtatdt+b¯tdWt.(5) To control the original insurance risk, we assume the insurer can purchase proportional reinsurance or acquire new business as described in Bäuerle (Citation2005) and Bai and Zhang (Citation2008). Let the retention level for the original insurer/ceded company at time t be q(t)[0,1). Then for the claim Y arriving at time t, the part of the claim the insurer pays is q(t)Y and that paid by the reinsurance/ceded company is (1q(t))Y. In this paper, we only consider the cheap reinsurance. Then, the insurer pays reinsurance premiums continuously at rate c1(t)=(1+θ(t))a(t)(1q(t)). After the purchase of reinsurance, the surplus process U(t) becomes (6) dUt=c0tdtqtdC(t)c1tdt=atθtqtdt+qtb¯tdW(t).(6) Since q(t)[1,+) can be interpreted as acquiring new business, we restrict the reinsurance strategy q(t)[0,+) in this paper.

Let πk(t) be the amount of the insurer's wealth invested in the kth risky asset at time t. In this paper, short-selling of the risky assets is not allowed, i.e. it must be satisfied that πk(t)0,t[0,T], k=1,,n. The process π():=(π1(),,πn())R+n is called an investment portfolio of the insurer at the risk financial market. Then the joint strategy of the reinsurance and risk investment can be described by a (n+1)-dimensional stochastic process, denoted by u():=(q(),π()). Let X():=Xu():=X(q,π)() be the wealth of the insurer, who adopts the reinsurance strategy q() and the investment portfolio π(). Given an investment portfolio π(), the amount of the insurer's wealth invested in the free-risk asset can be determined by π0():=X()k=1nπk(). Therefore, for an initial wealth x0>0, the dynamics of the wealth process X() is given by the following stochastic differential equation: (7) dXt=rtXt+utBtdt+utΣtdWt,t[0,T],X0=x0>0,(7) where Σt:=b0tbt0σtRn+1×n+1andBt:=atθt,βtRn+1.

Assumption 2.1

θ(),a(),b0(),μk(),σkl()LG([0,T];R+{0}), bk()LG([0,T];R+),k,l=1,,n, are dt×dP-a.s. predictable processes. σ() is uniformly non-degenerate, i.e. there exists a constant δ>0 such that (8) σ(t)σtδIn, t[0,T],P-a.s.(8)

Remark 2.1

By some elementary matrix operations, we can show the following results under Assumption 2.1:

  1. For all t[0,T], σ(t)σ(t) and Σ(t)Σ(t) are positive definite;

  2. There are two positive constants ρˆ and ρˇ such that for a.e. t[0,T] and P-a.s. (9) ρˆυ2υΣtΣtυρˇv2, υRn+1.(9)

Definition 2.2

A strategy u()=(q(),π()) is admissible if q,πLF20,T;R+×LF20,T;R+n. We denote by A the set of all admissible strategies.

Similar to Hu and Zhou (Citation2005), we consider the following mean-variance optimisation problem.

Definition 2.3

The mean-variance reinsurance and investment problem is a constrained stochastic optimisation problem such that for any given zx0e0Tr(t)dt: (10) minmizeJMVx0,u:=EX(T)z2=EX(T)2z2,subject toEX(T)=z,uA.(10) Moreover, the problem is called feasible if there is at least one strategy satisfying all the constraints of (Equation10). The problem is called finite if it is feasible and the infimum JMV(x0,u()) is finite.

Remark 2.2

The restriction of the targeted payoff zx0e0Tr(t)dt is natural as the latter can always be achieved by putting all the money in the bank and conducting proportional reinsurance policies with the retention ratio q(t)=0 for all t[0,T] (i.e. the insurer transfers all of the claim to the reinsurer).

Similar to Hu and Zhou (Citation2005, Theorem 6.1), if we choose υ()=(a()θ(),0) and adopt a family of admissible strategies uδ():=δυ(),δ0, by adjusting the coefficient δ0, it is easy to get the following proposition about the feasibility of the problem.

Proposition 2.4

The mean-variance reinsurance-investment problem (Equation10) is always feasible for every z[x0e0Tr(t)dt,+).

3. Solution to the problem

3.1. Preliminaries

In this section, some mathematical preliminaries are presented. In the subsequent analysis, a vital technical tool is Tanaka's formula.

Lemma 3.1

Tanaka's formula

Let X(t) be a continuous semi-martingale. Then dX+(t)=1(X(t)>0)dX(t)+12dL(t),dX(t)=1(X(t)0)dX(t)+12dL(t), where L() is an increasing continuous process, called the local time of X() at 0, satisfying (11) 0tX(s)dL(s)=0,P-a.s.(11)

For (t,υ(t))[0,T]×Rn+1, define (12) H+t,υt:=υtΣtΣtυt+2υtBt,Ht,υt:=υtΣtΣtυt2υtBt,(12) Let (13) H+t:=minυR+n+1H+t,υt,Ht:=minυtR+n+1Ht,υt.(13) (14) ν+t:=ζ+t,ξ+t:=argminvtR+n+1H+t,vt,νt:=ζt,ξt:=argminvtR+n+1Ht,vt,(14) where ζ+(),ζ()R+ and ξ+(),ξ()R+n. For the convenience of analysis, υ() is sometimes partitioned as υ()=(κ(),υ1()) where κ()R+ and υ1()=(υ1(),,υn())R+n.

Lemma 3.2

Under Assumption 2.1. we have for a.e. t[0,T] and P-a.s. (15) H+tH+t,ν+t=0,ν+t=0(15) and (16) H0tHtinfvtR+n+1{0},vt2C1ρˆHt,vtBt2ρˇ,(16) where H0(t):=B(t)(Σ(t)Σ(t))1B(t) and C1:=n+11/2esssupt,ω0,T×Ωatθt,μ1trt,,μntrt.

Proof.

Since B(t)R+n+1{0} and Σ(t)Σ(t) is positive definite (see Remark 2.1), it is easy to find that H+(t)=minυR+n+1H+(t,υ)0 and the equality holds if and only if υ(t)=0. Therefore (Equation15) is obtained.

For the second part of the lemma, firstly by noting B()R+n+1{0}, υ()R+n+1 and (Equation9), we obtain for a.e. t[0,T] and P-a.s., υtBt=υtBt=atθtκtl=1n+l=1nμltrtυltn+1a2tθ2tκ2t+l=1nμltrt2υl2tC1υt, where the last inequality is from that C1=n+11/2esssupt,ω0,T×Ωatθt,μ1trt,,μntrt>0. If |υ(t)|>2C1/ρˆ, we have for a.e. t[0,T] and P-a.s., Ht,υt=υtΣtΣtυt2υtBt=υtΣtΣtυt2υtBtρˆυt2C1ρˆυt>0 and infυ(t)R+n+1,|υ(t)|>2C1/ρˆHt,υtinf|υ(t)|>2C1/ρˆHt,υt>0. But H(t)H(t,0)=0. Thus the minimum value H(t) will be obtained somewhere in the bounded range {υ(t)R+n+1||υ(t)|2C1/ρˆ}, that is to say Htinfυ(t)R+n+1,|υ(t)|2C1/ρˆHt,υt. Since (17) Ht,υt=υtΣtΣtυt2υtBt=ΣtυtΣ1tBt2H0tH0t,for υtRn+1,(17) it is not difficult to get that H(t)H0(t). Moreover, by (Equation9), we have for a.e. t[0,T] and P-a.s. Ht,υt=υtΣtΣtυt2υtBtρˇυtvt2υtBt=ρˇυtBtρˇ2Bt2ρˇ Therefore (18) Ht=minυR+n+1Ht,vtHt,Btρˇ=Bt2ρˇ<0.(18) Moreover, considering H(t,v(t))|v(t)=0=0, we obtain (Equation16). Consequently, by recalling Assumption 2.1, H(t) is finite.

Remark 3.1

If the claim process is independent of the risky assets, i.e. b(t)=0, it is easy to see Ht,vt=q2tκtatθtqt2+υtβtσtσt1σt2a2tθ2tq2tβtσtσt1βta2tθ2tq2t+βtσtσt1βt. Then Ht=a2tθ2tq2t+βtσtσt1βt,νt=ζt,ξt=atθtqt,σtσt1βt. Given ν(), we consider the SDE (19) dYt=rtYt+YtνtBtdt+YtνtΣtdWt,t[0,T],Y0=y0.(19)

Lemma 3.3

SDE (Equation19) has a unique solution (20) Yt=y0+e0tr(s)dsy0φte0tr(s)dsLF2Ω;C0,T;R,(20) where (21) φt:=exp0tνsBs12νsΣsΣsνsds+0tνsΣsdWs.(21)

Proof.

Firstly, we introduce two linear SDEs as follows (22) dY+(t)=rtY+tdt,t[0,T],Y+0=y0+,(22) and (23) dY(t)=rtνtBtYtdtYtνtΣsdWt,t[0,T],Y0=y0.(23) It is easy to get the unique solution to (Equation22) Y+t:=y0+e0trsds. Considering Assumption 2.1, it is easy to obtain that r()ν()B()LG(0,T;R) and ν()Σ()LG(0,T;R). It is well-known that the unique continuous F-adapted solution to (Equation23) is given by (24) Yt:=y0exp0trs+νsBs12νsΣsΣsνsds+0tνsΣsdWs.(24) Define Yt:=Y+tYt. The fact that Y+(t)0, Y(t)0 and Y+(t)Y(t)=0 implies that Y+t=Y+(t),Yt=Y(t), t0,T. By applying Itô formula to Y(t):=Y+(t)Y(t), the resulted SDE is exactly (Equation19).

If we can prove the uniqueness of the solution, then the proof of the lemma will be accomplished. To this end, first suppose that Y1() and Y2() are two continuous adapted solution to SDE (Equation19). Denote Yˆ():=Y1()Y2(). Now in order to generate a linear SDE for Yˆ(), the following procedure was implemented. Set γ+t:=Y1+tY2+tY1tY2t1{Y1tY2t},γt:=Y1tY2tY1tY2t1{Y1tY2t}. Then Yˆ() is a continuous adapted solution to the following linear SDE: dYˆt=rt+γtνsBsYˆtdt+γtνsΣsYˆtdWt,t[0,T],Yˆ0=0. Therefore, Yˆ()=0, which implies that Y()=Y+()Y() is the unique the solution to SDE (Equation19).

Next, we introduce the following backward stochastic differential equation driven by the Markov chain α() (25) dPt=ft,Ptdt+ΨtdMt,t[0,T]P(T)=1,(25) where f(t,P(t)):=(2r(t)+H(t))P(t).

By Lemma 3.2 and Cohen and Elliott (Citation2010, Theorem 3.11), we have the following result:

Lemma 3.4

There exists a unique solution (P(),Ψ())SG2(0,T;R)×LG,α2(0,T;Rm) to the BSDE (Equation25) such that (26) Pt=e2tTr(s)dsEtetTH(s)ds,t0,T,(26) and K1P()K2, for some constants K1,K2>0.

3.2. Main results

In this section, we give explicit solutions to the mean-variance reinsurance-investment problem (Equation10) in terms of the solution to the BSDE (Equation25).

It is well-known that the Lagrange multiplier method is the standard method to cope with the constraint equation EX(T)=z on the mean-variance problem (Equation10). By inserting the Lagrange multiplier λR, we can consider the following cost functional: (27) Jx0,u,λ:=EXT2z22λX(T)z=EXTλ2λz2,λR.(27) Similar to Hu and Zhou (Citation2005), we may first solve the following unconstrained problem parameterized by the Lagrange multiplier λR: (28) minmizeJx0,u,λ=EXTλ2λz2subject toEXT=z,uA.(28) Define J(x0,λ):=infu()AJ(x0,u(),λ).

Theorem 3.5

Let (P(),Ψ())SG2(0,T;R)×LG,α2(0,T;Rm) be the solution to BSDE (Equation25). Then the strategy (29) ut=νtX(t)λetTr(s)ds(29) is optimal for the problem (Equation28). Moreover, the corresponding cost is (30) Jx0,λ=e20Tr(s)dsx0λe0Tr(s)ds+2+P0x0λe0Tr(s)ds2λz2.(30)

Proof.

First note that Lemma 3.4 ensures that (Equation25) has an unique bounded, uniformly positive solution (P(),Ψ())SG2(0,T;R)×LG,α2(0,T;Rm). Let X() be the solution to (Equation7) under an arbitrary admissible strategy u()=(q(),π()). Noting that the admissible strategy satisfies u()LF2(0,T;Rn+1), by the theory of stochastic differential equation (SDE, for short), there is a unique solution X()LF2(Ω;C(0,T;R)) for the SDE (Equation7).

Set Y(t)X(t)λetTr(s)ds. By Itô's formula and (Equation7), we have (31) dY(t)=r(t)Y(t)+utBtdt+utΣtdWt,t[0,T],Y0=x0λe0Tr(s)ds.(31) Therefore, the cost function (Equation27) can be written as Jy0,u=EY(T)2λz2. Applying Tanaka's formula (3.1) to Y(), we have (32) dY+t=r(t)Y+(t)+1Y(t)>0utBtdt+1(Y(t)>0)utΣtdWt+12dL(t),(32) and (33) dYt=r(t)Y(t)1(Y(t)0)utBtdt1(Y(t)<0)utΣtdWt+12dL(t),(33) where L() is the local time of Y() at 0. Applying Itô's formula, we get (34) dY+t2=2r(t)Y+t2+2Y+(t)utBt+1Y(t)>0utΣtΣtutdt+2Y+tutΣtdWt,(34) where we have used the fact that Y+(t)dL(t)=0 by virtue of (Equation11). Similarly, we have (35) dYt2=2r(t)Yt22Y(t)utΣtΣt+1(Y(t)0)utΣtΣtutdt2YtutΣtdWt.(35) Applying Itô's formula to e2tTr(s)ds(Y+(t))2, we have (after some reorganisation) (36) de2tTrsdsY+t2=e2tTr(s)ds2Y+(t)utΣt+1(Y(t)>0)utΣtΣtutdt+2e2tTrsdsY+tutΣtdWt.(36) Similarly, we can derive (37) dP(t)Yt2=H(t)P(t)Yt22P(t)Y(t)utBt+1(Y(t)0)P(t)utΣtΣtutdt2P(t)YtutΣtdWt+Y(t)2ΨtdM(t).(37) Next, we define, for any integer n¯1, the following stopping time ιn¯ as follows (38) ιn¯:=inft00t2e2tTrsds×Y+susΣt2ds0T+0t2P(s)YsusΣt2ds+0TYs4Ψsαs2dsn¯T,(38) where inf:=T. Obviously, ιn¯, n¯1, is an increasing sequence of stopping times converging to T almost surely.

Summing (Equation36) and (Equation37), we get (39) de2tTr(s)dsY+t2+P(t)Yt2=1(Y(t)>0)e2tTrsds+1(Y(t)0)Pt×utΣtΣtut+2e2tTrsdsY+(t)PtY(t)×utBtHtPtYt2dt+2e2tTrsdsY+(t)PtY(t)×utΣtdWt+Yt2ΨtdMt.(39) Integrating the above formula from 0 to ιn¯, and then taking expectation, after arranging we get (40) Ee2ιn¯TrsdsY+ιn¯2+P(ιn¯)Yιn¯2=Ee20Tr(s)dsY+02+P0Y02+E0ιn¯ϕYt,utdt,(40) where (41) ϕYt,ut:=1(Y(t)>0)e2tTr(s)ds+1(Y(t)0)P(t)×utΣtΣtut+2e2tTrsdsY+(t)P(t)Y(t)×utBtH(t)P(t)Yt2.(41) Now let us send n¯. In addition, noticing that Y()LF2(Ω;C(0,T;R)), X()LF2(Ω;C(0,T;R)), and K1P()K2, applying the dominated convergence theorem, from (Equation27) we get (42) Jx0,u,λ=limn¯Ee2ιn¯TrsdsY+ιn¯2+P(ιn¯)Yιn¯2λz2=Ee20Trsdsy0+2+P0y02+E0TϕYt,ut)dtλz2.(42) The next step is to show that ϕ(Y(t),u(t)))0 for any t[0,T]. Specifically, the analysis is as follows.

If Y(t)>0 for some t, then set u(t)=Y(t)v(t). In this case, notice u(t)R+n+1 if only if υ(t)R+n+1. Fixing t given before, substituting Y(t)v(t) for u(t), and noticing the definition of H+(t), then ϕYt,ut=e2tTrsdsutΣtΣtut+2Y+(t)utBt=e2tTrsdsvtΣtΣtvt+2vtBtY2t=e2tTrsdsH+t,vtY2te2tTrsdsH+tY2t=0, where the equality holds at v(t)=ν+(t)=(0,0), t[0,T]. Then, the equality in the above holds if and only if the strategy takes ut=0R+n+1. If Y(t)<0 for some t, then set u(t)=Y(t)v(t). In this case, ϕYt,ut=PtutΣtΣtut2P(t)YtutBtH(t)PtYt2=PtvtΣtΣtvtY2t2PtvtBtY2tH(t)P(t)Y2t=PtvtΣtΣtvt2vtBtH(t)Y2t=PtHt,vtHtY2t0, where the equality holds at ut=YtνtR+n+1. Finally, when Y(t)=0, then ϕYt,ut=PtutΣtΣtut0 with the equality if and only if u(t)=0.

According to the above analysis, we have ϕYt,utϕYt,ut=0. Combining the above analysis and (Equation42), we find that for all admissible strategies u(t)A, (43) Jx0,ut,λEe20Trsdsy0+2+P0y02λz2(43) with the equality sign holding if and only if the strategy u(t) adopts u(t) as follows (44) ut=Y+tν+t+Ytνt=Ytνt,(44) i.e. the expression (Equation29). Thus (Equation30) follows from that Jx0,λ=Jx0,ut,λ=e20Trsdsy0+2+P0y02λz2, where y0=x0λe0Tr(s)ds.

According to the definition of the admissible strategy, we need to show that the strategy u() defined by (Equation29) should belong to LF2(0,T;Rn+1) if it is admissible. Noting the definition of ν(), and Assumption 2.1 and Remark 2.1, we have (45) νtvtR+m+1vt2C1ρˆ.(45) Therefore ν() is uniformly bounded.

Now, under the state feedback strategy (Equation44), the stochastic differential equation (Equation31) is written as (46) dY(t)=r(t)Y(t)+YtνtBtdt+YtνtΣtdW(t),t[0,T],(46) with Y(0)=y0. By Lemma 3.3, (Equation46) indeed admits a unique solution Y()LF2(Ω;C(0,T;R)). Recalling that ν+()=0, it is easy to see that u()=Y()ν()LF2(0,T;R+n+1).

Remark 3.2

It is easy to find that (47) supλRJx0,λ=supλ[x0e0Tr(s)ds,+)Jx0,λ,(47) from (Equation30).

Now, we can show the optimal strategy and efficient frontier of the problem (Equation10).

Theorem 3.6

Let (P(),Ψ())SG2(0,T;R)×LG2(0,T;Rm) be the unique solution to the BSDE (Equation25). Then the efficient strategy corresponding to zx0e0Tr(s)ds, as a feedback of the wealth process, for the problem (Equation10) is (48) ut=λetTr(s)dsXtνt,(48) where (49) λ:=zx0P0e0Tr(s)ds1P0e20Tr(s)ds=zx0e0Tr(s)dsEe0THsds1Ee0TH(s)ds.(49) Moreover, the efficient frontier is (50) VarXT=P0e20Tr(s)ds1P0e20Tr(s)ds×EXTx0e0Tr(s)ds2,EXTx0e0Tr(s)ds.(50)

Proof.

It follows from (Equation16) that 1P(0)e20Tr(s)ds=1E(e0TH(s)ds)>0. Thus λ in (Equation49) is well defined.

First, we intend to directly solve the problem (Equation10) for z=x0e0Tr(s)ds. Let X() be the wealth process corresponding to the efficient strategy u(). By linearity, it is easy to see that X()=X0()+X1() where X0() and X1() are given by as follows, respectively (51) dX0(s)=X0(s)r(s)ds,s0,T,X00=x0>0,(51) and (52) dX1s=rsX1s+ussBsds+ussΣsdWs,s0,T,X10=0.(52) Applying Itô's formula, for s[0,T], we obtain de0sr(τ)dτX1(s)=e0sr(τ)dτusBsds+e0sr(τ)dτus×ΣsdWs. Integrating from 0 to T and taking expectation, it yields that (53) X1t=0testr(τ)dτusBsds+0testr(τ)dτusΣsdWs.(53) Noting X0(t):=x0e0tr(s)ds we have (54) Xt=X0t+X1t=x0e0tr(s)ds+0testr(τ)dτusBsds+0testr(τ)dτusΣsdWs(54) and (55) EXT=x0e0Tr(s)ds+E0TetTr(s)dsutBtdt.(55) So if z=x0e0Tr(s)ds, under the constraint EX(T)=z in (Equation10), by (Equation55), we immediately obtain that the corresponding efficient strategy u(t)0 which means that all the wealth to be put in the bank (i.e. the risk-free asset) and all the risk of the insurance business to be passed to the reinsurer. By (Equation54) and u(t)=0, X(t)=x0e0tr(s)ds. Obviously it is easy to get the corresponding variance VarX(T)=0.

Putting z=x0e0Tr(s)ds, X(t)=x0e0tr(s)ds into (Equation49) and (Equation48), we can easily obtain that λ=x0e0Tr(s)ds and u(t)=0. Moreover, by substituting x0e0Tr(s)ds for EX(T) in (Equation50), we have VarX(T)=0. That is to say that, according to Definition 2.3, (Equation50) and (Equation48) are indeed the efficient frontier and the efficient strategy corresponding to z=x0e0Tr(s)ds. In the following, we only consider for any fixed z>x0e0Tr(s)ds.

As described in Hu and Zhou (Citation2005)), by applying the duality theorem, we have for z>x0e0Tr(s)ds, JMVx0=infq,πLF20,T;R+n+1Jx0,q(),π()=supλRinfq,πLF20,T;R+n+1Jx0,q(),π(),λ=supλRJx0,λ=supλx0e0Tr(s)ds,+Jx0,λ, where the last equality is from (Equation47). Recalling (Equation30), by making use of method of completing the square, we have, for any λ[x0e0Tr(s)ds,+), Jx0,λ=P0e20Tr(s)ds1λ22x0P0e0Tr(s)dszλ+P0x02z2=P0e20Tr(s)ds1×λzx0P0e0Tr(s)ds1P0e20Tr(s)ds2+P0e20Tr(s)ds1P0e20Tr(s)ds×zx0e0Tr(s)ds2. Considering (Equation16), under the constrained condition z=EX(T), it is apparent that for zx0e0Tr(s)ds, VarXT=JMVx0=supλx0e0Tr(s)ds,+Jx0,λ=Jx0,λ=P0e20Tr(s)ds1P0e20Tr(s)ds×EXTx0e0Tr(s)ds2, where λ:=(zx0P(0)e0Tr(s)ds)/(1P(0)e20Tr(s)ds)(x0e0Tr(s)ds,+). As H(s)<0, we also can get for z>x0e0Tr(s)ds, y0=x0λe0Tr(s)ds=x0zx0P0e0Tr(s)ds1P0e20Tr(s)dse0Tr(s)ds=ze0Tr(s)dsx01Ee0THsds0. Then for z>x0e0Tr(s)ds, considering (Equation20) in Lemma 3.3, we have (56) X(t)λetTr(s)ds=Yt=y0+e0trsdsy0φt=y0φt=y0φt0.(56) By Theorem 3.5 and substituting λ for λ in (Equation29), we have ut=νtX(t)λetTr(s)ds=νtλetTr(s)dsX(t). The proof is completed.

4. Two special cases

4.1. One risky asset

In this section, we consider n=1, i.e. there is only one risky asset. Then (Equation3), (Equation4) and (Equation7) are rewritten as follows dS1t=S1tμ1tdt+σ11tdW1t,t[0,T],S10=s1>0, dCt=atdtb0t,b1tdWt, and dXt=rtXt+utBtdt+utΣtdWt,t[0,T],X0=x0>0, where W(t)=(W0(t),W1(t)). In this case, we have, H0t=BtΣtΣt1Bt=atθtb0tb1tμ1trtb0tσ11t2+μ1trt2σ112t>0. From Proposition 3.2, we have H+(t)=H+(t,ν+(t))=0, where ν+(t)=(0,0). In the next Lemma 4.1, we get the minimum value H(t) and ν(t)=(ζ(t),ξ(t)) for the case with n = 1.

Lemma 4.1

  1. If b1(t)>0, t[0,T], then (57) H=μ1rσ112,ν=0,μ1rσ112,if 0<θ<b1μ1raσ11,H=H0,ν=aθσ11b1μ1rb02σ11,b02+b12μ1rb1aθσ11b02σ112,if b1μ1raσ11θ<b12+b02μ1rb1aσ11,H=a2θ2b12+b02,ν=aθb12+b02,0,if θb12+b02μ1rb1aσ11,(57) where we have suppressed the variable t;

  2. If b1(t)=0, t[0,T], we have for (58) Ht=H0t,νt=atθtb02t,μ1trtσ112t.(58)

Proof.

To get the the infimum of H+(t,υ) for υ:=(κ,υ1)R+2, we consider the the following optimisation problem minHt,υ=b0κaθb0+b1(μ1r)b0σ112+b1κ+σ11υ1μ1rσ112,s.t.κ0,υ10. By Karush–Kuhn–Tucker conditions, it holds that υR+2,λ:=λ1,λ2R+2,Ht,υκ+λ1=0,Ht,υυ1+λ2=0,λ1κ=0,λ2υ1=0, i.e. κ0,υ10,λ10,λ20,2b02+b12κ+b1σ11υ1aθλ1=0,2b1σ11κ+σ112υ1μ1rλ2=0,λ1κ=0,λ2υ1=0. It is easy to get υ=0,μ1rσ112,λ=2b1μ1rσ11aθ,0,if aθσ11b1μ1r,υ=aθσ11b1μ1rb02σ11,b02+b12μ1rb1aθσ11b02σ112,λ=0,0,if aθσ11>b1μ1randb1aθσ11<b02+b12μ1r,υ=aθb02+b12,0,λ=0,2aθb1σ11b02+b12μ1r,if aθb1σ11μ1rb02+b12. The proof is completed.

Through applying Lemma 4.1 to Theorem 3.6 and 4.2, we can obtain the explicit expression of the optimal strategy in the case of one risk-free asset.

4.2. Markovian regime-switching model

In this section, we consider the Markovian regime-switching model, that is to say we suppose that all of random coefficients mentioned before in the paper are functions of the state of the Markov chain. To be precise, let θt=θ~t,αt,νt=ν~t,αt,at=a~t,α,b¯(t)=b¯~t,αt,Bt=B~t,αt,σ(t)=σ~t,αt, where θ~(,), ν~(,),a~(,),b¯~(,),B~(,),σ~(,), are deterministic and bounded. Then other notations are changed correspondingly, for instance, Σ(t)=Σ~(t,α(t)) and Ht,υt=H~t,αt,υt:=υtΣ~t,αtΣ~t,αtυt2υtB~t,αt,t,αt,υt[0,T]×M×Rn+1. For the BSDE (Equation25), then we have ft,P(t)=f~t,αt,P(t):=2r(t)+H~t,αtPt, where H~t,αt:=minυtAH~t,αt,υt,t,αt0,T×M. In the following, to ease the presentation, sometimes we simplify the notations slightly by omitting the tilde character above those deterministic bounded functions.

By Cohen and Szpruch (Citation2012), there exists a unique function F(t):=(F1(t),,Fm(t)) such that (59) Pt,Ψt=αtFt,Ft,(59) and (60) Ftt=gt,FtQtFt,t0,T,FT=1,(60) where Ft(t)=(F1,t(t),,Fm,t(t))=(dF1(t)/dt,,dFm(t)/dt) and gt,Ft=ft,e1,F1t,,ft,em,Fmt=diag2rt+Ht,e1,,2rt+Ht,emFt. System (Equation60) is a homogeneous linear system of variable-coefficient first-order ordinary differential equations with continuous coefficients. Then, by the classical theory in the ODEs (see, e.g. Walter, Citation1998, P. 162), indeed there is exactly one solution to the system (Equation60). If r(), H(,ei), Q(), i=1,,m are constants. Let Θ:=(Θij)m×m, where Θij:=qij,ji,2rH,ei,j=i. Then system (Equation60) has the following unique solution Ft=kckvkeλkt, where λk, vk are the eigenvalues and corresponding eigenvectors of Θ and the constants ck is determined by kckvkeλkT=1.

Applying (Equation59) to (Equation48) and (Equation50) in Theorem 3.6, it easily to get the following result.

Theorem 4.2

The efficient strategy corresponding to zx0e0Tr(s)ds, as a feedback of the wealth process, for the problem (Equation10) is (61) ut,Xt=zx0α0F0e0Tr(s)ds1α0F0e20Tr(s)ds×etTr(s)dsX(t)zx0α0F0e0Tr(s)ds1α0F0e20Tr(s)dsνt,(61) where νt,ei:=argminυtAHt,ei,υt,t,i0,T×M and the efficient frontier (62) VarXT=α0F0e20Tr(s)ds1α0F0e20Tr(s)ds×EXTx0e0Tr(s)ds2.(62)

Remark 4.1

For the case with the one risky asset (i.e. n = 1), by Theorem 4.2, the efficient strategy corresponding to z=EX(T)x0e0Tr(s)ds is (Equation61) with ν=0,μ1rσ112,if aθσ11b1μ1r,b02+b12μ1rb1aθσ11b02σ112aθσ11b1μ1rb02σ11,b02+b12μ1rb1aθσ11b02σ112,if aθσ11>b1μ1rand b1aθσ11<b02+b12μ1r,aθb12+b02,0,if aθb1σ11μ1rb02+b12, and the efficient frontier is (63) VarXT=α0F0e20Tr(s)ds1α0F0e20Tr(s)ds×zx0e0Tr(s)ds2.(63)

Remark 4.2

If the interest rate in Chen and Yam (Citation2013) is deterministic and b()=0 in our paper, then the models in our paper are the same. In this case, H,ei=H0,ei=μ1,eir2σ112,ei+a,eiθ,eib0,eib1,eiμ1,eirb0,eiσ11,ei2 and the optimal strategy obtained in Chen and Yam (Citation2013) is given by (Equation61) with νt=atθtb02t,μ1trtσ112t and Ftt=diag2rtH0t,e1,,2rtH0t,emQtFt,t0,T,FT=1. Under the deterministic interest rate, the assumption in Chen and Yam (Citation2013, Theorem 5.1), which becomes e0Tr0sdszx0e0Tr0sdsα0F0e20Tr0sds1<0, always holds. It follows from Remark 4.1 that our results are the same.

5. Conclusion

We have investigated an optimal proportional reinsurance and investment problem for an insurer under the mean-variance criterion. We assumed that the claim process of the insurer and the prices of risky assets are correlated and the coefficients (except the interest rate) in the model are stochastic processes adapted to the filtration generated by a Markov chain. Such a non-Markovian model can capture the path-dependence and memory effects in the financial market. Furthermore, we considered the non-negative constraint on the reinsurance and investment strategies. By solving a unconstrained optimisation problem parameterized by the Lagrange multiplier, we obtained the optimal strategy in terms of the unique solution to a BSDE driven by the Markov chain.

Some relevant problems are worthy of being discussed in further. First, we can assume that the interest rate is also a stochastic process. Second, we may consider the non-cheap reinsurance. Third, we only studied the pre-commitment strategy in this paper and hope to investigate the time-consistent equilibrium strategy in our future works.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the 111 Project [grant number B14019] and the National Natural Science Foundation of China [grant numbers 11571113, 11601157, 11601320].

Notes on contributors

Liming Zhang

Liming Zhang is a Ph.D. candidate, School of Statistics, East China Normal University.

Rongming Wang

Dr Rongming Wang holds a Ph.D. from East China Normal University. He is now a professor at East China Normal University. His research interests include financial risk management, insurance actuarial and mathematical finance.

Jiaqin Wei

Dr Jiaqin Wei holds a Ph.D. from East China Normal University. He is now a research professor at East China Normal University. His research interests include actuarial science and mathematical finance.

Notes

1 For example, Credit Default Swap (CDS) is a popular credit derivative to enhance the credit ratings of the reference risky assets. Thus, the claim processes of insurers providing CDS protections are related to the financial risks.

References

  • Bai, L., & Zhang, H. (2008). Dynamic mean-variance problem with constrained risk control for the insurers. Mathematical Methods of Operations Research, 68(1), 181–205. doi: 10.1007/s00186-007-0195-4
  • Bäuerle, N. (2005). Benchmark and mean-variance problems for insurers. Mathematical Methods of Operations Research, 62(1), 159–165. doi: 10.1007/s00186-005-0446-1
  • Bi, J., & Guo, J. (2013). Optimal mean-variance problem with constrained controls in a jump-diffusion financial market for an insurer. Journal of Optimization Theory and Applications, 157(1), 252–275. doi: 10.1007/s10957-012-0138-y
  • Bi, J., Meng, Q., & Zhang, Y. (2014). Dynamic mean-variance and optimal reinsurance problems under the no-bankruptcy constraint for an insurer. Annals of Operations Research, 212(1), 43–59. doi: 10.1007/s10479-013-1338-z
  • Browne, S. (1995). Optimal investment policies for a firm with a random risk process: Exponential utility and minimizing the probability of ruin. Mathematics of Operations Research, 20(4), 937–958. doi: 10.1287/moor.20.4.937
  • Chen, S., Li, Z., & Li, K. (2010). Optimal investment–reinsurance policy for an insurance company with var constraint. Insurance: Mathematics and Economics, 47(2), 144–153.
  • Chen, P., & Yam, S. C. P. (2013). Optimal proportional reinsurance and investment with regime-switching for mean–variance insurers. Insurance: Mathematics and Economics, 53(3), 871–883.
  • Cohen, S. N., & Elliott, R. J. (2008). Solutions of backward stochastic differential equations on Markov chains. Communications on Stochastic Analysis, 2(2), 251–262. doi: 10.31390/cosa.2.2.05
  • Cohen, S. N., & Elliott, R. J. (2010). Comparisons for backward stochastic differential equations on Markov chains and related no-arbitrage conditions. The Annals of Applied Probability, 20(1), 267–311. doi: 10.1214/09-AAP619
  • Cohen, S. N., & Szpruch, L. (2012). On Markovian solutions to Markov chain bsdes. Numerical Algebra, Control and Optimization, 2(2), 257–269. doi: 10.3934/naco.2012.2.257
  • Elliott, R. J., Aggoun, L., & Moore, J. B. (2008). Hidden Markov models: Estimation and control. New York, NY: Springer.
  • Hu, Y., & Zhou, X. Y. (2005). Constrained stochastic LQ control with random coefficients, and application to portfolio selection. SIAM Journal on Control and Optimization, 44(2), 444–466. doi: 10.1137/S0363012904441969
  • Li, D., & Ng, W. L. (2000). Optimal dynamic portfolio selection: Multiperiod mean-variance formulation. Mathematical Finance, 10(3), 387–406. doi: 10.1111/1467-9965.00100
  • Liang, Z., & Bayraktar, E. (2014). Optimal reinsurance and investment with unobservable claim size and intensity. Insurance: Mathematics and Economics, 55, 156–166.
  • Markowitz, H. (1952). Portfolio selection. The Journal of Finance, 7(1), 77–91.
  • Promislow, S. D., & Young, V. R. (2005). Minimizing the probability of ruin when claims follow Brownian motion with drift. North American Actuarial Journal, 9(3), 110–128. doi: 10.1080/10920277.2005.10596214
  • Shen, Y., Wei, J., & Zhao, Q. (2018). Mean–variance asset–liability management problem under non-Markovian regime-switching models. Applied Mathematics and Optimization. doi:10.1007/s00245-018-9523-8.
  • Shen, Y., & Zeng, Y. (2015). Optimal investment–reinsurance strategy for mean–variance insurers with square-root factor process. Insurance: Mathematics and Economics, 62, 118–137.
  • Siu, T. K., & Shen, Y. (2017). Risk-minimizing pricing and esscher transform in a general non-Markovian regime-switching jump-diffusion model. Discrete and Continuous Dynamical Systems-series B, 22(7), 2595–2626. doi: 10.3934/dcdsb.2017100
  • Walter, W. (1998). Ordinary differential equations. New York, NY:Springer-Verlag.
  • Wang, H., Wang, R., & Wei, J. (2019). Time-consistent investment-proportional reinsurance strategy with random coefficients for mean–variance insurers. Insurance: Mathematics and Economics, 85, 104–114.
  • Wang, T., & Wei, J. (2019). Mean–variance portfolio selection under a non-Markovian regime-switching model. Journal of Computational and Applied Mathematics, 350, 442–455. doi: 10.1016/j.cam.2018.10.040
  • Xu, L., Zhang, L., & Yao, D. (2017). Optimal investment and reinsurance for an insurer under Markov-modulated financial market. Insurance: Mathematics and Economics, 74, 7–19.
  • Yang, H., & Zhang, L. (2005). Optimal investment for insurer with jump-diffusion risk process. Insurance: Mathematics and Economics, 37(3), 615–634.
  • Yi, B., Li, Z., Viens, F. G., & Zeng, Y. (2013). Robust optimal control for an insurer with reinsurance and investment under Heston's stochastic volatility model. Insurance: Mathematics and Economics, 53(3), 601–614.
  • Zeng, Y., Li, D., & Gu, A. (2016). Robust equilibrium reinsurance-investment strategy for a mean–variance insurer in a model with jumps. Insurance: Mathematics and Economics, 66, 138–152.
  • Zhang, X., & Siu, T. K. (2012). On optimal proportional reinsurance and investment in a Markovian regime-switching economy. Acta Mathematica Sinica, English Series, 28(1), 67–82. doi: 10.1007/s10114-012-9761-7
  • Zhou, X. Y., & Li, D. (2000). Continuous-time mean-variance portfolio selection: A stochastic LQ framework. Applied Mathematics and Optimization, 42(1), 19–33. doi: 10.1007/s002450010003
  • Zhou, X. Y., & Yin, G. (2003). Markowitz's mean-variance portfolio selection with regime switching: A continuous-time model. SIAM Journal on Control and Optimization, 42(4), 1466–1482. doi: 10.1137/S0363012902405583

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.