858
Views
0
CrossRef citations to date
0
Altmetric
Article

Probability weighting functions obtained from Hong Kong index option market

, &
Pages 1922-1943 | Received 22 Oct 2017, Accepted 09 Oct 2018, Published online: 27 Jul 2019

Abstract

In this paper we estimate the pricing kernel from the Hong Kong index option market and obtain the empirical probability weighting functions based on the rank-dependent expected utility. The empirical pricing kernel is estimated semi-parametrically as the ratio of the risk-neutral and objective densities. We employ a two-step estimation procedure to estimate the objective and risk-neutral densities under a consistent parametric framework of the non-affine generalised autoregressive conditional heteroskedasticity (G.A.R.C.H.) diffusion model. In the first step, we develop a continuous particle filters-based maximum likelihood estimation method to estimate the objective parameters of the G.A.R.C.H. diffusion model using the Hang Seng Index (H.S.I.) returns. In the second step of our estimation, we depart from the usual pure calibration approach and use the H.S.I. option prices to estimate the risk-neutral parameters of the G.A.R.C.H. diffusion model by constraining certain parameters to be consistent with the time-series behaviour of H.S.I. returns. Based on the estimated objective and risk-neutral parameters, the objective and risk-neutral densities are obtained by inverting the corresponding characteristic functions. Empirical results indicate that the empirical pricing kernel estimated from the Hong Kong index option market is non-monotonic and the estimated probability weighting functions are S-shaped, which implies that investors underweight small probability events and overweight large ones.

JEL CLASSIFICATIONS:

1. Introduction

The behaviour of market investors has always been in focus in the literature on financial economics. Naturally, it involves the pricing kernel (Rosenberg & Engle, Citation2002) or stochastic discount factor (Cochrane, Citation2001). In standard economic theory, the pricing kernel is monotonically decreasing in investor wealth or market return and corresponds to a positive risk aversion function. However, there has been a lot of discussion about the reliability of this theory. In the past decade, a large number of empirical studies have provided evidence of a non-monotonically decreasing pricing kernel, which has been referred to as the ‘pricing kernel puzzle’ or ‘risk aversion puzzle’ (Aït-Sahalia & Lo, Citation2000; Jackwerth, Citation2000; Rosenberg & Engle, Citation2002). Beare and Schmidt (Citation2016) and Golubev et al. (Citation2014) provide further empirical evidence of a non-monotonic pricing kernel by conducting formal statistical tests. Using data on an exchange traded fund replicating the S&P 500 index, Figlewski and Malik (Citation2014) also confirm the pricing kernel puzzle. Recently, Cuesdeanu and Jackwerth (Citation2016) use a novel test and forward looking information only to confirm the presence of the pricing kernel puzzle in the S&P 500 index option market.

Many researchers try to explain the pricing kernel puzzle with several approaches, including investors’ heterogeneous beliefs (Detlefsen et al., Citation2007; Bakshi et al., Citation2010; Ziegler, Citation2007), misspecification of the underlying state space (Chabi-Yo, Citation2012; Chabi-Yo et al., Citation2008; Christoffersen et al., Citation2013), ambiguity aversion (Gollier, Citation2011), investors sentiment (Barone-Adesi et al., Citation2017), etc. In this paper we consider a pricing kernel based on the rank-dependent expected utility of Quiggin (Citation1993), one of the most important generalisations of expected utility theory, with a probability weighting function. We show that the rank-dependent expected utility with the probability weighting function is able to explain the properties of the empirical pricing kernel estimated from the Hong Kong index option market.

In the last decades, there has been a large amount of literature on the estimation of the pricing kernel. A number of earlier papers estimate the pricing kernel using aggregate consumption data (Chapman, Citation1997; Hansen & Jagannathan, Citation1991); problems with imprecise measurement of aggregate consumption can weaken the empirical results of these papers. Thus, many authors have used the option prices to estimate the pricing kernel. This approach avoids the use of aggregate consumption data. Among others, Rosenberg and Engle (Citation2002) emphasise the advantages of option prices over consumption data. Based on the option prices, three types of estimation approaches have been proposed, including the parametric approach (Audrino & Meier, Citation2012; Rosenberg & Engle, Citation2002), nonparametric approach (Aït-Sahalia & Lo, Citation2000; Belomestny et al., Citation2017; Härdle et al., Citation2015; Jackwerth, Citation2000; Song & Xiu, Citation2016) and semiparametric approach (Chernov, Citation2003; Detlefsen et al., Citation2007). In this paper, we follow a semiparametric approach to derive the pricing kernel and construct the implied probability weighting function by estimating the ratio of the risk-neutral and objective densities. The advantage of the semiparametric approach is that it avoids the use of parametric pricing kernel specification, which imposes a strict structure on the kernel so that it too restrictive to account for the dynamics of the risk preference, and bandwidth selection, which influences the shape of the pricing kernel. The semiparametric approach for estimating the pricing kernel is flexible and simple to implement.

Previous econometrics studies are concerned with deriving the empirical pricing kernel by estimating the objective and risk-neutral densities separately, and mainly relying on the discrete-time generalised autoregressive conditional heteroskedasticity (G.A.R.C.H.) model of Bollerslev (Citation1986, Citation1987) or/and affine stochastic volatility model of Heston (Citation1993). Our estimation procedure is based on the objective and risk-neutral densities and these distributions are derived with a consistent parametric stochastic volatility framework of a non-affine G.A.R.C.H. diffusion model. From these densities we construct the corresponding pricing kernel. The G.A.R.C.H. diffusion model is a non-affine stochastic volatility model, which is a weak limit of discrete-time G.A.R.C.H.(1,1) model and has been found to capture the dynamics of the financial time series much better than the affine Heston stochastic volatility model. A number of recent papers have provided strong evidence for the G.A.R.C.H. diffusion model, not only for returns data but also for options data (Christoffersen et al., Citation2010; Kaeck & Alexander, Citation2013; Wu et al., Citation2012, forthcoming). Thus, the model is well suited for our estimation of the pricing kernel, and hence the probability weighting function.

In this paper, the objective and risk-neutral densities are derived by estimating the objective and risk-neutral parameters of the G.A.R.C.H. diffusion model in a way that maintains the internal consistency of the objective and risk-neutral measures. To achieve this goal, we employ a two-step estimation procedure. In the first step, we develop a continuous particle filters-based maximum likelihood estimation method to estimate the objective parameters of the G.A.R.C.H. diffusion model using the Hong Kong Hang Seng Index (H.S.I.) returns over a long time span. The continuous particle filters-based maximum likelihood estimation method is easy to implement and can be used to estimate the nonlinear models that include unobservable state variables efficiently (Christoffersen et al., Citation2010; Duan & Fulop, Citation2009; Malik & Pitt, Citation2011; Pitt et al., Citation2014). In the second step of our estimation, we depart from the usual pure calibration approach and use the H.S.I. option prices to estimate the risk-neutral parameters of the G.A.R.C.H. diffusion model by constraining certain parameters to be consistent with the time-series behaviour of H.S.I. returns. More precisely, the volatility of variance and the leverage parameters should be equal under the objective and risk-neutral measures (Broadie et al., Citation2007). We impose this constraint for both pragmatic and theoretical reasons. First, there is little disagreement in the literature over these parameter values. Second, joint estimation using both option and underlying asset prices is a computationally demanding task. Based on the estimated objective and risk-neutral parameters, the objective and risk-neutral densities can be obtained by inverting the corresponding characteristic functions of the G.A.R.C.H. diffusion model. The pricing kernel is finally obtained as the ratio of the risk-neutral and objective densities.

The probability weighting functions have been studied extensively in the experimental literature over the past decades (Barberis & Huang, Citation2008; Polkovnichenko, Citation2005; Prelec, Citation1998; Shefrin & Statman, Citation2000; Tversky & Kahneman, Citation1992). Empirical papers investigating probability weighting functions include Chabi-Yo and Song (Citation2013), Dierkes (Citation2009, Citation2013), Kliger and Levy (Citation2009), Polkovnichenko and Zhao (Citation2013) and Wang (Citation2017). In these papers, the authors obtain the probability weighting functions mainly from option prices. The main difference between our paper and theirs lies mainly in the use of a consistent parametric framework of the popular non-affine G.A.R.C.H. diffusion model for estimation. To the best of our knowledge, the G.A.R.C.H. diffusion model has not been used to estimate the probability weighting functions. Also, different from previous empirical studies that investigate probability weighting functions mainly focus on the U.S. S&P 500 index option market, this paper aims to investigate the probability weighting functions empirically for the Hong Kong index option market. We estimate the pricing kernel from the H.S.I. options and obtain the empirical probability weighting functions based on the rank-dependent expected utility. We then employ the estimate of the probability weighting functions to examine characteristic of investors’ decision weights in the Hong Kong stock market.

This paper contributes to the existing literature in several ways. First, to ensure consistency between the objective and risk-neutral measures which is crucial for obtaining reasonable results, a two-step estimation procedure for the popular non-affine G.A.R.C.H. diffusion model is developed. Second, the empirical non-monotonic pricing kernel and S-shaped probability weighting functions are obtained from the Hong Kong index option market. Third, it reveals that investors in the Hong Kong stock market underweight small probability events (tail events) and overweight large ones. Finally, the S-shaped probability weighting function with a utility function exhibiting constant relative risk aversion (C.R.R.A.) based on the rand-dependent utility explains the non-monotonicity of the pricing kernel.

The rest of the paper is organised as follows. In Section 2, we describe the theoretical link between the pricing kernel, probability weighting function and risk-neutral and objective densities. In Section 3, we present under the objective measure the G.A.R.C.H. diffusion model and derive the corresponding system under the risk-neutral measure, which serves as the basis for the estimation of the objective and risk-neutral densities. We describe the two-step estimation procedure used to estimate the objective and risk-neutral densities in Section 4. Section 5 discusses the empirical results obtained from the H.S.I. option market, and Section 6 concludes.

2. Pricing kernel and probability weighting function

In this section, we present a theoretical link between the pricing kernel and probability weighting function based on the rank-dependent expected utility. See Polkovnichenko and Zhao (Citation2013) for detailed discussion of the properties of the pricing kernel with probability weighting function.

Consider an index as a proxy for all wealth in the economy, let ST be the future price of the index, and St be the current price of the index. In the absence of arbitrage, there exists one positive random variable Mt,T that prices the index (1) St=EP[Mt,T(ST)ST|Ft](1) where EP[·|Ft] is the expectation with respect to the objective measure P conditional on the information set Ft available at time t, and Mt,T is the projection of the pricing kernel into ST, which has the same pricing implications as the original one (Rosenberg & Engle, Citation2002).

According to the risk-neutral valuation principal, the price St of the index with payoff ST can be equivalently represented as (2) St=EQ[erτST|Ft](2) where EQ[·|Ft] is the expectation with respect to the risk-neutral measure Q conditional on the information set Ft available at time t, r is the risk-free interest rate, and τ=Tt.

Assuming that pt,T(ST) is the objective density (the probability density function under the objective measure P) of ST, and qt,T(ST) is the risk-neutral density (the probability density function under the risk-neutral measure Q) of ST. From EquationEquation (2), we have (3) St=R+erτxqt,T(x)dx=R+erτqt,T(x)pt,T(x)xpt,T(x)dx=EP[erτqt,T(ST)pt,T(ST)ST|Ft](3)

Comparing EquationEquations (1) and Equation(3), we get (4) Mt,T(ST)=erτqt,T(ST)pt,T(ST)(4)

It is obvious from EquationEquation (4) that we can obtain the empirical pricing kernel by estimating the ratio of the risk-neutral and objective densities. Reasonable estimates for the pricing kernel in EquationEquation (4) should be non-increasing in ST, thereby implying a risk-averse investor. However, many studies document a humpy pricing kernel that it might be increasing in some range of the market returns (Bakshi et al., Citation2010; Rosenberg & Engle, Citation2002). Under expected utility theory, it is difficult to understand the reasons for the shape of the pricing kernel. In this paper, we relax the assumption of expected utility theory, and attempt to explain the non-monotonic pricing kernel (or pricing kernel puzzle) under the rank-dependent expected utility of Quiggin (Citation1993), one of the most important extensions of expected utility theory.

Under rank-dependent expected utility, instead of the true cumulative distribution function, Pt,T(ST), the representative investor uses the following distorted one to make an investment decision (5) P˜t,T(ST)=w(Pt,T(ST))(5) where w(·): [ 0,1 ]  [ 0,1 ]  is the probability weighting function, satisfies w(0)=0 and w(1)=1. Moreover, w is nonlinear, differentiable, continuous, and non-decreasing.

Given the distribution function Pt,T, the rank-dependent expected utility is calculated as (6) U=u(ST)dw(Pt,T)=E[u(ST)z(Pt,T)](6) where z(Pt,T)=w(Pt,T).

Following Polkovnichenko and Zhao (Citation2013), under the assumption of complete markets, we have (7) Mt,T(ST)=u(ST)z(Pt,T)(7)

It is obvious from EquationEquation (7) that the pricing kernel Mt,T is linked to the derivative of the probability weighting function z(Pt,T)=w(Pt,T) via the marginal utility u(ST).

Considering the influence of the initial wealth, we use RTST/St as a proxy for the gross return on the total investor wealth and assume that the initial wealth is one, we can rewrite the EquationEquations (4) and Equation(7) as (8) Mt,T(RT)=erτqt,T(RT)pt,T(RT)(8)

and (9) Mt,T(RT)=u(RT)z(Pt,T)(9) where pt,T and qt,T now are the objective and risk-neutral densities of RT, and Pt,T now is the cumulative distribution function of RT and Pt,T=pt,T.

For any given return RT0, assume that Pt,T(RT0)=Pt,T0, according to EquationEquations (8) and Equation(9), we have (10) w(Pt,T0)=0Pt,T0z(Pt,T)dPt,T=c0RT0Mt,T(RT)u(RT)pt,T(RT)dRT=cerτ0RT0qt,T(RT)u(RT)dRT(10) where c=erτ(0qt,T(RT)u(RT)dRT)1 is the normalising constant such that w(1)=1.

3. The model

We adopt the non-affine G.A.R.C.H. diffusion model to characterise the dynamics of the underlying asset prices (H.S.I.), and form the basis for the estimation of the objective and risk-neutral densities.

In the G.A.R.C.H. diffusion model, the dynamics under the objective measure P of the underlying asset price, St, and the associated variance, Vt, are assumed to be given by (11) dSt=μStdt+VtStdW1,tP(11) (12) dVt=κP(θPVt)dt+σVtdW2,tP(12) where μ is the mean of the underlying asset returns, θP is the long-run mean of variance, κP is the mean reversion rate of variance, σ is the volatility of variance, and W1,tP and W2,tP are two standard Brownian motions with Corrt(dW1,tP,dW2,tP)=ρ. The correlation parameter, ρ, is typically found to be negative, which captures the well-known ‘leverage effect’ originated by Black (Citation1976). That is, when asset prices decrease or returns are negative, the firm becomes more risky due to an increase in its debt-equity ratio, leading to an increase in its volatility (see also Christie (Citation1982)).

The G.A.R.C.H. diffusion model in EquationEquations (11) and Equation(12) has attracted a great deal of attention in recent years in the financial econometrics literature. A number of papers have shown that the model can provide realistic volatility dynamics and good option valuation performance (Christoffersen et al., Citation2010; Kaeck & Alexander, Citation2013; Wu et al., Citation2012; Yang & Kanniainen, Citation2017).

Following Chernov and Ghysels (Citation2000), we assume that the dynamics of the underlying asset prices have the same form under the risk-neutral measure Q as under the objective measure P, and the dynamics of (St, Vt) under the risk-neutral measure Q are of the form (13) dSt=rStdt+VtStdW1,tQ(13) (14) dVt=κQ(θQVt)dt+σVtdW2,tQ(14) where r is the risk-free interest rate, W1,tQ and W2,tQ are two standard Brownian motions under the risk-neutral measure with Corrt(dW1,tQ,dW2,tQ)=ρ. As prior studies constrain κQθQ=κPθP, here we specify a more flexible risk-neutral dynamics in that we allow κQθQκPθP, which implies a more flexible variance risk premium (15) λ(Vt)=κPθPκQθQ+(κQκP)Vt(15) that enhances the model flexibility to fit the market option prices.

According to Wu et al. (Citation2012), the characteristic function for the log price XT=logST under the objective measure P is given by (16) ft,TP(Xt,Vt,τ;ϕ)=eC(τ;ϕ)+D(τ;ϕ)Vt+iϕXt(16) where C(τ;ϕ)=iϕμτ12σ2θP[κPθP12ρσiϕ(θP)3/2]×[2log(2d(dg)(1edτ)2d)+(dg)τ]+12σ2(θP)2[16gζ2(d2g2)2log2d(dg)(1edτ)2d4τζ2(d2g2)+4τζ2(dg)2edτ8ζ2(d+g)(1edτ)(d+g)2(dg)[2d(dg)(1edτ)]]D(τ;ϕ)=2ζ(1edτ)2d(dg)(1edτ)

and ζ=12iϕ(iϕ1)d=g24σ2ζθPg=κP32ρσiϕ(θP)1/2

The characteristic function for XT under the risk-neutral measure Q denoted by ft,TQ is analogous to the objective characteristic function ft,TP, which can be obtained by replacing the objective parameters in EquationEquation (16), μ, κP and θP, with the risk-neutral parameters, r, κQ and θQ.

Then the objective and risk-neutral densities for RT=ST/St can be obtained by inverting the characteristic functions ft,TP and ft,TQ, respectively. Specifically, we have (17) pt,T(RT)=1RTp̂t,T(logRT+logSt)(17) (18) qt,T(RT)=1RTq̂t,T(logRT+logSt)(18) where (19) p̂t,T(XT)=12πReiϕXTft,TP(Xt,Vt,τ;ϕ)dϕ(19) (20) q̂t,T(XT)=12πReiϕXTft,TQ(Xt,Vt,τ;ϕ)dϕ(20)

The integrals in EquationEquations (19) and Equation(20) can be easily computed by using some numerical methods.

Based on the theoretical links between the pricing kernel, probability weighting function and objective and risk-neutral densities in EquationEquations (8) and Equation(10), the pricing kernel and probability weighting function can finally be obtained.

4. Estimation methodology

For computing the objective and risk-neutral densities by EquationEquations (17) and Equation(18), we still need to estimate the objective and risk-neutral parameters of the G.A.R.C.H. diffusion model. In this section, we describe how to estimate the objective and risk-neutral parameters of the G.A.R.C.H. diffusion model in a way that maintains the internal consistency of the objective and risk-neutral measures. We employ a two-step estimation procedure.

First, we take the stabilising transformation Xt=logSt,ht=logVt. By Itô’s lemma, we have (21) dXt=(μ12eht)dt+eht/2dW1,tP(21) (22) dht=(κPθPehtκP12σ2)dt+σdW2,tP(22)

In the empirical literature, the above continuous-time model must be discretised to facilitate the parameter estimation. A simple Euler discretisation leads to the following discrete-time stochastic processes (23) yt=(μ12eht)Δt+eht/2Δtεt(23) (24) ht=ht1+(κPθPeht1κP12σ2)Δt+σΔtηt(24) where yt=XtXt1 is the log return of underlying asset, Δt is the time interval,1 and εt=(W1,tPW1,t1P)/Δt,ηt=(W2,tPW2,t1P)/Δt. It can be shown that εt and ηt are independent and identically distributed (i.i.d.) standard normal random variables with Corrt(εt,ηt)=ρ.

It can be seen that EquationEquations (23) and Equation(24) constitute a nonlinear and non-Gaussian state-space model that cannot be estimated using the standard Kalman filter. To overcome this problem, we adopt the continuous particle filters-based maximum likelihood estimation method to estimate the model parameters (objective parameters). The log likelihood of the model is given by (25) logL(ΘP)=logp(y1,,yT|ΘP)=t=0T1logp(yt+1|Ft;ΘP)(25) via prediction decomposition, where ΘP=(μ,κP,θP,σ,ρ) are the objective parameters of the G.A.R.C.H. diffusion model. In the above equation, the predictive density (likelihood) p(yt+1|Ft;ΘP) can be written as (26) p(yt+1|Ft;ΘP)=p(yt+1|ht+1;ΘP)p(ht+1|Ft;ΘP)dht+1(26)

The expression in EquationEquation (26) is crucial for the maximum likelihood estimation via particle filters. In fact, the predictive density p(yt+1|Ft;ΘP) can be approximated by using Monte Carlo method, that is, (27) p̂(yt+1|Ft;ΘP)=1Ni=1Np(yt+1|ht+1i;ΘP)(27) where ht+1i, i=1,,N, sampled from the predictive density p(ht+1|Ft;ΘP) via particle filters, which is a sequential Monte Carlo technique using simulated samples to represent prediction and filtering distributions. Updating from the prediction distribution to the filtering distribution is carried out by using the Bayesian rule. Specifically, we have (28) p(ht+1|Ft+1;ΘP)p(yt+1|ht+1;ΘP)p(ht+1|Ft;ΘP)=p(yt+1|ht+1;ΘP)p(ht+1|ht;ΘP)p(ht|Ft;ΘP)dht(28) where p(ht+1|Ft+1;ΘP) is the filtering density. The principle of Bayesian updating implies that the density of the state conditional on all available information can be constructed by combining a prior with a likelihood, recursive implementation of which forms the basis for particle filtering. The particle filtering algorithm thus propagates and updates the samples or ‘particles’ via EquationEquation (28), which can be approximated by (29) p̂(ht+1|Ft+1;ΘP)p(yt+1|ht+1;ΘP)1Nt=1Np(ht+1|hti;ΘP)(29) where hti, i=1,,N, are the equally weighted samples or ’particles’ from the density p(ht|Ft;ΘP).

In order to sample from the density of EquationEquation (29), we use the sampling importance resampling (S.I.R.) algorithm of Gordon et al. (Citation1993). However, the resampling step in the standard S.I.R. algorithm creates discontinuity in the likelihood function that is not conducive to numerical optimisation and statistical inference, and makes estimation by maximum likelihood problematic. To overcome this problem, we adopt the continuous S.I.R. (C.S.I.R.) scheme proposed by Malik and Pitt (Citation2011) to compute the likelihood function and conduct the maximum likelihood estimation.

The C.S.I.R. algorithm is outlined below:

Given samples from htip(ht|Ft;ΘP),i=1,,N.

  1. Sample h˜t+1ip(ht+1|hti;ΘP),i=1,,N, using EquationEquation (24).

  2. Calculate normalised weights, (30) πt+1i=ωt+1ii=1Nωt+1i,i=1,,N(30) where ωt+1i=p(yt+1|h˜t+1i;ΘP)={2πexp(h˜t+1i)Δt}1/2×exp{12[yt+1(μ12exp(h˜t+1i))Δt]2/(exp(h˜t+1i)Δt)}

  3. Resample from a continuous empirical distribution implied by the set of {(h˜t+1i,πt+1i)|i{1,,N}} to obtain the equally weighted filtering samples ht+1ip(ht+1|Ft+1;Θ),i=1,,N (For details see Appendix A).

Based on the above C.S.I.R. algorithm, the prediction likelihood may be estimated as (31) p̂(yt+1|Ft;ΘP)=1Ni=1Np(yt+1|h˜t+1i;ΘP)=1Ni=1Nωt+1i(31) where ωt+1i,i=1,,N, are simply the unnormalised weights calculated in Step 2 of C.S.I.R. algorithm. The resulting likelihood function in EquationEquation (31) is a smooth function of parameters. Then, the log likelihood can be estimated by (32) logL̂(ΘP)=t=0T1logp̂(yt+1|Ft;ΘP)=t=0T1log(1Ni=1Nωt+1i)(32)

Note that the log likelihood in EquationEquation (32) will not be unbiased. According to Malik and Pitt (Citation2011), we may correct the original (biased) log likelihood as unbiased one by (33) logL˜(ΘP)=t=0T1(logμωt+1+12σωt+12Nμωt+12)(33) where μωt+1=1Ni=1Nωt+1i,σωt+12=1N1i=1N(ωt+1iμωt+1)2

Given the log likelihood approximation logL˜, the maximum likelihood estimates of the model parameters (objective parameters) can be obtained by (34) Θ̂P=argmaxΘPlogL˜(ΘP)(34)

Based on the above estimated parameters, the spot variances can be obtained using the C.S.I.R. filtering algorithm.

Next, we turn to the estimation of risk-neutral parameters ΘQ=(κQ,θQ,σ,ρ). We impose consistency between the objective and risk-neutral measures in estimation. Specifically, we let the volatility of variance, σ, and the leverage parameters, ρ, be equal under the objective and risk-neutral measures. As the parameters σ and ρ have been estimated in the first step of our estimation, there are only two remaining risk-neutral parameters (κQ and θQ) need to be estimated in the second step of estimation. We optimise over κQ and θQ to fit the observed option prices. Specifically, to obtain the estimates of the parameters κQ and θQ, we minimise squared differences of model and market option prices, that is, (35) (κ̂Q,θ̂Q)=argminj=1M(CjCj(ΘQ))2(35) where M is the number of option contracts, Cj is the market option price of contract j, and Cj(ΘQ) is the corresponding model option price which can be calculated by using the fast Fourier transform approach based on the risk-neutral characteristic function of the G.A.R.C.H. diffusion model (Carr & Madan, Citation1999).

5. Empirical results

In contrast to many previous studies that have focused mainly on the S&P 500 index option market, we investigate in this paper the empirical pricing kernel and probability weighting functions by focusing on the H.S.I. option market. The H.S.I. serves as an approximation to the Hong Kong economy, and it can be used as a proxy for market portfolio.

5.1. The data

We use daily H.S.I. returns to estimate the objective G.A.R.C.H. diffusion model and to compute the objective density. The sample period is from 4 January 2000 to 4 October 2017, which is chosen to achieve a long sample. The sample size is 4379 for the index returns. The time-series and Q–Q plots of the H.S.I. returns are presented in . We can observe from that the H.S.I. returns exhibit time-varying volatility and volatility clustering during the sample period. Summary statistics for the H.S.I. returns are shown in . It can be seen that the H.S.I. returns are skewed and leptokurtic. Jarque–Bera statistics suggests that the assumption of normality is rejected for the H.S.I. return series, which also can be confirmed from the Q–Q plot of the index returns in . The results suggest the stochastic volatility model, such as the G.A.R.C.H. diffusion model, may be appropriate for modelling the dynamics of the H.S.I.

Table 1. Summary statistics of the H.S.I. returns.

To estimate the risk-neutral parameters of the G.A.R.C.H. diffusion model, we use the H.S.I. option data which is presented in . The option data are selected as the most actively traded option contracts with maturity about one month on 4 October 2017. Finally, we use the annualised the 1-month Hong Kong Interbank Offer Rate as a proxy for the risk-free interest rate. All of the data are obtained from the Wind Database of China.

Table 2. Selected H.S.I. option data on 4 October 2017.

5.2. Estimation results

Based upon the data on the H.S.I. returns, the objective parameters of the G.A.R.C.H. diffusion model can be estimated by adopting the C.S.I.R.-based maximum likelihood estimation method described in Section 4. reports the estimation results. Our results show that, under the objective measure, the long-run mean of the variance is θP=0.0522, with a mean-reversion speed of κP=2.3044. The estimate of the ‘leverage effect’ parameter ρ is significantly negative, indicating that the return and the variance processes are negatively correlated during the sample period, a well-known empirical fact.

Table 3. Estimation results.

Based on the estimated (objective) parameters of the G.A.R.C.H. diffusion model, the spot variances can be estimated via the C.S.I.R. filtering algorithm. shows the filtered variances. Specially, the spot variance on 4 October 2017 is 0.0185.

Using the H.S.I. option data in , the risk-neutral parameters of the G.A.R.C.H. diffusion model can be estimated. The estimates are reported in . It can be seen from the table that, under the risk-neutral measure, the long-run mean of the variance is θQ=0.0180, with a mean-reversion speed of κQ=0.1990, which both are obviously lower than the corresponding estimates under the objective measure. The relative root mean square error (R.R.M.S.E.) for the G.A.R.C.H. diffusion model is 0.3106, which is obviously lower than that of Black–Scholes model in which the R.R.M.S.E. for the Black–Scholes model is 0.5847. Thus, the improvement in R.R.M.S.E. offered by the G.A.R.C.H. diffusion model over the classical Black–Scholes model is 46.88%.

With the estimated objective and risk-neutral parameters of the G.A.R.C.H. diffusion model, we can use EquationEquations (17) and Equation(18) to compute the objective and risk-neutral densities of RTST/St, respectively. We plot the estimates of objective and risk-neutral densities for one-month time horizon τ=Tt=1/12 in . It can be seen that there are obvious discrepancies in the estimation results of the objective and risk-neutral densities.

The estimates of objective and risk-neutral densities allow us to estimate the empirical pricing kernel by using EquationEquation (4). displays our estimate of the pricing kernel. It can be seen from the figure that the estimated pricing kernel is not monotonically decreasing, but exhibits a hump around the gross return RT=ST/St=1. This is not in accordance with the classical economic theory and referred to as the ‘pricing kernel puzzle’.

Using the estimated objective and risk-neutral densities, we can construct the probability weighting function for given utility function. We use the standard C.R.R.A. utility functions u(R)=R1γ1γ for γ = 0 (linear utility), γ = 1 (logarithmic utility), and γ = 2 (power utility). We present estimates of the probability weighting function w(Pt,T) for one-month time horizon in . It can be seen from the figure that the probability weighting function estimates have the S-shaped forms, implying that investors in the Hong Kong stock market underweight small probability events (tail events) and overweight large ones. The probability weighting functions obtained from Hong Kong index option market are different from those obtained from the U.S. index option market, which typically have the inverse-S shape (see Polkovnichenko & Zhao, Citation2013). The results call for further efforts to integrate the models that can account for S-shaped probability weighting in portfolio theory, asset pricing and risk management.

6. Conclusion

The study of the probability weighting function has been the focus of the financial economics literature. The probability weighting function has been extensively employed to model investor behaviour in financial markets. It is informative about the tail risk or crash risk of return dynamics and can provide explanation for tail risk premium. It can also be used to construct investor sentiment toward tail events and to construct probability weighting measures of tail events to capture investors’ decision weights toward tail events. In addition, the probability weighting function can be used to understand asset price dynamics and equity premium puzzle.

In this paper, we semi-parametrically estimate the pricing kernel from the Hong Kong index option market and obtain the empirical probability weighting functions based on the rank-dependent expected utility, using the non-affine G.A.R.C.H. diffusion model in a way that maintains the internal consistency of the objective and risk-neutral measures. Our results show that the empirical pricing kernel estimated from the Hong Kong index option market is non-monotonic, deviating from expected utility theory, and that the estimated probability weighting functions are S-shaped, which implies that investors underweight small probability events (tail events) and overweight large ones. The S-shaped probability weighting function with a utility function exhibiting C.R.R.A. based on the rand-dependent utility can explain the non-monotonicity of the pricing kernel. The results point to theoretical models with S-shaped probability weighting functions as a promising direction to understand asset price dynamics and further to explore implications for many economic issues, such as portfolio choices, asset pricing, and risk management.

Acknowledgements

We would like to thank the editor and two anonymous referees for their insightful and helpful comments and suggestions that greatly improved the paper.

Disclosure statement

No potential conflict of interest was reported by the author.

Additional information

Funding

This study was supported by the National Natural Science Foundation of China under Grant Nos. 71501001, 71401033 and 71431008; MOE (Ministry of Education in China) Project of Humanities and Social Sciences under Grant No. 14YJC790133; China Postdoctoral Science Foundation under Grant No. 2015M580416; Foreign Visiting Scholar Program for Excellent Youth Scholars in Universities under Grant No. gxfx2017031; Southern Jiangsu Capital Markets Research Center under Grant No. 2017ZSJD020.

Notes

1 Assuming that there are 250 trading days per year, then the time interval Δt for one trading day is 1/250 year, and in this case the discretisation bias of Euler scheme is expected to be negligible.

References

  • Aït-Sahalia, Y., & Lo, A. W. (2000). Nonparametric risk management and implied risk aversion. Journal of Econometrics, 94, 9–51.
  • Audrino, F., & Meier, P. (2012). Empirical pricing kernel estimation using a functional gradient descent algorithm based on splines. Working Paper, University of St. Gallen.
  • Bakshi, G., Madan, D., & Panayotov, G. (2010). Returns of claims on the upside and the viability of U-shaped pricing kernels. Journal of Financial Economics, 97(1), 130–154. doi: 10.1016/j.jfineco.2010.03.009
  • Barberis, N., & Huang, M. (2008). Stocks as lotteries: The implications of probability weighting for security prices. American Economic Review, 98(5), 2066–2100. doi: 10.1257/aer.98.5.2066
  • Barone-Adesi, G., Mancini, L., & Shefrin, H. (2017). Estimating sentiment, risk aversion, and time preference from behavioral pricing kernel theory. Working Paper, University of Lugano.
  • Beare, B. K., & Schmidt, L. D. (2016). An empirical test of pricing kernel monotonicity. Journal of Applied Econometrics, 31(2), 338–356. doi: 10.1002/jae.2422
  • Belomestny, D., Härdle, W. K., & Krymova, E. (2017). Sieve estimation of the minimal entropy martingale marginal density with application to pricing kernel estimation. International Journal of Theoretical and Applied Finance, 20(6), 1–21. doi: 10.1142/S0219024917500418
  • Black, F. (1976). Studies of stock price volatility changes. In Proceedings of the 1976 meeting of business and economic statistics section. American Statistical Association, pp. 177–181.
  • Bollerslev, T. (1986). Genearlized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31(3), 307–327. doi: 10.1016/0304-4076(86)90063-1
  • Bollerslev, T. (1987). A conditionally heteroskedastic time series model for speculative prices and rates of return. The Review of Economics and Statistics, 69(3), 542–547. doi: 10.2307/1925546
  • Broadie, M., Chernov, M., & Johannes, M. (2007). Model specification and risk premia: Evidence from futures options. The Journal of Finance, 62(3), 1453–1490.
  • Carr, P., & Madan, D. (1999). Option valuation using the fast Fourier transform. The Journal of Computational Finance, 2(4), 61–73. doi: 10.21314/JCF.1999.043
  • Chabi-Yo, F. (2012). Pricing kernels with stochastic skewness and volatility risk. Management Science, 58(3), 624–640. doi: 10.1287/mnsc.1110.1424
  • Chabi-Yo, F., Garcia, R., & Renault, E. (2008). State dependence can explain the risk aversion puzzle. Review of Financial Studies, 21(2), 973–1011. doi: 10.1093/rfs/hhm070
  • Chabi-Yo, F., & Song, Z. G. (2013). Recovering the probability weights of tail events with volatility risk from option prices. Working Paper, Ohio State University.
  • Chapman, D. (1997). Approximating the asset pricing kernel. Journal of Finance, 52(4), 1383–1410. doi: 10.1111/j.1540-6261.1997.tb01114.x
  • Chernov, M. (2003). Empirical reverse engineering of the pricing kernel. Journal of Econometrics, 116(1-2), 329–364. doi: 10.1016/S0304-4076(03)00111-8
  • Chernov, M., & Ghysels, E. (2000). A study towards a unified approach to the joint estimation of objective and risk neutral measures for the purpose of options valuation. Journal of Financial Economics, 56(3), 407–458. doi: 10.1016/S0304-405X(00)00046-5
  • Christie, A. A. (1982). The stochastic behavior of common stock variances. Journal of Financial Economics, 10(4), 407–432. doi: 10.1016/0304-405X(82)90018-6
  • Christoffersen, P., Heston, S., & Jacobs, K. (2013). Capturing option anomalies with a variance-dependent pricing kernel. Review of Financial Studies, 26(8), 1963–2006. doi: 10.1093/rfs/hht033
  • Christoffersen, P., Jacobs, K., & Mimouni, K. (2010). Volatility dynamics for the S&P 500: Evidence from realized volatility, daily returns, and option prices. The Review of Financial Studies, 23(8), 3141–3189.
  • Cochrane, J. H. (2001). Asset pricing. New Jersey: Princeton University Press.
  • Cuesdeanu, H., & Jackwerth, J. C. (2016). The pricing kernel puzzle in forward looking data. Working Paper, University of Konstanz.
  • Detlefsen, K., Härdle, W., & Moro, R. (2007). Empirical pricing kernels and investor preferences. Working Paper, Universite de Provence.
  • Dierkes, M. (2009). Option-implied risk attitude under rank-dependent utility. Working Paper, University of Münster.
  • Dierkes, M. (2013). Probability weighting and asset prices. Working Paper, University of Münster.
  • Duan, J. C., & Fulop, A. (2009). Estimating the structural credit risk model when equity prices are contaminated by trading noises. Journal of Econometrics, 150(2), 288–296. doi: 10.1016/j.jeconom.2008.12.003
  • Figlewski, S., & Malik, M. F. (2014). Options on leveraged ETFs: A window on investor heterogeneity. Working Paper, New York University.
  • Gollier, C. (2011). Portfolio choices and asset prices: The comparative statics of ambiguity aversion. The Review of Economic Studies, 78(4), 1329–1344. doi: 10.1093/restud/rdr013
  • Golubev, Y., Härdle, W., & Timofeev, R. (2014). Testing monotonicity of pricing kernels. AStA Advances in Statistical Analysis, 98(4), 305–326. doi: 10.1007/s10182-014-0225-5
  • Gordon, N. J., Salmond, D. J., & Smith, A. F. (1993). Novel approach to nonlinear/non-Gaussian Bayesian state estimation. IEE Proceedings F Radar and Signal Processing, 140(2), 107–113. doi: 10.1049/ip-f-2.1993.0015
  • Hansen, L. P., & Jagannathan, R. (1991). Implications of security market data for models of dynamic economies. Journal of Political Economy, 99, (2), 225–262. doi: 10.1086/261749
  • Härdle, W. K., Okhrin, Y., & Wang, W. (2015). Uniform confidence bands for pricing kernels. Journal of Financial Econometrics, 13(2), 376–413. doi: 10.1093/jjfinec/nbu002
  • Heston, S. L. (1993). A closed-form solution for options with stochastic volatility with applications to bond and currency options. Review of Financial Studies, 6(2), 327–343. doi: 10.1093/rfs/6.2.327
  • Jackwerth, J. (2000). Recovering risk aversion from option prices and realized returns. Review of Financial Studies, 13(2), 433–451. doi: 10.1093/rfs/13.2.433
  • Kaeck, A., & Alexander, C. (2013). Stochastic volatility jump-diffusions for European equity index dynamics. European Financial Management, 19(3), 470–496.
  • Kliger, D., & Levy, O. (2009). Theories of choice under risk: Insights from financial markets. Journal of Economic Behavior and Organization, 71(2), 330–346. doi: 10.1016/j.jebo.2009.01.012
  • Malik, S., & Pitt, M. K. (2011). Particle filters for continuous likelihood evaluation and maximization. Journal of Econometrics, 165(2), 190–209. doi: 10.1016/j.jeconom.2011.07.006
  • Pitt, M. K., Malik, S., & Doucet, A. (2014). Simulated likelihood inference for stochastic volatility models using continuous particle filtering. Annals of the Institute of Statistical Mathematics, 66(3), 527–552. doi: 10.1007/s10463-014-0456-y
  • Polkovnichenko, V. (2005). Household portfolio diversification: A case for rank-dependent preferences. Review of Financial Studies, 18(4), 1467–1502. doi: 10.1093/rfs/hhi033
  • Polkovnichenko, V., & Zhao, F. (2013). Probability weighting functions implied in options prices. Journal of Financial Economics, 107(3), 580–609. doi: 10.1016/j.jfineco.2012.09.008
  • Prelec, D. (1998). The probability weighting function. Econometrica, 66(3), 497–527. doi: 10.2307/2998573
  • Quiggin, J. (1993). Generalized expected utility theory: The rank-dependent model. Norwell, MA, Dordrecht, the Netherlands: Kluwer Academic Publishers.
  • Rosenberg, J., & Engle, R. F. (2002). Empirical pricing kernels. Journal of Financial Economics, 64(3), 341–372. doi: 10.1016/S0304-405X(02)00128-9
  • Shefrin, H., & Statman, M. (2000). Behavioral portfolio theory. Journal of Financial and Quantitative Analysis, 35(2), 127–151. doi: 10.2307/2676187
  • Song, Z. G., & Xiu, D. C. (2016). A tale of two option markets: Pricing kernels and volatility risk. Journal of Econometrics, 190(1), 176–196.
  • Tversky, A., & Kahneman, D. (1992). Advances in prospect theory: Cumulative representation of uncertainty. Journal of Risk and Uncertainty, 5(4), 297–323. doi: 10.1007/BF00122574
  • Wang, B. (2017). Probability weighting and asset prices: Evidence from mergers and acquisitions. Working Paper, Fordham University.
  • Wu, X. Y., Ma, C. Q., & Wang, S. Y. (2012). Warrant pricing under GARCH diffusion model. Economic Modelling, 29(6), 2237–2244. doi: 10.1016/j.econmod.2012.06.020
  • Wu, X. Y., Zhou, H. L., & Wang, S. Y. (Forthcoming). Estimation of market prices of risks in G.A.R.C.H. diffusion model. Economic Research-Ekonomska Istraživanja, doi: 10.1080/1331677X.2017.1421989
  • Yang, H., & Kanniainen, J. (2017). Jump and volatility dynamics for the S&P 500: Evidence for infinite-activity jumps with non-affine volatility dynamics from stock and option markets. Review of Finance, 21(2), 811–844. doi: 10.1093/rof/rfw001
  • Ziegler, A. (2007). Why does implied risk aversion smile? Review of Financial Studies, 20(3), 859–904. doi: 10.1093/rfs/hhl023

Appendix A. Continuous stratified resampling

In the standard S.I.R. algorithm, the resampling is based on the discontinuous empirical distribution function F̂(h)=i=1NπiI(hhi)

where hi, i=1,,N, are sorted in ascending order, and i=1Nπi=1, I(z) is an indicator function, satisfying I(z)={1,z>00,z0

To produce samples of the state variables in a continuous way, Malik and Pitt (Citation2011) approximate F̂(h) by continuous empirical distribution function F˜(h), which is given by F˜(h)=ω0I(h1<h)+ωNI(hN<h)+i=1N1ωiGi(hhihi+1hi)

where ω0=π1/2,ωN=πN/2,ωi=(πi+1+πi)/2,i=1,2,,N1, and Gi(z) is a monotonically non-decreasing distribution function on [0,1], such that Gi(z)={0,z<0z,0z11,z>1

The above defined continuous distribution function F˜(h) is easy to invert and thus sampling from it becomes very simple and quick. It can be shown that as N,F˜(h)F̂(h)F(h), with F(h) being the true distribution function. In practice the difference between F˜(h) and F̂(h) becomes negligible for moderate N, typically we choose N = 500.

Continuous stratified resampling algorithm

First, we need to generate a single uniform uUID(0,1), and propagate sorted uniforms given by uj=j1N+uN,j=1,2,,N

Then, we sample the index corresponding to the region which are sorted as r1,r2,rN, and also produce a new set of uniforms, u1*,u2*,uN*, according to the algorithm given below.

set s=0,j=1;

for i=0:N

s=s+ωi;

while (ujs     jN)

rj=i;

uj*=(uj(sωi))/ωi;

j=j+1;

end

end

For the selected regions r1,r2,rN, we set the new set of uniforms u1*,u2*,uN* as hj*={h1,rj=0hN,rj=N(hrj+1hrj)×uj*+hrj,others

This produces as sample h1*,h2*,,hN* from continuous distribution function F˜(h).

Figure 1. Time-series and Q–Q plots of the H.S.I. returns. Source: Own calculation.

Figure 1. Time-series and Q–Q plots of the H.S.I. returns. Source: Own calculation.

Figure 2. Filtered variances of the H.S.I. returns. Source: Own calculation.

Figure 2. Filtered variances of the H.S.I. returns. Source: Own calculation.

Figure 3. Estimates of objective and risk-neutral densities (O.D. and R.N.D.). Source: Own calculation.

Figure 3. Estimates of objective and risk-neutral densities (O.D. and R.N.D.). Source: Own calculation.

Figure 4. Empirical pricing kernel. Source: Own calculation.

Figure 4. Empirical pricing kernel. Source: Own calculation.

Figure 5. Estimates of probability weighting functions. Source: Own calculation.

Figure 5. Estimates of probability weighting functions. Source: Own calculation.