1,508
Views
3
CrossRef citations to date
0
Altmetric
Articles

Can a Machine Correct Option Pricing Models?

, , &
 

Abstract

We introduce a novel two-step approach to predict implied volatility surfaces. Given any fitted parametric option pricing model, we train a feedforward neural network on the model-implied pricing errors to correct for mispricing and boost performance. Using a large dataset of S&P 500 options, we test our nonparametric correction on several parametric models ranging from ad-hoc Black–Scholes to structural stochastic volatility models and demonstrate the boosted performance for each model. Out-of-sample prediction exercises in the cross-section and in the option panel show that machine-corrected models always outperform their respective original ones, often by a large extent. Our method is relatively indiscriminate, bringing pricing errors down to a similar magnitude regardless of the misspecification of the original parametric model. Even so, correcting models that are less misspecified usually leads to additional improvements in performance and also outperforms a neural network fitted directly to the implied volatility surface.

Disclosure Statement

The authors report there are no competing interests to declare.

t=1,,Tσ(t,m,τ)σp(t,m,τ)σBS(t,m,τ)=a0ϵp(t,m,τ)=σ(t,m,τ)σp(t,m,τ)t=1,,Tj=1,,nt{σ(t,mj,t,τj,t), j=1,,nt}t=1Tσ̂p(t,mj,t,τj,t)ϵ̂p(t,mj,t,τj,t)=σ(t,mj,t,τj,t)σ̂p(t,mj,t,τj,t)1Tt=1T1ntj=1nt[ϵ̂p(t,mj,t,τj,t)f(mj,t,τj,t)]2,σ̂p(t,mj,t,τj,t)+f̂(mj,t,τj,t)yt=(y1,t,,yq,t)1Tt=1T1ntj=1nt[ϵ̂p(t,mj,t,τj,t)f(yt,mj,t,τj,t)]2,σ̂p(t,mj,t,τj,t)+f̂(yt,mj,t,τj,t)ytf(.)xi,t=(mi,t,τi,t),f(xi,t)xi,t=(yt,mi,t,τi,t),f(xi,t)f:R(q+2)Rd0=q+2Acknowledgments

We would like to thank the Associate Editor, two anonymous referees and conference participants at Econometric and Big Data Analyses of Global Economy, Financial Markets and Economic Policies for useful comments and suggestions.

Notes

1 The implied volatility surface represents the implied volatility of each option as a function of its moneyness and time to maturity. The implied volatility of a given option is the volatility parameter that makes the Black and Scholes (Citation1973) formula deliver the observed option price.

2 See Glad (Citation1998), Fan and Ullah (Citation1999), and Fan, Wu, and Feng (Citation2009) for the advantages of a parametrically guided nonparametric approach.

3 More recently, Ait-Sahalia, Li, and Li (Citation2021) reverse this logic by linking observed shape characteristics of the implied volatility surface to the coefficients of stochastic volatility models, while Bandi, Fusari, and Renò (Citation2021) expand the characteristic function of the underlying asset process to price short-maturity options and study the relation between equity characteristics and sources of structural risk.

4 Essentially, the formula is obtained by noticing that the option can be dynamically hedged by buying and/or selling the underlying security financing with a risk-free bond.

5 There are several iterative methods available to solve for CBS1, including the Newton–Raphson method, the bisection method and the Brent method.

6 Dumas, Fleming, and Whaley (Citation1998) choose a quadratic specification due to the parabolic shape of implied volatilities in the cross-section and to favor a parsimonious model.

7 Specifying the regression as a function of the strike price instead of moneyness leads to essentially the same results.

8 Since our main goal is prediction, we do not impose the Feller condition in the estimation.

9 Provided that we allow f(.) to be composed by a constant plus other functions of moneyness and maturity, which is generally the case.

10 Note that, for l = 1, the previous layer l – 1 is the input layer 0, where z0=xi,t and d0=2.

11 In this sense, “deep learning” refers to the use of a deep neural network, where “deep” usually means that L2.

12 We use the implementation of this method available in Matlab, with its default values.

13 See, for instance, Hutchinson, Lo, and Poggio (Citation1994), Rubinstein (Citation1994), Bakshi, Cao, and Chen (Citation1997), Ait-Sahalia and Lo (Citation1998), Dumas, Fleming, and Whaley (Citation1998), Duffie, Pan, and Singleton (Citation2000), Garcia and Gençay (Citation2000), Ait-Sahalia and Duarte (Citation2003), Fan and Mancini (Citation2009), Andersen, Fusari, and Todorov (Citation2015), Carr and Wu (Citation2016), and Ait-Sahalia, Li, and Li (Citation2021).

14 More specifically, observations with zero volume, with price lower than 1/8 or violating the usual no-arbitrage conditions are dropped.

15 For PM settled options, the time to expiration is the number of days between the trade date and the expiration date, while for AM settled options, it is the number of days between the dates less one.

16 For the rare case that there is no such pair of ATM call and put options for a given time to maturity, we use the dividend yield provided by OptionMetrics.

17 See, for instance, Ait-Sahalia and Lo (Citation1998), Fan and Mancini (Citation2009), and Andersen, Fusari, and Todorov (Citation2015).

18 See, among others, Bakshi, Cao, and Chen (Citation1997), Dumas, Fleming, and Whaley (Citation1998), and Fan and Mancini (Citation2009).

19 For instance, Gonçalves and Guidolin (Citation2006) propose a vector autoregressive approach to model the dynamics of the parameters of the AHBS model and predict their values in the future.

20 In unreported results available upon request, we also test Heston using the model-implied expectation at t of future variance at t + h, Et[Vt+h]=v¯̂+eκ̂h/252(V̂tv¯̂), instead of V̂t. We find that the predictive performance improves only marginally, without changing the qualitative nature of our results.

21 As a sanity check, we also conducted our tests with a standalone neural network and the results are identical to those of correcting the BS model. To avoid redundancy and save space, we omit these results.

22 Results for other network architectures and 5- and 21-day ahead predictions are qualitatively similar.

23 We thank an anonymous referee for this suggestion.

24 The in-sample training set consists of 604,749 options over 503 days, while the out-of-sample testing set consists of 662,442 options over 375 days.

26 That is, we use neural networks with three hidden layers with 32, 16, and 8 neurons, respectively.

27 The BS model is estimated as the constant implied volatility that minimizes the IVMSE in the in-sample option panel. The predictions of the BS model out-of-sample for any option are simply the estimated constant implied volatility.

28 More specifically, the level of the implied volatility surface is the average implied volatility for short-term ATM options, the term structure is the difference between the average implied volatility of long- and short-term ATM options, the skew is defined as the difference between the average implied volatility of short-term OTM put and OTM call options, and the skew term structure is the difference between the long- and short-term skew, where the long-term skew is defined analogously to the short-term skew.

29 More recently, Ait-Sahalia, Li, and Li (Citation2021) propose stochastic volatility models designed to fit directly the shape characteristics of the implied volatility surface.

30 The same comment would apply to a standalone neural network, which is equivalent to correcting the BS model. This further highlights the usefulness of our model-guided approach.

31 See, for instance, the estimation of the Heston model in the option panel described in Section 2.3, where we estimate the structural parameters ξ=(v¯,κ,σv,ρ) which are fixed over time.

Additional information

Funding

Fan’s research was supported by NSFC grant No.71991470/71991471.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.