Full article: Short-time near-the-money skew in rough fractional volatility models

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the ‘rough’ regime of Hurst parameter $H < 1 / 2$ . This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang [Asymptotics for rough stochastic volatility models. SIAM J. Financ. Math., 2017, 8(1), 114–145] in a way that allows us to zoom-in around the money while maintaining full analytical tractability. More precisely, this amounts to proving higher order moderate deviation estimates, only recently introduced in the option pricing context. This in turn allows us to push the applicability range of known at-the-money skew approximation formulae from CLT type log-moneyness deviations of order $t^{1 / 2}$ (works of Alòs, León & Vives and Fukasawa) to the wider moderate deviations regime.

Keywords:

2010 Mathematics Subject Classification:

1. Introduction

Since the groundbreaking work of Gatheral et al. (Citation2014), the past two years have brought about a gradual shift in volatility modeling, leading away from classical diffusive stochastic volatility models towards so-called rough volatility models. The term was coined in Gatheral et al. (Citation2014) and Bayer et al. (Citation2016), and it essentially describes a family of (continuous-path) stochastic volatility models where the driving noise of the volatility process has Hölder regularity lower than Brownian motion, typically achieved by modeling the fundamental noise innovations of the volatility process as a fractional Brownian motion with Hurst exponent (and hence Hölder regularity) $H < 1 / 2$ . Here, we would also like to mention pioneering work on asymptotics for rough volatility models in Alòs et al. (Citation2007) and Fukasawa (Citation2011). A major appeal of such rough volatility models lies in the fact that they effectively capture several stylized facts of financial markets both from a statistical (Gatheral et al. Citation2014; Bennedsen et al. Citation2016) and an option-pricing point of view (Bayer et al. Citation2016). In particular, with regards to the latter point of view, a widely observed empirical phenomenon in equity markets is the ‘steepness of the smile on the short end’ describing the fact that as time to maturity becomes small the empirical implied volatility skew follows a power law with negative exponent, and thus becomes arbitrarily large near zero. While standard stochastic volatility models with continuous paths struggle to capture this phenomenon, predicting instead a constant at-the-money implied volatility behavior on the short end (Gatheral Citation2011), models in the fractional stochastic volatility family (and more specifically so-called rough volatility models) constitute a class, well-tailored to fit empirical implied volatilities for short dated options.

Typically, the popularity of asset pricing models hinges on the availability of efficient numerical pricing methods. In the case of diffusions, these include Monte Carlo estimators, PDE discretization schemes, asymptotic expansions and transform methods. With fractional Brownian motion being the prime example of a process beyond the semimartingale framework, most currently prevalent option pricing methods – particularly the ones assuming semimartingality or Markovianity – may not easily carry over to the rough setting. In fact, the memory property (aka non-Markovianity) of fractional Brownian motion rules out PDE methods, heat kernel methods and all related methods involving a Feynman-Kac-type Ansatz. Previous work has thus focused on finding efficient Monte Carlo simulation schemes (Bayer et al. Citation2016; Bennedsen et al. Citation2017; Bayer et al. Citation2017) or – in the special case of the Rough Heston model – on an explicit formula for the characteristic function of the log-price (see El Euch and Rosenbaum Citation2016), thus in this particular model making pricing amenable to Fourier based methods. In our work, we rely on small-maturity approximations of option prices. This is a well-studied topic for which we mention (with no claim to completeness) a number of works, either based on large deviations or central limit type scaling regime, that inspired this work: Alòs et al. (Citation2007), Fukasawa (Citation2011), Deuschel et al. (Citation2014a), Deuschel et al. (Citation2014b) and Fukasawa (Citation2017), also Medvedev and Scaillet (Citation2003, Citation2007), Osajima (Citation2007, Citation2015), Guennoun et al. (Citation2014), Mijatović and Tankov (Citation2016) and especially Forde and Zhang (Citation2017). Rather recently, Friz et al. (Citation2018) introduced another regime called moderately-out-of-the-money (MOTM), which, in a sense, effectively navigates between the two regimes mentioned above, by rescaling the strike with respect to the time to maturity. This approach has various advantages. On the one hand, it reflects the market reality that as time to maturity approaches zero, strikes with acceptable bid-ask spreads tend to move closer to the money (see Friz et al. Citation2018 for more details). On the other hand, it allows us to zoom in on the term structure of implied volatility around the money at a high resolution scale. To be more specific, our paper adds to the existing literature in two ways. First, we obtain a generalization of the Osajima energy expansion (Osajima Citation2015) to a non-Markovian case, and using the new expansion, we extend the analysis of Friz et al. (Citation2018) to the case, where the volatility is driven by a rough $(H < 1 / 2)$ fractional Brownian motion. Indeed, Laplace approximation methods on Wiener space in the spirit of Azencott (Citation1982, Citation1985), Ben Arous (Citation1988) and Bismut (Citation1984) can be adapted to the present context, so that our analysis builds upon this framework in a fractional setting. Unlike many other works in this field, we do not rely on density expansions. Finally, using a version of the ‘rough Bergomi model’ (Bayer et al. Citation2016), we demonstrate numerically that our implied volatility asymptotics capture very well the geometry of the term structure of implied volatility over a wide array of maturities, extending up to a year.

The paper is organized as follows: In Section 2 we set the scene, describing the class of models included in our framework ((Equation1(1) $\frac{d S_{t}}{S_{t}} = σ ({\hat{B}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) .$ (1) ) and (Equation2(2) ${\hat{B}}_{t} = \int_{0}^{t} K (t, s) d B_{s}, t \geq 0,$ (2) )) and recalling some known results ((Equation4(4) $\begin{array}{ll} \frac{1}{2} {∥h∥}_{H_{0}^{1}}^{2} + \frac{1}{2} {∥f∥}_{H_{0}^{1}}^{2}, & f, h \in H_{0}^{1} and \hat{f} = K \dot{f}, \\ + \infty, & otherwise, \end{array}$ (4) ) and (Equation7(7) $\begin{aligned} I (x) & = inf_{h, f \in H_{0}^{1}} \{\frac{1}{2} \int_{0}^{1} {\dot{h}}^{2} d t + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t : ϕ_{1} (h, f) = x\} \\ = inf_{f \in H_{0}^{1}} \{\frac{1}{2} \frac{{(x - ρ ⟨σ (\hat{f}), \dot{f}⟩)}^{2}}{{\bar{ρ}}^{2} ⟨σ^{2} (\hat{f}), 1⟩} + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t\}, \end{aligned}$ (7) )), which are the starting point of our analysis. Most importantly, we argue that for small-time considerations it would suffice to restrict our attention to a class of stochastic volatility models of the form (Equation3(3) $d X_{t} = - \frac{1}{2} σ^{2} ({\hat{B}}_{t}) d t + σ ({\hat{B}}_{t}) d (\bar{ρ} W + ρ B), X_{0} = 0.$ (3) ) with a volatility process driven by a Gaussian Volterra process such as in (Equation2(2) ${\hat{B}}_{t} = \int_{0}^{t} K (t, s) d B_{s}, t \geq 0,$ (2) ). We formulate general assumptions on the Volterra kernel (Assumptions 2.1 and 2.5) and on the function σ in (Equation3(3) $d X_{t} = - \frac{1}{2} σ^{2} ({\hat{B}}_{t}) d t + σ ({\hat{B}}_{t}) d (\bar{ρ} W + ρ B), X_{0} = 0.$ (3) ) (Assumption 2.4) under which our results are valid. In Section 3 we gather our main results, concerning a higher order expansion of the energy (Theorem 3.1), and a general expansion formula for the corresponding call prices. We derive the classical Black-Scholes expansion for the call price, using the latter result mentioned above. In addition, in Section 3 we formulate moderate deviation expansions, which allow us to derive the corresponding asymptotic formulae for implied volatilities and implied volatility skews. Finally, Section 4 displays our simulation results. Sections 5–7 are devoted to proofs of the energy expansion, the price expansion and the moderate deviations expansion, respectively. In the appendix, we have collected some auxiliary lemmas, which are used in different sections.

2. Exposition and assumptions

We consider a rough stochastic volatility model, normalized to r=0 and $S_{0} = 1$ , of the form suggested by Forde and Zhang (Citation2017) (1) $\frac{d S_{t}}{S_{t}} = σ ({\hat{B}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) .$ (1) Here $(W, B)$ are two independent standard Brownian motions, $ρ \in (- 1, 1)$ a correlation parameter, and ${\bar{ρ}}^{2} = 1 - ρ^{2}$ . Then $\bar{ρ} W + ρ B$ is another standard Brownian motion which has constant correlation ρ with the factor B, which drives the stochastic volatility $σ_{stoch} (t, ω) := σ ({\hat{B}}_{t} (ω)) \equiv σ (\hat{B}) .$ Here $σ (.)$ is some real-valued function, typically smooth but not bounded, and we will denote by $σ_{0} := σ (0)$ the spot volatility, with $\hat{B}$ a Gaussian (Volterra) process of the form (2) ${\hat{B}}_{t} = \int_{0}^{t} K (t, s) d B_{s}, t \geq 0,$ (2) for some kernel K, which shall be further specified in Assumptions 2.1 and 2.5 below. The log-price $X_{t} = \log (S_{t})$ satisfies (3) $d X_{t} = - \frac{1}{2} σ^{2} ({\hat{B}}_{t}) d t + σ ({\hat{B}}_{t}) d (\bar{ρ} W + ρ B), X_{0} = 0.$ (3) Recall that by Brownian scaling, for fixed t>0, $(B_{t s}, W_{t s})_{s \geq 0} \overset{l a w}{=} ε (B_{s}, W_{s})_{s \geq 0}, where ε \equiv ε (t) \equiv t^{1 / 2} .$ As a direct consequence, classical short-time SDE problems can be analyzed as small-noise problems on a unit time horizon. For our analysis, it will also be crucial to impose such a scaling property on the Gaussian process $\hat{B}$ (more precisely, on the kernel K in (Equation2(2) ${\hat{B}}_{t} = \int_{0}^{t} K (t, s) d B_{s}, t \geq 0,$ (2) )) driving the volatility process in our model:

Assumption 2.1

Small time self-similarity

There exists a number $t_{0}$ with $0 < t_{0} \leq 1$ and a function $t \mapsto \hat{ε} = \hat{ε} (t),$ $0 \leq t \leq t_{0},$ such that $({\hat{B}}_{t s} : 0 \leq s \leq t_{0}) \overset{l a w}{=} (\hat{ε} {\hat{B}}_{s} : 0 \leq s \leq t_{0}) .$

In fact, we will always have $\hat{ε} \equiv \hat{ε} (t) \equiv t^{H} = ε^{2 H},$ which covers the examples of interest, in particular standard fractional Brownian motion $\hat{B} = B^{H}$ or Riemann-Liouville fBM with explicit kernel $K (t, s) = \sqrt{2 H} {∣t - s∣}^{H - 1 / 2}$ . (This is very natural, even from a general perspective of self-similar processes, see Lamperti Citation1962.)

We insist that no (global) self-similarity of $\hat{B}$ is required, as only $\hat{B} |_{[0, t]}$ for arbitrarily small t matters.

Remark 2.2

It should be possible to replace the fractional Brownian motion by a certain fractional Ornstein-Uhlenbeck process in the results obtained in this paper. Intuitively, this replacement creates a negligible perturbation (for $t ≪ 1$ ) of the fBm environment. A similar situation was in fact encountered in Cass and Friz (Citation2010), where fractional scaling at times near zero was important. To quantify the perturbation, the authors of Cass and Friz (Citation2010) introduced an easy to verify coupling condition (see Corollary 2 in Cass and Friz Citation2010). It should be possible to employ a version of this condition in the present paper to justify the replacement mentioned above. We will however not pursue this point further here.

Remark 2.3

Throughout this article, one can consider a classical (Markovian, diffusion) stochastic volatility setting by taking $K \equiv 1$ , or equivalently $H \equiv 1 / 2$ , by simply ignoring all hats ( $\hat{\cdot}$ ) in the sequel. In particular then, $\hat{ε} / ε \equiv 1$ in all subsequent formulae.

General facts on large deviations of Gaussian measures on Banach spaces (Deuschel and Stroock Citation1989) such as the path space $C ([0, 1], R^{3})$ imply that a large deviation principle holds for the triple ${\hat{ε} (W, B, \hat{B}) : \hat{ε} > 0}$ , with speed ${\hat{ε}}^{2}$ and rate function (4) $\begin{array}{ll} \frac{1}{2} {∥h∥}_{H_{0}^{1}}^{2} + \frac{1}{2} {∥f∥}_{H_{0}^{1}}^{2}, & f, h \in H_{0}^{1} and \hat{f} = K \dot{f}, \\ + \infty, & otherwise, \end{array}$ (4) where $K \dot{f} (t) := \int_{0}^{t} K (t, s) \dot{f} (s) d s$ for $f \in H_{0}^{1}$ , the space of absolutely continuous paths with $L^{2}$ derivative (5) $\begin{aligned} H_{0}^{1} := \{f : [0, 1] \to R continuous |{∥f∥}_{H_{0}^{1}}^{2} \\ := \int_{0}^{1} {∣\dot{f} (s)∣}^{2} d s < \infty, f (0) = 0\} . \end{aligned}$ (5) This enables us to derive a large deviations principle for X in (Equation3(3) $d X_{t} = - \frac{1}{2} σ^{2} ({\hat{B}}_{t}) d t + σ ({\hat{B}}_{t}) d (\bar{ρ} W + ρ B), X_{0} = 0.$ (3) ): the (local) small-time self-similarity property of $\hat{B}$ (Assumption 2.1) implies that $X_{t} \overset{l a w}{=} X_{1}^{ε}$ where $d X_{t}^{ε} = σ (\hat{ε} {\hat{B}}_{t}) ε d (\bar{ρ} W_{t} + ρ B_{t}) - \frac{1}{2} ε^{2} σ^{2} (\hat{ε} {\hat{B}}_{t}) d t, X_{0}^{ε} = 0.$ For what follows, it will be convenient to consider a rescaled version of (Equation3(3) $d X_{t} = - \frac{1}{2} σ^{2} ({\hat{B}}_{t}) d t + σ ({\hat{B}}_{t}) d (\bar{ρ} W + ρ B), X_{0} = 0.$ (3) ) $\begin{aligned} d {\hat{X}}_{t}^{ε} \equiv d (\frac{\hat{ε}}{ε} X_{t}^{ε}) = σ (\hat{ε} {\hat{B}}_{t}) \hat{ε} d (\bar{ρ} W_{t} + ρ B_{t}) \\ - \frac{1}{2} ε \hat{ε} σ^{2} (\hat{ε} {\hat{B}}_{t}) d t, {\hat{X}}_{0}^{ε} = 0. \end{aligned}$ Under a linear growth condition on the function σ, Forde and Zhang (Citation2017) use the extended contraction principle to establish a large deviations principle for ( ${\hat{X}}_{1}^{ε}$ ) with speed ${\hat{ε}}^{2}$ . More precisely, with (6) $ϕ_{1} (h, f) := Φ_{1} (h, f, \hat{f}) = \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f),$ (6) the rate function is given by (7) $\begin{aligned} I (x) & = inf_{h, f \in H_{0}^{1}} \{\frac{1}{2} \int_{0}^{1} {\dot{h}}^{2} d t + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t : ϕ_{1} (h, f) = x\} \\ = inf_{f \in H_{0}^{1}} \{\frac{1}{2} \frac{{(x - ρ ⟨σ (\hat{f}), \dot{f}⟩)}^{2}}{{\bar{ρ}}^{2} ⟨σ^{2} (\hat{f}), 1⟩} + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t\}, \end{aligned}$ (7) where $⟨\cdot, \cdot⟩$ denotes the inner product on $L^{2} ([0, 1], d t)$ . Several other proofs (under varying assumptions on σ) have appeared since (Jacquier et al. Citation2017; Bayer et al. Citation2017; Gulisashvili Citation2017).

As a matter of fact, this paper relies on moderate – rather than large – deviations, as emphasized in (iiic) below. To this end, let us make

Assumption 2.4

$($ Positive spot vol $)$ Assume $σ : R \to R$ is smooth with $σ_{0} := σ (0) > 0$ .
$($ Roughness $)$ The Hurst parameter H satisfies $H \in (0, 1 / 2]$ .
$($ Martingality $)$ The price process $S = \exp X$ is a martingale.
$($ Short-time moments $)$ $\forall m < \infty \exists t > 0 : E (S_{t}^{m}) < \infty$ .

While condition (iiia) hardly needs justification, we emphasize that conditions (iiia-b) are only used to the extent that they imply condition (iiic) given below (which thus may replace (iiia-b) as an alternative, if more technical, assumption). The reason we point this out explicitly is that all the conditions (iiia-c) are implicit (growth) conditions on the function $σ (.)$ . For instance, (iiia-b) was seen to hold under a linear growth assumption (Forde and Zhang Citation2017; Gulisashvili Citation2017), whereas the log-normal volatility case (think of $σ (x) = e^{x}$ ) is complicated. Martingality, for instance, requires $ρ \leq 0$ and there is a critical moment $m^{*} = m^{*} (ρ)$ , even when $ρ < 0$ . See Sin (Citation1998), Jourdain (Citation2004) and Lions and Musiela (Citation2007) for the case $H = 1 / 2$ and the forthcoming work (Friz and Gassiat Citation2018) for the general rough case $H \in (0, 1]$ . We view (iiic) simply as a more flexible condition that can hold in situations where (iiib) fails.

(Call price upper moderate deviation bound) For every $β \in (0, H)$ , and every fixed x>0, and ${\hat{x}}_{ε} := x ε^{1 - 2 H + 2 β}$ , $E [(e^{X_{1}^{ε}} - e^{{\hat{x}}_{ε}})^{+}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}}) .$

This condition is reminiscent of the ‘upper part’ of the large deviation estimate obtained in Forde and Zhang (Citation2017) (8) $E [(e^{X_{1}^{ε}} - e^{x ε^{1 - 2 H}})^{+}] = \exp (- \frac{I (x) + o (1)}{ε^{4 H}}) .$ (8) If fact, if one formally applies this with x replaced by $x ε^{2 β}$ , followed by Taylor expanding the rate function, $I (x ε^{2 β}) \sim \frac{1}{2} I^{″} (0) x^{2} ε^{4 β} = \frac{1}{2 σ_{0}^{2}} x^{2} ε^{4 β},$ one readily arrives at the estimate (iiic). Unfortunately, $o (1) = o_{x} (1)$ in (Equation8(8) $E [(e^{X_{1}^{ε}} - e^{x ε^{1 - 2 H}})^{+}] = \exp (- \frac{I (x) + o (1)}{ε^{4 H}}) .$ (8) ), which is a serious obstacle in making this argument rigorous. Instead, we will give a direct argument (Lemma 7.1) to see how (iiia-b) implies (iiic).

In the sequel, we will use another mild assumption on the kernel.

Assumption 2.5

The kernel K has the following properties

${\hat{B}}_{t} = \int_{0}^{t} K (t, s) d B_{s}$ has a continuous $($ in $t)$ version on $[0, 1]$ .
$\forall t \in [0, 1] : \int_{0}^{t} K (t, s)^{2} d s < \infty$ .

Note that the Riemann-Liouville kernel $K (t, s) = \sqrt{2 H} (t - s)^{γ}$ , $γ = H - 1 / 2$ satisfies Assumption 2.5.

Remark 2.6

Assumption 2.5 implies that the Cameron-Martin space $H$ of $\hat{B}$ is given by the image of $H_{0}^{1}$ under K, i.e. $H = {K \dot{f} ∣ f \in H_{0}^{1}} .$ See Lemma 5.3 and Remark 5.4 for more details. A reference and also a sufficient condition for Assumption 2.5 (i) can be found e.g. in Decreusefond (Citation2005, Section 3).

3. Main results

The following result can be seen as a non-Markovian extension of work by Osajima (Citation2015). The statement here is a combination of Theorem 5.10 and Proposition (5.14) below. Recall that $σ_{0} = σ (0)$ represents spot-volatility. We also set $σ_{0}^{'} \equiv σ^{'} (0)$ .

Theorem 3.1

Energy expansion

The rate function $($ or energy $)$ I in (Equation7(7) $\begin{aligned} I (x) & = inf_{h, f \in H_{0}^{1}} \{\frac{1}{2} \int_{0}^{1} {\dot{h}}^{2} d t + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t : ϕ_{1} (h, f) = x\} \\ = inf_{f \in H_{0}^{1}} \{\frac{1}{2} \frac{{(x - ρ ⟨σ (\hat{f}), \dot{f}⟩)}^{2}}{{\bar{ρ}}^{2} ⟨σ^{2} (\hat{f}), 1⟩} + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t\}, \end{aligned}$ (7) ) is smooth in a neighborhood of x=0 $($ at-the-money $)$ and it is of the form $I (x) = \frac{1}{σ_{0}^{2}} \frac{x^{2}}{2} - (6 ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} \int_{0}^{1} \int_{0}^{t} K (t, s) d s d t) \frac{x^{3}}{3!} + O (x^{4}) .$

The next result is an exact representation of call prices, valid in a non-Markovian generality, and amenable to moderate- and large-deviation analysis (Theorem 3.4 below).

Theorem 3.2

Pricing formula

For a fixed log-strike $x \geq 0$ and time to maturity $t > 0,$ set $\hat{x} := (ε / \hat{ε}) x,$ where $ε = t^{1 / 2}$ and $\hat{ε} = t^{H} = ε^{2 H},$ as before. Then we have (9) $\begin{aligned} c (\hat{x}, t) & = E [{(\exp (X_{t}) - \exp \hat{x})}^{+}] \\ = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} J (ε, x), \end{aligned}$ (9) where $J (ε, x) := E [e^{- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε}} (\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1) e^{I^{'} (x) R_{2}^{ε}} 1_{{\hat{U}}^{ε} \geq 0}]$ and ${\hat{U}}^{ε}$ is a random variable of the form (10) ${\hat{U}}^{ε} = \hat{ε} g_{1} + {\hat{ε}}^{2} R_{2}^{ε}$ (10) with $g_{1}$ a centred Gaussian random variable, explicitly given in equation (Equation38(38) $g_{1} = \int_{0}^{1} {σ ({\hat{f}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) + σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} h_{t} + ρ f_{t})},$ (38) ) below, and $R_{2}^{ε}$ is a $($ random $)$ remainder term, in the sense of a stochastic Taylor expansion in $\hat{ε},$ see Lemma 6.2 for more details.

Example 3.3

Black-Scholes model

We fix volatility $σ (\cdot) \equiv σ > 0$ , and $H = 1 / 2$ so that $\hat{ε} = ε$ and all $\hat{\cdot}$ can be omitted. Energy is given by $I (x) = x^{2} / 2 σ^{2}$ and $U^{ε} = ε g_{1} + ε^{2} R_{2}^{ε} \equiv ε σ W_{1} - ε^{2} σ^{2} / 2$ with $R_{2}^{ε} = R_{2} \equiv - σ^{2} / 2$ independent of ϵ. Moreover, (11) $\begin{aligned} J (ε, x) & = E [e^{- (I^{'} (x) / ε^{2}) U^{ε}} (e^{U^{ε}} - 1) e^{I^{'} (x) R_{2}} 1_{U^{ε} \geq 0}] \\ = E [e^{- (I^{'} (x) / ε) g_{1}} (e^{ε g_{1} - ε^{2} σ^{2} / 2} - 1) 1_{{g_{1} \geq ε σ^{2} / 2}}] \\ = E [e^{- α W_{1}} (e^{ε σ W_{1} - (ε σ)^{2} / 2} - 1) 1_{{W_{1} \geq ε σ / 2}}] \\ = e^{- (ε σ)^{2} / 2} M (- α + ε σ) - M (- α) \end{aligned}$ (11) with $α := I^{'} (x) σ / ε = (1 / σ) (x / ε)$ , and, in terms of the standard Gaussian cdf Φ, $M (β) := E [e^{β W_{1}} 1_{{W_{1} \geq ε σ / 2}}] = e^{β^{2} / 2} Φ (β - \frac{ε σ}{2}) .$ Using the expansion $Φ (- y) = (1 / y \sqrt{2 π}) e^{- y^{2} / 2} (1 - y^{- 2} + \dots)$ , as $y \to \infty$ one deduces, for fixed x>0, the asymptotic relation, as $ε \to 0$ , (12) $J (ε, x) \sim \frac{e^{- x / 2}}{\sqrt{2 π}} \frac{ε^{3} σ^{3}}{x^{2}} .$ (12) We will be interested (cf. Theorem 3.4) in replacing x by $\tilde{x} = x ε^{2 β} \to 0$ for $β > 0$ . This gives $\tilde{α} = (1 / σ) (x / ε^{1 - 2 β})$ and the above analysis, now based on $\tilde{α} \to \infty$ , remains validFootnote¹ for β in the ‘moderate’ regime $β \in [0, 1 / 2)$ and we obtain (13) $\forall x > 0, β \in [0, 1 / 2) : J (ε, x ε^{2 β}) \sim \frac{1}{\sqrt{2 π}} \frac{ε^{3 - 4 β} σ^{3}}{x^{2}} .$ (13) Let us point out, for the sake of completeness, that a similar expansion is not valid for $β > 1 / 2$ . To see this, first note that (Equation9(9) $\begin{aligned} c (\hat{x}, t) & = E [{(\exp (X_{t}) - \exp \hat{x})}^{+}] \\ = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} J (ε, x), \end{aligned}$ (9) ) implies that $J (ε, x) |_{x = 0}$ is precisely the ATM call price with time $t = ε^{2}$ from expiration. Well-known ATM asymptotics then imply that $J (ε, x) |_{x = 0} \sim (1 / \sqrt{2 π}) ε σ$ as $ε \to 0$ . These asymptotics are unchanged in case of $o (t^{1 / 2}) = o (ε)$ out-of-moneyness (‘almost-at-the-money’ in the terminology of Friz et al. Citation2018), which readily implies $\forall x > 0, β > 1 / 2 : J (ε, x ε^{2 β}) \sim \frac{1}{\sqrt{2 π}} ε σ = const \times ε$ At last, we have the borderline case $β = 1 / 2$ , or $\tilde{x} = x ε$ . From e.g. Muhle-Karbe and Nutz (Citation2011, Theorem 3.1), we see that $c (x ε, ε^{2}) \sim a (x; σ) ε$ with positive constant $a (x; σ)$ . A look at (Equation9(9) $\begin{aligned} c (\hat{x}, t) & = E [{(\exp (X_{t}) - \exp \hat{x})}^{+}] \\ = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} J (ε, x), \end{aligned}$ (9) ) then reveals $\forall x > 0 : J (ε, x ε) \sim a (x; σ) ε e^{x^{2} / 2 σ^{2}} = const \times ε .$ For the call price expansion in the large / moderate deviations regime, $β \in [0, 1 / 2)$ , the polynomial in ϵ-behavior of (Equation13(13) $\forall x > 0, β \in [0, 1 / 2) : J (ε, x ε^{2 β}) \sim \frac{1}{\sqrt{2 π}} \frac{ε^{3 - 4 β} σ^{3}}{x^{2}} .$ (13) ) implies that the J-term in the pricing formula will be negligible on the moderate / large deviation scale, in the sense for any $θ > 0$ , we have $ε^{θ} \log J (ε, x ε^{2 β}) \to 0$ as $ε \to 0$ . Consequently, with $k_{t} = k t^{β}$ , for $t = ε^{2}$ , k>0, $β \in [0, 1 / 2)$ , we get the ‘moderate’ Black-Scholes call price expansion, $- \log c_{B S} (k_{t}, t) = \frac{1}{t^{1 - 2 β}} \frac{k^{2}}{2 σ^{2}} (1 + o (1)) as t ↓ 0.$

While the above can be confirmed by elementary analysis of the Black–Scholes formula, the following theorem exhibits it as an instance of a general principle. See Friz et al. (Citation2018) for a general diffusion statement.

Theorem 3.4

Moderate Deviations

In the rough volatility regime $H \in (0, 1 / 2],$ consider log-strikes of the form $k_{t} = k t^{1 / 2 - H + β} for a constant k \geq 0.$ (i) For $β \in (0, H),$ and every $θ > 0,$ we have $- \log c (k_{t}, t) = \frac{I^{′′} (0)}{t^{2 H - 2 β}} \frac{k^{2}}{2} + O (t^{3 β - 2 H}) + O (t^{- θ}) as t ↓ 0.$ (ii) For $β \in (0, \frac{2}{3} H),$ and every $θ > 0,$ we have $\begin{aligned} - \log c (k_{t}, t) = \frac{I^{′′} (0)}{t^{2 H - 2 β}} \frac{k^{2}}{2} + \frac{I^{′′′} (0)}{t^{2 H - 3 β}} \frac{k^{3}}{6} + O (t^{4 β - 2 H}) \\ + O (t^{- θ}) as t ↓ 0. \end{aligned}$ Moreover, $\begin{aligned} I^{′′} (0) & = \frac{1}{σ_{0}^{2}}, \\ I^{′′′} (0) & = - 6 ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} \int_{0}^{1} \int_{0}^{t} K (t, s) d s d t = - 6 ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨ K 1, 1 ⟩, \end{aligned}$ where $⟨\cdot, \cdot⟩$ is the inner product in $L^{2} ([0, 1])$ .

Remark 3.5

In principle, further terms (of order $t^{i β - 2 H}$ , $i = 4, 5, \dots$ ) can be added to this expansion of log call prices, given that the energy has sufficient regularity, see Theorem 3.6. We also note that, for small enough β, the error term $O (t^{- θ})$ can be omitted. In any case, one can replace the additive error bounds by (cruder) ones, where the right-most term in the expansion is multiplied with $(1 + o (1))$ , as was done in Friz et al. (Citation2018).

Proof of Theorem 3.4

We apply Theorem 3.2 with $\hat{x} = k_{t} = k t^{1 / 2 - H + β}$ , i.e. with $x = k t^{β} = k ε^{2 β}$ . In particular, we so get, with $\hat{ε} = t^{H}$ and $ε = t^{1 / 2}$ , $c (k_{t}, t) = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} J (ε, k ε^{2 β}) .$ The technical Proposition 7.3 asserts that, for fixed k>0, the factor J is negligible in the sense that, for every $θ > 0$ , $ε^{θ} \log J (ε, k ε^{2 β}) \to 0 as ε \to 0.$ The theorem now follows immediately from the Taylor expansion of $I (x)$ around x=0 (see Theorem 3.1), plugging in $x = k t^{β}$ . Indeed, replacing $I (x)$ by the Taylor-jet seen in (i),(ii), leads exactly to an error term $O (t^{3 β - 2 H})$ , resp. $O (t^{4 β - 2 H})$ .

Fix real numbers k>0, $0 < H < \frac{1}{2}$ , $0 < β < H$ , and an integer $n \geq 2$ . For every $t > 0$ , set $k_{t} = k t^{1 / 2 - H + β},$ and denote $φ_{n, H, β, θ} (t) = max \{t^{2 H - 2 β - θ}, t^{(n - 1) β}\} .$ Here, $θ > 0$ can be arbitrarily small. It is clear that for all small t and θ small enough, $\begin{aligned} φ_{n, H, β, θ} (t) = t^{2 H - 2 β - θ} \Leftrightarrow 2 H - 2 β \leq (n - 1) β \\ \Leftrightarrow \frac{2 H}{n + 1} \leq β, \end{aligned}$ while $φ_{n, H, β, θ} (t) = t^{(n - 1) β} \Leftrightarrow 2 H - 2 β > (n - 1) β \Leftrightarrow β < \frac{2 H}{n + 1} .$ The following statement provides an asymptotic formula for the implied variance.

Theorem 3.6

Suppose $0 < β < 2 H / n$ and $θ > 0$ small enough. Then as $t \to 0$ $($ and for $k > 0),$ (14) $\begin{aligned} σ_{impl} (k_{t}, t)^{2} & = \sum_{j = 0}^{n - 2} \frac{(- 1)^{j} 2^{j}}{I^{′′} (0)^{j + 1}} {(\sum_{i = 3}^{n} \frac{I^{(i)} (0)}{i!} k^{i - 2} t^{(i - 2) β})}^{j} \\ + O (φ_{n, H, β, θ} (t)) . \end{aligned}$ (14) The $O$ -estimate in (Equation14(14) $\begin{aligned} σ_{impl} (k_{t}, t)^{2} & = \sum_{j = 0}^{n - 2} \frac{(- 1)^{j} 2^{j}}{I^{′′} (0)^{j + 1}} {(\sum_{i = 3}^{n} \frac{I^{(i)} (0)}{i!} k^{i - 2} t^{(i - 2) β})}^{j} \\ + O (φ_{n, H, β, θ} (t)) . \end{aligned}$ (14) ) depends on n, H, β, θ, and k. It is uniform on compact subsets of $[0, \infty)$ with respect to the variable k.

Remark 3.7

Using the multinomial formula, we can represent the expression on the left-hand side of (Equation14(14) $\begin{aligned} σ_{impl} (k_{t}, t)^{2} & = \sum_{j = 0}^{n - 2} \frac{(- 1)^{j} 2^{j}}{I^{′′} (0)^{j + 1}} {(\sum_{i = 3}^{n} \frac{I^{(i)} (0)}{i!} k^{i - 2} t^{(i - 2) β})}^{j} \\ + O (φ_{n, H, β, θ} (t)) . \end{aligned}$ (14) ) in terms of certain powers of t. However, the coefficients become rather complicated.

Remark 3.8

Let an integer $n \geq 2$ be fixed, and suppose we would like to use only the derivatives $I^{(i)} (0)$ for $2 \leq i \leq n$ in formula (Equation14(14) $\begin{aligned} σ_{impl} (k_{t}, t)^{2} & = \sum_{j = 0}^{n - 2} \frac{(- 1)^{j} 2^{j}}{I^{′′} (0)^{j + 1}} {(\sum_{i = 3}^{n} \frac{I^{(i)} (0)}{i!} k^{i - 2} t^{(i - 2) β})}^{j} \\ + O (φ_{n, H, β, θ} (t)) . \end{aligned}$ (14) ) to approximate $σ_{impl} (k_{t}, t)^{2}$ . Then, the optimal range for β is the following: $2 H / (n + 1) \leq β < 2 H / n$ . On the other hand, if β is outside of the interval $[2 H / (n + 1), 2 H / n)$ , more derivatives of the energy function at zero may be needed to get a good approximation of the implied variance in formula (Equation14(14) $\begin{aligned} σ_{impl} (k_{t}, t)^{2} & = \sum_{j = 0}^{n - 2} \frac{(- 1)^{j} 2^{j}}{I^{′′} (0)^{j + 1}} {(\sum_{i = 3}^{n} \frac{I^{(i)} (0)}{i!} k^{i - 2} t^{(i - 2) β})}^{j} \\ + O (φ_{n, H, β, θ} (t)) . \end{aligned}$ (14) ).

We will next derive from Theorem 3.6 several asymptotic formulas for the implied volatility. In the next corollary, we take n=2.

Corollary 3.9

As $t \to 0,$ (15) $σ_{impl} (k_{t}, t) = σ_{0} + O (φ_{2, H, β, θ} (t)) .$ (15)

Corollary 3.9 follows from Theorem 3.6 with n=2, the equality (16) $I^{′′} (0) = σ_{0}^{- 2}$ (16) given in Theorem 3.4, and the Taylor expansion $\sqrt{1 + h} = 1 + O (h)$ as $h \to 0$ .

In the next corollary, we consider the case where n=3.

Corollary 3.10

Suppose $β < 2 H / 3$ . Then, as $t \to 0,$ (17) $σ_{impl} (k_{t}, t) = σ_{0} + ρ \frac{σ_{0}^{'}}{σ_{0}} ⟨ K 1, 1 ⟩ k t^{β} + O (φ_{3, H, β, θ} (t)) .$ (17)

Corollary 3.10 follows from Theorem 3.6 with n=3, formula (Equation16(16) $I^{′′} (0) = σ_{0}^{- 2}$ (16) ), the equality (18) $I^{′′′} (0) = - 6 ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨ K 1, 1 ⟩$ (18) (see Theorem 3.4), and the expansion $\sqrt{1 + h} = 1 + \frac{1}{2} h + O (h^{2})$ as $h \to 0$ .

Using Corollary 3.10, we establish the following implied volatility skew formula in the moderate deviation regime.

Corollary 3.11

Let $0 < H < \frac{1}{2},$ $0 < β < \frac{2}{3} H,$ and fix y,z>0 with $y \neq z$ . Then as $t \to 0,$ (19) $\frac{σ_{impl} (y t^{1 / 2 - H + β}, t) - σ_{impl} (z t^{1 / 2 - H + β}, t)}{(y - z) t^{1 / 2 - H + β}} \sim ρ \frac{σ_{0}^{'}}{σ_{0}} ⟨ K 1, 1 ⟩ t^{H - 1 / 2} .$ (19)

Remark 3.12

Corollary 3.11 complements earlier works of Alòs et al. (Citation2007) and Fukasawa (Citation2011, Citation2017). For instance, the following formula can be found in Fukasawa (Citation2017, p. 6), see also Fukasawa (Citation2011, p. 14): (20) $\frac{σ_{impl} (y t^{1 / 2}, t) - σ_{impl} (z t^{1 / 2}, t)}{(y - z) t^{1 / 2}} \sim ρ C (H) \frac{σ_{0}^{'}}{σ_{0}} t^{H - 1 / 2} .$ (20) In formula (Equation20(20) $\frac{σ_{impl} (y t^{1 / 2}, t) - σ_{impl} (z t^{1 / 2}, t)}{(y - z) t^{1 / 2}} \sim ρ C (H) \frac{σ_{0}^{'}}{σ_{0}} t^{H - 1 / 2} .$ (20) ), we employ the notation used in the present paper. Our analysis shows that the applicability range of skew approximation formulas is by no means restricted to the Central Limit Theorem type log-moneyness deviations of order $t^{1 / 2}$ . It also includes the moderate deviations regime of order $t^{1 / 2 - H + β}$ . The previous rate is clearly $≫ t^{1 / 2}$ as $t \to 0$ .

Remark 3.13

Symmetry

Write $Φ_{1} (W, B, \hat{B}; ρ; σ)$ for the ‘Itô-type map’ $Φ_{1} (W, B, \hat{B}) := \int_{0}^{1} σ (\hat{B}) d (\bar{ρ} W + ρ B) .$ It equals, in law, $Φ_{1} (W, - B, - \hat{B}; - ρ; σ (- \cdot))$ , and indeed all our formulae are invariant under this transformation. In particular, the skew remains unchanged when the pair $(ρ, σ_{0}^{'})$ is replaced by $(- ρ, - σ_{0}^{'})$ .

4. Simulation results

We verify our theoretical results numerically with a variant of the rough Bergomi model (Bayer et al. Citation2016) which fits nicely into the general rough volatility framework considered in this paper. As before, the model has been normalized such that $S_{0} = 1$ and r=0. We let $(W, B)$ be two independent Brownian motions and $ρ \in (- 1, 1)$ with ${\bar{ρ}}^{2} = 1 - ρ^{2}$ such that $Z = \bar{ρ} W + ρ B$ is another Brownian motion having constant correlation ρ with B. For some spot volatility $σ_{0}$ and volatility of volatility parameter η, we then assume the following dynamics for some asset S: (21) $\frac{d S_{t}}{S_{t}} = σ ({\hat{B}}_{t}) d Z_{t}$ (21) (22) $σ (x) = σ_{0} \exp (\frac{1}{2} η x)$ (22) where $\hat{B}$ is a Riemann-Liouville fBM given by ${\hat{B}}_{t} = \sqrt{2 H} \int_{0}^{t} | t - s |^{H - 1 / 2} d B_{s} .$ The approach taken for the Monte Carlo simulations of the quantities we are interested in is the one initially explored in the original rough Bergomi pricing paper (Bayer et al. Citation2016). That is, exploiting their joint Gaussianity, where we use the well-known Cholesky method to simulate the joint paths of $(Z, \hat{B})$ on some discretization grid $D$ . With (Equation22(22) $σ (x) = σ_{0} \exp (\frac{1}{2} η x)$ (22) ) being an explicit function in terms of the rough driver, an Euler discretisation of the Ito SDE (Equation21(21) $\frac{d S_{t}}{S_{t}} = σ ({\hat{B}}_{t}) d Z_{t}$ (21) ) on $D$ then yields estimates for the price paths.

The Cholesky algorithm critically hinges on the availability and explicit computability of the joint covariance matrix of $(Z, \hat{B})$ whose terms we readily compute below.Footnote²

Lemma 4.1

For convenience, define constants $γ = \frac{1}{2} - H \in [0, \frac{1}{2})$ and $D_{H} = \sqrt{2 H} / (H + \frac{1}{2})$ and define an auxiliary function $G : [1, \infty) \to R$ by (23) $\begin{aligned} G (x) = 2 H (\frac{1}{1 - γ} x^{- γ} + \frac{γ}{1 - γ} x^{- (1 + γ)} \\ \times \frac{1}{2 - γ}_{2} F_{1} (1, 1 + γ, 3 - γ, x^{- 1})) \end{aligned}$ (23) where $_{2} F_{1}$ denotes the Gaussian hypergeometric function (Olver et al. Citation2010). Then the joint process $(Z, \hat{B})$ has zero mean and covariance structure governed by $\begin{aligned} \begin{array}{ll} Var [{\hat{B}}_{t}^{2}] = t^{2 H}, & f o r t \geq 0, \\ Cov [{\hat{B}}_{s} {\hat{B}}_{t}] = t^{2 H} G (s / t), & f o r s > t \geq 0, \\ Cov [{\hat{B}}_{s} Z_{t}] = ρ D_{H} (s^{H + 1 / 2} \\ - (s - min (t, s))^{H + 1 / 2}), & f o r t, s \geq 0, \\ Cov [Z_{t} Z_{s}] = min (t, s), & f o r t, s \geq 0. \end{array} \end{aligned}$

Numerical simulationsFootnote³ confirm the theoretical results obtained in the last section. In particular – as can be seen in figure – the asymptotic formula for the implied volatility (Equation17(17) $σ_{impl} (k_{t}, t) = σ_{0} + ρ \frac{σ_{0}^{'}}{σ_{0}} ⟨ K 1, 1 ⟩ k t^{β} + O (φ_{3, H, β, θ} (t)) .$ (17) ) captures very well the geometry of the term structure of implied volatility, with particularly good results for higher H and worsening results as $H ↓ 0$ . Quite surprisingly, despite being an asymptotic formula, it seems to be fairly accurate over a wide array of maturities extending up to a single year.

Figure 1. Illustration of the term structure of implied volatility of the Modified Rough Bergomi model in the Moderate deviations regime with time-varying log-strike $k_{t} = 0.4 t^{β}$ . Depicted are the asymptotic formula (equation (Equation17(17) $σ_{impl} (k_{t}, t) = σ_{0} + ρ \frac{σ_{0}^{'}}{σ_{0}} ⟨ K 1, 1 ⟩ k t^{β} + O (φ_{3, H, β, θ} (t)) .$ (17) ), dashed line) and an estimate based on $N = 10^{8}$ samples of a MC Cholesky Option Pricer (solid line) with 500 time steps. Model parameters are given by spot vol $σ_{0} \approx 0.2557$ , vvol $η = 0.2928$ and correlation parameter $ρ = - 0.7571$ .

5. Proof of the energy expansion

Consider $\begin{aligned} d X & = - \frac{1}{2} σ^{2} (Y) d t + σ (Y) d (\bar{ρ} d W + ρ d B), X_{0} = 0 \\ d Y & = d \hat{B}, Y_{0} = 0 \end{aligned}$ where ${\hat{B}}_{t} = \int_{0}^{t} K (t, s) d B_{s}$ for a fixed Volterra kernel (recall (Equation3(3) $d X_{t} = - \frac{1}{2} σ^{2} ({\hat{B}}_{t}) d t + σ ({\hat{B}}_{t}) d (\bar{ρ} W + ρ B), X_{0} = 0.$ (3) ) in the previous section). We study the small noise problem $(X^{ε}, Y^{ε})$ where $(W, B, \hat{B})$ is replaced by $(ε W, ε B, \hat{ε} \hat{B})$ . The following proposition roughly says that $P (X_{1}^{ε} \approx \frac{ε}{\hat{ε}} x) \approx \exp (- \frac{I (x)}{{\hat{ε}}^{2}}) .$

Proposition 5.1

Forde and Zhang Citation2017

Under suitable assumptions $($ cf. Section 2), the rescaled process $((\hat{ε} / ε) X_{1}^{ε} : ε \geq 0)$ satisfies an LDP $($ with speed ${\hat{ε}}^{2})$ and rate function (24) $\begin{aligned} I (x) = inf_{f \in H_{0}^{1}} [\frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (\hat{f})} + \frac{1}{2} E (f)] \\ \equiv inf_{f \in H_{0}^{1}} I_{x} (f) \equiv I_{x} (f^{x}), \end{aligned}$ (24) where $\begin{aligned} G (f) & = \int_{0}^{1} σ ((K \dot{f}) (s)) {\dot{f}}_{s} d s \equiv ⟨σ (K \dot{f}), \dot{f}⟩ \equiv ⟨σ (\hat{f}), \dot{f}⟩ \\ F (f) & = \int_{0}^{1} σ {((K \dot{f}) (s))}^{2} d s \equiv ⟨σ^{2} (K \dot{f}), 1⟩ \equiv ⟨σ^{2} (\hat{f}), 1⟩ \\ E (f) & = \int_{0}^{1} {|\dot{f} (s)|}^{2} d s \equiv ⟨\dot{f}, \dot{f}⟩ \end{aligned}$

The rest of this section is devoted to analysis of the function I as defined in (Equation24(24) $\begin{aligned} I (x) = inf_{f \in H_{0}^{1}} [\frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (\hat{f})} + \frac{1}{2} E (f)] \\ \equiv inf_{f \in H_{0}^{1}} I_{x} (f) \equiv I_{x} (f^{x}), \end{aligned}$ (24) ). First, we derive the first order optimality condition for the above minimization problem.

Proposition 5.2

First order optimality condition

For any $x \in R$ we have at any local minimizer $f = f^{x}$ of the functional $I_{x}$ in (Equation24(24) $\begin{aligned} I (x) = inf_{f \in H_{0}^{1}} [\frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (\hat{f})} + \frac{1}{2} E (f)] \\ \equiv inf_{f \in H_{0}^{1}} I_{x} (f) \equiv I_{x} (f^{x}), \end{aligned}$ (24) ) that (25) $\begin{aligned} f_{t}^{x} & = \frac{ρ (x - ρ G (f^{x})) \{⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩\}}{{\bar{ρ}}^{2} F (f^{x})} \\ + \frac{{(x - ρ G (f^{x}))}^{2}}{{\bar{ρ}}^{2} F^{2} (f^{x})} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩, \end{aligned}$ (25) for all $t \in [0, 1]$ .

Proof.

We denote $a \approx b$ whenever $a = b + o (δ)$ for a small parameter δ. We expand $\begin{aligned} E (f + δ g) & \approx E (f) + 2 δ ⟨\dot{f}, \dot{g}⟩ \\ F (f + δ g) & \approx F (f) + δ ⟨{(σ^{2})}^{'} (K \dot{f}), K \dot{g}⟩ \\ G (f + δ g) & \approx G (f) + δ \{⟨σ (K \dot{f}), \dot{g}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K \dot{g}⟩\} \end{aligned}$ If $f = f^{x}$ is a minimizer then $δ \mapsto I_{x} (f + δ g)$ has a minimum at $δ = 0$ for all g. We expand $\begin{aligned} I_{x} (f + δ g) = \frac{{(x - ρ G (f + δ g))}^{2}}{2 {\bar{ρ}}^{2} F (f + δ g)} + \frac{1}{2} E (f + δ g) \\ \approx \frac{{(x - ρ G (f) - δ ρ \{⟨σ (K \dot{f}), \dot{g}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K \dot{g}⟩\})}^{2}}{2 {\bar{ρ}}^{2} [F (f) + δ ⟨{(σ^{2})}^{'} (K \dot{f}), K \dot{g}⟩]} \\ + \frac{1}{2} E (f) + δ ⟨\dot{f}, \dot{g}⟩ \\ \approx \frac{\begin{matrix} {(x - ρ G (f))}^{2} - δ 2 ρ (x - ρ G (f)) \\ \{⟨σ (K \dot{f}), \dot{g}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K \dot{g}⟩\} \end{matrix}}{2 {\bar{ρ}}^{2} F (f) [1 + \frac{δ}{F (f)} ⟨{(σ^{2})}^{'} (K \dot{f}), K \dot{g}⟩]} \\ + \frac{1}{2} E (f) + δ ⟨\dot{f}, \dot{g}⟩ \\ \approx \frac{\begin{matrix} {(x - ρ G (f))}^{2} - δ 2 ρ (x - ρ G (f)) \\ \{⟨σ (K \dot{f}), \dot{g}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K \dot{g}⟩\} \end{matrix}}{2 {\bar{ρ}}^{2} F (f)} \\ - \frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (f)} \frac{δ}{F (f)} ⟨{(σ^{2})}^{'} (K \dot{f}), K \dot{g}⟩ + \frac{1}{2} E (f) \\ + δ ⟨\dot{f}, \dot{g}⟩ . \end{aligned}$ As a consequence, we must have, for $f = f^{x}$ and every $\dot{g} \in L^{2} [0, 1]$ $\begin{aligned} 0 & = \frac{d}{d δ} {\{I_{x} (f + δ g)\}}_{δ = 0} \\ = - \frac{ρ (x - ρ G (f)) \{⟨σ (K \dot{f}), \dot{g}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K \dot{g}⟩\}}{{\bar{ρ}}^{2} F (f)} \\ - \frac{{(x - ρ G (f))}^{2}}{{\bar{ρ}}^{2} F^{2} (f)} ⟨(σ σ^{'}) (K \dot{f}), K \dot{g}⟩ + ⟨\dot{f}, \dot{g}⟩ . \end{aligned}$ Recall $f_{0}^{x} = 0$ , any x. We now test with $\dot{g} = 1_{[0, t]}$ for a fixed $t \in [0, 1]$ and obtain $\begin{aligned} f_{t}^{x} & = \frac{ρ (x - ρ G (f^{x})) \{⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩\}}{{\bar{ρ}}^{2} F (f^{x})} \\ + \frac{{(x - ρ G (f^{x}))}^{2}}{{\bar{ρ}}^{2} F^{2} (f^{x})} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩ . \end{aligned}$

5.1. Smoothness of the energy

Having formally identified the first order condition for minimality in (Equation24(24) $\begin{aligned} I (x) = inf_{f \in H_{0}^{1}} [\frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (\hat{f})} + \frac{1}{2} E (f)] \\ \equiv inf_{f \in H_{0}^{1}} I_{x} (f) \equiv I_{x} (f^{x}), \end{aligned}$ (24) ), we will now show that the energy $x \mapsto I (x)$ is a smooth function. More precisely, we will use the implicit function theorem to show that the minimizing configuration $f^{x}$ is a smooth function in x (locally at x=0). As $I_{x}$ is a smooth function, too, this will imply smoothness of $x \mapsto I_{x} (f^{x}) = I (x)$ , at least in a neighborhood of 0.

As the Cameron-Martin space $H$ of the process $\hat{B}$ continuously embeds into $C ([0, 1])$ , K maps $H_{0}^{1}$ continuously into $C ([0, 1])$ , i.e. there is a constant C>0 such that for any $f \in H_{0}^{1}$ we have (26) ${∥K \dot{f}∥}_{\infty} \leq C {∥f∥}_{H_{0}^{1}} .$ (26) This result will follow from

Lemma 5.3

Let $(V_{t} : 0 \leq t \leq 1)$ be a continuous, centred Gaussian process and $H$ its Cameron-Martin space. Then we have the continuous embedding $H ↪ C [0, 1]$ . That is, for some constant C, ${∥h∥}_{\infty} \leq C {∥h∥}_{H} .$

Proof.

By a fundamental result of Fernique, applied to the law of V as Gaussian measure on the Banach space $(C [0, 1], ∥ \cdot ∥_{\infty})$ , the random variable $∥ V ∥_{\infty}$ has Gaussian integrability. In particular, $σ^{2} := E ({∥V∥}_{\infty}^{2}) < \infty,$ On the other hand, a generic element $h \in H$ can be written as $h_{t} = E [V_{t} Z]$ where Z is a centred Gaussian random variable with variance $∥ h ∥_{H}^{2}$ , see, e.g. Friz and Hairer (Citation2014, page 150). By Cauchy–Schwarz, $|h_{t}| \leq E {[|V_{t}|]}^{1 / 2} {∥h∥}_{H} \leq σ {∥h∥}_{H}$ and conclude by taking the $sup$ over on the l.h.s. over $t \in [0, 1]$ .

Remark 5.4

Assume V is of Volterra form, i.e. $V_{t} = \int_{0}^{t} K (t, s) d B_{s}$ . Then it can be shown (see Decreusefond Citation2005, Section 3) that $H$ is the image of $L^{2}$ under the map $K : \dot{f} \mapsto \hat{f} := (t \mapsto \int_{0}^{t} K (t, s) {\dot{f}}_{s} d s)$ and $∥ K \dot{f} ∥_{H} = ∥ \dot{} f ∥_{L^{2}}$ . In particular then, applying the above with $h = K \dot{f} \in H$ , gives ${∥K \dot{f}∥}_{\infty} \leq C {∥K \dot{f}∥}_{H} = C {∥\dot{f}∥}_{L^{2}} = C {∥f∥}_{H_{0}^{1}} .$

5.1.1. The uncorrelated case

We start with the case $ρ = 0$ as the formulas are much simpler in this case.

By Proposition 5.2, any local optimizer $f = f^{x}$ of the functional $I_{x} : H_{0}^{1} \to R$ in the uncorrelated case $ρ = 0$ satisfies for any $t \in [0, 1]$ $f_{t} = \frac{x^{2}}{F^{2} (f)} ⟨(σ σ^{'}) (K \dot{f}), K 1_{[0, t]}⟩ .$ We define a map $H : H_{0}^{1} \times R \to H_{0}^{1}$ by (27) $H (f, x) (t) := f_{t} - \frac{x^{2}}{F^{2} (f)} ⟨(σ σ^{'}) (K \dot{f}), K 1_{[0, t]}⟩ .$ (27) Hence, for given $x \in R$ , any local optimizer f must solve $H (f, x) = 0$ . As one particular solution is given by the pair $(0, 0)$ , we are in the realm of the implicit function theorem. We need to prove that

$(f, x) \mapsto H (f, x)$ is locally smooth (in the sense of Fréchet);
$D H (f, x) := (\partial / \partial f) H (f, x)$ is invertible in $(0, 0)$ .

Note that invertibility should hold for x small enough, as $D H (f, x) = {id}_{H_{0}^{1}} - x^{2} R$ for some R, which is invertible as long as R has a bounded norm for sufficiently small x.

Remark 5.5

The method of proof in this section is purely local in $H_{0}^{1}$ . Hence, we only really need smoothness of σ locally around 0. Note, however, that stochastic Taylor expansions used in Section 6 will actually require global smoothness of σ.

Lemma 5.6

The functions $F : H_{0}^{1} \to R$ and $R_{1} : H_{0}^{1} \to C ([0, 1])$ defined by $R_{1} (f) (t) := ⟨(σ σ^{'}) (K \dot{f}), K 1_{[0, t]}⟩, t \in [0, 1],$ are smooth in the sense of Fréchet.

Proof.

For $N \geq 1$ we note that the Gateaux derivative of F satisfies $D^{N} F (f) \cdot (g_{1}, \dots, g_{N}) = \int_{0}^{1} \frac{d^{N}}{d x^{N}} σ^{2} (K \dot{f}) K {\dot{g}}_{1} \dots K {\dot{g}}_{N} d s .$ By Lemma 5.3, we can bound $\begin{aligned} ∣D^{N} F (f) \cdot (g_{1}, \dots, g_{N})∣ & \leq const \int_{0}^{1} ∣K \dot{g_{1}} (s)∣ \dots ∣K \dot{g_{N}} (s)∣ d s \\ \leq const {∥K {\dot{g}}_{1}∥}_{\infty} \dots {∥K {\dot{g}}_{N}∥}_{\infty} \\ \leq const C^{N} {∥g_{1}∥}_{H_{0}^{1}} \dots {∥g_{N}∥}_{H_{0}^{1}}, \end{aligned}$ for $const = {∥(d^{n} / d x^{n}) σ^{2}∥}_{\infty}$ .Footnote⁴ Thus, $D^{N} F (f)$ is a multi-linear form on $H_{0}^{1}$ with operator norm $∥D^{N} F (f)∥ \leq {∥(d^{n} / d x^{n}) σ^{2}∥}_{\infty} C^{N}$ independent of f. As $f \mapsto D^{N} F (f)$ is continuous, we conclude that $D^{N} F (f)$ as given above is, in fact, a Fréchet derivative.

Let us next consider the functional $R_{1}$ . Note that $(D^{N} R_{1} (f) \cdot (g_{1}, \dots, g_{N})) (t) = ⟨s_{N} (K \dot{f}) K {\dot{g}}_{1} \dots K {\dot{g}}_{N}, K 1_{[0, t]}⟩$ for $s_{N} (x) := (d^{N} / d x^{N}) σ (x) σ^{'} (x)$ . Hence, Assumption 2.5 implies that $\begin{aligned} {∥D^{N} R_{1} (f) \cdot (g_{1}, \dots, g_{N})∥}_{H_{0}^{1}}^{2} \\ = \int_{0}^{1} {(\int_{t}^{1} s_{N} ((K \dot{f}) (s)) \prod_{i = 1}^{N} (K {\dot{g}}_{i}) (s) K (s, t) d s)}^{2} d t \\ \leq {∥s_{N}∥}_{\infty}^{2} \prod_{i = 1}^{N} {∥K {\dot{g}}_{i}∥}_{\infty}^{2} \int_{0}^{1} \int_{t}^{1} K (s, t)^{2} d s d t \\ \leq {∥s_{N}∥}_{\infty}^{2} C^{2 N} \prod_{i = 1}^{N} {∥g_{i}∥}_{H_{0}^{1}}^{2} \int_{0}^{1} \int_{0}^{s} K (s, t)^{2} d t d s \\ \leq {∥s_{N}∥}_{\infty}^{2} C^{2 N} \int_{0}^{1} \int_{0}^{s} K (s, t)^{2} d t d s \prod_{i = 1}^{N} {∥g_{i}∥}_{H_{0}^{1}}^{2} . \end{aligned}$ We see that the multi-linear map $D^{N} R_{1} (f)$ has operator norm bounded by $∥D^{N} R_{1} (f)∥ \leq {∥s_{N}∥}_{\infty} C^{N} \sqrt{\int_{0}^{1} \int_{0}^{s} K (s, t)^{2} d t d s},$ independent of f. From continuity of $f \mapsto D^{N} R_{1} (f)$ , it follows that $D^{N} R_{1} (f)$ is the N'th Fréchet derivative.

Theorem 5.7

Zero correlation

Assuming $ρ = 0,$ the energy $I (x)$ $($ as defined in (Equation24(24) $\begin{aligned} I (x) = inf_{f \in H_{0}^{1}} [\frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (\hat{f})} + \frac{1}{2} E (f)] \\ \equiv inf_{f \in H_{0}^{1}} I_{x} (f) \equiv I_{x} (f^{x}), \end{aligned}$ (24) )) is smooth in a neighborhood of x=0.

Proof.

By construction, we have $D H (f, x) = {id}_{H_{0}^{1}} - x^{2} A (f)$ for $A : H_{0}^{1} \to L (H_{0}^{1}, H_{0}^{1})$ defined by $A (f) := R_{1} (f) \otimes D F^{- 2} (f) + F^{- 2} (f) D R_{1} (f) .$ Here, $(R_{1} (f) \otimes D F^{- 2} (f)) \cdot g = \underset{\in R}{\underset{⏟}{(D F^{- 2} (f) \cdot g)}} \underset{\in H_{0}^{1}}{\underset{⏟}{R_{1} (f)}} .$ As verified above, H is smooth in the sense of Fréchet. Trivially, $D H (0, 0) = {id}_{H_{0}^{1}}$ is invertible and $H (0, 0) = 0$ . Therefore, the implicit function theorem implies that there are open neighborhoods U and V of $0 \in H_{0}^{1}$ and $0 \in R$ , respectively, and a smooth map $x \mapsto f^{x}$ from V to U such that $H (f^{x}, x) \equiv 0$ and $f^{x}$ is unique in U with this property.

For the energy, we prove that $I (x) = I_{x} (f^{x})$ in a neighborhood of x=0. First of all, we show that a minimizer exists. If not, there is a function $g \in H_{0}^{1}$ with $I_{x} (g) < I_{x} (f^{x})$ . For small enough x such a g must be inside a ball with radius ε around $0 \in H_{0}^{1}$ , as $I_{x} (g) \geq \frac{1}{2} {∥g∥}_{H_{0}^{1}}^{2}$ and $lim_{x \to 0} I_{x} (f^{x}) = 0$ . Then note that for any $g \in H_{0}^{1}$ $D^{2} I_{0} (0) \cdot (g, g) = {∥g∥}_{H_{0}^{1}}^{2} > 0,$ where $D^{2} I_{x} (f)$ denotes the second derivative of $f \mapsto I_{x} (f)$ . By continuity, $D^{2} I_{x} (f)$ stays positive definite for $(x, f)$ in a neighborhood of $(0, 0)$ . As noted, for x small enough, both g and $f^{x}$ (and the line connecting them) lie in this neighborhood. For $h := g - f^{x}$ , this implies $\begin{aligned} I_{x} (g) - I_{x} (f^{x}) = D I_{x} (f_{x}) \cdot h \\ + \int_{0}^{1} D^{2} I_{x} (f^{x} + t h) \cdot (h, h) d t > 0, \end{aligned}$ since $D I_{x} (f_{x}) \cdot h = 0$ and $D^{2} I_{x} (f^{x} + t s h) \cdot (h, h) > 0$ . This contradicts the assumption that $I_{x} (g) < I_{x} (f^{x})$ , and we conclude that $f^{x}$ is, indeed, a minimizer of $I_{x}$ , implying that $I (x) = I_{x} (f^{x})$ locally.

Finally, as $x \mapsto f^{x}$ is smooth and $(f, x) \mapsto I_{x} (f) = x^{2} / 2 F (f) + \frac{1}{2} {∥f∥}_{H_{0}^{1}}^{2}$ is smooth, we see that $x \mapsto I (x) = I_{x} (f^{x})$ is smooth in a neighborhood of 0. (Note that this arguments relies on $σ (0) \neq 0$ , implying that $F (f) \neq 0$ for f in a neighborhood to 0.)

Remark 5.8

Classical counter-examples in the context of the direct method of calculus of variations show that the step of verifying the existence of a minimizer should not be taken too lightly. For instance, the functional $J (u) := \int_{0}^{1} [(u^{'} (s)^{2} - 1)^{2} + u (s)^{2}] d s$ does not have a minimizer in $H_{0}^{1}$ , but J can be made arbitrarily close to 0 by choosing piecewise-linear functions u with slope $∣u^{'}∣ = 1$ oscillating around 0. We refer to any text book on calculus of variations. In the situation above, local ‘convexity’ in the sense of a positive definite second derivative prevents this phenomenon. An alternative method of proof for the existence of a minimizer is to show that J is (lower semi-) continuous in the weak sense.

5.1.2. The general case

In the general case (cf. Proposition 5.2), we define the function $H : H_{0}^{1} \times R \to H_{0}^{1}$ by (28) $\begin{aligned} H (f, x) (t) \\ := f_{t} - \frac{\begin{matrix} ρ (x - ρ G (f)) \\ \{⟨σ (K \dot{f}), 1_{[0, t]}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K 1_{[0, t]}⟩\} \end{matrix}}{{\bar{ρ}}^{2} F (f)} \\ + \frac{{(x - ρ G (f))}^{2}}{{\bar{ρ}}^{2} F^{2} (f)} ⟨(σ σ^{'}) (K \dot{f}), K 1_{[0, t]}⟩ \\ = f_{t} - \frac{ρ (x - ρ G (f))}{{\bar{ρ}}^{2} F (f)} (R_{2} (f) (t) + R_{3} (f) (t)) \\ + \frac{{(x - ρ G (f))}^{2}}{{\bar{ρ}}^{2} F (f)^{2}} R_{1} (f) (t), \end{aligned}$ (28) where $R_{2}, R_{3} : H_{0}^{1} \to H_{0}^{1}$ are defined by (29) $\begin{aligned} R_{2} (f) (t) & := ⟨σ (K \dot{f}), 1_{[0, t]}⟩, \end{aligned}$ (29) (30) $\begin{aligned} R_{3} (f) (t) & := ⟨σ^{'} (K \dot{f}) \dot{f}, K 1_{[0, t]}⟩, \end{aligned}$ (30) $t \in [0, 1]$ .

One easily checks that G, $R_{2}$ , $R_{3}$ are smooth in the Fréchet sense.

Lemma 5.9

The functions $G : H_{0}^{1} \to R,$ $R_{2} : H_{0}^{1} \to H_{0}^{1}$ and $R_{3} : H_{0}^{1} \to H_{0}^{1}$ are smooth in Fréchet sense.

Proof.

The proof of smoothness is clear. We report the actual derivatives. For G we get $\begin{aligned} D^{N} G (f) & \cdot (g_{1}, \dots, g_{N}) = ⟨σ^{(N)} (K \dot{f}) \dot{f}, \prod_{i = 1}^{N} K {\dot{g}}_{i}⟩ \\ + \sum_{k = 1}^{N} ⟨σ^{(N - 1)} (K \dot{f}), {\dot{g}}_{k} \prod_{i \neq k} K {\dot{g}}_{i}⟩ . \end{aligned}$ For $R_{2}$ and, respectively, $R_{3}$ , we obtain $\begin{aligned} (D^{N} R_{2} (f) \cdot (g_{1}, \dots, g_{N})) (t) \\ = \int_{0}^{t} σ^{(N)} ((K \dot{f}) (s)) \prod_{i = 1}^{N} (K {\dot{g}}_{i}) (s) d s, \end{aligned}$ and $\begin{aligned} (D^{N} R_{3} (f) \cdot (g_{1}, \dots, g_{N})) (t) \\ = ⟨σ^{(N + 1)} (K \dot{f}) \dot{f} K 1_{[0, t]}, \prod_{i = 1}^{N} K {\dot{g}}_{i}⟩ \\ + \sum_{k = 1}^{N} ⟨σ^{(N)} (K \dot{f}) K 1_{[0, t]}, {\dot{g}}_{k} \prod_{i \neq k} K {\dot{g}}_{i}⟩ . \end{aligned}$

Theorem 5.10

Let σ be smooth with $σ (0) \neq 0$ . Then the energy $I (x)$ as defined in (Equation24(24) $\begin{aligned} I (x) = inf_{f \in H_{0}^{1}} [\frac{{(x - ρ G (f))}^{2}}{2 {\bar{ρ}}^{2} F (\hat{f})} + \frac{1}{2} E (f)] \\ \equiv inf_{f \in H_{0}^{1}} I_{x} (f) \equiv I_{x} (f^{x}), \end{aligned}$ (24) ) is smooth in a neighborhood of x=0.

Proof.

The proof is similar to the proof of Theorem 5.7. In fact, the only difference is in establishing invertibility of $D H (0, 0)$ and the existence of a minimizer.

Note that (Equation28(28) $\begin{aligned} H (f, x) (t) \\ := f_{t} - \frac{\begin{matrix} ρ (x - ρ G (f)) \\ \{⟨σ (K \dot{f}), 1_{[0, t]}⟩ + ⟨σ^{'} (K \dot{f}) \dot{f}, K 1_{[0, t]}⟩\} \end{matrix}}{{\bar{ρ}}^{2} F (f)} \\ + \frac{{(x - ρ G (f))}^{2}}{{\bar{ρ}}^{2} F^{2} (f)} ⟨(σ σ^{'}) (K \dot{f}), K 1_{[0, t]}⟩ \\ = f_{t} - \frac{ρ (x - ρ G (f))}{{\bar{ρ}}^{2} F (f)} (R_{2} (f) (t) + R_{3} (f) (t)) \\ + \frac{{(x - ρ G (f))}^{2}}{{\bar{ρ}}^{2} F (f)^{2}} R_{1} (f) (t), \end{aligned}$ (28) ) contains three terms. The derivative of the first term ( $f \mapsto f$ ) is always equal to ${id}_{H_{0}^{1}}$ . For the second term, we note that ${(x - ρ G (f))|}_{x = 0, f = 0} = 0.$ Hence, the only non-vanishing contribution to the derivative of the second term evaluated in direction $g \in H_{0}^{1}$ at x=0, f=0 and $t \in [0, 1]$ is $\begin{aligned} \frac{ρ^{2} D G (0) \cdot g}{{\bar{ρ}}^{2} F (0)} (R_{2} (0) + R_{3} (0)) = \frac{ρ^{2} σ_{0} g (1)}{{\bar{ρ}}^{2} σ_{0}^{2}} (σ_{0} t + 0) \\ = \frac{ρ^{2}}{{\bar{ρ}}^{2}} g (1) t . \end{aligned}$ For the same reason, the derivative of the third term at $(f, x) = (0, 0)$ vanishes entirely. Hence, $(D H (0, 0) \cdot g) (t) = g (t) + \frac{ρ^{2}}{{\bar{ρ}}^{2}} g (1) t .$ It is easy to see that $g \mapsto D H (0, 0) \cdot g$ is invertible. Indeed, let us construct the pre-image $g = D H (0, 0)^{- 1} \cdot h$ of some $h \in H_{0}^{1}$ . At t=1 we have $\frac{{\bar{ρ}}^{2} + ρ^{2}}{{\bar{ρ}}^{2}} g (1) = h (1),$ implying $g (1) = {\bar{ρ}}^{2} h (1)$ . For $0 \leq t < 1$ , we then get $\begin{aligned} g (t) + \frac{ρ^{2}}{{\bar{ρ}}^{2}} g (1) t = g (t) + \frac{ρ^{2}}{{\bar{ρ}}^{2}} {\bar{ρ}}^{2} h (1) t \\ = g (t) + ρ^{2} h (1) t = h (t), \end{aligned}$ or $g (t) = h (t) - ρ^{2} h (1) t$ .

For existence of the minimizer, note that $D^{2} J_{0} (0) \cdot (g, g) = \frac{ρ^{2}}{{\bar{ρ}}^{2}} g (1)^{2} + {∥g∥}_{H_{0}^{1}}^{2},$ which is again positive definite.

Remark 5.11

Though only formulated in terms of ‘smoothness’, it is easy to show that $σ \in C^{k}$ implies that $I \in C^{k - 1}$ (locally at 0).

5.2. Energy expansion

Having established smoothness of the energy I as well as of the minimizing configuration $x \mapsto f^{x}$ locally around x=0, we can proceed with computing the Taylor expansion of $f^{x}$ around x=0. We will once more rely on the first order optimality condition given in Proposition 5.2. Plugging the Taylor expansion of $f^{x}$ into $I_{x}$ will then give us the local Taylor expansion of $I (x)$ .

5.2.1. Expansion of the minimizing configuration

Theorem 5.12

We have $\begin{aligned} f_{t}^{x} & = α_{t} x + β_{t} \frac{x^{2}}{2} + O (x^{3}), \\ α_{t} & = \frac{ρ}{σ_{0}} t, \\ β_{t} & = 2 \frac{σ_{0}^{'}}{σ_{0}^{3}} [ρ^{2} ⟨K 1, 1_{[0, t]}⟩ + ⟨K 1_{[0, t]}, 1⟩ - 3 ρ^{2} t ⟨K 1, 1⟩] . \end{aligned}$

Remark 5.13

Non-Markovian transversality

In the RL-fBM case, $K (t, s) = \sqrt{2 H} | t - s |^{γ}$ with $γ = H - 1 / 2$ one computes $⟨1, K 1_{[0, t]}⟩ = \frac{1}{(1 + γ) (2 + γ)} \{1 - {(1 - t)}^{2 + γ}\} \in C^{1} [0, 1] .$ Interestingly, the transversality condition known from the Markovian setting ( $q_{1} = 0$ , which readily translates to ${\dot{f}}_{1}^{x} = 0$ there) remains valid here (for $ρ = 0$ ), at least to order $x^{2}$ , in the sense that ${\dot{f}}_{t}^{x} \approx β_{t} \frac{x^{2}}{2} = (const) {(1 - t)}^{1 + γ} |_{t = 1} = 0$

Proof of Theorem 5.12

First order expansion:

Up to the order needed in order to get the first order term, we have $\begin{aligned} f_{t}^{x} & = α_{t} x + O (x^{2}), \\ {\dot{f_{t}}}^{x} & = \dot{α_{t}} x + O (x^{2}), \\ σ (K {\dot{f}}^{x}) & = σ_{0} + σ_{0}^{'} K \dot{α} x + O (x^{2}), \\ σ^{'} (K {\dot{f}}^{x}) & = σ_{0}^{'} + σ_{0}^{″} K \dot{α} x + O (x^{2}), \\ F (f^{x}) & = ⟨ σ^{2} (K {\dot{f}}^{x}), 1 ⟩ \\ = σ_{0}^{2} + O (x), \\ G (f^{x}) & = ⟨ σ (K {\dot{f}}^{x}), {\dot{f}}^{x} ⟩ \\ = ⟨σ_{0}, \dot{α}⟩ x + O (x^{2}) . \end{aligned}$ Therefore, $\begin{aligned} ⟨ σ (K {\dot{f}}^{x}), 1_{[0, t]} ⟩ & = σ_{0} t + O (x), \\ ⟨ σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]} ⟩ & = O (x), \\ ⟨ σ σ^{'} (K {\dot{f}}^{x}), K 1_{[0, t]} ⟩ & = O (1), \\ x - ρ G (f^{x}) & = (1 - ρ σ_{0} α_{1}) x + O (x^{2}), \\ (x - ρ G (f^{x}))^{2} & = O (x^{2}) . \end{aligned}$ This yields for the first order term in (Equation25(25) $\begin{aligned} f_{t}^{x} & = \frac{ρ (x - ρ G (f^{x})) \{⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩\}}{{\bar{ρ}}^{2} F (f^{x})} \\ + \frac{{(x - ρ G (f^{x}))}^{2}}{{\bar{ρ}}^{2} F^{2} (f^{x})} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩, \end{aligned}$ (25) ) $α_{t} = \frac{ρ (1 - ρ σ_{0} α_{1})}{{\bar{ρ}}^{2} σ_{0}} t .$ Setting t=1, we get $α_{1} = \frac{ρ}{{\bar{ρ}}^{2} σ_{0}} - \frac{ρ^{2}}{{\bar{ρ}}^{2}} α_{1},$ which is solved by $α_{1} = ρ / σ_{0}$ . Inserting this term back into the equation for $α_{t}$ , we get (31) $α_{t} = \frac{ρ}{σ_{0}} t .$ (31)

Second order expansion:

Using (Equation31(31) $α_{t} = \frac{ρ}{σ_{0}} t .$ (31) ) and the ansatz $f_{t}^{x} = α_{t} x + \frac{1}{2} β_{t} x^{2} + O (x^{3})$ , we re-compute the relevant terms appearing in the (Equation25(25) $\begin{aligned} f_{t}^{x} & = \frac{ρ (x - ρ G (f^{x})) \{⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩\}}{{\bar{ρ}}^{2} F (f^{x})} \\ + \frac{{(x - ρ G (f^{x}))}^{2}}{{\bar{ρ}}^{2} F^{2} (f^{x})} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩, \end{aligned}$ (25) ). We have $σ (K {\dot{f}}^{x} (s)) = σ_{0} + σ_{0}^{'} \frac{ρ}{σ_{0}} (K 1) (s) x + O (x^{2})$ and analogously for σ replaced by $σ^{'}$ , $σ σ^{'}$ . This implies $\begin{aligned} ⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ = σ_{0} t + σ_{0}^{'} \frac{ρ}{σ_{0}} ⟨K 1, 1_{[0, t]}⟩ x + O (x^{2}), \\ ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩ = ρ \frac{σ^{'}}{σ_{0}} ⟨K 1_{[0, t]}, 1⟩ x + O (x^{2}), \\ ⟨σ σ^{'} (K {\dot{f}}^{x}), K 1_{[0, t]}⟩ = σ_{0} σ_{0}^{'} ⟨K 1_{[0, t]}, 1⟩ + O (x) . \end{aligned}$ Using the notation introduced earlier, we have $\begin{aligned} F (f^{x}) = σ_{0}^{2} + 2 σ_{0}^{'} ρ ⟨K 1, 1⟩ x + O (x^{2}), \\ G (f^{x}) = ρ x + (\frac{1}{2} σ_{0} β_{1} + ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{2}} ⟨K 1, 1⟩) x^{2} + O (x^{3}) . \end{aligned}$ This directly implies $\begin{aligned} x - ρ G (f^{x}) = {\bar{ρ}}^{2} x - ρ (\frac{1}{2} σ_{0} β_{1} + ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{2}} ⟨K 1, 1⟩) x^{2} + O (x^{3}), \\ {(x - ρ G (f^{x}))}^{2} = {\bar{ρ}}^{4} x^{2} - 2 {\bar{ρ}}^{2} ρ \\ \times (\frac{1}{2} σ_{0} β_{1} + ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{2}} ⟨K 1, 1⟩) x^{3} + O (x^{4}) . \end{aligned}$ We next compute some auxiliary terms appearing in (Equation25(25) $\begin{aligned} f_{t}^{x} & = \frac{ρ (x - ρ G (f^{x})) \{⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩\}}{{\bar{ρ}}^{2} F (f^{x})} \\ + \frac{{(x - ρ G (f^{x}))}^{2}}{{\bar{ρ}}^{2} F^{2} (f^{x})} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩, \end{aligned}$ (25) ). $\begin{aligned} N_{1} & := ρ (x - ρ G (f^{x})) (⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩) \\ = ρ {\bar{ρ}}^{2} σ_{0} t x + [ρ^{2} {\bar{ρ}}^{2} \frac{σ_{0}^{'}}{σ_{0}} (⟨K 1, 1_{[0, t]}⟩ + ⟨K 1_{[0, t]}, 1⟩) \\ - ρ^{4} \frac{σ_{0}^{'}}{σ_{0}} t ⟨K 1, 1⟩ - \frac{1}{2} ρ^{2} σ_{0}^{2} t β_{1}] x^{2} + O (x^{3}) \end{aligned}$ The corresponding denominator is ${\bar{ρ}}^{2} F (f^{x})$ . Using the formula $\frac{a_{1} x + a_{2} x^{2} + O (x^{3})}{b_{0} + b_{1} x + O (x^{2})} = \frac{a_{1}}{b_{0}} x + \frac{a_{2} b_{0} - a_{1} b_{1}}{b_{0}^{2}} x^{2} + O (x^{3}),$ we obtain (32) $\begin{aligned} \frac{N_{1}}{{\bar{ρ}}^{2} F (f^{x})} = \frac{ρ}{σ_{0}} t x + [ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} (⟨K 1, 1_{[0, t]}⟩ + ⟨K 1_{[0, t]}, 1⟩) \\ - (\frac{ρ^{4}}{{\bar{ρ}}^{2}} + 2 ρ^{2}) \frac{σ_{0}^{'}}{σ_{0}^{3}} t ⟨K 1, 1⟩ - \frac{1}{2} \frac{ρ^{2}}{{\bar{ρ}}^{2}} β_{1} t] x^{2} + O (x^{3}) \end{aligned}$ (32) For the second term in (Equation25(25) $\begin{aligned} f_{t}^{x} & = \frac{ρ (x - ρ G (f^{x})) \{⟨σ (K {\dot{f}}^{x}), 1_{[0, t]}⟩ + ⟨σ^{'} (K {\dot{f}}^{x}) {\dot{f}}^{x}, K 1_{[0, t]}⟩\}}{{\bar{ρ}}^{2} F (f^{x})} \\ + \frac{{(x - ρ G (f^{x}))}^{2}}{{\bar{ρ}}^{2} F^{2} (f^{x})} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩, \end{aligned}$ (25) ), let $\begin{aligned} N_{2} := {(x - ρ G (f^{x}))}^{2} ⟨(σ σ^{'}) (K {\dot{f}}^{x}), K 1_{[0, t]}⟩ \\ = {\bar{ρ}}^{4} σ_{0} σ_{0}^{'} ⟨K 1_{[0, t]}, 1⟩ x^{2} + O (x^{3}) . \end{aligned}$ The corresponding denominator is ${\bar{ρ}}^{2} F (f^{x})^{2} = {\bar{ρ}}^{2} σ_{0}^{4} + O (x)$ . Hence, (33) $\frac{N_{2}}{{\bar{ρ}}^{2} F (f^{x})^{2}} = {\bar{ρ}}^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1_{[0, t]}, 1⟩ x^{2} + O (x^{3}) .$ (33) Combining (Equation32(32) $\begin{aligned} \frac{N_{1}}{{\bar{ρ}}^{2} F (f^{x})} = \frac{ρ}{σ_{0}} t x + [ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} (⟨K 1, 1_{[0, t]}⟩ + ⟨K 1_{[0, t]}, 1⟩) \\ - (\frac{ρ^{4}}{{\bar{ρ}}^{2}} + 2 ρ^{2}) \frac{σ_{0}^{'}}{σ_{0}^{3}} t ⟨K 1, 1⟩ - \frac{1}{2} \frac{ρ^{2}}{{\bar{ρ}}^{2}} β_{1} t] x^{2} + O (x^{3}) \end{aligned}$ (32) ) and (Equation33(33) $\frac{N_{2}}{{\bar{ρ}}^{2} F (f^{x})^{2}} = {\bar{ρ}}^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1_{[0, t]}, 1⟩ x^{2} + O (x^{3}) .$ (33) ), we get $\begin{aligned} f_{t}^{x} & = \frac{ρ}{σ_{0}} t x + [ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} (⟨K 1, 1_{[0, t]}⟩ + ⟨K 1_{[0, t]}, 1⟩) \\ - \frac{ρ^{4}}{{\bar{ρ}}^{2}} \frac{σ_{0}^{'}}{σ_{0}^{3}} t ⟨K 1, 1⟩ \\ - \frac{1}{2} \frac{ρ^{2}}{{\bar{ρ}}^{2}} β_{1} t - 2 ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} t ⟨K 1, 1⟩ + {\bar{ρ}}^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1_{[0, t]}, 1⟩] x^{2} \\ + O (x^{3}) \end{aligned}$ We shall next compute $β_{1}$ . Taking the second order terms on both sides and letting t=1, we obtain $\begin{aligned} \frac{1}{2} β_{1} & = ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} 2 ⟨K 1, 1⟩ - \frac{ρ^{4}}{{\bar{ρ}}^{2}} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1, 1⟩ \\ - \frac{1}{2} \frac{ρ^{2}}{{\bar{ρ}}^{2}} β_{1} - 2 ρ^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1, 1⟩ + {\bar{ρ}}^{2} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1, 1⟩ . \end{aligned}$ Moving $β_{1}$ to the other side with $1 + ρ^{2} / {\bar{ρ}}^{2} = 1 / {\bar{ρ}}^{2}$ and collecting terms on the right hand side, we arrive at $\begin{aligned} \frac{1}{2} \frac{1}{{\bar{ρ}}^{2}} β_{1} = \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1, 1⟩ (2 ρ^{2} - \frac{ρ^{4}}{{\bar{ρ}}^{2}} - 2 ρ^{2} + {\bar{ρ}}^{2}) \\ = \frac{1 - 2 ρ^{2}}{{\bar{ρ}}^{2}} \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1, 1⟩ \end{aligned}$ We conclude that $β_{1} = 2 (1 - 2 ρ^{2}) \frac{σ_{0}^{'}}{σ_{0}^{3}} ⟨K 1, 1⟩$ Hence, we obtain $β_{t} = 2 \frac{σ_{0}^{'}}{σ_{0}^{3}} [ρ^{2} ⟨K 1, 1_{[0, t]}⟩ + ⟨K 1_{[0, t]}, 1⟩ - 3 ρ^{2} t ⟨K 1, 1⟩] .$

5.2.2. Energy expansion in the general case

Now we compute the Taylor expansion of $I (x)$ as defined in Proposition 5.1. We start with the second term. Plugging in the optimal path $f_{t}^{x} = α_{t} x + \frac{1}{2} β_{t} x^{2} + O (x^{3})$ (and using $⟨\dot{} β, 1⟩ = β_{1}$ as $β_{0} = 0$ ) we obtain $\frac{1}{2} ⟨{\dot{f}}^{x}, {\dot{f}}^{x}⟩ = \frac{1}{2} \frac{ρ^{2}}{σ_{0}^{2}} x^{2} + \frac{1}{2} \frac{ρ}{σ_{0}} β_{1} x^{3} + O (x^{4}) .$ Inserting $β_{1} = 2 (1 - 2 ρ^{2}) (σ_{0}^{'} / σ_{0}^{3}) ⟨K 1, 1⟩$ into the above formula for $(x - ρ G (f^{x}))^{2}$ , we get ${(x - ρ G (f^{x}))}^{2} = {\bar{ρ}}^{4} x^{2} - 2 {\bar{ρ}}^{4} ρ \frac{σ_{0}^{'}}{σ_{0}^{2}} ⟨K 1, 1⟩ x^{3} + O (x^{4}) .$ Recall the denominator $2 {\bar{ρ}}^{2} F (f^{x}) = 2 {\bar{ρ}}^{2} σ_{0}^{2} + 4 {\bar{ρ}}^{2} σ_{0}^{'} ρ ⟨K 1, 1⟩ x + O (x^{2}) .$ Using the expansion of a fraction $\frac{a_{2} x^{2} + a_{3} x^{3} + O (x^{4})}{b_{0} + b_{1} x + O (x^{2})} = \frac{a_{2}}{b_{0}} x^{2} + \frac{a_{3} b_{0} - a_{2} b_{1}}{b_{0}^{2}} x^{3} + O (x^{4}),$ we obtain from $\begin{aligned} \frac{{(x - ρ G (f^{x}))}^{2}}{2 {\bar{ρ}}^{2} F (f^{x})} = \frac{{\bar{ρ}}^{4}}{2 {\bar{ρ}}^{2} σ_{0}^{2}} x^{2} \\ + \frac{(- 2 {\bar{ρ}}^{4} ρ \frac{σ_{0}^{'}}{σ_{0}^{2}} ⟨K 1, 1⟩) 2 {\bar{ρ}}^{2} σ_{0}^{2} - {\bar{ρ}}^{4} (4 {\bar{ρ}}^{2} σ_{0}^{'} ρ ⟨K 1, 1⟩)}{4 {\bar{ρ}}^{4} σ_{0}^{4}} x^{3} \\ + O (x^{4}) \\ = \frac{{\bar{ρ}}^{2}}{2 σ_{0}^{2}} x^{2} - 2 {\bar{ρ}}^{2} ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨K 1, 1⟩ x^{3} + O (x^{4}) . \end{aligned}$ We note that $\begin{aligned} \frac{1}{2} \frac{ρ}{σ_{0}} β_{1} - 2 {\bar{ρ}}^{2} ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨K 1, 1⟩ \\ = ((1 - 2 ρ^{2}) - 2 (1 - ρ^{2})) ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨K 1, 1⟩ = - ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨K 1, 1⟩ . \end{aligned}$ Adding both terms, we arrive at the

Proposition 5.14

The energy expansion to third order gives $I (x) = \frac{1}{2 σ_{0}^{2}} x^{2} - ρ \frac{σ_{0}^{'}}{σ_{0}^{4}} ⟨K 1, 1⟩ x^{3} + O (x^{4}) .$

5.2.3. Energy expansion for the Riemann-Liouville kernel

Let us specialize the energy expansion given in Proposition 5.14 for the Riemann-Liouville fBm. Choose $γ = H - \frac{1}{2}$ and recall that the kernel K takes the form $K (t, s) = (t - s)^{γ}$ . We get $(K 1) (t) = \int_{0}^{t} K (t, s) d s = \int_{0}^{t} (t - s)^{γ} d s = \frac{t^{1 + γ}}{1 + γ} .$ The key term $⟨K 1, 1⟩$ appearing in the energy expansion now gives $\begin{aligned} ⟨K 1, 1⟩ = \int_{0}^{1} (K 1) (t) d t = \int_{0}^{1} \frac{t^{1 + γ}}{1 + γ} d t = \frac{1}{(1 + γ) (2 + γ)} \\ = \frac{1}{(H + 1 / 2) (H + 3 / 2)} . \end{aligned}$ Plugging the above formula into the energy expansion, we obtain the energy expansion for the Riemann-Liouville fractional Browian motion $I (x) = \frac{1}{2 σ_{0}^{2}} x^{2} - \frac{ρ}{(H + 1 / 2) (H + 3 / 2)} \frac{σ_{0}^{'}}{σ_{0}^{4}} x^{3} + O (x^{4}) .$ For completeness, let us also fully describe the time-dependence of the second order term $β_{t}$ in the expansion of the optimal trajectory $f_{t}^{x}$ . Unlike the first order time, here we do not have a linear movement any more. Indeed (34) $\begin{aligned} ⟨K 1, 1_{[0, t]}⟩ = \int_{0}^{t} (K 1) (s) d s = \int_{0}^{t} \frac{s^{1 + γ}}{1 + γ} d s \\ = \frac{t^{2 + γ}}{(1 + γ) (2 + γ)}, \end{aligned}$ (34) (35) $\begin{aligned} ⟨K 1_{[0, t]}, 1⟩ & = \frac{1}{(1 + γ) (2 + γ)} (1 - (1 - t)^{2 + γ}) . \end{aligned}$ (35)

6. Proof of the pricing formula

Fix $x \geq 0$ and $\hat{x} = (ε / \hat{ε}) x$ where $ε = t^{1 / 2}$ and $\hat{ε} = t^{H} = ε^{2 H}$ . We have $\begin{aligned} c (\hat{x}, t) & = E {(\exp (X_{t}) - \exp \hat{x})}^{+} \\ = E {(\exp (X_{1}^{ε}) - \exp \hat{x})}^{+} \\ = E {(\exp (\frac{ε}{\hat{ε}} {\hat{X}}_{1}^{ε}) - \exp (\frac{ε}{\hat{ε}} x))}^{+} \end{aligned}$ where we recall $\begin{aligned} {\hat{X}}_{1}^{ε} \equiv \frac{\hat{ε}}{ε} X_{1}^{ε} = \int_{0}^{1} σ (\hat{ε} \hat{B}) \hat{ε} d (\bar{ρ} W + ρ B) \\ - \frac{1}{2} ε \hat{ε} \int_{0}^{1} σ {(\hat{ε} {\hat{B}}_{t})}^{2} d t . \end{aligned}$ Consider a Cameron-Martin perturbation of ${\hat{X}}_{1}^{ε}$ . That is, for a Cameron-Martin path $h = (h, f) \in H_{0}^{1} \times H_{0}^{1}$ consider a measure change corresponding to a transformation $\hat{ε} (W, B) ⇝ \hat{ε} (W, B) + (h, f)$ (transforming the Brownian motions to Brownian motions with drift), we obtain the Girsanov density (36) $\begin{aligned} G_{ε} = \exp (- \frac{1}{\hat{ε}} \int_{0}^{1} {\dot{h}}_{s} d W_{s} - \frac{1}{\hat{ε}} \int_{0}^{1} {\dot{f}}_{s} d B_{s} \\ - \frac{1}{2 {\hat{ε}}^{2}} \int_{0}^{1} ({\dot{h}}_{s}^{2} + {\dot{f}}_{s}^{2}) d s) . \end{aligned}$ (36) Under the new measure, ${\hat{X}}_{1}^{ε}$ becomes ${\hat{Z}}_{1}^{ε}$ , where $\begin{aligned} {\hat{Z}}_{1}^{ε} = \int_{0}^{1} σ (\hat{ε} {\hat{B}}_{t} + {\hat{f}}_{t}) [\hat{ε} d (\bar{ρ} W_{t} + ρ B_{t}) + d (\bar{ρ} h_{t} + ρ f_{t})] \\ - \frac{1}{2} ε \hat{ε} \int_{0}^{1} σ (\hat{ε} {\hat{B}}_{t} + {\hat{f}}_{t})^{2} d t . \end{aligned}$

Definition 6.1

For fixed $x \geq 0$ , write $(h, f) \in K^{x}$ if $Φ_{1} (h, f, \hat{f}) = x$ . Call such $(h, f)$ admissible for arrival at log-strike x. Call $(h^{x}, f^{x})$ the cheapest admissible control, which attains $I (x) = inf_{h, f \in H_{0}^{1}} \{\frac{1}{2} \int_{0}^{1} {\dot{h}}^{2} d t + \frac{1}{2} \int_{0}^{1} {\dot{f}}^{2} d t : Φ_{1} (h, f, \hat{f}) = x\},$ where we recall that $\hat{f} = K \dot{f}$ and $Φ_{1} (h, f, \hat{f}) = \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f) .$

For any Cameron-Martin path $(h, f)$ , the perturbed random variable ${\hat{Z}}_{1}^{ε}$ admits a stochastic Taylor expansion with respect to $\hat{ε}$ .

Lemma 6.2

Fix $(h, f) \in K^{x}$ and define ${\hat{Z}}_{1}^{ε}$ accordingly. Then (37) ${\hat{Z}}_{1}^{ε} = x + \hat{ε} g_{1} + {\hat{ε}}^{2} R_{2}^{ε},$ (37) where $g_{1}$ is a Gaussian random variable, given explicitly by (38) $g_{1} = \int_{0}^{1} {σ ({\hat{f}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) + σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} h_{t} + ρ f_{t})},$ (38) and (39) $\begin{aligned} R_{2}^{ε} & = \int_{0}^{1} σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} W_{t} + ρ B_{t}) - \frac{1}{2} \frac{ε}{\hat{ε}} \int_{0}^{1} σ (\hat{ε} {\hat{B}}_{t} + {\hat{f}}_{t})^{2} d t \\ + \frac{1}{2 {\hat{ε}}^{2}} \int_{0}^{\hat{ε}} \int_{0}^{1} σ^{″} (ζ {\hat{B}}_{t} + {\hat{f}}_{t}) {\hat{B}}_{t}^{2} \\ \times [\hat{ε} d (\bar{ρ} W_{t} + ρ B_{t}) + d (\bar{ρ} h_{t} + ρ f_{t})] (\hat{ε} - ζ) d ζ . \end{aligned}$ (39)

Proof.

By a stochastic Taylor expansion for the controlled process ${\hat{Z}}_{t}^{ε}$ with control $(h, f) \in K^{x}$ as in Definition 6.1 and thanks to $σ \in C^{2}$ , we have at t=1 $\begin{aligned} {\hat{Z}}_{1}^{ε} & = \int_{0}^{1} σ (\hat{ε} \hat{B} + \hat{f}) [\hat{ε} d (\bar{ρ} W + ρ B) + d (\bar{ρ} h + ρ f)] \\ - \frac{1}{2} ε \hat{ε} \int_{0}^{1} σ (\hat{ε} {\hat{B}}_{t} + {\hat{f}}_{t})^{2} d t \\ = \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f) + \hat{ε} \int_{0}^{1} {σ (\hat{f}) d (\bar{ρ} W + ρ B) \\ + σ^{'} (\hat{f}) \hat{B} d (\bar{ρ} h + ρ f)} \\ + {\hat{ε}}^{2} \int_{0}^{1} σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} W_{t} + ρ B_{t}) \\ - \frac{1}{2} ε \hat{ε} \int_{0}^{1} σ (\hat{ε} {\hat{B}}_{t} + {\hat{f}}_{t})^{2} d t \\ + \frac{1}{2} \int_{0}^{\hat{ε}} \int_{0}^{1} σ^{″} (ζ {\hat{B}}_{t} + {\hat{f}}_{t}) {\hat{B}}_{t}^{2} \\ \times [\hat{ε} d (\bar{ρ} W_{t} + ρ B_{t}) + d (\bar{ρ} h_{t} + ρ f_{t})] (\hat{ε} - ζ) d ζ . \end{aligned}$ Collecting terms in powers of $\hat{ε}$ and with the random variable $g_{1}$ as in (Equation38(38) $g_{1} = \int_{0}^{1} {σ ({\hat{f}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) + σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} h_{t} + ρ f_{t})},$ (38) ) (recalling that $\hat{ε} ε \in O ({\hat{ε}}^{2})$ ), we have ${\hat{Z}}_{1}^{ε} = \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f) + \hat{ε} g_{1} + O ({\hat{ε}}^{2}),$ furthermore, since $(h, f) \in K^{x}$ , by the definition of $Φ_{1}$ , it holds that $\int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f) = x .$ This proves the statement (Equation37(37) ${\hat{Z}}_{1}^{ε} = x + \hat{ε} g_{1} + {\hat{ε}}^{2} R_{2}^{ε},$ (37) ) and the statement that $g_{1}$ is Gaussian is immediate from the form (Equation38(38) $g_{1} = \int_{0}^{1} {σ ({\hat{f}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) + σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} h_{t} + ρ f_{t})},$ (38) ).

Finally, we determine an explicit form of the Girsanov density $G_{ε}$ for the choice where $(h^{x}, f^{x})$ in (Equation36(36) $\begin{aligned} G_{ε} = \exp (- \frac{1}{\hat{ε}} \int_{0}^{1} {\dot{h}}_{s} d W_{s} - \frac{1}{\hat{ε}} \int_{0}^{1} {\dot{f}}_{s} d B_{s} \\ - \frac{1}{2 {\hat{ε}}^{2}} \int_{0}^{1} ({\dot{h}}_{s}^{2} + {\dot{f}}_{s}^{2}) d s) . \end{aligned}$ (36) ) are chosen the cheapest admissible control (cf. Definition 6.1. Similarly to classical works of Azencott, Ben Arous and others, see, for instance, Ben Arous (Citation1988), we show that the stochastic integrals in the exponent of $G_{ε}$ are proportional to the first order term $g_{1}$ (with factor $I^{'} (x)$ ) when evaluated at the minimizing configuration $(h^{x}, f^{x})$ .

Lemma 6.3

We have $\int_{0}^{1} {\dot{h}}_{t}^{x} d W_{t} + \int_{0}^{1} {\dot{f}}_{t}^{x} d B_{t} = I^{'} (x) g_{1} .$

Proof.

See Lemma A.2.

With these preparations in place, we are now ready to prove the pricing formula from Section 3.

Proof of Theorem 3.2

With a Girsanov factor (all integrals on $[0, 1]$ ) $G_{ε} = e^{- 1 / \hat{ε} \int \dot{h} d W - \frac{1}{\hat{ε}} \int \dot{f} d B - \frac{1}{2 {\hat{ε}}^{2}} \int ({\dot{h}}^{2} + {\dot{f}}^{2}) d t}$ and (evaluated at the minimizer) $G_{ε} |_{*} = e^{- I (x) / {\hat{ε}}^{2}} e^{- I^{'} (x) g_{1} (ω) / \hat{ε}},$ we have, setting ${\hat{U}}^{ε} := {\hat{Z}}_{1}^{ε} - x = \hat{ε} g_{1} + {\hat{ε}}^{2} R_{2}^{ε}$ $\begin{aligned} c (\hat{x}, t) & = E [{(\exp (\frac{ε}{\hat{ε}} {\hat{Z}}_{1}^{ε}) - \exp (\frac{ε}{\hat{ε}} x))}^{+} G_{ε} |_{*}] \\ = e^{ε / \hat{ε} x} E [{(\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1)}^{+} G_{ε} |_{*}] \\ = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} E [{(\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1)}^{+} e^{- I^{'} (x) g_{1} / \hat{ε}}] \\ = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} E [(\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1) e^{- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε}} \\ \times e^{I^{'} (x) R_{2}^{ε}} 1_{{\hat{U}}^{ε} \geq 0}] . \\ = e^{- I (x) / {\hat{ε}}^{2}} e^{ε / \hat{ε} x} J (ε, x) . \end{aligned}$

7. Proof of the moderate deviation expansions

In Section 2, we pointed out that (iiic) is exactly what one gets from (call price) large deviations (Equation8(8) $E [(e^{X_{1}^{ε}} - e^{x ε^{1 - 2 H}})^{+}] = \exp (- \frac{I (x) + o (1)}{ε^{4 H}}) .$ (8) ), if heuristically applied to $x ε^{2 β}$ . We now give a proper derivation based on moderate deviations.

Lemma 7.1

Assume (iiia-b) from Assumption 2.4. Then an upper moderate deviation estimate holds both for calls and digital calls. That is, we have

For every $β \in (0, H),$ and every fixed $x > 0,$ and ${\hat{x}}_{ε} := x ε^{1 - 2 H + 2 β},$ $E [(e^{X_{1}^{ε}} - e^{{\hat{x}}_{ε}})^{+}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}})$

and also (40) $P [X_{1}^{ε} > {\hat{x}}_{ε}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}}) .$ (40)

Proof.

Recall $σ (.)$ smooth but unbounded and recall ${\hat{x}}_{ε} := x ε^{1 - 2 H + 2 β}$ . In case of $β = 0$ and $H = 1 / 2$ a large deviation principle (LDP) for $(X_{1}^{ε} \hat{ε} / ε)$ is readily reduced, via exponential equivalence, to a LDP for the family of stochastic Itô integrals given by $\int σ (\hat{ε} \hat{B}) \hat{ε} d Z$ for some Brownian Z, ρ-correlated with B. There are then many ways to establish a LDP for this family. A particularly convenient one, that requires no growth restriction on σ, uses continuity of stochastic integration with respect to the rough path $(B, Z, \int B d Z) = (B, Z, \int \hat{B} d Z)$ in suitable metrics, for which a LDP is known (Friz and Hairer Citation2014, Ch 9.3). It was pointed out in Bayer et al. (Citation2017) that a similar reasoning is possible when $H < 1 / 2$ , the rough path is then replaced by a ‘richer enhancement’ of $(B, Z)$ , the precise size of which depends on H, for which again one has a LDP. A moderate deviation priniple (MDP) for $(X_{1}^{ε} \hat{ε} / ε)$ is a LDP for $(ε^{- 2 β} X_{1}^{ε} \hat{ε} / ε)$ for $β \in (0, H)$ . This can be reduced to a LDP, with $\bar{ε} := ε^{- 2 β} \hat{ε} = ε^{2 H - 2 β}$ , for $ε^{- 2 β} \int_{0}^{1} σ (\hat{ε} \hat{B}) \hat{ε} d Z = \int_{0}^{1} σ (\hat{ε} \hat{B}) \bar{ε} d Z \equiv \int_{0}^{1} σ_{ε} (\bar{ε} \hat{B}) \bar{ε} d Z$ with speed ${\bar{ε}}^{2}$ . Since $σ_{ε} (\cdot) \equiv σ (ε^{2 β} \cdot)$ converges (with all derivatives) locally uniformly to the constant function $σ_{0}$ , and one checks that the above is exponentially equivalent to the (Gaussian) family given by $σ_{0} \bar{ε} Z_{1}$ , with law $N (0, σ_{0}^{2} {\bar{ε}}^{2}) = N (0, σ_{0}^{2} ε^{4 H - 4 β})$ which gives (Equation40(40) $P [X_{1}^{ε} > {\hat{x}}_{ε}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}}) .$ (40) ), even with equality. (By localization, exponential equivalence can again be done for σ without growth restrictions.)

We have not yet used either assumption (iiia-b). These become important in order to extend estimate (Equation40(40) $P [X_{1}^{ε} > {\hat{x}}_{ε}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}}) .$ (40) ) to the case of genuine call payoffs. We can follow here a well-known argument (e.g. Forde and Jacquier Citation2009; Pham Citation2010; Forde and Zhang Citation2017) with the ‘moderate’ caveat to carry along a factor $ε^{2 β}$ . In fact, this is follows precisely the argument of Forde and Zhang (Citation2017) where the authors carry along a factor $\hat{ε} / ε = ε^{2 H - 1}$ . (This provides a unified view on rough and moderate deviations.) The remaining details then follow essentially ‘Appendix C. Proof of Corollary 4.13., part (ii) upper bound’ of Forde and Zhang (Citation2017), noting perhaps that the authors use their assumptions to show validity of what we simply assumed as condition (iiib), and also that one works with the quadratic rate function $I^{″} (0) x^{2} = x^{2} / 2 σ_{0}^{2}$ throughout.

Remark 7.2

By an easy argument similar to ‘Appendix C. Proof of Corollary 4.13., part (i) lower bound’ of Forde and Zhang (Citation2017) one sees that validity of the call price upper bound (iiic) implies the corresponding digital call price upper bound (Equation40(40) $P [X_{1}^{ε} > {\hat{x}}_{ε}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}}) .$ (40) .) For this reason, we only emphasized (iiic) but not (Equation40(40) $P [X_{1}^{ε} > {\hat{x}}_{ε}] \leq \exp (- \frac{x^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}}) .$ (40) ) in Section 2.

In a classical work Azencott (Citation1982) (see also Azencott Citation1985; Ben Arous Citation1988, Théorème 2) obtained asymptotic expansions of functionals of Laplace type on Wiener space, of the type ‘ $E [\exp (- F (X^{ε}) / ε^{2})]$ ’, for small noise diffusions $X^{ε}$ . This refines the large deviation (equivalently: Laplace) principle of Freidlin–Wentzell for small noise diffusions. In a nutshell, for fixed $X_{0} = x$ , Azencott gets expansions of the form $e^{- c / ε^{2}} (α_{0} + α_{1} ε \dots)$ . His ideas (used by virtually all subsequent works in this direction) are a Girsanov transform, to make the minimizing path ‘typical’, followed by localization around the minimizer (justified by a good large deviation principle), and finally a local (stochastic Taylor) type analysis near the minimizer. None of these ingredients rely on the Markovian structure (or, relatedly, PDE arguments). As a consequence (and motivation for this work) such expansions were also obtained in the (non-Markovian) context of rough differential equations driven by fractional Brownian motion (Inahama Citation2013; Baudoin and Ouyang Citation2015) with $H < 1 / 2$ .

And yet, our situation is different in the sense that call price Wiener functionals do not fit the form studied by Azencott and others, nor can we in fact expect a similar expansion: Example 3.3 gives a Black-Scholes call price expansion of the form constant times $e^{- c ε^{2}} (ε^{3} + \dots)$ . Azencott's ideas are nonetheless very relevant to us: we already used the Girsanov formula in Theorem 3.2 in order to have a tractable expression for J. It thus ‘only’ remains to carry out the localization and do some local analysis.

Proposition 7.3

Let x>0 and $β \in (0, H)$ . Then the factor J is negligible in the sense that, for every $θ > 0,$ $ε^{θ} \log J (ε, x ε^{2 β}) \to 0 as ε \to 0.$

Proof.

Step 1. Localization Write $x_{ε} := x ε^{2 β}, {\hat{x}}_{ε} := x_{ε} ε^{1 - 2 H} = x ε^{1 - 2 H + 2 β}$ . By definition, $E [(e^{X_{1}^{ε}} - e^{{\hat{x}}_{ε}})^{+}] e^{I (x_{ε}) / {\hat{ε}}^{2}} e^{- {\hat{x}}_{ε}} = J (ε, x_{ε}) .$ Fix $x, δ > 0$ and write $δ_{ε} = δ ε^{2 β}$ . We claim that (the positive quantity) (41) $J (ε, x_{ε}) - J_{δ_{ε}} (ε, x_{ε}) = e^{I (x_{ε}) / {\hat{ε}}^{2}} e^{- {\hat{x}}_{ε}} E [(e^{X_{1}^{ε}} - e^{{\hat{x}}_{ε}}) 1_{{\hat{X}}_{1}^{ε} > x_{ε} + δ_{ε}}]$ (41) is exponentially small, in the sense that, for some c>0 and ${\bar{ε}}^{2} = ε^{4 H - 4 β}$ , $J (ε, x_{ε}) - J_{δ_{ε}} (ε, x_{ε}) = O (e^{- c / {\bar{ε}}^{2}}) .$ There is a battle here between the exploding factor $e^{I (x_{ε}) / {\hat{ε}}^{2}}$ , with exponent $\frac{I (x_{ε})}{{\hat{ε}}^{2}} \sim \frac{I^{″} (0) {(x_{ε})}^{2}}{2 {\hat{ε}}^{2}} = \frac{I^{″} (0) x^{2}}{2 ε^{4 H - 4 β}},$ and on the other hand $E [(e^{X_{1}^{ε}} - e^{{\hat{x}}_{ε}}) 1_{{\hat{X}}_{1}^{ε} > x_{ε} + δ_{ε}}] \leq \exp (- \frac{(x + δ)^{2} + o (1)}{2 σ_{0}^{2} ε^{4 H - 4 β}})$ where the given estimate is an easy consequence of Lemma 7.1. Since $I^{″} (0) = 1 / σ_{0}^{2}$ we see that the last factor ‘exponentially over-compensates’ the rest, so that the difference is indeed exponentially negligible.

Step 2. Upper bound. For any x>0, recall that ${\hat{U}}^{ε, x} = {\hat{U}}^{ε}$ decomposes into a Gaussian random variable $g_{1} = g_{1}^{x}$ and remainder $R_{2}^{ε, x} = R_{2}^{ε}$ . In order to control this remainder without imposing boundedness assumption on $σ (.)$ , we will crucially used a ‘localized remainder tail estimate’ as given in Proposition 7.4 below. We have, for any $ε \in (0, 1]$ , (42) $\begin{aligned} J_{δ} (ε, x) & = E [e^{- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε}} (\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1) e^{I^{'} (x) R_{2}^{ε}} 1_{{\hat{U}}^{ε} \in [0, δ_{ε}]}] \\ \leq (e^{δ} - 1) E [e^{- I^{'} (x) / \hat{ε} g_{1}^{x}}; {\hat{U}}^{ε, x} \in [0, δ]] . \end{aligned}$ (42) To proceed, recall ${\hat{ε}}^{- 1} g_{1}^{x} = {\hat{ε}}^{- 2} {\hat{U}}^{ε, x} - R_{2}^{ε, x}$ so that, for any $κ > 0$ , $\begin{aligned} e^{- (I^{'} (x) / \hat{ε}) g_{1}^{x}} & = e^{- (I^{'} (x) / \hat{ε}) g_{1}^{x}} 1_{{|\hat{ε} \hat{B}|}_{\infty; [0, 1]} \geq κ} \\ + e^{- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε, x}} e^{I^{'} (x) R_{2}^{ε, x}} 1_{{|\hat{ε} \hat{B}|}_{\infty; [0, 1]} < κ} . \end{aligned}$ Since $I^{'} (x) > 0$ for small enough x>0, it follows that $- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε, x} < 0$ on the event ${{\hat{U}}^{ε, x} \in [0, δ]}$ , which leads us to $\begin{aligned} J_{δ} (ε, x) & \leq (e^{δ} - 1) E [e^{- (I^{'} (x) / \hat{ε}) g_{1}^{x}}; | \hat{ε} \hat{B} |_{\infty; [0, 1]} \geq κ] \\ + (e^{δ} - 1) E [e^{I^{'} (x) R_{2}^{ε, x}}; | \hat{ε} \hat{B} |_{\infty; [0, 1]} < κ] \\ \leq (e^{δ} - 1) \sqrt{E [e^{- (2 I^{'} (x) / \hat{ε}) g_{1}^{x}}]} \sqrt{P [| \hat{ε} \hat{B} |_{\infty; [0, 1]} \geq κ]} \\ + (e^{δ} - 1) C \end{aligned}$ where, by Proposition 7.4, the constant $C = C (κ)$ is uniform in small ϵ and x. The square-root terms are computed resp. (Fernique) estimated by $\exp (\frac{(I^{'} (x))^{2} V (g_{1}^{x})}{{\hat{ε}}^{2}}) \times \exp (- c κ^{2} / {\hat{ε}}^{2})$ for some c>0 which depends on the law of B (hence H), but is uniform in ϵ and x. Hence, for x small enough, the resulting exponent $(I^{'} (x))^{2} V (g_{1}^{x}) - c κ^{2}$ is negative, which is more than enough to conclude the upper bound.

Step 3. Lower bound. Write $E_{δ, κ} [\cdot] = E [\cdot 1_{{\hat{U}}^{ε, x} \in [0, δ_{ε}]} 1_{|} \hat{ε} \hat{B} |_{\infty; [0, 1]} < κ]$ and estimate $\begin{aligned} E_{δ, κ} [e^{- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε} / 2} {(\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1)}^{1 / 2}] \\ = E_{δ, κ} [e^{- (I^{'} (x) / {\hat{ε}}^{2}) {\hat{U}}^{ε} / 2} {(\exp (\frac{ε}{\hat{ε}} {\hat{U}}^{ε}) - 1)}^{1 / 2} e^{I^{'} (x) R_{2}^{ε} / 2} \\ \times e^{- I^{'} (x) R_{2}^{ε} / 2}] \\ \leq J_{δ} {(ε, x)}^{1 / 2} E_{δ, κ} {[e^{- I^{'} (x) R_{2}^{ε}}]}^{1 / 2} \end{aligned}$ where we used Cauchy–Schwarz and discarded the event ${| \hat{ε} \hat{B} |_{\infty; [0, 1]} < κ}$ . The localized remainder estimate provides an upper bound on $E_{δ, κ} [e^{- I^{'} (x) R_{2}^{ε}}]$ , uniformly over small (enough) ϵ and x.

It then suffices to get a suitable lower bound of the left-hand side above. Indeed, for $u \in [0, {\hat{ε}}^{2} η] = [0, ε^{4 H} η]$ , with η small enough, not dependent on ϵ, (43) $u \mapsto (e^{(ε / \hat{ε}) u} - 1)^{1 / 2} e^{- (I^{'} (x) / {\hat{ε}}^{2}) u / 2} \geq γ {(\frac{ε}{\hat{ε}} u)}^{1 / 2}$ (43) for a constant $γ > 0$ which can also be taken uniformly in small $x, ε$ . Then estimate $\begin{aligned} E_{δ, κ} [(e^{(ε / \hat{ε}) {\hat{U}}^{ε}} - 1)^{1 / 2} e^{- (I^{'} (x) / 2 ε^{2}) {\hat{U}}^{ε}}] \\ \geq γ ε^{1 / 2 - H} E [| {\hat{U}}^{ε} |^{1 / 2} 1_{{\hat{U}}^{ε} \in [0, {\hat{ε}}^{2} η]} 1_{{|ε B|}_{\infty; [0, 1]} < κ}] . \end{aligned}$ As a quick sanity check, pretend zero remainder so that ${\hat{U}}^{ε} = \hat{ε} g_{1}$ : dropping further the (exponentially close to probability one) event ${| ε B |_{\infty; [0, 1]} < κ}$ , a Gaussian computation then shows that we are left with ( $γ ε^{1 / 2 - H}$ times ${\hat{ε}}^{1 / 2}$ times) $E [| g_{1} |^{1 / 2}; g_{1} \in [0, \hat{ε}]] \sim (c o n s t) {\hat{ε}}^{3 / 2} .$ In general, set $V^{ε} = {\hat{U}}^{ε} / \hat{ε} = g_{1} + \hat{ε} R_{2} s^{ε}$ , so thatFootnote⁵ $E_{κ} [| {\hat{U}}^{ε} |^{1 / 2}; {\hat{U}}^{ε} \in [0, {\hat{ε}}^{2} η]] = {\hat{ε}}^{1 / 2} E_{κ} [{|V^{ε}|}^{1 / 2}; V^{ε} \in [0, \hat{ε} η]] .$ At this stage, it is difficult to treat $\hat{ε} R^{ε}$ as perturbation of g since, on the given event ${V^{ε} \in [0, \hat{ε} η]}$ , all terms are of order $\hat{ε}$ . We can solve this issue by realizing that we can replace, throughout, x by $x_{ε} = x ε^{2 β}$ . Since $I^{'} (x_{ε}) \sim (c o n s t) x_{ε}$ , with see from (Equation43(43) $u \mapsto (e^{(ε / \hat{ε}) u} - 1)^{1 / 2} e^{- (I^{'} (x) / {\hat{ε}}^{2}) u / 2} \geq γ {(\frac{ε}{\hat{ε}} u)}^{1 / 2}$ (43) ), that in the above estimate the event ${\hat{U}}^{ε} \in [0, {\hat{ε}}^{2} η] = [0, ε^{4 H} η]$ (resp. $V^{ε} \in [0, \hat{ε} η] = [0, ε^{2 H} η]$ ) can be replaced by ${\hat{U}}^{ε} \in [0, ε^{4 H - 2 β} η]$ (resp. $V^{ε} \in [0, ε^{2 H - 2 β} η]$ ), possibly with an insignificantly modified constant η. It is now straight-forward to show that the behavior of $E_{κ} [| V^{ε} |^{1 / 2}; V^{ε} \in [0, ε^{2 H - 2 β} η]]$ is of the same order as $E [g^{1 / 2}; g \in [0, ε^{2 H - 2 β} η]]$ , the correct behavior (i.e. positive power of $ε)$ is obtained by spelling out the (Gaussian) integral.

Proposition 7.4

Localized remainder tail estimate

For every $κ > 0,$ there exists $c_{1}, c_{2} > 0$ such that, for all r and uniformly in small $ε, x$ we have $P [|R_{2}^{ε}| > r, | \hat{ε} \hat{B} |_{\infty; [0, 1]} < κ] \leq c_{1} \exp (- c_{2} r)$

Proof.

We decompose ${\hat{ε}}^{2} R_{2}^{ε} = M^{ε} + N^{ε}$ in terms of the (local) martingale $M^{ε} := \hat{ε} \int_{0} [σ (\hat{ε} \hat{B} + \hat{f}) - σ (\hat{f})] d [\bar{ρ} W + ρ B]$ and the (bounded variation) process $\begin{aligned} N^{ε} := \int_{0} [σ (\hat{ε} \hat{B} + \hat{f}) - σ (\hat{f}) - σ^{'} (\hat{f}) \hat{ε} B] d [\bar{ρ} h + ρ f] \\ - \frac{1}{2} ε \hat{ε} \int_{0} σ^{2} (\hat{ε} \hat{B} + \hat{f}) d t . \end{aligned}$ Let $τ^{ε, κ}$ be the stopping time when $\hat{ε} \hat{B}$ first leaves the uniform ball of radius κ. Then $M_{t}^{κ, ε} := M_{t \land τ^{ε, κ}}^{ε}$ still yields a (local) martingale. The point is that ${| \hat{ε} \hat{B} |_{\infty; [0, 1]} < κ} = {τ^{ε, κ} > 1}$ . On this event, $M^{ε} |_{[0, 1]} = M^{κ, ε} |_{[0, 1]}$ and we can thus replace $M^{ε}$ , in the definition of the remainder, by $M^{κ, ε}$ . Let $K = K^{κ, x}$ be the κ-fattening of ${f (t) : 0 \leq t \leq 1}$ , recall $f = f^{x}$ , then, for $t \in [0, 1]$ , $d {[M^{κ, ε}]}_{t} / d t = {\hat{ε}}^{2} (σ (\hat{ε} {\hat{B}}_{t} + {\hat{f}}_{t}) - σ (f_{t}))^{2} \leq {\hat{ε}}^{4} {∥σ^{'}∥}_{\infty; K}^{2} | {\hat{B}}_{t} |^{2} .$ Clearly, we can replace K by ${\tilde{K}}^{κ}$ which contains all $K^{κ, x}$ for small x. To summarize, we have, on the event ${| \hat{ε} \hat{B} |_{\infty; [0, 1]} < κ}$ , $R^{ε} (\cdot) = {\hat{ε}}^{- 2} M^{κ, ε} + {\hat{ε}}^{- 2} N^{ε}$ with $[{\hat{ε}}^{- 2} M^{κ, ε}] = O (| \hat{B} |_{\infty; [0, 1]}^{2})$ and, as seen by a similar (but easier) reasoning, ${\hat{ε}}^{- 2} N^{ε} = O (| \hat{B} |_{\infty; [0, 1]}^{2})$ , always for fixed $κ > 0$ , but uniformly in small ϵ (equivalently, $\hat{ε}$ ) and small x>0. This clearly shows that ${\hat{ε}}^{- 2} N^{ε}$ has exponential tails. The same is true for the martingale part, whose bracket is $O ({Gaussian}^{2})$ . This is exactly the situation for the ‘model’ martingal increment $2 \int_{0}^{1} B d B = B_{1}^{2} - 1$ which clearly has exponential tails. To make this rigorous, recall that Gaussian resp. exponential tails are characertized by $O (\sqrt{p})$ resp. $O (p)$ -growth of the $L^{p}$ -norms. The statement is then an easy consequence of the sharp (upper) BDG constant (Carlen and Kree Citation1991), known to be $O (\sqrt{p})$ .

8. Proof of the implied volatility expansion

With Theorem 3.2 in place, we now turn to the proof of the implied volatility expansion, formulated in Theorem 3.6.

Proof of Theorem 3.6

We will use an asymptotic formula for the dimensionless implied variance $V_{t}^{2} = t σ_{impl} (k_{t}, t)^{2}, t > 0,$ obtained in Gao and Lee (Citation2014). It follows from the first formula in Remark 7.3 in Gao and Lee (Citation2014) that (44) $V_{t}^{2} - \frac{k_{t}^{2}}{2 L_{t}} = O (\frac{k_{t}^{2}}{L_{t}^{2}} (k_{t} + | \log k_{t} | + \log L_{t})), t \to 0,$ (44) where $L_{t} = - \log c (k_{t}, t)$ , t>0.

We will need the following formula that was established in the proof of Theorem 3.4: (45) $L_{t} = \frac{I (k t^{β})}{t^{2 H}} + O (t^{- θ})$ (45) as $t \to 0$ , for all $x \geq 0$ and $β \in [0, H)$ and any $θ > 0$ . Let us first assume $2 H / (n + 1) \leq β < 2 H / n$ . Using the energy expansion, we obtain from (Equation45(45) $L_{t} = \frac{I (k t^{β})}{t^{2 H}} + O (t^{- θ})$ (45) ) that (46) $\begin{aligned} L_{t} & = \sum_{i = 2}^{n} \frac{I^{(i)} (0)}{i!} k^{i} t^{i β - 2 H} + O (t^{- θ}) = \frac{I^{′′} (0)}{2} k^{2} t^{2 β - 2 H} \\ \times [1 + \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{2 H - 2 β - θ})] \end{aligned}$ (46) as $t \to 0$ . The second term in the brackets on the right-hand side of (Equation46(46) $\begin{aligned} L_{t} & = \sum_{i = 2}^{n} \frac{I^{(i)} (0)}{i!} k^{i} t^{i β - 2 H} + O (t^{- θ}) = \frac{I^{′′} (0)}{2} k^{2} t^{2 β - 2 H} \\ \times [1 + \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{2 H - 2 β - θ})] \end{aligned}$ (46) ) disappears if n=2.

Remark 8.1

Suppose $n \geq 2$ and $2 H / (n + 1) \leq β < 2 H / n$ . Then formula (Equation46(46) $\begin{aligned} L_{t} & = \sum_{i = 2}^{n} \frac{I^{(i)} (0)}{i!} k^{i} t^{i β - 2 H} + O (t^{- θ}) = \frac{I^{′′} (0)}{2} k^{2} t^{2 β - 2 H} \\ \times [1 + \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{2 H - 2 β - θ})] \end{aligned}$ (46) ) is optimal. Next, suppose $n \geq 2$ and $0 < β < 2 H / (n + 1)$ . In this case, there exists $m \geq n + 1$ such that $2 H / (m + 1) \leq β < 2 H / m$ , and hence (Equation46(46) $\begin{aligned} L_{t} & = \sum_{i = 2}^{n} \frac{I^{(i)} (0)}{i!} k^{i} t^{i β - 2 H} + O (t^{- θ}) = \frac{I^{′′} (0)}{2} k^{2} t^{2 β - 2 H} \\ \times [1 + \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{2 H - 2 β - θ})] \end{aligned}$ (46) ) holds with m instead of n. However, we can replace m by n, by making the error term worse. It is not hard to see that the following formula holds for all $n \geq 2$ and $0 < β < 2 H / (n + 1)$ : (47) $\begin{aligned} L_{t} & = \sum_{i = 2}^{n} \frac{I^{(i)} (0)}{i!} k^{i} t^{i β - 2 H} + O (t^{(n + 1) β - 2 H}) = \frac{I^{′′} (0)}{2} k^{2} t^{2 β - 2 H} \\ \times [1 + \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{(n - 1) β})] \end{aligned}$ (47) as $t \to 0$ provided we choose θ small enough.

Let us continue the proof of Theorem 3.6. Since $k_{t} \approx t^{1 / 2 - H + β}$ and $L_{t} \approx t^{2 β - 2 H}$ as $t \to 0$ , (Equation44(44) $V_{t}^{2} - \frac{k_{t}^{2}}{2 L_{t}} = O (\frac{k_{t}^{2}}{L_{t}^{2}} (k_{t} + | \log k_{t} | + \log L_{t})), t \to 0,$ (44) ) implies that (48) $V_{t}^{2} = \frac{k^{2} t^{1 - 2 H + 2 β}}{2 L_{t}} + O (t^{1 + 2 H - 2 β - θ}), t \to 0.$ (48) Next, using the Taylor formula for the function $u \mapsto 1 / (1 + u)$ , and setting $u = \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{2 H - 2 β - θ}),$ we obtain from (Equation46(46) $\begin{aligned} L_{t} & = \sum_{i = 2}^{n} \frac{I^{(i)} (0)}{i!} k^{i} t^{i β - 2 H} + O (t^{- θ}) = \frac{I^{′′} (0)}{2} k^{2} t^{2 β - 2 H} \\ \times [1 + \sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β} + O (t^{2 H - 2 β - θ})] \end{aligned}$ (46) ) that $(2 L_{t})^{- 1} = \frac{t^{2 H - 2 β}}{k^{2} I^{′′} (0)} [\sum_{j = 0}^{n - 2} (- 1)^{j} u^{j} + O (u^{n - 1})]$ as $t \to 0$ . It follows from $2 H / (n + 1) \leq β < 2 H / n$ that $(n - 1) β \geq 2 H - 2 β$ , and hence $\begin{aligned} (2 L_{t})^{- 1} & = \frac{t^{2 H - 2 β}}{k^{2} I^{′′} (0)} [\sum_{j = 0}^{n - 2} (- 1)^{j} u^{j}] + O (t^{4 H - 4 β - θ}) \\ = \frac{t^{2 H - 2 β}}{k^{2} I^{′′} (0)} [\sum_{j = 0}^{n - 2} (- 1)^{j} {(\sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β})}^{j}] \\ + O (t^{4 H - 4 β - θ}) \end{aligned}$ as $t \to 0$ . Now, (Equation48(48) $V_{t}^{2} = \frac{k^{2} t^{1 - 2 H + 2 β}}{2 L_{t}} + O (t^{1 + 2 H - 2 β - θ}), t \to 0.$ (48) ) gives $\begin{aligned} V_{t}^{2} & = \frac{t}{I^{′′} (0)} [\sum_{j = 0}^{n - 2} (- 1)^{j} {(\sum_{i = 3}^{n} \frac{2 I^{(i)} (0)}{i! I^{′′} (0)} k^{i - 2} t^{(i - 2) β})}^{j}] \\ + O (t^{1 + 2 H - 2 β - θ}) \end{aligned}$ as $t \to 0$ . Finally, by canceling a factor of t in the previous formula, we obtain formula (Equation14(14) $\begin{aligned} σ_{impl} (k_{t}, t)^{2} & = \sum_{j = 0}^{n - 2} \frac{(- 1)^{j} 2^{j}}{I^{′′} (0)^{j + 1}} {(\sum_{i = 3}^{n} \frac{I^{(i)} (0)}{i!} k^{i - 2} t^{(i - 2) β})}^{j} \\ + O (φ_{n, H, β, θ} (t)) . \end{aligned}$ (14) ) for $2 H / (n + 1) \leq β < 2 H / n$ . The proof in the case where $β \leq 2 H / (n + 1)$ is similar. Here we take into account Remark 8.1. This completes the proof of Theorem 3.6.

Acknowledgments

Two referees are thanked for their useful comments. We further thank Martin Forde for valuable feedback.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

We gratefully acknowledge financial support through DFG research grants FR2943/2 and BA5484/1 (C. Bayer, P.K. Friz, B. Stemper), European Research Council Grant CoG-683164 (P.K. Friz), and SNF Early Postdoc Mobility Grant 165248 (B. Horvath) respectively.

Notes

† More terms in the expansion of Φ are needed.

† Note that expressions for the exact same scenario have have been computed before in the original pricing paper (Bayer et al. Citation2016), yet in that version the expression for the autocorrelation of the fBM

\hat{B}

was incorrect. We compute and state here all the relevant terms for the sake of completeness.

† The Python 3 code used to run the simulations can be found at github.com/RoughStochVol.

† More precisely, since neither σ nor its derivatives need to be bounded, we need to actually work with a local version of the above estimate, for instance by replacing the max with a sup over a compact set containing ${(K \dot{f}) (t) : 0 \leq t \leq 1}$ .

† Write $E_{κ}$ for the expected valued restricted to the event ${| ε B |_{\infty; [0, 1]} < κ}$

References

Alòs, E., León, J.A. and Vives, J., On the short-time behavior of the implied volatility for jump-diffusion models with stochastic volatility. Finance Stoch., 2007, 11(4), 571–589. doi: 10.1007/s00780-007-0049-1
Web of Science ®Google Scholar
Azencott, R., Formule de Taylor stochastique et développement asymptotique d'intégrales de Feynman. In Seminar on Probability, XVI, Supplement, volume 921 of Lecture Notes in Math., pp. 237–285, 1982 (Springer: Berlin-New York).
Google Scholar
Azencott, R., Petites perturbations aléatoires des systemes dynamiques: développements asymptotiques. Bull. Sci. Math., 1985, 109(3), 253–308.
Google Scholar
Baudoin, F. and Ouyang, C., On small time asymptotics for rough differential equations driven by fractional Brownian motions. In Large Deviations and Asymptotic Methods in Finance, edited by P.K. Friz, J. Gatheral, A. Gulisashvili, A. Jacquier, and J. Teichmann, pp. 413–438, 2015 (Springer International Publishing: Cham).
Google Scholar
Bayer, C., Friz, P.K., Gassiat, P., Martin, J. and Stemper, B., A regularity structure for rough volatility. Preprint, 2017. arXiv:1710.07481.
Google Scholar
Bayer, C., Friz, P.K. and Gatheral, J., Pricing under rough volatility. Quant. Finance, 2016, 16(6), 887–904. doi: 10.1080/14697688.2015.1099717
Web of Science ®Google Scholar
Ben Arous, G., Methods de Laplace et de la phase stationnaire sur l'espace de Wiener. Stochastics, 1988, 25(3), 125–153. doi: 10.1080/17442508808833536
Google Scholar
Bennedsen, M., Lunde, A. and Pakkanen, M.S., Decoupling the short- and long-term behavior of stochastic volatility. Preprint, 2016. arXiv:1610.00332.
Google Scholar
Bennedsen, M., Lunde, A. and Pakkanen, M.S., Hybrid scheme for Brownian semistationary processes. Finance Stoch., 2017, 21(4), 931–965. doi: 10.1007/s00780-017-0335-5
Web of Science ®Google Scholar
Bismut, J.-M., Large Deviations and the Malliavin Calculus. Progress in Mathematics, Vol. 45. 1984 (Birkhäuser Boston, Inc.: Boston, MA).
Google Scholar
Carlen, E. and Kree, P., Estimates on iterated stochastic integrals. Ann. Probab., 1991, 19(1), 354–368. doi: 10.1214/aop/1176990549
Web of Science ®Google Scholar
Cass, T. and Friz, P., Densities for rough differential equations under Hörmander's condition. Ann. Math., 2010, 0, 2115–2141. doi: 10.4007/annals.2010.171.2115
Google Scholar
Decreusefond, L., Stochastic integration with respect to Volterra processes. Ann. l. H. Poincare Probab. Statist., 2005, 41(2), 123–149. doi: 10.1016/j.anihpb.2004.03.004
Web of Science ®Google Scholar
Deuschel, J.-D., Friz, P.K., Jacquier, A. and Violante, S., Marginal density expansions for diffusions and stochastic volatility I: Theoretical foundations. Comm. Pure Appl. Math., 2014a, 67(1), 40–82. doi: 10.1002/cpa.21478
Web of Science ®Google Scholar
Deuschel, J.-D., Friz, P.K., Jacquier, A. and Violante, S., Marginal density expansions for diffusions and stochastic volatility II: Applications. Comm. Pure Appl. Math., 2014b, 67(2), 321–350. doi: 10.1002/cpa.21483
Web of Science ®Google Scholar
Deuschel, J.-D. and Stroock, D.W., Large Deviations, Vol. 137, 1989 (Academic Press: Boston, MA).
Google Scholar
El Euch, O. and Rosenbaum, M., The characteristic function of rough Heston models. Preprint, 2016. To appear in Math. Finance.
Google Scholar
Forde, M. and Jacquier, A., Small-time asymptotics for implied volatility under the heston model. Int. J. Theoret. Appl. Finance, 2009, 12(06), 861–876. doi: 10.1142/S021902490900549X
Google Scholar
Forde, M. and Zhang, H., Asymptotics for rough stochastic volatility models. SIAM J. Financ. Math., 2017, 8(1), 114–145. doi: 10.1137/15M1009330
Google Scholar
Friz, P.K. and Gassiat, P., Martingality and moments for lognormal rough volatility. In preparation, 2018.
Google Scholar
Friz, P.K., Gerhold, S. and Pinter, A., Option Pricing in the Moderate Deviations Regime. Math. Finance, 2018, 28(3), 962–988. doi: 10.1111/mafi.12156
PubMed Web of Science ®Google Scholar
Friz, P. and Hairer, M., A Course on Rough Paths, 2014 (Springer: Cham).
Google Scholar
Fukasawa, M., Asymptotic analysis for stochastic volatility: martingale expansion. Finance Stoch., 2011, 15(4), 635–654. doi: 10.1007/s00780-010-0136-6
Web of Science ®Google Scholar
Fukasawa, M., Short-time at-the-money skew and rough fractional volatility. Quant. Finance, 2017, 17(2), 189–198. doi: 10.1080/14697688.2016.1197410
Web of Science ®Google Scholar
Gao, K. and Lee, R., Asymptotics of implied volatility to arbitrary order. Finance Stoch., 2014, 18(2), 349–392. doi: 10.1007/s00780-013-0223-6
Web of Science ®Google Scholar
Gatheral, J., The Volatility Surface: A Practitioner's Guide, 2011 (John Wiley & Sons: Hoboken, NJ).
Google Scholar
Gatheral, J., Jaisson, T. and Rosenbaum, M., Volatility is rough. Preprint, 2014. To appear in Quant. Finance.
Google Scholar
Guennoun, H., Jacquier, A. and Roome, P., Asymptotic behaviour of the fractional Heston model. Preprint, 2014. arXiv:1411.7653.
Google Scholar
Gulisashvili, A., Large deviation principle for Volterra type fractional stochastic volatility models. ArXiv e-prints, October 2017. To appear in SIAM J. Financ. Math.
Google Scholar
Inahama, Y., Laplace approximation for rough differential equation driven by fractional brownian motion. Ann. Probab., 2013, 41(1), 170–205. doi: 10.1214/11-AOP733
Web of Science ®Google Scholar
Jacquier, A., Pakkanen, M.S. and Stone, H., Pathwise large deviations for the Rough Bergomi model. ArXiv e-prints, June 2017.
Google Scholar
Jourdain, B., Loss of martingality in asset price models with lognormal stochastic volatility. Int. J. Theoret. Appl. Finance, 2004, 13, 767–787.
Google Scholar
Lamperti, J., Semi-stable stochastic processes. Trans. Am. Math. Soc., 1962, 104(1), 62–78. doi: 10.1090/S0002-9947-1962-0138128-7
Google Scholar
Lions, P.-L. and Musiela, M., Correlations and bounds for stochastic volatility models. Ann. Inst. H. Poincaré Anal. Non Linéaire, 2007, 24(1), 1–16.
Google Scholar
Medvedev, A. and Scaillet, O., A simple calibration procedure of stochastic volatility models with jumps by short term asymptotics. Preprint, 2003. Available at SSRN 477441.
Google Scholar
Medvedev, A. and Scaillet, O., Approximation and calibration of short-term implied volatilities under jump-diffusion stochastic volatility. Rev. Financ. Stud., 2007, 20(2), 427–459. doi: 10.1093/rfs/hhl013
Web of Science ®Google Scholar
Mijatović, A. and Tankov, P., A new look at short-term implied volatility in asset price models with jumps. Math. Finance, 2016, 26(1), 149–183. doi: 10.1111/mafi.12055
Web of Science ®Google Scholar
Muhle-Karbe, J. and Nutz, M., Small-time asymptotics of option prices and first absolute moments. J. Appl. Probab., 2011, 48(4), 1003–1020. doi: 10.1239/jap/1324046015
Web of Science ®Google Scholar
Olver, F.W.J., Lozier, D.W., Boisvert, R.F. and Clark, C.W (Eds.), NIST Handbook of Mathematical Functions, 2010 (Cambridge University Press: New York).
Google Scholar
Osajima, Y., The asymptotic expansion formula of implied volatility for dynamic SABR model and FX hybrid model. Preprint, 2007. Available at SSRN 965265.
Google Scholar
Osajima, Y., General asymptotics of Wiener functionals and application to implied volatilities, In Large Deviations and Asymptotic Methods in Finance, edited by P.K. Friz, J. Gatheral, A. Gulisashvili, A. Jacquier, and J. Teichmann, pp. 137–173, 2015 (Springer International Publishing: Cham).
Google Scholar
Pham, H., Large deviations in mathematical finance, 2010. Available online at: https://www.lpsm.paris/pageperso/pham/GD-finance.pdf.
Google Scholar
Sin, C.A., Complications with stochastic volatility models. Adv. Appl. Probab., 1998, 30(1), 256–268. doi: 10.1239/aap/1035228003
Web of Science ®Google Scholar

Appendix. Auxiliary lemmas

In this section we provide and prove some auxiliary lemmas, which are used in the preparations to the proof of Theorem 3.2. We start with a technical Lemma, that justifies the derivation.

Lemma .1

Assume $σ (.) > 0$ and $| ρ | < 1$ . Then $K^{x}$ is a Hilbert manifold near any $h := (h, f) \in K^{x} \subset H := H_{0}^{1} \times H_{0}^{1}$ .

Proof.

Similar to Bismut (Citation1984, p. 25) we need to show that $D ϕ_{1} (h)$ is surjective where $ϕ_{1} (h) :$ $H \to R$ with $ϕ_{1} (h) = ϕ_{1} (h, f) = \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f) .$ From $\begin{aligned} ϕ_{1} (h + δ h^{'}) & = \int_{0}^{1} σ (\hat{f} + δ {\hat{f}}^{'}) d (\bar{ρ} h + ρ f + δ (\bar{ρ} h^{'} + ρ f^{'})) \\ = ϕ_{1} (h) + δ \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h^{'} + ρ f^{'}) \\ + δ \int_{0}^{1} σ^{'} (\hat{f}) {\hat{f}}^{'} d (\bar{ρ} h + ρ f) + o (δ) . \end{aligned}$ the functional derivative $D ϕ_{1} (h)$ can be computed explicitly. In fact, even the computation $(D ϕ_{1} (h), (h^{'}, 0)) = \bar{ρ} \int_{0}^{1} σ (\hat{f}) d h^{'}$ is sufficient to guarantee surjectivity of $D ϕ_{1} (h)$ .

We now give the proof of Lemma 6.3, which determines the form of the Girsanov measure change (Equation36(36) $\begin{aligned} G_{ε} = \exp (- \frac{1}{\hat{ε}} \int_{0}^{1} {\dot{h}}_{s} d W_{s} - \frac{1}{\hat{ε}} \int_{0}^{1} {\dot{f}}_{s} d B_{s} \\ - \frac{1}{2 {\hat{ε}}^{2}} \int_{0}^{1} ({\dot{h}}_{s}^{2} + {\dot{f}}_{s}^{2}) d s) . \end{aligned}$ (36) ) for the minimizing configuration.

Lemma A.2

(i) Any optimal control $h^{0} = (h^{x}, f^{x}) \in K^{x}$ is a critical point of $h = (h, f) \mapsto - I (ϕ_{1}^{h}) + \frac{1}{2} {∥h∥}_{H}^{2};$ (ii) it holds that $\int_{0}^{1} {\dot{h}}^{x} d W + \int_{0}^{1} {\dot{f}}^{x} d B = I^{'} (x) g_{1} .$

Proof.

(Step 1) Write $h = (h, f)$ and $ϕ_{1} (h) = ϕ_{1} (h, f) = \int_{0}^{1} σ (\hat{f}) d (\bar{ρ} h + ρ f) .$ Let $h^{0} = (h^{x}, f^{x}) \in K^{x}$ an optimal control. Then $K e r D ϕ_{1} (h^{0}) = T_{h^{0}} K^{x} = \{h \in H^{1} : D ϕ_{1} (h) = 0\} .$ (This requires $K^{x}$ to be a Hilbert manifold near $h^{0}$ , as was seen in the last lemma.)

(Step 2) For fixed $h \in H$ , define $u (t) := - I (ϕ_{1}^{h^{0} + t h}) + \frac{1}{2} {∥h^{0} + t h∥}_{H}^{2} \geq 0$ with equality at t=0 (since $x = ϕ_{1}^{h^{0}}$ and $I (x) = \frac{1}{2} ∥ h^{0} ∥_{H}^{2}$ ) and non-negativity for all t because $h^{0} + t h$ is an admissible control for reaching $\tilde{x} = ϕ_{1}^{h^{0} + t h}$ (so that $I (\tilde{x}) = inf {\dots} \leq \frac{1}{2} ∥ h^{0} + t h ∥_{H}^{2}$ .)

(Step 3) We note that $\dot{u} (0) = 0$ is a consequence of $u \in C^{1}$ near 0, $u (0) = 0$ and $u \geq 0$ . In other words, $h^{0}$ is a critical point for $H^{1} ∋ h \mapsto - I (ϕ_{1}^{h}) + \frac{1}{2} {∥h∥}_{H}^{2} .$ (Step 4) The functional derivative of this map at $h^{0}$ must hence be zero. In particular, for all $h \in H$ ,

$\begin{aligned} 0 & \equiv - I^{'} (ϕ_{1}^{h^{0}}) ⟨D ϕ_{1} (h^{0}), h⟩ + ⟨h^{0}, h⟩ \\ = - I^{'} (x) ⟨D ϕ_{1} (h^{0}), h⟩ + ⟨h^{0}, h⟩ . \end{aligned}$

(Step 5) With $h^{0} = (h^{x}, f^{x})$ and $h = (h, f)$ $\begin{aligned} ⟨D ϕ_{1} (h^{0}), h⟩ = {\frac{d}{d ε}|}_{ε = 0} \int_{0}^{1} σ ({\hat{f}}^{x} + ε \hat{f}) d \\ \times (\bar{ρ} h^{x} + ρ f^{x} + ε (\bar{ρ} h + ρ f)) \\ = \int_{0}^{1} σ ({\hat{f}}^{x}) d (\bar{ρ} h + ρ f) + \int_{0}^{1} σ^{'} ({\hat{f}}^{x}) \hat{f} d (\bar{ρ} h^{x} + ρ f^{x}) \end{aligned}$

By continuous extension, replace $h = (h, f)$ by $(W, B)$ above and note that $⟨D ϕ_{1} (h^{0}), (W, B)⟩ = g_{1}$ since indeed $g_{1} = \int_{0}^{1} σ ({\hat{f}}_{t}) d (\bar{ρ} W_{t} + ρ B_{t}) + σ^{'} ({\hat{f}}_{t}) {\hat{B}}_{t} d (\bar{ρ} h_{t} + ρ f_{t})$ . Hence $\int_{0}^{1} {\dot{h}}^{x} d W + \int_{0}^{1} {\dot{f}}^{x} d B = I^{'} (x) g_{1} .$

Short-time near-the-money skew in rough fractional volatility models

Abstract

1. Introduction

2. Exposition and assumptions

Small time self-similarity

3. Main results

Energy expansion

Pricing formula

Black-Scholes model

Moderate Deviations

Proof of Theorem 3.4

Symmetry

4. Simulation results

5. Proof of the energy expansion

Forde and Zhang Citation2017

First order optimality condition

5.1. Smoothness of the energy

5.1.1. The uncorrelated case

Zero correlation

5.1.2. The general case

5.2. Energy expansion

5.2.1. Expansion of the minimizing configuration

Non-Markovian transversality

Proof of Theorem 5.12

5.2.2. Energy expansion in the general case

5.2.3. Energy expansion for the Riemann-Liouville kernel

6. Proof of the pricing formula

Proof of Theorem 3.2

7. Proof of the moderate deviation expansions

Localized remainder tail estimate

8. Proof of the implied volatility expansion

Proof of Theorem 3.6

Acknowledgments

Disclosure statement

References

Appendix. Auxiliary lemmas

Information for

Open access

Opportunities

Help and information

Short-time near-the-money skew in rough fractional volatility models

Abstract

1. Introduction

2. Exposition and assumptions

Small time self-similarity

3. Main results

Energy expansion

Pricing formula

Black-Scholes model

Moderate Deviations

Proof of Theorem 3.4

Symmetry

4. Simulation results

5. Proof of the energy expansion

Forde and Zhang Citation2017

First order optimality condition

5.1. Smoothness of the energy

5.1.1. The uncorrelated case

Zero correlation

5.1.2. The general case

5.2. Energy expansion

5.2.1. Expansion of the minimizing configuration

Non-Markovian transversality

Proof of Theorem 5.12

5.2.2. Energy expansion in the general case

5.2.3. Energy expansion for the Riemann-Liouville kernel

6. Proof of the pricing formula

Proof of Theorem 3.2

7. Proof of the moderate deviation expansions

Localized remainder tail estimate

8. Proof of the implied volatility expansion

Proof of Theorem 3.6

Acknowledgments

Disclosure statement

Additional information

Funding

Notes

References

Appendix. Auxiliary lemmas

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date