Full article: Optimal trade execution for Gaussian signals with power-law resilience

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We characterize the optimal signal-adaptive liquidation strategy for an agent subject to power-law resilience and zero temporary price impact with a Gaussian signal, which can include e.g an OU process or fractional Brownian motion. We show that the optimal selling speed $u_{t}^{*}$ is a Gaussian Volterra process of the form $u^{*} (t) = u^{0} (t) + \bar{u} (t) + \int_{0}^{t} k (u, t) d W_{u}$ on $[0, T)$ , where $k (\cdot, \cdot)$ and $\bar{u}$ satisfy a family of (linear) Fredholm integral equations of the first kind which can be solved in terms of fractional derivatives. The term $u^{0} (t)$ is the (deterministic) solution for the no-signal case given in Gatheral et al. [Transient linear price impact and Fredholm integral equations. Math. Finance, 2012, 22, 445–474], and we give an explicit formula for $k (u, t)$ for the case of a Riemann-Liouville price process as a canonical example of a rough signal. With non-zero linear temporary price impact, the integral equation for $k (u, t)$ becomes a Fredholm equation of the second kind. These results build on the earlier work of Gatheral et al. [Transient linear price impact and Fredholm integral equations. Math. Finance, 2012, 22, 445–474] for the no-signal case, and complement the recent work of Neuman and Voß[Optimal signal-adaptive trading with temporary and transient price impact. Preprint, 2020]. Finally we show how to re-express the trading speed in terms of the price history using a new inversion formula for Gaussian Volterra processes of the form $\int_{0}^{t} g (t - s) d W_{s}$ , and we calibrate the model to high frequency limit order book data for various NASDAQ stocks.

Keywords:

1. Introduction

A critical problem for algorithmic traders is how to optimally split a large trade so as to minimize trading costs and market impact. The seminal article of Almgren and Chriss (Citation2001) formulates this problem as trade-off between expected execution cost and risk; more specifically, they assume the stock price is a martingale and execution costs are linear in the trading rate and the choice of risk criterion is variance. Under these assumptions, there is a well known closed-form analytical solution for the optimal selling speed which is deterministic.

More recently, authors have begun to relax the martingale assumption of Almgren-Chriss to incorporate the effect of signals. In particular, Cartea and Jaimungal (Citation2016) provide empirical evidence of the impact of order flow on NASDAQ stocks, and propose a model of order flow for an investor who executes a large order when market order-flow from all agents, including the investor's own trades, has a permanent price impact (see also Section 7.3 in Cartea et al. Citation2015). Cartea and Jaimungal (Citation2016) derive a closed-form solution for the optimal strategy where the rate of trading depends on the expectation of future order flow. Cartea et al. (Citation2018) show that volume imbalance is an effective predictor of the sign of future market orders, and how trading signals arising from order flow can be used to execute large orders and make markets. More recently Kalsi et al. (Citation2020) and Cartea et al. (Citation2020) use signals as inputs to the signature of the market to devise trading algorithms.

For the case of zero signal with a general impact function G, the optimal trading strategy is deterministic and satisfies $\int_{0}^{T} G (| t - v |) d X_{v} = λ$ , which is a Fredholm integral equation of the first kind. The constant λ has to be chosen so as to enforce the liqudation condition $X_{T} = 0$ , and Gatheral et al. (Citation2012) prove existence in this case if G is non-constant, non-decreasing, convex and integrable at zero. The Fredholm equation can be solved explicitly for the case of exponential and power law impact. For the former, the solution is well known from Obizhaeva and Wang (Citation2013) and consists of a block (i.e. an impulse response) sell trade at time zero and at the final maturity, with continuous selling in between proportional to the resilience parameter ρ (see also Example 2.12 in Gatheral et al. Citation2012). For the case of power law impact, the integral equation reduces to the well known Abel integral equation which also has an explicit solution which is U-shaped and symmetric, c.f. Section 2.2 in Curato et al. (Citation2017). The Fredholm equation becomes a weakly singular Urysohn equation of the first kind if the temporary price impact component is non-linear, i.e. the price paid per unit stock is $S_{t} + \int_{0}^{t} G (t - s) f ({\dot{X}}_{s}) d t$ for some non-linear impact function f, and X is assumed to be absolutely continuous (see Dang Citation2014, Curato et al. Citation2017 for more on this, and numerical schemes for solving such non-linear integral equations).

Belak et al. (Citation2020) derive the optimal trading strategy for a linear price impact model with a partial liquidation penalty of the form $Γ X_{T}^{2}$ for $Γ > 0$ , when the stock price is a general unspecified semimartingale. Using a similar variational argument to Bank et al. (Citation2017), they show that $(X_{t}, {\dot{X}}_{t})$ satisfies a coupled linear Forward-Backward Stochastic Differential Equation (FBSDE), which can be re-written in a matrix form and solved explicitly using the same trick that is used to compute the solution for a standard OU process. The Belak et al. (Citation2020) argument can be very easily adapted to deal with the infinite penalty case $Γ = \infty$ by simply replacing the vector $(\frac{Γ}{λ}, - 1)$ with $(1, 0)$ , but one would need to verify admissibility of the solution.

More recently, Neuman and Voß (Citation2020) consider the problem of optimal trade execution under exponential resilience i.e. $G (t - s) = c o n s t . \times e^{- ρ (t - s)}$ , with a general square integrable semi-martingale price process and: (i) a non-zero temporary price impact and (ii) a finite quadratic penalty for non-liquidation. The solution is shown to satisfy a system of four coupled linear FBSDEs in $X_{t}$ , $u_{t}$ , $Y_{t} = \int_{0}^{t} e^{- ρ (t - s)} u_{s} d s$ and an auxiliary process $Z_{t}$ . These can solved explicitly in terms of the matrix exponential function using similar arguments to Belak et al. (Citation2020), to find that the optimal selling speed (in feedback form) is affine-linear in the current inventory $X_{t}$ and $Y_{t}$ .

Lorenz and Schied (Citation2013) show that for exponential resilience with zero temporary price impact and semimartingale price process, optimal trading strategies $(X_{t})_{t \in [0, T]}$ with bounded variation do not exist in general. Hence one has to enlarge the space of admissible strategies to the class of all semimartingales, which includes processes with non-zero quadratic variation. In this setting, Theorem 2.6 in Lorenz and Schied (Citation2013) computes the optimal $X_{t}$ (with the surprising result that if the drift is not absolutely continuous then the expected profit/loss is infinite, although such trading strategies with infinite variation will of course incur infinite transaction costs in the real world). For the well behaved case when the drift is absolutely continuous, they give an explicit formula for $X_{t}$ which includes martingale terms, which minimizes the modified cost functional in Lemma 2.5 in Lorenz and Schied (Citation2013) involving quadratic variation terms. Moreover, the process X is Gaussian if the stock price process is Gaussian. Theorem 2.6 in Lorenz and Schied (Citation2013) extends the classical Obizhaeva and Wang (Citation2013) solution for the no-signal case (see above).

In this article we compute an explicit solution for the optimal signal-adaptive liquidation strategy for a trader subject to power-law resilience and a Gaussian signal with zero temporary price impact, which is obtained as the solution to a Forward-Backward Stochastic Integral Equation (FBSIE). The natural choice for the admissible space of strategies turns out to be intimately related to the Fractional Gaussian Field (FGF) with covariance equal to G which lives in the space of tempered distributions, and the optimal trading speed is a Gaussian Volterra process of the form $u^{0} (t) + \bar{u} (t) + \int_{0}^{t} k (u, t) d W_{t}$ , where $u^{0} (t)$ is the (deterministic) solution for the non-signal case and k satisfies a family of Fredholm integral equations of the first kind (and $\bar{u} (t)$ also satisfies a single Fredholm equation of the first kind) all of which can be solved explicitly using the known solution given in e.g. Chakrabarti and George (Citation1994), or more symbolically in terms of the adjoint of the square root of the linear operator associated with G. This generalizes the earlier work of Gatheral et al. (Citation2012) for the no-signal case, and complements the recent work of Neuman and Voß (Citation2020) and has the advantage over (Neuman and Voß Citation2020) that we impose the full liquidation constraint $X_{T} = 0$ .

The layout of the article is as follows: Section 2.1 derives the first order optimality condition for a general signal $ξ_{t}$ , Section 2.2 contains the main Theorem 2.2 which specializes Section 2.1 to the case of Gaussian signals, Section 2.3 recalls the known solution for the special case of zero signal which is also relevant to Theorem 2.2, Section 2.4 computes the expected profit/loss for the trading strategy in Theorem 2.2 and Section 2.5 re-writes the optimal solution in Theorem 2.2 in a more natural/practical way in terms of the observable price process itself (and may be of independent interest). Section 3.1 describes the most interesting and relevant example of price process to consider for Theorem 2.2 (namely a rough Gaussian Volterra process) with numerical simulations, and Section 3.2 makes a minor addition to the setup in Section 2.2 with the addition of the usual temporary price impact term. Finally Section 4 calibrates the model to real limit order book data for Apple, Cisco and Vodafone stocks using a discretized version of the model with difference equations.

2. The model setup

We work on a probability space $(Ω, F, P)$ throughout, with a filtration $(F_{t})_{t \geq 0}$ which satisfies the usual conditions, and $E_{t} (\cdot)$ will denote $E (\cdot | F_{t})$ . We consider an agent subject to transient price impact where the execution price for an asset at time t is (1) $S_{t} = P_{t} + \int_{0}^{t} G (t - s) d X_{s},$ (1) where $X_{t} = X_{0} - \int_{0}^{t} u_{s} d s$ is the number of shares held at time t, which we assume is absolutely continuous in t so $u_{t}$ is the selling speed, and P is some $F_{t}$ -progressively measurable process P with $E (P_{t}^{2}) < \infty$ for all $t \in [0, T]$ (which we refer to as the unaffected price process). $\int_{0}^{t} G (t - s) d X_{s}$ represents the cumulative effect of our trading activities on the current stock price, and G is the decay kernel, which characterizes resilience of price impact between trades.

From here on we assume that $G (t) = c t^{- γ}$ for $γ \in (0, 1)$ for some constant c>0.

We set $ξ_{t} := E_{t} (P_{T} - P_{t}) .$ Then a natural criterion is to maximize the agent's expected profit/loss at T: $\begin{aligned} V (u) & = E (\int_{0}^{T} (P_{t} - \int_{0}^{t} G (t - s) u_{s} d s) u_{t} d t + P_{T} X_{T}) \\ = E (\int_{0}^{T} (P_{t} - \int_{0}^{t} G (t - s) u_{s} d s) u_{t} d t \\ + P_{T} (X_{0} - \int_{0}^{T} u_{t} d t)) \\ = E (P_{T} X_{0}) + E (\int_{0}^{T} (P_{t} - P_{T} \\ - \int_{0}^{t} G (t - s) u_{s} d s) u_{t} d t) \end{aligned}$ over $U_{0}^{X_{0}}$ , where $U_{0}^{x}$ denote the space of $F_{t}$ -progressively measurable processes u such that $X_{T} = x - \int_{0}^{T} u_{t} d t = 0$ (i.e. we must liquidate all inventory by time T) such that $E (\int_{0}^{T} | u_{t} (P_{t} - P_{T}) | d t) < \infty$ and $E (\int_{0}^{T} \int_{0}^{t} | G (t - s) u_{s} u_{t} | d s d t) < \infty$ .

One can in principle add additional penalty terms to our performance criterion (the most common being a quadratic inventory penalty of the form $c o n s t . \times \int_{0}^{T} X_{t}^{2} d t$ to penalize large positions before T) but our optimal solution is already rather complicated to compute, so we leave the details of this for future works. We also remind the reader that since we are imposing full liquidation, we implicitly already have an infinite penalty here for non-liquidation.

Remark 2.1

From Fubini's theorem, we know that $u \in U_{0}^{x}$ also implies that $\int_{0}^{T} E (| u_{t} (P_{t} - P_{T}) |) d t < \infty$ and $\int_{0}^{T} \int_{0}^{t} E (| G (t - s) u_{s} u_{t} |) d t < \infty$ .

From Fubini's theorem and the definition of $U_{0}^{X_{0}}$ , we can re-write $V (u)$ as (2) $\begin{aligned} V (u) & = E (P_{T} X_{0}) + \int_{0}^{T} E ((P_{t} - P_{T}) u_{t}) d t \\ - E (\int_{0}^{t} G (t - s) u_{s} d s u_{t} d t) \\ = E (P_{T} X_{0}) - \int_{0}^{T} E (u_{t} ξ_{t}) d t \\ - E (\int_{0}^{t} G (t - s) u_{s} d s u_{t} d t) \\ (u s i n g t h e t o w e r p r o p e r t y) \\ = X_{0} E (P_{T}) - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} G (t - s) u_{s} d s) u_{t} d t), \end{aligned}$ (2) where we have used Fubini again in the final line, since $\begin{aligned} \int_{0}^{T} E (| u_{t} ξ_{t} |) d t & = \int_{0}^{T} E (| u_{t} E_{t} (P_{T} - P_{t}) |) d t \\ = \int_{0}^{T} E (| E_{t} (u_{t} (P_{T} - P_{t})) |) d t \\ (b y c o n d i t i o n a l J e n s e n) \\ \leq \int_{0}^{T} E (E_{t} (| u_{t} (P_{T} - P_{t}) |)) d t \\ = \int_{0}^{T} E (| u_{t} (P_{T} - P_{t}) |) d t, \end{aligned}$ which is finite for $u \in U_{0}^{X_{0}}$ (see Remark 2.1). Since $X_{0} E (P_{T})$ is independent of u, for convenience we henceforth work with the modified functional: (3) $\tilde{V} (u) = - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} G (t - s) u_{s} d s) u_{t} d t) .$ (3) Note that we do not assume that S is a semimartingale (as is usually assumed in the literature).

2.1. The first order condition for the optimizer

We now establish the first order optimality condition for an optimal trading strategy using variational and convexity arguments, similar to Section 5 in Bank et al. (Citation2017).

Theorem 2.1

A sufficient condition for $u \in U_{0}^{X_{0}}$ to be an optimal trading strategy is that u satisfies the Forward-Backward Stochastic Integral equation (FBSIE): (4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) for $t \in [0, T]$ for some martingale M such that $X_{T} = 0$ .

Remark 2.2

Note that (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ) by itself does not uniquely determine the optimal u, we need the additional terminal condition $X_{T} = 0$ as well (see e.g. Lemma 5.2(ii)) in Bank et al. (Citation2017) and equation (3.5) in Belak et al. (Citation2020) for qualitatively similar results for different problems).

Proof.

Let $L = {u \in A : ⟨ u, u ⟩_{G} < \infty}$ , where $⟨ u, v ⟩_{G} := E (\int_{0}^{T} u_{t} \int_{0}^{T} v_{s} G (| t - s |) d s d t)$ and $A$ is the space of $F_{t}$ -progressively measurable processes.

Perturbing u to $u + ε u^{1}$ with $u^{1} \in U_{0}^{0}$ (i.e. a round trip so $\int_{0}^{T} u_{t}^{1} d t = 0$ ) we find that (5) $\begin{aligned} \tilde{V} (u + ε u^{1}) \\ = - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} (u_{s} + ε u_{s}^{1}) G (t - s) d s) \\ \times (u_{t} + ε u_{t}^{1}) d t) \\ = \tilde{V} (u) - ε E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t)) \\ - ε^{2} E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{s} \int_{0}^{s} u_{t}^{1} G (s - t) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{t}^{1} \int_{t}^{T} u_{s} G (s - t) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{T} u_{s} G (| t - s |) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε ⟨ u^{1}, u ⟩_{G} . \end{aligned}$ (5) From the definition of $U_{0}^{X_{0}}$ above, we know that $u \in U_{0}^{X_{0}}$ implies that $E (\int_{0}^{T} \int_{0}^{t} G (t - s) u_{s} u_{t} d s d t) = ∥ u ∥_{G}^{2} < \infty$ .

The $O (ε)$ component of (Equation5(5) $\begin{aligned} \tilde{V} (u + ε u^{1}) \\ = - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} (u_{s} + ε u_{s}^{1}) G (t - s) d s) \\ \times (u_{t} + ε u_{t}^{1}) d t) \\ = \tilde{V} (u) - ε E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t)) \\ - ε^{2} E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{s} \int_{0}^{s} u_{t}^{1} G (s - t) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{t}^{1} \int_{t}^{T} u_{s} G (s - t) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{T} u_{s} G (| t - s |) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε ⟨ u^{1}, u ⟩_{G} . \end{aligned}$ (5) ) can be re-written as (6) $\begin{aligned} - E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = - E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{s}^{1} \int_{s}^{T} G (t - s) u_{t} d t d s) \\ = - E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} [\int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{t}^{T} u_{s} G (s - t) d s] d t) \\ = - E (\int_{0}^{T} u_{t}^{1} (ξ_{t} + \int_{0}^{T} u_{s} G (| t - s |) d s) d t) \\ = - E (\int_{0}^{T} u_{t}^{1} [ξ_{t} + E_{t} (\int_{0}^{T} u_{s} G (| t - s |) d s)] d t) . \end{aligned}$ (6) Now assume that (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ) is satisfied which implies $M_{t} := ξ_{t} + E_{t} (\int_{0}^{T} G (| t - s |) u_{s} d s) = E_{t} (\int_{0}^{T} G (| T - v |) u_{v} d v)$ . Then we see that (7) $\begin{aligned} E (\int_{0}^{T} u_{t}^{1} M_{t} d t) \\ = E (\int_{0}^{T} u_{t}^{1} (ξ_{t} + E_{t} (\int_{0}^{T} u_{s} G (| t - s |) d s)) d t) . \end{aligned}$ (7) The second term on the right in (Equation7(7) $\begin{aligned} E (\int_{0}^{T} u_{t}^{1} M_{t} d t) \\ = E (\int_{0}^{T} u_{t}^{1} (ξ_{t} + E_{t} (\int_{0}^{T} u_{s} G (| t - s |) d s)) d t) . \end{aligned}$ (7) ) is just $⟨ u, u_{1} ⟩_{G}$ , which we know is finite from Lemma A.1, and the first term on the right is also finite from the definition of $U_{0}^{0}$ . The following observations will be needed in what follows:

$\int_{0}^{T} E (| u_{t}^{1} ξ_{t} |) d t = \int_{0}^{T} E (| E_{t} (u_{t}^{1} (P_{T} - P_{t})) |) d t \leq \int_{0}^{T} E (E_{t} (| u_{t}^{1} (P_{T} - P_{t}) |)) d t = \int_{0}^{T} E (| u_{t}^{1} (P_{T} - P_{t}) |) d t$ ,which is finite for $u^{1} \in U_{0}^{0}$ (see the definition of $U_{0}^{x}$ and Remark 2.1)
Similarly $\int_{0}^{T} E (| u_{t}^{1} E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) |) d t = \int_{0}^{T} E (| E_{t} (u_{t}^{1} (\int_{0}^{T} G (| t - v |) u_{v} d v)) |) d t \leq \int_{0}^{T} E (E_{t} (| u_{t}^{1} (\int_{0}^{T} G (| t - v |) u_{v} d v) |)) d t = \int_{0}^{T} E (| u_{t}^{1} (\int_{0}^{T} G (| t - v |) u_{v} d v) |) d t \leq ⟨ | u^{1} |, | u | ⟩_{G}$ , which is finite by Lemma A.1 since $| u |$ and $| u^{1} |$ are in $U_{0}^{X_{0}}$ and $U_{0}^{0}$ respectively, which implies they are also in $L$ .

Then using that $M_{t} = ξ_{t} + E_{t} (\int_{0}^{T} u_{s} G (| t - s |) d s)$ and the two bullet points immediately above, we can apply Fubini and the tower property to say that $\begin{aligned} E (\int_{0}^{T} u_{t}^{1} M_{t} d t) & = E (\int_{0}^{T} u_{t}^{1} E_{t} (M_{T}) d t) \\ = E (\int_{0}^{T} E_{t} (u_{t}^{1} M_{T}) d t) \\ = \int_{0}^{T} E (E_{t} (u_{t}^{1} M_{T})) d t \\ = \int_{0}^{T} E (u_{t}^{1} M_{T}) d t \\ = E (M_{T} \int_{0}^{T} u_{t}^{1} d t) \\ = 0, \end{aligned}$ since $u^{1}$ is a round trip. Thus (Equation6(6) $\begin{aligned} - E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = - E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{s}^{1} \int_{s}^{T} G (t - s) u_{t} d t d s) \\ = - E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} [\int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{t}^{T} u_{s} G (s - t) d s] d t) \\ = - E (\int_{0}^{T} u_{t}^{1} (ξ_{t} + \int_{0}^{T} u_{s} G (| t - s |) d s) d t) \\ = - E (\int_{0}^{T} u_{t}^{1} [ξ_{t} + E_{t} (\int_{0}^{T} u_{s} G (| t - s |) d s)] d t) . \end{aligned}$ (6) ) is zero, so (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ) is a sufficient condition for u to be a local optimizer. Moreover, using the Plancherel identity, we can re-write the expectation in the $O (ε^{2})$ term in (Equation5(5) $\begin{aligned} \tilde{V} (u + ε u^{1}) \\ = - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} (u_{s} + ε u_{s}^{1}) G (t - s) d s) \\ \times (u_{t} + ε u_{t}^{1}) d t) \\ = \tilde{V} (u) - ε E (\int_{0}^{T} ξ_{t} u_{t}^{1} d t + \int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t \\ + \int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t)) \\ - ε^{2} E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{t} \int_{0}^{t} u_{s}^{1} G (t - s) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{s} \int_{0}^{s} u_{t}^{1} G (s - t) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{t} u_{s} G (t - s) d s d t) \\ - ε E (\int_{0}^{T} u_{t}^{1} \int_{t}^{T} u_{s} G (s - t) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε E (\int_{0}^{T} u_{t}^{1} \int_{0}^{T} u_{s} G (| t - s |) d s d t) \\ = \tilde{V} (u) + \tilde{V} (ε u^{1}) - ε ⟨ u^{1}, u ⟩_{G} . \end{aligned}$ (5) ) (up to a minus sign) as $\begin{aligned} E (\int_{0}^{T} u_{t}^{1} \int_{0}^{T} u_{s}^{1} G (| t - s |) d s d t) \\ = E (\int_{- \infty}^{\infty} u_{t}^{1} \int_{- \infty}^{\infty} u_{s}^{1} G (| t - s |) d s d t) \\ = E (\int_{- \infty}^{\infty} {\hat{u}}^{1} (k) \bar{\hat{u^{1}} (k)} \hat{G} (k) d k) \\ = E (\int_{- \infty}^{\infty} | {\hat{u}}^{1} (k) |^{2} \hat{G} (k) d k) \geq 0, \end{aligned}$ where we are setting $u^{1} \equiv 0$ outside $[0, T]$ , and $\hat{G} (k) = c_{γ} | k |^{γ - 1}$ for some constant $c_{γ}$ ; hence $\tilde{V} (u + ε u^{1})$ is concave in ϵ, so any local optimizer is a global optimizer.

2.2. Gaussian signals

We now assume that $ξ_{t}$ is a Gaussian Volterra process of the form (8) $ξ_{t} = \bar{ξ} (t) + \int_{0}^{t} K_{ξ} (u, t) d W_{u}$ (8) for some deterministic function $\bar{ξ} (t)$ , where W is a standard Brownian motion and $\int_{0}^{t} K_{ξ} (u, t)^{2} d u < \infty$ for all $t \in [0, T]$ and $F_{t} = F_{t}^{W}$ . Given that $ξ_{T} = E_{T} (P_{T} - P_{T}) = 0$ is a Normal random variable with zero mean and zero variance, we see that (9) $\bar{ξ} (T) = K_{ξ} (u, T) = 0$ (9) for all $u \in [0, T]$ . Let (10) $\begin{aligned} k (u, t) & = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} (- K_{ξ} (u, u + (T - u) (\cdot)) \\ - λ_{1} (u)) (\frac{t - u}{T - u}) a n d \\ λ_{1} (u) & = - \frac{1}{{\bar{c}}_{γ}} \int_{0}^{1} G_{1}^{- 1} (K_{ξ} (u, u + (T - u) (\cdot))) (s) d s \end{aligned}$ (10) where ${\bar{c}}_{γ} = \frac{2^{\frac{1}{2} (3 - Γ)} π^{\frac{5}{4}} (T - u) Γ (\frac{1}{2} (3 + γ)) \sec (\frac{1}{2} π γ)}{(1 + γ) Γ (\frac{1}{2} (1 - γ))^{\frac{3}{2}} \sqrt{Γ (\frac{1}{2} γ)} Γ (1 + γ)}$ , and the operator $G_{1}$ is defined by (11) $(G_{1} φ) (t) := \int_{0}^{1} φ (s) G (t - s) d s .$ (11) $G_{1}^{- 1} (f)$ for a general function f has an explicit form which is stated and used in the proof of Theorem 2.2.

We let $X^{0} (t) = X_{0} - \int_{0}^{t} u^{0} (s) d s$ denote the (deterministic) solution to the same problem but with no signal (see Subsection 2.3 for the explicit solution for $X^{0}$ ).

We now state the main result of the article:

Theorem 2.2

If $K_{ξ}$ is such that $\int_{0}^{\cdot} k (v, \cdot) d W_{v} \in U_{0}^{0}$ , then the optimal trading strategy $X^{*}$ is given by $d X_{t}^{*} = d X^{0} (t) - \hat{u} (t) d t$ , where $\hat{u} (t) = \bar{u} (t) + \int_{0}^{t} k (v, t) d W_{v}$ is a Gaussian Volterra process on $[0, T)$ and $k (u, \cdot)$ and $\bar{u} (t)$ are the unique solutions to the following Fredholm integral equations of the first kind: (12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) (13) $\begin{aligned} - \bar{ξ} (t) & = \int_{0}^{T} G (| t - v |) \bar{u} (v) d v + λ_{2} \end{aligned}$ (13) where the first equation holds for each $u \in [0, T]$ fixed and all $t \in [u, T]$ , and the function $λ (u)$ and the constant $λ_{2}$ are chosen (uniquely) to ensure that $E (X_{T}^{2}) = 0$ , for which the following two conditions are necessary and sufficient: (14) $\begin{aligned} \int_{u}^{T} k (u, t) d t = 0 f o r a l l u \in [0, T], \int_{0}^{T} \bar{u} (v) d v = 0 . \end{aligned}$ (14) $d \hat{X} (t) = - \hat{u} (t) d t$ is the optimal solution to the round trip problem, i.e. for the case $X_{0} = 0$ .

Proof.

We break up the proof into multiple parts.

Deriving the Fredholm equation. We first assume $X_{0} = 0$ (at the end of the proof we show how to extend to the general case with case $X_{0} \neq 0$ ). Since $\hat{u}$ has to be adapted, we guess that ${\hat{u}}_{t} = \bar{u} (t) + \int_{0}^{t} k (v, t) d W_{v}$ , so $E_{t} ({\hat{u}}_{v}) = \bar{u} (v) + \int_{0}^{t \land v} k (u, v) d W_{u}$ . Then from (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ) we see that $\begin{aligned} 0 & = ξ_{t} + E_{t} (\int_{0}^{T} (G (| t - v |) - G (| T - v |)) {\hat{u}}_{v} d v) \\ = \bar{ξ} (t) + \int_{0}^{t} K_{ξ} (u, t) d W_{u} + \int_{0}^{T} (G (| t - v |) \\ - G (| T - v |)) \bar{u} (v) d v + \int_{0}^{T} (G (| t - v |) \\ - G (T - v)) \int_{0}^{t \land v} k (u, v) d W_{u}) d v \\ = \int_{0}^{t} [K_{ξ} (u, t) + \int_{u}^{T} k (u, v) (G (| t - v |) \\ - G (T - v)) d v] d W_{u} + \bar{ξ} (t) + \int_{0}^{T} (G (| t - v |) \\ - G (| T - v |)) \bar{u} (v) d v . \end{aligned}$ Then we see that this is zero for all $t \in [0, T]$ a.s. if and only if (15) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} k (u, v) (G (| t - v |) - G (T - v)) d v \end{aligned}$ (15) (16) $\begin{aligned} - \bar{ξ} (t) & = \int_{0}^{T} (G (| t - v |) - G (| T - v |)) \bar{u} (v) d v \end{aligned}$ (16) are satisfied for all u, t with $0 \leq u \leq t \leq T$ .
Enforcing the liquidation condition. Now consider a solution $k (u, \cdot)$ to (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) for all $u \in [0, T]$ , where $λ (u)$ will be chosen to ensure that $E (X_{T}^{2}) = 0$ , and we will see that this implies that $k (u, \cdot)$ satisfies (Equation15(15) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} k (u, v) (G (| t - v |) - G (T - v)) d v \end{aligned}$ (15) ) and (Equation16(16) $\begin{aligned} - \bar{ξ} (t) & = \int_{0}^{T} (G (| t - v |) - G (| T - v |)) \bar{u} (v) d v \end{aligned}$ (16) ) for all $u \in [0, T]$ as well. Setting $\hat{u} (t) = \bar{u} (t) + \int_{0}^{t} k (v, t) d W_{v}$ we see that $\begin{aligned} X_{t} & = - \int_{0}^{t} \bar{u} (v) d v - \int_{0}^{t} \int_{0}^{s} k (v, s) d W_{v} d s \\ = - \int_{0}^{t} \bar{u} (v) d v - \int_{0}^{t} \int_{v}^{t} k (v, s) d s d W_{v} \end{aligned}$ so in particular (17) $\begin{aligned} X_{T} & = - \int_{0}^{T} \bar{u} (v) d v - \int_{0}^{T} \int_{0}^{t} k (v, t) d W_{v} d t \\ = - \int_{0}^{T} \bar{u} (v) d v - \int_{0}^{T} \int_{v}^{T} k (v, t) d t d W_{v} . \end{aligned}$ (17) Consequently, to impose that $E (X_{T}^{2}) = 0$ , we see that both equations in (Equation14(14) $\begin{aligned} \int_{u}^{T} k (u, t) d t = 0 f o r a l l u \in [0, T], \int_{0}^{T} \bar{u} (v) d v = 0 . \end{aligned}$ (14) ) must hold, the first of which determines $λ (u)$ and second determines the constant $λ_{2}$ (below we will show that $λ (u)$ and $λ_{2}$ are uniquely determined using operator formalism and we give an explicit formula in (Equation21(21) $\begin{aligned} λ (u) = - \frac{\int_{u}^{T} G_{1}^{- 1} (K_{ξ} (u, u + (T - u) (\cdot))) (\frac{t - u}{T - u}) d t}{\int_{u}^{T} G_{1}^{- 1} (1) (\frac{t - u}{T - u}) d t} . \end{aligned}$ (21) )). Then setting t = T in (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) and using that $K_{ξ} (u, T) = 0$ (from (Equation9(9) $\bar{ξ} (T) = K_{ξ} (u, T) = 0$ (9) )), we see that $0 = \int_{u}^{T} G (| T - v |) k (u, v) d v + λ (u)$ so (Equation15(15) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} k (u, v) (G (| t - v |) - G (T - v)) d v \end{aligned}$ (15) ) is indeed satisfied. Similarly using that $\bar{ξ} (T) = 0$ (from (Equation9(9) $\bar{ξ} (T) = K_{ξ} (u, T) = 0$ (9) )) we find that $\int_{0}^{T} G (| T - v |) \bar{u} (v) d v + λ_{2} = 0$ , so (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) implies (Equation16(16) $\begin{aligned} - \bar{ξ} (t) & = \int_{0}^{T} (G (| t - v |) - G (| T - v |)) \bar{u} (v) d v \end{aligned}$ (16) ).
Explicit computation of $λ (u)$ and $λ_{2}$ . We now transform (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) so the range of integration is $[0, 1]$ . To this end, we first re-write (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) in the form $c \int_{u}^{T} \frac{g (v)}{| x - v |^{γ}} d v = \tilde{f} (x)$ where $g (v) = k (u, v)$ and $\tilde{f} (x) = - K_{ξ} (u, x) - λ (u)$ and let $w = \frac{v - u}{T - u}$ , so $d w = \frac{d v}{T - u}$ , then we can re-write this as $\begin{aligned} c (T - u) \int_{0}^{1} \frac{g ((T - u) w + u)}{| x - (T - u) w - u |^{γ}} d w & = c (T - u) \\ \int_{0}^{1} \frac{g_{1} (w)}{| x - (T - u) w - u |^{γ}} d w = \tilde{f} (x) \end{aligned}$ where $g_{1} (w) = g ((T - u) w + u)$ , where our notation is chosen so as to be consistent with that used in Chakrabarti and George (Citation1994). Now let $x - u = (T - u) x^{'}$ to obtain (18) $\begin{aligned} c & (T - u) \int_{0}^{1} \frac{g_{1} (w)}{| (T - u) x^{'} - (T - u) w |^{γ}} d w \\ = c | T - u |^{1 - γ} \int_{0}^{1} \frac{g_{1} (w)}{| x^{'} - w |^{γ}} d w \\ = \tilde{f} (u + (T - u) x^{'}) \end{aligned}$ (18) which we can re-write more succinctly as (19) $G_{1} g_{1} = \frac{\tilde{f} (u + (T - u) (\cdot))}{c | T - u |^{1 - γ}},$ (19) where $G_{1}$ is the operator defined in (Equation11(11) $(G_{1} φ) (t) := \int_{0}^{1} φ (s) G (t - s) d s .$ (11) ). Then from (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) and the linearity of $G_{1}^{- 1}$ , we see that (20) $\begin{aligned} k (u, t) & = g (t) \\ = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} \tilde{f} (u, u + (T - u) (\cdot)) \\ \times (\frac{t - u}{T - u}) \\ = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} (- K_{ξ} (u, u + (T - u) (\cdot)) \\ - λ (u)) (\frac{t - u}{T - u}) . \end{aligned}$ (20) Integrating from t = u to T and using that $\int_{u}^{T} k (u, t) d t = 0$ for all $u \in [0, T]$ and moving the $λ (u)$ term to the other side and canceling terms, we see that $\begin{aligned} \int_{u}^{T} G_{1}^{- 1} (- K_{ξ} (u, u + (T - u) (\cdot))) (\frac{t - u}{T - u}) d t \\ = \int_{u}^{T} G_{1}^{- 1} (λ (u)) (\frac{t - u}{T - u}) d t, \end{aligned}$ so by the linearity of $G_{1}^{- 1}$ , we see that (21) $\begin{aligned} λ (u) = - \frac{\int_{u}^{T} G_{1}^{- 1} (K_{ξ} (u, u + (T - u) (\cdot))) (\frac{t - u}{T - u}) d t}{\int_{u}^{T} G_{1}^{- 1} (1) (\frac{t - u}{T - u}) d t} . \end{aligned}$ (21) Moreover, from Example 2.30 in Gatheral et al. (Citation2012), we know that (22) $G_{1}^{- 1} (1) (s) = \frac{c_{γ}}{(s (1 - s))^{\frac{1}{2} (1 - γ)}},$ (22) where $c_{γ} = [2^{γ - 1} Γ (\frac{1}{2} - \frac{1}{2} γ) Γ (\frac{1}{2} γ) / \sqrt{π}]^{- \frac{1}{2}}$ . Then $\int_{u}^{T} G_{1}^{- 1} (1) (\frac{t - u}{T - u}) d t = {\bar{c}}_{γ} (T - u)$ (where ${\bar{c}}_{γ}$ is defined in the statement of the Theorem), so $λ (u)$ simplifies to $\begin{aligned} λ (u) & = - \frac{1}{{\bar{c}}_{γ}} \frac{1}{T - u} \int_{u}^{T} G_{1}^{- 1} (K_{ξ} (u, u + (T - u) (\cdot))) \\ \times (\frac{t - u}{T - u}) d t \\ = - \frac{1}{{\bar{c}}_{γ}} \int_{0}^{1} G_{1}^{- 1} (K_{ξ} (u, u + (T - u) (\cdot))) (s) d s . \end{aligned}$ Similarly we find that $λ_{2} = - \frac{1}{{\bar{c}}_{γ}} \int_{0}^{1} G_{1}^{- 1} (\bar{ξ} (T (\cdot))) (s) d s$ and $\bar{u} (t) = \frac{1}{c T^{1 - γ}} G_{1}^{- 1} (- \bar{ξ} (T (\cdot)) - λ_{2}) (\frac{t}{T})$ and note that u = 0 in these last two formulae.
Decomposing $G_{1}$ and explicit computation of $G_{1}^{- 1}$ . From Example 9.2 (see also Example 6.2) in Porter and Stirling (Citation1990), setting $ν = γ$ we know that $G_{1}$ can be decomposed as $G_{1} = T T^{*}$ , where $T$ is the Volterra-type operator defined by $(T φ) (t) = \int_{0}^{t} κ (s, t) φ (s) d s$ and $κ (s, t) = c_{ν} (\frac{t}{s})^{(1 - γ) / 2} (t - s)^{- \frac{1}{2} (1 + γ)}$ for some constant $c_{ν}$ depending on ν, and $T^{*}$ is its adjoint given by $(T^{*} φ) (t) = \int_{s}^{T} κ (s, t) φ (t) d t$ (see e.g. the start of Appendix A of Forde and Zhang (Citation2017) to see why $T^{*}$ takes this form). Then we can further re-write $T$ as $T = B^{- 1} I_{ν} B$ , where B is the bounded operator on $L^{2}$ which multiplies functions by $t^{- (1 - ν) / 2}$ and $I_{ν}$ is the Riemann-Liouville operator $(I_{ν} φ) (t) := \int_{0}^{t} (t - s)^{- \frac{1}{2} (1 + γ)} φ (s) d s = \frac{1}{Γ (1 - r)} I^{r}$ where $r = \frac{1}{2} - \frac{1}{2} γ$ so $I_{ν}^{- 1} = Γ (1 - r) D^{r}$ , where $I^{r}$ and $D^{r}$ are the fractional derivative operators of order r. Summing this up, we can re-write (Equation18(18) $\begin{aligned} c & (T - u) \int_{0}^{1} \frac{g_{1} (w)}{| (T - u) x^{'} - (T - u) w |^{γ}} d w \\ = c | T - u |^{1 - γ} \int_{0}^{1} \frac{g_{1} (w)}{| x^{'} - w |^{γ}} d w \\ = \tilde{f} (u + (T - u) x^{'}) \end{aligned}$ (18) ) as $T T^{*} g_{1} = h_{1}$ for some function $h_{1}$ , which has solution $g_{1} = {T^{*}}^{- 1} (T^{- 1} h_{1}) .$ To compute $(T^{*})^{- 1}$ , we note that $(φ, T ψ) = (φ, B^{- 1} I_{ν} B ψ) = (B^{- 1} φ, I_{ν} B ψ) = (I_{ν}^{*} B^{- 1} φ, B ψ) = (B I_{ν}^{*} B^{- 1} φ, ψ)$ , so $T^{*} = B I_{ν}^{*} B^{- 1}$ , and we know how to invert B and $I_{ν}^{*}$ .
Practical computation of $k (u, t)$ . We can read off the solution to (Equation18(18) $\begin{aligned} c & (T - u) \int_{0}^{1} \frac{g_{1} (w)}{| (T - u) x^{'} - (T - u) w |^{γ}} d w \\ = c | T - u |^{1 - γ} \int_{0}^{1} \frac{g_{1} (w)}{| x^{'} - w |^{γ}} d w \\ = \tilde{f} (u + (T - u) x^{'}) \end{aligned}$ (18) ) more explicitly from Chakrabarti and George (Citation1994), with $f (x_{1}) = \frac{\tilde{f} (x^{'})}{| T - u |^{1 - γ}}$ and their a = b = c, for which the explicit solution is given in equations (3.14a) and (3.14b) in Chakrabarti and George (Citation1994) which we can re-write in our variables as $\begin{aligned} k (u, t) & = - t^{\bar{γ} + μ - 1} \frac{\sin^{2} (π \bar{γ})}{π^{2}} \frac{d}{d t} \\ \times \int_{t}^{1} \frac{1}{(s - t)^{\bar{γ}}} \int_{0}^{s} \frac{v^{- \bar{γ}} h (v)}{(s - v)^{1 - \bar{γ}}} d v w h e r e \\ h (t) & = \frac{t^{1 - γ}}{b} \frac{d}{d t} \int_{0}^{t} \frac{f (y)}{(x - y)^{1 - γ}} d y \end{aligned}$ and $μ = γ$ , $α + γ = 1$ , $- λ = \frac{π}{\sin (π (1 - γ))} + π \cot (π (1 - γ))$ and $\bar{γ}$ satisfies $| λ | = π \cot (π \bar{γ})$ with $0 < \bar{γ} < \frac{1}{2}$ (note $\bar{γ}$ here is the γ parameter in Chakrabarti and George (Citation1994) and our γ is the μ parameter in Chakrabarti and George (Citation1994).
Remark 2.3
For the case commonly considered where $γ = \frac{1}{2}$ , the α-parameter in Chakrabarti and George (Citation1994) is $1 - γ = \frac{1}{2}$ and their λ parameter is $- (a π / (b \sin (π α) - π \cot (π α) - = - π$ so their γ parameter is $\frac{1}{4}$ (which we call $γ_{1}$ to distinguish from our γ parameter).
If two distinct solutions exist to (Equation20(20) $\begin{aligned} k (u, t) & = g (t) \\ = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} \tilde{f} (u, u + (T - u) (\cdot)) \\ \times (\frac{t - u}{T - u}) \\ = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} (- K_{ξ} (u, u + (T - u) (\cdot)) \\ - λ (u)) (\frac{t - u}{T - u}) . \end{aligned}$ (20) ), then we must have a non-zero solution φ to $G_{1} φ = 0$ , so in particular $\int_{[0, 1]} \int_{[0, 1]} φ (s) φ (t) G (| t - s |) d s d t = ⟨ φ, G_{1} φ ⟩_{L^{2}} = 0$ . But from Plancherel's theorem we know this quantity is equal to $\begin{aligned} \int_{[0, T]} \int_{[0, T]} φ (s) φ (t) G (| t - s |) d s d t \\ = \int_{- \infty}^{\infty} | \hat{φ} (k) |^{2} \hat{G} (k) d k = c o n s t . \times ∥ φ ∥_{H^{- \frac{1}{2} γ}}^{2}, \end{aligned}$ where $\hat{G} (k) = c_{γ} | k |^{γ - 1} > 0$ is the Fourier transform of G (see Appendix for the exact formula) for some constant $c_{γ} > 0$ , and $∥ . ∥_{H^{- s}}$ denotes the norm on the homogenous fractional Sobolev space of order $- s < 0$ (see Appendix for details, and references on this). Hence we cannot have two distinct solutions to (Equation20(20) $\begin{aligned} k (u, t) & = g (t) \\ = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} \tilde{f} (u, u + (T - u) (\cdot)) \\ \times (\frac{t - u}{T - u}) \\ = \frac{1}{c | T - u |^{1 - γ}} G_{1}^{- 1} (- K_{ξ} (u, u + (T - u) (\cdot)) \\ - λ (u)) (\frac{t - u}{T - u}) . \end{aligned}$ (20) ) in $H^{- γ / 2}$ .
Extending to the general case $X_{0} \neq 0$ . For $X_{0} \neq 0$ , we can easily verify that $X^{0} (t) + {\hat{X}}_{t}$ satisfies (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ) (since the equation is linear in u), i.e. we can decompose the general solution as the (deterministic) no-signal solution plus the round trip solution (again see next subsection for details of how to compute $X^{0}$ ).

Remark 2.4

Note that $\bar{u} \equiv 0$ if $\bar{ξ} \equiv 0$ , since from the uniqueness part at the end of the proof, we know the solution to the Fredholm equation is unique.

Remark 2.5

If we replace W with an Itô process of the form $M_{t} = \int_{0}^{t} σ_{s}^{2} d W_{s}$ then the stochastic integral part of (Equation17(17) $\begin{aligned} X_{T} & = - \int_{0}^{T} \bar{u} (v) d v - \int_{0}^{T} \int_{0}^{t} k (v, t) d W_{v} d t \\ = - \int_{0}^{T} \bar{u} (v) d v - \int_{0}^{T} \int_{v}^{T} k (v, t) d t d W_{v} . \end{aligned}$ (17) ) will be replaced by $\int_{0}^{T} \int_{v}^{T} k (v, t) d t σ_{v} d W_{v}$ , whose variance is $E (\int_{0}^{T} (\int_{v}^{T} k (v, t) d t)^{2} σ_{v}^{2} d v) = \int_{0}^{T} (\int_{v}^{T} k (v, t) d t)^{2} E (σ_{v}^{2}) d v$ . Then if $E (σ_{v}^{2}) > 0$ for all v we still require that $\int_{v}^{T} k (v, t) d t = 0$ and (formally at least) Theorem 2.3 still holds if the proposed trading strategy is admissible. A potentially interesting example which falls in this framework is an affine driftless Rough-Heston model-type process for P of the form $P_{t} = P_{0} + c \int_{0}^{t} (t - s)^{H - \frac{1}{2}} \sqrt{P_{s}} d W_{s}$ , which also has the advantage that P is non-negative (we defer the details for future research).

2.3. The zero-signal case

For the case of power-law impact where $G (t) = c t^{- γ}$ for $γ \in (0, 1)$ , the optimal selling speed with no-signal satisfies (23) $\int_{0}^{T} G (| t - v |) u^{0} (v) d v = λ,$ (23) where λ is the unique constant which ensures that $X_{T} = X_{0} - \int_{0}^{T} u^{0} (t) d t = 0$ , and setting t = T we see that $\int_{0}^{T} (G (| t - v |) - G (| T - v |)) u^{0} (v) d v = 0$ which is consistent with (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ) for the case of zero signal. We can re-write (Equation23(23) $\int_{0}^{T} G (| t - v |) u^{0} (v) d v = λ,$ (23) ) using operator formalism as $G u^{0} = λ$ where $G φ (\cdot) := \int_{0}^{T} G (| (\cdot) - v |) φ (v) d v$ , so λ satisfies $X_{0} - λ \int_{0}^{T} G^{- 1} (1) (t) d t = 0$ and the solution is given by $u^{0} (t) = \frac{c_{1}}{(t (T - t))^{\frac{1}{2} (1 - γ)}}$ for some constant $c_{1}$ (see Example 2.30 in Gatheral et al. Citation2012, Curato et al. Citation2017).

2.4. Computing the expected optimal profit/loss

If $\bar{ξ} (t) \equiv 0$ , the expected profit/loss from the optimal trading strategy in Theorem 2.2 is $\begin{aligned} V (\hat{u}) & = E (P_{T} X_{0}) - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} G (t - s) {\hat{u}}_{s} d s) {\hat{u}}_{t} d t) \\ = E (P_{T} X_{0}) - E (\int_{0}^{T} \int_{0}^{t} K_{ξ} (s, t) d W_{s} \\ \times (u^{0} (t) + \int_{0}^{t} k (u, t) d W_{u}) d t) \\ - E (\int_{0}^{T} \int_{0}^{t} G (t - s) (u^{0} (s) + \int_{0}^{s} k (u, s) d W_{u}) \\ \times (u^{0} (t) + \int_{0}^{t} k (v, t) d W_{v}) d s d t) \\ = E (P_{T} X_{0}) - \int_{0}^{T} \int_{0}^{t} K_{ξ} (u, t) k (u, t) d u d t) \\ - \int_{0}^{T} \int_{0}^{t} G (t - s) \int_{0}^{s} k (u, s) k (u, t) d u d s d t \\ - \int_{0}^{T} \int_{0}^{t} G (t - s) u^{0} (s) u^{0} (t) d s d t . \end{aligned}$ where the final line gives the contribution from $u^{0}$ . We can easily adapt this expression to include the case of a general non-zero $\bar{ξ} (t)$ but the expression will be a lot messier due to the squared terms. We have found Monte Carlo to be the most efficient way to compute this triple integral in practice, which is what was used to compute the right plot in figure .

Figure 1. On the left we have plotted the optimal inventory $X_{t}^{*}$ in Theorem 2.2 when $P_{t} = σ \int_{0}^{t} (t - s)^{H - \frac{1}{2}} d W_{s}$ is a Riemann-Liouville process using (Equation26(26) $\begin{aligned} k (u, t) & = - (2 c π^{\frac{3}{2}} τ^{\frac{3}{2}} {\bar{u}}^{\frac{1}{4}} Γ (H))^{- 1} \cdot [\frac{τ^{\frac{3}{2} + H} σ Γ (\frac{1}{4}) Γ (H_{\frac{1}{4}})}{w^{\frac{1}{4}} (u - t)} \\ + H_{\frac{1}{4}} τ^{\frac{1}{2} + H} {\bar{u}}^{- \frac{3}{4} + H} σ Γ (\frac{1}{4}) (- B (\bar{u}, - H_{\frac{1}{4}}, \frac{3}{4}) \\ + \frac{Γ (\frac{3}{4}) Γ (- H_{\frac{1}{4}})}{Γ (\frac{1}{2} - H)}) Γ (H_{\frac{1}{4}}) \\ + \frac{\sqrt{2 π} Γ (H) (τ^{\frac{1}{2} + H} σ + τ λ_{1} (u))}{w^{\frac{1}{4}}}], \end{aligned}$ (26) ) with $H = \frac{2}{3}$ , $σ = 1$ , c = 1 and $γ = .5$ and $X_{0} = 0$ , and in the middle we have plotted $u_{t}^{*}$ (blue) and $ξ_{t}$ (in red). On the right, as a sanity check, we have plotted the expected profit/loss for α times the optimal trading speed, as a function of α (which we see is correctly maximized close to $α = 1$ , the small numerical error is there because we have to estimate the triple integral in (24) with Monte Carlo).

Figure 1. On the left we have plotted the optimal inventory Xt∗ in Theorem 2.2 when Pt=σ∫0t(t−s)H−12dWs is a Riemann-Liouville process using (Equation26(26) k(u,t)=−(2cπ32τ32u¯14Γ(H))−1⋅τ32+HσΓ14Γ(H14)w14(u−t)+H14τ12+Hu¯−34+HσΓ14−Bu¯,−H14,34+Γ34Γ−H14Γ(12−H)ΓH14+2πΓ(H)(τ12+Hσ+τλ1(u))w14,(26) ) with H=23, σ=1, c = 1 and γ=.5 and X0=0, and in the middle we have plotted ut∗ (blue) and ξt (in red). On the right, as a sanity check, we have plotted the expected profit/loss for α times the optimal trading speed, as a function of α (which we see is correctly maximized close to α=1, the small numerical error is there because we have to estimate the triple integral in (24) with Monte Carlo).

2.5. Re-expressing the trading speed in terms of the price history

At the moment our optimal selling speed is expressed as $u_{t} = \int_{0}^{t} k (u, t) d W_{u}$ , but it is more natural and useful to re-express $u_{t}$ in terms of P itself. To this end, let $Z_{t} = \int_{0}^{t} g (s, t) d W_{s}$ , and we seek a function $h (\cdot, \cdot)$ such that $h (t, t) Z_{t} - \int_{0}^{t} h_{s} (s, t) Z_{s} d s = W_{t}$ . Then we see that $\begin{aligned} h (t, t) Z_{t} - \int_{0}^{t} h_{s} (s, t) Z_{s} d s \\ = h (t, t) \int_{0}^{t} g (u, t) d W_{u} - \int_{0}^{t} h_{s} (s, t) \int_{0}^{s} g (u, s) d W_{u} d s \\ = h (t, t) \int_{0}^{t} g (u, t) - \int_{0}^{t} \int_{u}^{t} h_{s} (s, t) g (u, s) d s d W_{u}, \end{aligned}$ where $h_{s} (\cdot, \cdot)$ denotes the partial derivative of h with respect to the first argument. Hence to find an inversion formula, we need to solve the integral equation $h (t, t) g (u, t) - \int_{u}^{t} h_{s} (s, t) g (u, s) d s = 1.$ If $g (s, t) = g (t - s)$ with $g \in L^{2}$ and we guess that $h (s, t) = h (t - s)$ , then the equation takes the special form $h (0) g (t - u) + \int_{u}^{t} h^{'} (t - s) g (s - u) d s = 1 .$ Setting $\tilde{s} = s - u$ , we can re-write this as $h (0) g (t - u) + \int_{0}^{t - u} h^{'} (t - (u + \tilde{s})) g (\tilde{s}) d \tilde{s} = 1,$ and replacing t−u with t we can further re-write as $h (0) g (t) + \int_{0}^{t} h^{'} (t - \tilde{s}) g (\tilde{s}) d \tilde{s} = h (0) g (t) + h^{'} * g = 1 .$ Then taking the Laplace transform, we have $h (0) \hat{g} + \hat{(h^{'})} \hat{g} = h (0) \hat{g} + (λ \hat{h} - h (0)) \hat{g} = \frac{1}{λ},$ so we see that (24) $\hat{h} = \frac{1}{λ^{2} \hat{g}} .$ (24) Hence if $P_{t} = \int_{0}^{t} g (t - u) d W_{u}$ for some $g \in L^{2}$ then $ξ_{t} = \int_{0}^{t} K_{ξ} (u, t) d W_{u}$ with $K_{ξ} (u, t) = g (T - u) - g (t - u)$ , and from the preceding computations we have the inversion formula $W_{t} = h (t, t) P_{t} - \int_{0}^{t} h_{s} (s, t) P_{s} d s$ and recall that ${\hat{u}}_{t} = \int_{0}^{t} k (u, t) d W_{u}$ (where $k (\cdot, \cdot)$ depends on $K_{ξ}$ via the Fredholm eq (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ), and hence on g itself) so we now see how $\hat{u}$ depends solely on the (unaffected) stock price history $(P_{u})_{0 \leq u \leq t}$ , which gives us our signal-adaptive optimal selling speed.

We can compute h explicitly for the case when $g (t) = t^{H - \frac{1}{2}} e^{- θ t}$ for $H \in (0, 1)$ , $θ > 0$ for which we find that (25) $\begin{aligned} h (t) = \frac{\begin{matrix} e^{- θ t} t^{- \frac{1}{2} - H} [2 - e^{t θ} (1 + 2 H + 2 t θ) \\ (E_{\frac{3}{2} + H} (t θ) - (t θ)^{\frac{1}{2} + H} Γ (- \frac{1}{2} - H))] \end{matrix}}{2 θ Γ (- \frac{1}{2} - H) Γ (\frac{1}{2} + H)} . \end{aligned}$ (25) where $E_{n} (z) = \int_{1}^{\infty} \frac{e^{- z t}}{t^{n}} d t$ . $H = \frac{1}{2}$ corresponds to the OU process for which $h (t) = 1 + θ t$ , and $θ = 0$ corresponds to the Riemann-Liouville process for which $h (t) = \frac{t^{\frac{1}{2} - H}}{Γ (\frac{3}{2} - H) Γ (\frac{1}{2} + H)}$ (see next section).

3. Examples and extensions of the main model

3.1. Rough signals

If $P_{t} = σ \int_{0}^{t} (t - s)^{H - \frac{1}{2}} d W_{s}$ (i.e. a Riemann-Liouville process) for $H \in (0, 1)$ and $γ = \frac{1}{2}$ and $\bar{ξ} (t) \equiv 0$ for simplicity, then clearly $ξ_{t} = E_{t} (P_{T} - P_{t}) = \int_{0}^{t} ((T - s)^{H - \frac{1}{2}} - (t - s)^{H - \frac{1}{2}}) d W_{s}$ and (after some lengthy Mathematica computations) we find that (26) $\begin{aligned} k (u, t) & = - (2 c π^{\frac{3}{2}} τ^{\frac{3}{2}} {\bar{u}}^{\frac{1}{4}} Γ (H))^{- 1} \cdot [\frac{τ^{\frac{3}{2} + H} σ Γ (\frac{1}{4}) Γ (H_{\frac{1}{4}})}{w^{\frac{1}{4}} (u - t)} \\ + H_{\frac{1}{4}} τ^{\frac{1}{2} + H} {\bar{u}}^{- \frac{3}{4} + H} σ Γ (\frac{1}{4}) (- B (\bar{u}, - H_{\frac{1}{4}}, \frac{3}{4}) \\ + \frac{Γ (\frac{3}{4}) Γ (- H_{\frac{1}{4}})}{Γ (\frac{1}{2} - H)}) Γ (H_{\frac{1}{4}}) \\ + \frac{\sqrt{2 π} Γ (H) (τ^{\frac{1}{2} + H} σ + τ λ_{1} (u))}{w^{\frac{1}{4}}}], \end{aligned}$ (26) where $H_{\frac{1}{4}} = H + \frac{1}{4}$ , $τ = T - u$ , $w = \frac{T - t}{T - u}$ , $\bar{u} = \frac{t - u}{T - u}$ and $B (z, a, b) = \int_{0}^{z} t^{a - 1} (1 - t)^{b - 1} d t$ denotes the incomplete Beta function, and enforcing the liquidation condition $\int_{u}^{T} k (u, t) d t = 0$ we find that $λ (u) = - Υ τ^{H - \frac{1}{2}}$ where Υ is given by $σ \frac{\begin{matrix} π^{2} c s c (θ π) + Γ (ω_{-}) [2 H Γ {(\frac{3}{4})}^{2} Γ (H) - H \sqrt{π} Γ (- \frac{1}{4}) Γ (θ) \\ - π \cos (H π) c s c (θ π) Γ (ω_{+}) + \sqrt{π} Γ (- \frac{1}{4}) Γ (\frac{5}{4} + H)] \end{matrix}}{2 H Γ {(\frac{3}{4})}^{2} Γ (ω_{-}) Γ (H)}$ with $θ = \frac{1}{4} + H$ and $ω_{\pm} = \frac{1}{2} \pm H$ (see numerical simulations above and overleaf). Note that we have not rigourously verified that this strategy is admissible which would be extremely difficult to check (figures and ).

Figure 2. Non-Round trip case: from left to right (with $X_{0} = .25$ and the same parameters as above) we see (i) the optimal buying speed with no-signal (ii) $X_{t}^{*}$ with no signal.

Figure 3. On the left we see the optimal selling speed with non-zero signal (blue) and the no-signal optimal speed (grey) and on the right we see $X_{t}^{*}$ with non-zero signal (blue) and zero signal (grey), for the same parameters and simulated Brownian motion as figure .

Remark 3.1

H can be efficiently estimated from a time series using maximum likelihood methods (see Chang Citation2014 for explicit formulae) or using convolutional neural networks (see Stone Citation2020).

3.2. Temporary price impact

If we add a temporary price impact term $η \dot{} X_{t} = - η u_{t}$ on the right hand side of (Equation1(1) $S_{t} = P_{t} + \int_{0}^{t} G (t - s) d X_{s},$ (1) ), then we incur an additional $η u_{t}^{2}$ term in (Equation3(3) $\tilde{V} (u) = - E (\int_{0}^{T} (ξ_{t} + \int_{0}^{t} G (t - s) u_{s} d s) u_{t} d t) .$ (3) ), and a standard first order variational analysis of this expression leads to the following modified (Equation4(4) $ξ_{t} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v} d v) = M_{t} a . s .$ (4) ): $ξ_{t} + 2 η u_{t}^{*} + E_{t} (\int_{0}^{T} G (| t - v |) u_{v}^{*} d v) = M_{t}$ for some martingale M to be determined such that $X_{T} = 0$ as before. Then using the same ansatz $u_{t} = \int_{0}^{t} k (u, t) d W_{u}$ , we can readily verify that (Equation12(12) $\begin{aligned} - K_{ξ} (u, t) & = \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) \end{aligned}$ (12) ) changes to $\begin{aligned} - K_{ξ} (u, t) & = 2 η k (u, t) + \int_{u}^{T} G (| t - v |) k (u, v) d v + λ (u) - \bar{ξ} (t) \\ = 2 η \bar{u} (t) + \int_{0}^{T} G (| t - v |) \bar{u} (v) d v + λ_{2} \end{aligned}$ where $λ (u)$ and $λ_{2}$ are again chosen to ensure that $X_{T} = 0$ , and this is now a Fredholm equation of the second kind, for $u \in [0, T]$ fixed.

4. Calibrating the model to real limit order book data

To calibrate the price impact model in equation (Equation1(1) $S_{t} = P_{t} + \int_{0}^{t} G (t - s) d X_{s},$ (1) ) we employ the order flow of all market participants, the transaction prices weighted by volume, and the unaffected price process. We then look for parameters that best fit the data. In (Equation1(1) $S_{t} = P_{t} + \int_{0}^{t} G (t - s) d X_{s},$ (1) ) we refer to $P_{t}$ as the unaffected price process, and $d X_{t} = - u_{t} d t$ is the instantaneous trading of the agent. Let ${\tilde{P}}_{t} = P_{t} - \int_{0}^{t} G (t - s) d Y_{s}$ be the “observable unaffected price”, where $d Y_{t} = v_{t} d t$ and Y is the cumulative instantaneous trading of all other market participants excluding the agent. Then (Equation1(1) $S_{t} = P_{t} + \int_{0}^{t} G (t - s) d X_{s},$ (1) ) changes to (27) $S_{t} = {\tilde{P}}_{t} + \int_{0}^{t} G (t - s) d Z_{s}$ (27) where $d Z_{s} = d X_{s} + d Y_{s} = (u_{s} + v_{s}) d s$ captures the order flow of the entire market (see Cartea and Jaimungal Citation2016).

Given the previous decomposition, we show how to estimate the parameters that appear in the decay kernel G. Let Θ be the parameter space associated with G. For example, in the power-law impact case, in which $θ = (c, γ)$ , the parameter space is $Θ = R^{+} \times (0, 1)$ . Take $θ \in Θ$ and consider a discretized version of (Equation27(27) $S_{t} = {\tilde{P}}_{t} + \int_{0}^{t} G (t - s) d Z_{s}$ (27) ) given by $S_{t_{n}} \approx {\tilde{P}}_{t_{n - 1}} + \sum_{i = 1}^{n} G^{θ} (t_{n} - t_{i - 1}) (u_{t_{i}} + v_{t_{i}}) Δ$ where $0 = t_{0} < t_{1} < \dots < t_{n}$ , and $Δ = t_{i} - t_{i - 1}$ for $i \in {1, 2, \dots, n}$ . The quantity $(u_{t_{i}} + v_{t_{i}}) Δ$ represents the volume traded in $[t_{i - 1}, t_{i})$ by all market participants. The observable unaffected price ${\tilde{P}}_{t_{n - 1}}$ can be taken to be the mid-price of the asset at time $t_{n - 1}$ , and $S_{t_{n}}$ is the volume-weighted average price of all transactions in $[t_{i - 1}, t_{i})$ .

Fix a given calibration horizon T (for example, one day of trading), let $t_{0} < t_{1} < \dots < t_{N}$ be a fixed time grid, where $t_{0} = 0$ and $t_{N} = T$ (for example, one minute intervals throughout the day), let $(S_{t_{i}})_{1 \leq i \leq N}$ be the observed volume-weighted transaction prices,Footnote¹ and let $(V_{t_{i}})_{0 \leq i \leq N}$ be the volume traded by all market participants. For instance, for $i \in {1, 2, \dots, N}$ , $V_{t_{i}} = (u_{t_{i}} + v_{t_{i}}) Δ$ . Finally, let $({\tilde{P}}_{t_{i}})_{0 \leq i \leq N - 1}$ be the mid-price sampled at times $t_{0} < t_{1} < \dots < t_{N - 1}$ . We assume our observations have noise, that is to say $S_{t_{n}} = {\tilde{P}}_{t_{n - 1}} + \sum_{i = 1}^{n} G^{θ} (t_{n} - t_{i - 1}) V_{t_{i}} + ϵ_{n},$ where $(ϵ_{n})_{n \in N}$ is a collection of independent and identically distributed normal random variables. We take the estimator $\hat{θ}$ of θ to be the parameters that minimize the residual sum of squares, in other words, (28) $\hat{θ} = {a r g m i n}_{θ \in Θ} \sum_{n = 1}^{N} {(S_{t_{n}} - {\tilde{P}}_{t_{n}} - \sum_{i = 1}^{n} G^{θ} (t_{n} - t_{i}) V_{t_{i}})}^{2} .$ (28) Next, we test the calibration method in (Equation28(28) $\hat{θ} = {a r g m i n}_{θ \in Θ} \sum_{n = 1}^{N} {(S_{t_{n}} - {\tilde{P}}_{t_{n}} - \sum_{i = 1}^{n} G^{θ} (t_{n} - t_{i}) V_{t_{i}})}^{2} .$ (28) ). We employ limit order book (LOB) data from VOD, AAPL, and CSCO trading in NASDAQ from 2 December 2019 to 31 January 2020. The data comprise all of the updates in the best prices, quantities, and trades. We take the time intervals to be spaced by one minute, and we set $[0, T]$ to be from 10:00 am to 2:00 pm. We calibrate the parameters $(c, γ)$ in $R^{+} \times (0, 1)$ for the power-law impact case $G^{θ} (t) = c t^{- γ}$ , and we refer to the estimates as $\hat{c}$ and $\hat{γ}$ . We observe that over the two months of data, the mean value (and standard deviation) of the estimate $\hat{γ}$ was 0.384 (0.104) for VOD, 0.440 (0.125) for AAPL, and 0.493 (0.104) for CSCO. Similarly, the mean value (and standard deviation) of the estimate $\hat{c}$ was 0.0015 (0.0004) for VOD, 0.0028 (0.0007) for AAPL, and 0.0009 (0.0004) for CSCO. For an alternate approach to the calibration of parameters under transient market impact, see Busseti and Lillo (Citation2012).

Acknowledgments

We thank Alex Schied for helpful discussions.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 We define

S_{0} = P_{0}

. If there are no transactions in a given interval

[t_{i - 1}, t_{i})

for

i \in {1, 2, \dots, N}

, we define

S_{t_{i}} = S_{t_{i - 1}}

. Otherwise,

S_{t_{i}}

is the volume-weighted trade price over all trading carried in

[t_{i - 1}, t_{i})

References

Almgren, R. and Chriss, N., Optimal execution of portfolio transactions. J. Risk, 2001, 3, 5–50. doi: https://doi.org/10.21314/JOR.2001.041
Google Scholar
Bank, P., Soner, H.M. and Voß, M., Hedging with temporary price impact. Math. Financ. Econ., 2017, 11(2), 215–239. doi: https://doi.org/10.1007/s11579-016-0178-4
Web of Science ®Google Scholar
Belak, C., Muhle-Karbe, J. and Ou, K., Liquidation in target zone models. Market Micro. Liquid., 2020, 4(3 Article ID 1950010
Web of Science ®Google Scholar
Bierme, H., Durieu, O. and Wang, Y., Generalized random fields and Lévy's continuity theorem on the space of tempered distributions. Preprint, 2017.
Google Scholar
Busseti, E. and Lillo, F., Calibration of optimal execution of financial transactions in the presence of transient market impact. J. Stat. Mech. Theory Exp., 2012, 2012, Article ID P09010. doi: https://doi.org/10.1088/1742-5468/2012/09/P09010
Google Scholar
Cartea, A. and Jaimungal, S., Incorporating order-flow into optimal execution. Math. Financ. Econ., 2016, 10(3), 339–364. doi: https://doi.org/10.1007/s11579-016-0162-z
Web of Science ®Google Scholar
Cartea, A., Jaimungal, S. and Penalva, J., Algorithmic and High-Frequency Trading, 2015 (Cambridge University Press: Cambridge).
Google Scholar
Cartea, A., Donnelly, R. and Jaimungal, S., Enhancing trading strategies with order book signals. Appl. Math. Finance, 2018, 25(1), 1–35. doi: https://doi.org/10.1080/1350486X.2018.1434009
Google Scholar
Cartea, A., Perez Arribas, I. and Sánchez-Betancourt, L., Optimal execution of foreign securities: A double-execution problem with signatures and machine learning. Preprint, 2020.
Google Scholar
Chakrabarti, A. and George, A.J., A formula for the solution of general Abel integral equation. Appl. Math. Lett., 1994, 7(2), 87–90. doi: https://doi.org/10.1016/0893-9659(94)90037-X
Web of Science ®Google Scholar
Chang, Y.C., Efficiently implementing the maximum likelihood estimator for hurst exponent. Math. Probl. Eng., 2014, 2014, Article ID 490568.
Web of Science ®Google Scholar
Curato, G., Gatheral, J. and Lillo, F., Optimal execution with nonlinear transient market impact. Quant. Finance, 2017, 17(1), 41–54. doi: https://doi.org/10.1080/14697688.2016.1181274
Web of Science ®Google Scholar
Dang, N.-M., Optimal execution with transient impact, 2014. Available at SSRN 2183685.
Google Scholar
Duchon, J., Robert, R. and Vargas, V., Forecasting volatility with the multifractal random walk model. Math. Finance, 2012, 22(1), 83–108. doi: https://doi.org/10.1111/j.1467-9965.2010.00458.x
Web of Science ®Google Scholar
Duplantier, D., Rhodes, R., Sheffield, S. and Vargas, V., Renormalization of critical Gaussian multiplicative chaos and KPZ relation. Comm. Math. Phys., August, 2014, 330(1), 283–330. doi: https://doi.org/10.1007/s00220-014-2000-6
Web of Science ®Google Scholar
Duplantier, D., Rhodes, R., Sheffield, S. and Vargas, V., Log-correlated Gaussian fields: An overview. In Geometry, Analysis and Probability, pp. 191–216, August, 2017.
Google Scholar
Forde, M. and Smith, B., The conditional law of the Bacry-Muzy and Riemann-Liouville log-correlated Gaussian fields and their GMC, via Gaussian Hilbert and fractional Sobolev spaces. Stat. Prob. Lett., June, 2020, 161, Article 108732. doi: https://doi.org/10.1016/j.spl.2020.108732
Google Scholar
Forde, M. and Zhang, H., Asymptotics for rough stochastic volatility models. SIAM J. Financ. Math., 2017, 8, 114–145. doi: https://doi.org/10.1137/15M1009330
Google Scholar
Forde, M., Fukasawa, M., Gerhold, S. and Smith, B., The Rough Bergomi model as H→0 – skew flattening/blow up and non-Gaussian rough volatility. Preprint, 2020.
Google Scholar
Gatheral, J., Schied, A. and Slynko, A., Transient linear price impact and Fredholm integral equations. Math. Finance, 2012, 22, 445–474. doi: https://doi.org/10.1111/j.1467-9965.2011.00478.x
Web of Science ®Google Scholar
Janson, S., Gaussian Hilbert Spaces, 2009 (Cambridge University Press: Cambridge).
Google Scholar
Kalsi, J., Lyons, T. and Arribas, I.P., Optimal execution with rough path signatures. SIAM J. Financ. Math., 2020, 11(2), 470–493. doi: https://doi.org/10.1137/19M1259778
Google Scholar
Lorenz, C. and Schied, A., Drift dependence of optimal trade execution strategies under transient price impact. Finance Stoch., 2013, 17, 743–770. doi: https://doi.org/10.1007/s00780-013-0211-x
Web of Science ®Google Scholar
Neuman, E. and Voß, M., Optimal signal-adaptive trading with temporary and transient price impact. Preprint, 2020.
Google Scholar
Obizhaeva, A. and Wang, J., Optimal trading strategy and supply/demand dynamics. J. Finance Markets, 2013, 16(1), 1–32. doi: https://doi.org/10.1016/j.finmar.2012.09.001
Web of Science ®Google Scholar
Porter, D. and Stirling, D.S.G., Integral Equations: A Practical Treatment from Spectral Theory to Applications, 1990 (Cambridge University Press: Cambridge).
Google Scholar
Stone, H.M.C, Calibrating rough volatility models: A convolutional neural network approach. Quant. Finance, 2020, 20(3), 379–392. doi: https://doi.org/10.1080/14697688.2019.1654126
Web of Science ®Google Scholar

Appendix

Recall that $⟨ u, v ⟩_{G} = E (\int_{0}^{T} \int_{0}^{T} u_{s} v_{t} G (| t - s |) d s d t)$ .

Lemma A.1

Let $u, v \in U$ such that $∥ u ∥_{G}$ and $∥ v ∥_{G}$ are finite. Then $⟨ u, v ⟩_{G} < \infty$ .

Proof.

We first consider a deterministic function φ in the Schwarz space $S$ with $s u p p (φ) \subseteq [0, T]$ (φ will be replaced with a random $u \in U_{0}^{X_{0}}$ below once we have the required machinery in place). Using Plancherel's theorem, we see that $\begin{aligned} ⟨ φ, φ ⟩_{G} & = \int_{0}^{T} φ (t) \int_{0}^{T} φ (s) G (| t - s |) d s d t \\ = \int_{- \infty}^{\infty} φ (t) \int_{- \infty}^{\infty} φ (s) G (| t - s |) d s d t \\ = \int_{- \infty}^{\infty} \hat{φ} (k) \bar{\hat{φ} (k)} \hat{G} (k) d k \\ = \int_{- \infty}^{\infty} | \hat{φ} (k) |^{2} \hat{G} (k) d k \geq 0, \end{aligned}$ where $\hat{G} (k) = c_{γ} | k |^{γ - 1}$ is the Fourier transform of G, for some constant $c_{γ} > 0$ . Thus $⟨ \cdot, \cdot ⟩_{G}$ is a positive semi-definite bilinear form on $S$ . Using similar arguments to equation (8) in Forde and Smith (Citation2020), we can also show $⟨ \cdot, \cdot ⟩_{G}$ is continuous on the Schwarz space $S (R)$ . Hence by Minlos's theorem, $e^{- \frac{1}{2} ⟨ φ, φ ⟩_{G}} = E (e^{i ⟨ φ, Z ⟩})$ is the characteristic functional of the Fractional Gaussian Field (FGF) Z with covariance function $G (| t - s |) = c | t - s |^{- γ}$ which lives in the space of tempered distributions $S^{'}$ (see e.g. pg 8 of Janson Citation2009, and Duplantier et al. Citation2017 and Appendix A in Forde et al. Citation2020 for more details) which is the dual of the Schwartz space $S$ (see e.g. Section 2.2 in Duplantier et al. Citation2014 and Theorem 2.1 in Bierme et al. Citation2017). Moreover, $S$ is a Montel space and thus is reflexive, i.e. $(S^{'})^{'}$ is isomorphic to $S$ using the canonical embedding of S into its bi-dual $(S^{'})^{'}$ .

Proceeding as in Forde and Smith (Citation2020), we now let $\bar{F}$ denote the Hilbert space equal to the $L^{2} (S, F_{T}, P)$ closure of $F = {Z (φ) : φ \in S, s u p p (φ) \subseteq [0, T]}$ where $F_{T} = σ ((Z_{u})_{0 \leq u \leq T})$ .

In order to characterize $\bar{F}$ , we first note that $E ((Z, φ)^{2}) = \int_{0}^{T} \int_{0}^{T} G (| t - s |) φ (s) φ (t) d s d t .$ We also know that $\begin{aligned} \int_{0}^{T} \int_{0}^{T} G (| t - s |) φ (s) ψ (t) d s d t & = E (\int_{- \infty}^{\infty} \hat{φ} (k) \bar{\hat{ψ}} (k) \hat{G} (k) d k) \\ = c_{γ} ⟨ φ, ψ ⟩_{H^{- \frac{1}{2} (1 - γ)}} \end{aligned}$ where $\hat{G} (k) = c_{γ} | k |^{γ - 1}$ for some constant $c_{γ}$ , and $H^{s}$ denotes the homogenous fractional Sobolev space of order s (see e.g. page 5 in Duchon et al. Citation2012 for definitions). Thus, setting $s = \frac{1}{2} (1 - γ)$ , the following two inner products on the linear space $S$ of Schwarz functions are equivalent and hence generate the same topologies on $S$ :

$⟨ φ, ψ ⟩_{H^{- s}} := \int_{- \infty}^{\infty} | k |^{- 2 s} \hat{φ} (k) \bar{\hat{ψ}} (k) d k$ (i.e. the standard inner product on $H^{- s}$ )
$⟨ φ, ψ ⟩ := E [Z (φ) Z (ψ)] = \int_{0}^{T} \int_{0}^{T} φ (s) ψ (t) G (| t - s |) d s d t$ .

We now make the following observations:

Let $φ \in H^{- s}$ , with $s u p p (φ) \subseteq [0, T]$ . $S$ is dense in $H^{- s}$ , so there exists a sequence $φ_{n} \in S$ with $s u p p (φ_{n}) \subseteq [0, T]$ such that $∥ φ_{n} - φ ∥_{H^{- s}} \to 0$ , and φ is a Cauchy sequence in $H^{- s}$ so (by the equivalence of norms) $Z (φ_{n})$ is a Cauchy sequence in $\bar{F}$ , and thus converges to some Y in $\bar{F}$ . This defines $Z (φ) := Y$ as a continuous linear extension of Z from $S$ to the larger space $H^{- s}$ , which we will also often write as $\int φ (t) Z_{t} d t$ . To check that $Z (φ)$ is uniquely specified, consider two such sequences $φ_{n}$ and $φ_{n}^{'}$ . Then from the triangle inequality $∥ φ_{n} - φ_{n}^{'} ∥_{H^{- s}} \leq ∥ φ_{n} - φ ∥_{H^{- s}} + ∥ φ - φ_{n}^{'} ∥_{H^{- s}} \to 0$ and thus (by the equivalence of norms) we have $∥ Z (φ_{n}) - Z (φ_{n}^{'}) ∥_{L^{2} (S, F_{T}, P)} = ∥ Z (φ_{n}) - Z (φ_{n}^{'}) ∥_{\bar{F}} \to 0 .$
Conversely, for any $Z \in \bar{F}$ , there exists a sequence $φ_{n} \in S$ such that $Z (φ_{n})$ converges to $Z \in L^{2} (S, F_{T}, P)$ , so $φ_{n}$ is a Cauchy sequence with respect to the second norm defined above, and hence also a Cauchy sequence with respect to the $H^{- s}$ norm (by the equivalence of the two norms). $H^{- s}$ is a Hilbert space so Cauchy sequences in $H^{- s}$ converge i.e. there exists a φ in $H^{- s}$ such that $φ_{n} \to φ \in H^{- s}$ .

Thus we have shown that $\bar{F} = {Z (φ) : φ \in H^{- s}, s u p p (φ) \subseteq [0, T]},$

where we are using the extension of Z to $H^{- s}$ on the right hand side here as defined in the first bullet point above. Moreover, we can now extend the inner product to $H^{- s}$ as $\begin{aligned} ⟨ φ, ψ ⟩ & = lim_{n \to \infty} E [Z (φ_{n}) Z (ψ_{n})] \\ = lim_{n \to \infty} \int_{0}^{T} \int_{0}^{T} φ_{n} (s) ψ_{n} (t) G (| t - s |) d s d t \end{aligned}$ where $φ_{n}, φ_{n} \in S$ and $φ_{n} \to φ$ in $H^{- s}$ and $ψ_{n} \to ψ$ in $H^{- s}$ .

Finally, to prove the lemma, if $u \in U_{0}^{X_{0}}$ and $E (\int_{0}^{T} \int_{0}^{T} u_{s} u_{t} G (| t - s |) d s d t) < \infty$ , then $\int_{0}^{T} \int_{0}^{T} u_{s} u_{t} G (| t - s |) d s d t < \infty$ a.s., so $u \in H^{- s}$ a.s. Then if we assume the field Z is independent of u then $\begin{aligned} ⟨ u, v ⟩_{G} & = E ((Z, u) (Z, v)) \leq E ((Z, u)^{2})^{\frac{1}{2}} E ((Z, v)^{2})^{\frac{1}{2}} \\ = E (E ((Z, u)^{2} | u))^{\frac{1}{2}} E (E ((Z, v)^{2})^{\frac{1}{2}}) \\ = E {(\int_{0}^{T} \int_{0}^{T} u_{s} u_{t} G (| t - s |) d s d t)}^{\frac{1}{2}} \\ \times E {(\int_{0}^{T} \int_{0}^{T} v_{s} v_{t} G (| t - s |) d s d t)}^{\frac{1}{2}} \\ < \infty \end{aligned}$ as required.

Optimal trade execution for Gaussian signals with power-law resilience

Abstract

1. Introduction

2. The model setup