Full article: Mean-field FBSDE and optimal control

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We study optimal control for mean-field forward–backward stochastic differential equations with payoff functionals of mean-field type. Sufficient and necessary optimality conditions in terms of a stochastic maximum principle are derived. As an illustration, we solve an optimal portfolio with mean-field risk minimization problem.

2010 MATHEMATICS SUBJECT CLASSIFICATION:

Keywords:

1. Introduction

After the seminal work by Lasry and Lions [Citation1], where they introduced mean-field game theory that is devoted to the analysis of differential games with infinitely many players. Mean-field games attracted a lot of attention and forward/backward stochastic differential equations of mean-field type are used, extensively, as dynamics (see e.g., Huang et al. [Citation2], Xu and Zhang [Citation3] and Xu and Shi [Citation4]). In Huang [Citation5], the author studies a linear–quadratic game with a major player and a large number of minor players. The dynamics of the major player is influenced by an aggregation of all minor players (mean-field coupling) whereas the minor players’ dynamics depend on the control of the major player in addition to their individual controls as well as the mean-field coupling, i.e., a system of partially control-coupled forward stochastic differential equations (SDEs). This work (Ref. [Citation5]) was generalized to the non-linear case in Nourian and Caines [Citation6]. In all previously mentioned works, the authors find $ϵ$ -Nash equilibrium for mean-field games, where each player play a game with the aggregation of the other players (the mass). In the present paper, the setting is different. We consider a mean-field type control problem where the goal is to find an optimal control via stochastic maximum principle. The mass or the laws of state processes are not freezed, they vary with the change of the control. Thus, finding an optimal control will yield optimal laws. Furthermore, in our control problem we consider a controlled partially coupled forward–backward SDE of mean-field type (MF-FBSDE) as dynamics, which is a novel contribution. We have also used the Sobolov space of random measures, introduced in Agram et al. [Citation7–9], in which, the Fréchet derivative with respect to the measure can be taken directly. This is a new approach compared to what is standard in the literature, where the Wasserstein metric space for measures and the lifting technique, introduced by Lions [Citation10], is used to differentiate a function of a measure.

Existence of a fully coupled MF-FBSDE is studied by Carmona and Delarue [Citation11] under Lipschitz assumption on the coefficients but no uniqueness result was proven. Bensoussan et al. [Citation12] prove existence and uniqueness of a fully coupled MF-FBSDE by assuming Lipschitz and monotonicity conditions. Recently, Djehiche and Hamadene [Citation13] prove the same results but under weak monotonicity assumptions and without the non-degeneracy condition on the forward equation.

The purpose of our work is to derive necessary and sufficient optimality conditions in terms of a stochastic maximum principle for a set $\hat{u}$ of admissible controls which maximize a cost functional of the form $\begin{matrix} J (u) & = E [h (X (T), M (T)) + ϕ (Y (0), N (0)) \\ + \int_{0}^{T} f (t, X (t), Y (t), Z (t), M (t), N (t), u (t)) d t], \end{matrix}$ with respect to admissible controls u, for some functions $f, h, ϕ,$ under dynamics governed by MF-FBSDEs. More specifically, we consider the coupled system $\begin{matrix} {\begin{matrix} d X (t) & = & b (t, X (t), M (t), u (t)) d t + σ (t, X (t), M (t), u (t)) d B (t), t \in [0, T], \\ X (0) & = & x_{0}, \end{matrix} \\ {\begin{matrix} d Y (t) & = & - g (t, X (t), Y (t), Z (t)), M (t), N (t), u (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = & ψ (X (T)), \end{matrix} \end{matrix}$ for some functions $b, σ$ and a Brownian motion $B (t) .$ M(t) and N(t) denote the marginal laws of X and Y, respectively. As an application, we will consider a risk minimization control problem. More precisely, we want to minimize the risk given by $Y (0) = - E [φ (X (T))]$ such that $E [φ (X (T))]$ is the convex risk measure by means of backward stochastic differential equations of mean-field type (MF-BSDEs). Let us recall what we mean by the convex risk measure:

Definition 1.1.

A convex risk measure is a map $E : L^{p} (F_{T}) \to R,$ $p \in [2, \infty]$ that satisfies the following properties:

(Convexity) $E (λ φ_{1} + (1 - λ) φ_{2}) \leq λ E (φ_{1}) + (1 - λ) E (φ_{2})$ for all $λ \in [0, 1]$ and all $φ_{1}, φ_{2} \in L^{p} (F_{T}) .$
(Monotonicity) If $φ_{1} \leq φ_{2},$ then $E (φ_{1}) \geq E (φ_{2}) .$
(Translation invariance) $E (φ + a) = E (φ) - a$ for all $φ \in L^{p} (F_{T})$ and all constants a.
$E (0) = 0 .$

The construction of risk measures from solutions of BSDEs is given as follows: Assume that $M (t) : = E [Y (t)]$ in the driver $g (t, y, m, z, n)$ of the above MF-BSDE and that $z \mapsto g (t, y, E [y], z)$ is convex for all t. Then $E [φ (X (T))] = - Y (0)$ defines a convex risk measure. This shows how crucial is the choice of the functional g. Through this connection, the problem of risk minimization is equivalent to stochastic optimal control of MF-FBSDEs, as shown in Øksendal and Sulem [Citation14], for the non-mean-field case. The rest of the paper is organized as follows. In Section 2, we give some mathematical background. In Section 3, we study a stochastic optimal control of MF-FBSDE where sufficient and necessary optimality conditions are derived. In the last section, we construct a dynamic risk measure by means of MF-BSDE and then we solve an associated risk minimization problem.

2. Generalities

Let $B = B (t), t \in [0, T]$ be a one-dimensional Brownian motion defined in a complete filtered probability space $(Ω, F, F, P) .$ The filtration $F = {F_{t}}_{t \geq 0}$ is assumed to be the P-augmented filtration generated by B.

Definition 2.1.

Let $Z$ be the set of integers.

• Let $M^{k}$ be the space of random measures μ on $R$ equipped with the norm (2.1) $\begin{matrix} {‖ μ ‖}_{L^{k}}^{2} & : = & E [\int_{R} | \hat{μ} (y) |^{2} {(1 + | y |)}^{k} e^{- y^{2}} d y], k \in Z, \end{matrix}$ (2.1) where $\hat{μ}$ is the Fourier transform of the measure μ, i.e., $\begin{matrix} \hat{μ} (y) & : = & \int_{R} e^{ixy} d μ (x); y \in R . \end{matrix}$ We endow $M^{k}$ with the inner product $〈 μ, η 〉 : = \int_{R} | \hat{μ} (y) - \hat{η} (y) |^{2} {(1 + | y |)}^{k} e^{- y^{2}}$ $μ, η, y \in R,$ $\hat{μ}$ and $\hat{η}$ are the Fourier transform of the measures μ and η, respectively. Then $(M^{k}, | | \cdot | |_{M^{k}})$ is a pre-Hilbert space, for each k. Let $M$ be the union (inductive limit) of $M^{k}, k \in Z .$

• We denote by $M_{0}$ the set of all deterministic elements of $M .$

We give some examples:

Example 2.2

(Measures). Let us give some examples of measures in $M_{0}^{0}$ and $M^{0}$ :

Suppose that $μ = δ_{x_{0}}$ , the unit point mass at $x_{0} \in R$ . Then $δ_{x_{0}} \in M_{0}^{0}$ and $\begin{matrix} \hat{μ} (y) & = \int_{R} e^{ixy} d μ (x) & = e^{i x_{0} y}, \end{matrix}$
and hence $\begin{matrix} {‖ μ ‖}_{M_{0}^{0}}^{2} & = \int_{R} | e^{i x_{0} y} |^{2} e^{- y^{2}} d y & < \infty . \end{matrix}$
Suppose $d μ (x) = f (x) d x$ , where $f \in L^{1} (R)$ . Then $μ \in M_{0}^{0}$ and by Riemann–Lebesque lemma, $\hat{μ} (y) \in C_{0} (R)$ , i.e., $\hat{μ}$ is continuous and $\hat{μ} (y) \to 0$ when $| y | \to \infty$ . In particular, $| \hat{μ} |$ is bounded on $R$ and hence $\begin{matrix} {‖ μ ‖}_{M_{0}^{0}}^{2} & = \int_{R} | \hat{μ} (y) |^{2} e^{- y^{2}} d y & < \infty . \end{matrix}$
Suppose that μ is any finite positive measure on $R$ . Then $μ \in M_{0}^{0}$ and $\begin{matrix} | \hat{μ} (y) | & \leq \int_{R} d μ (y) = μ (R) & < \infty, for all y, \end{matrix}$
and hence $\begin{matrix} {‖ μ ‖}_{M_{0}^{0}}^{2} & = \int_{R} | \hat{μ} (y) |^{2} e^{- y^{2}} d y & < \infty . \end{matrix}$
Next, suppose $x_{0} = x_{0} (ω)$ is random. Then $δ_{x_{0} (ω)}$ is a random measure in $M^{0}$ . Similarly, if $f (x) = f (x, ω)$ is random, then $d μ (x, ω) = f (x, ω) d x$ is a random measure in $M^{0} .$

We denote by U a nonempty convex subset of $R$ and we denote by $U_{G}$ the set of U-valued $G$ -progressively measurable processes where $G : = {G_{t}}_{t \geq 0}$ with $G_{t} \subseteq F_{t}$ for all $t \geq 0;$ we consider them as the admissible control processes.

We will also use the following spaces:

$S^{2}$ is the set of $R$ -valued $F$ -adapted càdlàg processes $X = X (t), t \in [0, T],$ such that $| | X | |_{S^{2}}^{2} : = E [\sup_{t \in [0, T]} | X (t) |^{2}] < \infty,$
$L^{2}$ is the set of $R$ -valued $F$ -adapted processes $Q = Q (t), t \in [0, T],$ such that $| | Q ‖_{L^{2}}^{2} : = E [\int_{0}^{T} | Q (t) |^{2} d t] < \infty .$
$K$ denotes the set of absolutely continuous functions $m : [0, T] \to M_{0} .$
$K$ is the set of bounded linear functionals $K : M_{0} \to R$ equipped with the operator norm $| | K | |_{K} : = \sup_{m \in M_{0}, | | m | |_{M_{0}} \leq 1} | K (m) | .$
$S_{K}^{2}$ is the set of $F$ -adapted stochastic processes $p : [0, T] \times Ω \mapsto K,$ such that $| | p | |_{S_{K}^{2}}^{2} : = E [\sup_{t \in [0, T]} | | p (t) | |_{K}^{2}] < \infty .$
$L_{K}^{2}$ is the set of $F$ -adapted stochastic processes $q : [0, T] \times Ω \mapsto K,$ such that $| | q | |_{L_{K}^{2}}^{2} : = E [\int_{0}^{T} | | q (t) | |_{K}^{2} d t] < \infty .$

We recall now the notion of differentiability which will be used in the sequel.

Let $X, Y$ be two Banach spaces with norms $| | \cdot ‖_{X}, | | \cdot ‖_{Y},$ respectively, and let $F : X \to Y .$

We say that F has a directional derivative (or Gateaux derivative) at $v \in X$ in the direction if $D_{w} F (v) : = \lim_{ε \to 0} \frac{1}{ε} (F (v + ε w) - F (v))$

exists in

Y .

We say that F is Fréchet differentiable at $v \in X$ if there exists a continuous linear map $A : X \to Y$ such that $\lim_{\begin{matrix} h \to 0 \\ h \in X \end{matrix}} \frac{1}{| | h | |_{X}} | | F (v + h) - F (v) - A (h) ‖_{Y} = 0,$ where $A (h) = 〈 A, h 〉$ is the action of the liner operator A on h. In this case we call A the gradient (or Fréchet derivative) of F at v and we write $A = \nabla_{v} F .$
If F is Fréchet differentiable at v with Fréchet derivative $\nabla_{v} F,$ then F has a directional derivative in all directions $w \in X$ and $D_{w} F (v) = \nabla_{v} F (w) = 〈 \nabla_{v} F, w 〉 .$
In particular, note that if F is a linear operator, then $\nabla_{v} F = F$ for all v.

3. Optimal control problem

Here we denote by $M (t) : = L (X (t))$ the law of X(t) at time t and by $N (t) : = L (Y (t))$ the law of Y(t) at time t. We assume that our system is governed by a coupled system of MF-FBSDE as follows:

The MF-SDE $X^{u} (t) = X (t)$ is given by (3.1) ${\begin{matrix} d X (t) & = & b (t, X (t), M (t), u (t)) d t + σ (t, X (t), M (t), u (t)) d B (t), t \in [0, T], \\ X (0) & = & x_{0}, \end{matrix}$ (3.1) for functions $σ, b : Ω \times [0, T] \times R \times M_{0} \times U \to R$ which are supposed to be $F_{t}$ -measurable and the initial value $x_{0} \in R .$

The couple MF-BSDE $(Y^{u} (t), Z^{u} (t)) = (Y (t), Z (t))$ satisfies (3.2) ${\begin{matrix} d Y (t) & = & - g (t, X (t), Y (t), Z (t)), M (t), N (t), u (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = & ψ (X (T)), \end{matrix}$ (3.2) where $g : Ω \times [0, T] \times R^{3} \times M_{0}^{2} \times U \to R$ is $F$ -adapted and $ψ : Ω \times R \times M_{0} \to R$ is $F_{T}$ -measurable.

It follows from the definition of the norm (Equation2.1(2.1) $\begin{matrix} {‖ μ ‖}_{L^{k}}^{2} & : = & E [\int_{R} | \hat{μ} (y) |^{2} {(1 + | y |)}^{k} e^{- y^{2}} d y], k \in Z, \end{matrix}$ (2.1) ) that $\begin{matrix} | | L (X^{(1)}) - L (X^{(2)}) | |_{M_{0}}^{2} & \leq & E [{(X^{(1)} - X^{(2)})}^{2}], \end{matrix}$ where $X^{(1)}$ and $X^{(2)}$ are random variables that follow the distributions $L (X^{(1)})$ and $L (X^{(2)}),$ respectively.

Assume that (C is a constant that may change from line to line)

(A1) there exists C > 0, such that

for all $t \in [0, T],$ for all fixed $u \in U,$ $x, x^{'} \in R, m, m^{'} \in M_{0}$ $\begin{matrix} | σ (t, x, m, u) - σ (t, x^{'}, m^{'}, u) | + | b (t, x, m, u) - b (t, x^{'}, m^{'}, u) | \\ \leq C (| x - x^{'} | + | | m - m^{'} | |_{M_{0}}) . \end{matrix}$
for all $t \in [0, T],$ for all fixed $u \in U,$ $| σ (t, 0, δ_{0}, u) | + | b (t, 0, δ_{0}, u) | \leq C,$

where δ₀ is the distribution law of zero, i.e., the Dirac measure with mass at zero.

(A2) there exists C > 0, such that, for all fixed $u \in U$ and all knowing $X (t) \in S^{2}$ of Equation (Equation3.1(3.1) ${\begin{matrix} d X (t) & = & b (t, X (t), M (t), u (t)) d t + σ (t, X (t), M (t), u (t)) d B (t), t \in [0, T], \\ X (0) & = & x_{0}, \end{matrix}$ (3.1) ) and $M (t) : = L (X (t)) \in M_{0},$ we have

for all $t \in [0, T],$ $y, y^{'}, z, z^{'} \in R, n, n^{'} \in M_{0}$ $\begin{matrix} | g (t, x, y, z, m, n, u) - g (t, x, y^{'}, z^{'}, m, n^{'}, u) | \\ \leq C (| y - y^{'} | + | z - z^{'} | + | | n - n^{'} | |_{M_{0}}) . \end{matrix}$
for all $t \in [0, T],$ $| g (t, x, 0, 0, m, δ_{0}, u) | \leq C .$

Proposition 3.1.

Under Assumptions (ACitation1) and (ACitation2), the MF-FBSDE (3.Citation1)–(3.Citation2) admits a unique solution $(X, Y, Z) \in S^{2} \times S^{2} \times L^{2} .$

Since the system is partially coupled i.e., the forward equation does not depend on the solution of the backward one, we can solve the system separately as follows: we first find a solution X(t) of the MF-SDE (Equation3.1(3.1) ${\begin{matrix} d X (t) & = & b (t, X (t), M (t), u (t)) d t + σ (t, X (t), M (t), u (t)) d B (t), t \in [0, T], \\ X (0) & = & x_{0}, \end{matrix}$ (3.1) ) and then we plug it into the backward EquationEquation (3.2)(3.2) ${\begin{matrix} d Y (t) & = & - g (t, X (t), Y (t), Z (t)), M (t), N (t), u (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = & ψ (X (T)), \end{matrix}$ (3.2) , then we solve it.

Our aim is to maximize the performance functional of the form $\begin{matrix} J (u) & = E [h (X (T), M (T)) + ϕ (Y (0), N (0)) \\ + \int_{0}^{T} f (t, X (t), Y (t), Z (t), M (t), N (t), u (t)) d t], \end{matrix}$ over all admissible controls, for functions $f : Ω \times [0, T] \times R^{3} \times M_{0}^{2} \times U \to R, h : Ω \times R \times M_{0} \to R$ and $ϕ : Ω \times R \times M_{0} \to R .$

Now, we can define the Hamiltonian $H : Ω \times [0, T] \times R^{3} \times M_{0}^{2} \times U \times R^{2} \times K \times R \times K \to R$ by (3.3) $\begin{matrix} H (t, x, y, z, m, n, u, p^{0}, q^{0}, p^{1}, λ^{0}, λ^{1}) = f (t, x, y, z, n, u) + p^{0} b (t, x, m, u) \\ + q^{0} σ (t, x, m, u) + λ^{0} g (t, x, y, z, m, n, u) \\ + 〈 p^{1}, m^{'} 〉 - 〈 λ^{1}, n^{'} 〉 . \end{matrix}$ (3.3)

Remark 3.2.

For ease of notation we drop the dependence of all variables except for the time $t, \forall Φ \in {σ, f, H, h, g, ϕ},$ we write $Φ (t), \forall t .$ Moreover, we will use $\begin{matrix} \hat{Φ} (t) : = Φ (t, \hat{X} (t), \hat{Y} (t), \hat{Z} (t), \hat{M} (t), \hat{N} (t), \hat{u} (t)) \\ \overset{ˇ}{Φ} (t) : = Φ (t, \hat{X} (t), \hat{Y} (t), \hat{Z} (t), \hat{M} (t), \hat{N} (t), u (t))] . \end{matrix}$

We assume that

(A3) ${σ, f, H, h, g, ϕ, ψ}$ are continuously differentiable with bounded partial derivatives w.r.t all the variables.

For $u \in U$ with corresponding solution $X^{u} = X,$ define, whenever solutions exist, $p^{\hat{u}} = p = (p^{0}, p^{1})$ and $q^{\hat{u}} = q = (q^{0}, q^{1})$ and $λ^{\hat{u}} = λ = (λ^{0}, λ^{1})$ by the adjoint equations:

The BSDE for the unknown processes $(p^{0}, q^{0}) \in S^{2} \times L^{2}$ (3.4) ${\begin{matrix} d p^{0} (t) & = & - \partial_{x} H (t) d t + q^{0} (t) d B (t), t \in [0, T], \\ p^{0} (T) & = & \partial_{x} h (T) + λ^{0} (T) \partial_{x} ψ (T) . \end{matrix}$ (3.4)

The MF-BSDE for the unknown processes $(p^{1}, q^{1}) \in S_{K}^{2} \times L_{K}^{2}$ (3.5) ${\begin{matrix} d p^{1} (t) & = & - \nabla_{m} H (t) d t + q^{1} (t) d B (t), t \in [0, T], \\ p^{1} (T) & = & \nabla_{m} h (T) . \end{matrix}$ (3.5)

The forward SDE $λ^{0} \in S^{2}$ (3.6) ${\begin{matrix} d λ^{0} (t) & = & \partial_{y} H (t) d t + \partial_{z} H (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & \partial_{y} ϕ (0), \end{matrix}$ (3.6) and $λ^{1} \in S_{K}^{2}$ (3.7) ${\begin{matrix} d λ^{1} (t) & = & \nabla_{n} H (t) d t, t \in [0, T], \\ λ^{1} (0) & = & \nabla_{n} ϕ (0) . \end{matrix}$ (3.7)

Remark 3.3.

The real-valued linear system of FBSDE (Equation3.4(3.4) ${\begin{matrix} d p^{0} (t) & = & - \partial_{x} H (t) d t + q^{0} (t) d B (t), t \in [0, T], \\ p^{0} (T) & = & \partial_{x} h (T) + λ^{0} (T) \partial_{x} ψ (T) . \end{matrix}$ (3.4) ) and (Equation3.6(3.6) ${\begin{matrix} d λ^{0} (t) & = & \partial_{y} H (t) d t + \partial_{z} H (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & \partial_{y} ϕ (0), \end{matrix}$ (3.6) ) have a unique solution by Proposition 3.1 since the coefficients satisfy condition (ACitation3). However, EquationEquation (3.5)(3.5) ${\begin{matrix} d p^{1} (t) & = & - \nabla_{m} H (t) d t + q^{1} (t) d B (t), t \in [0, T], \\ p^{1} (T) & = & \nabla_{m} h (T) . \end{matrix}$ (3.5) is equivalent to the degenerate BSDE $\begin{matrix} p^{1} (t) = \nabla_{m} h (T) + \int_{t}^{T} {\nabla_{m} f (s) + p^{0} (s) \nabla_{m} b (s) + q^{0} (s) \nabla_{m} σ (s)} d s \\ - \int_{t}^{T} q^{1} (t) d B (t) . \end{matrix}$

We take conditional expectation to obtain $p^{1} (t) = E [\nabla_{m} h (T) + \int_{t}^{T} {\nabla_{m} f (s) + p^{0} (s) \nabla_{m} b (s) + q^{0} (s) \nabla_{m} σ (s)} d s | F_{t}] .$

Similarly, a solution for (Equation3.7(3.7) ${\begin{matrix} d λ^{1} (t) & = & \nabla_{n} H (t) d t, t \in [0, T], \\ λ^{1} (0) & = & \nabla_{n} ϕ (0) . \end{matrix}$ (3.7) ) is given by $λ^{1} (t) = \nabla_{n} ϕ (0) + \int_{0}^{t} {\nabla_{n} f (s) + λ^{0} (s) \nabla_{n} g (s)} d s .$

Before stating and proving sufficient and necessary conditions of optimality, we need the following result, which is Lemma 2.3 in Agram and Øksendal [Citation7].

Lemma 3.4.

Suppose that X(t) is an Itô process of the form ${\begin{matrix} d X (t) = θ (t) d t + γ (t) d B (t), t \in [0, T], \\ X (0) = x_{0} \in R, \end{matrix}$ where $θ, γ$ are adapted processes.

Then the map $M (t) : [0, T] \to M_{0}$ is absolutely continuous.

It follows that $t \mapsto M (t)$ is differentiable for t-a.e. We will in the following use the notation $M^{'} (t) = \frac{d}{d t} M (t) .$

In fact, it is proven in [Citation7] that if $M (t) \in M^{k}$ then $M' (t) \in M^{k - 4}; k \in Z .$

3.1. Sufficient optimality conditions

We state and prove a type of a verification theorem.

Theorem 3.5.

Suppose that $\hat{u} \in U_{G}$ with corresponding solutions $\hat{X} (t), (\hat{Y} (t), \hat{Z} (t)), (p^{0} (t), q^{0} (t)), (p^{1} (t), q^{1} (t)), λ^{0} (t), λ^{1} (t)$ to Equations (3.1), (Equation3.2(3.2) ${\begin{matrix} d Y (t) & = & - g (t, X (t), Y (t), Z (t)), M (t), N (t), u (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = & ψ (X (T)), \end{matrix}$ (3.2) ), (Equation3.4(3.4) ${\begin{matrix} d p^{0} (t) & = & - \partial_{x} H (t) d t + q^{0} (t) d B (t), t \in [0, T], \\ p^{0} (T) & = & \partial_{x} h (T) + λ^{0} (T) \partial_{x} ψ (T) . \end{matrix}$ (3.4) ), (Equation3.5(3.5) ${\begin{matrix} d p^{1} (t) & = & - \nabla_{m} H (t) d t + q^{1} (t) d B (t), t \in [0, T], \\ p^{1} (T) & = & \nabla_{m} h (T) . \end{matrix}$ (3.5) ), (Equation3.6(3.6) ${\begin{matrix} d λ^{0} (t) & = & \partial_{y} H (t) d t + \partial_{z} H (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & \partial_{y} ϕ (0), \end{matrix}$ (3.6) ) and (Equation3.7(3.7) ${\begin{matrix} d λ^{1} (t) & = & \nabla_{n} H (t) d t, t \in [0, T], \\ λ^{1} (0) & = & \nabla_{n} ϕ (0) . \end{matrix}$ (3.7) ), respectively. Suppose that $• x, m \mapsto h (k, m),$ $• y, n \mapsto ϕ (y, n), x \mapsto ψ (x),$ $• x, y, z, m, n, u \mapsto H (\cdot, x, y, z, m, n, u),$ are concave functions P-a.s for each $t \in [0, T] .$ Moreover, $E [\hat{H} (t) | G_{t}] = \max_{_{u \in U}} E [\overset{ˇ}{H} (t) | G_{t}],$

P-a.s for all t $\in [0, T] .$ Then $\hat{u}$ is an optimal control.

Proof.

We show that $J (u) - J (\hat{u}) \leq 0,$ for an arbitrary u and a fixed optimal $\hat{u} \in U_{G} .$

We introduce first the following notation $\forall Φ \in {σ, f, H, h, g, ϕ, M, N, M^{'}, N^{'}}$ and $\forall t,$ $δ Φ (t) = \overset{ˇ}{Φ} (t) - \hat{Φ} (t),$ and $δ M^{'} (t) = δ (\frac{d}{d t} M (t)) = \frac{d}{d t} (δ M (t)) .$

From the definition of the Hamiltonian (Equation3.3(3.3) $\begin{matrix} H (t, x, y, z, m, n, u, p^{0}, q^{0}, p^{1}, λ^{0}, λ^{1}) = f (t, x, y, z, n, u) + p^{0} b (t, x, m, u) \\ + q^{0} σ (t, x, m, u) + λ^{0} g (t, x, y, z, m, n, u) \\ + 〈 p^{1}, m^{'} 〉 - 〈 λ^{1}, n^{'} 〉 . \end{matrix}$ (3.3) ), we have $\begin{matrix} δ f (t) = δ H (t) - δ b (t) p^{0} (t) - δ σ (t) q^{0} (t) \\ - 〈 p^{1} (t), δ M^{'} (t) 〉 - 〈 λ^{1} (t), δ N^{'} (t) 〉, \end{matrix}$ and (3.8) $\begin{matrix} J (u) - J (\hat{u}) = E [\int_{0}^{T} {δ H (t) - δ b (t) p^{0} (t) - δ σ (t) q^{0} (t) - 〈 p^{1} (t), δ M^{'} (t) 〉 \\ - 〈 λ^{1} (t), δ N^{'} (t) 〉} d t + δ h (T) + δ Φ (0)] . \end{matrix}$ (3.8)

We use the concavity of h and $ϕ$ as well as the boundary values of EquationEquations (3.4)(3.4) ${\begin{matrix} d p^{0} (t) & = & - \partial_{x} H (t) d t + q^{0} (t) d B (t), t \in [0, T], \\ p^{0} (T) & = & \partial_{x} h (T) + λ^{0} (T) \partial_{x} ψ (T) . \end{matrix}$ (3.4) , (3.5), (3.Citation6) and (Equation3.7(3.7) ${\begin{matrix} d λ^{1} (t) & = & \nabla_{n} H (t) d t, t \in [0, T], \\ λ^{1} (0) & = & \nabla_{n} ϕ (0) . \end{matrix}$ (3.7) ) (3.9) $\begin{matrix} δ h (T) + δ ϕ (0) \leq \partial_{x} h (T) δ X (T) + 〈 \nabla_{m} h (T), δ M (T) 〉 \\ + \partial_{x} ϕ (0) δ Y (0) + 〈 \nabla_{n} ϕ (0), δ N (0) 〉 \\ = p^{0} (T) δ X (T) - λ^{0} (T) δ X (T) + 〈 p^{1} (T), δ M (T) 〉 \\ + λ^{0} (0) δ Y (0) + 〈 λ^{1} (0), δ N (0) 〉 . \end{matrix}$ (3.9)

Applying It $\hat{o}$ formula to $p^{0} (t) δ X (t), 〈 p^{1} (t), δ M (t) 〉, λ^{0} (t) δ Y (t)$ and $〈 λ^{1} (t), δ N (t) 〉,$ yields the following duality relations: (3.10) $\begin{matrix} E [p^{0} (T) δ X (T)] - E [λ^{0} (T) \partial_{x} ψ (T)] = E [\int_{0}^{T} p^{0} (t) δ b (t) d t - \int_{0}^{T} δ X (t) \partial_{x} H (t) d t] \\ + E [\int_{0}^{T} q^{0} (t) δ σ (t) d t] - E [λ^{0} (T) \partial_{x} ψ (T)], \end{matrix}$ (3.10) (3.11) $E [〈 p^{1} (T), δ M (T) 〉] = E [\int_{0}^{T} 〈 p^{1} (t), δ M^{'} (t) 〉 d t - \int_{0}^{T} 〈 \nabla_{m} \hat{H} (t), δ M (t) 〉 d t],$ (3.11) (3.12) $\begin{matrix} E [λ^{0} (T) δ Y (T)] - E [λ^{0} (0) δ Y (0)] = - E [\int_{0}^{T} λ^{0} (t) δ g (t) d t] + E [\int_{0}^{T} δ Y (t) \partial_{y} \hat{H} (t) d t] \\ + E [\int_{0}^{T} δ Z (t) \partial_{z} \hat{H} (t) d t] . \end{matrix}$ (3.12)

Concavity of ψ gives (3.13) $\begin{matrix} E [λ^{0} (0) δ Y (0)] = E [λ^{0} (T) δ ψ (T)] + E [\int_{0}^{T} λ^{0} (t) δ g (t) d t] - E [\int_{0}^{T} δ Y (t) \partial_{y} \hat{H} (t) d t] \\ - E [\int_{0}^{T} δ Z (t) \partial_{z} \hat{H} (t) d t] \\ \leq E [λ^{0} (T) \partial_{x} ψ (T) δ X (T)] + E [\int_{0}^{T} λ^{0} (t) δ g (t) d t] \\ - E [\int_{0}^{T} δ Y (t) \partial_{y} \hat{H} (t) d t] - E [\int_{0}^{T} δ Z (t) \partial_{z} \hat{H} (t) d t], \\ E [λ^{1} (T) δ N (T)] - E [〈 λ^{1} (0), δ N (0) 〉] = E [\int_{0}^{T} 〈 λ^{1} (t), δ N^{'} (t) 〉 + 〈 \nabla_{n} \hat{H} (t), δ N (t) 〉 d t] . \end{matrix}$ (3.13)

By the concavity of H, we obtain (3.14) $\begin{matrix} δ H (t) \leq \partial_{x} \hat{H} (t) δ X (t) + \partial_{y} \hat{H} (t) δ Y (t) + \partial_{z} \hat{H} (t) δ Z (t) \\ + 〈 \nabla_{m} \hat{H} (t), δ M (t) 〉 + 〈 \nabla_{n} \hat{H} (t), δ N (t) 〉 + \partial_{u} \hat{H} (t) δ u (t) . \end{matrix}$ (3.14)

Finally, by substituting the derived duality relations (Equation3.10(3.10) $\begin{matrix} E [p^{0} (T) δ X (T)] - E [λ^{0} (T) \partial_{x} ψ (T)] = E [\int_{0}^{T} p^{0} (t) δ b (t) d t - \int_{0}^{T} δ X (t) \partial_{x} H (t) d t] \\ + E [\int_{0}^{T} q^{0} (t) δ σ (t) d t] - E [λ^{0} (T) \partial_{x} ψ (T)], \end{matrix}$ (3.10) ), (Equation3.11(3.11) $E [〈 p^{1} (T), δ M (T) 〉] = E [\int_{0}^{T} 〈 p^{1} (t), δ M^{'} (t) 〉 d t - \int_{0}^{T} 〈 \nabla_{m} \hat{H} (t), δ M (t) 〉 d t],$ (3.11) ), (Equation3.12(3.12) $\begin{matrix} E [λ^{0} (T) δ Y (T)] - E [λ^{0} (0) δ Y (0)] = - E [\int_{0}^{T} λ^{0} (t) δ g (t) d t] + E [\int_{0}^{T} δ Y (t) \partial_{y} \hat{H} (t) d t] \\ + E [\int_{0}^{T} δ Z (t) \partial_{z} \hat{H} (t) d t] . \end{matrix}$ (3.12) ) and (Equation3.13(3.13) $\begin{matrix} E [λ^{0} (0) δ Y (0)] = E [λ^{0} (T) δ ψ (T)] + E [\int_{0}^{T} λ^{0} (t) δ g (t) d t] - E [\int_{0}^{T} δ Y (t) \partial_{y} \hat{H} (t) d t] \\ - E [\int_{0}^{T} δ Z (t) \partial_{z} \hat{H} (t) d t] \\ \leq E [λ^{0} (T) \partial_{x} ψ (T) δ X (T)] + E [\int_{0}^{T} λ^{0} (t) δ g (t) d t] \\ - E [\int_{0}^{T} δ Y (t) \partial_{y} \hat{H} (t) d t] - E [\int_{0}^{T} δ Z (t) \partial_{z} \hat{H} (t) d t], \\ E [λ^{1} (T) δ N (T)] - E [〈 λ^{1} (0), δ N (0) 〉] = E [\int_{0}^{T} 〈 λ^{1} (t), δ N^{'} (t) 〉 + 〈 \nabla_{n} \hat{H} (t), δ N (t) 〉 d t] . \end{matrix}$ (3.13) ) in (Equation3.8(3.8) $\begin{matrix} J (u) - J (\hat{u}) = E [\int_{0}^{T} {δ H (t) - δ b (t) p^{0} (t) - δ σ (t) q^{0} (t) - 〈 p^{1} (t), δ M^{'} (t) 〉 \\ - 〈 λ^{1} (t), δ N^{'} (t) 〉} d t + δ h (T) + δ Φ (0)] . \end{matrix}$ (3.8) ) and using the estimates (Equation3.9(3.9) $\begin{matrix} δ h (T) + δ ϕ (0) \leq \partial_{x} h (T) δ X (T) + 〈 \nabla_{m} h (T), δ M (T) 〉 \\ + \partial_{x} ϕ (0) δ Y (0) + 〈 \nabla_{n} ϕ (0), δ N (0) 〉 \\ = p^{0} (T) δ X (T) - λ^{0} (T) δ X (T) + 〈 p^{1} (T), δ M (T) 〉 \\ + λ^{0} (0) δ Y (0) + 〈 λ^{1} (0), δ N (0) 〉 . \end{matrix}$ (3.9) ), (Equation3.14(3.14) $\begin{matrix} δ H (t) \leq \partial_{x} \hat{H} (t) δ X (t) + \partial_{y} \hat{H} (t) δ Y (t) + \partial_{z} \hat{H} (t) δ Z (t) \\ + 〈 \nabla_{m} \hat{H} (t), δ M (t) 〉 + 〈 \nabla_{n} \hat{H} (t), δ N (t) 〉 + \partial_{u} \hat{H} (t) δ u (t) . \end{matrix}$ (3.14) ), we obtain $J (u) - J (\hat{u}) \leq E [\int_{0}^{T} \partial_{u} \hat{H} (t) δ u (t)] .$

Using the tower property and the fact that u(t) is $G$ -adapted the desired result follows $J (u) - J (\hat{u}) \leq E [\int_{0}^{T} E [\partial_{u} \hat{H} (t) | G_{t}] δ u (t) d t] \leq 0,$ and thus, $\hat{u}$ is optimal.□

3.2. Necessary optimality conditions

Given an arbitrary but fixed control $u \in U_{G},$ we define (3.15) $u^{ρ} : = \hat{u} + ρ u, ρ \in [0, 1] .$ (3.15)

Note that, the convexity of U and $U_{G}$ guarantees that $u^{ρ} \in U_{G}, ρ \in [0, 1] .$ We denote by $X^{ρ} : = X^{u^{ρ}}$ and by $\hat{X} : = X^{\hat{u}},$ the solution processes corresponding to $u^{ρ}$ and $\hat{u},$ respectively.

For each $t_{0} \in [0, T]$ and all bounded $G_{t_{0}}$ -measurable random variables $α,$ the process $\begin{matrix} u (t) & = & α 1_{(t_{0}, T]} (t), \end{matrix}$ belongs to $U_{G} .$

In general, if $K^{\hat{u}} (t)$ is a process depending on $\hat{u},$ we define the operator D on K by (3.16) $D K^{\hat{u}} (t) : = D^{u} K^{\hat{u}} (t) = \frac{d}{d ρ} K^{\hat{u} + ρ u} (t) |_{ρ = 0},$ (3.16) whenever the derivative exists.

Define the following derivative processes $\begin{matrix} D X (t) : = \frac{d}{d ρ} X^{\hat{u} + ρ u} (t) |_{ρ = 0} = X (t), \\ D Y (t) : = \frac{d}{d ρ} Y^{\hat{u} + ρ u} (t) |_{ρ = 0} = Y (t), \\ D Z (t) : = \frac{d}{d ρ} Z^{\hat{u} + ρ u} (t) |_{ρ = 0} = Z (t), \\ D N (t) : = \frac{d}{d ρ} N^{\hat{u} + ρ u} (t) |_{ρ = 0}, \\ D M (t) : = \frac{d}{d ρ} M^{\hat{u} + ρ u} (t) |_{ρ = 0}, \\ D N^{'} (t) : = \frac{d}{d ρ} \frac{d}{d t} M^{\hat{u} + ρ u} (t) |_{ρ = 0}, \\ D M^{'} (t) : = \frac{d}{d ρ} \frac{d}{d t} M^{\hat{u} + ρ u} (t) |_{ρ = 0}, \end{matrix}$ such that (3.17) ${\begin{matrix} d X (t) & = {\partial_{x} b (t) X (t) + 〈 \nabla_{m} b (t), D M (t) 〉 + \partial_{u} b (t) u (t)} d t \\ + {\partial_{x} σ (t) X (t) + 〈 \nabla_{m} σ (t), D M (t) 〉 + \partial_{u} σ (t) u (t)} d B (t), t \in [0, T], \\ X (0) & = 0, \end{matrix}$ (3.17) and (3.18) ${\begin{matrix} d Y (t) & = - {\partial_{x} g (t) X (t) + \partial_{y} g (t) Y (t) + \partial_{z} g (t) Z (t) + 〈 \nabla_{m} g (t), D M (t) 〉 \\ + 〈 \nabla_{n} g (t), D N (t) 〉 + \partial_{u} g (t) u (t)} d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = \partial_{x} ψ (T) X (T) . \end{matrix}$ (3.18)

Remark 3.6.

Equations (Equation3.17(3.17) ${\begin{matrix} d X (t) & = {\partial_{x} b (t) X (t) + 〈 \nabla_{m} b (t), D M (t) 〉 + \partial_{u} b (t) u (t)} d t \\ + {\partial_{x} σ (t) X (t) + 〈 \nabla_{m} σ (t), D M (t) 〉 + \partial_{u} σ (t) u (t)} d B (t), t \in [0, T], \\ X (0) & = 0, \end{matrix}$ (3.17) ), (Equation3.18(3.18) ${\begin{matrix} d Y (t) & = - {\partial_{x} g (t) X (t) + \partial_{y} g (t) Y (t) + \partial_{z} g (t) Z (t) + 〈 \nabla_{m} g (t), D M (t) 〉 \\ + 〈 \nabla_{n} g (t), D N (t) 〉 + \partial_{u} g (t) u (t)} d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = \partial_{x} ψ (T) X (T) . \end{matrix}$ (3.18) ) are linear FBSDE with bounded coefficients, then by Proposition 3.1 they have a unique solution.

Theorem 3.7.

Let $\hat{u} \in U_{G}$ be the optimal control and $X (t), (Y (t), Z (t)), (p^{0} (t), q^{0} (t)), (p^{1} (t), q^{1} (t)), λ^{0} (t), λ^{1} (t)$ be the corresponding solutions to the EquationEquations (Equation3.17(3.17) ${\begin{matrix} d X (t) & = {\partial_{x} b (t) X (t) + 〈 \nabla_{m} b (t), D M (t) 〉 + \partial_{u} b (t) u (t)} d t \\ + {\partial_{x} σ (t) X (t) + 〈 \nabla_{m} σ (t), D M (t) 〉 + \partial_{u} σ (t) u (t)} d B (t), t \in [0, T], \\ X (0) & = 0, \end{matrix}$ (3.17) )(3.17) ${\begin{matrix} d X (t) & = {\partial_{x} b (t) X (t) + 〈 \nabla_{m} b (t), D M (t) 〉 + \partial_{u} b (t) u (t)} d t \\ + {\partial_{x} σ (t) X (t) + 〈 \nabla_{m} σ (t), D M (t) 〉 + \partial_{u} σ (t) u (t)} d B (t), t \in [0, T], \\ X (0) & = 0, \end{matrix}$ (3.17) , (Equation3.18(3.18) ${\begin{matrix} d Y (t) & = - {\partial_{x} g (t) X (t) + \partial_{y} g (t) Y (t) + \partial_{z} g (t) Z (t) + 〈 \nabla_{m} g (t), D M (t) 〉 \\ + 〈 \nabla_{n} g (t), D N (t) 〉 + \partial_{u} g (t) u (t)} d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = \partial_{x} ψ (T) X (T) . \end{matrix}$ (3.18) ), (Equation3.4(3.4) ${\begin{matrix} d p^{0} (t) & = & - \partial_{x} H (t) d t + q^{0} (t) d B (t), t \in [0, T], \\ p^{0} (T) & = & \partial_{x} h (T) + λ^{0} (T) \partial_{x} ψ (T) . \end{matrix}$ (3.4) ), (Equation3.5(3.5) ${\begin{matrix} d p^{1} (t) & = & - \nabla_{m} H (t) d t + q^{1} (t) d B (t), t \in [0, T], \\ p^{1} (T) & = & \nabla_{m} h (T) . \end{matrix}$ (3.5) ), (Equation3.6(3.6) ${\begin{matrix} d λ^{0} (t) & = & \partial_{y} H (t) d t + \partial_{z} H (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & \partial_{y} ϕ (0), \end{matrix}$ (3.6) ), (Equation3.7(3.7) ${\begin{matrix} d λ^{1} (t) & = & \nabla_{n} H (t) d t, t \in [0, T], \\ λ^{1} (0) & = & \nabla_{n} ϕ (0) . \end{matrix}$ (3.7) ), respectively. Then, the following statements are equivalent

$\frac{d}{d ρ} J (\hat{u} + ρ u) |_{ρ = 0} = 0$ for all bounded $β \in U_{G} .$
$E [\frac{\partial}{\partial u} \hat{H} (t) | G_{t}] = 0$ for all $t \in [0, T] .$

Proof

We first prove Theorem 3.7 by assuming (i) and aiming to show (ii) $\begin{matrix} 0 = \frac{d}{d ρ} J (u + ρ u) |_{ρ = 0} \\ = E [\int_{0}^{T} \frac{d}{d ρ} f^{ρ} (t) |_{ρ = 0} d t + p^{0} (T) X (T) - λ^{0} (T) \partial_{x} ψ (T) X (T) + 〈 p^{1} (T), D M (T) 〉 \\ + λ^{0} (0) Y (0) + 〈 λ^{1} (0), D N (0) 〉] \end{matrix}$

{we substitute f(t) from Equation $(3.3)$ } $\begin{array}{l} = E [\int_{0}^{T} \frac{d}{d ρ} {H^{ρ} (t) - p^{0} (t) b^{ρ} (t) - q^{0} (t) σ^{ρ} (t) - λ^{0} (t) g^{ρ} (t) - 〈 p^{1} (t), M^{ρ^{'}} (t) 〉 \\ + 〈 λ^{1} (t), N^{ρ^{'}} (t) 〉} |_{ρ = 0} d t + p^{0} (T) X (T) - λ^{0} (T) X (T) + 〈 p^{1} (T), D M (T) 〉 \\ + λ^{0} (0) Y (0) + 〈 λ^{1} (0), D N (0) 〉], \end{array}$ by using the chain rule, we obtain $\begin{matrix} \frac{d}{d ρ} H^{ρ} (t) |_{ρ = 0} = \partial_{x} H (t) X (t) + \partial_{y} H (t) Y (t) + \partial_{z} H (t) Z (t) + 〈 \nabla_{m} H (t), D M (t) 〉 \\ + 〈 \nabla_{n} H (t), D N (t) 〉 + \partial_{u} H (t) u (t), \\ \frac{d}{d ρ} p^{0} (t) b^{ρ} (t) |_{ρ = 0} = p^{0} (t) \partial_{x} b (t) X (t) + p^{0} (t) 〈 \nabla_{m} b (t), D M (t) 〉 + p^{0} (t) \partial_{u} b (t) u (t), \\ \frac{d}{d ρ} q^{0} (t) σ^{ρ} (t) |_{ρ = 0} = q^{0} (t) \partial_{x} σ (t) X (t) + q^{0} (t) 〈 \nabla_{m} σ (t), D M (t) 〉 + q^{0} (t) \partial_{u} σ (t) u (t), \\ \frac{d}{d ρ} λ^{0} (t) g^{ρ} (t) |_{ρ = 0} = λ^{0} (t) \partial_{x} g (t) X (t) + λ^{0} (t) \partial_{y} g (t) Y (t) + λ^{0} (t) \partial_{z} g (t) Z (t) \\ + λ^{0} (t) 〈 \nabla_{m} g (t), D M (t) 〉 + λ^{0} (t) 〈 \nabla_{n} g (t), D N (t) 〉 \\ + λ^{0} (t) \partial_{u} g (t) u (t), \\ \frac{d}{d ρ} 〈 p^{1} (t), M^{ρ^{'}} (t) 〉 |_{ρ = 0} = 〈 p^{1} (t), D M^{'} (t) 〉, \end{matrix}$ and $\frac{d}{d ρ} 〈 λ^{1} (t), N^{ρ^{'}} (t) 〉 |_{ρ = 0} = 〈 λ^{1} (t), D N^{'} (t) 〉 .$

We apply It $\hat{o}$ formula to $p^{0} (t) X (t), 〈 p^{1} (t), D M (t) 〉, λ^{0} (t) Y (t)$ and $〈 λ^{1} (t), D N (t) 〉$ then we take the expectation, we obtain the following important duality relations: $\begin{matrix} E [p^{0} (T) X (T)] = E [\int_{0}^{T} {p^{0} (t) \partial_{x} b (t) X (t) + p^{0} (t) 〈 \nabla_{m} b (t), D M (t) 〉 \\ + p^{0} (t) \partial_{u} b (t) u (t) - \partial_{x} H (t) X (t) + q^{0} (t) \partial_{x} σ (t) X (t) \\ + q^{0} (t) 〈 \nabla_{m} σ (t), D M (t) 〉 + q^{0} (t) \partial_{u} σ (t) u (t)} d t], \\ E [〈 p^{1} (T), D M (T) 〉] = E [\int_{0}^{T} 〈 p^{1} (t), D M^{'} (t) 〉 - 〈 \nabla_{m} H (t), D M (t) 〉 d t], \\ E [λ^{0} (T) Y (T)] - E [λ^{0} (0) Y (0)] = E [\int_{0}^{T} {- λ^{0} (t) \partial_{x} g (t) X (t) \\ - λ^{0} (t) \partial_{y} g (t) Y (t) - λ^{0} (t) \partial_{z} g (t) Z (t) \\ - λ^{0} (t) 〈 \nabla_{m} g (t), D M (t) 〉 \\ - λ^{0} (t) 〈 \nabla_{n} g (t), D N (t) 〉 - λ^{0} (t) \partial_{u} g (t) u (t) \\ + \partial_{y} H (t) Y + \partial_{z} H (t) Z (t)} d t], \\ E [〈 λ^{1} (T), D N (T) 〉] - E [〈 λ^{1} (0), D N (0) 〉] = E [\int_{0}^{T} {〈 λ^{1} (t), D N^{'} (t) 〉 \\ + 〈 \nabla_{n} H (t), D N (t) 〉} d t] . \end{matrix}$

By substituting the derived duality relations and the partial derivatives of f(t) the desired result follows. This proof can be reversed to prove $(i i) \Rightarrow (i) .$ We omit the details.□

4. Mean-field risk minimization

4.1. Mean-field dynamic risk measure

In this section, we are interested in a particular class of MF-BSDE of the following form (4.1) ${\begin{matrix} d Y (t) & = - f (t, Y (t), E [Y (t)], Z (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = ξ, \end{matrix}$ (4.1) where $f (t, Y (t), E [Y (t)], Z (t)) : = - r (t) Y (t) - r^{'} (t) E [Y (t)] + F (t, Z (t)) .$

We assume that the generator $(y, \bar{y}, z) \to f (t, Y (t), E [Y (t)], Z (t)) : Ω \times [0, T] \times R \times R \times R \to R$ is $F$ -adapted, uniformly Lipschitz and concave, and the terminal condition $ξ \in L^{2} (Ω, F_{T}) .$

Definition 4.1.

Define $E_{t} :$ $(T; ξ) \to E_{t} (T; ξ)$ by $\begin{matrix} E_{t} (T; ξ) & = & - Y_{t} (T; ξ), t \in [0, T], \end{matrix}$ where $Y_{t} (T; ξ)$ is a component of the solution of the MF-BSDE (Equation4.1(4.1) ${\begin{matrix} d Y (t) & = - f (t, Y (t), E [Y (t)], Z (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = ξ, \end{matrix}$ (4.1) ) with terminal horizon T, terminal condition ξ and driver f. Then $E_{t} (T; ξ)$ is a dynamic risk measure induced by the MF-BSDE (Equation4.1(4.1) ${\begin{matrix} d Y (t) & = - f (t, Y (t), E [Y (t)], Z (t)) d t + Z (t) d B (t), t \in [0, T], \\ Y (T) & = ξ, \end{matrix}$ (4.1) ).

We may remark that the driver f depends linearly on Y and its expected value $E [Y],$ and nonlinearly on Z. This is interpreted as a market with interest rates $(r (t), r' (t)) .$ We can reformulated this as a problem with a driver independent of Y and $E [Y]$ by discounting the financial position ξ. We assume that the instantaneous interest rates r(t) and $r^{'} (t)$ are deterministic. We denote by $E_{\cdot},$ the corresponding discounted risk measure.

Define the discounted process $Y^{r} (t) : = e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} Y (t) .$

Then Y^r with driver $F^{r} (\cdot, t, Z (t)) : = e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} F (\cdot, t, e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} Z (t)),$ and the terminal value $ξ^{r} : = e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} ξ$ is a part of the solution of the associated BSDE. We obtain also a discounted risk measure accordingly $E_{0} (ξ, T) = E_{0}^{r} (e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} ξ, T) .$

This discounted risk measure is translation-invariant because F^r does not depend on Y, we have for $ξ \in L^{2} (Ω, F_{T})$ and $a \in R,$ $\begin{matrix} E_{0} (ξ + a e^{\int_{0}^{t} (r (s) + r^{'} (s)) d s}, T) = E_{0}^{r} (e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} ξ + a, T) \\ = E_{0}^{r} (e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} ξ, T) - a \\ = E_{0} (ξ, T) - a . \end{matrix}$

Similarly, we can get for each $t \in [0, T],$ that $E (ξ, T) = E^{r} (e^{- \int_{0}^{t} (r (s) + r^{'} (s)) d s} ξ, T)$ is translation-invariant.

4.2. Optimal portfolio with mean-field risk minimization

Consider a financial market with two investment possibilities:

Safe, or risk-free asset with unit price $\begin{matrix} S_{0} (t) = 1, t \in [0, T] . \end{matrix}$
Risky asset with unit price $\begin{matrix} d S_{1} (t) = S_{1} (t) [b_{0} (t) d t + σ_{0} (t) d B (t)], t \in [0, T] . \end{matrix}$

Let $π (t)$ be a self-financing portfolio invested in the risky asset at time t. We want to minimize the risk $φ (X^{π} (T))$ of the terminal value of the wealth process $X^{π} (t)$ corresponding to a portfolio π which satisfies the linear SDE (4.2) ${\begin{matrix} d X^{π} (t) & = π (t) X^{π} (t) [b_{0} (t) d t + σ_{0} (t) d B (t)], t \in [0, T], \\ X^{π} (0) & = x_{0}, \end{matrix}$ (4.2) such that $φ (X^{π} (T)) = - Y^{π} (0)$ where $Y^{π} (t)$ satisfies a MF-BSDE (4.3) ${\begin{matrix} - d Y^{π} (t) & = [- r_{0} (t) E [Y^{π} (t)] + F (Z (t))] d t - Z (t) d B (t), t \in [0, T], \\ Y^{π} (T) & = X^{π} (T) . \end{matrix}$ (4.3)

Here we assume that $b_{0} (t),$ $σ_{0} (t),$ $r_{0} (t)$ are given deterministic functions and $F : R \to R$ is some given concave function. We want to find $\hat{π} \in U_{G}$ such that $\inf_{π \in U_{G}} (- Y^{π} (0)) = - Y^{\hat{π}} (0) .$

Define the Hamiltonian H that corresponds to our problem by $\begin{matrix} H (t, x, z, \bar{y}, π, p^{0}, q^{0}, λ^{0}, λ^{1}) = p^{0} b_{0} π x + q^{0} σ_{0} π x \\ + λ^{0} (r_{0} \bar{y} + F (z)) + 〈 λ^{1}, \bar{y} 〉 . \end{matrix}$

The couple $(p^{0}, q^{0})$ solution of the following BSDE ${\begin{matrix} d p^{0} (t) & = & - [p^{0} (t) b_{0} (t) π (t) + q^{0} (t) σ_{0} (t) π (t)] d t + q^{0} (t) d B (t), t \in [0, T], \\ p^{0} (T) & = & λ^{0} (T), \end{matrix}$ and $(p^{1}, q^{1})$ satisfies ${\begin{matrix} d p^{1} (t) & = & q^{1} (t) d B (t), t \in [0, T], \\ p^{1} (T) & = & 0. \end{matrix}$

The equation for $λ^{0}$ is given by the forward SDE (4.4) ${\begin{matrix} d λ^{0} (t) & = & \partial_{z} F (Z (t)) λ^{0} (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & 1, \end{matrix}$ (4.4) and $λ^{1}$ satisfies ${\begin{matrix} d λ^{1} (t) & = & - r_{0} (t) λ^{0} (t) d t, t \in [0, T], \\ λ^{1} (0) & = & 0. \end{matrix}$

The first order necessary optimality condition gives ${\hat{p}}^{0} (t) b_{0} (t) \hat{X} (t) + {\hat{q}}^{0} (t) σ_{0} (t) \hat{X} (t) = 0,$

where we denoted by $\hat{X} (t) = X^{\hat{π}} (t)$ and so on. Since $\hat{X} (t) > 0$ for all t P-a.s., we obtain (4.5) ${\hat{p}}^{0} (t) b_{0} (t) + {\hat{q}}^{0} (t) σ_{0} (t) = 0,$ (4.5) which implies ${\begin{matrix} d {\hat{p}}^{0} (t) & = & {\hat{q}}^{0} (t) d B (t) = - \frac{b_{0} (t)}{σ_{0} (t)} {\hat{p}}^{0} (t) d B (t), t \in [0, T], \\ {\hat{p}}^{0} (T) & = & {\hat{λ}}^{0} (T), \end{matrix}$ this together with EquationEquation (Equation4.4(4.4) ${\begin{matrix} d λ^{0} (t) & = & \partial_{z} F (Z (t)) λ^{0} (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & 1, \end{matrix}$ (4.4) )(4.4) ${\begin{matrix} d λ^{0} (t) & = & \partial_{z} F (Z (t)) λ^{0} (t) d B (t), t \in [0, T], \\ λ^{0} (0) & = & 1, \end{matrix}$ (4.4) , yields ${\hat{p}}^{0} (t) = {\hat{λ}}^{0} (t), {\hat{q}}^{0} (t) = \partial_{z} F (\hat{Z} (t)) {\hat{λ}}^{0} (t) .$

From (Equation4.5(4.5) ${\hat{p}}^{0} (t) b_{0} (t) + {\hat{q}}^{0} (t) σ_{0} (t) = 0,$ (4.5) ), we get $\partial_{z} F (\hat{Z} (t)) = - \frac{b_{0} (t)}{σ_{0} (t)} .$

For example, if we choose (4.6) $F (z) = - \frac{1}{2} z^{2} .$ (4.6)

That is $\hat{Z} (t) = \frac{b_{0} (t)}{σ_{0} (t)} .$

Substituting the expression of $\hat{Z} (t)$ above into the MF-BSDE (Equation4.3(4.3) ${\begin{matrix} - d Y^{π} (t) & = [- r_{0} (t) E [Y^{π} (t)] + F (Z (t))] d t - Z (t) d B (t), t \in [0, T], \\ Y^{π} (T) & = X^{π} (T) . \end{matrix}$ (4.3) ), we obtain (4.7) ${\begin{matrix} d \hat{Y} (t) & = - [- r_{0} (t) E [\hat{Y} (t)] - \frac{1}{2} {(\frac{b_{0} (t)}{σ_{0} (t)})}^{2}] d t - \frac{b_{0} (t)}{σ_{0} (t)} d B (t), t \in [0, T], \\ \hat{Y} (T) & = \hat{X} (T) . \end{matrix}$ (4.7)

Consequently $\begin{matrix} - d E [\hat{Y} (t)] & = [- r_{0} (t) E [\hat{Y} (t)] - \frac{1}{2} {(\frac{b_{0} (t)}{σ_{0} (t)})}^{2}] d t, \end{matrix}$ thus (4.8) $\begin{matrix} E [\hat{Y} (t)] & = exp (- \int_{0}^{t} r_{0} (s) d s) [\hat{Y} (0) + \frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] . \end{matrix}$ (4.8)

Define $Γ (t)$ to be the solution of the linear SDE ${\begin{matrix} d Γ (t) & = - \frac{b_{0} (t)}{σ_{0} (t)} Γ (t) d B (t), t \in [0, T], \\ Γ (0) & = 1, \end{matrix}$

or explicitly (4.9) $Γ (t) = exp (- \int_{0}^{t} \frac{b_{0} (s)}{σ_{0} (s)} d B (s) - \frac{1}{2} \int_{0}^{t} {(\frac{b_{0} (s)}{σ_{0} (s)})}^{2} d s), t \in [0, T] .$ (4.9)

By the Girsanov theorem of change of measures, we know that there exists an equivalent local martingale measure $Q ≪ P,$ such that $d Q = Γ (T) d P on F_{T},$ with $Γ (T) = \frac{d Q}{d P}$ is the Radon–Nikodym derivative of Q with respect to P on $F_{T} .$

Substituting (Equation4.8(4.8) $\begin{matrix} E [\hat{Y} (t)] & = exp (- \int_{0}^{t} r_{0} (s) d s) [\hat{Y} (0) + \frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] . \end{matrix}$ (4.8) ), (Equation4.9(4.9) $Γ (t) = exp (- \int_{0}^{t} \frac{b_{0} (s)}{σ_{0} (s)} d B (s) - \frac{1}{2} \int_{0}^{t} {(\frac{b_{0} (s)}{σ_{0} (s)})}^{2} d s), t \in [0, T] .$ (4.9) ) into (Equation4.7(4.7) ${\begin{matrix} d \hat{Y} (t) & = - [- r_{0} (t) E [\hat{Y} (t)] - \frac{1}{2} {(\frac{b_{0} (t)}{σ_{0} (t)})}^{2}] d t - \frac{b_{0} (t)}{σ_{0} (t)} d B (t), t \in [0, T], \\ \hat{Y} (T) & = \hat{X} (T) . \end{matrix}$ (4.7) ) we have $\begin{matrix} \hat{X} (T) = \hat{Y} (T) = \hat{Y} (0) + exp (- \int_{0}^{t} r_{0} (s) d s) [\hat{Y} (0) + \frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α)] \\ + \frac{1}{2} \int_{0}^{T} {(\frac{b_{0} (s)}{σ_{0} (s)})}^{2} d s + \int_{0}^{T} \frac{b_{0} (s)}{σ_{0} (s)} d B (s) \\ = \hat{Y} (0) + exp (- \int_{0}^{t} r_{0} (s) d s) [\hat{Y} (0) + \frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] - ln Γ (t) . \end{matrix}$

Taking the expectation but now with respect to the new measure Q, we get (4.10) $\begin{array}{l} - \hat{Y} (0) = - x_{0} - exp (- \int_{0}^{t} r_{0} (s) d s) [\hat{Y} (0) + \frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] - E_{Q} [ln Γ (T)] \\ = \frac{1}{1 - exp (- \int_{0}^{t} r_{0} (s) d s)} {- x_{0} - exp (- \int_{0}^{t} r_{0} (s) d s) [\frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] \\ - E [Γ (T) ln Γ (T)]}, \end{array}$ (4.10) where $E [Γ (T) ln Γ (T)]$ is the entropy of Q with respect to P.

Since we obtained the optimal value of $\hat{Y} (0),$ we can get the corresponding optimal terminal wealth $\hat{X} (T) .$

Summarizing, we have the following conclusion:

Theorem 4.2.

Suppose that (Equation4.6(4.6) $F (z) = - \frac{1}{2} z^{2} .$ (4.6) ) holds. Then the minimal risk of our problem is given by (Equation4.10(4.10) $\begin{array}{l} - \hat{Y} (0) = - x_{0} - exp (- \int_{0}^{t} r_{0} (s) d s) [\hat{Y} (0) + \frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] - E_{Q} [ln Γ (T)] \\ = \frac{1}{1 - exp (- \int_{0}^{t} r_{0} (s) d s)} {- x_{0} - exp (- \int_{0}^{t} r_{0} (s) d s) [\frac{1}{2} \int_{0}^{t} \frac{b_{0}^{2} (s)}{σ_{0}^{2} (s)} exp (\int_{0}^{s} r_{0} (α) d α) d s] \\ - E [Γ (T) ln Γ (T)]}, \end{array}$ (4.10) ).

Additional information

Funding

This research was carried out with support of the Norwegian Research Council, within the research project Challenges in Stochastic Control, Information and Applications (STOCONINF), project number 250768/F20; and U.S. Air Force Office of Scientific Research under grant number FA9550-17-1-0259.

References

Lasry, J.-M., Lions, P.-L. (2007). Mean field games. Jpn. J. Math. 2(1):229–260. DOI: 10.1007/s11537-007-0657-8.
Web of Science ®Google Scholar
Huang, J., Wang, S., Wu, Z. (2016). Backward mean-field linear–quadratic–Gaussian (LQG) games: Full and partial information. IEEE Trans. Automat. Contr. 61(12):3784–3796. DOI: 10.1109/TAC.2016.2519501.
Web of Science ®Google Scholar
Xu, R., Zhang, F. (2020). ε-Nash mean-field games for general linear–quadratic systems with applications. Automatica. 114:108835. DOI: 10.1016/j.automatica.2020.108835.
Web of Science ®Google Scholar
Xu, R., Shi, J. (2019). ε-Nash mean-field games for linear–quadratic systems with random jumps and applications. Int. J. Control. DOI: 10.1080/00207179.2019.1651940.
Google Scholar
Huang, M. (2010). Large-population LQG games involving a major player: the Nash certainty equivalence principle. SIAM J. Control Optim. 48(5):3318–3353. DOI: 10.1137/080735370.
Web of Science ®Google Scholar
Nourian, M., Caines, P. E. (2013). ε-Nash mean field game theory for nonlinear stochastic dynamical systems with major and minor agents. SIAM J. Control Optim. 51(4):3302–3331. DOI: 10.1137/120889496.
Web of Science ®Google Scholar
Agram, N., Øksendal, B. (2019). Model uncertainty stochastic mean-field control. Stoch. Anal. Appl. 37(1):36–21. DOI: 10.1080/07362994.2018.1499036.
Web of Science ®Google Scholar
Agram, N., Øksendal, B. (2019). Stochastic control of memory mean-field processes. Appl. Math. Optim. 79(1):181–204. DOI: 10.1007/s00245-017-9425-1.
Web of Science ®Google Scholar
Agram, N., Bachouch, A., Øksendal, B., Proske, F. (2019). Singular control optimal stopping of memory mean-field processes. SIAM J. Math. Anal. 51(1):450–468. DOI: 10.1137/18M1174787.
Web of Science ®Google Scholar
Lions, P. Cours au college de france: Th´eorie des jeux ´a champs moyens (2014).
Google Scholar
Carmona, R., Delarue, F. (2015). Forward–backward stochastic differential equations and controlled McKean–Vlasov dynamics. Ann. Probab. 43(5):2647–2700. DOI: 10.1214/14-AOP946.
Web of Science ®Google Scholar
Bensoussan, A., Yam, S. C. P., Zhang, Z. (2015). Well-posedness of mean-field type forward–backward stochastic differential equations. Stoch. Process. Appl. 125(9):3327–3354. DOI: 10.1016/j.spa.2015.04.006.
Web of Science ®Google Scholar
Djehiche, B., Hamadene, S. (2019). Mean-field backward–forward stochastic differential equations and nonzero sum stochastic differential games. arXiv preprint arXiv:1904.06193
Google Scholar
Øksendal, B., Sulem, A. (2015). Risk minimization in financial markets modeled by Ito⁁-L´evy processes. Afr. Math. 26(5–6):939–979. DOI: 10.1007/s13370-014-0248-9.
Web of Science ®Google Scholar

Mean-field FBSDE and optimal control

Abstract

1. Introduction

2. Generalities

3. Optimal control problem

3.1. Sufficient optimality conditions

3.2. Necessary optimality conditions

4. Mean-field risk minimization

4.1. Mean-field dynamic risk measure

4.2. Optimal portfolio with mean-field risk minimization

References

Information for

Open access

Opportunities

Help and information

Mean-field FBSDE and optimal control

Abstract

1. Introduction

2. Generalities

3. Optimal control problem

3.1. Sufficient optimality conditions

3.2. Necessary optimality conditions

4. Mean-field risk minimization

4.1. Mean-field dynamic risk measure

4.2. Optimal portfolio with mean-field risk minimization

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date