Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We consider impulse control of stochastic functional differential equations (SFDEs) driven by Lévy processes under an additional $L^{p}$ -Lipschitz condition on the coefficients. Our results, which are first derived for a general stochastic optimization problem over infinite horizon impulse controls and then applied to the case of a controlled SFDE, apply to the infinite horizon as well as the random horizon settings. The methodology employed to show existence of optimal controls is a probabilistic one based on the concept of Snell envelopes.

Keywords:

1. Introduction

The standard stochastic impulse control problem is an optimal control problem that arises when an operator controls a dynamical system by intervening on the system at a discrete set of stopping times. Generally, an intervention can be represented by an element in the control set U which we assume to be a compact subset of $R^{m}$ .

In impulse control the control-law, thus, takes the form $u = (τ_{1}, \dots, τ_{N}; β_{1}, \dots, β_{N})$ , where $τ_{1} \leq τ_{2} \leq \dots \leq τ_{N}$ is a sequence of times when the operator intervenes on the system and $β_{j}$ is the impulse that the operator affects the system with at time $τ_{j}$ . The standard impulse control problem in infinite horizon can be formulated as finding a control that maximizes (1) $\begin{aligned} E [\int_{0}^{\infty} e^{- ρ_{0} s} ϕ (s, X_{s}^{u}) d s - \sum_{j = 1}^{N} e^{- ρ_{0} τ_{j}} ℓ (τ_{j}, X_{τ_{j}}^{u}, β_{j})], \end{aligned}$ (1) where $ρ_{0} > 0$ is a constant referred to as the discount factor, $X^{u}$ is an $R^{m}$ -valued controlled stochastic process that jumps at interventions (e.g. by setting $X_{τ_{j}}^{u} = Γ (τ_{j}, X_{τ_{j} -}^{u}, β_{j})$ for some deterministic function Γ) and the deterministic functionsFootnote¹ $ϕ : R_{+} \times R^{m} \to R$ and $ℓ : R_{+} \times R^{m} \times U \to R$ give the running reward and the intervention costs, respectively. The quantity $ℓ (t, x, b)$ , thus, represents the cost incurred by applying the impulse $b \in U$ at time $t \in R_{+}$ when the state is x.

As impulse control problems appear in a vast number of real-world applications (see e.g. [Citation19,Citation23] for applications in finance and [Citation2,Citation5] for applications in energy) a lot of attention has been given to various types of problems where the control is of impulse type. In the standard Markovian setting, where $X^{u}$ solves a stochastic differential equation (SDE) driven by a Lévy process on $[τ_{j}, τ_{j + 1})$ , the relation to quasi-variational inequalities has frequently been exploited to find optimal controls (see the seminal work in [Citation3] or turn to [Citation21] for a more recent textbook). In the non-Markovian framework an impulse control problem in finite horizon ( $T < \infty$ ) was solved in [Citation8] by utilizing the link between optimal stopping and reflected BSDEs (originally discovered in [Citation13]) while considering the reward functional $\begin{aligned} E [\int_{0}^{T} ϕ (s, ω, L_{s}^{u}) d s - \sum_{j = 1}^{N} c (β_{j})], \end{aligned}$ where $ϕ : [0, T] \times Ω \times R^{n} \to R$ is now a random (and not necessarily Markovian) field and the controlled process $L^{u}$ takes the particular form $L_{t}^{u} := L_{t} + \sum_{j = 1}^{N} 1_{[τ_{j} \leq t]} β_{j}$ , with L an (exogenous) non-controlled process and assuming that U is a finite set. Relevant is also the treatment of multi-modes optimal switching problems in a non-Markovian setting in [Citation11].

In [Citation16] the original work of [Citation8] was extended to incorporate delivery lag by setting $L_{t}^{u} := L_{t} + \sum_{j = 1}^{N} 1_{[τ_{j} + Δ \leq t]} β_{j}$ for a fixed $Δ > 0$ . As in [Citation22] the work in [Citation16] is based on the assumption that $τ_{j + 1} \geq τ_{j} + Δ$ . This work was later extended by considering the infinite horizon setting in [Citation10]. Notable is also the recent work on finite horizon impulse control of SFDEs driven by a Brownian motion in [Citation17].

In the present article we take a different approach to all the above mentioned works by considering the abstract reward functional (2) $\begin{aligned} J (u) := E [φ (τ_{1}, \dots, τ_{N}; β_{1}, \dots, β_{N}) - \sum_{j = 1}^{N} c (τ_{1}, \dots, τ_{j}; β_{1}, \dots, β_{j})], \end{aligned}$ (2) where the terminal reward φ maps controls to values of the real line and is measurable with respect to $G \otimes B (D)$ , where $B (D)$ is the Borel σ-field of $D := \cup_{i = 0}^{\infty} D^{i}$ , with $D^{0} := \emptyset$ , $D^{i} := {(t_{1}, \dots, t_{i}; b_{1}, \dots, b_{i}) : 0 \leq t_{1} \leq \dots \leq t_{i}, b_{j} \in U, f o r j = 1, \dots, i}$ for $i \geq 1$ and $G$ is the σ-field of a complete probability space $(Ω, G, P)$ . The intervention cost c is also assumed to be a $G \otimes B (D)$ -measurable map in addition to being bounded from below by a deterministic positive function. We consider the partial information setting and assume that we observe the system through a filtration $F := {F_{t}}_{t \geq 0}$ of sub-σ-fields of $G$ and thus restrict our attention to $F$ -adapted controls.

To indicate the applicability of the results we consider the special case when (3) $\begin{aligned} φ (u) = \int_{0}^{\infty} e^{- ρ (s)} ϕ (s, X_{s}^{u}) d s, \end{aligned}$ (3) and (4) $\begin{aligned} c (τ_{1}, \dots, τ_{j}; β_{1}, \dots, β_{j}) = e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{(τ_{1}, \dots, τ_{j - 1}; β_{1}, \dots, β_{j - 1})}, β_{j}), \end{aligned}$ (4) where $X^{u}$ solves an impulsively controlled stochastic functional differential equation (SFDE) driven by a Lévy process under an additional $L^{p}$ -type Lipschitz condition on the coefficients of the SFDE. Furthermore, we will see that the results easily extend to problems with a random horizon which allows us to model aspects such as default in financial applications. We thus extend the result in [Citation17] on the one hand by considering a more general driving noise but also by considering both the infinite and the random horizon settings. Our treatment of the random horizon problem also motivates the exploration of partial information as optimal controls may be fundamentally different in the partial information setting.

The main contributions of the present work are twofold. First, we show that the problem of maximizing J has a solution under certain assumptions on φ and c, summarized in the definition of an admissible reward pair, by finding an optimal control in terms of a family of interconnected value processes. We refer to this family of processes as a verification family. Furthermore, we give a set of conditions under which the reward pair defined by (Equation3(3) $\begin{aligned} φ (u) = \int_{0}^{\infty} e^{- ρ (s)} ϕ (s, X_{s}^{u}) d s, \end{aligned}$ (3) )–(Equation4(4) $\begin{aligned} c (τ_{1}, \dots, τ_{j}; β_{1}, \dots, β_{j}) = e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{(τ_{1}, \dots, τ_{j - 1}; β_{1}, \dots, β_{j - 1})}, β_{j}), \end{aligned}$ (4) ) is admissible.

The remainder of the article is organized as follows. In the next section we state the problem, set the notations used throughout the article and detail the set of assumptions that are made. In particular we introduce the notion of an admissible reward pair. Then, in Section 3 a verification theorem is derived. This verification theorem is an extension of the verification theorem for the multi-modes optimal switching problem with memory developed in [Citation24] and presumes the existence of a verification family. In Section 4 we show that, under the assumptions made, there exists a verification family whenever $(φ, c)$ is an admissible reward pair, thus proving existence of an optimal control for the impulse control problem with the cost functional J defined in (Equation2(2) $\begin{aligned} J (u) := E [φ (τ_{1}, \dots, τ_{N}; β_{1}, \dots, β_{N}) - \sum_{j = 1}^{N} c (τ_{1}, \dots, τ_{j}; β_{1}, \dots, β_{j})], \end{aligned}$ (2) ). Then, in Section 5 we show that a type of impulse control problems for controlled SFDEs satisfies the conditions on φ and c as prescribed in the definition of an admissible reward pair, both in the infinite and random horizon settings. Finally in the appendix, we recall some results, such as the Snell envelope, that are useful when showing existence of optimal controls.

2. Preliminaries

Let $(Ω, G, P)$ be a complete probability space, and $F := (F_{t})_{t \geq 0}$ a filtration of sub-σ-fields of $G$ satisfying the usual conditions in addition to being quasi-left continuous. We assume that $F_{0}$ is the trivial σ-field and define $F := F_{\infty}$ .

Throughout, we will use the following notations. Let:

$P_{F}$ be the σ-algebra on $R_{+} \times Ω$ of $F$ -progressively measurable subsets.
For $p \geq 1$ , $S^{p}$ be the set of all finite, $R$ -valued, $P_{F}$ -measurable, càdlàg processes $(Z_{t} : t \geq 0)$ such that $E [sup_{t \in R_{+}} | Z_{t} |^{p}] < \infty$ and let $S_{q}^{p}$ be the subset of processes that are quasi-left upper semi-continuous (see Appendix 1 for a definition of quasi-left continuity).
$T$ be the set of all $F$ -stopping times and for each $η \in T$ we let $T_{η}$ be the corresponding subsets of stopping times τ such that $τ \geq η$ , $P$ -a.s. Furthermore, we let $T^{f}$ (resp. $T_{η}^{f}$ ) be the subset of $T$ (resp. $T_{η}$ ) with all stopping times τ for which $P [τ < \infty] = 1$ .
For each $τ \in T$ , $A (τ)$ be the set of all $F_{τ}$ -measurable random variables taking values in U.
$U$ be the set of all controls $u = (τ_{1}, \dots, τ_{N}; β_{1}, \dots, β_{N})$ , where $(τ_{j})_{j = 1}^{\infty}$ (the intervention times) is a non-decreasing sequence of $F$ -stopping times, $β_{j} \in A (τ_{j})$ (the interventions) and $N := sup {j \geq 1 : τ_{j} < \infty} \lor 0$ is the (random, $F$ -measurable) number of interventions.
$U^{f}$ denote the subset of $u \in U$ for which $N (t) := sup {j : τ_{j} \leq t}$ is $P$ -a.s. finite on compacts (i.e. $U^{f} := {u \in U : P [{ω \in Ω : N (t) > k, \forall k > 0}] = 0, \forall t \in R_{+}}$ ) and for all $k \geq 0$ we let $U^{k}$ be the set of all controls $(τ_{1}, \dots, τ_{N \land k}; β_{1}, \dots, β_{N \land k})$ truncated at k interventions and set $U^{l i m} := \cup_{k \geq 0} U^{k}$ .
For a random interval A (i.e. a set of the type $[η_{1}, η_{2}]$ , $[η_{1}, η_{2})$ , $(η_{1}, η_{2}]$ or $(η_{1}, η_{2})$ for some $η_{1}, η_{2} \in T$ ) $U_{A}$ (and $U_{A}^{f}$ resp. $U_{A}^{k}$ ) be the subset of $U$ (and $U^{f}$ resp. $U^{k}$ ) with $τ_{j} \in A$ , $P$ -a.s. for $j = 1, \dots, N$ . When the interval is $A = [η, \infty]$ for some $η \in T$ we use the shorthand $U_{η}$ (and $U_{η}^{f}$ resp. $U_{η}^{k}$ ).
$D^{f}$ be the subset of $D$ with all finite sequences and for $k \geq 0$ we let $D^{k} := \cup_{i = 0}^{k} D^{i}$ .
$v = (t, b)$ , with $t := (t_{1}, \dots, t_{n})$ and $b := (b_{1}, \dots, b_{n})$ , where n is possibly infinite, denote a generic element of $D$ .
For $v = (t, b) \in D^{f}$ and $v^{'} = (t^{'}, b^{'}) \in D$ , we introduce the composition, denoted by °, defined asFootnote² $v \circ v^{'} := (t_{1}, \dots, t_{n}, t_{1}^{'} \lor t_{n}, \dots, t_{n^{'}}^{'} \lor t_{n}; b_{1}, \dots, b_{n}, b_{1}^{'}, \dots, b_{n^{'}}^{'})$ . For $v \in D$ , we define the truncation to $k \geq 0$ interventions as $[v]_{k} := (t_{1}, \dots, t_{k \land n}; b_{1}, \dots, b_{k \land n})$ .
The composition operator ° be extended to controls by setting $u \circ \tilde{u} := (τ_{1}, \dots, τ_{N \land k}, {\tilde{τ}}_{1} \lor τ_{k}, \dots, {\tilde{τ}}_{\hat{N}} \lor τ_{k}; β_{1}, \dots, β_{N \land k}, {\tilde{β}}_{1}, \dots, {\tilde{β}}_{\hat{N}})$ , where $\hat{N} := sup {j \geq 1 : {\tilde{τ}}_{j} \lor τ_{k} < \infty} \lor 0$ , whenever $u \in U^{k}$ and $\tilde{u} := ({\tilde{τ}}_{1}, \dots, {\tilde{τ}}_{\tilde{N}}; {\tilde{β}}_{1}, \dots, {\tilde{β}}_{\tilde{N}}) \in U$ .
$Π_{l} := {0, 1 / 2^{l}, 2 / 2^{l}, \dots}$ for $l \geq 0$ and set $Π := \cup_{l = 1}^{\infty} Π_{l}$ .

Furthermore, we introduce the following set:

Definition 2.1

We let $H_{F}^{'}$ be the set of all $P_{F} \otimes B (U)$ -measurable mapsFootnote³ $h : R_{+} \times Ω \times U \to R$ such that the collection ${h (τ, β) : τ \in T^{f}, β \in A (τ)}$ is uniformly integrable and (outside of a $P$ -null set) we have for all $(t, b) \in R_{+} \times U$ :

The limit $lim_{(t^{'}, b^{'}) \to (t, b)} h (t^{'}, b^{'})$ exists;
$lim_{t^{'} ↘ t} sup_{b^{'} \in U} | h (t^{'}, b^{'}) - h (t, b^{'}) | = 0$ ;
$lim_{b^{'} \to b} h (t, b^{'}) \leq h (t, b)$ .

Furthermore, let $H_{F}$ be the set of all $h \in H_{F}^{'}$ such that for any predictable stopping time $θ \in T$ and any announcing sequence $θ_{j} ↗ θ$ with $θ_{j} \in T^{f}$ we have $\underset{j \to \infty}{lim sup} sup_{b \in U} {h (θ_{j}, b) - h (θ, b)} \leq 0$ , $P$ -a.s.

The sets $H_{F}$ and $H_{F}^{'}$ will play an important role in the characterization of optimal controls.

2.1. Problem formulation

With the notations above, the problem we deal with is characterized by the following objects:

$(Ω, G, F, P)$ a complete probability space.
A $G \otimes B (D)$ -measurable map $φ : D \to R$ .
A $G \otimes B (D)$ -measurable map $c : D \to R_{+}$ .

To obtain existence of optimal controls we need to make some assumptions on the involved objects. The assumptions that we will use are summarized in the definition of what we refer to as an admissible reward pair:

Definition 2.2

We call the pair $(φ, c)$ an admissible reward pair if, for someFootnote⁴ p>2:

The terminal reward φ and the intervention cost c are both right-continuous in the intervention times (uniformly in the interventions) and satisfy the following bounds:
1. $sup_{u \in U} E [| φ (u) |^{p}] < \infty$ .
2. $sup_{u \in U} E [| c (u) |^{2}] < \infty$ and $c (v) \geq δ (t_{n})$ for all $v \in D^{f}$ , $P$ -a.s., where $δ : R_{+} \to R_{+}$ is a deterministic, continuous, non-increasing and positive function, i.e. $δ (s) \geq δ (t) > 0$ , whenever $0 \leq s \leq t < \infty$ .
For every $v \in U^{l i m}$ and every $k \geq 0$ and T>0, there are maps $χ^{T} \in H_{F}^{'}$ , $χ \in H_{F}$ and $(t, b) \mapsto - c_{t, b}^{v} \in H_{F}$ such that for all $τ \in T$ and $b \in U$ we have $\begin{aligned} χ (τ, b) & = \underset{u \in U_{τ}^{k}}{e s s sup} E [φ (v \circ (τ, b) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, b) \circ [u]_{j}) | F_{τ}], \\ χ^{T} (τ, b) & = \underset{u \in U_{[τ, T)}^{k}}{e s s sup} E [φ (v \circ (τ, b) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, b) \circ [u]_{j}) | F_{τ}], \end{aligned}$ and $\begin{aligned} c_{τ, b}^{v} = E [c (v \circ (τ, b)) | F_{τ}], \end{aligned}$ $P$ -a.s. (with a $P$ -null exception set that can be chosen independently of b).
We have $sup_{u \in U^{l i m}, v \in U_{T}^{f}} E [| φ (u \circ v) - φ (u) |^{p}] \to 0$ as $T \to \infty$

The conditions in the above definition are mainly standard assumptions for infinite horizon stochastic impulse control problems translated to our setting (see e.g. [Citation10]). Condition (i.a) together with positivity of the intervention cost, c, in (i.b) implies that the expected maximal reward is finite. Condition (iii) implies that the future has diminishing impact on the total reward and can be seen as a generalization of the deterministic discounting applied in (Equation1(1) $\begin{aligned} E [\int_{0}^{\infty} e^{- ρ_{0} s} ϕ (s, X_{s}^{u}) d s - \sum_{j = 1}^{N} e^{- ρ_{0} τ_{j}} ℓ (τ_{j}, X_{τ_{j}}^{u}, β_{j})], \end{aligned}$ (1) ). We show below that the boundedness of the intervention costs from below by a positive function together with (i. (a) imply that, with probability one, the optimal control (whenever it exists) can only make a finite number of interventions within any compact time interval.

Remark 2.1

Note that we may hide part of the intervention cost within the function φ which implies that, similar to the setting in [Citation20], we can handle problems with negative intervention costs as long as a type of martingale condition is satisfied.

Recall the reward functional given by (Equation2(2) $\begin{aligned} J (u) := E [φ (τ_{1}, \dots, τ_{N}; β_{1}, \dots, β_{N}) - \sum_{j = 1}^{N} c (τ_{1}, \dots, τ_{j}; β_{1}, \dots, β_{j})], \end{aligned}$ (2) ). The problem we deal with might be formulated as:

Problem 2.1

Find $u^{*} \in U$ , such that (5) $\begin{aligned} J (u^{*}) = sup_{u \in U} J (u), \end{aligned}$ (5) when $(φ, c)$ is an admissible reward pair.

Throughout Sections 3–4 we will thus assume that $(φ, c)$ is an admissible reward pair, before we state a set of conditions under which we are able to show that a particular $(φ, c)$ of the form (Equation3(3) $\begin{aligned} φ (u) = \int_{0}^{\infty} e^{- ρ (s)} ϕ (s, X_{s}^{u}) d s, \end{aligned}$ (3) )–(Equation4(4) $\begin{aligned} c (τ_{1}, \dots, τ_{j}; β_{1}, \dots, β_{j}) = e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{(τ_{1}, \dots, τ_{j - 1}; β_{1}, \dots, β_{j - 1})}, β_{j}), \end{aligned}$ (4) ) is an admissible reward pair.

As a step in solving Problem 1 we need the following proposition which is a reduction result for impulse control problems.

Proposition 2.3

Suppose there is a $u^{*} \in U$ such that $J (u^{*}) \geq J (u)$ for all $u \in U^{f}$ . Then $u^{*}$ is an optimal control for (Equation5(5) $\begin{aligned} J (u^{*}) = sup_{u \in U} J (u), \end{aligned}$ (5) ), i.e. $J (u^{*}) \geq J (u)$ for all $u \in U$ .

Proof.

Pick $\hat{u} := ({\hat{τ}}_{1}, \dots; {\hat{β}}_{1}, \dots) \in U ∖ U^{f}$ . Then there is a $t \in R_{+}$ such that $P [B] > 0$ with $B := {ω \in Ω : \hat{N} (t) > k, \forall k > 0}$ , where $\hat{N} (t) := max {j : {\hat{τ}}_{j} \leq t} \lor 0$ . Furthermore, by positivity of the intervention costsFootnote⁵ $\begin{aligned} J (\hat{u}) \leq sup_{u \in U} E [| φ (u) |] - k δ (t) P [B] \leq C - k δ (t) P [B], \end{aligned}$ for all $k \geq 0$ , by Definition 2.2.(i). However, again by Definition 2.2.(i.a) we have $J (\emptyset) \geq - C$ . Hence, $\hat{u}$ is dominated by the strategy of doing nothing and the assertion follows.

2.2. Relevant properties of $H_{F}$

We note the following properties:

Lemma 2.4

If $h, h^{'} \in H_{F}$ (resp. $h, h^{'} \in H_{F}^{'}$ ), then $h + h^{'}$ and $h \lor h^{'}$ are also in $H_{F}$ (resp. $H_{F}^{'}$ ).
If $h \in H_{F}^{'}$ then there is a $P_{F}$ -measurable càdlàg process, $h^{*}$ , of class [D], such that $h_{τ}^{*} = sup_{b \in U} h (τ, b)$ , $P$ -a.s. for each $τ \in T^{f}$ . If $h \in H_{F}$ then $h^{*}$ is quasi-left upper semi-continuous.
If $h, h^{'} \in H_{F}^{'}$ , then $(sup_{b \in U} | h^{'} (t, b) - h (t, b) | : t \geq 0)$ is $P_{F}$ -measurable and càdlàg.
If $(h^{k})_{k \geq 0}$ is a sequence in $H_{F}$ that converges uniformly to some h (outside of a $P$ -null set) then $h \in H_{F}$ .

Proof.

Moving on to property b), we let $s_{j}^{l} := j 2^{- l}$ and note that for $j \geq 0$ , there is a $β_{j + 1}^{l} \in A (s_{j}^{l})$ such that $\begin{aligned} sup_{b \in U} E [h (s_{j + 1}^{l}, b) | F_{s_{j}^{l}}] = E [h (s_{j + 1}^{l}, β_{j + 1}^{l}) | F_{s_{j}^{l}}] \end{aligned}$ $P$ -a.s., by Corollary A.5 in Appendix 3. By Theorem A.6 we can define the sequence of càdlàg processes $({\hat{h}}^{l})_{l \geq 0}$ as $\begin{aligned} {\hat{h}}_{t}^{l} := \sum_{j = 0}^{\infty} 1_{[s_{j}^{l}, s_{j + 1}^{l})} (t) E [h (t, β_{j + 1}^{l}) | F_{t}] . \end{aligned}$ Now, we let ${\underline{h}}^{0} := {\hat{h}}^{0}$ and then recursively define ${\underline{h}}^{l} := {\underline{h}}^{l - 1} \lor {\hat{h}}^{l}$ for $l \geq 1$ . Then, $({\underline{h}}^{l})_{l \geq 0}$ is a non-decreasing sequence of càdlàg processes and $\begin{aligned} sup_{b \in U} h (t, b) \geq {\underline{h}}_{t}^{l}, \end{aligned}$ $P$ -a.s., for all $t \in [0, \infty)$ and all $l \geq 0$ . Furthermore, for $t \geq 0$ we let $ι_{l} := max {j : j 2^{- l} \leq t}$ and get that $\begin{aligned} sup_{b \in U} h (t, b) - {\underline{h}}_{t}^{l} & = sup_{b \in U} h (t, b) - E [h (s_{ι_{l} + 1}^{l}, β_{ι_{l} + 1}^{l}) | F_{s_{ι_{l}}^{l}}] + E [h (s_{ι_{l} + 1}^{l}, β_{ι_{l} + 1}^{l}) | F_{s_{ι_{l}}^{l}}] \\ - E [h (t, β_{ι_{l} + 1}^{l}) | F_{t}] \\ \leq sup_{b \in U} {h (t, b) - E [h (s_{ι_{l} + 1}^{l}, b) | F_{s_{ι_{l}}^{l}}]} + E [h (s_{ι_{l} + 1}^{l}, β_{ι_{l} + 1}^{l}) | F_{s_{ι_{l}}^{l}}] \\ - E [h (t, β_{ι_{l} + 1}^{l}) | F_{t}] . \end{aligned}$ Since $s_{ι_{l}}^{l} ↗ t$ we have, by quasi-left continuity of the filtration and uniform integrability that $\begin{aligned} lim_{l \to \infty} E [h (s_{ι_{l} + 1}^{l}, β_{ι_{l} + 1}^{l}) | F_{s_{ι_{l}}^{l}}] = lim_{l \to \infty} E [h (s_{ι_{l} + 1}^{l}, β_{ι_{l} + 1}^{l}) | F_{t}] . \end{aligned}$ Now, as $s_{ι_{l} + 1}^{l} ↘ t$ , it follows by Definition 2.1.(ii) that ${\underline{h}}_{t}^{l} ↗ sup_{b \in U} h (t, b)$ , $P$ -a.s., as $l \to \infty$ . We note that ${\underline{h}}^{l}$ is an increasing, uniformly bounded, sequence of $P_{F}$ -measurable càdlàg processes. The sequence, thus, converges to a $P_{F}$ -measurable process, $h^{*}$ . It remains to show that $h^{*}$ is càdlàg, quasi-left upper semi-continuous and that it agrees with $sup_{b \in U} h (t, b)$ on stopping times.

To show that the limit is càdlàg we let $\begin{aligned} {\bar{h}}_{t}^{l} := \sum_{j = 0}^{\infty} 1_{[s_{j}^{l}, s_{j + 1}^{l})} (t) sup_{(r, b) \in [t, s_{j + 1}^{l}) \times U} h (r, b) . \end{aligned}$ We note that ${\bar{h}}^{l}$ has left and right limits and that, furthermore, $lim_{t^{'} ↘ t} {\bar{h}}_{t^{'}}^{l} \leq {\bar{h}}_{t}^{l}$ . Now, if $lim_{t^{'} ↘ t} {\bar{h}}_{t^{'}}^{l} < {\bar{h}}_{t}^{l}$ then $\begin{aligned} {\bar{h}}_{t}^{l} = sup_{b \in U} h (t, b) = h (t, β_{t}) \end{aligned}$ for some $F_{t}$ -measurable $β_{t}$ , but then we would have $\begin{aligned} h (t, β_{t}) > lim_{t^{'} ↘ t} {\bar{h}}_{t^{'}}^{l} \geq lim_{t^{'} ↘ t} h (t^{'}, β_{t}) \end{aligned}$ contradicting the fact that $(h (s, β_{t}) : s \geq t)$ is right continuous. We conclude that $({\bar{h}}^{l})_{l \geq 0}$ is a non-increasing sequence with ${\bar{h}}^{l}$ a ${F_{t + 2^{- l}}}_{t \geq 2^{- l}}$ -adapted càdlàg process and $\begin{aligned} sup_{b \in U} h (t, b) \leq {\bar{h}}_{t}^{l} . \end{aligned}$ For $t \geq 0$ we know that there is a non-increasing sequence $({\tilde{τ}}_{l})_{l \geq 0}$ , with ${\tilde{τ}}_{l}$ a $F_{t + 2^{- l}}$ -measurable r.v. taking values in $[t, t + 2^{- l})$ such that $\begin{aligned} lim_{l \to \infty} {\bar{h}}_{t}^{l} = lim_{l \to \infty} sup_{b \in U} h ({\tilde{τ}}_{l}, b) . \end{aligned}$ Now, as $h \in H_{F}$ we have $lim_{t^{'} ↘ t} sup_{b \in U} | h (t^{'}, b) - h (t, b) | = 0$ . It follows that $\begin{aligned} lim_{l \to \infty} {\bar{h}}_{t}^{l} - sup_{b \in U} h (t, b) = lim_{l \to \infty} sup_{b \in U} h ({\tilde{τ}}_{l}, b) - sup_{b \in U} h (t, b) \leq lim_{l \to \infty} sup_{b \in U} {h ({\tilde{τ}}_{l}, b) - h (t, b)} = 0, \end{aligned}$ $P$ -a.s., and in particular we find that ${\bar{h}}_{t}^{l} ↘ h_{t}^{*}$ , $P$ -a.s. as $l \to \infty$ . This gives that for any sequence $t_{j} ↘ t$ and $l \geq 0$ , $\begin{aligned} \underset{t_{j} ↘ t}{lim inf} h_{t_{j}}^{*} \geq \underset{t_{j} ↘ t}{lim inf} {\underline{h}}_{t_{j}}^{l} = {\underline{h}}_{t}^{l} \end{aligned}$ and $\begin{aligned} \underset{t_{j} ↘ t}{lim sup} h_{t_{j}}^{*} \leq \underset{t_{j} ↘ t}{lim sup} {\bar{h}}_{t_{j}}^{l} = {\bar{h}}_{t}^{l} . \end{aligned}$ Letting l tend to infinity we find that $\begin{aligned} \underset{t_{j} ↘ t}{lim inf} h_{t_{j}}^{*} = \underset{t_{j} ↘ t}{lim sup} h_{t_{j}}^{*} = h_{t}^{*} . \end{aligned}$ Similarly we get existence of left-limits and by the uniform integrability property imposed on members of $H_{F}$ we conclude that $h^{*}$ is a càdlàg, $P_{F}$ -measurable process of class [D].

For $τ \in T$ , let $(τ_{l})_{l \geq 0}$ be a non-increasing sequence of stopping times in $T^{Π}$ (the subset of $T$ with all stopping times taking values in the countable set $Π$ ) such that $τ_{l} ↘ τ$ . We may, for example, set $τ_{l} := inf {s \in Π_{l} : s \geq τ}$ . Since $Π$ is countable we have (6) $\begin{aligned} h_{τ_{l}}^{*} = sup_{b \in U} h (τ_{l}, b), \end{aligned}$ (6) $P$ -a.s. Now, by right-continuity we get that $lim_{l \to \infty} h_{τ_{l}}^{*} = h_{τ}^{*}$ . Letting $(β_{l})_{l \geq 0}$ be a sequence of maximizers for the right-hand side of (Equation6(6) $\begin{aligned} h_{τ_{l}}^{*} = sup_{b \in U} h (τ_{l}, b), \end{aligned}$ (6) ) at times $(τ_{l})_{l \geq 0}$ we get $\begin{aligned} \underset{l \to \infty}{lim sup} sup_{b \in U} h (τ_{l}, b) = \underset{l \to \infty}{lim sup} h (τ_{l}, β_{l}), \end{aligned}$ $P$ -a.s. Moreover, for each $ω \in Ω$ there is a subsequence $(ι_{j} (ω))_{j \geq 1}$ such that $\begin{aligned} \underset{l \to \infty}{lim sup} h (τ_{l}, β_{l}) = lim_{j \to \infty} h (τ_{ι_{j}}, β_{ι_{j}}) . \end{aligned}$ Since U is compact, there is a subsequence $(ι_{j}^{'} (ω))_{j \geq 1} \subset (ι_{j} (ω))_{j \geq 1}$ such that $(β_{ι_{j}^{'}} (ω))_{j \geq 0}$ converges to some $\tilde{β} (ω) \in U$ and so we have $\begin{aligned} \underset{l \to \infty}{lim sup} sup_{b \in U} h (τ_{l}, b) = lim_{j \to \infty} (h (τ_{ι_{j}^{'}}, β_{ι_{j}^{'}}) - h (τ, β_{ι_{j}^{'}})) + lim_{j \to \infty} (h (τ, β_{ι_{j}^{'}}) - h (τ, \tilde{β})) + h (τ, \tilde{β}) . \end{aligned}$ Now, $\begin{aligned} lim_{j \to \infty} (h (τ_{ι_{j}^{'}}, β_{ι_{j}^{'}}) - h (τ, β_{ι_{j}^{'}})) = 0, \end{aligned}$ $P$ -a.s., by Definition 2.1.(ii) and $\begin{aligned} lim_{j \to \infty} (h (τ, β_{ι_{j}^{'}}) - h (τ, \tilde{β})) \leq 0, \end{aligned}$ $P$ -a.s., by the upper semi-continuity declared in Definition 2.1.(iii). Further, as $h (τ, \tilde{β}) \leq sup_{b \in U} h (τ, b)$ we conclude that $\begin{aligned} \underset{l \to \infty}{lim sup} sup_{b \in U} h (τ_{l}, b) \leq sup_{b \in U} h (τ, b), \end{aligned}$ $P$ -a.s. On the other hand, there is a $β \in A (τ)$ such that $sup_{b \in U} h (τ, b) = h (τ, β)$ , $P$ -a.s., and we have $\begin{aligned} \underset{l \to \infty}{lim inf} sup_{b \in U} h (τ_{l}, b) = \underset{l \to \infty}{lim inf} h (τ_{l}, β_{l}) \geq lim_{l \to \infty} h (τ_{l}, β) = h (τ, β), \end{aligned}$ $P$ -a.s. This implies that the limit exists with $lim_{l \to \infty} sup_{b \in U} h (τ_{l}, b) = sup_{b \in U} h (τ, b)$ , $P$ -a.s., establishing that $h_{τ}^{*} = sup_{b \in U} h (τ, b)$ , $P$ -a.s. Finally, if $h \in H_{F}$ then quasi-left upper semi-continuity of $h^{*}$ is immediate from Definition 2.1 and (b) follows.

Property (c) follows similarly by noting that for any $ϵ_{l} > 0$ we can chose $β_{j + 1}^{l} \in A (s_{j}^{l})$ such that $\begin{aligned} sup_{b \in U} E [| h^{'} (s_{j + 1}^{l}, b) - h (s_{j + 1}^{l}, b) | | F_{s_{j}^{l}}] = E [| h^{'} (s_{j + 1}^{l}, β_{j + 1}^{l}) - h (s_{j + 1}^{l}, β_{j + 1}^{l}) | | F_{s_{j}^{l}}] + ϵ_{l} \end{aligned}$ and we can choose the sequence $(ϵ_{l})_{l \geq 0}$ such that $ϵ_{l} ↘ 0$ .

Concerning the last property we note that for each $ϵ > 0$ we can, by uniform convergence, choose a $P$ -a.s. finite $k (ω) \geq 0$ such that $| h (t, b) - h^{k} (t, b) | \leq ϵ$ for all $(t, b) \in R_{+} \times U$ . Then, $\begin{aligned} \underset{(t^{'}, b^{'}) \to (t, b)}{lim sup} h (t^{'}, b^{'}) - \underset{(t^{'}, b^{'}) \to (t, b)}{lim inf} h (t^{'}, b^{'}) \\ \leq \underset{(t^{'}, b^{'}) \to (t, b)}{lim sup} h^{k} (t^{'}, b^{'}) - \underset{(t^{'}, b^{'}) \to (t, b)}{lim inf} h^{k} (t^{'}, b^{'}) + 2 ϵ = 2 ϵ \end{aligned}$ and property (i) in Definition 2.1 follows as $ϵ > 0$ was arbitrary. The remaining properties follow similarly.

3. A verification theorem

Our approach to finding a solution to Problem 1 is based on deriving an optimal control under the assumption that a specific family of processes exists, and then showing that the family does indeed exist. We will refer to any such family of processes as a verification family. Before making precise the concept of a verification family we introduce the notion of consistency:

Definition 3.1

We refer to a family of processes $((X_{t}^{v})_{t \geq 0} : v \in U^{l i m})$ as being consistent if for each $u \in U^{l i m}$ , the map $h : R_{+} \times Ω \times U \to R$ given by $h (t, b) = X_{t}^{u \circ (t, b)}$ is $P_{F} \otimes B (U)$ -measurable and for each $τ \in T$ and each $β \in A (τ)$ we have $X_{τ}^{u \circ (τ, β)} = h (τ, β)$ , $P$ -a.s.

We are now ready to state the definition of a verification family:

Definition 3.2

We define a verification family to be a consistent family of càdlàg supermartingales $((Y_{s}^{v})_{s \geq 0} : v \in U^{l i m})$ such that for each $v \in U^{l i m}$ :

The family satisfies the recursion (7) $\begin{aligned} Y_{s}^{v} = \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b)}} | F_{s}] . \end{aligned}$ (7)
The family is uniformly bounded in the sense that $\begin{aligned} sup_{u \in U^{l i m}} E [sup_{s \in [0, \infty]} | Y_{s}^{u} |^{2}] < \infty . \end{aligned}$
The map $(t, b) \mapsto Y_{t}^{v \circ (t, b)}$ belongs to $H_{F}$ .
$sup_{u \in U^{l i m}} E [sup_{s \in [T, \infty]} | Y_{s}^{u} - E [φ (u) | F_{s}] |] \to 0$ , as $T \to \infty$ .

The purpose of the present section is to reduce the solution of Problem 1 to showing existence of a verification family. This is accomplished by the following verification theorem (the proof of which follows along the lines of the proof of Theorem 2 in [Citation9]):

Theorem 3.3

Assume that there exists a verification family $((Y_{s}^{v})_{s \geq 0} : v \in U^{l i m})$ and let:

the sequence $(τ_{j}^{*})_{j = 1}^{\infty}$ be given by (8) $\begin{aligned} τ_{j}^{*} := inf {s \geq τ_{j - 1}^{*} : Y_{s}^{[u^{*}]_{j - 1}} = sup_{b \in U} {- c_{s, b}^{[u^{*}]_{j - 1}} + Y_{s}^{[u^{*}]_{j - 1} \circ (s, b)}}}, \end{aligned}$ (8) using the convention that $inf \emptyset = \infty$ , with $τ_{0}^{*} = 0$ and set $N^{*} = sup {j \geq 0 : τ_{j}^{*} < \infty}$ ;
the sequence $(β_{j}^{*})_{j = 1}^{\infty}$ be defined recursively as a measurable selection of (9) $\begin{aligned} β_{j}^{*} \in \underset{b \in U}{\arg max} {- c_{τ_{j}^{*}, b}^{[u^{*}]_{j - 1}} + Y_{τ_{j}^{*}}^{[u^{*}]_{j - 1} \circ (τ_{j}^{*}, b)}} . \end{aligned}$ (9)

Then $u^{*} = (τ_{1}^{*}, \dots, τ_{N^{*}}^{*}; β_{1}^{*}, \dots, β_{N^{*}}^{*}) \in U^{f}$ is an optimal control for (Equation5(5) $\begin{aligned} J (u^{*}) = sup_{u \in U} J (u), \end{aligned}$ (5) ) in the sense that $J (u^{*}) = sup_{u \in U} J (u)$ . Moreover, the family is unique (i.e. there is at most one verification family, up to indistinguishability of the maps $t \mapsto Y_{t}^{v}$ and $(t, b) \mapsto Y_{t}^{v \circ (t, b)}$ ) and $Y_{0} = sup_{u \in U} J (u)$ (where $Y := Y^{\emptyset}$ ).

Proof.

The proof is divided into three steps where we first, in Step 1, show that for any $0 \leq j \leq N^{*}$ we have (10) $\begin{aligned} Y_{s}^{[u^{*}]_{j}} = E [1_{[τ_{j + 1}^{*} = \infty]} φ (u^{*}) + 1_{[τ_{j + 1}^{*} < \infty]} {- c ([u^{*}]_{j + 1}) + Y_{τ_{j + 1}^{*}}^{[u^{*}]_{j + 1}}} | F_{s}], \end{aligned}$ (10) $P$ -a.s. for $s \in [τ_{j}^{*}, τ_{j + 1}^{*}]$ . Following this, in Step 2, we show that $Y_{0} = J (u^{*})$ . Then in Step 3 we show that $u^{*}$ is the optimal control, establishing (i) and (ii). A straightforward generalization to arbitrary initial conditions $v \in U^{l i m}$ then gives that (11) $\begin{aligned} Y_{s}^{v} = \underset{u \in U_{s}}{e s s sup} E [φ (v \circ u) - \sum_{j = 1}^{N} c (v \circ [u]_{j}) | F_{s}], \end{aligned}$ (11) by which uniqueness follows. Below we refer to the properties of a verification family in Definition 3.2 simply as properties a), b), a) and d).

Step 1 We start by showing that for each $v \in U^{l i m}$ the recursion (Equation7(7) $\begin{aligned} Y_{s}^{v} = \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b)}} | F_{s}] . \end{aligned}$ (7) ) can be written in terms of a $F$ -stopping time and that the inner supremum is attained, $P$ -a.s. In particular, this will imply the existence of a maximizer in (Equation9(9) $\begin{aligned} β_{j}^{*} \in \underset{b \in U}{\arg max} {- c_{τ_{j}^{*}, b}^{[u^{*}]_{j - 1}} + Y_{τ_{j}^{*}}^{[u^{*}]_{j - 1} \circ (τ_{j}^{*}, b)}} . \end{aligned}$ (9) ). From (Equation7(7) $\begin{aligned} Y_{s}^{v} = \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b)}} | F_{s}] . \end{aligned}$ (7) ) and consistency we note that $Y^{v}$ is the smallest supermartingale that dominates (12) $\begin{aligned} R^{v} := (1_{[s = \infty]} E [φ (v) | F] + 1_{[s < \infty]} sup_{b \in U} {- c_{s, b}^{v} + Y_{s}^{v \circ (s, b)}} : 0 \leq s \leq \infty) . \end{aligned}$ (12) By Property (c) and Definition 2.2.(ii) we have that the map $(t, b) \mapsto - c_{t, b}^{v} + Y_{t}^{v \circ (t, b)}$ belongs to $H_{F}$ . It thus follows from Lemma 2.4.(b) that $R^{v}$ is a càdlàg process of class [D] that is quasi-left upper semi-continuous on $[0, \infty)$ . Furthermore, by Property (d) and positivity of the intervention costs we note that $\underset{t \to \infty}{lim sup} R_{t}^{v} \leq R_{\infty}^{v}$ , $P$ -a.s. By Theorem A.1.(iii) in Appendix 2 and consistency we conclude that for any $θ \in T^{f}$ , the stopping time $τ^{θ} \in T_{θ}$ given by $\begin{aligned} τ^{θ} := inf {s \geq θ : Y_{s}^{v} = sup_{b \in U} {- c_{s, b}^{v} + Y_{s}^{v \circ (s, b)}}} \end{aligned}$ is such that: $\begin{aligned} Y_{θ}^{v} = E [1_{[τ^{θ} = \infty]} φ (v) + 1_{[τ^{θ} < \infty]} sup_{b \in U} {- c_{τ^{θ}, b}^{v} + Y_{τ^{θ}}^{v \circ (τ^{θ}, b)}} | F_{θ}] . \end{aligned}$ Now, since $(t, b) \mapsto - c_{t, b}^{v} + Y_{t}^{v \circ (t, b)} \in H_{F}$ , the map $b \mapsto - c_{τ^{θ}, b}^{v} + Y_{τ^{θ}}^{v \circ (τ^{θ}, b)}$ is $F_{τ^{θ}} \otimes B (U)$ -measurable and u.s.c. on ${τ^{θ} < \infty} ∖ N$ for some $P$ -null set $N$ . Corollary A.5 of Appendix 3 and consistency then implies that there is a $β^{θ} \in A (τ^{θ})$ such that $\begin{aligned} Y_{θ}^{v} = E [1_{[τ^{θ} = \infty]} φ (v) + 1_{[τ^{θ} < \infty]} {- c (v \circ (τ^{θ}, β^{θ})) + Y_{τ^{θ}}^{v \circ (τ^{θ}, β^{θ})}} | F_{θ}], \end{aligned}$ $P$ -a.s., and in particular (Equation10(10) $\begin{aligned} Y_{s}^{[u^{*}]_{j}} = E [1_{[τ_{j + 1}^{*} = \infty]} φ (u^{*}) + 1_{[τ_{j + 1}^{*} < \infty]} {- c ([u^{*}]_{j + 1}) + Y_{τ_{j + 1}^{*}}^{[u^{*}]_{j + 1}}} | F_{s}], \end{aligned}$ (10) ) holds. As mentioned above, this also implies the existence of a $F_{τ_{j}^{*}}$ -measurable $β_{j}^{*}$ satisfying (Equation9(9) $\begin{aligned} β_{j}^{*} \in \underset{b \in U}{\arg max} {- c_{τ_{j}^{*}, b}^{[u^{*}]_{j - 1}} + Y_{τ_{j}^{*}}^{[u^{*}]_{j - 1} \circ (τ_{j}^{*}, b)}} . \end{aligned}$ (9) ).

Step 2 We now show that $Y_{0} = J (u^{*})$ . We start by noting that Y is the Snell envelope of $\begin{aligned} (1_{[s = \infty]} E [φ (\emptyset) | F] + 1_{[s < \infty]} sup_{b \in U} {- c_{s, b} + Y_{s}^{s, b}} : 0 \leq s \leq \infty) \end{aligned}$ and by Step 1 we thus have (since $F_{0}$ is trivial) that $\begin{aligned} Y_{0} = E [1_{[τ_{1}^{*} = \infty]} φ (\emptyset) + 1_{[τ_{1}^{*} < \infty]} {- c (τ_{1}^{*}, β_{1}^{*}) + Y_{τ_{1}^{*}}^{τ_{1}^{*}, β_{1}^{*}}}] . \end{aligned}$ Moving on we pick $j \in {1, \dots, N^{*}}$ and note that $[u^{*}]_{j} \in U^{j}$ . But then, by Step 1, we have that $\begin{aligned} Y_{τ_{j}^{*}}^{[u^{*}]_{j}} = E [1_{[τ_{j + 1}^{*} = \infty]} φ (u^{*}) + 1_{[τ_{j + 1}^{*} < \infty]} {- c ([u^{*}]_{j + 1}) + Y_{τ_{j + 1}^{*}}^{[u^{*}]_{j + 1}}} | F_{τ_{j}^{*}}] . \end{aligned}$ By induction we get that for each $K \geq 0$ we have, (13) $\begin{aligned} Y_{0} = E [1_{[N^{*} \leq K]} φ (u^{*}) - \sum_{j = 1}^{N^{*} \land K} c ([u^{*}]_{j}) + 1_{[N^{*} > K]} {- c ([u^{*}]_{K + 1}) + Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}}}] . \end{aligned}$ (13) Now, arguing as in the proof of Proposition 2.3 and using Property (b) we find that $u^{*} \in U^{f}$ . To show that the right-hand side of (Equation13(13) $\begin{aligned} Y_{0} = E [1_{[N^{*} \leq K]} φ (u^{*}) - \sum_{j = 1}^{N^{*} \land K} c ([u^{*}]_{j}) + 1_{[N^{*} > K]} {- c ([u^{*}]_{K + 1}) + Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}}}] . \end{aligned}$ (13) ) equals $J (u^{*})$ we note that (Equation13(13) $\begin{aligned} Y_{0} = E [1_{[N^{*} \leq K]} φ (u^{*}) - \sum_{j = 1}^{N^{*} \land K} c ([u^{*}]_{j}) + 1_{[N^{*} > K]} {- c ([u^{*}]_{K + 1}) + Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}}}] . \end{aligned}$ (13) ) can be rewritten as $\begin{aligned} Y_{0} = E [φ ([u^{*}]_{K + 1}) - \sum_{j = 1}^{N^{*} \land K + 1} c ([u^{*}]_{j}) + 1_{[N^{*} > K]} {Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - φ ([u^{*}]_{K + 1})}] \end{aligned}$ which gives (14) $\begin{aligned} | Y_{0} - J (u^{*}) | & \leq E [| φ ([u^{*}]_{K + 1}) - φ (u^{*}) |] + E [\sum_{j = K + 2}^{N^{*}} c ([u^{*}]_{j})] \\ + | E [1_{[N^{*} > K]} {Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - φ ([u^{*}]_{K + 1})}] | \end{aligned}$ (14) for all $K \geq 0$ . For the first term on the right-hand side we note that for any T>0 we have $\begin{aligned} E [| φ ([u^{*}]_{K + 1}) - φ (u^{*}) |] \\ \leq E [1_{[τ_{K + 2}^{*} < T]} | φ ([u^{*}]_{K + 1}) - φ (u^{*}) |] + sup_{u \in U^{l i m}, v \in U_{T}} E [| φ (u \circ v) - φ (u) |] \\ \leq 2 P [τ_{K + 2}^{*} < T]^{1 / 2} sup_{u \in U} E [| φ (u) |^{2}]^{1 / 2} + sup_{u \in U^{l i m}, v \in U_{T}} E [| φ (u \circ v) - φ (u) |] . \end{aligned}$ where we have used Hölder's inequality to arrive at the last inequality. As $(φ, c)$ is an admissible reward pair, Definition 2.2.(iii) gives that the second term can be made arbitrarily small by choosing T sufficiently large and Definition 2.2.(i) implies that the first term tends to zero as $K \to \infty$ for all finite T, since $u^{*} \in U^{f}$ . We, thus, conclude that the first term on the right-hand side in (Equation14(14) $\begin{aligned} | Y_{0} - J (u^{*}) | & \leq E [| φ ([u^{*}]_{K + 1}) - φ (u^{*}) |] + E [\sum_{j = K + 2}^{N^{*}} c ([u^{*}]_{j})] \\ + | E [1_{[N^{*} > K]} {Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - φ ([u^{*}]_{K + 1})}] | \end{aligned}$ (14) ) tends to zero as $K \to \infty$ .

For the second term we note that letting $K \to \infty$ in (Equation13(13) $\begin{aligned} Y_{0} = E [1_{[N^{*} \leq K]} φ (u^{*}) - \sum_{j = 1}^{N^{*} \land K} c ([u^{*}]_{j}) + 1_{[N^{*} > K]} {- c ([u^{*}]_{K + 1}) + Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}}}] . \end{aligned}$ (13) ) and using Property (b) and Definition 2.2.(i.a) we find that $\sum_{j = 1}^{N^{*} \land l} c ([u^{*}]_{j})$ converges increasingly to a limit in $L^{2} (Ω, P)$ as $l \to \infty$ . Hence, the second term on the right-hand side of (Equation14(14) $\begin{aligned} | Y_{0} - J (u^{*}) | & \leq E [| φ ([u^{*}]_{K + 1}) - φ (u^{*}) |] + E [\sum_{j = K + 2}^{N^{*}} c ([u^{*}]_{j})] \\ + | E [1_{[N^{*} > K]} {Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - φ ([u^{*}]_{K + 1})}] | \end{aligned}$ (14) ) also tends to zero as $K \to \infty$ .

Conditioning on $F_{τ_{K + 1}^{*}}$ in the third term of the right-hand side of (Equation14(14) $\begin{aligned} | Y_{0} - J (u^{*}) | & \leq E [| φ ([u^{*}]_{K + 1}) - φ (u^{*}) |] + E [\sum_{j = K + 2}^{N^{*}} c ([u^{*}]_{j})] \\ + | E [1_{[N^{*} > K]} {Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - φ ([u^{*}]_{K + 1})}] | \end{aligned}$ (14) ) and noting that ${N^{*} > K}$ is $F_{τ_{K + 1}^{*}}$ -measurableFootnote⁶ we find that, similar to the above case, we have for any $T \geq 0$ that $\begin{aligned} | E [1_{[N^{*} > K]} {Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - φ ([u^{*}]_{K + 1})}] | & \leq E [| Y_{τ_{K + 1}^{*}}^{[u^{*}]_{K + 1}} - E [φ ([u^{*}]_{K + 1}) | F_{τ_{K + 1}^{*}}] |] \\ \leq C P [τ_{K + 1}^{*} < T]^{1 / 2} \\ + sup_{u \in U^{l i m}} E [sup_{s \in [T, \infty]} | Y_{s}^{u} - E [φ (u) | F_{s}] |], \end{aligned}$ where the second term can be made arbitrarily small by Property (d) and we conclude that $Y_{0} = J (u^{*})$ .

Step 3 It remains to show that the strategy $u^{*}$ is optimal. To do this we pick any other strategy $\hat{u} := ({\hat{τ}}_{1}, \dots, {\hat{τ}}_{\hat{N}}; {\hat{β}}_{1}, \dots, {\hat{β}}_{\hat{N}}) \in U^{f}$ . By Step 2 and the definition of $Y_{0}$ in (Equation7(7) $\begin{aligned} Y_{s}^{v} = \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b)}} | F_{s}] . \end{aligned}$ (7) ) we have $\begin{aligned} J (u^{*}) = Y_{0} & \geq E [1_{[\hat{N} = 0]} φ (\emptyset) + 1_{[\hat{N} > 0]} sup_{b \in U} {- c_{{\hat{τ}}_{1}, b} + Y_{{\hat{τ}}_{1}}^{{\hat{τ}}_{1}; b}}] \\ \geq E [1_{[\hat{N} = 0]} φ (\emptyset) + 1_{[\hat{N} > 0]} {- c ({\hat{τ}}_{1}; {\hat{β}}_{1}) + Y_{{\hat{τ}}_{1}}^{{\hat{τ}}_{1}; {\hat{β}}_{1}}}] \end{aligned}$ but in the same way $\begin{aligned} Y_{{\hat{τ}}_{1}}^{{\hat{τ}}_{1}, {\hat{β}}_{1}} \geq E [1_{[\hat{N} = 1]} φ ({\hat{τ}}_{1}, {\hat{β}}_{1}) + 1_{[\hat{N} > 1]} {- c ({\hat{τ}}_{1}, {\hat{τ}}_{2}; {\hat{β}}_{1}, {\hat{β}}_{2}) + Y_{{\hat{τ}}_{2}}^{{\hat{τ}}_{1}, {\hat{τ}}_{2}; {\hat{β}}_{1}, {\hat{β}}_{2}}} | F_{{\hat{τ}}_{1}}], \end{aligned}$ $P$ -a.s. Repeating this procedure K times gives $\begin{aligned} J (u^{*}) \geq {\hat{Y}}_{K} := E [1_{[\hat{N} \leq K]} φ (\hat{u}) - \sum_{j = 1}^{\hat{N} \land K} c ([\hat{u}]_{j}) + 1_{[\hat{N} > K]} {- c ([\hat{u}]_{K + 1}) + Y_{{\hat{τ}}_{K + 1}}^{[\hat{u}]_{K + 1}}}] . \end{aligned}$ Now, we have $\begin{aligned} J (\hat{u}) - {\hat{Y}}_{K} & \leq E [1_{[\hat{N} > K]} {φ (\hat{u}) - Y_{{\hat{τ}}_{K + 1}}^{[\hat{u}]_{K + 1}}}] \\ = E [1_{[\hat{N} > K]} {φ ([\hat{u}]_{K + 1}) - Y_{{\hat{τ}}_{K + 1}}^{[\hat{u}]_{K + 1}}}] + E [1_{[\hat{N} > K]} {φ (\hat{u}) - φ ([\hat{u}]_{K + 1})}] \\ \leq E [| E [φ ([\hat{u}]_{K + 1}) | F_{{\hat{τ}}_{K + 1}}] - Y_{{\hat{τ}}_{K + 1}}^{[\hat{u}]_{K + 1}} |] + E [| φ (\hat{u}) - φ ([\hat{u}]_{K + 1}) |] \end{aligned}$ where the right-hand side tends to zero as $K \to \infty$ by repeating the argument in Step 2, which is possible since $\hat{u} \in U^{f}$ . We conclude that $J (u^{*}) \geq J (\hat{u})$ for all $\hat{u} \in U^{f}$ and it follows by Proposition 2.3 that $u^{*}$ is an optimal control for Problem 1.

4. Existence of the verification family

Theorem 3.3 presumes existence of the verification family $((Y_{s}^{v})_{s \geq 0} : v \in U^{l i m})$ . To obtain a satisfactory solution to Problem 1, we thus need to establish that a verification family exists. This is the topic of the present section. We will follow the standard existence proof which goes by applying a Picard iteration (see [Citation5,Citation11,Citation15]). We first show that there exists a sequence of consistent families of processes $((Y_{s}^{v, k})_{s \geq 0} : v \in U^{l i m})_{k \geq 0}$ that satisfy the recursion (15) $\begin{aligned} Y_{s}^{v, 0} := E [φ (v) | F_{s}] \end{aligned}$ (15) and (16) $\begin{aligned} Y_{s}^{v, k} := \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b), k - 1}} | F_{s}] \end{aligned}$ (16) for $k \geq 1$ . Then, we show that the limit family obtained by letting $k \to \infty$ is a verification family.

Proposition 4.1

There is a sequence of consistent families of càdlàg supermartingales $((Y_{s}^{v, k})_{s \geq 0} : v \in U^{l i m})_{k \geq 0}$ such that for each $v \in U^{l i m}$ :

The sequence satisfies the recursion (Equation15(15) $\begin{aligned} Y_{s}^{v, 0} := E [φ (v) | F_{s}] \end{aligned}$ (15) )–(Equation16(16) $\begin{aligned} Y_{s}^{v, k} := \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b), k - 1}} | F_{s}] \end{aligned}$ (16) ).
There is a K>0 (that does not depend on k) such that, $\begin{aligned} sup_{u \in U^{l i m}} E [sup_{s \in [0, \infty]} | Y_{s}^{u, k + 1} |^{2}] \leq K . \end{aligned}$
For each $k \geq 0$ , the map $(t, b) \mapsto Y_{t}^{v \circ (t, b), k}$ belongs to $H_{F}$ .
$sup_{u \in U^{l i m}} E [sup_{t \in [T, \infty]} | Y_{t}^{u, k} - E [φ (u) | F_{t}] |] \to 0$ , as $T \to \infty$ , uniformly in k.

The proof of Proposition 4.1 will be based on two lemmas and the following induction hypothesis:

Hypthesis (VF.k). There is a sequence of consistent families of càdlàg supermartingales $((Y_{s}^{v, k^{'}})_{s \geq 0} : v \in U^{l i m})_{0 \leq k^{'} \leq k}$ such that for $k^{'} = 0, \dots, k$ and $v \in U^{l i m}$ :

The relation (Equation15(15) $\begin{aligned} Y_{s}^{v, 0} := E [φ (v) | F_{s}] \end{aligned}$ (15) ) holds for $k^{'} = 0$ and (Equation16(16) $\begin{aligned} Y_{s}^{v, k} := \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b), k - 1}} | F_{s}] \end{aligned}$ (16) ) holds for $k^{'} > 0$ .
The map $(t, b) \mapsto Y_{t}^{v \circ (t, b), k^{'}}$ belongs to $H_{F}$ .

We note that Hypothesis VF.k lacks properties (b) and (d) of Proposition 4.1. In the following two propositions we show that these are implicit.

Lemma 4.2

Assume that Hypothesis VF.k holds for some $k \geq 0$ . Then, the sequence of families of processes $((Y_{s}^{v, k^{'}})_{s \geq 0} : v \in U^{l i m})_{0 \leq k^{'} \leq k}$ is well defined and uniformly bounded in the sense that there is a K>0 (that does not depend on k) such that, $\begin{aligned} sup_{u \in U^{l i m}} E [sup_{s \in [0, \infty]} | Y_{s}^{u, k + 1} |^{2}] \leq K . \end{aligned}$ Furthermore, for each $v \in D^{f}$ (whenever it is well defined) the collection ${sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b), k}} : τ \in T^{f}, k \geq 0}$ of random variables is uniformly integrable.

Proof.

We note that under Hypothesis VF.k, the sequence of families $((Y_{s}^{v, k^{'}})_{s \geq 0} : v \in U^{l i m})_{0 \leq k^{'} \leq k + 1}$ exists and is uniquely defined up to indistinguishability for each $Y^{v, k^{'}}$ by repeated application of Theorem A.1 in Appendix 2. By the definition of $Y^{v, k + 1}$ and positivity of the intervention costs we have that for any $v \in U^{l i m}$ , $\begin{aligned} E [φ (v) | F_{s}] \leq Y_{s}^{v, k + 1} \leq \underset{u \in U_{s}^{k + 1}}{e s s sup} E [φ (v \circ u) | F_{s}] . \end{aligned}$ For $k \geq 0$ , we define the càdlàg supermartingale $Z_{s}^{k} := {e s s sup}_{u \in U_{s}^{k}} E [φ (v \circ u) | F_{s}]$ and the stopping times $τ^{z, k} := inf {s \geq 0 : | Z_{s}^{k} |^{p} \geq z}$ for all $z \geq 0$ . Then, by Definition 2.2.(i.a) we have $\begin{aligned} E [| Z_{τ^{z, k}}^{k} |^{p}] & \leq E [\underset{u \in U_{τ^{z, k}}^{k}}{e s s sup} E [| φ (v \circ u) |^{p} | F_{τ^{z, k}}]] \\ \leq sup_{u \in U^{l i m}} E [| φ (u) |^{p}] \leq C . \end{aligned}$ In particular, by right-continuity this implies that $\begin{aligned} P [sup_{s \in [0, \infty)} | Z_{s}^{k} |^{p} \geq z] \leq \frac{C}{z} \land 1 \end{aligned}$ or $\begin{aligned} P [sup_{s \in [0, \infty)} | Z_{s}^{k} |^{2} \geq z] \leq \frac{C}{z^{p / 2}} \land 1, \end{aligned}$ where C does not depend on v and k. The first assertion now follows as $\begin{aligned} E [sup_{s \in [0, \infty)} | Z_{s}^{k} |^{2}] \leq \int_{0}^{\infty} (\frac{C}{z^{p / 2}} \land 1) d z < \infty . \end{aligned}$ Concerning the second claim, note that for each $τ \in T^{f}$ and each $ϵ > 0$ , repeating the proof of Corollary A.5 in Appendix 3 with $g^{ϵ} := g - ϵ$ instead of g we find that there is a $β^{ϵ} \in A (τ)$ such that $\begin{aligned} sup_{b \in U} {- c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b), k}} \leq - c_{τ, β^{ϵ}}^{v} + Y_{τ}^{v \circ (τ, β^{ϵ}), k} + ϵ . \end{aligned}$ Now, $\begin{aligned} E [sup_{b \in U} | - c_{τ, b}^{v} + Y_{τ}^{v \circ (τ, b), k} |^{2}] & \leq 2 E [| - c_{τ, β^{ϵ}}^{v} + Y_{τ}^{v \circ (τ, β^{ϵ}), k} |^{2}] + 2 ϵ^{2} \\ \leq 4 E [| c (v \circ (τ, β^{ϵ})) |^{2}] + 4 E [sup_{t \in [0, \infty)} | Y_{t}^{v \circ (τ, β^{ϵ}), k} |^{2}] + 2 ϵ^{2} \end{aligned}$ where the right-hand side is bounded, uniformly in $(τ, β^{ϵ})$ and $k \geq 0$ , by the above in combination with Definition 2.2.(i.b) and Doob's maximal inequality.

We also have the following diminishing future impact property:

Lemma 4.3

Assume again that Hypothesis VF.k holds for some $k \geq 0$ . Then, $\begin{aligned} sup_{u \in U^{l i m}} E [sup_{t \in [T, \infty]} | Y_{t}^{u, k + 1} - E [φ (u) | F_{t}] |] \to 0, \end{aligned}$ as $T \to \infty$ , uniformly in k.

Proof.

By the properties of the essential supremum and positivity of the intervention costs we have for every $v \in U^{l i m}$ , $\begin{aligned} E [φ (v) | F_{t}] \leq Y_{t}^{v, k + 1} \leq \underset{u \in U_{t}^{f}}{e s s sup} E [φ (v \circ u) | F_{t}] . \end{aligned}$ This implies that $\begin{aligned} | Y_{t}^{v, k + 1} - E [φ (v) | F_{t}] | \leq \underset{u \in U_{t}^{f}}{e s s sup} E [| φ (v \circ u) - φ (v) | | F_{t}] . \end{aligned}$ The desired result now follows by a similar argument to the one used in the proof of Lemma 4.2 and Definition 2.2.(iii).

Proof

Proof of Proposition 4.1

First, note that by Definition 2.2.(ii) there is a $h \in H_{F}$ such that for $τ \in T^{f}$ we have (17) $\begin{aligned} h (τ, β) = E [- c (v \circ (τ, β)) + φ (v \circ (τ, β)) | F_{τ}], \end{aligned}$ (17) for all $β \in A (τ)$ , $P$ -a.s. The statement, thus, holds for k=0.

Moving on we assume that VF.k holds for some $k \geq 0$ . But then, by Lemmas 4.2 and 4.3 we can applying a reasoning similar to that in the proof of Theorem 3.3 to find that $Y^{v, k + 1}$ is a càdlàg supermartingale with (18) $\begin{aligned} Y_{t}^{v, k + 1} = \underset{u \in U_{t}^{k + 1}}{e s s sup} E [φ (v \circ u) - \sum_{j = 1}^{N} c (v \circ [u]_{j}) | F_{t}] . \end{aligned}$ (18) By Definition 2.2.(ii) it follows that there is a consistent family satisfying (Equation18(18) $\begin{aligned} Y_{t}^{v, k + 1} = \underset{u \in U_{t}^{k + 1}}{e s s sup} E [φ (v \circ u) - \sum_{j = 1}^{N} c (v \circ [u]_{j}) | F_{t}] . \end{aligned}$ (18) ) such that $(t, b) \mapsto Y_{t}^{v \circ (t, b), k + 1} \in H_{F}$ and we conclude that VF.k+1 holds as well. By induction this extends to all $k \geq 0$ .

The objective in the remainder of this section is to show that the limit family that we get when letting $k \to \infty$ in $((Y_{s}^{v, k})_{s \geq 0} : v \in U^{l i m})$ is a verification family.

Proposition 4.4

For each $v \in U^{l i m}$ , the limit ${\bar{Y}}^{v} := lim_{k \to \infty} Y^{v, k}$ , exists as an increasing pointwise limit, $P$ -a.s.

Proof.

Since $U_{t}^{k} \subset U_{t}^{k + 1}$ we have that $Y_{t}^{v, k} \leq Y_{t}^{v, k + 1}$ , $P$ -a.s. Moreover, by Proposition 4.1 the sequence is bounded $P$ -a.s., thus, it converges $P$ -a.s. for all $t \in [0, \infty]$ .

To assess the type of convergence that we have for the sequence $Y^{v, k}$ , we introduce a sequence of families of processes corresponding to a truncation of the time interval. For each T>0 and $k \geq 0$ , we define the consistent family $((^{T} Y_{t}^{v, k})_{t \geq 0} : v \in U^{l i m})$ of càdlàg supermartingales as $\begin{aligned} ^{T} Y_{t}^{v, k} = \underset{u \in U_{[t, T)}^{k}}{e s s sup} E [φ (v \circ u) - \sum_{j = 1}^{N} c (v \circ [u]_{j}) | F_{t}] \end{aligned}$ for all $v \in U^{l i m}$ with $(t, b) \mapsto^{T} Y_{t}^{v \circ (t, b), k} \in H_{F}^{'}$ . Then,

Lemma 4.5

The sequence $((^{T} Y_{s}^{v, k})_{s \geq 0} : v \in U^{l i m})_{k \geq 0}$ satisfies:

$Y^{v, 0} \leq^{T} Y^{v, k} \leq Y^{v, k}$ .
For each $ϵ > 0$ there is a $T \geq 0$ such that $P [\cup_{k = 0}^{\infty} B_{k}^{T, ϵ}] < ϵ$ , with $\begin{aligned} B_{k}^{T, ϵ} := {ω \in Ω : sup_{s \in [0, \infty]} sup_{b \in U} | Y_{s}^{v \circ (s, b), k} -^{T} Y_{s}^{v \circ (s, b), k} | > ϵ} . \end{aligned}$
There is a $P$ -a.s. finite $F$ -measurable random variable ξ and a constant q>0 such that $\begin{aligned} sup_{t \in [0, \infty]} sup_{b \in U} |^{T} Y_{t}^{v \circ (t, b), k} -^{T} Y_{t}^{v \circ (t, b), k^{'}} | \leq ξ / (k^{'})^{q} \end{aligned}$ $P$ -a.s. for each $0 < k^{'} \leq k$ .

Proof.

The inequality in (i) follows from noting that $\emptyset \in U_{[t, T)}^{k} \subset U_{t}^{k}$ .

For the second statement we note that by Lemma 2.4.(c), the process $(sup_{b \in U} | Y_{s}^{v \circ (s, b), k} -^{T} Y_{s}^{v \circ (s, b), k} | : s \geq 0)$ is $P_{F}$ -measurable and càdlàg. Now, each $u \in U_{τ}^{k}$ can be decomposed as $u = u_{1} \circ u_{2}$ with $u_{1} \in U_{[τ, T)}^{k}$ and $u_{2} \in U_{T}^{k}$ , which implies that

(19) $\begin{aligned} Y_{τ}^{v \circ (τ, β), k} & -^{T} Y_{τ}^{v \circ (τ, β), k} = \underset{u \in U_{τ}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, β) \circ [u]_{j}) | F_{τ}] \\ - \underset{u \in U_{[τ, T)}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, β) \circ [u]_{j}) | F_{τ}] \\ \leq \underset{u_{1} \in U_{[τ, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u_{1} \circ u_{2}) - \sum_{j = 1}^{N_{1} + N_{2}} c (v \circ (τ, β) \circ [u_{1} \circ u_{2}]_{j}) | F_{τ}] \\ - \underset{u \in U_{[τ, T)}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, β) \circ [u]_{j}) | F_{τ}] \\ \leq \underset{u_{1} \in U_{[τ, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (τ, β) \circ u_{1} \circ u_{2}) - φ (v \circ (τ, β) \circ u_{1}) | | F_{τ}], \end{aligned}$ (19) where $N_{1}$ and $N_{2}$ are the number of interventions in $u_{1}$ and $u_{2}$ , respectively. We thus define the sets $\begin{aligned} {\tilde{B}}_{k}^{T, ϵ} & := {ω \in Ω : sup_{s \in [0, \infty]} sup_{b \in U} \underset{u_{1} \in U_{[0, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (s, b) \circ u_{1} \circ u_{2}) \\ - φ (v \circ (s, b) \circ u_{1}) | | F_{s}] \geq ϵ} \end{aligned}$ and have by (Equation19(19) $\begin{aligned} Y_{τ}^{v \circ (τ, β), k} & -^{T} Y_{τ}^{v \circ (τ, β), k} = \underset{u \in U_{τ}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, β) \circ [u]_{j}) | F_{τ}] \\ - \underset{u \in U_{[τ, T)}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, β) \circ [u]_{j}) | F_{τ}] \\ \leq \underset{u_{1} \in U_{[τ, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u_{1} \circ u_{2}) - \sum_{j = 1}^{N_{1} + N_{2}} c (v \circ (τ, β) \circ [u_{1} \circ u_{2}]_{j}) | F_{τ}] \\ - \underset{u \in U_{[τ, T)}^{k}}{e s s sup} E [φ (v \circ (τ, β) \circ u) - \sum_{j = 1}^{N} c (v \circ (τ, β) \circ [u]_{j}) | F_{τ}] \\ \leq \underset{u_{1} \in U_{[τ, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (τ, β) \circ u_{1} \circ u_{2}) - φ (v \circ (τ, β) \circ u_{1}) | | F_{τ}], \end{aligned}$ (19) ) that $B_{k}^{T, ϵ} \subset {\tilde{B}}_{k}^{T, ϵ}$ for $k \geq 0$ . Furthermore, as $U_{[τ, T)}^{k} \subset U_{[τ, T)}^{k + 1}$ and $U_{T}^{k} \subset U_{T}^{k + 1}$ we find that ${\tilde{B}}_{k}^{T, ϵ} \subset {\tilde{B}}_{k + 1}^{T, ϵ}$ for all $k \geq 0$ .

Now, let $\begin{aligned} τ_{k}^{ϵ} & := inf {s \geq 0 : sup_{b \in U} \underset{u_{1} \in U_{[0, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (s, b) \circ u_{1} \circ u_{2}) \\ - φ (v \circ (s, b) \circ u_{1}) | | F_{s}] \geq ϵ} \end{aligned}$ (recalling our convention that $inf \emptyset = \infty)$ and pick $β_{k}^{ϵ}$ such that $\begin{aligned} sup_{b \in U} \underset{u_{1} \in U_{[0, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (τ_{k}^{ϵ}, b) \circ u_{1} \circ u_{2}) - φ (v \circ (τ_{k}^{ϵ}, b) \circ u_{1}) | | F_{τ_{k}^{ϵ}}] \\ \leq \underset{u_{1} \in U_{[0, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (τ_{k}^{ϵ}, β_{k}^{ϵ}) \circ u_{1} \circ u_{2}) - φ (v \circ (τ_{k}^{ϵ}, β_{k}^{ϵ}) \circ u_{1}) | | F_{τ_{k}^{ϵ}}] + ϵ / 2. \end{aligned}$ Then, by right continuity we have ${\tilde{B}}_{k}^{T, ϵ} \subset {\hat{B}}_{k}^{T, ϵ}$ where $\begin{aligned} {\hat{B}}_{k}^{T, ϵ} & := {ω \in Ω : \underset{u_{1} \in U_{[0, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (τ_{k}^{ϵ}, β_{k}^{ϵ}) \circ u_{1} \circ u_{2}) \\ - φ (v \circ (τ_{k}^{ϵ}, β_{k}^{ϵ}) \circ u_{1}) | | F_{τ_{k}^{ϵ}}] \geq ϵ / 2} . \end{aligned}$ We thus only need to show that there is a T>0 such that $P [{\hat{B}}_{k}^{T, ϵ}] < ϵ$ for all $k \geq 0$ . We have, $\begin{aligned} E [\underset{u_{1} \in U_{[0, T)}^{k}, u_{2} \in U_{T}^{k}}{e s s sup} E [| φ (v \circ (τ_{k}^{ϵ}, β_{k}^{ϵ}) \circ u_{1} \circ u_{2}) - φ (v \circ (τ_{k}^{ϵ}, β_{k}^{ϵ}) \circ u_{1}) | | F_{τ_{k}^{ϵ}}]] \\ \leq sup_{u_{1} \in U_{[0, T)}^{l i m}, u_{2} \in U_{T}^{f}} E [| φ (u_{1} \circ u_{2}) - φ (u_{1}) | |], \end{aligned}$ where right-hand side is independent of k and tends to 0 as $T \to \infty$ by Definition 2.2.(iii). We thus conclude that there is a $T = T (ϵ)$ such that $P [{\hat{B}}_{k}^{T, ϵ}] < ϵ$ for all $k \geq 0$ .

Concerning the third statement, we note that for $r \in (1, 2)$ , we have for each $τ \in T$ and all $β \in A (τ)$ , that $\begin{aligned} ^{T} Y_{τ}^{v \circ (τ, β), k} & \leq sup_{s \in [0, \infty]} Y_{s}^{v \circ (τ, β), k} \\ \leq sup_{s \in [0, \infty]} \underset{u \in U_{s}^{f}}{e s s sup} E [| φ (v \circ (τ, β) \circ u) | | F_{s}] \\ \leq 1 + sup_{s \in [0, \infty]} \underset{u \in U_{s}^{f}}{e s s sup} E [| φ (v \circ (τ, β) \circ u) |^{r} | F_{s}] =: K (ω) \end{aligned}$ and similarly $\begin{aligned} ^{T} Y_{τ}^{v \circ (τ, β), k} \geq - K (ω) \end{aligned}$ for all $k \geq 0$ (where the inequalities hold $P$ -a.s.). Now, arguing as in the proof of Proposition 4.2 we have $\begin{aligned} E [sup_{s \in [0, \infty]} \underset{u \in U_{s}^{f}}{e s s sup} E [| φ (v \circ (τ, β) \circ u) |^{r} | F_{s}]^{2 / r}] < \infty \end{aligned}$ and we conclude that there is a $P$ -null set $N$ such that for each $ω \in Ω ∖ N$ we have $K (ω) < \infty$ .

For $ϵ > 0$ , let $u^{k, ϵ} := (τ_{1}^{k, ϵ}, \dots, τ_{N^{k, ϵ}}^{k, ϵ}; β_{1}^{k, ϵ}, \dots, β_{N^{k, ϵ}}^{k, ϵ}) \in U_{[τ, T)}^{k}$ be such that $\begin{aligned} ^{T} Y_{τ}^{v \circ (τ, β), k} \leq E [φ (v \circ (τ, β) \circ u^{k, ϵ}) - \sum_{j = 1}^{N^{k, ϵ}} c (v \circ (τ, β) \circ [u^{k, ϵ}]_{j}) | F_{τ}] + ϵ . \end{aligned}$ We note that on $[τ \geq T]$ we have $N^{k, ϵ} = 0$ and get that for $ω \in Ω ∖ N$ (in the remainder of the proof $N$ denotes a generic $P$ -null set), we have $\begin{aligned} - K - ϵ & \leq E [φ (v \circ (τ, β) \circ u^{k, ϵ}) - \sum_{j = 1}^{N^{k, ϵ}} c (v \circ (τ, β) \circ [u^{k, ϵ}]_{j}) | F_{τ}] \\ \leq K - δ (T) E [N^{k, ϵ} | F_{τ}] \end{aligned}$ and we conclude that $E [1_{[N^{k, ϵ} > k^{'}]} | F_{τ}] < (2 K + ϵ) / (δ (T) k^{'})$ for all $k^{'} \geq 0$ .

Now, for all $0 \leq k^{'} \leq k$ we have, $\begin{aligned} ^{T} {\overset{˘}{Y}}_{τ}^{v \circ (τ, β), k, k^{'}} & := E [φ (v \circ (τ, β) \circ [u^{k, ϵ}]_{k^{'}}) - \sum_{j = 1}^{N^{k, ϵ} \land k^{'}} c (v \circ (τ, β) \circ [u^{k, ϵ}]_{j}) | F_{τ}] \\ \leq^{T} Y_{τ}^{v \circ (τ, β), k^{'}} \leq^{T} Y_{τ}^{v \circ (s, b), k}, \end{aligned}$ where we have introduced $^{T} {\overset{˘}{Y}}_{τ}^{v \circ (τ, β), k, k^{'}}$ corresponding to the truncation $[u^{k, ϵ}]_{k^{'}} :=$

$(τ_{1}^{k, ϵ}, \dots, τ_{N^{k, ϵ} \land k^{'}}^{k, ϵ}; β_{1}^{k, ϵ}, \dots, β_{N^{k, ϵ} \land k^{'}}^{k, ϵ})$ of $u^{k, ϵ}$ . As the truncation only affects the performance of the controller when $N^{k, ϵ} > k^{'}$ we have $\begin{aligned} ^{T} Y_{τ}^{v \circ (τ, β), k} -^{T} {\overset{˘}{Y}}_{τ}^{v \circ (τ, β), k, k^{'}} & \leq E [1_{[N^{k, ϵ} > k^{'}]} (φ (v \circ u^{k, ϵ}) - \sum_{j = k^{'} + 1}^{N^{k, ϵ}} c (v \circ (τ, β) \circ [u^{k, ϵ}]_{j}) \\ - φ (v \circ (τ, β) \circ [u^{k, ϵ}]_{k^{'}})) | F_{τ}] + ϵ \\ \leq E [1_{[N^{k, ϵ} > k^{'}]} (φ (v \circ (τ, β) \circ u^{k, ϵ}) \\ - φ (v \circ (τ, β) \circ [u^{k, ϵ}]_{k^{'}})) | F_{τ}] + ϵ . \end{aligned}$ Applying Hölder's inequality we get that for $ω \in Ω ∖ N$ , $\begin{aligned} ^{T} Y_{τ}^{v \circ (τ, β), k} -^{T} {\overset{˘}{Y}}_{τ}^{v \circ (τ, β), k, k^{'}} & \leq 2 E [1_{[N^{k, ϵ} > k^{'}]} | F_{τ}]^{1 / q} \underset{u \in U_{τ}^{f}}{e s s sup} E [| φ (v \circ (τ, β) \circ u) |^{r} | F_{τ}]^{1 / r} + ϵ \\ \leq 2^{1 + \frac{1}{q}} \frac{K (ω) + ϵ}{(δ (T) k^{'})^{1 / q}} + ϵ, \end{aligned}$ with $\frac{1}{r} + \frac{1}{q} = 1$ . Since $δ (T) > 0$ , there is thus a $P$ -a.s. finite $F$ -measurable r.v. $ξ = ξ (ω)$ such that (for all τ and β) we have $\begin{aligned} |^{T} Y_{τ}^{v \circ (τ, β), k} -^{T} Y_{τ}^{v \circ (τ, β), k^{'}} | \leq \frac{ξ}{(k^{'})^{1 / q}} + C ϵ . \end{aligned}$ Since, $β \in A (τ)$ was arbitrary we can choose β such that $\begin{aligned} sup_{b \in U} |^{T} Y_{τ}^{v \circ (τ, b), k} -^{T} Y_{τ}^{v \circ (τ, b), k^{'}} | \leq |^{T} Y_{τ}^{v \circ (τ, β), k} -^{T} Y_{τ}^{v \circ (τ, β), k^{'}} | + ϵ \leq \frac{ξ}{(k^{'})^{1 / q}} + C ϵ, \end{aligned}$ $P$ -a.s. and by right-continuity the last statement follows as $ϵ > 0$ was arbitrary.

Proposition 4.6

For each $v \in U^{l i m}$ , we have $\begin{aligned} sup_{t \in [0, \infty]} sup_{b \in U} | Y_{t}^{v \circ (t, b), k} - {\bar{Y}}_{t}^{v \circ (t, b)} | \to 0 \end{aligned}$ as $k \to \infty$ , outside of a $P$ -null set.

Proof.

By Lemma 4.5.(ii) there exist for each $ϵ > 0$ , a $T \geq 0$ and a measurable set $B \subset Ω$ with $P [B] \geq 1 - ϵ$ such that $\begin{aligned} sup_{t \in [0, \infty]} sup_{b \in U} | Y_{t}^{v \circ (t, b), k} - Y_{t}^{v \circ (t, b), k^{'}} | \leq sup_{t \in [0, \infty]} sup_{b \in U} |^{T} Y_{t}^{v \circ (t, b), k} -^{T} Y_{t}^{v \circ (t, b), k^{'}} | + 2 ϵ \end{aligned}$ for all $0 \leq k \leq k^{'}$ and $ω \in B$ . Furthermore, by Lemma 4.5.(iii) there is a $P$ -a.s. finite r.v., ξ, such that $\begin{aligned} sup_{t \in [0, \infty]} sup_{b \in U} |^{T} Y_{t}^{v \circ (t, b), k} -^{T} Y_{t}^{v \circ (t, b), k^{'}} | \leq \frac{ξ}{(k)^{q}} . \end{aligned}$ Combining these and taking the limit as $k, k^{'} \to \infty$ we find that $\begin{aligned} lim_{k \to \infty} sup_{t \in [0, \infty]} sup_{b \in U} | {\bar{Y}}_{t}^{v \circ (t, b)} - Y_{t}^{v \circ (t, b), k} | \leq lim_{k \to \infty} lim_{k^{'} \to \infty} sup_{t \in [0, \infty]} sup_{b \in U} | Y_{t}^{v \circ (t, b), k^{'}} - Y_{t}^{v \circ (t, b), k} | \leq 2 ϵ \end{aligned}$ on $B ∖ N$ for some $P$ -null set $N$ . Now, as $ϵ > 0$ was arbitrary the statement follows.

We are now ready to show that a verification family exists, establishing the existence of optimal controls for Problem 1.

Proposition 4.7

A verification family exists.

Proof.

Letting $h^{k} (t, b) := Y_{t}^{v \circ (t, b), k}$ we have by Proposition 4.6 that $h^{k} (t, b)$ converges uniformly in $(t, b)$ to $\bar{h} (t, b) := {\bar{Y}}_{t}^{v \circ (t, b)}$ as $k \to \infty$ (outside of a $P$ -null set). Since $h^{k} \in H_{F}$ by Proposition 4.1.c), we have by Lemma 2.4.(d) that $\bar{h} \in H_{F}$ . In particular, we conclude that property c) in the definition of a verification family holds for the limit family $(({\bar{Y}}_{s}^{v})_{s \geq 0} : v \in U^{l i m})$ .

Moreover, for each $τ \in T$ and $β \in A (τ)$ we have that $Y_{τ}^{v \circ (τ, β), k} ↗ {\bar{Y}}_{τ}^{v \circ (τ, β)}$ , $P$ -a.s., as $k \to \infty$ and we conclude by consistency of $(({\bar{Y}}_{s}^{v, k})_{s \geq 0} : v \in U^{l i m})$ for each $k \geq 0$ that ${\bar{Y}}_{τ}^{v \circ (τ, β)} = \bar{h} (τ, β)$ , $P$ -a.s., implying consistency of $(({\bar{Y}}_{s}^{v})_{s \geq 0} : v \in U^{l i m})$ .

We treat each of the remaining properties separately:

(a) By the above and (b) of Lemma 2.4 we have that $sup_{b \in U} {- c_{s, b}^{v} + {\bar{Y}}_{s}^{v \circ (s, b)}}$ is a càdlàg, quasi-left upper semi-continuous process of class [D]. In particular we note that $\begin{aligned} 1_{[s = \infty]} φ (v) + 1_{[s < \infty]} sup_{b \in U} {- c_{s, b}^{v} + {\bar{Y}}_{s}^{v \circ (s, b)}} \end{aligned}$ is càdlàg. Applying (iv) of Theorem A.1 in Appendix 2 then gives $\begin{aligned} {\bar{Y}}_{s}^{v} = \underset{τ \in T_{s}}{e s s sup} E [1_{[τ = \infty]} φ (v) + 1_{[τ < \infty]} sup_{b \in U} {- c_{τ, b}^{v} + {\bar{Y}}_{τ}^{v \circ (τ, b)}} | F_{s}] . \end{aligned}$ (b) By Proposition 4.1 we have that $sup_{u \in U^{l i m}} E [sup_{t \in [0, \infty]} | Y_{t}^{u, k} |^{2}]$ is uniformly bounded in k. From this it follows immediately that $sup_{u \in U^{l i m}} E [sup_{t \in [0, \infty]} | {\bar{Y}}_{t}^{u} |^{2}] < \infty$ .

(d) We have that $\begin{aligned} sup_{u \in U^{l i m}} E [sup_{t \in [T, \infty]} | {\bar{Y}}_{t}^{u} - E [φ (u) | F_{t}] |] & \leq sup_{u \in U^{l i m}} E [sup_{t \in [0, \infty]} | {\bar{Y}}_{t}^{u} - Y_{t}^{u, k} |] \\ + sup_{u \in U^{l i m}} E [sup_{t \in [T, \infty]} | Y_{t}^{u, k} - E [φ (u) | F_{t}] |] \end{aligned}$ where the first term on the right-hand side can be made arbitrarily small by choosing k sufficiently large and the last term tends to 0 as $T \to \infty$ for all $k \geq 0$ .

5. Application to impulse control of SFDEs

In [Citation24] a finite horizon impulse control problem with a discrete set U was solved when the underlying process followed a stochastic delay differential equation (SDDE) under a loop condition on the impulses. This problem was motivated by hydropower operation where the flow-times between different power plants induce delays in the dynamics of the controlled system.

In this section we extend the results from [Citation24] by considering a discounted infinite horizon setting, allowing an uncountable control set U and also by taking the dynamics of the underlying process to follow a stochastic functional differential equation. Furthermore, our prior treatment of the problem with abstract reward, φ, and intervention cost, c, allows us to consider a less restrictive set of assumptions on the coefficients in the problem formulation. In particular, we are able to remove the loop condition.

Our treatment of non-Markovian impulse control problems in infinite horizon should also be compared to [Citation10] where an infinite horizon impulse control problem in a non-Markovian framework with a fixed discrete delay is considered. The work presented in this section goes in a different direction by having an underlying dynamics driven by a Lévy process that is affected by the impulses in the control, resulting in a more complex relation between the control and the output of the performance functional. Furthermore, we investigate the important extension to random horizon which turns out to be a trivial modification of our initial problem.

Throughout this section, we will only consider controls for which $τ_{j} \to \infty$ , $P$ -a.s., and restrict our attention to the setting when the underlying uncertainty stems from a process $X^{u}$ , with $u \in U^{f}$ , defined as $X^{u} := lim_{k \to \infty} X^{u, k}$ where (20) $\begin{aligned} X_{t}^{u, 0} & = x (t) f o r t \in (- \infty, 0) \end{aligned}$ (20) (21) $\begin{aligned} X_{t}^{u, 0} & = x (0) + \int_{0}^{t} a (s, (X_{r}^{u, 0})_{r \leq s}) d s + \int_{0}^{t} σ (s, (X_{r}^{u, 0})_{r \leq s}) d B_{s} \\ + \int_{0}^{t} \int_{R^{d} ∖ {0}} γ (s, (X_{r}^{u, 0})_{r < s}, z) d \tilde{P} (d s, d z), f o r t \geq 0 \end{aligned}$ (21) for some $x \in D$ , the set of all (deterministic) uniformly bounded, càdlàg functions $x : R \to R^{d}$ , and (22) $\begin{aligned} X_{t}^{u, k} & = X_{t}^{u, k - 1} f o r t \in [0, τ_{k}) \end{aligned}$ (22) (23) $\begin{aligned} X_{t}^{u, k} & = Γ (τ_{k}, (X_{s}^{u, k - 1})_{s \leq τ_{k}}, β_{k}) + \int_{τ_{k}}^{t} a (s, (X_{r}^{u, k})_{r \leq s}) d s + \int_{τ_{k}}^{t} σ (s, (X_{r}^{u, k})_{r \leq s}) d B_{s} \\ + \int_{τ_{k} +}^{t} \int_{R^{d} ∖ {0}} γ (s, (X_{r}^{u, k})_{r < s}, z) d \tilde{P} (d s, d z), f o r t \geq τ_{k} . \end{aligned}$ (23) The dynamics of $X^{u}$ are driven by a d-dimensional Brownian motion B and a Poisson random measure P with intensity measure $ϱ (d s; d z) = d s \times μ (d z)$ , where $μ (d z)$ is the Lévy measure on $R^{d}$ of P and $\tilde{P} (d s; d z) := (P - ϱ) (d s; d z)$ is called the compensated jump martingale random measure of P. We assume that $G := {G_{t}}_{t \geq 0}$ is the natural filtration generated by B and P, with $G = G_{\infty} := lim_{t \to \infty} G_{t}$ .

As mentioned above, we assume that all uncertainty comes from the process $X^{u}$ and consider the discounted setting with a continuous discount factor $ρ : R_{+} \to R_{+}$ . The reward functional is then (24) $\begin{aligned} J (u) = E [\int_{0}^{\infty} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t - \sum_{j = 1}^{N} e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{u, j - 1}, β_{j})] . \end{aligned}$ (24)

5.1. Assumptions

We assume that the involved coefficients satisfy the following constraints:

Assumption 5.1

For any $t, t^{'} \geq 0$ , $b, b^{'} \in U$ , $x, x^{'} \in R^{d}$ and $y, y^{'} \in D$ and for some $q \geq 2$ and p>2 we have:

The function $Γ : R_{+} \times D \times U \to R^{d}$ satisfies the Lipschitz condition $\begin{aligned} | Γ (t, (y_{s})_{s \leq t}, b) - Γ (t^{'}, (y_{s}^{'})_{s \leq t^{'}}, b^{'}) | & \leq C (\int_{- \infty}^{t \land t^{'}} | y_{s}^{'} - y_{s} | d s + | y_{t^{'}}^{'} - y_{t} | \\ + (| t^{'} - t | + | b^{'} - b |) (1 + sup_{s \leq t} | y_{s} | + sup_{s \leq t^{'}} | y_{s}^{'} |)) \end{aligned}$ and the growth condition $\begin{aligned} | Γ (t, (y_{s})_{s \leq t}, b) | \leq K_{Γ} \lor | y_{t} | . \end{aligned}$ for some constant $K_{Γ} > 0$ .
The coefficients $a : R_{+} \times D \to R^{d}$ and $σ : R_{+} \times D \to R^{d \times d}$ are continuous in t and satisfy the growth condition $\begin{aligned} | a (t, (y_{s})_{s \leq t}) | + | σ (t, (y_{s})_{s \leq t}) | \leq C (1 + sup_{s \leq t} | y_{s} |) \end{aligned}$ and the Lipschitz continuity $\begin{aligned} | a (t, (y_{s})_{s \leq t}) - a (t, (y_{s}^{'})_{s \leq t}) | + | σ (t, (y_{s})_{s \leq t}) - σ (t, (y_{s}^{'})_{s \leq t}) | & \leq C sup_{s \leq t} | y_{s}^{'} - y_{s} |, \\ \int_{0}^{t} | a (s, (y_{r})_{r \leq s}) - a (s, (y_{r}^{'})_{r \leq s}) | d s & \leq C \int_{- \infty}^{t} | y_{s}^{'} - y_{s} | d s \\ \int_{0}^{t} | σ (s, (y_{r})_{r \leq s}) - σ (s, (y_{r}^{'})_{r \leq s}) |^{2} d s & \leq C \int_{- \infty}^{t} | y_{s}^{'} - y_{s} |^{2} d s . \end{aligned}$
There is a $\bar{γ} : R^{d} \to R_{+}$ , with $\int {\bar{γ}}^{p q} (z) μ (d z) < \infty$ such that $γ : R_{+} \times D \times R^{d} \to R^{d}$ satisfies $\begin{aligned} | γ (t, (y_{s})_{s < t}, z) | & \leq \bar{γ} (z) (1 + sup_{s < t} | y_{s} |), \\ | γ (t, (y_{s})_{s < t}, z) - γ (t, (y_{s}^{'})_{s < t}), z) | & \leq \bar{γ} (z) sup_{s < t} | y_{s} - y_{s}^{'} |, \\ \int_{0}^{t} | γ (s, (y_{r})_{r < s}, z) - γ (s, (y_{r}^{'})_{r < s}, z) |^{2 (m + 2)} d s & \leq {\bar{γ}}^{2 (m + 2)} (z) \int_{- \infty}^{t} | y_{s}^{'} - y_{s} |^{2 (m + 2)} d s . \end{aligned}$
The running reward $ϕ : R_{+} \times R^{d} \to R$ is $B (R_{+} \times R^{d})$ -measurable and satisfies the growth condition $\begin{aligned} | ϕ (t, x) | \leq C (1 + | x |^{q}) . \end{aligned}$ Moreover, there is a non-decreasing function $C_{ϕ} : R^{+} \to R^{+}$ such that for all $L \geq 0$ , $\begin{aligned} | ϕ (t, x) - ϕ (t, x^{'}) | \leq C_{ϕ} (L) | x - x^{'} |, \end{aligned}$ whenever $| x | \lor | x^{'} | \leq L$ .
There is a finite collection of closed connected subsets $(U_{i})_{i = 1}^{M}$ of U and corresponding maps $ℓ_{i} : R_{+} \times R^{d} \times U_{i} \to R$ that are jointly continuous in $(t, x, b)$ , bounded from below, i.e. $\begin{aligned} ℓ_{i} (t, x, b) \geq δ > 0, \end{aligned}$ of polynomial growth, $\begin{aligned} | ℓ_{i} (t, x, b) | \leq C (1 + | x |^{q}), \end{aligned}$ and locally Lipschitz in x, i.e. there is a non-decreasing function $C_{ℓ} : R^{+} \to R^{+}$ such that for all $L \geq 0$ , $\begin{aligned} | ℓ_{i} (t, x, b) - ℓ_{i} (t, x^{'}, b) | \leq C_{ℓ} (L) | x - x^{'} |, \end{aligned}$ whenever $| x | \lor | x^{'} | \leq L$ , and we have $ℓ (t, x, b) = min_{i : b \in U_{i}} ℓ_{i} (t, x, b)$ .

Note that the growth condition on Γ in (i) implies that interventions can only increase the magnitude of the state $X_{t}$ as long as $| X_{t} | < K_{Γ}$ . In particular, this avoids the problem of explosions in a finite time due to impulses.

Remark 5.1

To see that the above SFDE is a generalization of discrete delay SDDEs with Lipschitz coefficients note that if $χ : R_{+} \times (R^{d})^{k + 1} \to R$ satisfies $\begin{aligned} | χ (t, x_{1}, \dots, x_{k + 1}) - χ (t, x_{1}, \dots, x_{k + 1}) | \leq C (| x_{1} - x_{1}^{'} | + \dots + | x_{k + 1} - x_{k + 1}^{'} |) \end{aligned}$ for each $(x_{1}, \dots, x_{k + 1}), (x_{1}^{'}, \dots, x_{k + 1}^{'}) \in (R^{d})^{k + 1}$ , then for $l \geq 0$ we have $\begin{aligned} \int_{0}^{t} | χ (s, y_{s}, y_{s - δ_{1}}, \dots, y_{s - δ_{k}}) - χ (s, y_{s}^{'}, y_{s - δ_{1}}^{'}, \dots, y_{s - δ_{k}}^{'}) |^{l} d s \\ \leq C \int_{0}^{t} (| y_{s} - y_{s}^{'} | + | y_{s - δ_{1}} - y_{s - δ_{1}}^{'} | \dots + | y_{s - δ_{k}} - y_{s - δ_{k}}^{'} |)^{l} d s \\ \leq C \int_{- \infty}^{t} | y_{s} - y_{s}^{'} |^{l} d s . \end{aligned}$

Remark 5.2

In the above assumptions the involved coefficients are all deterministic. We remark that a trivial extension is to allow these to depend on ω as well in which case the coefficients in the Lipschitz conditions can be taken to be non-decreasing, $P$ -a.s. finite, $P_{G}$ -measurable càdlàg processes.

The motivation for allowing intervention costs that are discontinuous in b is the important application of production systems, where increasing the production beyond a certain threshold may necessitate a costly startup of additional production units.

5.2. Existence of optimal controls

In this section we show that the problem of maximizing the reward functional (Equation24(24) $\begin{aligned} J (u) = E [\int_{0}^{\infty} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t - \sum_{j = 1}^{N} e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{u, j - 1}, β_{j})] . \end{aligned}$ (24) ) has a solution. Throughout we will, for notational simplicity, only consider the one-dimensional case (d=1), but we note that all results extend trivially to higher dimensions. We start with the following moment estimate:

Proposition 5.2

Under Assumption 5.1, the SFDE (Equation20(20) $\begin{aligned} X_{t}^{u, 0} & = x (t) f o r t \in (- \infty, 0) \end{aligned}$ (20) )–(Equation23(23) $\begin{aligned} X_{t}^{u, k} & = Γ (τ_{k}, (X_{s}^{u, k - 1})_{s \leq τ_{k}}, β_{k}) + \int_{τ_{k}}^{t} a (s, (X_{r}^{u, k})_{r \leq s}) d s + \int_{τ_{k}}^{t} σ (s, (X_{r}^{u, k})_{r \leq s}) d B_{s} \\ + \int_{τ_{k} +}^{t} \int_{R^{d} ∖ {0}} γ (s, (X_{r}^{u, k})_{r < s}, z) d \tilde{P} (d s, d z), f o r t \geq τ_{k} . \end{aligned}$ (23) ) admits a unique solution for each $u \in U^{f}$ . Furthermore, the solution has moments of order pq on compacts, in particular we have for T>0, that (25) $\begin{aligned} sup_{u \in U^{f}} E [sup_{t \in [0, T]} | X_{t}^{u} |^{p q}] \leq C, \end{aligned}$ (25) where $C = C (T, p q)$ and for each $v \in U^{l i m}$ , we have (26) $\begin{aligned} E [sup_{t \in [0, T]} \underset{u \in U_{t}^{f}}{e s s sup} | E [sup_{s \in [t, T]} | X_{s}^{v \circ u} |^{q} | F_{t}] |^{2}] \leq C \end{aligned}$ (26) where $C = C (T, q)$ .

Proof.

By repeated use of Theorem 3.2 in [Citation1] existence and uniqueness of solutions to (Equation20(20) $\begin{aligned} X_{t}^{u, 0} & = x (t) f o r t \in (- \infty, 0) \end{aligned}$ (20) )–(Equation23(23) $\begin{aligned} X_{t}^{u, k} & = Γ (τ_{k}, (X_{s}^{u, k - 1})_{s \leq τ_{k}}, β_{k}) + \int_{τ_{k}}^{t} a (s, (X_{r}^{u, k})_{r \leq s}) d s + \int_{τ_{k}}^{t} σ (s, (X_{r}^{u, k})_{r \leq s}) d B_{s} \\ + \int_{τ_{k} +}^{t} \int_{R^{d} ∖ {0}} γ (s, (X_{r}^{u, k})_{r < s}, z) d \tilde{P} (d s, d z), f o r t \geq τ_{k} . \end{aligned}$ (23) ) follows since $τ_{j} \to \infty$ , $P$ -a.s. By Assumption 5.1.(i) we get, for $t \in [τ_{j}, T]$ , using integration by parts, that $\begin{aligned} | X_{t}^{u, j} |^{2} & = | X_{τ_{j}}^{u, j} |^{2} + 2 \int_{τ_{j} +}^{t} X_{s -}^{u, j} d X_{s}^{u, j} + \int_{τ_{j} +}^{t} d [X^{u, j}, X^{u, j}]_{s} \\ \leq K_{Γ}^{2} \lor | X_{τ_{j}}^{u, j - 1} |^{2} + 2 \int_{τ_{j} +}^{t} X_{s -}^{u, j} d X_{s}^{u, j} + \int_{τ_{j} +}^{t} d [X^{u, j}, X^{u, j}]_{s} . \end{aligned}$ We note that if $| X_{t}^{u, j} | > K_{Γ}$ and $| X_{s}^{u, j} | \leq K_{Γ}$ for some $s \in [0, t)$ then there is a largest time $ζ < t$ such that $| X_{ζ -}^{u, j} | \leq K_{Γ}$ . This means that during the interval $(ζ, t]$ interventions will not increase the magnitude $| X^{u, j} |$ . By induction, since $| x_{0} |$ is finite, we find that $\begin{aligned} | X_{t}^{u, j} |^{2} \leq C + \sum_{i = 0}^{j} {2 \int_{ζ \lor ({\tilde{τ}}_{i} +)}^{t \land {\tilde{τ}}_{i + 1}} X_{s -}^{u, i} d X_{s}^{u, i} + \int_{ζ \lor ({\tilde{τ}}_{i} +)}^{t \land {\tilde{τ}}_{i + 1}} d [X^{u, i}, X^{u, i}]_{s}} \end{aligned}$ for all $t \in [0, T]$ , where $ζ = sup {s \geq 0 : | X_{s}^{u} | \leq K_{Γ}} \lor 0$ , ${\tilde{τ}}_{0} + = 0$ , ${\tilde{τ}}_{i} = τ_{i}$ for $i = 1, \dots, j$ and ${\tilde{τ}}_{j + 1} = \infty$ . Letting $\begin{aligned} R_{t} := \sum_{i = 0}^{j} {2 \int_{{\tilde{τ}}_{i} +}^{t \land {\tilde{τ}}_{i + 1}} X_{s -}^{u, i} d X_{s}^{u, i} + \int_{{\tilde{τ}}_{i} +}^{t \land {\tilde{τ}}_{i + 1}} d [X^{u, i}, X^{u, j}]_{s}} \end{aligned}$ we thus find that for $p \geq 2$ , $\begin{aligned} E [sup_{s \in [t, T]} | X_{s}^{u, j} |^{p} | F_{t}] \leq C (1 + E [| X_{t}^{u, j} |^{p} + sup_{s \in [t, T]} | R_{s} - R_{t} |^{p / 2} | F_{t}]) . \end{aligned}$ Now, since $X^{u, i}$ and $X^{u, j}$ coincide on $[0, τ_{i + 1 \land j + 1})$ we have $\begin{aligned} \sum_{i = 0}^{j} \int_{{\tilde{τ}}_{i} +}^{t \land {\tilde{τ}}_{i + 1}} X_{s -}^{u, i} d X_{s}^{u, i} & = \int_{0}^{t} X_{s}^{u, j} a (s, (X_{r}^{u, j})_{r \leq s}) d s + \int_{0}^{t} X_{s}^{u, j} σ (s, (X_{r}^{u, j})_{r \leq s}) d W_{s} \\ + \int_{0}^{t} \int_{R^{d} ∖ {0}} X_{s -}^{u, j} γ (s, (X_{r}^{u, j})_{r < s}, z) \tilde{P} (d s, d z), \end{aligned}$ and $\begin{aligned} \sum_{i = 0}^{j} \int_{{\tilde{τ}}_{i} +}^{t \land {\tilde{τ}}_{i + 1}} d [X^{u, i}, X^{u, j}]_{s} & = \int_{0}^{t} σ^{2} (s, (X_{r}^{u, j})_{r \leq s}) d s \\ + \int_{0}^{t} \int_{R^{d} ∖ {0}} γ^{2} (s, (X_{r}^{u, j})_{r < s}, z) P (d s, d z) . \end{aligned}$ From Assumption 5.1.(ii)-(iii) and the Burkholder-Davis-Gundy inequality we get that $\begin{aligned} E [sup_{t \leq s \leq T} | R_{s} - R_{t} |^{p / 2} | F_{t}] & \leq C E [| \int_{t}^{T} (1 + sup_{r \leq s} | X_{r}^{u, j} |^{4}) d s |^{p / 4} \\ + | \int_{t}^{T} (1 + sup_{r \leq s} | X_{r}^{u, j} |^{2}) d s |^{p / 2} | F_{t}] \\ \leq C (1 + T^{p / 2 - 1}) E [\int_{t}^{T} (1 + sup_{r \leq s} | X_{r}^{u, j} |^{p}) d s | F_{t}] \end{aligned}$ and Grönwall's lemma gives that (27) $\begin{aligned} E [sup_{s \in [t, T]} | X_{s}^{u, j} |^{p} | F_{t}] \leq C (1 + E [sup_{s \in [0, t]} | X_{s}^{u, j} |^{p} | F_{t}]), \end{aligned}$ (27) $P$ -a.s., where the constant $C = C (T, q)$ does not depend on u or j and (Equation25(25) $\begin{aligned} sup_{u \in U^{f}} E [sup_{t \in [0, T]} | X_{t}^{u} |^{p q}] \leq C, \end{aligned}$ (25) ) follows by letting t=0. We now give a more straightforward way of showing (Equation26(26) $\begin{aligned} E [sup_{t \in [0, T]} \underset{u \in U_{t}^{f}}{e s s sup} | E [sup_{s \in [t, T]} | X_{s}^{v \circ u} |^{q} | F_{t}] |^{2}] \leq C \end{aligned}$ (26) ) than the method used in the proof of Lemma 4.2. Applying (Equation27(27) $\begin{aligned} E [sup_{s \in [t, T]} | X_{s}^{u, j} |^{p} | F_{t}] \leq C (1 + E [sup_{s \in [0, t]} | X_{s}^{u, j} |^{p} | F_{t}]), \end{aligned}$ (27) ) to the left-hand side of (Equation26(26) $\begin{aligned} E [sup_{t \in [0, T]} \underset{u \in U_{t}^{f}}{e s s sup} | E [sup_{s \in [t, T]} | X_{s}^{v \circ u} |^{q} | F_{t}] |^{2}] \leq C \end{aligned}$ (26) ) we get $\begin{aligned} E [sup_{t \in [0, T]} \underset{u \in U_{t}^{f}}{e s s sup} | E [sup_{s \in [t, T]} | X_{s}^{v \circ u} |^{q} | F_{t}] |^{2}] & \leq C (1 + E [sup_{t \in [0, T]} | E [sup_{s \in [0, t]} | X_{s}^{v} |^{q} | F_{t}] |^{2} \\ + sup_{(t, b) \in [0, T] \times U} | E [| Γ (t, X_{t}^{v}, b) |^{q} | F_{t}] |^{2}]) \\ \leq C (1 + E [sup_{t \in [0, T]} | E [sup_{s \in [0, t]} | X_{s}^{v} |^{q} | F_{t}] |^{2}]) \\ \leq C (1 + E [sup_{t \in [0, T]} | X_{t}^{v} |^{2 q}]) \end{aligned}$ and the desired result follows from (Equation25(25) $\begin{aligned} sup_{u \in U^{f}} E [sup_{t \in [0, T]} | X_{t}^{u} |^{p q}] \leq C, \end{aligned}$ (25) ).

Lemma 5.3

For each $k \geq 0$ , there is a $P$ -null set $N$ such that for all $ω \in Ω ∖ N$ and all $(t, b) \in D^{k}$ the limit $lim_{(t^{'}, b^{'}) \to (t, b)} X^{t^{'}, b^{'}}$ exists in the topology of uniform convergence on compact subsets of $R_{+} ∖ {t_{1}, \dots, t_{k}}$ . Furthermore, for all $(t, b) \in R_{+} \times U$ , we have $\begin{aligned} lim_{t^{'} ↘ t} lim_{b^{'} \to b} sup_{s \in [t^{'}, T]} | X_{s}^{v \circ (t^{'}, b^{'}) \circ u} - X_{s}^{v \circ (t, b) \circ u} | = 0, \end{aligned}$ $P$ -a.s., for any $T \geq 0$ , $v \in U^{l i m}$ and $u \in U^{k}$ (with an exception set that is independent of $(t, b)$ ).

Proof.

Our proof will rely on a pre-localization argument and we introduce the following non-decreasing sequence of stopping times $\begin{aligned} κ_{K} := inf {s \geq 0 : \int_{R^{d} ∖ {0}} \bar{γ} (z) P ({s}, d z) \geq K}, \end{aligned}$ for $K \geq 0$ and set $Λ_{K} := [0, κ_{K})$ . By, Assumption 5.1.(iii) it then follows that $κ_{K} \to \infty$ , $P$ -a.s. as $K \to \infty$ . Furthermore, we note that on $Λ_{K}$ the magnitude of the jumps of $X^{u}$ due to the Poisson jump integral of (Equation20(20) $\begin{aligned} X_{t}^{u, 0} & = x (t) f o r t \in (- \infty, 0) \end{aligned}$ (20) )–(Equation23(23) $\begin{aligned} X_{t}^{u, k} & = Γ (τ_{k}, (X_{s}^{u, k - 1})_{s \leq τ_{k}}, β_{k}) + \int_{τ_{k}}^{t} a (s, (X_{r}^{u, k})_{r \leq s}) d s + \int_{τ_{k}}^{t} σ (s, (X_{r}^{u, k})_{r \leq s}) d B_{s} \\ + \int_{τ_{k} +}^{t} \int_{R^{d} ∖ {0}} γ (s, (X_{r}^{u, k})_{r < s}, z) d \tilde{P} (d s, d z), f o r t \geq τ_{k} . \end{aligned}$ (23) ) are bounded by $C + K sup_{s \leq t} | X_{s}^{u} |$ and repeating the argument in the proof of Proposition 5.2 gives that (28) $\begin{aligned} sup_{u \in U^{f}} E [sup_{s \in [0, T] \cap Λ_{K}} | X_{t}^{u} |^{l}] \leq C, \end{aligned}$ (28) for all $l \geq 0$ .

For $0 \leq t \leq t^{'} \leq T$ , we let $^{t, t^{'}} X^{v}$ solve the SFDE (Equation20(20) $\begin{aligned} X_{t}^{u, 0} & = x (t) f o r t \in (- \infty, 0) \end{aligned}$ (20) )–(Equation23(23) $\begin{aligned} X_{t}^{u, k} & = Γ (τ_{k}, (X_{s}^{u, k - 1})_{s \leq τ_{k}}, β_{k}) + \int_{τ_{k}}^{t} a (s, (X_{r}^{u, k})_{r \leq s}) d s + \int_{τ_{k}}^{t} σ (s, (X_{r}^{u, k})_{r \leq s}) d B_{s} \\ + \int_{τ_{k} +}^{t} \int_{R^{d} ∖ {0}} γ (s, (X_{r}^{u, k})_{r < s}, z) d \tilde{P} (d s, d z), f o r t \geq τ_{k} . \end{aligned}$ (23) ) with integrand $(1 - 1_{(t, t^{'}]} (s)) γ (s, \cdot, \cdot)$ in the jump part and let $j \geq 0$ be the largest integer such that $τ_{j} \leq t^{'}$ . Then by Assumption 5.1.(i) we have for $l = 1, \dots, j + 1$ (recalling that $[u]_{l} = (τ_{1}, \dots, τ_{N \land l}; β_{1}, \dots, β_{N \land l})$ is the truncation of u limiting the number of interventions to l), $\begin{aligned} |^{t, t^{'}} X_{t^{'}}^{v \circ [(t^{'}, b^{'}) \circ u]_{l}} -^{t, t^{'}} X_{t^{'}}^{v \circ [(t, b) \circ u]_{l}} | \leq C (\int_{t}^{τ_{l - 1}} |^{t, t^{'}} X_{s}^{v \circ [(t^{'}, b^{'}) \circ u]_{l - 1}} -^{t, t^{'}} X_{s}^{v \circ [(t, b) \circ u]_{l - 1}} | d s \\ + |^{t, t^{'}} X_{t^{'}}^{v \circ [(t^{'}, b^{'}) \circ u]_{l - 1}} -^{t, t^{'}} X_{t^{'}}^{v \circ [(t, b) \circ u]_{l - 1}} | + |^{t, t^{'}} X_{t^{'}}^{v \circ [(t, b) \circ u]_{l - 1}} -^{t, t^{'}} X_{τ_{l - 1}}^{v \circ [(t, b) \circ u]_{l - 1}} | \\ + ((t^{'} - τ_{l - 1}) + 1_{[l = 1]} | b^{'} - b |) (1 + sup_{s \leq t^{'}} |^{t, t^{'}} X_{s}^{v \circ [(t^{'}, b^{'}) \circ u]_{l - 1}} | + sup_{s \leq τ_{l}} |^{t, t^{'}} X_{s}^{v \circ [(t, b) \circ u]_{l - 1}} |)), \end{aligned}$ with $τ_{0} := t$ . We define $^{1} X^{l} :=^{t, t^{'}} X^{v \circ [(t, b) \circ u]_{l}}$ , $^{2} X^{l} :=^{t, t^{'}} X^{v \circ [(t^{'}, b^{'}) \circ u]_{l}}$ and let $δ X^{l} :=^{2} X^{l} -^{1} X^{l}$ and set $δ X := δ X^{k + 1}$ . Then, since the jump part is deactivated during $(t, t^{'}]$ and by (Equation28(28) $\begin{aligned} sup_{u \in U^{f}} E [sup_{s \in [0, T] \cap Λ_{K}} | X_{t}^{u} |^{l}] \leq C, \end{aligned}$ (28) ), we have $\begin{aligned} E [| δ X_{t^{'}} |^{2 (m + 2)}] \leq C (| t^{'} - t |^{m + 2} + | b^{'} - b |^{m + 2}) . \end{aligned}$ For $l = j + 2, \dots, N$ , we have by Assumption 5.1.(i) that $\begin{aligned} | δ X_{τ_{l}}^{l + 1} | \leq C (\int_{0}^{τ_{l}} | δ X_{s}^{l} | d s + | δ X_{τ_{l}}^{l} |) . \end{aligned}$ Now, for $s \geq t^{'}$ , $\begin{aligned} δ X_{s} = δ X_{t^{'}} + \sum_{l = j}^{N} \int_{(t^{'} \lor τ_{l}) +}^{s \land τ_{l + 1}} d (δ X^{l + 1})_{r} + \sum_{l = j + 1}^{N} 1_{[s \geq τ_{l}]} (δ X_{τ_{l}}^{l + 1} - δ X_{τ_{l}}^{l}), \end{aligned}$ with $τ_{N + 1} := \infty$ . Taking the absolute value on both sides we get $\begin{aligned} | δ X_{s} | & \leq | δ X_{t^{'}} | + C (\int_{t^{'}}^{s} | a (r, (^{2} X_{ζ})_{ζ \leq r}) - a (r, (^{1} X_{ζ})_{ζ \leq r}) | d r \\ + \sum_{l = j}^{N} | \int_{t^{'} \lor τ_{l}}^{τ_{l + 1} \land s} σ (r, (^{2} X_{ζ})_{ζ \leq r}) - σ (r, (^{1} X_{ζ})_{ζ \leq r}) d W_{r} | \\ + \int_{t^{'} +}^{s} \int_{R^{d} ∖ {0}} | γ (r, (^{2} X_{ζ})_{ζ < r}, z) - γ (r, (^{1} X_{ζ})_{ζ < s}, z) | \tilde{P} (d r, d z) + \int_{t}^{s} | δ X_{r} | d r) . \end{aligned}$ The Burkholder-Davis-Gundy inequality now gives $\begin{aligned} E [sup_{r \in [t^{'}, s]} | δ X_{r} |^{2 (m + 2)}] \leq C E [| δ X_{t^{'}} |^{2 (m + 2)} \\ + (\int_{t^{'}}^{s} | a (r, (^{2} X_{ζ})_{ζ \leq r}) - a (r, (^{1} X_{ζ})_{ζ \leq r}) | d r)^{2 (m + 2)} \\ + (\int_{t^{'}}^{s} | σ (r, (^{2} X_{ζ})_{ζ \leq r}) - σ (r, (^{1} X_{ζ})_{ζ \leq r}) |^{2} d r)^{m + 2} \\ + (\int_{t^{'} +}^{s} \int_{R^{d} ∖ {0}} | γ (r, (^{2} X_{ζ})_{ζ < r}, z) - γ (r, (^{1} X_{ζ})_{ζ < r}, z) |^{2} \tilde{P} (d r, d z))^{m + 2} \\ + \int_{t}^{s} | δ X_{r} |^{2 (m + 2)} d r] . \end{aligned}$ Appealing to the boundedness of the jumps and the integral Lipschitz conditions on the coefficients then gives that $\begin{aligned} E [sup_{r \in [t^{'}, s] \cap Λ_{K}} | δ X_{r} |^{2 (m + 2)}] & \leq C (\int_{t^{'}}^{s} E [sup_{ζ \in [t^{'}, r] \cap Λ_{K}} | δ X_{ζ} |^{2 (m + 2)}] d r \\ + | t^{'} - t |^{m + 2} + | b^{'} - b |^{m + 2}), \end{aligned}$ for all $s \in [0, T]$ . Now, Grönwall's lemma gives $\begin{aligned} E [sup_{s \in [t^{'}, T] \cap Λ_{K}} | δ X_{s} |^{2 (m + 2)}] \leq C (| t^{'} - t |^{m + 2} + | b^{'} - b |^{m + 2}), \end{aligned}$ where C does not depend on $u \in U^{k}$ . Furthermore, for each $t \in [0, T]$ and each $ω \in Ω ∖ N$ (for some $P$ -null set $N$ ) there is a $t^{'} > t$ such that $P (ω, (t, t^{'}], R^{d}) = 0$ . Uniform convergence on $[t^{'}, T] \cap Λ_{K}$ , thus, follows by applying a Kolmogorov continuity argument (see e.g. Theorem 72 in Chapter IV of [Citation25]) and uniform right-continuity follows as $κ_{K} \to \infty$ , $P$ -a.s. The existence of limits follows similarly.

Definition 5.4

For all $v \in U^{l i m}$ and $u \in U^{f}$ we define the map $Ψ^{v, u} : R_{+} \times Ω \times U$ as $\begin{aligned} Ψ^{v, u} (t, b) := \int_{0}^{\infty} e^{- ρ (s)} ϕ (s, X_{s}^{v \circ (t, b) \circ u}) d s - \sum_{j = 1}^{N} e^{- ρ (τ_{j} \lor t)} ℓ (τ_{j} \lor t, X_{τ_{j} \lor t}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}) . \end{aligned}$ Moreover, for $T \geq 0$ we define the truncation $Ψ_{T}^{v, u} : R_{+} \times Ω \times U$ of $Ψ^{v, u}$ as $\begin{aligned} Ψ_{T}^{v, u} (t, b) := \int_{0}^{T} e^{- ρ (s)} ϕ (s, X_{s}^{v \circ (t, b) \circ u}) d s - \sum_{j = 1}^{N (T -)} e^{- ρ (τ_{j} \lor t)} ℓ (τ_{j} \lor t, X_{τ_{j} \lor t}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}), \end{aligned}$ where $N (T -) := max {j : τ_{j} < T}$ , and for $L \geq 0$ we define the localization $Ψ_{T, L}^{v, u} : R_{+} \times Ω \times U \to R$ of $Ψ_{T}^{v, u}$ as $\begin{aligned} Ψ_{T, L}^{v, u} (t, b) := \int_{0}^{T} e^{- ρ (s)} ϕ (s,^{L} X_{s}^{v \circ (t, b) \circ u}) d s - \sum_{j = 1}^{N (T -)} e^{- ρ (τ_{j} \lor t)} ℓ (τ_{j} \lor t,^{L} X_{τ_{j} \lor t}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}), \end{aligned}$ where $^{L} X_{s}^{u} := \frac{L}{L \lor | X_{s}^{u} |} X_{s}^{u}$ .

Corollary 5.5

For each $T, L \geq 0$ , $k \geq 0$ , $v \in U^{l i m}$ and $u \in U^{k}$ the map $(t, b) \mapsto Ψ_{T, L}^{v, u} (t, b)$ has limits everywhere and is $P$ -a.s. continuous on $[η_{j}, η_{j + 1}) \times U$ , where $(η_{j})_{j \geq 1}$ are the jump times of P.

Proof.

Let $\begin{aligned} φ_{T, L} (u) := \int_{0}^{T} e^{- ρ (s)} ϕ (s,^{L} X_{s}^{u}) d s \end{aligned}$ and note that for $0 \leq t \leq t^{'}$ we have $\begin{aligned} | φ_{T, L} (v \circ (t^{'}, b^{'}) \circ u) - φ_{T, L} (v \circ (t, b) \circ u) | \\ \leq \int_{t}^{t^{'}} e^{- ρ (s)} (| ϕ (s,^{L} X_{s}^{v \circ (t, b) \circ u}) | + | ϕ (s,^{L} X_{s}^{v \circ (t^{'}, b^{'}) \circ u}) |) d s \\ + \int_{t^{'}}^{T} e^{- ρ (s)} | ϕ (s,^{L} X_{s}^{v \circ (t, b) \circ u}) - ϕ (s,^{L} X_{s}^{v \circ (t^{'}, b^{'}) \circ u}) | d s \\ \leq C (| t^{'} - t | + \int_{t^{'}}^{T} e^{- ρ (s)} |^{L} X_{s}^{v \circ (t, b) \circ u} -^{L} X_{s}^{v \circ (t^{'}, b^{'}) \circ u} | d s) \\ \leq C (| t^{'} - t | + \int_{t^{'}}^{T} e^{- ρ (s)} | X_{s}^{v \circ (t, b) \circ u} - X_{s}^{v \circ (t^{'}, b^{'}) \circ u} | d s) . \end{aligned}$ Now, by Lemma 5.3 it follows immediately that $\begin{aligned} lim_{t^{'} ↘ t} lim_{b^{'} \to b} | φ_{T} (v \circ (t^{'}, b^{'}) \circ u) - φ_{T} (v \circ (t, b) \circ u) | = 0, \end{aligned}$ and from its proof we have that $\begin{aligned} lim_{t^{'} ↗ t} lim_{b^{'} \to b} | φ_{T} (v \circ (t^{'}, b^{'}) \circ u) - φ_{T} (v \circ (t, b) \circ u) | = 0, \end{aligned}$ whenever $t \notin {η_{1}, η_{2}, \dots}$ . Concerning the intervention costs we have (29) $\begin{aligned} \sum_{j = 1}^{N (T -)} | e^{- ρ (τ_{j} \lor t^{'})} ℓ (τ_{j} \lor t^{'},^{L} X_{τ_{j} \lor t^{'}}^{v \circ (t^{'}, b^{'}) \circ [u]_{j - 1}}, β_{j}) - e^{- ρ (τ_{j} \lor t)} ℓ (τ_{j} \lor t,^{L} X_{τ_{j} \lor t}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}) | \\ \leq \sum_{j = 1}^{N (T -)} {1_{[0, t^{'})} (τ_{j}) | e^{- ρ (t^{'})} ℓ (t^{'},^{L} X_{t^{'}}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}) - e^{- ρ (τ_{j} \lor t)} ℓ (τ_{j} \lor t,^{L} X_{τ_{j} \lor t}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}) | \\ + | ℓ (τ_{j} \lor t^{'},^{L} X_{τ_{j} \lor t^{'}}^{v \circ (t^{'}, b^{'}) \circ [u]_{j - 1}}, β_{j}) - ℓ (τ_{j} \lor t^{'},^{L} X_{τ_{j} \lor t^{'}}^{v \circ (t, b) \circ [u]_{j - 1}}, β_{j}) |} \end{aligned}$ (29) where the first term tends to zero as $t^{'} ↘ t$ by joint continuity of ℓ, continuity of ρ and right continuity of X. By continuity of ℓ, the assertion follows by repeating the argument in the proof of Lemma 5.3.

Lemma 5.6

For each T>0 and $k \geq 0$ there is, for every $v \in U^{l i m}$ , a $J_{T}^{*} \in H_{F}$ such that for all $τ \in T$ and $b \in U$ we have $\begin{aligned} J_{T}^{*} (τ, b) = \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T}^{v, u} (τ, b) | F_{τ}], \end{aligned}$ $P$ -a.s. (with an exception set that is independent of b).

Proof.

For any $K, L \geq 0$ it follows by Corollary 5.5 and Theorem A.6 in Appendix 3 that there is for each $b \in U$ an $F$ -optional càdlàg process $(Z_{t}^{b, u} : t \geq 0)$ such that $\begin{aligned} Z_{τ}^{b, u} = E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}], \end{aligned}$ $P$ -a.s. for any $τ \in T$ . Now, pick a sequence $(ϵ_{l})_{l \geq 0}$ of positive real numbers such that $ϵ_{l} ↘ 0$ and for $j, l \geq 0$ define $s_{j}^{l} := j 2^{- l}$ . Then, there is a control $u_{j}^{l} \in U_{s_{j}^{l}}^{k}$ such that $\begin{aligned} E [Ψ_{T \land κ_{K}, L}^{v, u_{j}^{l}} (s_{j}^{l}, b) | F_{s_{j}^{l}}] \geq \underset{u \in U_{s_{j}^{l}}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (s_{j}^{l}, b) | F_{s_{j}^{l}}] - ϵ_{l} . \end{aligned}$ Define the sequence of càdlàg processes $({\tilde{Z}}_{t}^{b, l} : t \geq 0)_{l \geq 0}$ as $\begin{aligned} {\tilde{Z}}_{t}^{b, l} := \sum_{j = 0}^{\infty} 1_{[s_{j}^{l}, s_{j + 1}^{l})} (t) Z_{t}^{b, u_{j + 1}^{l}} \end{aligned}$ and set ${\hat{Z}}_{t}^{b, l} := max_{i \in {0, \dots, l}} {\tilde{Z}}_{t}^{b, i}$ . Then, ${\hat{Z}}^{b, l}$ is an increasing $P$ -a.s. finite sequence of càdlàg processes and it, thus, converges pointwisely, $P$ -a.s. to a limit $Z^{b, *}$ that, moreover, is $P_{F}$ -measurable. We note that for any $l \geq 0$ and $τ \in T^{f}$ we have with $τ_{l} := inf {s \geq τ : s \in Π_{l}}$ and $u^{l} := \sum_{j = 1}^{\infty} 1_{[τ_{l} = s_{j}^{l}]} u_{j}^{l}$ , that $\begin{aligned} \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}] - Z_{τ}^{b, *} \\ \leq \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}] - E [Ψ_{T \land κ_{K}, L}^{v, u^{l}} (τ, b) | F_{τ}] \\ = \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}] - \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ_{l}, b) | F_{τ}] \\ + \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ_{l}, b) | F_{τ}] - E [Ψ_{T \land κ_{K}, L}^{v, u^{l}} (τ, b) | F_{τ}] \end{aligned}$ and as $\begin{aligned} | \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ_{l}, b) | F_{τ}] - E [Ψ_{T \land κ_{K}, L}^{v, u^{l}} (τ, b) | F_{τ}] | \\ \leq \underset{u \in U_{τ}^{k}}{e s s sup} E [| Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) - Ψ_{T \land κ_{K}, L}^{v, u} (τ_{l}, b) | | F_{τ}] + ϵ_{l} \end{aligned}$ we get that $\begin{aligned} | Z_{τ}^{b, *} - \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}] | \\ \leq 2 \underset{u \in U_{τ}^{k}}{e s s sup} E [| Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) - Ψ_{T \land κ_{K}, L}^{v, u} (τ_{l}, b) | | F_{τ}] + ϵ_{l} . \end{aligned}$ Jensen's inequality now gives that $\begin{aligned} E [| Z_{τ}^{b, *} - \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}] |^{2}] \\ \leq 8 sup_{u \in U_{τ}^{k}} E [| Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) - Ψ_{T \land κ_{K}, L}^{v, u} (τ_{l}, b) |^{2}] + 2 ϵ_{l}^{2} . \end{aligned}$ Letting $l \to \infty$ and using that the map $t \mapsto Ψ_{T \land κ_{K}, L}^{v, u} (t, b)$ is right-continuous uniformly in u it follows that $\begin{aligned} Z_{τ}^{b, *} = \underset{u \in U_{τ}^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}], \end{aligned}$ $P$ -a.s., for each stopping time $τ \in T$ .

We now show that $Z^{b, *}$ is a càdlàg process. First, since $Z^{b, *}$ is the limit of an increasing sequence of càdlàg processes we have that $\underset{t^{'} ↘ t}{lim inf} Z_{t^{'}}^{b, *} \geq Z_{t}^{b, *}$ . For any $τ \in T^{f}$ and $ϵ > 0$ let $\begin{aligned} τ_{l} := inf {s \geq τ : {\hat{Z}}_{s}^{b, l} \geq Z_{τ}^{b, *} + ϵ} . \end{aligned}$ Then as $(Z^{b, l})_{l \geq 0}$ is non-decreasing, the sequence $(τ_{l})_{l \geq 0}$ is non-increasing. Let $B := {ω \in Ω : lim_{l \to \infty} τ_{l} = τ}$ and note that $B \in F_{τ}$ by right-continuity of the filtration and $\underset{t^{'} ↘ t}{lim sup} Z_{t^{'}}^{b, *} < Z_{t}^{b, *} + ϵ$ on $B^{c} := Ω ∖ B$ . Moreover, with ${\hat{τ}}_{l} := 1_{B} τ_{l} + 1_{B^{c}} τ$ , Fatou's lemma gives $\begin{aligned} \underset{l \to \infty}{lim inf} E [Z_{{\hat{τ}}_{l}}^{b, *} - Z_{τ}^{b, *}] = \underset{l \to \infty}{lim inf} E [1_{B} (Z_{{\hat{τ}}_{l}}^{b, *} - Z_{τ}^{b, *})] \geq P [B] ϵ . \end{aligned}$ On the other hand, we have $\begin{aligned} E [Z_{{\hat{τ}}_{l}}^{b, *} - Z_{τ}^{b, *}] \leq sup_{u \in U_{τ}^{k}} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) - Ψ_{T \land κ_{K}, L}^{v, u} ({\hat{τ}}_{l}, b) |] \to 0 \end{aligned}$ as $l \to \infty$ and we conclude that $P [B] = 0$ and, since $ϵ > 0$ was arbitrary, it follows that $\underset{t^{'} ↘ t}{lim sup} Z_{t^{'}}^{b, *} = Z_{t}^{b, *}$ .

To prove that $Z^{b, *}$ has left limits we define, for $ϵ > 0$ , the sequence $(ϑ_{j}^{ϵ})_{j \geq 0}$ as $ϑ_{0}^{ϵ} = 0$ and then recursively let $\begin{aligned} ϑ_{j}^{ϵ} := inf {s \geq ϑ_{j - 1}^{ϵ} : \underset{u \in U_{s}^{k}}{e s s sup} | Z_{s}^{b, u} - Z_{ϑ_{j - 1}^{ϵ}}^{b, u} | \geq ϵ} . \end{aligned}$ We note by the above discussion that $ϑ_{j}^{ϵ} \in T$ and furthermore, by right-continuity that $ϑ_{j}^{ϵ} > ϑ_{j - 1}^{ϵ}$ and $ϑ_{j}^{ϵ} \to \infty$ , $P$ -a.s. If not, we would have $ϑ_{j}^{ϵ} ↗ ϑ^{ϵ} \in T^{f}$ on some set A of positive measure. However, as increments in the jump integral part is $P$ -a.s. zero at predictable times we note by Corollary 5.5 that $Ψ^{v, u} (t, b)$ is continuous in t at $ϑ^{ϵ}$ on $A ∖ N$ for some $P$ -null set $N$ , uniformly in u. Now, as the filtration is quasi-left continuous this implies that $\begin{aligned} \underset{j \to \infty}{lim sup} \underset{u \in U^{k}}{e s s sup} sup_{b \in U} | Z_{ϑ_{j}^{ϵ}}^{b, u} - Z_{ϑ_{j - 1}^{ϵ}}^{b, u} | = 0, \end{aligned}$ on $A ∖ N$ , a contradiction. Letting, $\begin{aligned} {\overset{˘}{Z}}_{t}^{b, l} := \sum_{j = 0}^{\infty} 1_{[ϑ_{j}^{1 / l}, ϑ_{j - 1}^{1 / l})} (t) Z_{ϑ_{j}^{1 / l}}^{b, *}, \end{aligned}$ we find that $({\overset{˘}{Z}}^{b, l})_{l \geq 0}$ is a sequence of càdlàg processes with $sup_{t \in [0, T]} | Z_{t}^{b, *} - {\overset{˘}{Z}}_{t}^{b, l} | \leq 1 / l$ and we conclude that $Z^{b, *}$ is càdlàg.

By repeating the argument in the proof of Lemma 5.3 we find that $\begin{aligned} sup_{u \in U_{t}^{k}} E [| Ψ_{T \land κ_{K}, L}^{v, u} (t, b^{'}) - Ψ_{T \land κ_{K}, L}^{v, u} (t, b) |^{2 (m + 1)}] \leq C | b^{'} - b |^{m + 1}, \end{aligned}$ $P$ -a.s. for any $t \in R_{+}$ and $b, b^{'} \in U$ and it follows that $\begin{aligned} E [sup_{t \in [0, \infty]} | Z_{t}^{b^{'}, *} - Z_{t}^{b, *} |^{m + 1}] \leq C | b^{'} - b |^{m + 1} . \end{aligned}$ Hence, by Kolmogorov's continuity theorem and Corollary 5.5 it follows that there is a unique map $h^{K, L} \in H_{F}$ such that $\begin{aligned} h^{K, L} (τ, b) = \underset{u \in U^{k}}{e s s sup} E [Ψ_{T \land κ_{K}, L}^{v, u} (τ, b) | F_{τ}], \end{aligned}$ $P$ -a.s. for all $b \in U$ . By dominated convergence we find that $h^{K, L}$ converges pointwisely to some h as $K, L \to \infty$ . We define the set $\begin{aligned} Ξ_{L} := {ω \in Ω : sup_{t \in [0, T]} \underset{u \in U_{t}^{f}}{e s s sup} E [sup_{s \in [t, T]} | X_{s}^{v \circ u} | | F_{t}] > L} \end{aligned}$ and note that for $r \in (1, 2)$ , we have $\begin{aligned} E [sup_{(t, b) \in [0, T] \times U} | \underset{u \in U^{k}}{e s s sup} E [Ψ_{T}^{v, u} (t, b) | F_{t}] - h^{K, L} (t, b) |^{r}] \\ \leq E [sup_{(t, b) \in [0, T] \times U} \underset{u \in U_{t}^{k}}{e s s sup} | E [Ψ_{T}^{v, u} (t, b) - Ψ_{T \land κ_{K}, L}^{v, u} (t, b) | F_{t}] |^{r}] \\ \leq C E [(1_{[κ_{K} < T]} + 1_{Ξ}) sup_{t \in [0, T]} \underset{u \in U_{t}^{k + 1}}{e s s sup} | E [sup_{s \in [t, T]} | X_{s}^{v \circ u} |^{q} | F_{t}] |^{r}] \\ \leq C E [1_{[κ_{K} < T]} + 1_{Ξ_{L}}]^{1 / r^{'}} \end{aligned}$ where $\frac{1}{r^{'}} + \frac{r}{2} = 1$ and the last step follows by Hölder's inequality and Proposition 5.2. Now, the right-hand side of the last inequality goes to zero as $K, L \to \infty$ by the definition of $κ_{K}$ and Proposition 5.2 and by uniform convergence we conclude that there is a $J_{T}^{*} \in H_{F}$ such that $\begin{aligned} J_{T}^{*} (τ, b) = \underset{u \in U^{k}}{e s s sup} E [Ψ_{T}^{v, u} (τ, b) | F_{τ}], \end{aligned}$ $P$ -a.s. for each $b \in U$ .

It remains to show that we can choose the exception set to be independent of b. Let ${\bar{U}}_{0} \subset {\bar{U}}_{1} \subset \dots$ be a sequence of finite subsets of U with $min_{b \in {\bar{U}}_{l}} max_{b^{'} \in U} | b^{'} - b | \leq 2^{- l}$ . For $β \in A (τ)$ define $(β_{l})_{l \geq 0}$ as a measurable selection of $β_{l} \in {\arg min}_{b \in {\bar{U}}_{l}} | β - b |$ . Then since $β_{l}$ takes values in a finite set we have $\begin{aligned} J_{T}^{*} (τ, β_{l}) = \underset{u \in U^{k}}{e s s sup} E [Ψ_{T}^{v, u} (τ, β_{l}) | F_{τ}], \end{aligned}$ $P$ -a.s. By continuity it follows that $\begin{aligned} lim_{l \to \infty} J_{T}^{*} (τ, β_{l}) = J_{T}^{*} (τ, β), \end{aligned}$ $P$ -a.s. Furthermore, by uniform integrability and $P$ -a.s. continuity of $Ψ_{T}^{v, u}$ uniformly in u we have that $\begin{aligned} lim_{l \to \infty} \underset{u \in U^{k}}{e s s sup} E [| Ψ_{T}^{v, u} (τ, β) - Ψ_{T}^{v, u} (τ, β_{l}) | | F_{τ}] = 0 \end{aligned}$ and we conclude that $\begin{aligned} J_{T}^{*} (τ, β) = \underset{u \in U^{k}}{e s s sup} E [Ψ_{T}^{v, u} (τ, β) | F_{τ}], \end{aligned}$ $P$ -a.s. From this the statement follows as $β \in A (τ)$ was arbitrary.

This far we have not made any assumption on the discount factor ρ, other than it being continuous. Clearly, some assumptions on the growth of ρ have to be made in order for the maximization problem to have a finite value. We summarize our assumptions in the following hypothesis:

Hypothesis. [Disc.-A] There an $ϵ > 0$ such that $ρ (t) \geq ϵ t$ , $\begin{aligned} sup_{u \in U^{f}} E [\int_{T}^{\infty} e^{- ρ (t)} | ϕ (t, X_{t}^{u}) |^{p} d t] \leq C e^{- ϵ T} \end{aligned}$ and $\begin{aligned} sup_{u \in U^{f}} E [sup_{t \in [0, \infty)} e^{- p ρ (t)} | ℓ (t, X_{t}^{u}, b) |^{p} d t] < \infty \end{aligned}$ for all $T \geq 0$ and $b \in U$ . Furthermore, for each $k \geq 0$ there is an $ϵ > 0$ such that for all $T \geq T^{'}$ for some $T^{'} > 0$ we have $\begin{aligned} E [sup_{t \in [0, \infty)} \underset{u \in U_{t}^{k}}{e s s sup} E [| \int_{T}^{\infty} e^{- ρ (s)} ϕ (s, X_{s}^{v \circ u}) d s | | F_{t}]] \leq e^{- ϵ T}, \end{aligned}$ and $\begin{aligned} E [sup_{t \in [0, \infty)} \underset{u \in U_{t}^{k}}{e s s sup} E [1_{[N \geq 1]} 1_{[τ_{N} \geq T]} e^{- ρ (τ_{N})} | ℓ (τ_{N}, X_{τ_{N}}^{v \circ [u]_{N - 1}}, β_{N}) | | F_{t}]] \leq e^{- ϵ T} . \end{aligned}$ for all $v \in U^{l i m}$ .

Remark 5.3

An important situation where Hypothesis Disc.-A holds with $ρ (t) = ρ_{0} t$ for any $ρ_{0} > 0$ is when the functions ϕ and ℓ are eventually bounded, i.e. when there is a $T^{'} > 0$ such that $| ϕ (t, x) | \leq C$ and $| ℓ (u \circ (t, x)) | \leq C$ for all $(t, x) \in [T^{'}, \infty) \times R^{d}$ . Another important case is when $ρ (T) - \ln (C (T, p q))$ grows linearly in T, where C is the bound in Proposition 5.2.

We are now ready to state the main result of this section, showing that under Assumption 5.1 and Hypothesis Disc.-A an optimal control for the problem of maximizing J exist.

Proposition 5.7

Under Hypothesis Disc.-A there is a $u^{*} \in U^{f}$ such that $J (u^{*}) \geq J (u)$ for all $u \in U^{f}$ . Furthermore, $u^{*}$ is given by the recursion (Equation8(8) $\begin{aligned} τ_{j}^{*} := inf {s \geq τ_{j - 1}^{*} : Y_{s}^{[u^{*}]_{j - 1}} = sup_{b \in U} {- c_{s, b}^{[u^{*}]_{j - 1}} + Y_{s}^{[u^{*}]_{j - 1} \circ (s, b)}}}, \end{aligned}$ (8) )–(Equation9(9) $\begin{aligned} β_{j}^{*} \in \underset{b \in U}{\arg max} {- c_{τ_{j}^{*}, b}^{[u^{*}]_{j - 1}} + Y_{τ_{j}^{*}}^{[u^{*}]_{j - 1} \circ (τ_{j}^{*}, b)}} . \end{aligned}$ (9) ), with $\begin{aligned} φ (u) := \int_{0}^{\infty} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t \end{aligned}$ and $\begin{aligned} c (u \circ (t, b)) := e^{- ρ (t \lor τ_{N})} ℓ (t \lor τ_{N}, X_{t \lor τ_{N}}^{u}, b) . \end{aligned}$

Proof.

To show that the assertion is true we need to show that the pair $(φ, c)$ is an admissible reward pair. It is clear that the uniform $L^{2}$ -bounds on φ and c in Definition 2.2.(i) hold by Hypothesis Disc.-A. In particular, we note that by Jensen's inequality we get that $\begin{aligned} E [| φ (u) |^{p}] & = E [| \int_{0}^{\infty} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t |^{p}] \\ \leq C E [\int_{0}^{\infty} e^{- ρ (t)} | ϕ (t, X_{t}^{u}) d t |^{p} d t] \leq C . \end{aligned}$ The decreasing importance property stated in Definition 2.2.(iii) follows similarly by noting that for $v \in U_{T}^{f}$ with $T \geq T^{'}$ we have, by Hypothesis Disc.-A, that $\begin{aligned} E [| φ (u \circ v) - φ (u) |^{p}] & = E [| \int_{T}^{\infty} e^{- ρ (t)} (ϕ (t, X_{t}^{u \circ v}) - ϕ (t, X_{t}^{u})) d t |^{p}] \\ \leq C E [\int_{T}^{\infty} e^{- ρ (t)} (| ϕ (t, X_{t}^{u \circ v}) |^{p} + | ϕ (t, X_{t}^{u}) |^{p}) d t] \\ \leq C e^{- ϵ T}, \end{aligned}$ which tends to 0 as $T \to \infty$ .

Concerning the continuity properties listed in Definition 2.2.(ii) we note that for each $k \geq 0$ and $v \in U^{l i m}$ we have that $\begin{aligned} | Ψ^{v, u} (t^{'}, b^{'}) - Ψ^{v, u} (t, b) | & \leq | Ψ_{T}^{v, u} (t^{'}, b^{'}) - Ψ_{T}^{v, u} (t, b) | + | Ψ^{v, u} (t^{'}, b^{'}) - Ψ_{T}^{v, u} (t^{'}, b^{'}) | \\ + | Ψ^{v, u} (t, b) - Ψ_{T}^{v, u} (t, b) | . \end{aligned}$ Now, $\begin{aligned} E [sup_{(t, b) \in [0, \infty) \times U} \underset{u \in U_{t}^{k}}{e s s sup} E [| φ (v \circ (t, b) \circ u) - φ_{T} (v \circ (t, b) \circ u) | | F_{t}]] \\ \leq E [sup_{t \in [0, \infty)} \underset{u \in U_{t}^{k}}{e s s sup} E [| \int_{T}^{\infty} e^{- ρ (s)} ϕ (s, X_{s}^{v \circ u}) d s | | F_{t}]] \leq e^{- ϵ T} . \end{aligned}$ and similarly $\begin{aligned} E [sup_{(t, b) \in [0, \infty) \times U} \underset{u \in U_{t}^{k}}{e s s sup} E [| c (v \circ (t, b) \circ u) - 1_{[τ_{N} \lor t < T]} c (v \circ (t, b) \circ u) | | F_{t}]] \\ \leq E [sup_{t \in [0, \infty)} \underset{u \in U_{t}^{k + 1}}{e s s sup} E [1_{[N \geq 1]} 1_{[τ_{N} \geq T]} e^{- ρ (τ_{N})} | ℓ (τ_{N}, X_{τ_{N}}^{v \circ [u]_{N - 1}}, β_{N}) | | F_{t}]] \leq e^{- ϵ T} . \end{aligned}$ This implies that $\begin{aligned} P [sup_{(t, b) \in [0, \infty) \times U} \underset{u \in U_{t}^{k}}{e s s sup} E [| Ψ^{v, u} (t, b) - Ψ_{T}^{v, u} (t, b) | | F_{t}] \geq e^{- ϵ T / 2}] \leq C e^{- ϵ T / 2} \end{aligned}$ and the Borel-Cantelli lemma gives that $\begin{aligned} sup_{(t, b) \in R_{+} \times U} \underset{u \in U_{t}^{k}}{e s s sup} E [| Ψ^{v, u} (t, b) - Ψ_{T}^{v, u} (t, b) | | F_{t}] \to 0, \end{aligned}$ $P$ -a.s., as $T \to \infty$ for all $v \in U^{l i m}$ .

By Lemma 5.6 and uniform convergence it follows from Lemma 2.4.(d) that $J^{*} := lim_{T \to \infty} J_{T}^{*} \in H_{F}$ . The desired result now follows by Lemma 2.4.(a) while noting that by the construction of ℓ in Assumption 5.1.(v), a simplified version of Lemma 5.6 (without having to consider maximization over u) applied to each of the $ℓ_{i}$ gives that there is an $h \in H_{F}$ such that $h (τ, b) = - E [ℓ (τ, X_{τ}^{v}, b) | F_{τ}]$ , $P$ -a.s. (with an exception set that is independent of b).

Remark 5.4

In a perfect information setting, i.e. when $F = G$ , we note that $ℓ (t, x, b)$ can be taken to be any upper semi-continuous function in b that satisfies the remaining properties of polynomial growth and local Lipschitz continuity.

5.3. The random horizon setting

We turn instead to the reward (30) $\begin{aligned} J^{η} (u) = E [\int_{0}^{η} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t + e^{- ρ (η)} ψ (η, X_{η}^{[u]_{N (η -)}}) - \sum_{j = 1}^{N} e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{ν, j - 1}, β_{j})] . \end{aligned}$ (30) where η is a $G$ -stopping and $N (η -) := sup {j : τ_{j} < η} \lor 0$ . A notable convention applied in (Equation30(30) $\begin{aligned} J^{η} (u) = E [\int_{0}^{η} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t + e^{- ρ (η)} ψ (η, X_{η}^{[u]_{N (η -)}}) - \sum_{j = 1}^{N} e^{- ρ (τ_{j})} ℓ (τ_{j}, X_{τ_{j}}^{ν, j - 1}, β_{j})] . \end{aligned}$ (30) ) is that the terminal reward disregards interventions made at the horizon. This is natural from an applications perspective as it is generally to late to intervene at a default in a financial setting or at the failure of a unit in an engineering application.

In addition to the requirements listed in Assumption 5.1, we make the following assumptions:

Assumption 5.8

The terminal reward $ψ : R_{+} \times R^{d} \to R$ is Borel-measurable, satisfies the growth condition $\begin{aligned} | ψ (t, x) | \leq C (1 + | x |^{q}) \end{aligned}$ and there is a non-decreasing continuous function $C_{ψ} : R^{+} \to R^{+}$ such that for all $L \geq 0$ , we have $\begin{aligned} | ψ (t, x) - ψ (t, x^{'}) | \leq C_{ψ} (L) | x - x^{'} | \end{aligned}$ whenever $| x | \lor | x^{'} | \leq L$ . Moreover, if there is a sequence $(θ_{j})_{j \geq 0}$ in $T^{f}$ such that $θ_{j} ↗ η$ on some set $B \in G$ , then there is a $P$ -null set $N$ such that on $B ∖ N$ we have for every $(y, b) \in D \times U$ that (31) $\begin{aligned} ψ (η, y_{η}) \geq ψ (η, Γ (η, (y_{s})_{s \leq η}, b)) \end{aligned}$ (31)

We introduce the following hypothesis:

Hypothesis. [Disc.-B] The terminal reward satisfies the bound $\begin{aligned} sup_{u \in U^{f}} E [e^{- p ρ (η)} | ψ (η, X_{η}^{u}) |^{p}] < \infty . \end{aligned}$ Furthermore, for each $k \geq 0$ there is an $ϵ > 0$ such that for all $T \geq T^{'}$ for some $T^{'} > 0$ we have $\begin{aligned} E [sup_{t \in [0, \infty)} \underset{u \in U_{t}^{k}}{e s s sup} E [1_{[η \geq T]} e^{- ρ (η)} | ψ (η, X_{η}^{v \circ u}) | | F_{t}]] \leq e^{- ϵ T} \end{aligned}$ for all $v \in U^{l i m}$ .

We have the following extension of Proposition 5.7.

Proposition 5.9

Under Hypotheses Disc.-A and Disc.-B there is a $u^{*} \in U^{f}$ such that $J^{η} (u^{*}) \geq J^{η} (u)$ for all $u \in U^{f}$ . Furthermore, $u^{*}$ is given by the recursion (Equation8(8) $\begin{aligned} τ_{j}^{*} := inf {s \geq τ_{j - 1}^{*} : Y_{s}^{[u^{*}]_{j - 1}} = sup_{b \in U} {- c_{s, b}^{[u^{*}]_{j - 1}} + Y_{s}^{[u^{*}]_{j - 1} \circ (s, b)}}}, \end{aligned}$ (8) )–(Equation9(9) $\begin{aligned} β_{j}^{*} \in \underset{b \in U}{\arg max} {- c_{τ_{j}^{*}, b}^{[u^{*}]_{j - 1}} + Y_{τ_{j}^{*}}^{[u^{*}]_{j - 1} \circ (τ_{j}^{*}, b)}} . \end{aligned}$ (9) ), with $\begin{aligned} φ (u) := \int_{0}^{η} e^{- ρ (t)} ϕ (t, X_{t}^{u}) d t + e^{- ρ (η)} ψ (η, X_{η}^{[u]_{N (η -)}}) \end{aligned}$ and $\begin{aligned} c (u \circ (t, b)) := e^{- ρ (t \lor τ_{N})} ℓ (t \lor τ_{N}, X_{t \lor τ_{N}}^{u}, b) . \end{aligned}$ If, in addition η is an $F$ -stopping time, then $τ_{j}^{*} < η$ for all $1 \leq j \leq N^{*}$ .

Proof.

We note that all details in the proof of Proposition 5.7 transfer immediately to this situation except for the quasi-left upper semi-continuity property in the definition of $H_{F}$ (Definition 2.1). We thus assume that there is a sequence non-decreasing sequence $(θ_{j})_{j \geq 0}$ of stopping times such that $θ_{j} ↗ θ \in T$ . When $θ < η$ , $P$ -a.s. left-continuity at θ follows by Lemma 5.3 and the local Lipschitz property of ψ and when $θ > η$ , $P$ -a.s. left-continuity at θ is immediate. We thus assume that $θ_{j} ↗ η$ on some measurable set $B \subset Ω$ .

Then, we have $\begin{aligned} 1_{B} (φ (v \circ (θ_{j}, b) \circ u) - φ (v \circ (θ, b) \circ u)) \\ \leq \int_{θ_{j}}^{θ} e^{- ρ (t)} | ϕ (t, X_{t}^{v \circ (θ_{j}, b) \circ u}) - ϕ (t, X_{t}^{v \circ (θ, b) \circ u}) | d t \\ + 1_{B} e^{- ρ (η)} (ψ (η, X_{η}^{v \circ (θ_{j}, b) \circ u}) - ψ (η, X_{η}^{v})), \end{aligned}$ where the first term on the right-hand side tends to zero, $P$ -a.s. Concerning the second term we have $\begin{aligned} 1_{B} e^{- ρ (η)} (ψ (η, X_{η}^{v \circ (θ_{j}, b) \circ u}) - ψ (η, X_{η}^{v})) \\ \leq 1_{B} e^{- ρ (η)} (ψ (η, Γ_{β_{N (η -)}} \circ \dots \circ Γ_{β_{1}} \circ Γ_{b} (η, X_{η}^{v}) - ψ (η, X_{η}^{v})) \\ + e^{- ρ (η)} | ψ (η, X_{η}^{v \circ (θ_{j}, b) \circ [u]_{N (η -)}}) - ψ (η, X_{η}^{v \circ (θ, b) \circ [u]_{N (η -)}}) |, \end{aligned}$ where $Γ_{b} (\cdot, \cdot) := Γ (\cdot, \cdot, b)$ and ° denotes composition of functions. The first term on the right-hand side is $P$ -a.s. non-positive by Assumption 5.8 and the last term tends to zero, $P$ -a.s., by the local Lipschitz property of ψ and Lemma 5.3 in combination with Proposition 5.2, the polynomial growth condition on ψ and Hypothesis Disc.-B.

The last assertion follows by noting that since c>0 it will never be optimal to intervene at times greater than or equal to η.

We note the following distinction between the finite (deterministic) horizon and the random horizon settings:

Remark 5.5

In the case when $P (η = T) = 1$ for some $T \geq 0$ it follows from the proof of Proposition 5.9 that we can relax (Equation31(31) $\begin{aligned} ψ (η, y_{η}) \geq ψ (η, Γ (η, (y_{s})_{s \leq η}, b)) \end{aligned}$ (31) ) to $\begin{aligned} ψ (T, y_{T}) \geq ψ (T, Γ (T, (y_{s})_{s \leq T}, b)) - ℓ (T, y_{T}, b) . \end{aligned}$

To see that there is an actual distinction here consider the following example:

Example 5.10

We let $F$ be the trivial σ-algebra ${\emptyset, Ω}$ and assume that $P (η = x) = {\begin{cases} 0.5, & x = 1 \\ 0.5, & x = 2 \end{cases}$ . We take $U := {1}$ and set $X_{t} = 1_{[τ_{1}, \infty)} (t)$ . Then, with the rewards $ϕ \equiv 0$ , $ψ (t, x) = x e^{| t - 1 |}$ , the intervention cost $ℓ (t, x, b) = e^{| t - 1 |}$ and the discount $ρ \equiv 0$ , we get $\begin{aligned} sup_{u \in U^{f}} J^{η} (u) = 0.5 (e^{1} - 1), \end{aligned}$ but there is no control that attains this value.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Swedish Energy Agency through grant number 48405-1

Notes

1 Throughout, we let $R_{+} := [0, \infty)$

2 We let $\land$ (resp. $\lor$ ) to denote minimum (resp. maximum), so that $x \land y := min (x, y)$ .

3 Throughout, we generally suppress dependence on ω and refer to $h \in H_{F}$ as a map $(t, b) \mapsto h (t, b)$ .

4 Requiring that p>2 is for notational convenience only and can easily be loosened to p>1.

5 Throughout, C will denote a generic positive constant that may change value from line to line.

6 By definition ${N^{*} > K} = {τ_{K + 1}^{*} < \infty}$ , which belongs to $F_{τ_{K + 1}^{*}}$

References

N. Agram and B. Øksendal, Stochastic control of memory mean-field processes, Appl. Math. Optim.79(1) (2019), pp. 181–204.
Web of Science ®Google Scholar
M. Basei, Optimal price management in retail energy markets: an impulse control problem with asymptotic estimates, Math. Meth. Oper. Res. 89(3) (2019), pp. 355–383.
Web of Science ®Google Scholar
A. Bensoussan and J.L. Lions, Impulse Control and Quasivariational Inequalities, Gauthier-Villars, Montrouge, France, 1984.
Google Scholar
D.P. Bertsekas and S.E. Shreve, Stochastic Optimal Control: The Discrete-time Case, Academic Press, 1978.
Google Scholar
R. Carmona and M. Ludkovski, Pricing asset scheduling flexibility using optimal switching, Appl. Math. Finance 15(5-6) (2008), pp. 405–447.
Google Scholar
C. Dellacherie and P.-A. Meyer, Probabilités Et Potentiel, I-IV, Hermann, Paris, 1975.
Google Scholar
C. Dellacherie and P.-A. Meyer, Probabilités Et Potentiel, V-VIII, Hermann, Paris, 1980.
Google Scholar
B. Djehiche, S. Hamadène, and I. Hdhiri, Stochastic impulse control of non-markovian processes, Appl. Math. Optim. 61(1) (2010), pp. 1–26.
Web of Science ®Google Scholar
B. Djehiche and S. Hamadène, On a finite horizon starting and stopping problem with risk of abandonment, Int. J. Theoret. Appl. Finance 12(04) (2009), pp. 523–543.
Google Scholar
B. Djehiche, S. Hamadène, I. Hdhiri, and H. Zaatra, Infinite horizon stochastic impulse control with delay and random coefficients, Math. Oper. Res. 47(1) (2022), pp. 665–689.
Web of Science ®Google Scholar
B. Djehiche, S. Hamadène, and A. Popier, A finite horizon optimal multiple switching problem, SIAM J. Control Optim. 48(4) (2009), pp. 2751–2770.
Web of Science ®Google Scholar
N. El Karoui, Les aspects probabilistes du contrôle stochastique. Ecole d'Eté de SaintFlour IX 1979. Lecture Notes in Math, Berlin, Springer, 1981.
Google Scholar
N. El-Karoui, C. Kapoudjian, E. Pardoux, S. Peng, and M.C. Quenez, Reflected solutions of backward SDEs and related obstacle problems for PDEs, Ann. Probab. 25(2) (1997), pp. 702–737.
Web of Science ®Google Scholar
S. Hamadène, Reflected BSDE's with discontinuous barrier and application, Stoch. Int. J. Probab. Stoch. Process. 74(3-4) (2002), pp. 571–596.
Google Scholar
S. Hamadène and J. Zhang, Switching problem and related system of reflected backward SDEs, Stoch. Process. Appl. 120(4) (2010), pp. 403–426.
Web of Science ®Google Scholar
I. Hdhiri and M. Karouf, Optimal stochastic impulse control with random coefficients and execution delay, Stoch. Int. J. Probab. Stoch. Process. 90(2) (2018), pp. 151–164.
Web of Science ®Google Scholar
J. Jönsson and M. Perninge, Finite horizon impulse control of stochastic functional differential equations, SIAM J. Control Optim. 61(2) (2023), pp. 924–948.
Web of Science ®Google Scholar
N. El Karoui and X. Tan, Capacities, measurable selection and dynamic programming part i: Abstract framework. arXiv:1310.3363, 2013.
Google Scholar
R. Korn, Some applications of impulse control in mathematical finance, Math. Meth. Oper. Res. 50(3) (1999), pp. 493–518.
Web of Science ®Google Scholar
R. Martyr, Finite-horizon optimal multiple switching with signed switching costs, Math. Oper. Res.41(4) (2016), pp. 1432–1447.
Web of Science ®Google Scholar
B. Øksendal and A. Sulem, Applied Stochastic Control of Jump Diffusions, Springer, 2007.
Google Scholar
B. Øksendal and A. Sulem, Optimal stochastic impulse control with delayed reaction, Appl. Math. Optim. 58(2) (2008), pp. 243–255.
Web of Science ®Google Scholar
J. Palczewski and L. Stettner, Impulsive control of portfolios, Appl. Math. Optim. 56(1) (2007), pp. 67–103.
Web of Science ®Google Scholar
M. Perninge, A finite horizon optimal switching problem with memory and application to controlled sddes, Math. Meth. Oper. Res. 91(3) (2020), pp. 465–500.
Web of Science ®Google Scholar
P. Protter, Stochastic Integration and Differential Equations, 2nd ed. Springer, Berlin, 2004.
Google Scholar

Appendices

Appendix 1. Quasi-left continuity

A càdlàg process $(X_{t} : t \geq 0)$ is quasi-left continuous if for each predictable stopping time θ and every announcing sequence of stopping times $θ_{k} ↗ θ$ we have $X_{θ -} := lim_{k \to \infty} X_{θ_{k}} = X_{θ}$ , $P$ -a.s. Similarly, X is quasi-left upper semi-continuous if $X_{θ -} \leq X_{θ}$ , $P$ -a.s. A filtration is quasi-left continuous if $F_{θ} = F_{θ -}$ for every predictable stopping time θ.

Appendix 2. The Snell envelope

In this section we gather some useful results concerning the Snell envelope. Recall that a progressively measurable process X is of class [D] if the set of random variables ${X_{τ} : τ \in T^{f}}$ is uniformly integrable.

Theorem A.1

The Snell envelope

Let $X = (X_{t})_{t \geq 0}$ be an $F$ -adapted, $R$ -valued, càdlàg process of class $[$ D $]$ . Then there exists a unique $($ up to indistinguishability $)$ , $R$ -valued càdlàg process $Z = (Z_{t})_{t \geq 0}$ called the Snell envelope of X, such that Z is the smallest supermartingale that dominates X. Moreover, the following holds (with $Δ X_{t} := X_{t} - X_{t -}$ ):

For any stopping time η, (A1) $\begin{aligned} Z_{η} = \underset{τ \in T_{η}}{e s s sup} E [X_{τ} | F_{η}] . \end{aligned}$ (A1)
The Doob-Meyer decomposition of the supermartingale Z implies the existence of a triple $(M, K^{c}, K^{d})$ where $(M_{t} : t \geq 0)$ is a uniformly integrable right-continuous martingale, $(K_{t}^{c} : t \geq 0)$ is a non-decreasing, predictable, continuous process with $K_{0}^{c} = 0$ and $(K_{t}^{d} : t \geq 0)$ is non-decreasing purely discontinuous predictable with $K_{0}^{d} = 0$ , such that (A2) $\begin{aligned} Z_{t} = M_{t} - K_{t}^{c} - K_{t}^{d} . \end{aligned}$ (A2) Furthermore, ${Δ K_{t}^{d} > 0} \subset {Δ X_{t} < 0} \cap {Z_{t -} = X_{t -}}$ for all $t \geq 0$ .
Let $η \in T$ be given and assume that for any predictable $θ \in T_{η}$ and any increasing sequence ${θ_{j}}_{j \geq 0}$ with $θ_{j} \in T_{η}^{f}$ and $lim_{j \to \infty} θ_{j} = θ$ , $P$ -a.s, we have $\underset{j \to \infty}{lim sup} X_{θ_{j}} \leq X_{θ}$ , $P$ -a.s. Then, the stopping time $τ_{η}^{*}$ defined by $τ_{η}^{*} := inf {s \geq η : Z_{s} = X_{s}}$ (with the convention that $inf \emptyset = \infty$ ) is optimal after η, i.e. $\begin{aligned} Z_{η} = E [X_{τ_{η}^{*}} | F_{η}] . \end{aligned}$ Furthermore, in this setting the Snell envelope, Z, is quasi-left continuous, i.e. $K^{d} \equiv 0$ .
Let $X^{k}$ be a sequence of càdlàg processes converging increasingly and pointwisely to the càdlàg process X and let $Z^{k}$ be the Snell envelope of $X^{k}$ . Then the sequence $Z^{k}$ converges increasingly and pointwisely to a process Z and Z is the Snell envelope of X.

In the above theorem, (i)–(iii) are standard results and proofs can be found in, for example, [Citation12,Citation14]. A finite horizon version of statement (iv), which extends trivially to infinite horizon, was proved in [Citation11].

Appendix 3. The section and projection theorems

In this section we recall two fundamental results from the general theory of stochastic processes, namely the measurable selection and the optional projection theorems.

We consider a complete filtered probability space $(Ω, F, F, P)$ , with $F := {F_{t}}$ a right-continuous filtration. For any space E, we define the projection of a set $A \subset Ω \times E$ onto Ω as $π_{Ω} (A) := {ω \in Ω : \exists x \in E, (ω, x) \in A}$ .

Theorem A.2

Measurable projection

Let E be a locally compact Polish space. For every $A \in F \otimes B (E)$ the set $π_{Ω} (A)$ is $F$ -measurable.

A proof can be found in, e.g. [Citation18] (see the proof of Theorem 2.10) or [Citation6] Chapter III. In particular we need the following corollary result:

Corollary A.3

Let $h (ω, x)$ be a real valued, measurable function defined on the product space $(Ω \times R^{m}, F \otimes B (R^{m}))$ . Then for all $A \in F \otimes B (R^{m})$ , the function $\begin{aligned} g (ω) := sup_{x \in R^{m}} {h (ω, x) : (ω, x) \in A} \end{aligned}$ (with the convention $sup \emptyset = - \infty$ ) is $F$ -measurable.

Proof.

For each $K \in R$ we have ${g (ω) > K} = π_{Ω} (A \cap h^{- 1} ((K, \infty]))$ . Now, since h is measurable, the set $A \cap h^{- 1} ((K, \infty])$ is in $F \otimes B (R^{m})$ and the result follows by the measurable projection theorem.

Theorem A.4

Measurable selection

Let $(E, E)$ be a Borel space with $E := B (E)$ . For every $A \in F \otimes B (E)$ there is a $F$ -measurable r.v. β taking values in $\bar{E} := E \cup {\partial}$ (with ∂ a cemetery point) such that $\begin{aligned} {(ω, β (ω)) \in Ω \times E} \subset A a n d {ω \in Ω : β (ω) \in E} = π_{Ω} (A) . \end{aligned}$

This is a standard result and a proof can be found in [Citation18] (Theorem 2.20) (see also Chapter 7 in [Citation4] where several extensions are given). In particular we need the following well known corollary result:

Corollary A.5

Let $h (ω, x)$ be a measurable function defined on the product space $(Ω \times R^{m}, F \otimes B (R^{m}))$ , such that for $P$ -almost every ω the map $x \mapsto h (ω, x)$ is upper semi-continuous. Then, with U a compact subset of $R^{m}$ , there exists a $F$ -measurable r.v. β such that $\begin{aligned} h (ω, β (ω)) = sup_{x \in R^{n}} {h (ω, x) : (ω, x) \in Ω \times U}, \end{aligned}$ $P$ -a.s.

Proof.

Since $A := Ω \times U \in F \otimes B (E)$ (where now $E = R^{m}$ ) the function $g (ω) = sup_{x \in E} {h (ω, x) : (ω, x) \in A}$ is $F$ -measurable. Furthermore, as h is $F \otimes B (E)$ -measurable, the set $B := {(ω, x) \in Ω \times U : h (ω, x) = g (ω)}$ is in $F \otimes B (E)$ . Now, by Theorem A.4 there is a $F$ -measurable $\bar{E}$ -valued r.v. β such that ${(ω, β (ω)) \in Ω \times E} \subset B$ and ${ω \in Ω : β (ω) \in E} = π_{Ω} (B)$ . As U is compact and $b \mapsto h (ω, b)$ is u.s.c. on $Ω ∖ N$ with $P (N) = 0$ , we have that $B^{ω} := {b \in U : (ω, b) \in B} = {b \in U : h (ω, b) = g (ω)} \neq \emptyset$ for all $ω \in Ω ∖ N$ and, hence, $P (π_{Ω} (B)) = 1$ .

The last result that we need is the optional projection theorem.

Theorem A.6

Optional projection

Assume that $(X_{t} : t \geq 0)$ is a measurable process (not necessarily adapted to the filtration $F$ ) with $E [| X_{τ} |] < \infty$ for all stopping times $τ \in T$ , then there exists a unique optional process $(^{o} X_{t} : t \geq 0)$ such that $\begin{aligned} 1_{[τ < \infty]}^{o} X_{τ} = E [1_{[τ < \infty]} X_{τ} | F_{τ}], \end{aligned}$ for all stopping times $τ \in T$ . If, furthermore, X is càdlàg then $^{o} X$ is also càdlàg.

A proof of Theorem A.6 can be found in Chapter VI, pp. 103 of [Citation7].

Infinite horizon impulse control of stochastic functional differential equations driven by Lévy processes

Abstract

1. Introduction

2. Preliminaries

2.1. Problem formulation

2.2. Relevant properties of $H_{F}$

3. A verification theorem

4. Existence of the verification family

Proof of Proposition 4.1

5. Application to impulse control of SFDEs

5.1. Assumptions

5.2. Existence of optimal controls

5.3. The random horizon setting

Disclosure statement

References

Appendices

Appendix 1. Quasi-left continuity

Appendix 2. The Snell envelope

The Snell envelope

Appendix 3. The section and projection theorems

Measurable projection

Measurable selection

Optional projection

Information for

Open access

Opportunities

Help and information

Infinite horizon impulse control of stochastic functional differential equations driven by Lévy processes

Abstract

1. Introduction

2. Preliminaries

2.1. Problem formulation

2.2. Relevant properties of HF

3. A verification theorem

4. Existence of the verification family

Proof of Proposition 4.1

5. Application to impulse control of SFDEs

5.1. Assumptions

5.2. Existence of optimal controls

5.3. The random horizon setting

Disclosure statement

Additional information

Funding

Notes

References

Appendices

Appendix 1. Quasi-left continuity

Appendix 2. The Snell envelope

The Snell envelope

Appendix 3. The section and projection theorems

Measurable projection

Measurable selection

Optional projection

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

2.2. Relevant properties of $H_{F}$