HAL Id: hal-00743348
https://hal.archives-ouvertes.fr/hal-00743348v2
Preprint submitted on 24 Feb 2014
HAL
is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or
L’archive ouverte pluridisciplinaire
HAL, estdestinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires
BSDEs with weak terminal condition
Bruno Bouchard, Romuald Elie, Anthony Réveillac
To cite this version:
Bruno Bouchard, Romuald Elie, Anthony Réveillac. BSDEs with weak terminal condition. 2014.
�hal-00743348v2�
BSDEs with weak terminal condition
Bruno Bouchard
∗Romuald Elie
†Anthony R´eveillac
‡Universit´e Paris-Dauphine CEREMADE UMR CNRS 7534 Place du Mar´echal De Lattre De Tassigny
75775 Paris cedex 16 France February 24, 2014
Abstract
We introduce a new class of Backward Stochastic Differential Equations in which theT-terminal value YT of the solution (Y, Z) is not fixed as a random variable, but only satisfies a weak constraint of the form E[Ψ(YT)] ≥ m, for some (possibly ran- dom) non-decreasing map Ψ and some thresholdm. We name themBSDEs with weak terminal condition and obtain a representation of the minimal time t-values Yt such that (Y, Z) is a supersolution of the BSDE with weak terminal condition. It provides a non-Markovian BSDE formulation of the PDE characterization obtained for Marko- vian stochastic target problems under controlled loss in Bouchard, Elie and Touzi [2].
We then study the main properties of this minimal value. In particular, we analyze its continuity and convexity with respect to the m-parameter appearing in the weak terminal condition, and show how it can be related to a dual optimal control problem in Meyer form. These last properties generalize to a non Markovian framework previous results on quantile hedging and hedging under loss constraints obtained in F¨ollmer and Leukert [6, 7], and in Bouchard, Elie and Touzi [2].
Key words: Backward stochastic differential equations, optimal control, stochastic target.
MSC Classification (2000): Primary: 60H10; 93E20; Secondary: 49L20; 91G80
∗and CREST, [email protected]
1 Introduction
Solving a backward stochastic differential equation (hereafter BSDE), with terminal data ξ ∈ L
2(F
T) and driver g, consists in finding a pair of predictable processes (Y, Z), with certain integrability properties, such that the dynamics of Y satisfies dY
t= −g(t, Y
t, Z
t)dt + Z
tdW
tand Y
T= ξ (where W denotes a standard Brownian motion). It can be rephrased in:
find an initial data Y
0and a control process Z such that the solution Y
Zof the controlled stochastic differential equation
Y
tZ= Y
0−
Z t0
g(s, Y
sZ, Z
s)ds +
Z t0
Z
sdW
s, 0 ≤ t ≤ T , (1.1) satisfies Y
TZ= ξ. In cases where the previous problem does not admit a solution, a weaker formulation is to find an initial data Y
0and a control Z such that
Y
TZ≥ ξ
P− a.s. (1.2)
In most applications, one is interested in the minimal initial condition Y
0and in the as- sociated control Z. This is for instance the case in the financial literature in which Y
0represents the cost of the cheapest super-replication strategy for the contingent claim ξ, and Z provides the associated hedging strategy, see e.g. [5].
Motivated by situations where this minimal value Y
0is too large for practical applications, it was suggested to relax the strong constraint (1.2) into a weaker one of the form
E
ℓ(Y
TZ− ξ)
≥ m , (1.3)
where m is a given threshold and ℓ is a non-decreasing map. For ℓ(x) = 1
{x≥0}, this corresponds to matching the criteria Y
TZ≥ ξ at least with probability m. In financial terms, this is the so-called quantile hedging problem, see [6]
1. More generally, ℓ is viewed as a loss function, one typical example being ℓ(x) := −(x
−)
qwith q ≥ 1, see [7] for general non-Markovian but linear dynamics. Such problems were coined “stochastic target problems with controlled loss” by [2] who consider a non-linear Markovian formulation in a Brownian diffusion setting, see also [8] for the jump diffusions setting.
The aim of this paper is to study the non-linear non-Markovian setting in which the terminal constraint is of the form
E
Ψ(Y
TZ)
≥ m. (1.4)
In the above, m ∈
Rand Ψ is a (possibly random) non-decreasing real-valued map. Our problem can then be written as
Find the minimal Y
0such that (1.1) and (1.4) hold for some Z. (1.5) This leads to the introduction of a new class of BSDEs which we call BSDEs with weak terminal condition. More precisely, we refer to this problem by saying that we want to solve
1In fact, their original formulation also imposes a budget constraint constraintYTZ≥0P−a.s., which can be taken into account by imposing a criteria of the form (1.4) with Ψ(YTZ) :=1{YTZ−ξ≥0}− ∞1{YTZ<0}.
the BSDE with driver g and weak terminal condition (Ψ, m) to insist on the fact that the terminal condition Y
TZis not fixed as a random variable, but only has to satisfy the weak constraint (1.4).
The first step in our analysis lies in a reformulation based on the martingale representation theorem, as suggested in [2]. More precisely, if Y
0and Z are such that (1.4) holds, then the martingale representation Theorem implies that we can find an element α in the set A
0, of predictable square integrable processes, such that
Ψ(Y
TZ) ≥ M
Tα:= m +
Z T0
α
sdW
s.
On the other hand, since Ψ is non-decreasing, one can introduce its left-continuous inverse Φ and note that the solution (Y
α, Z
α) of the BSDE
Y
tα= Φ(M
Tα) +
Z Tt
g(s, Y
sα, Z
sα)ds −
Z Tt
Z
sαdW
s, 0 ≤ t ≤ T , (1.6) actually solves (1.1) and (1.4). We indeed show that the solution of (1.5) is given by
inf{Y
0α, α ∈ A
0}. (1.7)
This leads to study its dynamical counterpart
Y
τα:= essinf{Y
τα′, α
′∈ A
0s.t. α
′= α on [[0, τ ]]} , 0 ≤ τ ≤ T . (1.8) We verify that the family {Y
α, α ∈ A
0} satisfies a dynamic programming principle which can be seen as a counterpart of the geometric dynamic programming principle of [15] used in [2]. In particular, this implies that {Y
α, α ∈ A
0} is a g-submartingale family to which we can apply the non-linear Doob-Meyer decomposition of [11]. This provides a representation of the family {Y
α, α ∈ A
0} in terms of minimal supersolutions to a family of BSDEs with driver g and (strong) terminal conditions {Φ(M
Tα), α ∈ A
0}. This representation allows in particular to characterize the family {Y
α, α ∈ A
0} uniquely. Under additional convexity assumptions on the coefficients g and Φ, we observe that the essential infimum in (1.8) is attained. Hence, there exists an optimal ˆ α ∈ A
0such that solving the BSDE with weak terminal condition (Ψ, m) boils down to solving the BSDE with dynamics (1.6) and strong terminal condition Φ(M
Tαˆ). In a Markovian framework, our approach provides in particular a BSDE formulation for the PDEs derived in [2].
We then study in details important properties of this family and focus in particular on
the regularity of Y
αwith respect to the threshold parameter m. We exhibit, under weak
conditions, a stability property of the solution with respect to the variations of the param-
eter m. We also observe that Y
αis convex with respect to the threshold parameter. This
observation allows us in particular to conclude that Φ (whenever it is deterministic) can
be replaced by its more regular convex envelope in order to compute Y
αon [0, T ). This
was already observed in the restrictive Markovian setting of [2], in which it is proved by
using PDE technics. We provide here a pure probabilistic argument. Similarly, it was also
observed in [6], [7] and [2] that (1.5) admits a dual linear problem when g is linear. We
extend this result via probabilistic arguments to the semi-linear setting, for which the dual
formulation takes the form of a stochastic control problem in Meyer form.
The rest of the paper is organized as follows. In Section 2, we provide a precise formulation for (1.5) and relate this problem to a g-submartingale family satisfying a dynamic program- ming principle. Attainability of the optimal control ˆ α ∈ A
0is also discussed. Section 3 collects the continuity and convexity properties as well as the dual formulation of the prob- lem. Finally, Section 4 contains the proof of the BSDE representation for {Y
α, α ∈ A
0}.
We close this introduction with a series of notations that will be used all over this paper.
Let d ≥ 1 and T > 0 be fixed. We denote by W := (W
t)
t∈[0,T]a d-dimensional Brownian motion defined on a probability space (Ω, F ,
P) with
P-augmented natural filtration
F= (F
t)
t∈[0,T]. The components of W will be denoted by W = (W
1, · · · , W
d) and E will stand for the expectation with respect to
P. For simplicity, we assume thatF = F
T. Throughout the paper we will make use of the following spaces.
- L
p(U, G) denotes the set of p-integrable G-measurable random variables with values in U , p ≥ 0, U a Borel set of
Rnfor some n ≥ 1 and G ⊂ F . When U and G can be clearly identified by the context, we omit them. This will be in particular the case when G = F.
- T denotes the set of
F-stopping times in [0, T]. For τ
1∈ T , T
τ1is the set of stopping times τ
2in T such that τ
2≥ τ
1 P− a.s. The notation E
τ[·] stands for the conditional expectation given F
τ, τ ∈ T .
- S
2denotes the set of
R-valued,c` adl` ag
2and
F-adapted stochastic processesX = (X
t)
t∈[0,T]such that kXk
S2:= E[sup
t∈[0,T]|X
t|
2]
1/2< ∞.
- H
2denotes the set of
Rn-valued,
F-predictable stochastic processesX = (X
t)
t∈[0,T]such that kXk
H2:= E
hRT
0
|X
t|
2dt
i1/2< ∞. In the following, the dimension n will be given by the context.
- K
2denotes the set of non-decreasing
R-valued and F-adapted stochastic processesX = (X
t)
t∈[0,T]such that kXk
S2< ∞.
Inequalities between random variables are understood in the
P− a.s.-sense.
2 BSDE with weak terminal condition
2.1 Definitions and problem reformulation We first define the main object of this paper.
Definition 2.1 (Solution to a BSDE with weak terminal condition). Given a measurable map Ψ :
R× Ω 7→ U , with U ⊂
R∪ {−∞}, τ ∈ T and µ ∈ L
0(U, F
τ), we say that
2right-continuous with left limits
(Y, Z) ∈ S
2× H
2is a supersolution of the BSDE with generator g : Ω × [0, T ] ×
R×
Rd→
Rand weak terminal condition (Ψ, µ, τ ), in short BSDE(g, Ψ, µ, τ ), if for any 0 ≤ s ≤ t ≤ T,
Y
s≥ Y
t+
Z ts
g(r, Y
r, Z
r)dr −
Z ts
Z
rdW
r, and (2.1)
E
τ[Ψ(Y
T)] ≥ µ. (2.2)
Before discussing the well-posedness of Equation (2.1)-(2.2), let us emphasize that the difference with classical BSDEs lies in the fact that we do not prescribe a terminal condition to Y in the classical
P− a.s.-sense but only impose a weak condition in expectation form (which justifies the terminology of BSDE with weak terminal condition). Even if we were asking for equalities in (2.1)-(2.2), this would obviously be too weak to expect uniqueness, as any random variable ξ satisfying E
τ[Ψ(ξ)] = µ could serve as a terminal condition.
However, when Ψ is non-decreasing, the set
Γ(τ, µ) := {Y
τ: (Y, Z) ∈ S
2× H
2is a supersolution of BSDE(g, Ψ, µ, τ )} , (2.3) defined for any τ ∈ T and µ ∈ L
0(U, F
τ), can be characterized by its lower-bound, when- ever it is achieved.
Throughout the paper, we shall restrict to the case where g is Lipschitz continuous with linear growth, Ψ
+is bounded, and the domain of Ψ is bounded from below, in order to avoid un-necessary technicalities.
Standing Assumption (H
Ψ): For
P− a.e. ω ∈ Ω, the map y ∈
R7→ Ψ(ω, y) is non- decreasing, right-continuous, valued in [0, 1] ∪ {−∞}, and its left-continuous inverse Φ(ω, ·) satisfies Φ : Ω × [0, 1] 7→ [0, 1] is measurable.
By left-continuous inverse we mean the left-continuous map defined for ω fixed by Φ(ω, x) := inf {y ∈
R,Ψ(ω, y) ≥ x},
which satisfies
Φ ◦ Ψ ≤ Id ≤ Ψ ◦ Φ. (2.4)
The left-hand side follows from the definition of Φ, the right-hand side holds by right- continuity of Ψ. Note that the above assumption implies Ψ(ω, ·) = −∞ on (−∞, 0) and Ψ(ω, ·) = 1 on [1, ∞). In particular, the constraint in expectation (2.2) implies Y
T≥ 0
P− a.s. Obviously the set [0, 1] is chosen for ease of notations and can be replaced by any closed interval.
Standing Assumption (H
g) g is a measurable map from Ω × [0, T ] ×
R×
Rdto
Rand g(·, y, z) is
F-predictable, for each (y, z)∈
R×
Rd. There exists a constant K
g> 0 and a random variable χ
g∈ L
2(R
+), such that
|g(t, 0, 0)| ≤ χ
g P− a.s.
|g(t, y
1, z
1) − g(t, y
2, z
2)| ≤ K
g(|y
1− y
2| + |z
1− z
2|)
P− a.s.
∀(t, y
i, z
i) ∈ [0, T ] ×
R×
Rd, i = 1, 2.
Let A
τ,µdenote the set elements α ∈ H
2such that M
(τ,µ),α:= µ +
Z τ∨·
τ
α
sdW
stakes values in [0, 1]. (2.5) Then, (2.2) is equivalent to Ψ(Y
T) ≥ M
T(τ,µ),αfor some α ∈ A
τ,µ. In view of (2.4), this is equivalent to Y
T≥ Φ(M
T(τ,µ),α) for some α ∈ A
τ,µ. This implies that supersolutions of BSDE(g, Ψ, µ, τ ) can be characterized in terms of g-expectations whose definition is recalled below.
Definition 2.2 (g-expectation). Given τ
2∈ T and ξ ∈ L
2(R, F
τ2), let (Y, Z) ∈ S
2× H
2denote the solution of
Y = ξ +
Z τ2·∧τ2
g(s, Y
s, Z
s)ds −
Z τ2·∧τ2
Z
sdW
s.
Then, we define the (conditional) g-expectation of ξ at the stopping time τ
1≤ τ
2as E
τg1,τ2[ξ] := Y
τ1. When τ
2≡ T , we only write E
τg1[ξ], and say that (Y, Z) solves BSDE(g, ξ).
Note that existence and uniqueness hold under Assumption (H
g). In the following, we shall adopt the terminology of Peng [12] and call g-martingale (resp. g-submartingale) a process Y such that E
t,sg[Y
s] = Y
t(resp. E
t,sg[Y
s] ≥ Y
t), for all t ≤ s ≤ T .
Proposition 2.1. Fix τ ∈ T , µ ∈ L
0([0, 1], F
τ). Then, (Y, Z) ∈ S
2×H
2is a supersolution of BSDE(g, Ψ, µ, τ ) if and only if (Y, Z) satisfies (2.1) and there exists α ∈ A
τ,µsuch that Y
t≥ E
tg[Φ(M
T(τ,µ),α)] for t ∈ [0, T ]
P− a.s.
Proof. Let (Y, Z) be a super solution of BSDE(g, Ψ, µ, τ ). Then there exists some element ρ in L
0([0, 1], F
τ) with ρ ≥ µ,
P− a.s. and ˜ α in A
τ,ρsuch that Ψ(Y
T) = M
T(τ,ρ),˜α. Set θ
α˜:= inf {s ≥ τ, M
s(τ,µ),˜α= 0}. It is clear that θ
α˜belongs to T and that α := ˜ α1
[0,θα˜)belongs to A
τ,µand satisfies M
T(τ,ρ),˜α≥ M
T(τ,µ),α,
P− a.s., since M
T(τ,ρ),˜α≥ 0 by definition of A
τ,ρ. The monotonicity of Φ and (2.4) imply that
Y
T≥ (Φ ◦ Ψ)(Y
T) ≥ Φ(M
T(τ,µ),α).
By comparison for Lipschitz BSDEs, we obtain Y
t≥ E
tg[Φ(M
T(τ,µ),α)] for t ∈ [0, T ]. Conver- sly, let α ∈ A
τ,µbe such that Y
t≥ E
tg[Φ(M
T(τ,µ),α)] for t ∈ [0, T ] and assume that (Y, Z) satisfies (2.1). Then, (2.4) implies
Ψ(Y
T) ≥ (Ψ ◦ Φ)(M
T(τ,µ),α) ≥ M
T(τ,µ),α.
Taking the conditional expectation on both sides leads to (2.2).
✷In view of Proposition 2.1, the lower bound of Γ(τ, µ) (which we recall, has been defined in (2.3)) can be expressed in terms of
Y
τ(µ) := ess inf
α∈Aτ,µ
E
τghΦ(M
T(τ,µ),α)
i, τ ∈ T , µ ∈ L
0([0, 1], F
τ). (2.6)
This is the statement of the next corollary.
Corollary 2.1. essinf Γ(τ, µ) = Y
τ(µ), ∀ τ ∈ T , µ ∈ L
0([0, 1], F
τ).
Proof. The fact that Y
τ∈ Γ(τ, µ) implies Y
τ≥ Y
τ(µ) follows from Proposition 2.1. On the other hand, the same proposition implies that each E
τg[Φ(M
T(τ,µ),α)] with α ∈ A
τ,µbelongs
to Γ(τ, µ).
✷Remark 2.1. For later use, note that the assumptions (H
g) and (H
Ψ) ensure that we can find η ∈ S
2such that |E
tg[Φ(M)]| ∨ |Y
t(µ)| ≤ η
t, for all t ≤ T and µ ∈ L
0([0, 1], F
t), M ∈ L
0([0, 1]). See (i) of Proposition 5.2 in the Appendix.
Remark 2.2. Note that Y
τ(µ) = Y
τ(µ
1)1
A+ Y
τ(µ
2)1
Acwhenever µ := µ
11
A+ µ
21
Acfor A ∈ F
τ, µ
1, µ
2∈ L
0([0, 1], F
τ), and τ ∈ T . Indeed, α := 1
[τ,T](α
11
A+ α
21
Ac) ∈ A
τ,µfor all α
i∈ A
τ,µiwith i = 1, 2. Since E
τghΦ(M
T(τ,µ),α)
i= E
τghΦ(M
T(τ,µ1),α1)
i1
A+ E
τghΦ(M
T(τ,µ2),α2)
i1
Ac, this implies Y
τ(µ) ≤ Y
τ(µ
1)1
A+Y
τ(µ
2)1
Ac. The converse inequality follows from the previous identity applied with α
1:= α1
Aand α
2:= α1
Acfor any α ∈ A
τ,µso that α
i∈ A
τ,µifor i = 1, 2.
Remark 2.3. Before going on with the study of the set Γ, let us notice that a similar analysis can be carried out for weak constraints of the form E
τh[Ψ(Y
T)] ≥ µ in place of E
τ[Ψ(Y
T)] ≥ µ in (2.2), with E
hdefined as the h-expectation associated to some random map h satisfying similar conditions as g. In finance, the latter condition interprets as a risk-measure constraints, see e.g. [12], while our condition is more related to expected loss constraints, see [7]. Again, we try to avoid un-necessary additional technicalities and stick to the case h ≡ 0.
2.2 BSDE characterization of the minimal initial condition
The main result of this section is a BSDE characterization for the lower bound of the set Γ(τ, µ) of time-τ initial conditions of supersolutions of BSDE(g, Ψ, µ, τ ). In particular, this extends to a non Markovian framework the PDE characterization of [2].
For ease of notations, we now fix m
o∈ [0, 1] and set
(M
tα:= M
t(0,mo),α, A
ατ:= {α
′∈ A
τ,Mτα: α
′= α dt × d
Pon [[0, τ ]]}, A
0:= A
0,moand Y
tα:= Y
t(M
tα) for α ∈ A
0, t ∈ [0, T ],
where we recall that M
(0,mo),αand A
0,moare given in (2.5).
Theorem 2.1. For any α ∈ A
0, Y
αis a g-submartingale, it is l` adl` ag
3on countable sets, and the following dynamic programming principle holds:
(i) Y
τα1= ess inf
¯
α∈Aατ1
E
τg1,τ2[Y
τα¯2], for each τ
1∈ T , τ
2∈ T
τ1. Under the additional assumption that
m ∈ [0, 1] 7→ Φ(ω, m) is continuous for
P-a.e.ω ∈ Ω, (2.7) the following holds:
3left and right-limited according to the french celebrated acronym
(ii) Y
αis indistinguishable from a c` adl` ag g-submartingale, for each α ∈ A
0. (iii) There exists a family (Z
α, K
α)
α∈A0⊂ H
2× K
2satisfying
sup
α∈A0
k(Y
α, Z
α, K
α)k
S2×H2×K2< ∞ , (2.8) and such that, for all α ∈ A
0, we have
Y
α= Φ(M
Tα) +
Z T·
g(s, Y
sα, Z
sα)ds −
Z T·
Z
sαdW
s+ K
α− K
Tα, (2.9) K
ατ1= ess inf
¯
α∈Aατ1
E
K
ατ¯2|F
τ1
, ∀ τ
1∈ T , τ
2∈ T
τ1, (2.10) and
(Y
α, Z
α, K
α)1
[[0,τ]]= (Y
α¯, Z
α¯, K
α¯)1
[[0,τ]], ∀ τ ∈ T , α ¯ ∈ A
ατ. (2.11) (iv) (Y
α, Z
α, K
α)
α∈A0is the unique family of S
2× H
2× K
2satisfying (2.8)-(2.9)-(2.10)-
(2.11) for all α ∈ A
0.
The proof of this theorem is reported in Section 4.
Remark 2.4. (i) The precise continuity assumption needed in the proof is : Φ(M
Tαn) converges in L
2to Φ(M
Tα) whenever kM
Tαn− M
Tαk
L2tends to 0, for any sequence (α
n)
n⊂ A
0. However, this condition implies that Φ is continuous, as soon as random variables with non-absolutely continuous law with respect to the Lebesgue measure might be considered (which is the case here).
(ii) We shall see in Proposition 3.3 below that Φ can be replaced by its m-convex envelope, under mild assumptions. In this case, the continuity assumption of the second part of Theorem 2.1 is not required anymore because the convex envelope of Φ is continuous, see Remark 3.1 below.
2.3 Representation as a BSDE with strong terminal condition
The previous section raises in particular one natural question: Does there exist an admis- sible control ˆ α on the whole time interval [0, T ] allowing to match all time t-values of the minimal solution of a BSDE with weak terminal condition? Rephrasing, we wonder about the existence of a control ˆ α in A
0such that
Y
tαˆ= E
tghΦ(M
Tαˆ)
i, 0 ≤ t ≤ T .
Hereby, solving the BSDE with weak terminal condition (Ψ, m
o, 0) boils down to solving the classical BSDE with the optimal strong terminal one Φ(M
Tαˆ): along the optimal path ˆ
α, the compensator K
αˆof the BSDE (2.9) must degenerate to 0.
Not surprisingly, the existence of an optimal control requires the addition of convexity
assumptions on the coefficients of the BSDE. We shall therefore assume that:
(H
conv) For all (λ, m
1, m
2, t, y
1, y
2, z
1, z
2) ∈ [0, 1]×[0, 1]
2×[0, T ]×R
2×[R
d]
2, the following holds
P− a.s.:
Φ(λm
1+ (1 − λ)m
2) ≤ λΦ(m
1) + (1 − λ)Φ(m
2)
g(t, λy
1+ (1 − λ)y
2, λz
1+ (1 − λ)z
2) ≤ λg(t, y
1, z
1) + (1 − λ)g(t, y
2, z
2)
Remark 2.5. We recall the following result which is based on standard comparison argu- ments, see e.g. [14, Proposition 7]: For any τ ∈ T , the map E
τg[Φ(·)] : L
0([0, 1]) → L
0is convex under Assumption (H
conv).
Proposition 2.2. Assume that Assumptions (H
conv) and (2.7) hold. Then, for any (τ, α) ∈ T × H
2, there exists α ˆ
τ,α∈ A
ατsuch that
Y
τα= E
τghΦ(M
Tαˆτ,α)
i= E
τ,τg ′h
Y
ταˆ′τ,αi
, ∀ τ
′∈ T
τ.
Remark 2.6. As detailed in Remark 3.2 below, the convexity assumption on the terminal map Φ can be avoided in some cases. In particular, if Φ is deterministic then it can be replaced by its convex envelope. Then, only the convexity assumption on g has to hold.
Proof. Lemma 4.1 below provides a sequence (α
n)
nvalued in A
ατsuch that Y
τα= lim
n→∞
↓ E
τgΦ(M
Tαn)
,
P− a.s. (2.12)
Since the sequence (M
Tαn)
nis bounded in [0, 1], we can find sequences of non-negative real numbers (λ
ni)
i≥nwith
Pi≥n
λ
ni= 1, such that only a finite number of λ
nido not vanish, for each n, and such that the sequence of convex combinations ( ˜ M
Tn)
ngiven by
M ˜
Tn:=
Xi≥n
λ
niM
Tαi(2.13)
converges
P− a.s. to some ˆ M
T∈ L
0([0, 1]). By dominated convergence, the convergence holds in L
2, in particular E
τ[ ˆ M
T] = M
τα, and the martingale representation Theorem implies that we can find ˆ α ∈ A
ατsuch that ˆ M
T= M
Tαˆ. Using the convexity of Φ and g, see Remark 2.5, we deduce that
Y ˜
τn:=
Xi≥n
λ
niE
τghΦ(M
Tαi)
i≥ E
τghΦ( ˜ M
Tn)
i.
By (2.12), ˜ Y
τn→ Y
ταP−a.s. On the other hand, the convergence ˜ M
Tn→ M
Tαˆin L
2combined with the boundedness and a.s. continuity of Φ implies that Φ( ˜ M
Tn) → Φ(M
Tαˆ) in L
2, after possibly passing to a subsequence. Therefore the convergence E
τghΦ( ˜ M
Tn)
i→ E
τgΦ(M
Tαˆ)
P− a.s. follows by Proposition 5.1 below. This gives Y
τα≥ E
τgΦ(M
Tαˆ)
, while the converse holds by definition of Y
τα.
It remains to show that Y
τα= E
τ,τg ′Y
ταˆ′, for τ
′∈ T
τ. To see this, first note that the above implies that Y
τα= E
τ,τg ′E
τg′[Φ(M
Tαˆ)]
≥ E
τ,τg ′Y
ταˆ′
by standard comparison arguments and the fact that E
τg′[Φ(M
Tαˆ)] ≥ Y
ταˆ′by definition. As above, we can find a sequence (ˆ α
n) ∈ A
ατˆ′such that E
τg′Φ(M
Tαˆn)
→ Y
ταˆ′ P− a.s. In view of Remark 2.1, the convergence holds in L
2and Proposition 5.1 below implies Y
τα≤ E
τ,τg ′h
E
τg′h
Φ(M
Tαˆn)
ii→ E
τ,τg ′h
Y
ταˆ′i
,
where we used the fact that ˆ α
n∈ A
ατˆ′⊂ A
ατto obtain the left hand-side.
✷3 Main properties of the minimal initial condition process
In this section, we emphasize remarkable properties of the map Y
t: µ ∈ L
0([0, 1], F
t) 7→
Y
t(µ), for t ∈ [0, T ). We first derive the continuity of this map under a weak continuity assumption on E
g[Φ(·)]. Then, we verify that this map (or more precisely its l.s.c. envelope) is convex, and discuss the propagation of the convexity property to the time boundary T −.
Finally, we retrieve, in this non-Markovian setting, a dual representation of the map Y
0, using solely probabilistic arguments.
3.1 Continuity
Our continuity result is stated in terms of the quantities Err
t(η) := esssup
R
t(M, M
′) : M, M
′∈ L
0([0, 1]) , E
t[|M − M
′|
2] ≤ η , defined for η ∈ L
0([0, 1]), in which
R
t(M, M
′) := |E
tg[Φ(M)] − E
tg[Φ(M
′)]|.
Observe that classical a priori estimates on BSDEs ensure that Err
t(η
n) → 0 as η
n→ 0
P− a.s. with (η
n)
n⊂ L
0([0, 1]), whenever Φ is a deterministic Lipschitz map, see e.g.
Proposition 5.1 below. This observation remains valid when Φ is simply continuous, via a classical convolution density argument for Lipschitz maps on bounded domains. The next result indicates that this property ensures the regularity of the map: µ 7→ Y
t(µ).
Proposition 3.1. Let t < T , µ
1, µ
2∈ L
0([0, 1], F
t). Then,
|Y
t(µ
1) − Y
t(µ
2)| ≤ Err
t(∆(µ
1, µ
2)) + Err
t(∆(µ
2, µ
1)), where
∆(µ
i, µ
j) := (1 − µ
iµ
j)1
{µi<µj}+ µ
i− µ
j1 − µ
j1
{µi>µj}, i, j = 1, 2.
Moreover,
|Y
t(µ
1) − Y
t(µ
2)|1
{µ1=0}≤ R
t(µ
2, 0) and
|Y
t(µ
1) − Y
t(µ
2)|1
{µ1=1}≤ esssup
R
t(1, M ) : M ∈ L
0([0, 1]) , E
t[|1 − M|
2] ≤ 1 − µ
2.
In particular, if Err
t(η
n) → 0
P− a.s. as η
n→ 0
P− a.s., for all (η
n)
n⊂ L
0([0, 1]), then
µ ∈ L
0((0, 1), F
t) 7→ Y
t(µ) is continuous for the sequential
P− a.s. convergence and the
strong L
2convergence.
Proof. Step 1. Fix µ
1, µ
2∈ L
0([0, 1], F
t). Given α
2∈ A
t,µ2, we define λ := 1 − µ
11 − µ
21
{µ2<µ1}+ µ
1µ
21
{µ1<µ2}+ 1
{µ1=µ2},
which is by construction valued in [0, 1]. Since M
(t,µ2),α2takes values in [0, 1], M
(t,µ1),λα2= µ
1− λµ
2+ λM
(t,µ2),α2∈ [µ
1− λµ
2, µ
1+ λ(1 − µ
2)] ⊂ [0, 1] . In particular, λα
2∈ A
t,µ1. Thus, (2.6) leads to
Y
t(µ
1) ≤ E
tg[Φ(M
T(t,µ2),α2)] + (E
tg[Φ(M
T(t,µ1),λα2)] − E
tg[Φ(M
T(t,µ2),α2)]) . (3.1) Besides,
M
T(t,µ1),λα2− M
T(t,µ2),α2= µ
1− λµ
2+ (λ − 1)M
T(t,µ2),α2so that, since M
T(t,µ2),α2belongs to [0, 1], we have
µ
1− 1 + λ(1 − µ
2) ≤ M
T(t,µ1),λα2− M
T(t,µ2),α2≤ µ
1− λµ
2. In addition,
µ
1− λµ
2= 0 , if µ
1< µ
2, and µ
1− 1 + λ(1 − µ
2) = 0 , if µ
1≥ µ
2. This directly leads to
E
t[|M
T(t,µ1),λα2− M
T(t,µ2),α2|] ≤ ∆(µ
1, µ
2) . Since these two processes belong to [0, 1], we get
E
t[|M
T(t,µ1),λα2− M
T(t,µ2),α2|
2] ≤ ∆(µ
1, µ
2).
Hence, the arbitrariness of α
2∈ A
t,µ2together with (2.6) and (3.1) provides Y
t(µ
1) ≤ Y
t(µ
2) + Err
t(∆(µ
1, µ
2)) .
Interchanging the roles of µ
1and µ
2leads to
Y
t(µ
2) ≤ Y
t(µ
1) + Err
t(∆(µ
2, µ
1)) .
Step 2. We next consider the case where
P[µ
1= 0] > 0. Without loss of generality, we can assume that µ
1≡ 0. Fix α ∈ A
t,µ2. Since A
t,µ1= {0}, M
T(t,µ2),α≥ 0 and Φ is non-decreasing, comparison implies that
Y
t(0) = E
tg[Φ(0)] ≤ E
tg[Φ(M
T(t,µ2),α)].
In particular, Y
t(0) = E
tg[Φ(0)] ≤ Y
t(µ
2) ≤ E
tg[Φ(M
T(t,µ2),0)] = E
tg(Φ(µ
2)).
Step 3. We now consider the case where
P[µ
1= 1] > 0. Again, we can assume that µ
1≡ 1 so that A
t,µ1= {0}. By comparison as above, one has
Y
t(1) = E
tg[Φ(1)] ≥ Y
t(µ
2).
On the other hand, since M
(t,µ2),αis a martingale taking values in [0, 1], we have E
t[|1 − M
T(t,µ2),α|
2] ≤ E
t[1 − M
T(t,µ2),α] = 1 − µ
2, α ∈ A
t,µ2,
from which the result follows.
✷3.2 Convexity
In [2] and [8], it is shown that the map m ∈ [0, 1] 7→ Y
0(m) is convex. This is done in a Markovian framework using PDE arguments. In this section, we provide a probabilistic proof of this result which hereby extends to our setting. The result is stated for the lower- semicontinuous envelope Y
t∗of Y
tdefined as
Y
t∗(µ) := lim
ε→0
essinf{Y
t(µ
′) : |µ
′− µ| ≤ ε, µ
′∈ L
0([0, 1], F
t)}, (3.2) for any t ∈ [0, T ]. We refer to Proposition 3.1, the discussion before it and to (ii) of Remark 2.4 for conditions ensuring that Y
∗= Y.
We first make precise the notion of convexity adapted to our non-Markovian setting. Fix a time t ∈ [0, T ].
Definition 3.1 (F
t-convexity).
(i) In the following, we say that a subset D ⊂ L
∞(R, F
t) is F
t-convex if λµ
1+ (1−λ)µ
2∈ D, for all µ
1, µ
2∈ D and λ ∈ L
0([0, 1], F
t).
(ii) Let D be an F
t-convex subset of L
∞(
R, F
t). A map J : D 7→ L
2(
R, F
t) is said to be F
t-convex if
Epi(J ) := {(µ, Y ) ∈ D × L
2(R, F
t) : Y ≥ J (µ)}
is F
t-convex.
(iii) Let Epi
c(Y
t) be the set of elements of the form
Pn≤N
λ
n(µ
n, Y
n) with (µ
n, Y
n, λ
n)
n≤N⊂ Epi(Y
t) × L
0([0, 1], F
t) such that
Pn≤N