Discrete-time probabilistic approximation of path-dependent stochastic control problems

(1)

HAL Id: hal-01246998

https://hal.archives-ouvertes.fr/hal-01246998

Submitted on 21 Dec 2015

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Discrete-time probabilistic approximation of path-dependent stochastic control problems

Xiaolu Tan

To cite this version:

Xiaolu Tan. Discrete-time probabilistic approximation of path-dependent stochastic control problems.

Annals of Applied Probability, Institute of Mathematical Statistics (IMS), 2014, �10.1214/13-AAP963�.

�hal-01246998�

(2)

Discrete-time probabilistic approximation of path-dependent stochastic control problems

Xiaolu TAN ^∗ August 26, 2013

Abstract

We give a probabilistic interpretation of the Monte Carlo scheme proposed by Fahim, Touzi and Warin [Ann. Appl. Probab. 21(4) : 1322-1364 (2011)] for fully nonlinear parabolic PDEs, and hence generalize it to the path-dependent (or non- Markovian) case for a general stochastic control problem. General convergence result is obtained by weak convergence method in spirit of Kushner and Dupuis [19]. We also get a rate of convergence using the invariance principle technique as in Dolinsky [7], which is better than that obtained by viscosity solution method. Finally, by approximating the conditional expectations arising in the numerical scheme with simulation-regression method, we obtain an implementable scheme.

Key words. Numerical scheme, path-dependent stochastic control, weak convergence, invariance principle.

MSC 2010. Primary 65K99, secondary 93E20, 93E25

1 Introduction

Stochastic optimal control theory is largely applied in economics, ﬁnance, physics and management problems. Since its development, numerical methods for stochastic control problems have also been largely investigated. For the Markovian control problem, the value function can usually be characterized by Hamilton-Jacob-Bellman(HJB) equations, then many numerical methods are also given as numerical schemes for PDEs.

In this context, a powerful tool to prove the convergence is the monotone convergence of viscosity solution method of Barles and Souganidis [1].

In the one-dimensional case, the explicit ﬁnite diﬀerence scheme can be easily constructed and implemented, and the monotonicity is generally guaranteed under the

∗

Ceremade, University of Paris-Dauphine, Paris, xiaolu.tan@gmail.com. The author is grateful to Nizar

Touzi, J. Fr´ ed´ eric Bonnans, Nicole El Karoui and two anonymous referees for helpful comments and sugges-

tions.

(3)

Courant-Friedrichs-Lewy (CFL) condition. In two dimensional cases, Bonnans, Otten- waelter and Zidani [3] proposed a numerical algorithm to construct monotone explicit schemes. Debrabant and Jakobsen [6] gave a semi-Lagrangian scheme which is easily constructed to be monotone but needs finite difference grid together with interpolation method for the implementation. In general, these methods may be relatively efficient in low dimensional cases; while in high dimensional cases, Monte Carlo method is preferred if possible.

As a generalization of Feynman-Kac formula, the Backward Stochastic Differential Equation (BSDE) opens a way for the Monte Carlo method for optimal control problems (see e.g. Bouchard and Touzi [4], Zhang [28]). Generally speaking, the BSDE covers the controlled diffusion processes problems of which only the drift part is controlled. However, it cannot include the general control problems when the volatility part is also controlled. This is one of the main motivations of recent developments of second order BSDE (2BSDE) by Cheridito, Soner, Touzi and Victoir [5] and Soner, Touzi and Zhang [22]. Motivated by the 2BSDE theory in [5], and also inspired by the numerical scheme of BSDEs, Fahim, Touzi and Warin [11] proposed a probabilistic numerical scheme for fully nonlinear parabolic PDEs. In their scheme, one needs to simulate a diffusion process, and estimate the value function as well as the derivatives of the value function arising in the PDE by conditional expectations, and then com- pute the value function in a backward way on the discrete time grid. The efficiency of this Monte Carlo scheme has been shown by several numerical examples, we refer to Fahim, Touzi and Warin [11], Guyon and Henry-Labord` ere [15] and Tan [25] for the implemented examples.

However, instead of probabilistic arguments, the convergence of this scheme is proved by techniques of monotone convergence of viscosity solution of Barles and Souganidis [1]. Moreover, their scheme can be only applied in the Markovian case when the value function is characterized by PDEs.

The main contribution of this paper is to give a probabilistic interpretation to the Monte Carlo scheme of [11] for fully nonlinear PDEs, which permits to generalize it to the non-Markovian case for a general stochastic optimal control problem. One of the motivations for the non-Markovian generalization comes from ﬁnance to price the path-dependent exotic derivative options in the uncertain volatility model.

Our general convergence result is obtained by weak convergence techniques in spirit of Kushner and Dupuis [19]. In contrast to [19], where the authors deﬁne their controlled Markov chain in a descriptive way, we give our controlled discrete-time semimartingale in an explicit way using the cumulative distribution functions. Moreover, we introduce a canonical space for the control problem following El Karoui, Huu Nguyen and Jeanblanc [9], which permits to explore the convergence conditions on the reward functions. We also provide a convergence rate using the invariance principle techniques of Sakhanenko [23] and Dolinsky [7]. Comparing to the scheme in [7] in the context of 𝐺−expectation (see e.g. Peng [20] for 𝐺−expectation), our scheme is implementable using simulation-regression method.

The rest of the paper is organized as follows. In Section 2, we ﬁrst introduce a

general path-dependent stochastic control problem, and propose a numerical scheme.

(4)

Then we give the assumptions on the diﬀusion coeﬃcients and the reward functions, as well as the main convergence results, including the general convergence and a rate of convergence. Next in Section 3, we provide a probabilistic interpretation of the numerical scheme, by showing that the numerical solution is equivalent to the value function of a controlled discrete-time semimartingale problem. Then we complete the proofs of the convergence results in Section 4. Finally, in Section 5, we discuss some issues about the implementation of our numerical scheme, including a simulation- regression method.

Notations. We denote by 𝑆

𝑑

the space of all 𝑑 × 𝑑 matrices, and by 𝑆

_𝑑⁺

the space of all positive symmetric 𝑑 × 𝑑 matrices. Given a vector or a matrix 𝐴, then 𝐴

^⊤

denotes its transposition. Given two 𝑑 × 𝑑 matrix 𝐴 and 𝐵, their product is deﬁned by 𝐴 ⋅ 𝐵 := Tr(𝐴𝐵

^⊤

) and ∣𝐴∣ := √

𝐴 ⋅ 𝐴. Let Ω

^𝑑

:= 𝐶([0, 𝑇 ], ℝ

^𝑑

) be the space of all continuous paths between 0 and 𝑇 , denote ∣x∣ := sup

_{0≤𝑡≤𝑇}

∣x

_𝑡

∣ for every x ∈ Ω

^𝑑

. In the paper, 𝐸 is a ﬁxed compact Polish space, we denote

𝑄

_𝑇

:= [0, 𝑇 ] × Ω

^𝑑

× 𝐸.

Suppose that (𝑋

_𝑡_𝑘

)

0≤𝑘≤𝑛

is a process deﬁned on the discrete time grid (𝑡

_𝑘

)

0≤𝑘≤𝑛

of [0, 𝑇 ] with 𝑡

_𝑘

:= 𝑘ℎ and ℎ :=

^𝑇_𝑛

, we usually write it as (𝑋

_𝑘

)

0≤𝑘≤𝑛

, and denote by 𝑋 ˆ its linear interpolation path on [0, 𝑇 ]. In the paper, 𝐶 is a constant whose value may vary from line to line.

2 A numerical scheme for stochastic control problems

2.1 A path-dependent stochastic control problem

Let (Ω, ℱ , ℙ ) be a complete probability space containing a 𝑑−dimensional standard Brownian motion 𝑊 , 𝔽 = (ℱ

_𝑡

)

0≤𝑡≤𝑇

be the natural Brownian ﬁltration. Denote by Ω

^𝑑

:= 𝐶([0, 𝑇 ], ℝ

^𝑑

) the space of all continuous paths between 0 and 𝑇 . Suppose that 𝐸 is a compact Polish space with a complete metric 𝑑

𝐸

, (𝜎, 𝜇) are bounded continuous functions deﬁned on 𝑄

_𝑇

:= [0, 𝑇 ] × Ω

^𝑑

× 𝐸 taking value in 𝑆

_𝑑

× ℝ

^𝑑

. We ﬁx a constant 𝑥

0

∈ ℝ

^𝑑

through out the paper. Then given a 𝔽 −progressively measurable 𝐸−valued process 𝜈 = (𝜈

𝑡

)

0≤𝑡≤𝑇

, denote by 𝑋

^𝜈

the controlled diﬀusion process which is the strong solution to

𝑋

_𝑡^𝜈

= 𝑥

₀

+

∫

𝑡 0

𝜇(𝑠, 𝑋

_⋅^𝜈

, 𝜈

_𝑠

)𝑑𝑠 +

∫

𝑡 0

𝜎(𝑠, 𝑋

_⋅^𝜈

, 𝜈

_𝑠

)𝑑𝑊

_𝑠

. (2.1)

To ensure the existence and uniqueness of the strong solution to the above equation

(2.1), we suppose that for every progressively measurable process (𝑋, 𝜈), the processes

𝜇(𝑡, 𝑋

⋅

, 𝜈

_𝑡

) and 𝜎(𝑡, 𝑋

⋅

, 𝜈

_𝑡

) are progressively measurable. In particular, 𝜇 and 𝜎 depend

on the past trajectory of 𝑋. Further, we suppose that there is some constant 𝐶 and a

continuity module 𝜌, which is an increasing function on ℝ

⁺

satisfying 𝜌(0

⁺

) = 0, such

(5)

that

𝜇(𝑡

₁

, x

₁

, 𝑢

₁

) − 𝜇(𝑡

₂

, x

₂

, 𝑢

₂

) +

𝜎(𝑡

₁

, x

₁

, 𝑢

₁

) − 𝜎(𝑡

₂

, x

₂

, 𝑢

₂

)

≤ 𝐶∣x

^𝑡₁¹

− x

^𝑡₂²

∣ + 𝜌(∣𝑡

₁

− 𝑡

₂

∣ + 𝑑

_𝐸

(𝑢

₁

, 𝑢

₂

)), (2.2) where for every (𝑡, x) ∈ [0, 𝑇 ] × Ω

^𝑑

, we denote x

^𝑡_𝑠

:= x

_𝑠

1

_[0,𝑡]

(𝑠) + x

_𝑡

1

_(𝑡,𝑇]

(𝑠). Let Φ : x ∈ Ω

^𝑑

→ ℝ and 𝐿 : (𝑡, x, 𝑢) ∈ 𝑄

𝑇

→ ℝ be the continuous reward functions, and denote by 𝒰 the collection of all 𝐸−valued 𝔽 −progressively measurable processes, the main purpose of this paper is to approximate numerically the following optimization problem:

𝑉 := sup

𝜈∈𝒰

𝔼 [ ∫

𝑇

0

𝐿 (

𝑡, 𝑋

_⋅^𝜈

, 𝜈

𝑡

) 𝑑𝑡 + Φ(𝑋

_⋅^𝜈

) ]

. (2.3)

Similarly to 𝜇 and 𝜎, we suppose that for every progressively measurable process (𝑋, 𝜈 ), the process 𝑡 7→ 𝐿(𝑡, 𝑋

⋅

, 𝜈

_𝑡

) is progressively measurable. Moreover, to ensure that the expectation in (2.3) is well deﬁned, we shall assume later that 𝐿 and Φ are of exponential growth in x and discuss their integrability in Proposition 2.5.

2.2 The numerical scheme

In preparation of the numerical scheme, we shall ﬁx, through out the paper, a progressively measurable function 𝜎

0

: [0, 𝑇 ] × Ω

^𝑑

→ 𝑆

𝑑

such that

𝜎

0

(𝑡

1

, x

1

) − 𝜎

0

(𝑡

2

, x

2

)

≤ 𝐶∣x

^𝑡₁¹

− x

^𝑡₂²

∣ + 𝜌(∣𝑡

₁

− 𝑡

2

∣)),

∀(𝑡

₁

, x

1

), (𝑡

2

, x

2

) ∈ [0, 𝑇 ] × Ω

^𝑑

, and with 𝜀

0

> 0, 𝜎

0

𝜎

^⊤₀

(𝑡, x) ≥ 𝜀

0

𝐼

𝑑

for every (𝑡, x) ∈ [0, 𝑇 ] × Ω

^𝑑

. Denote

𝜎

₀^𝑡,x

:= 𝜎

0

(𝑡, x), 𝑎

^𝑡,x₀

:= 𝜎

^𝑡,x₀

(𝜎

^𝑡,x₀

)

^⊤

,

𝑎

^𝑡,x_𝑢

:= 𝜎𝜎

^⊤

(𝑡, x, 𝑢) − 𝑎

^𝑡,x₀

, 𝑏

^𝑡,x_𝑢

:= 𝜇(𝑡, x, 𝑢). (2.4) Then we deﬁne a function 𝐺 on [0, 𝑇 ] × Ω

^𝑑

× 𝑆

𝑑

× ℝ

^𝑑

by

𝐺(𝑡, x, 𝛾, 𝑝) := sup

𝑢∈𝐸

( 𝐿(𝑡, x, 𝑢) + 1

2 𝑎

^𝑡,x_𝑢

⋅ 𝛾 + 𝑏

^𝑡,x_𝑢

⋅ 𝑝 )

, (2.5)

which is clearly convex in (𝛾, 𝑝) as the supremum of a family of linear functions, and lower-semicontinuous in (𝑡, x) as the supremum of a family of continuous functions.

Let 𝑛 ∈ ℕ , denote the time discretization by ℎ :=

^𝑇_𝑛

and 𝑡

_𝑘

:= ℎ𝑘.

Let us take the standard 𝑑−dimensional Brownian motion 𝑊 in the complete probability space (Ω, ℱ, ℙ ). For simplicity, we denote 𝑊

_𝑘

:= 𝑊

𝑡_𝑘

, Δ𝑊

_𝑘

:= 𝑊

_𝑘

− 𝑊

𝑘−1

, ℱ

_𝑘^𝑊

:= 𝜎(𝑊

₀

, 𝑊

₁

, ⋅ ⋅ ⋅ , 𝑊

_𝑘

) and 𝔼

^𝑊_𝑘

[⋅] := 𝔼 [⋅∣ℱ

_𝑘^𝑊

]. Then we have a process 𝑋

⁰

on the discrete grid (𝑡

𝑘

)

0≤𝑘≤𝑛

deﬁned by

𝑋

₀⁰

:= 𝑥

0

, 𝑋

_𝑘+1⁰

:= 𝑋

_𝑘⁰

+ 𝜎

0

(𝑡

𝑘

, 𝑋 ˆ

⁰⋅

)Δ𝑊

𝑘+1

, (2.6)

where 𝑋 ˆ

⁰

denotes the linear interpolation process of (𝑋

_𝑘⁰

)

0≤𝑘≤𝑛

on interval [0, 𝑇 ].

(6)

Then, for every time discretization ℎ, our numerical scheme is given by

𝑌

_𝑘^ℎ

:= 𝔼

^𝑊_𝑘

[𝑌

_𝑘+1^ℎ

] + ℎ 𝐺(𝑡

_𝑘

, 𝑋 ˆ

⁰⋅

, Γ

^ℎ_𝑘

, 𝑍

_𝑘^ℎ

), (2.7) with terminal condition

𝑌

_𝑛^ℎ

:= Φ( 𝑋 ˆ

⁰⋅

), (2.8)

where 𝐺 is deﬁned by (2.5) and Γ

^ℎ_𝑘

:= 𝔼

^𝑊_𝑘

[

𝑌

_𝑘+1^ℎ

(𝜎

^⊤_0,𝑘

)

⁻¹

Δ𝑊

_𝑘+1

Δ𝑊

_𝑘+1^⊤

− ℎ𝐼

_𝑑

ℎ

²

𝜎

⁻¹_0,𝑘

]

, 𝑍

_𝑘^ℎ

:= 𝔼

^𝑊_𝑘

[

𝑌

_𝑘+1^ℎ

(𝜎

^⊤_0,𝑘

)

⁻¹

Δ𝑊

𝑘+1

ℎ ]

,

with 𝜎

_0,𝑘

:= 𝜎

^𝑡₀^𝑘^,^𝑋^ˆ⁰

= 𝜎

₀

(𝑡

_𝑘

, 𝑋 ˆ

⁰⋅

).

Remark 2.1. By its deﬁnition, 𝑌

_𝑘^ℎ

is a measurable function of (𝑋

₀⁰

, ⋅ ⋅ ⋅ , 𝑋

_𝑘⁰

). We shall show later in Proposition 2.5 that the function 𝑌

_𝑘^ℎ

(𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) is of exponential growth in max

0≤𝑖≤𝑘

∣𝑥

_𝑖

∣ under appropriate conditions, and hence the conditional expectations in (2.7) are well deﬁned. Therefore, the above scheme (2.7) should be well deﬁned.

Remark 2.2. In the Markovian case, when the function Φ(⋅) (resp. 𝐿(𝑡, ⋅, 𝑢), 𝜇(𝑡, ⋅, 𝑢) and 𝜎(𝑡, ⋅, 𝑢)) only depends on 𝑋

_𝑇

(resp. 𝑋

_𝑡

), so that the function 𝐺(𝑡, ⋅, 𝛾, 𝑧) only depends on 𝑋

𝑡

and the value function of the optimization problem (2.3) can be characterized as the viscosity solution of a nonlinear PDE

−∂

_𝑡

𝑣 − 1

2 𝑎

0

(𝑡, 𝑥) ⋅ 𝐷

²

𝑣 − 𝐺(𝑡, 𝑥, 𝐷

²

𝑣, 𝐷𝑣) = 0.

Then the above scheme reduces to that proposed by Fahim, Touzi and Warin [11].

2.3 The convergence results of the scheme

Our main idea to prove the convergence of the scheme (2.7), (2.8) is to interpret it as an optimization problem on a system of controlled discrete-time semimartingales, which converge weakly to the controlled diﬀusion processes. Therefore, a reasonable assumption is that Φ and 𝐿 are bounded continuous on Ω

^𝑑

(i.e. Φ(⋅), 𝐿(𝑡, ⋅, 𝑢) ∈ 𝐶

_𝑏

(Ω

^𝑑

)), or they belong to the completion space of 𝐶

_𝑏

(Ω

^𝑑

) under an appropriate norm.

We shall suppose that Φ (resp. 𝐿) is continuous in x (resp. (𝑡, x, 𝑢)), and there are a constant 𝐶 and continuity modules 𝜌

0

, (𝜌

_𝑁

)

_𝑁≥1

such that for every (𝑡, x, 𝑢) ∈ 𝑄

_𝑇

and (𝑡

₁

, x

₁

), (𝑡

₂

, x

₂

) ∈ [0, 𝑇 ] × Ω

^𝑑

and 𝑁 ≥ 1,

⎧

 

⎨

 

⎩

∣Φ(x)∣ + ∣𝐿(𝑡, x, 𝑢)∣ ≤ 𝐶 exp ( 𝐶∣x∣ )

,

∣𝐿(𝑡

₁

, x, 𝑢) − 𝐿(𝑡

₂

, x, 𝑢)∣ ≤ 𝜌

₀

(∣𝑡

₁

− 𝑡

₂

∣),

∣Φ

_𝑁

(x

₁

) − Φ

_𝑁

(x

₂

)∣ + ∣𝐿

_𝑁

(𝑡, x

₁

, 𝑢) − 𝐿

_𝑁

(𝑡, x

₂

, 𝑢)∣ ≤ 𝜌

_𝑁

(∣x

₁

− x

₂

∣),

(2.9)

where Φ

𝑁

:= (−𝑁 ) ∨ (Φ ∧ 𝑁 ) and 𝐿

𝑁

:= (−𝑁 ) ∨ (𝐿 ∧ 𝑁 ).

Denote

𝑚

_𝐺

:= min

(𝑡,x,𝑢)∈𝑄𝑇, 𝑤∈ℝ^𝑑

( 1

2 𝑤

^⊤

𝑎

^𝑡,x_𝑢

𝑤 + 𝑏

^𝑡,x_𝑢

⋅ 𝑤 )

, (2.10)

(7)

and

ℎ

₀

:= 1

_𝑚_𝐺₌₀

𝑇 + 1

_𝑚_𝐺_<0

min

(𝑡,x,𝑢)∈𝑄_𝑇

− 𝑚

⁻¹_𝐺

( 1 − 1

2 𝑎

^𝑡,x_𝑢

⋅ (

𝑎

^𝑡,x₀

)

−1

)

, (2.11) where 𝑎

^𝑡,x₀

, 𝑎

^𝑡,x𝑢

and 𝑏

^𝑡,x𝑢

are deﬁned in (2.4). Clearly, 𝑚

𝐺

≤ 0.

Assumption 1. For every (𝑡, x, 𝑢) ∈ 𝑄

_𝑇

, we have 𝑎

^𝑡,x_𝑢

≥ 0 and 1 − 1

2 𝑎

^𝑡,x_𝑢

⋅ ( 𝑎

^𝑡,x₀

)

−1

≥ 0.

Further, the constants 𝑚

𝐺

> −∞ and ℎ

0

> 0.

Remark 2.3. Assumption 1 is almost equivalent to Assumption F of [11] in the context of the control problem, and it implies that the drift 𝜇 and 𝜎 are uniformly bounded, as assumed at the beginning of Section 2.1. In particular, it follows that when 𝑚

_𝐺

< 0, we have

1 − 1 2 𝑎

^𝑡,x_𝑢

⋅ (

𝑎

^𝑡,x₀

)

−1

+ ℎ𝑚

_𝐺

≥ 0, for every (𝑡, x, 𝑢) ∈ 𝑄

_𝑇

and ℎ ≤ ℎ

₀

.

Moreover, since 𝑎

0

is supposed to be nondegenerate, the assumption implies that 𝜎𝜎

^⊤

(𝑡, x, 𝑢) is nondegenerate for all (𝑡, x, 𝑢) ∈ 𝑄

_𝑇

. The non-degeneracy condition may be incon- venient in practice (see e.g. Example 5.1), we shall also provide more discussions and examples in Section 5.1.

Remark 2.4. When 𝑎

^𝑡,x𝑢

≥ 𝜀𝐼

𝑑

uniformly for some 𝜀 > 0, we get immediately 𝑚

𝐺

>

−∞ since 𝑏

^𝑡,x𝑢

is uniformly bounded. When 𝑎

^𝑡,x𝑢

degenerates, 𝑚

_𝐺

> −∞ implies that 𝑏

^𝑡,x𝑢

lies in the image of 𝑎

^𝑡,x𝑢

.

Proposition 2.5. Suppose that the reward functions 𝐿 and Φ satisfy (2.9), then the optimal value 𝑉 in (2.3) is ﬁnite. Suppose in addition that Assumption 1 holds true. Then for every ﬁxed 𝑛 ∈ ℕ (ℎ :=

^𝑇_𝑛

) and every 0 ≤ 𝑘 ≤ 𝑛, as a function of (𝑋

₀

, ⋅ ⋅ ⋅ , 𝑋

_𝑘

), 𝑌

_𝑘^ℎ

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) is also of exponential growth in max

0≤𝑖≤𝑘

∣𝑥

_𝑖

∣. And hence 𝑌

_𝑘^ℎ

is integrable in (2.7), the numerical scheme (2.7) is well deﬁned.

The proof is postponed in Section 3.1 after a technical lemma.

Our main results of the paper are the following two convergence theorems, whose proofs are left in Section 4.

Theorem 2.6. Suppose that 𝐿 and Φ satisfy (2.9) and Assumption 1 holds true. Then 𝑌

₀^ℎ

→ 𝑉 as ℎ → 0.

To derive a convergence rate, we suppose further that 𝐸 is a compact convex subset of 𝑆

_𝑑⁺

× ℝ

^𝑑

, and for every (𝑡, x, 𝑢) = (𝑡, x, 𝑎, 𝑏) ∈ 𝑄

𝑇

,

𝑎 > 0, 𝜇(𝑡, x, 𝑢) = 𝜇(𝑡, x, 𝑎, 𝑏) = 𝑏, 𝜎(𝑡, x, 𝑢) = 𝜎(𝑡, x, 𝑎, 𝑏) = 𝑎

^1/2

. (2.12)

(8)

Moreover, we suppose that 𝐿(𝑡, x, 𝑢) = ℓ(𝑡, x) ⋅ 𝑢 for some continuous function ℓ : [0, 𝑇 ] × Ω

^𝑑

→ 𝑆

𝑑

× ℝ

^𝑑

and that there exists a constant 𝐶 > 0 such that for every couple (𝑡

₁

, x

₁

), (𝑡

₂

, x

₂

) ∈ 𝑄

_𝑇

,

∣ℓ(𝑡

₁

, x

1

) − ℓ(𝑡

2

, x

2

)∣ + ∣Φ(x

₁

) − Φ(x

2

)∣ (2.13)

≤ 𝐶 (

∣𝑡

₁

− 𝑡

₂

∣ + ∣x

^𝑡₁¹

− x

^𝑡₂²

∣ + ∣x

₁

− x

₂

∣ ) exp (

𝐶(∣x

₁

∣ + ∣x

₂

∣ ) .

Theorem 2.7. Suppose that 𝐿 and Φ satisfy conditions (2.9) and (2.13), the set 𝐸 ⊂ 𝑆

_𝑑⁺

× ℝ

^𝑑

is compact and convex, functions 𝜇 and 𝜎 satisfy (2.12), and Assumption 1 holds true. Then for every 𝜀 > 0, there is a constant 𝐶

𝜀

such that

∣𝑌

₀^ℎ

− 𝑉 ∣ ≤ 𝐶

_𝜀

ℎ

^1/8−𝜀

, ∀ℎ ≤ ℎ

₀

. (2.14) If, in addition, 𝐿 and Φ are bounded, then there is a constant 𝐶 such that

∣𝑌

₀^ℎ

− 𝑉 ∣ ≤ 𝐶ℎ

^1/8

, ∀ℎ ≤ ℎ

₀

. (2.15) Remark 2.8. In the Markovian context as in Remark 2.2, Fahim, Touzi and Warin [11] obtained a convergence rate ℎ

^1/4

for one side and ℎ

^1/10

for the other side using Krylov’s shaking coeﬃcient method. Then their global convergence rate is ℎ

^1/10

. We get a rate ℎ

^1/8

in this path-dependent case under some additional constraints. When there is no control on the volatility part, the BSDE method in Bouchard and Touzi [4]

and Zhang [28] gives a convergence rate of order ℎ

^1/2

. Our current technique cannot achieve this rate in the BSDE context.

Remark 2.9. When the covariance matrix 𝜎𝜎

^⊤

is diagonal dominated, Kushner and Dupuis [19] gave a systematic way to construct a convergent ﬁnite diﬀerence scheme.

However, the construction turns to be not easy when the matrix is not diagonal dominated, see e.g. Bonnans, Ottenwaelter and Zidani [3]. Our scheme relaxes this con- straint. Moreover, our scheme implies a natural Monte Carlo implementation, which may be more eﬃcient in high dimensional cases, see numerical examples in Fahim, Touzi and Warin [11], Guyon and Henry-Labord` ere [15] and Tan [25].

3 A controlled discrete-time semimartingale interpretation

Before giving the proofs of the above convergence theorems, we ﬁrst provide a probabilistic interpretation to the scheme (2.7), (2.8) in spirit of Kushner and Dupuis [19].

Namely, we shall show that the numerical solution is equivalent to the value function of a controlled discrete-time semimartingale problem.

For ﬁnite diﬀerence schemes, the controlled Markov chain interpretation given by

Kushner and Dupuis [19] is straightforward, where their construction of the Markov

chain is descriptive. For our scheme, the probabilistic interpretation is less evident as

the state space is uncountable. Our main idea is to use the inverse function of the

distribution functions. This question has not been evoked in the Markovian context of

(9)

[11] since they use the monotone convergence of viscosity solution technique, where the idea is to show that the terms 𝑍

^ℎ

and Γ

^ℎ

deﬁned below (2.8) are good approximations of the derivatives of the value function.

3.1 A technical lemma

Given a ﬁxed (𝑡, x, 𝑢) ∈ 𝑄

_𝑇

, let us simplify further the notations in (2.4),

𝜎

₀

:= 𝜎

₀^𝑡,x

, 𝑎

₀

:= 𝑎

^𝑡,x₀

, 𝑎

_𝑢

:= 𝑎

^𝑡,x_𝑢

, 𝑏

_𝑢

:= 𝑏

^𝑡,x_𝑢

. (3.1) Denote

𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥) := 1

(2𝜋ℎ)

^𝑑/2

∣𝜎

₀

∣

^1/2

exp (

− 1

2 ℎ

⁻¹

𝑥

^⊤

𝑎

⁻¹₀

𝑥 ) (

1 − 1

2 𝑎

_𝑢

⋅ 𝑎

⁻¹₀

+ 𝑏

_𝑢

⋅ 𝑎

⁻¹₀

𝑥 + 1

2 ℎ

⁻¹

𝑎

_𝑢

⋅ 𝑎

⁻¹₀

𝑥𝑥

^⊤

(𝑎

^⊤₀

)

⁻¹

) . (3.2) It follows by (2.10) that for every (𝑡, x, 𝑢) ∈ 𝑄

𝑇

and 𝑥 ∈ ℝ

^𝑑

,

𝑏

_𝑢

⋅ 𝑎

⁻¹₀

𝑥 + 1

2 ℎ

⁻¹

𝑎

_𝑢

⋅ 𝑎

⁻¹₀

𝑥𝑥

^⊤

(𝑎

^⊤₀

)

⁻¹

= ℎ [

𝑏

_𝑢

⋅ 𝑎

⁻¹₀

𝑥 ℎ + 1

2 𝑎

_𝑢

⋅ 𝑎

⁻¹₀

𝑥 ℎ

𝑥

^⊤

ℎ (𝑎

^⊤₀

)

⁻¹

]

≥ ℎ 𝑚

𝐺

.

Then under Assumption 1, one can verify easily (see also Remark 2.3) that when ℎ ≤ ℎ

₀

for ℎ

₀

given by (2.11), 𝑥 7→ 𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥) is a probability density function on ℝ

^𝑑

, i.e.

𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥) ≥ 0, ∀𝑥 ∈ ℝ

^𝑑

, and

∫

ℝ^𝑑

𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥) 𝑑𝑥 = 1.

Lemma 3.1. Let ℎ ≤ ℎ

0

and 𝑅 be a random vector with probability density 𝑥 7→

𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥). Then for all functions 𝑔 : ℝ

^𝑑

→ ℝ of exponential growth, we have 𝔼 [𝑔(𝑅)] = 𝔼

[

𝑔(𝜎

0

𝑊

ℎ

) (

1 + ℎ𝑏

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑊

ℎ

ℎ + 1

2 ℎ𝑎

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑊

_ℎ

𝑊

_ℎ^⊤

− ℎ𝐼 ℎ

²

𝜎

⁻¹₀

)]

, (3.3) where 𝑊

_ℎ

is a 𝑑−dimensional Gaussian random variable with distribution 𝑁 (0, ℎ𝐼

_𝑑

).

In particular, it follows that there exists a constant 𝐶

1

independent of (ℎ, 𝑡, x, 𝑢) ∈ (0, ℎ

₀

] × 𝑄

_𝑇

such that

𝔼 [𝑅] = 𝑏

_𝑢

ℎ, Var[𝑅] = (𝑎

_𝑢

+ 𝑎

₀

)ℎ − 𝑏

_𝑢

𝑏

^⊤_𝑢

ℎ

²

and 𝔼 [∣𝑅∣

³

] < 𝐶

₁

ℎ

^3/2

, (3.4) where Var[𝑅] means the covariance matrix of the random vector 𝑍. Moreover, for any 𝑐 ∈ ℝ

^𝑑

,

𝔼[𝑒

^𝑐⋅𝑅

] ≤ 𝑒

^𝐶²^ℎ

(1 + 𝐶

2

ℎ), (3.5) where 𝐶

2

is independent of (ℎ, 𝑡, x, 𝑢) and is deﬁned by

𝐶

₂

:= sup

(𝑡,x,𝑢)∈𝑄𝑇

( 1

2 𝑐

^⊤

𝑎

₀

𝑐 + ∣𝑏

_𝑢

⋅ 𝑐∣ + 1

2 𝑐

^⊤

𝑎

_𝑢

𝑐 )

.

(10)

Proof. First, it is clear that

¹

(2𝜋ℎ)^𝑑/2∣𝜎0∣^1/2

exp (

−

¹₂

ℎ

⁻¹

𝑥

^⊤

𝑎

⁻¹₀

𝑥 )

is the density function of 𝜎

₀

𝑊

_ℎ

. Then by (3.2),

𝔼 [𝑔(𝑅)] =

∫

ℝ^𝑑

𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥) 𝑔(𝑥) 𝑑𝑥

=

∫

ℝ^𝑑

1 (2𝜋ℎ)

^𝑑/2

∣𝜎

₀

∣

^1/2

exp (

− 1

2 ℎ

⁻¹

𝑥

^⊤

𝑎

⁻¹₀

𝑥 ) 𝑔(𝑥)

( 1 − 1

2 𝑎

𝑢

⋅ 𝑎

⁻¹₀

+ 𝑏

𝑢

⋅ 𝑎

⁻¹₀

𝑥 + 1

2 ℎ

⁻¹

𝑎

𝑢

⋅ 𝑎

⁻¹₀

𝑥𝑥

^⊤

(𝑎

^⊤₀

)

⁻¹

)

𝑑𝑥

= 𝔼

[

𝑔(𝜎

0

𝑊

ℎ

) (

1 + ℎ𝑏

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑊

_ℎ

ℎ + 1

2 ℎ𝑎

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑊

_ℎ

𝑊

_ℎ^⊤

− ℎ𝐼 ℎ

²

𝜎

⁻¹₀

)]

.

Hence (3.3) holds true.

In particular, let 𝑔(𝑅) = 𝑅 or 𝑔(𝑅) = 𝑅𝑅

^⊤

, it follows by direct computation that the ﬁrst two equalities of (3.4) hold true. Further, letting 𝑔(𝑥) = ∣𝑥∣

³

, we get from (3.3) that

𝔼 [∣𝑅∣

³

] = ℎ

^3/2

𝔼 [

∣𝜎

₀

𝑁 ∣

³

( 1 +

√

ℎ𝑏

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑁 + 1

2 𝑎

𝑢

⋅ (𝜎

^⊤₀

)

⁻¹

(𝑁 𝑁

^⊤

− 𝐼 )𝜎

₀⁻¹

)]

,

where 𝑁 is a Gaussian vector of distribution 𝑁 (0, 𝐼

_𝑑

). And hence (3.4) holds true with 𝐶

1

:= sup

(𝑡,x,𝑢)∈𝑄_𝑇

𝔼

[ ∣𝜎

₀

𝑁 ∣

³

( 1 + √

ℎ

0

𝑏

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑁 + 1

2 𝑎

𝑢

⋅ (𝜎

^⊤₀

)

⁻¹

(

𝑁 𝑁

^⊤

− 𝐼 ) 𝜎

⁻¹₀

)]

,

which is clearly bounded and independent of (ℎ, 𝑡, x, 𝑢).

Finally, to prove inequality (3.5), we denote 𝑁

ℎ

:= 𝑁 + √

ℎ𝜎

₀^⊤

𝑐 for every ℎ ≤ ℎ

0

. Then

𝔼 [𝑒

^𝑐⋅𝑅

] = 𝔼 [

𝑒

^𝑐^⊤^𝜎⁰^𝑊^ℎ

(

1 + ℎ𝑏

_𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑊

_ℎ

ℎ + 1

2 ℎ𝑎

_𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑊

ℎ

𝑊

_ℎ^⊤

− ℎ𝐼

𝑑

ℎ

²

𝜎

⁻¹₀

)]

= 𝔼

[ 𝑒

^𝑐^⊤^𝜎⁰^𝑁

√ ℎ

(

1 +

√

ℎ𝑏

𝑢

⋅ (𝜎

^⊤₀

)

⁻¹

𝑁 + 1

2 𝑎

𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

(

𝑁 𝑁

^⊤

− 𝐼

_𝑑

) 𝜎

₀⁻¹

)]

= 𝑒

^𝑐

⊤𝑎0𝑐 2 ℎ

𝔼 [

1 + √

ℎ𝑏

_𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

𝑁

_ℎ

+ 1

2 𝑎

_𝑢

⋅ (𝜎

₀^⊤

)

⁻¹

(

𝑁

_ℎ

𝑁

_ℎ^⊤

− 𝐼

_𝑑

) 𝜎

₀⁻¹

]

= 𝑒

^𝑐

⊤𝑎0𝑐 2 ℎ

(

1 + (𝑏

𝑢

⋅ 𝑐 + 1

2 𝑐

^⊤

𝑎

𝑢

𝑐)ℎ )

≤ 𝑒

^𝐶²^ℎ

(1 + 𝐶

2

ℎ), where 𝐶

₂

:= sup

_{(𝑡,x,𝑢)∈𝑄}_𝑇

(

1

2

𝑐

^⊤

𝑎

₀

𝑐 + ∣𝑏

_𝑢

⋅ 𝑐∣ +

¹₂

𝑐

^⊤

𝑎

_𝑢

𝑐 )

is bounded and independent of (ℎ, 𝑡, x, 𝑢).

Remark 3.2. Since the random vector 𝑅 does not degenerate to the Dirac mass, it follows by (3.4) that under Assumption 1,

𝜎𝜎

^⊤

(𝑡, x, 𝑢) > 𝜇𝜇

^⊤

(𝑡, x, 𝑢)ℎ, for every (𝑡, x, 𝑢) ∈ 𝑄

_𝑇

, ℎ ≤ ℎ

0

. With this technical lemma, we can give the

Proof of Proposition 2.5. For the ﬁrst assertion, it is enough to prove that

sup

_𝜈∈𝒰

𝔼 [exp(𝐶∣𝑋

_⋅^𝜈

∣)] is bounded by condition (2.9). Note that 𝜇 and 𝜎 are uniformly

(11)

bounded. When 𝑑 = 1, 𝑋

^𝜈

is a continuous semimartingale whose ﬁnite variation part and quadratic variation are both bounded by a constant 𝑅

𝑇

for every 𝜈 ∈ 𝒰. It follows by Dambis-Dubins-Schwartz’s time change theorem that

sup

𝜈∈𝒰

𝔼[exp(𝐶∣𝑋

_⋅^𝜈

∣)] ≤ 𝑒

^𝐶𝑅^𝑇

𝔼 exp (

𝐶 sup

0≤𝑡≤𝑅_𝑇

∣𝐵

_𝑡

∣ )

< ∞, (3.6) where 𝐵 is a standard one-dimensional Brownian motion. When 𝑑 > 1, it is enough to remark that for 𝑋 = (𝑋

¹

, ⋅ ⋅ ⋅ , 𝑋

^𝑑

), exp (

𝐶∣𝑋

_⋅

∣ )

≤ exp (

𝐶(∣𝑋

_⋅¹

∣ + ⋅ ⋅ ⋅ + ∣𝑋

_⋅^𝑑

∣) )

; and we then conclude the proof of the ﬁrst assertion applying Cauchy-Schwartz inequality.

For the second assertion, let us prove it by backward induction. Given 0 ≤ 𝑘 ≤ 𝑛−1, 𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

∈ ℝ

^𝑑

, we denote by 𝑥 ˆ the linear interpolation path of 𝑥(𝑡

_𝑖

) := 𝑥

_𝑖

, denote also

𝐿

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑢) := 𝐿(𝑡

_𝑘

, 𝑥 ˆ

⋅

, 𝑢), ∀𝑢 ∈ 𝐸. (3.7) For the terminal condition, it is clear that 𝑌

_𝑛^ℎ

(𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

𝑛

) is of exponential growth in max

0≤𝑖≤𝑛

∣𝑥

_𝑛

∣ by condition (2.9). Now, suppose that

𝑌

_𝑘+1^ℎ

(𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

𝑘+1

)

≤ 𝐶

𝑘+1

exp (

𝐶

𝑘+1

max

0≤𝑖≤𝑘+1

∣𝑥

_𝑖

∣ ) .

Let 𝑅

_𝑢

be a random variable of distribution density 𝑥 7→ 𝑓

_ℎ

(𝑡

_𝑘

, ˆ 𝑥, 𝑢, 𝑥). Then it follows by (2.7) and Lemma 3.1 that

𝑌

_𝑘^ℎ

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

)

= sup

𝑢∈𝐸

{

ℎ𝐿

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑢) + 𝔼 [

𝑌

_𝑘+1^ℎ

(

𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑥

_𝑘

+ 𝜎

₀

𝑊

_ℎ

) (

1 + ℎ𝑏

^𝑡_𝑢^𝑘^,ˆ^𝑥

⋅ (𝜎

^⊤₀

)

⁻¹

𝑊

_ℎ

ℎ + 1

2 ℎ𝑎

^𝑡_𝑢^𝑘^,ˆ^𝑥

⋅ (𝜎

^⊤₀

)

⁻¹

𝑊

_ℎ

𝑊

_ℎ^⊤

− ℎ𝐼

ℎ

²

𝜎

⁻¹₀

)]}

= sup

𝑢∈𝐸

{

ℎ𝐿

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑢) + 𝔼 [

𝑌

_𝑘+1^ℎ

(

𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑥

_𝑘

+ 𝑅

_𝑢

) ]}

. (3.8)

Therefore by (2.9) and (3.5), 𝑌

_𝑘^ℎ

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

)

≤ (𝐶

_𝑘+1

+ 𝐶ℎ) exp (

(𝐶

_𝑘+1

+ 𝐶ℎ) max

0≤𝑖≤𝑘

∣𝑥

_𝑖

∣ ) sup

𝑢∈𝐸

𝔼 [ exp (

𝐶

_𝑘+1

∣𝑅

_𝑢

∣ )]

≤ 𝑒

^𝐶²^ℎ

(1 + 𝐶

2

ℎ)(𝐶

𝑘+1

+ 𝐶ℎ) exp (

(𝐶

𝑘+1

+ 𝐶ℎ) max

0≤𝑖≤𝑘

∣𝑥

_𝑖

∣ ) ,

where 𝐶 is the same constant given in (2.9), and the constant 𝐶

₂

is from (3.5) depend- ing on 𝐶

𝑘+1

. We then conclude the proof.

3.2 The probabilistic interpretation

In this section, we shall interpret the numerical scheme (2.7) as the value function

of a controlled discrete-time semimartingale problem. In preparation, let us show

how to construct the random variables with density function 𝑥 7→ 𝑓

ℎ

(𝑡, x, 𝑢, 𝑥). Let

𝐹 : ℝ → [0, 1] be the cumulative distribution function of a one-dimensional random

(12)

variable, denote by 𝐹

⁻¹

: [0, 1] → ℝ its generalized inverse function. Then given a random variable 𝑈 of uniform distribution 𝑈 ([0, 1]), it is clear that 𝐹

⁻¹

(𝑈 ) turns to be a random variable with distribution 𝐹. In the multi-dimensional case, we can convert the problem to the one-dimensional case since ℝ

^𝑑

is isomorphic to [0, 1], i.e. there is a one-to-one mapping 𝜅 : ℝ

^𝑑

→ [0, 1] such that 𝜅 and 𝜅

⁻¹

are both Borel measurable (see e.g. Proposition 7.16 and Corollary 7.16.1 of Bertsekas and Shreve [2]).

Deﬁne

𝐹

_ℎ

(𝑡, x, 𝑢, 𝑥) :=

∫

𝜅(𝑦)≤𝑥

𝑓

_ℎ

(𝑡, x, 𝑢, 𝑦)𝜅(𝑦)𝑑𝑦.

It is clear that 𝑥 7→ 𝐹

ℎ

(𝑡, x, 𝑢, 𝑥) is the distribution function of random variable 𝜅(𝑅) where 𝑅 is a random variable of density function 𝑥 7→ 𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥). Denote by 𝐹

_ℎ⁻¹

(𝑡, x, 𝑢, 𝑥) the inverse function of 𝑥 7→ 𝐹

_ℎ

(𝑡, x, 𝑢, 𝑥) and

𝐻

_ℎ

(𝑡, x, 𝑢, 𝑥) := 𝜅

⁻¹

(𝐹

_ℎ⁻¹

(𝑡, x, 𝑢, 𝑥)). (3.9) Then given a random variable 𝑈 of uniform distribution on [0, 1], 𝐹

_ℎ⁻¹

(𝑡, x, 𝑢, 𝑈 ) has the same distribution of 𝜅(𝑅) and 𝐻

_ℎ

(𝑡, x, 𝑢, 𝑈 ) is of distribution density 𝑥 7→ 𝑓

_ℎ

(𝑡, x, 𝑢, 𝑥).

In particular, it follows that the expression (3.8) of numerical solution of scheme (2.7) turns to be

𝑌

_𝑘^ℎ

(𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

𝑘

) = sup

𝑢∈𝐸

𝔼 [

ℎ𝐿

𝑘

(𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

𝑘

, 𝑢) + 𝑌

_𝑘+1^ℎ

(

𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑥

_𝑘

+ 𝐻

_ℎ

(𝑡

_𝑘

, ˆ 𝑥, 𝑢, 𝑈 ) ) ]

, (3.10) where 𝑥 ˆ is the linear interpolation function of (𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) on [0, 𝑡

_𝑘

].

Now, we are ready to introduce a controlled discrete-time semimartingale system.

Suppose that 𝑈

1

, ⋅ ⋅ ⋅ , 𝑈

𝑛

are i.i.d random variables with uniform distribution on [0, 1]

in the probability space (Ω, ℱ, ℙ ). Let 𝒜

_ℎ

denote the collection of all strategies 𝜙 = (𝜙

_𝑘

)

0≤𝑘≤𝑛−1

, where 𝜙

_𝑘

is a universally measurable mapping from ( ℝ

^𝑑

)

^𝑘+1

to 𝐸. Given 𝜙 ∈ 𝒜

_ℎ

, 𝑋

^ℎ,𝜙

is deﬁned by

𝑋

₀^ℎ,𝜙

:= 𝑥

0

, 𝑋

_𝑘+1^ℎ,𝜙

:= 𝑋

_𝑘^ℎ,𝜙

+ 𝐻

ℎ

( 𝑡

𝑘

, 𝑋 ˆ

^ℎ,𝜙⋅

, 𝜙

𝑘

(𝑋

₀^ℎ,𝜙

, ⋅ ⋅ ⋅ , 𝑋

_𝑘^ℎ,𝜙

), 𝑈

𝑘+1

) . (3.11) We then also deﬁne an optimization problem by

𝑉

₀^ℎ

:= sup

𝜙∈𝒜_ℎ

𝔼 [

^𝑛−1

∑

𝑘=0

ℎ𝐿 (

𝑡

𝑘

, 𝑋 ˆ

^ℎ,𝜙⋅

, 𝜙

𝑘

) )

+ Φ( 𝑋 ˆ

^ℎ,𝜙⋅

) ]

. (3.12)

The main result of this section is to show that the numerical solution given by (2.7) is equivalent to the value function of optimization problem (3.12) on the controlled discrete-time semimartingales 𝑋

^ℎ,𝜙

.

Remark 3.3. It is clear that in the discrete-time case, every process is a semimartin-

gale. When 𝜇 ≡ 0 and 𝑈 is of uniform distribution on [0, 1], the random variable

𝐻

_ℎ

(𝑡, x, 𝑢, 𝑈 ) is centered, and hence 𝑋

^ℎ,𝜙

turns to be a controlled martingale. This is

also the main reason we choose the terminology “semimartingale” in the section title.

(13)

Theorem 3.4. Suppose that 𝐿 and Φ satisfy (2.9) and Assumption 1 holds true. Then for 0 < ℎ ≤ ℎ

0

with ℎ

0

deﬁned by (2.11),

𝑌

₀^ℎ

= 𝑉

₀^ℎ

.

The above theorem is similar to a dynamic programming result. Namely, it states that optimizing the criteria globally in (3.12) is equivalent to optimizing it step by step in (3.10). With this interpretation, we only need to analyze the “distance” of the controlled semimartingale 𝑋

^ℎ,𝜙

in (3.11) and the controlled diﬀusion process 𝑋

^𝜈

in (2.1) to show this convergence of 𝑉

₀^ℎ

to 𝑉 in order to prove Theorems 2.6 and 2.7.

Before providing the proof, let us give a technical lemma.

Lemma 3.5. For the function 𝐺 deﬁned by (2.5) and every 𝜀 > 0, there is a universally measurable mapping 𝑢

^𝜀

: 𝑆

_𝑑

× ℝ

^𝑑

→ 𝐸 such that for all (𝛾, 𝑝) ∈ 𝑆

_𝑑

× ℝ

^𝑑

,

𝐺(𝑡, x, 𝛾, 𝑝) ≤ 𝐿(𝑡, x, 𝑢

^𝜀

(𝛾, 𝑝)) + 1

2 𝑎

^𝑡,x_𝑢𝜀(𝛾,𝑝)

⋅ 𝛾 + 𝑏

^𝑡,x_𝑢𝜀(𝛾,𝑝)

⋅ 𝑝 + 𝜀.

Proof. This follows from the measurable selection theorem, see e.g. Theorem 7.50 of Bertsekas and Shreve [2] or Section 2 of El Karoui and Tan [10].

Proof of Theorem 3.4: First, following (3.10), we can rewrite 𝑌

_𝑘^ℎ

as a measurable function of (𝑋

₀⁰

, ⋅ ⋅ ⋅ , 𝑋

_𝑘⁰

), and

𝑌

_𝑘^ℎ

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) = sup

𝑢∈𝐸

𝔼 [

ℎ𝐿

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑢) + 𝑌

_𝑘+1^ℎ

(

𝑥

0

, ⋅ ⋅ ⋅ , 𝑥

𝑘

, 𝑥

𝑘

+ 𝐻

ℎ

(𝑡

𝑘

, 𝑥, 𝑢, 𝑈 ˆ

𝑘+1

) ) ] ,

where 𝑥 ˆ is the linear interpolation function of (𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) on [0, 𝑡

_𝑘

], 𝑈

_𝑘+1

is of uniform distribution on [0, 1] and 𝐿

𝑘

is deﬁned by (3.7).

Next, for every control strategy 𝜙 ∈ 𝒜

_ℎ

and 𝑋

^ℎ,𝜙

deﬁned by (3.11), we denote ℱ

_𝑘^ℎ,𝜙

:= 𝜎(𝑋

₀^ℎ,𝜙

, ⋅ ⋅ ⋅ , 𝑋

_𝑘^ℎ,𝜙

) and

𝑉

_𝑘^ℎ,𝜙

:= 𝔼 [

^𝑛−1

∑

𝑖=𝑘

ℎ𝐿(𝑡

_𝑖

, 𝑋 ˆ

^ℎ,𝜙⋅

, 𝜙

_𝑖

) + Φ( 𝑋 ˆ

^ℎ,𝜙⋅

) ℱ

_𝑘^ℎ,𝜙

]

,

which is clearly a measurable function of (𝑋

₀^ℎ,𝜙

, ⋅ ⋅ ⋅ , 𝑋

_𝑘^ℎ,𝜙

) and satisﬁes 𝑉

_𝑘^ℎ,𝜙

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) = ℎ𝐿

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝜙

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

))

+ 𝔼 [

𝑉

_𝑘+1^ℎ,𝜙

(

𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

, 𝑥

_𝑘

+ 𝐻

_ℎ

(𝑡

_𝑘

, 𝑥, 𝜙 ˆ

_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

), 𝑈

_𝑘+1

) ) ] .

Then by comparing 𝑉

^ℎ,𝜙

with 𝑌

^ℎ

and the arbitrariness of 𝜙 ∈ 𝒜

_ℎ

, it follows that 𝑉

₀^ℎ

≤ 𝑌

₀^ℎ

.

For the reverse inequality, it is enough to ﬁnd, for any 𝜀 > 0, a strategy 𝜙

^𝜀

∈ 𝒜

_ℎ

with 𝑋

^ℎ,𝜀

as deﬁned in (3.11) using 𝜙

^𝜀

such that

𝑌

₀^ℎ

≤ 𝔼 [

^𝑛−1

∑

𝑘=0

ℎ𝐿(𝑡

_𝑘

, 𝑋 ˆ

^ℎ,𝜀⋅

, 𝜙

^𝜀_𝑘

) + Φ( 𝑋 ˆ

^ℎ,𝜀⋅

) ]

+ 𝑛 𝜀. (3.13)

(14)

Let us write Γ

^ℎ_𝑘

and 𝑍

_𝑘^ℎ

deﬁned below (2.7) as a measurable function of (𝑋

₀⁰

, ⋅ ⋅ ⋅ , 𝑋

_𝑘⁰

), and 𝑢

^𝜀

be given by Lemma 3.5, denote

𝜙

^𝜀_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

) := 𝑢

^𝜀

(Γ

^ℎ_𝑘

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

), 𝑍

_𝑘^ℎ

(𝑥

₀

, ⋅ ⋅ ⋅ , 𝑥

_𝑘

𝑧)).

Then by the tower property, the semimartingale 𝑋

^ℎ,𝜀

deﬁned by (3.11) with 𝜙

^𝜀

satisﬁes (3.13).

4 Proofs of the convergence theorems

With the probabilistic interpretation of the numerical solution 𝑌

^ℎ

in Theorem 3.4, we are ready to give the proofs of Theorems 2.6 and 2.7. Intuitively, we shall analyze the “convergence” of the controlled semimartingale 𝑋

^ℎ,𝜙

in (3.11) to the controlled diﬀusion process 𝑋

^𝜈

in (2.1).

4.1 Proof of Theorem 2.6

The main tool to prove Theorem 2.6 is the weak convergence technique due to Kushner and Dupuis [19]. We adapt their idea in our context. We shall also introduce an enlarged canonical space for control problems following El Karoui, Huu Nguyen and Jeanblanc [9], in order to explore the convergence conditions. Then we study the weak convergence of probability measures on the enlarged canonical space.

4.1.1 An enlarged canonical space

In Dolinsky, Nutz and Soner [8], the authors studied a similar but simpler problem in the context of 𝐺−expectation, where they use the canonical space Ω

^𝑑

:= 𝐶([0, 𝑇 ], ℝ

^𝑑

).

We refer to Stroock and Varadhan [24] for a presentation of basic properties of canonical space Ω

^𝑑

. However, we shall use an enlarged canonical space introduced by El Karoui, Huu Nguyen and Jeanblanc [9], which is more convenient to study the control problem for the purpose of numerical analysis.

An enlarged canonical space Let M([0, 𝑇 ] × 𝐸) denote the space of all ﬁnite positive measures 𝑚 on [0, 𝑇 ] × 𝐸 such that 𝑚([0, 𝑇 ] × 𝐸) = 𝑇 , which is a Polish space equipped with the weak convergence topology. Denote by M the collection of ﬁnite positive measures 𝑚 ∈ M([0, 𝑇 ] × 𝐸) such that the projection of 𝑚 on [0, 𝑇 ] is the Lebesgue measure, so that they admit the disintegration 𝑚(𝑑𝑡, 𝑑𝑢) = 𝑚(𝑡, 𝑑𝑢)𝑑𝑡, where 𝑚(𝑡, 𝑑𝑢) is a probability measure on 𝐸 for every 𝑡 ∈ [0, 𝑇 ], i.e.

M := {

𝑚 ∈ M([0, 𝑇 ] × 𝐸) : 𝑚(𝑑𝑡, 𝑑𝑢) = 𝑚(𝑡, 𝑑𝑢)𝑑𝑡 s.t. ∫

𝐸

𝑚(𝑡, 𝑑𝑢) = 1 }

. (4.1)

In particular, (𝑚(𝑡, 𝑑𝑢))

0≤𝑡≤𝑇

is a measure-valued process. The measures in space M

are examples of Young measures and have been largely used in deterministic control

problems. We also refer to Young [27] and Valadier [26] for a presentation of Young

measure as well as its applications.

(15)

Clearly, M is closed under weak convergence topology and hence is also a Polish space. We deﬁne also the 𝜎−ﬁelds on M by ℳ

_𝑡

:= 𝜎{𝑚

_𝑠

(𝜑), 𝑠 ≤ 𝑡, 𝜑 ∈ 𝐶

𝑏

([0, 𝑇 ] × 𝐸)}, where 𝑚

_𝑠

(𝜑) := ∫

𝑠

0

𝜑(𝑟, 𝑢)𝑚(𝑑𝑟, 𝑑𝑢). Then (ℳ

_𝑡

)

0≤𝑡≤𝑇

turns to be a ﬁltration.

In particular, ℳ

_𝑇

is the Borel 𝜎−ﬁeld of M. As deﬁned above, Ω

^𝑑

:= 𝐶([0, 𝑇 ], ℝ

^𝑑

) is the space of all continuous paths between 0 and 𝑇 equipped with canonical ﬁltration 𝔽

^𝑑

= (ℱ

_𝑡^𝑑

)

0≤𝑡≤𝑇

. We then deﬁne an enlarged canonical space by Ω

^𝑑

:= Ω

^𝑑

× M, as well as the canonical process 𝑋 by 𝑋

_𝑡

(𝜔) := 𝜔

^𝑑_𝑡

, ∀𝜔 = (𝜔

^𝑑

, 𝑚) ∈ Ω

^𝑑

= Ω

^𝑑

× M, and the canonical ﬁltration 𝔽

^𝑑

= (ℱ

^𝑑_𝑡

)

0≤𝑡≤𝑇

with ℱ

^𝑑_𝑡

:= ℱ

_𝑡^𝑑

⊗ ℳ

_𝑡

. Denote also by M(Ω

^𝑑

) the collection of all probability measures on Ω

^𝑑

.

Four classes of probability measures A controlled diﬀusion process as well as the control process may induce a probability measure on Ω

^𝑑

. Further, the optimization criterion in (2.3) can be then given as a random variable deﬁned on Ω

^𝑑

. Then the optimization problem (2.3) can be studied on Ω

^𝑑

, as the quasi-sure approach in Soner, Touzi and Zhang [21]. In the following, we introduce four subclasses of M(Ω

^𝑑

).

Let 𝛿 > 0, we consider a particular strategy 𝜈

^𝛿

of the form 𝜈

_𝑠^𝛿

= 𝑤

_𝑘

(𝑋

_𝑟^𝜈𝑘^𝛿 𝑖

, 𝑖 ≤ 𝐼

_𝑘

) for every 𝑠 ∈ (𝑘𝛿, (𝑘 + 1)𝛿], where 𝐼

_𝑘

∈ ℕ , 0 ≤ 𝑟

₀^𝑘

< ⋅ ⋅ ⋅ < 𝑟

_𝐼^𝑘

𝑘

≤ 𝑘𝛿, 𝑋

^𝜈^𝛿

is the controlled process given by (2.1) with strategy 𝜈

^𝛿

, and 𝑤

𝑘

: ℝ

^𝑑𝐼^𝑘

→ 𝐸 is a continuous function.

Clearly, 𝜈

^𝛿

is an adapted piecewise constant strategy. Denote by 𝒰

₀

the collection of all strategies of this form for all 𝛿 > 0. It is clear that 𝒰

₀

⊂ 𝒰.

Given 𝜈 ∈ 𝒰 in the probability space (Ω, ℱ, ℙ ), denote 𝑚

^𝜈

(𝑑𝑡, 𝑑𝑢) := 𝛿

_𝜈_𝑡

(𝑑𝑢)𝑑𝑡 ∈ M. Then (𝑋

^𝜈

, 𝑚

^𝜈

) can induce a probability measure ℙ

^𝜈

on Ω

^𝑑

by

𝔼

^ℙ

𝜈

Υ ( 𝜔

^𝑑

, 𝑚 )

:= 𝔼

^ℙ

Υ(𝑋

_⋅^𝜈

, 𝑚

^𝜈

)

, (4.2)

for every bounded measurable function Υ deﬁned on Ω

^𝑑

. In particular, for any bounded function 𝑓 : ℝ

^{𝑑𝐼+𝐼𝐽}

→ ℝ with arbitrary 𝐼, 𝐽 ∈ ℕ , 𝑠

_𝑖

∈ [0, 𝑇 ], 𝜓

_𝑗

: [0, 𝑇 ] × 𝐸 → ℝ bounded,

𝔼

^ℙ

𝜈

𝑓 (

𝑋

_𝑠_𝑖

, 𝑚

_𝑠_𝑖

(𝜓

_𝑗

), 𝑖 ≤ 𝐼, 𝑗 ≤ 𝐽 )

= 𝔼

^ℙ

𝑓 ( 𝑋

_𝑠^𝜈_𝑖

,

∫

𝑠𝑖

0

𝜓

_𝑗

(𝜈

_𝑟

)𝑑𝑟, 𝑖 ≤ 𝐼, 𝑗 ≤ 𝐽 ) .

Then the ﬁrst and the second subsets of M(Ω

^𝑑

) are given by 𝒫

_𝑆₀

:= {

ℙ

^𝜈

: 𝜈 ∈ 𝒰

₀

}

and 𝒫

_𝑆

:= {

ℙ

^𝜈

: 𝜈 ∈ 𝒰 } .

Now, let 0 < ℎ ≤ ℎ

0

and 𝜙 ∈ 𝒜

_ℎ

, denote 𝑚

^ℎ,𝜙

(𝑑𝑡, 𝑑𝑢) := ∑

𝑛−1

𝑘=0

𝛿

_𝜙_𝑘

(𝑑𝑢)1

_(𝑡_𝑘_,𝑡_𝑘+1_]

(𝑑𝑡), and 𝑋

^ℎ,𝜙

be the discrete-time semimartingale deﬁned by (3.11). It follows that ( 𝑋 ˆ

^ℎ,𝜙

, 𝑚

^ℎ,𝜙

) induces a probability measure ℙ

^ℎ,𝜙

on Ω

^𝑑

as in (4.2). Then the third subsets of M(Ω

^𝑑

) we introduce is

𝒫

_ℎ

:= {

ℙ

^ℎ,𝜙

: 𝜙 ∈ 𝒜

_ℎ

} .

Finally, for the fourth subset, we introduce a martingale problem on Ω

^𝑑

. Let ℒ

^𝑡,x,𝑢

be a functional operator deﬁned by

ℒ

^𝑡,x,𝑢

𝜑 := 𝜇(𝑡, x, 𝑢) ⋅ 𝐷𝜑 + 1

2 𝜎𝜎

^⊤

(𝑡, x, 𝑢) ⋅ 𝐷

²

𝜑. (4.3)

(16)

Then for every 𝜑 ∈ 𝐶

_𝑏^∞

( ℝ

^𝑑

), a process 𝑀 (𝜑) is deﬁned on Ω

^𝑑

by 𝑀

_𝑡

(𝜑) := 𝜑(𝑋

_𝑡

) − 𝜑(𝑋

₀

) −

∫

𝑡 0

∫

𝐸

ℒ

^𝑡,𝑋^⋅^,𝑢

𝜑(𝑋

_𝑠

) 𝑚(𝑠, 𝑑𝑢) 𝑑𝑠. (4.4) Denote by 𝒫

_𝑅

the collection of all probability measures on Ω

^𝑑

under which 𝑋

₀

= 𝑥

₀

a.s. and 𝑀

_𝑡

(𝜑) is a 𝔽

^𝑑

−martingale for every 𝜑 ∈ 𝐶

_𝑏^∞

( ℝ

^𝑑

). In [9], a probability measure in 𝒫

_𝑅

is called a relaxed control rule.

Remark 4.1. We denote, by abuse of notations, the random processes in Ω

^𝑑

𝜇(𝑡, 𝜔

^𝑑

, 𝑚) :=

∫

𝐸

𝜇(𝑡, 𝜔

^𝑑

, 𝑢)𝑚(𝑡, 𝑑𝑢), 𝑎(𝑡, 𝜔

^𝑑

, 𝑚) :=

∫

𝐸

𝜎𝜎

^⊤

(𝑡, 𝜔

^𝑑

, 𝑢)𝑚(𝑡, 𝑑𝑢), which are clearly adapted to the ﬁltration 𝔽

^𝑑

. It follows that 𝑀

𝑡

(𝜑) deﬁned by (4.4) is equivalent to

𝑀

𝑡

(𝜑) := 𝜑(𝑋

𝑡

) − 𝜑(𝑋

0

) − ∫

_𝑡

0

(

𝜇(𝑠, 𝜔

^𝑑

, 𝑚) ⋅ 𝐷𝜑(𝑋

𝑠

) +

¹₂

𝑎(𝑠, 𝜔

^𝑑

, 𝑚) ⋅ 𝐷

²

𝜑(𝑋

𝑠

) )

𝑑𝑠.

Therefore, under any probability ℙ ∈ 𝒫

_𝑅

, since 𝑎(𝑠, 𝜔

^𝑑

, 𝑚) is non-degenerate, there is a Brownian motion 𝑊 ˜ on Ω

^𝑑

such that the canonical process can be represented as

𝑋

_𝑡

= 𝑥

₀

+

∫

𝑡 0

𝜇(𝑠, 𝑋

⋅

, 𝑚)𝑑𝑠 +

∫

𝑡 0

𝑎

^1/2

(𝑠, 𝑋

⋅

, 𝑚)𝑑 𝑊 ˜

_𝑠

.

Moreover, it follows by Itˆ o’s formula as well as the deﬁnition of ℙ

^𝜈

in (4.2) that ℙ

^𝜈

∈ 𝒫

_𝑅

for every 𝜈 ∈ 𝒰 . In resume, we have 𝒫

_𝑆₀

⊂ 𝒫

_𝑆

⊂ 𝒫

_𝑅

.

A completion space of 𝐶

_𝑏

(Ω

^𝑑

) Now, let us introduce two random variables on Ω

^𝑑

by

Ψ(𝜔) = Ψ(𝜔

^𝑑

, 𝑚) :=

∫

𝑇 0

𝐿 (

𝑡, 𝜔

^𝑑

, 𝑢 )

𝑚(𝑑𝑡, 𝑑𝑢) + Φ(𝜔

^𝑑

) (4.5) and

Ψ

_ℎ

(𝜔) = Ψ

_ℎ

(𝜔

^𝑑

, 𝑚) :=

∫

𝑇 0

𝐿

_ℎ

(

𝑡, 𝜔

^𝑑

, 𝑢 )

𝑚(𝑑𝑡, 𝑑𝑢) + Φ(𝜔

^𝑑

),

where 𝐿

_ℎ

(𝑡, x, 𝑢) := 𝐿(𝑡

_𝑘

, x, 𝑢) for every 𝑡

_𝑘

≤ 𝑡 ≤ 𝑡

_𝑘+1

given the discretization param- eter ℎ :=

^𝑇_𝑛

. It follows by the uniform continuity of 𝐿 that

sup

𝜔∈Ω

∣Ψ(𝜔) − Ψ

_ℎ

(𝜔)∣ ≤ 𝜌

₀

(ℎ), (4.6) where 𝜌

0

is the continuity module of 𝐿 in 𝑡 given before (2.9). Moreover, the optimization problem (2.3) and (3.12) are equivalent to

𝑉 = sup

ℙ∈𝒫_𝑆

𝔼

^ℙ

[Ψ] and 𝑉

₀^ℎ

= sup

ℙ∈𝒫_ℎ

𝔼

^ℙ

[Ψ

_ℎ

]. (4.7)

(17)

Finally, we introduce a space 𝐿

¹_∗

of random variables on Ω

^𝑑

. Let 𝒫

^∗

:= (

∪

_0<ℎ≤ℎ₀

𝒫

_ℎ

)

∪ 𝒫

_𝑅

,

and deﬁned a norm ∣ ⋅ ∣

_∗

for random variables on Ω

^𝑑

by

∣𝜉∣

_∗

:= sup

ℙ∈𝒫^∗

𝔼

^ℙ

∣𝜉∣.

Denote by 𝐿

¹_∗

the completion space of 𝐶

_𝑏

(Ω

^𝑑

) under the norm ∣ ⋅ ∣

_∗

. 4.1.2 Convergence in the enlarged space

We ﬁrst give a convergence result for random variables in 𝐿

¹_∗

. Then we show that Ψ deﬁned by (4.5) belongs to 𝐿

¹_∗

. In the end, we provide two other convergence lemmas.

Lemma 4.2. Suppose that 𝜉 ∈ 𝐿

¹_∗

, ( ℙ

𝑛

)

𝑛≥0

is a sequence of probability measures in 𝒫

^∗

such that ℙ

𝑛

converges weakly to ℙ ∈ 𝒫

^∗

. Then

𝔼

^ℙ^𝑛

[𝜉] → 𝔼

^ℙ

[𝜉]. (4.8)

Proof. For every 𝜀 > 0, there is 𝜉

_𝜀

∈ 𝐶

_𝑏

(Ω

^𝑑

) such that sup

ℙ∈𝒫^∗

𝔼

^ℙ

[∣𝜉 − 𝜉

_𝜀

∣] ≤ 𝜀. It follows that

lim sup

𝑛→∞

𝔼

^ℙ^𝑛

[𝜉] − 𝔼

^ℙ

[𝜉]

≤ lim sup

𝑛→∞

[ 𝔼

^ℙ^𝑛

[

𝜉 − 𝜉

𝜀

] +

𝔼

^ℙ^𝑛

[𝜉

𝜀

] − 𝔼

^ℙ

[𝜉

𝜀

]

+ 𝔼

^ℙ

[ 𝜉

𝜀

− 𝜉

] ]

≤ 2𝜀.

Therefore, (4.8) holds true by the arbitrariness of 𝜀.

The next result shows that the random variable Ψ deﬁned by (4.5) belongs to 𝐿

¹_∗

, when 𝐿 and Φ satisfy (2.9).

Lemma 4.3. Suppose that Assumption 1 holds true, Φ and 𝐿 satisfy (2.9). Then the random variable Ψ deﬁned by (4.5) lies in 𝐿

¹_∗

.

Proof. We ﬁrst claim that for every 𝐶 > 0 sup

ℙ∈𝒫^∗

𝔼

^ℙ

[

𝑒

^𝐶∣𝑋^𝑇^∣

]

< ∞, (4.9)

which implies that 𝔼 [𝑒

^𝐶∣𝑋^ℎ,𝜙^𝑛 ^∣

] < ∞ is uniformly bounded in ℎ and in 𝜙 ∈ 𝒜

_ℎ

. Let 𝐶

0

> 0 such that ∣𝜇(𝑡, x, 𝑢)∣ ≤ 𝐶

0

, ∀(𝑡, x, 𝑢) ∈ 𝑄

𝑇

. Then (∣𝑋

_𝑘^ℎ,𝜙

− 𝐶

0

𝑡

𝑘

∣)

_{0≤𝑘≤𝑛}

is a submartingale, and hence for every 𝐶

1

> 0, (𝑒

^𝐶¹^∣𝑋^𝑘^ℎ,𝜙^−𝐶⁰^𝑡^𝑘^∣

)

0≤𝑘≤𝑛

is also a submartingale.

Therefore, by Doob’s inequality, 𝔼

[ sup

0≤𝑘≤𝑛

𝑒

^𝐶¹^∣𝑋^𝑘^ℎ,𝜙^∣

]

≤ 𝑒

^𝑑𝐶⁰^𝑇

𝔼 [

sup

0≤𝑘≤𝑛

𝑒

^𝐶¹^∣𝑋^𝑘^ℎ,𝜙^−𝐶⁰^𝑡^𝑘^∣

]

≤ 2𝑒

^2𝑑𝐶⁰^𝑇

√ 𝔼 [

𝑒

^2𝐶¹^∣𝑋^𝑛^ℎ,𝜙^∣

]

,

Discrete-time probabilistic approximation of path-dependent stochastic control problems

HAL Id: hal-01246998

https://hal.archives-ouvertes.fr/hal-01246998

Submitted on 21 Dec 2015

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.