A regression Monte-Carlo method for Backward Doubly Stochastic Differential Equations

(1)

HAL Id: hal-00607274

https://hal.archives-ouvertes.fr/hal-00607274

Preprint submitted on 8 Jul 2011

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A regression Monte-Carlo method for Backward Doubly

Stochastic Differential Equations

Omar Aboura

To cite this version:

Omar Aboura. A regression Monte-Carlo method for Backward Doubly Stochastic Differential Equa-tions. 2011. �hal-00607274�

(2)

DOUBLY STOCHASTIC DIFFERENTIAL EQUATIONS

OMAR ABOURA

Abstract. This paper extends the idea of E.Gobet, J.P.Lemor and X.Warin from the setting of Backward Stochastic Differential Equations to that of Backward Doubly Sto-chastic Differential equations. We propose some numerical approximation scheme of these equations introduced by E.Pardoux and S.Peng.

1. Introduction

Since the pioneering work of E. Pardoux and S. Peng [11], backward stochastic dif-ferential equations (BSDEs) have been intensively studied during the two last decades. Indeed, this notion has been a very useful tool to study problems in many areas, such as mathematical finance, stochastic control, partial differential equations; see e.g. [9] where many applications are described. Discretization schemes for BSDEs have been studied by several authors. The first papers on this topic are that of V.Bally [4] and D.Chevance [6]. In his thesis, J.Zhang made an interesting contribution which was the starting point of intense study among, which the works of B. Bouchard and N.Touzi [5], E.Gobet, J.P. Lemor and X. Warin[7],... The notion of BSDE has been generalized by E. Pardoux and S. Peng [12] to that of Backward Doubly Stochastic Differential Equation (BDSDE) as follows. Let (Ω,F, P) be a probability space, T denote some fixed terminal time which will be used throughout the paper, (Wt)_0≤t≤T and (Bt)_0≤t≤T be two independent standard

Brownian motions defined on (Ω,F, P) and with values in R. On this space we will deal with the following families of σ-algebras:

Ft:=F0,tW ∨ Ft,TB ∨ N , Fbt:=F0,tW ∨ F0,TB ∨ N , Ht=F0,TW ∨ Ft,TB ∨ N , (1.1)

whereFB

t,T := σ (Br− Bt; t≤ r ≤ T ), F_0,tW := σ (Wr; 0≤ r ≤ t) and N denotes the class of

P null sets. We remark that ( bFt) is a filtration, (Ht) is a decreasing family of σ-albegras,

while (Ft) is neither increasing nor decreasing. Given an initial condition x∈ R, let (Xt)

be the diffusion process defined by

Xt= x + Z t 0 b (Xs) ds + Z t 0 σ (Xs) dWs. (1.2)

Let ξ _{∈ L}2(Ω) be an R-valued,FT-measurable random variable, f and g be regular enough

coefficients; consider the BDSDE defined as follows:

Yt = ξ + Z T t f (s, Xs, Ys, Zs) ds + Z T t g (s, Xs, Ys, Zs) d←B−s− Z T t ZsdWs. (1.3)

In this equation, dW is the forward stochastic integral and d←B is the backward stochastic− integral (we send the reader to [10] for more details on backward integration). A solution

Date: July 8, 2011.

(3)

to (1.3) is a pair of real-valued process (Yt, Zt), such that Ytand ZtareFt-measurable for

every t∈ [0, T ], such that (1.3) is satisfied and

E sup 0≤s≤T|Ys| 2 + E Z T 0 |Z s|2ds < +∞. (1.4)

In [12] Pardoux and Peng have proved that under some Lipschitz property on f and g which will be stated later, (1.3) has a unique solution (Y, Z). They also proved that

Yt= u t, Xt,←−∆Bs t≤s≤T , Zt= v t, Xt,←−∆Bs t≤s≤T , for some Borel functions u and v.

The time discretization of BDSDEs has been addressed in [2] when the coefficient g does not depend on Z; see also [1] in the more general setting for g which may also depend on Z as in [12]. Both papers follow Zhang’s approach and provide a theoretical approximation only using a constant time mesh.

In order to obtain a more tractable discretization which could be implemented, a natural idea is to see whether the methods introduced in [7] can be extended from the framework of BSDEs to that more involved of BDSDEs ; this is the aim of this paper.

We use three consecutive steps, and each time we give a precise estimate of the corre-sponding error. Thus, we start with a time discretization (YN

tk, Z

N

tk) with a constant time mesh T /N . We can prove that

Y_tN_k = uNtk, XtNk, ←− ∆BN₋₁, . . . ,←∆B− k , Z_tN_k = vNtk, XtNk, ←− ∆BN₋₁, . . . ,←∆B− k , where for k = 1, . . . , N − 1, tk = kT /N and ←∆B− k = Btk+1 − Btk. Furthermore, if either f = 0 or if the scheme is not implicit as in [1] then we have the more precise description:

Y_tN_k = uN_N tk, XtNk + N_X₋₁ j=k uN_j tk, XtNk, ←_∆B− N−1, . . . ,←∆B− j+1 ←−∆Bj, Z_tN_k = vN_N tk, XtNk + N_X₋₁ j=k v_jNtk, XtNk, ←− ∆BN₋₁, . . . ,←∆B− j+1 ←−∆Bj,

with the convention that if j + 1 > N _{− 1,} ←−∆BN₋₁, . . . ,←∆B− j+1

= _{∅. The main} time discretization result in this direction is Theorem 3.4. In order to have a numeri-cal scheme, we use this decomposition and the ideas of E.Gobet, J.P.Lemor and X.Warin [7]. Thus we introduce the following hypercubes, that is approximate random variables uN_j tk, XtNk,

←−

∆BN₋₁, . . . ,←∆B− j+1 ←−∆Bj by their orthogonal projection on some finite

vec-tor space generated by some bases (uj) and (vj) defined below. For k = 1, . . . , N we have

YtNk ≈ X iN E YtNkuiN X N tk uiN X N tk + N_X−1 j=k X iN,iN −1,...,ij E YtNk uiN X N tk viN −1 ←− ∆BN−1 . . . vik+1 ←− ∆Bj+1 ←_∆B− _j √ h ! uiN X N tk vi_{N −1}←−∆BN₋₁ . . . vik+1 ←−_∆B j+1 ←_∆B− _j √ h .

We use a linear regression operator of the approximate solution. Thus, we at first use an orthogonal projection on a finite dimensional space _Pk. This space consists in linear

(4)

combinations of an orthonormal family of properly renormalized indicator functions of disjoint intervals composed either with the diffusion X or with increments of the Brownian motion B. As in [7], in order not to introduce error terms worse that those due to the time discretization, we furtherore have to use a Picard iteration scheme. The error due to this regression operator is estimated in Theorem 4.1.

Then the coefficients (α, β) of the decomposition of the projection of (YtNk , Z

N tk) are shown to solve a regression minimization problem and are expressed in terms of expected values. Note that a general regression approach has also been used by Bouchard and Touzi for BSDEs in [5]. Finally, the last step consists in replacing the minimization problem for the pair (α, β) in terms of expectations by similar expressions described in terms of an average over a sample of size M of the Brownian motions W and B. Then, a proper localization is needed to get an L2 bound of the last error term. This requires another Picard iteration and the error term due to this Monte Carlo method is described in Theorem 5.8.

A motivation to study BSDEs is that these equations are widely used in financial models, so that having an efficient and fast numerical methods is important. As noted in [12], BDSDEs are connected with stochastic partial differential equations and the discretization of (2.2) is motivated by its link with the following SPDE:

u(t, x) = φ(x) + Z T

t

Lu(s, x) + f (s, x, u(s, x), ∇u(s, x)σ(x))ds

+ Z T

t

g (s, x, u(s, x),_{∇u(s, x)σ(x)) d}←B−s, (1.5)

Discretizations of SPDEs are mainly based on PDE techniques, such as finite differences or finite elements methods. Another approach for special equations is given by particle systems. We believe that this paper gives a third way to deal with this problem. As usual, the presence of the gradient in the diffusion coefficient is the most difficult part to handle when dealing with SPDEs. Only few results are obtained in the classical discretization framework when PDE methods are extended to the stochastic case.

Despite the fact that references [2] and [3] deal with a problem similar to that we address in section 3, we have kept the results and proofs of this section. Indeed, on one hand we study here an implicit scheme as in [7] and wanted the paper to be self contained. Furthermore, because of measurability properties of Y0and Y0π, the statements

and proofs of Theorem 3.6 in [2] and Theorem 4.6 in [3] are unclear and there is a gap in the corresponding proofs because of similar measurability issues for (Yt) and (Ytπ).

The paper is organized as follows. Section 2 gives the main notations concerning the time discretization and the function basis. Section 3 describes the time discretization and results similar to those in [2] are proved in a more general framwork. The fourth section describes the projection error. Finally section 5 studies the regression technique and the corresponding Monte Carlo method. Note that the presence of increments of the Brownian motion B, which drives the backward stochastic integrals, requires some new arguments such as Lemma 5.16 which is a key ingredient of the last error estimates. As usual C denotes a constant which can change from line to line.

2. Notations

Let (Wt, t≥ 0) and (Bt, t≥ 0) be two mutually independent standard Brownian

mo-tions. For each x _{∈ R, let (X}t, Yt, Zt, t∈ [0, T ]) denote the solution of the following

(5)

and S.Peng in [12]: Xt=x + Z t 0 b (Xs) ds + Z t 0 σ (Xs) dWs, (2.1) Yt=Φ (XT) + Z T t f (Xs, Ys, Zs) ds + Z T t g (Xs, Ys) d←B−s− Z T t ZsdWs. (2.2)

Assumption. We suppose that the coefficients f and g satisfy the following: Φ (XT)∈L2, f(x, y, z) − f(x′_{, y}′_{, z}′₎2 ≤Lf |x − x′|2+|y − y′|2+|z − z′|2, (2.3) g(x, y) − g(x′_{, y}′₎2 ≤Lg |x − x′|2+|y − y′|2, (2.4)

Note that (2.3) and (2.4) yield that f and g have linear growth in their arguments. We use two approximations. We at first discretize in time with a constant time mesh h = T /N , which yields the processes XN_{, Y}N_{, Z}N_{. We then approximate the pair Y}N_{, Z}N _by

some kind of Picard iteration scheme with I steps YN,i,I, ZN,I for i = 1, . . . , I.

In order to be as clear as possible, we introduce below all the definitions used in the paper. Most of them are same as in [7].

(N0) For 0≤ t ≤ t′ _{≤ T , set F}_t₌_FW

t ∨ Ft,TB and

FtW =σ (Ws; 0≤ s ≤ t) ∨ N , Ft,tB′ = σ Bs− Bt′; t≤ s ≤ t′

∨ N . Ek is the conditionnal expectation with respect to Ftk.

(N1) N is the number of steps of the time discretization, the integer I corresponds to the number of steps of the Picard iteration, h := T /N is the size of the time mesh and for k = 0, 1, . . . , N we set tk:= kh and←∆B− k= Btk+1−Btk, ∆Wk+1 = Wtk+1−Wtk. Let π = t0, t1, . . . , tN = T denote the corresponding subdivision on [0, T ].

(N2) The function basis for XN

tk is defined as follows: let ak < bk be two reals and (_Xk

i )i=1...L denote a partition of [ak, bk]; for i = 1, . . . , L set

ui XtNk :=1_Xk i X N tk /qP XN tk ∈ X k i (2.5) (N3) The function basis for N _{∼ N (0, h) is defined as follows: let a < b two reals and}

(Bi)i=1...L denote a partition of [a, b]. For i = 1, . . . , L set

vi(N ) :=1Bi(N ) / p

P (N ∈ Bi) (2.6)

(N4) For fixed k = 1, . . . , N , let pk denote the following vector whose components belong

to L2_{(Ω). It is defined blockwise as follows:}

uiN X N tk iN, uiN X N tk ←∆B−_√N−1 h ! iN , uiN X N tk viN −1 ←−_∆B N−1 ←_∆B− _N −2 √ h ! iN,iN −1 , . . .  uiN X N tk NY−1 j=k+1 vij ←−_∆B j ←_∆B− _k √ h   iN,iN −1,...,ik+1

(6)

3. Approximation result: step 1

We first consider a time discretization of equations (2.1) and (2.2). The forward equation (2.1) is approximated using the Euler scheme: XN

t0 = x and for k = 0, . . . , N− 1,

X_tN_k+1 = X_tN_k + hb(X_tN_k) + σ(X_tN_k)∆Wk+1. (3.1)

The following result is well know: (see e.g. [8])

Theorem 3.1. There exists a constant C such that for every N max k=1,...,N_t_k−1sup_≤r≤t_kE Xr− XtNk−1 2≤ Ch, max k=0,...,NE XN tk 2 = C <_∞.

The following time regularity is proved in [2] (see also Theorem 2.3 in [1]), it extends the original result of Zhang [13].

Lemma 3.2. There exists a constant C such that for every integer N _{≥ 1, s, t ∈ [0, T ],}

N X k=1 E Z tk t_k−1 Zr− Zt_k−1 2 +_|Zr− Ztk| 2_dr ≤ Ch, E|Yt− Ys|2 ≤ C |t − s| .

The backward equation (2.2) is approximated by backward induction as follows: Y_tN_N :=Φ(X_tN_N), Z_tN_N := 0, (3.2) ZtNk := 1 hEk YtNk+1∆Wk+1 +1 h ←− ∆BkEk gXtNk+1, Y N tk+1 ∆Wk+1 , (3.3) Y_tN_k :=EkYtNk+1+ hf X N tk, Y N tk, Z N tk +←∆B− kEkg X_tN_k+1, Y_tN_k+1, (3.4) Note that as in [2], [3] and [7] we have introduced an implicit scheme, thus different from that in [1]. However, it differs from that in [2] and [3] since the conditional expectation we use is taken with respect to _Ftk which is different from σ

XN tj, j≤ k ∨ σ Btj, j ≤ k used in [3].

Proposition 3.3 (Existence of the scheme). For sufficiently large N, the above scheme has a unique solution. Moreover, for all k = 0, . . . , N , we have YN

tk, Z

N tk ∈ L

2₍_F tk). The following theorem is the main result of this section.

Theorem 3.4. There exists a constant C > 0 such that for h small enough

max 0≤k≤NE Ytk− Y N tk 2+ N_X−1 k=0 Z tk+1 tk EZr− ZtNk 2dr_{≤ Ch + CE}φ X_tN_N_{− φ (X}T) 2.

The rest of this section is devoted to the proof of this theorem; it requires several steps. First of all, we define a process (Ytπ, Ztπ)t_{∈[0,T ]}such that Ytπk and Z

π

tk areFtk measurable, and a family ofFtk measurable random variables Z

π,1

tk , k = 0, . . . , N as follows. For t = T , set

Y_Tπ := Φ X_tN_N, Z_Tπ := 0, Z_tπ,1

N := 0. (3.5)

Suppose that the scheme (Yπ

t , Ztπ) is defined for all t ∈ [tk, T ] and that Ztπ,1j has been defined for j = N, . . . , k. Then for h small enough the following equation

Mtk_k−1 :=Ek₋₁ Ytπk+ f XtN_k−1, Mtk_k−1, ZtN_k−1 ∆tk₋₁+ g XtNk, Y π tk ←− ∆Bk₋₁ (3.6)

(7)

has a unique solution.

Using Proposition 3.3 and the linear growth of f , we deduce that the map Fξ defined by

Fξ(Y ) = Ek₋₁ ξ + hfX_tN_k−1, Y, Z_tN_k−1 (3.7) is such that Fξ L2 Ftk−1 ⊂ L2 _F tk−1 . Futhermore, given Y, Y′ ∈ L2 _F tk−1 , the L2 contraction property of Ek₋₁and the Lipschitz condition (2.3) imply E|Fξ(Y )− Fξ(Y′)|2≤

h2LfE|Y − Y′|2. Then Fξis a contraction for h small enough and the fixed point theorem

concludes the proof.

We can extend M.k to the interval t∈ [tk₋₁, tk] letting

M_tk:= EY_tπ_k + fX_tN_k−1, M_tk_k−1, Z_tN_k−1∆tk₋₁+ g XtNk, Y π tk ←− ∆Bk₋₁ FtW ∨ FtBk−1,T , which is consistent at time tk−1.

By an extension of the martingale representation theorem (see e.g. [12] p.212), there exists aFW

t ∨ FtB_k−1,T

tk−1≤t≤tk

-adapted and square integrable process Nk t t_∈[tk−1,tk]such that for any t ∈ [tk₋₁, tk], Mtk = Mtkk−1 + Rt t_k−1NskdWs and hence Mtk = Mtkk − Rtk t NskdWs. Since, M_tk_k = Y_tπ_k + fX_tN_k−1, M_tk_k−1, Z_tN_k−1∆tk₋₁+ g XtNk, Y π tk ←− ∆Bk₋₁,

we deduce that for t_{∈ [t}k₋₁, tk]

M_tk = Y_tπ_k+ fX_tN_k−1, M_tk_k−1, Z_tN_k−1∆tk−1+ g XtNk, Y π tk ←_∆B− k−1− Z tk t N_skdWs. (3.8) For t∈ [tk₋₁, tk), we set Ytπ := Mtk, Ztπ := Ntk, Ztπ,1k−1 := 1 hEk−1 Z tk t_k−1 Zrπdr ! . (3.9)

Lemma 3.5. For all k = 0, . . . , N , Y_tπ k = Y N tk, Z π,1 tk = Z N tk (3.10)

and hence for k = 1, . . . , N Y_tπ_k−1 = Y_tπ_k+ Z tk tk−1 fX_tN_k−1, Y_tπ_k−1, Z_tπ,1_k−1dr+ Z tk tk−1 g X_tN_k, Y_tπ_kd←B−r− Z tk tk−1 Z_rπdWr (3.11)

Proof. We proceed by backward induction. For k = N , (3.10) is true by (3.2) and (3.5). Suppose that (3.10) holds for l = N, N _{− 1, . . . , k, so that Y}π

tk = Y N tk, Z π,1 tk = Z N tk. Then (3.10) holds for l = k− 1; indeed, for ξ := YN

tk + ←− ∆Bk₋₁g XtNk, Y N tk we deduce from (3.4) and (3.6) that Mk tk−1 = Fξ Mk tk−1 , YN tk−1 = Fξ YN tk−1 and Yπ tk−1 = M k tk−1 = Fξ Mk tk−1 , where Fξ is defined by (3.7). So using the uniqueness of the fixed point of the map Fξ, we

can conclude that Yπ tk−1 = Y

N

tk−1(= M

k

tk−1). Therefore, (3.8) and (3.9) imply (3.11). Ito’s formula yields ∆Wk Z tk t_k−1 Z_rπdWr= Z tk t_k−1 (Wr− Wtk−1)Z π rdWr+ Z tk t_k−1 Z r t_k−1 Z_sπdWsdWr+ Z tk t_k−1 Z_rπdr, so that Ek−1 ∆Wk Rtk tk−1Z π rdWr = Ek−1Rttk−1k Z π rdr = hZ_tπ,1 k−1. Hence multiplying (3.11) by ∆Wkand taking conditional expectation with respect toFtk−1 =F

W t_k−1∨FtBk−1,T. We deduce hZ_tπ,1_k−1 =Ek₋₁ YtNk∆Wk +←∆B− k₋₁Ek₋₁ g XtNk, Y N tk ∆Wk

(8)

Comparing this with (3.3) concludes the proof of (3.10) for l = k_{− 1.} Lemma 3.5 shows that for r_{∈ [t}k, tk+1] one can upper estimate the L2 norm of Zr− ZtNk by that of Zr− Zrπ and increments of Z. Indeed, using (3.10) we have for k = 0, . . . , N− 1

and r ∈ [tk, tk+1] EZr− ZtNk 2_{= E}_Zr− Ztπ,1k 2≤ 2E |Zr− Ztk| 2 + 2EZtk− Z π,1 tk 2 Furthermore, (3.9) and Cauchy-Schwarz’s inequality yield for k = 0, . . . , N _{− 1}

E Ztk− Z π,1 tk 2≤1_hE Z tk+1 tk |Ztk− Z π r|2dr ≤2_hE Z tk+1 tk |Ztk− Zr| 2_{dr +} 2 hE Z tk+1 tk |Zr− Zrπ|2dr. Hence we deduce N_X₋₁ k=0 Z tk+1 tk EZr− ZtNk 2dr≤6 N_X₋₁ k=0 Z tk+1 tk E|Zr− Ztk| 2_{dr + 4} N_X₋₁ k=0 Z tk+1 tk E|Zr− Zrπ|2dr. (3.12) Using Lemma 3.2 and (3.12) we see that Theorem 3.4 is a straightforward consequence of the following:

Theorem 3.6. There exists a constant C such that for h small enough,

max 0≤k≤NE Ytk − Y π tk 2 + Z T 0 E|Z r− Zrπ|2dr≤ Ch + CE Φ XN tN − Φ (XT) 2.

Proof. For any k = 1, . . . , N set

Ik₋₁ := E Ytk−1− Y π t_k−1 2+ E Z tk t_k−1|Z r− Zrπ|2dr. (3.13) Since Ytk−1 − Y π

t_k−1 is Ftk−1-measurable while for r ∈ [tk, tk+1] the random variable Zr − Zrπ is FrW ∨ FtB_k−1,T-measurable, we deduce that Ytk−1 − Y

π

tk−1 is orthogonal to Rtk

tk−1(Zr− Z

π

r) dWr. Therefore, the identities (2.2) and (3.11) imply that

Ik−1=E Ytk−1− Y π t_k−1+ Z tk tk−1 (Zr− Zrπ) dWr 2 =E Ytk− Y π tk+ Z tk t_k−1 f (Xr, Yr, Zr)− f X_tN_k−1, Y_tπ_k−1, Z_tπ,1_k−1dr + Z tk tk−1 g (Xr, Yr)− g XtNk, Y π tk d←B−r 2 .

Notice that for tk−1≤ r ≤ tk the random variable g (Xr, Yr)− g XtNk, Y

π tk is_FW tk ∨ F B r,T

-measurable. Hence Ytk−Ytπk, which isFtk-measurable, and Rtk tk−1 g (Xr, Yr)− g X N tk, Y π tk d←B−r

are orthogonal. The inequality (a + b + c)2 _{≤ 1 +}_λ1(a2+ c2) + (1 + 2λ) b2 + 2ac valid for λ > 0, Cauchy-Schwarz’s inequality and the isometry of backward stochastic integrals

(9)

yield for λ := ǫ h, ǫ > 0: Ik₋₁≤ 1 +h ǫ  EYtk− Y π tk 2 + E Z tk t_k−1 g (Xr, Yr)− g XtNk, Y π tk d←B−r 2  +1 + 2ǫ h E Z tk tk−1 f (Xr, Yr, Zr)− f X_tN_k−1, Y_tπ_k−1, Z_tπ,1_k−1dr 2 ≤ 1 +h ǫ " EYk− Ytπk 2 + E Z tk tk−1 g (Xr, Yr)− g XtNk, Y π tk 2 dr # + (h + 2ǫ) E Z tk tk−1 f (Xr, Yr, Zr)− f X_tN_k−1, Y_tπ_k−1, Z_tπ,1 k−1 2 dr.

The Lipschitz properties (2.3) and (2.4) of f and g imply

Ik₋₁≤ 1 +h ǫ " EYtk − Y π tk 2 + LgE Z tk t_k−1 Xr− XtNk 2 +Yr− Ytπk 2 dr # + (h + 2ǫ) LfE Z tk tk−1 Xr− XtNk−1 2+_Yr− Ytπk−1 2+_Zr− Ztπ,1_k−1 2 dr. (3.14)

Using the definition of Z_tπ,1

k in (3.9), the L

2 _{contraction property of E}

k and

Cauchy-Schwarz’s inequality, we have

hEZtk− Z π,1 tk 2 ≤ 1 hE Ek Z tk+1 tk (Ztk− Z π r) dr 2 ≤ E Z tk+1 tk |Ztk − Z π r|2dr.

Thus, by Young’s inequality, we deduce for k = 1, . . . , N

E Z tk tk−1 Zr− Ztπ,1_k−1 2dr_≤2E Z tk tk−1 Zr− Ztk−1 2 dr + 2hEZtk−1− Z π,1 t_k−1 2 ≤2E Z tk t_k−1 Zr− Zt_k−1 2 dr + 4E Z tk t_k−1|Z π r − Zr|2dr + 4E Z tk tk−1 Zr− Zt_k−1 2 dr ≤6E Z tk t_k−1 Zr− Ztk−1 2 dr + 4E Z tk t_k−1|Z π r − Zr|2dr.

We now deal with increments of Y . Using Lemma 3.2, we have

E Z tk t_k−1 Yr− Ytπ_k−1 2dr≤2E Z tk t_k−1 Yr− Ytk−1 2 dr + 2E Z tk t_k−1 Ytk−1 − Y π t_k−1 2dr ≤Ch2+ 2hEYt_k−1− Ytπk−1 2, while a similar argument yields

E Z tk tk−1 Yr− Ytπk 2 dr_≤Ch2+ 2hEYtk− Y π tk 2 .

(10)

Using Theorem 3.1 and the previous upper estimates in (3.14), we deduce Ik₋₁ ≤ 1 +h ǫ EYtk − Y π tk 2 + Lf(h + 2ǫ) Ch2+ 2hEYtk−1− Y π t_k−1 2 + 6E Z tk tk−1 Zr− Zt_k−1 2 dr + 4E Z tk tk−1 |Zrπ− Zr|2dr # + Lg 1 +h ǫ h Ch2+ 2hEYtk − Y π tk 2i .

Thus, (3.13) implies that for any ǫ > 0

[1_{− 2L}f(h + 2ǫ) h] E Yt_k−1− Ytπk−1 2+ [1_{− 4L}f(h + 2ǫ)] E Z tk tk−1 |Zr− Zrπ|2dr ≤ 1 +h ǫ + 2Lg 1 +h ǫ h EYtk− Y π tk 2 + Lf(h + 2ǫ) + Lg 1 +h ǫ Ch2 + 6Lf(h + 2ǫ) E Z tk tk−1 Zr− Zt_k−1 2 dr.

Now we choose ǫ such that 8ǫLf = 1₂. Then we have for eC = 4Lf, h small enough and

some positive constant C depending on Lf and Lg:

1_{− e}Ch_E_Ytk−1 − Y π t_k−1 2+ 1 2 − eCh E Z tk tk−1 |Zr− Zrπ|2dr ≤ 1 + ChEYtk − Y π tk 2 + Ch2+ CE Z tk t_k−1 Zr− Ztk−1 2 dr. (3.15)

We need the following

Lemma 3.7. Let L > 0; then for h∗ _{small enough (more precisely Lh}∗ _{< 1) there exists}

Γ := _1−LhL ∗ > 0 such that for all h∈ (0, h∗) we have

1

1−Lh < 1 + Γh

Proof. Let h∈ (0, h∗_{); then we have 1}_{− Lh > 1 − Lh}∗ _{> 0. Hence} L 1−Lh <

L

1−Lh∗ = Γ, so that Lh < Γh(1− Lh), which yields 1 + Γh − Lh − ΓLh2 _{= (1 + Γh)(1}_{− Lh) > 1. This}

concludes the proof.

Lemma 3.7 and (3.15) imply the existence of a constant C > 0 such that for h small enough and k = 1, 2, . . . , N we have

E Ytk−1− Y π t_k−1 2≤ (1 + Ch) EYtk − Y π tk 2 + Ch2+ CE Z tk tk−1 Zr− Ztk−1 2 dr. (3.16)

The final step relies on the following discrete version of Gronwall’s lemma (see [7]). Lemma 3.8 (Gronwall’s Lemma). Let (ak), (bk), (ck) be nonnegative sequences such that

for some K > 0 we have for all k = 1, . . . , N_{− 1, a}k−1+ ck−1 ≤ (1 + Kh)ak+ bk−1. Then,

for all k = 0, . . . , N − 1, ak+PN_i=k−1ci ≤ eK(T −tk)

aN +PN_i=k−1bi

(11)

Use Lemma 3.8 with ck= 0, ak−1 = E Yt_k−1 − Ytπk−1 2and bk= CE Rtk tk−1 Zr− Zt_k−1 2 + Ch2; this yields sup 0≤k≤NE Ytk− Y π tk 2 ≤C EYT − YtπN 2 + N X k=1 E Z tk t_k−1 Zr− Zt_k−1 2 dr + Ch ! ≤CEYT − YtπN 2 + Ch, (3.17)

where the last upper estimate is deduced from Lemma 3.2. We sum (3.15) from k = 1 to k = N ; using (3.17) we deduce that for some constant ¯C depending on Lf and Lg we have

1 2− eCh E Z T 0 |Z r− Zrπ|2dr≤Ch N_X₋₁ k=1 EYtk− Y π tk 2 + Ch + CEYT − YtπN 2 ≤Ch + CEYT − YtπN 2 + ChC + N EYT − YtπN 2 ≤Ch + CEYT − YtπN 2 .

The definitions of YT and YtNN from (2.2) and (3.2) conclude the proof of Theorem 3.6.

4. Approximation results: step 2 In order to approximate YN

tk, Z

N tk

k=0,...,N we use the idea of E.Gobet, J.P. Lemor and

X.Warin [7], that is a projection on the function basis and a Picard iteration scheme. In this section, N and I are fixed positive integers. We define the sequencesY_tN,i,I_k

i=0,...,I k=0,...,N

andZ_tN,I_k

k=0,...,N −1using backward induction on k, and for fixed k forward induction on

i for Y_tN,i,I

k as follows: For k = N , Z

N,I

tN = 0 and for i = 0, . . . , I, set Y

N,i,I tN := PNΦ X N tN . Assume that Y_tN,I,I_k+1 has been defined and set

Z_tN,I_k :=1 hPk h Y_tN,I,I_k+1 ∆Wk+1 i + 1 hPk h←−_∆B kg X_tN_k+1, Y_tN,I,I_k+1 ∆Wk+1 i . (4.1)

Let Y_tN,0,I_k := 0 and for i = 1, . . . , I define inductively by the following Picard iteration scheme: Y_tN,i,I k :=PkY N,I,I tk+1 + hPk h fX_tN_k, Y_tN,i−1,I k , Z N,I tk i + Pkh←−∆Bkg X_tN_k+1, Y_tN,I,I k+1 i , (4.2)

where Pk is the orthogonal projection on the Hilbert spacePk ⊂ L2(Ftk) generated by the function pk defined by (N4). Set Rk := I− Pk. Note that Pk is a contraction of L2(Ftk). Furthermore, given Y _{∈ L}2_(Ω),

EkPkY = PkEkY = PkY. (4.3)

Indeed, since _Pk ⊂ L2(Ftk), EkPkY = PkY . Let Y ∈ L2; for every, Uk ∈ Pk, since Uk

is _Ftk-measurable, we have E (UkRkY ) = 0 = E (UkEkRkY ); so that, PkEkRk(Y ) = 0.

Futhermore Y = PkY + RkY implies PkEkY = PkPkY + PkEkRkY = PkY which yields

(12)

Theorem 4.1. For h small enough, we have max 0≤k≤NE YtN,I,Ik − Y N tk 2+ h N_X₋₁ k=0 E ZtN,Ik − Z N tk 2 ≤ Ch2I−2+ C N_X₋₁ k=0 ERkYtNk 2 + CEΦ XtNN − PNΦ XtNN 2 + Ch N_X−1 k=0 E RkZtN,Ik 2.

Proof of Theorem 4.1. The proof will be deduced from severals lemmas. The first result gives integrability properties of the scheme defined by (4.1) and (4.2).

Lemma 4.2. For every k = 0, . . . , N and i = 0, . . . , I we have Y_tN,i,I_k , Z_tN,I_k _{∈ L}2(_Ftk). Proof. We prove this by backward induction on k, and for fixed k by forward induction on i. By definition Y_tN,i,I_N = PNΦ(XtNN) and Z

N,I tN = 0. Suppose that Z N,I tj and Y N,l,I tj belong to L2 _F tj

for j = N, N_{− 1, . . . , k + 1 and any l, and for j = k and l = 0, . . . , i − 1; we} will show that Y_tN,i,I_k , Z_tN,I_k ∈ L2₍_F

tk).

The measurability is obvious since_Pk⊂ L2(Fk). We at first prove the square integrability

of Z_tN,I

k . Using (4.3), the conditional Cauchy-Schwarz inequality and the independence of ∆Wk+1 and Ftk, we deduce EPk Y_tN,I,I k+1 ∆Wk+1 2 =EPkEk Y_tN,I,I k+1 ∆Wk+1 2 ≤ EEk Y_tN,I,I k+1 ∆Wk+1 2 ≤E Ek|∆Wk+1|2Ek YtN,I,Ik+1 2 ≤ hEYtN,I,Ik+1 2.

A similar computation using the independence of ∆Wk+1 andFtk, and of ←−

∆Bk and Ftk+1 as well as the growth condition deduced from (2.4) yields

E Pk←−∆Bk∆Wk+1g XtNk+1, Y N,I,I tk+1 2 = EPkEk←−∆Bk∆Wk+1g XtNk+1, Y N,I,I tk+1 2 ≤EEk←−∆Bk∆Wk+1g X_tN k+1, Y N,I,I tk+1 2 ≤ hEEk+1 ←∆B− kg X_tN k+1, Y N,I,I tk+1 2 ≤h2E gX_tN_k+1, Y_tN,I,I k+1 2 ≤ 2h2|g(0, 0)|2+ 2h2Lg E XtNk+1 2+ EYtN,I,Ik+1 2 .

The two previous upper estimates and the induction hypothesis proves that Z_tN,I k ∈ L2₍_F

tk). A similar easier proof shows that Y

N,i,I tk ∈ L

2₍_F

tk).

The following lemma gives L2 bounds for multiplication by ∆Wk+1

Lemma 4.3. For every Y _{∈ L}2 _{we have E}_|E

k(Y ∆Wk+1)|2 ≤ h

E|Y |2_{− E |E} kY|2

Proof. Using the fact that Ek(∆Wk+1EkY ) = 0 we have

E|Ek(Y ∆Wk+1)|2 =E|Ek((Y − EkY ) ∆Wk+1)|2

Using the conditional Cauchy-Schwarz inequality and the independence of ∆Wk+1andFtk, we deduce E|Ek(Y ∆Wk+1)|2 ≤ hE |Y − EkY|2 ≤ h E|Y |2− E |EkY|2 ; this concludes the proof.

The following result gives orthogonality properties of several projections. Lemma 4.4. Let k = 0, . . . , N − 1, and Mtk+1, Ntk+1 ∈ L

2 _F tk+1 . Then EPkMtk+1Pk ←−_∆B kNtk+1 = 0.

(13)

Proof. Let Mtk+1 ∈ L

2 _F tk+1

; the definition of Pk yields

PkMtk+1 = X 1≤iN≤L α(iN)uiN X N tk + X 1≤iN≤L α(N− 1, iN)uiN X N tk ←∆B−_√N−1 h (4.4) + X k≤l≤N−1 X 1≤iN,...,il+1≤L α (l, iN, . . . , il+1) uiN X N tk NY−1 r=l+1 vir ←−_∆B r ←_∆B− _l √ h , where α(iN) = E Mtk+1uiN X N tk , α (N − 1, iN) = E Mtk+1uiN X N tk ←− ∆B_√N −1 h , and α (l, iN, . . . , il+1) = E h Mtk+1uiN X N tk QN−1 r=l+1vir ←− ∆Br ←− ∆B_√ l h i

. Taking conditional ex-pectation with respect to Ftk+1, we deduce that for any iN, . . . , ik∈ {1, . . . , L}

α (k, iN, . . . , ik+1) =E " Mtk+1uiN X N tk NY−1 r=k+1 vir ←−_∆B r Ek+1 ←_∆B− k √ h # = 0. A similar decomposition of Pk←−∆BkNtk+1 yields Pk←−∆BkNtk+1 = X 1≤iN≤L β (iN) uiN X N tk + X 1≤iN≤L β (N_{− 1, i}N) uiN X N tk ←∆B− N₋₁ √ h + X k_≤l≤N−1 X 1≤iN,...,il+1≤L β (l, iN, . . . , il+1) uiN X N tk NY−1 r=l+1 vir ←−_∆B r ←_∆B− _l √ h (4.5) where β (iN) = Eh←−∆BkNtk+1uiN X N tk i , β (N _{− 1, i}N) = E ←_∆B− kNtk+1uiN X N tk ←_∆B− N −1 √ h and β (l, iN, . . . , il+1) = Eh←−∆BkNtk+1uiN X N tk QN₋₁ r=l+1vir ←−_∆B r ←_∆B− l √ h i . In the above sum, all terms except those corresponding to l = k are equal to 0. Indeed, let l ∈ {k + 1, . . . , N − 1}; then using again the conditional expectation with respect to Ftk+1 we obtain β (l, iN, . . . , il+1) =E " Ntk+1uiN X N tk NY−1 r=l+1 vir ←−_∆B r ←_∆B− _l √ h Ek+1 ←− ∆Bk # = 0

The two first terms in the decomposition of Pk←−∆BkNtk+1

are dealt with by a similar argument. Notice that for any l _{∈ {k + 1, . . . , N − 1} and any i}N, . . . , il, jN, . . . , jk+1 ∈

{1, . . . , L} we have, (conditioning with respect to Ftk):

E " uiN X N tk NY−1 r=l+1 vir ←− ∆Br ←_∆B− _l √ h ujN X N tk NY−1 r=k+1 vjr ←− ∆Br ←_∆B− _k √ h # = 0.

A similar computation proves that for any iN, jN, . . . , jk+1 ∈ {1, . . . , L}, ξ ∈

1,←∆B−_√N −1 h E " uiN X N tk ξujN X N tk NY−1 r=k+1 vr←−∆Br # = 0.

The decompositions (4.4) and (4.5) conclude the proof. The next lemma provides upper bounds of the L2-norm of Z_tN,I_k and Z_tN,I_k − ZN

(14)

Lemma 4.5. For small h enough and for k = 0, . . . , N_{−1, we have the following L}2_bounds E ZtN,Ik 2 ≤1 h E YtN,I,Ik+1 2− EEkYtN,I,Ik+1 2 (4.6) + 1 h E ←∆B− kg XtNk+1, Y N,I,I tk+1 2 − EEk←−∆Bkg XtNk+1, Y N,I,I tk+1 2 , EZtN,Ik − Z N tk 2≤ERkZtNk 2+ 1 h EYtN,I,Ik+1 − Y N tk+1 2− EEk Y_tN,I,I k+1 − Y N tk+1 2 (4.7) +1 h E ←∆B− k h gX_tN_k+1, Y_tN,I,I_k+1 _{− g}X_tN_k+1, Y_tN_k+1i2 −EEk←−∆Bk h gXN tk+1, Y N,I,I tk+1 − gXN tk+1, Y N tk+1 i2 .

Proof. Lemma 4.4 implies that both terms in the right hand side of (4.1) are orthogonal. Hence squaring both sides of equation (4.1), using (4.3) and Lemma 4.3, we deduce

E ZtN,Ik 2= 1 h2E Pk Y_tN,I,I_k+1 ∆Wk+1 2 + 1 h2E Pk←−∆Bkg X_tN_k+1, Y_tN,I,I_k+1 ∆Wk+1 2 = 1 h2E PkEk h Y_tN,I,I k+1 ∆Wk+1 i2 + 1 h2E PkEk←−∆Bkg X_tN k+1, Y N,I,I tk+1 ∆Wk+1 2 ≤ 1 h2E Ek h Y_tN,I,I_k+1 ∆Wk+1i 2 + 1 h2E Ek←−∆Bkg XtNk+1, Y N,I,I tk+1 ∆Wk+1 2 ≤1 h E YtN,I,Ik+1 2− EEkYtN,I,Ik+1 2 + 1 h E ←∆B− kg XtNk+1, Y N,I,I tk+1 2 − EEk←−∆Bkg XtNk+1, Y N,I,I tk+1 2 ; this proves (4.6).

Using the orthogonal decomposition ZtNk = PkZ

N tk + RkZ N tk, since Z N,I tk ∈ Pk we have E ZtN,Ik − Z N tk 2 = EZtN,Ik − PkZ N tk 2+ ERkZtNk 2

. Futhermore (3.3), (4.1) and (4.3) yield

Z_tN,I_k _{− P}kZtNk = 1 hPk h Y_tN,I,I_k+1 _{− Y}_tN_k+1∆Wk+1 i + 1 hPk h←−_∆B k gX_tN_k+1, Y_tN,I,I k+1 − gX_tN_k+1, Y_tN_k+1∆Wk+1 i .

(15)

Lemma 4.4 shows that the above decomposition is orthogonal; thus using (4.3), the con-traction property of Pk and Lemma 4.3, we deduce

E ZtN,Ik − PkZ N tk 2 = 1 h2E PkEk h Y_tN,I,I k+1 − Y N tk+1 ∆W_k+1i2 + 1 h2E PkEkh←−∆Bk∆Wk+1 gX_tN_k+1, Y_tN,I,I_k+1 _{− g}X_tN_k+1, Y_tN_k+1i2 ≤_h1₂EEk h Y_tN,I,I k+1 − Y N tk+1 ∆Wk+1i 2 + 1 h2E Ekh←−∆Bk∆Wk+1 gXtNk+1, Y N,I,I tk+1 − gXtNk+1, Y N tk+1 i2 ≤1 h E YtN,I,Ik+1 − Y N tk+1 2− EEk Y_tN,I,I_k+1 − YN tk+1 2 + 1 h E ←∆B− k h gXtNk+1, Y N,I,I tk+1 − gXtNk+1, Y N tk+1 i2 −EEk←−∆Bk h gX_tN_k+1, Y_tN,I,I_k+1 _{− g}X_tN_k+1, Y_tN_k+1i2 .

This concludes the proof of (4.7).

For Y ∈ L2₍_F tk), let χ N,I k (Y ) be defined by: χN,I_k (Y ) := Pk Y_tN,I,I_k+1 + hfXtNk, Y, Z N,I tk +←∆B− kg XtNk+1, Y N,I,I tk+1 .

The growth conditions of f and g deduced from (2.3), (2.4) and the orthogonality of ←∆B− k

and Ftk+1 imply that χ

N,I

k L2(Ftk)

⊂ Pk⊂ L2(Ftk). Futhermore, (2.3) implies that for Y1, Y2∈ L2(Ftk) E χN,Ik (Y2)− χ N,I k (Y1) 2 ≤ Lfh2E|Y2− Y1|2, (4.8)

and (4.2) shows that Y_tN,i,I_k = χN,I_k Y_tN,i_k −1,Ifor i = 1, . . . , I.

Lemma 4.6. For small h (i.e., h2Lf < 1) and for k = 0, . . . , N− 1, there exists a unique

Y_tN,∞,I k ∈ L 2₍_F tk) such that Y_tN,∞,I k = Pk h Y_tN,I,I k+1 + hf X_tN_k, Y_tN,∞,I k , Z N,I tk +←∆B− kg X_tN_k+1, Y_tN,I,I k+1 i , (4.9) E YtN,k ∞,I− Y N,i,I tk 2 ≤ Lifh2iE YtN,k ∞,I 2, (4.10)

and there exists some constant K > 0 such that for every N, k, I,

EYtN,k ∞,I 2 ≤ Kh + (1 + Kh)EYtN,I,Ik+1 2. (4.11)

Proof. The fixed point theorem applied to the map χN,I_k , which is a contration for h2Lf <

1, proves (4.9) ; (4.10) is straightforward consequence from (4.2) by induction on i. Lemma 4.4 shows that PkYtN,I,Ik+1 and Pk

←−_∆B kg XtNk+1, Y N,I,I tk+1

are orthogonal. Hence for any ǫ > 0, using Young’s inequality, (4.3), the L2 _{contracting property of P}

(16)

condition of g deduced from (2.4) we obtain E YtN,k ∞,I 2≤ 1 +h ǫ E PkYtN,I,Ik+1 2+ (h2+ 2ǫh)EPk h fXtNk, Y N,_∞,I tk , Z N,I tk i 2 + 1 +h ǫ E Pkh←−∆Bkg X_tN_k+1, Y_tN,I,I k+1 i2 ≤ 1 +h ǫ E EkYtN,I,Ik+1 2+ 2(h2+ 2ǫh)|f(0, 0, 0)|2 + 2Lf(h2+ 2ǫh) EXtNk 2 + EYtN,k ∞,I 2+ EZtN,Ik 2 + 1 +h ǫ E Ekh←−∆Bkg X_tN_k+1, Y_tN,I,I_k+1 i2. Using the upper estimate (4.6) in Lemma 4.5, we obtain

[1_−2Lf(h2+ 2ǫh)]E YtN,k ∞,I 2 ≤ 1 +h ǫ − 2Lf(h + 2ǫ) E EkYtN,I,Ik+1 2+ 2 h2+ ǫh _{|f(0, 0, 0)|}2 + LfE XN tk 2 + 2Lf(h + 2ǫ) E YtN,I,Ik+1 2+ 2Lf(h + 2ǫ) E ←∆B− kg X_tN_k+1, Y_tN,I,I_k+1 2 + 1 +h ǫ − 2Lf(h + 2ǫ) E Ek←−∆Bkg XtNk+1, Y N,I,I tk+1 2 .

Choose ǫ such that 4Lfǫ = 1. Then 1 +h_ǫ− 2Lf(h + 2ǫ) = 2Lfh and 2Lf(h + 2ǫ) =

2Lfh + 1. Using Theorem 3.1 we deduce the exitence of C > 0 such that,

1− 2Lf(h2+ 2ǫh) E YtN,k ∞,I 2 ≤Ch + (1 + 4Lfh) EYtN,I,Ik+1 2+ E←∆B− kg X_tN_k+1, Y_tN,I,I k+1 2 .

Then for h∗ _{∈ (0, 1] small enough (ie (2L}_f_{+ 1)h}∗ _{< 1), using Lemma 3.7, we deduce that}

for Γ := 2Lf+1

1−(2Lf+1)h∗ and h∈ (0, h

∗_{), we have (1}_{− (2L}

f+ 1)h)−1 ≤ 1 + Γh. Thus using the

independence of ←∆B− k and Ftk+1, the growth condition (2.4) and Lemma 3.1, we deduce the existence of a constant C > 0, such that for h_{∈ (0, h}∗_),

E YtN,k ∞,I 2≤Ch + (1 + Ch) EYtN,I,Ik+1 2+ ChEgX_tN_k+1, Y_tN,I,I k+1 2 ≤Ch + (1 + Ch) EYtN,I,Ik+1 2.

This concludes the proof of (4.11).

Let η_kN,I := EYtN,I,Ik − Y

N tk

2 for k = 0, . . . , N ; the following lemma gives an upper bound of the L2-norm of Y_tN,_k ∞,I− PkYtNk in terms of η

N,I k+1.

Lemma 4.7. For small h and for k = 0, . . . , N _{− 1 we have:}

E YtN,k ∞,I− PkY N tk 2 ≤ (1 + Kh)ηk+1N,I + Kh h ERkYtNk 2_{+ E}RkZtNk 2i.

(17)

Proof. The argument, which is similar to that in the proof of Lemmas 4.5 and 4.6 is more briefly sketched. Applying the operator Pkto both sides of equation (3.4) and using (4.3),

we obtain PkYtNk =PkY N tk+1+ hPk f X_tN k, Y N tk, Z N tk + Pkh←−∆Bkg X_tN k+1, Y N,I,I tk+1 i . Hence Lemma 4.6 implies that

Y_tN,_k ∞,I− PkYtNk =Pk h Y_tN,I,I_k+1 − YN tk+1 i + hPk h fX_tN_k, Y_tN,_k ∞,I, Z_tN,I_k − f XN tk, Y N tk , Z N tk i + Pk←−∆Bk h gXtNk+1, Y N,I,I tk+1 − gXtNk+1, Y N tk+1 i .

Lemma 4.4 proves the orthogonality of the first and third term of the above decomposition. Squaring this equation, using Young’s inequality and (4.3), the L2_{-contraction property}

of Pk and the Lipschitz property of g given in (2.4), computations similar to that made in

the proof of Lemma 4.6 yield EYtN,k ∞,I− PkY N tk 2 = 1 + h ǫ EPk h Y_tN,I,I k+1 − Y N tk+1 i2 + h21 + 2ǫ h E Pk h

fX_tN_k, Y_tN,_k ∞,I, Z_tN,I_k _{− f X}_tN_k, Y_tN_k, Z_tN_ki2 + 1 +h ǫ E Pk←−∆Bk h gX_tN k+1, Y N,I,I tk+1 − gX_tN k+1, Y N tk+1 i2 ≤ 1 +h ǫ E Ek h Y_tN,I,I_k+1 _{− Y}_tN_k+1i2+ Lf(h + 2ǫ) h E YtN,k ∞,I− Y N tk 2+ EZtN,Ik − Z N tk 2 + 1 +h ǫ E Ek←−∆Bk h gXtNk+1, Y N,I,I tk+1 − gXtNk+1, Y N tk+1 i2 . (4.12)

By construction Y_tN,_k ∞,I ∈ Pk. Hence

E YtN,k ∞,I− Y N tk 2 = EYtN,k ∞,I− PkY N tk 2+ ERkYtNk 2. (4.13) Using Lemma 4.5 we deduce that for any ǫ > 0

1_{− L}f h2+ 2ǫh E YtN,k ∞,I− PkY N tk 2 ≤Lf(h + 2ǫ) η_k+1N,I + hLf(h + 2ǫ) h ERkYtNk 2 + ERkZtNk 2i + 1 +h ǫ − Lf(h + 2ǫ) E Ek Y_tN,I,I k+1 − Y N tk+1 2 + Lf(h + 2ǫ) E ←∆B− k h gX_tN_k+1, Y_tN,I,I_k+1 − gX_tN_k+1, Y_tN_k+1i2 + 1 +h ǫ − Lf(h + 2ǫ) EEk←−∆Bk h gX_tN k+1, Y N,I,I tk+1 − gX_tN k+1, Y N tk+1 i2 .

Let ǫ > 0 satisfy 2Lfǫ = 1; then 1 +hǫ

− Lf(h + 2ǫ) = Lfh and Lf(h + 2ǫ) = Lfh + 1.

Thus, since Ek contracts the L2-norm, we deduce

1_{− L}f h2+ 2ǫh E YtN,k ∞,I− PkY N tk 2 ≤ (1 + 2Lfh) η_k+1N,I + h (1 + Lfh) h ERkYtNk 2_{+ E}RkZtNk 2i + (1 + 2Lfh) E ←∆B− k h gXtNk+1, Y N,I,I tk+1 − gXtNk+1, Y N tk+1 i2 .

(18)

Let h∗ ∈ (0,Lf1+1) and set Γ =

Lf+1

1−(Lf+1)h∗. Lemma 3.7 shows that for h∈ (0, h

∗_{) we have}

1_{− L}f h2+ 2ǫh−1≤ 1 + Γh. The previous inequality, the independence of ←∆B− k and

Ftk+1 and the Lipschitz property (2.4) imply that for some constant K which can change for one line to the next

E YtN,k ∞,I− PkY N tk 2 ≤ (1 + Kh)ηN,Ik+1+ Kh h ERkYtNk 2_{+ E}RkZtNk 2i + KhEgX_tN k+1, Y N,I,I tk+1 − gX_tN k+1, Y N tk+1 2 ≤(1 + Kh)ηk+1N,I + Kh h ERkYtNk 2_{+ E}RkZtNk 2i.

This concludes the proof of Lemma 4.7 The following Lemma provides L2_{-bounds of Y}N,I,I

tk , Y N,_∞,I tk and Z N,I tk independent of N and I.

Lemma 4.8. There exists a constant K such that for large N and for every I ≥ 1, max 0≤k≤NE YtN,I,Ik 2+ max 0≤k≤N−1E YtN,k ∞,I 2+ max 0≤k≤NhE ZtN,Ik 2 ≤ K.

Proof. Using inequality (4.10) and Young’s inequality, we have the following bound, for i = 1, . . . , I, h < 1 and some constant K depending on Lf:

E YtN,i,Ik 2 ≤ 1 + 1 h E YtN,k ∞,I− Y N,i,I tk 2+ (1 + h)EYtN,k ∞,I 2 ≤ 1 + 1 h Li_fh2iE YtN,k ∞,I 2+ (1 + h)EYtN,k ∞,I 2≤ (1 + Kh)EYtN,k ∞,I 2. (4.14) Choosing i = I and using (4.11) we deduce that for some constant K which can change from line to line, EYtN,I,Ik

2 ≤ Kh + (1 + Kh)EYtN,I,Ik+1

2. Hence Lemma 3.8 yields maxkE YtN,I,Ik

2 ≤ K. Plugging this relation into inequality (4.11) proves that

max k E YtN,I,Ik 2+ max k E YtN,k ∞,I 2 ≤ K < ∞.

Using (4.6) and the independence of ←∆B− k and Ftk+1, we deduce hEZtN,Ik 2 ≤EYtN,I,Ik+1 2+ E←∆B− kg X_tN_k+1, Y_tN,I,I_k+1 2 ≤EYtN,I,Ik+1 2+ hEgXtNk+1, Y N,I,I tk+1 2

Finally, the Lipschitz property (2.4) yields hEZtN,Ik 2 ≤EYtN,I,Ik+1 2+ 2h|g (0, 0)|2+ 2hLg E XtNk+1 2+ EYtN,I,Ik+1 2 ≤ (1 + 2hLg) E YtN,I,Ik+1 2+ 2h_{|g (0, 0)|}2+ 2hLgE XtNk+1 2.

Theorem 3.1 and the L2-upper estimates of Y_tN,I,I_k+1 conclude the proof. The following lemma provides a backward recursive upper estimate of ηN,I

. Recall that ηN,I_k = EYtN,I,Ik − Y N tk 2

(19)

Lemma 4.9. For 0_{≤ k < N, we have:}

ηN,I_k ≤ (1 + Kh)ηk+1N,I + Ch2I−1+ KE

RkYtNk 2_{+ KhE}RkZtNk 2. Proof. For k = N , YtNN = Φ X N tN and Y_tN,I,I_N = PNΦ XtNN so that ηN,I_N = EΦ XtNN − PNΦ XtNN 2 . Let k∈ {0, . . . , N − 1}; using inequality (4.10) and Young’s inequality, we obtain

ηN,I_k =EYtN,I,Ik − Y N tk 2 ≤ 1 +1 h E YtN,I,Ik − Y N,_∞,I tk 2+ (1 + h)EYtN,k ∞,I− Y N tk 2 ≤ 1 +1 h LI_fh2IE YtN,k ∞,I 2+ (1 + h)EYtN,k ∞,I− PkY N tk 2+ (1 + h)ERkYtNk 2.

Finally, Lemmas 4.8 and 4.7 imply that for some constant K we have for every N any k = 1, . . . , N :

ηN,I_k _≤Kh2I−1+ (1 + h)EYtN,k ∞,I− PkY

N tk 2+ (1 + h)ERkYtNk 2 (4.15) ≤(1 + Kh)ηN,Ik+1+ Kh2I−1+ KE RkYtNk 2_{+ KhE}RkZtNk 2;

this concludes the proof.

Gronwall’s Lemma 3.8 and Lemma 4.9 prove the existence of C such that for h small enough max 0≤k≤NE YtN,I,Ik − Y N tk 2 ≤Ch2I−2+ C N_X₋₁ k=0 ERkYtNk 2+ Ch N_X₋₁ k=0 E RkZtN,Ik 2 + CEΦ XtNN − PNΦ XtNN 2 (4.16)

which is part of Theorem 4.1. Let ζN _{:= h}PN−1 k=0 E ZtN,Ik − Z N tk 2. In order to conclude the proof Theorem 4.1, we need to upper estimate ζN, which is done in the next lemma. Lemma 4.10. There exits a constant C such that for h small enough and every I ≥ 1

ζN ≤ Ch2I−2+ Ch N_X−1 k=0 ERkZkN 2+ C N_X−1 k=0 ERkYkN 2+ C max 0≤k≤N−1η N,I k .

Proof. Multiply inequality (4.7) by h, use the independence of ←∆B− k and Ftk+1 and the Lipschitz property (2.4); this yields

ζN _{≤ h} N_X₋₁ k=0 ERkZtNk 2+ N_X₋₁ k=0 (1 + Lgh) E YtN,I,Ik+1 − Y N tk+1 2− EEk Y_tN,I,I_k+1 _{− Y}_tN_k+12 . (4.17)

(20)

Multiply inequality(4.12) by (1 + Lgh)(1 + h), use the independence of ←∆B− k and Ftk+1 and the Lipschitz property (2.4); this yields for ǫ > 0:

(1 + Lgh)(1 + h)E YtN,k ∞,I− PkY N tk 2 ≤ 1 +h ǫ (1 + Lgh)(1 + h)E Ek h Y_tN,I,I_k+1 − YtNk+1 i2 + Lf(h + 2ǫ) h(1 + Lgh)(1 + h) E YtN,k ∞,I− Y N tk 2+ EZtN,Ik − Z N tk 2 + 1 +h ǫ (1 + Lgh)(1 + h)LghE YtN,I,Ik+1 − Y N tk+1 2. (4.18)

Multiply inequality (4.15) by (1 + Lgh) and use (4.18); this yields for some constants K,

C, ¯C and h∈ (0, 1], ǫ > 0: ∆k+1 := (1 + Lgh) E YtN,I,Ik+1 − Y N tk+1 2− EEk Y_tN,I,I k+1 − Y N tk+1 2 ≤Kh2I−1+ KERkYtNk 2+ 1 +h ǫ (1 + Lgh)(1 + h)− 1 E Ek h Y_tN,I,I_k+1 _{− Y}_tN_k+1i2 + C (h + 2ǫ) h E YtN,k ∞,I− Y N tk 2+ EZtN,Ik − Z N tk 2 + 1 +h ǫ ChEYtN,I,Ik+1 − Y N tk+1 2.

Now we choose ǫ such that 2Cǫ = 1₄; then we have for some constant K and h∈ (0, 1]:

∆k+1≤Kh2I−1+ KE RkYtNk 2_{+ KhE}_YN,I,I tk+1 − Y N tk+1 2 + Ch +1 4 h E YtN,k ∞,I− Y N tk 2+ EZtN,Ik − Z N tk 2 .

Thus, for h small enough (so that Ch≤ 1

4), summing over k we obtain N_X−1 k=0 (1 + Lgh)E YtN,I,Ik+1 − Y N tk+1 2− EEk Y_tN,I,I k+1 − Y N tk+1 2 ≤Kh2I−2+ K N_X−1 k=0 ERkYtNk 2+ K max k η N,I k +1 2h N_X₋₁ k=0 E YtN,k ∞,I− Y N tk 2+ EZtN,Ik − Z N tk 2 .

Plugging this inequality in (4.17) yields

1 2ζ N ≤Kh2I−2+ h N_X−1 k=0 ERkZtNk 2 + K N_X−1 k=0 ERkYtNk 2 + K max k η N,I tk +1 2h N_X₋₁ k=0 E YtN,k ∞,I− Y N tk 2.

(21)

Using (4.13) and Lemma 4.7, we obtain for some constant K and every h_{∈ (0, 1]} h N_X−1 k=0 E YtN,k ∞,I− Y N tk 2 ≤ (1 + Kh)h N_X−1 k=0 η_k+1N,I + Kh2 N_X−1 k=0 h ERkYtNk 2 + ERkZtNk 2i + h N_X₋₁ k=0 ERkYtNk 2 _{≤ K max} k η N,I k + K N_X₋₁ k=0 ERkYtNk 2+ Kh N_X₋₁ k=0 ERkZtNk 2.

This concludes the proof of Lemma 4.10. Theorem 4.1 is a straightforward consequence of inequality (4.16) and Lemma 4.10.

5. Approximation step 3

In this section we will use regression approximations and introduce some minimization problem for a M -sample of (B, W ) denoted by (Bm, Wm, m = 1, . . . , M ). This provides a Monte Carlo approximation of YN_{, I, I and Z}N_{, I on the time grid.}

5.1. Some more notations for the projection. We at first introduce some notations (N5) For fixed k = 1, . . . , N and m = 1, . . . , M , let pm

k denote the orthonormal family

of L2_{(Ω) similar to p}

k in (N4) replacing XN by XN,m and B by Bm.

(N6) For a real n× n symmetric matrix A, kAk is the maximum of the absolute value of its eigenvalues and_kAkF =P_i,jA2i,j

1 2

its Frobenius norm. If A : Rn_{→ R}n _also

denotes the linear operator whose matrix in the canonical basis is A, then _{kAk is} the operator-norm of A when Rn _{is endowed with the Euclidian norm. Note that}

kAk ≤ kAkF follows from Schwarz’s inequality.

(N7) For k = 0, . . . , N − 1 and m = 1, . . . , M let vm

k and vk be column vectors whose

entries are the components in the canonical base of the vectors pm_k, pm_k ∆W m k+1 √ h , and pk, pk ∆Wk+1 √ h (5.1) respectively. Note that Evkv∗k = Id, since the entries of pk are an orthonormal

family of L2(Fk) and ∆W_hk+1 is a normed vector in L2 independent of pk.

(N8) For k = 0, . . . , N _{− 1 let V}M

k , PkM be symmetric matrices defined by

V_kM := 1 M M X m=1 v_km[vm_k]∗, P_kM := 1 M M X m=1 pm_k(pm_k )∗. (5.2)

(N9) We denote by_{N the σ-algebra of measurable sets A with P(A) = 0 and set:} FtW,m=σ (Wsm; 0≤ s ≤ t) ∨ N , F B,m t,t′ = σ B m s − Bmt′; t≤ s ≤ t′ ∨ N , FtW,M =FtW ∨ M _ m=1 FtW,m, F B,M t,T =F B t,T ∨ M _ m=1 Ft,TB,m, Ft=FtW ∨ Ft,TB .

Note that (_Ft)_t and

FB

t,T

t are not filtrations.

(N10) In the sequel we will need to localize some processes using the following events A_j _:=_kV_jM _{− Idk ≤ h, kP}_jM _{− Idk ≤ h} _{∈ F}_tW,M j+1 ∨ F B,M tj,T , (5.3) AM_k := N_\−1 j=k A_j _{∈ F}W,M tN ∨ F B,M tk,T . (5.4)

(22)

(N11) For x = (x1, . . . , xm)∈ RM, we denote|x|2_M := _M1 PM_m=1|xm|2.

5.2. Another look at the previous results. We introduce the following random vari-ables ζ_kN := ρN_k :=_|pk| p C0 ∨ 1,

where C0 is constant in the Lemma 4.8. Since YtN,i,Ik and Z

N,I

tk are in Pk (see (4.1) and (4.2)), we can rewrite these random variables as follows:

Y_tN,i,I_k = αi,I_k .pk= αi,I_k ∗pk, ZtN,Ik = β I k.pk= βkI ∗ pk, (5.5)

where αi,I_k (resp. βI

k) is the vector of the coefficient in the basis pk of the random variable

Y_tN,i,I_k (resp. Z_tN,I_k ), identified with the column matrix of the coefficients in the canonical basis.

Remark 5.1. Note that the vectors αi,I_k and βI

k are deterministic.

The following Proposition gives a priori estimates of Y_tN,i,I_k and Z_tN,I_k .

Proposition 5.2. For i _{∈ {1, . . . , I} ∪ {∞} and for k = 0, . . . , N, we have} _Y_tN,i,I k ≤ ρN k, √ h_Z_tN,I k

≤ ζkN. Moreover, for every I and i = 0, . . . , I:

αi,Ik 2 ≤ EρN k 2, β_kI2 ≤ 1 hE ζN k 2. (5.6)

Proof. Let i ∈ {1, . . . , I} ∪ {∞} and k = 0, . . . , N. Squaring YtN,i,Ik , taking expectation and using the previous remark, we obtain

E YtN,i,Ik 2=αi,I_k ∗E (pkp∗k) α i,I k ≥

α_ki,I∗αi,I_k =_α i,I_k 2

Using Lemma 4.8, we deduce that _αi,I_k 2 ≤ C0. The Cauchy-Schwarz inequality implies

YtN,i,Ik ≤ αi,Ik |pk| ≤ |pk| √ C0≤ |pk| √ C0 ∨ 1.

A similar computation based on Lemma 4.8 proves that √h_Z_tN,I k

≤ ζkN. The upper

estimates of _αi,I_k 2 and βI k

2

are straightforward consequences of the previous ones.

We now prove that αi,I_k , βI k

solves a minimization problem.

Proposition 5.3. The vector αi,I_k , βI k

solves the following minimization problem: for k = 0, . . . , N− 1 and for every i = 1, . . . , I, we have:

αi,I_k , β_kI= arg min

(α,β)E Yk+1N,I,I− α.pk+ hf X_tN_k, αi_k−1,I.pk, ZtN,Ik +←∆B− kg XtNk+1, Y N,I,I tk+1 − β.pk∆Wk+1 2. (5.7)

(23)

Proof. Let (Y, Z)_{∈ P}k× Pk; then sincePk ⊂ L2(Ftk) and ∆Wk+1 is independent of Ftk, we have E YtN,I,Ik+1 − Y + hf X_tN_k, Y_tN,i−1,I k , Z N,I tk +←∆B− kg X_tN_k+1, Y_tN,I,I k+1 − Z∆Wk+1 2 =EYtN,I,Ik+1 − Y + hf

X_tN_k, Y_tN,i_k −1,I, Z_tN,I_k +←∆B− kg

X_tN_k+1, Y_tN,I,I_k+1 2 + hE Z − 1_h Y_tN,I,I_k+1 +←∆B− kg XtNk+1, Y N,I,I tk+1 ∆Wk+1 2 − 1 hE Y_tN,I,I k+1 + ←_∆B− kg X_tN k+1, Y N,I,I tk+1 ∆Wk+1 2.

The minimun on pairs of elements of_Pkis given by the orthogonal projections, that is by

the random variables Y = Y_tN,i,I

k and Z = Z

N,I

tk defined by (4.2) and (4.1) respectively. This concludes the proof using the notations introduced in (5.5).

For i _{∈ {1, . . . , I} ∪ {∞}, we define θ}_ki,I := αi,I_k ,√hβI k

. The following lemma gives some properties on θi,I_k .

Lemma 5.4. For all i ∈ {1, . . . , I} ∪ {∞}, we have for k = 0, . . . , N (resp. for k = 0, . . . , N _{− 1)} θi,Ik 2≤ EρN k 2 + EζkN 2

, resp. _θ_k∞,I− θi,Ik

2≤ Li fh2iE ρN k 2 .

Furthermore, we have the following explicit expression of θ∞,I_k for vk defined by (5.1):

θ∞,I_k = Ehvk αI,I_k+1.pk+1+ hf XkN, α∞,Ik .pk, β I k.pk +←∆B− kg XtNk+1, α I,I k+1.pk+1 i . (5.8)

Proof. Proposition 5.2 implies that _θ i,I_k 2=_αi,I_k 2+ hβ_kI2 ≤ EρN k 2 + EζkN 2 . Using inequality (4.10) and Proposition 5.2, since E|pk|2 = 1 we obtain

θk∞,I− θ i,I k 2 = EYtN,k ∞,I− Y N,i,I tk 2 ≤ Lifh2iE YtN,k ∞,I 2 ≤ Lifh2iE ρN k 2.

Using equation (4.9) and the fact that the components of pkare an orthonormal family

of L2_{, we have} α∞,I_k =EhpkY_kN,∞,I i =EpkPk h Y_tN,I,I_k+1 + hfXtNk, Y N,_∞,I tk , Z N,I tk +←∆B− kg XtNk+1, Y N,I,I tk+1 i =Ehpk αI,I_k+1.pk+1+ hf X_kN, α∞,I_k .pk, βkI.pk +←∆B− kg X_tN k+1, α I,I k+1.pk+1 i . A similar computation based on equation (4.1) and on the independence ofFtk and ∆Wk+1 yields √ hβ_kI =Eh√hpkZtN,Ik i =E ₁ √ hpkPk Y_tN,I,I_k+1 ∆Wk+1+←∆B− kg XtNk+1, Y N,I,I tk+1 ∆Wk+1 =E pk αI,I_k+1.pk+1 ∆Wk+1 √ h + ←_∆B− kg X_tN_k+1, αI,I_k+1.pk+1 ∆Wk+1 √ h =E pk ∆Wk+1 √ h αI,I_k+1.pk+1+ hf X_kN, α∞,I_k .pk, βkI.pk +←∆B− kg X_tN_k+1, αI,I_k+1.pk+1 .

(24)

Finally, we recall by (5.1) that vk:=

pk, pk∆W√k+1_h

; this concludes the proof.

5.3. The numerical scheme. Let ξ : R → R be a C2

b function, such that ξ(x) = x for

|x| ≤ 3/2, |ξ|∞ ≤ 2 and |ξ′|∞≤ 1. We define the random truncation functions

b ρN_k(x) := ρN_kξ x ρN k , ζb_kN(x) := ζ_kNξ x ζN k . (5.9)

The following lemma states some properties of these functions. Lemma 5.5. Let bρN

k and bζkN be defined by (5.9), then

(1) bρN k (resp. bζkN) leaves Y N,I,I tk (resp. √ hZ_tN,I

k ) invariant, that is: b ρN_k αI,I_k .pk = αI,I_k .pk, ζbkN √ hβ_kI.pk =√hβ_kI.pk. (2) bρN

k, bζkN are 1-Lipschitz and

bρN k(x) ≤ |x| for every x ∈ R. (3) bρN k (resp. bζkN) is bounded by 2 ρN k (resp. by 2ζN k ). Proof. In part (1)-(3) we only give the proof for bρN

k, since that for bζkN is similar.

1. By Proposition 5.2, α I,I k .pk ρN k ≤ 1. Hence, ξ αI,I_k .pk ρN k = α I,I k .pk ρN k . 2. Let y, y′ _{∈ R; since |ξ}′_|_∞_{≤ 1,} bρN k(y)− bρNk(y′) =ρN k _ξ y ρN k − ξ y′ ρN k ≤ |y − y′|. Since bρN k(0) = 0, we deduce bρN k(x) ≤ |x| .

3. This upper estimate is a straightforward consequence of |ξ|_∞ ≤ 2; this concludes the

proof. Let X.N,m 1≤m≤M, (∆W m . )_1≤m≤M and ←−∆B.m 1≤m≤M be independent realizations

of XN, ∆W and ←∆B respectively. In a similar way, we introduce the following random− variables and random functions:

ζ_kN,m:=ρN,m_k :=|pm k| p C0∨ 1, b ζ_kN,m(x) :=ζ_kN,mξ x ζ_kN,m ! , ρ_bN,m_k (x) := ρN,m_k ξ x ρN,m_k ! , x_{∈ R.} (5.10)

An argument similar to that used to prove Lemma 5.5 yields the following:

Lemma 5.6. The random functions bρN,m_k (.) defined above satisfy the following properties: (1) bρN,m_k is bounded by 2_ρN,m_k _{and is 1-Lipschitz.}

(2) ρN,m_k and ρN_k have the same distribution. We now describe the numerical scheme

Definition 5.7. Initialization. At time t = tN, set YtN,i,I,MN := α

i,I,M

N .pN := PNΦ XtNN and β_Ni,I,M = 0 for all i∈ {1, . . . , I}.

Induction Assume that an approximation Y_tN,i,I,M

l is built for l = k + 1, . . . , N and set Y_tN,I,I,M,m k+1 := bρ N,m k αI,I,M_k+1 .pm k+1

its realization along the mth simulation.

(25)

β_k0,I,M = 0. For i = 1, . . . , I, the vector θ_ki,I,M := αi,I,M_k ,√hβ_ki,I,M is defined by (for-ward) induction as the arg min in (α, β) of the quantity:

1 M M X m=1 YtN,I,I,M,mk+1 − α.p m k + hf X_tN,m k , α i_−1,I,M k .p m k, β i_−1,I,M k .p m k +←∆B− kmg X_tN,m_k+1, Y_tN,I,I,M,m_k+1 − β.pmk∆Wk+1m 2. (5.11) This minimization problem is similar to (5.7) replacing the expected value by an average over M independent realizations. Note that θi,I,M_k = αi,I,M_k ,√hβ_ki,I,M is a random vector. We finally set:

Y_tN,I,I,M_k _{:= b}ρN_k αI,I,M_k .pk

,√hZ_tN,I,I,M_k := bζ_kN√hβ_ki,I,M.pk

, (5.12) The following theorem gives an upper estimate of the L2 _{error beetween Y}N,I,I

. , Z.N,I and YN,I,I,M . , Z.N,I,I,M in terms of ζN . and ρN .

; it is the main result of this section. We recall that by (5.4) AM k = TN₋₁ j=k n VjM − Id ≤ h, kPjM − Idk ≤ h o ∈ FTW,M∨ F B,M tk,T . For k = 1, . . . , N _{− 1 set} ǫk :=Ekvkv∗k− Idk2F EρNk 2_{+ E}ζN k 2_{+ E}h_|vk|2|pk+1|2 i EρNk+1 2 + h2E |vk|2 1 +X_kN2+_|pk|2E ρN k 2+ 1 h|pk| 2 EζkN 2 + hE|vk|2+ wp k 2 1 +_X_tN_k+12+_|pk+1|2E ρN k+1 2. (5.13)

Choosing N and then M large enough, the following result gives the speed of convergence of the Monte Carlo approximation scheme of YN,I,I _{and Z}N,I_.

Theorem 5.8. There exists a constant C > 0 such that for h small enough, for any k = 0, . . . , N− 1 and M ≥ 1: E_M _:=E_YN,I,I tk − Y N,I,I,M tk 2+ h N_X₋₁ j=k E ZtN,Ij − Z N,I,I,M tj 2 ≤16 N_X₋₁ j=k EhζjN 2+ρN_j 2_1[_AM k ] c i + ChI−1 N_X₋₁ j=k h2+ hEρNj+1 2_{+ E}ρN j 2_{+ E}ζN j 2 + C hM N_X₋₁ j=k ǫj.

5.4. Proof of Theorem 5.8. Before we start the proof, let us recall some results on regression (i.e. orthogonal projections). Let v = (vm₎

1≤m≤M be a sequence of vectors in

Rn. Let use define the n_{×n matrix V}M _:= 1 M

PM

m=1vmvm∗, suppose that VM is invertible

and denote by λmin VM its smallest eigenvalue.

Lemma 5.9. Under the above hypotheses, we have the following results: Let (xm_{, m =}

1, . . . , M ) be a vector in RM_.

(1) There exists a unique Rn _{valued vector θ}

x satisfying θx= arg inf

θ_∈Rn |x − θ.v|

2

M where

θ.v denotes the vector (Pn_i=1θ(i)vm_{(i), m = 1 . . . , M ).}

(2) Moreover, we have θx= _M1 VM

−1PM

(26)

(3) The map x_{7→ θ}x is linear from RM to Rn and λmin(VM)|θx|2≤ |θx.v|2M ≤ |x|2M.

The following lemma gives a first upper estimate of EM.

Lemma 5.10. For every M and k = 0, . . . , N − 1, we have the following upper estimate

E_M _≤E αI,Ik − α I,I,M k 21_AM k + h N_X₋₁ j=k E βjI − β I,I,M j 21_AM j + 16EhρNk 2_1[_AM k ] c i + 16 N_X−1 j=k EhζjN 2_1[_AM j ] c i .

This lemma should be compared with inequality (31) in [7].

Proof. Using the decomposition of YN,I,I_{, Y}N,I,I,M_{, Z}N,I _{and Z}N,I,I,M_{, Lemma 5.5 (1) ,}

we deduce E_M _=E bρN_k αI,I_k .pk − bρN_k αI,I,M_k .pk 2 + h N_X₋₁ j=k E " 1 √ hζb N j √ hβ_jI.pj −√1 hζb N j √ hβ_jI,I,M.pj 2# .

Using hte partition AM k , AMk

c

where AM

k is defined by (5.4), Cauchy-Schwarz’s inequality,

Lemma 5.5 and the independence of αI,I,M_k , β_jI,I,M, 1_AM k and pk we deduce: E_M _≤E αI,Ik .pk− αI,I,Mk .pk 21_AM k + h N_X−1 j=k E βjI.pj− βjI,I,M.pj 21_AM j + 2EbρN_k αI,I_k .pk 2 +_bρN_k αI,I,M_k .pk 2 1[AM k ] c + 2 N_X−1 j=k E bζjN √ hβjI.pj 2 +_bζjN √ hβ_jI,I,M.pj 2 1[AM j ] c

≤EhαI,I_k _{− α}I,I,M_k ∗pkp∗k

αI,I_k _{− α}I,I,M_k 1_AM k i + h N_X₋₁ j=k EhβjI − βjI,I,M ∗ pjp∗j βjI − βjI,I,M 1_AM j i + 2Eh8ρN_k2_1[_AM k ] c i + 2 N_X−1 j=k E h 8ζ_jN2_1[_AM j ] c i ≤Epkp∗kE αI,Ik − α I,I,M k 21_AM k + h N_X₋₁ j=k Epjp∗jE βjI − β I,I,M j 21_AM j + 16EhρNk 2_1[_AM k ] c i + 16 N_X−1 j=k EhζjN 2_1[_AM j ] c i .

This concludes the proof.

We now upper estimate _θI,I,M_k _{− θ}_kI,I2 on the event AM

k . This will be done in severals

lemmas below. By definition _kVM

(27)

h_{∈ (0, 1)}

1− h ≤ λmin VkM(ω)

on AM_k . (5.14) Lemma 5.11. For every α_{∈ R}n_{and k = 1, . . . , N , we have} 1

M PM m=1|α.pmk| 2 ≤ |α|2 PM k . Proof. The definition of the Euclidian norm and of P_kM imply

1 M M X m=1 |α.pmk|2= α∗ 1 M M X m=1 p_km(pm_k)∗α = α∗P_kMα_≤ P_kM |α|2;

this concludes the proof.

For i = 0, . . . , I, we introduce the vector xi,I,M_k :=xi,I,m,M_k

m=1,...,M defined for m =

1, . . . , M by:

xi,I,m,M_k _:=bρN,m_k+1 αI,I,M_k+1 .p_k+1m + hfX_kN,m, αi,I,M_k .pmk, β i,I,M k .p m k +←∆B− m k g X_k+1N,m_{, b}ρN,m_k+1 αI,I,M_k+1 .pm k+1 . (5.15)

Using Lemma 5.9, we can rewrite equation (5.11) as follows:

θ_ki,I,M = arg inf

θ xik−1,I,M − θ.v m k 2 M = 1 M V M k −1 XM m=1 xi_k−1,I,m,Mvkm. (5.16)

We will need the following

Lemma 5.12. For all k = 0, . . . , N _{− 1 and every I, the random variables α}I,I,M_k are FTW,M ∨ F

B,M

tk,T measurable.

Proof. The proof uses backward indution on k and forward induction on i. Initialization. Let k = N− 1. By definition α0,I,MN−1 = 0. Let i≥ 1 and suppose α

i−1,I,M N−1 ∈

FTW,M ∨ F B,M

tN −1,T. Using (5.1) (resp. (5.2)), we deduce that v

m N₋₁∈ F W,m T ∨ F B,m tN −1,T (resp. VM N₋₁ ∈ F W,M T ∨ F B,M tN −1,T).

Futhermore (5.15) shows that xi_N−1,I,m,M₋₁ _{∈ F}_TW,M _{∨ F}_tB,M

N −1,T and hence (5.16) implies that αi,I,M_N₋₁ _{∈ F}_TW,M _{∨ F}_tB,M

N −1,T.

Induction. Suppose that αI,I,M_k+1 _{∈ F}_TW,M_{∨ F}_tB,M

k+1,T; we will prove by forward induction on i that αi,I,M_k _{∈ F}_TW,M _{∨ F}_tB,M

k,T for i = 0, . . . , I.

By definition α0,I,M_k = 0. Suppose αi_k−1,I,M _{∈ F}_TW,M _{∨ F}_tB,M

k,T ; we prove that α

i,I,M

k ∈

FTW,M∨ F B,M

tk,T by similar arguments. Indeed, (5.1) (resp. (5.2)) implies that v

m k ∈ F W,m T ∨ FtB,mk,T (resp. V M k ∈ F W,M T ∨ F B,M

tk,T ), while (5.15) (resp. (5.16)) yields x

i_−1,I,m,M k ∈ F W,M T ∨ FtB,Mk,T (resp. α i,I,M k ∈ F W,M T ∨ F B,M

tk,T ). This concludes the proof. The following Lemma gives an inductive upper estimate of _θ_ki+1,I,M − θi,I,Mk

2.

Lemma 5.13. There exists eC > 0 such that for small h, for k = 0, . . . , N _{− 1 and for} i = 1, ..., I− 1 θi+1,I,Mk − θ i,I,M k 2 ≤ eCh_θ_ki,I,M − θik−1,I,M 2 on AM k .

Proof. Using (5.14) and Lemma 5.9 (4), we obtain on AM k

(1_{− h)}_θ_ki+1,I,M _{− θ}i,I,M_k 2_{≤ λ}min VkM θ i+1,I,M k − θ i,I,M k 2 ≤xi,I,Mk − x i−1,I,M k 2_M.