• Aucun résultat trouvé

Switching problems with controlled randomisation and associated obliquely reflected BSDEs

N/A
N/A
Protected

Academic year: 2021

Partager "Switching problems with controlled randomisation and associated obliquely reflected BSDEs"

Copied!
51
0
0

Texte intégral

(1)

HAL Id: hal-02463626

https://hal.archives-ouvertes.fr/hal-02463626v1

Submitted on 1 Feb 2020 (v1), last revised 13 Dec 2021 (v2)

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de

Switching problems with controlled randomisation and associated obliquely reflected BSDEs

Cyril Bénézet, Jean-François Chassagneux, Adrien Richou

To cite this version:

Cyril Bénézet, Jean-François Chassagneux, Adrien Richou. Switching problems with controlled ran-

domisation and associated obliquely reflected BSDEs. Stochastic Processes and their Applications,

Elsevier, 2022, pp.23-71. �10.1016/j.spa.2021.10.010�. �hal-02463626v1�

(2)

Switching problems with controlled randomisation and associated obliquely reected BSDEs

Cyril Bénézet

, Jean-François Chassagneux

, Adrien Richou

January 30, 2020

Abstract

We introduce and study a new class of optimal switching problems, namely switching problem with controlled randomisation, where some extra-randomness impacts the choice of switching modes and associated costs. We show that the optimal value of the switching problem is related to a new class of multidimen- sional obliquely reected BSDEs. These BSDEs allow as well to construct an optimal strategy and thus to solve completely the initial problem. The other main contribution of our work is to prove new existence and uniqueness results for these obliquely reected BSDEs. This is achieved by a careful study of the domain of re- ection and the construction of an appropriate oblique reection operator in order to invoke results from [7].

1 Introduction

In this work, we introduce and study a new class of optimal switching problems in stochastic control theory. The interest in switching problems comes mainly from their connections to nancial and economic problems, like the pricing of real options [4]. In a celebrated article [14], Hamadène and Jeanblanc study the fair valuation of a company producing electricity. In their work, the company management can choose between two modes of production for their power plant operating or close and a time of switch- ing from one state to another, in order to maximise its expected return. Typically, the company will buy electricity on the market if the power station is not operating.

The company receives a prot for delivering electricity in each regime. The main point here is that a xed cost penalizes the prot upon switching. This switching problem has been generalized to more than two modes of production [10]. Let us now discuss this switching problem with d ¥ 2 modes in more details. The costs to switch from

Centre de Mathématiques Appliquées (CMAP), Ecole Polytechnique and CNRS, Université Paris- Saclay, Route de Saclay, 91128 Palaiseau Cedex, France (cyril.benezet@polytechnique.edu)

UFR de Mathématiques & LPSM, Université de Paris, Bâtiment Sophie Germain, 8 place Aurélie Nemours, 75013 Paris, France (chassagneux@lpsm.paris)

UMR 5251 & IMB, Université de Bordeaux, F-33400 Talence, France (adrien.richou@math.u-bordeaux.fr)

(3)

one state to another are given by a matrix p c

i,j

q

1¤i,j¤d

. The management optimises the expected company prots by choosing switching strategies which are sequences of stopping times p τ

n

q

n¥0

and modes p ζ

n

q

n¥0

. The current state of the strategy is given by a

t

°

8

k0

ζ

k

1

rτkk 1q

p t q , t P r 0, T s , where T is a terminal time. To formalise the problem, we assume that we are working on a complete probability space pΩ, A, Pq supporting a Brownian Motion W . The stopping times are dened with respect to the ltration p F

t

q

t¥0

generated by this Brownian motion. Denoting by f p t, i q the instan- taneous prot received at time t in mode i, the time cumulated prot associated to a switching strategy is given by ³

T

0

f p t, a

t

q dt °

8

k0

c

ζkk 1

1

tτk 1¤t^Tu

. The management solves then at the initial time the following control problem

V

0

sup

aPA

E

»

T 0

f p t, a

t

q dt ¸

8

k0

c

ζkk 1

1

tτk 1¤Tu

, (1.1)

where A is a set of admissible strategies that will be precisely described below (see Section 2.1). We shall refer to problems of the form (1.1) under the name of classical switching problems. These problems have received a lot of interest and are now quite well understood [14, 10, 17, 5]. In our work, we introduce a new kind of switching problem, to model more realistic situations, by taking into account uncertainties that are encountered in practice. Coming back to the simple but enlightning example of an electricity producer described in [14], we introduce some extra-randomness in the production process. Namely, when switching to the operating mode, it may happen with hopefully a small probability that the station will have some dysfunction. This can be represented by a new mode of production with a greater switching cost than the business as usual one. To capture this phenomenon in our mathematical model, we introduce a randomisation procedure: the management decides the time of switching but the mode is chosen randomly according to some extra noise source. We shall refer to this kind of problem by randomised switching problem. However, we do not limit our study to this framework. Indeed, we allow some control by the agent on this randomisation.

Namely, the agent can chose optimally a probability distributions P

u

on the modes space given some parameter u P C , in the control space. The new mode ζ

k 1

is then drawn, independently of everything up to now, according to this distribution P

u

and a specic switching cost c

uζ

kk 1

is applied. The management strategy is thus given now by the sequence p τ

k

, u

k

q

k¥0

of switching times and controls. The maximisation problem is still given by (1.1). Let us observe however that E

c

uζk

kk 1

E °

1¤j¤d

P

ζuk

k,j

c

uζk

k,j

, thanks to the tower property of conditional expectation. In particular, we will only work with the mean switching costs c ¯

uik

: °

1¤j¤d

P

i,juk

c

ui,jk

in (1.1). We name this kind of control problem switching problem with controlled randomisation. Although their apparent modeling power, this kind of control problem has not been considered in the literature before, to the best of our knowledge. In particular, we will show that the classical or randomised switching problem are just special instances of this more generic problem. The switching problem with controlled randomisation is introduced rigorously in Section 2.1 below.

A key point in our work is to relate the control problem under consideration to a

(4)

new class of obliquely reected Backward Stochastic Dierential Equations (BSDEs).

In the rst part, following the approach of [14, 10, 17], we completely solve the switching problem with controlled randomisation by providing an optimal strategy. The optimal strategy is built by using the solution to a well chosen obliquely reected BSDE. Al- though this approach is not new, the link between the obliquely reected BSDE and the switching problem is more subtle than in the classical case due to the state uncertainty.

In particular, some care must be taken when dening the adaptedness property of the strategy and associated quantities. Indeed, a tailor-made ltration, studied in details in Appendix A.2, is associated to each admissible strategy. The state and cumulative cost processes are adapted to this ltration, and the associated reward process is dened as the Y -component of the solution to some switched BSDE in this ltration. The classical estimates used to identify an optimal strategy have to be adapted to take into account the extra orthogonal martingale arising when solving this switched BSDE in a non Brownian ltration.

In the second part of our work, we study the auxiliary obliquely reected BSDE, which is written in the Brownian ltration and represents the optimal value in all the possible starting modes. Reected BSDEs were rst considered by Gegout-Petit and Pardoux [13], in a multidimensional setting of normal reections. In one dimension, they have also been studied in [11] in the so called simply reected case, and in [8] in the doubly reected case. The multidimensional RBSDE associated to the classical switching problem is reected in a specic convex domain and involves oblique directions of reection. Due to the controlled randomisation, the domain in which the Y -component of the auxiliary RBSDE is constrained is dierent from the classical switching problem domain and its shape varies a lot from one model specication to another. The existence of a solution to the obliquely reected BSDE has thus to be studied carefully. We do so by relying on the article [7], that studies, in a generic way, the obliquely reected BSDE in a xed convex domain in both Markovian and non-Markovian setting. The main step for us here is to exhibit an oblique reection operator, with the good properties to use the results in [7]. We are able to obtain new existence results for this class of obliquely reected BSDEs. Because we are primarily interested in solving the control problem, we derive the uniqueness of the obliquely reected BSDEs in the Hu and Tang specication for the driver [17], namely f

i

p t, y, z q : f

i

p t, y

i

, z

i

q for i P t 1, . . . , d u. But our results can be easily generalized to the specication f

i

pt, y, zq : f

i

pt, y, z

i

q by using similar arguments as in [6].

The rest of the paper is organised as follows. In Section 2, we introduce the switching

problem with controlled randomisation. We prove that, if there exists a solution to

the associated BSDE with oblique reections, then its Y -component coincides with

the value of the switching problem. A verication argument allows then to deduce

uniqueness of the solution of the obliquely reected BSDE. In Section 3, we show that

there exists indeed a solution to the obliquely reected BSDE under some conditions

on the parameters of the switching problem and its randomisation. We also prove

uniqueness of the solution under some structural condition on the driver f . Finally, we

gather in the Appendix section some technical results.

(5)

Notations If n ¥ 1 , we let B

n

be the Borelian sigma-algebra on R

n

.

For any ltered probability space pΩ, G, F, Pq and constants T ¡ 0 and p ¥ 1, we dene the following spaces:

• L

pn

p G q is the set of G -measurable random variables X valued in R

n

satisfying E r| X |

p

s   8,

• P pFq is the predictable sigma-algebra on Ω r 0, T s,

• H

pn

pFq is the set of predictable processes φ valued in R

n

such that E

»

T

0

| φ

t

|

p

dt

  8 , (1.2)

• S

pn

pFq is the set of càdlàg processes φ valued in R

n

such that E

sup

0¤t¤T

| φ

t

|

p

  8 , (1.3)

• A

pn

pFq is the set of continuous processes φ valued in R

n

such that φ

T

P L

pn

p F

T

q and φ

i

is nondecreasing for all i 1, . . . , n .

If n 1 , we omit the subscript n in previous notations.

For d ¥ 1 , we denote by p e

i

q

di1

the canonical basis of R

d

and S

d

pRq the set of symmetric matrices of size d d with real coecients.

If D is a convex subset of R

d

(d ¥ 1) and y P D , we denote by C pyq the outward normal cone at y , dened by

C p y q : t v P R

d

: v

J

p z y q ¤ 0 for all z P D u . (1.4) We also set n p y q : C p y q X t v P R

d

: | v | 1 u.

If X is a matrix of size n m , I € t 1, . . . , n u and J € t 1, . . . , m u, we set X

pI,Jq

the matrix of size p n | I |q p m | J |q obtained from X by deleting rows with index i P I and columns with index j P J . If I t i u we set X

pi,Jq

: X

pI,Jq

, and similarly if J t j u.

If v is a vector of size n and 1 ¤ i ¤ n , we set v

piq

the vector of size n 1 obtained from v by deleting coecient i.

For p i, j q P t 1, . . . , d u, we dene i

pjq

: i 1

ti¡ju

P t 1, . . . , d 1 u , for d ¥ 2 .

We denote by ¥ the component by component partial ordering relation on vectors and

matrices.

(6)

2 Switching problem with controlled randomisation

We introduce here a new kind of stochastic control problem that we name switching problem with controlled randomisation. In contrast with the usual switching problems [14, 16, 17], the agent cannot choose directly the new state, but chooses a probability distribution under which the new state will be determined. In this section, we assume the existence of a solution to some auxiliary obliquely reected BSDE to characterize the value process and an optimal strategy for the problem, see Assumption 2.2 below.

Let p Ω, G, Pq be a probability space. We x a nite time horizon T ¡ 0 and κ ¥ 1, d ¥ 2 two integers. We assume that there exists a κ-dimensional Brownian motion W and a sequence p U

n

q

n¥1

of independent random variables, independent of W , uniformly distributed on r 0, 1 s. We also assume that G is generated by the Brownian motion W and the family p U

n

q

n¥1

. We dene F

0

p F

t0

q

t¥0

as the augmented Brownian ltration, which satises the usual conditions.

Let C be an ordered compact metric space and F : C t 1, . . . , d u r 0, 1 s Ñ t 1, . . . , d u a measurable map. To each u P C is associated a transition probability function on the state space t 1, . . . , d u, given by P

i,ju

: Pp F p u, i, U q j q for U uniformly distributed on r 0, 1 s. We assume that for all p i, j q P t 1, . . . , d u

2

, the map u ÞÑ P

i,ju

is continuous.

Let ¯ c : t 1, . . . , d u C Ñ R , p i, u q ÞÑ ¯ c

ui

a map such that u ÞÑ c ¯

ui

is continuous for all i 1, . . . , d. We denote sup

iPt1,...,du,uPC

¯ c

ui

: c ˇ and inf

iPt1,...,du,uPC

¯ c

ui

: ˆ c.

Let ξ p ξ

1

, . . . , ξ

d

q P L

2d

p F

T0

q and f : Ω r 0, T s R

d

R

dκ

Ñ R

d

a map satisfying

• f is P pF

0

q b B

d

b B

dκ

-measurable and f p , 0, 0 q P H

2d

pF

0

q.

• There exists L ¥ 0 such that, for all p t, y, y

1

, z, z

1

q P r 0, T sR

d

R

d

R

dκ

R

dκ

,

| f p t, y, z q f p t, y

1

, z

1

q| ¤ L p| y y

1

| | z z

1

|q .

These assumptions will be in force throughout our work. We shall also use, in this section only, the following additional assumptions.

Assumption 2.1. i) Switching costs are assumed to be positive, i.e. ˆ c ¡ 0 . ii) For all p t, y, z q P r 0, T s R

d

R

dκ

, it holds almost surely,

f p t, y, z q p f

i

p t, y

i

, z

i

qq

1¤i¤d

. (2.1) Remark 2.1. i) It is usual to assume positive costs in the litterature on switching prob- lem. In particular, the cumulative cost process, see (2.2), is non decreasing. Introducing signed costs adds extra technical diculties in the proof of the representation theorem (see e.g. [19] and references therein). We postpone the adaptation of our results in this more general framework to future works.

ii) Assumption (2.1) is also classical since it allows to get a comparison result for BS-

DEs which is key to obtain the representation theorem. Note however than our results

can be generalized to the case f

i

p t, y, z q f

i

p t, y, z

i

q for i P t 1, ..., d u by using similar

arguments as in [5].

(7)

2.1 Solving the control problem using obliquely reected BSDEs We dene in this section the stochastic optimal control problem. We rst introduce the strategies available to the agent and related processes. The denition of the strategy is more involved than in the usual switching problem setting since its adaptiveness property is understood with respect to a ltration built recursively.

A strategy is thus given by φ p ζ

0

, p τ

n

q

n¥0

, p α

n

q

n¥1

q where ζ

0

P t 1, . . . , d u, p τ

n

q

n¥0

is a nondecreasing sequence of random times and p α

n

q

n¥1

is a sequence of C -valued random variables, which satisfy:

• τ

0

P r 0, T s and ζ

0

P t 1, . . . , d u are deterministic.

• For all n ¥ 0 , τ

n 1

is a F

n

-stopping time and α

n 1

is F

τn

n 1

-measurable (recall that F

0

is the augmented Brownian ltration). We then set F

n 1

p F

tn 1

q

t¥0

with F

tn 1

: F

tn

_ σ p U

n 1

1

tτn 1¤tu

q.

Lastly, we dene F

8

p F

t8

q

t¥0

with F

t8

: š

n¥0

F

tn

, t ¥ 0 . For a strategy φ p ζ

0

, p τ

n

q

n¥0

, p α

n

q

n¥1

q, we set, for n ¥ 0 ,

ζ

n 1

: F p α

n 1

, ζ

n

, U

n 1

q and a

t

: ¸

8

k0

ζ

k

1

rτkk 1q

p t q , t ¥ 0 ,

which represents the state after a switch and the state process respectively. We also introduce two processes, for t ¥ 0 ,

A

φt

¸

8

k0

¯ c

αζk 1

k

1

tτk 1¤tu

and N

tφ

: ¸

k¥0

1

tτk 1¤tu

. (2.2) The random variable A

φt

is the cumulative cost up to time t and N

tφ

is the number of switches before time t . Notice that the processes p a, A

φ

, N

φ

q are adapted to F

8

and that A

φ

is a non decreasing process.

We say that a strategy φ p ζ

0

, p τ

n

q

n¥0

, p α

n

q

n¥1

q is an admissible strategy if the cumu- lative cost process satises

A

φT

A

φτ0

P L

2

pF

8T

q and E

A

φτ0 2

F

τ0

0

  8 a.s. (2.3)

We denote by A the set of admissible strategies, and for t P r 0, T s and i P t 1, . . . , d u, we denote by A

ti

the subset of admissible strategies satisfying ζ

0

i and τ

0

t . Remark 2.2. i) The denition of an admissible strategy is slightly weaker than usual

[17], which requires the stronger property A

φT

P L

2

p F

T8

q. But, importantly, the

above denition is enough to dene the switched BSDE associated to an admissible

control, see below. Moreover, we observe in the next section that optimal strategies

are admissible with respect to our denition, but not necessarily with the usual one,

due to possible simultaneous jumps at the initial time.

(8)

ii) For technical reasons involving possible simultaneous jumps, we cannot consider the generated ltration associated to a, which is contained in F

8

.

We are now in position to introduce the reward associated to an admissible strategy. If φ p ζ

0

, p τ

n

q

n¥0

, p α

n

q

n¥1

q P A , the reward is dened as the value E

U

τφ0

A

φτ0

F

τ0

0

, where p U

φ

, V

φ

, M

φ

q P S

2

pF

8

q H

2κ

pF

8

q H

2

pF

8

q is the solution of the following switched BSDE [17] on the ltered probability space p Ω, G, F

8

, Pq:

U

t

ξ

aT

»

T

t

f

as

p s, U

s

, V

s

q ds

»

T

t

V

s

dW

s

»

T

t

dM

s

»

T

t

dA

φs

, t P r τ

0

, T s . (2.4) Remark 2.3. This switched BSDE rewrites as a classical BSDE in F

8

, and since A

φ

A

φt

P S

2

pF

8

q, the terminal condition and the driver are standard parameters, there exists a unique solution to (2.4) for all φ P A . We refer to Section A.2.2 for more details.

For t P r0, T s and i P t1, . . . , du, the agent aims thus to solve the following maximisation problem:

V

ti

ess sup

φPAti

E

U

tφ

A

φt

F

t0

. (2.5)

We rst remark that this control problem corresponds to (1.1) as soon as f does not depend on y and z. Moreover, the term E

A

φt

F

t0

is non zero if and only if we have at least one instantaneous switch at initial time t .

The main result of this section is the next theorem that relates the value process V to the solution of an obliquely reected BSDEs, that is introduced in the following assumption:

Assumption 2.2. i) There exists a solution p Y, Z, K q P S

2d

pF

0

q H

2dκ

pF

0

q A

2d

pF

0

q to the following obliquely reected BSDE:

Y

ti

ξ

»

T

t

f

i

ps, Y

si

, Z

si

qds

»

T

t

Z

si

dW

s

»

T

t

dK

si

, t P r0, T s, i P I, (2.6)

Y

t

P D, t P r0, T s, (2.7)

»

T

0

Y

ti

sup

uPC

#

d

¸

j1

P

i,ju

Y

tj

c ¯

ui

+

dK

ti

0, i P I, (2.8)

where I : t 1, . . . , d u and D is the following convex subset of R

d

: D :

#

y P R

d

: y

i

¥ sup

uPC

#

d

¸

j1

P

i,ju

y

j

¯ c

ui

+

, i P I +

. (2.9)

ii) For all u P C and i P t 1, . . . , d u, we have P

i,iu

1 .

(9)

Let us observe that the positive costs assumption implies that D has a non-empty interior. Except for Section 2.2, this is the main setting for this part, recall Remark 2.1.

In Section 3, the system (2.6)-(2.7)-(2.8) is studied in details in a general costs setting.

An important step is then to understand when D has non-empty interior.

Theorem 2.1. Assume that Assumptions 2.1 and 2.2 are satised.

1. For all i P t 1, . . . , d u, t P r 0, T s and φ P A

ti

, we have Y

ti

¥ E

U

tφ

A

φt

F

t0

. 2. We have Y

ti

E

U

tφ

A

φt

F

t0

, where φ

p i, p τ

n

q

n¥0

, p α

n

q

n¥1

q P A

ti

is dened in (2.27)-(2.28).

The proof is given at the end of Section 2.3. It will use several lemmata that we introduce below. We rst remark, that as an immediate consequence, we obtain the uniqueness of the BSDE used to characterize the value process of the control problem.

Corollary 2.1. Under Assumptions 2.1 and 2.2, there exists a unique solution p Y, Z, K q P S

2d

pF

0

q H

2dκ

pF

0

q A

2d

pF

0

q to the obliquely reected BSDE (2.6)-(2.7)-(2.8).

Remark 2.4. The classical switching problem is an example of switching problem with controlled randomisation. Indeed, we just have to consider C t 1, ..., d 1 u,

P

i,ju

#

1 if j i u mod d,

0 otherwise. , @ u P C, 1 ¤ i, j ¤ d and

c

i,j

$ '

&

' %

¯

c

jii

if j ¡ i

¯

c

jii d

if j   i 0 if j i.

@ u P C, 1 ¤ i, j ¤ d.

We observe that, in this specic case, there is no extra-randomness introduced at each switching time and so there is no need to consider an enlarged ltration. In this setting, Theorem 2.1 is already known and Assumption 2.2 is fullled, see e.g. [16, 17].

2.2 Uniqueness of solutions to reected BSDEs with general costs In this section, we extend the uniqueness result of Corollary 2.1. Namely, we consider the case where inf

1¤i¤d,uPC

¯ c

ui

ˆ c can be nonpositive, meaning that only Assumptions 2.1-ii) and 2.2 hold here. Assuming that D has a non empty interior, we are then able to show uniqueness to (2.6)-(2.7)-(2.8) in Proposition 2.1 below.

Fix y

0

in the interior of D . It is clear that for all 1 ¤ i ¤ d , y

i0

¡ sup

uPC

#

d

¸

j1

P

i,ju

y

0j

c ¯

ui

+

.

(10)

We set, for all 1 ¤ i ¤ d and u P C ,

˜

c

ui

: y

0i

¸

d j1

P

i,ju

y

j0

¯ c

ui

¡ 0,

so that ˆ ˜ c : inf

1¤i¤d,uPC

c ˜

ui

¡ 0 by compactness of C . We also consider the following set D ˜ :

#

˜

y P R

d

: ˜ y

i

¥ sup

uPC

#

d

¸

j1

P

i,ju

y ˜

j

˜ c

ui

+

, 1 ¤ i ¤ d +

.

Lemma 2.1. Assume that D has a non empty interior. Then, D ˜ y y

0

: y P D (

.

Proof. If y P D , let y ˜ : y y

0

. For 1 ¤ i ¤ d and u P C , we have

˜

y

i

y

i

y

i0

¥

¸

d j1

P

i,ju

y

j

¯ c

ui

y

i0

¸

d j1

P

i,ju

py

j

y

j0

q p¯ c

ui

y

i0

¸

d j1

P

i,ju

y

0j

q

¸

d j1

P

i,ju

y ˜

j

˜ c

ui

, hence y ˜ P D ˜ .

Conversely, let y ˜ P D ˜ and let y : y ˜ y

0

. We can show by the same kind of calculation

that y P D . l

Proposition 2.1. Assume that D has a non empty interior. Under Assumptions 2.1-ii) and 2.2-ii), there exists at most one solution to (2.6)-(2.7)-(2.8) in S

2d

pF

0

q H

2

pF

0

q A

2d

pF

0

q.

Proof. Let us assume that pY

1

, Z

1

, K

1

q and pY

2

, Z

2

, K

2

q are two solutions to (2.6)- (2.7)-(2.8). We set Y ˜

1

: Y

1

y

0

and Y ˜

2

: Y

2

y

0

. Then one checks easily that p Y ˜

1

, Z

1

, K

1

q and p Y ˜

2

, Z

2

, K

2

q are solutions to (2.6)-(2.7)-(2.8) with terminal condition ξ ˜ ξ y

0

, driver f ˜ given by

f ˜

i

p t, y ˜

i

, z

i

q : f

i

p t, y ˜

i

y

i0

, z

i

q , 1 ¤ i ¤ d, t P r 0, T s , y ˜ P R

d

, z P R

dκ

,

and domain D ˜ . This domain is associated to a randomised switching problem with ˆ ˜

c ¡ 0 , hence Corollary 2.1 gives that p Y ˜

1

, Z

1

, K

1

q p Y ˜

2

, Z

2

, K

2

q which implies the

uniqueness. l

(11)

2.3 Proof of the representation result

We prove here our main result for this part, namely Theorem 2.1. It is divided in several steps.

2.3.1 Preliminary estimates

We rst introduce auxiliary processes associated to an admissible strategy and prove some key integrability properties.

Let i P t 1, . . . , d u and t P r 0, T s. We set, for φ P A

ti

and t ¤ s ¤ T , Y

sφ

: ¸

k¥0

Y

sζk

1

rτkk 1q

p s q , (2.10) Z

sφ

: ¸

k¥0

Z

sζk

1

rτkk 1q

p s q , (2.11) K

φs

: ¸

k¥0

K

sζk

1

rτkk 1q

psq, (2.12) M

φs

: ¸

k¥0

Y

τζkk11

E

Y

τζkk11

F

τk

k 1

1

tt τk 1¤su

, (2.13) A

φs

¸

k¥0

Y

τζkk1

E

Y

τζkk11

F

τk

k 1

¯ c

αζk 1

k

1

tt τk 1¤su

. (2.14) Remark 2.5. For all k ¥ 0 , since α

k 1

P F

τk

k 1

, we have E

Y

τζkk11

F

τk

k 1

¸

d j1

P

ζ

k 1

j | F

τk

k 1

Y

τj

k 1

¸

d j1

P

ζαk

k,j

Y

τj

k 1

. (2.15) Lemma 2.2. Assume that assumption (2.2) is satised. For any admissible strategy φ P A

ti

, M

φ

is a square integrable martingale with M

φt

0 . Moreover, A

φ

is increasing and satises A

φT

P L

2

p F

T8

q. In addition,

E

¸

k¥0

Y

τζkk11

E

Y

τζkk11

F

τk

k 1

1

tτk 1¤tu

2

F

t0

  8 a.s. (2.16)

Proof. Let φ P A

ti

. Using (2.14) and (2.15), we have, for all s P r t, T s, A

φs

¸

k¥0

Y

τζkk1

¸

d j1

P

ζαk 1

k,j

Y

τjk 1

¯ c

αζk 1

k

1

tt τk 1¤su

, (2.17) which is increasing since each summand is positive as Y P D .

We have, for t ¤ s ¤ T , Y

sφ

Y

tφ

¸

k¥0

Y

τζkk1^s

Y

τζkk^s

¸

k¥0

Y

τζkk11

Y

τζkk1

1

tt τk 1¤su

. (2.18)

(12)

Using (2.6), we get, for all k ¥ 0 , Y

τζkk1^s

Y

τζkk^s

»

τk 1^s

τk^s

f

ζk

p u, Y

uζk

, Z

uζk

q du

»

τk 1^s

τk^s

Z

uζk

dW

u

»

τk 1^s

τk^s

dK

uζk

,

Recalling ζ

k

is F

τk

-measurable. We also have, using (2.15), for all k ¥ 0 , Y

τζkk11

Y

τζkk1

Y

τζkk11

E

Y

τζkk11

F

τk

k 1

Y

τζkk1

¸

d j1

P

ζαk 1

k,j

Y

τjk 1

c ¯

αζk 1

k

¯ c

αζk 1

k

.

Plugging the two previous equalities into (2.18), we get:

Y

sφ

Y

tφ

¸

k¥0

»

τk 1^s

τk^s

f

ζk

p u, Y

uζk

, Z

uζk

q du

»

τk 1^s

τk^s

Z

uζk

dW

u

»

τk 1^s

τk^s

dK

uζk

¸

k¥0

Y

τζkk11

E

Y

τζkk11

F

τk

k 1

1

tt τk 1¤su

A

φs

A

φt

¸

k¥0

Y

τζk

k 1

¸

d j1

P

ζαk 1

k,j

Y

τj

k 1

c ¯

αζk 1

k

1

tt τk 1¤su

. (2.19) By denition of Y

φ

, Z

φ

, K

φ

, M

φ

, A

φ

, we obtain, for all s P r t, T s,

Y

sφ

ξ

aT

»

T

s

f

au

pu, Y

uφ

, Z

uφ

qdu

»

T

s

Z

uφ

dW

u

»

T

s

dM

φu

»

T

s

dA

φu

A

φT

K

φT

A

φs

K

sφ

. (2.20)

For any n ¥ 1 , we consider the admissible strategy φ

n

p ζ

0

, p τ

kn

q

k¥0

, p α

nk

q

k¥1

q dened by ζ

0n

i ζ

0

, τ

kn

τ

k

, α

nk

α

k

for k ¤ n , and τ

kn

T 1 for all k ¡ n . We set Y

n

: Y

φn

, Z

n

v : Z

φn

, and so on.

By (2.20) applied to the strategy φ

n

, we get, recalling that A

nt

0 , A

nτ

n^T

Y

tn

Y

τn

n^T

»

τn^T

t

f

ans

p s, Y

sn

, Z

sn

q ds

»

τn^T

t

Z

sn

dW

s

»

τn^T

t

dM

ns

»

τn^T

t

dA

ns

»

τn^T

t

dK

ns

. (2.21)

We obtain, for a constant Λ ¡ 0 , E

| A

nτn^T

|

2

¤ Λ

E

| Y

tn

|

2

| Y

τnn^T

|

2

»

τn^T

t

| f

ans

p s, Y

sn

, Z

sn

q|

2

ds (2.22)

»

τn^T

t

| Z

sn

|

2

ds

»

τn^T

t

dr M

n

s

s

pA

φT

A

φt

q

2

p K

nT

q

2

.

(13)

We have

E

| Y

rn

|

2

¤

¸

d j1

E

| Y

rj

|

2

E

| Y

r

|

2

¤ E

sup

t¤r¤T

| Y

r

|

2

} Y }

2S2

dpF0q

, E

»

τn^T

t

| Z

sn

|

2

ds

¤ } Z }

H2pF0q

, E

»

τn^T

t

| f

ans

p s, Y

sn

, Z

sn

q|

2

ds

¤ 4L

2

T } Y }

2S2

dpF0q

4L

2

} Z }

H2

dpF0q

2 } f p , 0, 0 q}

2H2

dpF0q

and

E

p K

nT

q

2

¤ E

| K

T

|

2

.

Thus, by these estimates and the fact that A

φT

A

φt

P L

2

p F

T8

q as φ is admissible, there exists a constant Λ

1

¡ 0 such that

E | A

nτ

n^T

|

2

¤ Λ

1

Λ E

»

τn^T t

d r M

n

s

s

. (2.23)

Using (2.20) applied to φ

n

and Itô's formula between t and τ

n

^ T , since M

n

is a square integrable martingale orthogonal to W and A

n

, A

n

, K

n

are nondecreasing and nonnegative, we get

E

| Y

tn

|

2

»

τn^T

t

| Z

sn

|

2

ds

»

τn^T

t

d r M

n

s

s

E

| Y

τnn^T

|

2

2

»

τn^T

t

Y

sn

f

ans

ps, Y

sn

, Z

sn

qds 2

»

τn^T

t

Y

sn

dA

ns

2

»

τn^T

t

Y

sn

dA

ns

2

»

τn^T

t

Y

sn

dK

ns

¤ E

| Y

τn

n^T

|

2

2 E

»

τn^T

t

| Y

sn

f

ans

p s, Y

sn

, Z

sn

q| ds

2 E

»

τn^T

t

| Y

sn

| dA

ns

2 E

»

τn^T

t

| Y

sn

| dA

ns

2 E

»

τn^T

t

| Y

sn

| dK

ns

. (2.24)

We have, using Young's inequality, for some ¡ 0 , and (2.23), E

»

τn^T

t

| Y

sn

f

ans

p s, Y

sn

, Z

sn

q| ds

¤ 1 2 E

»

τn^T

t

| Y

sn

|

2

ds

1

2 E

»

τn^T

t

| f

ans

p s, Y

sn

, Z

sn

q|

2

ds

¤ T p 1

2 2L

2

q} Y }

2S2

dpF0q

2L

2

} Z }

H2

dpF0q

} f p , 0, 0 q}

2H2

dpF0q

, E

»

τn^T

t

| Y

sn

| dA

ns

¤ 1 2 } Y }

2S2

dpF0q

1 2 E

p A

φT

A

φt

q

2

,

E

»

τn^T

t

| Y

sn

| dK

ns

¤ 1 2 } Y }

2S2

dpF0q

1 2 E

| K

T

|

2

,

(14)

and

E

»

τn^T

t

| Y

sn

| dA

ns

¤ 1 2 } Y }

2S2

dpF0q

2 E

p A

nτ

n^T

q

2

¤ 1 2 } Y }

2S2

dpF0q

2

Λ

1

Λ E

»

τn^T

t

d r M

n

s

s

.

Using these estimates together with (2.24) gives, for a constant C

¡ 0 independent of n ,

p 1 Λ q E

»

τn^T

t

d r M

n

s

s

¤ C

} Y }

2S2

dpF0q

} Z }

H2pF0q

} f p , 0, 0 q}

2H2

dpF0q

E

| K

T

|

2

, (2.25) and chosing

1

gives that E ³

τn^T

t

d r M

n

s

s

is upper bounded independently of n . We also get an upper bound independent of n for E

p A

nτ

n^T

q

2

by (2.23).

Since ³

τn^T

t

d r M

n

s

s

(resp. | A

nτ

n^T

|

2

) is nondecreasing to ³

T

t

d r M

φ

s

s

(resp. to | A

φT

|

2

), we obtain by monotone convergence the rst part of Lemma (2.2). It is also clear that M

φ

is a martingale satisfying M

φt

0 .

We now prove (2.16). Using that E

pN

tφ

q

2

F

t0

is almost-surely nite as φ is admissible and ˆ c ¡ 0 , we compute,

E

¸

k¥0

Y

τζkk11

E

Y

τζkk11

F

τk

k 1

1

tτk 1¤tu

2

F

t0

¤ E

¸

k¥0

Y

τζkk11

¸

d j1

p

αζk 1

k,j

Y

tj

1

tτk 1¤tu

2

F

t0

¤ 4 | Y

t

|

2

E

p N

tφ

q

2

F

t0

  8 a.s. (2.26)

l 2.3.2 An optimal strategy

We now introduce a strategy, which turns out to be optimal for the control problem.

This strategy is the natural extension to our setting of the optimal one for classical switching problem, see e.g. [17]. The rst key step is to prove that this strategy is admissible, which is more involved than in the classical case due to the randomisation.

Let φ

p ζ

0

, p τ

n

q

n¥0

, p α

n

q

n¥1

q dened by τ

0

t and ζ

0

i and inductively by:

τ

k 1

inf

#

τ

k

¤ s ¤ T : Y

sζk

max

uPC

#

d

¸

j1

P

ζu

k,j

Y

sj

c ¯

uζ k

++

^ p T 1 q , (2.27)

α

k 1

min

#

α P arg max

uPC

#

d

¸

j1

P

ζu

k,j

Y

τj k 1

¯ c

uζ

k

++

, (2.28)

(15)

recall that C is ordered.

In the following lemma, we show that, since D has non-empty interior, the number of switch (hence the cost) required to leave any point on the boundary of D is square integrable, following the strategy φ

. This result will be used to prove that the cost associated to φ

satises E

A

φt 2

F

t0

is almost surely nite.

Lemma 2.3. Let Assumption 2.2-ii) hold. For y P D , we dene S p y q

#

1 ¤ i ¤ d : y

i

max

uPC

#

d

¸

j1

P

i,ju

y

j

c ¯

ui

++

, (2.29)

and p u

i

q

iPSpyq

the family of elements of C given by

u

i

min arg max

uPC

#

d

¸

j1

P

i,ju

y

j

¯ c

ui

+

.

Consider the homogeneous Markov Chain X on S p y q Y t 0 u dened by, for k ¥ 0 and i, j P S p y q

2

,

Pp X

k 1

j | X

k

i q P

i,jui

, Pp X

k 1

0 | X

k

i q 1 ¸

jPSpyq

P

i,jui

,

Pp X

k 1

0 | X

k

0 q 1, Pp X

k 1

i | X

k

0 q 0.

Then 0 is accessible from every i P S p y q, meaning that X is an absorbing Markov Chain.

Moreover, let N p y q inf t n ¥ 0 : X

n

0 u. Then N p y q P L

2

pP

i

q for all i P S p y q, where P

i

is the probability satisfying P

i

p X

0

i q 1 .

Proof. Assume that there exists i P S p y q from which 0 is not accessible. Then every communicating class accessible from i is included in Spyq. In particular, there exists a recurrent class S

1

€ S p y q. For all i P S

1

, we have P

i,jui

0 if j R S

1

since S

1

is recurrent.

Moreover, since S

1

€ S p y q, we obtain, for all i P S

1

, by denition of S p y q, y

i

¸

jPS1

P

i,jui

y

j

c ¯

uii

. (2.30) Since S

1

is a recurrent class, the matrix P ˜ p P

i,jui

q

i,jPS1

is stochastic and irreducible.

By denition of D , we have

D € R

d|S1|

$ &

% z P R

|S1|

| z

i

¥ ¸

jPS1

P

i,jui

z

j

c ¯

uii

, i P S

1

, .

- R

d|S1|

D

1

.

(16)

With a slight abuse of notation, we do not renumber coordinates of vectors in D

1

. Let i

0

P S

1

and let us restrict ourself to the domain D

1

. According to Lemma 3.1, D

1

is invariant by translation along the vector p 1, ..., 1 q of R

|S1|

. Moreover, Assumption 3.1 is fullled since P ˜ is irreducible and controls p u

i

q

iPSpyq

are set. So, Proposition 3.1 gives us that D

1

X t z P R

|S1|

| z

i0

0 u is a compact convexe polytope. Recalling (2.30), we see that py

i

y

i0

q

iPS1

is a point of D

1

X tz P R

|S1|

|z

i0

0u that saturates all the inequalities.

So, p y

i

y

i0

q

iPS1

is an extreme points of D

1

X t z P R

|S1|

| z

i0

0 u and all extreme points are given by

E :

$ &

% z P R

|S1|

| z

i

¸

jPS1

P

i,jui

z

j

¯ c

uii

, i P S

1

, z

i0

0 , . - .

Recalling that D

1

is compact, E is a nonempty bounded ane subspace of R

|S1|

, so it is a singleton. Since D

1

X t z

i0

0 u is a compact convex polytope, it is the convex hull of E and so it is also a singleton. Hence D

1

is a line in R

|S1|

. Moreover, |S

1

| ¥ 2 as P

i,iu

1 for all u P C and i P t 1, . . . , d u. Thus D € R

d|S1|

D

1

gives a contradiction with the fact that D has non-empty interior and the rst part of the lemma is proved.

Finally, we have N p y q P L

2

pP

i

q for all i P S p y q thanks to Theorem 3.3.5 in [18]. l Lemma 2.4. Assume that assumption (2.2) is satised. The strategy φ

is admissible.

Proof. For n ¥ 1 , we consider the admissible strategy φ

n

p ζ

0

, p τ

kn

q

k¥0

, p α

nk

q

k¥1

q dened by ζ

0n

i ζ

0

, τ

kn

τ

k

, α

nk

α

k

for k ¤ n, and τ

kn

T 1 for all k ¡ n. We set Y

sn

: Y

sφn

, Z

sn

: Z

sφn

and so on, for all s P r t, T s.

By denition of τ

, α

, recall (2.27)-(2.28), it is clear that A

ns^τ

n

0 and that ³

τk 1^s

τk^s

dK

uζk

0 for all k   n and s P rt, T s. The identity (2.20) for the admissible strategy φ

n

gives

Y

tn

Y

τn n^T

»

τ n^T t

f

ans

p s, Y

sn

, Z

sn

q ds

»

τ n^T t

Z

sn

dW

s

»

τ n^T t

dM

ns

»

τ n^T t

dA

ns

. Using similar arguments and estimates as in the precedent proof, we get

E | A

nτ

n^T

A

nt

|

2

¤ Λ

1

Λ E

»

τn^T

t

d r M

u

s

s

, (2.31)

and, for ¡ 0 , p 1 Λ q E

»

τn^T

t

d r M

n

s

s

¤ C

} Y }

2S2

dpF0q

} Z }

H2

dκpF0q

} f p , 0, 0 q}

2H2

dpF0q

. (2.32) Choosing

1

gives that E

³

τ n^T

t

d r M

n

s

s

and E | A

nτ

n^T

A

nt

|

2

are upper bounded uniformly in n , hence by monotone convergence, we get that A

φT

A

φt

P L

2

p F

T8

q.

It remains to prove that A

φt

P L

2

p F

t0

q. We have A

φt

¤ ˇ cN

tφ

, and E

p N

tφ

q

2

F

t0

  8 a.s. is immediate from Lemma 2.3, since E

p N

tφ

q

2

F

t0

Ψ p Y

t

q with Ψ p y q E

i

p N p y qq

2

, y P D , where E

i

is the expectation under the probability P

i

dened in

Lemma 2.3. l

Références

Documents relatifs

The optical anisotropy of the obliquely deposited PbTe layers is studied using the Smet relations /5/ and a graphical interpolation method.. The Smet rela- tions allowed

Escape probability; Absorption probability; Obliquely reflected Brownian motion in a quadrant; Functional equation; Carleman boundary value problem; Laplace transform;

Let us mention Foddy (1984) (dealing with the particular case of a Brownian motion with identity covariance matrix), Baccelli and Fayolle (1987) (on a diffusion having a quite

Considering this two-dimensional transmutation- diffusion process (I, X I ) as a new forward process, the introduction of a well chosen con- strained BSDE with jumps allows for

La plupart des demandeurs d’emploi inscrits à Pôle emploi sont tenus de faire des actes positifs de recherche d’emploi : certains sont sans emploi (catégorie A), d’autres

[r]

De nature simpliste, cette méthode mesure par voie colorimétrique le produit de la réaction de l’acide thiobarbiturique avec la malonaldéhyde (MDA), un sous – produit des lipides

Occupancy density; Green’s function; Obliquely reflected Brownian motion in a half- plane; Stationary distribution; Exact Asymptotics; Martin boundary; Laplace transform;