• Aucun résultat trouvé

On a class of stochastic optimal control problems related to BSDEs with quadratic growth

N/A
N/A
Protected

Academic year: 2021

Partager "On a class of stochastic optimal control problems related to BSDEs with quadratic growth"

Copied!
19
0
0

Texte intégral

(1)

HAL Id: hal-00451623

https://hal.archives-ouvertes.fr/hal-00451623

Submitted on 20 Dec 2017

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

to BSDEs with quadratic growth

Marco Fuhrman, Ying Hu, Gianmario Tessitore

To cite this version:

Marco Fuhrman, Ying Hu, Gianmario Tessitore. On a class of stochastic optimal control problems

related to BSDEs with quadratic growth. SIAM Journal on Control and Optimization, Society for

Industrial and Applied Mathematics, 2006, 45 (4), pp.1279-1296. �10.1137/050633548�. �hal-00451623�

(2)

Vol. 45, No. 4, pp. 1279–1296

ON A CLASS OF STOCHASTIC OPTIMAL CONTROL PROBLEMS RELATED TO BSDES WITH QUADRATIC GROWTH

MARCO FUHRMAN, YING HU, AND GIANMARIO TESSITORE§

Abstract. In this paper, we study a class of stochastic optimal control problems, where the drift term of the equation has a linear growth on the control variable, the cost functional has a quadratic growth, and the control process takes values in a closed set (not necessarily compact). This problem is related to some backward stochastic differential equations (BSDEs) with quadratic growth and unbounded terminal value. We prove that the optimal feedback control exists, and the optimal cost is given by the initial value of the solution of the related BSDE.

Key words.stochastic optimal control, backward stochastic differential equations, quadratically growing driver, unbounded final condition

AMS subject classifications. 93E20, 60H10 DOI.10.1137/050633548

1. Introduction. In this paper, we consider a controlled equation of the form dX

t

= b(t, X

t

) dt + σ(t, X

t

) [dW

t

+ r(t, X

t

, u

t

) dt], t [0, T ],

X

0

= x.

(1.1)

In the equation, W is an R

d

-valued Wiener process, defined on a complete probability space (Ω, F , P ) with respect to a filtration ( F

t

)

t0

satisfying the usual conditions;

the unknown process X takes values in R

n

; x is a given element of R

n

; and u is the control process, which is assumed to be an ( F

t

)-adapted process taking values in a given nonempty closed set K R

m

. The control problem consists of minimizing a cost functional of the form

J = E

T

0

g(t, X

t

, u

t

) dt + E φ(X

T

).

(1.2)

We suppose that r has a linear growth in u, g has quadratic growth in x and u, and φ has quadratic growth in x.

The main novelty of the present paper, in comparison with the existing literature, is that on the one hand we assume that neither K nor r is bounded; on the other hand we consider a degenerate control problem (since nothing is assumed on the image of σ). Moreover, we also allow φ to have quadratic growth.

Received by the editors June 14, 2005; accepted for publication (in revised form) March 24, 2006;

published electronically September 26, 2006.

http://www.siam.org/journals/sicon/45-4/63354.html

Dipartimento di Matematica, Politecnico di Milano, piazza Leonardo da Vinci 32, 20133 Milano, Italy (marco.fuhrman@polimi.it). This author was supported by the European Community’s Human Potential Programme under contracts HPRN-CT-2002-00281 (Evolution Equations) and HPRN-CT- 2002-00279 (QP-Applications).

IRMAR, Universit´e de Rennes 1, Campus de Beaulieu, 35042 Rennes Cedex, France (ying.hu@

univ-rennes1.fr).

§Dipartimento di Matematica e Applicazioni, Universit`a di Milano-Bicocca, Via Cozzi 53, 20135, Milano, Italy (gianmario.tessitore@unimib.it). This author was supported by the European Commu- nity’s Human Potential Programme under contract HPRN-CT-2002-00281 (Evolution Equations).

1279

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(3)

Nonlinear backward stochastic differential equations (BSDEs) were first intro- duced by Pardoux and Peng [11]. A major application of BSDEs is in stochastic control; see, e.g., [4] and [12]. We also refer the reader to [9], [5], and [7]. From several points of view, the class of control problems addressed in these papers, and in the references therein, is more general than the one considered here, but assumptions implying “bounded control image” (i.e., boundedness of K or r in our notation) are a common feature of all the above papers. The special “unbounded” case correspond- ing to the assumptions K = R

m

and g =

12

| u |

2

+ q(t, x) is treated in [6] by an ad hoc exponential transform. We notice that in [6] φ is allowed to take the value + . Finally, the same special case (in which the Hamiltonian is exactly the square of the norm of the gradient) was treated in [8] by analytic techniques under nondegeneracy assumptions and in an infinite-dimensional framework.

The difficulty here is that the Hamiltonian corresponding to the control prob- lem has quadratic growth in the gradient and consequently the associated BSDE has quadratic growth in the Z variable. Well-posedness for this class of BSDEs has been proved in [10] in the case of bounded terminal value. Since we allow for unbounded terminal cost, to treat such equations we have to apply the techniques recently intro- duced in [2]. We notice that for such BSDEs no general uniqueness results are known:

we replace uniqueness with the selection of a maximal solution. Moreover, the usual application of the Girsanov technique is not allowed (since the Novikov condition is not guaranteed), and we have to develop specific arguments both to prove the fun- damental relation (see section 4) and to obtain the existence of a (weak) solution to the closed loop equation (see section 5). Our main result is to prove that the optimal feedback control exists and the optimal cost is given by the value Y

0

of the maximal solution (Y, Z ) of the BSDE with the quadratic growth and unbounded terminal value mentioned above. Moreover, we show that we can construct an optimal feedback in terms of the process Z. Finally, we prove that if we fix a particular optimal feed- back law, then the solution of the corresponding closed loop equation is unique; see Proposition 5.4.

An alternative approach to the control problem may consist of applying the stochastic maximum principle; see, e.g., [12] for a detailed exposition. However, this would require further differentiability conditions on the coefficients of the controlled equation and would not immediately imply existence of an optimal control, since the maximum principle is usually stated as a necessary condition for optimality.

The paper is organized as follows. In the next section, we describe the control problem; in section 3, we study the related BSDE; and in section 4, we establish the fundamental relation between the optimal control problem and BSDE. The last section is devoted to the proof of the existence of optimal feedback control.

2. The control problem. We consider the optimal control problem given by a state equation of the form

dX

t

= b(t, X

t

) dt + σ(t, X

t

) [dW

t

+ r(t, X

t

, u

t

) dt], t [0, T ], X

0

= x,

(2.1)

and given by a cost functional of the form J = E

T 0

g(t, X

t

, u

t

) dt + E φ(X

T

).

(2.2)

We work under the following assumptions.

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(4)

Hypothesis 2.1.

1. The process W is a Wiener process in R

d

, defined on a complete probability space (Ω, F , P ) with respect to a filtration ( F

t

) satisfying the usual conditions.

2. The set K is a nonempty closed subset of R

m

.

3. The functions b : [0, T ] × R

n

R

n

, σ : [0, T ] × R

n

R

n×d

, r : [0, T ] × R

n

× K R

d

, g : [0, T ] × R

n

× K R , φ : R

n

R are Borel measurable.

4. For all t [0, T ], x R

n

, r(t, x, · ) and g(t, x, · ) are continuous functions from K to R

d

and from K to R , respectively.

5. There exists a constant C > 0 such that for every t [0, T ], x, x

R

n

, u K it holds that

| b(t, x) b(t, x

) | ≤ C | x x

| , | b(t, x) | ≤ C(1 + | x | ), (2.3)

| σ(t, x) σ(t, x

) | ≤ C | x x

| , (2.4)

| σ(t, x) | ≤ C, (2.5)

| r(t, x, u) r(t, x

, u) | ≤ C(1 + | u | ) | x x

| , (2.6)

| r(t, x, u) | ≤ C(1 + | u | ), (2.7)

0 g(t, x, u) C(1 + | x |

2

+ | u |

2

), (2.8)

0 φ(x) C(1 + | x |

2

).

(2.9)

6. There exist R > 0 and c > 0 such that for every t [0, T ], x R

n

, and every u K satisfying | u | ≥ R,

g(t, x, u) c | u |

2

. (2.10)

We will say that an ( F

t

)-adapted stochastic process { u

t

, t [0, T ] } with values in K is an admissible control if it satisfies

E

T

0

| u

t

|

2

dt < . (2.11)

This square summability requirement is justified by (2.10): a control process which is not square summable would have infinite cost.

Remark 2.2. Some classes of linear quadratic control problems fall within the scope of our result. Consider the controlled system

dX

t

= A(t)X

t

dt + b(t) dt + Σ(t) [dW

t

+ B(t)u

t

dt], t [0, T ], X

0

= x

(2.12)

and the cost functional J = 1

2 E

T

0

[ Q(t)X

t

, X

t

+ R(t)u

t

, u

t

] dt + 1

2 E SX

T

, X

T

,

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(5)

and assume that A, b, Σ, B, Q, R are bounded Borel measurable functions with values in R

n×n

, R

n

, R

n×d

, R

d×m

, R

n×n

, R

m×m

, respectively, and that S R

n×n

; also suppose that Q(t) 0, S 0, R(t) cI for some c > 0 and all t, in the sense of the usual matrix order. Then Hypothesis 2.1 is verified. We are not assuming K = R

m

in general. If K = R

m

, then the usual linear quadratic theory applies and gives more general results in the sense that state- and control-dependent noise can be considered, as well as a more general drift coefficient, in (2.12).

Next we show that for every admissible control, the solution to (2.1) exists.

Proposition 2.3. Let u be an admissible control. Then there exists a unique, continuous, ( F

t

)-adapted process X satisfying E sup

t[0,T]

| X

t

|

2

< , and P -a.s., X

t

= x +

t 0

b(s, X

s

) ds +

t

0

σ(s, X

s

) dW

s

+

t

0

σ(s, X

s

) r(s, X

s

, u

s

) ds, t [0, T ].

Proof. The proof of Proposition 2.3 relies on an approximation procedure that will be used again in what follows. We introduce the sequence of stopping times

τ

n

= inf

t [0, T ] :

t

0

| u

s

|

2

ds > n

,

with the convention that τ

n

= T if the indicated set is empty. By (2.11), for P -almost every ω Ω, there exists an integer N (ω) depending on ω such that

n N (ω) = τ

n

(ω) = T.

(2.13)

Let us fix u

0

K, and for every n, let us define u

nt

= u

t

1

t≤τn

+ u

0

1

t>τn

and consider the equation

dX

tn

= b(t, X

tn

) dt + σ(t, X

tn

) [dW

t

+ r(t, X

tn

, u

nt

) dt], t [0, T ], X

0n

= x.

(2.14)

We claim that (2.14) has a unique solution X

n

in the class of ( F

t

)-adapted pro- cesses X satisfying sup

t[0,T]

E | X

t

|

2

< .

To prove the claim we use a classical argument: We first write (2.14) in the form dX

t

= b(t, X

t

) dt + b(t, X

t

) dt + σ(t, X

t

) dW

t

,

where b(t, x) = σ(t, x, u

nt

)r(t, x, u

nt

) is a stochastic coefficient which satisfies, by (2.3)–

(2.7),

| b(t, x) b(t, x

) | = | σ(t, x, u

nt

)r(t, x, u

nt

) σ(t, x

, u

nt

)r(t, x

, u

nt

) | ≤ C(1 + | u

nt

| ) | x x

| ,

| b(t, x) | = | σ(t, x, u

nt

)r(t, x, u

nt

) | ≤ C(1 + | u

nt

| ), t [0, T ], x, x

R

n

. We define K

t

(ω) = C(1 + | u

nt

(ω) | ), and we note that

T 0

| K

t

|

2

dt C

1 +

T

0

| u

nt

|

2

dt

C(1 + n + | u

0

|

2

) =: C, P -a.s.,

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(6)

by the definition of τ

n

. Next we introduce the norm X

2

:= sup

t[0,T]

e

2λt

E | X

t

|

2

and prove that the mapping Γ defined by

Γ(X)

t

=

t

0

b(s, X

s

) ds +

t

0

b(s, X

s

) ds +

t

0

σ(s, X

s

) dW

s

, t [0, T ], is a contraction with respect to this norm for sufficiently large λ > 0. For a pair of processes X, X

, we have

E

t

0

[ b(s, X

s

) b(s, X

s

)] ds

2

E

t 0

K

s

| X

s

X

s

| ds

2

E

t

0

| K

s

|

2

ds

t

0

| X

s

X

s

|

2

ds

C E

t

0

| X

s

X

s

|

2

ds

C X X

2

t

0

e

2λs

ds, and it follows that

sup

t∈[0,T]

e

2λt

E

t

0

[ b(s, X

s

) b(s, X

s

)] ds

2

C X X

2

t

0

e

2λ(ts)

ds

C

X X

2

.

The other verifications needed to prove the contraction property are standard. The claim is proved.

It is clear that the solution X

n

of (2.14) is also continuous. Moreover, we have X

tn

= X

tn+1

for t τ

n

;

therefore there exists a process X such that

X

t

= X

tn

for t τ

n

,

and X is clearly the required solution. The property E sup

t[0,T]

| X

t

|

2

< is an immediate consequence of the following lemma, which concludes the proof.

Lemma 2.4. Under the previous assumptions, the family of random variables sup

t∈[0,T]

| X

tn

|

2

, n = 1, 2, . . . , is uniformly integrable.

Proof. We set M

tn

=

t

0

σ(s, X

sn

) dW

s

. By (2.14), we have X

tn

= x +

t 0

b(s, X

sn

) ds +

t

0

σ(s, X

sn

)r(s, X

sn

, u

ns

) ds + M

tn

.

First, we claim that the family { sup

t[0,T]

| M

tn

|

2

; n = 1, 2, . . . } is uniformly integrable;

indeed it is uniformly bounded in L

p

(Ω, F , P ) for every p [1, ), since by the Burkholder–Davis–Gundy inequalities and the boundedness of σ (see (2.5)),

E sup

t∈[0,T]

| M

tn

|

2p

C E

T 0

| σ(s, X

sn

) |

2

ds

p

C

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(7)

for some constant independent of n. Next we note that, by (2.7),

t

0

σ(s, X

sn

)r(s, X

sn

, u

ns

) ds

2

C

t 0

(1 + | u

ns

| )

2

ds C

1 +

T

0

| u

s

|

2

ds

. Then, setting l

nt

= sup

r[0,t]

| X

rn

|

2

, we obtain by (2.3),

l

nt

C

1 +

T

0

| u

s

|

2

ds + sup

t∈[0,T]

| M

tn

|

2

+ C

t

0

| X

sn

|

2

ds

C

1 +

T

0

| u

s

|

2

ds + sup

t∈[0,T]

| M

tn

|

2

+ C

t

0

l

ns

ds.

From the Gronwall lemma we deduce l

nt

C

1 +

T 0

| u

s

|

2

ds + sup

t∈[0,T]

| M

tn

|

2

, and the required uniform integrability follows.

The stochastic control problem associated with (2.1)–(2.2) consists of minimizing the cost functional J(x, u) among all the admissible controls.

3. The forward-backward system. We consider again the functions b, σ, g, φ satisfying the assumptions in Hypothesis 2.1. We define the Hamiltonian function

ψ(t, x, z) = inf

u∈K

[g(t, x, u) + z · r(t, x, u)], t [0, T ], x R

n

, z R

d

, (3.1)

where · denotes the usual scalar product in R

d

. We collect some immediate properties of the function ψ.

Lemma 3.1. The map ψ is a Borel measurable function from [0, T ] × R

n

× R

d

to R . There exists a constant C > 0 such that

C(1 + | z |

2

) ψ(t, x, z) g(t, x, u) + C | z | (1 + | u | ) u K.

(3.2)

Moreover, the infimum in (3.1) is attained in a ball of radius C(1 + | x | + | z | ), that is, ψ(t, x, z) = min

u∈K,|u|≤C(1+|x|+|z|)

[g(t, x, u) + z · r(t, x, u)], (3.3)

t [0, T ], x R

n

, z R

d

, and

ψ(t, x, z) < g(t, x, u) + z · r(t, x, u) if | u | > C (1 + | x | + | z | ).

(3.4)

Finally, for every t [0, T ] and x R

n

, z ψ(t, x, z) is continuous on R

d

.

Proof. The measurability of ψ is straightforward since, by the continuity of r and g with respect to u, we have ψ(t, x, z) = inf

uK

[g(t, x, u) + z · r(t, x, u)], t [0, T ], x R

n

, z R

d

, where K is any countable dense subset of K.

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(8)

Coming now to the proof of (3.2), we notice that, since g is nonnegative and satisfies (2.10), we have g(t, x, u) c( | u |

2

R

2

) (c and R are the same as in (2.10)).

By (2.7) and (2.8), we have

g(t, x, u) + z · r(t, x, u) c( | u |

2

R

2

) C | z | (1 + | u | ), (3.5)

and it follows that ψ(t, x, z) inf

u∈Rm

[g(t, x, u) + z · r(t, x, u)] inf

u∈Rm

[c( | u |

2

R

2

) C | z | (1 + | u | )]

= C

1

| z |

2

C

2

,

by direct computation, for suitable constants C

1

and C

2

. This proves the left-hand side of (3.2). The right-hand side of (3.2) is immediate, since by (2.7),

ψ(t, x, z) g(t, x, u) + z · r(t, x, u) g(t, x, u) + | z | C(1 + | u | ).

We come now to the second assertion. By (3.5) we get g(t, x, u) + z · r(t, x, u) c | u | | u | − C

c | z |

cR

2

C | z | . (3.6)

On the other hand, if we fix an arbitrary u

0

K, then

g(t, x, u

0

) + z · r(t, x, u

0

) C(1 + | x |

2

+ | u

0

|

2

) + C | z | (1 + | u

0

| ) C

3

(1 + | x |

2

+ | z | ).

(3.7)

Hence, there exists a constant C > 0 such that, if | u | > C(1 + | x | + | z | ), then g(t, x, u) + z · r(t, x, u) > g(t, x, u

0

) + z · r(t, x, u

0

),

and (3.3) follows from the continuity of g and r with respect to u.

Finally, the continuity of ψ(t, x, · ) can be easily proved, taking into account (3.3).

Next we take an arbitrarily complete probability space (Ω, F , P

) and a Wiener process W

in R

d

with respect to P

. We denote by ( F

t

) the associated Brownian filtration, i.e., the filtration generated by W

and augmented by the P

-null sets of F ; ( F

t

) satisfies the usual conditions.

We introduce the forward equation

dX

t

= b(t, X

t

) dt + σ(t, X

t

) dW

t

, t [0, T ], X

0

= x,

(3.8)

whose solution is a continuous ( F

t

)-adapted process, which exists and is unique by classical results. Next we consider the associated backward equation

dY

t

= ψ(t, X

t

, Z

t

) dt + Z

t

dW

t

, t [0, T ], Y

T

= φ(X

T

).

(3.9)

The solution of (3.9) exists in the sense specified by the following proposition.

Proposition 3.2. Assume that b, σ, g, φ satisfy Hypothesis 2.1. Then there exist Borel measurable functions

v : [0, T ] × R

n

R , ζ : [0, T ] × R

n

R

d

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(9)

with the following property: For an arbitrarily chosen complete probability space (Ω, F , P

) and Wiener process W

in R

d

, denoting by X the solution of (3.8), the processes Y, Z defined by

Y

t

= v(t, X

t

), Z

t

= ζ(t, X

t

) satisfy

E

sup

t∈[0,T]

| Y

t

|

2

< , E

T

0

| Z

t

|

2

dt < ;

moreover, Y is continuous and nonnegative, and P

-a.s., Y

t

+

T t

Z

s

dW

s

= φ(X

T

) +

T

t

ψ(s, X

s

, Z

s

) ds, t [0, T ].

Finally, this solution is the maximal solution among all the solutions (Y

, Z

) of (3.9) satisfying

E

sup

t∈[0,T]

| Y

t

|

2

< + .

Proof. From Lemma 3.1, there exists a constant C > 0 such that

C(1 + | z |

2

) ψ(t, x, z) g(t, x, u

0

) + C(1 + | u

0

| ) | z | . Let us first note that

E

sup

t∈[0,T]

| X

t

|

p

< ∞ ∀ p 2.

Next, we adopt the same strategy as that in [2] to construct a maximal solution to (3.9); i.e., for each n C, we define the globally Lipschitz continuous function,

ψ

n

(t, x, z) = sup { ψ(t, x, q) n | q z | : q Q

d

} ,

which is decreasing and converges to ψ; then by (Y

n

, Z

n

) we denote the unique solution to the BSDE with Lipschitz coefficient ψ

n

,

dY

tn

= ψ

n

(t, X

t

, Z

tn

) dt + Z

tn

dW

t

, t [0, T ], Y

Tn

= φ(X

T

),

(3.10)

and by (Y

S

, Z

S

) the unique solution to the BSDE,

dY

tS

= [g(t, X

t

, u

0

) + C(1 + | u

0

| ) | Z

tS

| ] dt + Z

tS

dW

t

, t [0, T ], Y

TS

= φ(X

T

),

(3.11)

where C is the same as in (3.2). We notice that, since ψ

n

(t, x, 0) ψ(t, x, 0) 0, then by an application of the comparison theorem (see [4]),

0 Y

tn

Y

tS

.

Then let us introduce the following stopping time: For k 1, τ

k

= inf { t [0, T ] : max( | X

t

| , Y

tS

) > k } , with the convention that τ

k

= T if the indicated set is empty.

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(10)

Then (Y

kn

, Z

kn

) := (Y

tnτ

k

, Z

tn

1

t≤τk

) satisfies the following BSDE:

Y

kn

(t) = ξ

kn

+

T

t

1

s≤τk

ψ

n

(s, X

s

, Z

kn

(s))ds

T

t

Z

kn

(s)dW

s

, where of course ξ

nk

= Y

kn

(T ) = Y

τnk

.

For fixed k, Y

kn

is decreasing in n and remains bounded by k. It follows from Lemma 3 in [2] (which is a slight generalization of Proposition 2.4 in [10]) that there exists a process (Y

k

, Z

k

) such that Y

k

is a continuous process, E

T

0

| Z

k

(s) |

2

ds < + , lim

n

sup

t∈[0,T]

| Y

kn

(t) Y

k

(t) | = 0, lim

n

E

T

0

| Z

kn

(t) Z

k

(t) |

2

dt = 0,

and (Y

k

, Z

k

) solves the BSDE Y

k

(t) = ξ

k

+

T t

1

s≤τk

ψ(s, X

s

, Z

k

(s))ds

T

t

Z

k

(s)dW

s

, (3.12)

where ξ

k

= inf

n

Y

τnk

.

On the other hand, τ

k

τ

k+1

, and, from the definition of (Y

kn

, Z

kn

), we have Y

k+1n

(t τ

k

) = Y

kn

(t), Z

k+1n

(t)1

t≤τk

= Z

kn

(t).

Sending n to infinity, we get

Y

k+1

(t τ

k

) = Y

k

(t), Z

k+1

(t)1

tτk

= Z

k

(t).

Now we define Y and Z on [0, T ] by setting

Y

t

= Y

k

(t), Z

t

= Z

k

(t) if t [0, τ

k

].

For P

-a.s. ω, there exists an integer K(ω) such that for k K(ω), τ

k

(ω) = T.

Thus Y is a continuous process, Y

T

= φ(X

T

), and

T

0

| Z

t

|

2

ds < , P

-a.s.

From (3.12), (Y, Z ) satisfies Y

t∧τk

= Y

τk

+

τk t∧τk

ψ(s, X

s

, Z(s))ds

τk

t∧τk

Z (s)dW

s

. (3.13)

By sending k to infinity, we deduce that (Y, Z ) is a solution of (3.9) and lim

n

sup

t∈[0,T]

| Y

tn

Y

t

| = 0, lim

n

T 0

| Z

tn

Z

t

|

2

dt = 0, P

-a.s.

Thus | Z

n

Z | converges to zero in measure d P

dt, and passing, if needed, to a subsequence (that by abuse of language we still denote Z

n

), we can assume that

| Z

n

Z | → 0, d P

dt almost everywhere.

Now, as ψ

n

is globally Lipschitz continuous, from [4] (see also [6]) there exist Borel measurable functions

v

n

: [0, T ] × R

n

R , ζ

n

: [0, T ] × R

n

R

d

such that

Y

tn

= v

n

(t, X

t

), Z

tn

= ζ

n

(t, X

t

).

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(11)

It suffices to define

v(t, x) = lim inf

n→∞

v

n

(t, x), and ζ(t, x) = lim inf

n→∞

ζ

n

(t, x) (where the second lim inf is intended coordinatewise) to get

Y

t

= v(t, X

t

), Z

t

= ζ(t, X

t

), which implies that (v, ζ) is the Borel function we look for.

Finally, 0 Y

t

Y

tS

implies that E sup

t∈[0,T]

| Y

t

|

2

< , and from the equation

| Y

t

|

2

+

τk

t

| Z

s

|

2

ds = 2

τk

t

Y

s

ψ(s, X

s

, Z

s

)ds 2

τk

t

Y

s

Z

s

dW

s

, taking into consideration that

Y

s

ψ(s, X

s

, Z

s

) Y

s

(g(s, X

s

, u

0

) + C(1 + | u

0

| ) | Z

s

| ) Y

sS

(g(s, X

s

, u

0

) + C(1 + | u

0

| ) | Z

s

| ), we deduce, by standard arguments, that

E

T

0

| Z

t

|

2

dt < .

Moreover, this solution is the maximal solution among all the solutions (Y

, Z

) satisfying

E

sup

t∈[0,T]

| Y

t

|

2

< + ,

and it suffices to apply Proposition 5 of [2] to deduce that Y

n

Y

and then Y Y

. 4. The fundamental relation. In this section we revert to the notation intro- duced in the first section and still assume that Hypothesis 2.1 holds.

Proposition 4.1. Let v, ζ denote the functions in the statement of Proposi- tion 3.2. Then for every admissible control u and for the corresponding trajectory X starting at x, we have

J(u) = v(0, x) + E

T

0

[ ψ(t, X

t

, ζ(t, X

t

)) + ζ(t, X

t

) · r(t, X

t

, u

t

) + g(t, X

t

, u

t

)] dt.

Proof. We introduce stopping times τ

n

and control processes u

n

as in the proof of Proposition 2.3, and we denote by X

n

the solution to (2.14). Let us define

W

tn

= W

t

+

t

0

r(s, X

sn

, u

ns

) ds.

From the definition of τ

n

and from (2.7), it follows that

T

0

| r(s, X

sn

, u

ns

) |

2

ds C

T

0

(1 + | u

ns

| )

2

ds C

τn

0

(1 + | u

s

| )

2

ds + C C + Cn.

(4.1)

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(12)

Therefore defining ρ

n

= exp

T 0

r(s, X

sn

, u

ns

) dW

s

1 2

T 0

| r(s, X

sn

, u

ns

) |

2

ds

,

the Novikov condition implies that E ρ

n

= 1. Setting d P

n

= ρ

n

d P , by the Girsanov theorem W

n

is a Wiener process under P

n

. Let us denote by ( F

tn

) its natural aug- mented filtration. Since

dX

tn

= b(t, X

tn

) dt + σ(t, X

tn

) dW

tn

, t [0, T ], X

0n

= x

has a strong solution by classical results, the process X

n

is also ( F

tn

) adapted. Let us define

Y

tn

= v(t, X

tn

), Z

tn

= ζ(t, X

tn

).

Then by Proposition 3.2, we have

dY

tn

= Z

tn

dW

tn

ψ(t, X

tn

, Z

tn

) dt, t [0, T ], Y

Tn

= φ(X

Tn

)

(4.2) and E

n

T

0

| Z

tn

|

2

dt < , where E

n

denotes expectation with respect to P

n

. It follows that

Y

τn

n

= φ(X

Tn

) +

T

τn

ψ(t, X

tn

, Z

tn

) dt

T

τn

Z

tn

dW

t

T

τn

Z

tn

· r(t, X

tn

, u

nt

) dt.

(4.3)

We note that for every p [1, ) we have ρ

np

= exp

p

T 0

r(s, X

sn

, u

ns

) dW

sn

p

2

2

T 0

| r(s, X

sn

, u

ns

) |

2

ds

· exp

p

2

p 2

T 0

| r(s, X

sn

, u

ns

) |

2

ds

.

By (4.1) the second exponential is bounded by a constant depending on n and p, while the first one has P

n

-expectation, equal to 1. So we conclude that E

n

ρ

np

< . It follows that

E

T

0

| Z

tn

|

2

dt

1/2

= E

n

ρ

n2

T

0

| Z

tn

|

2

dt

1/2

E

n

ρ

n2

1/2

E

n

T 0

| Z

tn

|

2

dt

1/2

< . We conclude that the stochastic integral in (4.3) has zero expectation, and we obtain

E Y

τn

n

= E φ(X

Tn

) + E

T

τn

[ψ(t, X

tn

, Z

tn

) Z

tn

· r(t, X

tn

, u

nt

)] dt.

Since by definition, ψ(t, x, z) z · r(t, x, u) g(t, x, u) 0, we have E Y

τnn

E φ(X

Tn

) + E

T τn

g(t, X

tn

, u

nt

) dt.

(4.4)

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(13)

Now we let n → ∞ . By the definition of u

n

and (2.8), E

T τn

g(t, X

tn

, u

nt

) dt = E

T

0

1

t>τn

g(t, X

tn

, u

0

) dt

C E

T

0

1

t>τn

(1 + | X

tn

|

2

+ | u

0

|

2

)dt C E

(T τ

n

)

1 + sup

t∈[0,T]

| X

tn

|

2

, and the right-hand side tends to 0 by Lemma 2.4 and (2.13). Next we note that, again by (2.13), for n N(ω) we have τ

n

(ω) = T and

φ(X

Tn

) = φ(X

τnn

) = φ(X

τn

) = φ(X

T

).

Moreover, by (2.9),

| φ(X

Tn

) | ≤ C(1 + | X

Tn

|

2

) C

1 + sup

t∈[0,T]

| X

tn

|

2

,

and by Lemma 2.4 the right-hand side is uniformly integrable. We deduce that E φ(X

Tn

) E φ(X

T

), and from (4.4) we conclude that lim sup

n→∞

E Y

τn

n

E φ(X

T

).

On the other hand, for n N (ω) we have τ

n

(ω) = T and Y

τnn

= Y

Tn

= φ(X

Tn

) = φ(X

T

).

Since Y

n

is positive, by the Fatou lemma, E φ(X

T

) lim inf

n→∞

E Y

τnn

. We have thus proved that

lim

n→∞

E Y

τn

n

= E φ(X

T

).

(4.5)

Now we return to (4.2) and write Y

τnn

= Y

0n

+

τn 0

ψ(t, X

tn

, Z

tn

) dt +

τn

0

Z

tn

dW

t

+

τn

0

Z

tn

· r(t, X

tn

, u

nt

) dt.

Arguing as before, we conclude that the stochastic integral has zero P -expectation.

Moreover, we have Y

0n

= v(0, x), and, for t τ

n

, also have u

nt

= u

t

, X

tn

= X

t

, and Z

tn

= ζ(t, X

t

). Thus we obtain

E Y

τnn

= v(0, x) + E

τn

0

[ ψ(t, X

t

, ζ (t, X

t

)) + ζ(t, X

t

) · r(t, X

t

, u

t

)] dt and

E

τn

0

g(t, X

t

, u

t

) dt + E Y

τnn

= v(0, x) + E

τn

0

[ ψ(t, X

t

, ζ(t, X

t

)) + ζ(t, X

t

) · r(t, X

t

, u

t

) + g(t, X

t

, u

t

)] dt.

Noting that ψ(t, x, z) + z · r(t, x, u) + g(t, x, u) 0 and recalling that g(t, x, u) 0, by (4.5) and the monotone convergence theorem, we obtain for n → ∞ ,

E

T

0

g(t, X

t

, u

t

) dt + E φ(X

T

) = v(0, x) + E

T 0

[ ψ(t, X

t

, ζ(t, X

t

)) + ζ(t, X

t

) · r(t, X

t

, u

t

) + g(t, X

t

, u

t

)] dt, which gives the required conclusion.

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(14)

Corollary 4.2. For every admissible control u and any initial datum x, we have J (u) v(0, x), and the equality holds if and only if the following feedback law holds P -a.s. for almost every t [0, T ]:

ψ(t, X

t

, ζ (t, X

t

)) = ζ(t, X

t

) · r(t, X

t

, u

t

) + g(t, X

t

, u

t

), where X is the trajectory starting at x and corresponding to control u.

5. Existence of optimal controls: The closed loop equation. Let us con- sider again the functions b, σ, g, φ satisfying the assumptions in Hypothesis 2.1. We recall the definition of the Hamiltonian function:

ψ(t, x, z) = inf

u∈K

[g(t, x, u) + z · r(t, x, u)], t [0, T ], x R

n

, z R

d

. (5.1)

Lemma 5.1. There exists a Borel measurable function γ : [0, T ] × R

n

× R

d

K such that

ψ(t, x, z) = g(t, x, γ(t, x, z)) + z · r(t, x, γ(t, x, z)), t [0, T ], x R

n

, z R

d

. (5.2)

Moreover, there exists a constant C > 0 such that

| γ(t, x, z) | ≤ C(1 + | x | + | z | ).

(5.3)

Proof. Consider the function F (t, x, z, u) = g(t, x, u) + z · r(t, x, u), t [0, T ], x R

n

, z R

d

. Clearly F is a Carath´ eodory map (that is, F (t, x, z, · ) is continuous for all t [0, T ], x R

n

, and z R

d

, and F ( · , · , · , u) is Borel measurable for all u K;

see [1, p. 311]). By (3.3) we have

ψ(t, x, z) ∈ { F(t, x, z, u) : u K } ∀ t [0, T ], x R

n

, z R

d

.

Thus by the Filippov theorem (see, e.g., [1, Thm. 8.2.10, p. 316]) there exists a Borel measurable map γ : [0, T ] × R

n

× R

d

K such that F(t, x, z, γ(t, x, z)) = ψ(t, x, z);

see [3] as well. The fact that | γ(t, x, z) | ≤ C(1 + | x | + | z | ) is an immediate consequence of (3.4).

Next we address the problem of finding a weak solution to the so-called closed loop equation. We define

u(t, x) = γ(t, x, ζ(t, x)), t [0, T ], x R

n

, where ζ is defined in Proposition 3.2. The closed loop equation is

dX

t

= b(t, X

t

) dt + σ(t, X

t

) [dW

t

+ r(t, X

t

, u(t, X

t

)) dt], t [0, T ], X

0

= x.

(5.4)

By a weak solution we mean a complete probability space (Ω, F , P ) with a filtration ( F

t

) satisfying the usual conditions, a Wiener process W in R

d

with respect to P and ( F

t

), and a continuous ( F

t

)-adapted process X with values in R

n

satisfying, P -a.s.,

T 0

| u(t, X

t

) |

2

dt < , (5.5)

and such that (5.4) holds. We note that by (2.7) it also follows that

T

0

| r(t, X

t

, u(t, X

t

)) | dt < , P -a.s., so that (5.4) makes sense.

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

(15)

Proposition 5.2. Assume that b, σ, g, φ satisfy Hypothesis 2.1. Then there exists a weak solution of the closed loop equation, satisfying in addition

E

T

0

| u(t, X

t

) |

2

dt < . (5.6)

Proof. Let us take an arbitrary complete probability space (Ω, F , P

) and a Wiener process W

in R

d

with respect to P

. Let ( F

t

) be the associated Brownian filtration.

We define the process X as the solution of the equation

dX

t

= b(t, X

t

) dt + σ(t, X

t

) dW

t

, t [0, T ], X

0

= x.

(5.7)

The solution is a continuous ( F

t

)-adapted process, which exists and is unique by classical results. Moreover, it satisfies E

sup

t[0,T]

| X

t

|

p

< for every p [1, ).

By Proposition 3.2, setting

Y

t

= v(t, X

t

), Z

t

= ζ(t, X

t

), the following backward equation holds:

dY

t

= ψ(t, X

t

, Z

t

) dt + Z

t

dW

t

, t [0, T ], Y

T

= φ(X

T

),

and we have

E

T

0

| Z

t

|

2

dt < . (5.8)

By (2.7) we have | r(t, X

t

, u(t, X

t

)) | ≤ C(1 + | u(t, X

t

) | ), and by (5.3),

| u(t, X

t

) | = | γ(t, X

t

, ζ(t, X

t

)) | ≤ C(1 + | X

t

| + | ζ(t, X

t

) | ) = C(1 + | X

t

| + | Z

t

| ).

(5.9)

Now let us define stopping times τ

n

= inf

t [0, T ] :

t

0

| u(s, X

s

) |

2

ds > n

,

with the convention that τ

n

= T if the indicated set is empty. By (5.8) and (5.9), for P

-a.s. ω Ω, there exists an integer N(ω) depending on ω such that τ

n

(ω) = T for n N (ω). Let us fix u

0

K, and for every n let us define

u

nt

= u(t, X

t

) 1

t≤τn

+ u

0

1

t>τn

, M

tn

= exp

t 0

r(s, X

s

, u

ns

) dW

s

1 2

t 0

| r(s, X

s

, u

ns

) |

2

ds

,

M

t

= exp

t 0

r(s, X

s

, u(s, X

s

)) dW

s

1 2

t 0

| r(s, X

s

, u(s, X

s

)) |

2

ds

,

W

tn

= W

t

t

0

r(s, X

s

, u

ns

) ds, W

t

= W

t

t 0

r(s, X

s

, u(s, X

s

)) ds.

Downloaded 12/20/17 to 129.20.36.117. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

Références

Documents relatifs

Under general assumptions on the data, we prove the convergence of the value functions associated with the discrete time problems to the value function of the original problem..

In the deteministic case, problems with supremum costs and finite time horizon have been extensively studied in the literature, we refer for instance to [8, 9, 2, 10] where the

Abstract— We present a theory of singular arc, and the corresponding second order necessary and sufficient conditions, for the optimal control of a semilinear parabolic equation

Section 3 contains a notion of a local quadratic growth condition of the Hamiltonian at the presence of control constraints and a formulation of generalized strengthened

The main idea of the proof is that as soon as one knows that the value function of the singular control problem is associated to a constrained BSDE, one can use the fact that such

The results of illite fine fraction dating ob- tained in this study constrain the phase of major cooling, inferred thrusting, and structural uplift to between ca. 60 and

Although the four dimensions and 16 items reported here provide a better theo- retical and empirical foundation for the measurement of PSM, the lack of metric and scalar

Only with the ToF-SIMS analysis is the identification unequivocal (Fig. Second, the in-depth magnetism of a conventional hard disk is imaged using MFM. As layers are being