• Aucun résultat trouvé

Note on Pontryagin maximum principle with running state constraints and smooth dynamics - Proof based on the Ekeland variational principle

N/A
N/A
Protected

Academic year: 2021

Partager "Note on Pontryagin maximum principle with running state constraints and smooth dynamics - Proof based on the Ekeland variational principle"

Copied!
27
0
0

Texte intégral

(1)

HAL Id: hal-01302222

https://hal.archives-ouvertes.fr/hal-01302222

Submitted on 13 Apr 2016

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

Note on Pontryagin maximum principle with running state constraints and smooth dynamics - Proof based on

the Ekeland variational principle

Loïc Bourdin

To cite this version:

Loïc Bourdin. Note on Pontryagin maximum principle with running state constraints and smooth dynamics - Proof based on the Ekeland variational principle. [Other] University of Limoges, France.

2016. �hal-01302222�

(2)

Note on Pontryagin maximum principle with running state constraints and smooth dynamics - Proof based on the

Ekeland variational principle

Lo¨ıc Bourdin. University of Limoges, France.

April 13, 2016

Abstract

In this note our aim is to give a proof of the Pontryagin maximum principle for a general optimal control problem with running state constraints and smooth dynamics. Our proof is based on the classical Ekeland variational principle.

The main result (and its proof) of this note are not new and are already well-known in the literature. The aim of the author is only to provide a complete and detailed proof of this classical theorem in the case of smooth dynamics. If you have any remarks or questions, do not hesitate to contact the author atloic.bourdin@unilim.fr.

Contents

1 Main result 2

1.1 A state constrained optimal control problem. . . . 2

1.2 Pontryagin maximum principle . . . . 3

2 Proof 3 2.1 Preliminaries . . . . 3

2.1.1 Some Cauchy-Lipschitz and continuous dependence results . . . . 4

2.1.2 Implicit spike variations and a differentiable dependence result . . . . 5

2.2 Application of the Ekeland variational principle . . . . 7

2.2.1 First inequality depending on fixed . . . . 8

2.2.2 Second inequality independent of . . . . 10

2.3 Introduction of the adjoint vectorp. . . . 11

2.3.1 Introduction ofη . . . . 11

2.3.2 Definition ofp . . . . 11

2.4 End of the proof . . . . 12

A Proof of Lemma 3 13 B Some recalls about Banach spaces geometry 14 B.1 Renorming a separable Banach space. . . . 14

B.2 The distance function . . . . 17

C Functions of bounded variations and Stieltjes integrals 18 C.1 Some recalls . . . . 19

C.2 Notations and Fubini-type formulas. . . . 22

(3)

D State-transition matrices and linear Cauchy-Stieltjes problems 22 D.1 Recalls on state-transition matrices . . . . 23 D.2 Recalls on linear Cauchy problems . . . . 24 D.3 Linear Cauchy-Stieltjes problems . . . . 25

1 Main result

We first introduce some notations available throughout the paper. LetT >0 be fixed. For every nNand every 1r+, we denote by

BFn := BF([0, T],Rn) the classical space of bounded functions endowed with the classical uniform normk · k;

Cn:= C([0, T],Rn) the classical space of continuous functions endowed withk · k;

ACn:= AC([0, T],Rn) the classical space of absolutely continuous functions;

BVn:= BV([0, T],Rn) the classical space of functions with bounded variations endowed with k · kBVn (see Appendix Cfor some recalls);

Lrn := Lr([0, T],Rn) the classical Lebesgue space ofr-integrable functions endowed with its usual normk · kLrn.

In the whole paper, when no confusion is possible, we remove the subscript nand we just denote by BF, C, AC, BV or Lr.

We denote by BF+1 := BF([0, T],R+) and C+1 := C([0, T],R+) whereR+= [0,+).

Then,ηBVnis said to benormalized ifη(0) = 0 andηis left-continuous on (0, T). The subspace of normalized functions with bounded variations will be denoted by NBVn.

Finally, the classical Lebesgue measure on [0, T] will be denoted byλ.

1.1 A state constrained optimal control problem

Letm,nandjN be fixed. In this paper we consider the optimal control problem (OCP) given by

minimize Ψ(q(T)),

subject to qACn, uLm,

˙

q(t) =f(q(t), u(t), t), a.e. t[0, T], q(0) =q0,

u(t)Ω, a.e. t[0, T],

Gi(q(t), t)0, t[0, T], i= 1, . . . , j,

(OCP)

where Ψ :RnRis of class C1, wheref :Rn×Rm×[0, T]Rn is continuous and of class C1 in its two first variables, whereG= (Gi)i=1,...,j :Rn×[0, T]Rj is continuous and of class C1in its first variable, and where q0Rn is fixed and ΩRmis a nonempty closed subset.

(4)

Since Ψ, f and G are all regular, Problem (OCP) is said to be an optimal control problemwith smooth dynamics. The last constraint corresponds torunning state constraints.

We now introduce the HamiltonianH :Rn×Rm×Rn×[0, T]Rassociated to Problem (OCP) defined by

H(q, u, p, t) :=hp, f(q, u, t)iRn×Rn.

1.2 Pontryagin maximum principle

Our main result in this note is given by the following theorem.

Theorem 1 (Pontryagin maximum principle). Let(q, u)ACn×Lm be an optimal solution of Problem (OCP). There exists a nontrivial couple (ψ, η)where ψ0 andη = (ηi)i=1,...,j NBVj

such that

u(t)arg min

v∈Ω

H(q(t), v, p(t), t)

for a.e. t [0, T], where p BVn is the unique global solution of the backward linear Cauchy- Stieltjes problem given by

( dp=1H(q, u, p,·)dt+Pj

i=11Gi(q,·)i, on[0, T], p(T) =ψΨ(q(T)).

In addition, it holds that

ηi is monotically increasing on [0, T] and Z T

0

Gi(q(τ), τ)i) = 0, for every i= 1, . . . , n.

We refer to AppendixCfor some recalls about functions of bounded variations and to AppendixD for details on linear Cauchy-Stieltjes problems.

2 Proof

This section is entirely devoted to the proof of Theorem 1. This proof is based on the classical Ekeland variational principle and is inspired from several references like [3,11].

2.1 Preliminaries

LetuL. In this preliminary section we focus on the forward (nonlinear) Cauchy problem (CPu) given by

( q(t) =˙ f(q(t), u(t), t), a.e. t[0, T],

q(0) =q0. (CPu)

A couple (q, I) is said to be a(local) solution of (CPu) if

1. I[0, T] is an interval with nonempty interior such that minI= 0;

2. q:IRn is absolutely continuous onI andqsatisfies

( q(t) =˙ f(q(t), u(t), t), a.e. tI, q(0) =q0,

(5)

or equivalently,q:IRn is continuous onI andqsatisfies q(t) =q0+

Z t 0

f(q(τ), u(τ), τ)dτ, for everytI.

The couple (q, I) is said to be aglobal solution of (CPu) if I= [0, T].

Let (q, I) and (q, I) be two local solutions of (CPu). We say that (q, I) is anextension of (q, I) if I I andq(t) =q(t) for everytI. We say that (q, I) is amaximal solution of (CPu) if it extends all other local solutions of (CPu).

2.1.1 Some Cauchy-Lipschitz and continuous dependence results Recall the two following classical Cauchy-Lipschitz (or Picard-Lindel¨of) results.

Lemma 1. For every uL, there exists a unique maximal solution of (CPu).

In the sequel we denote by (q(·, u), I(u)) the maximal solution of (CPu) associated touL. Lemma 2. Let uL. If (q(·, u), I(u))is not global (that is, T /I(u)) then I(u)is not closed andq(·, u)is unbounded on I(u).

In the sequel we denote by U L the set of controls uL such that T I(u). A control uU is usually said to be admissible. For everyuU and everyR >kukL, we introduce

Ku,R:={(x, v, t)Rn×Rm×[0, T] | kxq(t, u)kRn1 andkvkRm R}.

From continuity ofq(·, u) on [0, T], Ku,Ris a compact subset ofRn×Rm×[0, T]. As a consequence, f, 1f and2f are bounded on Ku,R by someLu,R0 and it holds that

kf(x2, v2, t)f(x1, v1, t)kRnLu,R(kx2x1kRn+kv2v1kRm), (1) for all (x1, v1, t), (x2, v2, t)Ku,R.

Proposition 1. Let uU. For every R >kukL, there exists νu,R>0 such that Eu,R:= BL(0, R)BL1(u, νu,R)

is contained in U. Moreover, for every u Eu,R, it holds that(q(τ, u), u(τ), τ)Ku,R for a.e.

τ [0, T].

Proof. LetR >kukL and letνu,R>0 be such thatνu,RLu,ReT Lu,R <1. Letu Eu,R. Our aim is to prove that TI(u). By contradiction, let us assume that the set

A:={tI(u) | kq(t, u)q(t, u)kRn>1}

is not empty and let t0 := infA. From continuity, it holds that kq(t0, u)q(t0, u)kRn 1.

Moreover, one hast0>0 since q(0, u) =q(0, u) =q0. Hence, kq(τ, u)q(τ, u)kRn 1 for every τ [0, t0). Therefore (q(τ, u), u(τ), τ) and (q(τ, u), u(τ), τ) belong to Ku,R for a.e. τ [0, t0).

Since one has

q(t, u)q(t, u) = Z t

0

f(q(τ, u), u(τ), τ)f(q(τ, u), u(τ), τ)dτ,

(6)

for everytI(u), it follows from (1) that kq(t, u)q(t, u)kRnLu,R

Z t

0 ku(τ)u(τ)kRm +Lu,R

Z t

0 kq(τ, u)q(τ, u)kRndτ, for everyt[0, t0], which implies from the classical Gronwall lemma that

kq(t, u)q(t, u)kRn Lu,ReT Lu,RkuukL1νRLu,ReT Lu,R <1,

for every t [0, t0]. This raises a contradiction at t = t0. Therefore A is empty. We conclude that q(·, u) is bounded on I(u), then T I(u). Moreover, since A is empty, we also conclude that kq(t, u)q(t, u)kRn 1 for every t [0, T], and thus (q(τ, u), u(τ), τ) Ku,R for a.e.

τ [0, T].

We conclude this section with the following continuous dependence result.

Proposition 2. Let uU andR >kukL. The mapping

Fu,R: (Eu,R,k · kL1) −→ (Cn,k · k) u 7−→ q(·, u) is Cu,R-Lipschitz continuous for someCu,R0.

Proof. Let u and u′′ be two elements of Eu,R U. We know that (q(τ, u′′), u′′(τ), τ) and (q(τ, u), u(τ), τ) are elements of Ku,R for a.e. τ [0, T]. Following the same arguments as in the previous proof, it follows that

kq(t, u′′)q(t, u)kRnLu,ReT Lu,Rku′′ukL1, for everyt[0, T]. The lemma follows withCu,R:=Lu,ReT Lu,R 0.

2.1.2 Implicit spike variations and a differentiable dependence result

Before introducing the concept of implicit spike variations, we first need to recall the following lemma (see [11, Paragraph 3.2 p.143]). The proof is recalled in AppendixA.

Lemma 3. Let h L1n. Then, for all ρ (0,1), there exists a measurable set Qρ [0, T] such that λ(Qρ) =ρT and

sup

t∈[0,T]

Z t 0

11

ρ1Qρ(s)

h(s)ds Rn ρ.

Note that Qρ depends onh.

Let uU andu L. For everyρ[0,1), we introduce the so-calledimplicit spike variation u(·, ρ) ofuassociated tou as

u(τ, ρ) :=

u) if τQρ, u(τ) if τ /Qρ,

for a.e. τ[0, T], where Qρ is defined in Lemma3 associated tohu,uLn L1n defined by hu,u(τ) :=f(q(τ, u), u(τ), τ)f(q(τ, u), u(τ), τ),

for a.e. τ [0, T].1

1Forρ= 0, we fix Qρ=∅.

(7)

Finally, we introduce the so-called variation vector w(·, u, u) associated to (u, u) as the unique maximal solution, which is moreover global (see Appendix D.2), of the forward linear Cauchy problem given by

( w(t) =˙ 1f(q(t, u), u(t), t)×w(t) +hu,u(t), a.e. t[0, T], w(0) = 0.

Let us prove the following differentiability dependence result.

Proposition 3. The mapping Fu,u defined by

Fu,u(ρ) :=q(·, u(·, ρ))Cn,

for sufficiently small ρ0, is Fr´echet-differentiable atρ= 0, with DFu,u(0) =w(·, u, u).

Proof. Let R := max(kukL + 1,kukL). Since λ(Qρ) = ρT (see Lemma 3), it holds that ku(·, ρ)ukL1 2RT ρ for every ρ [0,1). As a consequence, for sufficiently small ρ 0, u(·, ρ)Eu,RU and thenFu,u(ρ) is well-defined. Moreover, it follows from Proposition2that kq(·, u(·, ρ))q(·, u)k 2RT Cu,Rρ, and consequentlyq(·, u(·, ρ)) uniformly converges on [0, T] to q(·, u).

Let us assume by contradiction that Fu,u is not Fr´echet-differentiable atρ= 0 withDFu,u(0) = w(·, u, u). Then, there existsε >0 et (ρk)k a positive sequence such that (ρk)k tends to zero and

such that

Fu,uk)Fu,u(0)

ρk w(·, u, u)

ε

for allkN. In this proof, for the ease of notations, we denote byw:=w(·, u, u),q:=q(·, u) and byqk :=q(·, u(·, ρk)),uk :=u(·, ρk) for everykN. Since the sequence (uk)k converges touin L1, we deduce from the (partial) converse of the classical Lebesgue dominated convergence theorem that there exists a subsequence (that we do not relabel) such that (uk)k tends toua.e. on [0, T].

For everykNand everyt[0, T], we definezk(t) := qk(t)−q(t)ρ

k w(t). From our assumption, it holds that kzkkεfor allkN. On the other hand, we have

zk(t) = Z t

0

f(qk(τ), uk(τ), τ)f(q(τ), u(τ), τ)

ρk 1f(q(τ), u(τ), τ)×w(τ)hu,u(τ)dτ, that is,

zk(t) = Z t

0

f(qk(τ), uk(τ), τ)f(q(τ), uk(τ), τ)

ρk 1f(q(τ), u(τ), τ)×w(τ) +f(q(τ), uk(τ), τ)f(q(τ), u(τ), τ)

ρk hu,u(τ)dτ, for everyt[0, T]. From the classical Taylor formula with integral rest, we obtain that

zk(t) = Z t

0

1f(q(τ), u(τ), τ)×zk(τ)+ Z t

0

1 ρk

1Q

ρk(τ)1

hu,u(τ) +

Z t 0

Z 1 0

1f(q(τ) +θ(qk(τ)q(τ)), uk(τ), τ)1f(q(τ), u(τ), τ)

qk(τ)q(τ) ρk

dτ,

(8)

for everyt[0, T]. Hence, from Lemma3, we get that kzk(t)kRnρk+ 2RT Cu,Rκk+Lu,R

Z t

0 kzk(τ)kRndτ, for everyt[0, T], where

κk :=

Z T 0

Z 1

0 k1f(q(τ) +θ(qk(τ)q(τ)), uk), τ)1f(q(τ), u(τ), τ)kRn×Rn dθdτ.

From the continuity and the boundedness of 1f on Ku,R, since (uk)k tends to u a.e. on [0, T] and from the classical Lebesgue dominated convergence theorem, one can easily prove that (κk)k

tends to zero. Finally, from the classical Gronwall lemma, we obtain that kzk(t)kRn k + 2RT Cu,Rκk)eT Lu,R for everyt[0, T]. This raises a contradiction with the inequalitykzkkε for allkN. The proof is complete.

2.2 Application of the Ekeland variational principle

Let us introduce g = (gi)i=1,...,j : Cn Cj the application defined by g(q) := G(q,·) for every qCn, and let S be the nonempty closed convex cone of Cj defined by S := C([0, T],(R)j). Thus the running state constraints in Problem (OCP) can equivalently be replaced by

g(q)S.

Note that g is of class C1 withDg(q)(w) =1G(q,·)×w for every q, wCn, and that S has a nonempty interior.

Since (Cj,k·k) is a separable Banach space, we endow Cjwith an equivalent normk·kCj such that the associated dual norm k · kCj is strictly convex (see Proposition4 in AppendixB.1). Then, we denote bydSthe 1-Lipschitz continuous distance function to S defined bydS(q) := infz∈SkqzkCj

for every q Cj. Since the dual norm k · kCj is strictly convex, we know that dS is strictly Hadamard-differentiable on Cj\S with kDdS(q)kCj = 1 for every q Cj\S (see Proposition 5 in Appendix B.2). As a consequence, d2S is also strictly Hadamard-differentiable on Cj\S with Dd2S(q) = 2dS(q)DdS(q) for everyq Cj\S. We also recall that d2S is Fr´echet-differentiable on S withDd2S(q) = 0 for everyqS (see Remark6 in AppendixB.2).

In the whole section, letq(·, u)AC anduLbe an optimal solution of Problem (OCP). Let (R) be a positive sequence such thatR>kukL for everyNand such that limR= +. Let (εk)k be a positive sequence such that limkεk = 0. For every ℓ, k N, we consider the penalized functional given by

Jk: Eu,R −→ R+

u 7−→

r

Ψ(q(T, u))Ψ(q(T, u)) +εk+2

+d2S

g q(·, u) ,

where

Eu,R :={uEu,R | u(τ)Ω for a.e. τ [0, T]}.

Note that Jk is a positive functional because of the optimality ofu. We endow Eu,R with the classical normk · kL1. Since Ω is a nonempty closed subset ofRm, it follows from the (partial) con- verse of the classical Lebesgue dominated convergence theorem that (Eu,R,k · kL1) is a nonempty

(9)

closed subset of (L1,k·kL1) and consequently (Eu,R,k·kL1) is a complete metric space. Moreover, from the continuities of Ψ, Fu,R (see Proposition 2), dS and ofg, one can easily see that Jk is continuous on (Eu,R,k · kL1).

Moreover it holds that Jk(u) = εk. As a consequence, from the classical Ekeland variational principle, we conclude that for everyℓ,kN, there existsukEu,Rsuch thatkukukL1 εk

and

εkkuukkL1 Jk(u)Jk(uk), (2) for all u Eu,R. In particular, for a fixed N, note that the sequence (uk)k converges to u in L1 and consequently, the sequence (q(·, uk))k uniformly converges on [0, T] to q(·, u) (see Proposition2).

For everyℓ,kN, we introduce ψk:= 1

Jk(uk) Ψ(q(T, uk))Ψ(q(T, u)) +εk+

0, and

ϕk:=

1 Jk(uk)dS

g(q(·, uk)) DdS

g q(·, uk)

Cj if g(q(·, uk))/S,

0Cj if g(q(·, uk))S.

In particular it holds that |ψk|2+kϕkk2Cj = 1 for everyℓ,kN.

In the sequel our aim is to derive some important inequalities from Inequality (2) using implicit spike variations onuk.

Remark 1. In this remark (and in Remarks2and3), our aim is to provide two crucial inequalities satisfied byϕk. In the caseg(q(·, uk))/S, recall thatDdS(g(q(·, uk))) belongs to the subdifferential of dSat the pointg(q(·, uk)). As a consequence, in both casesg(q(·, uk))/ S andg(q(·, uk))S, it holds that

hϕk, zg(q(·, uk))iCj×Cj 0, (3) for everyzS. Since S has a nonempty interior, there existsξS andδ >0 such thatξ+δzS for everyzB(Cj,k·kCj)(0,1). As a consequence, we obtain that

δhϕk, ziCj×Cj ≤ hϕk, g(q(·, uk))ξiCj×Cj, for everyzB(Cj,k·kCj)(0,1). We deduce that

δkϕkkCj =δq

1− |ψk|2≤ hϕk, g(q(·, uk))ξiCj×Cj. (4) 2.2.1 First inequality depending on fixed

In this section, we fixN. Recall that the sequence (uk)k converges touin L1. Using compact- ness arguments, we infer the existence of a subsequence of (εk)k(that we do not relabel)2such that (uk)k converges tou a.e. on [0, T], (ψk)k converges to someψ0 and (ϕk)k weakly* converges

2The subsequence of (εk)kis not relabel. However, it is worth to note that the extracted subsequence depends onfixed.

Références

Documents relatifs

Then we discuss the maximization condition for optimal sampled-data controls that can be seen as an average of the weak maximization condition stated in the classical

In the random case B → ∞, target probabilities are learned in descending order of their values, and the structure in the spin-configuration space is completely irrelevant:

As mentioned in Remark 3.4 , owing to the time scale theory, Theorem 3.7 represents an extension to the general time scale setting of several versions of the Pontryagin

On peut ensuite d´ecrire simplement un algorithme de filtrage associ´e `a une contrainte alldiff qui va r´ealiser la consistance d’arc pour cette contrainte, c’est-`a-

Afin de travailler sur un potentiel constant sur toute la surface de la cathode (un potentiel de travail uniforme), il est indispensable de donner à la cellule une

Plus précisément on se propose de : – montrer que pour tout pair il existe un cycle homocline qui possède points d’équilibre ; – montrer que le seul cycle avec un nombre impair

In the classical case of optimal permanent control problems without running in- equality state constraint, the Pontryagin maximum principle induces a numerical indirect method based

Keywords and phrases: Optimal control; fractional calculus; Riemann-Liouville and Caputo operators; Filippov’s existence theorem; Pontryagin maximum principle; needle-like