Second-order sufficient conditions for strong solutions to optimal control problems

(1)

HAL Id: hal-00825260

https://hal.inria.fr/hal-00825260

Submitted on 23 May 2013

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

optimal control problems

Joseph Frederic Bonnans, Xavier Dupuis, Laurent Pfeiffer

To cite this version:

Joseph Frederic Bonnans, Xavier Dupuis, Laurent Pfeiffer. Second-order suﬀicient conditions for

strong solutions to optimal control problems. ESAIM: Control, Optimisation and Calculus of Varia-

tions, EDP Sciences, 2014, 20 (03), pp.704-724. �10.1051/cocv/2013080�. �hal-00825260�

(2)

0249-6399ISRNINRIA/RR--8307--FR+ENG

RESEARCH REPORT N° 8307

May 2013

conditions for strong solutions to optimal control problems

J. Frédéric Bonnans, Xavier Dupuis, Laurent Pfeiffer

(3)

(4)

RESEARCH CENTRE SACLAY – ÎLE-DE-FRANCE 1 rue Honoré d’Estienne d’Orves Bâtiment Alan Turing

J. Frédéric Bonnans

^∗

, Xavier Dupuis

^∗

, Laurent Pfeier

^∗

Project-Team COMMANDS

Research Report n° 8307 May 2013 24 pages

Abstract: In this report, given a reference feasible trajectory of an optimal control problem, we say that the quadratic growth property for bounded strong solutions holds if the cost function of the problem has a quadratic growth over the set of feasible trajectories with a bounded control and with a state variable suciently close to the reference state variable. Our sucient second- order optimality conditions in Pontryagin form ensure this property and ensure a fortiori that the reference trajectory is a bounded strong solution. Our proof relies on a decomposition principle, which is a particular second-order expansion of the Lagrangian of the problem.

Key-words: Optimal control; second-order sucient conditions; quadratic growth; bounded strong solutions; Pontryagin multipliers; pure state and mixed control-state constraints; decomposition principle.

The research leading to these results has received funding from the EU 7th Framework Programme (FP7- PEOPLE-2010-ITN), under GA number 264735-SADCO, and from the Gaspard Monge Program for Optimization and operations research (PGMO).

∗Inria-Saclay and CMAP, Ecole Polytechnique, Route de Saclay, 91128 Palaiseau Cedex, France. Emails:

frederic.bonnans@inria.fr, xavier.dupuis@cmap.polytechnique.fr, laurent.pfeier@polytechnique.edu.

(5)

Résumé : Nous considérons dans ce rapport une trajectoire admissible d'un problème de commande optimale et disons que la propriété de croissance quadratique pour des solutions fortes est satisfaite si la fonction coût du problème a une croissance quadratique sur l'ensemble des trajectoires dont la commande est bornée et dont la variable d'état est susamment proche de la variable d'état de référence. Nos conditions d'optimalité du second ordre sous forme Pontryaguine garantissent cette propriété et a fortiori que la trajectoire de référence est une solution forte.

Notre preuve s'appuie sur un principe de décomposition, qui est un développement particulier du lagrangien du problème au second ordre.

Mots-clés : Commande optimale; conditions susantes du second ordre; croissance quadra-

tique; solutions fortes; multiplicateurs de Pontryaguine; contraintes pures sur l'état et contraintes

mixtes sur l'état et la commande; principe de décomposition.

(6)

1 Introduction

In this paper, we consider an optimal control problem with nal-state constraints, pure state constraints, and mixed control-state constraints. Given a feasible control u ¯ and its associated state variable y ¯ , we give second-order conditions ensuring that for all R > k uk ¯

∞

, there exist ε > 0 and α > 0 such that for all feasible trajectory (u, y) with kuk

∞

≤ R and ky − yk ¯

∞

≤ ε ,

J (u, y) − J (¯ u, y) ¯ ≥ α(ku − uk ¯

²₂

+ |y

0

− y ¯

0

|

²

), (1.1) where J (u, y) is the cost function to minimize. We call this property quadratic growth for bounded strong solutions. Its specicity lies in the fact that the quadratic growth is ensured for controls which may be far from u ¯ in L

^∞

norm.

Our approach is based on the theory of second-order optimality conditions for optimization problems in Banach spaces [7, 13, 15]. A local optimal solution satises rst- and second-order necessary conditions; denoting by Ω the Hessian of the Lagrangian, theses conditions state that under the extended polyhedricity condition [6, Section 3.2], the supremum of Ω over the set of Lagrange multipliers is nonnegative for all critical directions. If the supremum of Ω is positive for nonzero critical directions, we say that the second-order sucient optimality conditions hold and under some assumptions, a quadratic growth property is then satised. This approach can be used for optimal control problems with constraints of any kind. For example, Stefani and Zezza [19] dealt with problems with mixed control-state equality constraints and Bonnans and Hermant [4] with problems with pure state and mixed control-state constraints. However, the quadratic growth property which is then satised holds for controls which are suciently close to u ¯ in uniform norm and only ensures that (¯ u, y) ¯ is a weak solution.

For Pontryagin minima, that is to say minima locally optimal in a L

¹

neighborhood of u ¯ , the necessary conditions can be strengthened. The rst-order conditions are nothing but the well-known Pontryagin's principle, historically formulated in [18] and extended to problems with various constraints by many authors, such as Hestenes for problems with mixed control-state constraints [11] Dubovitskii and Osmolovskii for problems with pure state and mixed control- state constraints in early Russian references [9, 10], as highlighted by Dmitruk [8]. We refer to the survey by Hartl et al. for more references on this principle.

We say that the second-order necessary condition are in Pontryagin form if the supremum of Ω is taken over the set of Pontryagin multipliers, these multipliers being the Lagrange multipliers for which Pontryagin's principle holds. Maurer and Osmolovskii proved in [17] that the second- order necessary conditions in Pontryagin form were satised for Pontryagin minima to optimal control problems with mixed control-state equality constraints. They also proved that if second- order sucient conditions in Pontryagin form held, then the quadratic growth for bounded strong solutions was satised. The sucient conditions in Pontryagin form are as follows: the supremum of Ω over Pontryagin multipliers only is positive for nonzero critical directions and for all bounded neighborhood of u ¯ , there exists a Pontryagin multiplier which is such such the Hamiltonian has itself a quadratic growth. The results of Maurer and Osmolovskii are true under a restrictive full- rank condition for the mixed equality constraints, which is not satised by pure constraints, and under the Legendre-Clebsch condition, imposing that the Hessian of the augmented Hamiltonian w.r.t. u is positive. The full-rank condition enabled them to reformulate their their problem as a problem with nal-state constraints only. Note that these results were rst stated by Milyutin and Osmolovskii in [16], without proof.

For problems with pure and mixed inequality constraints, we proved the second-order neces-

sary conditions in Pontryagin form [2]; in the present paper, we prove that the sucient conditions

in Pontryagin form ensure the quadratic growth property for bounded strong solutions under the

Legendre-Clebsch condition. Our proof is based on an extension of the decomposition principle

(7)

of Bonnans and Osmolovskii [5] to the constrained case. This principle is a particular second- order expansion of the Lagrangian, which takes into account the fact that the control may have large perturbations in uniform norm. Note that the diculties arising in the extension of the principle and the proof of quadratic growth are mainly due to the presence of mixed control-state constraints.

The outline of the paper is as follows. In Section 2, we set our optimal control problem.

Section 3 is devoted to technical aspects related to the reduction of state constraints. We prove the decomposition principle in Section 4 (Theorem 4.2) and prove the quadratic growth property for bounded strong solutions in Section 5 (Theorem 5.3). In Section 6, we prove that under technical assumptions, the sucient conditions are not only sucient but also necessary to ensure the quadratic growth property (Theorem 6.3).

Notations. For a function h that depends only on time t , we denote by h

t

its value at time t , by h

i,t

the value of its i -th component if h is vector-valued, and by h ˙ its derivative. For a function h that depends on (t, x) , we denote by D

t

h and D

x

h its partial derivatives. We use the symbol D without any subscript for the dierentiation w.r.t. all variables except t , e.g. Dh = D

_(u,y)

h for a function h that depends on (t, u, y) . We use the same convention for higher order derivatives.

We identify the dual space of R

ⁿ

with the space R

^n∗

of n -dimensional horizontal vectors.

Generally, we denote by X

^∗

the dual space of a topological vector space X . Given a convex subset K of X and a point x of K , we denote by T

K

(x) and N

K

(x) the tangent and normal cone to K at x , respectively; see [6, Section 2.2.4] for their denition.

We denote by |·| both the Euclidean norm on nite-dimensional vector spaces and the cardinal of nite sets, and by k · k

s

and k · k

q,s

the standard norms on the Lesbesgue spaces L

^s

and the Sobolev spaces W

^q,s

, respectively.

We denote by BV ([0, T ]) the space of functions of bounded variation on the closed interval [0, T ] . Any h ∈ BV ([0, T ]) has a derivative dh which is a nite Radon measure on [0, T ] and h

₀

(resp. h

_T

) is dened by h

₀

:= h

₀₊

− dh(0) (resp. h

_T

:= h

_T₋

+ dh(T) ). Thus BV ([0, T ]) is endowed with the following norm: khk

BV

:= kdhk

_M

+ |h

T

| . See [1, Section 3.2] for a rigorous presentation of BV .

All vector-valued inequalities have to be understood coordinate-wise.

2 Setting

2.1 The optimal control problem

We formulate in this section the optimal control problem under study and we use the same framework as in [2]. We refer to this article for supplementary comments on the dierent assumptions made. Consider the state equation

˙

y

_t

= f (t, u

_t

, y

_t

) for a.a. t ∈ (0, T ). (2.1) Here, u is a control which belongs to U , y is a state which belongs to Y , where

U := L

^∞

(0, T ; R

^m

), Y := W

^1,∞

(0, T ; R

ⁿ

), (2.2) and f : [0, T ] × R

^m

× R

ⁿ

→ R

ⁿ

is the dynamics. Consider constraints of various types on the system: the mixed control-state constraints, or mixed constraints

c(t, u

t

, y

t

) ≤ 0 for a.a. t ∈ (0, T ), (2.3) the pure state constraints, or state constraints

g(t, y

t

) ≤ 0 for a.a. t ∈ (0, T ), (2.4)

(8)

and the initial-nal state constraints

( Φ

^E

(y

0

, y

T

) = 0,

Φ

^I

(y

₀

, y

_T

) ≤ 0. (2.5)

Here c : [0, T ] × R

^m

× R

ⁿ

→ R

ⁿ^c

, g : [0, T ] × R

ⁿ

→ R

ⁿ^g

, Φ

^E

: R

ⁿ

× R

ⁿ

→ R

ⁿ^Φ^E

, Φ

^I

: R

ⁿ

× R

ⁿ

→ R

ⁿ^Φ^I

. Finally, consider the cost function φ : R

ⁿ

× R

ⁿ

→ R. The optimal control problem is then

(u,y)∈U ×Y

min φ(y

0

, y

T

) subject to (2.1)-(2.5) . ( P ) We call a trajectory any pair (u, y) ∈ U × Y such that (2.1) holds. We say that a trajectory is feasible for problem ( P ) if it satises constraints (2.3)-(2.5), and denote by F(P ) the set of feasible trajectories. From now on, we x a feasible trajectory (¯ u, y) ¯ .

Similarly to [19, Denition 2.1], we introduce the following Carathéodory-type regularity notion:

Denition 2.1. We say that ϕ : [0, T ] × R

^m

× R

ⁿ

→ R

^s

is uniformly quasi- C

^k

i

(i) for a.a. t , (u, y) 7→ ϕ(t, u, y) is of class C

^k

, and the modulus of continuity of (u, y) 7→

D

^k

ϕ(t, u, y) on any compact of R

^m

× R

ⁿ

is uniform w.r.t. t .

(ii) for j = 0, . . . , k , for all (u, y) , t 7→ D

^j

ϕ(t, u, y) is essentially bounded.

Remark 2.2. If ϕ is uniformly quasi- C

^k

, then D

^j

ϕ for j = 0, . . . , k are essentially bounded on any compact, and (u, y) 7→ D

^j

ϕ(t, u, y) for j = 0, . . . , k − 1 are locally Lipschitz, uniformly w.r.t.

t .

The regularity assumption that we need for the quadratic growth property is the following:

Assumption 1. The mappings f , c and g are uniformly quasi- C

²

, g is dierentiable, D

_t

g is uniformly quasi- C

¹

, Φ

^E

, Φ

^I

, and φ are C

²

.

Note that this assumption will be strengthened in Section 6.

Denition 2.3. We say that the inward condition for the mixed constraints holds i there exist γ > 0 and v ¯ ∈ U such that

c(t, u ¯

t

, y ¯

t

) + D

u

c(t, u ¯

t

, y ¯

t

)¯ v

t

≤ −γ, for a.a. t. (2.6) In the sequel, we will always make the following assumption:

Assumption 2. The inward condition for the mixed constraints holds.

Assumption 2 ensures that the component of the Lagrange multipliers associated with the mixed constraints belongs to L

^∞

(0, T ; R

ⁿ^c^∗

) , see e.g. [5, Theorem 3.1]. This assumption will also play a role in the decomposition principle.

2.2 Bounded strong optimality and quadratic growth

Let us introduce various notions of minima, following [16].

(9)

Denition 2.4. We say that (¯ u, y) ¯ is a bounded strong minimum i for any R > k¯ uk

_∞

, there exists ε > 0 such that

φ(¯ y

0

, y ¯

T

) ≤ φ(y

0

, y

T

), for all (u, y) ∈ F(P ) such that (2.7) ky − yk ¯

_∞

≤ ε and kuk

_∞

≤ R,

a Pontryagin minimum i for any R > k uk ¯

_∞

, there exists ε > 0 such that

φ(¯ y

₀

, y ¯

_T

) ≤ φ(y

₀

, y

_T

), for all (u, y) ∈ F(P ) such that (2.8) ku − uk ¯

1

+ ky − yk ¯

_∞

≤ ε and kuk

_∞

≤ R,

a weak minimum i there exists ε > 0 such that

φ(¯ y

₀

, y ¯

_T

) ≤ φ(y

₀

, y

_T

), for all (u, y) ∈ F(P ) such that (2.9) ku − uk ¯

_∞

+ ky − yk ¯

_∞

≤ ε.

Obviously, (2.7) ⇒ (2.8) ⇒ (2.9).

Denition 2.5. We say that the quadratic growth property for bounded strong solutions holds at (¯ u, y) ¯ i for all R > k¯ uk

_∞

, there exist ε

_R

> 0 and α

_R

> 0 such that for all feasible trajectory (u, y) satisfying kuk

_∞

≤ R and ky − yk ¯

_∞

≤ ε ,

φ(y

0

, y

T

) − φ(¯ y

0

, y ¯

T

) ≥ α

R

ku − uk ¯

²₂

. (2.10) The goal of the article is to characterize this property. If it holds at (¯ u, y) ¯ , then (¯ u, y) ¯ is a bounded strong solution to the problem.

2.3 Multipliers

We dene the Hamiltonian and the augmented Hamiltonian respectively by

H [p](t, u, y) := pf (t, u, y), H

^a

[p, ν](t, u, y) := pf (t, u, y) + νc(t, u, y), (2.11) for (p, ν, t, u, y) ∈ R

^n∗

× R

ⁿ^c^∗

× [0, T ] × R

^m

× R

ⁿ

. We dene the end points Lagrangian by

Φ[β, Ψ](y

₀

, y

_T

) := βφ(y

₀

, y

_T

) + ΨΦ(y

₀

, y

_T

), (2.12) for (β, Ψ, y

0

, y

T

) ∈ R × R

ⁿ^Φ^∗

× R

ⁿ

× R

ⁿ

, where n

Φ

= n

_ΦE

+ n

_ΦI

and Φ =

Φ

^E

Φ

^I

. We set

K

c

:= L

^∞

(0, T ; R

ⁿ−^c

), K

g

:= C([0, T ]; R

ⁿ−^g

), K

Φ

:= {0}

_Rⁿ_ΦE

× R

ⁿ−^Φ^I

, (2.13) so that the constraints (2.3)-(2.5) can be rewritten as

c(·, u, y) ∈ K

_c

, g(·, y) ∈ K

_g

, Φ(y

₀

, y

_T

) ∈ K

_Φ

. (2.14) Recall that the dual space of C([0, T ]; R

ⁿ^g

) is the space M([0, T ]; R

ⁿ^g^∗

) of nite vector-valued Radon measures. We denote by M([0, T ]; R

ⁿ^g^∗

)

+

the cone of positive measures in this dual space. Let

E := R × R

ⁿ^Φ^∗

× L

^∞

(0, T ; R

ⁿ^c^∗

) × M([0, T ]; R

ⁿ^g^∗

). (2.15)

(10)

Let N

K_c

(c(·, u, ¯ y)) ¯ be the set of elements in the normal cone to K

c

at c(·, u, ¯ y) ¯ that belong to L

^∞

(0, T ; R

ⁿ^c^∗

) , i.e.

N

K_c

(c(·, u, ¯ y)) := ¯

ν ∈ L

^∞

(0, T ; R

ⁿ+^c^∗

) : ν

t

c(t, ¯ u

t

, y ¯

t

) = 0 for a.a. t . (2.16) Let N

_K_g

(g(·, y)) ¯ be the normal cone to K

_g

at g(·, y) ¯ , i.e.

N

_K_g

(g(·, y)) := ¯ (

µ ∈ M([0, T ]; R

ⁿ^g^∗

)

₊

: Z

[0,T]

(dµ

_t

g(t, y ¯

_t

)) = 0 )

. (2.17)

Let N

_K_Φ

(Φ(¯ y

₀

, y ¯

_T

)) be the normal cone to K

_Φ

at Φ(¯ y

₀

, y ¯

_T

) , i.e.

N

_K_Φ

(Φ(¯ y

₀

, y ¯

_T

)) :=

Ψ ∈ R

ⁿ^Φ^∗

: Ψ

_i

≥ 0

Ψ

_i

Φ

_i

(¯ y

₀

, y ¯

_T

) = 0 for n

_ΦE

< i ≤ n

_Φ

. (2.18)

Finally, let

N (¯ u, y) := ¯ R

+

× N

K_Φ

(Φ(¯ y

0

, y ¯

T

)) × N

K_c

(c(·, u, ¯ y)) ¯ × N

K_g

(g(·, y)) ¯ ⊂ E. (2.19) We dene the costate space

P := BV ([0, T ]; R

^n∗

). (2.20)

Given λ = (β, Ψ, ν, µ) ∈ E , we consider the costate equation in P ( −dp

t

= D

y

H

^a

[p

t

, ν

t

](t, u ¯

t

, y ¯

t

)dt + dµ

t

Dg(t, y ¯

t

),

p

_T₊

= D

_y_T

Φ[β, Ψ](¯ y

₀

, y ¯

_T

). (2.21) Denition 2.6. Let λ = (β, Ψ, ν, µ) ∈ E . We say that the solution of the costate equation (2.21) p

^λ

∈ P is an associated costate i

−p

^λ₀₋

= D

_y₀

Φ[β, Ψ](¯ y

₀

, y ¯

_T

). (2.22) Let N

π

(¯ u, y) ¯ be the set of nonzero λ ∈ N(¯ u, y) ¯ having an associated costate.

We dene the set-valued mapping U : [0, T ] ⇒ R

^m

by

U (t) := cl {u ∈ R

^m

: c(t, u, y ¯

_t

) < 0} for a.a. t, (2.23) where cl denotes the closure in R

^m

. We can now dene two dierent notions of multipliers.

Denition 2.7. (i) We say that λ ∈ N

π

(¯ u, y) ¯ is a generalized Lagrange multiplier i

D

_u

H

^a

[p

^λ_t

, ν

_t

](t, u ¯

_t

, y ¯

_t

) = 0 for a.a. t. (2.24) We denote by Λ

_L

(¯ u, y) ¯ the set of generalized Lagrange multipliers.

(ii) We say that λ ∈ Λ

L

(¯ u, y) ¯ is a generalized Pontryagin multiplier i

H [p

^λ_t

](t, u ¯

_t

, y ¯

_t

) ≤ H[p

^λ_t

](t, u, y ¯

_t

) for all u ∈ U (t), for a.a. t. (2.25) We denote by Λ

P

(¯ u, y) ¯ the set of generalized Pontryagin multipliers.

Note that even if (¯ u, y) ¯ is a Pontryagin minimum, inequality (2.25) may not be satised for

some t ∈ [0, T ] and some u ∈ R

^m

for which c(t, u, y ¯

t

) = 0 , as we show in [2, Appendix].

(11)

2.4 Reduction of touch points

Let us rst recall the denition of the order of a state constraint. For 1 ≤ i ≤ n

_g

, assuming that g

i

is suciently regular, we dene by induction g

^(j)_i

: [0, T ] × R

^m

× R

ⁿ

→ R, j ∈ N, by

g

_i^(j+1)

(t, u, y) := D

t

g

_i^(j)

(t, u, y) + D

y

g

_i^(j)

(t, u, y)f (t, u, y), g

_i⁽⁰⁾

:= g

i

. (2.26) Denition 2.8. If g

_i

and f are C

^qⁱ

, we say that the state constraint g

_i

is of order q

_i

∈ N i

D

u

g

^(j)_i

≡ 0 for 0 ≤ j ≤ q

i

− 1, D

u

g

^(q_i ⁱ⁾

6≡ 0. (2.27) If g

i

is of order q

i

, then for all j < q

i

, g

_i^(j)

is independent of u and we do not mention this dependence anymore. Moreover, the mapping t 7→ g

i

(t, y ¯

t

) belongs to W

^qⁱ^,∞

(0, T ) and

d

^j

dt

^j

g

_i

(t, y ¯

_t

) = g

^(j)_i

(t, y ¯

_t

) for 0 ≤ j < q

_i

, (2.28) d

^j

dt

^j

g

i

(t, y ¯

t

) = g

^(j)_i

(t, u ¯

t

, y ¯

t

) for j = q

i

. (2.29) Denition 2.9. We say that τ ∈ [0, T ] is a touch point for the constraint g

i

i it is a contact point for g

i

, i.e. g

i

(τ, y ¯

τ

) = 0 , and τ is isolated in {t : g

i

(t, y ¯

t

) = 0} . We say that a touch point τ for g

i

is reducible i τ ∈ (0, T ) ,

_dt^d²2

g

i

(t, y ¯

t

) is dened for t close to τ , continuous at τ , and

d

²

dt

²

g

i

(t, y ¯

t

)|

t=τ

< 0. (2.30) For 1 ≤ i ≤ n

_g

, let us dene

T

g,i

:=

( ∅ if g

i

is of order 1,

{ touch points for g

_i

} otherwise . (2.31) Note that for the moment, we only need to distinguish the constraints of order 1 from the other constraints, for which the order may be undened if g

i

or f is not regular enough.

Assumption 3. For 1 ≤ i ≤ n

g

, the set T

g,i

is nite and only contains reducible touch points.

2.5 Tools for the second-order analysis

We dene now the linearizations of the system, the critical cone, and the Hessian of the La- grangian. Let us set

V

2

:= L

²

(0, T ; R

^m

), Z

1

:= W

^1,1

(0, T ; R

ⁿ

), and Z

2

:= W

^1,2

(0, T ; R

ⁿ

). (2.32) Given v ∈ V

₂

, we consider the linearized state equation in Z

₂

˙

z

t

= Df(t, u ¯

t

, y ¯

t

)(v

t

, z

t

) for a.a. t ∈ (0, T ). (2.33) We call linerarized trajectory any (v, z) ∈ V

2

×Z

2

such that (2.33) holds. For any (v, z

⁰

) ∈ V

2

× R

ⁿ

, there exists a unique z ∈ Z

2

such that (2.33) holds and z

₀

= z

⁰

; we denote it by z = z[v, z

⁰

] . We also consider the second-order linearized state equation in Z

₁

, dened by

ζ ˙

t

= D

y

f (t, u ¯

t

, y ¯

t

)ζ

t

+ D

²

f (t, u ¯

t

, y ¯

t

)(v

t

, z

t

[v, z

⁰

])

²

for a.a. t ∈ (0, T ). (2.34)

(12)

We denote by z

²

[v, z

⁰

] the unique ζ ∈ Z

1

such that (2.34) holds and such that z

0

= 0 . The critical cone in L

²

is dened by

C

₂

(¯ u, y) := ¯



 



 



(v, z) ∈ V

₂

× Z

₂

: z = z[v, z

₀

] Dφ(¯ y

₀

, y ¯

_T

)(z

₀

, z

_T

) ≤ 0

DΦ(¯ y

₀

, y ¯

_T

)(z

₀

, z

_T

) ∈ T

_K_Φ

(Φ(¯ y

₀

, y ¯

_T

)) Dc(·, u, ¯ y)(v, z) ¯ ∈ T

K_c

(c(·, u, ¯ y)) ¯ Dg(·, y)z ¯ ∈ T

K_g

(g(·, y)) ¯



 



 



(2.35)

Note that by [6, Examples 2.63 and 2.64], the tangent cones T

K_g

(g(·, y)) ¯ and T

K_c

(c(·, u, ¯ y)) ¯ are resp. described by

T

K_g

= {ζ ∈ C([0, T ]; R

ⁿ

) : ∀t, g(t, y ¯

t

) = 0 = ⇒ ζ

t

≤ 0}, (2.36) T

_K_c

= {w ∈ L

²

([0, T ]; R

^m

) : for a.a. t, c(t, u ¯

_t

, y ¯

_t

) = 0 = ⇒ w

_t

≤ 0} (2.37) Finally, for any λ = (β, Ψ, ν, µ) ∈ E , we dene a quadratic form, the Hessian of Lagrangian, Ω[λ] : V

2

× Z

2

→ R by

Ω[λ](v, z) :=

Z

T 0

D

²

H

^a

[p

^λ_t

, ν

t

](t, u ¯

t

, y ¯

t

)(v

t

, z

t

)

²

dt + D

²

Φ[β, Ψ](¯ y

0

, y ¯

T

)(z

0

, z

T

)

²

+ Z

[0,T]

dµ

_t

D

²

g(t, y ¯

_t

)(z

_t

)

²

− X

τ∈Tg,i

1≤i≤ng

µ

_i

(τ )

Dg

_i⁽¹⁾

(τ, y ¯

τ

)z

τ

²

g

⁽²⁾_i

(τ, u ¯

_τ

, y ¯

_τ

)

. (2.38)

We justify the terms involving the touch points in T

g,i

in the following section.

3 Reduction of touch points

We recall in this section the main idea of the reduction technique used for the touch points of state constraints of order greater or equal than 2. Let us mention that this approach was described in [12, Section 3] and used in [14, Section 4] in the case of optimal control problems.

As shown in [3], the reduction allows to derive no-gap necessary and sucient second-order optimality conditions, i.e., the Hessian of the Lagrangian of the reduced problem corresponds to the quadratic form of the necessary conditions. We also prove a strict dierentiability property for the mapping associated with the reduction, that will be used in the decomposition principle.

Recall that for all 1 ≤ i ≤ n

g

, all touch points of T

g,i

are supposed to be reducible (Assumption 3). Let ε > 0 be suciently small so that for all 1 ≤ i ≤ n

g

, for all τ ∈ T

g,i

, the time function

t ∈ [τ − ε, τ + ε] 7→ g(t, y ¯

_t

) (3.1) is C

²

and is such that for some β > 0 ,

_d^d_t²2

g

_i

(t, y ¯

_t

) ≤ −β , for all t in [τ − ε, τ + ε] . From now on, we set for all i and for all τ ∈ T

g,i

∆

^ε_τ

= [τ − ε, τ + ε] and ∆

^ε_i

= [0, T ]\

∪

_τ∈T_g,i

∆

^ε_τ

, (3.2) and we consider the mapping Θ

^ε_τ

: U × R

ⁿ

→ R dened by

Θ

^ε_τ

(u, y

⁰

) := max {g

i

(t, y

t

) : y = y[u, y

⁰

], t ∈ ∆

^ε_τ

}. (3.3)

(13)

We dene the reduced pure constraints as follows:

for all i ∈ {1, ..., n

g

} ,

( g

_i

(t, y

_t

) ≤ 0, for all t ∈ ∆

^ε_i

, (i)

Θ

^ε_τ

(u, y

⁰

) ≤ 0, for all τ ∈ T

g,i

. (ii) (3.4) Finally, we consider the following reduced problem, which is an equivalent reformulation of problem ( P ), in which the pure constraints are replaced by constraint (3.4):

min

(u,y)∈U ×Y

φ(y

₀

, y

_T

) subject to (2.1) , (2.3) , (2.5) , and (3.4) . ( P

⁰

) Now, for all 1 ≤ i ≤ n

g

, consider the mapping ρ

i

dened by

ρ

i

: µ ∈ M([0, T ]; R

+

) 7→ µ

_|∆^ε

i

, (µ(τ))

_τ∈T_g,i

∈ M(∆

^ε_i

; R

+

) × R

^|T^g,i^|

. (3.5) Lemma 3.1. The mapping Θ

^ε_τ

is twice Fréchet-dierentiable at (¯ u, y ¯

0

) with derivatives

DΘ

^ε_τ

(¯ u, y ¯

0

)(v, z

0

) = Dg

i

(τ, y ¯

τ

)z

τ

[v, z

0

], (3.6) D

²

Θ

^ε_τ

(¯ u, y ¯

₀

)(v, z

₀

)

²

= D

²

g

_i

(τ, y ¯

_τ

)(z

_τ

[v, z

₀

])

²

+ Dg

_i

(τ, y ¯

_τ

)z

²_τ

[v, z

₀

]

−

Dg

_i⁽¹⁾

(τ, y ¯

τ

)z

τ

2

g

⁽²⁾_i

(τ, u ¯

_τ

, y ¯

_τ

) . (3.7) and the following mappings dene a bijection between Λ

L

(¯ u, y) ¯ and the Lagrange multipliers of problem ( P

⁰

), resp. between Λ

_P

(¯ u, y) ¯ and the Pontryagin multipliers of problem ( P

⁰

):

λ = β, Ψ, ν, µ

∈ Λ

L

(¯ u, y) ¯ 7→ β, Ψ, ν, (ρ

i

(µ

ⁱ

))

1≤i≤n_g

(3.8) λ = β, Ψ, ν, µ

∈ Λ

P

(¯ u, y) ¯ 7→ β, Ψ, ν, (ρ

i

(µ

ⁱ

))

_1≤i≤n_g

. (3.9)

See [3, Lemma 26] for a proof of this result. Note that the restriction of µ

_i

to ∆

^ε_i

is associated with constraint (3.4(i)) and (µ

_i

(τ))

_τ∈T_g,i

with constraint (3.4(ii)). The expression of the Hessian of Θ

^ε_τ

justies the quadratic form Ω dened in (2.38). Note also that in the sequel, we will work with problem P

⁰

and with the original description of the multipliers, using implicitly the bijections (3.8) and (3.9).

Now, let us x i and τ ∈ T

g,i

. The following lemma is a dierentiability property for the mapping Θ

^ε_τ

, related to the one of strict dierentiability, that will be used to prove the decomposition theorem.

Lemma 3.2. There exists ε > 0 such that for all u

₁

and u

₂

in U , for all y

⁰

in R

ⁿ

, if

ku

¹

− uk ¯

1

≤ ε, ku

²

− uk ¯

1

≤ ε, and |y

⁰

− y ¯

₀

| ≤ ε, (3.10) then

Θ

^ε_τ

(u

²

, y

⁰

) − Θ

^ε_τ

(u

¹

, y

⁰

) = g(τ, y

τ

[u

²

, y

⁰

]) − g(τ, y

τ

[u

¹

, y

⁰

]) + O ku

²

− u

¹

k

1

(ku

¹

− uk ¯

1

+ ku

²

− uk ¯

1

+ |y

⁰

− y ¯

0

|)

. (3.11)

An intermediate lemma is needed to prove this result. Consider the mapping χ dened as follows:

χ : x ∈ W

^2,∞

(∆

^ε_τ

) 7→ sup

t∈[τ−ε,τ+ε]

x

t

∈ R . (3.12)

Let us set x

⁰

= g

i

(·, y) ¯

_|∆ε

τ

. Note that x ˙

⁰_τ

= 0 .

(14)

Lemma 3.3. There exists α

⁰

> 0 such that for all x

¹

and x

²

in W

^2,∞

(∆

τ

) , if k x ˙

¹

− x ˙

⁰

k

_∞

≤ α

⁰

and k x ˙

²

− x ˙

⁰

k

_∞

≤ α

⁰

, then

χ(x

²

) − χ(x

¹

) = x

²

(τ ) − x

¹

(τ)

+ O k x ˙

²

− x ˙

¹

k

∞

(k x ˙

¹

− x ˙

⁰

k

∞

+ k x ˙

²

− x ˙

⁰

k

∞

)

. (3.13)

Proof. Let 0 < α

⁰

< βε and x

¹

, x

²

in W

^2,∞

(∆

τ

) satisfy the assumption of the lemma. Denote by τ

1

(resp. τ

2

) a (possibly non-unique) maximizer of χ(x

¹

) (resp. χ(x

²

) ). Since

˙

x

¹_τ−ε

≥ x ˙

⁰_τ−ε

− α

⁰

≥ βε − α

⁰

> 0 and x ˙

¹_τ+ε

≤ x ˙

⁰_τ+ε

+ α ≤ −βε + α < 0, (3.14) we obtain that τ

1

∈ (τ − ε, τ + ε) and therefore that x ˙

¹_τ₁

= 0 . Therefore,

β |τ

1

− τ | ≤ | x ˙

⁰_τ

1

− x ˙

⁰_τ

| = | x ˙

¹_τ

1

− x ˙

⁰_τ

1

| ≤ k x ˙

¹

− x ˙

⁰

k

_∞

(3.15) and then, |τ

1

− τ| ≤ k x ˙

¹

− x ˙

⁰

k

∞

/β . Similarly, |τ

2

− τ | ≤ k x ˙

²

− x ˙

⁰

k

∞

/β . Then, by (3.15),

χ(x

²

) ≥ x

¹

(τ

1

) + (x

²

(τ

1

) − x

¹

(τ

1

))

= χ(x

¹

) + (x

²

(τ) − x

¹

(τ)) + O(k x ˙

²

− x ˙

¹

k

∞

|τ

1

− τ |) (3.16) and therefore, the l.h.s. of (3.13) is greater than the r.h.s. and by symmetry, the converse inequality holds. The lemma is proved.

Proof of Lemma 3.2. Consider the mapping

G

τ

: (u, y

⁰

) ∈ (U × R

ⁿ

) 7→ t ∈ ∆

τ

7→ g

i

(t, y

t

[u, y

⁰

])

∈ W

^2,∞

(∆

τ

). (3.17) Since g

i

is not of order 1 and by Assumption 1, the mapping G

τ

is Lipschitz in the following sense : there exists K > 0 such that for all (u

¹

, y

^0,1

) and (u

²

, y

^0,2

) ,

kG

τ

(u

¹

, y

^0,1

) − G

τ

(u

²

, y

^0,2

)k

_1,∞

≤ K(ku

²

− u

¹

k

1

+ |y

^0,2

− y

^0,1

|). (3.18) Set α = α

⁰

/(2K) . Let u

¹

and u

²

in U , let y

⁰

in R

ⁿ

be such that (3.10) holds. Then by Lemma 3.3 and by (3.18),

Θ

^ε_τ

(u

²

, y

⁰

) − Θ

^ε_τ

(u

¹

, y

⁰

)

= χ(G

_τ

(u

²

, y

⁰

)) − χ(G

_τ

(u

¹

, y

⁰

))

= g(y

τ

[u

²

, y

⁰

]) − g(y

τ

[u

¹

, y

⁰

])

+ O ku

²

− u

¹

k

1

(ku

²

− uk ¯

1

+ ku

¹

− uk ¯

1

+ |y

⁰

− y ¯

0

|)

, (3.19)

as was to be proved.

4 A decomposition principle

We follow a classical approach by contradiction to prove the quadratic growth property for bounded strong solutions. We assume the existence of a sequence of feasible trajectories (u

^k

, y

^k

)

_k

which is such that u

^k

is bounded and such that ky

^k

− yk ¯

∞

→ 0 and for which the quadratic growth property does not hold. The Lagrangian function rst provides a lower estimate of the cost function φ(y

₀^k

, y

_T^k

) . The diculty here is to linearize the Lagrangian, since we must consider large perturbations of the control in L

^∞

norm. To that purpose, we extend the decomposition principle of [5, Section 2.4] to our more general framework with pure and mixed constraints.

This principle is a partial expansion of the Lagrangian, which is decomposed into two terms:

Ω[λ](v

^A,k

, z[v

^A,k

, y

₀^k

− y ¯

0

]) , where v

^A,k

stands for the small perturbations of the optimal control,

and a dierence of Hamiltonians where the large perturbations occur.

(15)

4.1 Notations and rst estimates

Let R > k¯ uk

∞

, let (u

^k

, y

^k

)

_k

be a sequence a feasible trajectories such that

∀k, ku

^k

k

_∞

≤ R and ku

^k

− uk ¯

₂

→ 0. (4.1) This sequence will appear in the proof of the quadratic growth property. Note that the conver- gence of controls implies that ky

^k

− yk ¯

_∞

→ 0 . We need to build two auxiliary controls u

^A,k

and

˜

u

^k

. The rst one, ˜ u

^k

, is such that

( c(t, u ˜

^k_t

, y

^k_t

) ≤ 0, for a.a. t ∈ [0, T ],

k u ˜

^k

− uk ¯

∞

= O(ky

^k

− yk ¯

∞

). (4.2) The following lemma proves the existence of such a control.

Lemma 4.1. There exist ε > 0 and α ≥ 0 such that for all y ∈ Y with ky − yk ¯

∞

≤ ε , there exists u ∈ U satisfying

ku − uk ¯

_∞

≤ αky − yk ¯

_∞

and c(t, u

_t

, y

_t

) ≤ 0, for a.a. t. (4.3) Proof. For all y ∈ Y , consider the mapping C

y

dened by

u ∈ U 7→ C

y

(u) = t 7→ c(t, u

t

, y

t

)

∈ L

^∞

(0, T ; R

ⁿ^g

). (4.4) The inward condition (Assumption 2) corresponds to Robinson's constraint qualication for C

¯y

at u ¯ with respect to L

^∞

(0, T ; R

ⁿ−^g

) . Thus, by the Robinson-Ursescu stability theorem [6, Theorem 2.87], there exists ε > 0 such that for all y ∈ Y with ky − yk ¯

∞

≤ ε , C

_y

is metric regular at u ¯ with respect to L

^∞

(0, T ; R

ⁿ−^g

) . Therefore, for all y ∈ Y with ky − yk ¯

∞

≤ ε , there exists a control u such that, for almost all t , c(t, u

_t

, y

_t

) ≤ 0 and

ku − uk ¯

_∞

= O dist (C

y

(¯ u), L

^∞

(0, T ; R

ⁿ−^g

))

= O(ky − yk ¯

_∞

).

This proves the lemma.

Now, let us introduce the second auxiliary control u

^A,k

. We say that a partition (A, B) of the interval [0, T ] is measurable i A and B are measurable subset of [0, T ] . Let us consider a sequence of measurable partitions (A

k

, B

k

)

k

of [0, T ] . We dene u

^A,k

as follows:

u

^A,k_t

= ¯ u

_t

1

_{t∈Bk}

+ u

^k_t

1

_{t∈A_k_}

. (4.5) The idea is to separate, in the perturbation u

^k

− u ¯ , the small and large perturbations in uniform norm. In the sequel, the letter A will refer to the small perturbations and the letter B to the large ones. The large perturbations will occur on the subset B

k

.

For the sake of clarity, we suppose from now that the following holds:



 

 

(A

k

, B

k

)

k

is a sequence of measurable partitions of [0, T ] ,

|y

₀^k

− y ¯

0

| + ku

^A,k

− uk ¯

∞

→ 0,

|B

k

| → 0,

(4.6)

where |B

k

| is the Lebesgue measure of B

k

. We set

v

^A,k

:= u

^A,k

− u ¯ and v

^B,k

:= u

^k

− u

^A,k

(4.7)

(16)

and we dene

δy

^k

:= y

^k

− y, ¯ y

^A,k

:= y[u

^A,k

, y

₀^k

], and z

^A,k

:= z[v

^A,k

, δy

^k₀

]. (4.8) Let us introduce some useful notations for the future estimates:

R

1,k

:= ku

^k

− uk ¯

1

+ |δy

₀^k

|, R

2,k

:= ku

^k

− uk ¯

2

+ |δy

^k₀

|, R

1,A,k

:= kv

^A,k

k

1

+ |δy

₀^k

|, R

2,A,k

:= kv

^A,k

k

2

+ |δy

₀^k

|, R

1,B,k

:= kv

^B,k

k

1

, R

2,B,k

:= kv

^B,k

k

2

.

(4.9)

Combining the Cauchy-Schwarz inequality and assumption (4.6), we obtain that

R

1,B,k

≤ R

2,B,k

|B

k

|

^1/2

= o(R

2,B,k

). (4.10) Note that by Gronwall's lemma,

kδy

^k

k

_∞

= O(R

1,k

) = O(R

2,k

) and kz

^A,k

k

_∞

= O(R

1,A,k

) = O(R

2,k

). (4.11) Note also that

kδy

^k

− (y

^A,k

− y)k ¯

_∞

= O(R

1,B,k

) = o(R

2,k

) (4.12) and since ky

^A,k

− (¯ y + z

^A,k

)k

_∞

= O(R

²_2,k

) ,

kδy

^k

− z

^A,k

k

_∞

= o(R

_2,k

). (4.13)

4.2 Result

We can now state the decomposition principle.

Theorem 4.2. Suppose that Assumptions 1, 2, and 3 hold. Let R > k¯ uk

∞

, let (u

^k

, y

^k

)

k

be a sequence of feasible controls satisfying (4.1) and (A

k

, B

k

)

k

satisfy (4.6). Then, for all λ = (β, Ψ, ν, µ) ∈ Λ

L

(¯ u, y) ¯ ,

β(φ(y

^k₀

, y

^k_T

) − φ(¯ y

₀

, y ¯

_T

)) ≥

¹₂

Ω[λ](v

^A,k

, z

^A,k

) +

Z

B_k

H [p

^λ_t

](t, u

^k_t

, y ¯

t

) − H [p

^λ_t

](t, u ˜

^k_t

, y ¯

t

)

dt + o(R

²_2,k

), (4.14) where Ω is dened by (2.38).

The proof is given at the end of the section, page 15. The basic idea to obtain a lower estimate of β(φ(y

0

, y

T

) − φ(¯ y

0

, y ¯

T

)) is classical: we dualize the constraints and expand up to the second order the obtained Lagrangian. However, the dualization of the mixed constraint is particular here, in so far as the nonpositive added term is the following:

Z

A_k

ν

t

(c(t, u

^A,k_t

, y

^k_t

) − c(t, u ¯

t

, y ¯

t

)) dt + Z

B_k

ν

t

(c(t, u ˜

^k_t

, y

^k_t

) − c(t, u ¯

t

, y ¯

t

)) dt, (4.15) where u ˜

^k

and u

^A,k

are dened by (4.2) and (4.5). In some sense, we do not dualize the mixed constraint when there are large perturbations of the control. By doing so, we prove that the contribution of the large perturbations is of the same order as the dierence of Hamiltonians appearing in (4.14). If we dualized the mixed constraint with the following term:

Z

T 0

ν

t

(c(t, u

^k_t

, y

^k_t

) − c(t, u ¯

t

, y ¯

t

)) dt, (4.16)

(17)

we would obtain for the contribution of large perturbations a dierence of augmented Hamilto- nians.

Let us x λ ∈ Λ

L

(¯ u, y) ¯ and let us consider the following two terms:

I

₁^k

= Z

T

0

−H

_y^a

[p

^λ_t

](t, u ¯

t

, y ¯

t

)δy

_t^k

dt +

Z

Ak

(H

^a

[p

^λ_t

](t, u

^A,k_t

, y

^k_t

) − H

^a

[p

^λ_t

](t, u ¯

t

, y ¯

t

)) dt (4.17a) +

Z

B_k

(H

^a

[p

^λ_t

](t, u ˜

^k_t

, y

_t^k

) − H

^a

[p

^λ_t

](t, u ¯

_t

, y ¯

_t

)) dt (4.17b) +

Z

Bk

(H [p

^λ_t

](t, u

^k_t

, y

_t^k

) − H [p

^λ_t

](t, u ˜

^k_t

, y

_t^k

)) dt (4.17c) and

I

₂^k

= − Z

[0,T]

(dµ

t

Dg(t, y ¯

t

)δy

_t^k

) +

ng

X

i=1

Z

∆^ε_i

(g

i

(t, y

_t^k

) − g

i

(t, y ¯

t

)) dµ

t,i

(4.18a)

+ X

τ∈Tg,i

1≤i≤ng

µ

i

(τ )(Θ

^ε_τ

(u

^k

, y

^k₀

) − Θ

^ε_τ

(¯ u, y ¯

0

)). (4.18b)

Lemma 4.3. Let R > k¯ uk

_∞

, let (u

^k

, y

^k

)

_k

be a sequence of feasible trajectories satisfying (4.1), and let (A

k

, B

k

)

k

satisfy (4.6). Then, for all λ ∈ Λ

L

(¯ u, y) ¯ , the following lower estimate holds:

β (φ(y

₀^k

, y

_T^k

)−φ(¯ y

₀

, y ¯

_T

))

≥

¹₂

D

²

Φ[λ](¯ y

0

, y ¯

T

)(z

^A,k₀

, z

_T^A,k

)

²

+ I

₁^k

+ I

₂^k

+ o(R

²_2,k

). (4.19) Proof. Let λ ∈ Λ

L

(¯ u, y) ¯ . In view of sign conditions for constraints and multipliers, we rst obtain that

βφ(y

₀^k

, y

_T^k

) − φ(¯ y

0

, y ¯

T

) ≥ Φ[β, Ψ](y

0

, y

T

) − Φ[β, Ψ](¯ y

0

, y ¯

T

) +

ng

X

i=1

Z

∆^ε_i

(g

i

(t, y

_t^k

) − g

i

(t, y ¯

t

)) dµ

i,t

+ X

τ∈Tg,i

1≤i≤ng

µ

i

(τ)(Θ

^ε_τ

(u

^k

, y

^k₀

) − Θ

^ε_τ

(¯ u, y ¯

0

))

+ Z

Ak

ν

t

(c(t, u

^A,k_t

, y

^k_t

) − c(t, u ¯

t

, y ¯

t

)) dt + Z

Bk

ν

t

(c(t, u ˜

^k_t

, y

^k_t

) − c(t, u ¯

t

, y ¯

t

)) dt. (4.20) Expanding the end-point Lagrangian up to the second order, and using (4.13), we obtain that

Φ[β, Ψ](y

^k₀

, y

^k_T

) − Φ[β, Ψ](¯ y

₀

, y ¯

_T

)

= DΦ[β, Ψ](¯ y

0

, y ¯

T

)(δy

₀^k

, δy

_T^k

) +

¹₂

D

²

Φ[β, Ψ](¯ y

0

, y ¯

T

)(δy

^k₀

, δy

^k_T

)

²

+ o(R

²_2,k

)

= p

^λ_T

δy

^k_T

− p

^λ₀

δy

₀^k

+

¹₂

D

²

Φ[λ](¯ y

0

, y ¯

T

)(z

^A,k₀

, z

_T^A,k

)

²

+ o(R

²_2,k

). (4.21)

(18)

Integrating by parts (see [3, Lemma 32]), we obtain that p

^λ_T

δy

_T^k

− p

^λ₀

δy

^k₀

=

Z

[0,T]

d p

^λ_t

δy

^k_t

+ p

^λ_t

δy ˙

^k_t

dt

= Z

T

0

− H

_y^a

(t, u ¯

t

, y ¯

t

)δy

_t^k

+ H (t, u

^k_t

, y

^k_t

) − H(t, u ¯

t

, y ¯

t

) dt

− Z

[0,T]

d µ

t

Dg(t, y ¯

t

)δy

^k_t

. (4.22)

The lemma follows from (4.20), (4.21), and (4.22).

A corollary of Lemma 4.3 is the following estimate, obtained with (4.2):

β(φ(y

^k₀

, y

_T^k

) − φ(¯ y

0

, y ¯

T

)) (4.23)

≥ Z

T

0

H [p

^λ_t

](t, u

^k_t

, y

_t^k

) − H [p

^λ_t

](t, u ˜

^k_t

, y

_t^k

)

dt + O(kδy

^k

k

_∞

)

= Z

T

0

H [p

^λ_t

](t, u

^k_t

, y ¯

_t

) − H [p

^λ_t

](t, u ¯

_t

, y ¯

_t

)

dt + O(kδy

^k

k

_∞

). (4.24) Proof of the decomposition principle. We prove Theorem 4.2 by estimating the two terms I

₁^k

and I

₂^k

obtained in Lemma 4.3.

B Estimation of I

₁^k

. Let show that

I

₁^k

= 1 2

Z

T 0

D

²

H

^a

[p

^λ_t

](t, u ¯

t

, y ¯

t

)(v

^A,k_t

, z

^A,k_t

)

²

dt +

Z

Bk

(H [p

^λ_t

](t, u

^k_t

, y ¯

t

) − H [p

^λ_t

](t, u ˜

^k_t

, y ¯

t

)) dt + o(R

²_2,k

). (4.25) Using (4.13) and the stationarity of the augmented Hamiltonian, we obtain that term (4.17a) is equal to

Z

Ak

H

_y^a

[p

^λ_t

](t, u ¯

t

, y ¯

t

)δy

^k_t

dt + 1

2 Z

A_k

D

²

H

^a

[p

^λ_t

](t, u ¯

_t

, y ¯

_t

)(v

^A,k_t

, z

^A,k_t

)

²

dt + o(R

²_2,k

). (4.26) Term (4.17b) is negligible compared to R

²_2,k

. Since

Z

B_k

(H [p

^λ_t

](t, u

^k_t

, y

_t^k

) − H [p

^λ_t

](t, u ˜

^k_t

, y

_t^k

)) dt

− Z

B_k

(H [p

^λ_t

](t, u

^k_t

, y ¯

_t

) − H [p

^λ_t

](t, u ˜

^k_t

, y ¯

_t

)) dt = O(|B

_k

|R

²_1,k

) = o(R

²_2,k

), (4.27) term (4.17c) is equal to

Z

Bk

(H [p

^λ_t

](t, u

^k_t

, y ¯

t

) − H [p

^λ_t

](t, u ˜

^k_t

, y ¯

t

)) dt + o(R

²_2,k

). (4.28)

(19)

The following term is also negligible:

Z

B_k

D

²

H

^a

[p

^λ_t

](t, u ¯

t

, y ¯

t

)(v

^A,k_t

, z

^A,k_t

)

²

dt = o(R

²_2,k

). (4.29) Finally, combining (4.17), (4.26), (4.28), and (4.29), we obtain (4.25).

B Estimation of I

₂^k

. Let us show that

I

₂^k

= 1 2

Z

[0,T]

dµ

_t

D

²

g(t, y ¯

_t

)(z

_t^A,k

)

²

− 1 2

X

τ∈Tg,i

1≤i≤ng

µ

_i

(τ) (Dg

⁽¹⁾_i

(τ, y ¯

τ

)z

^A,k_τ

)

²

g

_i⁽²⁾

(τ, u ¯

_τ

, y ¯

_τ

)

. (4.30)

Using (4.13), we obtain the following estimate of term (4.18a):

− X

τ∈Tg,i

1≤i≤n_g

Z

∆^ε_τ

Dg

_i

(t, y ¯

_t

)δy

_t^k

dµ

_i,t

+ 1 2

n_g

X

i=1

Z

∆^ε_i

D

²

g

_i

(t, y ¯

_t

)(z

_t^A,k

)

²

dµ

_t

+ o(R

²_2,k

). (4.31)

Remember that z

²

[v

^A,k

, δy

₀^k

] denotes the second-order linearization (2.34) and that the following holds:

ky

^A,k

− (¯ y + z[v

^A,k

, δy

^k₀

] + z

²

[v

^A,k

, δy

₀^k

])k

∞

= o(R

²_2,k

). (4.32) Using Lemma 3.2 and estimate (4.13), we obtain that for all i , for all τ ∈ T

g,i

,

Θ

^ε_τ

(u

^k

, y

^k₀

) − Θ

^ε_τ

(u

^A,k

, y

₀^k

)

= g

_i

(τ, y

_τ^k

) − g

_i

(τ, y

_τ^A,k

) + O(R

_1,B,k

(R

_1,B,k

+ R

_1,k

))

= Dg

_i

(τ, y ¯

_τ

)(y

_τ^k

− y

^A,k_τ

) + o(R

²_2,k

)

= Dg

_i

(τ, y ¯

_τ

)(δy

^k_τ

− z

^A,k_τ

− z

²_τ

[v

^A,k

, δy

₀^k

]) + o(R

²_2,k

). (4.33) By Lemma 3.1,

Θ

^ε_τ

(u

^A,k

, y

₀^k

) − Θ

^ε_τ

(¯ u, y ¯

₀

)

= Dg

i

(τ, y ¯

τ

)(z

_τ^A,k

+ z

_τ²

[v

^A,k

, δy

₀^k

]) + 1

2 D

²

g

i

(τ, y ¯

τ

)(z

_τ^A,k

)

²

− 1 2

(D

y

g

⁽¹⁾_i

(τ, y ¯

τ

)z

^A,k_τ

)

²

) g

⁽²⁾_i

(τ, u ¯

τ

, y ¯

τ

)

+ o(R

²_2,k

). (4.34) Recall that the restriction of µ

i

to ∆

^ε_τ

is a Dirac measure at τ . Summing (4.33) and (4.34), we obtain the following estimate for (4.18b):

X

τ∈Tg,i

1≤i≤n_g

h Z

∆^ε_τ

Dg

_i

(t, y ¯

_t

)δy

^k_t

+ 1

2 D

²

g

_i

(t, y ¯

_t

)(z

_t^A,k

)

²

dµ

_i,t

− 1 2

(Dg

_i⁽¹⁾

(τ, y ¯

τ

)z

_τ^A,k

)

²

) g

⁽²⁾_i

(τ, u ¯

τ

, y ¯

τ

)

i

+ o(R

²_2,k

). (4.35)

Combining (4.31) and (4.35), we obtain (4.30). Combining (4.25) and (4.30), we obtain the

result.

(20)

5 Quadratic growth for bounded strong solutions

We give in this section sucient second-order optimality conditions in Pontryagin form ensuring the quadratic growth property for bounded strong solutions. Our main result, Theorem 5.3, is proved with a classical approach by contradiction.

Assumption 4. There exists ε > 0 such that for all feasible trajectory (u, y) in (U × Y) with ky − yk ≤ ¯ ε , if (u, y) satises the mixed constraints, then there exists u ˆ such that

c(t, u ˆ

t

, y ¯

t

) ≤ 0, for a.a. t and ku − uk ˆ

_∞

= O(ky − yk ¯

_∞

). (5.1) This assumption is a metric regularity property, global in u and local in y . Note that the required property is dierent from (4.2).

Denition 5.1. A quadratic form Q on a Hilbert space X is said to be a Legendre form i it is weakly lower semi-continuous and if it satises the following property: if x

^k

* x weakly in X and Q(x

^k

) → Q(x) , then x

^k

→ x strongly in X .

Assumption 5. For all λ ∈ Λ

_P

(¯ u, y) ¯ , Ω[λ] is a Legendre form.

Remark 5.2. By [3, Lemma 21], this assumption is satised if for all λ ∈ Λ

P

(¯ u, y) ¯ , there exists γ > 0 such that for almost all t ,

γ ≤ D

_uu²

H

^a

[p

^λ_t

, ν

t

](t, u ¯

t

, y ¯

t

). (5.2) In particular, in the absence of mixed and control constraints, the quadratic growth of the Hamiltonian (5.4) implies (5.2).

For all R > k¯ uk

_∞

, we dene Λ

^R_P

(¯ u, y) = ¯

λ ∈ Λ

L

(¯ u, y) : ¯ for a.a. t , for all u ∈ U (t) with |u| ≤ R,

H[p

^λ_t

](t, u, y ¯

t

) − H[p

^λ_t

](t, u ¯

t

, y ¯

t

) ≥ 0 . (5.3) Note that Λ

_P

(¯ u, y) = ¯ ∩

_R>k¯_uk_∞

Λ

^R_P

(¯ u, y) ¯ . Remember that C

₂

(¯ u, y) ¯ is the critical cone in L

²

, dened by (2.35).

Theorem 5.3. Suppose that Assumptions 1-5 hold. If the following second-order sucient conditions hold: for all R > k uk ¯

∞

,

1. there exist α > 0 and λ

^∗

∈ Λ

^R_P

(¯ u, y) ¯ such that

( for a.a. t , for all u ∈ U (t) with |u| ≤ R,

H [p

^λ_t^∗

](t, u

t

, y ¯

t

) − H[p

^λ_t^∗

](t, u ¯

t

, y ¯

t

) ≥ α|u − u ¯

t

|

²₂

, (5.4) 2. for all (v, z) ∈ C

₂

\{0} , there exists λ ∈ Λ

^R_P

(¯ u, y) ¯ such that Ω[λ](v, z) > 0 ,

then the quadratic growth property for bounded strong solutions holds at (¯ u, y) ¯ .

Proof. We prove this theorem by contradiction. Let R > k¯ uk

_∞

, let us suppose that there exists a sequence (u

^k

, y

^k

)

k

of feasible trajectories such that ku

^k

k

_∞

≤ R , ky

^k

− yk ¯

_∞

→ 0 and

φ(y

₀^k

, y

^k_T

) − φ(¯ y

0

, y ¯

T

) ≤ o(ku

^k

− uk ¯

²₂

+ |y

₀^k

− y ¯

0

|

²

). (5.5)

We use the notations introduced in (4.9). Let λ

^∗

= (β

^∗

, Ψ

^∗

, ν

^∗

, µ

^∗

) ∈ Λ

^R_P

(¯ u, y) ¯ be such that (5.4)

holds.