Optimal control problems with horizon tending to infinity and lacking controllability assumption

(1)

HAL Id: inria-00585688

https://hal.inria.fr/inria-00585688

Submitted on 13 Apr 2011

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Optimal control problems with horizon tending to infinity and lacking controllability assumption

Marc Quincampoix

To cite this version:

Marc Quincampoix. Optimal control problems with horizon tending to infinity and lacking controllability assumption. SADCO Kick off, Mar 2011, Paris, France. �inria-00585688�

(2)

Optimal control problems with horizon tending to inﬁnity and lacking controllability assumption

Marc Quincampoix Universit´e de Brest SADCO, March 2011

M. Q., & J. Renault, On Existence of a limit value in some non expansive optimal control problems,(submitted) (2009)

(3)

An Optimal Control Problem

V_t(y₀) := inf

u∈U

1 t

∫ _t

s=0

h(y(s, u, y₀), u(s))ds, where s 7→ y(s, u, y₀) denotes the solution to

y^′(s) = g(y(s), u(s)), y(0) = y₀.

g : IR^d × U → IR^d Lipschitz, U compact, g h bounded.

PROBLEM : Existence of a limit of V_t(y₀) as t → +∞.

No ergodicity condition here (Lions-Papanicolaou- Varad-

han, Arisawa-Lions, Bettiol, Alvarez-Bardi Capuzzo-Dolcetta, Artstein-Gaitsgory, Fathi...)The limit may depend on the

initial condition

(4)

Contents

1. Introduction and examples 2. A controllability approach

3. Existence of limit value in nonexpansive case 4. Generalisations

(5)

Introduction

Deﬁnition 1 The problem Γ(y₀) := (Γ_t(y₀))_t>0 has a limit value if

V (y₀) := lim

t→∞ V_t(y₀) = lim

t→∞ inf

u∈U

1 t

∫ _t

s=0

h(y(s, u, y₀), u(s))ds.

Deﬁnition 2 The problem Γ(y₀) has a uniform value if it has a limit value V (y₀) and if:

∀ε > 0, ∃u ∈ U, ∃t₀, ∀t ≥ t₀, 1 t

∫ _t

s=0

h(y(s, u, y₀), u(s))ds ≤ V (y₀) +ε.

(6)

Examples

• Example 1: here y ∈ IR² (seen as the complex plane i² = −1), there is no control

y^′(t) = i y(t), V_t(y₀) −−−→

t→∞

1 2π|y₀|

∫

|z|=|y₀| h(z)dz,

and since there is no control, the value is uniform.

• Example 2: in the complex plane again, but now g(y, u) = i y u, where u ∈ U a given bounded subset of IR, and h is continuous in y.

(7)

Assumptions and Notations







The function h : IR^d × U −→ IR is measurable and bounded

∃L ≥ 0, ∀(y, y^′) ∈ IR^2d, ∀u ∈ U, ∥g(y, u) − g(y^′, u)∥ ≤ L∥y − y^′∥

∃a > 0, ∀(y, u) ∈ IR^d × U, ∥g(y, u)∥ ≤ a(1 + ∥y∥)

(HK) ∃ a compact invariant set K for the control system Average cost induced by u between 0 and t by:

γ_t(y₀, u) := 1 t

∫ _t

0

h(y(s, u, y₀), u(s))ds, V_t(y₀) = inf

u∈U γ_t(y₀, u).

for m ≥ 0, γ_m,t(y₀, u) := 1 t

∫ _m+t

m

h(y(s, u, y₀), u(s))ds,

(8)

A Classical Controlability Approach

Suppose that ∃T > 0, ∀(y₁, y₂) ∈ K, ∃t ≤ T, ∀u ∈ U, ∃v ∈ U, ∥y(t, u, y₁) − y(t, v, y₂)∥ = 0.

Then for any t ≥ T and any Ψ ∈ C(K) the maps y₀ 7→ V_t^Ψ(y₀) := inf

u∈U

∫ _t

s=0

h(y(s, u, y₀))ds + Ψ(y(t, u, y₀)),

are equicontinuous with a modulus of continuity which does not depend on t and Ψ (but only on the Lipschitz constants of h and f).

(9)

Thus V_t^Ψ is more regular than Ψ. This also could be obtained and generalized using HJB results with coercive concave hamiltonians.

• Example 3: g(y, u) = −y+u, where u ∈ U a given bounded subset of IR^d, and h is continuous in y.

• Example 4: in IR². The initial state is y₀ = (0, 0) and U = [0, 1], and the cost is h(y) = 1 − y₁(1 − y₂).

y^′(s) = g(y(s), u(s)) =

( u(s)(1 − y₁(s)) u²(s)(1 − y₁(s))

) .

One can easily observe that the reachable set G(y₀) ⊂ [0, 1]². If u = ε > 0 constant, y₁(t) = 1 − exp(−εt) and y₂(t) = εy₁(t).

So we have V_t(y₀) −−−→

t→∞ 0. Existence of a Uniform Value

(10)

No ergodicity :

{y ∈ [0, 1]², lim

t→∞ V_t(y) = lim

t→∞ V_t(y₀)} = [0, 1] × {0},

and starting from y₀ it is possible to reach no point in (0, 1] × {0}.

(11)

A ﬁrst result in Nonexpansive case

Denote by G(y₀) := {y(t, u, y₀), t ≥ 0, u ∈ U} the reachable set Theorem 3 h(y, u) = h(y) only depends on the state,

G(y₀) is bounded (invariant),

∀(y₁, y₂) ∈ G(y₀)², sup_u_∈_U inf_v_∈_U < y₁ − y₂, g(y₁, u) − g(y₂, v) >≤ 0.

Then Γ(y₀) has a limit value V_t(y₀) −−−−→

t→+∞ V ^∗(y₀). The con- vergence of (V_t)_t to V ^∗ is uniform over G(y₀), and the value of Γ(y₀) is uniform.

(12)

A Crucial Technical Lemma

We deﬁne V ⁻(y₀) := lim inf_t_→₊_∞ V_t(y₀), V ⁺(y₀) := lim sup_t_→₊_∞ V_t(y₀).

Lemma 4 For every m₀ in IR₊, we have:

sup

t>0

minf≤m₀ V_m,t(y₀) ≥ V ⁺(y₀) ≥ V ⁻(y₀) ≥ sup

t>0

minf≥0 V_m,t(y₀).

Deﬁnition 5

V ^∗(y₀) = sup

t>0

minf≥0 V_m,t(y₀).

(13)

Sketch of the proof of the ﬁrst result

Lemma 6 ∀T > 0, ∀ε > 0, ∀(y₁, y₂) ∈ G(y₀)², ∀u ∈ U, ∃v ∈ U,

∀t ∈ [0, T], ∥y(t, u, y₁) − y(t, v, y₂)∥ ≤ ∥y₁ − y₂∥ + ε.

Proposition 7 ∀ε > 0, ∃m₀, sup

t>0

minf≤m₀ V_m,t(y₀) ≤ sup

t>0

minf≥0 V_m,t(y₀) + 2ε

(14)

• (V_T(y₀))_{T >0} is equicontinuous (Lemma 6 +continuity of h)

• Deﬁne G^m(y₀) := {y(t, u, y₀), t ≤ m, u ∈ U} the reachable set in time m.

∀ε, ∃m₀, ∀z ∈ G(y₀), ∃z^′ ∈ G^m⁰(y₀) such that ∥z − z^′∥ ≤ ε.

• We have inf_m_≥₀ V_m,t(y₀) = inf{V_t(z), z ∈ G(y₀)}, and inf_m_≤_m₀ V_m,t(y₀) = inf{V_t(z), z ∈ G^m⁰(y₀)}. By steps 1 and 2 inf{V_t(z), z ∈ G^m⁰(y₀)} ≤

inf{V_t(z), z ∈ G(y₀)} + 2ε.

(15)

Generalizations

Theorem 8 ∃ C¹ ∆ : IR^d × IR^d −→ IR₊, vanishing on the diagonal (∆(y, y) = 0) and symmetric (∆(y₁, y₂) = ∆(y₂, y₁) ) h(y, u) = h(y) only depends on the state,

G(y₀) is bounded (invariant),

∀(y₁, y₂) ∈ G(y₀)², ∀u ∈ U,∃v ∈ U.

< g(y₁, u), ∂

∂y₁∆(y₁, y₂) > + < g(y₂, v), ∂

∂y₂∆(y₁, y₂) >≤ 0 Then Γ(y₀) has a limit value V_t(y₀) −−−−→

t→+∞ V ^∗(y₀). The con- vergence of (V_t)_t to V ^∗ is uniform over G(y₀), and the value of Γ(y₀) is uniform.

(16)

• This result can be applied to example 4, with ∆(y₁, y₂) =

∥y₁ − y₂∥1 (L¹-norm). In this example, we have for each y₁, y₂ and u: ∆(y₁ + tg(y₁, u), y₂ + tg(y₂, u)) ≤ ∆(y₁, y₂) as soon as t ≥ 0 is small enough.

Example 4: in IR². The initial state is y₀ = (0, 0) and U = [0, 1], and the cost is h(y) = 1 − y₁(1 − y₂).

y^′(s) = g(y(s), u(s)) =

( u(s)(1 − y₁(s)) u²(s)(1 − y₁(s))

) .

(17)

Further Generalizations

Theorem 9 (H1) h is uniformly continuous in y on Z¯ uni- formly in u. And for each y in Z¯, either h does not depend on u or the set {(g(y, u), h(y, u)) ∈ IR^d×[0, 1], u ∈ U} is closed.

(H2): ∃∆ : IR^d × IR^d −→ IR₊, vanishing on the diagonal (∆(y, y) = 0) and symmetric (∆(y₁, y₂) = ∆(y₂, y₁) ), and a unniformly continuous function αˆ : IR₊ −→ IR₊ s.t.

ˆ

α(t) −−−→

t→0 0 satisfying:

a) ∀ sequence (z_n)_n ⊂ Z, ∀ε > 0, ∃n, lim inf_p ∆(z_n, z_p) ≤ ε.

b) ∀(y₁, y₂) ∈ Z¯², ∀u ∈ U, ∃v ∈ U such that

D ↑ ∆(y₁, y₂)(g(y₁, u), g(y₂, v)) ≤ 0, h(y₂, v) − h(y₁, u) ≤ α(∆(yˆ ₁, y₂)).

Then Γ(y₀) has a uniform value lim_t_→∞ V_t = V ^∗.

(18)

Remarks

• Although ∆ may not satisfy the triangular inequality nor the separation property, it may be seen as a “distance”

adapted to the problem Γ(y₀).

• D ↑ is the contingent epi-derivative (which reduces to the upper Dini derivative if ∆ is Lipschitz) D↑∆(z)(α) = lim inf_t_→₀+,α^′→α 1

t(∆(z + tα^′) − ∆(z)). If ∆ is diﬀerentiable, the condition D ↑ ∆(y₁, y₂)(g(y₁, u), g(y₂, v)) ≤ 0 just reads:

< g(y₁, u), _∂y^∂

1∆(y₁, y₂) > + < g(y₂, v), _∂y^∂

2∆(y₁, y₂) >≤ 0.

(19)

• The assumption: “{(g(y, u), h(y, u)) ∈ IR^d × [0, 1], u ∈ U} closed” could be checked for instance if U is compact and if h and g are continuous with respect to (y, u).

• H2a) is a precompacity condition. It is satisﬁed as soon as G(y₀) is bounded. cf Renault 2008

• Notice that H2 is satisﬁed with ∆ = 0 if we are in the trivial case where inf_u h(y, u) is constant.

(20)

On Uniform Value

Deﬁnition 10 Γ(y₀) has a uniform value if ∃V (y₀) and if:

∀ε > 0, ∃u ∈ U, ∃t₀, ∀t ≥ t₀, 1 t

∫ _t

s=0

h(y(s, u, y₀), u(s))ds ≤ V (y₀) +ε.

• Example 5: in IR², y₀ = (0, 0), control set U = [0, 1], y^′(t) = (y₂(t), u(t)), and h(y₁, y₂) = 0 if y₁ ∈ [1, 2], = 1 otherwise.

We have u(s) = y₂^′(s) = y₁^′′(s),

Interpretation: u ”acceleration”, y₂ ”speed”, y₁ the ”po- sition”.

If u = ε constant, then y₂(t) = √

2εy₁(t) ∀t ≥ 0.

Limit Value: V_T(y₀) −−−−→

T→∞ 1/2 No Uniform Value.

(21)

Optimal control with discounted facteur λ → 0⁺

We deﬁne Θ_λ(y₀) := inf

u∈U

∫ ₊_∞

s=0

λe⁻^λsh(y(s, u, y₀), u(s))ds,

Theorem 11 (Oliu-Barton Vigeral 2010) the following uni- form limit in K exists lim_λ_→₀+ Θ_λ(y₀)

iﬀ

the following uniform limit in K exists lim_t_→∞ V_t(y₀) Question Application to diﬀerent concepts of means

(22)

Open Problems

Diﬀerential Game at horizon t:

V_t(y₀) := ” inf

u∈U sup

v∈V ” 1 t

∫ _t

s=0

h(y(s, u, v, y₀), u(s), v(s))ds, where s 7→ y(s, u, y₀) denotes the solution to

y^′(s) = g(y(s), u(s), v(s))), y(0) = y₀.

OPEN PROBLEM : Existence of a limit of V_t(y₀) as t → ∞. Only Partial results:

• When the Hamiltonian is coercive (hence ergodicity and the limit is y independent)Alvarez-Bardi ...

• For nonconvex and non coercive Hamiltonian in IR² Cardaliaguet

(23)

Averaging Problem for singularly perturbed system

{ i) x^′(s) = f(

x(s), y(s), u(s))

, x(0) = x, s ∈ [0, T] ii) εy^′(s) = g(

x(s), y(s), u(s))

y(0) = y, (1)

Change of variable τ = _ε^t, (X(τ), Y (τ), U(τ)) = (x(ετ), y(ετ), u(ετ)) { X^′(τ) = εf(

X(τ), Y (τ), U(τ))

, X(0) = x, τ ∈ [0, ^T_ε ] Y ^′(τ) = g(

X(τ), Y (τ), U(τ))

, Y (0) = y. (2)

Take ε = 0 in (2).We have the following associated system:

y^′(τ) = g(

x, y(τ), u(τ))

, y(0) = y, (3)

y_x(·, u, y) denotes the unique solution of (3).

(24)

Averaging method

We suppose that f and g are Lipschitz and there is a compact set M × N which is is invariant by (1) for all ε.

A(x, y, S, u) = _S¹ ∫ _S

0 f(x, y_x(τ, u, y), u(τ))dτ , F(x, y, S) = {A(x, y, S, u); u ∈ U}

Theorem 12 Gaitsgory, Grammel If ∃γ : IR → IR₊ with lim_S_→₊_∞ γ(S) = 0 and a Lipschitz set-valued map F¯ : M → IR^bb with compact convex nonempty values such that

d(co clF(x, y, S), F(x)) ≤ γ(S), ∀(x, y) ∈ M × N, ∀S > 0, then ∀x, y the solutions of the diﬀerential inclusion

x^′(s) ∈ F(

x(s))

, x(0) = x. (4)

approximate the solutions of the singularly perturbed sys- tem (1) in the following sense:

(25)

For any ε > 0, and any T > 0 there exists M(T, ε) > 0 with lim_ε_→₀ M(T, ε) = 0 such that

a) For any family of solutions (

x_ε(·), y_ε(·))

to (1) there ex- ists a solution x(·) to (4) such that

sup

t∈[0,T]

∥x_ε(t) − x(t)∥ ≤ M(T, ε).

b) Conversely ﬁx x(·) a solution to (4) then for any ε small enough there exists a solution (

x_ε(·), y_ε(·))

to (1) such that sup

t∈[0,T]

∥x_ε(t) − x(t)∥ ≤ M(T, ε).

cf also Wattbled, M.Q Wattbled ...

QUESTION ases and conditions where F may depend on y.

(26)

References

[1] O. Alvarez, M. Bardi Ergodicity, Stabilization, and sin- gular perturbations for Bellman-Isaacs equations To appear in Mem AMS.

[2] Z. Artstein, and V. Gaitsgory, The value function of singularly perturbed control systems, Appl. Math. Op- tim., 41 (2000), 425-445.

[3] Arisawa, M. and P.L. Lions (1998) Ergodic problem for the Hamilton Jacobi Belmann equations II, Ann. Inst.

Henri Poincar´e, Analyse Nonlin´eaire, 15 ,1 , 1–24.

[4] Arisawa, M. and P.L. Lions (1998) On ergodic stochas- tic control. Com. in partial diﬀerential equations, 23, 2187–2217.

(27)

[5] Bettiol, P. (2005) On ergodic problem for Hamilton- Jacobi-Isaacs equations ESAIM: Cocv, 11, 522–541.

[6] Cardaliaguet P. Ergodicity of Hamilton-Jacobi equa- tions with a non coercive non convex Hamiltonian in IR²/Z² preprint [hal-00348219 - version 1] (18/12/2008) [7] Frankowska, H., Plaskacz, S. and Rzezuchowski T.

(1995): Measurable Viability Theorems and Hamilton- Jacobi-Bellman Equation, J. Diﬀ. Eqs., 116, 265-305.

[8] V. Gaitsgory, Use of Averaging Method in Control Problems, Diﬀerential Equations (Translated from Rus- sian), 46 (1986) pp. 1081-1088.

[9] V. Gaitsgory, M. Quincampoix Linear programming analysis of deterministic inﬁnite horizon optimal con-

(28)

trol problems (discounting and time averaging cases), to appear in Siam Journal of Control and Opt.

[10] G. Grammel, Averaging of singularly perturbed sys- tems, Nonlinear Analysis Theory, 28, 11 (1997), 1851- 1865.

[11] Lions P.-L. , Papanicolaou G. , Varadhan S.R.S., Ho- mogenization of Hamilton- Jacobi Equations, unpub- lished work.

[12] Quincampoix, M. and F. Watbled (2003) Averaging methods for discontinuous Mayer’s problem of singu- larly perturbed control systems. Nonlinear analysis, 54, 819–837.

(29)

[13] Quincampoix, M. and J. Renault (submitted) On Ex- istence of a limit value in some non expansive optimal control problems,

[14] Renault, J. (2007) Uniform value in Dynamic Pro- gramming. Cahier du Ceremade 2007-1. arXiv : 0803.2758.

(30)

Thank You for your Attention