• Aucun résultat trouvé

Optimal control under probability constraint

N/A
N/A
Protected

Academic year: 2021

Partager "Optimal control under probability constraint"

Copied!
33
0
0

Texte intégral

(1)

HAL Id: inria-00585861

https://hal.inria.fr/inria-00585861

Submitted on 14 Apr 2011

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Optimal control under probability constraint

Pierre Carpentier, Jean-Philippe Chancelier, Guy Cohen

To cite this version:

Pierre Carpentier, Jean-Philippe Chancelier, Guy Cohen. Optimal control under probability con-straint. SADCO Kick off, Mar 2011, Paris, France. �inria-00585861�

(2)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Optimal control under probability constraint

P. Carpentier — ENSTA ParisTech

J.-P. Chancelier — ´Ecole des Ponts ParisTech

G. Cohen — ´Ecole des Ponts ParisTech



This work has been supported by

Thal`es-Al´enia-Space (Cannes) and CNES (Toulouse)

SADCO Workshop

(3)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Presentation outline

1 Problem formulation

2 Modeling improvement

3 Stochastic Arrow-Hurwicz algorithm

(4)

Problem formulation

Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Satellite model and deterministic optimization problem Engine failure

Stochastic formulation

1 Problem formulation

Satellite model and deterministic optimization problem Engine failure

Stochastic formulation

2 Modeling improvement

3 Stochastic Arrow-Hurwicz algorithm

(5)

Problem formulation

Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Satellite model and deterministic optimization problem

Engine failure Stochastic formulation

Satellite model

dr dt = v , dv dt = −µ r kr k3 + F mκ, (1a) dm dt = − T g0Isp δ . (1b)

(1a) : 6-dimensional state vector (position r and velocityv).

(1b) : 1-dimensional state vector (massm including fuel).

κinvolves the direction cosines of the thrust and the on-off switch

δ of the engine (3 controls), and µ, F , T , g0, Isp are constants. The deterministic control problem is to drive the satellite from the initial condition at ti to a known final position rf and velocity vf

(6)

Problem formulation

Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Satellite model and deterministic optimization problem

Engine failure Stochastic formulation

Deterministic optimization problem

Using equinoctial coordinates for the position and velocity ; state vector x ∈ R7,

and cartesian coordinates for the thrust of the engine

; control vector u ∈ R3,

the deterministic optimization problem is written as follows: min u(·) K x (tf)  (2a) subject to: x (ti) = xi, • x (t) = f x (t), u(t) , (2b) ku(t)k ≤ 1 ∀t ∈ [ti, tf] , (2c) C x (tf) = 0 . (2d)

(7)

Problem formulation

Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Satellite model and deterministic optimization problem

Engine failure

Stochastic formulation

Engine failure

Sometimes, the engine may fail to work when needed: the satellite drift away from the deterministic optimal trajectory. After the engine control is recovered, it is not always possible to drive the satellite to the final target at tf.

By anticipating such possible failures and by modifying the trajectory followed before any such failure occurs, one may increase the possibility of eventually reaching the target. But such a deviation from the deterministic optimal trajectory results in a deterioration of the economic performance. The problem is thus to balance the increased probability of eventually reaching the target despite possible failures against the expected economic performance, that is, to quantify the price of safety one is ready to pay for.

(8)

Problem formulation

Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Satellite model and deterministic optimization problem Engine failure

Stochastic formulation

Stochastic formulation (1)

A failure is modeled using two random variables: tp : random initial time of the failure,

td : random duration of the failure.

For every realization (tpξ, tdξ):

1 u(·) denotes the control used prior to any failure

; u is defined over [ti, tf] but implemented over [ti, tpξ] and corresponds to an open-loop control,

2 the control is 0 in [tpξ, tpξ+ tξ

d],

3 vξ(·) denotes the control used after the end of the failure

; vξ is defined over [tξ

p+ tdξ, tf] (if nonempty) and corresponds to a closed-loop strategy v. The satellite dynamics in the stochastic formulation writes:

xξ(ti) = xi, •

(9)

Problem formulation

Modeling improvement Stochastic Arrow-Hurwicz algorithm Numerical results

Satellite model and deterministic optimization problem Engine failure

Stochastic formulation

Stochastic formulation (2)

The problem is to minimize the expected cost (fuel consumption) w.r.t. the open-loop control u and the closed-loop strategy v,

the probability to hit the target at time tf being at least p.

min u(·) E  min vξ(·)K x ξ(t f)  (3a) subject to: xξ(ti) = xi, • xξ(t) = fξ xξ(t), u(t), vξ(t) , (3b) ku(t)k ≤ 1 ∀t ∈ [ti, tf] , kvξ(t)k ≤ 1 ∀t ∈ [tpξ+ t ξ d, tf] , (3c) P  C xξ(tf) = 0  ≥ p. (3d)

(10)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results Probability as an expectation Cost refinement Final formulation 1 Problem formulation 2 Modeling improvement Probability as an expectation Cost refinement Final formulation

3 Stochastic Arrow-Hurwicz algorithm

(11)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results Probability as an expectation Cost refinement Final formulation

Probability as an expectation

Let I(y ) = ( 1 if y = 0, 0 otherwise. Then P  C xξ(tf) = 0  = EI C xξ(tf)   . Thus, Problem (3) can (shortly) be written:

min u(·) E  min vξ(·)K x ξ(t f)  (4a) s.t. E  I C xξ(tf)   ≥ p . (4b)

Formulation (4) opens the possibility to make use of a stochastic

(12)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results

Probability as an expectation

Cost refinement

Final formulation

Missing the target should not contribute to the cost

Whenever a failure occurred, setting vξ ≡ 0 over [tpξ+ tdξ, tf] generally makes the satellite misses the target and does not contribute to the probability constraint,

but it forces fuel consumption to its minimum and contributes to minimize the cost function.

This yields an artificially good expected cost; however, one is not interested in what happens whenever the whole mission has failed. Therefore, a better formulation is to register only scenarios when the target is effectively hit; instead of the original expected cost in (4), we thus prefer to deal with a conditional expected cost:

E  K xξ(tf)  C x ξ(t f) = 0  = E  K xξ(tf) × I C xξ(tf)   E  I C xξ(t f)   .

(13)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results

Probability as an expectation

Cost refinement

Final formulation

Conditional expected cost

The problem is reformulated as

min u(·),v(·) E  K xξ(tf) × I C xξ(tf)   E  I C xξ(tf)   (5a) s.t. E  I C xξ(tf)   ≥ p . (5b)

Such a formulation is however not well-suited for the stochastic Arrow-Hurwicz algorithm:

(14)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results

Probability as an expectation

Cost refinement

Final formulation

An useful lemma

Using compact notation, Problem (5) is:

min

u

J(u)

Θ(u) s.t. Θ(u) ≥ p , (6) in which J and Θ assume positive values.

1 If u]is a solution of (6) and if Θ(u]) = p, then u]is also a solution of

min

u J(u) s.t. Θ(u) ≥ p . (7) 2 Conversely, if u]is a solution of (7), and if an optimal Kuhn-Tucker

multiplier β]satisfies the condition

β]≥ J(u

]

) Θ(u]) ,

(15)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results

Probability as an expectation

Cost refinement

Final formulation

Back to a cost in expectation

Finally, instead of (5) we aim at solving

min u(·),v(·)E  K xξ(tf) × I C xξ(tf)   (8a) s.t. E  I C xξ(tf)   ≥ p . (8b)

(16)

Problem formulation

Modeling improvement

Stochastic Arrow-Hurwicz algorithm Numerical results

Probability as an expectation Cost refinement

Final formulation

Final formulation and associated Lagrangian

min u(·)  E  min vξ(·)K x ξ(t f) × I C xξ(tf)    s.t. µ p − E  I C xξ(tf)   ≤ 0 .

Assuming there exists a saddle point for the associated Lagrangian, we have finally to solve

max µ≥0 minu(·)  µp + Emin vξ(·) K x ξ(t f)−µ×I C xξ(tf)    .

Remark: it is convenient to put apart the no-failure event, namelytp≥ tf

in the problem formulation. For the sake of simplicity, we do not present this last problem improvement. We will denote byπf the probability of this event.

(17)

Problem formulation Modeling improvement

Stochastic Arrow-Hurwicz algorithm

Numerical results

Algorithm overview

Downstream closed-loop problem Smoothing function I

Solving the resulting approximated problem

1 Problem formulation

2 Modeling improvement

3 Stochastic Arrow-Hurwicz algorithm

Algorithm overview

Downstream closed-loop problem Smoothing function I

Solving the resulting approximated problem

(18)

Problem formulation Modeling improvement

Stochastic Arrow-Hurwicz algorithm

Numerical results

Algorithm overview

Downstream closed-loop problem Smoothing function I

Solving the resulting approximated problem

Algorithm overview (1)

In order to solve max µ≥0 minu(·) ( µ p + Emin vξ(·) K x ξ(t f) − µ × I C xξ(tf)   | {z } W (u, µ, ξ)  ) . that is, max µ≥0 minu(·) n µ p + E W (u, µ, ξ)o ,

(19)

Problem formulation Modeling improvement

Stochastic Arrow-Hurwicz algorithm

Numerical results

Algorithm overview

Downstream closed-loop problem Smoothing function I

Solving the resulting approximated problem

Algorithm overview (2)

Arrow-Hurwicz algorithm At iteration k, 1 draw a failure ξk = (tξ k p , tξ k

d ) according to its probability law,

2 update u(·): uk+1= ΠBuk− εk uW (uk, µk, ξk)  , 3 update µ: µk+1 = max  0, µk + ρk p + ∇µW (uk+1, µk, ξk)  .

(20)

Problem formulation Modeling improvement

Stochastic Arrow-Hurwicz algorithm

Numerical results

Algorithm overview

Downstream closed-loop problem

Smoothing function I

Solving the resulting approximated problem

Downstream closed-loop problem

Consider the inner optimization problem: W (u, ξ, µ) = min vξ(·) n K xξ(tf) − µ ×I C xξ(tf)  o ,

If reaching the target is impossible (C xξ(tf) 6= 0 ∀vξ),

the best thing to do is to stop immediately: W ≡ 0. If reaching the target is possible but too expensive (that

is K xξ(tf) ≥ µ), the best thing to do is again to stop

immediately: W ≡ 0.

Therefore, it must be checked that one can reach the target while maintaining fuel consumption below a threshold, and

the corresponding optimal control v∗ξ(·) must be computed.

At every iteration k, we must evaluate function W as well as its derivatives w.r.t. u(.) and µ. But W is not differentiable!

(21)

Problem formulation Modeling improvement

Stochastic Arrow-Hurwicz algorithm

Numerical results

Algorithm overview

Downstream closed-loop problem

Smoothing function I

Solving the resulting approximated problem

First difficulty: I is not a smooth function

I(y ) = ( 1 if y = 0, 0 otherwise, Ir(y ) =     1 −yr22 2 if y ∈ [−r , r ], 0 otherwise. 0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0

There are specific rules to drive rk to 0 as the iteration number

k goes to infinity in order to obtain the best asymptotic Mean Quadratic Error of the gradient estimates (see [1]).

(22)

Problem formulation Modeling improvement

Stochastic Arrow-Hurwicz algorithm

Numerical results

Algorithm overview

Downstream closed-loop problem Smoothing function I

Solving the resulting approximated problem

Second difficulty: solving the approximated problem

The approximated closed-loop problem to solve at each iteration is: Wr(uk, ξk, µk) = min vξ(·) n K xξ(tf) − µk ×Ir C xξ(tf)  o .

In this setting, we have to check if the target is reachedup to r.

In practice, the solution of the approximated problem is derived from the resolution of two standard optimal control problems.

(23)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p The price of safety

1 Problem formulation

2 Modeling improvement

3 Stochastic Arrow-Hurwicz algorithm

4 Numerical results

Brief description

Results for different values of p The price of safety

(24)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p The price of safety

Mission description

Interplanetary mission (Earth-Mars trajectory): duration of the mission: 450 days,

tp: exponential distribution s.t. P tp≥ tf =πf ≈ 0.58, td: exponential distribution s.t. P 2 ≤ td≤ 7 ≈ 0.80. Comp. normale w Comp. tangentielle s Comp. radiale q 0 1 2 3 4 5 6 7 8 9 −0.4 −0.2 0.0 0.2 0.4 0.6 0.8 1.0

Using normalized units: ti= 0.69 and tf = 8.73.

The deterministic optimal control has a “bang–off–bang” shape. Along the deterministic optimal path, the probability to recover a failure is:

(25)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p The price of safety

Parameters tuning

Gradient step length:

εk = a

b +k , ρ

k = c

d +k ,

usual for a stochastic gradient algorithm.

Smoothing parameter:

rk = α

β +k13

,

(26)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Masse finale / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.675 0.676 0.677 0.678 0.679 0.680 0.681

Multiplicateur probabilite / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Comp. normale w Comp. tangentielle s Comp. radiale q 0 1 2 3 4 5 6 7 8 9 −0.4 −0.2 0.0 0.2 0.4 0.6 0.8 1.0

(27)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Masse finale / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.674 0.675 0.676 0.677 0.678 0.679 0.680 0.681

Multiplicateur probabilite / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.314 0.316 0.318 0.320 0.322 0.324 0.326 Comp. normale w Comp. tangentielle s Comp. radiale q 0 1 2 3 4 5 6 7 8 9 −0.4 −0.2 0.0 0.2 0.4 0.6 0.8 1.0

(28)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Masse finale / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.673 0.674 0.675 0.676 0.677 0.678 0.679 0.680 0.681

Multiplicateur probabilite / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.318 0.320 0.322 0.324 0.326 0.328 0.330 0.332 Comp. normale w Comp. tangentielle s Comp. radiale q 0 1 2 3 4 5 6 7 8 9 −0.4 −0.2 0.0 0.2 0.4 0.6 0.8 1.0

(29)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Masse finale / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.674 0.675 0.676 0.677 0.678 0.679 0.680 0.681

Multiplicateur probabilite / iterations

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000 0.320 0.325 0.330 0.335 0.340 0.345 0.350 0.355 0.360 0.365 Comp. normale w Comp. tangentielle s Comp. radiale q 0 1 2 3 4 5 6 7 8 9 −0.4 −0.2 0.0 0.2 0.4 0.6 0.8 1.0

(30)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Fuel consumption versus probability level

Consommation sans panne / Probabilite

0.85 0.90 0.95 1.00 0.3204 0.3206 0.3208 0.3210 0.3212 0.3214 0.3216 0.3218 0.3220 0.3222

(31)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Conclusion

Main conclusion

We are able to deal with probability constraints in the optimal

control framework.

Future works

From the theoretical point of view:

existence of a saddle point for the constrained problem, smoothing process (results available only for inequality constraints).

From the numerical point of view:

efficient solver for the downstream problem, computer parallelization.

(32)

Laetitia Andrieu, Guy Cohen and Felisa Vazquez-Abad.

Stochastic Programming with Probability.

arXiv, math.OC 0708.0281, 2007.

Jean-Christophe Culioli and Guy Cohen.

Decomposition/Coordination Algorithms in Stochastic Optimization.

SIAM Journal on Control and Optimization, 28, 1372-1403, 1990.

Jean-Christophe Culioli and Guy Cohen.

Optimisation stochastique sous contraintes en esp´erance.

CRAS, t.320, S´erie I, pp. 75-758, 1995.

Thierry Dargent.

Transfert d’orbite optimal par application du principe du minimum.

Logiciel T 3D, version 2, 19 mars 2004.

Andr´as Pr´ekopa.

Stochastic Programming.

(33)

Problem formulation Modeling improvement Stochastic Arrow-Hurwicz algorithm

Numerical results

Brief description

Results for different values of p

The price of safety

Références

Documents relatifs

L'ampère est l'intensité d'un courant électrique constant qui, maintenu dans deux conducteurs parallèles, rectilignes, de longueur infinie, de section circulaire négligeable et

Now, we assume that the ball randomly selected in urn I is placed in urn II before proceeding to the second draw.. Calculate the probability of getting 2 balls of the

As we do not really need it in the following (we just present it as a remark), we state it without proof. Let H be a separable Hilbert space with dimension greater than or equal to

– C2Ua: A uniform crossover, with the second solution where the crossover is allowed on all parameters, the polynomial mutation is realized on all esti- mated parameters according

Sample space Random variables Probability measures Distributions (laws) Sigma- algebras Probability measures (continued) References Appendices..

Results clearly showed that SIR was immediately and significantly more affected by Sb input in a non-contaminated soil than in a long-term contaminated soil, especially since

Given a linear time invariant model with its bond graph representation (Fig. 2-a) and subject to an optimal control problem with input and dissipation-based integral performance

Both approaches will lead us to consider algorithms such as (6) in which either a smooth approximation θ b of function θ will be used in both equations (6a) and (6b), or