Approximation of the pareto optimal set for multiobjective optimal control problems using viability kernels

(1)

DOI:10.1051/cocv/2013056 www.esaim-cocv.org

APPROXIMATION OF THE PARETO OPTIMAL SET FOR MULTIOBJECTIVE OPTIMAL CONTROL PROBLEMS USING VIABILITY KERNELS

Alexis Guigue

¹

Abstract. This paper provides a convergent numerical approximation of the Pareto optimal set for finite-horizon multiobjective optimal control problems in which the objective space is not necessarily convex. Our approach is based on Viability Theory. We first introduce a set-valued return functionV and show that the epigraph ofV equals the viability kernel of a certain related augmented dynamical system. We then introduce an approximate set-valued return function with finite set-values as the solution of a multiobjective dynamic programming equation. The epigraph of this approximate set-valued return function equals to the finite discrete viability kernel resulting from the convergent numerical approximation of the viability kernel proposed in [P. Cardaliaguet, M. Quincampoix and P. Saint-Pierre.

Birkhauser, Boston (1999) 177–247. P. Cardaliaguet, M. Quincampoix and P. Saint-Pierre,Set-Valued Analysis8(2000) 111–126]. As a result, the epigraph of the approximate set-valued return function converges to the epigraph ofV. The approximate set-valued return function finally provides the proposed numerical approximation of the Pareto optimal set for every initial time and state. Several numerical examples illustrate our approach.

Mathematics Subject Classification. 49M2, 49L20, 54C60, 90C29.

Received March 9, 2012. Revised November 20, 2012.

Published online October 10, 2013.

1. Introduction

Many applications, such as trajectory planning for spacecraft [7] and robotic manipulators [15], continuous casting of steel [16], fishery management [8,18], etc., involve optimal control formulations where pobjective functions (p > 1) need to be optimized simultaneously. For a General Optimization Problem (GOP) with a vector-valued objective function, the definition of an optimal solution requires the comparison between elements in the objective space, which is the set of all possible values that can be taken by the vector-valued objective function. This comparison is generally provided by a binary relation, expressing the preferences of the decision maker. In applications, it is common to consider the binary relation defined in terms of a pointed convex cone P ⊂R^p containing the origin [23]. However, in this paper, for simplicity, we will only consider the case P =R^p₊, which yields the well-known Pareto optimality. The resolution of (GOP) therefore consists of finding the set of Pareto optimal elements in the objective space, or Pareto optimal set. In general, this set cannot be

Keywords and phrases.Multiobjective optimal control, Pareto optimality, viability theory, convergent numerical approximation, dynamic programming.

1 Department of Mathematics, The University of British Columbia, Room 121, 1984 Mathematics Road, Vancouver, B.C., V6T 1Z2, Canada.aguigue@math.ubc.ca

Article published by EDP Sciences c EDP Sciences, SMAI 2013

(2)

obtained analytically and we have to resort to numerical approximations. The main objective of this paper is therefore to propose a convergent numerical approximation of the Pareto optimal set for a general ﬁnite-horizon multiobjective optimal control problem (MOC). We take care to avoid making any convexity assumption on the objective space Y (or more generally, on the setY +R^p₊). Indeed, in the case where the objective space (or Y +R^p₊) is convex, simple methods such the weighting method can be used to generate the entire Pareto optimal set [20], Theorem 3.4.4.

When the objective space is not convex, very few approaches to ﬁnd the Pareto optimal set have been proposed. An important line of research is to use evolutionary algorithms [9,11], such as genetic algorithms. Also, very recently, an approach [17] inspired by the-constraint method in nonlinear multiobjective optimization [19], pp. 85–95 has been developed for multiobjective exit-time optimal control problems where P = R^p₊. In this approach, then-dimensional state is augmented byp−1 dimensions, which yields a new single objective optimal control problem. The Pareto optimal set of the original problem can be retrieved by inspecting the values of the return function of this new augmented problem. The return function of the new problem is ﬁnally approximated by solving numerically the corresponding (n+p−1)-dimensional Hamilton–Jacobi–Bellman equation using a semi-Lagrangian “marching” method.

In this paper, instead of an exit-time optimal control problem, we consider an optimal control problem over a fixed finite horizon [0, T]. The proposed approach differs fundamentally from the one in [17]. We use Viability Theory [1,6] and the existing numerical schemes for approximating viability kernels [4,5,10]. Our first contribution is to reformulate the problem of finding the Pareto optimal set for (MOC) as the problem of determining a viability kernel. This is done by considering the epigraph of the return function V(·,·) : [0, T]×Rⁿ →2^R^p, defined as the set-valued map associating with each time t ∈[0, T] and statex∈ Rⁿ the set of Pareto optimal elements in the objective spaceY(t,x), whereY(t,x) is the set of all possible values that can be taken by the vector-valued objective function for trajectories starting at x at time t [13,14]. Hence, the Pareto optimal set for any time t and state xcan be obtained just by evaluatingV at (t,x). Our second contribution is to use the numerical schemes mentioned above to derive a convergent approximation ofV. This approximation is a set-valued map with finite set-values, called the approximate set-valued return function.

Hence, an approximation of the Pareto optimal setV(t,x) can be obtained just by evaluating the approximate set-valued return function at (t,x). The advantage of using Viability Theory is that it provides a framework that applies to problems with minimal regularity and convexity assumptions. Hence, it is expected that the proposed approach could be easily extended to more general classes of problems than the one considered in this paper,e.g., problems with state constraints,etc.

More precisely, the first step in the proposed approach is to show that the epigraph ofV,i.e., the graph of the set-valued mapV+R^p₊, coincides with the viability kernel of a certain related augmented dynamical system. The next step is to introduce an approximate set-valued return function as the solution of a multiobjective dynamic programming equation. The epigraph of this function is shown to be equal to the finite discrete viability kernel resulting from the convergent numerical approximation of the viability kernel proposed in [4,5]. From there, we easily obtain that the epigraph of the approximate set-valued return function converges in the sense of Painlevé–Kuratowski to the epigraph ofV. The multiobjective dynamic programming equation obtained is very similar to the one obtained in [12], where no proof of convergence was provided.

This paper is organized as follows. In Section2, we briefly discuss the concept of optimality in multiobjective optimization and present several useful properties related to Pareto optimal sets. In Section3, we detail the class of multiobjective optimal control problems considered and defineV. In Section4, we show that the epigraph of V equals the viability kernel of a certain related augmented dynamical system. Following [4,5], we then propose in Section5a finite discrete approximation of this viability kernel. In Section6, we show that this approximation corresponds to the epigraph of a set-valued function that satisfies a certain multiobjective dynamic programming equation. From the latter equation, we derive in Section 7 a numerical algorithm to compute this set-valued function and therefore the approximate Pareto optimal set at (t,x). Some numerical examples, for which the Pareto optimal set is analytically known, are provided in Section8 and some conclusions are finally drawn in Section9.

(3)

2. Multiobjective Optimization

For an optimization problem with a p-dimensional vector-valued objective function, the deﬁnition of an optimal solution requires a comparison between elements in the objective space, which is the set of all possible values that can be taken by the vector-valued objective function. This comparison is generally provided by a binary relation, expressing the preferences of the decision maker. In applications, it is common to deﬁne a binary preference relation in terms of a pointed convex coneP⊂R^p containing the origin [23].

Definition 2.1. Lety₁,y₂∈R^p. Then,y₁y₂ if and only ify₂∈y₁+P.

The binary relation in Deﬁnition 2.1yields the deﬁnition of generalized Pareto optimality.

Definition 2.2. Let S be a nonempty subset of R^p. An element y₁ ∈ S is said to be a generalized Pareto optimal element of S if and only if there is no y₂ ∈ S (y₂ =y₁) such that y₂ y₁, or equivalently, if and only if there is noy₂such thaty₁∈y₂+P\{0}. The set of generalized Pareto optimal elements ofS is called the generalized Pareto optimal set and is denoted byE(S, P). When P =R^p₊, the generalized Pareto optimal elements are referred to simply as Pareto optimal elements, and the setE(S, P) is denotedE(S).

An important role in this paper is played by the external stability or domination property ([21], pp. 59–66).

Definition 2.3 (External stability).A nonempty subsetS ofR^p is said to be externally stable if and only if S⊂ E(S, P) +P.

An immediate consequence of the external stability property is thatS+P =E(S, P) +P. WhenP is closed, a suﬃcient condition for a nonempty closed setS to be externally stable is given in Proposition2.5. Note that this condition is also suﬃcient to guarantee the existence of generalized Pareto optimal elements.

Definition 2.4 (Recession cone).LetSbe a nonempty subset ofR^p. The extended recession coneS⁺is deﬁned by

S⁺={y∈R^p: ∃h_k →0⁺, ∃y_k ∈S, h_ky_k→y}.

Proposition 2.5 ([21, Thm. 3.2.10]). Let S be a nonempty closed subset of R^p. If P is closed and S is P- bounded ([21], p. 52),i.e.,S⁺∩ −P ={0}, thenS is externally stable.

Corollary 2.6. Let K be a nonempty compact subset ofR^p. If P is closed, thenK is externally stable.

In the case of ﬁnite sets, which are compact and therefore externally stable from Corollary2.6, we have the two following results, which will be used to simplify the numerical algorithm proposed in Section7.

Lemma 2.7. LetS₁andS₂be two finite subsets ofR^p. LetP be closed. Then,E(S₁∪S₂, P) =E(S₁∪E(S₂), P).

Proof. Take z₁ ∈ E(S1∪S₂, P). Assume for contradiction that z₁ ∈ E(S/ 1∪ E(S2, P), P). Then, by external stability, there exists z₂ ∈ E(S₁ ∪ E(S₂, P), P) ⊂ S₁∪S₂ such that z₁ ∈ z₂+P\{0}. But this contradicts z₁∈ E(S₁∪S₂, P).

Conversely, takez₁∈ E(S₁∪ E(S₂), P).Assume for contradiction thatz₁∈ E(S/ ₁∪S₂, P).Then, by external stability, there exists z₂ ∈ E(S₁∪S₂, P)⊂S₁∪S₂ such that z₁ ∈z₂+P\{0}. Assume thatz₂ ∈/ S₂. Then, necessarilyz₂∈S₁∪ E(S₂, P). But, this contradictsz₁∈ E(S1∪ E(S₂, P), P). Assume now thatz₂∈S₂. Then, by external stability, there exists z₃ ∈ E(S₂, P) such that z₂ ∈ z₃+P\{0}. Hence, z₁ ∈ z₃+P\{0} with z₃∈S₁∪ E(S₂, P).But this again contradictsz₁∈ E(S₁∪ E(S₂, P), P).

(4)

Proposition 2.8. Let S₁, . . . , S_I be finite subsets of R^p. Let P be closed. Then,

E I

i=1

S_i, P

=E_I, whereE_I is recursively defined byE₁=E(S₁, P)and the relation

E_i+1 =E(S_i+1∪E_i, P).

Proof. We proceed by induction. For I= 1, this is by deﬁnition. Assume now that the relation holds up toI.

We aim to prove that it holds forI+ 1,i.e., E

_I+1

i=1

S_i

=E_I+1.

Apply Lemma2.7to _I

i=1S_i andS_I+1. Then, we get E

_I+1

i=1

S_i, P

=E

S_I+1∪ E _I

i=1

S_i, P

, P

.

Using the induction assumption, this yields E

_I₊₁

i=1

S_i, P

=E(S_I+1∪E_I, P) =E_I+1.

In this paper, for simplicity, we will only consider the caseP=R^p₊.

3. A multiobjective finite-horizon optimal control problem

In this paper, we will take for · in R^pandRⁿ the supremum norm. LetBbe the closed unit ball.

Consider the evolution over a ﬁxed ﬁnite time intervalI= [0, T] (0< T <∞) of an autonomous dynamical system [3] whosen-dimensional state dynamics are given by a continuous functionf(·,·) :Rⁿ×U →Rⁿ, where the control spaceU is a nonempty compact subset ofR^m. The functionf(·,u) is assumed to be Lipschitz,i.e., someK_f >0 obeys

∀u∈U, ∀x₁,x₂∈Rⁿ, f(x₁,u)−f(x₂,u) ≤K_fx₁−x₂. (3.1) We also assume that the functionf is uniformly bounded,i.e., someM_f >0 obeys

∀x∈Rⁿ, ∀u∈U, f(x,u) ≤M_f. (3.2)

A controlu(·) :I→U is a Lebesgue measurable function. The set of controls is denoted byU. The continuity of f and the Lipschitz condition (3.1) guarantee that, given anyt∈I, initial statex∈Rⁿ, and controlu(·)∈ U, the initial value problem,

x(s) =˙ f(x(s),u(s)) a.e. s∈[t, T],

x(t) =x, (3.3)

has a unique solution, called a trajectory, which we denotes→x(s;t,x,u(·)).

The cost of a trajectory over [t, T], t∈I,is given by a vector-valued functionJ(·,·,·) :I×Rⁿ× U →R^p, J(t,x,u(·)) =

_T

t L(x(s;x,u(·)),u(s)) ds, (3.4)

(5)

where the vector-valued functionL(·,·) :Rⁿ×U →R^p, called the running cost, is assumed to be continuous.

For simplicity, no terminal cost is included in (3.4). We assume that the functionLis uniformly bounded,i.e., someM_L≥0 obeys

∀x∈Rⁿ, ∀u∈U, L(x,u) ≤M_L, (3.5)

and that the functionL(·,u) satisﬁes a Lipschitz condition,i.e., someK_L≥0 obeys

∀u∈U, ∀x₁,x₂∈Rⁿ, L(x₁,u)−L(x₂,u) ≤K_Lx₁−x₂. (3.6) The objective spaceY(t,x) for (MOC) is deﬁned as the set of all possible costs for all possible controls:

Y(t,x) =

J(t,x,u(·)),u(·)∈ U

.

From (3.5), it follows that the setY(t,x) is bounded (byM_LT), and also thatY(t,x)⊂ {−(T−t)M_L1}+R^p₊. However, the setY(t,x) is not necessarily closed.

The set-valued return function V(·,·) : I×Rⁿ →2^R^p for (MOC) is deﬁned as the set-valued map which associates with each time t ∈ I and initial statex ∈ Rⁿ the set of Pareto optimal elements in the objective spaceY(t,x):

V(t,x) =E(cl(Y(t,x))). (3.7) The closure in (3.7) is used to guarantee the existence of Pareto optimal elements [14], Proposition 3.5. Hence,

∀t∈I, ∀x∈Rⁿ, V(t,x)=∅.

Remark 3.1. Whenp= 1, (3.7) takes the form V(t,x) =

u(·)∈Uinf _T

t L(x(s;t,x,u(·)),u(s)) ds

.

Hence,V(t,x) ={v(t,x)},wherev(·,·) is the value function for single objective optimal control problems [3,22].

Finally, asV(t,x)⊂cl(Y(t,x)), we have

V(t,x)⊂ {−(T−t)M_L1}+R^p₊. (3.8)

The objective of this paper is to ﬁnd a convergent approximation to the Pareto optimal setV(0,x₀) where x₀∈Rⁿ is some given initial state.

4. Characterization of the set-valued return function

In this section, we show that the epigraph of the set-valued return functionV,i.e., the graph of the set-valued mapV +R^p₊, is equal to the viability kernel of a certain related augmented dynamical system.

Deﬁne two set-valued maps FL⁻ and FL⁺ from Rⁿ toRⁿ×R^p by FL⁻(x) = co

u∈U

{(f(x,u),−L(x,u))}

,

and

FL⁺(x) = co

u∈U

{(f(x,u),L(x,u))}

,

where co(S) denotes the closure of the convex hull of the set S. Observe that these two set-valued maps take convex compact nonempty values. Moreover, they are bounded by M_FL = max{M_f, M_L} and Lipschitz

(6)

with Lipschitz constantK_FL= max{K_f, K_L}. Deﬁne also an expanded set-valued map fromR×Rⁿ×R^p to R×Rⁿ×R^pby

φ(t,x,z) =

{1} ×FL⁻(x) ift < T,

[0,1]×co(FL⁻(x)∪ {(0,0)}) ift≥T. (4.1) It is easy to see thatφis a Marchaud map ([4], Def. 2.2) bounded by max{1, M_FL}.

Consider now the diﬀerential inclusion

( ˙t(s),x(s),˙ ˙z(s))∈φ(t(s),x(s),z(s)) a.e. s≥0, (4.2) and the closed setHby

H={(t,x,z) : t∈[0, T], x∈Rⁿ,z∈ {−(T−t)M_L1}+R^p₊}.

Proposition 4.1. The epigraph of the set-valued mapV equals the viability kernel ofHfor φ,i.e., Epi(V) = Viab_φ(H),

whereEpi(V), by definition,Epi(V) = Graph(V +R^p₊).

Proof. First, we prove the inclusion

Viab_φ(H)⊂Graph(V +R^p₊).

Take (t,x,z)∈Viab_φ(H). Of course (t,x,z)∈ H, so t∈I andz∈ {−(T−t)M_L1}+R^p₊.

• Assume thatt=T. Then, asV(T,x) ={0}, it follows thatz∈R^p₊=V(T,x) +R^p₊.

• Assume thatt∈[0, T). Let (t(·),x(·),z(·)) be a solution to (4.2) with initial condition (t,x,z) which remains inH. By deﬁnition ofφ, (x(·),z(·)) is a solution to the diﬀerential inclusion

⎧⎨

⎩

(˙x(s),˙z(s))∈FL⁻(x(s)) a.e. s∈[0, T −t], x(0) =x,

z(0) =z,

while t(s) = s+t. Let s =s+t. For s ∈ [t, T], define x(s) =x(s −t) and z(s) = z(s −t). Then, (x(·),z(·)) satisfies the differential inclusion

⎧⎨

⎩

(x˙(s),˙z(s))∈FL⁻(x(s)) a.e. s∈[t, T], x(t) =x,

z(t) =z,

By the Relaxation Theorem [22], Theorem 2.7.2, for each ﬁxed >0, there existsu(·)∈ U such that x(·)−x(·;t,x,u(·)) ≤andz(·)−z(·;t,(x,z),u(·)) ≤

(the initial state for the evolving vector (x(·),z(·)) is (x,z), but we have simpliﬁed the notation by noting only the relevant components for each block).

In particular, we get

z(T)≤z(T;t,(x,z),u(·)) +1.

Moreover, as (s,x(s),z(s))∈ Hfor alls ∈[t, T],we have

z(s)∈ {−(T−s)M_L1}+R^p₊, or

z(T)∈R^p₊.

(7)

Hence,

z(T;t,(x,z),u(·)) =z− _T

t L(x(s;t,x,u(·)),u(s)) ds≥z(T)−1≥ −1, or

z+1≥ _T

t L(x(s;t,x,u(·)),u(s)) ds.

Thereforez+1∈Y(t,x) +R^p₊ for each >0. This implies that z∈cl(Y(t,x)) +R^p₊ =V(t,x) +R^p₊ by external stability.

Second, we prove the inclusion

Graph(V +R^p₊)⊂Viab_φ(H).

Take (t,x,z)∈Graph(V +R^p₊). Hence,t∈[0, T] and from (3.8),z∈ {−(T−t)M_L1}+R^p₊.

• Assume thatt=T. Thenz∈R^p₊, so (t(·),x(·),z(·)) = (T,x,z) is a solution to (4.2) viable inH.

• Assume thatt∈[0, T). We havez=z+d, wherez∈V(t,x) andd∈R^p₊. By deﬁnition of V(t,x), there exists a sequenceu_n(·)∈ U such that

n→+∞lim _T

t L(x_n(s;t,x,u_n(·)),u_n(s)) ds=z. (4.3) Using the Compactness of Trajectories Theorem [22], Theorem 2.5.3 and by passing to a subsequence if necessary, we can assume that there exists (x(·),z(·)) solution on [t, T] to the diﬀerential inclusion

(˙x(s),˙z(s))∈FL⁺(x(s)) a.e. s ∈[t, T] with initial condition (x,0) such that

n→+∞lim x_n(·;t,x,u_n(·))−x(·)= 0, and

n→+∞lim z_n(·;t,(x,0),u_n(·))−z(·)= 0. (4.4) From (4.3) and (4.4), we deduce thatz(T) =z.

Deﬁne now (x(·),z(·)) as follows:

(t(s),x(s),z(s)) =

(s+t,x(s+t),z−z(s+t))s∈[0, T−t], (T,x(T),d) s > T−t.

Asz(T−t) =z−z(T) =z−z=d,it follows that (t(·),x(·),z(·)) is a solution of (4.2). It remains to check that (t(s),x(s),z(s))∈ Hfor alls≥0. For s > T−t, as d∈R^p₊, this is obvious. For s∈[0, T −t], using the deﬁnition ofz, (4.3), and (4.4), we have

z(s) = lim

n→+∞

_T

t L(x_n(s;t,x,u_n(·)),u_n(s)) ds+d−z_n(s+t;t,(x,0),u_n(·)).

As

z_n(s+t;t,(x,0),u_n(·)) = _s+t

t L(x_n(s;t,x,u_n(·)),u_n(s)) ds, we get

z(s) =d+ lim

n→+∞

_T

s+tL(x_n(s;t,x,u_n(·)),u_n(s)) ds.

(8)

As _T

s+tL(x_n(s;t,x,u_n(·)),u_n(s)) ds≥ −(T−(s+t))M_L1

and d∈R^p₊,we ﬁnally obtain z(s)≥ −(T−t(s))M_L1. Hence, (t(s),x(s),z(s))∈ H and (t(·),x(·),z(·)) is

viable inH.

Remark 4.2. Proposition4.1remains valid if we take for Hthe closed set{(t,x,z) : t∈[0, T], x∈Rⁿ, z∈ {−(T−t)M_L1}+R^p₊}, whereM_L> M_L. This remark will be used in Section6.

5. Approximation of Viab

φ

(H)

In this section, we approximate Viab_φ(H) by finite discrete viability kernels. This is done two stages. First we replace the original differential inclusion system (4.2) by a finite difference inclusion system defined by a certain set-valued map G. Then we associate with G a suitable finite set-valued map Γ_,h defined on finite sets. The finite discrete viability kernel for this set-valued map approximates Viab_φ(H).

5.1. Approximation of Viab

_φ

( H ) by discrete viability kernels

Theorem 2.14 in [4] shows how Viab_φ(H) can be approximated by discrete viability kernels in the sense of Painlev´e–Kuratowski by considering an approximationφofφsatisfying the following three properties:

(H₀) φ is an upper semicontinuous set-valued map from R×Rⁿ×R^p to R×Rⁿ×R^p whose values are nonempty convex compact sets.

(H₁)

Graph(φ)⊂Graph(φ) +g()Band lim

→0⁺g() = 0⁺. (H₂)

∀(t,x,z)∈R×Rⁿ×R^p,

(t,x,z)−(t,x,z)≤M

φ(t,x,z)⊂φ(t,x,z), whereM = max{1, MFL} denotes a bound forφand >0 is the time step discretization.

We now demonstrate that these conditions hold for the set-valued map φ, > 0, from R×Rⁿ×R^p to R×Rⁿ×R^pdeﬁned by

φ(t,x,z) =

{1} ×(FL⁻(x) +KMB), if t < T−M ,

[0,1]×co((FL⁻(x) +KMB)∪ {(0,0)}),if t≥T−M . (5.1) Theorem 5.1. The set-valued mapφ satisfies (H₀),(H₁), and (H₂).

Proof.

(H₀) This follows from the properties of the set-valued map FL⁻ and the fact that FL⁻(x) +KMB ⊂ co((FL⁻(x) +KMB)∪ {(0,0)}).

(H₁) This relation holds withg() =max{1, K}M.To see this, let (t,x,z)∈R×Rⁿ×R^pand (s,f,l)∈ φ(t,x,z).Two cases arise:

• t< T−M .Heres= 1 and (f,l)∈FL⁻(x) +KMB,sog() =KM.

• t ≥T−M .Here we have s∈[0,1] and (f,l)∈co((FL⁻(x) +KMB)∪ {(0,0)}).There exists t≥T such that|t−t| ≤M .Thus

(f,l)∈ co((FL⁻(x) +KMB)∪ {(0,0)}),

⊂co(FL⁻(x)∪ {(0,0)}) +KMB.

The choiceg() =max{1, K}M covers both alternatives.

(9)

(H₂) Let (t,x,z)∈R×Rⁿ×R^p. Take (t,x,z)∈R×Rⁿ×R^p such that(t,x,z)−(t,x,z) ≤M . In particular,x−x ≤M . Two cases arise:

• t< T−M .This impliest < T. The Lipschitz property of FL⁻ yields:

φ(t,x,z) ={1} ×FL⁻(x)⊂ {1} ×(FL⁻(x) +KMB) =φ(t,x,z).

• t≥T−M .If, in addition,t < T,then as above,

φ(t,x,z) ={1} ×FL⁻(x)⊂ {1} ×(FL⁻(x) +KMB)⊂φ(t,x,z).

Alternatively, supposet≥T.Then, using the Lipschitz property of FL⁻, we have FL⁻(x)∪ {(0,0)} ⊂(FL⁻(x) +KMB)∪ {(0,0)}. Hence,

φ(t,x,z) = [0,1]×co(FL⁻(x)∪ {(0,0)})⊂φ(t,x,z).

5.2. Approximation of Viab

_φ

( H ) by discrete finite viability kernels

Theorem 2.19 in [4] goes on to show how Viab_φ(H) can be approximated by ﬁnite discrete viability kernels in the sense of Painlev´e–Kuratowski by considering an approximationΓ_,h ofG(·) satisfying the following two properties:

(H₃)

Graph(Γ_,h)⊂Graph(G) +ψ(, h)Band lim

→0⁺,^h→0⁺

ψ(, h) = 0⁺. (H₄) ∀(th,x_h,z_h)∈R_h×Rⁿ_h×R^p_h,

(t,x,z)−(th,xh,zh)≤h

(G(t,x,z) +hB)∩R_h×Rⁿ_h×R^p_h⊂Γ_,h(t_h,x_h,z_h),

whereh >0 is the state step discretization,R_h={kh, k∈Z}, andGis the set-valued map fromR×Rⁿ×R^p toR×Rⁿ×R^p deﬁned by

G(t,x,z) ={(t,x,z)}+φ(t,x,z).

We apply this result to the set-valued mapΓ_,h fromR_h×Rⁿ_h×R^p_h toR_h×Rⁿ_h×R^p_h deﬁned as follows:

• If t_h< T −M −h,Γ_,h(t_h,x_h,z_h) =

[t_h+−2h, t_h++ 2h]∩R_h×({(x_h,z_h)}+FL⁻(x_h) +α_,hB)∩Rⁿ_h×R^p_h. (5.2)

• If t_h≥T−M −h,Γ_,h(t_h,x_h,z_h) =

[t_h, t_h++ 2h]∩R_h×co(({(x_h,z_h)}+FL⁻(x_h) +α_,hB)∪({(x_h,z_h)}+ 2hB))∩Rⁿ_h×R^p_h, (5.3) whereα_,h= 2h+hK+²KM.

We assume that >2h.

Theorem 5.2. The set-valued mapΓ_,hsatisfies (H₃) and (H₄).

Proof.

(10)

(H₃) This relation holds withψ(, h) = 4h+hK,which veriﬁes

→0⁺lim,^h→0⁺

(4 +K)h = 0⁺.

Let (t_h,x_h,z_h)∈R_h×Rⁿ_h×R^p_hand (s_h,f_h,l_h)∈Γ_,h(t_h,x_h,z_h).Two cases arise:

• t_h< T −M −h.Then,s_h∈[t_h+−2h, t_h++ 2h]∩R_h⊂t_h++ [−2h,2h], and (f_h,l_h)∈({(xh,z_h)}+FL⁻(x_h) +α_,hB)∩Rⁿ_h×R^p_h,

⊂ {(x_h,z_h)}+FL⁻(x_h) +α_,hB,

={(xh,z_h)}+(FL⁻(x_h) +KMB) + (2h+hK)B.

Hence, (s_h,f_h,l_h)∈(t_h,x_h,z_h)+φ(t_h,x_h,z_h)+max{2h,2h+hK}B=G(t_h,x_h,z_h)+(2h+hK)B.

• t_h≥T−M −h.We haves_h∈[t_h, t_h++ 2h]∩R_hand

(f_h,l_h)∈co(({(x_h,z_h)}+FL⁻(x_h) +α_,hB)∪({(x_h,z_h)}+hB))∩Rⁿ_h×R^p_h. There existst≥T−M such that 0≤t−t_h≤h.We have

s_h ∈[t_h, t_h++ 2h]∩R_h,

⊂[t_h, t_h++ 2h],

⊂t+[0,1] + [−2h,2h].

Moreover,

(f_h,l_h)∈co(({(xh,z_h)}+FL⁻(x_h) +α_,hB)∪({(xh,z_h)}+ 2hB))∩Rⁿ_h×R^p_h,

⊂co(({(x_h,z_h)}+FL⁻(x_h) +α_,hB)∪({(x_h,z_h)}+ 2hB)),

={(xh,z_h)}+ co((FL⁻(x_h) +α_,hB)∪({(0,0)}+ 2hB)),

={(x_h,z_h)}+ co(((FL⁻(x_h) +KMB) + (2h+hK)B)∪({(0,0)}+ 2hB)),

⊂ {(x_h,z_h)}+ co(((FL⁻(x_h) +KMB))∪ {(0,0)}) + (4h+hK)B.

Hence, (s_h,f_h,l_h)∈(t,x_h,z_h)+φ(t,x_h,z_h)+max{4h+hK,2h}B=G(t,x_h,z_h)+(4h+hK)B.

(H₄) Let (t_h,x_h,z_h)∈R_h×Rⁿ_h×R^p_h. Take (t,x,z)∈R×Rⁿ×R^psuch that(t,x,z)−(t_h,x_h,z_h) ≤h.

In particular,|t−t_h| ≤handx−x_h ≤h. Two cases arise:

• t_h< T −M −h.This impliest< T−M . The Lipschitz property of FL⁻ yields:

G(t,x,z) +hB= (t,x,z) +φ(t,x,z) +hB,

= (t++ [−h, h])×({(x,z)}+FL⁻(x) +²KMB+hB),

⊂(t_h++ [−2h,2h])×({(x_h,z_h)}+hB+FL⁻(x_h) +hKB+²KM +hB),

= (t_h++ [−2h,2h])×({(x_h,z_h)}+FL⁻(x_h) +α_,hB).

Hence,

(G(t,x,z) +hB)∩R_h×Rⁿ_h×R^p_h⊂Γ_,h(t_h,x_h,z_h).

• t_h≥T−M −h.If, in addition,t< T−M ,then as above,

G(t,x,z) +hB⊂(t_h++ [−2h,2h])×({(x_h,z_h)}+FL⁻(x_h) +α_,hB).

(11)

As >2h,

t_h++ [−2h,2h]⊂[t_h, t_h++ 2h].

Moreover,

{(x_h,z_h)}+FL⁻(x_h)+α_,hB⊂co(({(x_h,z_h)}+FL⁻(x_h) +α_,hB)∪({(x_h,z_h)}+ 2hB)). Hence,

Alternatively,t≥T−M . In this case, using the Lipschitz property of FL⁻, we have G(t,x,z) +hB= (t,x,z) +([0,1]×co((FL⁻(x) +KMB)∪ {(0,0)})) +hB,

= ([t−h, t++h])×(co({(x,z) +FL⁻(x) +²KMB} ∪ {(x,z)}) +hB).

Now,

co({(x,z) +FL⁻(x) +²KMB} ∪ {(x,z)}) +hB,

⊂co({(x_h,z_h) +hB+FL⁻(x_h) +Kh+²KMB} ∪({(x_h,z_h)}+hB)) +hB,

⊂co({(x_h,z_h) +FL⁻(x_h) +α_,hB} ∪({(x_h,z_h)}+ 2hB)).

Hence,

6. Convergent approximation of the set-valued return function V

In this section, we first introduce a sequence of approximate set-valued return functions with finite set-values recursively defined by a multiobjective dynamic programming equation [13,14]. We then show that the epigraphs of these approximate set-valued return functions are equal to the sets involved in the calculation of the finite discrete viability kernels of the discrete setHh= (H+hB)∩R_h×Rⁿ_h×R^p_hfor the finite discrete dynamicsΓ_,h ([4], Prop. 2.18). This allows us to conclude that the sequence of approximate set-valued return functions is finite and that the epigraph of the final approximate set-valued return function of this sequence converges to the epigraph of the set-valued return functionV in the sense of Painlevé–Kuratowski.

Recall the deﬁnition ofH={(t,x,z) : t∈[0, T], x∈Rⁿ, z∈ {−(T−t)M_L1}+R^p₊}. Here, we takeM_L>

M_L (Rem.4.2) in the deﬁnition ofH. Hence,H={(t,x,z) : t∈[0, T], x∈Rⁿ, z∈ {−(T−t)M_L1}+R^p₊}.

LetI_h= (I+ [−h, h])∩R_h. We deﬁne the ﬁnite-valued set-valued mapV_,h⁰ fromI_h×Rⁿ_h toR^p_h such that Graph(V_,h⁰ +R^p_h,+) =H_h,

whereHh= (H+hB)∩R_h×Rⁿ_h×R^p_h.We now recursively deﬁne the ﬁnite set-valued mapsV_,h^k , k≥1,from I_h×Rⁿ_h toR^p_h as follows:

• If t_h< T −M −h, V_,h^k+1(t_h,x_h) = E

(l+α_,hB)∩R^p_h+V_,h^k ((t_h++ [−2h,2h])∩R_h,(x_h+f +α_,hB)∩Rⁿ_h) : (f,l) ∈ FL⁺(x_h)

. (6.1)

• Otherwise,

V_,h^k+1(t_h,x_h) =V_,h^k (t_h,x_h). (6.2)