Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method

(1)

HAL Id: hal-01670579

https://hal.archives-ouvertes.fr/hal-01670579

Preprint submitted on 21 Dec 2017

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method

Adrien Le Coënt, L Fribourg, J Vacher

To cite this version:

Adrien Le Coënt, L Fribourg, J Vacher. Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method. 2017. �hal-01670579�

(2)

Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method

A. Le Coënt¹, L. Fribourg², and J. Vacher¹ CMLA,¹LSV,²CNRS & ENS Paris-Saclay & INRIA ABSTRACT

In this paper, we explain how, under the one-sided Lipschitz (OSL) hypothesis, one can find an error bound for a variant of the Euler- Maruyama approximation method for stochastic switched systems.

We then explain how this bound can be used to control stochastic switched switched system in order to stabilize them in a given re- gion. The method is illustrated on several examples of the literature.

1 INTRODUCTION

Control synthesis for stochastic switched systems has been recently explored using the construction ofapproximately bisimilar sym- bolicmodels. This approach relies on the hypothesis ofincremental stabilityof the stochastic switched system (or existence of a common/multiple Lyapunov function) [8–10], which concerns only a small part of the real systems.

Here we address the problem of control synthesis in a more general setting. We do not use the construction of approximatively bisimilar symbolic models, but the (tamed) Euler method [3] for stochastic switched systems. We thus follow the lines of previous work on control synthesis for deterministic switched systems [1].

Unlike [8–10], the Euler-based method requires neither state-space discretization nor input set discretization, thus avoiding a source of combinatorial explosion.

The correctness of these Euler-based methods does not rely on the hypothesis of incremental stability as in [8, 10], but on the hypothesis of ‘one-sided Lipschitz (OSL)’ condition with constant λP R^d (also called ‘monotonicity’/‘dissipativity’, see [7]). It can be seen that if a stochastic switched system satisfies an OSL condition withλ ă 0, then the functionVpx,x¹q “ }x´x¹}²is a common incremental Lyapunov functionin the sense of [9], from which it follows that the switched system is incrementally stable, and can be treated by approximate bisimulation. However, Euler- based methods also apply when the system is not incrementally stable, in which case the constantλis necessarily positive.

The plan of the paper is as follows: In Section 2, we give an explicit upper bound on the mean square error of the tamed Euler method for SDEs under OSL condition. We apply the result in order to ensure properties of stochastic switched systems, such as “pR,Sq- stability” (Section 3). We conclude in Section 4.

2 BOUNDING THE ERROR OF THE TAMED EULER METHOD

2.1 Assumptions

The symbol} ¨ }denotes the Euclidean norm onR^d.The symbol x¨,¨ydenotes the scalar product of two vectors ofR^d. Given a point x PR^dand a positive realrą0, the ballBpx,rqof centrexand radiusris the settyPR^d | }x´y} ďru.

Letτ P p0,8qbe a fixed real number, letpΩ,F,Pqbe a prob- ability space with normal filtrationpF_tq_tPr0,τs, letd,mP N:“ t1,2, . . .uletW “ pW^p¹^q, . . . ,W^pmqq :r0,Rs ˆΩ ÑR^m be an m-dimensional standardpWtq_tPr0,τs-Brownian motion and letx0: ΩÑR^dbe anF0{BpR^dq-measurable mapping withEr}x0}^ps ă 8for allp P r1,8q. Moreover, let f : R^d ÑR^d be a continu- ously differentiable and globally one-sided Lipschitz continuous function whose derivative grows at most polynomially and let д“ pдi,jq_iPt1, ...,du,jPt1, ...,mu :R^d ÑR^dˆm be a globally Lips- chitz continuous function.

Then consider the Stochastic Differential Equations (SDE):

dXt“fpXtqdt`дpXtqdWt, X⁰“x⁰ (1) fortP r0,τs. The drift coefficientf is the infinitesimal mean of the processXand the diffusion coefficientдis the infinitesimal standard deviation of the processX. Under the above assumptions, the SDE (1) is known to have a unique strong solution. More formally, there exists an adapted stochastic processX :r0,τs ˆΩ ÑR^d with continuous sample paths fulfilling

Xt,x0“x⁰` ż_t

0

fpXsqds` ż_t

0

дpXsqdWs

for allt P r0,τsP-a.s. (see, e.g., [6]).

We denote byXt,x0the solution of Equation (1) at timetfrom initial conditionX0,x0“x0P-a.s., in whichx0is a random variable that is measurable inF0.

We suppose thatf behaves polynomially andдis Lipschitz, i.e.:

there exist constantsDPRě0,qPNandLдPRě0such that, for allx,yPR^d

}fpxq ´fpyq}²ďD}x´y}²p1` }x}^q` }y}^qq (H1) }дpxq ´дpyq} ďLд}x´y} (H2) We also assume that the SDE (1) satisfies the following one-sided Lipschitz (OSL) condition with constantλPR:

DλPR@x,yPR^d: xfpyq ´fpxq,y´xy ďλ}y´x}² (H3) Remark 1. Constantsλ,LдandDcan be computed using (con- strained) optimization algorithms (see [1]).

2.2 Tamed Euler approximation

The standard way to extend the classical Euler method for ordi- nary differential equations to the SDE (1) is the Euler-Maruyama scheme [4]. More precisely, givenz : Ω Ñ R^d anF0{BpR^dq- measurable mapping withEr}z}^ps ă 8for allp P r1,8q, the explicit Euler-Maruyama (EM) method for the SDE (1) is given by the mappingsY_n,z^N : Ω Ñ R^d,n P t0,1, . . . ,Nu, which satisfy Y0^N,z “zand

Y_n`^N1,z“Y_n,z^N ` τ

N ¨fpY_n,z^N q `дpY_n,z^N qpW_pn`1qτ{N ´W_nτ_{Nq

(3)

for allnP t0,1, . . . ,N´1uand allNPN. See [4]. Unfortunately, the convergence results for the EM scheme does not hold when the drift functionf of the SDE (1) behaves polynomially (and not linearly). For the sake of generality, we will now adopt a refined scheme, which has been proposed recently in order to overcome this difficulty [3]. LetX_n,z^N :ΩÑR^d,

X_n`^N 1,z“X_n,z^N `

Nτ ¨fpX^N_n,zq

1`_N^τ ¨ }fpX_n,z^N q}`дpX_n,z^N qpWpn`1qτ

N ´W^nτ

Nq (2) for allnP t0,1, . . . ,N´1uand allN PN. We refer to the numerical method (2) as atamed Euler scheme[3]. In this method the drift term^τ_n ¨fpX^N_n,zqis “tamed” by the factor 1{p1`_N^τ ¨ }fpX^N_n,zq}q fornP t0,1, . . . ,N´1uandNPNin (2).

A time continuous interpolation of the time discrete numerical approximations (2) is also introduced in [3] as follows. LetX˜_z^N : r0,τs ˆΩ ÑR^d,N PN, be a sequence of stochastic processes given by

X˜_t,z^N “X˜_n,z^N `pt´nτ{Nq ¨fpX˜_n,z^N q

1`τ{N¨ }fpX˜_n,z^N q} `дpX˜_n,z^N qpWt´W^nτ_Nq for alltP r^nτ_N,^pn`_N¹^qτs,nP t0,1. . . ,N´1uand allN PN. Note thatX˜_t,z^N :r0,τs ˆΩÑR^dis an adapted stochastic process with continuous sample paths for everyNPN.

Let us defineX^N_t,zby

X^N_t,z :“X_n,z^N fortP rnτ

N,pn`1qτ N q.

Note thatX˜_t,z^N “X_t,z^N “X_n,z^N at timet“^nτ_N fornP t0,1, . . . ,Nu. The following theorem is proven in [3]:

Theorem 1. [3] SupposepH1qpH2qpH3q. Let the setting in this section be fulfilled, andz :ΩÑR^dbe anF0{BpR^dq-measurable mapping withEr}z}^ps ă 8for allp P r1,8q. Then, for allp P r1,8q

sup

NPN

sup

nPt0,1, ...,NuEr}X^N_n,z}^ps ă 8

For the sake of simplicity, the numberNof subsampling steps is now left implicit. From Theorem 1, it follows (cf. Lemma 4.3, [2]):

Lemma 2. SupposepH1qpH2qpH3q. Let the setting in this section be fulfilled, andz:ΩÑR^dbe anF0{BpR^dq-measurable mapping withEr}z}^ps ă 8for allpP r1,8q. Then, for any even integerrě2, there exist two constantsEr,zandFr,zsuch that

sup

0ďtďτE}X_t,z´X˜t,z}^r ď p∆tq^r²pEr,zp∆tq^r²`Fr,zdq.

with∆_t“τ{Nand:

Er,z“2^rp}fp0q}^r `D2

r`1 2

p1`Esup

0ďtďτ}X_t,z}^qrq

1 2pEsup

0ďtďτ}X_t,z}²^rq

1 2q, Fr,z “2^rp}дp0q}²^r`L^r_дEsup

0ďtďτ}X_t,z}^r²q.

Proof. see Appendix.

□

Remark 2. ConstantsEr,zandFr,zare computed using the con- stantsλandLд(see Remark 1), and the expected values ofX_t,zat each timet “0,∆t,2∆t, . . . ,N∆t. These expected values are computed using a Monte Carlo method (by averaging here the value of10⁴ samplings).

2.3 Mean square error bounding

The following Theorem holds for SDE (1). This corresponds to a stochastic version of Theorem 1 of [1], showing that a similar result holds on average, using the tamed Euler method of [3]. It is an adaptation of Theorem 4.4 in [2].

Theorem 3. Given the SDE system(1)satisfying (H1)-(H2)-(H3). Letδ0PRě0. Suppose thatzis a random variable onR^d such that

Er}x⁰´z}²s ďδ0². Then, we have, for allτě0:

Er sup

0ďtďτ}Xt,x0´X˜t,z}²s ďδ_τ,δ²

0,

withδ_τ,δ²

0

:“βpτqe^{γ τ}, where:

γ “2p?

∆_t`2λ`L²_д`128L⁴_дq, and βpτq “2δ0²

`2τ∆_tL²_дp1`128L²_дqpF²,zd`E²,z∆_tq

`4τa

∆tDpF4,zd`E4,z∆²_tq

1 2

p1`4E sup

0ďtďτ}X_t,z}²^q`4E sup

0ďtďτ}X˜t,z}²^qq

1 2. (3) with∆_t “τ{N.

Proof. The proof closely follows the proof of Theorem 4.4 in [2].

Letet “Xt,x0´X˜t,z. We have, for all 0ďtďτ:

det “ pfpXt,x0q ´fpzqqdt` pдpXt,x0q ´дpzqqdWt. (4) Then, by using Equation (4) and the integral version of Itô formula applied to functionx ÞÑ }x}²we obtain

}e_t}²“ }e⁰}²`

ż_t

0

2xes, fpXs,x0q ´fpX_s,zqyds

` ż_t

0

}дpXs,x0q ´дpX_s,zq}²ds`Mptq,

(5)

wheree⁰“x⁰´z, and Mptq “

ż_t

0

2xes,дpXs,x0q ´дpX_s,zqydWs. So we have using (H2):

2

(4)

}et}²ď }e⁰}²`

ż_t

0

2xes, fpXs,x0q ´fpX˜s,zqyds

`L²_д ż_t

0

}Xs,x0´X_s,z}²ds

` ż_t

0

2xes,fpX˜s,zq ´fpX_s,zqyds`Mptq.

(6)

So we have using (H3) and Young’s inequality:

}et}²ď }e⁰}²`

ż_t

0

p2λ}es}²`L²_д}es}²qds

`L²_д ż_t

0

}X_s,z´X˜s,z}²ds

` ż_t

0

p 1

?∆t}fpX˜s,zq ´fpX_s,zq}²`a

∆t}es}²qds

`Mptq.

(7) So we have using (H1), for all 0ďtďτ:

}et}²ď

}e⁰}²` pa

∆t`2λ`L²_дq ż_t

0

}es}²ds

`L²_д ż_t

0

}X_s,z´X˜s,z}²ds

` D

?∆t

ż_t

0

p1` }X_s,z}^q` }X˜s,z}^qq}X_s,z´X˜s,z}²ds

`Mptq.

(8) It follows using Lemma 2 forr “2, and Cauchy-Schwarz inequality:

Er sup

0ďsďt}es}²s ď

E}e⁰}²` pa

∆t`2λ`L²_дq ż_t

0

E}es}²ds

`L²_дτ∆tpE²,z∆t`F²,zdq

` D

?∆_t ż_t

0

pEp1` }X_s,z}^q` }X˜s,z}^qq²q

1

2pE}X_s,z´X˜s,z}⁴q

1 2ds

`mptq,

(9) where

mptq “Er sup

0ďsďt}Mpsq}s.

Hence, using using Lemma 2 forr“4, and inequalitypa`bq^r ď 2^rpa^r `b^rq:

Er sup

0ďsďt}es}²s ď

E}e⁰}²` pa

∆_t`2λ`L²_дqq ż_t

0

E}es}²ds

`L²_дτ∆tpE²,z∆t`F²,zdq

`2Dτa

∆tpE4,z∆²_t`F4,zdq¹² p1`4E sup

1 2

`mptq.

(10) On the other hand, from the Burkholder-Davis-Gundy inequality, we get:

mptq ď16Er ż_t

0

}es}²}дpXs,x0q ´дpX_s,zq}²dss¹² Hence, using (H2):

mptq ď16L²_дEr sup

0ďsďt}es}² ż_t

0

}Xs,x0´X_s,z}²dss¹² Then, using Young’s inequality (for anyαą0):

mptq ď8L²_дpαEr sup

0ďsďt}es}²s ` 1 αEr

ż_t

0

}Xs,x0´X_s,z}²dssq.

Hence, by using Lemma 2 forr“2:

mptq ď8αL²_дEr sup

0ďsďt}es}²s

` 8L²_д

α ż_t

0

Er sup

0ďrďs}er}²sds

` 8L²_д

α τ∆tpE2,z∆t`F2,zdq.

(11) Hence, lettingα“₁₆¹_L2

д

, we have by replacing in (10):

1 2Er sup

0ďsďt}es}²s ď δ0²

` pa

∆t`2λ`L²_д`128L⁴_дq ż_t

0

Er sup

0ďrďs}er}²sds

`τpL²_д`128L⁴_дq∆_tpE²,z∆_t`F²,zdq

`τ2Da

∆tpE4,z∆²_t`F4,zdq¹² p1`4E sup

1 2. (12) It results from Gronwall’s inequality:

Er sup

0ďtďτ}e_t}²s “βpτqe^{γ τ}, with

3

(5)

γ“2p?

∆t`2λ`L²_д`128L⁴_дq, and βpτq “2δ0²

`2τp∆tL²_дp1`128L²_дqpF²,zd`E²,z∆tq

`4τa

∆tDpF4,zd`E4,z∆²_tq

1 2

p1`4E sup

1 2. (13)

□

It follows from Theorem 3 and Jensen’s inequality:

Proposition 1. Consider two pointsx⁰andzofR^d,and a positive real numberδ0. Suppose thatx0PBpz,δ0q(i.e.}x0´z} ďδ0). Then EXt,x0PBpX˜t,z,δ_t,δ₀qfor allt P r0,τs.

It also follows from Theorem 3:

Proposition 2. In the setting of Theorem 3, the expressionδ_τ,δ0

tends to

δ⁰?

2e²^λτ^`L^д²^`¹²⁸^L^д⁴ when∆t tends to 0 (i.e., whenNtends to8).

2.4 Implementation

This method has been implemented in the interpreted language Octave, and the experiments performed on a 2.80 GHz Intel Core i7-4810MQ CP U with 8 GB of memory. The implementation is an adaptation of the program described in [1] for controlling deterministic switched systems, but makes use of the tamed Euler scheme for SDEs (with the error functionδgiven in Theorem 3) instead of the classical Euler scheme.

Example 1. Consider the following system, corresponding to the example in Section 6.2 of [8] (cf. [9]) for modeu“1:

dx¹“ p´0.25x¹`x²`0.25qdt`0.05x¹dW_t¹ dx2“ p´2x1´0.25x2´2qdt`0.05x2dW_t²

The program gives (forτ “ 1,∆_t “ τ{10⁴):q “ 0,D “ 1.36, Lд“0.05,λ“0.25; and forz“ p´4,´3.8q:E2,z “893.3,E4,z“ 2.14¨10⁵,F²,z“0.002,F⁴,z“4.9¨10^´⁶.

Consider now the system corresponding to the example of [8] for modeu“2:

dx1“ p´0.25x1`2x2´0.25qdt`0.05x1dW_t¹ dx²“ p´x¹´0.25x²`1qdt`0.05x²dW_t²

The program gives (forτ“1,∆t “τ{10⁴):q“0,D“1.36,Lд“ 0.05,λ“0.25, and, forz“ p0,3q:E²,z “543.2,E⁴,z“7.94¨10⁴, F2,z“0.0442,F4,z“0.00178.

Both computations take less than 10 s. of CPU time. Simulations of the two systems are given in Figure 1 for modeu“1and starting pointz “ p´4,3.8q, and in Figure 2 for modeu “2and starting pointz“ p0,3q. On each figure, the initial ball (t “0) is depicted in black, the final ball (t “τ) in red, and 200 random sampling trajectories in blue fortP r0,τs.¹

1Note that, in the figures, all the end points (att“τ) of the sampling trajectories lie in the final ball, but this is not true in general; we only know by Proposition 1 that, for all starting pointx0of the initial ball, theexpected valueof the end point lie in the final ball.

-5.5 -5 -4.5 -4 -3.5 -3 -2.5 -2

-6 -4 -2 0 2 4

X1

X2

Figure 1: Example 1 with modeu “ 1,τ “ 1,∆_t “ 10^´⁴, initial ballBpz,δ0qwithz“ p´4,3.8qandδ0“0.5, final ball Bpz¹,δ_τ,δ0qwithz¹“ p´3.6,2.56qandδ_τ,δ0“1.17

-1 0 1 2 3 4 5 6

-1 0 1 2 3 4

X1

X2

Figure 2: Example 1 with modeu “ 2,τ “ 1,∆_t “ 10^´⁴, initial ballBpz,δ0qwithz “ p0,3qandδ0 “ 0.5, final ball Bpz¹,δ_τ,δ0qwithz¹“ p0.79,´0.63qandδ_τ,δ0“1.17

4