• Aucun résultat trouvé

Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method

N/A
N/A
Protected

Academic year: 2021

Partager "Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method"

Copied!
9
0
0

Texte intégral

(1)

HAL Id: hal-01670579

https://hal.archives-ouvertes.fr/hal-01670579

Preprint submitted on 21 Dec 2017

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method

Adrien Le Coënt, L Fribourg, J Vacher

To cite this version:

Adrien Le Coënt, L Fribourg, J Vacher. Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method. 2017. �hal-01670579�

(2)

Control Synthesis for Stochastic Switched Systems using the Tamed Euler Method

A. Le Coënt1, L. Fribourg2, and J. Vacher1 CMLA,1LSV,2CNRS & ENS Paris-Saclay & INRIA ABSTRACT

In this paper, we explain how, under the one-sided Lipschitz (OSL) hypothesis, one can find an error bound for a variant of the Euler- Maruyama approximation method for stochastic switched systems.

We then explain how this bound can be used to control stochastic switched switched system in order to stabilize them in a given re- gion. The method is illustrated on several examples of the literature.

1 INTRODUCTION

Control synthesis for stochastic switched systems has been recently explored using the construction ofapproximately bisimilar sym- bolicmodels. This approach relies on the hypothesis ofincremental stabilityof the stochastic switched system (or existence of a com- mon/multiple Lyapunov function) [8–10], which concerns only a small part of the real systems.

Here we address the problem of control synthesis in a more general setting. We do not use the construction of approximatively bisimilar symbolic models, but the (tamed) Euler method [3] for stochastic switched systems. We thus follow the lines of previous work on control synthesis for deterministic switched systems [1].

Unlike [8–10], the Euler-based method requires neither state-space discretization nor input set discretization, thus avoiding a source of combinatorial explosion.

The correctness of these Euler-based methods does not rely on the hypothesis of incremental stability as in [8, 10], but on the hypothesis of ‘one-sided Lipschitz (OSL)’ condition with constant λP Rd (also called ‘monotonicity’/‘dissipativity’, see [7]). It can be seen that if a stochastic switched system satisfies an OSL con- dition withλ ă 0, then the functionVpx,x1q “ }x´x1}2is a common incremental Lyapunov functionin the sense of [9], from which it follows that the switched system is incrementally stable, and can be treated by approximate bisimulation. However, Euler- based methods also apply when the system is not incrementally stable, in which case the constantλis necessarily positive.

The plan of the paper is as follows: In Section 2, we give an explicit upper bound on the mean square error of the tamed Euler method for SDEs under OSL condition. We apply the result in order to ensure properties of stochastic switched systems, such as “pR,Sq- stability” (Section 3). We conclude in Section 4.

2 BOUNDING THE ERROR OF THE TAMED EULER METHOD

2.1 Assumptions

The symbol} ¨ }denotes the Euclidean norm onRd.The symbol x¨,¨ydenotes the scalar product of two vectors ofRd. Given a point x PRdand a positive realrą0, the ballBpx,rqof centrexand radiusris the settyPRd | }x´y} ďru.

Letτ P p0,8qbe a fixed real number, letpΩ,F,Pqbe a prob- ability space with normal filtrationpFtqtPr0,τs, letd,mP N: t1,2, . . .uletW “ pWp1q, . . . ,Wpmqq :r0,Rs ˆ ÑRm be an m-dimensional standardpWtqtPr0s-Brownian motion and letx0: ÑRdbe anF0{BpRdq-measurable mapping withEr}x0}ps ă 8for allp P r1,8q. Moreover, let f : Rd ÑRd be a continu- ously differentiable and globally one-sided Lipschitz continuous function whose derivative grows at most polynomially and let д“ pдi,jqiPt1, ...,du,jPt1, ...,mu :Rd ÑRdˆm be a globally Lips- chitz continuous function.

Then consider the Stochastic Differential Equations (SDE):

dXtfpXtqdt`дpXtqdWt, X0x0 (1) fortP r0,τs. The drift coefficientf is the infinitesimal mean of the processXand the diffusion coefficientдis the infinitesimal stan- dard deviation of the processX. Under the above assumptions, the SDE (1) is known to have a unique strong solution. More formally, there exists an adapted stochastic processX :r0,τs ˆ ÑRd with continuous sample paths fulfilling

Xt,x0x0` żt

0

fpXsqds` żt

0

дpXsqdWs

for allt P r0,τsP-a.s. (see, e.g., [6]).

We denote byXt,x0the solution of Equation (1) at timetfrom initial conditionX0,x0x0P-a.s., in whichx0is a random variable that is measurable inF0.

We suppose thatf behaves polynomially andдis Lipschitz, i.e.:

there exist constantsDPRě0,qPNandLдPRě0such that, for allx,yPRd

}fpxq ´fpyq}2ďD}x´y}2p1` }x}q` }y}qq (H1) }дpxq ´дpyq} ďLд}x´y} (H2) We also assume that the SDE (1) satisfies the following one-sided Lipschitz (OSL) condition with constantλPR:

PR@x,yPRd: xfpyq ´fpxq,y´xy ďλ}y´x}2 (H3) Remark 1. Constantsλ,LдandDcan be computed using (con- strained) optimization algorithms (see [1]).

2.2 Tamed Euler approximation

The standard way to extend the classical Euler method for ordi- nary differential equations to the SDE (1) is the Euler-Maruyama scheme [4]. More precisely, givenz : Ñ Rd anF0{BpRdq- measurable mapping withEr}z}ps ă 8for allp P r1,8q, the explicit Euler-Maruyama (EM) method for the SDE (1) is given by the mappingsYn,zN : Ñ Rd,n P t0,1, . . . ,Nu, which satisfy Y0N,z zand

Yn`N1,zYn,zN ` τ

N ¨fpYn,zN q `дpYn,zN qpWpn`1qτ{N ´W{Nq

(3)

for allnP t0,1, . . . ,N´1uand allNPN. See [4]. Unfortunately, the convergence results for the EM scheme does not hold when the drift functionf of the SDE (1) behaves polynomially (and not linearly). For the sake of generality, we will now adopt a refined scheme, which has been proposed recently in order to overcome this difficulty [3]. LetXn,zN :ÑRd,

Xn`N 1,zXn,zN `

Nτ ¨fpXNn,zq

1`Nτ ¨ }fpXn,zN q}`дpXn,zN qpWpn`1qτ

N ´W

Nq (2) for allnP t0,1, . . . ,N´1uand allN PN. We refer to the numerical method (2) as atamed Euler scheme[3]. In this method the drift termτn ¨fpXNn,zqis “tamed” by the factor 1{p1`Nτ ¨ }fpXNn,zq}q fornP t0,1, . . . ,N´1uandNPNin (2).

A time continuous interpolation of the time discrete numerical approximations (2) is also introduced in [3] as follows. LetX˜zN : r0,τs ˆ ÑRd,N PN, be a sequence of stochastic processes given by

X˜t,zN X˜n,zN `pt´{Nq ¨fpX˜n,zN q

1`τ{N¨ }fpX˜n,zN q} `дpX˜n,zN qpWt´WNq for alltP rN,pn`N1s,nP t0,1. . . ,N´1uand allN PN. Note thatX˜t,zN :r0,τs ˆÑRdis an adapted stochastic process with continuous sample paths for everyNPN.

Let us defineXNt,zby

XNt,z :Xn,zN fortP r

N,pn`1 N q.

Note thatX˜t,zN Xt,zN Xn,zN at timetN fornP t0,1, . . . ,Nu. The following theorem is proven in [3]:

Theorem 1. [3] SupposepH1qpH2qpH3q. Let the setting in this section be fulfilled, andz :ÑRdbe anF0{BpRdq-measurable mapping withEr}z}ps ă 8for allp P r1,8q. Then, for allp P r1,8q

sup

NPN

sup

nPt0,1, ...,NuEr}XNn,z}ps ă 8

For the sake of simplicity, the numberNof subsampling steps is now left implicit. From Theorem 1, it follows (cf. Lemma 4.3, [2]):

Lemma 2. SupposepH1qpH2qpH3q. Let the setting in this section be fulfilled, andz:ÑRdbe anF0{BpRdq-measurable mapping withEr}z}ps ă 8for allpP r1,8q. Then, for any even integerrě2, there exist two constantsEr,zandFr,zsuch that

sup

0ďtďτE}Xt,z´X˜t,z}r ď p∆tqr2pEr,zp∆tqr2`Fr,zdq.

withtτ{Nand:

Er,z2rp}fp0q}r `D2

r`1 2

p1`Esup

0ďtďτ}Xt,z}qrq

1 2pEsup

0ďtďτ}Xt,z}2rq

1 2q, Fr,z 2rp}дp0q}2r`LrдEsup

0ďtďτ}Xt,z}r2q.

Proof. see Appendix.

Remark 2. ConstantsEr,zandFr,zare computed using the con- stantsλandLд(see Remark 1), and the expected values ofXt,zat each timet 0,∆t,2∆t, . . . ,N∆t. These expected values are com- puted using a Monte Carlo method (by averaging here the value of104 samplings).

2.3 Mean square error bounding

The following Theorem holds for SDE (1). This corresponds to a stochastic version of Theorem 1 of [1], showing that a similar result holds on average, using the tamed Euler method of [3]. It is an adaptation of Theorem 4.4 in [2].

Theorem 3. Given the SDE system(1)satisfying (H1)-(H2)-(H3). Letδ0PRě0. Suppose thatzis a random variable onRd such that

Er}x0´z}2s ďδ02. Then, we have, for allτě0:

Er sup

0ďtďτ}Xt,x0´X˜t,z}2s ďδτ,δ2

0,

withδτ,δ2

0

:βpτqeγ τ, where:

γ 2p?

t`2λ`L2д`128L4дq, and βpτq “2δ02

`2τ∆tL2дp1`128L2дqpF2,zd`E2,ztq

`4τa

tDpF4,zd`E4,z2tq

1 2

p1`4E sup

0ďtďτ}Xt,z}2q`4E sup

0ďtďτ}X˜t,z}2qq

1 2. (3) witht τ{N.

Proof. The proof closely follows the proof of Theorem 4.4 in [2].

Letet Xt,x0´X˜t,z. We have, for all 0ďtďτ:

det “ pfpXt,x0q ´fpzqqdt` pдpXt,x0q ´дpzqqdWt. (4) Then, by using Equation (4) and the integral version of Itô formula applied to functionx ÞÑ }x}2we obtain

}et}2 }e0}2`

żt

0

2xes, fpXs,x0q ´fpXs,zqyds

` żt

0

}дpXs,x0q ´дpXs,zq}2ds`Mptq,

(5)

wheree0x0´z, and Mptq “

żt

0

2xes,дpXs,x0q ´дpXs,zqydWs. So we have using (H2):

2

(4)

}et}2ď }e0}2`

żt

0

2xes, fpXs,x0q ´fpX˜s,zqyds

`L2д żt

0

}Xs,x0´Xs,z}2ds

` żt

0

2xes,fpX˜s,zq ´fpXs,zqyds`Mptq.

(6)

So we have using (H3) and Young’s inequality:

}et}2ď }e0}2`

żt

0

p2λ}es}2`L2д}es}2qds

`L2д żt

0

}Xs,z´X˜s,z}2ds

` żt

0

p 1

?t}fpX˜s,zq ´fpXs,zq}2`a

t}es}2qds

`Mptq.

(7) So we have using (H1), for all 0ďtďτ:

}et}2ď

}e0}2` pa

t`2λ`L2дq żt

0

}es}2ds

`L2д żt

0

}Xs,z´X˜s,z}2ds

` D

?t

żt

0

p1` }Xs,z}q` }X˜s,z}qq}Xs,z´X˜s,z}2ds

`Mptq.

(8) It follows using Lemma 2 forr 2, and Cauchy-Schwarz in- equality:

Er sup

0ďsďt}es}2s ď

E}e0}2` pa

t`2λ`L2дq żt

0

E}es}2ds

`L2дτtpE2,zt`F2,zdq

` D

?t żt

0

pEp1` }Xs,z}q` }X˜s,z}qq2q

1

2pE}Xs,z´X˜s,z}4q

1 2ds

`mptq,

(9) where

mptq “Er sup

0ďsďt}Mpsq}s.

Hence, using using Lemma 2 forr4, and inequalitypa`bqr ď 2rpar `brq:

Er sup

0ďsďt}es}2s ď

E}e0}2` pa

t`2λ`L2дqq żt

0

E}es}2ds

`L2дτtpE2,zt`F2,zdq

`2a

tpE4,z2t`F4,zdq12 p1`4E sup

0ďtďτ}Xt,z}2q`4E sup

0ďtďτ}X˜t,z}2qq

1 2

`mptq.

(10) On the other hand, from the Burkholder-Davis-Gundy inequality, we get:

mptq ď16Er żt

0

}es}2}дpXs,x0q ´дpXs,zq}2dss12 Hence, using (H2):

mptq ď16L2дEr sup

0ďsďt}es}2 żt

0

}Xs,x0´Xs,z}2dss12 Then, using Young’s inequality (for anyαą0):

mptq ď8L2дpαEr sup

0ďsďt}es}2s ` 1 αEr

żt

0

}Xs,x0´Xs,z}2dssq.

Hence, by using Lemma 2 forr2:

mptq ď8αL2дEr sup

0ďsďt}es}2s

` 8L2д

α żt

0

Er sup

0ďrďs}er}2sds

` 8L2д

α τ∆tpE2,zt`F2,zdq.

(11) Hence, lettingα161L2

д

, we have by replacing in (10):

1 2Er sup

0ďsďt}es}2s ď δ02

` pa

t`2λ`L2д`128L4дq żt

0

Er sup

0ďrďs}er}2sds

`τpL2д`128L4дq∆tpE2,zt`F2,zdq

`τ2Da

tpE4,z2t`F4,zdq12 p1`4E sup

0ďtďτ}Xt,z}2q`4E sup

0ďtďτ}X˜t,z}2qq

1 2. (12) It results from Gronwall’s inequality:

Er sup

0ďtďτ}et}2s “βpτqeγ τ, with

3

(5)

γ2p?

t`2λ`L2д`128L4дq, and βq “2δ02

`2τp∆tL2дp1`128L2дqpF2,zd`E2,ztq

`4τa

tDpF4,zd`E4,z2tq

1 2

p1`4E sup

0ďtďτ}Xt,z}2q`4E sup

0ďtďτ}X˜t,z}2qq

1 2. (13)

It follows from Theorem 3 and Jensen’s inequality:

Proposition 1. Consider two pointsx0andzofRd,and a positive real numberδ0. Suppose thatx0PBpz,δ0q(i.e.}x0´z} ďδ0). Then EXt,x0PBpX˜t,z,δt,δ0qfor allt P r0,τs.

It also follows from Theorem 3:

Proposition 2. In the setting of Theorem 3, the expressionδτ,δ0

tends to

δ0?

2e2λτ`Lд2`128Lд4 whent tends to 0 (i.e., whenNtends to8).

2.4 Implementation

This method has been implemented in the interpreted language Octave, and the experiments performed on a 2.80 GHz Intel Core i7-4810MQ CP U with 8 GB of memory. The implementation is an adaptation of the program described in [1] for controlling determin- istic switched systems, but makes use of the tamed Euler scheme for SDEs (with the error functionδgiven in Theorem 3) instead of the classical Euler scheme.

Example 1. Consider the following system, corresponding to the example in Section 6.2 of [8] (cf. [9]) for modeu1:

dx1“ p´0.25x1`x2`0.25qdt`0.05x1dWt1 dx2“ p´2x1´0.25x2´2qdt`0.05x2dWt2

The program gives (forτ 1,t τ{104):q 0,D 1.36, Lд0.05,λ0.25; and forz“ p´4,´3.8q:E2,z 893.3,E4,z 2.14¨105,F2,z0.002,F4,z4.9¨10´6.

Consider now the system corresponding to the example of [8] for modeu2:

dx1“ p´0.25x1`2x2´0.25qdt`0.05x1dWt1 dx2“ p´x1´0.25x2`1qdt`0.05x2dWt2

The program gives (forτ1,t τ{104):q0,D1.36,Lд 0.05,λ0.25, and, forz“ p0,3q:E2,z 543.2,E4,z7.94¨104, F2,z0.0442,F4,z0.00178.

Both computations take less than 10 s. of CPU time. Simulations of the two systems are given in Figure 1 for modeu1and starting pointz “ p´4,3.8q, and in Figure 2 for modeu 2and starting pointz“ p0,3q. On each figure, the initial ball (t 0) is depicted in black, the final ball (t τ) in red, and 200 random sampling trajectories in blue fortP r0,τs.1

1Note that, in the figures, all the end points (attτ) of the sampling trajectories lie in the final ball, but this is not true in general; we only know by Proposition 1 that, for all starting pointx0of the initial ball, theexpected valueof the end point lie in the final ball.

-5.5 -5 -4.5 -4 -3.5 -3 -2.5 -2

-6 -4 -2 0 2 4

X1

X2

Figure 1: Example 1 with modeu 1,τ 1,t 10´4, initial ballBpz,δ0qwithz“ p´4,3.8qandδ00.5, final ball Bpz1,δτ,δ0qwithz1“ p´3.6,2.56qandδτ,δ01.17

-1 0 1 2 3 4 5 6

-1 0 1 2 3 4

X1

X2

Figure 2: Example 1 with modeu 2,τ 1,t 10´4, initial ballBpz,δ0qwithz “ p0,3qandδ0 0.5, final ball Bpz1,δτ,δ0qwithz1“ p0.79,´0.63qandδτ,δ01.17

4

Références

Documents relatifs

Access and use of this website and the material on it are subject to the Terms and Conditions set forth at Low energy Prairie housing : a survey of some essential features..

Analyzing the overlaps and the shapes of the spin-coupled orbitals, we were also able to qualitatively confirm that, in presence of a halogen bonding

En conséquence, cette définition peut tout à fait s’appliquer aux journalistes professionnels rémunérés à la pige mais aussi aux salariés à temps partiel ou titulaires

We justify the efficiency of using the explicit tamed Euler scheme to approximate the invariant distribution, since the computational cost does not suffer from the at most

Since both known in vitro experiments using fresh human faecal microbiota showed no degradation of erythritol within 12 h, the aim of the present study was to prolong the

To the best of our knowledge, the long-time behavior of moment bounds and weak error estimates for explicit tamed Euler schemes applied to SPDEs has not been studied in the

Abstract: The performance of networked control systems based on a switched control law is studied. The control law is switched between sampled-data feedback and zero control, the

Abstract— Stability properties for continuous-time linear switched systems are determined by the Lyapunov exponent associated with the system, which is the analogous of the