• Aucun résultat trouvé

LQG homing in a finite time interval

N/A
N/A
Protected

Academic year: 2021

Partager "LQG homing in a finite time interval"

Copied!
5
0
0

Texte intégral

(1)

Titre:

Title

: LQG homing in a finite time interval

Auteurs:

Authors

: Mario Lefebvre

Date: 2011

Type:

Article de revue / Journal article

Référence:

Citation

:

Lefebvre, M. (2011). LQG homing in a finite time interval. Journal of Control

Science and Engineering, 2011, p. 1-3. doi:10.1155/2011/561347

Document en libre accès dans PolyPublie

Open Access document in PolyPublie

URL de PolyPublie:

PolyPublie URL: https://publications.polymtl.ca/3647/

Version: Version officielle de l'éditeur / Published versionRévisé par les pairs / Refereed Conditions d’utilisation:

Terms of Use: CC BY

Document publié chez l’éditeur officiel

Document issued by the official publisher

Titre de la revue:

Journal Title: Journal of Control Science and Engineering (vol. 2011) Maison d’édition:

Publisher: Hindawi URL officiel:

Official URL: https://doi.org/10.1155/2011/561347 Mention légale:

Legal notice:

Ce fichier a été téléchargé à partir de PolyPublie, le dépôt institutionnel de Polytechnique Montréal

This file has been downloaded from PolyPublie, the institutional repository of Polytechnique Montréal

(2)

Hindawi Publishing Corporation Journal of Control Science and Engineering Volume 2011, Article ID 561347,3pages doi:10.1155/2011/561347

Research Article

LQG Homing in a Finite Time Interval

Mario Lefebvre

D´epartement de Math´ematiques et G´enie Industriel, ´Ecole Polytechnique de Montr´eal, C.P. 6079, Succursale Centre-ville, Montr´eal, QC, Canada H3C 3A7

Correspondence should be addressed to Mario Lefebvre,[email protected]

Received 3 September 2010; Accepted 11 March 2011 Academic Editor: L. Z. Yu

Copyright © 2011 Mario Lefebvre. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. LetX(t) be a controlled one-dimensional diffusion process having constant infinitesimal variance. We consider the problem of

optimally controllingX(t) until time T(x) = min{T1(x), t1}, whereT1(x) is the first-passage time of the process to a given

boundary andt1is a fixed constant. The optimal control is obtained explicitly in the particular case whenX(t) is a controlled

Wiener process.

1. Introduction

Let{X(t), t 0}be the one-dimensional controlled di ffu-sion process defined by the stochastic differential equation

dX(t)=a[X(t)]dt + b0u(t)dt + σ0dB(t), (1) where a(·) is a real function, u(t) is the control variable, b0=/0, and σ0 > 0 are constants and {B(t), t 0} is a standard Brownian motion. We define the first-passage time

T1(x)=inf{t > 0 : X(t)=d|X(0)=x}, (2) wherex < d, and the random variable

T(x)=min{T1(x), t1}, (3) wheret1> 0 is a constant.

Next, we consider the cost criterion

J(x)= T(x) 0 1 2q0u 2(t)dt + k ln[T(x) + 1], (4) whereq0 > 0 and k /=0 are constants. We want to find the control u∗ that minimizes the expected value ofJ(x). This

type of problem is a special case of the ones that Whittle [1, page 289] termed LQG homing. Notice that if the constantk

is negative, then the optimizer tries to maximize the survival time of the process in the interval (0,d), taking the quadratic

control costs into account. LQG homing problems have been

treated by various authors; see Kuhn [2], Lefebvre [3], and Makasu [4]. Kuhn and Makasu used a risk-sensitive cost criterion (see also Whittle [5, page 222]).

In the general formulation given by Whittle,{X(t), t≥0}

is ann-dimensional process and the random variable T(x) is

the moment of first entry of the joint variable (X(t), t) into

a stopping setD ⊂ Rn×(0,). However, in practice, it is

very difficult to obtain explicit solutions to problems in two or more dimensions (except in special instances). Moreover, in the papers published so far on homing problems, the hitting time T(x) was defined only in terms of X(t). Here,

we consider the case when the optimizer stops controlling the diffusion process at most at time t1.

Using a theorem in Whittle [1], we can state that the optimal controlu∗can be expressed as follows:

u∗=σ02 b0 G(x) G(x), (5) where G(x)=E  exp  b20 q0σ02k ln[τ(x) + 1]  . (6)

In the above formula,τ(x) is a random variable defined by τ(x)=min1(x), t1}, (7) with

τ1(x)=inf 

(3)

2 Journal of Control Science and Engineering

and{ξ(t), t≥0}is the uncontrolled process that satisfies the stochastic differential equation

dξ(t)=a[ξ(t)]dt + σ0dB(t). (9) That is,τ(x) is the random variable that corresponds to T(x)

for the diffusion process obtained by setting u(t)=0 in (1). Hence, the optimal control problem is reduced to the computation of the mathematical expectationG(x). Actually,

for this result to hold, we must have P[τ(x) <∞]=1. However, in our case this condition is trivially satisfied becauseτ(x)≤t1.

In Section2, we will obtain an explicit solutionu∗in the case whena[X(t)]≡μ > 0, so that

dX(t)=μdt + b0u(t)dt + σ0dB(t). (10) Notice that the uncontrolled process{ξ(t), t 0}is then a Wiener process with driftμ and diffusion parameter σ02. Fur-thermore, we will choose the constantk = −q0σ02/b20. With this choice, the mathematical expectationG(x) simplifies to

G(x)=E exp{ln[τ(x) + 1]}=1 +E[τ(x)]. (11)

2. Optimal Control of a Wiener Process

Letm1(x) denote the expected value of the first-passage time

τ1(x). In the case of the Wiener process defined by

dξ(t)=μdt + σ0dB(t) (12) the functionm1(x) satisfies the ordinary differential equation

1 2σ

2

0m1(x) + μm1(x)= −1, (13) and is such thatm1(x)=0 ifx=d. We find (see Lefebvre [6, page 220]) that

m1(x)= d−x

μ . (14)

Therefore, in the case whent1tends to infinity, the function

G(x) is given by

G(x)=1 +d−x

μ . (15)

It follows from (5) that

u∗t1=∞= −

σ02

b0 1

μ + d−x. (16)

Now, to obtain the expected value of the random variable

τ(x), we can condition on τ1(x):

E[τ(x)]=E[τ(x)|τ1(x)≤t1]P[τ1(x)≤t1] +E[τ(x)|τ1(x) > t1]P[τ1(x) > t1].

(17)

We may write that

E[τ(x)|τ1(x) > t1]=t1. (18) −1 −0.8 −0.6 −0.4 −0.2 −10 −8 −6 −4 −2 2 4 x

Figure 1: Optimal controlsu∗

t1=∞(dotted line) andu

(solid line)

whenx∈(−10, 5),b0=q00=μ=1, andd=t1=5.

Moreover, because the conditional probability density func-tion ofτ1(x), given that τ1(x)≤t1, is given by

1(x)(t|τ1(x)≤t1)= 1(x)(t) P[τ1(x)≤t1] for 0< t≤t1, (19) we have: E[τ(x)]= t1 0 t fτ1(x)(t)dt + t1P[τ1(x) > t1]. (20)

The function 1(x)(t) is known to be (see Lefebvre [6,

page 219]) 1(x)(t)= d−x 2πσ02t3 exp  d−x−μt 2 2σ2 0t  for t > 0. (21) Making use of this formula, we can obtain an explicit expression for the mathematical expectation E[τ(x)] and

hence, for the optimal controlu∗.

To illustrate the results, we computed (numerically) the optimal control whenb0=q00=μ=1 andd=t1=5. Looking at Figure1, we see that the optimal controlu∗tends to zero much faster than u∗t1=∞ = 1/(x−6) as x tends to −∞. However, forx close to the boundary at d=5, the two functions are similar.

Next, to see the effect of the constant t1 on the optimal control, we computed the value ofu∗ whenx = 0 andt1 varies from 0 to 20. This value is compared tou∗t1=∞= −1/6

in Figure2. Whent1decreases to 0, so doesu∗, as it should be. Fort115 (approximately), we haveu∗u∗t1=∞.

Finally, in Figure3, we show the optimal controlu∗when

d=15 andt1 =5. BecauseE[τ1(x)] =15−x (see above), when x 5 (approximately), it is very unlikely that the uncontrolled process will hit the boundary atd=15 before time t1 = 5. Therefore, the optimal control is close to 0. Notice that limd→ ∞u∗ =0 (for a finite value ofx). Indeed, we

can write that limd→ ∞P[τ1(x) > t1]=1. Hence, we deduce from (17) thatE[τ(x)]=t1, which implies thatG(x)=1+t1, and thusu∗d=∞=0.

(4)

Journal of Control Science and Engineering 3 −0.16 −0.14 −0.12 −0.1 −0.08 −0.06 −0.04 −0.02 0 2 4 6 8 10 12 14 16 18 20 t1

Figure 2: Optimal controlsu∗

t1=∞ = −1/6 (dotted line) and u

(solid line) whenx = 0,b0 = q0 = σ0 = μ = 1,d = 5, and

t1(0, 20). −1 −0.8 −0.6 −0.4 −0.2 0 6 8 10 12 14 x

Figure 3: Optimal controlu∗whenx(5, 15),b

0=q00=μ=

1,d=15, andt1=5.

3. Conclusion

We have considered LQG homing problems for which the optimizer controls the diffusion process in the time interval (0,T(x)], where the random variable T(x) is smaller than

or equal to a fixed constant t1. Moreover, the termination cost function was chosen so that the optimal control was expressed in terms of the expected value of a first-passage time for the corresponding uncontrolled process.

An application of this type of problem is the following: suppose that X(t) denotes the wear of a machine. The

optimizer wants to maximize the lifetime of the machine. However, it is natural to assume that the machine will be replaced after a certain time, even if it is still in working order, because it might become obsolete.

To obtain a more realistic model for the wear of a device, we could use a degenerate two-dimensional diffusion process, as in Lefebvre [7]. The most difficult problem would

then be to compute the probability density function of the first-passage timeτ1.

References

[1] P. Whittle, Optimization over Time, Wiley, Chichester, UK, 1982.

[2] J. Kuhn, “The risk-sensitive homing problem,” Journal of

Applied Probability, vol. 22, no. 4, pp. 796–803, 1985.

[3] M. Lefebvre, “A homing problem for diffusion processes with control-dependent variance,” Annals of Applied Probability, vol. 14, no. 2, pp. 786–795, 2004.

[4] C. Makasu, “Risk-sensitive control for a class of homing problems,” Automatica, vol. 45, no. 10, pp. 2454–2455, 2009. [5] P. Whittle, Risk-Sensitive Optimal Control, Wiley, Chichester,

UK, 1990.

[6] M. Lefebvre, Applied Stochastic Processes, Springer, New York, NY, USA, 2007.

[7] M. Lefebvre, “Optimizing the time spent by diffusion processes in intervals,” International Journal of Open Problems in

(5)

Internatfional Journal of

A

e

ro

spa

c

e

Eng

fin

e

e

r

fing

Hfindawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2010

Robo

Journal o

t

fics

f

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

Actfive and Passfive Electronfic Components

Control Scfience and Engfineerfing

Journal of

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014 Internatfional Journal of

Rotatfing Machfinery

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014 H

findawfi Publfishfing Corporatfion http://www.hfindawfi.com

Journal of

En

g

fin

e

er

fin

g

Volume 2014

Subm

fi

t

your

manuscr

fip

ts

a

t

h

t

tp

:

/

/www

.h

findaw

fi

.com

VLSI Desfign

Hfindawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014 Shock and Vfibratfion H

findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

C

Advances

fiv

fi

l

Eng

fin

finee

r

fing

AcousAdvancestficsfin and Vfibratfion

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014 H

findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014 Electrfical and Computer Engfineerfing

Journal of

Advancesfin OptoElectronfics

Hfindawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

The

Sc

fient

fific

Wor

ld

Journa

l

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

Senso

JouHfindawfi Publrnafishfing Corporatl ofionf

rs

http://www.hfindawfi.com Volume 2014

Modellfing & Sfimulatfion fin Engfineerfing Hfindawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

Chemfical Engfineerfing

InternatfionalJournal of Antennas and

Propagatfion

InternatfionalJournal of

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014 H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

Navfigatfion and Observatfion

InternatfionalJournal of

H findawfi Publfishfing Corporatfion

http://www.hfindawfi.com Volume 2014

D

fist

r

fibuted

Senso

r

Netwo

rks

Figure

Figure 1: Optimal controls u ∗ t 1 =∞ (dotted line) and u ∗ (solid line) when x ∈ ( − 10, 5), b 0 = q 0 = σ 0 = μ = 1, and d = t 1 = 5.
Figure 3: Optimal control u ∗ when x ∈ (5, 15), b 0 = q 0 = σ 0 = μ = 1, d = 15, and t 1 = 5.

Références

Documents relatifs

By harmonic trans- form, we establish the existence of the Levy process conditioned to stay in a finite interval, called the confined process (the confined

By using now this result, Lemma 15 and the Markov property of the random walk and of Brownian motion, we deduce the following:..

We consider the accumulation of deleterious mutations in an asexual pop- ulation, a phenomenon known as Muller’s ratchet, using the continuous time model proposed in [4].. We show

Because of the construction of the Erlang PP P from a Poisson process of (density) intensity 2µ noted PO, the first point of P 0 after the origin is in C either if there is an even

In Timed Event Graphs, each circuit where the ratio equals the maximum ratio is a critical circuit as a small variation of any time durations relevant to the corresponding

Using this partition of the subsystems and particularly exploiting the time margins corresponding to idle times of machines, the approach allows an optimization of the non-critical

In Canada overall, in mathematics, more boys than girls reached the low international benchmark, the basic level of achievement (94 percent vs. 91 percent), and attained the

A free boundary problem modeling a microelectromechanical system (MEMS) consisting of a fixed ground plate and a deformable top plate is considered, the plates being held at