A NNALI DELLA
S CUOLA N ORMALE S UPERIORE DI P ISA Classe di Scienze
W ENDELL H. F LEMING Deterministic nonlinear filtering
Annali della Scuola Normale Superiore di Pisa, Classe di Scienze 4
esérie, tome 25, n
o3-4 (1997), p. 435-454
<http://www.numdam.org/item?id=ASNSP_1997_4_25_3-4_435_0>
© Scuola Normale Superiore, Pisa, 1997, tous droits réservés.
L’accès aux archives de la revue « Annali della Scuola Normale Superiore di Pisa, Classe di Scienze » (http://www.sns.it/it/edizioni/riviste/annaliscienze/) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions). Toute utilisa- tion commerciale ou impression systématique est constitutive d’une infraction pénale.
Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
435
Deterministic Nonlinear Filtering
WENDELL H. FLEMING
Dedicated to the memory of Ennio De Giorgi
Abstract. A model for nonlinear filtering is considered in which errors in state
dynamics and observations are modelled deterministically. Mortensen’s deter- ministic estimator and a minimax estimator are considered. A risk sensitive stochastic filter model with small state and observation noise intensities is also considered. The minimax estimator is obtained in the zero noise intensity limit, using asymptotic properties of a pathwise interpretation of the Zakai stochastic
partial differential equation.
1. - Introduction
In
general
terms, thefiltering problem
can be stated as follows. Let xTdenote a state
(or signal)
at time T and yT an observation at time T. The observationdepends
on both xT and certain measurement errors. Estimates eT for xT are allowed whichdepend
onpast
observations yt for0 t
T.The
goal
offiltering
is to find estimates which are "best" or at least"good"
according
to some criterion.Moreover,
to be useful inapplications, filtering algorithms
must becomputationally implementable
in real time.In the traditional stochastic filter
model,
statedynamics
aregoverned by
white-noise driven stochastic differential
equations,
and observation errors arealso modelled as white noises. The
objective
is to minimize anexpected
meansquared
estimation error criterion. Forlinear-gaussian
filtermodels,
the Kalman-Bucy
filterprovides
ahighly
successful solution. This filter involvesonly
thesolution off-line of a matrix Riccati differential
equation
and the solution on-line of a linear stochastic differentialequation
for the estimate. See forexample [FR,
p.136].
For nonlinear stochastic filtermodels,
theexpected
meansquared
estimation error
depends
on the conditional distribution of the state at time Tgiven
past observations for0 t
T. Nonlinearfiltering theory
reducesResearch partially supported by NSF grant DMS-9531276 through Brown University and by ONR grant N0014-96-1-0267 through North Carolina State University.
(1997), pp. 435-454
the
problem
to the solution of the Zakai stochastic PDE for an unnormalized version of the conditionaldensity,
followedby integrations [Ku]. Except
in thelowest state dimensions this is a
computationally
formidable task.However,
a
splitting
method consideredby
Rozovskii and associates is useful fordoing part
of thecomputation
off line rather than in real time[LMR].
Inengineering practice,
extended Kalman filters arefrequently
used instead. The extended Kalman filterapproximation
has beenrigorously justified
in somespecial
cases,in which observation noise is of low
intensity.
See Picard[P].
Instead of the stochastic filter
model,
we consider in this paper a determin- istic model described in Section 2. Errors in the statedynamics
and observationsare modelled as deterministic functions
(called disturbances)
rather than as white noises. See(2.1), (2.2).
The unnormalized conditionaldensity
isreplaced by
acertain "information state"
function,
which in our notation is- V (T , x ),
whereV (T, x)
is definedby minimizing
a leastsquared
disturbance error criterion.See
(2.5).
The function V evolvesaccording
to the first-order PDE(2.6)
ofHamilton-Jacobi-Bellman type, and is
interpreted
as aviscosity
sense solutionto
(2.6).
This function V is central to theanalysis.
In Section 3 we consider the solution operatorST,
which maps initialdata 0
for the PDE(2.6)
to thesolution
V (T, x).
The solution operator is "linear" when considered in the so-called
max-plus algebra [AQV]
rather than with respect toordinary
addition and scalarmultiplication.
See(3.3), (3.4).
Anotherimportant property
is that the solutionoperator
preservessemiconcavity (Lemma 2.5).
Theseproperties
areuseful to obtain a
representation (3.7)
of solutions in terms ofquadratic
basisfunctions.
The deterministic
approach
tofiltering
waspioneered by
Mortensen[Mo].
Mortensen’s estimator
xT
minimizesV(T, x)
as a function of x.Thus, iT
maximizes the information
state - V ( T , x ) .
In Section 4 we consider anotherestimator
8$,
called a minimax estimator. It is aslight
variant of the minimax estimator introducedby McEneaney [Mc2].
The minimax estimator minimizesover e the maximum over x of
el) - V ( T , x), where it
> 0 andt(r)
represents the loss from absolute estimation error r. This minimaxpoint
ofview in
filtering
is similar inspirit
to the robustapproach
to control(also
called
H-infinity
controltheory).
See[BB].
In Section 5 we return to a stochastic filter model.
However,
instead of the traditionalexpected
mean square errorcriterion,
anexpected exponential-of-loss
criterion is minimized. The estimator obtained in this way is called a risk sensitive filter.
Hijab’s
thesis[H]
obtained the deterministic information statevia WKB -type
asymptotics
of the unnormalized conditionaldensity,
as a stateand observation noise
intensity parameter E
tends to 0. See also[JB]
for aproof
of this kind of result
using viscosity
solution methods. In order to compare stochastic and deterministic filtermodels,
so calledpathwise filtering theory
is needed
[Da] [Mi] [FP].
Under somesimplifying assumptions,
we sketch astochastic control
proof
of thisasymptotic
result(Lemma
5.1(a)).
Then weprove convergence of the risk sensitive estimate to the minimax estimate as E ~ 0
(Theorem 5.2).
The results
presented
here arepart
ofongoing joint
research with W. M. McE- neaney. The mathematicalframework,
and similar results under a different set ofassumptions,
have been announced in[FM2].
In thepresent
paper weimpose
restrictive
assumptions
as needed to avoid various technical issues addressed in[FM2]
and its more detailed version which is inpreparation.
2. - Deterministic filter model
In this paper we are concerned with the
following
model. The state xt which is to be estimated evolvesaccording
to the differentialequation
where wt
is the state disturbance. The observation yt at time t satisfieswhere vt is the observation disturbance.
Here xt
E EJRm,
E RP.We assume
throughout:
Here f,
is the matrix ofpartial
derivatives off
with definedsimilarly.
An estimator is a function which
assigns
to each observationtrajectory
Y. and time T an estimate eT for the unknown state xT . It isrequired
to be nonan-ticipative,
in the sensethat yt = yt
for 0 Timply
that thecorresponding
estimates
satisfy
eT =eT . Following
a rather common abuse ofterminology,
we shall also refer to eT as the estimator.
Mortensen’s deterministic estimator
[Mo]
is described as follows.Let 4) satisfy:
We
regard -4J(xo)
as a measure of the likelihood of unknown initial state xo.(Later
additionalassumptions (A3)-(A5)
will be made asneeded.)
LetGiven
observations yt
for 0 T, one wishes to minimize J among all xo, w. v, consistent with the observations.Suppose
that the minimum occurs atX0, w*, v*,
and letxt
be thecorresponding
solution to(2.1).
The Mortensenestimator at time T is
xT
=xr. (If
the minimum is notunique,
theniT
neednot be
unique.)
It is convenient to describe the Mortensen estimator in the
following
way.We fix the final state x and solve
(2.1 )
backward in time. An obser-vation
trajectory
y. is fixedthroughout
the discussion which follows. We canrewrite
(2.3)
asFor
given (T, x),
a standard existence theorem in controltheory implies
thatthere exists w * which minimizes J. See
[FR,
Cor. III.4.1 ] .
Let K c 1 be any
compact
set. There exists a constantBK
such that whereI . 112
is the norm inZ~[0, T], provided
that( T , x )
E K.This follows from
(A2), (2.3’)
and0 :s J(T, x ; J(T, x; 0).
It then suffices to considerdisturbances w.
withIf xt
is thecorresponding
solutionto
(2.1 )
with xT = x, then R for some Rdepending only
on K. We callthis the
range of dependence property.
Let us next obtain a L 00 bound forw7:
LEMMA 2.1.
(a)
For every compact set K CJRn+1
there existsMK
such thatMK provided (r, x )
E K.(b) Lipschitz
and or is constant, then M where Mdepends only
on T.
PROOF. It suffices to assume
that ~
issmooth, by approximating ~ uniformly
on
compact
setsby
smooth functions whichsatisfy
uniform localLipschitz
bounds.
By Pontryagin’s principle [FR,
Sec. II.11 ], 2 I w I2 -~- pt ~ ~ (xt ) w
isminimum for w =
w7,
wherewith po =
-Øx (xõ)
which is boundedby
the range ofdependence
property.From
(2.4)
and(AI)
for suitable
Ci
1depending
on K.BK,
thisimplies
thatlog(I
+is
bounded,
and hence also pt and This proves(a). If 0
isLipschitz
and o- is constant, then po = is bounded and o-x = 0. A bound for pt, and hence also a bound for
w7,
whichdepends only
on T is obtainedimmediately
from(2.4).
0Following
thedynamic programming
method consider theoptimal
cost func-tion
One can
interpret - V (T, x)
as a measure of the likelihood of state xT = x attime T.
Sometimes - V (T, .)
is called theinformation
state at time T. TheMortensen estimator
xT E argminx V (T, x)
is in this sense a maximum likelihood estimator for xT . The functionV (T, .)
satisfies a localLipschitz condition, uniformly
for T in any finite interval. To seethis, by
the range ofdependence property
it suffices toconsider q5 Lipschitz
on R" and then to use Lemma 2.1(a)
and a standard argument
[FS,
Sec.4.8].
Another standardargument using
thedynamic programming principle gives
a uniform localLipschitz
estimate forV ., x). Thus, V (., .)
islocally Lipschitz.
Moreover,
V satisfies in theviscosity
sense thedynamic programming
PDE(also
called Hamilton- Jacobi-BellmanPDE)
with initial data
Here a = aa’ where ’ denotes matrix
transpose.
In order to obtain
uniqueness
ofviscosity
solutions to(2.6)-(2.7),
furtherassumptions
on the initialdata 0
are needed. Ageneral uniqueness
result whenI
grows at mostlinearly in Ixl as Ixl
~ oo isgiven
in[Mcl].
In thepresent
paper, thisuniqueness
issue will not arise. We remark thatif 0
isLipschitz
and or is constant, thenV(T, .)
satisfies a uniformLipschitz
conditionon any finite time interval.
Uniqueness
holds in the class of suchviscosity
solutions.
In addition to
(A 1 )
and(A2)
let us now assumeLEMMA 2.2. For each T >
0,
SKETCH OF PROOF. If w* is
minimizing
andxt
thecorresponding
solutionto
(2.1 )
withxj.
= x, thenby (2.3’)
If
V (T, xn )
is bounded for a sequence xn withI --+
oo, then thecorrespond- ing xon
is boundedby (A3)
andw n
is bounded inL2-norm. However,
this isimpossible by (2.1)
and(Al).
DIn
particular, V (T, .)
has a minimum at somexT (Mortensen estimator).
REMARK 2.3. If
f (x) -
= constant,h(x) =
Hx,and 0
isquadratic
the model
(2.1 )-(2.2)
is the deterministic counterpart of the stochastic Kalman-Bucy
model. In this case, the Mortensen filter agrees with theKalman-Bucy
filter. It satisfies the linear differential
equation
where can be
precomputed
from the solution to a matrix Riccati differentialequation.
Fornonquadratic 41,
there are similarasymptotic
results as T ~ oo.See formula
(2.13)
in the 1-dimensional case and discussion after it.REMARK 2.4. In the nonlinear case there is an
analogue
of(2.8).
Fornotational
simplicity,
let xt , yt be scalar valued(n
= p =1). Suppose
that V issmooth
(class C2)
in aneighborhood
of(T, iT)
and thatVxx (T, xT)
> 0.By differentiating
the PDE(2.6)
with respect to x and theequation Vx (T, XT)
= 0with respect to T, one gets
with initial data
xo
EUnfortunately, Vxx
is not known withoutsolving (2.6)-(2.7);
and hence(2.9)
does notprovide
a finite dimensional proce- dure forfinding iT.
Inengineering practice,
extended Kalman filtersinvolving repeated
linearizations of(2.1)
and(2.2)
arefrequently
used instead. At the end of Section4,
we will consider thespecial
case of one to one observa-tion function h and small observation parameter p. In that case a
simplified
version
(4.5)
ofequation (2.9) provides
a robust filter.SEMICONCAVITY. A
function fjJ
is called semiconcave if for every R there existsCR
such that(2. 1 0) x - x - 1C 2
(2.10) 2
is concave on the ball
R } .
In addition to(Al)
let us now assume:bounded.
LEMMA 2.5.
If 0
is semiconcave, thenV (T, ~)
is semiconcave. Infact,
thereexists
rR (T )
suchthat for
0 T T, R > 0is concave on the ball
I R I -
SKETCH OF PROOF. Let
where r =
rR (T)
is to be chosensuitably.
Since the infimum of anyfamily
ofconcave functions is concave, it suffices to show that
j(r,
. ;w.)
is concave onR}
foreach w.. By smoothing
via convolution withapproximations
tothe
identity,
we may assumethat 0
is smooth. Thensemiconcavity
isequivalent
to
CRlçl2
forall ~ , when
R. To showconcavity of J
itsuffices to show that
whenever Ixl :s
R = 1. Thesolution xt =
to(2.1) depends smoothly
onthe
final data x = XT. Let0161/, 0161?
denote the first and second order derivativesof xt
in thedirection ~ .
ThenBy
Lemma 2.1 we may assumethat MK,
where K =[0, T] x R }.
Then IXol:S R
I for suitableR
I and hence~o Moreover,
from
(A 1 ), (A2)
and(A4)
there are boundsfor I~/I, 1
and all other termson the
right
side of(2.2).
Thisimplies (2.11),
for suitable r. a LARGE TIME BEHAVIOR. We recall that isinterpreted
as a measureof the likelihood of initial state xo. The
function 0
is often chosen ratherarbitrarily.
It is aninteresting question
to describe conditions under which thedependence
of the Mortensen filteron 0
is lostasymptotically
as T --~ oo.For the stochastic
(white
noisedisturbance)
counterpart of the model which weconsider,
see[OP]
and references cited there for results of this kind.Let us consider
only
the linear model in Remark2.3,
butwith 0
not neces-sarily quadratic. (For
the stochasticmodel,
thiscorresponds
to lineardynamics
and
observations,
butnongaussian
initialdata.)
As in Remark2.4,
forsimplicity
we consider the 1 dimensional case.
Equation (2.9)
now becomesWe assume that
H #
0 andthat o
is smooth(class C2)
with 0 C. Wewill show that V is also smooth with > 0, and that as T --~ oo,
Vxx (T , XT)
tends at an
exponential
rate to a constant K. See formula(2.21).
The constantK is the same one obtained via the
Kalman-Bucy filter,
and does notdepend
on
q5.
Sincef (x) -
Ax,h(x) =
Hx, a is constantand 0
isstrictly
convex,J(T, x ; w.)
isstrictly
convexby (2.1)
and(2.3’).
Hence theminimizing w*
isunique. Moreover, by Pontryagin’s principle wt -
-cr pt with pt as in(2.4).
Then
(2.1), (2.4), with xt
becomeThese are the characteristic
equations
for the PDE(2.6),
for this choice off, h, a.
Letxt (a), pt (a)
denote the solution to(2.14)
with initial dataGiven
T, x,
we havext
= for aunique
a * such thatx_= xT (a * ) .
The method of characterisitics
provides
a smooth solutionV (T, x)
to(2.6)- (2.7)
on some interval0 s
TT1,
andV(T, x)
=V(T, x)
in this interval. Letus show that
TI
= oo and thatVxx (T, x)
> 0. For 0 tT1,
Let §1
=axt/aa,
qi =-apt/aa. By (2.14)
with the initial data
~o
=1,
170 = > 0. Anelementary analysis
showsthat §1
>0, r~r
>0 for all t >
0. Since > 0 themapping
a ~xt (a )
is one-to-one, and
V(r,jc)
= for all T. ThusT, = Moreover, by (2.17), Vxx
> 0. SinceVxx
= we haveby (2.17)
and(2.18)
for eachfixed a
where
Vxx =
and the initial data are =Thus,
rt = solves the Riccati differential
equation it
=g (rt ),
whereThen
g ( K )
=0, g’(K) = -h,
whereFor initial data 0 ro
C,
there exists B such thatI rT - Be-ÀT
forall T.
By choosing
a such thatxT (a)
=iT,
Thus, (2.13)
isasymptotically
as T --~ oo the same as forquadratic
initialdata. From this we wish to obtain a result which says
that, asymptotically
asT 2013~
00, iT
losesdependence
on0.
By (2.20), (2.21)
we can rewrite(2.13)
asfor some
fl.
Consider anotherinitial $
with 0$(x)
:s C. Thecorresponding
Mortensen estimate
iT
also satisfies(2.22),
with1/IT replaced by
which alsosatisfies
(2.23).
Let~T
= ThenLet us assume that the state and observation disturbances
satisfy
An
elementary analysis
showsthat I and I
are all as t -0o for any it >
A+,
whereA+ = max(A, 0). By (2.20b), A+ h/2.
From
(2.23), (2.24)
it then follows that~T
-~ 0 as T ~ oo. The author wishes to thank D. Hemandez-Hemandez forhelpful
commentsconcerning
thisargument.
3. - The solution
operator
Let
ST
denote the nonlinear operator which maps the initialdata 0
in(2.7)
to the
optimal
cost function V definedby (2.5):
Recall that V satisfies the PDE
(2.6)
in theviscosity
sense. Let C denote thecone of all
functions 0
suchthat q5
is semiconcave and satisfies(A2), (A3).
By
results of Section2, ST
maps C into C. Moreover,Moreover,
the solution operator has thefollowing important property.
Considera collection Z of functions
qlz
E C.LEMMA 3.1.
If 0 (x)
= thenPROOF.
Since 0 j qlz
for all z,(3.2) implies
thatSTØ(X)
is no more than theright
side of(3.4).
Given(T, x),
let w* minimizeJ(T,
x;wJ
with associated solution x * to(2.1), xT
= x. = for some ~ E Z and hencewhere for
J~
wereplace 0 by ql,
in(2.3’).
DProperties (3.3), (3.4)
state that the solution operatorST
is"linear",
not withrespect
to the usual addition and scalarmultiplication
but when considered withrespect
to the so-called"max-plus" algebra [AQV].
We shall return to thispoint
in Section 5 in
discussing
the small noiseasymptotics
oflinear, parabolic
PDEswhich arise in risk sensitive
filtering.
See Remark 5.3.BASIS FUNCTIONS. It is often useful to
approximate
solutions tolinear,
time-dependent
PDEsby
linear combinations of solutions which have initial data chosen from agiven
set of basis functions.Something
similar can be done for solutionsV(T, x)
to(2.6)-(2.7) provided
we work in themax-plus algebra.
Letus consider "basis functions" of the form
where for notational
simplicity
we suppress thedependence of 0,
on C.We wish to represent a semiconcave
function 0
in terms of basis functionsq5,.
For
simplicity,
let us first assume is concave on R’ forsome
C 1.
Let C >Ci
1 and as in(2.10) ~ (x ) Then 0
isconcave on R" and -~ -oo as oo. Let
a(z)
be the dual of the convex function-4>:
Then - $(x)
is the dual of the convex functiona (z) [RW, Chap. 11 ] :
or equivalently
By (3.3)
and(3.4)
To remove the
global
restriction above onq5, let 0
be semiconcave. Given acompact
setK,
it suffices to consider disturbances w. such that R where Rdepends only
on K(the
range ofdependence
property in Section2.)
Wecan
choose q5
such= q5 (x) whenever Ix I :S
R and for suitableCR
is concave on R"
with Ixl
~ oo. Then =for all
(T, x)
E K and we canreplace q5 by o
in(3.7).
Forexample
one canchoose
where B is
sufficiently large
and 9 is smooth with = 1 forIx (
R and9(x) = 0 for
Basis
representations
may be useful in connection with a numericaltechnique
which does part of the task of
solving (2.6)-(2.7)
"off line" rather than in real time. This will be discussed in aforthcoming
paper withMcEneaney.
Asimilar
approach
forsolving
the Zakai stochastic PDE of nonlinearfiltering
wasdeveloped by
Rozovskii and associates. See[LMR].
4. - Robust filters
The definition of robust filter which we will use is a
slight
modification ofone introduced
by McEneaney [Mc2].
Let eT be any estimator for the state xT at time T. We consider a measurel(lxT - eT I)
of the loss from estimationerror xT - eT. In
[Mc2]
thequadratic
loss function.~ (r ) =
is used. Weassume:
Let it
> 0 be a parameter. We say that eT achieves robust estimation at level>
if,
for all xo, w., v.If one sets
y 2
=A- 1,
then y has the role of a familiarH~ -
bound parameterin robust control and estimation. See
[BB] [Mc2].
Sincef"(0)
>0,
for small estimation errors the left side of(4.1)
behavesnearly
like aquadratic.
However
(A5)
allows for loss functions with less thanquadratic growth
oft(r) as )r) -~
oo. In Section 5 we will considerlinearly growing i (r),
such as forexample t(r)
=(1
+r2)1/2 _
1.By (2.3)
and(2.5),
eT achieves robust estimation at level /t at time T if andonly
iffor all x E R’~. Under rather
general assumptions
we should expect that thereare many robust estimators eT
if > A*,
and no robust estimatorsif >
>it*,
for a certain critical
parameter value /t*.
See[Mc2]
for the case ofquadratic t(r)
andquadratically
Let us consider thefollowing estimator,
introduced in
[Mc2].
MINIMAX ESTIMATOR. Let
We assume that the maximum is attained
(finite). By (A5)
the function isstrictly
convex on R’ for each x and moreoverThis
implies
thatG(T,.)
isstrictly
convex on R’ and has a minimum at aunique ~0.
We call8$
the minimax estimator for xT .By (4.1’)
the minimaxestimator
8$
is robust if andonly
ifIf the min and max are taken in the
opposite
order thenand the Mortensen estimator
Jcr
attains the maximum over x. Ingeneral, min max >
max minand 8$
is not the same asXT.
EXAMPLE 4.1. Take T =
0, n
= 1 andwith 0 a
1,
A calculation shows that8g
= 0 is robust and thatthe minimum
of 0 (x)
occurs at 0. DOn the other
hand,
if max min =min max,
then8$
=XT.
Inparticular,
thissaddle
point
property holds ifel) - V(T, x)
is concave in x, since this function is convex in e. Forinstance,
thishappens
in the Kalman -Bucy setting (Remark 2.3) with 0 (x)
andt (r) quadratic.
ThenIlx - el2 - V ( T , x )
isquadratic,
and concave in xfor it ~ * where >*
is the critical parameter value.SMALL OBSERVATION NOISE. When p is small in
(2.2),
then can beestimated with small error. We call this the case of small observation noise. In
general,
thefiltering problem
has no easy solution for small p, or even for thecase p = 0 considered in
[JL].
We consider hereonly
thefollowing
easy casein which states and observations have the same dimension n = p = 1 and h is
one-one with
Lipschitz
inverse:Following
anargument
in Picard[P],
for thecorresponding
stochastic small observation noisemodel,
thefollowing proposition provides
an easy way togenerate
robustfilters,
in thisspecial
case.PROPOSITION 4.2. Assume
(4.4)
and let1/1.
be anybounded function
such that1/It
~ c > 0.Define
the estimator eTby
with initial data eo.
Let 0 (x) = k(x - eo)2
where k > 0 andl(r)
=!r2.
Then>
0, To
> 0 there exists po > 0 such that eT achieves robust estimation atlevel
tt for
T >To
and 0 p po.PROOF. We may assume that
hx
> 0.By subtracting (4.5)
from(2.1),
we haveBy using
theinequality 2ab
and the boundedness of and1/1’t,
we obtain for small pThe first term on the
right
side is no more ifT > To
> 0and p
issmall
enough (p po . )
0Under the
special
circumstances ofProposition 4.2,
it seemslikely
that theminimax and Mortensen estimators are the same. It would be
interesting
tofind choices of
1/It
in(4.5)
whichgive especially good approximations
to theMortensen estimator
iT. (See [P]
for detailedanalysis
of acorresponding question
in the stochasticmodel,
in whichxT
isreplaced by
the conditionalmean.)
Onepossibility
is to find1/IT
from(2.9)
withVxx (T, xT) replaced by
anapproximation
obtainedby linearizing f (xt )
andh (xt )
aboutf ( yt )
andh (yt ),
and aquadratic appromiation to ql (xo) about 0(yo).
The solution to thislinear-quadratic approximation suggests
thatpvxx (T, iT)
is bounded belowby
a
positive
constant ifT > To
> 0 and p is small.REMARK 4.2. A more
general
formulation of robustfiltering
involves esti-mation errors over 0 T as well as at time T. Let
i (r) = ! r2.
Then theformulation in
[BJP] requires that,
for all xo, w., v.where ~c 1, /-t2 are
nonnegative
parameters(not
both0.)
In(4.1 )
we took >1 = it, A2 = 0. Other authors[DBB] [Kr]
took Al =0,
tt2 > 0. When ~2 > 0 theanalysis
becomes morecomplicated
since thedynamics
of an information state functionV (T, x), analogous
to ourV (T, x), depend
on the estimationtrajectory
e, which is to be chosen
optimally.
Under restrictiveassumptions
there is ananalogue iT
of the Mortensen estimatoriT,
obtainedby minimizing V(T, .) [Kr,
Thm.4.17].
The function V satisfies anequation (or inequality)
rathersimilar to
(2.6). However,
it is not a PDE(or partial
differentialinequality)
since a term
involving IX - XT 12
must be added to theright
side of(2.6).
See[Kr,
formula4.4.12].
The robustfiltering problem
with it2 > 0 and = 0 is aspecial
case of a nonlinearH,,,-control problem
withpartial
state observations.The control is the estimate et, which affects the
running
cost but not the statedynamics.
The result in[Kr] just
mentioned is an instance of acertainty
equivalence principle
in control. See also[JBE,
Section4.6]
5. - Risk sensitive filters
Instead of the deterministic model in Section
2,
let us now consider a stochas- tic filter model for system states and observations. Tosimplify
thepresentation,
let us make in this section the
following assumptions:
In
addition,
weonly
sketch variousarguments
which are well known and aregiven
in more detail elsewhere. Thedevelopment
will be similar to that in[FM2]
where a
technically
more difficult set ofassumptions
is made(including nonconstant, O(x) quadratically growing
asIx I - oo.)
Let
XI
denote the state process andYt
an accumulated observations process.They satisfy
stochastic differentialequations
where e > 0 is a parameter and
B., E.
areindependent
Brownian motion pro-cesses. Moreover,
Xo
isindependent
ofB., B.
and hasdensity ks exp[-E-10(x)l
where ks
is anormalizing
constant. The traditional nonlinearfiltering problem
is to find a minimum
expected
least squares estimate eT forX’,,
such that eTis measurable with
respect
to the a -algebra generated by
the accumulated observationsYt
for 0 T. For the risk sensitivefilter,
instead ofexpected
least squares,
expected exponential-of-loss
is to be minimized.Thus,
eT ischosen to minimize ...
where JL > 0 is a parameter.
The risk sensitive filter
problem
can be restated in terms of an unnormalized conditionaldensity q~ (T, x),
which satisfies the Zakai stochastic PDEwith initial data
Here
,C~
is the generator of the Markov processXr :
and
,C~
is the(formal) adjoint
ofLe.
See forexample [Da] [Ku].
The risk sensitive filterproblem
can be reformulated as one ofchoosing e = 8§l
whichminimizes
By
the estimate(5.12) below,
decreases to 0 at anexponential
rate asIx
~ oo, which assures finiteness of theintegral
for smallenough
it.We wish to show that the
optimal
stochastic risk sensitive estimator tendsto the deterministic minimax estimator in Section 4 as 8 ~ 0. This cannot be done
directly,
since a fixed observation function Y. isgiven
in the deterministic filtermodel,
whileY *8
is a nowhere differentiablesample path
of a stochasticprocess. This
apparent difficulty
is avoidedby using pathwise
nonlinearfiltering theory.
Thisprovides
a version of the unnormalized conditionaldensity
processq’
which"depends continuously"
on the accumulated observationsample paths.
See
[JB] [Da] [FP]. By using pathwise filtering theory
it suffices to consider any fixed smooth(class C 1 )
accumulated observation functionY
and to rewrite the Zakaiequation (5.3)
in Stratonovich form:where
d YT /d T .
Let usmultiply q £ by
a convenient factordepending only
on thegiven
observationpath
y., which does not affect the minimizationover e in
(5.6).
LetThen
q~
satisfies the linearparabolic
PDELet V I
= - 8 log 4’.
Thenwith initial data
Note
that,
when E =0, (5.8)-(5.9)
become(2.6)-(2.7)
in case o- =identity. By
viscosity
solution methods it can beproved
that Vuniformly
oncompact
sets as 8 - 0. See
[JB].
However, let us sketch a direct stochastic controlproof
of this fact
(see
Lemma5.1(a).)
There is the
following
stochastic controlrepresentation
forV’(T, x). Let §)
denote a controlled process,
satisfying
backward in time the stochastic differ- entialequation
with P.
a backward in time Brownian motion and w. any backward in time boundedprogressively
measurable control process.(A
moreprecise
formulation in terms of referenceprobability
systems isgiven
in[FS,
p.160].)
LetSince
(A6) holds,
a standardargument [FS
p.190] gives
abound M,
where Mdepends only
on T. Thedynamic programming
PDE for theproblem
of
minimizing J’,
as a function of w,, is(5.8).
A verification theorem thenyields
Moreover,
whichimplies
that it suffices to assumethat
sincew7
=Vx (t, ~t *) provides
anoptimal (feedback)
control. Here~t *
solves(5.10)
with wi =
w7.
LEMMA 5.1.
(a)
There existsCl (T )
such that(b)
There existpositive Kl (T ), K2 (T)
suchthat, for
0 81,
SKETCH OF PROOF. Given any
sample path
w. for an admissible control process suchthat IWtl :s M (M
=M(T)),
let xt be the solution of(2.1 )
with a=
identity
and xT = x. Thenwhere II II
is the sup norm on[0, T]. By (A6)
thisimplies
for all such controlprocesses w, which
implies (a).
To prove
(b)
it sufficesby (a)
to show thatfor suitable
K2(T).
For this purpose choose a > 0 such thatfor all
x,
M. ThenBy using (A6) (b),
for suitableKI (T), K2 (T)
for all w. such that
M(T). By (2.5)
thisgives (b).
0By (5.12)
Moreover, by (A6) (c),
Dr for some constant D. Thenwhich
implies
finiteness of theintegral
in(5.6) for JL
where ~l 1 -Since
.~(~x - ~~)
isstrictly
convex, the function4$~(T, .)
in(5.6)
isstrictly
convex.Moreover, ~~ (T, e)
-~ +00 aslei -
oo.Hence, ~~(r,~)
hasa minimum at a
unique
called the risk sensitive estimator for xT . The main result of this section is thefollowing.
THEOREM 5.2. As 8 -~ 0 the risk sensitive tends to the minimax
provided JL
with(5.12).
PROOF. Let
Then 8§l
minimizesG~ (T, .),
sincee)
is a function of Tplus log 4$~ (T, e).
A standard
Laplace-Varadhan asymptotic principle argument together
with(5.11), (5.14)
shows thatG£ (T, e)
-~G(T, e) uniformly
on compact sets(for
eachfixed
T)
as s ~0, provided
JL SinceG (T, ~)
isstrictly
convex andis minimum at the conclusion follows. D
REMARK 5.3. We write
exp[-E-1 V]
in the sense that V-’ = -£log 4’
tends to V
uniformly
oncompact
sets as 8 -~ 0. The PDE(5.7)
forijê
islinear. Thus
Similarly,
if is a solution to(5.7)
for i =1, 2,
thenThese
asymptotic properties
make clearwhy properties (3.3), (3.4)
of the solution operator musthold,
and thusST
is a linearoperator
in the maxplus algebra.
REFERENCES
[AQV] M. AKIAN - J.-P. QUADRAT - M. VIOT, Bellman Processes, in "Lecture Notes in Control and Information Science", No. 199, Springer-Verlag, 1994, 302-311.
[BB] T. BASAR - P. BERNHARD, "H~-Optimal Control and Related Minimax Design Prob-
lems ", Birkhauser, Boston, 1991.
[BJP] R. K. BOEL - M. R. JAMES - I. R. PETERSEN, Robustness and Risk Sensitive Filtering,
submitted to IEEE Trans. Auto. Control.
[Da] M. H. A. DAVIS, "Lectures on Stochastic Control and Nonlinear Filtering", Tata Institute Lectures on Math. and Physics No. 75, Springer Verlag, 1984.
[DBB] G. DIDINSKY - T. BASAR - P. BERNHARD, Structural properties of minimax policies of a class of differential games arising in nonlinear H~-control and filtering, in "Proc.
32nd IEEE Conf. on Decision and Control", San Antonio, 1993.
[FM1] W. H. FLEMING - W. M. MCENEANEY, Risk sensitive optimal control and differential
games, in "Lecture Notes in Control and Info. Sci.", No. 184, Springer Verlag, 1992,
185-197.
[FM2] W. H. FLEMING - W. M. MCENEANEY, Risk Sensitive and Robust Nonlinear Filtering,
in "Proc. 36-th IEEE Conf. on Decision and Control", San Diego, 1997.
[FP] W. H. FLEMING - E. PARDOUX, Optimal control forpartially observed diffusions, SIAM
J. Control and Optim. 2 ( 1982), 261-285.
[FR] W. H. FLEMING - R. W. RISHEL, "Deterministic and Stochastic Optimal Control", Springer-Verlag, 1975.
[FS] W. H. FLEMING - H. M. SONER, "Controlled Markov Processes and Viscosity Solu- tions", Springer-Verlag, 1992.
[H] O. HIJAB, "Minimum Energy Estimation", Ph.D. Dissertation, Univ. of Calif., Berke-
ley, 1980.
[JB] M. R. JAMES - J. S. BARAS, Nonlinear filtering and large deviations: A PDE-control theoretic approach, Stochastics 23 (1988), 391-412.
[JBE] M. R. JAMES - J. S. BARAS - R. J. ELLIOTT, Risk-sensitive control and dynamic games
for partially observed discrete-time nonlinear systems, IEEE Trans. Auto. Control 39 (4) (1994), 780-792.