A NNALES DE L ’I. H. P., SECTION B
P ATRICK C ATTIAUX
C HRISTIAN L ÉONARD
Minimization of the Kullback information of diffusion processes
Annales de l’I. H. P., section B, tome 30, n
o1 (1994), p. 83-132
<http://www.numdam.org/item?id=AIHPB_1994__30_1_83_0>
© Gauthier-Villars, 1994, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
Minimization of the Kullback information of diffusion processes
Patrick CATTIAUX and Christian
LÉONARD
Universite de Paris-Sud,
Département de Mathematiques, Bat. n° 425,
U.A.-C.N.R.S. n° 743,
Equipe
de Modelisation stochastique et statistique,91405 Orsay Cedex, France
Vol. 30, n° 1, 1994, p. 83-132. Probabilités et Statistiques
ABSTRACT. - In this paper we
compute
anexplicit expression
for therate function of
large
deviations for the measure valuedempirical
processwhere the
Xi’s
areindependent copies
of a diffusion process in This is doneby minimizing
the relativeentropy (Kullback information)
of aProbability
measureQ
withrespect
to the law P ofXi
when allmarginals
of
Q
are fixed. The finiteness of the rate function is connected with the existence of conservativediffusions,
with ageneral
diffusion matrix. These diffusion processes are constructed in verygeneral
cases.Key words : Large deviation, relative entropy, Kullback information, conservative diffusion process, Follmer measure, Girsanov transformation.
RESUME. - Dans cet
article,
nous donnons une formuleexplicite
de lafonction de taux des
grandes
deviations du processusempirique
a valeursmesures
Classification A.M.S. : Primary 60 F 10, 60 J 60; secondary 60 G 44, 60 H 05, 60 J 57.
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques - 0246-0203
84 P. CATTIAUX AND C. LEONARD
ou les
Xi
sont descopies independantes
d’un processus de diffusion dans(~d.
Ce programme est realise en minimisantl’entropie
relative(information
de
Kullback)
d’uneprobability Q
parrapport
a la loi P commune auxXi,
sous la contrainte des loismarginales temporelles
deQ
fixees. Lafinitude de la fonction de taux est mise en relation avec l’existence de diffusions conservatives pour des matrices de diffusion
generales.
Cesprocessus de diffusion sont construits dans une
grande generalite.
0. INTRODUCTION
Let
Q = C ([0, T],
be the space of all continuouspaths
with valuesin
Rd
endowed with the uniform norm, and P be the law of aRd-valued
diffusion process considered as a random variable on Q.We denote
by
M(Q)
andM 1 (SZ)
the sets of all measures and allProbability
measures on Q
equipped
with its Borela-algebra.
Let us consider asequence 1 of
independent copies
ofX,
P °° the law of this sequenceon the infinite tensor
product
and the randomempirical
measures onM ~ (Q)
N
where
~Z
is the Dirac measure atpoint
z. Sanov’s theorem states that thelaws of
1)
under P 00obey
alarge
deviationprinciple
in the space M(Q)
endowed with its usual weaktopology,
the rate function of whichis the Kullback information
I (., P)
defined for anyQ
E M(Q) by
More
precisely,
ifA
andA°
are the closure and the interior of the setA,
for any Borel subset A of M
(Q)
(see [DS]
orNow,
let us consider the sequence ofM1
continuous pro-cesses
Since the
mapping
where is the
Q-law
ofX (t),
is continuouswhen C([0, T], M 1 (f~d))
is endowed with thetopology
of the weak convergence of all thet-marginal
lawsuniformly
on[0, T],
the contractionprinciple provides
uswith the
following large
deviationprinciple.
For any Borel subset B ofC ([0, T], M 1 (f~d)),
we havewhere for any v in C
([0, T], M 1 (p~a))
In
[DaG],
D. A. Dawson and J. Gartner havecomputed
anexplicit expression
for I(v)
in the case where P is the law of anonhomogeneous
diffusion process
on Rd
withgenerator
assuming
that andb = (bi)
arelocally
Holder continuous and that thesymmetric
matrix a isstrictly positive
definite. Thisexpression
is thefollowing (at
leastformally):
where J.10 = P 0
Xo 1
is the initial law of theprocess, ~
is the set of Schwartztest functions on V denotes the
gradient operator,
A*(t)
is the formaladjoint
of A(t),
v is the time derivative of v in the sense of Schwartz distributions the firstbracketting ~ . , . ~ being
theduality (~, ~’)
andthe second one
(. / . ) being
the usual scalarproduct
inf~d.
In the
special
case where P is the Wiener measurestarting
from yo, H. Follmer([Fol]) proposed
an alternateproof
of the aboveresult,
underthe
assumption
that themarginal
flow is admissible in the86 P. CATTIAUX AND C. LEONARD
sense that there exists
QEM1 (Q)
such that andH (Q, P) 00.
Inaddition,
heproved
that for an admissible flow the infimum in the definition ofI(v)
is attained for a Markov lawQ*,
thedrift
(with respect
toP)
of whichbeing
characterized in terms of the flow v. His constructiveproof
relies on an extension of a result of I. Csiszar[Cs]
on the minimization of the Kullback information under linear con-straints.
By
means of the samemethod,
M. Brunaud[Bru]
obtained similarresults for a
uniformly elliptic
diffusion with smooth coefficients(also
see[Mi]).
The main reason for theseassumptions
is theemployment
of a timereversal
argument
whichyields
theapproximation
ofQ* by piecewise
h-processes
(or Schrödinger bridges).
It should be remarked that the notion of admissible flow isstrongly
related to the "conservative diffusions" of E. Carlen[Ca],
which appear in the stochasticinterpretation
ofquantum
mechanics.Our aim is to recover the results of
[DaG], [Fo I ], [Bru]
and to extendthem to a
large
class of Markov processes. In thepresent
paper, we consider the case of diffusion processes in(~d
withgenerator A (t),
wherea and b are
supposed
to belocally
boundedmeasurable,
and we allow ato be
degenerate.
Since we want extend the methods and results to alarger
class of processes, we decided not to use the usual time reversal of diffusion processes. We are here
mainly
interested inapplication
tolarge
deviationtheory (see [Leo]),
so, some of the naturaldevelopments
in the directionof stochastic mechanics are not treated.
They
are moresystematically
studied in
[CP] (also
see[Pe]).
Let us now
present briefly
theorganization
and contents of the paper.Section 1 is devoted to the introduction of the main notation.
In Section 2 we
give
someelementary properties
of the relativeentropy
H(Q, P)
we will usethroughout
the paper. Inparticular
we indicate in(2.6)
asimple
but efficient trick to build diffusion processes withgiven marginals
under anentropy
condition.The main result of Section 3 is Theorem
3 .1,
which tells that one canassociate to any
Q,
such that H(Q, P)
isfinite,
a MarkovProbability
measure
Q
with the samemarginals
and such thatH (Q, P) __ H (Q, P).
Therefore to solve the minimization
problem,
it is sufficient to restrict ourattention to Markov
Probability
measures.Actually
Theorem 3 .1 is moreprecise.
Indeed we prove that the "natural Markovian version" ofQ (see
§
3(3.4)
for thedefinition)
is aProbability
measure with the samemarginals
and smaller relativeentropy.
Theproof
of Theorem 3.1 is somewhat intricate because we have to useseveral approximation
proce-dures. An alternate
proof
of this result can be obtainedby
the methods of Section4,
but we think that the onepresented
here(which
isonly
based on stochastic calculus
arguments)
can be useful for other purpose.In Section 4 we
give
some conditions for agiven
flow to be admissible.Let v be a flow of
Probability
measures on(~d.
A Borel function B is saidto be of v-finite energy if
The results of the section are then the
following:
Assume that H
(vo, Ilo)
oo and that v satisfies the weak forward equa- tion for A(t)
+ a B.V,
for some B of v-finite energy. If one of thefollowing
conditions is satisfied
*
(H 1)
and bareC2,1,03B1 vector fields,
*
(H 2) a
isstrictly positive definite, a
and b arelocally
Holder continu- ous,*
(H 3) dvt/ dllt
islocally bounded,
then v is admissible.
This result extends
previous
results of Carlen[Ca], Zheng [Zhe]
andother
people.
As was saidbefore,
we refer to[CP]
for a morecomplete
discussion.
Finally
in Section 5 we describe the set of all MarkovProbability
measures associated to an admissible flow
(Theorem 5. 3),
and derive in Theorems 5. 9 and 5 . 20 theexplicit
formula for I(v), extending [DaG], [Fol]
and[Bru].
We want to underline that a
large part
of the ideas we areusing
in thisarticle,
arealready contained,
at least inimplicit form,
in the papers of[DaG], [Fo 1], [Mi], [Zhe], [AN]
or[DS1],
inparticular
cases of sometimes withincomplete proofs (see
the comments at the end of Section5).
Wethink that one
originaglity
of thepresent
paper is that itclearly
establishesa link between some of these papers, in
particular
itgives
apurely large
deviationsinterpretation
of the Nelson’s processes which is verysatisfactory
from thephysical point
of view(for
a directapproach
of theconstruction of Nelson processes
by large
deviationsmethods,
see[CL2]).
Another very
interesting large
deviationsapproach
of these processes isgiven
in[RZ].
As the reader will see, the
proofs
we have tried togive
are oftenindependent
of thespecific
form of thegenerator.
This allows tothink
that the contents of this paper are still available in a moregeneral (Markov)
context. This will be the aim of a future work([eLl]).
ACKNOWLEDGEMENTS
We wish to thank F. Comets for
pointing
out to us Lemma 3.16 and for his kind interest in this work. We also thank G. BenArous,
M. Brunaud and H. Follmer for very useful discussions. We
finally
thankan anonymous referee for
pointing
out an error in the first version of this paper and for his veryinteresting
comments.88 P. CATTIAUX AND C. LEONARD
1. PRELIMINARIES
We introduce here some definitions and notations that will hold
through-
out the paper.
Let
(Q, ~ )
be a measurable space. We denoteby M(Q) (resp. Mi (Q))
the set of all
positive
measures(resp. Probability measures)
on Q.For
Q
and P elements ofM 1 (Q), H (Q, P)
denotes the relativeentropy
ofQ
withrespect
toP,
definedby:
Notice that
H (Q, P) >__ 0,
theequality implying Q
= P(Jensen’s inequality)
The
following
result is well known:(1 . 2)
IfQ
and Pbelong
toM 1 (Q)
thenis the set of real valued bounded measurable
functions,
defined on Q.
Furthermore,
if Q is a Polish spaceequipped
with its Borel a-fieldiF,
one can
replace,
in(1 . 2), ~b (Q) by Cb (Q),
the space of bounded continu-ous functions.
In the paper, Q will be C
([0, T], f~d),
the space of all continuouspaths
with values in
(~d,
Tbeing
apositive (but finite)
realnumber; (Xt)t E
jo T~will be the canonical
process, ~
the Borel a-field of’
Let
P EM 1 (Q)
begiven.
(1. 3)
DEFINITION. - Let ~o, T~ be aflow of Probability
measures onRd.
We denoteby A03BD
theset {Q~M1 (Q)
s. t.Q.X-1t = vt for
allt E [o, T]},
and by Av,H
s. t.H (Q, P) + oo ~ .
The flow
will be called admissiblei, f ’
is non void.We
adopt
classical notation for the usual spaces of FunctionalAnalysis.
In
particular
thesubscript
o will denote functions withcompact support,
C~
is the space of k-times differentiable functions which are bounded with bounded derivatives up to anincluding
order k and we omit thetarget
space when it is R.
Our aim is to
study
for agiven
flow v, when P is the law of adiffusion process with initial law
(E M 1 (f~d))
andgenerator
A(t) given
as:
Here we assume that
( 1. 5 . i )
the coefficientsand bi belong
to lo~((~d
XR),
the set oflocally
bounded Borelfunctions,
( 1. 5 . ii)
the matrix a =(aij)
is nonnegative symmetric,
i. e. for all x EMore
precisely
we make thefollowing assumptions:
Let
Qt
= C([0, T], f~d
XR)
be thetime-space,
with additionnal time coor-dinate denoted
by
ut.The canonical space
iF, X., M)
isequipped
with astrong
Markovfamily (Px, u)(x,
u) E Rd x R ofProbability
measures, such that:In
particular
noexplosion
occurs up to andincluding
time T.The
(formal) generator
of(Px,u)
is thenA’= 2014 + A (u).
If
strong uniqueness
occurs,hypotheses ( 1. 6)
are satisfied.Also notice that when A does not
depend
upon the timecoordinate,
this coordinate
plays
no role nor inuniqueness
orextremality.
According
to[Jac,
Thm13.55],
the lastrequirement
in(1.6)
isequiva-
lent to:
(1.7) Px, u
is an extremal solution to themartingale problem M (A’, C~’ ~ 1 (~a
xR), ~x, u) (see
notation in[Jac,
ch.13]).
Now for
0~M1 (Rd)
we define( 1 . 8)
P’ =(bo
is the Dirac measure at0).
In addition we assume that P’ is an extremal solution of
( 1 . 6)
withinitial condition Notice that for
all s,
P’ a. s.Before
going further,
we have to make some remarks about the time- space process.90 P. CATTIAUX AND C. LEONARD
Let 03C0 be the
projection operator
ofRd
x R ontoRd.
To any random variable y defined on Qcorresponds y’
= defined onQt.
To P’ corre-sponds
itsprojection
on Q denotedby
P. We denoteby Px
theprojection
Of
P x,
0.To
any Q « P corresponds Q’
defined onS2 by Q’ = ( dQ dP.03C0) P’.
Of course relative
entropy
andspatial marginals
arepreserved.
We shall now recall some basic facts related to the Girsanov transform
theory
and to the Föllmer measure.If we
put
then sn
is alocalizing
sequence ofstopping
times forMt.
Let be
absolutely
continuous withrespect
toP,
andThanks to the
extremality assumption
in(1. 6),
it follows from theorems12. 17,
12.34 and 12.48 of[Jac]
that there exists af~d-
valuedprevisible
process such that if weput
where
( . / . )
denotes the euclidian scalarproduct
in~d,
thedensity
process
Z.
ofQ
withrespect
to P isgiven by
theexponential
formula:Furthermore one can choose a continuous version of
Z
such thatZr
= 0if
t > Tn
for all n. We shall sometimes writeZ ([3,
vo,P)
if confusions arepossible.
Now
(1.12) Tn-+Too=T-Q
a. s.,N,=M,-
is a continuousd-dimensional
Q-local martingale admitting sn
AT"
as alocalizing
sequence ofstopping
timesand ( N’, M’ B.
’
Conversely,
if[3S
is aprevisible
process, one can defineTn
and vo,P)
as in
( 1. 10)
and( 1 . 11 ).
ThenZ.
is anonnegative
P-localmartingale
andwe can choose a version which is a
supermartingale.
Onceagain
we canchoose a continuous version of Z and
Zr
= 0 if t >Tn
for all n(for
all theseresults see
[Jac chap.
8 andchap.
12.3§c]).
We can associate to this
supermartingale
its Follmer measure([Fo]).
Tothis
end,
we must consider the spaceQ~
ofexplosive trajectories
withexplosion time ç
whichbelongs
to[0, T] U {
+( 1.13)
NOTATION. - The Follmer measureQ
defined onQ~
associatedto the
P-exponential supermartingale Z (P,
vo,P)
will be called the(P,
vo,P)-FM.
If(3S
= B(XS, s)
for a Borel functionB,
we write B insteadof
P.
More
generally
the Follmer measure associated to anonnegative
P-supermartingale
Z will be called the(Z, P)-FM.
Recall that the
(P,
vo,P)-FM Q
satisfies thefollowing equality ( 1 . 14)
For allstopping
time T, and allf7’t
measurable FActually,
one deduces from whatprecedes
thatFinally,
if for a Borel functionB,
Z is amultiplicative
functional. It follows that the
(B,
vo,P)-FM
is a MarkovProbability
measure on
Q,
as soon asQ (~ + oo) = 0. By
Theorem 24. 36 in[Sha],
itis in fact
strong
Markov.2. SOME PROPERTIES OF THE RELATIVE ENTROPY In this section we shall discuss some
elementary properties
related tothe finite
entropy
condition.(2 .1 )
PROPOSITION. - Assume thatQ
is aProbability
measure withQ
P.Let
(3S
denotes itsdrift.
ThenProof (cf. [F6 2]). -
We may assume that H(vo, Jlo)
is finite. LetQ~
be defined
by Qn
=ZTn
P.By
Novikov criterion one knows thatQn
is aProbability
measure. FurthermoreH (Qn, P)
isfinite,
and thesetwo measures are the same in restriction to
~ Tn.
Thus92 P. CATTIAUX AND C. LEONARD
But, according
to(1.12), Jo is a Q-martingale
with zero
expectation.
It follows:
Notice that the lim inf of
ZT
is P almostsurely equal
toZT
becauseZS =
0 ifs> Tn
for all n, andrecall
thatTn --+ T, Q
a. s.Since the function x
log (x)
is bounded frombelow,
one canapply
Fatou’s lemma and obtain
The conserve
inequality
is obtainedby taking
the supremum over n, in theright
hand side of thefollowing
chain of relations:Indeed we may assume that H
(Q, P)
is finite. Sincelog (dQn/dP) belongs
to
L 1 (Qn)
it alsobelongs
toL 1 (Q)
and(2 . 2)
makes sense. DThe next
Proposition completes
thepreceding
one.(2. 3)
PROPOSITION. - LetQ be the (j3,
vo,P)-FM (see (1 . 13))
wherea
previsible
process and va aProbability
measure such that H(vo,
+ oo .bility
measure on S2 with H(Q, P)
+ oo,T~
= T -Q
a. s. and(2 .1)
holds.Proof. - Applying (1.14)
with’t=Tn
andF== 1,
weget
a. s.Thus
T ~ __ ~ - Q
a. s. FurthermoreQ
andQn = ZTn
P are the same inrestriction to Hence
This
implies
thatZTn
is a bounded sequence in the Orlicz spaceL~* (P)
with
i* (x) _ (x + 1) log (x + 1 ) - x.
Inparticular ZT
isuniformly integra-
ble. Since this sequence goes P a. s. to we
get, according
to(1 . 15),
In view of what
precedes
it is natural toput
thefollowing
definition:(2.4)
DEFINITION. -(i)
LetQ
be aProbability
measure on Q. We denoteby ~Q the following
space:= y.; Rd-valued
measurable process s. t.(ii)
Let ~o~ T~ be aflow of Probability
measures on We denoteby
thefollowing
space:~ X
[0, T]) measurable,
with values inf~d
s. t.The
quadratic
forms introduced in(2.4)
define hilbertian seminorms onthe spaces
22.
(2.5)
We shall denoteby LQ
andL~
the factor spaces(22/Ker form)
which are Hilbert spaces. If y
(resp. cp) belongs
toLQ (resp. L~ )
we shallsay that y
(resp. cp)
is of finiteQ-energy (resp. v-energy).
Putting together (2 .1 )
and(2. 3),
weget
thefollowing trick,
which weshall use several times in the
sequel.
(2 . 6)
Trick. - Let B be a Borel function defined andvt (o _ t -- T)
be a flow ofProbability
measures such thatH (vo, yo)
+ ~. DefineZ_ (B,
vo,P)
as in( 1 . 11)
with(3S
= B(XS, s),
anddenote
by Q
the(B,
vo,P)-FM.
Assume that(i) Energy
condition. -s)/a (x, s)
B(x, s))
vs(dx)
ds + oo, i. e. B is of finite v-energy,(ii)
Domination. - For allnonnegative f
in~b ((l~d
and all t in[0, T],
Then
by
the monotone convergence theoremii )
extends to anynonnegative
measurable
f,
so that94 P. CATTIAUX AND C. LEONARD
Thanks to
(2 . 3), Q
is then aProbability
measure onQ,
H(Q, P)
+ 00and
Too
= TQ
a. s. But now(ii ) implies
that for all t.In other
words,
the finite energy conditiontogether
with the domination relationimpose
thatQ
has theprescribed marginals
vt. This idea appears in aslightly
different form inZheng’s
paper[Zhe].
We
complete
this section with a useful remark on the behaviour ofmartingales
in connection with anentropy
condition.(2.7)
PROPOSITION. - LetQ
andQ* be
twoProbability
measures on Q such thatH (Q, Q*)
+00. LetSt
be a boundedQ*-martingale.
ThenSt
isa
Q-semi martingale
withdecomposition St
=Kt
+Vt,
whereKt
is aL2 Q- martingale with ~
K)t = ~
S)t
andEQ [~
K~T]
+ oo .Proof. - Everything
is well knownexcept (perhaps)
thatEQ [ (
S)T]
+ oo whichimplies
thatKt
is a(true) Q-martingale.
Let C bea bound for
S.. Applying
theBurkholder-Davis-Gundy inequalities
weget
It follows fromStirling
formula thatif C (4 e)-1~2
thenHence
(SB belongs
to the Orlicz spaceL,(Q*)
withT(~-)=~-x-l.
But dQ belongs
toL,.(Q*)
with sincedQ*
H(Q, Q*)
+ ~. Holder’sinequality
in Orlicz spaces leads to the desiredresult,
since T and T* areLegendre conjugate.
In thegeneral
case it sufficesto divide
S by
alarge enough
constant. D(2.8)
Remarks. -1)
The aboveproof
shows that there exists a universal constant K such that for allQ, Q*
and S as in(2.7)
2) = Z belongs
to someU (Q*)
with~>1,
thenbelongs dQ *
T g
(Q )
q ~~ ~T
gto all the
LP (Q)
and with( 1 /q) + ( 1 /~*) =1.
3. ENTROPY AND THE MARKOV PROPERTY Let us first discuss the
following
easy exercise.Let be the
Lebesgue
measure on the unit square S =[0, 1] x [0, 1].
Consider twoProbability
measures vo(dx)
and v~ on[0, 1] such
thatH (vo, dx)
andH (vi, dy)
are finite. Then among all Proba-bility
measures v on S such thatH(v,X)
is finite andv ~ y -1= v 1 ),
the one which minimizes the relativeentropy
withrespect
to03BB is 03BD0 ~ 03BD1.
If we
replace (S, ~,, (vo, v 1 )) by (Q, P,
to, for agiven
admissibleflow v., the natural
analogous
statement would be that theProbability
measure
Q
inAv, H
which minimizes the relativeentropy
withrespect
toP,
has the Markovproperty.
This statement is
actually
true and follows from thefollowing
moreprecise
result.(3.1)
THEOREM. - LetQ be
anyProbability
measure inA~,
H. Then thereexists a
(strong)
MarkovProbability
measureQ
such thatQ
H
(Q, P) ~
H(Q, P). (Here,
strong standsfor
thetime-space
versionof Q, see § 1).
Actually
we shall prove that if P is a(non necessarily extremal)
solutionto the
martingale problem (1 . 6)
and ifQ
isgiven by
thedensity
vo,
P)
of( 1 . 11 ),
a natural Markovian versionQ [see (3 . 4) below]
ofQ satisfy (3.1).
This result is moreprecise
that the one in[F61] ]
or[AN]
which
only
says that theminimizing Probability
measure has the Markovproperty.
Beforeproving (3.1),
we shall firstexplain
what we meanby
a"natural Markovian version of
Q".
Since H
(Q, P)
isfinite, P
ELQ. Thus,
we canapply
Rieszrepresentation
theorem to obtain:
(3 . 2)
there exists a Borel function B such that for all cp EL~ ,
Furthermore B is
unique
inL~.
Of course, this is
nothing
else than aparticular
form of multidimensional conditionalexpectation. Clearly
weget
We choose once for all a version of B. Furthermore we extend B to the whole space x R
by defining
B (x, u) = B (x, T)
fou >_ T
andB (x, u) = B (x, 0)
forDefine
Q’
as in Section1,
a "natural Markovian version ofQ"’
is thenQ’
defined as(3 . 4) Q’
is defined on as the(B,
vo,P’)-FM.
Notice that
Q’
maydepend
on the choice of B.96 P. CATTIAUX AND C. LEONARD
According
to the final remark of Section1, Q’
is thenstrong
Markov.We shall now discuss some
questions
related to thisQ’.
Define
Tn, Sn and in
as follows -Once
again
in -T, Q’
a. s.[by (3. 3)].
Let us define the processGt by:
where
Nt
is defined in( 1.12).
Of course Us = s forall s, Q’
a. s.Q’ (i~
=T)
=l,
thusGt
isQ’
a. s. definedby
theexponential
formula andwe can choose a continuous version of
Gt
which is anonnegative Q- supermartingale.
(3 . 7) Q"
is then defined onQt, ç
as the(G, Q’)-FM.
Q"
does notdepend
of our choice of B.According
to(3.3), Therefore, Q’~Q".
Now defineZt
as:Since
EP’ [Zt
lim infG~n ~n~
=EQ’ [lim
infGTn
=0,
it holdsZ
is aP’-supermartingale
andQ"
isactually
the(Z, P’)-FM.
Notice that,
Z
is notnecessarily
continuous(there
is ajump
to 0 at time and non-multiplicative.
Inparticular Q" (03BE =
+ and notequal
ingeneral. Zt
is smaller orequal
to thesupermartingale Z (B,
vo,P).
Indeedthey
may differ on theset ~ sup
in t sup whereZt
= 0 while Z(B,
vo,P) >__
o. As a consequence it is thus clear thatQ’ Q’
and thatRemark that
’t 00 = T - P’
a. s. if andonly
ifQ’
and P’ areequivalent (assuming
here that P’ is an extremalsolution,
see[Jac]).
IfQ’ (ç =
+oo)
= 1then
Q’ - Q’.
Butalso,
ifQ" (ç
+oo)
=0,
i. e.Q"
is aProbability
measureon
nt,
thenZt
is amartingale
and so isZ (B,
vo,P).
Thisimplies easily
that
Z (B,
vo,P)
andZ
coincide. HenceQ’ = Q"
andQ’-Q’.
.
In
general
oneonly
hasQ’ Q’.
Q
will denote theprojection Q’ 0 1t (see § 1).
We shall show that
Q
andQ (thus Q’
andQ’)
have samemarginals.
Q’
will thus be aProbability
measure onQt
and theinequality H (Q, P)
H(Q, P)
will then follow from(3. 3)
andproposition
2. 3.Proof of (3 . 1). - Step
1. - Assume that(3 .11. a) a
and b arebounded,
so that(3 .11. b)
M is aL2 (P)-(true) martingale,
and assume furthermore that(3 .11. c) ~ __ K,
for all t in[0, T].
Under these
assumptions
the process Z(P,
vo,P)
is a Pmartingale
whichbelongs
to anyLP(P),
Here are some other consequences:(3 .12 . a) N,
is aL2 (Q) (true) martingale.
(3 .12 . b)
One can choose for B a version ofEQ x dt s)],
sothat
(3.12. c)
Since Q is a Polish space we know that for almost all s, It follows from(3 .12 . b)
thatGt
is aQ’-martingale
whichbelongs
to anyLP (Q’), 1 p +00,
so thatQ’ = Q"
areProbability
measuresequivalent
to P’. Define
N
is then aL2 (Q) martingale.
FurthermoreQ = GT Q,
whereGt
=EQ
isgiven by
Hence for
f E ~b (f~d)
and tE ] 0, T]
thefollowing
holds:It remains to show that the second term of the sum vanishes. This result would be clear if one could
exchange
the stochasticintegral
and the98 P. CATTIAUX AND C. LEONARD
conditional
expectation EQ [ . ~ XJ (stochastic
conditional Fubini’s theo-rem). Indeed, according
to(3 .12 . c),
for allg E ~ ((~d)
and almostall s,
So that
To prove that the stochastic conditional Fubini’s theorem
holds,
we shalluse an
approximation procedure
of the stochasticintegral (which
appears p. 23 of Mac Kean’s book "Stochasticintegrals").
We thank FrancisComets for
pointing
out this result to us.(3.18)
LEMMA. - Putt~ = it 2 - n, i = 0, ... , 2n.
Leths
be aprevisible
process in
L2 (Q x dt) (with hs = ho for s ~ 0).
Thenwhere the limit is taken in
L2 (Q).
Proof of
the Lemma. - Themapping
is continuous from
L2(Q
xdt)
intoitself,
with normequal
to 1.Furthermore if h is constant on each
dyadic
interval oflength 2 - k,
thenfor
n ?_ k,
hand h" may differ on at most2 - k
intervals oflength
2 - n(the
left side of the
dyadic
intervals at level2 - k).
Hence for such an hFinally
if h is aprevisible
process inL2 (Q
xdt),
we canapproximate
h inL2(Q
xdt) by
a sequence ofhdyad.
Sincewe
immediatly
see that hn converges to h inL2 (Q
xdt).
Remark that h" is left continuous(will right limits)
and henceprevisible.
which proves that hn converges to h in
L.Q (i.
e. L"(N, Q)).
The Lemmanow follows from the
isometry property
of theL 2
stochasticintegral.
DThe
previous approximation procedure
is of course very useful because it does notrequire
any kind ofcontinuity property
on theintegrand.
Recall that the
approximation by
the Riemann sums forinstance, requires
the left
continuity
of thepaths.
We
apply
Lemma(3.18)
withs))
which satisfies thehypotheses
of(3 . 18)
sinceG
iscontinuous,
supGS
is squareintegrable
s
with
respect
toQ,
and(3s -
B(XS, s)
is bounded andprevisible.
Thisyields
But
In the
right
hand term we can now take the conditionalexpectation
withrespect
tov >__ s).
Ofcourse f(Xt)
andNtt
are cx(X", v >_ s)-
measurable for
s ~ t;.
On the otherhand,
sinceQ
is Markovand hs
isf7 s
measurable
Finally
Applying (3 . 17),
we see that each term of the sum in(3 .19)
isequal
to0,
which proves
(3.15)
becomesEQ [, f’ (Xt)]
=EQ [, f’ (Xt)].
Theproof
is finished under theassumptions (3 . 11 ).
DShort comments on
hypotheses {3 . 11 ). -
The time reversalargument
we used in the above
proof (recall
theconditionning)
refrains usfrom
using
a localizationprocedure
withstopping
times. Thisexplains {3 . 11 . a)
and(3 . 11 . b).
Lemma(3.18) requires
to work in aL2
framework. In order to ensure that the Girsanov densities are suffi-
ciently integrable,
it is natural toask 03B2
and B to be bounded.Remark that in view of the definition of
B,
the boundedness of the actionintegral r (Xs’ s)
ds is not sufficient toget
the boundedness ofJo (B (Xs’ s)/a (Xs’ s) B (Xs’ s))
ds. This is the main reason for (3.11. c).
100 P. CATTIAUX AND C. LEONARD
Step
2. - Wekeep (3 .11. a),
butreplace (3.11. c) by
Accordingly Q’ = Q"
andQ’
andQ’
areequivalent (see
the discussionabove).
Define the sequence~is
ofprevisible
processes as~is
=~is
Then
and
obviously [is
converges toBs,
P almostsurely.
Let
Qk
be the(PB
vo,P)-FM. Q~
is aProbability
measure thanks to(3 . 21 . ii )
and to Novikov criterion.Moreover
H (QB P) _ C [see proposition (2.1)].
We shall prove that a
subsequence of Qk.X-1t
converges toIndeed thanks to
(3 . 21 )
one canapply
the bounded convergence theo-rem to
get
It follows that in
L2 (P),
so that we can finda
subsequence
ofZ~j = 20142014 )
that converges P almostsurely
toZt B = 2014- ).
But thanks to (3.21. ii)
" - ~ ...
Thus for all
(f~d),
the sequencef (Xr) Zt
is bounded in all the Lp(P)
and is thus
uniformly integrable.
It follows that for the abovesubsequence
One can prove that in
fact, H (Q, Qk)
goes to 0. Hence this sequence converges toQ
for the variation distance onM 1 (SZ).
Thisprovides
a moredirect
proof
of(3 . 24).
But the idea of the aboveproof
will be used in thesequel.
Notice that in this case, we know thatZt
is adensity
ofProbability.
Since
Z~
converges P a. s. toZt,
the convergence ofQk
toQ
is a conse-quence of Scheffe’s theorem
[Bi].
But the aboveargument
is still true withoutassuming
that Z(or Zk)
is adensity
ofProbability,
and will be used in this situation in the next sections.To
Qk (resp. Q) corresponds Qk (resp. Q)
builded as before.By step
1we know that
Qk
andQk
have samemarginals
isbounded).
But it ismuch more difficult to prove the convergence of to Indeed
and we do not have any uniform bound for
The
key
idea of theproof
will be to use the Trick(2. 6).
Indeed the energy condition(2 . 6 . i )
is satisfied.So,
in order to prove thatQ
is aProbability
measure with
marginals
vt, it suffices to prove the domination relation(3 . 26)
for allnonnegative f E ~b (l~d), EQ f (Xt) EQ (f (Xt)],
whereS ~
is defined in(3 . 5 . ii ).
This will be
proved by showing
that for asubsequence Qk
[Of
course it is asubsequence
of thesubsequence
for which(3 . 24) holds].
It is easy to guess that the
inequality (3.27)
will be obtained thanks to Fatou’slemma,
if we prove the almost sure convergence of some densities.Hence,
first notice thatQ ~ Qk Qk,
thedensity
processesbeing given by:
We shall prove that a
subsequence
ofAt
convergesQ
almostsurely
toGt [ see ( 3 . 6 )],
since heredQ
=G~.
Just as before it isenough
to provedQ ,
r102 P. CATTIAUX AND C. LEONARD
Recall the
Kalman-Bucy
formulanoticing
that(As) -1
1 admitsQk
moments of any order. Forsimplicity
wewrite
For a better
understanding
of thecomputations,
the reader cankeep
inmind that any
quantity
a whose vocation is to tend to 1(for
instancens),
will be written as 1 +
(a- 1).
With the
help
of theinequality
and of theconditional Jensen
inequality,
it holds:Thus
Using (3 . 21. ii),
we see that the abovequantity
is less thanLikewise
(3 . 22)
the first term in(3.31)
goes to 0. Since( 1- As)
is aQ- martingale,
one can use Doob’sinequality
to obtainIt is easy to see that
(a subsequence of) (1- AT)
goesQ
almostsurely
to0 and is
uniformly
bounded(in k)
in all theLP(Q)
for[cf. (3.23)].
Thus the second termof (3. 31)
also goes to 0.We come now to the third and last term. Since sup
As
is alsouniformly
s
bounded in all the
LP (Q),
it suffices to controlEQ [(sup ( 1- n~))4]
and tos
apply Cauchy-Schwarz inequality.
Onceagain
we use the conditional Jenseninequality, Cauchy
Schwarz and Doobinequalities
toget:
Since
(a subsequence of) (AT)- ~
goesQ
almostsurely
to 1 and since allquantities appearing
in theprevious
bound areuniformly
bounded(in k)
in all the
LP (Q),
we deduce that the last term of(3 . 31 )
tends to 0.We have thus
proved
that asubsequence
ofA~
convergesQ
almostsurely
toGt.
Now(3 . 27)
is a consequence of Fatou’s lemma. DShort comments. - The
computations
ofstep
2 aremerely
tedious butunfortunately
we cannot avoid this technicalstep.
Indeed(3.20)
is thenatural
hypothesis
in view of a localizationprocedure
and as wealready said,
it is unsufficient for the firststep techniques
to be available.Step
3. - We shall now remove(3 .11. a)
and(3 . 20). Let 03B2~L2Q
andput
Denote
by Qn
the(~is,
vo,P)-FM
which is aProbability
measure onQ,
withdensity
Zn. B" is definedby
where the conditional
expectation
is taken in the sense of(3.2).
Noticethat
s)
is03BDn ds
almostsurely equal
to 0 where vn denotes the flow of themarginals
ofQ".
So we can choose Bn such thatB" is