A NNALES DE L ’I. H. P., SECTION B
L IMING W U
A deviation inequality for non-reversible Markov processes
Annales de l’I. H. P., section B, tome 36, n
o4 (2000), p. 435-445
<http://www.numdam.org/item?id=AIHPB_2000__36_4_435_0>
© Gauthier-Villars, 2000, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
435
A deviation inequality for
non-reversible Markov processes
Liming WU
1Laboratoire de Mathematiques Appliquees et CNRS-UMR 6620,
Universite Blaise Pascal, 63177 Aubiere, France
Article received in 23 September 1998, revised in 25 October 1999 Ann. Inst. Henri Poincaré, Probabilités et Statistiques 36, 4 (2000)
@ 2000 Editions scientifiques et medicales Elsevier SAS. All rights reserved
ABSTRACT. -
Using
thedissipative
criterion ofLumer-Philips
for thecontraction
semigroup,
weget
in this Note a new deviationinequality
for
J~ by
means of thesymmetrized
Dirichlet form. A moreexplicit
version is obtained in the case where thelogarithmic
Sobolevinequality
holds. © 2000Editions scientifiques
et médicales Elsevier SASKey words: Dirichlet forms, Deviation inequality, Logarithmic Sobolev inequality
AMS classification: 60F 10, 60J25
RESUME. - Par le critere de
dissipativite
deLumer-Philips
pourla contractivite de
semigroupes,
on obtient uneinegalite
nouvelle dedeviation pour
J~
via la forme de Dirichletsymmetrisee.
Uneexpression plus explicite
est obtenue dans le cas ou1’inegalite
de Sobolevlogarithmique
est vraie. @ 2000Editions scientifiques
et médicalesElsevier SAS
Mots Clés: Forme de Dirichlet, Inegalite de deviation, Inegalite de Sobolev logarithmique
~ E-mail: wuliming@ucfma.uni-bpclermont.fr.
1. Let
(Q , (Xt)tER+, (Px)xEE)
be a conservativecàdlàg
Markov process with values in a Polish space
E,
withsemigroup
oftransition
probability (Pt (x, dy)).
We assume that ~c is aprobability
measure on E
(equipped
with the Borel 0152-fieldB),
which is invariant andergodic
withrespect
to(P~).
For any initial measure v onE,
write We denoteby (£, Dp(£))
thegenerator
of(Pt) acting
onLP (E, p) (Dp(£) being
its domain inLP),
where 1~
p +00. Thesymmetrized
Dirichlet
form
isgiven by
.where
(-, ’)~
is the usual innerproduct
inL2(E,
Under the
assumption
belowits closure
(~D(~)) corresponds
to asymmetric
Markovsemigroup
Given a measurable function V : E -
R , p-integrable.
In this notewe are interested to the
probability
of deviation of theempirical
meanfrom its real
(or asymptotic)
mean m :=i.e.,
Introduce
for every r E R
(Convention :
inf 0 :== As iseasily
seen,Jv
is aconvex function on R. Then
[Jv (interior)
is some interval(a, b) where -~ a b #
+00.Define now
Iv
as the lower semi-continuous(l.s.c.
inshort) regular-
ization of
Jv. Obviously Iv (m)
=Jv (m)
= 0 andIv :
R -~[0, +00]
isconvex. Then
Iv
isnon-decreasing
on[m, +00)
andnon-increasing
on437
L. WU / Ann. Inst. Henri Poincare 36 (2000) 435~45
( - oo , m ] .
Notice that when ab,
then for any r ER,
When our Markov process
(Xt)
is~-reversible (or ( Pt )
isp-symmetric),
Deuschel and Stroock
[4,
Theorem5.3.10,
p.210] (1989) proved essentially
thefollowing large
deviation estimation(where
ageneral
level-2
large
deviation lower bound isgiven)
t
for V bounded. For
general
unboundedV, (4)
is shown in[7] ( 1993).
In this little note we propose to extend and
strengthen (4).
Our mainobservation is
THEOREM 1. - Assume
(H 1 ).
For any initial measure v such thatv « and
d~
EL 2 (~t,c),
wehave for
all t >0,
all r >0,
Remark 2. - In the
symmetric
case, the deviationinequality (5)
issharp
in itsexponent
forlarge time t, by (4).
The main differences between(4)
and(5)
are:(i)
Thesymmetry assumption required
in(4)
is removed for(5);
(ii)
In(5), t
and r,being arbitrary,
are fixed unlike in(4)
which isonly
anasymptotic
relation(t
2014~+00).
Hence(5)
is much morestronger
andpractical.
However in the
non-symmetric
case,inequality (5)
is nolonger asymptotically
exact. Infact,
when the level-2large
deviationprinciple
of Donsker-Varadhan holds and V is
bounded,
the limit(4)
isgiven by
a contraction form of the Donsker-Varadhan
entropy functional,
which. is different from the
expression
in terms of Dirichlet form. See Deuschel and Stroock[4, Chapter VI]
and Ben Arous and Deuschel[ 1 ] ( 1994).
Nevertheless that last
large
deviation resultrequires quite
restrictive conditions in thenon-symmetric
case: indeed there existgeometrically ergodic
irreducible Markov processes so that the level-1large
deviationprinciple
fails(see Bryc
and Smolenski[2] (1993)).
While the deviationinequality (5) requires only (HI),
which is satisfied in the mostpart
ofinteresting
cases. Moreover(HI)
can be removed in case that V isbounded,
see Remarks3 (a)
below.2. Proof of Theorem 1. Consider the
Feynman-Kac semigroup
where
f >
0 is B-measurable. We shall establish for any-integrable
function V : E -
R,
where
and
Here
v
Let us see
quickly why (8) implies (5), by
a very classicalargument
borrowed from the Cramer theorem[4].
In fact setP(A) := ~(~V),
VÀ ER.
By Chebychev’s inequality,
for all r, t > 0fixed,
439
L. WU / Ann. Inst. Henri Poincare 36 (2000) 435~45
It remains to
identify
theexponent
in the last term of( 10).
Since Àm
by
the definition(9),
m is a sub-differential ofP (h)
at À = 0. Thus for r >0,
which is the
Legendre
transformationP* (m + r)
ofP(~).
On the other
hand,
we haveby (9)
for all 03BB E R. Hence the famous
Fenchel-Legendre
theoremgives
usSubstituting
those into(10),
weget (5).
Applying (5) to - V,
weget (6).
Consequently
to conclude thistheorem,
it remains to show(8).
Wedivide its
proof
into three cases.Case 1. - V bounded. In this bounded case is a
strongly
continuous
semigroup
of boundedoperators
onL2 (~,c),
whosegenerator
isexactly (£
+V ; D2(£
+V)
=D2(£)) by
the well knownFeynman-
Kac formula.
By
the definition(9)
ofA ( V) ,
That means
exactly
that thegenerator ~+V2014~(V)
with domainD2 (£)
is a
dissipative operator
onL2 (E, ~c)
in the sense of Lumer andPhilips [9, Chapter IX,
p.250]. By
theLumer-Philips
Theorem[9, Chapter IX,
p.
250],
thesemigroup generated by
£ + V -A(V)
iscontractive on
L 2 ( E , /~).
In otherwords,
which is
exactly (8).
Case 2. - V upper bounded
(V a). Considering
V - a if necessary,we can assume
V ~
0. TakeVn
=max{ V, -n }
for n e N. We haveby
theCase
1,
Recall that
where
Fn : L2 (E,
-[0, +00]
isgiven by
By
Kato[5,
p.461,
Lemma3.14a]
and ourassumption (HI), Fn
islower semicontinuous on
L2 (E,
withrespect
to thestrong topology,
then with
respect
to the weaktopology L2) (since Fn, being
the sum of two
nonnegative quadratic forms,
is convex onL2 (E,
Moreover,
since the unit ball{ f E 1 }
iscompact
withrespect to a(L2, L2), by
anelementary analytical
lemma(see
e.g.[8, Proposition 1.2]),
Substituting
it into(12),
weget (8) again.
Case 3. - General case. Take
V N
=min{V, N}
for N E N.By
themonotone convergence
theorem,
441
L. WU / Ann. Inst. Henri Poincare 36 (2000) 435-445
where the third
inequality
follows from the Case2,
and the lastequality
follows from the fact that
D( £(5)
nL °° (,~)
is a form core for all~~v N > 1,
and for the notnecessarily
closablequadratic
form~.
The
proof
of(8)
and then that of Theorem 1 are so finished. 0 Remark 3. -(a)
When V isbounded,
it holds thatwhere
without the
assumption (HI)
about theclosability
of(~~ , D2CC)), by
theproof
in the Case 1 above. As in theproof
of(8)
=*(5) above,
one candeduce from
( 13)
the deviationinequalities (5)
and(6)
without(HI),
butwith
Iv
substitutedby
the l.s.c.regularization Is
ofWhen
(HI)
is satisfied and V isbounded, =11 (~, V ) ,
VX E R(by
the fact that
D2 (£), being
a form core of£(5 ,
is so for because of the boundednessof V),
and then/~ =1 v .
’
(b)
Note also thefollowing (indicated by
thereferee):
theinequality (8) implies
notonly (5)
and(6),
but also(with
the sameargument)
(c) Applying
theLumer-Philips
theorem to ~ 2014 V in with1 ~ p +00, we
get,
instead of(8),
that for any Vbounded,
where
3. In this
paragraph
we do notrequire (HI)
but we assume thelog-
Sobolev
inequality
below: there exists C > 0 such that for allf
ED2(£),
Consider the
log-Laplace
transformation of V - m :and its
Legendre
transformationBy
the classical Cramèr’s theorem[4],
H* governs thelarge
deviationprinciple
of the i.i.d. sequence of common lawThe
following
result says that thelog-Sobolev inequality ( 14) implies
a same
type
of estimation as in the i.i.d. case.COROLLARY 4. - Assume
( 14) (not (HI)). Then for
any V EIn
particular for
each initial measure v « ~.~, with~~~
ELZ(~)
andfor
allr > 0, t > 0
"
Proof. -
The deviationinequality ( 17)
follows from( 16) by Cheby-
chev’s
inequality
as in Theorem 1. To show thekey (16),
assume at firstthat V is bounded.
443
L. WU / Ann. Inst. Henri Poincare 36 (2000) 435~45
By ( 13 )
in Remark3,
we havewhere the last
equality
follows from Donsker-Varadhan’s variational formula(see
e.g.[8]).
Now for V
unbounded,
setVn
=min{max{ V, -n {, n } .
We haveby
the bounded case shown above and the dominated convergence(and
Fatou’s lemma if the lastintegral
isinfinite). (16)
is henceestablished. D
Remark 5. - Ledoux
[6] (1999) develops systematically
the so calledHerbst method which consists to derive deviation
inequalities
from alog-Sobolev inequality.
Thestrategy
consists toapply
alog-Sobolev inequality
toeÀF
to obtain a differentialinequation,
from which a controlon
EeÀF
is deducedby comparison
lemma. Nevertheless for thatstrategy
works here for FV (Xs) ds,
we should assume that alog-Sobolev inequality
on thepath
space(D([0, t ] , E), Pv) holds,
which is ingeneral
not the case here.
Even in case that such a
path
levellog-Sobolev inequality holds,
itseems that the Herbst method does not
give directly
better estimation than( 17).
Forinstance,
let(Bt )
be the Brownian motion on a Riemannian manifoldE,
withgenerator A/2,
where A is theLaplace-Beltrami
operator.
Assume that the Ricci curvature satisfies for allu E
O ( E ) (the
bundle of orthonormal frames onE). By Capitaine-Hsu-
Ledoux
[3, (6)],
thepath level log-Sobolev inequality
below holds:for any x E E and F :
C([0, t ] ; E) -
Rprovided
that theright
side termabove is
finite,
where is the norm in the Cameron-Martinsubspace
of the Malliavin derivative DF on the
path
space. Now the Herbst methoddeveloped
in[6, § 2. 3 ] yields: if cr 2 , Px -a. s.,
thenUsing
the notations of[3],
we caneasily
prove that for F =J~
ds= +00
[
is the Rie-mannian norm of the
gradient
of V atx ),
We then obtain
by ( 19),
That estimation is
quite interesting
andsharp
forsmall t,
but not sofor
large
t. On the otherhand,
when E iscompact,
thelog-Sobolev inequality ( 14)
holds(a
well knownfact),
then( 17)
is valid and itgives
amuch better estimation than
(20)
forlarge
t.Our
approach
inCorollary
4 consists toapply log-Sobolev inequality
after
obtaining
the control(in
Theorem1 ),
notbefore,
unlike in the Herbst method. One can
regard
it as anotherapplication
of
log-Sobolev inequality, complementing
thoseamply developed by
Ledoux
[6].
ACKNOWLEDGEMENTS
This work is done when the author visits the Center of
Mathematics,
Academie Sinica
during May-June
1998. I amgrateful
to Professor MaZhi-Ming
for his kind invitation. The warmhospitality
of that Center isacknowledged. My
thanks goespecially
to the referee for his careful comments on the first version of this note.445
L. WU / Ann. Inst. Henri Poincare 36 (2000) 435-445
REFERENCES
[1] Ben Arous G., Deuschel J.D., The rate function of hypoelliptic diffusions, Comm.
Pure Appl. Math. XLVII (6) (1994) 843-860.
[2] Bryc W., Smolenski W., On the convergence of averages of mixing sequences, J. Theoret. Probab. 6 (3) (1993) 473-484.
[3] Capitaine M., Hsu E.P., Ledoux M., Martingale representation and a simple proof of logarithmic Sobolev inequality on path spaces, Elect. Comm. Probab. 2 (7) (1997).
[4] Deuschel J.D., Stroock D.W., Large deviations, Pure and Appl. Math. 137 (1989).
[5] Kato T., Perturbation Theory for Linear Operators, 2nd ed., Springer, Berlin, 1984;
(2nd corrected printing).
[6] Ledoux M., Concentration of measure and logarithmic Sobolev inequalities, in:
Séminaire de Probab. XXXIII, Lecture Notes in Math., Vol. 1709, Springer, 1999, pp. 120-216.
[7] Wu L.M., Feynman-Kac semigroups, ground state diffusions and large deviations,
J. Funct. Anal. 123 (1) (1994) 202-231.
[8] Wu L.M., An introduction to large deviations, in: Yan J.A., Peng S., Fang S., Wu L.
(Eds.), Several Topics in Stochastic Analysis, Academic Press of China, Beijing, 1997, pp. 225-336; (in chinese).
[9] Yosida K., Functional Analysis, 3rd edn., Springer, 1971.