S (S )
M ICHEL L EDOUX
Concentration of measure and logarithmic Sobolev inequalities
Séminaire de probabilités (Strasbourg), tome 33 (1999), p. 120-216
<http://www.numdam.org/item?id=SPS_1999__33__120_0>
© Springer-Verlag, Berlin Heidelberg New York, 1999, tous droits réservés.
L’accès aux archives du séminaire de probabilités (Strasbourg) (http://portail.
mathdoc.fr/SemProba/) implique l’accord avec les conditions générales d’utili- sation (http://www.numdam.org/conditions). Toute utilisation commerciale ou im- pression systématique est constitutive d’une infraction pénale. Toute copie ou im- pression de ce fichier doit contenir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
AND LOGARITHMIC SOBOLEV INEQUALITIES
MICHEL LEDOUX
INTRODUCTION 123
1. ISOPERIMETRIC AND CONCENTRATION
INEQUALITIES
1261.1 Introduction 126
1.2
Isoperimetric inequalities
for Gaussian and Boltzmann measures 1271.3 Some
general
facts about concentration 1342. SPECTRAL GAP AND LOGARITHMIC SOBOLEV
INEQUALITIES
1392.1 Abstract functional
inequalities
1392.2
Examples
oflogarithmic
Sobolevinequalities
1452.3 Herbst’s argument 148
2.4
Entropy-energy inequalities
and non-Gaussian tails 1542.5 Poincare
inequalities
and concentration 1593. DEVIATION
INEQUALITIES
FOR PRODUCT MEASURES161
3.1 Concentration with respect to the
Hamming
metric 1613.2 Deviation
inequalities
for convex functions 1633.3 Information
inequalities
and concentration 166 3.4Applications
to bounds onempirical
processes 171 4. MODIFIED LOGARITHMIC SOBOLEVINEQUALITIES
FORLOCAL GRADIENTS 173
4.1 The
exponential
measure 1734.2 Modified
logarithmic
Sobolevinequalities
1784.3 Poincare
inequalities
and modifiedlogarithmic
Sobolevinequalities
1795. MODIFIED LOGARITHMIC SOBOLEV
INEQUALITIES
INDISCRETE SETTINGS 182
5.1
Logarithmic
Sobolevinequality
for Bernoulli and Poisson measures 182 5.2 Modifiedlogarithmic
Sobolevinequalities
and Poisson tails 1885.3
Sharp
bounds 1906. SOME APPLICATIONS TO LARGE DEVIATIONS AND TO
BROWNIAN MOTION ON A MANIFOLD 193
6.1
Logarithmic
Sobolevinequalities
andlarge
deviation upper bounds 193 6.2 Some tail estimate for Brownian motion on a manifold 1947. ON REVERSED HERBST’S
INEQUALITIES
AND BOUNDS ONTHE LOGARITHMIC’SOBOLEV CONSTANT 199
7.1 Reversed Herbst’s
inequality
1997.2 Dimension free lower bounds 204
7.3
Upper
bounds on thelogarithmic
Sobolev constant 2057.4 Diameter and the
logarithmic
Sobolev constant for Markov chains 209REFERENCES 214
The concentration of measure
phenomenon
was put forward in the seventiesby
V.D. Milman in the local
theory
of Banach spaces. Ofisoperimetric inspiration,
it is ofpowerful
interest inapplications,
inparticular
inprobability theory (probability
inBanach spaces,
empirical
processes,geometric probabilities,
statisticalmechanics... )
One main
example
is the Gaussian concentration property which expressesthat,
whenever .4 is a Borel set in IRn of canonical Gaussian measure
y(.4) > 2 .
for everyr > O.
- ,,~,(‘,~r)
) 1-e-r ~2
where .4r is the r-th Euclidean
neighborhood
of A. As rincreases,
theenlargement
Ar thus gets very
rapidly
a measure close to one. This Gaussian concentration property can be describedequivalently
on functions. If F is aLipschitz
map on IRn with L for every r ~0,
~y (F > f Fd~y + r~ e-r2~2.
Together
with the sameinequality
for-F,
theLipschitz
function F is seen to beconcentrated around some mean value with very
high probability.
Thesequantitative
estimates are dimension free and extend to
arbitrary
infinite dimensional Gaussian measures..A.ssuch, they
are a main tool in thestudy
of Gaussian processes andmeasures.
Simultaneously, hypercontractive
estimates andlogarithmic
Sobolevinequali-
ties came up in quantum field
theory
with the contributions of E. Nelson and L.Gross. In
particular,
L. Grossproved
in 1975 a Sobolevinequality
for Gaussianmeasures of
logarithmic
type.Namely,
for all smooth functionsf
on1R n ,
f2log f2d03B3- f2d03B3 log f2d03B3
~ 2|~f|2d03B3.This
inequality
isagain independent
of the dimension andproved
to be a substituteof the classical Sobolev
inequalities
in infinite dimensionalsettings. Logarithmic
Sobolev
inequalities
have been usedextensively
in the recent years as a way tomeasure the
smoothing properties (hypercontractivity)
of Markovsemigroups.
Inparticular, they
are a basicingredient
in theinvestigation
of the time toequilibrium.
One of the
early questions
onlogarithmic
Sobolevinequalities
was to determine which measures, onIRn, satisfy
aninequality
similar to the one for Gaussian mea- sures. To thisquestion,
raisedby
L.Gross;
I. Herbst(in
anunpublished
letter to L.Gross)
found thefollowing
necessary condition:if
is aprobability
measure suchthat for some C > 0 and every smooth function
f
onthen,
.oc
for every a
C. Furthermore,
for anyLipschitz
function F on R~ withand every real
À,
’e03BBFd ~
e03BBFd +C03BB2/4.
By
asimple
use ofChebyshev’s inequality;
thepreceding
thus relates in an essential way to the Gaussian concentrationphenomenon.
Herbst’s result was mentioned in the
early eighties by
E. Davies and B.Simon.
and has been revived
recently by
S.Aida,
T. Masuda and I.Shigekawa.
It was fur-ther
developed
and refinedby
S.Aida,
S.Bobkov,
F.Gotze,
L.Gross.
O.Rothaus.
D. Stroock and the author.
Following
these authors and theircontributions,
theaim of these notes is to present a
complete
account on theapplications
oflogarith-
mic Sobolev
inequalities
to the concentration of measurephenomenon.
Weexploit
Herbst’s
original
argument to deduce from thelogarithmic
Sobolevinequalities
somedifferential
inequalities
on theLaplace
transforms ofLipschitz
functions.According
to the
family
of entropy-energyinequalities
we aredealing with,
these differentialinequalities yield
various behaviors of theLaplace
transforms ofLipschitz
functionsand of their concentration
properties.
Inparticular,
the basicproduct
property of entropy allows us toinvestigate
with this tool concentrationproperties
inproduct
spaces. The
principle
is rathersimple minded,
and as such convenient forapplica-
tions.
The first part of this set of notes includes a introduction to
isoperimetry
and concentration for Gaussian and Boltzmann measures. The second part then presents
spectral
gap andlogarithmic
Sobolevinequalities,
and describes Herbst’s basicLaplace
transformargument.
In the third part, weinvestigate by
this method deviation and concentrationinequalities
forproduct
measures. While concentrationinequalities
do notnecessarily tensorize,
we show thatthey actually
follow from strongerlogarithmic
Sobolevinequalities.
We thus recover most of M.Talagrand’s
recent results on
isoperimetric
and concentrationinequalities
inproduct
spaces. Webriefly
mention there the information theoreticinequalities by
K. Marton which pro- vide an alternateapproach
to concentration also based on entropy, and which seemsto be well suited to
dependent
structures. We thendevelop
thesubject
of modifiedlogarithmic
Sobolevinequalities investigated recently
injoint
works with S. Bobkov.We examine in this way concentration
properties
for theproduct
measure of the ex-ponential distribution,
as well as, moregenerally,
of measuressatisfying
a Poincareinequality.
In the nextsection,
theanalogous questions
for discretegradients
areaddressed,
withparticular emphasis
on Bernoulli and Poisson measures. We then present someapplications
tolarge
deviation upper bounds and to tail estimates forBrownian motion on a manifold. In the final part, we discuss some recent results
on the
logarithmic
Sobolev constant in Riemannian manifolds withnon-negative
Ricci curvature. The last section is an addition of L. Saloff-Coste on the
logarithmic
Sobolev constant and the diameter for Markov chains. We
sincerely
thank him forthis contribution.
It is a
pleasure
to thank theorganizers (in particular
M.Scheutzow)
and theparticipants
of the "Graduierten-kolleg"
course which was held in Berlin in Novem- ber 1997 for theopportunity
to present, and to prepare, these notes. These noteswould not exist without the collaboration with S. Bobkov which led to the concept of modified
logarithmic
Sobolevinequality
and whosejoint
work form most of Parts4 and 5. Thanks are also due to S.
Kwapien
for numerousexchanges
over the yearson the
topic
of these notes. D. Piau and D. Steinsaltz were veryhelpful
with theircomments and corrections on the
manuscript.
In this first part, we present the Gaussian
isoperimetric inequality
as well as aGaussian type
isoperimetric inequality
for a class of Boltzmann measures with asufficiently
convexpotential. Isoperimetry
is a natural way to introduce to the con-centration of measure
phenomenon.
Forcompletness,
we propose a rathershort.
self-contained
proof
of theseisoperimetric inequalities following
the recent contri-butions
[Bob4], [Ba-L].
Let us mention however that our firstgoal
in these notes isto
produce simpler,
more functional arguments to derive concentrationproperties.
We then present the concentration of measure
phenomenon,
and discuss a few of itsfirst
properties.
1.1 Introduction
The classical
isoperimetric inequality
in Euclidean space states that among all sub- sets with fixed finitevolume,
balls achieve minimal surface area. Inprobabilistic,
and also
geometric, applications
one is often interested in finite measure space, suchas the unit
sphere
Sn inequipped
with its normalized invariant measure o-~.On
(geodesic) balls,
or caps, areagain
the extremal sets, that is achieve minimal surface measure among sets with fixed measure.The
isoperimetric inequality
on thesphere
was usedby
V. D. Milman in theearly
seventies as a tool to prove the famousDvoretzky
theorem on Euclidean sec-tions of convex bodies
(cf. [Mi], [M-S]). Actually,
V. D. Milman isusing
theisoperi-
metric property as a concentration property.
Namely,
in itsintegrated version,
theisoperimetry inequality
states that whenever~" (A) _ (jn(B)
where B is a ball on5’",
for every r >0,
(1.1)
where Ar
(resp. Br)
is theneighborhood
of order r of A(resp. B)
for thegeodesic
metric on the
sphere. Since,
for a set A on S’~ with smoothboundary 9A,
the surfacemeasure
Q~
of aA can be describedby
the Minkowski content formula as03C3ns(~A) = lim
inf 1 [03C3n(Ar) - 03C3n(A)],
(1.1)
iseasily
seen to beequivalent
to theisoperimetric
statement.Now,
the measureof a cap may be estimated
explicitely.
Forexample,
if~n (.4) > 2 .
it follows from(1.1)
thatQ"(Ar)
> 1 -(1.2)
for every r ~ 0.
Therefore,
if the dimension islarge, only
a small increase of r(of
the order of
ln )
makes the measure ofAr
close to 1. In a sense, the measure o~"is concentrated around the equator, and
(1.2)
describes the so-called concentration of measurephenomenon
of Onesignificant
aspect of this concentrationphe-
nomenon is that the
enlargements
are not infinitesimal as forisoperimetry.
and thatemphasis
is not on extremal sets. These notes willprovide
asample
of concentrationproperties
with the functional tool oflogarithmic
Sobolevinequalities.
1.2
Isoperimetric inequalities
for Gaussian and Boltzmann measuresIt is well known that uniform measures on n-dimensional
spheres
with radiusJfi approximate (when projected
on a finite number ofcoordinates)
Gaussian measures(Poincare’s lemma).
In this sense, theisoperimetric inequality
onspheres gives
riseto an
isoperimetric inequality
for Gaussian measures(cf. [Le3]).
Extremal sets arethen
half-spaces (which
may be considered as balls with centers atinfinity). Let,
more
precisely, /* = ~y"
be the canonical Gaussian measure on IRn withdensity
(2~)-n~2
with respect to
Lebesgue
measure. Define the Gaussian surface measure of a Borelset A in as
03B3s(~A) =
lim inf 1 r [03B3(Ar)-03B3(A)] (1.3)
where Ar =
{x
EIRn; dz (x, A) r}
is the r-Euclidean openneighborhood
of A.Then,
if H is ahalf-space
in that is H ={x
EIRn; (x, u) a},
where = 1and a E and
if 1’(A) = 1’(H),
thenLet
~(t) _ (2~r)-1~2 +oo~,
be the distribution function of the canonical Gaussian measure in dimension one andlet 03C6
= 03A6’. Then03B3(H) = 03A6(a)
and
~y~(8H)
=y(a)
sothat,
~y~(8A) >
= c~ o~-1 (~y(A)). (1.4)
Moreover, half-spaces
are the extremal sets in thisinequality.
In thisform,
theGaussian
isoperimetric inequality
is dimension free.In
applications,
the Gaussianisoperimetric inequality
is often used in its inte-grated
version.Namely,
if~y(A) _ ~y(H) _ (a) (or only ~(A) > ~(a)), then,
forevery r ~
0,
=
~(a
+r) . (1.6)
In
particular,
if~y(A) > z (= ~(0)),
’Y(Ar) > ~(r) > 1- e r2~2. (1.6)
To see that
( 1.4) implies (1.5),
we may assume,by
asimple approximation,
that Ais
given by
a finite union of open balls. Thefamily
of such sets .4 is closed under theoperation
A -+Ar, r >
0.Then,
the lim inf in( 1.3)
is a true limit.Actually,
theboundary
8A of A is a finite union ofpiecewise
smooth( n - I )-dimensional
surfacesin 1Rn and
~y~(8A)
isgiven by
theintegral
of the Gaussiandensity along
aA withrespect to
Lebesgue
measure on 8A.Now, by (1.4),
the functionu(r)
= ~’1o^j(.4r),
r ~
0,
satisfiesv’(r) = 03B3s(~Ar) 03C6 o 03A6-1(03B3(Ar))
~1
so that
v(r)
= v(0)+r0u’(s)ds ~v(0)+r,
which is(1.5). (Alternatively,
see[Bob3].)
The Euclidean
neighborhood
Ar of a Borel set .4 can be viewed as the Minkowskisum A +
rB2
={a
+rb;
a EA,
b EB2 }
withB2
the Euclidean open unit ball.If ^~
is any
(centered)
Gaussian measure on1R n, B2
has to bereplaced by
theellipsoid
associated to the covariance structure of ~y. More
precisely,
denoteby
r = M tM thecovariance matrix of the Gaussian measure / on 1R n.
Then;
is theimage
of thecanonical Gaussian measure
by
the linear map lyI = Set J~ =M(~2).
Then,
if~y(A) > ~(a),
for every r ~0,
~ ~
~y(.4 + rJ~) > ~(a + r). (1.7)
In this
formulation,
the Gaussianisoperimetric inequality
extends to infinite dimen-sional
(centered)
Gaussian measures, the set JCbeing
the unit ball of thereproducing
kernel Hilbert
space 1-£ (the
Cameron-Martin space for Wiener measure forexample).
Cf.
[Bor], [Led3].
To see moreover how
(1.6)
or(1.7)
may be used inapplications,
let forexample
X = be a centered Gaussian process indexed
by
some, forsimplicity,
count-able parameter set T. Assume that SUPtET
Xt
oo almostsurely.
Fix tl, ... , tn in T and consider thedistribution
of thesample (X~1, .... Xtn ).
Choose m finite such thatIP{supt~TXt ~ m} ~ 1 2.
Inparticular,
ifA = {max Xti m } ,
then
03B3(A) ~ 1 2. Therefore, by (1.7) (with
a =0),
for every r ~0,
y ~A + rJC) > ~(r) > 1- e-r2/2.
Now,
forany h
in x =M(B2 ),
max hi
~ max( M2ij)1/2
= max(IE(X2ti))1/2
by
theCauchy-Schwarz inequality,
so thatXt;
~m+r ’Set (1 =
(It
iseasily
seen that (1 isalways
finite under theassumption supt~TXt
~. Let indeed m’ be such thatIP{supt~TXt m’}
>4 .
Then, if 03C3t
=(IE(X2t))1/2, m’ 03C3t ~ 03A6-1(3 4)
>0.)
It follows from thepreceding
thatmax
Xt;
m +03C3r} ~
1-e-r2/2.
By
monotone convergence, andtaking complements,
for every r ~0.
m +
a~r? e-r2/2, (1.8)
tET
This
inequality
describes the strongintegrability properties
of almostsurely
boundedGaussian processes. It
namely implies
inparticular (cf. Proposition
1.2below)
thatfor every a
2~ ,
IE(exp(03B1(supt~T Xt)2))
~.( 1.9)
Equivalently,
in alarge
deviationformulation,
lim
1 r2 log IP{sup Xt
~r} = -
1 203C32.(1.10)
(The
lower bound in(1.10)
isjust
thatIP{supXt ~ r} ~ IP{Xt ~ r} = 1-03A6( r 03C3t
) ~ e-r2/203C32c 203C0(1+(r/03C3t))for
every t
E T and r >0.)
Butinequality (1.8) actually
contains more infor-mation than
just
thisintegrability
result.(For example,
if Xn is a sequence of Gaussian processes asbefore,
and if we let =supt~TXnt, n
EIN,
then- 0 almost
surely
as soon as -> 0 0 where(1n =
(1.8)
describes asharp
deviationinequality
in termsof two parameters, m and cr. In this sense, it
belongs
to the concentration of measurephenomenon
which will beinvestigated
in these notes(cf.
Section1.3).
Note that(1.8), (1.9), (1.10)
holdsimilarly
withsupt~TXt replaced by
supt~T |Xt|(under
theassumption
suptETIXtl
oc almostsurely).
The Gaussian
isoperimetric inequality
was established in 1974independently by
C. Borell[Bor]
and V. N. Sudakov and B. S. Tsirel’son[S-T]
on the basis of theisoperimetric inequality
on thesphere
and Poincare’s lemma. Aproof using
Gaussiansymmetrizations
wasdeveloped by
A. Ehrhard in 1983[Eh].
We present here a short and self containedproof
of thisinequality.
Ourapproach will
be functional. Denoteby U
= p o ~-1 the Gaussianisoperimetric
function in(1.4).
In a recentstriking
paper, S. Bobkov
[Bob4]
showed that for every smoothenough
functionf
withvalues in the unit interval
[0,1],
f dy / du (1.11) >
where
IV II denotes
the Euclideanlength
of thegradient ~f
off.
It iseasily
seen that(1.11)
is a functional version of the Gaussianisoperimetric inequality (1.4). Namely.
if
(1.11)
holds for all smoothfunctions,
it holds for allLipschitz
functions withvalues in
[0,1].
Assumeagain
that the set A in(1.4)
is a finite union of non-empty open balls. Inparticular, ~y(8A)
= 0.Apply
then(1.11)
tofr(x) = (1- r d2(x, A))+
(where d2
is the Euclidean distancefunction).
Thenf r
-~IA
andLI ( f r)
--~ 0 almosteverywhere
since~(8A)
= 0 andU(0)
= = 0.Moreover, ~~ Irl
= 0 on A andon the
complement
of the closure ofAr,
andeverywhere.
Note that thesets
8( Ar)
are of measure zero for every r ~ 0. Thereforeu(03B3(A)) ~
lim inf |~fr|d03B3 ~lim inf1 r[03B3(Ar)-03B3(A)]
=03B3s(~A).
To prove
(1.11),
S. Bobkov first establishes theanalogous inequality
on thetwo-point
space and then uses the central limittheorem,
very much as L. Gross in hisproof
of the Gaussianlogarithmic
Sobolevinequality [Grl] (cf.
Section2.2).
Theproof
below is direct. Our main tool will be the so-called Ornstein-Uhlenbeck orHermite
semigroup
with invariant measure the canonical Gaussian measure ~. For everyI,
in L1(~y)
say, setPtf(x)
=IRnf(e-t/2x
+(1 - e-t)1/2y)d03B3(y)
, x ~ IRn , t ~ 0.(1.12)
The operators
Pt
are contractions on all and aresymmetric
and invari-ant with respect to ~y. That
is,
for anysufficiently integrable
functionsf
and g., andevery t > 0, f fPtgd03B3
= Thefamily (Pt)t>o
is asemi group (PsoPt
=Pd+t).
Po
is theidentity
operator whereasPt f
converges in L2(~y)
towards~
as t tendsto
infinity.
All theseproperties
areimmediately
checked on thepreceding integral representation
ofPt together
with theelementary properties
of Gaussian measures.The infinitesimal generator of the
semigroup
that is the operator L such thatd dtPtf = PtLf = LPtf,
acts on all smooth functions
f
on 1Rnby
Lf(x) - Z ~f (x) - Z (x~
In other
words,
L is the generator of the Omstein- Uhlenbeck diffusion process(Xt)t>o,
the solution of the stochastic differentialequation dXt
= dBt -!Xtdt where (Bt)t>0
is standard Brownian motion in 1R".Moreover,
theintegration by
parts formula for L indicates
that,
forf and g
smoothenough
on= ~
.(1.13) )
Let now
f
be a fixed smooth function on IR" with values in[0, 1].
Itmight
actually
be convenient to assumethroughout
the argument that0 ~ - f -1- t
and let then 6: tend to zero. Recall U == ~ o
$"~.
To prove(1.11)
it will beenough
to show that the function
J(t)
=is
non-increasing
0.Indeed,
if this is the case.~(0), which, together
with the
elementary properties
of Pt recalledabove,
amounts to(1.11).
Towardsthis
goal,
we firstemphasize
the basic property of the Gaussianisoperimetric
func-tion U that will be used in the argument,
namely
that U satisfies the fundamental differentialequality
MY~ = 20141(exercise).
We now haveTo ease the
notation,
writef
forPtf.
We also setK(f)
=~(/)
+Therefore,
il = ~ -201420142014
+(V/, V(L/))] d~. (1.14)
For
simplicity
in theexposition,
let us assume that the dimension n is one, thegeneral
casebeing entirely similar, though notationally
a little bit heavier.By
theintegration by
parts formula(1.13),
,= -1 2 1 K(f)[u’2(f)-1]f’2d03B3
+ 1 2uu’(f)f’ K(f)3/2[uu’(f)f’+f’f"]d03B3
where we used that ~~ = 20141 and that
~(//
= +(/’)’
= +2~~. (1.15)
In order to handle the second term on the
right-hand
side of(1.14),
let us note that.
(Vf, V(L/)} = ~ ~(r - = -~~
+Hence, again by
theintegration by
parts formula(1.13),
andby (1.15),
,= -1 2f’2 K(f)d03B3-1 2f"2 K(f)d03B3
+ 1 2f’f" K(f)3/2[uu’(f)f’ + f’f"]d03B3.
Putting
theseequations together,
weget,
after somealgebra.
dJ dt = -1 2 1 K(f)3/2 [u’2(f)f’4 - 2uu’(f)f’2f" +u2(f)f"2] d03B3
and the result follows since
L(’2 f ) f’4 - 2Llu’ ( f )f ’2 f " +u2 (f )f m
= _ 0.The
preceding proof
of the Gaussianisoperimetric inequality
came up in thejoint
work[Ba-L]
with D.Bakry.
The argument isdeveloped
there in an abstractframework of Markov diffusion generators and
semigroups
andapplies
to alarge
classof invariant measures of diffusion generators
satisfying
a curvatureassumption.
We present here this result for some concrete class of Boltzmann measures for which a Gaussian-likeisoperimetric inequality
holds.Let us consider a smooth
(Cz say)
function W on IRn such thate-w
is inte-grable
with respect toLebesgue
measure. Define the so-called Boltzmann measure as theprobability
measure=
where Z is the normalization factor. As is
well-known. ,
may be described as the invariant measure of thegenerator
L= 2 ~ - 2 ~W .~
V.Alternatively,
L is thegenerator
of the Markovsemigroup
of theKolmogorov
process X = solution of the stochastic differentialLangevin equation
~
dXt
=The choice of
W(x)
with invariant measure the canonical Gaussian measurecorresponds
to the Ornstein-Uhlenbeck process. Denoteby W"(x)
the Hessian of Wat x E .
Theorem 1.1. Assume
that,
for some c >0, ~~(a*) ~
cId assymmetric matrices, uniformly
in x E 1R n.Then,
whenever A is a Borel set in ]R n with,u(A) > ~(a),
forany
r > 0,
>_
As in the Gaussian case, the
inequality
of Theorem 1.1 isequivalent
to itsinfinitesimal version
with the
corresponding
notion of surface measure and to the functionalinequality
u
(fd03B3)
~u2(f) + 1 c |~f|2 d03B3
which is the result we established in the
proof
as before. Beforeturning
to thisproof,
let us comment on the Gaussian aspect of the theorem. Let F be aLipschitz
map on R" with
Lipschitz
coefficientB/c. Then,
theimage
measure ~ of/~
by
F is a contraction of the canonical Gaussian measure on R.Indeed,
we mayassume
by
some standardregularization procedure
that v isabsolutely
continuouswith respect to
Lebesgue
measure on IR with astrictly positive density.
Setv(r)
=so that the measure v has
density
v’. For r ~1R, apply
Theorem 1.1,or rather its infinitesimal
version,
to A ={.F ~ r}
to getv’(r). Then, setting k
= 03BD-1 o $ and x = 03A6-1 ov(r), k’(x) ~
1 so that 03BD is theimage
of thecanonical Gaussian measure on IR
by
the contraction k. Inparticular,
in dimensionone, every measure
satisfying
thehypothesis
of Theorem 1.1 is aLipschitz image
ofthe canonical Gaussian measure.
Proof of
Theorem ~J. It isentirely
similar to theproof
of the Gaussianisoperimet-
ric
inequality
in Section 1.1. Denote thusby
the Markovsemi group
with generator L =~A - ~VW’
V. Theintegration by
parts formula for L readsfor smooth functions
f
and g. Fix a smooth functionf
on IR" with0 ~
1. As in the Gaussian case, we aim to showthat,
under theassumption
onW,
J(t)
is
non-increasing
in t ~ 0.Remaining
as before in dimension one for notationalsimplicity,
the argument is the same than in the Gaussian case with nowK(f)
=~(/)
+ so thatSimilarly,
(v/, V(Lf))
=/(~ r - ~ = -~ ~7’’
+f’Lf’.
Hence, again by
theintegration by
partsformula,
1 K(f)~f, ~(Lf)>d03B3
=-1 2 W"f’2 K(f)d + f’ K(f)Lf’d
= -
1 2W"f’2 K(f)
d - 1 2f"2 K(f)d
~/~[~~~~]~
In the same way, we then get
~ "S / x(7)!?
- ~~(~-’)b~~-
Since
W~ ~
c, the conclusion follows. Theproof
of Theorem 1.1 iscomplete.
D1.3 Some
general
facts about concentrationAs we have seen in
(1.6),
onecorollary
of Gaussianisoperimetry
is that whenever A is a Borel set in IRn with~y(A) > 2
for the canonical Gaussian measure 03B3,then,
for every r >
0,
’"((Ar)
>_ 1-e r2/2. (1.16)
In other
words, starting
with a set withpositive
measure.{ 2 here),
its(Euclidean) enlargement
orneighborhood
gets veryrapidly
a mass close to one(think
for exam-ple
of r = 5 or10).
We described with(1.2)
a similar property onspheres.
While trueisoperimetric inequalities
areusually quite
difficult toestablish,
inparticular
iden-tification of extremal sets, concentration
properties
like(1.2)
or(1.16)
aremilder;
and may be established
by
avariety
of arguments, as will be illustrated in thesenotes.
The concentration of measure
phenomenon,
put forward mostvigorously by
V.D. Milman in the local
theory
of Banach spaces(cf. [Mi], [M-S]),
may be described forexample
on a metric space(X, d) equipped
with aprobability
measure , on the Borel sets of(X, d).
One is then interested in the concentration functiona{r)
=~(~4r)~
?2 ~~
r >0,
where
Ar
={x
EX ; d(x,.4) r}.
As a consequence of(1.16), a(r) e-r2/2
in caseof the canonical Gaussian measure 03B3 on 1R n with respect to the Euclidean metric.
The
important
feature of this definition is that several measures, as we will see, do have very small concentration functionsa(r)
as r becomes"large" .
We willmainly
beinterested in Gaussian
(or
at leastexponential)
concentration functionsthroughout
these notes. Besides Gaussian measures, Haar measures on
spheres
were part of the firstexamples (1.2). Martingale inequalities
alsoyield family
ofexamples (cf.
[Maul], [M-S], [Ta7]).
In thiswork,
we will encounter furtherexamples,
inparticular
in the context of
product
measures.The concentration of measure
phenomenon
may also be described on functions.Let F be a
Lipschitz
map on X with~~F~~LIp
1(by homogeneity)
and let m be amedian of F for ~c.
Then,
since~(F m) ~ 2,
and{F :5 m}r
C{F:5
m +r},
wesee that for every r >
0,
JJ(F 2: m + r) a{r). (1.17)
When such an
inequality holds,
we willspeak
of a deviationinequality
for F. To-gether
with the sameinequality
for-F,
~c(~F - r) 2a(r). (1.18)
We then
speak
of a concentrationinequality
for F. Inparticular,
theLipschitz
map F concentrates around some fixed mean value m with a
probability
estimatedby
a.According
to the smallness of a as rincreases,
F .may be considered asalmost constant on almost all the space. Note that these deviation or concentration
inequalities
on(Lipschitz)
functions areactually equivalent
to thecorresponding
statement on sets. Let A be a Borel set in
(X, d)
with> 2 .
SetF(x)
=d(x, .4)
where r > 0.
Clearly (~F~~Lip
1 whileJJ(F
>0)
=d{x, A)
>0)
_ 1-- 2 .
Hence,
there is a median m of F which is 0 andthus, by (1.17),
.. 1-
~(A,.) ~(F
>r) a(r). (1.19)
In the Gaussian case, for every r ~
0,
~y(F >
m +r) e r~~2 (1.20)
when 1 and
’Y(F ~
m +r)
_for
arbitrary Lipschitz functions, extending
thus thesimple
case of linear functions.These
inequalities emphasize
the two main parameters in a concentration property,namely
some deviation or concentration value m, mean ormedian,
and theLipschitz
coefficient of F. An
example
of this typealready
occured in(1.8)
which maybe shown to follow
equivalently
from(1.20) (consider F(x)
= Asa consequence of Theorem
1.1, if ~
is a Boltzmann measure withW"(x) >
cId forevery x E
IRn,
and if F isLipschitz
with we getsimilarly
that for everyr ? 0,
m +
r e-r2~2c. 1.21
Although
this last bound covers aninteresting
class of measures, it is clear that itsapplication
isfairly
limited. It is therefore of interest toinvestigate
newtools,
otherthan
isoperimetric inequalities,
to derive concentrationinequalities
forlarge
familiesof measures. This is the task of the next
chapters.
It
might
be worthwhile to note that while we deduced thepreceding
concentra-tion
inequalities
fromisoperimetry,
one may alsoadapt
thesemigroup
arguments togive
adirect, simpler, proof
of theseinequalities.
To outline the argument in case of(1.20),
let F on IR" be smooth and such that = 0 and 1. For fixed A EIR,
setH(t)
-f
where(Pt)t>o
is the Ornstein-Uhlenbecksemigroup (1.12).
SinceH(oo)
=1,
we maywrite, for-every t > 0,
H(t) = 1 - ~tH’(s)ds
= 1 - 03BB~t(LPsFe03BBPsFd03B3)ds
= 1
+ 03BB2 2~t(|~PsF|2e03BBPsFd03B3)ds
by
theintegration by
parts formula(1.13).
Since~~F~~Lip
1, 1 almosteverywhere,
so that=
le-"/2 P..(V’F)12 :5
e-’almost
everywhere. Hence, for t > 0,
H(t)
~ 1+ 03BB2 2
~te-sH(s)ds.
By
Gronwall’slemma,
H(0) =
To deduce the deviation
inequality (1.20)
from thisresult, simply apply Chebyshev’s inequality:
for every À E 1R and r >0,
’
’Y(F
>r) e-~r+a2/2 Minimizing
in A(A
=r) yields
’Y(F
>r)
where we recall that F is smooth and such that
Fd03B3
= 0 and 1.By
asimple approximation procedure,
we therefore getthat,
for everyLipschitz
functionF on IRn such that 1 and all r ~
0,
~y (F >
+r) e-ra ~2 . ( 1.22 )
The same argument would
apply
for the Boltzmann measures of Theorem 1.1 toproduce (1.21)
with the mean instead of a median. We note that this directproof
of
(1.22)
is shorter than theproof
of the fullisoperimetric inequality.
Inequality (1.22)
may be used toinvestigate
supremum of Gaussian processesas
(1.7)
or( 1.20). As before,
let(Xt ) tET
be a centered Gaussian process indexedby
some countable set
T,
and assume that suptETXt
oc almostsurely.
Fix ti, ... , tn and denoteby
r = M tM the covariance matrix of the centered Gaussiansample (Xtl, ... , ,Xtn).
Thissample
thus has distribution Mx undery(dx).
LetF(x) =
x E lRn. Then F is
Lipschitz
with-
)) 1/2 ~ 03C3
where u =
Therefore, by (1.22),
for every r >0,
P{ ~l~t~n ’ ~
maxXt;
>1E(
maxXt;)
"/ + J -e-r2 12. (1.23)
~Similarly
for-F,
1P ~
maxXt $ae(
- maxXt; ) - Qr }
Choose now m such that
IP{supt~TXt ~ m} ~ 1 2
and ro such thate-r20/2 z .
Then