• Aucun résultat trouvé

Representation formula for the entropy and functional inequalities

N/A
N/A
Protected

Academic year: 2022

Partager "Representation formula for the entropy and functional inequalities"

Copied!
15
0
0

Texte intégral

(1)

www.imstat.org/aihp 2013, Vol. 49, No. 3, 885–899

DOI:10.1214/11-AIHP464

© Association des Publications de l’Institut Henri Poincaré, 2013

Representation formula for the entropy and functional inequalities

Joseph Lehec

CEREMADE, UMR CNRS 7534, Université Paris-Dauphine, place du maréchal de Lattre de Tassigny, 75016 Paris, France Received 11 June 2010; revised 9 September 2011; accepted 14 November 2011

Abstract. We prove a stochastic formula for the Gaussian relative entropy in the spirit of Borell’s formula for the Laplace trans- form. As an application, we give simple proofs of a number of functional inequalities.

Résumé. On démontre une formule stochastique pour l’entropie relative par rapport à la Gaussienne, dans le genre de la formule de Borell pour la transformée de Laplace. Cette formule donne des preuves simples d’un certain nombre d’inégalités fonctionnelles.

MSC:39B62; 60J65

Keywords:Gaussian measure; Entropy; Functional inequalities; Girsanov’s formula

1. Introduction: Borell’s formula

Letγdbe the standard Gaussian measure onRd: γd(dx)=e−|x|2/2

(2π)d/2dx, where|x| =√

x·xdenotes the Euclidean norm ofx. In [5] Borell proves the following representation formula. Given a standardd-dimensional Brownian motionBand a bounded functionf:Rd→Rwe have

log

Rdefd

=sup

u

E

f

B1+

1

0

usds

−1 2

1

0 |us|2ds

, (1)

where the supremum is taken over all random processes u, say bounded and adapted to the Brownian filtration.

Among other applications, he derives easily the Prékopa–Leindler inequality. The name Borell’s formulamay be unfair to Boué and Dupuis who in an earlier paper [6] obtained a stronger result, allowing the functionf to depend on the whole path(Bt)t∈[0,1](see Theorem9below for a precise statement). Anyway, Borell and Boué–Dupuis agree that representation formulas such as (1) arose much earlier in optimal control theory, particularly in Fleming and Soner’s work [13], and Borell should definitely be credited for bringing these techniques in the context of functional inequalities.

The present article deals with relative entropy. Let(Ω,A, m)be a measured space andμbe a probability measure.

The relative entropy ofμis defined by H(μ|m)=

Ω

dμ dmlog

dμ dm

dm ifμm

(2)

and H(μ|m)= +∞otherwise. It is well known that there is a Legendre duality between relative entropy and logarith- mic Laplace transform:

H(μ|m)=sup

f

fdμ−log

Ω

efdm

. (2)

The purpose of this article is to prove a representation formula for the Gaussian relative entropy, both inRdand in the Wiener space, providing the entropy counterparts of the results mentioned above. All these formulas have a common feature: Girsanov’s theorem. However, our approach is somewhat different from that of Borell and Boué–Dupuis:

it draws a connection with the work of Föllmer [14,15] which makes the whole argument arguably simpler. As an application, we give new, unified and simple proofs of a number of Gaussian inequalities.

2. Representation formula for the entropy

This section contains the main results of the article. Let us recall a couple of classical facts about relative entropy, see for instance [23], Section 10, and the references therein. IfAis the Borelσ-field of a Polish topology onΩ then it is enough to take the supremum over bounded and continuous functions in (2). In particular the mapμ→H(μ|m)is lower semicontinuous with respect to the topology of weak convergence of measures. IfT:(Ω,A),A)is a measurable map then

H

μT1|mT1

≤H(μ|m) (3)

and assuming that H(μ|m) <+∞, equality occurs if and only if the density dμ/dmis a function of T. We now describe the setting of the article. LetWbe the space of continuous paths

wC0

R+,Rd

, w0=0

equipped with the topology of uniform convergence on compact intervals. LetBbe the associated Borelσ-field and letγ be the Wiener measure on(W,B). Letxt:wwtbe the coordinate process and(Gt)t0be the natural filtration ofx. It is well known thatBcoincides with the smallestσ-field containing

t0Gt. LetHbe the Cameron–Martin space: a pathUbelongs toHif there existsu∈L2([0,+∞);Rd)such that

Ut= t

0

usds, t≥0.

The norm ofU inHis then defined by U =

+∞

0

|us|2ds 1/2

.

The Cauchy–Schwarz inequality shows that the Hilbert spaceHembeds continuously inW. Given a probability space (Ω,A,P)equipped with a filtration(Ft)t0we calldriftany adapted processU which belongs toHalmost surely.

Lastly, our Brownian motions are alwaysd-dimensional, standard and always start from 0.

2.1. The upper bound

We shall use repeatedly Girsanov’s formula, see [18], Chapter 6.

Proposition 1. LetBbe a Brownian motion defined on some filtered probability space(Ω,A,P,F)and letU be a drift.Lettingμbe the law ofB+U,we have

H(μ|γ )≤1

2EU2. (4)

(3)

Proof. WriteUt =t

0usds and assume for the moment thatU2=

0 |us|2ds is uniformly bounded. Then by Novikov’s criterion

Mt=exp

t

0

us·dBs−1 2

t

0 |us|2ds

, t≥0,

is a uniformly integrable martingale and Girsanov’s formula applies. Under dQ=MdP

the processX:=B+Uis a Brownian motion. ThereforeXhas lawμandγ underPandQ, respectively. Then by (3) H(μ|γ )≤H(P|Q)= −Elog(M)=1

2EU2,

which concludes the proof whenUis bounded. In the general case, define the stopping time Tn=inf

t≥0,

t

0 |us|2ds≥n

,

letUnbe the stopped process(Un)t=UtTnandμnbe the law ofB+Un. With probability 1 we haveU2<+∞, thusTn→ +∞andUnU inH, hence inW. Thereforeμnμweakly. AlsoEUn2EU2 by monotone convergence. Thus, using the lower semicontinuity of the entropy (observe thatWis a Polish space)

H(μ|γ )≤lim inf

n H(μn|γ )

≤lim inf

n

1

2EUn2=1

2EU2.

Remark. It follows immediately that whenEU2<+∞,the law ofB+U is absolutely continuous with respect to the Wiener measureγ.Let us point out that this is actually true for all driftsU,even ifEU2= +∞,see[18], Chapter7.

2.2. Föllmer’s drift

Let us address the question whether, given a probability measureμonW, equality can be achieved in (4). Recall that (xt)t0is the coordinate process on Wiener space(W,B, γ )and that(Gt)t0is its natural filtration. The following is due to Föllmer [14,15].

Theorem 2. Letμbe a measure on(W,B)having densityF with respect toγ.There exists an adapted processu such that underμthe following holds.

1. The processUt=t

0usdsbelongs toHalmost surely. 2. The processy=xUis a Brownian motion.

3. The relative entropy ofμis H(μ|γ )=1

2E

μU2.

We sketch the proof for completeness.

Proof of Theorem2. ThroughoutEγ andEμdenote expectations with respect toγ andμ respectively. OnGt the measureμhas density

Ft:=Eγ(F|Gt),

(4)

with respect toγ. A standard martingale argument shows that μ

tinf0Ft>0

=μ(F >0)=1. (5)

Since Brownian martingales can be represented as stochastic integrals there exists an adapted processvsatisfying γ

+∞

0

|vs|2ds <+∞

=1 (6)

and

Ft=1+ t

0

vs·dxs, t≥0.

Letube the process defined by ut=1{Ft>0}(Ft)1vt. It is adapted and (5) and (6) yield

μ

0

|us|2ds <+∞

=1, which is the first assertion of the theorem.

The assertion 2 follows from Girsanov’s formula, see [18], Theorem 6.2.

Underμ, we have Ft =1+

t

0

Fsus·dxs

=1+ t

0

Fsus·dys+ t

0

Fs|us|2ds.

Applying Itô’s formula (recall thatF is positive andyis a Brownian motion underμ) we obtain log(F )=

+∞

0

us·dys+1 2

+∞

0

|us|2ds.

IfEμU2<+∞the local martingale part in the equation above is integrable and has mean 0 so that H(μ|γ )=Eμlog(F )=1

2E

μU2.

Again, a localization argument shows that this equality remains valid whenEμU2= +∞, see [14], Lemma 2.6.

To finish this subsection, we give a formula for Föllmer’s drift when the underlying density has a Malliavin deriva- tive, we refer to the first chapter of [19] for the (little amount of) Malliavin calculus we shall use. For suitable F:W→Rwe let DF:W→Hbe the Malliavin derivative ofF. The domain of D in the space L2(W,B, γ ) is denoted byD2. IfF ∈D2then the Clark–Ocone formula asserts

Eγ(F|Gt)=1+ t

0

Eγ(DsF|Gs)·dxs, t≥0.

We obtain the following result.

(5)

Lemma 3. WhenF∈D2the processut given by Theorem2is ut=Eγ(DtF|Gt)

Eγ(F|Gt) 1{Eγ(F|Gt)>0}. This implies thatμ-almost surely

ut=Eμ DtF

F Gt

.

2.3. Optimal drift in a strong sense

According to Theorem2, the filtered probability space(W,B, μ,G)carries a Brownian motiony. The processx= y+Uhas lawμand the driftUsatisfies

H(μ|γ )=1 2E

μU2.

Still, it remains open whethergivena probability space, a filtration and a Brownian motion, there exists a drift achiev- ing equality in (4).

It this section, we show that this is indeed the case, under some restriction on the measureμ. The approach is taken from the article [4] in which Baudoin treats the case of Brownian bridges (see Section2.5below). We refer to [20] for the background on stochastic differential equations.

Theorem 4. LetBbe a Brownian motion defined on some filtered probability space(Ω,A,P,F).Letμbe a measure onW,absolutely continuous with respect toγ and letut:W→Rdbe the associated Föllmer process.If the stochastic differential equation

Xt=Bt+ t

0

us(X)ds, t≥0, (7)

has the pathwise uniqueness property,then it has a unique strong solution.This solutionXsatisfies the following.

1. The processUt=t

0us(X)dsbelongs toHalmost surely. 2. The processXhas lawμ.

3. The relative entropy ofμis given by H(μ|γ )=1

2EU2.

Proof. According to Theorem2, on(W,B, μ)the coordinate processx satisfies xt=yt+

t

0

us(x)ds,

wherey is a Brownian motion. Therefore (7) has a weak solution. By Yamada and Watanabe’s theorem, if pathwise uniqueness holds then (7) has a unique strong solution. Moreover, since pathwise uniqueness implies uniqueness in law, the solutionXhas lawμ. The rest of Theorem4concerns the law ofX, so it is contained in Theorem2.

We end this section by showing that for a reasonably large class of measuresμ, the stochastic differential equation (7) does satisfy the pathwise uniqueness property.

Definition 5. LetSbe the class of probability measures on(W,B, γ )having a density of the form

F (w)=Φ(wt1, . . . , wtn) (8)

for some integern,for some sample0≤t1< t2<· · ·< tnand for some functionΦ:(Rd)n→Rsatisfying

(6)

Φis Lipschitz.

• ∇Φ is Lipschitz.

There existsε >0such thatΦε.

Lemma 6. Ifμbelongs toSthen the equation(7)has the pathwise uniqueness property.

Proof. Letμhave densityF given by (8). ThenF∈D2and DF (w)=

n

i=1

iΦ(wt1, . . . , wtn)1[0,ti],

where∇iΦis the gradient ofΦin theith variable. By Lemma3, the process associated toμis ut(w)=Eγ(DtF (w)|Gt)

Eγ(F (w)|Gt)

= n

i=1

Eγ(iΦ(wt1, . . . , wtn)|Gt)

Eγ(Φ(wt1, . . . , wtn)|Gt) 1[0,ti](t).

It is enough to prove that there is a constantCsuch that ut(w)ut(w)˜ ≤C sup

0st

|ws− ˜ws| (9)

for allt≥0 and for allw,w˜ ∈W. Fixt≥0 and assume thattkt < tk+1for somek∈ {0, . . . , n−1}. By the Markov property of the Brownian motion

E

Φ(wt1, . . . , wtn)|Gt

=Ψ (wt1, . . . , wtk, wt), whereΨ (x1, . . . , xk, x)equals

WΦ(x1, . . . , xk, x+wtk+1t, . . . , x+wtnt)γ (dw).

Then observe that ΨlipΦlip. We have a similar property when 0≤t < t1 and when tnt. The argument

applies also to∇iΦ. The inequality (9) follows easily.

To sum up, we have the following representation formula.

Theorem 7. Let(Ω,A,P,F)be a filtered probability space and letB:Ω→Wbe a Brownian motion.For allμS we have

H(μ|γ )=min

U

1 2EU2

,

where the minimum is on all driftsUsuch thatB+Uhas lawμ.

2.4. The Boué and Dupuis formula

In this subsection the previous results are translated in terms of log-Laplace using the following lemma.

Lemma 8. Letf:W→Rbounded from below.For every positiveεthere existsμSsuch that log

Wef

Wfdμ−H(μ|γ )+ε. (10)

(7)

Proof. By monotone convergence we can assume that f is also bounded from above, and that

efdγ =1. Set F =ef and letμbe a probability measure onWhaving densityGwith respect toγ. Then

H(μ|γ )

fdμ= G

F log G

F

Fdγ . Usingtlog(t)≤ |t−1| + |t−1|2/2 we get

H(μ|γ )

fdμ≤ G F −1

Fdγ+1

2 G

F −1 2F

FGL1(γ )+CFG2L2(γ )

for some constantC(recall thatf is bounded below). Therefore, it is enough to show that there existsμS whose

densityGis arbitrarily close toF in L2(γ ). This is a standard fact.

Here is the Boué and Dupuis formula.

Theorem 9. For every functionf:W→Rmeasurable and bounded from below,we have log

Wef

=sup

U

E

f (B+U )−1 2U2

,

where the supremum is taken over all driftsU.

This is actually slightly more general than the result in [6], which concerns the spaceC([0, T],Rd)for some finite time horizonT.

Proof of Theorem9. LetU be a drift and μbe the law of B+U. By Proposition1 and the entropy/log-Lapace duality

E

f (B+U )−1 2U2

fdμ−H(μ|γ )≤log

Wef

.

On the other hand, givenε >0, there exists a probability measureμS satisfying (10). SinceμS, Theorem7 asserts that there exists a driftUsuch thatB+U has lawμand satisfying

H(μ|γ )=1 2EU2. Then (10) becomes

log

Wef

E

f (B+U )−1 2U2

+ε,

which concludes the proof.

2.5. Brownian bridges A measureμonWsatisfying

μ(dw)=ρ(w1)γ (dw), (11)

whereρis some density on(Rd, γd)is said to be a Brownian bridge. It can be seen as the law of a Brownian motion conditioned to have lawρ(x)γd(dx)at time 1.

(8)

Lemma 10. Letνhave densityρwith respect toγd,we have H(ν|γd)=inf

μ

H(μ|γ ) ,

where the infimum is on all probability measures satisfyingμ(x1)1=ν.The infimum is attained whenμis the bridge(11).

In other words, among all processes having lawν at time 1, the bridge minimizes the relative entropy. This is essentially a particular case of (3), see also [4] and [16], p. 161.

Assume that ρ is differentiable and that ∇ρ ∈L2d). Then F (w)=ρ(w1) belongs toD2 and has Malliavin derivative

DF (w)= ∇ρ(w1)1[0,1].

By Lemma3the Föllmer process of the bridgeμis such that ut=Eμ

∇log(ρ)(w1)|Gt

1[0,1](t), μ-a.s.

We obtain the following result.

Lemma 11. Underμ,the process(ut)t∈[0,1]is a martingale.In particular

Eμ(ut)=Eμ∇log(ρ)(w1)=Eγρ(w1)=

Rdxν(dx).

Now assume thatρand∇ρare Lipschitz and thatρε, so that the bridgeμbelongs toS. It is easily seen thatut can also be written as

ut(w)= ∇logP1tρ(wt)1[0,1](t), wherePtdenotes the heat semigroup onRd:

tPt=1 2Pt.

The stochastic differential equation (7) becomes Xt=Bt+

t1 0

∇log(P1sρ)(Xs)ds, t≥0. (12)

By Lemma6, there is a unique strong solution. Combining Lemma10with Theorem4we obtain the following dual formulation of Borell’s result (1).

Theorem 12. Letνandρbe as above.Then H(ν|γd)=inf

U

1 2EU2

,

where the infimum is taken on all driftsUsatisfyingB1+U1=νin law.The infimum is attained by the drift Ut=

t1 0

∇log(P1sρ)(Xs)ds, whereXis the unique solution of(12).

(9)

3. Applications

Following Borell, we now derive functional inequalities from the representation formula. Let us point out that in all but one applications we use Proposition1and Theorem2rather than Theorem7.

3.1. Transportation cost inequality

Let T2be the transportation cost for the Euclidean distance squared: given two probability measuresμandνonRd T2(μ, ν)=inf

Rd×Rd|xy|2dπ(x, y)

, (13)

where the infimum is taken over all couplings π ofμandν, namely all probability measures on the product space Rd×Rdhaving marginalsμandν. There is a huge literature about this optimization problem, usually referred to as Monge–Kantorovitch problem, see Villani’s book [24]. Talagrand’s inequality [22] asserts that

T2(ν, γd)≤2H(ν|γd)

for every probability measure ν on Rd. The purpose of this subsection is to prove a Wiener space version of this inequality.

On Wiener space the natural definition of T2involves the norm of the Cameron–Martin spaceH: given two proba- bility measuresμ, νon(W,B)

T2(μ, ν)=inf

W×W

w−w2π

dw,dw ,

where the infimum is taken over all couplingsπ ofμandνsuch thatww∈Hforπ-almost all(w, w).

Theorem 13. Letμbe a probability measure on(W,B).Then T2(μ, γ )≤2H(μ|γ ).

Here is a short proof based of Theorem2. Fair enough, Feyel and Üstünel [12] have a very similar argument.

Proof of Theorem13. Assume thatμis absolutely continuous with respect toγ (otherwise H(μ|γ )= +∞). Ac- cording to Theorem2there exists a Brownian motionBand a driftU such thatB+Uhas lawμand

H(μ|γ )=1 2EU2.

Then(B, B+U )is a coupling of(γ , μ)and by definition of T2

T2(μ, γ )EU2=2H(μ|γ ).

Let us point out that Talagrand’s inequality can be recovered easily from this theorem, applying it to a Brownian bridge. Details are left to the reader.

3.2. Logarithmic Sobolev inequality

In this section we prove the logarithmic Sobolev inequality for the Wiener measure, which extends the classical log- Sobolev inequality for the Gaussian measure, due to Gross [17]. Whenμis a measure on(W,B, γ )with densityF such that DF is well defined, the Fisher information ofμis

I(μ|γ )=

W

DF2 F dγ=

W

DF F

2dμ.

(10)

Theorem 14. Letμhave densityF with respect toγ and assume thatF ∈D2.Then H(μ|γ )≤1

2I(μ|γ ). (14)

Proof. We consider the probability space(W,B, μ). Recall that(Gt)t0is the filtration of the coordinate process. By Theorem2and Lemma3, letting

ut=Eμ DtF

F Gt

we have

H(μ|γ )=1 2E

μ

0

|ut|2dt.

By Jensen’s inequality Eμ|ut|2Eμ

DtF F

2

so that

H(μ|γ )≤1 2E

μ DF

F 2

which is the result.

This may not be the most straightforward proof, see [8]. Let us emphasize that applying (14) to a Brownian bridge yields the usual log-Sobolev inequality. More precisely, letνbe a probability measure onRdhaving a smooth density ρwith respect toγdand letμbe the measure onWgiven by

μ(dw)=ρ(w1)γ (dw).

Then H(ν|γd)=H(μ|γ ). On the other hand lettingF (w)=ρ(w1)we have DF (w)= ∇ρ(w1)1[0,1],

which implies easily that I(ν|γd)=I(μ|γ ). Thus (14) becomes H(ν|γd)≤1

2I(ν|γd).

3.3. Shannon’s inequality

Given a random vectorηonRdhaving densityρwith respect to the Lebesgue measure, Shannon’s entropy is defined as

S(η)= −

Rdρlog(ρ)dx.

In other words S(η)= −H(ν|λd)whereνis the law ofηandλdis the Lebesgue measure onRd. Theorem 15. Letη, ξ be independent random vectors onRdandθ∈ [0,π/2]

S

cos(θ )η+sin(θ )ξ

≥cos(θ )2S(η)+sin(θ )2S(ξ ). (15)

(11)

This inequality plays a central role in information theory, see [11] for an overview on the topic.

Proof of Theorem15. Let νθ be the law of cos(θ )η+sin(θ )ξ. By Theorem 2, Lemmas 10and11there exists a Brownian motionXand a driftUsuch that

X1+U1has lawν0.

• H(ν0|γd)=EU2/2.

E(U )=E(η)1[0,1].

Similarly, there exists a Brownian motionY and a driftV satisfying the corresponding properties forνπ/2. Besides, we can clearly assume thatY is independent ofX. Then cos(θ )X+sin(θ )Y is a Brownian motion and

cos(θ )X1+sin(θ )Y1+cos(θ )U1+sin(θ )V1

has lawνθ. By Proposition1and Lemma10 H(νθ|γd)≤1

2Ecos(θ )U+sin(θ )V2. Denoting the inner product inHby·,·we have

EU, V = EU,EV =(Eη)·(Eξ ), so that

H(νθ|γd)≤cos(θ )2H(ν0|γd)+sin(θ )2H(νπ/2|γd) +cos(θ )sin(θ )(Eη)·(Eξ ).

This is easily seen to be equivalent to (15).

3.4. Brascamp–Lieb inequality

Let us focus on a family of inequalities dating back to Brascamp and Lieb’s article [7] on optimal constants in Young’s inequality. Since then a number of nice alternate proofs have been discovered, see [3,9,10] and the survey article [1].

This subsection is inspired by the (unpublished) proof of Maurey relying on Borell’s formula.

LetE be a Euclidean space, letE1, . . . , Em be subspaces and for alli letPi be the orthogonal projection with rangeEi. The crucial hypothesis is the so-called frame condition: there existc1, . . . , cminR+such that

m

i=1

ciPi=idE. (16)

LetxE, we then have|x|2=(

ciPix)·xand sincePi is an orthogonal projection

|x|2= m

i=1

ci|Pix|2. (17)

From now on Wdenotes the space of continuous paths taking values inE and starting from 0 and γ denotes the Wiener measure onW. The spacesWi and measuresγi are defined similarly.

Theorem 16. Under the frame condition,for every probability measureμonWwe have H(μ|γ )

m

i=1

ciH(μi|γi),

whereμi =μPi1is the push-forward ofμby the projectionPi.

(12)

Proof. According to Theorem2there exists a standard Brownian motionBonEand a driftUsuch thatB+Uhas lawμand

H(μ|γ )=1 2EU2.

SincePi is an orthogonal projection, the processPiBis a standard Brownian motion onEi. AlsoPiB+PiUhas law μPi1=μi. By Proposition1

H(μi|γi)≤1

2EPiU2, i=1, . . . , m.

On the other hand, the frame condition (17) implies easily that U2=

n

i=1

ciPiU2

pointwise. Taking expectation yields the result.

As observed by Carlen and Cordero [9], this super-additivity property of the relative entropy is equivalent to the following Brascamp–Lieb inequality.

Corollary 17. Under the frame condition,givenmfunctionsFi:Wi→R+,we have

W

m

i=1

(FiPi)cidγ≤ m

i=1

Wi

Fii

ci

.

When the functions Fi depend only on the point w1 rather than on the whole path w we recover the usual Brascamp–Lieb inequality for the Gaussian measure.

3.5. Reversed Brascamp–Lieb inequality

Again E is a Euclidean space and E1, . . . , Em are subspaces satisfying the frame condition (16). Observe that if x1, . . . , xmbelong toE1, . . . , Emrespectively, then for anyyE, the Cauchy–Schwarz inequality and (17) yield

m

i=1

cixi

·y= m

i=1

ci(xi·Piy)

m

i=1

ci|xi|2

1/2 m

i=1

ci|Piy|2 1/2

= m

i=1

ci|xi|2 1/2

|y|. Hence

m

i=1

cixi

2

m

i=1

ci|xi|2. (18)

LetSi be the class of probability measures onEi which satisfy the conditions of Definition5, replacingRd byEi. Here is the reversed version of Theorem16.

(13)

Theorem 18. Givenmprobability measuresμ1, . . . , μmbelonging toS1, . . . ,Smrespectively,there existmprocesses X1, . . . , Xm(defined on the same probability space)such that

1. Xihas lawμi for alli=1, . . . , m.

2. Lettingμbe the law of

ciXiwe have

H(μ|γ )m

i=1

ciH(μi|γi).

Proof. Again letB be a standard Brownian motion onE. Fori=1, . . . , m, the processPiB is a standard Brownian motion onEi. SinceμiSithere exists a driftUi such that the processXi=PiB+Ui has lawμiand

H(μi|γi)=1

2EUi2. LetX=

ciXi and letμbe the law ofX. Since

ciPi is the identity ofE X=B+

m

i=1

ciUi.

By Proposition1, we get

H(μ|γ )≤1 2E

m

i=1

ciUi

2

.

On the other hand (18) easily implies that

m

i=1

ciUi

2

m

i=1

ciUi2,

pointwise. Taking expectation we get the result.

This sub-additivity property of the entropy is a multi-marginal version of thedisplacement convexityproperty put forward by Sturm [21]. By duality, we obtain the following reversed Brascamp–Lieb inequality.

Corollary 19. Assuming the frame condition,givenmfunctionsFi:Wi→R+bounded away from0,and a function G:W→R+satisfying

m

i=1

Fi(wi)ciG m

i=1

ciwi

(19) for all(w1, . . . , wm)∈W1× · · · ×Wm,we have

m

i=1

Wi

Fii

ci

WGdγ .

Proof. By Lemma8, for everyi, there exists a measureμiSi such that log

Wi

Fii

Wi

log(Fi)dμi−H(μi|γi)+ε.

(14)

LetX1, . . . , Xmbe the random processes given by the previous theorem, letX=

ciXi and letμbe the law ofX.

Then by duality and the hypothesis (19) we get log

WG

Elog(G)(X)−H(μ|γ )

E m

i=1

cilog(Fi)(Xi)

−H(μ|γ ).

Since H(μ|γ )

ciH(μi|γi), this is at least ci

log

Wi

Fii

ε

.

Lettingεtend to 0 yields the result.

Again when the functions depend only on the value of the path at time 1, we recover the reversed Brascamp–Lieb inequality for the Gaussian measure, which is due to Barthe [2].

Acknowledgements

The author is grateful to Patrick Cattiaux and Massimiliano Gubinelli for communicating references and to Christian Léonard, Bernard Maurey and Patrick Cattiaux again for valuable discussions.

References

[1] K. Ball. Convex geometry and functional analysis. InHandbook of the Geometry of Banach Spaces, Vol. 1 161–194. W. B. Johnson and J.

Lindenstrauss (Eds). North-Holland, Amsterdam, 2001.MR1863692

[2] F. Barthe. On a reverse form of the Brascamp–Lieb inequality.Invent. Math.134(1998) 335–361.MR1650312 [3] F. Barthe and N. Huet. On Gaussian Brunn–Minkowski inequalities.Studia Math.191(2009) 283–304.MR2481898

[4] F. Baudoin. Conditioned stochastic differential equations: Theory, examples and application to finance.Stochastic Process. Appl.100(2002) 109–145.MR1919610

[5] C. Borell. Diffusion equations and geometric inequalities.Potential Anal.12(2000) 49–71.MR1745333

[6] M. Boué and P. Dupuis. A variational representation for certain functionals of Brownian motion.Ann. Probab.26(1998) 1641–1659.

MR1675051

[7] H. J. Brascamp and E. H. Lieb. Best constants in Young’s inequality, its converse and its generalization to more than three functions.Adv.

Math.20(1976) 151–173.MR0412366

[8] M. Capitaine, E. P. Hsu and M. Ledoux. Martingale representation and a simple proof of logarithmic Sobolev inequalities on path spaces.

Electron. Commun. Probab.2(1997) 71–81.MR1484557

[9] E. Carlen and D. Cordero-Erausquin. Subadditivity of the entropy and its relation to Brascamp–Lieb type inequalities.Geom. Funct. Anal.19 (2009) 373–405.MR2545242

[10] D. Cordero-Erausquin and M. Ledoux. The geometry of Euclidean convolution inequalities and entropy.Proc. Amer. Math. Soc.138(2010) 2755–2769.MR2644890

[11] A. Dembo, T. M. Cover and J. A. Thomas. Information theoretic inequalities.IEEE Trans. Inform. Theory37(1991) 1501–1518.MR1134291 [12] D. Feyel and A. S. Üstünel. Measure transport on Wiener space and the Girsanov theorem.C. R. Math. Acad. Sci. Paris334(2002) 1025–1028.

MR1913729

[13] W. H. Fleming and H. M. Soner.Controlled Markov Processes and Viscosity Solutions, 2nd edition.Stochastic Modelling and Applied Probability25. Springer, New York, 2006.MR2179357

[14] H. Föllmer. An entropy approach to the time reversal of diffusion processes. InStochastic Differential Systems (Marseille-Luminy, 1984) 156–163.Lecture Notes in Control and Inform. Sci.69. Springer, Berlin, 1985.MR0798318

[15] H. Föllmer. Time reversal on Wiener space. InStochastic Processes – Mathematics and Physics (Bielefeld, 1984)119–129.Lecture Notes in Math.1158. Springer, Berlin, 1986.MR0838561

[16] H. Föllmer. Random fields and diffusion processes. InÉcole d’Été de Probabilités de Saint-Flour XV–XVII, 1985–87101–203.Lecture Notes in Math.1362. Springer, Berlin, 1988.MR0983373

[17] L. Gross. Logarithmic Sobolev inequalities.Amer. J. Math.97(1975) 1061–1083.MR0420249

[18] R. S. Liptser and A. N. Shiryayev.Statistics of Random Processes, Vol. 1: General Theory.Applications of Mathematics5. Springer, New York, 1977.MR0474486

(15)

[19] D. Nualart.The Malliavin Calculus and Related Topics, 2nd edition.Probability and Its Applications. Springer, Berlin, 2006.MR2200233 [20] L. C. G. Rogers and D. Williams.Diffusions, Markov Processes, and Martingales, Vol. 2: Itô Calculus.Cambridge Mathematical Library.

Cambridge Univ. Press, Cambridge, 2000.MR1780932

[21] K. T. Sturm. On the geometry of metric measure spaces. I.Acta Math.196(2006) 65–131.MR2237206

[22] M. Talagrand. Transportation cost for Gaussian and other product measures.Geom. Funct. Anal.6(1996) 587–600.MR1392331

[23] S. R. S. Varadhan.Large Deviations and Applications.CBMS–NSF Regional Conference Series in Applied Mathematics46. SIAM, Philadel- phia, 1984.MR0758258

[24] C. Villani.Optimal Transport: Old and New.Grundlehren der Mathematischen Wissenschaften338. Springer, Berlin, 2009.MR2459454

Références

Documents relatifs

We revisit entropy methods to prove new sharp trace logarithmic Sobolev and sharp Gagliardo-Nirenberg-Sobolev inequalities on the half space, with a focus on the entropy

Dynamics of mean field Schr¨ odinger bridges Our mean field version of the Schr¨ odinger problem stems from fundamental results in large deviations for weakly interacting

uno de los sentidos que ofrecen tanto el diccionario da la Real Academia como de María Moliner. En nuestra ciudad se refiere al juego de las cartas hacer ronda es

Since our results show that nonconvex transients of the relative entropy can occur arbitrarily close to the steady state, a natural question is whether a convex/monotone behavior

Entropy numbers and spectral radius type formula for composition operators on the polydisk.. Daniel Li, Hervé Queffélec,

Maximum Entropy modelling of a few protein families with pairwise interaction Potts models give values of σ ranging between 1.2 (when all amino acids present in the multiple

The relative entropy method for the stability of intermediate shock waves ; the rich case.. Denis Serre ∗ ,