• Aucun résultat trouvé

Wasserstein geometry of porous medium equation

N/A
N/A
Protected

Academic year: 2022

Partager "Wasserstein geometry of porous medium equation"

Copied!
16
0
0

Texte intégral

(1)

Ann. I. H. Poincaré – AN 29 (2012) 217–232

www.elsevier.com/locate/anihpc

Wasserstein geometry of porous medium equation

Asuka Takatsu

a,b,c,

aGraduate School of Mathematics, Nagoya University, Nagoya 464-8602, Japan bInstitut des Hautes Études Scientifiques, Bures-sur-Yvette 91440, France cMax-Planck-Intitut für Mathematik, Vivatsgasse 7, 53111 Bonn, Germany

Received 18 February 2010; received in revised form 23 August 2011; accepted 11 October 2011 Available online 19 October 2011

Abstract

We study the porous medium equation with emphasis onq-Gaussian measures, which are generalizations of Gaussian measures by using power-law distribution. On the space ofq-Gaussian measures, the porous medium equation is reduced to an ordinary dif- ferential equation for covariance matrix. We introduce a set of inequalities among functionals which gauge the difference between pairs of probability measures and are useful in the analysis of the porous medium equation. We show that anyq-Gaussian measure provides a nontrivial pair attaining equality in these inequalities.

©2011 Elsevier Masson SAS. All rights reserved.

MSC:60D05; 46E27

Keywords:q-Gaussian measure; Porous medium equation; Functional inequality

1. Introduction

Aq-Gaussian measure is one of power-law distributions onRd, which is described by mean, covariance matrix parameters and theq-exponential functiongiven by

expq(t ):=

1+(1q)t1−q1

+ forqQd:=(0,1)∪

1,d+4 d+2

,

where we put[x]+:=max{x,0}and by convention 0a:= ∞for any negative number a (see (3.1) for the precise definition of q-Gaussian measures). The q-Gaussian measure is considered as an approximation of the Gaussian measure since theq-exponential function recovers the usual exponential function in the limitq→1. This paper aims to demonstrate that theq-Gaussian measure inherits some features of the Gaussian measure. To do this, let us start by recalling properties of the Gauss measure.

For any v∈Rd andV ∈Sym+(d,R), which is the set of symmetric positive definite matrices of size d, the Gaussian measureN (v, V )with meanvand covariance matrixV is given by

This work was partially supported by Research Fellowships of the Japan Society for the Promotion of Science for Young Scientists and by JSPS–IHÉS (EPDI) Fellowship.

* Correspondence to: Graduate School of Mathematics, Nagoya University, Nagoya 464-8602, Japan.

E-mail address:[email protected].

0294-1449/$ – see front matter ©2011 Elsevier Masson SAS. All rights reserved.

doi:10.1016/j.anihpc.2011.10.003

(2)

N (v, V ):=exp

−1 2

xv, V1(xv)

det(2π V )

1 2Ld,

whereLd stands for the Lebesgue measure onRd. For example, the density ofN (0,2t Id), whereId is the identity matrix of sizedandt >0, is the heat kernel which is a self-similar solution to the heat equation

∂tρ=ρ.

As well as the heat kernel, a solution to the heat equation with an initial data being a Gaussian density remains Gaussian densities for all future time, that is, the space of Gaussian measures is stable under the heat equation. On the space of Gaussian measures, the heat equation is reduced to an ordinary differential equation for covariance matrix.

It simplifies not only the analysis of the heat equation but also the analysis of the Wasserstein space to restrict arguments to the space of Gaussian measures. TheWasserstein spaceis the spaceP2of Borel probability measures onRdhaving finite second moments equipped with a distance functionW2defined by

W2(μ, ν):=inf

Rd×Rd

|xy|2dπ(x, y) 1

2 π: a transport plan ofμandν

,

where atransport planofμandνis a probability measure onRd×Rdsuch thatπ[B×Rd] =μ[B]andπ[Rd×B] = ν[B]for any Borel setB⊂Rd. A transport plan is said to beoptimalif it attains the infimum above, which always exits (see [24, Chapter 4]). Though the explicit expression of an optimal transport plan is not usually obtained, the explicit expression of an optimal transport plan of a pair of Gaussian measures is known, which guarantees the convexity of the space of Gaussian measures in Wasserstein geometry. The restriction to the space makes it possible to analyze Wasserstein geometry in detail (see [17]).

We furthermore focus on the fact that the Gaussian measure satisfies a set of inequalities among the relative en- tropyH, the Fisher informationI and the Wasserstein distance functionW2, all of which gauge differences between pairs of probability measures and are defined by

H (μ|ν):= Rd ln ifμis absolutely continuous with respect toν,

+∞ otherwise,

I (μ|ν):= Rd|∇ln|2 ifμis absolutely continuous with respect toν,

+∞ otherwise.

We say that a probability measureνsatisfies the logarithmic Sobolev inequalitywith constantλ, in short LS(λ) if we have

H (μ|ν) 1 2λI (μ|ν)

for all absolutely continuous probability measure μ with respect toν. Similarly, the probability measureν is said tosatisfy the Talagrand inequalitywith constantλ, in short T(λ) (resp. theHWI inequalitywith constantλ, in short HWI(λ)) if we have

W2(μ, ν) 2

λH (μ|ν)

resp.H (μ|ν)W2(μ, ν)

I (μ|ν)λ

2W2(μ, ν)2

for all absolutely continuous probability measureμwith respect to ν. Any Gaussian measure satisfies LS(λ), T(λ) and HWI(λ), whereλ1is the largest eigenvalues of the covariance matrix, and moreover provides a nontrivial pair attaining equality in these inequalities. Note that criteria for a probability measure to satisfy LS(λ), T(λ) and HWI(λ) are known, for example, LS(λ) with some convexity condition forνimplies T(λ) (see [15] for the details). The key of the proof that LS(λ) implies T(λ) is to analyze asymptotic behaviors of the Fokker–Planck equation of the form

∂tρ=ρ+div(ρ∇Ψ ),

whereΨ is a function onRd, on the contrary, these inequalities are applied to analyze asymptotic behaviors of the Fokker–Planck equation. In particular, Otto [14] investigated the asymptotic behaviors in the case thatΨ (x)= |x|2/4 using Wasserstein gradient structure, where the scaling

(3)

ρ(t, x)=td2ρˆ

lnt, t12x

provides a one-to-one correspondence between solutionsρ to the heat equation and the solutionsρˆ to the Fokker–

Planck equation.

We show that such features are inherited to the space ofq-Gaussian measures after some preliminaries on the q-exponential function and the Wasserstein geometry in Section 2. Section 3 is devoted to the convexity of the space ofq-Gaussian measures in the Wasserstein space (Theorem A). In Section 4, we consider the features ofq-Gaussian measures as solutions to the porous medium equation

∂tρ=

ρ2q . (1.1)

Since the stability of the space ofq-Gaussian measures has been already shown by Ohara and Wada [12], it is possible to restrict the porous medium equation to the space ofq-Gaussian measures. We introduce the ordinary differential equation for covariance matrix obtained by restricting the porous medium equation to the space ofq-Gaussian mea- sures (Theorem B). Section 5 is concerned with generalizations of the logarithmic Sobolev inequality, the Talagrand inequality and the HWI inequality, which play crucial roles in analyzing the asymptotic behaviors of the nonlinear Fokker–Planck equation

∂tρ=

ρ2q +div(ρ∇Ψ ).

We give criteria for a probability measure to satisfy these inequalities and show that anyq-Gaussian measure provides a nontrivial pair attaining equality in these inequalities ifq >1 (Corollaries C, D). We finally prove that a variant of the logarithmic Sobolev inequality implies a variant of the Talagrand inequality using the nonlinear Fokker–Planck equation (Theorem E).

We refer to the preceding results in the literature. Generalizations of the logarithmic Sobolev inequality, the Tala- grand inequality and the HWI inequality have been studied by several authors. Carrillo, Jüngel, Markowich, Toscani and Untterreiter [6] studied these functional inequalities for parabolic systems using entropy dissipation methods.

Carrillo, McCann and Villani [7] also investigated these functional inequalities and they also estimated in [8] the con- traction rate of nonlinear evolution equations in the Wasserstein distance function. Agueh, Ghoussoub and Kang [1]

and Cordero-Erausquin, Gangbo and Houdré [9] introduced variants of the relative entropy and the Fisher informa- tion, through modifying the Boltzmann entropy and the quadratic transport cost. However, none of them referred to theq-Gaussian measures and our result is the first one concerning the importance of theq-Gaussian measures in the porous medium equation. See also [13], where Ohta and the author investigated the nonlinear Fokker–Planck equation on a (weighted) Riemannian manifold using the Wasserstein gradient structure and theq-exponential function.

2. Preliminaries

2.1. q-exponential function andq-logarithmic function

We first summarize the q-calculus, see [21] for further discussion. Take qQd and fix it. We define the q- logarithmic functionlnqby

lnq(t ):=t1q−1 1−q

fort >0. Since the function lnq is monotone increasing, there exists its inverse function on the image. This inverse function is called theq-exponential functionexpqand is naturally extended to all ofRas

expq(t ):=

1+(1q)t11q

+ .

Note that the functions lnq and expq recover the usual logarithmic function and the usual exponential function asq tends to 1, respectively.

Let us define functionals on the space Pac of absolutely continuous probability measures with respect to the Lebesgue measure using theq-logarithmic function. TheTsallis entropyEqis defined by

(4)

Eq(μ):= −

Rd

fqlnq(f ) dLd= −

Rd

fqf q−1 dLd

forμ=fLdPac, which is regarded as an approximation of the Boltzmann entropyEsince we have Eq(μ)−−−→q1 E(μ)= −

Rd

fln(f ) dLd.

We also define theq-relative entropyHqand theq-Fisher informationby Hq|ν):= 1

2−q

Rd

flnq(f )glnq(g)(2q)lnq(g)(fg) dLd,

Iq|ν):=

Rd

lnq(f )−lnq(g)2

forμ=fLd, ν=gLdPac, both of which are non-negative and gauge the difference between pairs of probability measures. Note that limq1Hq|ν)=H (μ|ν)and limq1Iq|ν)=I (μ|ν)hold. The square root of theq-relative entropyHq is considered as a generalization of the distance function in the context of information geometry since this satisfies a generalized Pythagorean relation (see [2,3] for information geometry). We refer to the generalized Pythagorean relation in Remark 3.2. It will be demonstrated in (5.15) that−Iq|ν)is the first variation ofHq(·|ν) atμ.

2.2. Wasserstein geometry

We briefly recall some results on the Wasserstein space. See [23,24] and references therein for the details and more information on Wasserstein geometry.

As mentioned in the introduction, an optimal transport of any pair inP2always exits. Though the explicit expres- sion of an optimal transport is usually not obtained, it has been known since the work of Brenier [5] that an optimal transport is characterized by a push forward measure of the gradient of a convex function. Apush forward measureof a probability measureμby a mapT, denoted byT μ, is defined byT μ[B] :=μ[T1(B)]for all Borel setsB⊂Rd. Given two probability measuresμ=fLd,ν=gLdand a differentiable mapT onRd,ν=T μis equivalent to that

f =g(T )det(dT ) (2.1)

holds forμ-almost everywhere, wheredT is the total differential ofT. We denote by id the identity map onRd. Theorem 2.1. (See [5].) Letμ, νP2 be such thatμ does not give mass to sets of Hausdorff dimension at most (d−1). Then there exists a convex function such that∇ϕpushesμforward toνand[id× ∇ϕ] μis a unique optimal transport plan of them. Moreover,{[(1t )id+tϕ] μ}t∈[0,1]is a unique geodesic fromμtoν in the Wasserstein space.

The converse situation also holds true, that is, if a support of a transport plan is almost contained in the subdiffer- ential of a convex function, then the transport plan is optimal. This is known as Knott–Smith optimality criterion.

Theorem 2.2. (See [23, Theorem 2.16].) Let ϕ be a proper lower semicontinuous convex function on Rd. For μ, νP2, letπbe a transport plan of them such that

Rd×Rd

ϕ(x)+sup

z∈Rd

y, zϕ(z)x, y

dπ(x, y)ε.

Then we have

Rd×Rd

|xy|2dπ(x, y)W2(μ, ν)2+2ε.

(5)

We finally give the explicit expression of the optimal transport between a pair of Gaussian measures, which helps us to understand Wasserstein geometry (see [17] and references therein for more details). Given anyX∈Sym+(d,R), we define a symmetric positive definite matrixX1/2=√

Xsuch thatX1/2·X1/2=X.

For any pair of Gaussian measuresN (v, V )andN (u, U ), we define the symmetric positive definite matrixT and the associated functionT by

T :=U12

U12V U12

1

2U12, T(x):=1

2

xv, T (xv)

+ x, u, (2.2)

which provides an optimal transport plan ofN (v, V )andN (u, U ). In other words,[id× ∇T] N (v, V )is the optimal transport plan of them and then the Wasserstein distance between them is given by

W2

N (v, V ), N (u, U ) 2= |vu|2+trV +trU−2 tr

U12V U12.

A unique geodesic fromN (v, V )toN (u, U )is given by{N (wt, Wt)}t∈[0,1], where the time-dependent vectorwt and the time-dependent matrixWtare defined by

wt:=(1t )v+t u, Wt:=

(1t )Id+t T V

(1t )Id+t T

. (2.3)

3. q-Gaussian measure

Let us summarize the definition ofq-Gaussian measures and then discuss Wasserstein geometry of the space of q-Gaussian measures. Background onq-Gaussian measures is found in [20] and [21].

A probability measure Nq(v, V ) is called theq-Gaussian measurewith mean v and covariance matrixV if it maximizes the Tsallis entropy Eq amongμPac with meanv and covariance matrixV. It is known that the q- Gaussian measureNq(v, V )is given by

Nq(v, V )=C0(detV )12expq

−1 2C1

xv, V1(xv)

Ld, (3.1)

whereC0andC1are the positive constants given by

C0=C0(q, d):=

⎧⎪

⎪⎨

⎪⎪

Γ (21qq+d2)

Γ (21qq) ((1q)C1)d2 if 0< q <1,

Γ (q11)

Γ (q11d2)((q1)C1)d2 if 1< q <dd++42, C1=C1(q, d):= 2

2+(d+2)(1−q)

andΓ (·)is theΓ-function (see [22]). In some cases, theq-Gaussian measureNq(v, V )is obtained as a maximizer of the Tsallis entropyEqunder theq-meanvconstraint and theq-covarianceV constraint, that is,Nq(v, V )maximizes the Tsallis entropyEqamongμ=fLdPacsatisfying

⎧⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎪

⎪⎩

Q(f )=

Rd

fqdLd,

Rd

xf (x)qdLd(x)=Q(f )v,

Rd

(xv)T(xv)f (x)qdLd(x)=Q(f )V ,

where vectors inRdare column andTx stands for the transpose ofx. A relation betweenNq(v, V )andNq(v, V )is given by

Nq(v, V )=Nq

v, 2+d(1q) 2+(d+2)(1−q)V

.

(6)

We use the usual mean and covariance constraint condition throughout this paper. In this case, theq-Gaussian measure is well defined forqQd and theq-Gaussian measureNq(v, V )recovers the Gaussian measureN (v, V )asqtends to 1. We denote bynq(v, V )the density ofNq(v, V )with respect to the Lebesgue measure.

The space ofq-Gaussian measures is convex in the Wasserstein space.

Theorem A.For anyqQd, the space ofq-Gaussian measures is convex and isometric to the space of Gaussian measures with respect to Wasserstein geometry.

Proof. For the functionT given in (2.2), we have nq(v, V )=nq(u, U )(T)(det HessT)

forNq(v, V )-almost everywhere, which implies[∇T] Nq(v, V )=Nq(u, U )by (2.1). Theorem 2.2 with the convex- ity ofT ensures the optimality of[id× ∇T] Nq(v, V )and we have

W2

Nq(v, V ), Nq(u, U ) 2= |vu|2+trV+trU−2 tr

U12V U12

=W2

N (v, V ), N (u, U ) 2.

Hence the map from the space of Gaussian measures to the space of q-Gaussian measures sending N (v, V ) to Nq(v, V )is an isometry with respect toW2.

Moreover, for the time-dependent vectorwt and the time-dependent matrixWt given in (2.3),{Nq(wt, Wt)}t∈[0,1]

is a unique geodesic fromNq(v, V )toNq(u, U ), which shows the convexity of the space ofq-Gaussian measures in the Wasserstein space. 2

Remark 3.1. The convexity of the space of q-Gaussian measures is due to the characterization by the mean and covariance matrix parameter rather than the q-exponential function, which suggests the existence of other convex spaces (see [19]).

Remark 3.2. We briefly explain that the square root of the q-relative entropy satisfies a generalized Pythagorean relation. GivenμPac, letNq(v, V )be a minimizingq-Gaussian measure for the variational problem

min Hq

μNq(u, U ) (u, U )∈Rd×Sym+(d,R) . Then the following Pythagorean relation

HqμNq(u, U ) =HqμNq(v, V ) +Hq

Nq(v, V )Nq(u, U )

holds for anyq-Gaussian measureNq(u, U ). For the proof, see [12, Proposition 3].

4. q-Gaussian measures as solutions to porous medium equation

It is known that the porous medium equation (1.1) allows for a self-similar solution of the form ρq(x, t ):=

Atdα(1q)B|x|2t11−q1

+ =

AB|x|2t1−q1

+ t, where the constantsαandBare given by

α=α(q, d):= 1

d(1q)+2, B=B(q, d):=(1q)α 2(2−q).

The other constantA=A(q, d)is defined by the total mass of the solution and we normalized it such that

Rd

ρq(x, t ) dLd=1.

(7)

To be precise,Ais given by A:=C02α(1q)

α (2q)C1

dα(1q)

.

The solutionρq was discovered by Barenblatt [4] and Pattle [16] and is called theBarenblatt–Pattle solution. It is easy to check the relation

ρq(x, t )=nq

0, CtId (x), where the constantC=C(q, d)is given by

C:=(2q)C1

α A.

As well as the Barenblatt–Pattle solution, it was proved by Ohara and Wada [12, Proposition 5] that a solution to the porous medium solution with an initial data being aq-Gaussian density remainsq-Gaussian densities for all future time. This fact implies that a solution to the porous medium equation on the space ofq-Gaussian measures can be explicitly solved [12, Remark 2]. To do this, we introduce the mapΘon Sym+(d,R)such that

Θ(V ):=(detV )α(1q)V .

Note thatρq(x, t )=nq(0, CΘ(t Id))(x)holds.

Theorem B.For anyqQdandV ∈Sym+(d,R), we set the time-dependent matrixVtas Θ(Vt)=Θ(V )+σ (t )Id, d

dtσ (t )=2α

detΘ(Vt)

1q 2 . Thennq(v, CΘ(Vt))is a solution to the porous medium equation(1.1).

Remark 4.1.The assertion also holds true forq=1.

Proof of Theorem B. For simplicity, we use the following notations

|x|2V :=

x, V1x

, Θt:=Θ(Vt), F (t, x):=

AB|xv|2Θt

+. Then we haveρ(t, x):=F1/(1q)(detΘt)1/2=nq(v, CΘ(Vt))(x)and calculate

ρ2q(t, x) =αF

1−qq (t, x)(detΘt)22q 2B

1−q|xv|2Θ2

tF (t, x)tr Θt1 . Combining well-known equations for a time-dependent invertible matrixXt

d dt

Xt 1 = −Xt 1 d

dtXt

Xt 1, d

dtdetXt=(detXt)tr

Xt1d dtXt

with the assumption d

dtΘt=2α(detΘt)12qId, we obtain

∂t

ρ(t, x) =αF

1−qq (t, x)(detΘt)2−q2 2B

1−q|xv|2Θ2

tF (t, x)tr Θt1 . We thus have

∂tρ= ρ2q ,

proving thatρ(t, x)=nq(c, CΘ(Vt))(x)is a solution to (1.1). 2

(8)

Remark 4.2.It is known that the scaling ρ(t, x)=tρˆ

lnt, tαx

provides a one-to-one correspondence between solutions ρ to the porous medium equation and solutionsρˆ to the nonlinear Fokker–Planck equation of the form

∂tρˆ= ˆ ρ2q +

ˆ ρ

α 2|x|2

.

In particular, the Barenblatt–Pattle solution corresponds to the stationary solutionnq(0, CId)to the nonlinear Fokker–

Planck equation (for instance, see [14]). Since the solutions obtained by Theorem B contain nonself-similar solutions, we obtain nonequilibrium solutions to the nonlinear Fokker–Planck equation. Such solutions help us to understand the asymptotic behavior of solutions to the nonlinear Fokker–Planck equation. As an example, the author [18] demon- strated by using elemental calculations thatNq(0, CId)satisfies theq-logarithmic Sobolev inequality, theq-Talagrand inequality and theq-HWI inequality for anyq-Gaussian measures, which are defined in Section 5 and are key ingre- dients to analyze the asymptotic behavior of solutions to the nonlinear Fokker–Planck equation.

5. Functional inequalities

In this section, we study relations among the functionalsHq,IqandW2in the form of inequality and discuss the importance ofq-Gaussian measures in the inequalities. We deal with the three inequalities, called theq-logarithmic Sobolev inequality,q-Talagrand inequality andq-HWI inequality, of the form

Hq|ν) 1

Iq|ν), W2(μ, ν)

2

λHq|ν), Hq|ν)

Iq|ν)W2(μ, ν)λ

2W2(μ, ν)2.

Though criteria for a probability measure to satisfy the three inequalities have been already provided even in a more general setting (for instance see [1,7,9]), we show criteria to emphasize the importance of theq-Gaussian measure.

We use the same symbol∇for the distributional gradient.

Lemma 5.1.Givenμ1=f1Ld, μ2=f2LdP2Pac, letT be a map such that[id×T]μ1is an optimal transport of them. For aC2-functionΨ onRdsuch thatHessΨ is bounded below by some numberK, we have

Rd

(f2f1)Ψ dLd

Rd

Ψ, T−idf1dLd+K

2W21, μ2)2. (5.1)

If we moreover assume thatf1has a weak derivative, then we have

Rd

f2lnq(f2)f1lnq(f1) dLd

Rd

T −id,∇

f12q dLd

=

Rd

(2q)

T−id,∇lnq(f1)

1. (5.2)

Proof. Since the assumption HessΨ Kprovides Ψ

T (x)Ψ (x)

Ψ (x), T (x)x +K

2xT (x)2,

we obtain (5.1) by integrating it with respect toμ1, where we use the optimality of[id×T] μ1.

We next prove (5.2). Since the equality is trivial, we only prove the inequality. Due to Theorem 2.1 and (2.1), there exists a convex functionϕsuch that

(9)

ϕ=T , Hessϕ=dT , f1=f2(T )det(dT )

hold forμ1-almost everywhere. LetU be the maximal subset whereϕ has the second derivative. Then the change of variable formula fory=T (x)implies

Rd

f2lnq(f2) dLd=

U

f1lnq

f2(T ) dLd

and

Rd

f2lnq(f2)f1lnq(f1) dLd=

U

lnq

f2(T ) −lnq(f1) f1dLd. FixxU and set

b(t ):=lnq

f1(x)

det[Id+t (Hessϕ(x)Id)]

,

which is convex fort∈ [0,1](see [11, Theorem 2.2]). Hence we have b(1)b(0)=lnq

f2 T (x)

−lnq

f1(x) b(0)= −f11q(x)

ϕ(x)−|x|2 2

and by integrating it with respect toμ1=f1Ld, we obtain

Rd

f2lnq(f2)f1lnq(f1)

dLd

U

f11q(x)

ϕ(x)−|x|2 2

f1(x) dLd(x)

Rd

f11q(x)D

ϕ(x)−|x|2 2

f1(x) dLd(x)

=

Rd

(f1)2q,ϕ−id dLd,

whereD is the distributional Laplacian. In the second inequality, we use the Aleksandrov theorem [10] which states that Dϕ coincides with ϕ onU, and the non-negativity of Dϕ on the interior of the domain of ϕ, which is derived from the convexity ofϕ. Thus the lemma is proved. 2

We now give a criterion for a probability measure to satisfy the q-logarithmic Sobolev inequality and the q- Talagrand inequality. ForνP2, we set

P2ac(ν):=

μP2Pacμis absolutely continuous with respect toν .

Proposition 5.2.For ν:=expq(Ψ )LdP2withqQd, we assume that Ψ isC2and that HessΨ is bounded below by a positive numberλ.

(1) Theq-logarithmic Sobolev inequality Hq|ν) 1

Iq|ν) (5.3)

holds for allμP2ac(ν)such that the density ofμhas a first weak derivative.

(2) Theq-Talagrand inequality W2(μ, ν)

2

λHq|ν) (5.4)

holds for allμP2ac(ν).

(10)

Proof. (1) Lemma 5.1 with(f1, f2)=(dμ/dLd,expq(Ψ ))provides

Hq|ν)= −1 2−q

Rd

f1lnq(f1)f2lnq(f2)(2q)lnq(f2)(f1f2) dLd

= −1 2−q

Rd

f1lnq(f1)f2lnq(f2)+(2q)Ψ (f1f2) dLd

Rd

T −id,∇

lnq(f1)+Ψ +λ

2|id−T|2

f1dLd

Rd

1 2λ∇

lnq(f1)−lnq(f2) 2f1dLd

= − 1

Iq|ν),

where we use the assumptionμP2ac(ν)in the second line and complete the square in the second inequality.

(2) Setting1, μ2)=(ν, μ)and adding up the inequality (5.1), (5.2), we have λ

2W2(μ, ν)2Hq|ν). 2

Let us demonstrate that anyq-Gaussian measure provides a nontrivial pair which attains equality in (5.3) and (5.4).

Corollary C. ForV ∈Sym+(d,R), letσ be the largest eigenvalue ofV andvbe the corresponding eigenvector.

Then for anyq >1andv∈Rd, a pair(μ, ν)=(Nq(v+v, V ), Nq(v, V ))attains equality in the inequalities(5.3) and(5.4)forλsatisfying

λσ =C01qC1(detV )12q.

Remark 5.3.For the functionΨ satisfying expq(Ψ )=nq(v, V ), we have HessΨ =C01qC1(detV )(1q)/2V1, that is, the Hessian ofΨ is bounded below byλ. The assumptionq >1 guaranteesP2ac(ν)=P2Pac.

Proof of Corollary C. By the proof of Proposition 5.2, equality holds for a pair (μ, ν)in (5.3) if and only if the inequalities in (5.1), (5.2) are equalities and we have

λ

xT (x) = ∇

lnq

dLd

+Ψ (x)

(5.5) forμ-almost everywhere, whereT is an optimal transport map ofμandν. Similarly, equality holds for a pair(μ, ν) in (5.4) if and only if the inequalities in (5.1) and (5.2) are equalities. To have equality in (5.2),T must be a translation map, that is, T (x)=x +u for some u∈Rd (see [11, Theorem 2.2]). In the case ofT (x)=x +u, equality in each of (5.1) and (5.5) is equivalent to that ∇Ψ (T (x))− ∇Ψ (x)=λu holds for μ-almost everywhere. The pair (Nq(v+t v, V ), Nq(v, V ))satisfies these conditions, hence we have equality in (5.3) and (5.4). 2

Remark 5.4.The proof of Corollary C is needed in order to consider conditions to establish equality in (5.3) and (5.4).

We directly show that the pair (μ, ν)=(Nq(v+v, V ), Nq(v, V )) given in Corollary C attains equality in (5.3) and (5.4) by computing

Hq|ν)=λ

2v2, Iq|ν)=λ2v2, W2(μ, ν)=v.

In this case, the both sides of (5.3) coincide withλ|v|/2 and the both sides of (5.4) are equal to|v|.

(11)

Remark 5.5.Since the square root ofHqbehaves like the distance function in information geometry, theq-Talagrand inequality provides the comparison between the two different geometries, namely, Wasserstein geometry and infor- mation geometry.

We next provide a criterion for a probability measure to satisfy theq-HWI inequality using not Lemma 5.1 but the geodesic equation. For a geodesic{μt}t∈[0,1]in the Wasserstein space, there exists a family{Φt}t∈[0,1] of functions such that

⎧⎪

⎪⎩

∂tμt+div(μtΦt)=0,

∂tΦt+1

2|∇Φt|2=0,

(5.6)

where the functionΦt serves as a “tangent vector field” on the Wasserstein space (a heuristics can be found in [15]

and also see [23,24]). In short, the relation between this “tangent vector field” Φt and an optimal transport mapT fromμ0toμ1is

Φt= [T−id] ◦

Tt1 , Tt:=id+t[T−id].

Proposition 5.6.Forν:=expq(Ψ )LdP2 withqQd andq (d+1)/d, we assume thatΨ isC2 and that HessΨ is bounded below by some numberK. Suppose that the support ofνis convex. Then theq-HWI inequality

Hq|ν)W2(μ, ν)

Iq|ν)K

2W2(μ, ν)2 (5.7)

holds for anyμP2ac(ν)such that the density ofμhas a first weak derive.

Remark 5.7.IfKis positive, then the support ofνis automatically convex (see [13, Lemma 2.5]). The convexity of the support ofνprovides the convexity ofP2ac(ν)in the Wasserstein space (see [8, Corollary 4.3]).

Proof of Proposition 5.6. Let{μt=ftLd}t∈[0,1]be a geodesic fromμP2ac(ν)toνand{Φt}t∈[0,1]be the function satisfying (5.6). We calculate

d

dtHqt|ν)=

Rd

Φt,

lnq(ft)−lnq(f1)

t, (5.8)

d2

dt2Hqt|ν)=

Rd

HessΨ (Φt,Φt)+ft1q 2−q

(1q)(Φt)2+

ij

(HessΦt)2ij

t, (5.9)

d dt

Rd

|∇Φt|2t=0, (5.10)

where we use the condition that−lnq(expq(Ψ ))=Ψ holds on the support ofμtP2ac(ν). Combining the inequality (1q)(Φt)2+

ij

(HessΦt)2ij

1−q+1 d

t)20 (5.11)

provided by the Cauchy–Schwarz inequality and the assumptionq(d+1)/d, with the assumption HessΨ K, we deduce from (5.9) and (5.10) that

d2

dt2Hqt|ν)

Rd

K|∇Φt|2t=KW20, μ1)2=KW2(μ, ν)2. (5.12) It follows that

(12)

Hq|ν)=Hq1|ν)Hq0|ν)

= 1 0

t 0

d2

ds2Hqs|ν) ds+ d ds

s=0

Hqs|ν)

dt

1 0

t 0

KW20, μ1)2ds+ d ds

s=0

Hqs|ν)

dt

=K

2W20, μ1)2+

Rd

Φ0,

lnq(f0)−lnq(f1) t

K

2W20, μ1)2

!

Rd

|∇Φ0|2t

Rd

lnq(f0)−lnq(f1)2t

=K

2W2(μ, ν)2W2(μ, ν)

Iq|ν), (5.13)

where the third line follows from (5.12) and the forth line follows from (5.8). In the fifth line, we apply the Cauchy–

Schwarz inequality. This concludes the proof of Proposition 5.6. 2

The pair ofq-Gaussian measure given in Corollary C also attains equality in the inequality (5.7).

Corollary D.ForV ∈Sym+(d,R), let σ be the largest eigenvalue ofV andvbe the corresponding eigenvector.

Then for anyq >1andv∈Rd, a pair(μ, ν)=(Nq(v+v, V ), Nq(v, V ))attains equality in the inequality(5.7)for λsatisfying

λσ =C01qC1(detV )12q.

Proof. To establish equality in (5.7), HessΨ (Φt,Φt)=K|∇Φt|2holds and the inequalities in (5.11), (5.13) need to be equalities, which is equivalent to∇Φtu, whereuis the difference between the mean ofν andμ. The given pair satisfies these conditions and we have equality in (5.7). 2

We finally give a new relation between theq-logarithmic inequality and theq-Talagrand inequality. To do this, we deal with the following evolution equation given by

⎧⎨

∂tρ= 1 2−q

ρ2q +div(ρ∇Ψ ), xΩ,

ρ=0, xΩc,

(5.14) whereΩis an open set defined by

Ω:=

x∈Rd(1q)Ψ (x) <1 .

For technical reasons, we need to require the solutionsρof (5.14) to satisfy the following conditions (I), (II) and (III).

(I) ρis non-negative and smooth.

(II) ρconserves the total mass and has a finite third moment, that is,

Rd

ρ dLd=

Ω

ρ dLd=1,

Rd

|x|3ρ dLd<.

(III) Forξt(x):=lnq(ρ)−lnq(expq(Ψ )), there exists a locally bounded functiona(t )such that (a) |∇t(x)ξt(y))|a(t )|xy|.

(b) |∇t2(x))|a(t )(1+ |x|3).

(13)

For a functionΨ, we set P(Ψ ):=

fLdPacthe solution to (5.14) with initial dataf satisfies (I)–(III) .

Remark 5.8.LetΨ (x)=λ|x|2/2 with someλ >0. If 1< q < (d+5)/(d+3), then anyq-Gaussian measure belongs toP(Ψ )(anyq-Gaussian measure has a finite third moment ifq < (d+5)/(d+3)). Forq <1,Nq(0, aId)P(Ψ ) ifλa < (C01qC1), which ensures that the support ofNq(0, aId)is contained inΩ.

Theorem E.Forν:=expq(Ψ )LdP2withqQd, we assume thatΨ isC2and thatHessΨ is bounded below by a positive numberK. If there exists some positive numberλsuch that

Hq|ν) 1

Iq|ν)

holds for anyμP2ac(ν)P(Ψ ), then we also have W2(μ, ν)

2 λIq|ν) for anyμP2ac(ν)P(Ψ ).

Remark 5.9.Proposition 5.2 says that the condition HessΨ K >0 implies that the q-Talagrand inequality (5.4) holds with the constantK. However,λis independent of K in Theorem E and the estimate in theq-Talagrand in- equality is improved ifλ > K.

The basic strategy for proving Theorem E is similar to the proof of [15, Theorem 1], where the key ingredients are some variations along the flow (5.14) and the convergence of the solution in the sense ofHqandW2.

Forμ=fLd, letft be the solution to (5.14) with the initial dataf0=f. Thenftsatisfies

∂tft= 1 2−q

ft2q +div(ftΨ )=div(ftξt)

and the conditions (I) and (II) guarantee μt :=ftLdP2ac(ν). We first consider the variations of Hqt|ν) and W2(μ, μt)along the solution to (5.14).

Lemma 5.10.

d

dtHqt|ν)= −Iqt|ν), (5.15)

d+

dtW2(μ, μt)=lim sup

s0

W2(μ, μt+s)W2(μ, μt)

s

Iqt|ν). (5.16)

Proof. Setg:=expq(Ψ ). For any smooth functionηwith compact support, we have d

dt 1

2−q

Rd

ftlnq(ft)glnq(g)(2q)lnq(g)(ftg) η dLd

= −

Rd

ft

"

ξt,

ξt+ 1 2−q

η

# dLd

= −1 2

Rd

η,

ξt2 t− 1 2−q

Rd

η,ξtt

Rd

η|∇ξt|2t,

Références

Documents relatifs

[r]

In the general case, inequality proved in this article, we do not know if the modified logarithmic Sobolev inequality (2.9) implies transportation inequality (2.16).... Using Theorem

[r]

Le but du probl`eme ci-dessous est de d´efinir une classe de mots (les grands Dycks) de deux mani`eres (par une condition num´erique, par une grammaire) et de donner

Le sujet est (en principe) trop long pour le temps imparti, il est donc conseill´e de traiter en priorit´e les questions que l’on sait faire en en indiquant clairement la

Get a priori qualitative properties: positivity, decay at infin- ity, regularity, Euler-Lagrange equations [1960 → ] Standard results are

- Expliquer comment la fécondation assure le passage de la phase haploïde à la phase diploïde La fécondation en assurant la fusion des noyaux des 2 gamètes haploïdes rétablie

[r]