• Aucun résultat trouvé

Optimal dynamic transport with mass consumption

N/A
N/A
Protected

Academic year: 2021

Partager "Optimal dynamic transport with mass consumption"

Copied!
28
0
0

Texte intégral

(1)

HAL Id: hal-02516695

https://hal.archives-ouvertes.fr/hal-02516695v3

Preprint submitted on 9 Apr 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

Optimal dynamic transport with mass consumption

Romain Hug

To cite this version:

Romain Hug. Optimal dynamic transport with mass consumption. 2020. �hal-02516695v3�

(2)

Research report

: Optimal dynamic transport with mass consumption

Romain Hug

The problem of optimal transport has remained very academic for a long time at least in the formu- lation that G. Monge made of it, and took on a more concrete aspect with the idea of L. Kantorovitch to formulate it in terms of optimal allocation of resources. More recently, works such as those by R.

J. McCann [13], Y. Brenier and J.-D. Benamou [1] made it possible to give a dynamic formulation of this type of problem (mainly for quadratic Euclidean transport costs). This formulation open the way to easier numerical process since it allows to apply classical numerical methods in the domain of variational and convex optimization under constraint. For absolutely continuous measures with L2-densities, we can cite in particular the augmented Lagrangian method [1], splitting-proximal or primal-dual[8, 14, 3]. Our present work will be based on the augmented Lagrangian formulation.

An important tool from the Optimal Transportation theory is the Wasserstein distance, a metric allowing to estimate the difference between two probability measures (or more generally, of two measures with the same "mass"). This distance represents a "global transportation cost" between two measures, assuming the local mass displacement cost to depend on the distance and linearly on the local mass.

One of the main limitation of Optimal Transport is that it can only be applied between measures with the same "total mass". This is why a lot of research works have proposed various extensions of these Wassertein distances to measure spaces with potentially different "masses" [2, 17, 16].

The use which is made of Optimal Transport for interpolation problems has also encouraged re- search for new optimal "unbalanced" mass transport models, and in particular dynamic models.

Indeed, in addition to offering a variational framework more suited for numerical processing, those models provide a continuous time evolution (a "geodesic interpolation" for the Wasserstein metric in the classic case) between the source and target measures. The dynamic formulation proposed in [1] represents the constraint of mass conservation during the time-continuous transport with a con- servative continuity equation. Their model uses absolutely continuous measures withL2-densities:

ifρ(t, x)represents the time evolution of the interpolated density, andv(t, x)represents the velocity field derived from the transportation map, we therefore havetρ+ div(ρv) = 0.

Many models, which aim to generalize these interpolations for measures or densities with different mass, use a source term in the continuity equation (which becometρ+ div(ρv) =f) which has to be taken into account in the energy fonctional (see [9, 10], or [6, 5, 15, 11, 12] with also applications to images processing, or [7] for a use in data assimilation) .

Research report produced within the teamOptimization and Optimal Control (RICAM, Johannes Kepler Uni- versity, 4040 Linz, Austria, 2017-2018)

Université d’Artois, LML, 62300 Lens, France

(3)

The model we propose to study in the present work starts from this idea of introducing a source term in the continuity equation and adapting the energy term. However, it does not currently aim to create an interpolation between two different measures or densities, but rather go back directly to an optimal allocation problem under the constraint of a displacement cost. A non standard feature of our model in that cost directly reprensents a loss of the mass transported: the objective of the problem is therefore to minimize this mass loss. In this model, the initial density does not need to be entirely transported, and the question is to determine the correct allocation for a part of this initial mass by taking into account the dissipation of it along its way.

In this report, we will limit ourselves to the proof of the equivalence between the resolution of the strict statement of this model (but whose dynamic formulation is reduced to a non-convex optimization problem), and the resolution of a relaxed version of it (which is a convex problem).

1 Brief statement of the problem

Consider a production area (for instance coal), and an area to supply. The production area pro- duces more coal than the needs of the area to be supplied. However, transport also requires coal consumption. The objective is to determine an optimal allocation map that ensures supply while minimizing the consumption of coal required for

transportation. resource to deliver

resource consumed by transportation

P roduction ar ea Consu mption ar ea

Transport

(4)

We choose a dynamic model (inspired by [1]) and we denote byρthe mass to be transported and by mthe momentum. In this first model, we assume that consumption is proportional to the energy dispersed by solid friction (for example the friction of the train on the rails), and therefore that it is proportional to the norm of the momentum |m|, with a coefficient of dispersion k. Here, we choose to overlook the consumption of the initial impulse (even if it may be an interesting thing to look out). It is clear that for such a model several solutions are possible. We are looking for a dynamic model which does not take into account the mass that is not displaced (which is useless), and which does not allow for overlapping of trajectories. That is why, in addition to the energy term|m|, we add a penalization term(/2) |m|2/(2ρ) +ρ

with fixed.

Solving this first problem opens the way to the asymptotic analysis when0 (preferably inL2 space).

We set the domainQ= (0,1)×Ω, with a convex bounded domain (sufficiently regular) ofRd. Letρ0 and ρ1 be two (non-negative) densities of L2(Ω) (withR

ρ0dxR

ρ1dx), with compact supports inΩ.

We define the following kinetic energy term(1/2)J:

J(α, β) =

|β|2

α if α >0,

0 if (α, β) = (0,0), +∞ else.

(1-1)

We also define the space of constraintsC:

C=

(ρ, m)L2(Q)d+1, ρ(0,·)ρ0, ρ(1,·) =ρ1, ∂tρ+ divx(m) =−k|m|, hm, ~ni∂Ω= 0 (1-2) We want to solve the following problem:

inf

(ρ,m)∈C

2

Z

Q

(J(ρ, m) +ρ)dx dt+ Z

Q

|m|dx dt

(1-3) In order to more easily a dual formulation of this problem, we choose to expand the space of constraints as follow:

Cre=

(ρ, m)L2(Q)d+1, ρ(0,·)ρ0, ρ(1,·)ρ1, ∂tρ+ divx(m)≤ −k|m|, hm, ~ni∂Ω0 (1-4) Therefore, we would stude the problem of existence and uniqueness of a couple density-momentum , m)such that

, m) = argmin

(ρ,m)∈Cre

2

Z

Q

(J(ρ, m) +ρ)dx dt+ Z

Q

|m|dx dt

(1-5) and we will prove (in section 3) that such a couple would satisfy ultimately, m)∈ C.

(5)

To model the space of constraintCre, we introduce the Lagrange multiplierϕas follow:

inf

(ρ,m)∈Cre

Z

Q

2(J(ρ, m) +ρ) +|m|

dx dt

= inf

(ρ,m)∈L2(Q) sup

0≤ϕ∈H1(Q)

Z

Q

2(J(ρ, m) +ρ) +|m|

dx dt

+ Z

Q

ϕ(∂tρ+ divx(m) +k|m|)dx dt +

Z

(ρ(0,·)ρ0)ϕ(0,·)dx Z

(ρ(1,·)ρ1)ϕ(1,·)dx

= inf

(ρ,m)∈L2(Q) sup

0≤ϕ∈H1(Q)

Z

Q

2(J(ρ, m) +ρ) + (kϕ+ 1)|m|

dx dt

+ Z

Q

t,xϕ·(ρ, m)dx dt+ Z

[ϕ(1,·)ρ1ϕ(0,·)ρ0]dx

(1-6)

By defining µ= (ρ, m)L2(Q)d+1, G(ϕ) =R

[ϕ(1,·)ρ1ϕ(0,·)ρ0]dx, and using the Legendre’s transformation

Z

Q

2(J(ρ, m) +ρ) + (kϕ+ 1)|m|

dx dt= sup

q

[q·µF(ϕ, q)] (1-7) with

F(ϕ, q) =iPϕ(q) =

0 if q∈ Pϕ,

+∞ else. (1-8)

and Pϕ =

q= (a, b)L2(Q)×L2(Q)d, a+ 1

2(max{|b|, kϕ+ 1} −1)2 2 a.e.

(1-9) Then, we have

inf

(ρ,m)∈Cre

Z

Q

2(J(ρ, m) +ρ) +|m|

dx dt

= inf

0≤ϕ∈H1 inf

q∈L2 sup

µ∈L2

[F(ϕ, q) +G(ϕ) +hµ,t,xϕqiL2]

(1-10)

Therefore, we search for a saddle point, q, µ)in S=H+1(Q)×L2(Q)d+1×L2(Q)d+1 (where H+1(Q) =H1(Q), ϕ0}) of the LagrangianLdefined by

L(ϕ, q, µ) =F(ϕ, q) +G(ϕ) +hµ,t,xϕqiL2(Q), (1-11) i.e. such thatL, q, µ)L, q, µ)L(ϕ, q, µ)for all(ϕ, q, µ)S.

In the following, when there is no ambiguity, we will write , q, µ) instead of , q, µ) in order to lighten the notation.

First of all, we would like to study the relevance of an augmented Lagrangian method (inspired by the algorithm developed in [1]): in this new formulation, where the old primal variableµ= (ρ, m)

(6)

is became the Lagrange multiplier, and conversely the old Lagrange multiplierϕis now became a primal variable, the new (linear) constraint is nowt,xϕ=q. Hence, we introduce, for allr0, the augmented LagrangianLr by:

Lr(ϕ, q, µ) =F(ϕ, q) +G(ϕ) +hµ,t,xϕqiL2(Q)+r

2k∇t,xϕqk2L2(Q). (1-12) Remark 1.1. In this Lagrangian formulation, we choose a partial dualisation by including the non-linear part of the constraint in the energy term (which then depends on the Lagrange multiplier ϕ). This choice seemed easier to us with regard to the purely theoretical study of the properties of the solutions of the model we propose here.

For a numerical exploitation, as an application of an Augmented Lagrangian method (as for instance in [1]), we would rather choose a total dualisation (thus not including the non-linear part of the constraint in the energy term), in order to obtain a Lagrangian formulation equivalent to the first, but more exploitable numerically. We could use the following Lagrangian:

L(ϕ, q, µ) =F(q) +G(ϕ) +hµ,t,xϕkϕc(b)qiL2(Q),

where we definec(b)(t, x) = 0

b(t,x)

|b(t,x)|

!

if b(t, x)6= 0andc(b)(t, x) = 0 ifb(t, x) = 0, and alsoF(q) = IndP

0(q), with P0=

q= (a, b)L2(Q)×L2(Q)d, a+ 1

2(max{|b| −1,0})2 2 a.e.

The corresponding augmented Lagrangian become:

Lr(ϕ, q, µ) =F2(q) +G(ϕ) +hµ,t,xϕkϕc(b)qiL2(Q)+r

2k∇t,xϕkϕc(b)qk2L2(Q).

2 Some few theoric results on the augmented Lagrangian for- mulation

Let us characterize the saddle points of the Langrangians Lr, which are, as we will see, the same as forL.

Proposition 2.1. Let >0 and , q, µ)S. Then, for all r0, the triplet, q, µ)is a saddle point ofLr if and only if it satisfies the followings assumptions:

1. q=t,xϕ, 2. q= (a, b)∈ Pϕ,

3. the vector µ= (ρ, m)is orthogonal to the paraboloidPϕ inq, i.e. , qqiL2 0 for allq∈ Pϕ.

4. µ= (ρ, m)∈ Cre: more precisely, we have:

∀ϕH1(Q), ϕ0, Z

Q

(∂tϕ ρ+xϕ·m)dx dtk Z

Q

|m|ϕ dx dt+G(ϕ)0. (2-13)

(7)

Especially forϕ=ϕ, we have:

Z

Q

(∂tϕρ+xϕ·m)dx dtk Z

Q

|mdx dt+G(ϕ) = 0. (2-14) Since this is true for any r 0, the saddle points are therefore common to all Lagrangians Lr, especially forL whenr= 0.

Before giving proof of this proposition, we will need to state (ond prove) an other one wich will characterize specially the optimal couple density-momentumµ= (ρ, m).

Proposition 2.2. Let µ = (ρ, m)L2(Q)d+1 and q= (a, b)∈ Pϕ, such thatµ is orthog- onal toPϕ inq, i.e. , qqiL2 0, ∀q∈ Pϕ. Therefore:

1. We haveρ0.

2. For almost all (t, x)Q, m(t, x) =

hρ

1|b+1|

bi

(t, x), if |b|(t, x)(t, x) + 11,

0 else.

(2-15) In other words,

m=wρb, with w= max

0, 1

1+ 1

|b|

(we consider 1/0 = +∞). (2-16)

3. We havea=(w|b|)2

2 +

2 on the support of ρ, i.e. ρ

a+(w|b|)2

2

2

= 0.

Proof:

(I)0]: Letf :RRbe defined as follow: f : yR7→

( y

|y|, if y6= 0, 0, if y= 0.

For all y R, yf(y) = |y|, then ρf) =| and −1 f) 1. We define q = (a, b)with a=f(ρ)1 +aandb=b. Therefore, we have

a+ 1

2(max{|b|, kϕ+ 1} −1)2=f)1 +a+ 1

2(max{|b|, kϕ+ 1} −1)2 2, i.e. q∈ Pϕ, then0≥ hµ, qqiL2=

Z

Q

(|ρ| −ρ)dx dt.

As| −ρ0, we conclude that|=ρ almost everywhere, that is to sayρ0.

(II)[m=wρb]: Firstly, let us remark that for all~uL((0,1)×Ω), and for allδ >0, and by definingbδ=b+δ~u, we have:

(|bδ| −1)2=|bδ|22(kϕ+ 1)|bδ|+ (kϕ+ 1)2

= (|b|2+ 2δhb, ~ui+δ2|~u|2)2(kϕ+ 1)|bδ|+ (kϕ+ 1)2

= (|b| −1)2+ 2δhb, ~ui+δ2|~u|2(|bδ| − |b|).

(2-17)

Let~uL((0,1)×Ω)and letλ >0.

(8)

We define

Aλ={(t, x)(0,1)×Ω, |b| −1λ}.

Let~u0=1A0~u(we assume~u06= 0). Let bδ =b+δ~u0, aδ =1

2(max{|bδ|, kϕ+ 1} −1)2+ 2 For allδ 2k~uλ

0k (then such that|bδ| ≥+ 1onAλ), we have Z

Aλ

(|bδ| −1)2dx dt= Z

Aλ

(|b| −1)2dx dt +δ

Z

Aλ

2hb, ~u0i −2hb, ~u0i+δ|~u0|2

|bδ|+|b|

dx dt+δ2 Z

Aλ

|~u0|2dx dt (2-18)

(we recall that(|bδ|+|b|)(t, x)≥ |b|(t, x)(t, x) + 11 onAλ).

We haveqδ = (aδ, bδ)∈ Pϕ. Then 0≥ hµ, qδqiL2((0,1)×Ω)

= Z

Aλ

(aδadx dt+ Z

Aλ

hbδb, midx dt

= Z

Aλ

−aρρ

2(|bδ| −1)2+

2ρ+δhm, ~u0i

dx dt

= Z

Aλ

a+ 1

2(|b| −1)2 2

ρdx dt+δ2 Z

Aλ

|~u0|2 2

2(kϕ+ 1)

|bδ|+|b| 1

dx dt

δ Z

Aλ

ρ

hb, ~u0i −2(kϕ+ 1)

|bδ|+|b| hb, ~u0i

+δhm, ~u0i

dx dt

δ2 Z

Aλ

|~u0|2 2

2(kϕ+ 1)

|bδ|+|b| 1

dx dtδ Z

Aλ

ρ

12(kϕ+ 1)

|bδ|+|b|

bm, ~u0

dx dt (2-19) Then

δ Z

Aλ

|~u0|2 2

2(kϕ+ 1)

|bδ|+|b| 1

dx dt−

Z

Aλ

ρ

1 2(kϕ+ 1)

|b+δ~u0|+|b|

bm, ~u0

dx dt0.

(2-20) Therefore, whenδ0, we have

Z

Aλ

ρ

1+ 1

|b|

bm, ~u0

dx dt0. (2-21)

Whenλ0, we finally have Z

Q

ρ

1+ 1

|b|

bm, ~u0

dx dt0. (2-22)

(9)

We define

Bλ={(t, x)(0,1)×Ω, |b| −1≤ −λ}.

Let~v0=1B0~u(we assume~v06= 0). Letbδ=b+δ~v0, aδ =1

2(max{|bδ|, kϕ+ 1} −1)2+ 2. For all δ 2k~vλ

0k, we have|bδ| ≤+ 1onBλ, i.e. aδ =/2(and a/2) onBλ, and thenqδ= (aδ, bδ)∈ Pϕ.

Thus, for allδ 2k~vλ

0k, we have 0≥ hµ, qδqiL2((0,1)×Ω)=

Z

Bλ

(aδadx dt+ Z

Bλ

hbδb, midx dt

= Z

Bλ

2 a

ρdx dt+δ Z

Bλ

h~v0, midx dtδ Z

Bλ

h~v0, midx dt.

(2-23)

Therefore, whenλ0, we have Z

Q

h~v0, midx dt0.

As1B0+1A0 = 1, and then~v0+~u0=~u, we finally have, and this for all~uL((0,1)×Ω), Z

Q

hwρbm, ~uidx dt0, (2-24) with

w= max

0, 1

1+ 1

|b|

,

and we then concludem=wρb. (III) a+(w|b|)2/2/2

= 0]: We define the vector q= (a, b) ∈ Pϕ, with b =b and a=21(max{|b|, kϕ+ 1} −1)2+2, whence

0≥ hµ, qqiL2 = Z

Q

ρ

a+ 1

2(max{|b|, kϕ+ 1} −1)2 2

dx dt.

Asρ

a+ 1

2(max{|b|, kϕ+ 1} −1)2 2

0, we then have

ρ

a+(w|b|)2

2

2

=ρ

a+ 1

2(max{|b|, kϕ+ 1} −1)2 2

= 0 (2-25) almost everywhere.

(10)

Proof of Proposition 2.1:

Firstly, let us assume that the triplet, q, µ)satisfies the four assumptions of the state- ment.

Letr0. LetϕH1(Q)such thatϕ0, and letq∈ Pϕ, i.e. such thatF(ϕ, q) = 0. Then, according to (2-13), we have

L(ϕ, q, µ) =F(ϕ, q) +G(ϕ) +,t,xϕqiL2(Q)+r

2k∇t,xϕqk2L2(Q)

k Z

Q

|m|ϕ dx dt− hµ, qiL2(Q).

(2-26)

We define the setsAandB by

A={(t, x)(0,1)×Ω, |b(t, x)| −kϕ(t, x)1>0}, and

B={(t, x)(0,1)×Ω, |b(t, x)| −kϕ(t, x)10}.

According to (2-15), we havem=wρb, and then k

Z

Q

|m|ϕ dx dt− hµ, qiL2(Q)

= Z

Q

|m|dx dt+ Z

Q

(kϕ+ 1)|m|dx dt Z

Q

dx dt Z

Q

b·mdx dt

≥ − Z

Q

|m|dx dt Z

Q

(|b| −1)|m|dx dt Z

Q

dx dt

≥ − Z

Q

|m|dx dt+ Z

B

(kϕ+ 1− |b|)|m|dx dt Z

B

dx dt

Z

A

a+ 1

2(|b| −1)2

ρdx dt 2

Z

A

(w|b|)2ρdx dt

≥ − Z

Q

|m|dx dt 2

Z

B

ρdx dt 2

Z

A

ρdx dt 2

Z

A

(w|b|)2ρdx dt

≥ − Z

Q

|m|dx dt 2

Z

Q

1 + (w|b|)2

ρdx dt.

(2-27)

We haveb·m =b·(wρb) =wρ|b|2=|b| · |m|, and moreover, by definition ofw (see (2-16)), we have(|b| −1)|m|=(w|b|)2ρ. Thus,

k Z

Q

|mdx dt− hµ, qiL2(Q)

= Z

Q

|m|dx dt Z

Q

aρdx dt+ Z

Q

(kϕ+ 1)|m|dx dt Z

Q

b·mdx dt

= Z

Q

|m|dx dt Z

Q

aρdx dt Z

Q

(|b| −1)|m|dx dt

= Z

Q

|m|dx dt Z

Q

a+(w|b|)2

ρdx dt.

(2-28)

Références

Documents relatifs

We introduce the optimal transportation interpretation of the Kantorovich norm on the space of signed Radon measures with finite mass, based on a generalized Wasserstein distance

In the present paper, we study the case where a wide angle Compton scattering subprocess γ(q q) ¯ → γρ characterized by the large scale M γρ (the invariant mass of the final

This idea was subsequently developed in infinite-dimensional spaces for proving the convergence to steady state for bounded solutions of several local equations such as

Table 2 summarizes the features of the three community-contributed commands. Al- though eventstudy and eventstudy2 do not share any programming code, the lat- ter can be considered

The cases are categorized by the number of periods (3 or 6 periods of replenishment planning), the proportion of the inventory storage cost in relation to the transportation cost

Since the works by Ball & Carr [2] and Stewart [24], several articles have been devoted to the existence and uniqueness of solutions to the SCE for coagulation kernels which

In particular, we will show that for the relativistic cost functions (not necessarily highly relativistic, then) the first claims is still valid (even removing the

One of possible ways to understand the interaction between the anomalous diffusion operator (given by Λ α or, more generally, by the L´evy diffusion operator) and the nonlinearity