Optimal Transportation for Data Assimilation

(1)

HAL Id: hal-01349637

https://hal.archives-ouvertes.fr/hal-01349637

Submitted on 28 Jul 2016

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Optimal Transportation for Data Assimilation

Nelson Feyeux, Maëlle Nodet, Arthur Vidard

To cite this version:

Nelson Feyeux, Maëlle Nodet, Arthur Vidard. Optimal Transportation for Data Assimilation. 5th International Symposium for Data Assimilation (ISDA 2016), Jul 2016, Reading, United Kingdom.

2016. �hal-01349637�

(2)

Optimal Transportation for Data Assimilation

Nelson FEYEUX, Maëlle NODET, Arthur VIDARD Inria & Université Grenoble-Alpes

Data assimilation

Given

•a physical system and its state x(t, x);

•partial observations of the system (y

^o_i

)

i

;

•a (numerical) model M simulating the evo- lution of x;

✬

✫

✩

✪

E.g., for the atmosphere, the state x(t, x, y) gathers the diﬀer- ent variables

•

humidity H(t, x, y);

•

velocities u(t, x, y);

•

temperature T (t, x, y);

•

pressure p(t, x, y).

Can we estimate the initial condition x

0

of the system?

t

0

− 8h t

0

− 6h t

0

− 4h t

0

− 2h

Variational data assimilation consists in retrieving x

₀

by minimizing J (x

₀

) := �

i

d �

H

i

M

i

(x

₀

)

� ��

Mapping ofx0on the space ofyi^o

, y

^o_i

�

2

+ ω d � x

₀

, x

^b₀

�

2

. (1)

It is common for the distance d to be a weighted L

²

distance. Our main goal to use the Wasserstein distance W

2

instead, which seems very interesting when dealing with dense data (see right panel). The Wasserstein cost function writes

J

W

(x

₀

) := �

i

W

2

� H

i

M

i

(x

₀

), y

^o_i

�

2

+ ω W

2

� x

₀

, x

^b₀

�

2

. (2)

Optimal transportation and the Wasserstein distance

For two functions ρ

0

(x) and ρ

1

(x), the square of the Wasserstein distance W

2

(ρ

0

,ρ

1

) is deﬁned as the minimal kinetic energy necessary to transport ρ

0

to ρ

1

,

W

2

(ρ

₀

, ρ

₁

)

²

:= inf (ρ(t, x), v(t, x))

∂

t

ρ + div(ρv) = 0 ρ(0, x) = ρ

0

(x), ρ(1, x) = ρ

1

(x)

1 2

��

[0,1]×Ω

ρ | v |

²

dtdx.

For the Wasserstein distance to be well-deﬁned, one needs ρ

0

≥ 0, ρ

1

≥ 0 and �

Ω

ρ

0

=

�

Ω

ρ

1

= 1.

Average w.r.t the Wasserstein distance

The average, or barycenter, minimizes W

2

(ρ, ρ

0

)

²

+ W

2

(ρ, ρ

1

)

²

. It is also the optimal ρ in the deﬁnition of W

2

(ρ

0

, ρ

1

)

²

at time t = 1/2.

ρ

0

ρ

1

W

2

average L

2

average

Example of use of the Wasserstein distance

(Source : Cuturi, Doucet,Fast computation of Wasserstein barycenters)

← Images to interpolate

← Euclidean barycenter

← Wasserstein barycenter

Results on a Shallow-water equation

Let the model M be a Shallow-Water equation, with initial condition (h

0

, u

0

),

M:

 



 

∂h

∂t + div(ρu) = 0

∂u

∂t + u · ∇ u = − g ∇ h.

We control the initial condition h

0

only, thanks to the Wasserstein cost function J

W

. We set u

₀

= 0.

✬

✫

✩

✪

t

₁

t

₂

t

₃

t

₄

t

₅

t

₆

The observations of h

^true

at times t

i

✬

✫

✩

True and background initial conditions

✪

Results :

✬

✫

✩

✪

Analysis h

0

of the assimilation when using the Euclidean (L

²

) or the Wasser- stein (W

2

) distance.

✬

✫

✩

✪

Values of h and u for the background and true states, as well as analysis for Euclidean and Wasserstein distances, at time t = t

6

Specifities on using the Wasserstein distance

•

The Wasserstein distance is only deﬁned for probability measures, i.e. ρ s.t.

ρ ≥ 0 and �

Ω

ρ = 1

Relaxations of the latter constraint are possible, however complex;

•

the W

2

interpolation works well if ρ

₀

and ρ

₁

are of distinct support;

•

when J (ρ

ⁿ₀

) → min

ρ0

J (ρ

0

), then there is only weak convergence of ρ

ⁿ₀

to ρ

^opt₀

: oscillations or diracs can occur!

•

Computing the Wasserstein distance is expensive [Peyré, Papadakis, Oudet, 2013];

•

The minimization of J

W

is performed through a gradient descent, using the Wasser- stein gradient, arising from the use of the following Wasserstein scalar product de- pending on ρ

0

,

For η, η

^�

s.t. �

Ω

η =

�

Ω

η

^�

= 0

Let Φ, Φ

^�

s.t. −div(ρ

0

∇ Φ) = η (with Neumann BC)

−div(ρ

0

∇ Φ

^�

) = η

^�

Then �η, η

^�

�

W

=

�

Ω

ρ

0

∇Φ · ∇Φ

^�

Optimal Transportation for Data Assimilation

HAL Id: hal-01349637

https://hal.archives-ouvertes.fr/hal-01349637

Submitted on 28 Jul 2016

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Optimal Transportation for Data Assimilation

Nelson Feyeux, Maëlle Nodet, Arthur Vidard

To cite this version:

Nelson Feyeux, Maëlle Nodet, Arthur Vidard. Optimal Transportation for Data Assimilation. 5th International Symposium for Data Assimilation (ISDA 2016), Jul 2016, Reading, United Kingdom.

2016. �hal-01349637�

Optimal Transportation for Data Assimilation

Nelson FEYEUX, Maëlle NODET, Arthur VIDARD Inria & Université Grenoble-Alpes

Data assimilation

Given

•a physical system and its state x(t, x);

•partial observations of the system (y

)

;

•a (numerical) model M simulating the evo- lution of x;

E.g., for the atmosphere, the state x(t, x, y) gathers the diﬀer- ent variables

humidity H(t, x, y);

velocities u(t, x, y);

temperature T (t, x, y);

pressure p(t, x, y).

Can we estimate the initial condition x

of the system?

t

− 8h t

− 6h t

− 4h t

− 2h

Variational data assimilation consists in retrieving x

by minimizing J (x

) := �

d �

H

M

(x

)

� �� �

, y

�

+ ω d � x

, x

�

. (1)

It is common for the distance d to be a weighted L

distance. Our main goal to use the Wasserstein distance W

instead, which seems very interesting when dealing with dense data (see right panel). The Wasserstein cost function writes

J

(x

) := �

W

� H

M

(x

), y

�

+ ω W

� x

, x

�

. (2)

Optimal transportation and the Wasserstein distance

For two functions ρ

(x) and ρ

(x), the square of the Wasserstein distance W

(ρ

,ρ

) is deﬁned as the minimal kinetic energy necessary to transport ρ

to ρ

,

W

(ρ

, ρ

)

:= inf (ρ(t, x), v(t, x))

∂

ρ + div(ρv) = 0 ρ(0, x) = ρ

� ��