• Aucun résultat trouvé

Combining variational interpolation (DIVAnd) and neural networks to generate ocean climatologies from in situ observations

N/A
N/A
Protected

Academic year: 2021

Partager "Combining variational interpolation (DIVAnd) and neural networks to generate ocean climatologies from in situ observations"

Copied!
1
0
0

Texte intégral

(1)

Combining variational interpolation (DIVAnd) and neural

networks to generate ocean climatologies from in situ

observations

Alexander Barth

(1)

Peter Herman

(2)

, Charles Troupin

(1)

, Aida Alvera-Azc´

arate

(1)

, Jean-Marie Beckers

(1)

(1)

GHER, University of Liege, Belgium.

(2)

Delft University of Technology, The Netherlands. Contact: A.Barth@uliege.be

Data-Interpolating Variational

Analysis

I DIVAnd: Data Interpolating Variational Analysis in n-dimensions

I Objective: derive a gridded climatology from in situ observations

I The variational inverse methods aim to derive a continuous field which is:

• close to the observations (it should not nec-essarily pass through all observations because ob-servations have errors)

• “smooth” (i.e. small first and second deriva-tives)

I DIVAnd is essentially a monovariate reconstruction method

I How can it be extended to use other related vari-ables?

Observations

Analysis

Multivariate analysis

I Multivariate analysis with covariables with a list of covariable z1, z2, ...

x = x0 + f (z1, z2, ..., W1, b1, W2, b2, ...)

where f is a non-linear function of the

known covariables and unknown parameters W1, b1, W2, b2, ...)

I The structure of the function f is given here by a neural network (multilayer perception).

I The field x0 is also unknown. Its spatial structure is constrained by DIVAnd.

Neural network

I For every location j, initially the value of vector v1 are the co-variables at the location j.

I This vector is linearly transformed by a weight ma-trix Wk and an offset vector bk and then a non-linear activation function (here RELU) is applied to each element element of the resulting vector (ex-cept for the last step).

vj(k+1) = g(k+1)(v(k)j Wk + bk)

I Here, the weights Wk and bk do not dependent on space but the longitude and latitude are one of the covariables.

Experiments with synthetic

observations

I Create a series of random field which are the “co-variables”

I Create synthetic observations by combining these covariables: true field

I Sample these fields at random location: synthetic observations

I Perturb these covariables as these covariables are not perfectly known in practice

I Try to recover the true field from the synthetic ob-servation using the imperfectly known covariables using the neural network

Logistic regression problem

p(Model 1 | obs.) = 0.9 · 0.7 · (1 − 0.1) = 0.57

p(Model 2 | obs.) = 0.3 · 0.1 · (1 − 0.3) = 0.02

I Similar as the previous case, but the true field is a probability of occurrence

I synthetic observation are binary (occurrence or not)

I cost-function to minimize the based on the negative log-likelihood (i.e. find the model which maximize the probability of the observations)

Application

I A gridded data product for 40 zooplankton species using DIVAnd and the neural network library Knet in Julia.

I The neural network uses the following variables as input: • dissolved oxygen • salinity • temperature • chlorophyll concentration • bathymetry

• the distance from coast

• the position (latitude and longitude) and the year

I Abundance values are expressed in number per m2 and transformed by the function log(x/a + 1) where a is 1 m−2.

I The covers the area from 9°E to 30.8°E and 53°N to 66.1°N at a resolution of a tenth of a degree.

I Gridded data product for the years 2007, 2008, 2010, 2011, 2012 and 2013 have been made. No observations were available for the year 2009.

I The fields represent the yearly average abundance.

I For every specie the correlation length and signal to noise ratio is estimated using the spatial variability of the observations.

Results

I Interpolated field show good agreement with the observations and the cross-validation data points

I Complex spatial dependencies could be learned from the covariable

Limnocalanus macrurus macrurus (year 2013)

Acartia (Acanthacartia) bifilosa (year 2007)

Conclusions

I Neural network can extract non-linear relationships useful to generated gridded data products

I We present essentially a multivariate extension to DIVAnd where the dependency to other variables (“covariables”) are estimated from the observations

I Tests with synthetic data show that the underlying true field can be reconstructed from observations, even when the covariables are not perfectly known

I The technique was also applied to abundance of 40 zooplankton species in the Baltic

I The gridded dataset for all 40 species is available at http://www.emodnet-biology.eu/.

Acknowledgments

F.R.S.-FNRS (Belgium), EMODnet Biology and Sea-DataCloud are acknowledged for their funding and support.

Références

Documents relatifs

2014 Consideration of the eigenproblem of the synaptic matrix in Hopfield’s model of neural networks suggests to introduce a matrix built up from an orthogonal set,

We do not discuss the problem of storage capability but compare our results on retrieval of spurious state with the ones derived in the Hopfield.. model

Lorsque des param `etres sont partag ´es : R ´eseau `a convolution (CNN) R ´eseau totalement interconnect ´e (Hopfield, Boltzmann).. R ´eseau r

73 Comparison between neural network prediction and leasl squares eslimate of the pitch moment (horizontal circular arc r' = 0.1) 88... INTRODUCTION AND

The policy-based solutions parameterize the policy function directly by a neural architecture and try to find the optimal policy using gradient ascent, while the value-based

This method has been implemented using the USPTO Patent Database, with a comparative study of two Neural Networks: a Wide and Deep Neural Network and a Recurrent Neural Network,

The proposed algorithms for processing images based on the extraction and compression of features using neural networks for colorization of black and white images are based

We also discuss new developments and new ideas in this area, in particular how ANNs can be used to model high-dimensional discrete and continuous data to deal with the curse