Particle-based methods for parameter estimation and tracking: Numerical experiments

(1)

HAL Id: hal-00977630

https://hal.inria.fr/hal-00977630v2

Submitted on 11 Apr 2014

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

tracking: Numerical experiments

Johann Fichou, François Le Gland, Laurent Mevel

To cite this version:

Johann Fichou, François Le Gland, Laurent Mevel. Particle-based methods for parameter estimation and tracking: Numerical experiments. [Research Report] PI 1604, 2004. �hal-00977630v2�

(2)

I

^R

^I ^S

A

INS^T ITUT^D

E R ECH^E

R^C HE^E

N I^N

FORMA^T I^Q^U

EE^T S^YS

TÈ^MES^A

LÉA^TO^IR^E S

P U B L I C A T I O N I N T E R N E

N^o

I R I S A

CAMPUS UNIVERSITAIRE DE BEAULIEU - 35042 RENNES CEDEX - FRANCE

ISSN 1166-8687

1604

PARTICLE–BASED METHODS

FOR PARAMETER ESTIMATION AND TRACKING : NUMERICAL EXPERIMENTS

JOHANN FICHOU, FRANÇOIS LE GLAND, LAURENT MEVEL

(3)

(4)

Particlebased Methods

for Parameter Estimation and Tracking : Numerical Experiments

Johann Fichou

*

, FrançoisLe Gland

*

, Laurent Mevel

*

Thème 4 Simulation et optimisation de systèmes complexes

Projet SIGMA2 (^σ²)

Publication interne n1604 Février 2004 59 pages

Abstract: The purpose of this work is to obtain as much intuition as possible, through numerical experiments in a simple case where exact solutions are explicitly available, about the particle approximation of nite signed measures. A prototypical example of a nite signed measure is the derivative, w.r.t. a parameter of the model, of some probability distributions related with a hidden Markov chain. This includes prior, prediction, ltering probability distributions, etc. Two points of view are considered here, to feel the quality of the approximation, at least in a qualitative manner :

(i) how accurate is the particle approximation of the nite signed measure, in view of an histogram representation of the weighted particle system ?

(ii) considering the loglikelihood function and the score function, how close is the approximate expression provided by the particle approximation to the exact expression ?

These two questions seem closely related, however the numerical experiments presented in this work show that one of the two particle approximation schemes fails to satisfy the rst criteria (quality of the approximation of the nite signed measure), and that both schemes satisfy the second criteria (quality of the approximation of the statistics).

Key-words: monitoring, mechanical system, modal parameter, recursive maximum likelihood, estimation, tracking, particle lter, linear tangent particle lter.

(Résumé : tsvp) Ce travail correspond au rapport nal d'un stage eectué dans le cadre du DESSMathématiques Appliquésde l'université de Rennes 1, dans la lièreStatistique. Il a été soutenu en partie par le projet Eurêka 2419Flight Test Easy(FLITE).

*IRISA / INRIA, Campus de Beaulieu, 35042 RENNES Cédex, France. {legland,lmevel}@irisa.fr

Centre National de la Recherche Scientifique Institut National de Recherche en Informatique (UPRESSA6074) Université de Rennes 1 – Insa de Rennes et en Automatique – unité de recherche de Rennes

(5)

pour l'estimation et la poursuite de paramètres : Expériences numériques

Résumé : L'objectif de ce travail est de mieux comprendre l'approximation particulaire de mesures signées nies, au travers de quelques expériences numériques menées dans un cas simple où les solutions exactes sont connues de manière explicite. Un exemple typique de mesure signée nie est la dérivée, par rapport à un paramètre du modèle, de distributions de probabilité associées à une chaîne de Markov cachée.

Cela inclut la distribution a priori, le prédicteur, le ltre, etc. Deux points de vue sont considérés ici pour évaluer la qualité de l'approximation, au moins dans un sens qualitatif :

• quelle est la précision de l'approximation particulaire de la mesure signée nie, au vu d'une représen- tation sous forme d'histogramme du système de particules pondérées ?

• si on s'intéresse seulement à la fonction de logvraisemblance ou à la fonction score, quel est l'écart entre l'expression fournie par l'approximation particulaire et l'expression exacte de ces quantités ? Ces deux questions sont évidemment liées, mais les expériences numériques présentées dans ce travail mon- trent que l'un des deux schémas d'approximation particulaire proposés ne répond pas de manière satisfaisante au premier critère (qualité de l'approximation de la mesure signée nie), et que les deux schémas proposés donnent une bonne approximation pour le second critère (précison de l'approximation des statistiques).

Mots clés : surveillance, système mécanique, paramètre modal, maximum de vraisemblance récursif, estimation, poursuite, ltre particulaire, ltre particulaire linéaire tangent.

(6)

1 Introduction

The purpose of this work is to obtain as much intuition as possible, through numerical experiments in a simple case where exact solutions are explicitly available, about the particle approximation of nite signed measures. A prototypical example of a nite signed measure is the derivative, w.r.t. a parameter of the model, of some probability distributions related with a hidden Markov chain. This includes prior, prediction, ltering probability distributions, etc. Two points of view are considered here, to feel the quality of the approximation, at least in a qualitative manner :

(i) how accurate is the particle approximation of the nite signed measure, in view of an histogram representation of the weighted particle system ?

(ii) considering the loglikelihood function and the score function, how close is the approximate expression provided by the particle approximation to the exact expression ?

These two questions seem closely related, however the numerical experiments presented in this work show that one of the two particle approximation schemes fails to satisfy the rst criteria (quality of the approximation of the nite signed measure), and that both schemes satisfy the second criteria (quality of the approximation of the statistics).

The initial motivation for this work was provided by an application to monitoring the integrity of structural and mechanical systems. Detecting and localizing damages for monitoring the integrity of structural and mechanical systems is a topic of growing interest, due to the aging of many engineering constructions and machines and to increased safety norms. Automatic global vibrationbased monitoring techniques turn out to be useful alternatives to visual inspections or local non destructive (e.g. ultrasonic) evaluations performed manually.

Health monitoring techniques based on processing vibration measurements basically handle two types of characteristics: the structural parameters (mass, stiness, exibility, damping) and the modal parameters (modal frequencies, and associated damping values and modeshapes), see [22, 9, 21]. A central question for monitoring is to compute changes in those characteristics and to assess their signicance. For the frequencies, crucial issues are then: how to compute the changes, to assess that the changes are signicant, to handle correlations among individual changes. A related issue is how to compare the changes in the frequencies obtained from experimental data with the sensitivity of modal parameters obtained from an analytical model.

Furthermore, it has been widely acknowledged that changes in frequencies bear useful information for damage detection.

Our contribution in this work is to design a particle ltering method to track the modal parameters.

Particle ltering techniques are a set of powerful and versatile simulationbased methods to perform optimal state estimation in nonlinear nonGaussian statespace models, and we consider here an approach combin- ing particle ltering and gradient algorithm to perform recursive maximum likelihood parameter estimation and tracking. In the next section, the modeling issues are introduced and some key parameterizations are discussed. Section 3 details the particle approximation mechanisms. Section 4 is devoted to particle approximation of nite signed measures, with numerical experiments. In Section 5, the particle implementation of the recursive maximum likelihood (RML) algorithm is described.

2 Mechanical model

2.1 Dynamical model and structural parameters

It is assumed that the behavior of the mechanical system can be described by a stationary linear dynamical system, and that, in the frequency range of interest, the input forces can be modeled as a non-stationary white noise. This results in :

MZ¨(t) +CZ˙(t) +KZ(t) = ν(t)

Y(t) = LZ(t) (1)

where ^t denotes continuous time, ^M, ^C and ^K are the mass, damping and stiness matrices respectively, the (high dimensional) vector^Z collects the displacements of the degrees of freedom of the structure, the PI n1604

(7)

external (non measured) force^ν is modeled as a nonstationary white noise with timevarying covariance matrix^Q^ν^(t), measurements are collected in the (often, low dimensional) vector^Y, and the matrix^Lindicates which components of the state vector are actually measured, i.e. where the sensors are located.

The modes or eigenfrequencies, denoted generically by^µ, are solutions of :

det(µ²M +µ C+K) = 0. (2)

The frequency^f and the damping coecient^dare recovered from a continuous eigenvalue^µthrough :

f = ℑ(µ)

2π and ^d⁼^−ℜ^(µ)^. (3)

Some comments are in order on parameterizations of interest for damage detection and localization. Since a local damage in the structure reduces the stiness and increases the damping, many damage detection techniques have been proposed which monitor the stiness matrix^K. Monitoring its inverse ^K⁻¹, namely the exibility matrix, has proven more tractable and computationally feasible [22, 13, 5]. In some cases, other structural parameterizations such as volumic mass and Young elasticity modulus may be preferable [14, 22].

Also, several methods in the literature are based on a transmissibility matrix [21, 23], which involves the processing of inputoutput data. However, in the case of non measured input excitation, processing output only data is mandatory [17, 3]. On the other hand, a reduced stiness and an increased damping result in decreased natural frequencies. Thus, monitoring the modal parameters is relevant.

2.2 Statespace model and parameterization

Sampling model (1) at rate^1/∆yields the discrete time model in state space form [12, 18] :

Xk+1 = F Xk+Wk

Yk = H Xk (4)

where the state and the output are :

Xk =

Z(k∆) Z˙(k∆)

and ^Y^k⁼^Y^(k^∆) ^, (5) the state transition and observation matrices are :

F = exp(A∆) where ^A⁼₋_M⁰⁻¹_K ₋_M^I⁻¹_C and

H= L 0

. (6)

The measurement equation in (4) with^H as in (6) implicitly assumes that the available sensors measure the (relative) displacements of the degrees of freedom themselves. The nature of the sensors used only inuences the observation matrix^H. In (4), the unmeasured state noise{Wk, k≥0} is assumed to be a zeromean Gaussian white noise, with covariance matrix :

Qk

=△E[WkW_k^∗] =

Z (k+1) ∆ k∆

exp(A s)Q(s) exp(Ae ^∗s)ds

and

Q(s) =e

0 0 0 M⁻¹Qν(s)M^−∗

.

The state^X and the observed output^Y have dimensions^2mand^rrespectively, with^r(often much) smaller than^2min practice.

Irisa

(8)

Let^λbe the modes of the state transition matrix^F, namely :

det (F−λ I) = 0. (7)

The continuous modes^µin (2) can be deduced from the discrete modes^λin (7) using :

exp(∆µ) =λ .

The frequency^f and the damping coecient^dare recovered from a discrete eigenvalue^λthrough :

f = arg(λ)

2π∆ and ^d⁼⁻_∆¹ ^log^|^λ^| ^. (8)

Because of the structure of the state in (5), the discrete modes^λare pairwise complex conjugate. If^F and^H are unknown matrices but we are able to track the coecients of these two matrices, then it is easy to obtain the frequency and the damping coecient using (8).

Remark 2.1. If we suppose noisy measurements, the model becomes :

Xk+1 = F Xk+Wk

Yk = H Xk+Vk (9)

where^{^V^k^{, k}^≥⁰^}is an unmeasured Gaussian white noise with zero mean. It is essential to note that, with this assumption, the measurement noise does not aect the eigenstructure of (9).

3 About particle approximation

In this section, we will rst detail particle approximation of the lter and the derivative of the lter w.r.t. the parameter of interest, following [6, 16]. Then, we will see dierent resampling schemes and their eciency.

3.1 Hidden Markov model

The state sequence{Xk, k≥0}is a Markov chain taking values in the space^E⁼^R^d, with transition kernel

Q(x, dx^′)(which is assumed time independent for simplicity), i.e.

P[Xk+1∈dx^′|Xk=x] =Q(x, dx^′).

The kernel ^{Q(x, dx}^′⁾ could depend on a parameter, that should be either estimated, or monitored (i.e.

changes w.r.t. a nominal value should be detected), however the dependence w.r.t. the parameter is not written explicitly, so as to avoid intricated notations. The following assumption is made

It is ^easy to ^simulate a r.v. ^X with probability distribution ^{Q(x, dx}^′⁾, even though the analytical expression of the kernel ^{Q(x, dx}^′⁾ is not known, or is so complicated that it is pratically impossible to ^compute such integrals as

Q φ(x) = Z

E

Q(x, dx^′)φ(x^′) or ^{Q µ(dx}^′^{) =}^Z

E

µ(dx)Q(x, dx^′).

The state sequence {Xk, k ≥ 0} is not observed, but instead an observation sequence {Yk, k ≥ 0} is available, which has the following property : given the hidden states{Xk, k≥0}, the observations{Yk, k≥ 0} are mutually independent, and the conditionnal probability distribution of ^Y^k (which is assumed time independent for simplicity) depends only on the hidden state^X^k at the same time instant, and by denition

P[Yk∈dy|Xk=x] =g(x, y)λ(dy) and ^Ψ^k^{(x) =}^{g(x, Y}^k⁾^.

Notice that when^x varies, all the conditionnal probability distributions^P[Y^k ^∈ ^dy ^|^X^k ⁼^x] are assumed absolutely continuous w.r.t. a nonnegative measure^λ(dy)which does not depend on^x(with densities^{g(x, y)} which do depend on^x).

PI n1604

(9)

Example This memoryless channel assumption is satised for instance in the case where the hidden state is observed in an additive white noise sequence, not necessarily Gaussian, i.e. in our model where the observation^Y^k is related to the hidden state^X^k by the relation

Yk =H Xk+Vk ,

where^{^V^k^{, k}^≥⁰^}is a white noise sequence (i.e. a sequence of mutually independent r.v.'s) with probability distribution ^q(v)^dv (which is assumed time independent for simplicity), independent of ^{^X^k^{, k} ^≥⁰^}. In this case

P[Yk∈dy|Xk=x] =q(y−H x)dy and ^Ψ^k^{(x) =}^q(Y^k−H x).

3.2 Particle approximation of the lter

Given observations, the objective is to estimate the hidden states, and to this eect the probability distributions

µk(dx) =P[Xk∈dx|Y0,· · · , Yk] and ^µ^k|k−1^{(dx) =}^P[X^k∈dx|Y0,· · · , Yk−1],

are introduced. The evolution of the sequence ^{^µ^k^{, k}^≥⁰^}taking values in the space of probability distributions on^E, is very easily described by the following steps

µk−1

prediction

−−−−−−−−−→µk|k−1=Q µk−1

correction

−−−−−−−−−→µk= Ψk·µk|k−1 ,

where

µk|k−1(dx^′) =Q µk−1(dx^′) = Z

E

µk−1(dx)Q(x, dx^′),

can happen to be dicult (if not just impossible) to compute, and where^· denotes the projective product, i.e.

µk(dx) = Ψk·µ_k|k−1(dx) = Ψk(x)µk|k−1(dx) hµk|k−1,Ψki .

In view of the key assumption that it is on the other hand easy to simulate r.v.'s with probability distribution^{Q(x, dx}^′⁾, the idea is to approximate the predictor^µ^k|k−1 with the empirical probability distribution associated with an^Nsample, i.e.

µk|k−1≈µ^N_k|k−1= 1 N

XN i=1

δξ_k|k−1ⁱ .

This approximation is completely characterized by the set {ξ_k|k−1ⁱ , i = 1,· · ·, N} of particles, and the algorithm is completely described by the mechanism which builds{ξⁱ_k+1|k, i= 1,· · · , N}from {ξⁱ_k|k−1, i= 1,· · · , N}. This mechanism is as follows :

(i) the correction step is applied exactly to^µ^Nk|k−1, which results in

µ^N_k = Ψk·µ^N_k|k−1= XN i=1

Ψk(ξ_k|k−1ⁱ )δξ_k|k−1ⁱ XN

j=1

Ψk(ξ_k|k−1^j )

= XN

i=1

ωⁱ_kδξⁱ_k|k−1 ,

i.e. particles ^{^ξk|k−1ⁱ , i = 1,· · ·, N} are now weighted, with weights ^{^ωkⁱ, i = 1,· · · , N} which are more heavy for those particles which are more consistent with the current observation^Y^k,

Irisa

(10)

(ii) instead of trying to compute ^{Q µ}^Nk, the following particle approximation

µ^N_k+1|k=S^N(Q µ^N_k ) = 1 N

XN i=1

δξ_k+1|kⁱ ,

is used, where the^Nsample^{^ξk+1|kⁱ , i= 1,· · ·, N}has precisely the probability distribution^{Q µ}^Nk, i.e.

N independent r.v.'s are simulated with common probability distribution^{Q µ}^Nk, which can be achieved in the following manner : independently for anyⁱ^{= 1,}· · · , N

ξ_kⁱ ∼ µ^N_k (dx) ⇐= easy, since the probability distribution^µ^N^k is discrete,

ξ_k+1|kⁱ ∼ Q(ξ_kⁱ, dx^′) ⇐= easy, by assumption.

3.3 Linear tangent kernel / extended kernel, etc.

If the transition kernel^{Q(x, dx}^′⁾depends on a parameter, then the lter^µ^k depends also on the parameter, and one would like to compute the linear tangent lter ^w^k, i.e. the derivative of the lter ^µ^k w.r.t. the parameter. To this end, one needs rst to study the linear tangent kernel^{Γ(x, dx}^′⁾, i.e. the derivative of the transition kernel^{Q(x, dx}^′⁾w.r.t. the parameter, and the following assumption is made

Assumption AC : The following probabilistic representation holds for the linear tangent kernel^{Γ(x, dx}^′⁾

Γφ(x) = Z

E

Γ(x, dx^′)φ(x^′) =E[φ(Xk+1) Ξk+1|Xk=x],

where {(Xk,Ξk), k≥0} is a Markov chain taking values in the product space^E×F, such that

P[Xk+1∈dx^′,Ξk+1 ∈ds^′ |Xk=x,Ξk =s]

=P[Xk+1∈dx^′,Ξk+1∈ds^′|Xk =x] =K(x, dx^′, ds^′).

The following assumption, which extends the similar assumption introduced in Section 3.1, is made

It is easy to simulate a r.v. ^(X,^Ξ) with probability distribution ^{K(x, dx}^′^{, ds}^′⁾, even though the analytical expression of the kernel ^{K(x, dx}^′^{, ds}^′⁾ is not known, or is so complicated that it is pratically impossible to compute such integrals as

Γφ(x) = Z

E×F

s^′φ(x^′)K(x, dx^′, ds^′) or ^Γ^µ(dx^′^{) =}^Z

E×F

µ(dx)s^′K(x, dx^′, ds^′).

Example In our model, the Markov chain{Xk, k≥0} taking values in^E⁼^R^d, is dened by

Xk+1=F Xk+Wk ,

where only the matrix^F depends on the parameter, and where ^{^W^k^{, k}^≥⁰^} is a sequence of independent r.v.'s taking values in ^R^d with probability distribution^p(w)^dw (coecients are assumed time independent for simplicity). For any^x∈R^d, the transition kernel^{Q(x, dx}^′⁾is given by

Q(x, dx^′) =p(x^′−F x)dx^′ ,

and one can show directly that

Γ(x, dx^′) =−p^′(x^′−F x) ∂F

∂θ x dx^′ = −p^′

p (x^′−F x) ∂F

∂θ x Q(x, dx^′).

PI n1604

(11)

It follows that

Γφ(x) = Z

E

Γ(x, dx^′)φ(x^′) = Z

E

φ(x^′) −p^′

p (x^′−F x) ∂F

∂θ x Q(x, dx^′)

= E[φ(Xk+1) −p^′

p (Xk+1−F Xk) ∂F

∂θ Xk|Xk=x],

i.e. Assumption AC is satised, with

Ξk+1 =−p^′

p (Wk)∂F

∂θ Xk .

Notice that in the above example, the r.v. ^Ξ^k+1 depends only on ^(X^k^{, W}^k⁾, in which case it does not seem necessary to simulate ^Ξ^k+1 in addition to ^W^k. This apparently very particular situation is actually very general, as the following result shows.

Lemma 3.1. Under Assumption AC

Γ(x, dx^′) =I(x, x^′)Q(x, dx^′),

with

I(x, x^′) =E[Ξk+1|Xk=x, Xk+1=x^′],

for any ^{x, x}^′ ^∈^E.

Proof. For any probability distribution^µon^E, and any pair ^{B, B}^′ of Borel subsets of^E

Eµ[Ξk+11(Xk+1∈B^′, Xk ∈B)] = Z

B×B^′

E[Ξk+1|Xk=x, Xk+1=x^′]µ(dx)Q(x, dx^′),

and

Eµ[Ξk+11(Xk+1 ∈B^′)1(Xk∈B)] = Z

B

E[Ξk+11(Xk+1∈B^′)|Xk =x]µ(dx)

= Z

B

Γ(x, B^′)µ(dx),

hence taking^B ⁼^Eyields

Z

E

Γ(x, B^′)µ(dx) = Z

E{ Z

B^′

E[Ξk+1|Xk=x, Xk+1=x^′]Q(x, dx^′)}µ(dx),

and since the probability distribution^µis arbitrary, it holds

Γ(x, B^′) = Z

B^′

E[Ξk+1|Xk =x, Xk+1 =x^′]Q(x, dx^′),

which proves the result.

By denition

Γφ(x) = Z

E×F

φ(x^′)s^′K(x, dx^′, ds^′) and ^{Q φ(x) =}^Z

E×F

φ(x^′)K(x, dx^′, ds^′),

hence

Γ(x, dx^′) = Z

F

s^′K(x, dx^′, ds^′) and ^{Q(x, dx}^′^{) =}^Z

F

K(x, dx^′, ds^′).

Irisa

(12)

On the product spaceÊ×E×F, dene the projection^π⁰ ^{: (x, x}^′^{, s}^′⁾7−→xon the (rst) spaceÊ, the projection ^π ^{: (x, x}^′^{, s}^′⁾7−→ x^′ on the (second) space Ê and the projection ^π^F ^{: (x, x}^′^{, s}^′⁾ 7−→s^′ on the auxiliary space^F. For any probability distribution^µon the spaceÊ, the probability distribution^µ⊗K is dened on the product spaceÊ×E×F by

(µ⊗K)(dx, dx^′, ds^′) =µ(dx)K(x, dx^′, ds^′).

It follows that

Q µ(dx^′) = Z

E×F

µ(dx)K(x, dx^′, ds^′) = (µ⊗K)◦π⁻¹(dx^′),

and

Γµ(dx^′) = Z

E×F

µ(dx)s^′K(x, dx^′, ds^′)

= Z

E×F

πF(x, x^′, s^′) (µ⊗K)(dx, dx^′, ds^′) = (πF(µ⊗K))◦π⁻¹(dx^′),

and if the nite signed measure^wis absolutely continuous w.r.t.^µ, then

Q w(dx^′) = Z

E×F

w(dx)K(x, dx^′, ds^′)

= Z

E×F

dw

dµ(x)µ(dx)K(x, dx^′, ds^′)

= Z

E×F

(dw

dµ ◦π0)(x, x^′, s^′) (µ⊗K)(dx, dx^′, ds^′) = ((dw

dµ ◦π0) (µ⊗K))◦π⁻¹(dx^′),

i.e.

Q µ= (µ⊗K)◦π⁻¹ and ^Γ^µ^{= (π}^F^(µ⊗K))◦π⁻¹ ,

and

(w≪µ =⇒ Q w= ((dw

dµ ◦π0) (µ⊗K))◦π⁻¹).

Lemma 3.2. Under Assumption AC, ^Γ^µ ^≪ ^{Q µ} for any probability distribution ^µ on ^E, with Radon Nikodym derivative (which depends on ^µ)

d(Γµ)

d(Q µ)(x^′) =Eµ[Ξn+1|Xn+1=x^′].

Proof. For any Borel subset^B^′ of^E, it holds

Γµ(B^′) = Z

E

µ(dx) Γ(x, B^′)

= Z

E

µ(dx)E[Ξk+11(Xk+1∈B^′)|Xk =x]

= Eµ[Ξk+11(Xk+1∈B^′)] = Z

B^′

Eµ[Ξk+1|Xk+1=x^′]Q µ(dx^′),

hence^Γ^µis a signed measure, absolutely continuous w.r.t. ^{Q µ}, with RadonNikodym derivative

d(Γµ)

d(Q µ)(x^′) =Eµ[Ξk+1 |Xk+1=x^′]

PI n1604

(13)

For completeness, the following elementary property is recalled

Lemma 3.3. If the nite signed measure ^w is absolutely continuous w.r.t. the probability distribution ^µ, then^{Q w}^≪^{Q µ}, with RadonNikodym derivative

d(Q w)

d(Q µ)(x^′) =Eµ[dw

dµ(Xk)|Xk+1=x^′].

Proof. For any Borel subset^B^′ of^E, it holds

Q w(B^′) = Z

E

w(dx)Q(x, B^′)

= Z

E

dw

dµ(x)µ(dx)Q(x, B^′)

= Eµ[dw

dµ(Xk)1(Xk+1∈B^′)] = Z

B^′Eµ[dw

dµ(Xk)|Xk+1=x^′]Q µ(dx^′),

hence^{Q w}is a signed measure, absolutely continuous w.r.t.^{Q µ}, with RadonNikodym derivative

d(Q w)

d(Q µ)(x^′) =Eµ[dw

dµ(Xk)|Xk+1=x^′]

The explicit expression of the RadonNikodym derivatives will not be used in the sequel : only the qualitative properties

Γµ≪Q µ and ⁽^w≪µ =⇒ Q w≪Q µ),

will be used.

By denition

Fk(µ)w= Ψkw

hµ,Ψki−hw,Ψki hµ,Ψki

Ψkµ hµ,Ψki ,

is the derivative at point^µand in the direction^w, of the mapping^µ7−→Ψk·µ. The following elementary property holds

Lemma 3.4. If the nite signed measure ^w is absolutely continuous w.r.t. the probability distribution ^µ, then^F^k^(µ)^w^≪^Ψ^k^·^µ, with RadonNikodym derivative

d(Fk(µ)w)

d(Ψk·µ) (x) = dw

dµ(x)− hΨk·µ,dw dµi.

3.4 Particle approximation of some nite signed measures

With the notations of the previous section, it easily seen that the probability distribution^{Q µ}and the nite signed measures^Γ^µand^{Q w} can be put in the general form^(r^(µ^⊗^K))^◦^π⁻¹ for some appropriate choice of the weight function ^r, namely ^r ^≡ ¹, ^r ⁼ ^π^F and ^r ⁼ ^dw_dµ ^◦^π⁰ respectively. The weighted particle approximation of a nite signed measure of the general form^r^(µ^⊗^K)is dened by

r(µ⊗K)≈r S^N(µ⊗K) = 1 N

XN i=1

r(ξ₀ⁱ, ξⁱ,Ξⁱ)δ

(ξ0ⁱ, ξⁱ,Ξⁱ),

Irisa

(14)

where the ^Nsample {ξ₀ⁱ, ξⁱ,Ξⁱ, i = 1,· · · , N} has precisely the probability distribution ^µ⊗K, i.e. one simulate ^N independent r.v.'s with common probability distribution ^µ⊗K, which can be achieved in the following manner : independently for anyⁱ^{= 1,}· · ·, N

ξ₀ⁱ ∼µ(dx) and ^(ξⁱ^,^Ξⁱ⁾∼K(ξ₀ⁱ, dx^′, ds^′),

and the corresponding particle approximation for the marginal measure^(r^(K⊗µ))◦π⁻¹ is dened by

(r(µ⊗K))◦π⁻¹≈(r S^N(µ⊗K))◦π⁻¹= 1 N

XN i=1

r(ξⁱ₀, ξⁱ,Ξⁱ)δξⁱ .

In particular for the weight functions^r^≡¹,^r⁼^π^F and^r⁼^dw_dµ ^◦^π⁰, it holds

Q µ= (µ⊗K)◦π⁻¹≈ 1 N

XN i=1

δξⁱ ,

Γµ= (πF(µ⊗K))◦π⁻¹≈ 1 N

XN i=1

Ξⁱδξⁱ ,

and

Q w= ((dw

dµ ◦π0) (µ⊗K))◦π⁻¹≈ 1 N

XN i=1

dw

dµ(ξ₀ⁱ)δξⁱ ,

respectively. For any test function^φdened on^E, it holds

E| 1 N

XN i=1

r(ξⁱ₀, ξⁱ,Ξⁱ)φ(ξⁱ)− h(r(µ⊗K))◦π⁻¹, φi |

≤ 1

√N { Z

E×E×F|φ(x^′)|²|r(x, x^′, s^′)|²µ(dx)K(x, dx^′, ds^′)}^1/2

≤ 1

√N { Z

E×E×F|r(x, x^′, s^′)|²µ(dx)K(x, dx^′, ds^′)}^1/2kφk,

and in particular for the weight functions^r^≡¹,^r⁼^π^F and^r⁼ ^dw_dµ ^◦^π⁰, it holds

sup

kφk=1E| 1 N

XN i=1

φ(ξⁱ)− hQ µ, φi | ≤ 1

√N ,

sup

kφk=1E| 1 N

XN i=1

Ξⁱ φ(ξⁱ)− hΓµ, φi | ≤ 1

√N {sup

x∈E

Z

E×F|s^′|²K(x, dx^′, ds^′)}^1/2,

and

sup

kφk=1E| 1 N

XN i=1

dw

dµ(ξ₀ⁱ)φ(ξⁱ)− hQ w, φi | ≤ 1

√N { Z

E|dw

dµ(x)|²µ(dx)}^1/2 ,

respectively.

PI n1604