HAL Id: hal-00652103
https://hal.inria.fr/hal-00652103
Submitted on 14 Dec 2011
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Recursive maximum likelihood estimation for structural health monitoring: tangent filter implementations
Fabien Campillo, Laurent Mevel
To cite this version:
Fabien Campillo, Laurent Mevel. Recursive maximum likelihood estimation for structural health monitoring: tangent filter implementations. 44th IEEE Conference on Decision and Control, and the European Control Conference, Dec 2005, Seville, Spain. pp.5923 - 5928, �10.1109/CDC.2005.1583109�.
�hal-00652103�
Recursive maximum likelihood estimation for structural health monitoring: Kalman and particle filter implementations
Fabien Campillo & Laurent Mevel INRIA/IRISA
Campus de Beaulieu 35042 Rennes cedex, France
[email protected] [email protected]
Abstract— Flutter monitoring can be handled by tracking the
real time variations of the modal parameters of a specified civil structure, be it a bridge or an aircraft. Previous algorithmic attempts encompass automated batch identification and damage detection through hypothesis testing. Both approaches appear impractical, the first one because of computational time consid- erations and the difficulty to select a windows length with the best trade off between bias and variance, the second because of the difficulty to obtain reference data set close to flutter regime.
Here, we investigate the capabilities of a sample wise recursive Kalman filter–based gradient approach and compare it to its particle filter counterpart.
I. I
NTRODUCTIONA critical problem for mechanical structures exposed to unmeasured non stationary natural excitation (turbulence) is an instability phenomenon also known as flutter. It is formulated as the monitoring of the time varying com- plex eigenvalues associated to the discretized linear system corresponding to the monitored mechanical system. It has already been investigated through batch identification modal analysis using only output-only in-flight data has already been investigated. See Mevel et al [1] for a case study of monitored aircraft using subspace identification methods.
For improving the estimation of the parameters of interest, the collection of frequency and damping coefficients, and moreover for achieving this in real-time during flight tests, one possible route is to resort to tracking algorithms.
Frequency and damping coefficients are monitored by a recursive maximum likelihood (RML) procedure. The considered tracking procedure is a special case of adaptive algorithms where the gain is kept constant. The approxi- mation of the associated score function is evaluated by a joint particle approximation of the conditional law and its derivative w.r.t. to the parameters (the tangent filter). Particle filtering techniques [2] are simple to implement and have many advantages in practice like robustness. Doucet & Tadic [3], Guyader et al [4], and Caylus et al [5] already applied these techniques to RML estimation. The problem presented here was previously treated by Fichou et al [6] in a more simple framework.
Particle approximation for health monitoring was already proposed by Yoshida & Sato [7] in order to handle non–
Gaussian noise. Modal characteristics monitoring is also considered by Ching et al [8]. In both cases authors use a state augmentation approach by including the unknown
parameters in the state process. It seems preferable to directly identify these parameters by a likelihood approach.
In a first part the structural health monitoring problem is written in term of recursive maximum likelihood estimate (RMLE) in a state–space model. Then a Kalman filter expression of the score function is proposed together with an alternate particle filter approximation. Last part is devoted to a case study.
II. T
HE PROBLEMA. Dynamical model and structural parameters Let us consider observations sampled at a rate 1/δ
y
k= L Z (k δ) (1)
of the state Z(t) of a n–degrees of freedom mechanical system. These measurements are gathered through d sensors, i.e. y
ktakes values in R
d. The matrix L indicates which components of the state vector are actually measured, i.e.
where the sensors are located. The behavior of the mechan- ical system is described by the following linear dynamical system
M Z(t) + ¨ C Z(t) + ˙ K Z(t) = σ ζ(t) (2) where the (non measured) input force ζ is a non–stationary white Gaussian noise with time-varying covariance matrix Q
ζ(t). M , C , K are respectively the matrices of mass, damping and stiffness.
Now let us describe the structural characteristics of the system (2). The modes or eigenfrequencies µ and the asso- ciated eigenvectors Φ
µof the system (2) are solutions of
det[µ
2M + µ C + K ] = 0 ,
[µ
2M + µ C + K ] Φ
µ= 0 . (3) Then the mode–shapes are Ψ
µ= L Φ
µ. The frequency and damping coefficients are
f =
2πb(Hz) , d =
√a|a|2+b2∈ [0, 1] (4) with a = (µ) and b = (µ).
The monitored structure is defined by its modal charac-
teristics: the collection of frequencies, dampings and mode
shapes, as well as the covariances of the noises. The problem
is to follow the slow evolutions of the structural character-
istics of the mechanical system (2) by a recursive tracking
method, whose starting values will be defined as the output of the data driven subspace method as described in Van Overschee & De Moor [9, Fig. 3.13 p. 90].
The tracking algorithm will focus on the frequencies and dampings, the mode shapes are assumed not to change significantly during the monitoring in regard to the changes in the eigenvalues. A change in the mode shapes would most likely be a local change in the structure, thus will indicate the presence of damage, whereas a change in the eigenvalues can still occur without presence of damage and not affect significantly the mode shapes (as for example the effect of temperature on the stiffness of the structure).
B. State–space model and canonical parameterization We rewrite the preceding system (1)–(2) as a linear state–
space model. Define X
k=
def Z(k δ) Z(k δ)˙and F =
defe
δAwith A =
def 0 I−M−1K −M−1C
∈ R
2n×2n. From (2) we get
X
k+1= F X
k+ σ ζ
k(5) where ζ
k=
defkδ(k−1)δ
e
(kδ−u)Ad
0dBu
and B
t=
deft0
ζ(s) ds is a Brownian motion. Hence ζ
kis a (discrete–time) white Gaussian noise with covariance matrix
kδ(k−1)δ
e
(kδ−u)A 0 00 M−1Qζ(u) (M−1)∗
e
(kδ−u)A∗du which is approximated by δ Q
ζkwith
Q
ζk=
def 0 00 M−1Qζ(kδ) (M−1)∗
. From (1) we get
y
k= [ L 0] X
k+ ν v
k(6) where [ L 0] ∈ R
d×2nand v
kis a N (0, Q
vk) white Gaussian noise which allows to take into account of the errors of modeling and the measurement noise. We suppose that the Hermitian matrix Q
vkis positive definite.
Let (λ, Φ
λ) be the eigenstructure of the state transition matrix F , namely
det(F − λ I) = 0 , (F − λ I) Φ
λ= 0 . (7) The parameters (µ, Φ
µ) in (3) can be deduced from the (λ, Φ
λ)’s using e
δ µ= λ and Φ
µ= Φ
λ. The frequency and damping coefficients (4) are recovered from a discrete eigenvalue λ through
a =
1δlog | λ | , b =
1δarctan
(λ)(λ)
.
Hypothesis: We suppose that F admits 2n pairwise complex conjugate distinct eigenvalues λ
1:n, ¯ λ
1:nwith associated or- thonormal set of eigenvectors Φ
1:n, Φ ¯
1:n(
1). We also suppose that these eigenvalues have modulus less than one.
1Notations:xT is the transpose ofx,¯xis the complex conjugate,x∗is the transpose/conjugate,|x|the modulus,jwill denote√
−1.
It turns out that this collection of modes forms a very natural parameterization for structural analysis. It is invariant w.r.t. changes in the state basis of system (5)–(6). In other words, the (λ, Φ
λ)’s form a canonical parameterization of the eigenstructure (or equivalently the pole part) of that system.
1) Change of variables: Define
Φ = [Φ
def 1:n] , Ψ = [Ψ
def 1:n] , Λ =
defdiag(λ
1:n) . We introduce the following linear transformation
T = [
defΦ Φ ¯ ] ∈ C
2n×2n,
i.e. the matrix whose columns are the eigenvectors of F . It is a unitary matrix, i.e. T
−1= T
∗. Then
Λ (0) (0) Λ¯= T
∗F T ∈ C
2n×2n. Define also
H = [
defL 0] T = [ L 0] [ Φ Φ ¯ ] = [ Ψ Ψ ¯ ] ∈ C
d×2n, Then after the change of variables
X ˜
k=
defT
∗X
k,
the vector X ˜
kis of the form [
x¯xkk] and (5) reduces to x
k+1= Λ x
k+ σ Φ
∗ζ
k, ζ
k iid∼ N(0, δ Q
ζk) . Note that in practice we just have access to the mode shapes matrix Ψ
1:nand not to the eigenvectors matrix Φ
1:n, so in order to fully specify the state equation we suppose that the covariance matrix Q
ζkis of the form [ L 0]
∗Q
k[ L 0] for a given covariance matrix Q
k. Hence w
k=
defΦ
∗ζ
kis a white Gaussian noise with covariance matrix Q
wk=
defδ Ψ
∗Q
kΨ .
The observation equation (6) becomes y
k= Ψ x
k+ ¯ Ψ ¯ x
k+ ν v
k, v
kiid
∼ N(0, Q
vk) . Note that Ψ x + ¯ Ψ x ¯ = 2 {Ψ x} is a linear operator.
2) The state/space system: One finally obtains the follow- ing system
x
k+1= Λ x
k+ σ w
k, w
k iid∼ N (0, Q
wk) , (8) y
k= 2 {Ψ x
k} + ν v
k, v
kiid
∼ N (0, Q
vk) . (9) In this model all parameters are assumed known, or pre- viously estimated, except the eigenvalues matrix Λ =
defdiag(λ
1:n) and the noise intensities σ and ν. The mode shapes matrix Ψ = [Ψ
1:n], the sampling period δ, and the covariance matrices Q
kand Q
vkare given (then Q
wk= δ Ψ
∗Q
kΨ ). From now on we suppose that Q
vk= I.
C. The RMLE procedure
Let L
k(θ) be the likelihood function of θ for the obser- vations y
1:k. We will see that the normalized log–likelihood function
k(θ) =
def 1klog L
k(θ) admits an incremental formu- lation
k
(θ) = 1 k
k l=1r
l(θ) .
Then the score function is ˙
k(θ) = 1
k
k l=1˙ r
l(θ)
and the RMLE procedure is
θ
k← θ
k−1+ γ
k× r ˙
k(θ
k−1)
where γ
kis a non–increasing sequence of positive numbers.
In § III we present the Kalman filter–based approximation of the score increment r ˙
k(θ) and in § IV its particle filter–
based counterpart. This last approximation, contrary to the first one, is valid in the nonlinear/non–Gaussian case. These procedures are applied to our problem in § V.
III. K
ALMANRMLE
FOR A LINEAR SYSTEMConsider the following linear system x
k+1= F (θ) x
k+ G(θ) w
k, w
kiid
∼ N (0, Q
wk) , y
k= H (θ) x
k+ Σ(θ) v
k, v
kiid
∼ N (0, Q
vk) , The state process x
ktakes values in C
n, the observation process y
kin C
d. The state initial law is x
0∼ N(¯ x
0, R
0).
Initial condition x
0, state noise w
kand observation noise v
kare mutually independent.
Here θ ∈ R is an unknown real parameter: the derivative w.r.t. this parameter will be denoted “∂
θ” or “ · ” where there is no ambiguity.
Suppose that the matrices F (θ) ∈ C
n×n, G(θ) ∈ C
n×n, H (θ) ∈ C
d×nand Σ(θ) ∈ C
d×dare differentiable w.r.t. θ.
For every fixed θ, the conditional laws law( x
k|y
1:k−1) = N (ˆ x
θk−, R
θk−) and law(x
k|y
1:k) = N (ˆ x
θk, R
θk) are given recursively by the Kalman filter (see Part b in
TABLEI).
A. Likelihood function
One expresses the law of the observations y
1:kvia the innovation process
ˆ
ı
k=
defy
k− E
θ[y
k|y
1:k−1] = y
k− H(θ) ˆ x
θk−. Note that
P
θ(y
1:k∈ dy
1:k) =
kl=1
P
θ(y
l∈ dy
l|y
1:l−1= y
1:l−1) and law(y
k|y
1:k−1) = N(H (θ) ˆ x
θk−, S
kθ) where S
kθis the covariance of the innovation process (see Part b in
TABLEI).
We get
P
θ(y
1:k∈ dy
1:k) =
kl=1
g
lθ(y
l) dy
lwhere g
kθ(y) is the p.d.f. of the N (H (θ) ˆ x
θk−, S
kθ) law. This means that
r
l(θ) = log
defg
θl( y
l) .
B. Score function
In order to calculate the score increment r ˙
k(θ), one sets an auxiliary result. Consider the p.d.f. q
θ(x) of the normal law N(µ(θ), R(θ)) on C
nwhose mean µ(θ) and covariance matrix R(θ) > 0 are differentiable w.r.t. a scalar parameter θ ∈ R . Then the two classical identities
∂
θlog |R(θ)| =
∂θ|R(θ)||R(θ)|= trace
[R(θ)]
−1R(θ) ˙ ,
∂
θ[R(θ)]
−1= −[R(θ)]
−1R(θ) [R(θ)] ˙
−1a – initialization
θ(initial guess)
ˆ
xθ0= ¯
x0 Rθ0=
R0b – Kalman filter ˆ
xθk−=
F(
θ) ˆ
xθk−1Rθk−
=
F(
θ)
Rθk−1F(
θ)
∗+
G(
θ)
Qwk−1G(
θ)
∗ˆ
ıθk=
yk−H(
θ) ˆ
xθk−Skθ
=
H(θ)
Rθk−H(θ)
∗+ Σ(θ)
QvkΣ(θ)
∗ Kθk=
Rθk−H(
θ)
∗[
Skθ]
−1ˆ
xθk= ˆ
xθk−+
Kkθˆ
ıθkRθk
=
I−KkθH
(
θ)
Rθk−c – tangent Kalman filter
x
˙ˆ
θk−=
F(θ) ˙ˆ
xθk−1+ ˙
F(θ) ˆxθk−1 R˙
θk−=
F(θ) ˙
Rθk−1F(θ)
∗+ ˙
F(
θ)
Rθk−1F(
θ)
∗+
F(
θ)
Rθk−1F˙ (
θ)
∗+ ˙
G(θ)Qwk−1G(θ)∗+
G(θ)Qwk−1G(θ)˙
∗ ı˙ˆ
θk=
−H˙ (θ) ˆ
xθk−−H(θ) ˙ˆxθk−S
˙
kθ=
H(
θ) ˙
Rθk−H(
θ)
∗+ ˙
H(θ)Rθk− H(θ)∗+
H(θ)
Rθk− H(θ)˙
∗+ ˙Σ(
θ)
QvkΣ(
θ)
∗+ Σ(
θ)
Qvk˙Σ(
θ)
∗ K˙
θk= ˙
Rθk−H(θ)
∗[S
kθ]
−1+
Rθk−H˙ (θ)
∗[S
kθ]
−1− Rθk−H(θ)∗
[S
θk]
−1S˙
θk[S
kθ]
−1 x˙ˆ
θk= ˙ˆ
xθk−+ ˙
Kkθˆ
ıθk+
Kkθı˙ˆ
θkR
˙
θk=
I−KkθH
(
θ)
R˙
θk−−K
˙
kθH(
θ) +
KkθH˙ (
θ)
Rθk−d – score increment
˜
ıθk= [S
kθ]
−1ˆ
ıθkr
˙
k(θ) =
−12 trace[S
kθ]
−1S˙
kθ −12[˜
ıθk]
∗S˙
kθ˜
ıθk+
[˜
ıθk]
∗H
(θ) ˙ˆ
xθk−+ ˙
H(θ) ˆ
xθk−e – parameters update
θ←θ+
γk×r˙
k(θ)
TABLE I
Kalman recursive maximum likelihood procedure.
applied to log q
θ(x) give
∂
θlog q
θ(x) = −
12trace
[R(θ)]
−1R(θ) ˙
+
x − µ(θ)
∗[R(θ)]
−1µ(θ) ˙
−
12x − µ(θ)
∗[R(θ)]
−1R(θ) [R(θ)] ˙
−1x − µ(θ) . From this result the score increment is
˙
r
k(θ) = −
12trace
[S
kθ]
−1S ˙
kθ+
y
k− H (θ) ˆ x
θk− ∗[S
kθ]
−1×
H (θ) ˙ˆ x
θk−+ ˙ H(θ) ˆ x
θk−−
12y
k− H (θ) ˆ x
θk− ∗[S
kθ]
−1S ˙
kθ[S
kθ]
−1×
y
k− H (θ) ˆ x
θk−.
TABLE
I page 3 presents the complete algorithm.
IV. P
ARTICLE FILTERRMLE
FOR A NONLINEAR SYSTEMA. The problem
Consider a state/observation process whose law depends on an unknown parameter θ ∈ R . The state process x = {x
k}
k≥0takes values in R
n, it is Markovian with transition kernel Q
θkand initial probability law µ
0,
Q
θk(dx
| x) =
defP
θ( x
k+1∈ dx
|x
k= x) , (10) µ
0(dx) =
defP
θ(x
0∈ dx) . (11) This process describes the evolution of a non observed system. The observation process y = {y
k}
k≥1takes values in R
d. We suppose that (i) conditionally to the state process, the observations y
kare independent, and (ii) the observation y
kdepends only on x
k( y
kis the observation of x
k), i.e.
P
θ(y
1:k∈ dy
1:k|x
0:k= x
0:k) =
k l=1P
θ(y
l∈ dy
l|x
l= x
l) . (12) The law of the state/observation process (x, y) is now com- pletely specified. We assume moreover that the conditional law of y
kgiven x
kadmits a density w.r.t. the Lebesgue measure:
ψ
kθ(y|x) dy =
defP
θ(y
k∈ dy|x
k= x) . (13) Then the law of the process (x, y) can be expressed explicitly according to the three basic terms (10), (11) and (13), see
§ IV-C. This situation corresponds to the following diagram
x0 xk−1
yk−1
xk
yk
xk+1
yk+1
Qθk−1 Qθk
ψθk−1 ψθk ψθk+1
The system depends on the parameter θ through the kernel Q
θkand the local likelihood function ψ
θk. For simplicity we suppose that the initial law does not depend on θ.
Here, like in Doucet & Tadic [3], we consider the case where the Markov kernel Q
θkadmits a density w.r.t. the Lebesgue measure
Q
θk(dx
|x) = q
θk(x
|x) dx .
The case of a Markov kernel without density was treated in Guyader et al [4] and Fichou et al [6]. This hypothesis is not required to establish the equations of the nonlinear optimal filter but it simplifies the derivation of the tangent filter.
Example: Consider a state–space model
x
k+1= f
kθ(x
k) + σ
wθw
k, w
k∼ N (0, Q
wk) , y
k= h
θk(x
k) + σ
vθv
k, v
k∼ N (0, Q
vk) , where x
kand y
ktake with values in R
nand R
d; w
k, v
k, x
0are independent; x
0∼ µ
0; σ
θwand σ
vθare scalar positive numbers. Here Q
θk(dx
|x) = q
kθ(x
|x) dx
and
q
θk(x
|x) =
(2 π)
n(σ
wθ)
2n|Q
wk|
−1/2× exp
−
2 (σ1θw)2
[x
− f
kθ(x)]
∗[Q
wk]
−1[x
− f
kθ(x)]
, ψ
θk(y|x) =
(2 π)
d(σ
vθ)
2d|Q
vk|
−12× exp
−
2 (σ1θv)2
[y − h
θk(x)]
∗[Q
vk]
−1[y − h
θk(x)]
. B. Nonlinear filter
Define the nonlinear filter and the predicted nonlinear filter π
kθ(dx|y
1:k) =
defP
θ(x
k∈ dx|y
1:k= y
1:k) ,
π
kθ−(dx | y
1:k−1) =
defP
θ( x
k∈ dx |y
1:k−1= y
1:k−1) . We also use the notation π
kθ(dx) =
defπ
kθ(dx |y
1:k), π
kθ−(dx) =
defπ
kθ−(dx |y
1:k−1) and ψ
θk(x) =
defψ
kθ( y
k| x). These conditional densities can be recursively obtained through the classical two steps procedure:
πk−1θ πθk−=πk−1θ Qθk−1 πkθ= Ψθk[πθk−]
prediction correction
(14) where the prediction (linear) operator Q
θk−1and the cor- rection (nonlinear) operator Ψ
θk, which act on the space of probability measures, are defined by
πQ
θk−1(dx
) =
defQ
θk−1(dx
| x) π(dx) , (15) Ψ
θk[π](dx) =
defψ
kθ(x) π(dx)
π, ψ
kθ(16) where
π , ψ =
defRn
ψ(x) π(dx) .
Q
θkis the transition kernel of the Markov chain x
kand the
first step in (14) is the Chapman–Kolmogorov equation. The
second step in (14) is a Bayes formula. The initial condition
in (14) is π
0θ= µ
0.
C. Likelihood and score functions
According to the previous section, the joint law of the state and observation processes is
P
θ(x
0:k∈ dx
0:k, y
1:k∈ dy
1:k)
= µ
0(dx
0)
kl=1
ψ
θ(y
l| x
l) Q
θl−1(dx
l| x
l−1) dy
1:k.
This proves that this statistical model is dominated and L
k(θ) =
x0:k
µ
0(dx
0)
kl=1
ψ
kθ(x
l) Q
θl−1(dx
l|x
l−1) .
Note that
P
θ( y
1:k∈ dy
1:k)
= P
θ(y
1∈ dy
1)
kl=2
P
θ(y
l∈ dy
l|y
1:l−1= y
1:l−1)
=
kl=1
xl
P
θ(y
l∈ dy
l|x
l= x
l) π
lθ−(dx
l|y
1:l−1)
=
kl=1
xl
ψ
θl(y
l|x
l) π
lθ−(dx
l|y
1:l−1) dy
1:kwhich leads to the other formulation
L
k(θ) =
kl=1
π
θl−, ψ
θlso
r
l(θ) = logπ
def θl−, ψ
lθand the score increment is
˙
r
k(θ) = ∂
θlogπ
θk−, ψ
kθ= π ˙
kθ−, ψ
θk+ π
kθ−, [∂
θlog ψ
kθ] ψ
θkπ
kθ−, ψ
θk. (17) D. Tangent filter
The definition of derivative π ˙
θk−of the nonlinear (predic- tion) filter requires some attention. Let M(R
n) the set of finite (signed) measures on (R
n, B(R
n)), M
+1(R
n) that of probability measures on ( R
n, B ( R
n)), and M
0( R
n) that of null mass. A measure µ on ( R
n, B ( R
n)) is said finite if µ
+(R
n) + µ
−(R
n) < ∞ where µ
±is the Hahn–Jordan decomposition of µ. Here π
kθ−∈ M
1(R
n) then π ˙
θk−∈ M
0(R
n) and π ˙
kθ−π
θk−. See Heidergott & Vázquez–
Abad [10] for details.
We now establish a recursive formulation for π ˙
θk−and π ˙
kθ. 1) Prediction step: The derivative Q ˙
θk−1of the Markov kernel Q
θk−1w.r.t. the parameter θ is
Q ˙
θk−1(dx
|x) = ∂
θq
k−1θ(x
|x) dx
=
∂
θlog q
k−1θ(x
| x) q
θk−1(x
| x) dx
.
=
∂
θlog q
θk−1(x
|x)
Q
θk−1(dx
|x) . (18) Q ˙
θk−1is a transition kernel on M
0(R
n). Then, the derivative of the nonlinear filter w.r.t. the parameter θ is
˙
π
θk−= ∂
θ{ π
θk−1Q
θk−1} = ˙ π
k−1θQ
θk−1+ π
θk−1Q ˙
θk−1. (19)
2) Correction step: One introduces DΨ
θk[π] ν ∈ M
0(R
n) the derivative of the operator π → Ψ
θk[π] at the point π ∈ M
+1( R
n) in the direction ν ∈ M
0( R
n)
DΨ
θk[π] ν =
defψ
kθν
π, ψ
θk− ν, ψ
kθπ, ψ
θkψ
θkπ π, ψ
kθ. If ν π and ν (dx) = (x) π(dx) then
DΨ
θk[π] ν =
− Ψ
θk[π], Ψ
θk[π] . Moreover, one has
∂
θΨ
θk[π] =
∂
θlog ψ
θk− Ψ
θk[π], ∂
θlog ψ
θkΨ
θk[π]
= DΨ
θk[π]([∂
θlog ψ
kθ] π) . Finally
˙
π
θk= ∂
θΨ
θk[π
kθ−]
= D Ψ
θk[π
θk−]( ˙ π
kθ−) +
∂
θΨ
θk[π]
π=πkθ−
= D Ψ
θk[π
θk−]( ˙ π
kθ−+ [∂
θlog ψ
θk] π
kθ−) .
Hence we can prove recursively that the tangent filter is absolutely continuous w.r.t. the nonlinear filter, i.e. π ˙
kθ−π
kθ−and π ˙
kθπ
kθ. Let
θk−(x) =
defd π ˙
kθ−dπ
kθ−(x) ,
θk(x) =
defd π ˙
θkdπ
θk(x) . This leads to
˙ π
kθ=
θk−
+ ∂
θlog ψ
kθ− π
kθ,
θk−+ ∂
θlog ψ
kθπ
kθ.
(20) 3) Score increment: Expression (17) becomes
˙
r
k(θ) = π
kθ,
θk−+ ∂
θlog ψ
kθ(21) which is exactly the centering term in (20).
The joint nonlinear/tangent filters is summarized in
TA-
BLE
II.
E. Particle approximation
We describe the simple “bootstrap” particle approxima- tion. Suppose that at time k − 1 we have particle approxi- mation of the nonlinear and tangent filters
π
k−1θπ
k−1N=
def N1 Ni=1
δ
ξi k−1,
˙
π
k−1θπ ˙
k−1N=
def N1 Ni=1
ρ
ik−1δ
ξi k−1. Note that
Ni=1
ρ
ik−1= 0, i.e. π ˙
k−1N∈ M
0(R
n). The idea of this approximation is to assure that π ˙
Nk−1π
k−1N, indeed
Nk−1
(ξ
k−1i) =
defd π ˙
θkdπ
Nk−1(ξ
ik−1) =
i:ξik−1 =ξik−1
ρ
ik−1i:ξk−1i =ξik−1
1 (22)
for i = 1 · · · N and
Nk−1(x) = 0 if x ∈ {ξ
k−1i; i = 1 · · · N } .
a – initialization
π0θ=
µ0 π˙
θ0= 0 b – prediction [see (15) and (18)]
πkθ−
=
πk−1θ Qθk−1π
˙
kθ−= ˙
πk−1θ Qθk−1+
πk−1θ Q˙
θk−1c – correction [see (16) and (20)]
πkθ
= Ψ
θk[π
θk−]
π˙
kθ=
θk−
+
∂θlog
ψkθ− πθk, θk−+
∂θlog
ψkθ πkθ hereθk−=
dπ˙
kθ−/dπθk−d – score increment [see (21)]
r
˙
k(
θ) =
πθk, θk−+
∂θlog
ψθkTABLE II
The joint nonlinear/tangent filters.
1) Prediction/sampling step: From (15), π
k−1NQ
θk−1(dx
) =
N1 Ni=1
Q
θk−1(dx
|ξ
k−1i) . We use the approximation
Q
θk−1(dx
|ξ
k−1i) δ
ξi k−(dx
)
where ξ
ki−∼ Q
θk−1(dx
|ξ
k−1i) (independently). Hence π
Nk−=
N1 Ni=1
δ
ξik−
where ξ
ki−∼ Q
θk−1(dx
|ξ
k−1i) . (23) From the tangent filter prediction (19) and (18)
˙
π
Nk−1Q
θk−1(dx
) + π
k−1NQ ˙
θk−1(dx
)
=
Q
θk−1(dx
|x) ˙ π
Nk−1(dx) +
Q
θk−1(dx
|x) [∂
θlog q
θk−1(x
|x)] π
k−1N(dx)
=
N1 Ni=1
ρ
ik−1+ ∂
θlog q
k−1θ(x
|ξ
k−1i)
× Q
θk−1(dx
|ξ
ik−1) . Again let Q
θk−1(dx
|ξ
k−1i) δ
ξik−
(dx
) so that
˙
π
Nk−=
N1 Ni=1
ρ
ik−δ
ξi k−with
ρ
ik−=
defρ
ik−1+ ∂
θlog q
k−1θ(ξ
ki−| ξ
k−1i) .
This approximation π ˙
kN−is not of null mass, it will be
“centered” in the correction step.
Again
Nk−(x) can be computed like in (22), but almost surely the particle positions ξ
ik−are all distinct, so we have
Nk−
(ξ
ki−) = ρ
ik−, i = 1 · · · N .
2) Correction/resampling step: Plugging the approxima- tion (23) in (16) gives exactly
Ψ
θk[π
kN−] =
Ni=1
ω
kiδ
ξi k−where
ω
ik= Ψ
def θk(ξ
ki−)/
Ni=1
Ψ
θk(ξ
ik−) .
The resampling step is the following: we multiply/discard particles {ξ
ki−}
i=1:Naccording to the high/low weights {ω
ik}
i=1:N, i.e. ξ
ik= ξ
ks[i]−where s [i] is the resampling mechanism associated with the weights {ω
ki}
i=1:N. The updated particle approximation is then
π
kN=
N1 Ni=1
δ
ξik
with ξ
ki= ξ
ks[i]−(24) and s is the resampling scheme associated with {ω
ki}
i=1:N. Substituting π
kθin (20) by its approximation (24) gives exactly
˙
π
Nk=
N1 Ni=1
ρ
ikδ
ξi kwhere (
2)
ρ
ik=
θk−(ξ
ik) + ∂
θlog ψ
kθ(ξ
ik) − centering term
= ρ
s[i]k−+ ∂
θlog ψ
kθ(ξ
ks[i]−) − centering term. (25) This last centering operation ensures that π ˙
kN∈ M
0(R
n).
3) Score increment: Approximation (24) in (21) leads to r
Nk(θ) = π
Nk,
θk−+ ∂
θlog ψ
kθ=
N1 Ni=1
ρ
s[i]k−+ ∂
θlog ψ
θk(ξ
ks[i]−)
which is exactly, like noticed in § IV-D.3, the centering term of (25).
The joint particle approximation of the nonlinear/tangent filters is summarized in
TABLEIII.
V. A
PPLICATIONIn (8)–(9) there are two alternate parameterizations. The first one is in terms of real/imaginary part of the λ’s (7)
θ = (α
def 1:n, β
1:n, σ, ν) ∈ R
2n× R
2+(26) where α
p=
def(λ
p) and β
p=
def(λ
p) for p = 1 · · · n. The second one is terms of frequency/damping coefficients (4)
θ = (
deff
1:n, d
1:n, σ, ν) ∈ R
n+× (0, 1)
n× R
2+. (27) If the behavior of the filter is quite equivalent in both parameterizations, the second is much simpler to use for the tuning of the parameters of the RMLE procedure.
Kalman filter formulation
Practical implementation of the algorithm describes in § III requires some adaptations. To prevent the degeneracy of the innovation covariance matrix it is necessary to reinforce the diagonal terms if S
kθin Part b of
TABLEI.
2“αi=βi−centering term” means thatαi=βi−N
i=1βi