HAL Id: hal-02184492
https://hal.archives-ouvertes.fr/hal-02184492
Submitted on 11 Jun 2020
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
A maximum likelihood estimator for switching linear systems with unknown inputs
Luc Meyer, Dalil Ichalal, Vincent Vigneron
To cite this version:
Luc Meyer, Dalil Ichalal, Vincent Vigneron. A maximum likelihood estimator for switch- ing linear systems with unknown inputs. Automatica, Elsevier, 2019, 108, pp.108490.
�10.1016/j.automatica.2019.07.003�. �hal-02184492�
A maximum likelihood estimator for switching linear systems with Unknown Inputs ?
Luc Meyer a , Dalil Ichalal b , Vincent Vigneron b
a
ONERA, DTIS, Universit´e Paris Saclay, 6 Chemin de la Vauve aux Granges, 91120 Palaiseau, France
b
IBISC, Univ Evry Universit´e Paris Saclay, 40, rue de Pelvoux, 91020 Courcouronnes, France
Abstract
This paper investigates the construction of an estimator for switching linear systems with unknown inputs (UI) and Gaussian noises, in the case of unknown switching sequence. First, a maximum-likelihood switching sequence estimator is constructed. Then, both state and UI estimators are proposed. An example illustrates the theoretical contributions.
Key words: State Estimation, Minimum Variance Estimation, Unknown Input, Switching Systems, Discrete-Time System
1 Introduction
A switching system is a dynamical system with both contin- uous and discrete states. Several approaches exist in order to estimate the discrete-state of such a system. If the switching sequence is known, different observers have been proposed for systems without [4] or with Unknown Inputs (UI) [6].
If the switching sequence is unknown but its dynamic fol- lows a known pattern (as a particular Markov model), then several estimators have also been proposed [7][5]. With- out any assumption made on the discrete state dynamic, [1] and [2] have proposed state estimation for discrete- time linear switching systems affected by bounded noises.
For continuous-time systems with bounded perturbations, a sliding-mode based observer is giving in [9]; in particular, this observer also deals with UI. In [3], a linear switching system affected by Gaussian noises in both state and mea- surement equations has been considered, but without the presence of UI.
The present paper focuses on discrete-time linear switching systems affected by both Gaussian noises and UI in both state and measurement equations. Its main contribution is to provide both state and UI estimators for such systems.
It is the first paper dealing with linear switching systems affected by UI in the case in which the matrix associated
?
This paper was not presented at any IFAC meeting. Correspond- ing author L. Meyer
Email addresses:
[email protected](Luc Meyer),
[email protected](Dalil Ichalal),
[email protected](Vincent Vigneron).
with the UI is not assumed to be constant. One of the main difficulty is to decouple the UI from the state estimation error despite of that fact. Another difficulty is to estimate the UI in such a case. The paper is organized as follows.
In section 2, the problem is stated. In section 3, a method used for a maximum likelihood estimation of the switching parameter is presented. In section 4, a practical algorithm for estimating both the continuous state and the UI and based on the switching parameter estimation is given. Finally, in section 5, an example illustrating the approach is given.
The following notations are used in the paper. For any matrix A ∈ M
n,p( R ), A
Tdenotes its transpose, and A
†its pseudo- inverse. For any square matrix A ∈ M
n,n( R ), tr(A) denotes its trace, det(A) denotes its determinant, λ (A) denotes the vector of its eigenvalues, and λ
min(A) (resp. λ
max(A)) de- notes its minimum (resp. maximum) singular value. For any vector v, ||v|| denotes its euclidean norm. For any symmet- ric positive matrix M, let set the norm ||.||
Msuch that, for any vector v of appropriate dimension, ||v||
M= √
v
TMv. If x is a stochastic vector, its expectancy is denoted by E[x].
For any vector sequence (v
k)
k=1,...and any integers α ≥ 0 and ω ≥ 0, let set v
k−α:k+ω= h
v
Tk−αv
Tk−α+1... v
Tk+ωi
T.
2 Problem Statement
Let consider the following switching system:
x
k+1= A
λkx
k+D
λk
d
k+ F
λkw
ky
k= C
λk
x
k+ E
λk
d
k+ v
k(1)
where x
k∈ R
nxis the continuous state, y
k∈ R
nyis the output, d
k∈ R
ndis the Unknown Input (UI), λ
k∈ Λ := {1,2, ..., n
Λ} is the unknown switching parameter (also called discrete mode or discrete state). Matrices A
λk
, D
λk
, F
λk
, C
λk
and E
λare switching matrices with appropriate dimensions. The
kperturbations w
kand v
kare independent Gaussian noises with zero mean and covariance matrices denoted by W
λkand V
λkrespectively. The unknown initial state follows a Gaussian distribution (independent of w
kand v
k) with known mean ˆ x
0and known covariance matrix P
0. Let assume that for any λ ∈ Λ, the matrix V
λis non singular, i.e. det(V
λ) 6= 0 (it is a widely used assumption for Kalman filtering).
In the whole paper, let us consider two integers α ≥ 0 and ω ≥ 0, used as time indexes: for any time step k ≥ α , the measures obtained between time steps k − α and k + ω are used for the estimation of the unknown switching parameterλ
k. For any k ≥ α, let λ
k−α:k+ωbe the switching sequence from time step k− α to time step k +ω . Then, the observation sequence y
k−α:k+ωgives:
y
k−α:k+ω= A
λk−α:k+ωx
k−α+ D
λk−α:k+ωd
k−α:k+ω+ F
λk−α:k+ωw
k−α:k+ω+ v
k−α:k+ω(2)
with matrices A
λk−α:k+ω, D
λk−α:k+ωand F
λk−α:k+ωgiven in (3). The following assumption is needed for removing the UI from equation (2).
Assumption 1 For any switching sequence π ∈ Λ
α+ω+1, the rank r
d(π ) of D
πsatisfies r
d(π ) < (α + ω + 1)n
y. For all π ∈ Λ
α+ω+1, there exists a full row rank matrix H
π∈ M
(α+ω+1)ny−rd,(α+ω+1)ny( R ) (with rank equal to r
h(π ) = (α + ω + 1)n
y− r
d(π )) such that H
πD
π= 0. Assumption 1 ensures that r
h(π) > 0. Then, if π = λ
k−α:k+ω, when left- multiplying equation (2) by H
λk−α:k+ω, it comes:
H
λk−α:k+ωy
k−α:k+ω= H
λk−α:k+ωA
λk−α:k+ωx
k−α+H
λk−α:k+ω
F
λk−α:k+ωw
k−α:k+ω+ H
λk−α:k+ωv
k−α:k+ωequation in which the unknown input does not appear any- more.
3 Maximum-likelihood switching sequence estimation
This section deals with the switching sequence estimation problem. Let us start by introducing a definition of observ- ability adapted to the considered switching system.
Definition 1 π ∈ Λ
α+ω+1and π
0∈ Λ
α+ω+1are saying to be jointly observable if:
rank h
H
π0A
π0H
π0A
πH
π0D
πi
= 2n
x+ (α +ω + 1)n
d(8)
Definition 1 does not depend on the matrix H
π0chosen under Assumption 1, as this matrix is uniquely defined modulo a left-multiplication by a non singular square matrix.
Let us consider a time step k ≥ α, and let consider the following maximization problem in the presence of a UI:
λ ˆ
k−α:k+ω∈ argmax
π∈Λα+ω+1h
max
x∈Rnxl(x, π|H
πy
k−α:k+ω) i (9) where l(x, π|H
πy
k−α:k+ω) = p(H
πy
k−α:k+ω|x,π ) is the like- lihood function to maximize.
Due to the left-multiplication of y
k−α:k+ωby H
πin (9), if π = λ
k−α:k+ωis the true switching sequence, then the likelihood l(x, λ
k−α:k+ω|H
λk−α:k+ωy
k−α:k+ω) is not a func- tion of the unknown input (see equation (8)). Moreover, the vector H
πy
k−α:k+ωis normally distributed with mean H
πA
λk−α:k+ωx
k−αand covariance matrix H
πΣ ˜
λk−α:k+ωH
πT(where for any π
0∈ Λ
α+ω+1, ˜ Σ
π0= F
π0W
π0F
πT0+V
π0).
Then, following the same line as in [3], the maximum like- lihood estimate can be obtained as:
λ ˆ
k−α:k+ω∈ argmin
π∈Λα+ω+1h
log|Σ
π| + δ (H
πy
k−α:k+ω, π) i (10) where Σ
π= H
πΣ ˜
πH
πT, and for any vector v ∈ R
rh(π), δ (v,π ) = ||(I −∆
π)v||
2Σ−1π
, with ∆
πthe projection matrix on the linear span of H
πA
π.
Given that the true continuous state is x
k−ωat time step k−ω and the true switching sequence is λ
k−α:k+ωbetween time steps k− α and k+ ω, the probability that λ
k−α:k+ωis more likely to be chosen by (10) than a discrete sequence π
0∈ Λ
α+ω+1is denoted by P (λ
k−α:k+ωπ
0|x
k−α,λ
k−α:k+ω).
The following theorem gives a lower bound of such a prob- ability.
Theorem 1 Let assumption 1 hold. Let be k ≥ α . Let π
0∈ Λ
α+ω+1be a switching-sequence. Then, the following in- equality holds:
P (λ
k−α:k+ωπ
0|x
k−α, λ
k−α:k+ω) ≥ γ
r(
k(λh(λk−α:k+ω,π0)k−α:k+ω,π0)
× (||x
k−α||
2+ ||d
k−α:k+ω||
2) +
h(λ 1k−α:k+ω,π0)
log
|Σ |Σπ0|λk−α:k+ω|
) (11) where γ
r(.) is the cumulative distribution function of a χ
2random variable with r = (α + ω +1)n
ydegrees of freedom.
Moreover for any switching sequences π ,π
0∈ Λ
α+ω+1, the coefficients h(π, π
0) and k(π, π
0) are given by equations (4), where Σ ˜
π= F
πW
πF
πT+V
π. Besides, if π and π
0are jointly observable, k(π , π
0) > 0.
PROOF. In order to simplify the notations, λ
k−ω:k+ωis de- noted by π in the following proof, but the reader has to keep
2
Aλk−α+ω=
C
λk−αC
λk−α+1A
λk−α...
C
λk+ωA
λk+ω−1...Aλk−α+1A
λk−α
,Dλk−α:k+ω=
E
λk−α0
ny,nd ...0
ny,ndC
λk−α+1D
λk−αE
λk−α+1 ...0
ny,nd... ... ... ...
C
λk+ωA
λk+ω−1...Aλk−α+1D
λk−αC
λk+ωA
λk+ω−1...Aλk−α+2D
λk−α+1 ...E
λk+ω
,
Fλk−α:k+ω =
0
ny,nv0
ny,nv ...0
ny,nv0
ny,nvC
λk−α+1F
λk−α0
ny,nv ...0
ny,nv0
ny,nv... ... ... ... ...
C
λk+ωA
λk+ω−1...Aλk−α+1F
λk−αC
λk+ωA
λk+ω−1...Aλk−α+2F
λk−α+1 ...C
λk+ωF
λk+ω−10
ny,nv
(3)
k(π
,π0) =12λmin{h Aπ DπiT
H
πT0(I−∆π0)TΣ−1π0 (I−∆π0)Hπ0h Aπ Dπ
i}
h(π,π
0) =λmax{HπT(I−∆π)TΣ−1π (I−∆π)Hπ+H
Tπ0(I−∆π0)TΣ−1π0 (I−∆π0)Hπ0}λmax{Σ
˜
π}(4)
P(πk−α:k+ωπ0|xk−α,πk−α:k+ω)≥P
z
TΣ˜
−1z
≤k(π,πh(π,π00))(||xk−α||2+||dk−α:k+ω||2) +h(π,π1 0)log
|Σ|Σπ0|π|
,
(5)
Φλk−α:k−1= h
A
λk−1...Aλk−αA
λk−1...Aλk−α+1D
k−α ...A
λk−1D
λk−2D
λk−1 i,Ψλk−α:k−1= h
A
λk−1...Aλk−α+1F
k−α ...A
λk−1F
λk−2F
λk−1 i(6)
[(Aλk−
A
ˆλk)−
K
k+1(Cλk+1A
λk−C
ˆλk+1
A
ˆλk)]xk= [(Aλk−
A
ˆλk)−
K
k+1(Cλk+1A
λk−C
ˆλk+1
A
ˆλk)]Φλk−α:k−1
x
k−αd
k−α:k−1
+[(Aλk−
A
λˆk)−K
k+1(Cλk+1A
λk−C
λˆk+1A
λˆk)]Ψλk−α:k−1w
k−α:k−1(7)
in mind that it is the actual discrete sequence. The follow- ing steps are similar to [3] in the structure, however, the technical difficulties raised by the presence of the UI makes the proof very different. Using equation (10), the probabil- ity P (π π
0|x
k−α, λ
k−α:k+ω) is equal to the probability that the following inequality holds:
log |Σ
π|+δ (H
πy
k−α:k+ω,π ) < log |Σ
π0|+δ (H
π0y
k−α:k+ω,π
0) (12) Let set z = F
πw
k−α:k+ω+ v
k−α:k+ω. Then z is normally distributed with zero mean and covariance matrix equal to Σ ˜
π. On one hand, we have:
δ (H
π0y
k−α:k+ω, π
0) = δ (H
π0A
πx
k−α+H
π0D
πd
k−α:k+ω+H
π0z,π
0).
(13) Besides:
δ (H
π0A
πx
k−α+ H
π0D
πd
k−α:k+ω, π
0)
= δ (H
π0A
πx
k−α+ H
π0D
πd
k−α:k+ω+ H
π0z − H
π0z, π
0)
≤ 2δ (H
π0y
k−α:k+ω, π
0) + 2δ (H
π0z, π
0),
(14) the last inequality coming from one of the property of the function δ (., π): δ (u − v , π ) ≤ δ (u, π) + δ (v , π ). Thus:
δ (H
π0y
k−α:k+ω, π
0)
≥
12δ (H
π0A
πx
k−α+ H
π0D
πd
k−α:k+ω, π
0) − δ (H
π0z, π
0)
≥ k(π , π
0)(||x
k−α||
2+ ||d
k−α:k+ω||
2) −δ (H
π0z,π
0), (15)
where k(π, π
0) is given by equation (4). Note, that un- der the joint observability of π and π
0, it comes that (I −∆
π0)H
π0h A
πD
πi
is full column rank. Then, it follows that k(π, π
0) > 0. On the other hand, we have:
δ (H
πy
k−α:k+ω, π ) = δ (H
πA
πx
k−α+ H
πD
πd
k−α:k+ω+ H
πz, π)
= δ (H
πz, π) (16)
as H
πD
πd
k−α:k+ω= 0, and (I − ∆
π)H
πA
π= 0. Then it comes that:
δ (H
πz, π ) + δ (H
π0z, π
0) = ||(I − ∆
π)H
πz||
2Σ−1π
+||(I − ∆
π0)H
π0z||
2Σ−1
π0
≤ h(π, π
0)||z||
2˜Σ−1π
(17)
where ˜ Σ
π= F
πW
πF
πT+V
πand h(π, π
0) is given by equa- tion (4) Finally, by combining (15), (16) and (17) it comes that
δ (H
πy
k−α:k+ω, π) −δ (H
π0y
k−α:k+ω, π
0)
≤ h(π, π
0)||z||
2˜Σ−1π
−k(π, π
0)(||x
k−α||
2+ ||d
k−α:k+ω||
2) (18) and thus a sufficient condition for equation (12) to hold is:
h(π ,π
0)||z||
2˜Σ−1π
−k(π ,π
0)(||x
k−α||
2+||d
k−α:k+ω||
2) ≤ log |Σ
π0|
|Σ
π|
(19)
which leads to equation (5), and recalling that z is nor-
mally distributed with zero mean and covariance matrix
Σ ˜
π, it comes that z
TΣ ˜
−1πz is a χ
2random variable with dim(z) = (α + ω + 1)n
ydegrees of freedom.
4 A maximum-likelihood estimator
The aim of this section is to construct a linear estimator based on the maximum likelihood switching sequence estimation developed in section 3. In order to do it, it is necessary (and also sufficient as it will be seen in Theorem 2) to decouple the UI from the state estimation error (otherwise the behavior of this error would depend on the UI behavior), whether the estimated switching parameter is the true one or not.
Assumption 2 The following rank condition holds:
rank "
C E DI 0
#!
= rank h C E
i
, (20)
where I = h
I
nΛnd... I
nλnd
i
(n
Λtimes the matrix I
nΛnd), D = h
D
1... D
nΛi , C = h
C
1D ... C
nΛD i
and E = h
E
1... E
nΛi . Note that this assumption differs from Assumption 1, as the former one is used in order to make the information of the output not affected by the UI, and thus to estimate at best the switching sequence, whereas assumption 2 is used in order to decouple the UI from the state estimation error for any choice of ˆ λ
kin Λ. Under assumption 2, there exists a matrix K such that:
K h C E
i
= h DI 0
i
, (21)
with K given by:
K = G +X H , (22)
with G = h DI 0
i h C E
i
†and H = I
ny,ny− h C E
i h C E
i
†, and X ∈ M
nx,ny( R ) is any matrix (this means that for any matrix X of appropriate dimensions, equation (21) holds with K given by (22)). In fact, X is chosen differently at each time step k ≥ α. Thus, from now, it will be denoted by X
kand K will be denoted by K
k= G +X
kH. It is worth noting that, under Assumption 2, and according to what precedes the following equation holds for any λ , λ
0∈ Λ, and any choice of X
k:
K
kD
λ,λ0= h D
λ0
i
, (23)
where D
λ,λ0= h
C
λ0D
λE
λ0i
. The following assumption is needed for the UI estimation.
Assumption 3 The following rank condition holds:
rank
C E
I
n2 Λ,nd0
= rank h C E
i
(24)
where I
n2 Λ,nd= h
I
nd... I
ndi
(n
2Λtimes the matrix I
nd).
Following the same lines as previously, there exists a matrix L
ksuch that for any λ , λ
0∈ Λ: L
kh
C
λ0D
λE
λ0i = h
I
nd0
ndi
, where L
k= J + Z
kH with J = h
I
nd0
ndi h
C
λ0D
λE
λ0i
†, and Z
kis any matrix (Z
kwill be chosen later). Saying that, the proposed state and UI estimator is the following for any k ≥ α :
λ ˆ
k−α:k+ω= argmin
π∈Λα+ωlog |Σ
π| +δ (H
πy
k−α:k+ω,π ) X
k= [(I
nx−GC
ˆλk
) P ˜
kC
Tˆλk
+GV
ˆλk
]H
T[HC
ˆλk
P ˜
kC
Tˆλk
H
T+HV
ˆλk
H
T]
−1Z
k= J(C
λk
P ˜
kC
λTk
)H
T[(H(C
λk
P ˜
kC
λTk
)H
T]
−1K
k= G + X
kH, L
k= J + Z
kH
ˆ
x
k= x ˜
k+K
k(y
k−C
ˆλk
x ˜
k), d ˆ
k−1= L
k(y
k−C
λkx ˜
k) P
k= (I
nx− K
kC
ˆλk
) P ˜
k(I
nx− K
kC
ˆλk
)
T+ K
kV
ˆλk
K
kTx ˜
k+1= A
λˆkx ˆ
k, P ˜
k+1= γ
2A
λˆkP
kA
Tˆλk
+ F
λˆkW
λˆkF
ˆTλk
.
(25) Remark 1 The global structure of algorithm (25) comes from the structure of the Kalman filter in the presence of a UI [8]. It is adapted using what precedes in order to take into account the case of an unknown switching sequence.
Moreover, the case of the UI estimation is also added here.
The matrix X
k(resp. Z
k) is constructed in order to minimize the trace of the covariance matrix P
k(resp. P
dk= E[e
Tdk
e
dk]), under the constraint of the unknown switching sequence.
In equations (25), γ is a scale factor chosen strictly greater than 1. The aim of such a factor is to enforce a certain convergence rate (more on this in the stability analysis given in [3]).
Assumption 4 For any switching sequences π ,π
0∈ Λ
α+ω+1, such that π [α + 1] 6= π
0[α + 1], the switching sequences π and π
0are assumed to be jointly observable.
Theorem 2 Let assumptions 1, 2, 3 and 4 hold. Then, equa- tions (25) define both state and UI estimators for system (1) with estimation errors e
k= x
k− x ˆ
kand e
dk= Ld
k− d ˆ
khav- ing the following dynamics: e
k+1= (I −K
k+1C
λˆk+1)A
ˆλk
e
k+ ξ
kand e
dk= −L
k+1C
ˆλk+1
A
ˆλk
e
k+ ζ
k, where ξ
kand ζ
kare bounded in mean square, i.e. there exist positive constants ρ
ξand ρ
ζsuch that for all k ≥ 0, ||ξ
k||
2≤ ρ
ξand ||ζ
k||
2≤ ρ
ζ.
PROOF. Due to space limitation, only the proof on the state estimation error e
k= x
k− x ˆ
kconvergence is established, the one of the UI estimation error e
dkfollows the same line. Using equations (25) and (23) the dynamics of the state estimation error is: e
k+1= (I − K
k+1C
ˆλk+1
)A
ˆλ
e
k+ ξ
k, where ξ
k= [(A
λk
− A
λˆk) − K
k+1(C
λk+1
A
λk− C
λˆk+1A
λˆk)]x
k+ (I − K
k+1C
ˆλk+1
)F
λkw
k− K
k+1v
k+1. Let prove the bound- edness in mean-square of ξ
k. Its second and third terms
4
are clearly bounded. Let be λ
k−α:k−1the true discrete state sequence between k − α and k − 1. It comes:
x
k= Φ
λk−α:k−1"
x
k−αd
k−α:k−1#
+ Ψ
λk−α:k−1w
k−α:k−1, where
Φ
λk−α:k−1and Ψ
λk−α:k−1are given by equation (6). Thus,
the first term of ξ
kcan be written as in equation (7). Here again, the second term is clearly bounded in mean square.
Following the same lines as in [3], let prove the bounded- ness in mean-square of Φ
λk−α:k"
x
k−αd
k−α:k#
. Given the state x
k−αand the unknown input sequence d
k−α:k+ω, the mean square error of Φ
λk−α:k−1"
x
k−αd
k−α:k−1#
is given by:
E
"
||Φ
λk−α:k−1"
x
k−αd
k−α:k−1#
||
2|x
k−α, d
k−α:k+ω, π
#
= ∑
λ0∈Λ,λ06=λkP
λ ˆ
k= λ
0|x
k−α, d
k−α:k+ω, π
||Φ
λk−α:k−1×
"
x
k−αd
k−α:k−1#
||
2.
(26) On one hand, it comes: ||Φ
λk−α:k−1"
x
k−αd
k−α:k−1#
||
2≤ Φ(||x
k−α||
2+||d
k−α:k+ω||
2),
where Φ = max
πα∈Λαλ
max{Φ
Tπα
Φ
πα}. On the other hand, it comes:
P
λ ˆ
k= λ
0|x
k−α, d
k−α:k+ω, π
= ∑
πα0∈Λα,πω0∈ΛωP
λ ˆ
k−α:k+ω= π
α0⊗λ
0⊗ π
ω|x
k−α, d
k−α:k+ω, π
!
≤ card(Λ)
α+ω"
1 −γ
(α+ω+1)ny(κ (||x
k−α||
2+||d
k−α:k+ω||
2) + ν)
#
(27) the last inequality coming from theorem 1, where κ = min
π,π0∈Λα+ω+1k(π,π0)
h(π,π0)
and ν = min
π,π0∈Λα+ω+1 1h(π,π0)
log
|Σ|Σπ|π0|
. Note that thanks to assumption 4 and theorem 1, the con- stant κ is positive. Then, using both previous items, it comes that the quantity
E
"
||Φ
λk−α:k−1"
x
k−αd
k−α:k−1#
||
2|x
k−α, d
k−α:k+ω, π
# (28)
is smaller than:
card(Λ)
α+ωΦ(||x
k−α||
2+ ||d
k−α:k+ω||
2)
× h
1 −γ
(α+ω+1)nyκ(||x
k−α||
2+ ||d
k−α:k+ω||
2) + ν i
. (29)
Then, recalling that γ
n(s) (with any positive integer n) con- verges exponentially to 1 as s tends to infinity, we can con- clude that (29) converges to 0 as ||x
k−α||
2+ ||d
k−α:k||
2tends to infinity (κ being positive). Finally, noting that (29) de- pends continuously on ||x
k−α||
2+ ||d
k−α:k+ω||
2on the inter- val [0;∞), it comes that (29) can be upper bounded by some constant independent from ||x
k−α||
2+ ||d
k−α:k+ω||
2. Remark 2 Note that in order to apply equations (25), a delay equal to ω is needed. In order to avoid such a delay, it is possible to use the following equations for i = k+1...k+ω (so that x
k+ωis estimated using y
k−α:k+ω):
X
i= [(I − GC
ˆλi
) P ˜
i+ GV
k]C
Tˆλi
H
T[HC
ˆλi
P ˜
iC
Tˆλi
H
T+ HV
iH
T]
−1Z
i= J(C
λiP ˜
iC
Tλi
)H
T[(H(C
λiP ˜
iC
Tλi
)H
T]
−1K
i= G +X
iH, L
i= J + Z
iH
ˆ
x
i= x ˜
i+K
i(y
i−C
ˆλi
x ˜
i), d ˆ
i−1= L
i(y
i−C
λix ˜
i) P
i= (I − K
iC
λˆi) P ˜
i(I −K
iC
λˆi)
T+ K
iV
iK
iT˜ x
i+1= A
ˆλ
x ˆ
i, P ˜
i+1= γ
2A
ˆλi
P
iA
Tˆλi
+ F
ˆλi
W
iF
Tˆλi
.
(30) Remark 3 The stability results stated in [3] remain true by considering X
kinstead of K
k(as in [3]) for the gain matrix, and (I −GC
ˆλk+1
−X
kHC
ˆλk+1
)A
ˆλk
instead of (I −K
kC
ˆλk+1
)A
ˆλk
(as in [3]) for the transition matrix of the state error vector, as well as some minor adaptations. They are not detailed here for a lack of space.
5 Illustrative Example
Let consider the following academic example (adapted from [3]) of the form (1) where λ ∈ Λ = {1; 2}, ∆ = 0.1, A
1=
"
cos(ω
1∆) −ω
1sin(ω
1∆)
1
ω1
sin(ω
1∆) cos(ω
1∆)
#
, with ω
1= 0.5,
A
2=
"
cos(ω
2∆) −ω
2sin(ω
2∆)
1
ω2
sin(ω
2∆) cos(ω
2∆)
#
, with ω
2= 1, D
1=
"
0 1
# ,
D
2=
"
0 2
#
, F
1= F
2= I
2, C =
0 1 2 1 1 0
, E =
0 1 0
, w
kand v
kare Gaussian noises with covariance matrices W = 0.01I
nxand V = 0.01I
nyrespectively, the unknown input is set to
d
k= 3 sin(π /100k) for k ≥ 0, the simulation is launched for
400 time steps, with γ = 1.1. By setting α = 3 and ω = 2, we
can check that assumptions 1, 2 and 4 are satisfied. The De-
layed Maximum Likelihood Estimator with UI (DMLEUI)
of equations (25) is compared with its Undelayed version
presented in remark 2 (’UMLEUI’), and with the Switching
Kalman Filter with Perfect Information (’SKFPI’), i.e. the
same filter applied with the true switching parameter at each
time step. Table 1 gives the performance of each estimator in
terms of the Root Mean Square Error (RMSE), defined for
the ith component by RMSE(x[i]) = s
1 400
399
∑
k=0
(x
k[i] − x ˆ
k[i]).
It is clear that the SKFPI is only slightly better to the DM- LEUI. This is due to the possible error made by estimating the switching sequence in the MLEUI estimator. Besides, this error increases when there are less measurements avail- able as in the case of the UMLEUI, which explains the slightly worse performance of this former.
Table 1
RMSE (average on 100 simulations)
RMSE on... SKFPI DMLEUI UMLEUI
x[1] 0.1000 0.1000 0.7011
x[2] 5.1225 5.1689 5.2434
6 Conclusions
The main contribution of this paper is the development of an observer for switching systems affected by Gaussian noises and Unknown Inputs (UI) with unknown switching sequence. The main contribution holds in the way of dealing with the UI in order to decouple it from the state estimation error. First, a method that estimates the switching parameter with maximum likelihood is proposed despite the presence of a UI. Then, this method is used in both state and UI es- timators.
References
[1] Angelo Alessandri, Marco Baglietto, and Giorgio Battistelli. Receding- horizon estimation for switching discrete-time linear systems. IEEE Transactions on Automatic Control, 50(11):1736–1748, 2005.
[2] Angelo Alessandri, Marco Baglietto, and Giorgio Battistelli.
Luenberger observers for switching discrete-time linear systems.
International Journal of Control, 80(12):1931–1943, 2007.
[3] Angelo Alessandri, Marco Baglietto, and Giorgio Battistelli. A maximum-likelihood kalman filter for switching discrete-time linear systems. Automatica, 46(11):1870–1876, 2010.
[4] Angelo Alessandri and Paolo Coletta. Switching observers for continuous-time and discrete-time linear systems. In Proceedings of the 2001 American Control Conference.(Cat. No. 01CH37148), volume 3, pages 2516–2521. IEEE, 2001.
[5] Yaakov Bar-Shalom, Subhash Challa, and Henk AP Blom. Imm estimator versus optimal estimator for hybrid systems. IEEE Transactions on Aerospace and Electronic Systems, 41(3):986–991, 2005.
[6] Francisco Javier Bejarano, Alessandro Pisano, and Elio Usai. Finite- time converging jump observer for linear switched systems with unknown inputs. IFAC Proceedings Volumes, 42(17):352–357, 2009.
[7] Oswaldo Luiz V Costa and S Guerra. Stationary filter for linear minimum mean square error estimator of discrete-time markovian jump systems.IEEE Transactions on Automatic Control, 47(8):1351–
1356, 2002.
[8] Mohamed Darouach, Michel Zasadzinski, and Mohamed Boutayeb.
Extension of minimum variance estimation for systems with unknown inputs. Automatica, 39(5):867–876, 2003.
[9] H´ector R´ıos, Jorge Davila, and Leonid Fridman. High-order sliding mode observers for nonlinear autonomous switched systems with unknown inputs.Journal of the Franklin Institute, 349(10):2975–3002, 2012.