• Aucun résultat trouvé

New Method for Identifying Finite Degree Volterra Series

N/A
N/A
Protected

Academic year: 2021

Partager "New Method for Identifying Finite Degree Volterra Series"

Copied!
14
0
0

Texte intégral

(1)

HAL Id: hal-00182924

https://hal.archives-ouvertes.fr/hal-00182924

Submitted on 29 Oct 2007

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

New Method for Identifying Finite Degree Volterra Series

Wael Suleiman, André Monin

To cite this version:

Wael Suleiman, André Monin. New Method for Identifying Finite Degree Volterra Series. Automatica,

Elsevier, 2008, 44 (2), pp.488-497. �10.1016/j.automatica.2007.06.007�. �hal-00182924�

(2)

New Method for Identifying Finite Degree Volterra Series ⋆

Wael SULEIMAN and Andr´e MONIN

LAAS - CNRS, University of Toulouse 7 avenue du Colonel Roche, 31077 Toulouse cedex 4

France

Abstract

In this paper, the identification of a class of nonlinear systems which admits input-output maps described by a finite degree Volterra series is considered. In actual fact, it appears that this class can model many important nonlinear multivariable processes not only in engineering, but also in biology, socio-economics, and ecology.

To solve this identification problem, we propose a method based on a local gradient search in a local parameterization of the state space realization of finite degree Volterra series with infinite horizon. Using the local parameterization not only reduces the amount of the gradient calculations to the minimal value, but also overcomes the nonuniqueness problem of the optimal solution.

Moreover, we propose a sequential projection method to provide an initial estimation of the parameters of finite degree Volterra series realization. This estimation is used to initialize the gradient search method.

Key words: Identification; nonlinear systems; Volterra series; optimization.

1 Introduction

During the last thirty years, the development of identifica- tion theory of dynamic systems has been a subject of ac- tive research, such a development is the result of the neces- sity of reliable models in process control, aerospace appli- cations, biomedical systems, ecology, physiology, biology, and socio-economics (Lakshmikantham, 1988; Nijmeijer &

van der Schaft, 1996; Sastry, 1999; Schetzen, 1981; Koren- berg & Hunter, 1990; Schuppen & Jan, 2004).

The developments of linear subspace identification methods have recently offered a significantly practical tool to deal with the identification of multivariable linear or pseudo lin- ear system (S¨oderstr¨om & Stoica, 1989; Moonen et al., 1989;

Overschee & Moor, 1994; Viberg, 1995; Ljung, 1999).

However, in practice, the identification of nonlinear multi- variable systems is becoming crucial since many systems in nature are nonlinear. In the theory of nonlinear systems, the term ”nonlinear” defines a class of systems for which the linear approximation fails to be an efficient model able to capture the dynamic of the system. The class of nonlinear

⋆ This paper was not presented at any IFAC meeting. Correspond- ing author M. Wael. SULEIMAN. Tel. +33 56 133 7912. Fax. +33 56 133 6969.

Email addresses: [email protected] (Wael SULEIMAN), [email protected] (Andr´e MONIN).

systems is a complex class, and defining an universal repre- sented model of this class is a very complicated task.

Therefore, anyone interested in nonlinear modeling is com- pelled to focus on specific model classes (Billings, 1980;

Haber & Unbeheauen, 1990; Lee, 1998; Mathews & Sicu- ranza, 2000; Pearson, 1995, 2000; Doyle et al., 2001).

In this paper, we focus on the representation of nonlinear system by Volterra series with infinite horizon (Volterra, 1930; Brockett, 1972, 1976). The expansion of a finite degree Volterra series has the following form

y

t

= ∑

l

n=1

τ1=0 τ1

τ2=0

. . .

τn−1

τn=0

w

n

1

, τ

2

, . . . , τ

n

)

×u

t−τ1

u

t−τ2

⊗ . . . ⊗u

t−τn

+ v

t

(1)

where u

t

∈ R

m

is the input signals, y

t

∈ R

p

is the output signals. The measurement noise v

t

is assumed to be a white- noise that is independent of the input signal and with zero mean, and ⊗ denotes the Kronecker product. The functions w

n

1

, τ

2

, . . . , τ

n

) ∈ R

p×mn

denote the Volterra kernels. As the last term in the series involves the l

th

kernel, they will be called Volterra series of degree l with infinite horizon.

This class of Volterra series is dense, in the L

2

sense, in nonlinear analytic input-output systems (Brockett, 1976).

Approximating nonlinear systems by Volterra series has been used in many research areas, such as filtering (Monin

& Salut, 1996), predictive control (Allg¨ower & Zheng,

(3)

2000), ecology (Takeuchi, 1996), biology (Thieme, 2003), and system control (Nijmeijer & van der Schaft, 1996;

Doyle et al., 2001).

In reality, the direct calculation of the Volterra kernels from Equation (1) is not feasible in practice. This because it in- volves the integral past of input signal and the size of the matrices which should be manipulated increases exponen- tially.

For that, the horizon of finite degree Volterra series is fixed (Rugh, 1981; Schetzen, 1989). The obtained class is called truncated finite degree Volterra series of degree l.

It has the following form y

t

= ∑

l

n=1 T

τ1=0 τ1

τ2=0

. . .

τn−1

τn=0

w

n

1

, τ

2

, . . . , τ

n

)

×u

t−τ1

⊗u

t−τ2

⊗. . . ⊗ u

t−τn

+ v

t

(2)

where T is the length of the considered horizon. In this case, the estimation of Volterra kernels becomes trivial and can be obtained by least-square methods. However, if we consider that the Volterra kernels are fully parameterized, then the number of parameters which should be estimated is the following

1

C (l, T ) = p (

l

n=1

(T + n)!

n! T ! m

n

)

(3) recall that p and m are the dimensions of the output and input signals respectively. It is clear that C(l, T ) increases exponentially with respect to l and T . For that, in the case of multivariable systems, we are obligated to operate high dimensional matrices, which are often ill-conditioned (Nowak & Veen, 1994).

The main contribution of this paper is defining a new method to identify a state-space realization of finite degree Volterra series with infinite horizon (1). The initial estimation of the realization’s parameters is obtained by a sequential projec- tion method that we have developed thanks to the recursive property of the realization structure, then the realization pa- rameters are optimized using a local gradient search method.

The paper is organized as follows. In Section 2, a realiza- tion of finite degree Volterra series in finite dimension state space representation is defined. In Section 3, the output error identification problem is formulated and the structure’s pa- rameters are chosen. A local parameterization of the realiza- tion of finite degree Volterra series is developed in Section 4.

In Section 5, we propose a sequential projection method to calculate an initial estimation of the structure’s parameters.

In Section 6, we summarize the algorithm of identification.

Finally, Section 7 presents some illustrative examples and a comparison with some system identification methods for nonlinear systems.

1

See Appendix.A for the details of calculation.

1.1 General notations

As a general rule in this paper, selecting elements of matrices is done using MATLAB

TM

standard matrix operations, e.g.

M(:, i : j) stands for the sub-matrix of the matrix M which contains the columns from the i

th

to j

th

columns.

2 Realization of a finite degree Volterra series

(Brockett, 1976) has proved that any observable realization of a finite Volterra series of degree l can be approximated by an recursive realization of the form

Z

t1

= A

1

Z

t−11

+ B

1

u

t

, Z

01

= 0 Z

t2

= A

2

Z

t−12

+ B

2

(u

t

⊗Z

t1

) , Z

02

= 0

.. .

Z

tl

= A

l

Z

t−1l

+ B

l

(u

t

Z

tl−1

) , Z

0l

= 0 y

t

=

l

i=1

C

i

Z

ti

+ v

t

(4)

The direct relation between y

t

and u

t

, can be obtained by using the property FG ⊗HJ = (F ⊗ H) (G ⊗J) and a simple development of (4) leads to

y

t

=

l

i=1 t−1

τ1=0 τ1

τ2=0

. . .

τi−1

τi=0

C

i

Φ

i

1

, . . . , τ

i

)

×u

t−τ1

u

t−τ2

. . . ⊗u

t−τi

+ v

t

(5)

where

Φ

1

1

) = (A

1

)

τ1

B

1

and for i ≥ 2:

Φ

i

1

, τ

2

, . . . , τ

i

) = (A

i

)

τ1

B

i

I

m

⊗ Φ

i−1

2

, . . . , τ

i

) (6)

By comparing (1) and (5), we observe that Realization (4) leads to Volterra series similar to (1), in which we suppose that u

t

= 0 for t ≤ 0, and the Volterra kernels are approxi- mated by a sum of matrix products.

In fact, this approximation depends on the dimensions of the states {Z

ti

: i = 1, 2,· · · , l}, and it is clear that it becomes better if we increase the dimensions of states. On the other hand, for practical purposes these dimensions should be cho- sen as lower as possible in order to find a compromise be- tween the approximation and numerical complexity issues.

Obviously, Realization (4) is not unique, assume that the states of Structure (4) transform into X

ti

= T

i

−1

Z

ti

where

T

i

∈ R

ni×ni

nonsingular matrices, and n

i

is the dimension of

(4)

the state Z

ti

. The structure becomes X

t1

= A ¯

1

X

t−11

+ B ¯

1

u

t

X

t2

= A ¯

2

X

t−12

+ B ¯

2

(u

t

X

t1

)

.. .

X

tl

= A ¯

l

X

t−1l

+ B ¯

l

(u

t

⊗X

tl−1

) y

t

=

l

i=1

C ¯

i

X

ti

+v

t

where

"

A ¯

i

B ¯

i

C ¯

i

0

#

=

"

T

i

−1

A

i

T

i

T

i

−1

B

i

T

mi−1

C

i

T

i

0

# (7)

and {T

mj

: j = 0, 1,2, · · · ,l − 1} are defined as follows T

mj

= I

m

T

j

, T

m0

= I

m

: I

m

is the identity matrix Lemma 1 The realization of finite degree Volterra series (4) is asymptotically stable if and only if:

For i = 1, 2, · · · , l : ρ(A

i

) < 1 (8) where ρ (A

i

) stands for the spectral radius of A

i

.

PROOF. Each subsystem may be viewed as a linear sys- tem with input (u

t

Z

ti−1

). So, Z

ti

will be asymptotically stable if and only if the linear system is stable ρ(A

i

) < 1 and u

t

Z

ti−1

is bounded. Assuming that Z

t1

is stable

ρ (A

1

) < 1

, so u

t

Z

t1

is bounded and Z

t2

is stable if and only if ρ(A

2

) < 1

. And so on.

3 Output error identification

Our goal is to determine the coefficient matrices of Structure (4). Assume that all matrices are fully parameterized, so the structure’s parameters can be given by

θ =

vec(A

1

) vec(B

1

) vec(C

1

)

.. . vec(A

l

) vec(B

l

) vec(C

l

)

(9)

where vec(.) denotes the vectorization operator defined as follows

vec : M ∈ R

m×n

→ R

m·n

vec(M) = vec h

m

1

m

2

· · ·m

n

i = h

m

T1

m

T2

· · · m

Tn

i

T

Given the input u

t

and output y

t

of the real system, the model corresponding to θ can be given as follows

Z ˆ

t1

= A

1

(θ ) Z ˆ

t−11

+ B

1

(θ )u

t

Z ˆ

t2

= A

2

(θ ) Z ˆ

t−12

+ B

2

(θ )(u

t

Z ˆ

t1

) .. .

Z ˆ

tl

= A

l

(θ ) Z ˆ

t−1l

+B

l

(θ)(u

t

Z ˆ

tl−1

) ˆ

y

t

(θ) =

l

i=1

C

i

(θ ) Z ˆ

ti

(10)

Note that as Z

ti

depends on θ , the mapping ˆ y

t

(θ ) is nonlinear with θ . Our goal is achieved if the output ˆ y

t

(θ) approximates the output of the real system accurately. This criterion can be transformed into the minimization of the output error with respect to the parameters θ . Considering a data of length N, the output-error cost function is given by

J

N

(θ) = 1 N

N

k=1

ky

k

y ˆ

k

(θ)k

22

= 1

N E

N

(θ )

T

E

N

(θ ) (11) where

E

N

(θ) =

e(1)

T

e(2)

T

· · · e(N)

T

T

(12) is the error vector in which e(k) = y

k

y ˆ

k

(θ). The minimiza- tion of (11) is clearly a nonlinear, nonconvex optimization problem. The numerical solution of this problem can be cal- culated by different algorithms, e.g. gradient search method (Levenberg-Marquard method) is a popular one. This itera- tive method is based on the updating of the system param- eters θ as follows

θ

i+1

= θ

i

− (ψ

NT

i

N

i

) + λ

i+1

I)

−1

ψ

NT

i

)E

N

i

) (13) Where λ

i

is the regularization parameter and

ψ

N

(θ ) , ∂ E

N

(θ )

∂ θ

T

(14)

is the jacobian of the error vector E

N

(θ ). As we mentioned in Section (2), the structure (4) is not unique. As a consequence, the minimization of J

N

(θ ) does not have a unique solution.

The nonuniqueness solution of θ is the consequence of the full parameterization of the matrices of the realization.

However, the optimal solution can be made unique by choos-

ing a suitable canonical parameterization. As each subsys-

tem of the realization can be considered as a linear system,

one could use a classical parameterization of the various

parameterization proposed for the linear systems. Unfortu-

nately, these parameterizations are not numerically robust

(5)

(McKelvey & Helmersson, 1997).

To overcome the nonuniqueness problem of the optimal θ and keep the full parameterization of the matrices, Ribarits et al (Ribarits et al., 2004) have proposed a method for the linear systems, in which the directions that do not change the cost of output error function are identified and projected out at each iteration.

For that only the active parameters are updated. In analogues way, we will define a local parameterization of the realiza- tion of finite degree Volterra series (4) in order to define the directions in which the cost function J

N

(θ ) does not change.

4 Local parameterization

Recall that two realizations of a finite degree Volterra se- ries (4) are similar if their coefficient matrices are related by Equation (7), where the transformation matrices {T

i

: i = 1, 2,· · · , l} parameterize the subset of equivalent models.

The obtained subset defines a manifold. In order to iden- tify the tangent plane of this manifold at (A

i

, B

i

,C

i

: i = 1, 2, · · · , l), we linearize Relation (7) around the identity ma- trices. Considering a small perturbation T

i

= I

ni

+ ∆T

i

, we suppose that ρ ( ∆T

i

) ≪ 1 then by using the approximation

I

ni

+ ∆T

i

−1

I

ni

− ∆T

i

and neglecting all second order terms, we obtain

"

A ¯

i

B ¯

i

C ¯

i

0

#

=

"

A

i

B

i

C

i

0

# +

"

− ∆ T

i

A

i

+ A

i

T

i

− ∆ T

i

B

i

C

i

∆T

i

0

#

+

"

0 B

i

∆T

mi−1

0 0

# (15)

By defining

Λ

ij

, h

0

nj×(i−1)nj

I

nj

0

nj×(m−i)nj

i

it is possible to write ∆T

mj

as follows

∆T

mj

=

m

i=1

Λ

ijT

∆T

j

Λ

ij

If we consider the following vectors of parameters

θ =

vec(A

1

) vec(B

1

) vec(C

1

)

.. . vec(A

l

) vec(B

l

) vec(C

l

)

and θ ¯ =

vec( A ¯

1

) vec( B ¯

1

) vec( C ¯

1

)

.. . vec( A ¯

l

) vec( B ¯

l

) vec( C ¯

l

)

(16)

the relation between θ and ¯ θ can be obtained using the property vec(ABC) = C

T

A

vec(B)

θ ¯ = θ + M

θ

vec( ∆T

1

) .. . vec( ∆T

l

)

(17)

where for 1 ≤ jl −1

M

θ

:, 1 +

j−1

k=1

n

2k

:

j

k=1

n

2k

!

=

0

α

j×n2j

A

j

T

I

nj

+ I

nj

A

j

B

j

T

I

nj

I

nj

⊗C

j

0

n2

j+1×n2j m

i=1

Λ

ijT

B

j+1

Λ

ijT

0

γ

j×n2j

and

M

θ

:, 1 +

l−1

k=1

n

2k

:

l

k=1

n

2k

!

=

0

α

l×n2l

A

l

T

I

nl

+ I

nl

A

l

B

l

T

I

nl

I

nl

⊗C

l

 (18) with

α

j

= ∑

i=1j−1

n

i

(n

i

+ m n

i−1

+ p) γ

j

= p n

j+1

+ ∑

li=j+2

n

i

(n

i

+ m n

i−1

+ p)

Lemma 2 The left null space of M

θ

(18) defines a basis of the directions in which the parameters should be modified to lead a change in the value of cost function J

N

(θ ).

PROOF. Equation (17) shows that, the tangent space of the manifold of equivalent realizations at (A

i

, B

i

,C

i

: i = 1, 2, · · · ,l) is equal to the column space of the matrix M

θ

(18). Since the left null space of the matrix M

θ

is orthogonal complement to the column space, the directions in which the value of the cost function J

N

(θ) changes are those related

to left null space of M

θ

.

Let the QR decomposition of M

θ

be given by

M

θ

= h Q

1

Q

2

i

"

R 0

#

(19)

(6)

then a basis of the null space of M

θ

is Q

2

.

Thus, the update rule should be modified such that we project out the directions in which the cost function does not change.

The new update rule becomes

θ

i

= θ

i−1

−Q

2

Q

T2

ψ

NT

ψ

N

Q

2

+ λ

i

I

−1

Q

T2

ψ

NT

E

N

(20) where Q

2

, ψ

N

and E

N

depend on θ

i−1

. Note that since Q

2

depends on the past parameter θ

i−1

, the QR decomposition (19) must be computed at each iteration.

4.1 Computing the iterative parameter update

In order to compute the update rule (20), the quantities E

N

(θ ) and ψ

N

(θ ) should be computed. Computing the vec- tor E

N

(θ) can be done by simulating the system (10) with θ = θ

i−1

. At the same time, this simulation brings out the states { Z ˆ

ti

: i = 1, 2,· · · ,l}. In order to simulate the gradi- ent ψ

N

i−1

), we should compute the derivative of ˆ y

t

with respect to θ

i−1

. Let us define

ζ

t,kj

= ∂ Z ˆ

tj

∂ θ

k

(21) then the computation of

∂ θyˆtT

= h

yˆt

∂ θ1

· · ·

∂ θyˆt

q

i

, where q is the number of parameters in θ , can be made as follows

ζ

t,k1

=A

1

ζ

t−1,k1

+ ∂ A

1

∂ θ

k

Z ˆ

t−11

+ ∂ B

1

∂ θ

k

u

t

ζ

t,k2

=A

2

ζ

t−1,k2

+ ∂ A

2

∂ θ

k

Z ˆ

t−12

+ ∂ B

2

∂ θ

k

(u

t

Z ˆ

t1

) +B

2

(u

t

⊗ ζ

t,k1

) .. .

ζ

t,kl

=A

l

ζ

t−1,kl

+ ∂ A

l

∂ θ

k

Z ˆ

t−1l

+ ∂ B

l

∂ θ

k

(u

t

Z ˆ

tl−1

) + B

l

(u

t

⊗ ζ

t,kl−1

)

y ˆ

t

∂ θ

k

=

l

j=1

C

j

ζ

t,kj

+ ∂C

j

∂ θ

k

Z ˆ

tj

(22) In fact, if we consider the local parameterization of finite degree Volterra realization, then computing the update rule (20) can be done without calculate ψ

N

first. Let us define

N

, ψ

N

Q

2

= ∂ E

N

∂ θ

T

Q

2

(23) So, the update rule can be expressed as follows

θ

i

= θ

i−1

−Q

2

TN

N

i

I

−1

TN

E

N

(24)

Consider the t p elements of the s

th

column of Ω

N

N

((t − 1) p + 1 : t p, s) =

q

k=1

y ˆ

t

∂ θ

k

Q

2

(k, s) (25)

In order to calculate this sum, let us define ζ

tj

=

q

k=1

Z ˆ

tj

∂ θ

k

Q

2

(k, s) (26)

Using (22), the computation of (25) can be done as follows ζ

t1

=A

1

ζ

t−11

+ ∆A

1

Z ˆ

t−11

+ ∆B

1

u

t

ζ

t2

=A

2

ζ

t−12

+ ∆A

2

Z ˆ

t−12

+ ∆B

2

(u

t

Z ˆ

t1

) + B

2

(u

t

⊗ ζ

t1

) .. .

ζ

tl

=A

l

ζ

t−1l

+ ∆A

l

Z ˆ

t−1l

+ ∆B

l

(u

t

Z ˆ

tl−1

) +B

l

(u

t

⊗ ζ

tl−1

)

q

k=1

y ˆ

t

∂ θ

k

Q

2

(k, s) =

l

j=1

C

j

ζ

tj

+ ∆C

j

Z ˆ

tj

(27) where

∆A

j

=

q

k=1

A

j

∂ θ

k

Q

2

(k,s) : j = 1, 2, · · · ,l (28) and ∆B

j

, ∆C

j

are defined analogously. These matrices can be obtained from

vec( ∆A

1

) vec( ∆B

1

) vec( ∆C

1

)

.. . vec( ∆A

l

) vec( ∆B

l

) vec( ∆C

l

)

= ∑

q

k=1

∂ θ

∂ θk

Q

2

(k, s)

=

q

k=1

e

k

Q

2

(k, s)

(29)

where

e

k

= [0 . . . 0 1

k

0 . . . 0]

T

Note that the number of columns of Ω

N

is smaller than that of ψ

N

because the first one is the number of active parameters and the second one is the total number of parameters.

5 Computing an initial estimation

The reason why a good initial estimation is crucial is that

the local gradient search method converges to the nearest

local optimum in the neighborhood of initial guess of solu-

tion. Moreover, the initial estimation should also provide a

stable realization that verifies the constrain (8).

(7)

In this section, we propose a sequential projection method in order to provide an initial estimation of the realization’s vector of parameters (θ). The basic idea is to use the con- secutive property of finite degree Volterra series realization to define a sequential projection procedure. Recall that, the error introduced in the projection process is due to the non orthogonality of the subprocesses (linear, quadratic, cubic,

· · · ), and as well the effect of the noise on the observed out- put signal. The algorithm of projection can be resumed as follows

(1) Estimate the best linear stable approximation (the linear subsystem) of the nonlinear system.

(2) From the simulation of the linear subsystem, we project the residual on the class of 2-degree subsystems.

(3) From the simulation of linear + 2-degree subsystems, we project the residual on the class of 3-degree sub- system.

(4) Etc.

Moreover, this projection procedure yields an estimation of the dimensions of the states {Z

ti

: i = 1, 2,· · · , l}, which means the set {n

i

: i = 1, 2, · · · , l}. Furthermore, it yields the degree of the realization (l) by defining a precision criterion as it will appear in the sequel.

5.1 Identification of linear subsystem

Estimating the best linear stable approximation of nonlinear system can be formulated as a minimization problem

min L

1N

= 1 N

N

t=1

k y

t

y

1t

k

22

where

Z

t1

= A

1

Z

t−11

+ B

1

u

t

y

1t

= C

1

Z

t1

with the constraint ρ A

1

< 1

(30)

The parameters of the optimization problem, which should be estimated, are the triple (A

1

, B

1

,C

1

). The above mini- mization problem can be transformed into the following ma- tricial form (Gopinath, 1969; DeMoor et al., 1988)

min k Y

α,N

− Γ

1α

Z

0,N1 −α

− Φ

1α

U

α,N

k

2F

subject to

ρ A

1

< 1

(31)

where k . k

F

denotes Frobenius norm. The input and output

are stocked in Hankel matrices form

U

α,N

,

u

1

u

2

· · · u

N−α+1

u

2

u

3

· · · u

N−α+2

.. . .. . . .. .. . u

α

u

α+1

· · · u

N

where α and N refer to the number of rows in the matrix and data length respectively. The output Hankel matrix Y

α,N

is defined analogously to U

α,N

. The matrices Γ

1α

, Φ

1α

and Z

0,N−α1

are defined as follows

Γ

1α

=

C

1

C

1

A

1

.. . C

1

(A

1

)

(α−1)

Φ

1α

=

C

1

B

1

0 0 · · · 0 C

1

A

1

B

1

C

1

B

1

0 · · · 0 .. . . .. ... ... ...

C

1

(A

1

)

(α−1)

B

1

· · · · C

1

B

1

Z

0,N−α1

= h

Z

01

Z

11

· · · Z

N−α1

i

.

Note that the number of rows (α ) should be roughly chosen to be greater than the expected linear system order n

1

. This condition guarantees that the extended matrix of observabil- ity Γ

1α

has a full rank. Our objective is to estimate the ma- trices Γ

1α

and Φ

1α

. Since only the data matrices U

α,N

and Y

α,N

are known, instead of solving the minimization prob- lem (30) with respect to Γ

1α

and Φ

1α

, we can transform it into an equivalent one involving only the matrix Γ

1α

as follows

min k Y

α,N

Π

Uα,NT

− Γ

1α

Z

0,N−α1

Π

Uα,NT

k

2F

subject to

ρ A

1

< 1

(32)

where Π

Uα,NT

is the orthogonal projection onto the nullspace of U

α,N

Π

Uα,NT

= I −U

α,NT

U

α,N

U

α,NT

−1

U

α,N

(33) such that

U

α,N

Π

Uα,NT

= 0 (34)

The inverse of the matrix

U

α,N

U

α,NT

exists if the input is

persistently exciting and N > mα. In fact, the origin of the

idea of subtracting the term that involves the input signal

(8)

belongs to the direct 4SID method, which is a classical sub- space method (DeMoor et al., 1988).

To solve the problem (32), one could use the Singular Value Decomposition (SVD). Recall that, the extended observabil- ity matrix ( Γ

1α

) is a full rank matrix. Consider the following SVD

Y

α,N

Π

Uα,NT

= h Q

s

Q

n

i

"

S

s

0 0 S

n

# "

V

T

s

V

T

n

#

(35)

where the matrix S

s

contains the principals singular values (further than threshold).

The dimension of this matrix yields n

1

(the dimension of Z

t1

).

The estimation of the matrix Γ

1α

can be done by solving the following problem of minimization

min k Γ

1α

− Q

s

k

2F

subject to

ρ A

1

< 1

(36)

By using the property

Γ

1α

(1 : (α − 1)p, :)A

1

= Γ

1α

(p + 1 : α p, :) (37) and replacing the stability condition ρ(A

1

) < 1 by equivalent Lyapunov inequalities

ρ(A

1

) < 1 ⇐⇒ ∃P ≥ δ I

n1

: PA

1

PA

1T

≥ δ I

n1

where δ > 0, we can transform the minimization problem (36) into the following one

min J A

1

,k L

A1

Q

s

− Q

s

A

1

R

A1

k

2F

subject to

"

P − δ I

n1

X

T

X P

#

≥ 0

2n1

(38)

where

Q

s

, Q

s

(p +1 : α p, :) Q

s

, Q

s

(1 : (α −1) p, :) X , A

1

P

(39)

If we let R

A1

, P and L

A1

be equal to the identity ma- trix, the problem is converted to an optimization problem that involves minimizing a linear function over symmetric cones and can be solved using SeDuMi MATLAB

TM

pack- age (Sturm, 1999) (see (Lacy & Bernstein, 2003) for more details).

Solving the minimization problem (38) provides an estima- tion of a stable matrix A

1

. The matrix C

1

can be estimated

using the following formula

C

1

=

α−1

k=0

Q

s

(k p + 1 : (k + 1)p,:)

! Λ

T

A1

Λ

A1

Λ

T

A1

−1

(40) where Λ

A1

, I

n1

+∑

α−1i=1

A

1

i

. Note that as A

1

verifies that ρ(A) < 1, Λ

A1

is a full rank matrix and the matrix inverse exists. The matrix Φ

1α

can be estimated by considering the least-squares solution to the overdetermined system of equa- tions

Q

T

n

Y

α,N

U

α,NT

U

α,N

U

α,NT

−1

= Q

T

n

Φ

1α

(41)

Finally, the matrix B

1

can be easily calculated from the estimation of Φ

1α

.

5.2 Identification of higher order subsystems

After using the procedure described in the previous section, an estimation of (A

1

,B

1

,C

1

) is available. Calculating the state Z

t1

and the output y

1t

for t = 1, .., N of the linear sub- system can be done by evaluating the following model

Z

t1

= A

1

Z

t−11

+B

1

u

t

y

1t

= C

1

Z

t1

The next task is to project the residual on the class of 2- degree subsystem. We define the residual as follows

˜

y

t

= y

t

y

1t

(42) The estimation of the best 2-degree subsystem can be done by solving the following optimization problem

min L

2N

= 1 N

N

t=1

k y ˜

t

−y

2t

k

22

where

Z

t2

= A

2

Z

t−12

+ B

2

u

t

Z

t1

y

t2

= C

2

Z

t2

with the constraint ρ A

2

< 1

(43)

As u

t

Z

t1

is computed thanks to the previous step, this minimization problem is similar to (30). So, by using an analogous logic, we obtain an estimation of the matrices A

2

, B

2

,C

2

.

Then, this procedure is reiterated until we get an appreciated

precision. Defining this precision criterion is the objective

of the following section.

(9)

5.3 Determining the realization’s degree

Consider that, the subsystems of order inferior to j are avail- able. In order to verify that the nonlinear system is accu- rately fitted by the finite degree Volterra series realization, we evaluate the following criterion

ˆ y

t

=

j

i=1

y

it

1 − ∑

Nt=1

(y

t

y ˆ

t

)

T

(y

t

y ˆ

t

)

t=1N

(y

t

y) ¯

T

(y

t

y) ¯

!

×100 ≥ η (44)

where η is a user-defined constant and ¯ y denotes the mean of the output signals defined as follows

¯ y = 1

N

N

t=1

y

t

(45)

If the criterion (44) is true, the algorithm stops and the real- ization’s degree is j. Otherwise we estimate the subsystem of order j + 1.

Note that the measurement noise v

t

should be considered when we define η as follows

0 < η < 1 − ∑

Nt=1

v

Tt

v

t

Nt=1

(y

t

y) ¯

T

(y

t

y) ¯

!

× 100 (46)

6 Identification algorithm

We can resume the algorithm of identification as follows (1) Calculate an initial estimation of the realization’s

parameters using the sequential projection de- scribed in Section 5. Recall that, this procedure yields the realization degree (l), the dimensions of the states Z

ti

{n

i

: i = 1, 2, · · · , l}, and the matrices A

i

,B

i

,C

i

: i = 1, 2,· · · , l . The estimated vector of parameters θ

0

is used as an initial guess for the opti- mization process and k ← 0.

(2) Calculate the states {Z

ti

: i = 1, 2, · · · ,l} and ˆ y

t

by sim- ulating the system (10) with θ = θ

k

.

(3) Compute E

N

(θ ) using (12).

(4) Calculate the matrix M

θ

using (18).

(5) Calculate the QR decomposition of M

θ

(19), from which we obtain Q

2

.

(6) Calculate ∆A

j

, ∆B

j

and ∆C

j

: j = 1,2, · · · , l using (29).

(7) Calculate the matrix Ω

N

using (25) and (27), we sup- pose that {ζ

0j

= 0 : j = 1, 2, · · · , l}.

(8) Calculate the update rule of the gradient search algo- rithm using (24) and kk +1.

(9) Perform the termination test for minimization, If true, the algorithm stops. Otherwise, return to step (2), i.e.

compute the values J

N

θ

k−1

and J

N

θ

k

using (11) and test if kJ

N

θ

k

−J

N

θ

k−1

k

2

is small enough.

7 Illustrative examples

In this section, first we compare the computational com- plexity of the direct gradient search and gradient search in the local parameterization of finite degree Volterra realiza- tion as a function of realization degree. At the same time, we corroborate that the method is able to identify the real- ization of finite degree Volterra series. Second, we consider two examples of nonlinear system, and we compare the ca- pability of finite degree Volterra realization to approximate the considered examples with some of state-of-the-art sys- tem identification methods for nonlinear systems.

In order to validate the obtained models, for a data of N samples, we have used T

id

=

N2

samples for the identification purpose, and the rest of them T

val

= T −T

id

=

N2

samples for the validation purpose. The validation step, can be viewed as an evaluation of prediction accuracy of the models, and in order to verify that the models are capable to extract the real output signal from the measured noised one, we con- sider the output signal without the measurement noise in the validation step. The model accuracy is defined as the Per- cent Variance Accounted For (%VAF )

%VAF , 1 − ∑

t=1N

(y

t

y ˆ

t

)

T

(y

t

y ˆ

t

)

Nt=1

(y

t

y) ¯

T

(y

t

y) ¯

!

× 100 where ˆ y

t

denotes the estimated output signal and ¯ y is the mean of the output signals.

7.1 Computational complexity

In this section, we figure out that using a local parameteriza- tion accelerates the convergence of gradient search method.

For that, we consider a finite degree Volterra series real- ization (4), where the states {Z

ti

: i = 1, 2, · · · ,l} have the same dimension which is equal to n = 10. The identifica- tion experiment for each value of the realization’s degree is repeated 10 times. Each one has a length of 4000 samples.

The average of optimization time is then computed.

The optimization time is the required time to reach a specific

precision (local optimum). Each experiment has three inputs

( u

t

∈ R

3

) which have been chosen to be uniform white noise

and three outputs ( y

t

∈ R

3

) . The measurement noise v

t

is a

Gaussian white noise scaled such that the signal to noise ra-

tio SNR = 10 dB. The initial estimation of the realization’s

vector of parameters (θ) is computed using the sequential

projection method explained in Section 5. The computation

time of sequential projection method is calculated and given

in Fig. 2. The optimization time as a function of realiza-

tion degree (l) for the direct gradient search method and the

gradient search in local parameterization space is given in

(10)

Fig. 1. The implementation of two approaches is done using MATLAB

TM

programming environment.

2 3 4 5 6 7 8

103 104 105 106

Realization degree

Optimization time (second)

Fig. 1. Optimization time for the direct gradient search method (cir- cle mark) and gradient search in local parameterization space (plus mark).

2 3 4 5 6 7 8

102 103 104

Realization’s degree

Computation time (second)

Fig. 2. Computation time of the sequential projection method.

It is clear from Fig. 1 that using local parameterization re- duces the time of computation. This reducing becomes ad- vantageous when the realization degree increases.

Table 1

Accuracy of the identified models

Identification Initial estimated Optimized

model model

Accuracy (%VAF) 82.3 ± 2.7 89.1± 1.8 Validation

Accuracy (%VAF) 86.9 ± 1.3 98.6 ± 0.7

In order to corroborate that the proposed method is able to identify the finite Volterra series efficiently, we calculate the average and the standard deviation of the accuracies of the obtained models. The accuracies are reported in Table 1 where the initial estimated model is the model obtained by the sequential projection explained in Section 5 and the optimized model is that obtained by local gradient search in local parameterization space. These accuracies demonstrate

that the proposed method has efficiently identified the fi- nite degree Volterra realization and the sequential projection method provides a good initial guess for the optimization process.

7.2 First example of nonlinear system Consider the following example

x

1t

=0.7 x

1t−1

+ u

t

, x

10

= 0 x

2t

=0.7 x

2t−1

+ x

t1

2

+ x

1t

+u

t

, x

20

= 0 y

t

= x

t1

2

+ x

t2

+ v

t

(47)

The input signal u

t

∈ R is chosen to be uniform white noise with standard deviation is equal to 0.5 and zero mean, its length is equal to 1000 samples. The measurement noise v

t

is a Gaussian white noise scaled such that, the signal to noise ratio SNR = 10 dB. We compare the performance of finite Volterra series realization with neural network based nonlinear system identification, which is a popular approach (Billings et al., 1992; Sjoberg et al., 1994; Nørgaard et al., 2000). For comparison, we have chosen two nonlinear model structures

(1) Neural Network Output Error (NNOE) model with 10 hidden hyperbolic tangent units in the hidden layer.

The output function generated by the neural network can be calculated as follows

ϕ

t

= h ˆ

y

t−1

(θ) y ˆ

t−2

(θ) y ˆ

t−3

(θ ) u

t

u

t−1

u

t−2

i

T

ˆ

y

t

(θ ) = g

t

, θ)

(48) where ϕ

t

is a vector containing the regressors, θ is a vector containing the weights and g is the function realized by the neural network.

(2) Neural Network State Space Innovation Form (NNSSIF) model which determines a nonlinear state space model of the dynamic system

ˆ

x

t

(θ ) = g ( x ˆ

t−1

(θ), u

t

) ˆ

y

t

= C x ˆ

t

(θ ) (49) where ˆ x

t

(θ) is the state vector and its dimension is n

x

, g is the function realized by the neural network and C is a constant matrix.

In this example, the best performance was obtained by n

x

= 6 and neural network with 10 hidden hyperbolic tangent units in the hidden layer. The neural network is trained with the Levenberg-Marquardt method.

The implementation of two models (NNOE and NNSSIF) is done using Neural Network Based System Identification toolbox (Nørgaard, 2000).

Note that the dimensions of the states (Z

ti

: i = 1, 2) of the

quadratic system (Volterra series realization of degree two)

Références

Documents relatifs

To prove the optimal convergence of the standard finite element method, we shall construct a modification of the nodal interpolation operator replacing the standard

A finite volume method to solve the Navier-Stokes equations for incompressible flows on unstructured meshes. Cayré,

De plus , comme il y a env i ron deux fois plus de baguenaudiers à fleurs jaunes sur la station étudiée , il est patent que la probabilité qu ' un azuré les visite est

By dividing a reservoir mode! into relatively independent zones, it is possible to initialize the history matching problem by choosing the &#34;best&#34; realizations

• Alternative formulation for a good conditioning of the system matrix â Quasi optimal convergence.. • Necessary and sufficient geometrical conditions remain open â Inclusion

Combining space and time eXtended finite element methods gives a unified space–time dis- cretization and accurate results when space and/or time discontinuities have to be

The identification of modal parameters damping ratio (ζ) and natural frequency (ω n ) uses the underlying linear dynamics of the beam, obtained when input has low level of

Figure 3 shows the restriction on excitation forces applied in previous works. That is the point of the next section that will define the Green-Volterra kernels. Then, the method