• Aucun résultat trouvé

Exact Arma Lattice Predictors From Autocorrelation Functions

N/A
N/A
Protected

Academic year: 2021

Partager "Exact Arma Lattice Predictors From Autocorrelation Functions"

Copied!
25
0
0

Texte intégral

(1)

HAL Id: hal-02916431

https://hal.laas.fr/hal-02916431

Submitted on 17 Aug 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Functions

André Monin, Gérard Salut

To cite this version:

André Monin, Gérard Salut. Exact Arma Lattice Predictors From Autocorrelation Functions. IEEE Transactions on Signal Processing, Institute of Electrical and Electronics Engineers, 1994, 42 (4), pp.877 - 886. �10.1109/78.285651�. �hal-02916431�

(2)

Exact Arma Lattice Predictors From Autocorrelation Functions.

André MONIN, Gérard SALUT.

Laboratoire d’Automatique et d’Analyse des Systèmes.

Centre National de la Recherche Scientifique.

7 Avenue du Colonel Roche 31077 TOULOUSE Cédex. FRANCE

Abstract:This paper derives an optimal linear -predictor of ARMA type in lattice form of arbitrarily fixed dimension for a process whose autocorrelation function is known. The algo- rithm preserves exact optimality at each step, as opposed to asymptotic convergence of more usual algorithms, at the expense of hereditary computation. Only the discrete time case is ex- amined. It is shown how the unnormalized (respectively normalized) lattice form may be re- duced to only 4n-2 parameters (respectively 2n+1) for a n-th order projection on the past. The normalization algorithm for the forward and backward residuals uses only scalar square root computations. Some examples are given which show the accuracy of this technique compared to those using the classical ARMA form for the predictor.

Keywords:Linear filtering, linear predictor, reduced filters, lattice form.

1. Introduction.

1.1. Objectives.

The paper concerns the problem of computing the exact minimum variance ARMA pre- dictor of n-th order, as a time varying lattice, from the autocorrelation function. Approximate procedures have been proposed in the past [1], [2], [3], [4], [5] to achieve the same goal, namely by trivially imbedding the past innovations into a 2-channel standard lattice problem. However, it has been shown by [1], as well as noticed in [5], that this is equivalent to the extended least- squares approximation [6], therefore with the same pitfalls (computed transient coefficients are not the true projection coefficients, and do not guarantee convergence to the optimal, in the gen- eral case. For the same reason, the predictor is always biased in the short- term transient.). More recent papers have used stochastic gradient approximations of higher order [7], [8], [9], [10].

Except the fact that they use data records while we use autocorrelations, the problem is indeed similar to ours. However, the aim is to asymptotically determine the stationary optimal predic- tor parameters, not to obtain the optimal time-varying parameters. The techniques developed are essentially based on extensions of the well-known stochastic gradient search. They are re- cursive but do not look for the optimum at each step. Moreover, some convergence problem may occur when, for example, the dimension of the unknown system is greater that the model [11]. It is obviously the case when the system is not finite dimensional, which is the case in the low-pass filter treated an illustration in 5.

To derive the exact ARMA lattice predictor, thus avoiding such pitfalls, is a more difficult task since the feedback parameters of the lattice lead to a non-linear implicit dependence or, equivalently, a growing memory for the computation of the time-varying optimal predictor pa-

L2

(3)

rameters. This is equivalent to the exact stochastic gradient (i.e. an infinite order approxima- tion). This difficulty was clearly identified in [1], [2] but was never fully investigated, to the author’s knowledge. We examine here this problem for a known autocorrelation function.

Our results, in this respect, are too-fold:

- A general algorithm gives the exact (hereditary) computation of the optimal n-th order ARMA predictor, in time-varying lattice form with zero initial state, for an arbitrary autocor- relation function. The linear dependence in the vector process (infinite Hankel rank) is being exploited to obtain the minimum number of parameters.

- In the finite Hankel rank case, the computations are shown to be finitely recursive, and completely solve the so-called realization problem under lattice form.

1.2. Contents.

Let , be a discrete-time scalar sequence of (mean square random variables) with zero mean. Assume that the only data available is the autocorrelation function of , not necessarily with a finite Hankel rank. One can derive, as in [12], the reduced optimal n-th order non-stationary linear predictor of (in the minimum variance sense), under its observable canonical state-space form or, equivalently, its input-output ARMA form:

(1.1) The optimal parameters are obtained by minimizing the prediction variance error ( ) with no stationary assumption but explicit state-dimension constraint on the predictor, as shown in [12]. Otherwise, the state variable has to be reconstructed for the whole time horizon, at each step, leading to an infinite-dimensional state-space as in ARMA identification [1].

In such a predictor, computing the values of requires the inversion of a dimensional system at each step. Despite the possibility of recursive inversion, as in [1], the canonical state-space form is generally not tractable for higher dimensions. It is in fact well known that numerical computations are very sensitive with respect to the coefficients of the characteristic polynomial.

Recall that the optimization of leads to the orthogonality equations as:

(1.2) Clearly, that equations (1.2) represent the projection of on the random variables

and with as inner product. We write this projection:

(1.3) The purpose of this paper is to exhibit an orthogonal basis of the linear space spanned by the random variables in order to simplify the projection formula and to im- prove its numerical accuracy. This orthogonal basis induces a lattice type filter but without the independent modular structure, which is well known to be impossible in the ARMA case. Note that this problem has been studied in [1], [2] in the context of the ARMA identification, but with the extended least squares approximate.

y y

( )

yt,tN

{ } L2p( )Ω

yt

zt = yˆt t1 yt

zt atizti

i=1 n

+btiyti

=

Jt = E[(ytzt)2]

ati,bti { }in=1 2n×2n

Jt

t0,∀i = 1n1,E[(ytzt)yti] = 0 E[(ytzt)zti] = 0 yt

yti

{ }in=1 {zti}in=1 (x y) = E xy[ ]

zt = P y[ t yt1,…, ytn,zt1,…,ztn] yti,zti

{ }in=1

(4)

The main contribution here is as follows:

- We first exhibit the optimal n-th order predictor (reduced dimension) under the unnor- malized lattice form using4n-2parameters ( -dimensional state-space) and a complete algo- rithm is provided which uses hereditary computation on the autocorrelation function .

- We then introduce the normalized form using2n+1 parameters with its algorithm at- tached. The problem of extracting square roots of the normalization matrices is solved using only scalar square root computation because of its particular form.

- We also show that such a projection technique (i.e. onto a linear functional of the past with internal memory constraint) yields an associated innovation whose “whiteness” has the same restricted meaning: orthogonality to an n-dimensional sub-space of the past. It yields a linear model of with “n-th order white noise”, whose approximation properties are illustrated through applications.

The applications concern the most useful case, and deal with autocorrelation function of infinite Hankel rank (low-pass and fractional order filters), one of which is well known for its sensitivity problems. By converting the role of and , it must be noticed that the algorithm gives the exact n-th order best -approximate with minimum phase, an open problem in the literature on optimum systems approximations [13].

2. The orthogonalisation formulas.

Let with inner and outer products written as usual and

.

The linear projection of a vectorXof on the linear product space spanned by vec- tors of is obtained by solving the following dimensional linear system as:

(2.1) where are matrices. The projection takes then the following form:

(2.2) The orthogonal projection of a vectorXof on a scalar random variable of leads to:

, i.e. . (2.3)

In the same way, one may compute the orthogonal projection of a scalar of on a vector of . One finds:

(2.4) We now recall some properties that will be used further to reduce the number of param- eters of the predictor in lattice form.

2n

ytyτ

y

y y˜ L2p

X Y, ∈(L2P)n E X[ TY]

E X Y[ T]

LP2 ( )n

Y1,…,Yp (LP2)n n× p

i 1…p E X Y, [ iT] KjE Y[ jYiT]

j=1 p

= =

Ki n×n

P X Y[ 1,…,Yp] KiYi

i=1 p

= L2P

( )n z LP2

E[(XP X z[ ])z] = 0 P X z[ ] E Xz[ ] E z[ ]2 ---z

=

z LP2 Y (L2P)n

P z Y[ ] = E zY[ T](E Y Y[ T])1Y

(5)

Proposition 1. Let X and Y be vectors of . Write and . If

and , then .❏

Proposition 2. Let be orthogonal vectors of , that is ,

. Then the projection of a vector X of on the space spanned by

is: .❏

3. Unormalized lattice form.

Recall that our purpose is to compute the n-th order linear predictor as a projection of the observation signal on the n-th order memory of the past as in (1.3). This will be achieved in two steps. First we exhibit an orthogonal basis of the past using the Levinson formulas. We then reduce the dimension of the involved system using the degeneracy property of the projection space (clearly, is only n-dimensional). We then show how the computation of the predictor’s parameters is achieved knowing the autocorrela- tion function of .

3.1. Forward and backward residuals formulas.

Define , where denotes the innovation in the sense of (1.1)-(1.3). It

is clear from (2.4) that the projection of on the 2n scalars can be equiva- lently computed as the projection of on then vectors . That is:

(3.1) The aim of this paragraph is to recall an orthogonalisation of the space spanned by

[1], using the Graham-Shmidt procedure. Let:

(3.2)

be the backward residuals of the family and L2p

( )2 X x1

x2

= Y y1

y2

= E X y[ 1] = 0 E y[ 1y2] = 0 P X Y[ ] = P X y[ 2]

Z1,…Zp (L2P)ni, j = 1…n

ijE Z[ iZTj] = 0 (LP2)n Zi

{ }in=1 P X Z[ 1,…,Zp] = P X Z[ 1]+…+P X Z[ p]

zt yt

yti,zti { }in=1

yti,zti { }in=1

yt

Zt y˜t zt

= y˜t ytzt

yt {yti,zti}in=1 yt {Zti}in=1

zt = P y[ t Zt1,…,Ztn]

Zti { }in=1

Xt0 = Zt

Xt1 = Zt1P Z[ t1 Zt]

Xtn = ZtnP Z[ tn Ztn+1,…,Zt]









Zt,…,Ztn+1

{ }

(6)

(3.3)

be the forward ones.

We have then the following properties:

Property 1. is orthogonal, that is .

Property 2. is orthogonal.

Property 3.

.

Property 4. .

One can then rewrite the relations (3.2) and (3.3) as follows:

and (3.4)

According to the last equation of system (3.3), the first component of is equal to while the second one is equal to zero. In fact, a straightforward calculation leads to:

. (3.5)

But, according to (1.2), one has:

. (3.6)

On the other hand:

, (3.7)

since . (3.8)

From the definition of the projection, the linear predictor becomes:

(3.9) The implicit aspect of this formula can be eliminated by using the orthogonality of the innovation with the past . It leads to an explicit expression of the predictor, gi-ven the statistics of the observation signal:

Zt0 = Zt

Zt1 = ZtP Z[ t Zt1]

Ztn = ZtP Z[ t Ztn,…,Zt1]









Xt0,…, Xtn1

{ } ∀ij E X, [ ti(Xtj)T] = 0 Zt0n+1,…,Ztn1

{ }

i = 1n1,{Zt,…,Zti}sp = {Xto,…,Xti}sp

i = 1…n1,{Zt,…,Zti}sp = {Ztoi,…,Zti}sp

Xt0 = Zt

Xt1 = Xt01P X[ t01 Zt0]

Xtn = Xtn11P X[ tn11 Ztn1]









Zt0 = Zt

Zt1 = Zt0P Z[ t0 Xto1]

Ztn = Ztn1P Z[ tn1 Xtn11]









Ztn y˜t 1 0 Zt

n = y˜tP yt Zt1,…,Ztn]

P yt Zt1,…,Ztn] = 0

0 1 Zt

n = ztP z[ t Zt1,…,Ztn]

P z[ t Zt1,…,Ztn] = P y[ t Zt1,…,Ztn] = zt

zt = P z[ t Xt01]+…+P z[ t Xtn11] y˜t Xt01,…,Xtn11

(7)

(3.10) Remark:As was noticed by Benveniste in [2], the vector is degenerated because it only de- pends on the scalar sequence . If we examine the relations (3.4), it appears at first that the lattice predictor needs2nmatrices as parameters, that is8nparameters. In fact, because of the degeneracy of vector , the system can be reduced to only4n-2 parameters as will be shown in the next section.

3.2. Reduction of the number of parameters.

Proposition 3.

,

where is the second

component of .❑

Proof: Recall that, according to (3.3), one has:

Let us examine the first component of these vectors. One has

. (3.11)

because of the orthogonality relations (1.2). Let us then check the hypothesis of Proposition 1:

(3.12) But according to Property 4, one has:

(3.13) Applying then the Proposition 2, we have:

. (3.14) The orthogonality of the family implies that is orthogonal

to , so that . As a consequence, it is

clear that

(3.15) On the other hand:

. (3.16)

By linear combination of equations (1.2) using the equality (1.1), one can easily derive that . But is also orthogonal to , according to (1.2). It leads to:

zt = P y[ t Xt01]+…+P y[ t Xtn11] Zt yt

2×2 Zt

i = 0…n1 P X[ ti1 Zti] = P X[ ti1 zti] zti Zti

Zti = ZtP Z[ t Zt1,…,Zti]

1 0 Zt

i = y˜tP yt Zt1,…,Zti] = y˜t

E X[ ti1(Zti)T] 1

0 = E ytZt1i]–E ytP Z[ t1i Zti,…,Zt1]]

P Z[ t1i Zti,…,Zt1] = P Z[ t1i Zt0i,…,Zti11]

P Z[ t1i Zti,…,Zt1] = P Z[ t1i Zt0i]+…+P Z[ t1i Zti11] Zti,Zti11,…,Zt0i

{ } Zti

Zti11,…,Zt0i

{ } E ytZt0i] = … = E ytZti11] = 0

i 0…n1 E X[ ti1(Zti)T] 1

, 0 0

= =

1 0 E Zt i(Zti)T

[ ] 0

1 = E yt(ztP z[ t Zt1,…,Zti])]

E ytzt] = 0 y˜t {Zti}in=1

(8)

(3.17) Applying then Proposition 1, the result follows. ■

A useful consequence of this proposition is that the backward and forward residuals of system (3.4) need only the second component of vectors . The residuals may there- fore be computed using only4n-2parameters by the following system.

and (3.18)

Recall that according to (3.9), . It leads to the following lattice structure of com- putation:

and (3.19)

where the output of the predictor is computed as:

(3.20)

According to (4) and (5), the parameters and are determined as follows:

(3.21)

The figure 1 illustrates the structure obtained.

i 0…n1 1 0 E Zt i(Zti)T

[ ] 0

, 1 0

= =

Zti { }in=10

zt0 = zt

zt1 = zt0P y[ t Xt01]

ztn = ztn1P y[ t Xtn11]









Xt0 = Zt

Xt1 = Xt01P X[ t01 zt0]

Xtn1 = Xtn12P X[ tn12 ztn2]









ztn = 0

zt0 = zt

zt1 = zt0–(kt0)TXto1

ztn1 = ztn2–(ktn2)TXtn12 0 = ztn1–(ktn1)TXtn11











Xt0 Zt y˜t zt

= =

Xt1 = Xt01qt0zt0

Xtn1 = Xtn12qtn2ztn2











zt ( )kti TXti1

i=0 n1

=

kti

{ }in=01 { }qti in=02

i 0…n1

kti = (E X[ ti1(Xti1)T])1E X[ ti1yt] qti E X[ ti1yt]

E z[( )ti 2] ---

=







 ,

=

(9)

3.3. Off-line computation of parameters.

We now show how the above parameters are recursively computed, in the general case, (i.e. infinite Hankel rank for the process ) by a growing dimension system (as in [12]). In such a case, the algorithm needs no particular dimensional caution, but only a stationary conver- gence test. It should be noted that, in case of singular initialization, the first steps call for a par- ticular formula. This will be developed further.

When the process has a finite recursion model of order (hankel rank), but , it is shown how the corresponding (reduced) predictor’s coefficients are themselves generated by finite recursion formulas.

3.3.1 The general algorithm.

First, let us examine the quantities to be computed to derive the lattice parameters defined in (3.21). For simplicity, we write the expectation of X.Define . The first system of (3.19) permits us to evaluate as:

(3.22) As a consequence, according to the Property 1, . One then simplify the formulas (3.21) as follows:

(3.23)

It appears then that there are two kinds of quantities to be computed: the finitely realiz- able one, and the hereditary one as . Note also and are finitely realizable.

•Finitely recursive computations.

Recall that, at each step , and are assumed to be known, so are . +

-

- y + -

t

zt

X1 z1

q0 k0

k0

+

+ -

- X2 z2

q1 k1

k1

z-1 +

zn2

- Xn1

kn2

z-1 +

qn2

kn1

z-1

++ +

+ +

+ z0

X0 z-1

Figure 1 - Unnormalized Lattice Structure.

y

y m m>n

X = E X[ ] Mti = Xti(Xti)T zti

i 0…n1,zti (ktj)TXtj1

j=i n1

= =

Xti1zti = Mti1kti

i = 0…n1,kti = (Mti1)1Xti1yt

i = 0…n1,qti = Xti1yt⁄( )zti 2





Xti1yt Mti ( )zti 2

t Mti Xti1yt {kti,qti}in=01

(10)

The next value of is computed from system (3.21) as follows:

(3.24)

But according to (3.22) and because of the orthogonality of the family , one has:

(3.25) The system above may then be transform in:

(3.26)

In a similarly way, using system (3.19), the matrices may be recursively computed as:

(3.27)

Using (3.25) a second time, one has:

(3.28)

Note that is well defined at step because is assumed to be given data in the prob- lem and is computed from (3.24).

Hereditary computations (for infinite Hankel rank of ).

We use such an expression in the usual sense of non-finitely recursive computation (as a function, here, of the time horizon of the autocorrelation function ).

Although are recursive quantities, the expectation is not so. In fact, its computation needs all the parameters from the beginning of the algorithm.

System (3.18) allows us to derive the following formula:

zti ( )2

ztn1

( )2 = (ktn1)TXtn11(Xtn11)Tktn1 zti

( )2 = (zti+1)2+2 k( )ti TXti1zti–( )kti TXti1(Xti1)Tkti,∀i = 0…n2





Xt01,…,Xtn11

{ }

Xti1zti = Xti1(Xti1)Tkti

ztn1

( )2 = (ktn1)TMtn11ktn1 zti

( )2 = (zti+1)2+( )kti TMti1kti,∀i = 0…n2





Mti

i1 X, ti1(Xti1)T = Xti11(Xti11)Tqti(Xti11)Tzti1Xti11zti1(qti1)T qti1(zti1)2(qti1)T

+

Mti = Mti11qti1(kti1)TMti11,∀i1

Mt0 ( )yt 2–( )zt0 2 0 0 ( )zt0 2

=









Mt0 t yt2

zt0 ( )2

y

ytyτ

Xti

{ }in=01 Xti1yt

kτi,qτi,

τ

t { }in=01

(11)

(3.29)

It appears then that a second hereditary quantity is to be computed, that is . A sim- ilar computation leads to:

(3.30)

where the autocorrelation function of appears as the input of the hereditary recursion (3.29)- (3.30). The figure 2 presents the sequence of computation.

3.3.2 Additional formulas for singular initialization.

Without a priori information on the past, the initial estimator at is the a priori ex- pectation of the stochastic process to be estimated, so that . For a zero mean process, this value is zero.

At the first step( ), the only relevant data is since no information on past values of the predictor is available. As a consequence, the projection of on the past is and only the first components of and are determined, since is singu- lar.

At the second step ( ), one may then project on . This yields the matrix non-singular but not . Therefore, and are well defined as is regular.

At step , and are well defined but the first components of and are de-

0

τ

t1

Xτ0yt yτytzτ0yt zτ0yt

=

Xτiyt = Xτi11ytqτi1zτi1yt,∀i = 1…n1









,

zτi1yt

0

τ

t1 zτn1yt = (kτn1)TXτn11yt

zτiyt = zτi+1yt+(kτi)TXτi1yt,∀i = 0n2





 ,

yt

Initialization

Computation of kto ktn1

, ,

Computation of zt

( )o 2, , (ztn1)2 (20)

Computation of qt o qt

n2

, , (17)

Computation of M t1

o M

t1

n1

, , (21)

Computation of X t1

o y

t X

t1

n1

yt

, , (22,23)

t = 0

(17)

tt+1

Figure 2 - Unnormalized Lattice Algorithm.

t = 0 z0 = y0

t = 1 y0

y1

z1 = P y[ 1 y0] k10 q10 M00

t = 2 y2 {y1,z1}

M10 M11 k20 q20 M10

t = 3 k30 q30 k31 q31

Références

Documents relatifs

Vergne, Residue formulae, vector partition functions and lattice points in rational polytopes, J.. Vergne, Arrangement of hyperplanes I Rational functions and Jeffrey–Kirwan

The aims of this article are: 1 To give two alternative characterizations of the facial weak order see Theorem 3.14: one in terms of root inversion sets of parabolic cosets which

This means that at relatively small values of μ integer values of the surface charge are thermodynamically preferred and thus in the constant potential ensemble we should see

Then the closed ideals of L correspond exactly, via the dimension map, to polar subsets of the Dedekind-complete lattice- ordered group K 0 (L). This leads to methods which can

We also show that they include the discrete Sugeno integral, which has been extensively studied and used in the setting of nonlinear aggregation and integra- tion.. Finally, we

In the case of the classical sylvester relation, we know that the Tamari lattice can be seen either as a sub-lattice of the right weak order (by taking the subset of maximal

in the study of the Ising model, many authors used this approach to calculate the critical behaviour of other models like generalized Ising models [2],

- Magnetostriction and thermal expansion measurements on CeAI, in and near the antiferroma- gnetic (AF) phase have established (1) the AF phase boundary in the H-T plane