• Aucun résultat trouvé

Comparison between unitary and causal approaches to backward adaptative transform coding of vectorial signals

N/A
N/A
Protected

Academic year: 2022

Partager "Comparison between unitary and causal approaches to backward adaptative transform coding of vectorial signals"

Copied!
4
0
0

Texte intégral

(1)

COMPARISON BETWEEN UNITARY AND CAUSAL APPROACHES TO BACKWARD ADAPTIVE TRANSFORM CODING OF VECTORIAL SIGNALS

David Mary, Dirk T.M. Slock

Institut Eur´ecom, 2229 route des Crˆetes, B.P. 193, 06904 Sophia Antipolis Cedex, FRANCE Tel: +33 4 9300 2916/2606; Fax: +33 4 9300 2627

e-mail:

f

mary,slock

g

@eurecom.fr

ABSTRACT

In a transform coding framework, we compare the optimal causal approach (LDU, Lower-Diagonal-Upper) to the optimal unitary approach (Karhunen-Loeve Transform, KLT). The criterion of merit used for this comparison is the coding gain, defined for a transfor- mationT as the ratio of the average distortion obtained with the identity transformation over the average distortion obtained with

T. Both transforms are known to yield the same gain when they are computed on the signal covariance matrixR. The purpose of this paper is to compare the behavior of these two transformations when the ideal transform coding scheme gets perturbed, that is, when only an estimateR+RofRis known. In this case, not only the transformation itself will be perturbated, but also the bit allocation mechanism. We compare the two approaches in two cases. Firstly,Ris caused by a quantization noise : the coding scheme is based on the statistics of the quantized data. We find that the coding gain in the unitary case is higher than in the causal case. In a second case,Rcorresponds to an estimation noise : the coding scheme is based on an estimate ofRbased on a fi- nite amount of available data. In this case, both causal and unitary approaches are strictly equivalent, because of the unimodularity and decorrelating properties of the transformations. Simulations results confirming the predicted behavior of the coding gains with perturbations are reported.

1. INTRODUCTION

Consider a stationary Gaussian vectorial sourcefXg. This source may be composed of any scalar sourcesfxi

g, for example au- dio signals. In the classical transform coding framework, a lin- ear transformationT is applied to each N-vectorXto produce an N-vectorY =TXwhose componentsyiare independently quan- tized using a scalar quantizerQi. A number of bitsriis attributed to eachQi under the constraint

P

i r

i

= Nr. For an entropy constrained scalar quantizer of a Gaussian source, the high resolu- tion distortion is E(yiq

(k );yi(k )) 2

= 2

q

i

= c2

;2r

2

y

i

, where

c= e

6

.

An important property of commonly used transformations is that, if a noise (for example quantization noise) is added to the signalY, then its power will be the same in the transform and in the signal domains. This property is sometimes referred to as ”unity noise Eur´ecom’s research is partially supported by its industrial partners:

Ascom, Swisscom, Thales Communications, STMicroelectron ics, CEGE- TEL, Motorola, France T´el´ecom, Bouygues Telecom, Hitachi Europe Ltd.

and Texas Instruments.

gain” property [3]. The coding gain forTis then defined as

G

T

= Ek

~

Xk 2

(I)

Ek

~

Xk 2

(T)

= Ek

~

Xk 2

(I)

Ek

~

Yk 2

(T)

(1)

whereIis the identity matrix, and the notationkXk~ 2(T)denotes the variance of the quantization error on the vectorX, obtained for a transformationT. The optimal bit allocation yields the well known distortion for the vectorial signalfYg: EjjY~jj2T

= 1

N P

N

i=1

2

qi

=

Nc2

;2r

Q

N

i=1

2

y

i 1

N

=N 2

q.2q i

is independent ofi, and the number of bits assigned to the ithcomponent isr+1

2 l og2

2

y

i

Q

N

i=1

2

y

i

1

N

. In the next section, we recall the main characteristics of the opti- mal causal approach (LDU) when optimized onR, and summarize the reasons why its performance is the same as the best unitary ap- proach (KLT).

However, a backward adaptive coding scheme requires that nei- ther the transformation nor the parameters of the bit allocation are transmitted to the decoder. So suppose now that the coding scheme is based onRXXb =R+Rinstead ofR, whereRXXb is avail- able at both encoder and decoder. Then the computed transforma- tion will beT^=T+T, and the distortion will be proportional to the variances of the signals transformed by means ofT^instead ofT, say2y0

i

. Moreover, the bitsri should be attributed on the basis of estimates of the variances available at both encoder and decoder also, that is,(T^RXXb T^)ii, where(:)iidenotes the ithdi- agonal element of(:). Hence, we get the following measure of distortion for a transformationT^based onRbXX:

Ek

~

Yk 2

(

^

T)

=E N

X

i=1 c2

;2r + 1

2 log

2 (

^

T b

R

XX

^

T T

)

ii

( Q

N

i=1 (

^

T b

R

XX

^

T T

)

ii )

1

N ]

2

0

y

i : (2) where the expectation is w.r.tRin case it is non-deterministic. In the third section,we compare this distortion whenRis caused by a quantization noise : the coding scheme is based on the statistics of the quantized data, under high resolution assumption. In the fourth part,Rcorresponds to an estimation noise : the coding scheme is based on an estimate ofRdue to a finite amount ofK vectors :RbXX

= 1

K P

K

i=1 X

i X

T

i . The fifth part is dedicated to simulation results.

2. OPTIMAL CAUSAL AND UNITARY APPROACHES WITHOUT PERTURBATION

In the causal case,Y =LX =X;L X, whereLXis the ref- erence vector. The outputXqisYq+LX. Note that the recon-

(2)

struction errorX~equals the quantization errorY~ :

~

X=X;X q

=X;LX;Y q

=Y ;Y q

=

~

Y (3) as in the unitary case. As detailed in [2], the optimalLin terms of coding gain is such thatLRXX

L T

=diag f 2

y

1 :::

2

y

N gwhere

diag f:::grepresents a diagonal matrix whose elements are2y i

. In other words, the componentsyiare the prediction errors ofxi

with respect to the past values ofX, theX1:i;1, and the opti- mal coefficients;Li1:i;1are the optimal prediction coefficients.

Since each prediction erroryiis orthogonal to the subspaces gen- erated by theX1:i;1, theyiare orthogonal. It follows thatRXX=

L

;1

R

YY L

;T, which represents the LDU factorization ofRXX. Referring to (1), the coding gain without perturbation for the opti- mal causal transform can be written as

G (0)

L

= Ejj

~

Yjj 2

I

Ejj

~

Xjj 2

L

= Ejj

~

Xjj 2

I

Ejj

~

Yjj 2

L

=

det diag (R

XX )]

det diag (LR

XX L

T

)]

1

N

(4) wherediag (R)denotes here the diagonal matrix that corresponds to the diagonal of the matrixR. Now, using the unimodulariry of

L,det(diag (RYY))=det (RXX)=det, whereis the eigenvalue matrix ofRXX. The coding gain is

G (0)

L

=

det diag (RXX)]

det diag (LR

XX L

T

)]

1

N

=

det diag (RXX)]

det

1

N

=G (0)

V

(5) whereV denotes a KLT ofRXX. Thus, for an optimal bit al- location, the coding gains of the KLT and the LDU are the same without perturbation for three reasons : both transformations en- sure that the power of the quantization error is the same in the transform and in the signal domains, they are totally decorrelating transforms, and finally they are unimodular.

3. QUANTIZATION EFFECTS ON THE CODING GAINS Suppose we compute the transformation on the basis of quantized data. The statistics of the quantized data are assumed to be per- fectly known in this section. In other words, we assume that the decoder disposes of an infinite number of quantized vectorsXiq, and hence ofRX

q

X

q. Under the assumptions of high resolu- tion (uncorrelated white noise), optimal bit assignment and unity noise gain property of the transformation,R=EX~X~T =2q

I, where2q

= c2

;2r

Q

N

i=1

2

y

i 1

N. Thus, forT^ =IV^ andL^, we shall compute (the subscriptqrefers to quantization)

Ek

~

Yk 2

(

^

Tq )

= N

X

i=1 c2

;2r + 1

2 log2

(

^

TR

X q

X q

^

T T

)

ii

( Q

N

i=1 (

^

TR

X q

X q

^

T T

)

ii )

1

N ]

2

0

y

i (6)

3.1. Identity Transformation

In this case, the number of bits attributed to the quantizerQiis

r+ 1

2 l og2

(R

X q

X q)ii

( Q

N

i=1 (R

X q

X q

)

ii )

1

N

, and the variance2yi0 are indeed

(RXX)ii. Thus

Ek

~

Yk 2

(Iq )

= N

X

i=1 c2

;2r + 1

2 log

2 (R

X q

X q

)

ii

( Q

N

i=1 (R

X q

X q

)

ii )

1

N ]

(R

XX )

ii (7)

= N

X

c2

;2r

(detdiag fRX q

X q

g) 1

N (R

XX )

ii

(RX q

X q)ii

: (8)

One shows that

N

X

i=1 (R

XX )

ii

(R

X q

X q

)

ii

=tr f

;

I+ 2

q (diag R

XX )

;1

;1

g

wheretrdenotes the trace operator, and

det(diag RX q

X q)

=det(diag RXX)det (I+

2

q

(diag RXX)

;1

)

and we find

Ek

~

Yk 2

(Iq )

= Ek

~

Yk 2

(I) 1

N

(det(I+ 2

q (diag R

XX )

;1

)) 1

N

tr f

;

I+ 2

q (diag R

XX )

;1

;1

g:

(9) The distortion is slightly increased because the bits allocated on the basis of variances of quantized signals are not the optimal ones (the variance of the quantization noise is not equal in each branchi).

An approximation of (9) up to the second order of the perturbation gives

Ek

~

Yk 2

(Iq )

=c2

;2r

(detdiag fRXXg) 1=N

( N

i=1 (

2

q

(R

XX )

ii ))

1=N P

N

i=1 (1+

1

(R

XX )

ii )

;1

Ek

~

Yk 2

(I) (1+

4

q

N 2

( N;1

2 P

N

i=1 1

(R

XX )

2

ii

; P

N

i=1 P

j >i

1

(R

XX )

ii (R

XX )

j j ))

(10)

3.2. KLT

As observed in [4] also, ifVdenotes a KLT ofRXX, thenV(RXX+

2

q I)V

T

=+ 2

q I=

q, andV is also a KLT ofRXX+q2 I. Thus, the perturbation term q2

I onRXX does not change the backward adapted transformation, and the variances of the trans- formed signals remain unchanged : 2

0

y

i

=i. However, the de- coder estimates the variances(VRX

q

X q

V T

)

ii

=

i +

2

q, on the basis of which the coder assigns the bitsri. Thus, the actual distortion is

Ek

~

Yk 2

(Vq )

= N

X

i=1 c2

;2r + 1

2 log

2 (VR

X q

X qV

T

)

ii

( Q

N

i=1 (VR

X q

X qV

T

)

ii )

1

N ]

(VR

XX V

T

)

ii

(11) wich can be computed in a similar way as in 3.1. We find

Ek

~

Yk 2

(Kq )

=Ek

~

Yk 2

(K) 1

N

(det(I+ 2

q (

;1

))) 1

N

tr f

;

I+ 2

q (

;1

)

;1

g:

(12) Again, the increase in distortion comes from the perturbation oc- curing on the bit allocation mechanism. Up to the second order of perturbation, an expression similar to (10) is

Ek

~

Yk 2

(Kq )

=c2

;2r

(detdiag fRXXg) 1=N

( N

i=1 (

2

q

i ))

1=N P

N

i=1 (1+

1

i )

;1

Ek

~

Yk 2

(K)

1+

4

q

N 2

( N;1

2 P

N

i=1 1

(

i )

2

; P

N

i=1 P

j >i 1

i

j )

(13) The corresponding expression for the coding gain is

G

Kq

=G 0

(det(I+ 2

q (diag R

XX )

;1

)) 1

N

tr f

;

I+ 2

q (diag R

XX )

;1

;1

g

(det(I+ 2

q (

;1

))) 1

N

tr f

;

I+ 2

q (

;1

)

;1

g

(14) whose second order approximation is

GKqG 0

1+

4

q

N 2

( N;1

2 P

N

i=1 (

1

(R

XX )

2

ii

; 1

(

i )

2 )

; P

N

i=1 P

j >i (

1

(R

XX )

ii (R

XX )

j j

; 1

i

j ))]:

(15)

(3)

3.3. LDU

In the causal case, the coder uses a transformationL0 such that

L 0

RX q

X qL

0

T

= R 0

YY. R0YY is the diagonal matrix of the es- timated variances involved in the bit allocation (L0andR0YY are both available to the decoder). In this case, the difference vector

Y = X;L 0

X

q, the quantization noise is filtered by the rows ofL0. Note thatEkXk~ 2L

0

q still equalsEkY~k2L 0

q, sinceX~ =

X q

;X=Y q

+L 0

X q

;X=Y q

;(X;L 0

X q

)=Y q

;Y =

~

Y. One shows that the actual variances of the signalsyiobtained with

L

0are(L0RX q

X qL

0

T

; 2

q I)

ii[2]. In this case, one finds

Ek

~

Yk 2

(L 0

q )

= P

N

i=1 c2

;2r + 1

2 log

2 (L

0

R

X q

X qL

0

T

)

ii

( Q

N

i=1 (L

0

R

X q

X qL

0

T

)

ii )

1

N ]

(L 0

RX q

X q

L 0

T

; 2

q I)ii

=Ek

~

Yk 2

(L) 1

N

(det(I+ 2

q (

;1

))) 1

N

tr f

I+ 2

q (R

0

;1

YY )

g:

(16) The increase in distortion comes not only from the perturbation occuring on the bit allocation mechanism but also from the filtering of the quantization noise. Up to the first order of perturbation,we obtain

Ek

~

Yk 2

(L 0

q ) Ek

~

Yk 2

(K)

"

1+

2

q

N (

N

X

i=1 1

i

; 1

2

y

i )

#

(17) whereN

]

(RXX)ii

Ek

~

Yk 2

(I) 1+ E

N;1

2N 2

P

N

i=1 (

(R)

ii

(R

XX )

ii )

2

;E 1

N 2

P

i P

j >i (R)

ii

(R

XX )

ii (R)

j j

(R

XX )

j j ]

(21) The expectation of the first term in (21) is

E N;1

2N 2

N

X

i=1

(R)ii

(R

XX )

ii

2

= N;1

2N 2

N

X

i=1

2(RXX) 2

ii

K(R

XX )

2

ii

= N;1

NK :

(22) The expectation of the second term is

E 1

N 2

P

i P

j >i (R)

ii

(R

XX )

ii (R)

j j

(R

XX )

j j

2

k P

i P

j >i (R

XX )

2

ij

(R

XX )

ii (R

XX )

j j

2

k k.

(diag fRXXg) 1=2

RXX(diag fRXXg) 1=2

k 2

F

(23) where.(A)denotes the stritcly lower triangular matrix made with the strictly lower trianguler part ofA, andkAk2Fthe squared Frobe- nius norm ofA. IfD=diag fRXX

g, we obtain

E 1

N 2

P

i P

j >i (R)

ii

(R

XX )

ii (R)

j j

(R

XX )

j j

1

K (kD

; 1

2

RXXD

; 1

2

k 2

F

;kdiag fD

; 1

2

RXXD

; 1

2

k 2

F )

= 1

K

(tr fRXXD

;1

RXXD

;1

g):

(24) Finally, the expected distortion for Identity with estimation noise is, for sufficiently highK,

Ek

~

Yk 2

(IK) Ek

~

Yk 2

(I)

1+ 1

K (1;

tr fRXXD

;1

RXXD

;1

N 2

g)

:

(25) 4.2. KLT

In the unitary case, the expected distortion is

Ek

~

Yk 2

(

^

Vk )

=E N

X

i=1 c2

;2r + 1

2 log

2 (

^

V b

R

XX

^

V K

)

ii

( Q

N

i=1 (

^

V b

R

XX

^

V T

)

ii )

1

N ]

(

^

VRXX

^

V T

)ii:

(26) Using the fact thatV^RXXb V^T is diagonal, we can write (26) as

Ek

~

Yk 2

(

^

VK)

=Ec2

;2r

det

^

V b

RXX

^

V

1

N N

X

i=1 (

^

VRXX

^

V T

)ii

(

^

V b

R

XX

^

V T

)

ii

(27)

(4)

Because of the unimodularity ofV^, the determinant in (27) may be written as

det

^

V b

RXX

^

V T

= det (RXX+R)

= (detRXX)det (I+R

;1

XX R)

(28) The sum in (27) may be written as

P

N

i=1 (

^

VR

XX

^

V T

)

ii

(

^

V b

R

XX

^

V T

)ii

=tr f(

^

V b

RXX

^

V T

)

; 1

2

^

VRXX

^

V T

(

^

V b

RXX

^

V T

)

; 1

2

g

=tr f(I+R

;1

XX R)

;1

g

(29) Thus, (27) is equivalent to

Ek

~

Yk 2

(

^

VK)

=Ek

~

Yk 2

(K) 1

N

E(det(I+R

;1

XX R))

1

N

tr f

;

I+R

;1

XX R

;1

g:

(30) Using similar developments as in the previous section, the ex- pected distortion for the KLT when the transformation is based onkvectors is finally, under high resolution assumption

Ek

~

Yk 2

(

^

Vk ) Ek

~

Yk 2

(K)

1+ N;1

2K +

N;1

NK

: (31) The associated coding gain is,whereD=diag fRXXg,

G

^

VK

= Ek

~

Yk 2

(IK)

Ek

~

Yk 2

(

^

VK) G

0

1; 1

KN 2

tr fRD

;1

RD

;1

g;

N;1

2K +

1

NK

:

(32) 4.3. LDU

As stated in the introduction of this section, the expected distortion withL^computed onRXXb is

Ek

~

Yk 2

(

^

LK)

=E P

N

i=1 c2

;2r + 1

2 log

2 (

^

L b

R

XX

^

L T

)

ii

( Q

N

i=1 (

^

L b

R

XX

^

LT)

ii )

1

N ]

(

^

LR

XX

^

L T

)

ii

=Ec2

;2r

det

^

V b

RXX

^

V

1

N

| {z }

=det b

R

XX

=det(R

XX )det(I+R

;1

XX R)

N

X

i=1 (

^

LRXX

^

L T

)ii

(

^

L b

R

XX

^

L T

)

ii

| {z }

=tr f(I+R

;1

XX R)

;1

g

=Ek

~

Yk 2

(

^

VK)

(33) where the equality concerning the determinants comes from the unimodularity of the transformationsL^andV^. The equality con- cerning the trace comes from their decorrelating property. Thus, the distortion and coding gain with estimation noise are the same in the causal and the unitary cases, and are given up to the first order inKby (30) and (32).

5. SIMULATIONS

For the simulations, we used entropy constrained scalar quantiz- ersQi and real Gaussian i.i.d. vectors with covariance matrix

R

XX

= HR

AR1 H

T. RAR1 is the covariance matrix of a first order autoregressive process with normalized correlation coeffi- cient.His a diagonal matrix whose ithentry is(N;i+1)1=3 (decreasing variances).

In Figure 1, the coding gain with quantization noise is plotted for KLT (upper curves) and LDU (lower curves) with = 0:9,

N =4. The theoretical exact expressions are given by (14) and (18), and the approximated expressions by (15) and (19).

The coding gains in presence of estimation noise are compared for LDU and KLT in Figure 2, forN =4and=0:9(mean over

100realizations). The observed behaviors of the transformation corresponds quite well to the theoretically predicted ones forK a few tens.

2 2.5 3 3.5 4 4.5 5 5.5 6

3.2 3.25 3.3 3.35 3.4 3.45 3.5

Coding Gains with Quantization Noise −N=4 − Variances − AR1 − rho=.9

Rate (bit per sample)

Coding Gain over Identity

G0 Observed G LDU Observed G KLT Theoretical G LDU : exact Theoretical G1KLT : exact Theoretical G1 LDU : approx.

Theoretical G KLT : approx.

Fig. 1. Coding Gains vs rate in bit/sample.

100 101 102 103

0.5 1 1.5 2 2.5 3 3.5

Coding Gain for KLT and LDU with Estimation Noise − rho=0.9 − N=4 − 100 realizations

Number of Vectors

Coding Gain over Identity

G0 Theoretical Gain Observed Gain LDU Observed Gain KLT

Fig. 2. Gains for KLT and LDU with estimation noise

6. REFERENCES

[1] D. Mary, Dirk T.M. Slock, “Causal Transform Coding, Gener- alized MIMO Prediction and Application to Vectorial DPCM Coding of Multichannel Audio,” WASPAA01, New York, USA, October 2001.

[2] D. Mary, Dirk T.M. Slock, “Vectorial DPCM Coding and Ap- plication to Wideband Coding of Speech,” ICASSP 2001, Salt Lake City, USA, May 2001.

[3] S.-M. Phoong, Y.-P. Lin, “Prediction-Based Lower Triangular Transform,” IEEE trans. on Sig. Proc., vol. 48, no. 7, July 2000.

[4] V. Goyal, J. Zhuang, M. Vetterli, “ Transform Coding with Backward Adaptive Updates,” IEEE trans. on Inf. Theory, vol. 46, no. 4, July 2000.

Références

Documents relatifs