• Aucun résultat trouvé

Linear precoding for spatial multiplexing MIMO systems: blind channel estimation aspects

N/A
N/A
Protected

Academic year: 2022

Partager "Linear precoding for spatial multiplexing MIMO systems: blind channel estimation aspects"

Copied!
5
0
0

Texte intégral

(1)

Linear Precoding for Spatial Multiplexing MIMO Systems:

Blind Channel Estimation Aspects

Abdelkader Medles, Dirk T.M. Slock

Eurecom Institute1 2229 route des Crˆetes, B.P. 193

06904 Sophia Antipolis Cedex FRANCE

Abstract - For the case of white uncorrelated inputs, most of the blind multichannel identification techniques are not very robust and only allow to estimate the channel up to a number of ambiguities, especially in the MIMO case.

On the other hand, all current standardized communica- tion systems employ some form of known inputs to allow channel estimation. The channel estimation performance in those cases can be optimized by a semiblind approach which exploits both training and blind information. When the inputs are colored and have sufficiently different spec- tra, the MIMO channel may become blindly identifiable up to one constant phase factor per input, and this under looser conditions on the channel. For the case of spatial multiplexing, possible cooperation between the channel in- puts allows for more complex MIMO source prefiltering that may allow blind MIMO channel identification up to just one global constant phase factor. We introduce semib- lind criteria that are motivated by the Gaussian ML ap- proach. They combine a training based weighted least- squares criterion with a blind criterion based on linear pre- diction. A variety of blind criteria are considered for the various cases of source coloring.

I. INTRODUCTION

The multichannel aspect has led to the development of a wealth of blind channel estimation techniques over the last decade. In this paper, blind identification shall mean channel identification on the basis of the second-order statistics of the received signal. Consider linear digital modulation over a lin- ear channel with additive Gaussian noise. Assume that we have

p

transmitters and

m > p

receiving channels (e.g. antennas in BLAST or SDMA). For more details on the used notation in this paper refer to [1].

II. BLINDIDENTIFICATION FORCOLOREDINPUTS

In this section we aim to improve the Second-Order Statis- tics (SOS) based blind channel identification by exploiting cor- relation in the inputs. In the context of digital communica- tions, the inputs are symbol sequences which are typically uncorrelated. Correlation can be introduced by linear con- volutive precoding, which corresponds to MIMO prefiltering of the actual vector sequence b

k

of symbols to be transmit- ted with a MIMO prefilter T

(z)

such that the transmitted vec- tor signal becomes a

k =

T

(q)

b

k

. In this paper we consider full rate linear precoding so that T

(z)

is a

p

p

square ma-

trix transfer function (in [2] an example of low rate precod- ing appears since the same symbol sequence gets distributed over all TX antennas). We get for the transmitted signal spec- trum

S

aa

(z) =

T

(z)S

bb

(z)

Ty

(z) = b

2T

(z)

Ty

(z)

and for

1Eurecom’s research is partially supported by its industrial partners: As- com, Swisscom, France T´el´ecom, La Fondation CEGETEL, Bouygues T´el´e- com, Thales, ST Microelectronics, Motorola, Hitachi Europe and Texas In- struments. The work reported here was also partially supported by the French RNRT project ERMITAGES.

the received signal spectrum

S

yy

(z) =

H

(z)S

aa

(z)

Hy

(z) + S

vv

(z) =

2

b

H

(z)

T

(z)

Ty

(z)

Hy

(z) +

2

v I m

. The choice of appropriate prefiltering, as we shall see below, may reduce the non-identifiability to a phase factor per source or even to a global phase factor. In the context of wireless communica- tions, two scenarios may be distinguished:

Noncooperative scenario: this scenario corresponds to the multi-user case (on the transmitter side) without cooperation between users. We shall consider the simple case in which the users transmit through only one antenna. This noncooperative scenario can also arise in other source separation applications since natural sources tend to have different spectra. In this sce- nario, H

(z)

has no structure, other than possibly being FIR, and T

(z)

and

S

aa

(z)

are diagonal. This scenario has been considered in [3],[4].

Cooperative scenario: this is the single-user spatial multiplex- ing case. In this case, since transmit antennas are near each other and also receive antennas are near each other, all (FIR) entries in H

(z)

have the same delay spread and hence are poly- nomial of the same order.

S

aa

(z)

is allowed to be nondiagonal.

In the noncooperative case the channel will tend to be ir- reducible, a characteristic we have assumed so far, due to the fact that the users tend to be spread out in space. In the spa- tial multiplexing case however, in which the TX antennas are essentially colocated, the irreducibility of the channel depends on the richness of the scattering environment. In general, we need to consider a reducible channel. Such a channel can be factored as H

(z) =

G

(z)

C

(z)

where G

(z)

is irreducible and column reduced with columns in order of e.g. non-increasing degree. If

r

is the (generic) rank of H

(z)

, then G

(z)

is

m

r

whereas C

(z)

is

r

p

. In the noncooperative scenario C

(z)

has

no particular structure. In the cooperative scenario however, all entries in a particular row of C

(z)

have the same degree and the degrees of the rows are non-decreasing (the degree profile of the rows in C

(z)

is complementary to the degree profile of the columns in G

(z)

).

If

r

m

,

1

, then

2

v

is blindly identifiable from

S

yy

(z)

and

G

(z)

is blindly identifiable from the signal/noise subspaces of

S

yy

(z)

up to a postmultiplication factor L

(z)

that is block lower triangular with block sizes according to the multiplici- ties of the degrees of the columns of G

(z)

and L

(z)

is also

polynomial with the degree of block

(i;j)

being the difference between the degrees of block

i

and block

j

of the columns of G

(z)

[5]. So in particular, the diagonal blocks of L

(z)

are con-

stant. Also, L,1

(z)

has the same polynomial structure as L

(z)

.

In the cooperative case, L,1

(z)

C

(z)

has the same polynomial structure as C

(z)

. If

r

m

,

1

, there are essentially no restric- tions on the number of inputs

p

for identifiability. If

r = m

,

identifiability of

v

2becomes an issue and there’s no longer a point in considering a factorization of H

(z)

for its identifica-

(2)

tion.

Finally, let us note that TX pulse shape filters can be incor- porated in T

(z)

or

S

aa

(z)

and that oversampling at the RX also leads to an increase in the number of RX channels. Also, the formulation of complex quantities as a superposition of real quantities may lead to an extra MIMO dimension. In the next two sections we investigate channel identifiability with diago- nal or full prefiltering T

(z)

.

III. NONCOOPERATIVE/DIAGONALPREFILTERING

In general, we would like to handle the reducible channel case. The rank

r

can be identified from

S

yy

(z)

. If

r

m

,

1

, then we can denoise the SOS and identify the factor G

(z)

from the subspaces. G

(z)

is unique up to a factor L

(z)

. For

whichever G

(z)

in this equivalence class, it remains to identify C

(z)

in H

(z) =

G

(z)

C

(z)

from

S(z) =

G#

(z)(S

yy

(z)

,

2

v I m )

G#y

(z) =

C

(z)S

aa

(z)

Cy

(z)

(1) where G#

(z)

is a MMSE ZF equalizer for G

(z)

: G#

(z)

G

(z) = I r

. For

r = m

, the problem is similar to the one in (1) with C

(z)

replaced by H

(z)

(apart from the

2

v

identification is- sue which will be discussed below). The value of the rank

r

2 f

1;2;:::;min(m;p)

gis unpredictible in general. For a certain rank

r

, subsets of

r

,

1

columns of C

(z)

could be iden- tified jointly from

S(z)

using certain

S

aa

(z)

and under certain conditions on C

(z)

(or subsets of

r

columns under more strin- gent conditions on C

(z)

). So to be general,

S

aa

(z)

should

be such that it allows identifiability for the worst case of

r

,

which is

r = 1

. In that case, each column of C

(z)

needs to

be identified separately. On the other hand, since in the case

r = 1

each column of C

(z)

is a scalar FIR transfer function, only its minimum-phase equivalent is identifiable. So a col- umn would be truly identifiable only if it is minimum-phase.

To avoid having zeros would require to impose

r

2

. In any

case, to be fully general, it is desirable to have

S

aa

(z)

such that

it allows identifiability of each column of C

(z)

separately. So the MIMO problem gets converted into a set of disconnected SIMO (

r > 1

) or SISO (

r = 1

) problems. This will allow iden- tifiability of each column up to a constant phase factor of the form

e j

if the column has no maximum-phase zeros (which is quite possible if

r

2

but highly unlikely for

r = 1

). An-

other issue is the degree of C

j (z)

, column

j

of C

(z)

. We have

H

j (z) =

G

(z)

C

j (z)

. The degree of C

j (z)

is unpredictible and can be up to

N j

,

1

, the degree of the corresponding col- umn H

j (z)

of H

(z)

. For identifiability, we need to consider the worst case and hence we shall assume that the degree of C

j (z)

is

N j

,

1

. Of course, the

N j

themselves may be unpre- dictible and in practice need to be replaced by an upper bound.

We now consider two approaches for identification, leading to two classes of solutions for

S

aa

(z)

.

Frequency domain approach: The idea here is to introduce zeros into the diagonal elements of T

(z)

or hence

S

aa

(z)

such

that all other elements other than diagonal element

j

share

N j

zeros

T

jj (z) =

Y

p

i

=1

;

6=

j N

i

Y

k

=1

(1

,

z i;k z

,1

)

(2)

This allows identifiability of C

j (z)

from

S(z)

up to a phase since

S(z j;k ) =

C

j (z j;k )S a

j

a

j

(z j;k )

Cy

j (z j;k ) ; k = 1;:::;N j

(3)

where

S a

j

a

j

(z) =

2

b T jj (z)T jj

y

(z)

. If all

N j

are equal (to

N

1), then we can choose equispaced zeros on the unit cir- cle:

Q

N k

=1i

(1

,

z i;k z

,1

) = 1

,

e j

i

z

,

N

1. If we furthermore choose overall equispacing by taking

i = (i

,

1)2=pN

1,

then

T ii (z) =

1,1,

e

ji

z

,

z

pN,1N1. In [4] very similar work appears in which the degree of

S

aa

(z)

(in the case of equal

N j

) is at least

pN

1(compared to

(p

,

1)N

1here), but the discussion in [4] is limited to the case

m > p = 2

. Note that here we can easily allow

p > m

even (more inputs than outputs!). Remark also that by introducing a number of zeros in T

(z)

, we can further- more identify an equal number of noise parameters (such as

2

v

for instance when

r = m

). Non-FIR

S

aa

(z)

can be consid- ered also. For instance we can consider the case in which the

S a

j

a

j

(e j

2

f )

have at least

N j

disjoint expansion coefficients in some orthogonal basis. An extreme example of this would be

S a

j

a

j

(e j

2

f )

that are bandlimited with the

p

bands being non-overlapping.

Time domain approach: The idea here is to introduce delay in the prefilter so that the correlations of each C

j (z)

appear

separately in certain portions of the correlation sequence of

S(z)

. This can be obtained for instance with T

jj (z) = 1

,

j z

,

d

j

; d j =

X

j

,1

i

=1

N i :

(4)

Identification can be done with a correlation sequence peeling approach that starts with the last column C

p (z)

of which the (1-sided) correlation sequence appears in an isolated fashion in the last

N p

correlations of

S(z)

. Identification of C

p (z)

from

its correlation sequence can be done up to a phase factor

e j

p

(and up to the phase of zeros if C

p (z)

has zeros). We can then subtract

S a

j

a

j

(z)

C

p (z)

Cy

p (z)

(which does not require C

p (z)

but only its correlation sequence) from

S(z)

which will then reveal the correlation sequence of C

p

,1

(z)

in its last

N p

,1 correlations, etc. The degree of

S

aa

(z)

is in this case the de- gree

d p

of

S a

p

a

p

(z)

which, in the case of all equal

N j

, is again

(p

,

1)N

1, which leads to a degree of

pN

1,

1

for

S(z)

or

hence

pN

1correlations. Such a degree for

S

aa

(z)

is not only sufficient but also necessary since when

r = 1

, there are

pN

1

parameters to be identified for which indeed at least

pN

1cor-

relations are needed. Note that in the temporal approach, in- creasing all the delays

d j

with an amount

D

allows furthermore the (straightforward) identification of MA(

D

,

1

) noise (e.g.

D = 1

for white noise with arbitrary spatial correlation).

In practice, with estimated correlations, the correlation peel- ing approach leads to increasing estimation errors as the columns of C

(z)

get processed. This error increase can be avoided by doubling the delay separation between sources, which may furthermore lead to simpler algorithms (e.g. SIMO subspace fitting with asymmetric covariance matrices). Time domain approaches also appear in [3].

IV. COOPERATIVE/SPATIAL-MULTIPLEXING

PREFILTERING

Non-cooperative approaches can of course also be applied in the cooperative scenario, so diagonal prefiltering can be used for spatial multiplexing. However, this leads to at least a un- known phase per TX antenna and hence requires either differ- ential encoding or training symbols per TX antenna. By ap- plying full prefiltering, such that

S

aa

(z)

is not blockdiagonal

(3)

in which case it is said to be fully diverse, the channel may possibly be identified up to a global phase factor only. Since better identifiability results in this case, better estimation may possibly be another consequence. We consider here linear pre- coding by time-invariant MIMO prefiltering. In [6], a block precoding approach is considered.

We can work with the eigen or LDU decompositions of

S

aa

(z)

. To begin with, consider the eigendecomposition Saa

(z) =

V

(z)

D

(z)

Vy

(z)

where V

(z)

is paraunitary (i.e.

V

y

(

z

)

V

(

z

) =

I) and contains the eigenvectors as columns, and D

(z)

is diagonal with the diagonal elements, the eigenval- ues, being valid scalar spectra.

A paraunitary matrix V

(z)

is said to be full diverse, if

PV

(z)

Vy

(1)

P

T

cannot be made block diagonal for any per- mutationP.

Theorem 1: : An irreducible FIR MIMO channel is blindly identifiable up to a phase factor per user if

S

aa

(z)

has distinct eigen value functions, and up to one global phase factor if its eigenvector matrix is fully diverse.

It may perhaps be more practical to work with the LDU (Lower triangular-Diagonal-Upper triangular) decomposition Saa

(z) =

L

(z)

D

(

z

)

Ly

(

z

)

where L

(z)

is lower triangular with unit diagonal and the non-zero off-diagonal elements be- ing unconstrained transfer functions, and D

(z)

is diagonal with the diagonal elements being valid scalar spectra. The rela- tion between the LDU decomposition and the prefilter T

(z)

is immediate if T

(z)

is of the form T

(z) =

L

(z)(z)

where

(z)

is diagonal. An example of such a T

(z)

that allows irre- ducible channel identification up to one global phase factor is

(z) = I p

and T

(z) =

L

(z) = I p + D z

e ,1where

D

ehas only

non-zero elements on the first subdiagonal and those elements are all different constants.

Stationary precoding can be generalized to cyclostationary precoding via periodically timevariant prefiltering. By stack- ing

q

consecutive symbol period quantities y

k

, v

k

, a

k

, b

k

, we obtain Y

k

, V

k

, A

k

, B

k

. We can then introduce a

pq

pq

LTI

MIMO prefilter T

(z)

such that

Y

k

,V

k = (I q

H

(q))

A

k = (I q

H

(q))

T

(q)

B

k :

(5)

Cyclostationary prefiltering introduces more information, hence should allow improved estimation (and possibly avoid stationary noise).

V. GAUSSIAN ML SEMIBLINDCHANNELIDENTIFICATION

For Gaussian ML, which will allow to exploit the SOS, we model the unknown symbols as uncorrelated Gaussian vari- ables whereas the known symbols b

k

lead to a non-zero mean.

By neglecting the non-stationarity due to the known symbols, the Gaussian likelihood function can be written in the fre- quency domain:

H

[M lndet(S

yy

(z))+

(

y

(z)

,H

(z)

T

(z)

b

K (z))

y

S

yy,1

(z)(

y

(z)

,H

(z)

T

(z)

b

K (z))]

(6) where

H

is short for 1

2

j

H

dzz

and y

(z)

, b

K (z)

denote the

z

transforms of the signal of

M

samples y

k

and the known sym- bols b

k

. The gradient of this criterion is the same as the gradi- ent of the following sum of two subcriteria. The first subcrite- rion is

I

(

y

(z)

,H

(z)

T

(z)

b

K (z))

y

S

,1yy

(z)(

y

(z)

,H

(z)

T

(z)

b

K (z))

(7)

which is a weighted LS criterion (quadratic in H

(z)

) with the

training information. The second subcriterion is tr

I

f

S

yy,1

(z) S

eyy

(z)S

yy,1

(z) S

eyy

(z)

g (8)

where

S

eyy

(z) = S

yy

(z)

,

S

byy

(z)

,

S

byy

(z) = M

1y

(z)

y

(z)

y

(periodogram), and the gradient is taken by considering

S

yy,1

(z)

as constant. This second criterion is one of weighted spectrum matching and expresses the blind information. By taking the gradient of the sum of the two subcriteria, we com- bine training and blind information in an optimal fashion (com- pare to the CRB expression for GML). Asymptotically we can replace

S

yy,1

(z)

by a consistent estimate

S

b,1yy

(z)

such as the periodogram. Also, we can replace the periodogram by an AR model which matches the covariance sequence estimate ap- pearing implicitely in the periodogram (or asymptotically by a consistent AR model) such that P

(z) S

byy

(z)

Py

(z) = I

so

that

S

b,1yy

(z) =

Py

(z)

P

(z)

, where P

(z)

is the MIMO predic- tion error filter in which a square-root of the prediction error covariance matrix inverse has been absorbed. The blind sub- criterion then becomesI

P

(z)S

yy

(z)

Py

(z)

,

I m

2

F

(9) which is of fourth order in H

(z)

. One solution consists of inter- preting H1

(z)

and H2

(z)

in

S

yy

(z) =

H1

(z)S

aa

(z)

H2

(z) +

2

v I m

as different quantities and performing alternating opti- mizations between them (and

v

2).

VI. BLINDGML CHANNELIDENTIFICATION FOR AFLAT

CHANNEL

Here we shall focus on the blind identification part for a frequency flat channel H

=

G C with

r < m

. We can take G

= V

S, an orthonormal matrix spanning the signal subspace.

We shall estimate first

V

Sand then C.

Identification of the signal subspace

V

S:

We can alternatively estimate the noise subspace

V

N. Ide-

ally, R

L (I L

V

N

) = 0

where R

L

is the denoised covariance matrix of

L

symbol periods of y

k

. We shall estimate

V

N using

a weighted LS criterion

V

N:

V

NH

min V

N=

I

fm

vect(

,r Rb

L (I L

V

N

))

g

H

Wf

vect(

Rb

L (I L

V

N

))

g

(10) where Rb

L

is the sample covariance matrix, Rb

L =

R

L +

Re

L

. The optimal weighting is W

=

E

vect(

Rb

L (I L

V

N

))

gf

vect(

Rb

L (I L

V

N

))

g

H

. WithRb

L

based on

M

sam-

ples, we get Ef

vect(

Re

L )

gf

vect(

Re

L )

g

H = M

1R

TL

R

L

. This

allows us to work out the WLS criterion (10) to become

V

N:

V

NH

min V

N=

I

m,r

tr

f

V

N

H

ryy

(0)V

Ng (11)

where ryy

(0)

is the estimated correlation matrix of y

k

at lag

0

. The solution is clearly given by the noise subspace of the matrix in the middle, so that

V

S becomes its signal subspace.

Identification of C:

By equalizing

V

S, we get r

(i) = V S H

ryy

(i)V S =

C raa

(i)

C

H

. We introduce a normalization so that r,1

=

2

(0)

C r1aa

=

2

(0)

raa

(i)

r

H=

aa2

(0)

C

H

r,

H=

2

(0) =

r

(i)

(12)

(4)

for

i = 0;1;:::;L

, and where r

(i) =

r,1

=

2

(0)

r

(i)

r,

H=

2

(0)

and raa

(i) =

r,1aa

=

2

(0)

raa

(i)

r,aa

H=

2

(0)

. From

i = 0

, we ob-

serve that r,1

=

2

(0)

C r1aa

=

2

(0) = Q

for some matrix

Q

with

orthonormal rows, or hence C

=

r1

=

2

(0)Q

r,1aa

=

2

(0)

. To find

estimate

Q

, we shall assume

r = p

so that

Q

is square and uni- tary, and we shall solve (12), which becomes

Q

raa

(i)Q H =

r

(i)

, for

i = i;:::;L

in a least-squares sense:

Q

:trf

min Q

H

Q

g=

r L

X

i

=1

k

Q

raa

(i)

,r

(i)Q

k2

F

(13)

The solution of this problem involves an eigendecomposition and is unique in general up to a phase factor.

VII. PRECODER OPTIMIZATION

We focus here on the flat channel case, and we study the op- timization of the precoder to maximize the available capacity of the system, which is also the mutual information when the channel is estimated I

S

a a

(

y

;

bjHb

)

.

The ergodic capacity of an AWGN channel, when the chan- nel knowledge is absent at the transmitter and perfect at the receiver is given by:

C(S

a a

) =

EI

S

a a

(

y

;

bjH

)

=

EH

lndet(I +

1v2H

S

a a

(z)

H

H )

(14)

Where

S

aa

(z) =

P

L i

raa

(i)z

,

i

, the expectation E is here w.r.t. the distribution of the channel. As in [7], we as- sume the entries H

i;j

of the channel to be mutually inde- pendent, identically distributed zero mean complex Gaus- sian variables (Rayleigh flat fading MIMO channel model).

Telatar has shown [7] that for such a channel model, the op- timization of the capacity subject to the TX power constraint

H

tr

(S

a a

(z))

N tx

2

b

leads to the requirement of a white (and Gaussian) vector transmission signal

S

a a

(z) =

2

b I

.

For a block of length

M

(sufficiently large for the fre- quency domain expressions to be sufficiently accurate), and using a precoding

S

a a

(z) =

2

b (I + S(z))

(

<< 1

,

S(z) =

P

L i S i z

,

i

andH tr

(S(z)) =

tr

(S

0

) = 0

for

power constraint), the deviation of the available capacity for the whole block is:

= MC(

2

b I)

,

M

E I

S

aa

(

y

;

bjHb

)

=M(C( b

2

I)

,

C(S

a a

))

| {z }

1

+M(C(S

a a

)

,E I

S

aa

(

y

;

bjHb

))

| {z }

2

1 represents the deviation (decrease) of the capacity due to(15) the precoding, whereas

2represents the deviation caused by channel estimation errors.

Let

=

2b2v and

R = I +

H H

H

, then:

1

=

,

M

EH

lndet(I + R

,1H

S(z)

H

H

=

,

M

EH

[tr

f

R

,1H

S(z)

H

H

g

,

2

2

tr

f

R

,1H

S(z)

H

H R

,1H

S(z)

H

H

g

+ O(

3

)]

,

Mtr

fEH

H R

,1H

S

0g

+M

2

2E

tr

fHH

H R

,1H

S(z)

H

H R

,1H

S(z)

g

(16) We will show below that E H

H R

,1H is a multiple of the unity matrix

I

. In fact, for every permutation matrix

P

and

unitary diagonal matrix

D = diag(e j

1

;:::;e j

p

)

, H0

=

H

PD

has the same distribution as H, then E H

H R

,1H

= D

P H

E H

H R

,1H

PD

. By averaging over the set of permu- tations P and phases

[0;2] p

, we get:

E H

H R

,1H

=

(21

)p

Z

1 p!

X

P

2P

D

P H

EH

H R

,1H

PD d

1

:::d p

= tr

E HHR

p

,1H

I

(17) The first term in

1is,

M tr

EHH R

p

,1H

trS

0

= 0

.

1

=M

2

2P

i;j

E

tr

fH

z

,(

i

+

j

)H

H R

,1H

S i

H

H R

,1H

S j

g

=M

2

2P

i

E

tr

fHH

H R

,1H

S i

H

H R

,1H

S Hi

g

=M

2P

i vect H (S i )

W

vect(S i ):

(18) where W

=

2E

(

H

H R

,1H

) T

(

H

H R

,1H

)

,P

i

is done over all index values

i

for which

S i

6

= 0

and

H

H R

,1H

= (I + (

H

H

H

)

,1

)

,1then for high SNR i.e

>> 1

: W

I

.

As shown in [8], the deviation of the capacity for a small channel estimation error is

2

= ME

He2. To mini-

mize

2the channel estimator should be the MMSE estimator.

The MMSE estimator doesn’t achieve in general the Cramer- Rao Bound (

CRB

) but a good measure of the deviation is

2

= Mtr

E

CRB = Mtr

E

J HH

# , where

J HH

is the Fis- cher Information Matrix.

Subspace fitting gives an estimation of the channel Hb0 from the estimated correlation at lag 0 (see Section VI.) ryy

(0) =

H

S

0H

H + I

, which leads asymptotically to H0

=

H

S

01

=

2Q

H

. The identification of the unitary matrix Q is done from the remaining correlation lag, H0#ryy

(i)

H0#

H =

Q

S

0,1

=

2

S i S

0,

H=

2Q

H

; as a result we can assume without loss of generality that

S

0

= I

and hence

S

0

= 0

. The Mean

Square Error of the subspace estimator H0is of order

M

,1.

H

=

H0Q

=

Hb0

Q

b

|{z}

=

Hb

+

Hb0Qe

|{z}

/

g

(

)

=

p

M

+

He0Q

|{z}

/1

=

p

M

(19)

where

g()

is an increasing function of

,1. For large

M

and

small

we can neglect the last term, and assume thatHb0

=

H0.

The source of the channel error is then caused by the estimation error on Q. If X

= (

x1

;

x2

;:::;

x

n

p

)

is a real parameterization of the unitary matrix, i.e Q

=

Q

(

X

)

, then:

J

HH = @ @

Xh

JJ XX

@

X

@

h

T

(20) where h

= [Re(vect(

H

)) T ;Im(vect(

H

)) T ] T

and

JJ XX =

,EYjH0

@

ln

p

(YjX

;

H0)

@

X

@

ln

p

(YjX

;

H0)

@

X

T

. An asymptotic expression of

JJ XX

for large

M

is then :

JJ XX (i;j) = M

I

tr

f

S

yy,1

(z)@S

yy

(z)

@

x

i S

,1yy

(z)@S

yy

(z)

@

x

j

g

(21) By proceeding as for

1, we can show that:

(5)

JJ XX (i;j) = M

f

2X

k vect H (S k )W Ji;j vect(S k )+O(

3

)

g

(22) where

W Ji;j =

@

Q

@

xiQ

H

R

0,

T

R

0,1

@

Q

@

xjQ

and

R

0

= I + (

H0

H

H0

)

,1. We can note that for high SNR i.e

>> 1

:

R

0

I

. The decrease is now

2

=

,2

E

tr

J

M

HH2#

=

,2

E

tr

JJ M

XX2,1

@

X

@

h

T @

X

@

h

,1

. The minimization of the capacity lost

can be performed by evaluating the expectation, which is quite complex, and then optimizing the resulting cost function. An alternative ap- proach is to consider a close lower bound of

that is easier to calculate and to minimize. From the solution for

S(z)

,

we can then find the solution for the precoder T

(z)

. For a

precoder T

(z) = (1

,

2

)

1

=

2

I +

P

k M F

,1

F k z

,

k

, where

M F = (

P

i F i F i H )

1

=

2 verifies

M F M F H =

P

i F i F i H

and

F

0

= 0

, the power spectrum is:

S

a a

(z)= b

2f

I + (1

,

2

)

1

=

2P

k

,

M F

,1

F k + F

,

H k M F

,

H

z

,

k +

2,

M F

,1

F(z)F

y

(z)M F

,

H

,

I

g

(23) This spectrum verifies the power constraint as

S

0

= 0

. The

first order approximation is:

S(z) =

P

k

,

M F

,1

F k + F

,

H k M F

,

H

z

,

k

.

Example of optimization for

p = 2

:

We introduce the following parameterization of Q: X

= (;'; )

2

[0;]

[0;2]

2, that allows to determine Q up to a global phase factor:

Q

(;'; ) =

h

e j cos e j sin

,

e

,

j sin e

,

j cos

i

(24) We consider here the case of high SNR, i.e

>> 1

, in which

case

1

= 2M

2P

L

1jj

S i

jj2and

R

0,

T

R

0,1

= I

. To

optimize

, we fix

1 by fixingP

L

1jj

S i

jj2

= 1

, we op-

timize then

2 under this last constraint, and given the pre- coder solution, optimize finally the overall decrease

with

respect to the scale factor

. We will skip below several de- tails of the derivation due to lack of place. First we establish the following equalities:

@

X

@

h

#

= @ @

XThT

= @ @

QTXTv

@ @

QhTv, where

Q

v = [Re(vect(

Q

)) T ;Im(vect(

Q

)) T ] T

.

Considering now the independence between H0and

Q

we

can write:

2

=

,2

EX

tr

JJ M

XX2,1

@ @

QTXTv EH0

@

hT

@

Qv

@

hT

@

Qv

T

@

QTv

@

XT

T

=

,2

E

jj

H p

jj2

tr

EX

8

<

:

JJ M

XX2

@

QTv

@

XT

@

QTv

@

XT

T

!,19=

; ,1

=

,2

E

jj

H p

jj2

C(S)

(25) where EX (resp. EH0) denotes expectation with respect to X (resp. H0), and to obtain the second expression we used arguments similar to the ones used to prove that E H

H R

,1H

= tr

E HHR

p

,1H

I

and applied here to H0 (as- sumed to have the same distribution as H) in order to prove that: EH0

@

hT

@

Qv

@

hT

@

Qv

T

= E

jj

H p

jj2

I

. The main difficulty is to

minimize

C(S)

, which is possible directly but at high numer- ical cost. To avoid it, let’s first note that as result of the convex- ity character of the function

f(x) = 1=x

overR+:

C(S)

C

0

(S) = tr

8

<

:

EX

JJ M

XX2

@

QTv

@

XT

@

QTv

@

XT

T

!,19=

; ,1

, then for

S opt

optimum of

C

0,

C(S opt )

C opt

C opt

0 .

If

C

(

S

opt),

C

0opt

C

(

S

opt)

<< 1

then we can conclude that

S opt

minimizes

2. In our case

C

0

(S) =

14

m

5+1 mm12 +

m

2, where

m

1

=

P

L i

=1jj

S i (1;1)

,

S i (2;2)

jj2 and

m

2

=

P

L i

=1jj

S i (1;2)

jj2

+

jj

S i (2;1)

jj2, the optimal solution (

C opt

0

= 1:125

) verifies:

S i (2;2) =

,

S i (1;1); i = 1;:::;L

,

m

2

=

23, andP

L i

=1jj

S i (1;1)

jj2

+

jj

S i (1;1)

jj2

=

1

3

. One choice of a causal precoder with the smallest order L=1 that verifies these optimality conditions is:

T

opt (z) = (1

,

2

)

1

=

2

I +

A

z

,1

;

A

= 1

p

3

"

1

p

2

1

1

,p12

#

(26) In the first order of

this leads to

S opt (z) = (1

,

2

)

1

=

2A

(z

,1

+ z) +

,2

2

I

A

(z

,1

+ z)

. By numeri- cal evaluation we get:

C

(

S

opt),

C

0opt

C

(

S

opt)

=

1

:

375,11

:

375

:

125

= 0:18

,

hence

S opt

is close to optimal. The optimization of

gives

=

n

(

E

jj

H

jj22

Mp

)

C

(

S

opt)o1

=

4, usually the normalization is

E

jj

H

jj2

p = 1

, then

= 0:91

n

M

o,1

=

4 and

MC

(

2b

I

)

=

1

:

66(

M

)1=2

MC

(

b2

I

) / ln1

p

M

. The same ratio in the case of the ex- clusive use of a training sequence gives

MC

(

b2

I

)/

M TS pm

,1ln

.

To achieve the same performance, the length of the training sequence needs to be of the order

q

M m

which can become very important for large MIMO systems.

REFERENCES

[1] A. Medles, D.T.M. Slock and E. De Carvalho “Linear prediction based semi-blind estimation of MIMO FIR channels”. In Proc. Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC’01), pp. 58-61, Taipei, Taiwan, March 2001.

[2] G. Leus, P. Vandaele, and M. Moonen. “Deterministic Blind Modulation- Induced Source Separation for Digital Wireless Communications”. IEEE Trans. on Signal Processing, SP-49(1):219–227, Jan. 2001.

[3] J. Xavier, V.A.N. Barroso and J.M.F Moura. “Closed-Form Correlative Coding(CFC2)Blind Identification of MIMO Channels: Isometry Fit- ting to Second Order Statistics”. IEEE Trans. Signal Processing, SP- 49(5):1073–1086, May 2001.

[4] Y. Hua and Y. Xiang. “The BID for Blind Equalization of FIR MIMO Channels”. In Proc. ICASSP Conf., Salt Lake City, USA, May 2001.

[5] A. Gorokhov and Ph. Loubaton. “Subspace based techniques for second- order blind separation of convolutive mixtures with temporally correlated sources”. IEEE Trans. Circuits and Systems, 44:813–820, Sept. 1997.

[6] H. Hat´es and F. Hlawatsch. “Blind Equalization of MIMO Channels Us- ing Deterministic Precoding”. In Proc. ICASSP Conf., Salt Lake City, USA, May 2001.

[7] I.E. Telatar. “Capacity of Multi-antenna Gaussian Channels”. Eur. Trans.

Telecom., vol. 10, no. 6, pp. 585-595, Nov/Dec 1999.

[8] A. Medles and D.T.M. Slock. “Semiblind channel estimation for MIMO spatial multiplexing systems”. In Proc. 35th Asilomar Conference on Sig- nals, Systems and Computers, Pacific Grove, CA, USA, Nov. 2001.

Références

Documents relatifs

Motivated by the iterative power method, used in Numerical Analysis for estimating singular values and singular vectors, we develop RLS and LMS subspace based adaptive algorithms

Training sequence methods base the parameter estimation on the received signal containing known symbols only and all the other observations, contain- ing (some) unknown symbols,

Whereas we have considered block fading so far in this paper, we conjecture that these results extend to the continu- ous transmission (CT) case: in steady-state, channel estima-

puts, most of the blindmultichannelidenticationtech- niques are not very robust and only allow to estimate the channel up to a number of ambiguities, especially in the MIMO case. On

In this study we propose a technique to estimate the downlink (DL) channel spatial covariance matrix R d in realistic massive MIMO systems operating in the frequency division

In the Gaussian symbols case on the other hand, robustness to channel length overdetermination and channel zeros arises, and the channel unidentifiablity gets reduced to a block

The first step involves a large SVD in order to obtain a base for the noise subspace of the received signal, and the second consists in applying either a small SVD [8] or

In this paper, we present a new iterative algorithm for joint Rayleigh channel (taps or complex gains) estimation and data recovery in fast time-varying environments.. After