Linear Precoding for Spatial Multiplexing MIMO Systems:
Blind Channel Estimation Aspects
Abdelkader Medles, Dirk T.M. Slock
Eurecom Institute1 2229 route des Crˆetes, B.P. 193
06904 Sophia Antipolis Cedex FRANCE
Abstract - For the case of white uncorrelated inputs, most of the blind multichannel identification techniques are not very robust and only allow to estimate the channel up to a number of ambiguities, especially in the MIMO case.
On the other hand, all current standardized communica- tion systems employ some form of known inputs to allow channel estimation. The channel estimation performance in those cases can be optimized by a semiblind approach which exploits both training and blind information. When the inputs are colored and have sufficiently different spec- tra, the MIMO channel may become blindly identifiable up to one constant phase factor per input, and this under looser conditions on the channel. For the case of spatial multiplexing, possible cooperation between the channel in- puts allows for more complex MIMO source prefiltering that may allow blind MIMO channel identification up to just one global constant phase factor. We introduce semib- lind criteria that are motivated by the Gaussian ML ap- proach. They combine a training based weighted least- squares criterion with a blind criterion based on linear pre- diction. A variety of blind criteria are considered for the various cases of source coloring.
I. INTRODUCTION
The multichannel aspect has led to the development of a wealth of blind channel estimation techniques over the last decade. In this paper, blind identification shall mean channel identification on the basis of the second-order statistics of the received signal. Consider linear digital modulation over a lin- ear channel with additive Gaussian noise. Assume that we have
p
transmitters andm > p
receiving channels (e.g. antennas in BLAST or SDMA). For more details on the used notation in this paper refer to [1].II. BLINDIDENTIFICATION FORCOLOREDINPUTS
In this section we aim to improve the Second-Order Statis- tics (SOS) based blind channel identification by exploiting cor- relation in the inputs. In the context of digital communica- tions, the inputs are symbol sequences which are typically uncorrelated. Correlation can be introduced by linear con- volutive precoding, which corresponds to MIMO prefiltering of the actual vector sequence b
k
of symbols to be transmit- ted with a MIMO prefilter T(z)
such that the transmitted vec- tor signal becomes ak = T(q)
bk
. In this paper we consider
full rate linear precoding so that T(z)
is ap
p
square ma-
trix transfer function (in [2] an example of low rate precod- ing appears since the same symbol sequence gets distributed over all TX antennas). We get for the transmitted signal spec- trum
S
aa(z) =
T(z)S
bb(z)
Ty(z) = b2T(z)
Ty(z)
and for
1Eurecom’s research is partially supported by its industrial partners: As- com, Swisscom, France T´el´ecom, La Fondation CEGETEL, Bouygues T´el´e- com, Thales, ST Microelectronics, Motorola, Hitachi Europe and Texas In- struments. The work reported here was also partially supported by the French RNRT project ERMITAGES.
the received signal spectrum
S
yy(z) =
H(z)S
aa(z)
Hy(z) + S
vv(z) =
2b
H(z)
T(z)
Ty(z)
Hy(z) +
2v I m
. The choice of appropriate prefiltering, as we shall see below, may reduce the non-identifiability to a phase factor per source or even to a global phase factor. In the context of wireless communica- tions, two scenarios may be distinguished:Noncooperative scenario: this scenario corresponds to the multi-user case (on the transmitter side) without cooperation between users. We shall consider the simple case in which the users transmit through only one antenna. This noncooperative scenario can also arise in other source separation applications since natural sources tend to have different spectra. In this sce- nario, H
(z)
has no structure, other than possibly being FIR, and T(z)
andS
aa(z)
are diagonal. This scenario has been considered in [3],[4].Cooperative scenario: this is the single-user spatial multiplex- ing case. In this case, since transmit antennas are near each other and also receive antennas are near each other, all (FIR) entries in H
(z)
have the same delay spread and hence are poly- nomial of the same order.S
aa(z)
is allowed to be nondiagonal.In the noncooperative case the channel will tend to be ir- reducible, a characteristic we have assumed so far, due to the fact that the users tend to be spread out in space. In the spa- tial multiplexing case however, in which the TX antennas are essentially colocated, the irreducibility of the channel depends on the richness of the scattering environment. In general, we need to consider a reducible channel. Such a channel can be factored as H
(z) =
G(z)
C(z)
where G(z)
is irreducible and column reduced with columns in order of e.g. non-increasing degree. Ifr
is the (generic) rank of H(z)
, then G(z)
ism
r
whereas C
(z)
isr
p
. In the noncooperative scenario C(z)
hasno particular structure. In the cooperative scenario however, all entries in a particular row of C
(z)
have the same degree and the degrees of the rows are non-decreasing (the degree profile of the rows in C(z)
is complementary to the degree profile of the columns in G(z)
).If
r
m
,1
, then2v
is blindly identifiable fromS
yy(z)
andG
(z)
is blindly identifiable from the signal/noise subspaces ofS
yy(z)
up to a postmultiplication factor L(z)
that is block lower triangular with block sizes according to the multiplici- ties of the degrees of the columns of G(z)
and L(z)
is alsopolynomial with the degree of block
(i;j)
being the difference between the degrees of blocki
and blockj
of the columns of G(z)
[5]. So in particular, the diagonal blocks of L(z)
are con-stant. Also, L,1
(z)
has the same polynomial structure as L(z)
.In the cooperative case, L,1
(z)
C(z)
has the same polynomial structure as C(z)
. Ifr
m
,1
, there are essentially no restric- tions on the number of inputsp
for identifiability. Ifr = m
,identifiability of
v2becomes an issue and there’s no longer a
point in considering a factorization of H(z)
for its identifica-
tion.
Finally, let us note that TX pulse shape filters can be incor- porated in T
(z)
orS
aa(z)
and that oversampling at the RX also leads to an increase in the number of RX channels. Also, the formulation of complex quantities as a superposition of real quantities may lead to an extra MIMO dimension. In the next two sections we investigate channel identifiability with diago- nal or full prefiltering T(z)
.III. NONCOOPERATIVE/DIAGONALPREFILTERING
In general, we would like to handle the reducible channel case. The rank
r
can be identified fromS
yy(z)
. Ifr
m
,1
, then we can denoise the SOS and identify the factor G(z)
from the subspaces. G
(z)
is unique up to a factor L(z)
. Forwhichever G
(z)
in this equivalence class, it remains to identify C(z)
in H(z) =
G(z)
C(z)
fromS(z) =
G#(z)(S
yy(z)
,2v I m )G#y(z) =
C(z)S
aa(z)
Cy(z)
(1) where G#
(z)
is a MMSE ZF equalizer for G(z)
: G#(z)
G(z) = I r. Forr = m
, the problem is similar to the one in (1) with
C(z)
replaced by H(z)
(apart from the 2v
identification is-
sue which will be discussed below). The value of the rank
r
2 f1;2;:::;min(m;p)
gis unpredictible in general. For a certain rankr
, subsets ofr
,1
columns of C(z)
could be iden- tified jointly fromS(z)
using certainS
aa(z)
and under certain conditions on C(z)
(or subsets ofr
columns under more strin- gent conditions on C(z)
). So to be general,S
aa(z)
shouldbe such that it allows identifiability for the worst case of
r
,which is
r = 1
. In that case, each column of C(z)
needs tobe identified separately. On the other hand, since in the case
r = 1
each column of C(z)
is a scalar FIR transfer function, only its minimum-phase equivalent is identifiable. So a col- umn would be truly identifiable only if it is minimum-phase.To avoid having zeros would require to impose
r
2
. In anycase, to be fully general, it is desirable to have
S
aa(z)
such thatit allows identifiability of each column of C
(z)
separately. So the MIMO problem gets converted into a set of disconnected SIMO (r > 1
) or SISO (r = 1
) problems. This will allow iden- tifiability of each column up to a constant phase factor of the forme jif the column has no maximum-phase zeros (which
is quite possible ifr
2
but highly unlikely forr = 1
). An-
other issue is the degree of C
j (z), columnj
of C(z)
. We have
H
j (z) = G(z)
Cj (z). The degree of Cj (z) is unpredictible
and can be up toN j
,1
, the degree of the corresponding col-
umn Hj (z)of H(z)
. For identifiability, we need to consider
the worst case and hence we shall assume that the degree of
Cj (z)isN j
,1
. Of course, theN j
themselves may be unpre-
dictible and in practice need to be replaced by an upper bound.
j (z) is unpredictible
and can be up toN j
,1
, the degree of the corresponding col- umn Hj (z)of H(z)
. For identifiability, we need to consider
the worst case and hence we shall assume that the degree of
Cj (z)isN j
,1
. Of course, theN j
themselves may be unpre-
dictible and in practice need to be replaced by an upper bound.
N j
We now consider two approaches for identification, leading to two classes of solutions for
S
aa(z)
.Frequency domain approach: The idea here is to introduce zeros into the diagonal elements of T
(z)
or henceS
aa(z)
suchthat all other elements other than diagonal element
j
shareN j zeros
T
jj (z) = Yp
i
=1;
6=j N
iY
k
=1(1
,z i;k z
,1)
(2)This allows identifiability of C
j (z) fromS(z)
up to a phase
since
S(z j;k ) =
Cj (z j;k )S a
ja
j(z j;k )
Cyj (z j;k ) ; k = 1;:::;N j
(3)where
S aja
j(z) =
2b T jj (z)T jj
y(z)
. If all N j are equal (to
N
1), then we can choose equispaced zeros on the unit cir- cle:Q
N k
=1i(1
,z i;k z
,1) = 1
,e jiz
,N
1. If we furthermore
choose overall equispacing by taking i = (i
,1)2=pN
1,
then
T ii (z) =
1,1,e
jiz
,z
pN,1N1. In [4] very similar work appears in which the degree ofS
aa(z)
(in the case of equalN j) is at least
pN
1(compared to(p
,1)N
1here), but the discussion in [4] is limited to the casem > p = 2
. Note that here we can easily allowp > m
even (more inputs than outputs!). Remark also that by introducing a number of zeros in T(z)
, we can further- more identify an equal number of noise parameters (such as 2v
for instance whenr = m
). Non-FIRS
aa(z)
can be consid- ered also. For instance we can consider the case in which theS aja
j(e j2f )
have at leastN j disjoint expansion coefficients
in some orthogonal basis. An extreme example of this would
beS aja
j(e j2f )
that are bandlimited with thep
bands being
non-overlapping.
f )
S aja
j(e j2f )
that are bandlimited with thep
bands being
non-overlapping.
f )
Time domain approach: The idea here is to introduce delay in the prefilter so that the correlations of each C
j (z) appear
separately in certain portions of the correlation sequence of
S(z)
. This can be obtained for instance with Tjj (z) = 1, j z
,d
j ; d j =
Xj
,1
i
=1N i :
(4)Identification can be done with a correlation sequence peeling approach that starts with the last column C
p (z)of which the
(1-sided) correlation sequence appears in an isolated fashion in
the lastN p
correlations ofS(z)
. Identification of Cp (z)from
its correlation sequence can be done up to a phase factor
e jp
(and up to the phase of zeros if C
p (z)has zeros). We can then
subtractS a
ja
j(z)
Cp (z)Cyp (z)(which does not require Cp (z)
p (z)
but only its correlation sequence) from
S(z)
which will then reveal the correlation sequence of Cp
,1(z)
in its lastN p,1
correlations, etc. The degree ofS
aa(z)
is in this case the de-
greed pofS apa
p(z)
which, in the case of all equalN j, is again
S apa
p(z)
which, in the case of all equalN j, is again
(p
,1)N
1, which leads to a degree ofpN
1,1
forS(z)
orhence
pN
1correlations. Such a degree forS
aa(z)
is not only sufficient but also necessary since whenr = 1
, there arepN
1parameters to be identified for which indeed at least
pN
1cor-relations are needed. Note that in the temporal approach, in- creasing all the delays
d jwith an amountD
allows furthermore
the (straightforward) identification of MA(D
,1
) noise (e.g.
D = 1
for white noise with arbitrary spatial correlation).In practice, with estimated correlations, the correlation peel- ing approach leads to increasing estimation errors as the columns of C
(z)
get processed. This error increase can be avoided by doubling the delay separation between sources, which may furthermore lead to simpler algorithms (e.g. SIMO subspace fitting with asymmetric covariance matrices). Time domain approaches also appear in [3].IV. COOPERATIVE/SPATIAL-MULTIPLEXING
PREFILTERING
Non-cooperative approaches can of course also be applied in the cooperative scenario, so diagonal prefiltering can be used for spatial multiplexing. However, this leads to at least a un- known phase per TX antenna and hence requires either differ- ential encoding or training symbols per TX antenna. By ap- plying full prefiltering, such that
S
aa(z)
is not blockdiagonalin which case it is said to be fully diverse, the channel may possibly be identified up to a global phase factor only. Since better identifiability results in this case, better estimation may possibly be another consequence. We consider here linear pre- coding by time-invariant MIMO prefiltering. In [6], a block precoding approach is considered.
We can work with the eigen or LDU decompositions of
S
aa(z)
. To begin with, consider the eigendecomposition Saa(z) =
V(z)
D(z)
Vy(z)
where V(z)
is paraunitary (i.e.V
y
(
z)
V(
z) =
I) and contains the eigenvectors as columns, and D(z)
is diagonal with the diagonal elements, the eigenval- ues, being valid scalar spectra.A paraunitary matrix V
(z)
is said to be full diverse, ifPV
(z)
Vy(1)
PT
cannot be made block diagonal for any per- mutationP.Theorem 1: : An irreducible FIR MIMO channel is blindly identifiable up to a phase factor per user if
S
aa(z)
has distinct eigen value functions, and up to one global phase factor if its eigenvector matrix is fully diverse.It may perhaps be more practical to work with the LDU (Lower triangular-Diagonal-Upper triangular) decomposition Saa
(z) =
L(z)
D(
z)
Ly(
z)
where L(z)
is lower triangular with unit diagonal and the non-zero off-diagonal elements be- ing unconstrained transfer functions, and D(z)
is diagonal with the diagonal elements being valid scalar spectra. The rela- tion between the LDU decomposition and the prefilter T(z)
is immediate if T
(z)
is of the form T(z) =
L(z)(z)
where(z)
is diagonal. An example of such a T(z)
that allows irre- ducible channel identification up to one global phase factor is(z) = I pand T(z) =
L(z) = I p + D z
e ,1whereD
ehas only
non-zero elements on the first subdiagonal and those elements are all different constants.
Stationary precoding can be generalized to cyclostationary precoding via periodically timevariant prefiltering. By stack- ing
q
consecutive symbol period quantities yk
, vk
, ak
, bk
, we obtain Yk
, Vk
, Ak
, Bk
. We can then introduce apq
pq
LTIMIMO prefilter T
(z)
such thatY
k
,Vk = (I q
H(q))
Ak = (I q
H(q))
T(q)
Bk : (5)
Cyclostationary prefiltering introduces more information, hence should allow improved estimation (and possibly avoid stationary noise).
V. GAUSSIAN ML SEMIBLINDCHANNELIDENTIFICATION
For Gaussian ML, which will allow to exploit the SOS, we model the unknown symbols as uncorrelated Gaussian vari- ables whereas the known symbols b
k
lead to a non-zero mean.By neglecting the non-stationarity due to the known symbols, the Gaussian likelihood function can be written in the fre- quency domain:
H
[M lndet(S
yy(z))+
(
y(z)
,H(z)
T(z)
bK (z))yS
yy,1(z)(
y(z)
,H(z)
T(z)
bK (z))]
(6) where
H
is short for 1
2
j
H
dzz
and y(z)
, bK (z) denote thez
transforms of the signal of
M
samples yk
and the known sym- bols bk
. The gradient of this criterion is the same as the gradi- ent of the following sum of two subcriteria. The first subcrite- rion isI
(
y(z)
,H(z)
T(z)
bK (z))yS
,1yy(z)(
y(z)
,H(z)
T(z)
bK (z))
(7)
which is a weighted LS criterion (quadratic in H
(z)
) with thetraining information. The second subcriterion is tr
I
f
S
yy,1(z) S
eyy(z)S
yy,1(z) S
eyy(z)
g (8)where
S
eyy(z) = S
yy(z)
,S
byy(z)
,S
byy(z) = M1y(z)
y(z)
y
(periodogram), and the gradient is taken by considering
S
yy,1(z)
as constant. This second criterion is one of weighted spectrum matching and expresses the blind information. By taking the gradient of the sum of the two subcriteria, we com- bine training and blind information in an optimal fashion (com- pare to the CRB expression for GML). Asymptotically we can replaceS
yy,1(z)
by a consistent estimateS
b,1yy(z)
such as the periodogram. Also, we can replace the periodogram by an AR model which matches the covariance sequence estimate ap- pearing implicitely in the periodogram (or asymptotically by a consistent AR model) such that P(z) S
byy(z)
Py(z) = I
sothat
S
b,1yy(z) =
Py(z)
P(z)
, where P(z)
is the MIMO predic- tion error filter in which a square-root of the prediction error covariance matrix inverse has been absorbed. The blind sub- criterion then becomesI
P
(z)S
yy(z)
Py(z)
,I m2F
(9)
which is of fourth order in H(z)
. One solution consists of inter-
preting H1(z)
and H2(z)
inS
yy(z) =
H1(z)S
aa(z)
H2(z) +
2v I m
as different quantities and performing alternating opti-
mizations between them (and v2).
VI. BLINDGML CHANNELIDENTIFICATION FOR AFLAT
CHANNEL
Here we shall focus on the blind identification part for a frequency flat channel H
=
G C withr < m
. We can take G= V
S, an orthonormal matrix spanning the signal subspace.We shall estimate first
V
Sand then C.Identification of the signal subspace
V
S:We can alternatively estimate the noise subspace
V
N. Ide-ally, R
L (I L
V
N) = 0
where RL
is the denoised covariance matrix ofL
symbol periods of yk
. We shall estimateV
N usinga weighted LS criterion
V
N:V
NHmin VN=I
fmvect(
,r RbL (I L
V
N))
gH
Wfvect(
RbL (I L
V
N))
g
(10) where Rb
L
is the sample covariance matrix, RbL = RL +
Re
L
. The optimal weighting is W=
Evect(
RbL (I L
V
N))
gfvect(
RbL (I L
V
N))
gH
. WithRbL
based onM
sam-ples, we get Ef
vect(
ReL )gfvect(
ReL )gH = M
1RTL
RL
. This
H = M
1RTL
RL
. Thisallows us to work out the WLS criterion (10) to become
V
N:V
NHmin VN=I
m,r tr
fV
NH
ryy(0)V
Ng (11)
where ryy
(0)
is the estimated correlation matrix of yk
at lag0
. The solution is clearly given by the noise subspace of the matrix in the middle, so thatV
S becomes its signal subspace.Identification of C:
By equalizing
V
S, we get r(i) = V S Hryy(i)V S =
C raa
(i)
CH
. We introduce a normalization so that r,1=
2(0)
C r1aa=
2(0)
raa(i)
rH=
aa2(0)
CH
r,H=
2(0) =
r(i)
(12)
for
i = 0;1;:::;L
, and where r(i) =
r,1=
2(0)
r(i)
r,H=
2(0)
and raa
(i) =
r,1aa=
2(0)
raa(i)
r,aaH=
2(0)
. Fromi = 0
, we ob-serve that r,1
=
2(0)
C r1aa=
2(0) = Q
for some matrixQ
withorthonormal rows, or hence C
=
r1=
2(0)Q
r,1aa=
2(0)
. To findestimate
Q
, we shall assumer = p
so thatQ
is square and uni- tary, and we shall solve (12), which becomesQ
raa(i)Q H =
r
(i)
, fori = i;:::;L
in a least-squares sense:Q
:trfmin QHQ
g=r L
X
i
=1k
Q
raa(i)
,r(i)Q
k2F
(13)The solution of this problem involves an eigendecomposition and is unique in general up to a phase factor.
VII. PRECODER OPTIMIZATION
We focus here on the flat channel case, and we study the op- timization of the precoder to maximize the available capacity of the system, which is also the mutual information when the channel is estimated I
S
a a(
y;
bjHb)
.The ergodic capacity of an AWGN channel, when the chan- nel knowledge is absent at the transmitter and perfect at the receiver is given by:
C(S
a a) =
EIS
a a(
y;
bjH)
=
EHlndet(I + 1v2HS
a a(z)
HH )
(14)
Where
S
aa(z) =
PL i
raa(i)z
,i
, the expectation E is here w.r.t. the distribution of the channel. As in [7], we as- sume the entries Hi;j
of the channel to be mutually inde- pendent, identically distributed zero mean complex Gaus- sian variables (Rayleigh flat fading MIMO channel model).Telatar has shown [7] that for such a channel model, the op- timization of the capacity subject to the TX power constraint
H
tr
(S
a a(z))
N tx
2b
leads to the requirement of a white (and Gaussian) vector transmission signalS
a a(z) =
2b I.
For a block of length
M
(sufficiently large for the fre- quency domain expressions to be sufficiently accurate), and using a precodingS
a a(z) =
2b (I + S(z)) ( << 1
,
S(z) =
PL i S i z,i
andH tr(S(z)) =
tr(S
0) = 0
for
i
andH tr(S(z)) =
tr(S
0) = 0
forpower constraint), the deviation of the available capacity for the whole block is:
= MC(
2b I),M
E IS
aa(
y;
bjHb)
=M(C( b2I)
,C(S
a a))
| {z }
1
+M(C(S
a a)
,E IS
aa(
y;
bjHb))
| {z }
2
1 represents the deviation (decrease) of the capacity due to(15) the precoding, whereas2represents the deviation caused by channel estimation errors.Let
= 2b2v andR = I +
H HH
, then:
1 =
,M
EHlndet(I + R
,1HS(z)
HH
=
,M
EH[tr
fR
,1HS(z)
HH
g,
22tr
fR
,1HS(z)
HH R,1HS(z)
HH
g+ O(
3)]
,
Mtr
fEHH R,1HS
0g
+M
22Etr
fHHH R,1HS(z)
HH R,1HS(z)
g
S(z)
g(16) We will show below that E H
H R,1H is a multiple of the
unity matrix I
. In fact, for every permutation matrix P
and
unitary diagonal matrix
D = diag(e j 1;:::;e j p)
, H0 =
)
, H0=
H
PD
has the same distribution as H, then E HH R,1H = D
P H
E HH R,1HPD
. By averaging over the set of permu-
tations P and phases[0;2] p
, we get:
E H
H R,1H=
(21
)p
Z
1 p!
X
P
2PD
P HEHH R
,1HPD d
1:::d p
= trE HHRp
,1H I
(17) The first term in
1is,M trEHH Rp
,1HtrS
0 = 0
.
1 =M
22Pi;j
Etr
fHz
,(i
+j
)HH R
,1HS iHH R
,1HS jg
=M
22Pi
Etr
fHHH R,1HS i
HH R,1HS Hi
g
=M
2Pi vect H (S i )Wvect(S i ):
(18) where W
=
2E(
HH R,1H) T
(
HH R,1H)
,Pi
is done over
all index valuesi
for whichS i
6= 0
andHH R,1H= (I + (
HH
H)
,1)
,1then for high SNR i.e >> 1
: WI
.
As shown in [8], the deviation of the capacity for a small channel estimation error is
2= ME
He2. To mini-mize
2the channel estimator should be the MMSE estimator.The MMSE estimator doesn’t achieve in general the Cramer- Rao Bound (
CRB
) but a good measure of the deviation is 2= Mtr
ECRB = Mtr
EJ HH# , whereJ HH is the Fis-
cher Information Matrix.
Subspace fitting gives an estimation of the channel Hb0 from the estimated correlation at lag 0 (see Section VI.) ryy
(0) =
HS
0HH + I, which leads asymptotically to H0 =
H
S
01=
2QH
. The identification of the unitary matrix Q is done from the remaining correlation lag, H0#ryy(i)
H0#H =
Q
S
0,1=
2S i S
0,H=
2QH
; as a result we can assume without loss of generality thatS
0= I
and henceS
0= 0
. The MeanSquare Error of the subspace estimator H0is of order
M
,1.H
=
H0Q=
Hb0Q
b|{z}
=
Hb
+
Hb0Qe|{z}
/
g
()=
pM
+
He0Q|{z}
/1
=
pM
(19)
where
g()
is an increasing function of,1. For largeM
andsmall
we can neglect the last term, and assume thatHb0=
H0.The source of the channel error is then caused by the estimation error on Q. If X
= (
x1;
x2;:::;
xn
p)
is a real parameterization of the unitary matrix, i.e Q=
Q(
X)
, then:J
HH = @ @Xh JJ XX
@
X@
hT
(20) where h
= [Re(vect(
H)) T ;Im(vect(
H)) T ] T andJJ XX =
,EYjH0
@
lnp
(YjX;
H0)@
X
@
lnp
(YjX;
H0)@
XT
. An asymptotic expression of
JJ XX for largeM
is then :
JJ XX (i;j) = M
Itr
fS
yy,1(z)@S
yy(z)
@
xi S,1yy(z)@S
yy(z)
@
xj
g(21) By proceeding as for
1, we can show that:JJ XX (i;j) = M
f2Xk vect H (S k )W Ji;j vect(S k )+O(3)
g
(22) where
W Ji;j =
@
Q@
xiQH
R
0,T
R
0,1@
Q@
xjQ
and
R
0= I + (
H0H
H0)
,1. We can note that for high SNR i.e>> 1
:R
0I
. The decrease is now 2=
,2Etr
JM
HH2#=
,2Etr
JJ M
XX2,1
@
X@
hT @
X@
h
,1
. The minimization of the capacity lost
can be performed by evaluating the expectation, which is quite complex, and then optimizing the resulting cost function. An alternative ap- proach is to consider a close lower bound ofthat is easier to calculate and to minimize. From the solution forS(z)
,we can then find the solution for the precoder T
(z)
. For aprecoder T
(z) = (1
,2)
1=
2I +
Pk M F
,1F k z
,k
, whereM F = (
Pi F i F i H )1=
2 verifiesM F M F H =
Pi F i F i H
and
F
0= 0
, the power spectrum is:
S
a a(z)= b2fI + (1
,2)
1=
2Pk
,M F,1F k + F
,H k M F
,H
z
,k +
2,M F,1F(z)F
y(z)M F,H
,I
g
F k + F
,H k M F
,H
z
,k +
F(z)F
y(z)M F,H
,I
g
(23) This spectrum verifies the power constraint as
S
0= 0
. Thefirst order approximation is:
S(z) =
Pk
,M F,1F k + F
,H k M F
,H
z
,k
.
Example of optimization for
p = 2
:We introduce the following parameterization of Q: X
= (;'; )
2[0;]
[0;2]
2, that allows to determine Q up to a global phase factor:Q
(;'; ) =
he j cos e j sin
,
e
,j sin e,j cos
i
(24) We consider here the case of high SNR, i.e
>> 1
, in whichcase
1= 2M
2PL
1jjS ijj2andR
0,T
R
0,1 = I
. To
optimize
, we fix 1 by fixingPL
1jjS ijj2 = 1
, we op-
timize then
2 under this last constraint, and given the pre- coder solution, optimize finally the overall decrease withrespect to the scale factor
. We will skip below several de- tails of the derivation due to lack of place. First we establish the following equalities:@
X@
h
#
= @ @XThT = @ @QTXTv @ @
QhTv, where
@ @
QhTv, whereQ
v = [Re(vect(Q)) T ;Im(vect(
Q)) T ] T
.
Considering now the independence between H0and
Q
wecan write:
2=
,2EXtr
JJ M
XX2,1@ @
QTXTv EH0@
hT@
Qv@
hT@
QvT
@
QTv@
XTT
=
,2 EjjH p
jj2tr
EX
8
<
:
JJ M
XX2@
QTv@
XT@
QTv@
XTT
!,19=; ,1
=
,2 EjjH p
jj2C(S)
(25) where EX (resp. EH0) denotes expectation with respect to X (resp. H0), and to obtain the second expression we used arguments similar to the ones used to prove that E H
H R,1H = tr
E HHRp
,1HI
and applied here to H0 (as- sumed to have the same distribution as H) in order to prove that: EH0@
hT@
Qv@
hT@
QvT
= EjjH p
jj2I
. The main difficulty is to
minimize
C(S)
, which is possible directly but at high numer- ical cost. To avoid it, let’s first note that as result of the convex- ity character of the functionf(x) = 1=x
overR+:C(S)
C
0(S) = tr
8
<
:
EX
JJ M
XX2@
QTv@
XT@
QTv@
XTT
!,19=; ,1
, then for
S opt optimum of C
0, C(S opt )
C opt C opt0 .
C opt0 .
If
C
(S
opt),C
0optC
(S
opt)<< 1
then we can conclude thatS opt minimizes 2. In our case C
0(S) =
14m
5+1 mm12
+m
2,
where m
1 =
PL i
=1jjS i (1;1)
,S i (2;2)
jj2 and m
2 =
P
L i
=1jjS i (1;2)
jj2+
jjS i (2;1)
jj2, the optimal solution (C opt0 = 1:125
) verifies: S i (2;2) =
,S i (1;1); i = 1;:::;L
,m
2=
23, andPL i
=1jjS i (1;1)
jj2+
jjS i (1;1)
jj2=
1
3
. One choice of a causal precoder with the smallest order L=1 that verifies these optimality conditions is:
T
opt (z) = (1,2)
1=
2I +
Az
,1;
A= 1
p3
"
1
p
2
1
1
,p12#
(26) In the first order of
this leads toS opt (z) = (1
, 2)
1=
2A(z
,1+ z) +
,22
I
A(z
,1+ z)
. By numeri- cal evaluation we get:C
(S
opt),C
0optC
(S
opt)=
1:
375,11:
375:
125= 0:18
,hence
S optis close to optimal. The optimization ofgives
=
n
(E
jjH
jj22Mp
)C
(S
opt)o1=
4, usually the normalization is
E
jjH
jj2p = 1, then = 0:91
nM
o,1=
4 and MC
(
2bI
) =
1
:
66(M
)1=2MC
(b2I
) / ln1p
M
. The same ratio in the case of the ex- clusive use of a training sequence givesMC
(b2I
)/M TS pm,1ln
.
To achieve the same performance, the length of the training sequence needs to be of the order
q
M mwhich can become very important for large MIMO systems.
REFERENCES
[1] A. Medles, D.T.M. Slock and E. De Carvalho “Linear prediction based semi-blind estimation of MIMO FIR channels”. In Proc. Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC’01), pp. 58-61, Taipei, Taiwan, March 2001.
[2] G. Leus, P. Vandaele, and M. Moonen. “Deterministic Blind Modulation- Induced Source Separation for Digital Wireless Communications”. IEEE Trans. on Signal Processing, SP-49(1):219–227, Jan. 2001.
[3] J. Xavier, V.A.N. Barroso and J.M.F Moura. “Closed-Form Correlative Coding(CFC2)Blind Identification of MIMO Channels: Isometry Fit- ting to Second Order Statistics”. IEEE Trans. Signal Processing, SP- 49(5):1073–1086, May 2001.
[4] Y. Hua and Y. Xiang. “The BID for Blind Equalization of FIR MIMO Channels”. In Proc. ICASSP Conf., Salt Lake City, USA, May 2001.
[5] A. Gorokhov and Ph. Loubaton. “Subspace based techniques for second- order blind separation of convolutive mixtures with temporally correlated sources”. IEEE Trans. Circuits and Systems, 44:813–820, Sept. 1997.
[6] H. Hat´es and F. Hlawatsch. “Blind Equalization of MIMO Channels Us- ing Deterministic Precoding”. In Proc. ICASSP Conf., Salt Lake City, USA, May 2001.
[7] I.E. Telatar. “Capacity of Multi-antenna Gaussian Channels”. Eur. Trans.
Telecom., vol. 10, no. 6, pp. 585-595, Nov/Dec 1999.
[8] A. Medles and D.T.M. Slock. “Semiblind channel estimation for MIMO spatial multiplexing systems”. In Proc. 35th Asilomar Conference on Sig- nals, Systems and Computers, Pacific Grove, CA, USA, Nov. 2001.