HAL Id: hal-01420957
https://hal.archives-ouvertes.fr/hal-01420957
Submitted on 23 Dec 2016
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Division algebra codes achieve MIMO block fading channel capacity within a constant gap
Laura Luzzi, Roope Vehkalahti
To cite this version:
Laura Luzzi, Roope Vehkalahti. Division algebra codes achieve MIMO block fading channel capacity
within a constant gap. IEEE International Symposium on Information Theory, Jun 2015, Hong-Kong,
China. pp.446 - 450, �10.1109/ISIT.2015.7282494�. �hal-01420957�
Division algebra codes achieve MIMO block fading channel capacity within a constant gap
Laura Luzzi
Laboratoire ETIS, CNRS - ENSEA - UCP Cergy-Pontoise, France
[email protected]
Roope Vehkalahti
Department of Mathematics and Statistics, University of Turku Finland
[email protected]
Abstract—This work addresses the question of achieving capac- ity with lattice codes in multi-antenna block fading channels when the number of fading blocks tends to infinity. In contrast to the standard approach in the literature which employs random lattice ensembles, the existence results in this paper are derived from number theory. It is shown that a multiblock construction based on division algebras achieves rates within a constant gap from block fading capacity both under maximum likelihood decoding and naive lattice decoding. First the gap to capacity is shown to depend on the discriminant of the chosen division algebra;
then class field theory is applied to build families of algebras with small discriminants. The key element in the construction is the choice of a sequence of division algebras whose centers are number fields with small root discriminants.
Index Terms—MIMO, block fading, number theory, division algebras
I. I
NTRODUCTIONThe closed-form expression of the capacity of ergodic multiple-input multiple-output (MIMO) channels was given in [1] and [2]. In this work we consider the question of achieving MIMO capacity with lattice codes and we prove that there exists a family of so-called multi-block division algebra codes [3, 4] that achieve a constant gap to capacity over the block fading MIMO channel when the number of fading blocks tends to infinity.
Our constructions are based on two results from classical class field theory. First we choose the center K of the algebra from an ensemble of Hilbert class fields having small root discriminant and then we prove the existence of a K-central division algebra with small discriminant. Our lattices belong to a very general family of division algebra codes introduced in [3, 4, 5], and developed further in [6] and [7]. We will use the most general form presented in [8].
While we discuss specific lattice codes from division alge- bras, our proofs do work for any ensemble of matrix lattices with asymptotically good normalized minimum determinant.
The larger this value is, the smaller the gap to the capacity.
This work suggests that capacity questions in fading chan- nels are naturally linked to problems in the mathematical research area of geometry of numbers. Unlike our previous work in the single antenna case [9], many of the questions that arise have not been actively studied by the mathematical community.
We note that, while studying diversity-multiplexing gain tradeoff (DMT) of multiblock codes in [4], H.-f. Lu conjec- tured that these codes might approach MIMO capacity. Our work confirms that conjecture; however, we point out that it is unlikely that DMT-optimality alone is enough to approach capacity. Instead one should pick the code very carefully by maximizing the normalized minimum determinant.
II. P
RELIMINARIESA. Channel model
We consider a MIMO system with n transmit and n
rreceive antennas, where transmission takes place over k quasi-static Rayleigh fading blocks of delay T = n. Each multi-block codeword X ∈ M
n×nk( C ) has the form (X
1, X
2, . . . , X
k), where the submatrix X
i∈ M
n( C ) is sent during the i-th block.
The received signals are given by
Y
i= H
iX
i+ W
i, i ∈ {1, . . . , k} (1) where H
i∈ M
nr×n( C ) and W
i∈ M
nr×T( C ) are the channel and noise matrices. The coefficients of H
iand W
iare modeled as circular symmetric complex Gaussian with zero mean and unit variance per complex dimension, and the fading blocks H
iare independent. We assume that perfect channel state information is available at the receiver, and that decoding is performed after all k blocks have been received. We will call such a channel an (n, n
r, k)-multiblock channel.
A multi-block code C in a (n, n
r, k)-channel is a set of matrices in M
n×nk( C ). In particular we will concentrate on finite codes that are drawn from lattices. Let R denote the code rate in bits per complex channel use; equivalently, |C| = 2
Rkn2. We assume that every matrix X in a finite code C ⊂ M
n×nk( C ) satisfies the average power constraint
1
nk kXk
2≤ P. (2)
B. Lattices and spherical shaping
Definition 2.1: A matrix lattice L ⊆ M
n×T( C ) has the form L = Z B
1⊕ Z B
2⊕ · · · ⊕ Z B
s, where the matrices B
1, . . . , B
sare linearly independent over R , i.e., form a lattice basis, and s is called the rank or the dimension of the lattice.
In the following we will use the notations R (L) for the
linear space which is generated by the basis elements of the
lattice L, and Vol(L) for the volume of a fundamental region of L according to the Lebesgue measure in R (L).
Lemma 2.2: [10] Let us suppose that L is a lattice in M
n×kn( C ) and S is a Jordan measurable bounded subset of R (L). Then there exists X ∈ M
n×kn( C ) such that
|(L + X ) ∩ S| ≥ Vol(S) Vol(L) .
C. Minimum determinant for the multiblock channel
Let us first assume that we have an l-dimensional square matrix lattice L in M
n×n( C ). The minimum determinant of the lattice L is defined as
det
min(L) = inf
X6={0}
{| det(X)|}.
The pairwise-error probability based determinant criterion by Tarokh et al. [11] motivates us to define the normalized minimum determinant δ(L), which is obtained by scaling the lattice L to have a unit volume fundamental parallelotope before taking the minimum determinant. A simple computation proves the following:
Lemma 2.3: Let L ⊂ M
n×n( C ) be an l-dimensional matrix lattice. We then have that
δ(L) = det
min(L) /(Vol(L))
n/l.
This concept generalizes to the multiblock case as follows.
Let us suppose that L ⊂ M
n×kn( C ) is a multiblock code and that X = (X
1, X
2, . . . X
k) is a codeword in L. The received signal matrix
(H
1X
1, H
2X
2, . . . , H
kX
k) + (W
1, W
2, . . . , W
k), can just as well be written in the form
(H
1, H
2, · · · , H
k)diag(X ) + diag(W
1, W
2, . . . , W
k), where the diag-operator places the i-th n × n entry in the i-th diagonal block of a matrix in M
kn×kn( C ). This reveals that optimizing a code L for the (n, n
r, k)-multiblock channel is equivalent to optimizing diag(L) for the usual one shot nk × kn
rMIMO channel, where diag(L) is defined as {diag(X ) | X ∈ L}.
Definition 2.4: By abusing notation we define
det
min(L) := det
min(diag(L)) and δ(L) := δ(diag(L)).
III. L
ATTICES FROM DIVISION ALGEBRASLet us now describe how lattice codes from division alge- bras are typically built.
Definition 3.1: Let K be an algebraic number field of degree m and assume that E/K is a cyclic Galois extension of degree n with Galois group Gal(E/K ) = hσi. We can define an associative K-algebra
D = (E/K, σ, γ) = E ⊕ uE ⊕ u
2E ⊕ · · · ⊕ u
n−1E, where u ∈ D is an auxiliary generating element subject to the relations xu = uσ(x) for all x ∈ E and u
n= γ ∈ K
∗. We call the resulting algebra a cyclic algebra.
It is clear that the center of the algebra D is precisely the field K. That is, an element of D commutes with all other elements of D if and only if it lies in K.
Definition 3.2: We call p
[D : K] the degree of the algebra D. It is easily verified that the degree of D is equal to n.
Definition 3.3: A Z -order Λ in D is a subring of D having the same identity element as D, and such that Λ is a finitely generated module over Z which generates D as a linear space over Q .
To every Z -order Λ in D we can associate a non-zero integer d(Λ/ Z ) called the Z -discriminant of Λ [12, Chapter 2].
IV. A
SYMPTOTICALLY GOOD FAMILIES OF DIVISION ALGEBRA CODESA. Number fields with small root discriminants
The following theorem by Martinet [13] proves the exis- tence of infinite sequences of number fields K with small discriminants d
K. As we will see, choosing such a field as the center of the algebra D is the key to obtaining a good normalized minimum determinant.
Theorem 4.1: There exists an infinite tower of totally com- plex number fields {K
k} of degree 2k = 5 · 2
t, such that
|d
Kk|
2k1= G, (3) for G ≈ 92.368, and an infinite tower of totally real number fields {F
k} of degree k = 2
tsuch that
|d
Fk|
k1= G
1, (4) where G
1≈ 1058.
B. Division algebra based 2kn
2-dimensional codes in M
n×nk( C )
Let us now suppose that we have a totally complex field K of degree 2k and a K-central division algebra D of degree n.
Proposition 4.2: [8, Proposition 5] Let Λ be a Z -order in D.
Then there exists an injective mapping ϕ : D 7→ M
n×nk( C ) such that ϕ(Λ) is a 2kn
2-dimensional lattice in M
n×nk( C ) and
det
min(ϕ(Λ)) = 1, Vol(ϕ(Λ)) = q
|d(Λ/ Z )| · 2
−2kn2, δ(ϕ(Λ)) = 2
2kn2|d(Λ/ Z )|
!
1/4n.
We can now see that in order to maximize the minimum determinant of a multiblock code, we should minimize the Z -discriminant of the corresponding O
K-order Λ, given by
d(Λ/ Z ) = N
K/Q(d(Λ/O
K))(d
K)
n2, where N
K/Qis the algebraic norm in K.
Let P
1and P
2be some prime ideals of K with norms N
K/Q(P
1) and N
K/Q(P
2). According to [14, Theorem 6.14]
there exists a degree n division algebra D having Z -order Λ with discriminant
d(Λ/ Z ) = (N
K/Q(P
1)N
K/Q(P
2))
n(n−1)(d
K)
n2. (5)
Let us now aim for building the families of (n, n, k) multi- block codes with as large as possible normalized minimum determinant.
A trivial observation is that every number field of degree 2k has prime ideals P
1and P
2such that
N
K/Q(P
1) ≤ 2
2kand N
K/Q(P
2) ≤ 3
2k. (6) Armed with this observation, we have the following.
Proposition 4.3: Given n there exists a family of 2n
2k- dimensional lattices L
n,k⊂ M
n×nk( C ), such that
det
min(L
n,k) = 1, Vol(L
n,k) ≤ 6
kn(n−1)G
2
n2k. Proof: Suppose that K is a degree 2k field extension in the Martinet family of totally complex fields such that (3) holds. We know that this field K has some primes P
1and P
2such that N
K/Q(P
1) ≤ 2
2kand N
K/Q(P
2) ≤ 3
2k. Then, there exists a central division algebra D of degree n over K, and a maximal order Λ of D, such that
d(Λ/ Z ) = (N
k/Q(P
1)N
K/Q(P
2))
n(n−1)(d
K)
n2≤
≤ (6
2k)
n(n−1)(G
2k)
n2.
Let us now show how we can design multiblock codes C having rate R, and satisfying average power constraint P , from a scaled version αL
n,k⊆ M
n×nk( C ) of the lattices defined in Proposition 4.3. Here α is a suitable energy normalization constant. We denote by B(r) the set of matrices in M
n×nk( C ) with Frobenius norm smaller or equal to r. According to Lemma 2.2, we can choose a constant shift X
R∈ M
n×nk( C ) such that for C = B( √
P kn) ∩ (X
R+ αL
n,k) we have 2
Rnk= |C| ≥ Vol(B( √
P kn))
Vol(αL
n,k) = C
n,kP
n2kα
2n2kVol(L
n,k) , where C
n,k=
(πnk)n2k
(n2k)!
. We find the following condition for the scaling constant:
α
2= C
1 n2k
n,k
P 2
RnVol(L
n,k)
n12k≥ C
1 n2k
n,k
P
2
Rn(G/2)6
1−n1. (7) C. Division algebra based kn
2-dimensional codes in M
n×nk( C )
Let K/ Q be a totally real number field of degree k and D a K-central division algebra of degree n. Then there exists an embedding ϕ : D → M
n×nk( C ) [7] with the following properties.
Proposition 4.4: Let us suppose that Λ is a Z -order in D.
Then ϕ(Λ) is an n
2k-dimensional lattice in M
n×nk( C ) and det
min(ϕ(Λ)) = 1, Vol(ϕ(Λ)) = p
d(Λ/ Z ), and δ(ϕ(Λ)) =
1
|d(Λ/ Z )|
1/2n.
Next, we will focus on a particular instance of this family of lattices. We use the notation H for matrices of the form
c −b
∗b c
∗and M
1×k(H) for matrices in M
2×2k( C ) such that each 2 ×2 block is of type H.
Proposition 4.5: [15] Let K be a totally real number field of degree k. Then there exists a degree 2 K-central division algebra D and a Z -order Λ ⊂ D such that ϕ(Λ) is a 4k- dimensional lattice in M
1×k(H) ⊆ M
2×2k( C ) and
det
min(ϕ(Λ)) = 1, Vol(ϕ(Λ)) = |d
K|
2.
Assuming that the center K belongs to the family of real fields from Theorem 4.1, we have the following.
Corollary 4.6: For every k = 2
m, there exists a 4k- dimensional lattice L
Alam,k⊂ M
1×k(H) such that
det
min(ϕ(Λ)) = 1, Vol(ϕ(Λ)) = G
2k1, where G
1≈ 1058.
Similarly to the previous section, we will produce codes C having rate R and satisfying average power constraint P from the lattices L
Alam,k⊆ M
2×2k( C ). Let α be an energy normal- ization constant. According to Lemma 2.2, ∃X
R∈ M
2×2k( C ) such that for C = B( √
2P k) ∩ (X
R+ αL
Alam,k) we have 2
2Rk= |C| ≥ Vol(B( √
P k2))
Vol(αL
Alam,k) = C
Alam,kP
2kα
4kVol(L
Alam,k) , where C
Alam,k=
(2kπ)(2k)!2k. Solving for α, we find
α
2= C
1 2k
Alam,k
P
2
RVol(L
n,k)
2k1≥ C
1 2k
Alam,k
P
2
RG
1. (8) V. A
CHIEVING CONSTANT GAPA. The codes for the (n, n, k)-multiblock channel
Let us now consider the lattice codes C of section IV-B.
Here the underlying lattice L
n,k⊂ M
n×nk( C ) is 2n
2k- dimensional. We are considering the channel model (1) in the symmetric MIMO case where n
r= n. We will analyze the performance of these codes when the number of antennas n is fixed and the number of blocks k tends to infinity.
Let ψ(x) =
dxdln Γ(x) denote the Digamma function. Then we have the following:
Proposition 5.1: Over the (n, n, k) multiblock channel, re- liable communication is guaranteed when k → ∞ for rates
R < n
log P
n e
n1Pni=1ψ(i)− log n + log πe
2 − log 6
1−n1G
when using the multiblock code construction in Section IV-B.
Remark 5.2: We can compare the achievable rate with the tight lower bound in [20, eq. (7)] for n transmit and receive antennas
1:
C ≥ n log
1 + P
n e
n1Pni=1ψ(i)1We note that the capacity (per channel use) of the block fading MIMO channel of finite block lengthT with perfect channel state information at the receiver is independent ofT [21, eq. (9)]. So the bounds in [2] and [20] still hold in our case.
Proof of Proposition 5.1: Let d
Hdenote the minimum Euclidean distance in the received constellation:
d
2H= min
X,X∈C¯ X6= ¯X
k
X
i=1
H
i(X
i− X ¯
i)
2
.
Suppose that the receiver performs maximum likelihood de- coding or “naive” lattice decoding (closest point search in the infinite lattice). For both, the error probability is bounded by
P
e≤ P (
kW k
2≥ d
H2
2)
,
where W = (W
1, . . . , W
k) is the multiblock noise. Note that d
2H≥ α
2n min
X∈Ln,k\{0}
k
X
i=1
|det(H
iX
i)|
n2≥
≥ α
2nk min
X∈Ln,k\{0}
k
Y
i=1
|det(H
iX
i)|
nk2≥ α
2nk
k
Y
i=1
|det(H
i)|
nk2where the first step comes from the Minkowski inequality, the second step comes from the arithmetic mean - geometric mean inequality, and the third from observing that Q
ki=1
|det(X
i)| ≥ 1 for all X ∈ L
n,k\ {0}. Therefore
P
e≤ P
( kW k
2kn
2≥ α
24n
k
Y
i=1
|det(H
i)|
nk2)
Given > 0, we can bound the error probability by P
kW k
2kn
2≥ 1 +
+ P α
24n
k
Y
i=1
|det(H
i)|
nk2< 1 +
(9) Note that 2 kW k
2∼ χ
2(2kn
2), and the tail of the chi-square distribution is bounded as follows for ∈ (0, 1) [16]:
P
( kW k
2kn
2≥ 1 + )
≤ 2e
−kn22
8
. (10)
Therefore the first term in (9) when k → ∞. In order to upper bound the second term, we need to analyze the distribution of the random variable Q
ki=1
|det(H
i)|
2.
In the single block case, it is well-known [17, 18] that if H is an n × n matrix with i.i.d. complex Gaussian entries having variance per real dimension 1/2, the random variable 2
n|det(H )|
2is distributed as the product V
n= Z
1· · · Z
nof n independent chi square random variables Z
j∼ χ
2(2j), j ∈ {1, . . . , n} with density p
Zj(x) =
2jΓ(j)1x
j−1e
−x2. We have
E [ln Z
j] = 1 2
jΓ(j)
Z
∞0
x
j−1e
−x2ln x dx = ψ(j) + ln 2.
where ψ(x) is the Digamma function. Let M
n= E [ln V
n] = n ln 2 +
n
X
j=1
ψ(j). (11) Observe that
E [Z
j−v] = 1 2
jΓ(i)
Z
∞0
x
j−1−ve
−x2dx = Γ(j − v) 2
vΓ(j) , (12)
E [Z
j−vln Z
j] = 1 2
jΓ(j)
Z
∞0
x
j−1−ve
−x2ln x dx =
= Γ(j − v)
2
vΓ(j) (ψ(j − v) + ln 2). (13) Now let’s turn to the multiblock case, and let S
k= 2
nkQ
ki=1
|det(H
i)|
2. We have S
k= V
n(1)· · · V
n(k), where V
n(i)= Z
1(i)· · · Z
n(i)are i.i.d. products of n independent chi squared random variables Z
j(i)∼ χ
2(2j). Note that E [ln S
k] = k E [ln V
n] = kM
n. Consider the zero-mean random variable
B
k= − ln S
k+ kM
n= −
k
X
i=1
ln V
n(i)+ kM
n=
k
X
i=1
T
n(i),
where T
n(i)are i.i.d. with distribution T
n= − P
nj=1
ln Z
j+ M
n. From the Chernoff bound [19] for B
k, given δ > 0,
∀v > 0 we have
P {B
k≥ nkδ} ≤ e
−vδnkE [e
vBk]. (14) The tightest bound in (14) is obtained for v
δsuch that
E [B
ke
vδBk] = δnk E [e
vδBk]. (15) It is easy to see that
E [e
vTn] = e
vMnn
Y
j=1
E [Z
j−v] = e
v(Mn−nln 2)n
Y
j=1
Γ(j − v) Γ(j) Recalling that the variables Z
jare independent, we find E [T
ne
vTn] = E
h −
n
X
j=1
ln Z
j+ M
ne
vMnY
nl=1
Z
l−vi
=
= e
vMnX
nj=1
E [−Z
j−vln Z
j] Y
l6=j
E [Z
l−v] + M
n nY
l=1
E [Z
l−v]
=
= e
v(Mn−nln 2)Y
nl=1
Γ(l − v) Γ(l)
M
n− n ln 2 −
n
X
j=1
ψ(j − v) .
We can finally compute E [e
vBk] = E
h
e
vPki=1Tn(i)i
= E [e
vTn]
k=
= e
vk(Mn−nln 2)n
Y
j=1
Γ(j − v)
kΓ(j)
kSimilarly,
E [B
ke
vBk] =
k
X
i=1
E
h T
n(i)e
vTn(i)i E
h e
Pl6=ivTn(l)i
=
= ke
kv(Mn−nln 2)M
n− n ln 2 −
n
X
j=1
ψ(j − v) Y
nl=1
Γ(l − v)
kΓ(l)
kThus, the tightest bound (15) is achieved for v
δsuch that nδ = M
n− n ln 2 −
n
X
j=1
ψ(j − v
δ) =
n
X
j=1
(ψ(j) − ψ(j − v
δ))
Clearly, for fixed n, v
δ→ 0 when δ → 0. From (14) we get
P {S
knk1≤ e
Mnn −δ} = P {B
k≥ nkδ} ≤
≤ e
k(vδ(−nδ+Mn−nln 2)−Pn
j=1
(ln Γ(j)−ln Γ(j−vδ)))
=
= e
k(ln Γ(1−vδ)+vδψ(1−vδ)+n
P
i=2
(−ln Γ(i)+ln Γ(i−vδ)+vδψ(i−vδ)))
. Recall that Γ(x) is monotone decreasing for 0 < x < a
0= 1.461632 . . . and monotone increasing for x > a
0. Using the mean value theorem for the function ln Γ(x) in the interval [i−v
δ, i] we get that for i = 1, v
δψ(1−v
δ)+ln Γ(1−v
δ) ≤ 0, and for i ≥ 2, v
δψ(i − v
δ) ≤ ln Γ(i) − ln Γ(i − v
δ). Thus,
P (
2
k
Y
i=1