Causal feedback and delay constraint
4.3 Long-term average capacity region
constrained channels [1] and denition of capacity region per unit cost given in [14].
Denition 2.
A rateK
-tuple R?2RK+ is long-term average-achievable if for all>
0 there existL
such that forL
L
variable-rate (N;L;
R;
;
)-codes can be found withR
k> R
?k ; fork
= 1;
;K
. A rateK
-tuple is achievable if it is -achievable for all 0< <
1. The long-term average capacity regionC
K;N() is the closure of the convex hull of all long-termachievable rate
K
-tuples.Denition 3.
AK
-tupler?2RK+ is a long-term average-achievable rate per unit energy if for all>
0 there exist an energy vector = (1;
;
K) such that for 1 variable-rate (N;L;
R;
=
(NL
);
)-codes can be found with (LNR
k)=
k> r
?k;fork
= 1;
;K
. A rateK
-tuple is achievable if it is -achievable for all 0< <
1. The long-term average capacity region per unit-energyU
K;N is the set of all long-term achievable rateK
-tuples perunit-energy.
In this setting, it is meaningful to study the largest achievable long-term average rate region, subject to the short-term power constraint (4.3). More-over, in the energy-limited case investigated here, a meaningful system de-sign criterion is to look for the largest achievable long-term average capacity per unit energy(bit/joule). Next, in analogy with [13, 14], we characterize the long-term average capacity region and the long-term average capacity per unit energy for our system. We also give limiting theorems for large delay
N
.4.3 Long-term average capacity region
We have the following result:
Theorem 1.
The long-term average capacity region is given byC
K;N() = [2;K;N()
R2RK+ : 8Af1
;
;K
gX
k2A
R
k E"
N
1XN
n=1log 1 +X
k2A
k;nk;n(Sn)!#)
(4.7) where expectation is with respect to the channel stateSN and where ;K;N() is the set of feasible causal short-term power allocation policies =f
k;n:1For two vectors a and b, the notation a b means that the dierence a;b has nonnegative components.
k
= 1;
;K;n
= 1;
;N
g dened as;K;N()=
(
2RKN+ : 1
N
XN
n=1
k;n(Sn)k)
(4.8) where
k;n(Sn) indicates the causality constraint.Proof.
The achievable part easily follows by constructing random codes with entries xk;n N(0 ;
k;nI
) such that the variances satisfy (4.8) and the rates satisfy all the inequalities in (4.7). The converse part follows by showing that the capacity of theN
-slot extension channel is achieved by Gaussian input distributions in the form of those used in the achievable part of the theorem. For details see Appendix 4.A.Remark.
The regionC
K;N() is convex in , in fact, by applying Jensen's inequality it is straightforward to see that if R(a) 2C
K;N() andR(b) 2
C
K;N() then, for every 2 [0;
1] we have R(a)+ (1;)R(b) 2C
K;N(). For this reason in Theorem 1 the convex-hull operation is not needed.For a given power policy in ;K;N(), let P() be the set of long-term average rates achievable by applying. Theorem 1 states that the long-term average capacity region
C
K;N() is the union of all the polymatroids P()C
K;N() = [2;K;N()
P() (4.9)
Such formulation of
C
K;N() is not useful unless we can determine its clo-sure set. We explicitly characterize the boundary surface of theC
K;N(), following the approach of [13], as the closure of the convex-hull allK
-tuplesR2RK+ that solve
max
R2CK;N()
XK
k=1
kR
k (4.10)for some= (
1;
;
K)2RK+. As in [13], the optimization in (4.10) can be written asmax
2;K;N() max
R2P()
XK
k=1
kR
k (4.11)SinceP() is a polymatroid, the solution of the inner maximization in (4.11) is attained by one of the (at most)
K
! vertices of P(). Such vertex is uni-vocally determined by the entries of the vector: it is the one correspond-ing to the decodcorrespond-ing order K;
K;1;
;
1 where is the permutation4.3 Long-term average capacity region 117
of f1;
2;
;K
g that orders in decreasing order, i.e., 1>
>
K. Hence, for any policy we havemax maxi-mization (4.10) into the maximaxi-mization of the right hand side (RHS) of (4.12) over the power policies 2;K;N(). Due to the causal nature of the chan-nel state information, the maximization of (4.12) with respect to , and hence the solution of (4.10), is obtained by Dynamic Programming. We have the following:
Theorem 2.
Dene forn
= 1;
;N
the Dynamic Programming recursionS
n(P
1;
;P
K;) = EProof.
Recursion (4.13) and the optimal power policy (4.16) follow easily from the general theory of Dynamic Programming [98] when the cost func-tion to be maximized is given by the RHS of (4.12) and the system evolves,from slot
n
to slotn
+ 1, according to (n;
P)! (n+1;
P ;bpn). Detailscan be found in Appendix 4.B.
Note that
XK
k=1
kR
bk;N(;
) = 1N S
N(N
1;
;N
K;) (4.17) In [92], the authors computed numerically the recursion (4.13) forK
= 1 in the Rayleigh fading case.Remark.
In contrast with [13], the convex-hull operation in (4.14) is needed since the rates RbN(;
) might not be continuous functions of . Consider, as an example, the case ofN
= 1. The regionC
K;1() coincides with the ergodic capacity region of a fading channel without CSI at the transmitters, the dominant face of which is an hyper-plane inK
dimensions.Due to the polymatroid structure of
C
K;1(), the solution (4.15) is one of the (at most)K
! vertices of the dominant face. Hence, as varies in RK+, the set ofRbN(;
) contains at mostK
! points. It is clear that the convex hull operation is needed here.From Theorem 2, by solving recursion (4.13) for
n
= 1, we see that the optimal solution isp
bk;1 =P
k for every and for every . Hence, from (4.16) withN
= 1, we have that bk;1 =k for allk
, i.e., the optimal solution forN
= 1 is constant power allocation. From (4.16), we also see thatbk;N =N
k;PN;1j=1
bk;j which means that, on the last available slot, all the remaining energy is used regardless of the fading value, which is sensible since the remaining energy cannot be used on the next frame.Due to the heavy notation of Theorem 2, it might not be so straightfor-ward to understand how to construct the optimal policy b and the role of the recursion (4.13). Hence, we give some explanation. To characterize the boundary surface of
C
K;N(), rst we need to solve the recursion (4.13) in or-der to determine the optimal valuesfp
bk;n(;;
P)8k
= 1;K
gforn
1.Then, for a given delay
N
, we built up the optimal power policy (4.16) by considering the \length-N
window" of optimal values fp
bk;n(;;
P) 8k
= 1;K
g forn
=N;N
;1;
;
1. On the rst slot (n
= 1), the optimal policy b1 = [b1;1;
;
bK;1] is derived from fp
bk;N(;;
P) 8k
= 1;K
g computed forequal to the actual fading values 1 = [1;1;
;
K;1] and for power P equal to the total available energiesN
= [N
1;
;N
K], i.e., for each userk
= 1;
;K
bk;1(S1;;
) =p
bk;N(1;N
;)On the second slot (
n
= 2), the optimal policy b2 = [b1;2;
;
bK;2] is derived fromfp
bk;N;1(;;
P)8k
= 1;K
g. Given the fading realization2 = [
1;2