• Aucun résultat trouvé

An Iterative Approach to Nonconvex QCQP with Applications in Signal Processing

N/A
N/A
Protected

Academic year: 2021

Partager "An Iterative Approach to Nonconvex QCQP with Applications in Signal Processing"

Copied!
5
0
0

Texte intégral

(1)

An Iterative Approach to Nonconvex QCQP with Applications in Signal Processing

Ahmad Gharanjik∗‡, Bhavani Shankar, Mojtaba Soltanalian, and Bj¨orn Ottersten∗‡

Signal Processing Department, KTH Royal Institute of Technology,SnT Centre, University of Luxembourg

Department of Electrical and Computer Engineering, University of Illinois at Chicago

Abstract—This paper introduces a new iterative approach to solve or to approximate the solutions of the nonconvex quadratically constrained quadratic programs (QCQP). First, this constrained problem is transformed to an unconstrained problem using a specialized penalty-based method. A tight upper- bound for the alternative unconstrained objective is introduced.

Then an efficient minimization approach to the alternative uncon- strained objective is proposed and further studied. The proposed approach involves power iterations and minimization of a convex scalar function in each iteration, which are computationally fast. The important design problem of multigroup multicast beamforming is formulated as a nonconvex QCQP and solved using the proposed method.

I. INTRODUCTION

Nonconvex QCQP is an important class of optimization problems that can be formulated as,

P: min.

x∈CN xHA0x

s. t. xHAix≤ci, ∀i∈[M], (1) where A0 and Ai are Hermitian matrices for all i ∈ [M], M denotes the number of quadratic constraints, and ci ∈R (Please see the footnote for the notations)1. Herein, we are interested in a subclass of nonconvex QCQP problems with convex objective and nonconvex constraints, A0 is a positive definite (PD) matrix and Ai are Hermitian matrices with at least one negative eigenvalue [1]. This class of nonconvex QCQP problems captures many problems that are of interest to the signal processing and communications community such as beamforming design [2]–[4], radar optimal code design [5]–

[8], multiple-input multiple-output (MIMO) and multiuser estimation and detection [9], as well as phase retrieval [10], [11]. The application is also extended to other domains such as portfolio risk management in financial engineering [12].

Nonconvex QCQP is known to be an NP-hard problem, i.e. at least as hard as NP-complete problems which are particularly deemed by optimization community to be difficult [13]. Due to its wide area of application, the nonconvex QCQP problem has been studied extensively in the optimization and signal processing literature. The NP-hardness of the problem has mo- tivated the search for various efficient approaches to solveP

1x(k)is thekthentry of the vectorx,kxkis thel2-norm ofx,XH is the complex conjugate of X, XT is the transpose of X, andTr(X)the trace ofX.vec(X)is the vector obtained by column-wise stacking ofX.

kXkF is the Frobenius norm of a matrixX,is the Kronecker product and diag(x)is the diagonal matrix formed by elements ofx. [M] denotes the set{1,2,· · ·, M}, andeiis the theithstandard basis vector.

including those based on the semidefinite relaxation (SDR) [9], [14], the reformulation linearization technique (RLT) [15], [16], and the successive convex approximation (SCA) [17]–

[19]. Recently a variant of SCA known as feasible point pursuit-successive convex approximation (FPP-SCA) has also been proposed in [20]. To the best of our knowledge, SDR is yet the most prominent and widely used technique employed for tackling nonconvex QCQP. Note that P in (1) includes several class of widely known optimization problems including binary quadratic programming (BQP) (where Ai = eieTi , x ∈ RN) and unimodular quadratic programming (UQP) (whereAi=eieTi ,x∈CN) [21].

In this paper, we propose a new iterative method to solve the nonconvex QCQP problem. We convert the nonconvex QCQP problem to an unconstrained problem in Section II.

The reformulated optimization problem is then decomposed to subproblems which can be solved either analytically or using extremely efficient optimization tools, discussed in Sec- tion III. In the following, we show that the important signal processing problem of multigroup multicast beamforming can be formulated as a nonconvex QCQP that requires solving P. The proposed formulation serves as a cornerstone to our numerical example in Section IV.

A. Application to Multigroup Multicast Beamforming Consider the general multigroup multicast beamforming problem [3] for a downlink channel, with a nTx-antenna transmitter and K single-antenna users assigned to G ≤ K multicast groups. We denote the subset of user indices in the kth group by Gk for anyk ∈[G]. Lethi ∈CnTx denote the channel between the transmit antennas and theith user. Also let wk ∈CnTx denote the beamforming vector corresponding to the kth group, k ∈ [G], multicast group of users. The beamformed vector to kth group takes the form wksk with E[|sk|2] = 1 where sk is the symbol to be transmitted. The beamforming vectors are to be designed in order to enhance the network performance. In particular, the SINR value for any user i∈ Gk (and anyk∈[G]) is given by [3],

SINRi= wkHRiwk

P

j∈[G]\{k}wHj Riwj

2i, (2)

where Ri = E{hihHi } is the covariance matrix of the ith channel, σ2i denotes the variance of the zero-mean additive white Gaussian noise (AWGN).

(2)

Consequently, the problem of minimizing total transmit power subject to constraints on user SINR performance in the network can be formulated as [2], [3],

min.

{wk}Gk=1

XG k=1

kwkk22

s. t. SINRi≥γi, i∈[K] (3) Note that by a specific reformulation, the SINR metric in (2) can be rewritten as a quadratic criterion. To see this, define the stacked beamforming vectorw ∈CN (withN =nTxG), Rbi andRei as,

w,vec([w1 w2 · · · wG]), (4) b

Ri,diag(ej)⊗Ri, ∀i∈[K], i∈ Gj (5) Rei,(IG−diag(ej))⊗Ri, ∀i∈[K], i∈ Gj (6) in which {Rbi} and {Rei} are PSD matrices. It can be easily verified that

SINRi = wHRbiw

wHReiw+σi2, ∀i∈[K]. (7) As a result, the SINR constraint in (3) can be rewritten as,

wHRbiw−γiwHReiw≥γiσ2i, (8) or equivalently aswHRiw≥1, whereRi is given by,

Ri= b

Ri−γiRei γiσi2

. (9) The beamforming design problem for minimizing total trans- mit power with SINR constraint can thus be formulated as,

min.

w kwk2, s. t. wHRiw≥1, ∀i∈[K]. (10) Note that this formulation may also be used to solve physical- layer multicasting and traditional multiuser transmit beam- forming problems; see [4] and [2] for details.

II. PROBLEMREFORMULATION

We begin our reformulation by rewritingP in an equivalent form. We can assume, without loss of generality, thatci 6= 0;

otherwise P will have a trivial solution of x = 0 or it will be infeasible. Since A0 is a PD matrix, using the change of parameters by Ai

A

1 2

0 AiA

1 2

0

/ci andx ← A

1 2

0x, the nonconvex QCQP of interest may be recast as,

P1: min.

x∈CN kxk2

s. t. xHAix⊳i1, ∀i∈[M], (11) withAi being Hermitian matrices. Here“⊳i”can represent any of “≥”,“≤” or“ = ”for each i.

Now, let us definex=√pu, where p∈R+ andu∈CN is a unit norm vector. Then, (11) can be written as,

P1: min.

u,p p

s. t. uHAiu⊳i 1

p, ∀i∈[M],

kuk2= 1. (12)

By introducing slack variables{ti}, we transform all inequal- ity constraints to equality constraints, viz.

uHAiu+ti= 1

p, ∀i∈[M], (13) whereti∈R. Therefore,P1 can be reformulated as,

P2: min.

u,p,{ti} p

s. t. uH(Ai+tiI)u= 1

p, ∀i∈[M], (14) kuk2= 1.

Any Hermitian matrix can be decomposed as a difference of two PSD matrices simply by partitioning the matrix into parts comprising only non-positive and non-negative eigenvalues. In particular, we consider,

Ai=A+i −Ai , Ai+,Ai 0, ∀i∈[M]. (15) We can also decomposeti as ti=t+i −ti ,∀i∈[M]where

t+i =

(ti if ti>0

0 if ti≤0, ti =

(0 if ti>0

|ti| if ti≤0. (16) Consequently, the constraint in P2 can be written as,

uH A+i +t+i I

u=uH Ai + (1/p+ti )I

u (17) For notational simplicity, we defineCi = Ai + (1/p+ti )I andBi = A+i +t+i I, where both matrices are PSD. Note that (17) holds if and only ifkB

1 2

iuk=kC

1 2

iuk. In particular, the left-hand side of (17) is close to the right-hand side of (17) if and only ifkB

1 2

iukis close tokC

1 2

iuk. Therefore, one can consider the following optimization problem as a penalized reformulation ofP2

P3: min.

u,p,{ti} p+η XM i=1

kB

1 2

iuk − kC

1 2

iuk2

s. t. kuk2= 1, ti∈R, ∀i∈[M], (18) in which η > 0 determines the weight of the penalty-term added to the original objective of P2; and whereP3 andP2 coincide as η → +∞. Note that optimizing P3 with respect to (w. r. t.)umay require rewritingP3as a quartic objective inu. To avoid this, we introduce another alternative objective:

P4: min.

u,p,{ti},{Qi} p+η XM i=1

kB

1 2

iu−QiC

1 2

i uk2 s. t. kQikF ≤1, ti∈R ∀i∈[M],

kuk2= 1. (19)

In contrast toP3, the optimization problemP4 w. r. t.ucan be easily cast as a problem of finding the largest eigenvalue of a PSD matrix—more on this later. To establish the equivalence of P3 and P4, observe that the minimizer Qi ofP4 should be a matrix with Frobenius norm less than or equal to1 that satisfies the following condition,

QiC

1 2

iu= kC

1 2

i uk kB

1 2

iuk B

1 2

iu. (20)

(3)

In this case, it will be straightforward to verify that, XM

i=1

kB

1 2

i u−QiC

1 2

iuk2= XM i=1

kB

1 2

i uk2− kC

1 2

iuk2

2

. (21) In Section III, we present an analytical approach for the derivation of{Qi}.

III. PROPOSEDOPTIMIZATIONFRAMEWORK

We now propose an efficient iterative optimization frame- work based on a separate optimization of the objective of P4 and P3 over its partitions of variablesu,{Qi}, p, and {ti}, at each iteration where the iterations can be initiated from any arbitrary setting.

A. Optimization w. r. t.u

Considerp,{Qi}and{ti}are fixed, then one can optimize P4 w. r. t.uvia minimizing the criterion:

XK i=1

kB

1 2

iu−QiC

1 2

i uk2=uHRu (22)

whereR=PK i=1

n(Bi+Ci)−(B

1 2

i QiC

1 2

i +C

1 2

i QHi B

1 2

i )o . Minimizing uHRu is equivalent to maximizing uH(−R)u.

In general, matrix −R is not PSD. However by diagonal loading (DL), one can make it PSD. Let us define diagonally loaded PD matrix Rb , −R+µI with µ > 0 being larger than the minimum eigenvalue of −R. Due to the fact that kuk2= 1, DL will not change the solution of the optimization problem since it only adds a constant to the objective function:

uHRub =−uHRu+µin whichµis constant. Consequently, one can minimize (or decrease monotonically) the criterion in (22) by maximizing (or increasing monotonically) the objective of the following optimization problem:

max.

kuk2=1 uHRub . (23) Problem (23) is very well-known in that its solution is given by the unit-norm eigenvector corresponding to the largest eigenvalue of R, which can be found efficiently using theb power method iterations [22].

B. Tightening the Upper-Bound: Optimization w. r. t. {Qi} Let wi = B

1 2

iu and vi = C

1 2

i u for notational sim- plicity. Then, the penalty term in P3 can be rewritten as PM

i=1(kwik − kvik)2. Using the following Lemma, we pro- vide an upper-bound for this penalty term.

Lemma 1. For any wi ∈ CN, vi ∈ CN and Qi ∈ CN×N withkQikF ≤1, we havekwik − kvik ≤ kwi−Qivik.

Due to space limitation, the proof of the Lemma 1 is not included in the paper. Considering above lemma, it is straightforward to verify that

XM i=1

(kwik − kvik)2≤ XM i=1

kwi−Qivik2. (24)

As mentioned in Section II, optimal{Qi}should satisfy (24) with equality. Hence, givenu,pand{ti}, we must have

kwik − kvik= min

kQikF≤1kwi−Qivik (25) Now, the question to be addressed is finding optimal {Qi}. The typical method to find Qi is to solve the optimization problem stated in (25). Interestingly, we show that in fact it is not necessary to numerically tackle such an optimization problem to find optimal{Qi}. Recall the optimality condition ofQi in (20), which may be written as,

Qivi=

kvik/kwik

wi. (26) Note that (26) can be recast as,

Qivi= wikvik2

kwikkvik = wivHi

kwikkvikvi. (27) Thus, the optimalQi=Qi ofP4 is immediately given by

Qi = wiviH

/(kwikkvik) . (28) It is straightforward to verify thatQi of (28) satisfies (25) and kQikF = 1. Note that givenu,pand{ti}, calculation ofQi is not demanding from a computational point of view.

C. Optimization w. r. t.p

Now, assume thatu,{Qi}and{ti}are given. Considering P3, the minimization w. r. t.pcan be handled by the following optimization problem:

min.p p+η XM i=1

αi− kC

1 2

iuk2

, (29)

whereαi=kB

1 2

i ukis given fori∈[M]. We recall form (17) thatCi = Ai + (1/p+ti )Iis a function ofp. Since Ai is a PSD matrix, it may be characterized by its eigen-value decompositionAi =ViΛiVHi whereVi is a unitary matrix and Λi is a diagonal matrix formed from the eigenvalues of Ai . As a result,C

1 2

i can be written as, C

1 2

i =Vi

Λi+ (1

p+ti )I 12

VHi . (30) Since multiplication with a unitary matrix does not change the ℓ2-norm, we have that

kC

1 2

iuk=

Λi+ (1

p+ti )I 12

VHi u

(31)

= XM k=1

|ai(k)|2

λλ

λi(k) +1 p+ti

!12

=

bi+1 p+ti

12

, where ai = ViHu, λλλi is a vector formed from diagonal elements of Λi (λλλi=diag(Λi)) or equivalently from the eigenvalues ofAi , andbi = PM

k=1|ai(k)|2λλλi(k). In (32),

(4)

we have used the fact that PM

k=1|ai(k)|2 =kaik2= 1. The objective function of (29) now can be expanded as

f(p) = (32)

p+η XM i=1

α2i +bi+1

p+ti −2αi

bi+1

p+ti 12!

. By looking over its second derivative off(p), one can readily observe thatf(p)is not convex. Instead, we consider a change of parameters byg(q) =f(1/q). Letqdenotes the optimalq that minimizeg(q). Clearly, one can conclude thatp= 1/q will minimize f(p). Therefore, in order to solve (29) it is sufficient to solve

min.

q>0 g(q) (33)

Now, let us have a deeper look atg(q). The second derivative ofg(q)is given by

g′′(q) = 2 q3

XM i=1

αi

2 bi+q+ti 32

. (34) Since q,αi, bi andη have positive values, we can conclude thatg′′(q)>0. This means thatg(q)is a convex function and we can use numerical methods like gradient descent algorithm to find the global optimumq. Corresponding optimal solution for (29) will be given by p = 1/q. Note that convexity of g(q)can also be concluded from definition ofg(q), as a sum of convex functions over q >0.

D. Optimization w. r. t. ti

Assuminguandpare known, the values of{ti}minimizing P3 andP4 can be calculated by using (13), that implies

ti= 1

p−uHAiu. (35)

However, it should be noted that at the optimal point, following conditions need to be satisfied for all i∈[M],





ti≥0 if “⊳i” = “≤” ti= 0 if “⊳i” = “ = ” ti≤0 if “⊳i” = “≥”

, (36)

otherwise it means that constraint in (12) is not satisfied and optimization problem P1 is not feasible. When the con- straints (36) is imposed, the optimal feasible solution in each iteration can be found by,

ti=

(ti if (36) is satisfied

0 if (36) is not satisfied (37) Let us denote the variables generate at iteration r of the optimization framework byxr= (ur, pr, tri). It can be shown that any limit point of the sequencexris an stationary point.

A proof is not provided herein due to lack of space.

IV. A NUMERICALEXAMPLE

In this section, a brief numerical example is provided to investigate the performance of the proposed method. To this end, we consider a multigroup multicast beamforming scenario with G = 3, nTx = 4 and K = 15 single-antenna users.

We assume γi = 1 for all users. The entries of the channel vectors hi are drawn from an i. i. d. complex Gaussian distribution with zero mean, with a variance set to 10. The Gaussian noise components received at each user antenna are assumed to have unit variance, i.e. σ2i = 1 for all i ∈ [K].

We stop the optimization iterations whenever the objective decrease becomes bounded by 10−5 or number of iterations goes beyond1000. Figure 1 shows the transition of objective function of P3 (equivalent to P4), with η = 10 in different iterations. It also shows the values ofpin different iterations.

It can be observed that objective function is monotonically decreasing. The difference betweenpand the objective ofP3

denotes the penalty term of P3. Since η = 10 the penalty term might not be exactly zero, therefore resulted SINR for users,γˆi, might be slightly less than targetedγi. In this case, one can readily find the feasible beamforming vector w by simply scaling it. The results leading to Figure 1 was obtained in 2.5 seconds on a standard PC, while SDR followed by a randomization step (with1000realizations) took3.5seconds.

Also our approach resulted inp= 1.22while SDR achieved pSDR = 1.37. Note that the lower bound for p achieved by SDR (corresponding to high-rank solution) waspLB = 1.16.

0 200 400 600 800 1000

0 1 2 3 4 5 6 7 8 9 10

iteration number

objective ofP3 p

Fig. 1. Transition of the objective function P3 and the parameter p vs.

iteration number. The weight of the penalty-term (η) is set to10.

V. CONCLUSION

An iterative approach is proposed to tackle the nonconvex QCQPs. Each iteration of the proposed method requires solv- ing a set of subproblems, which is accomplished by computa- tionally efficient steps. The multigroup multicast beamforming problem is formulated as a nonconvex QCQP and solved using the proposed method. Numerical results showed the proposed approach is computationally efficient and produces quality results.

(5)

REFERENCES

[1] P. M. Pardalos and S. A. Vavasis, “Quadratic programming with one negative eigenvalue is NP-hard,” Journal of Global Optimization, vol. 1, no. 1, pp. 15–22, 1991.

[2] M. Bengtsson and B. Ottersten, Optimal and Suboptimal Transmit Beamforming, ser. Electrical Engineering & Applied Signal Processing Series, L. Chand Godara, Ed. CRC Press, 2001.

[3] E. Karipidis, N. D. Sidiropoulos, and Z.-Q. Luo, “Quality of service and max-min fair transmit beamforming to multiple cochannel multicast groups,” IEEE Transactions on Signal Processing, vol. 56, no. 3, pp.

1268–1279, 2008.

[4] N. D. Sidiropoulos, T. N. Davidson, and Z.-Q. T. Luo, “Transmit beamforming for physical-layer multicasting,” IEEE Transactions on Signal Processing, vol. 54, no. 6, pp. 2239–2251, 2006.

[5] Y. Huang and D. P. Palomar, “Randomized algorithms for optimal solutions of double-sided qcqp with applications in signal processing,”

IEEE Transactions on Signal Processing, vol. 62, no. 5, pp. 1093–1108, 2014.

[6] A. De Maio, Y. Huang, M. Piezzo, S. Zhang, and A. Farina, “Design of optimized radar codes with a peak to average power ratio constraint,”

IEEE Transactions on Signal Processing, vol. 59, no. 6, pp. 2683–2697, 2011.

[7] A. Aubry, A. De Maio, M. Piezzo, M. M. Naghsh, M. Soltanalian, and P. Stoica, “Cognitive radar waveform design for spectral coexistence in signal-dependent interference,” in IEEE Radar Conference, 2014, pp.

0474–0478.

[8] P. Stoica, J. Li, and Y. Xie, “On probing signal design for MIMO radar,”

IEEE Transactions on Signal Processing, vol. 55, no. 8, pp. 4151–4161, 2007.

[9] Z.-Q. Luo, W.-K. Ma, A.-C. So, Y. Ye, and S. Zhang, “Semidefinite relaxation of quadratic optimization problems,” IEEE Signal Processing Magazine, vol. 27, no. 3, pp. 20–34, 2010.

[10] E. J. Candes, T. Strohmer, and V. Voroninski, “Phaselift: Exact and stable signal recovery from magnitude measurements via convex pro- gramming,” Communications on Pure and Applied Mathematics, vol. 66, no. 8, pp. 1241–1274, 2013.

[11] I. Waldspurger, A. d’Aspremont, and S. Mallat, “Phase recovery, maxcut and complex semidefinite programming,” Mathematical Programming, vol. 149, no. 1-2, pp. 47–81, 2015.

[12] M. U. Torun, A. N. Akansu, and M. Avellaneda, “Portfolio risk in multiple frequencies,” IEEE Signal Processing Magazine, vol. 28, no. 5, pp. 61–71, 2011.

[13] A. d’Aspremont and S. Boyd, “Relaxations and randomized methods for nonconvex QCQPs,” EE392o Class Notes, Stanford University.

[14] S. Boyd and L. Vandenberghe, Convex optimization. Cambridge university press, 2009.

[15] K. M. Anstreicher, “On convex relaxations for quadratically constrained quadratic programming,” Mathematical programming, vol. 136, no. 2, pp. 233–251, 2012.

[16] H. D. Sherali and W. P. Adams, A reformulation-linearization technique for solving discrete and continuous nonconvex problems. Springer Science & Business Media, 2013, vol. 31.

[17] A. Beck, A. Ben-Tal, and L. Tetruashvili, “A sequential parametric convex approximation method with applications to nonconvex truss topology design problems,” Journal of Global Optimization, vol. 47, no. 1, pp. 29–51, 2010.

[18] B. R. Marks and G. P. Wright, “Technical note- a general inner approx- imation algorithm for nonconvex mathematical programs,” Operations Research, vol. 26, no. 4, pp. 681–683, 1978.

[19] L.-N. Tran, M. F. Hanif, and M. Juntti, “A conic quadratic programming approach to physical layer multicasting for large-scale antenna arrays,”

IEEE Signal Processing Letters, vol. 21, no. 1, pp. 114–117, 2014.

[20] O. Mehanna, K. Huang, B. Gopalakrishnan, A. Konar, and N. Sidiropou- los, “Feasible point pursuit and successive approximation of non-convex qcqps,” IEEE Signal Processing Letters, vol. 22, no. 7, pp. 804–808, 2015.

[21] M. Soltanalian and P. Stoica, “Designing unimodular codes via quadratic optimization,” IEEE Transactions on Signal Processing, vol. 62, no. 5, pp. 1221–1234, March 2014.

[22] G. H. Golub and C. F. Van Loan, Matrix computations. JHU Press, 2012, vol. 3.

[23] M. Soltanalian, A. Gharanjik, B. Shankar, and B. Ottersten, “Grab- n-Pull: An optimization framework for fairness-achieving networks,”

accepted to IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 2016.

[24] A. Gharanjik, B. Shankar M. R. R, P.-D. Arapoglou, M. Bengtsson, and B. Ottersten, “Robust precoding design for multibeam downlink satellite channel with phase uncertainty,” in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, April, 2015, pp.

3083–3087.

Références

Documents relatifs

We prove that the rst eigenvector of the multigroup system in the periodicity cell controls the oscillatory behavior of the solu- tions, whereas the global trend is asymptotically

In Chapter 3, which contains the result of [17], the definition of a generalized notion of quasistatic evolution is given in terms of both Young measures and stochastic processes,

Otherwise, the Gaussian randomization process needs to be applied to get a feasible rank-one solution, which is then compared to the lower bound higher rank solution.. The results

A counterexample due to Stefan M¨ uller in [13] shows that in general, for quasiconvex nonconvex energy densities, the inequality W cell ≥ W hom can be strict.. More

Our goal is, on one hand, to write down explicit first and second order optimality conditions for general 2-dimensional shape optimization problems with convexity con- straint and,

On one hand, our results extends the result in [7] to the more general problem and, on the other hand, our result provides an alternative improvement of the result in [9].. Unlike

On s’intéresse à l’étude de ces équations le cas où les coefficients ne vérifient pas la condition de Lipschits ; précisément dans le cas où les coefficients sont

The Spectral Intrinsic Decomposition Method [1] is an adaptive decomposition technique with which any compli- cated signal can be decomposed into a definite number of high frequency