• Aucun résultat trouvé

The augmented Lagrangian method with full Jacobian decomposition and logarithmic-quadratic proximal regularization for multiple-block separable convex programming

N/A
N/A
Protected

Academic year: 2022

Partager "The augmented Lagrangian method with full Jacobian decomposition and logarithmic-quadratic proximal regularization for multiple-block separable convex programming"

Copied!
41
0
0

Texte intégral

(1)

SMAI-JCM

SMAI Journal of

Computational Mathematics

The augmented Lagrangian method with full Jacobian decomposition and logarithmic-quadratic proximal

regularization for multiple-block separable convex programming

Min Li & Xiaoming Yuan Volume 4 (2018), p. 81-120.

<http://smai-jcm.cedram.org/item?id=SMAI-JCM_2018__4__81_0>

© Société de Mathématiques Appliquées et Industrielles, 2018 Certains droits réservés.

cedram

Article mis en ligne dans le cadre du

Centre de diffusion des revues académiques de mathématiques http://www.cedram.org/

(2)

Vol. 4, 81-120 (2018)

The augmented Lagrangian method with full Jacobian

decomposition and logarithmic-quadratic proximal regularization for multiple-block separable convex programming

Min Li1 Xiaoming Yuan2

1School of Management and Engineering, Nanjing University, China E-mail address: [email protected]

2Department of Mathematics, The University of Hong Kong, Hong Kong, China E-mail address: [email protected].

Abstract. We consider a separable convex minimization model whose variables are coupled by linear constraints and they are subject to the positive orthant constraints, and its objective function is in form of mfunctions without coupled variables. It is well recognized that when the augmented Lagrangian method (ALM) is applied to solve some concrete applications, the resulting subproblem at each iteration should be decomposed to generate solvable subproblems. When the Gauss-Seidel decomposition is implemented, this idea has inspired the alternating direction method of multiplier (form= 2) and its variants (form3). When the Jacobian decomposition is considered, it has been shown that the ALM with Jacobian decomposition in its subproblem is not necessarily convergent even when m = 2 and it was suggested to regularize the decomposed subproblems with quadratic proximal terms to ensure the convergence. In this paper, we focus on the multiple-block case with m 3. We consider implementing the full Jacobian decomposition to ALM’s subproblems and using the logarithmic-quadratic proximal (LQP) terms to regularize the decomposed subproblems. The resulting subproblems are all unconstrained minimization problems because the positive orthant constraints are all inactive; and they are fully eligible for parallel computation. Accordingly, the ALM with full Jacobian decomposition and LQP regularization is proposed. We also consider its inexact version which allows the subproblems to be solved inexactly. For both the exact and inexact versions, we comprehensively discuss their convergence, including their global convergence, worst-case convergence rates measured by the iteration-complexity in both the ergodic and nonergodic senses, and linear convergence rates under additional assumptions. Some preliminary numerical results are reported to demonstrate the efficiency of the ALM with full Jacobian decomposition and LQP regularization.

2010 Mathematics Subject Classification. 90C25, 90C33, 65K05.

Keywords. convex programming, splitting methods, augmented Lagrangian method, logarithmic-quadratic proximal, parallel computation, convergence rate.

1. Introduction

We consider the following separable convex minimization problem whose variables are coupled by linear constraints and they are subject to the positive orthant constraints, and its objective function is the sum of more than one function without coupled variables:

minn

m

X

i=1

θi(xi)

m

X

i=1

Aixi=b, xi ∈ <n+i, i= 1, . . . , mo, (1.1) where θi :<ni → <(i= 1, . . . , m) are convex but not necessarily smooth functions; Ai ∈ <l×ni and b∈ <l. The solution set of (1.1) is assumed to be nonempty throughout our discussions.

The first author was supported by the National Natural Science Foundation of China under grant 11001053 and Program for New Century Excellent Talents in University under grant NCET-12-0111.

The second author was supported by the General Research Fund from Hong Kong Research Grants Council: 12313516.

(3)

Letλ∈ <l be the Lagrange multiplier associated with the linear equality constraints in (1.1) and the Lagrangian function of (1.1) be

L(x1, . . . , xm, λ) :=

m

X

i=1

θi(xi)−λT

m

X

i=1

Aixib, (1.2)

defined on Ω :=<n+1× · · · × <n+m× <l. Then, the augmented Lagrangian function of (1.1) is Lβ(x1, . . . , xm, λ) :=

m

X

i=1

θi(xi)−λT

m

X

i=1

Aixib+ β 2

m

X

i=1

Aixib2, (1.3)

where β > 0 is a penalty parameter. If we treat the model (1.1) as a whole and apply directly the augmented Lagrangian method (ALM) in [21, 41], then the resulting scheme is

(xk+11 , . . . , xk+1m ) := argminLβ(x1, . . . , xm, λk)xi ∈ <n+i, i= 1, . . . , m , λk+1:=λkβ(Pmi=1Aixk+1ib).

(1.4)

In general, the minimization subproblem in (1.4) is hard because it requires minimizingm functions with variables coupled by the quadratic term in (1.3). This difficulty has inspired a series of splitting methods whose common idea is decomposing the subproblem in (1.4) and thus generating easier subproblems. For example, for the special case of (1.1) withm = 2, if the minimization subproblem in (1.4) is decomposed in Gauss-Seidel order, the scheme is

xk+11 := argminLβ(x1, xk2, λk)x1 ∈ <n+1 , xk+12 := argminLβ(xk+11 , x2, λk)x2 ∈ <n+2 , λk+1 :=λkβ(A1xk+11 +A2xk+12b).

(1.5)

This is the so-called alternating direction method of multiplier (ADMM) in [17] and it has found many efficient applications in a broad spectrum of application domains such as image processing, statistical learning, computer vision, network optimization, and so on. We refer to [5, 13, 16] for some review papers on the ADMM. If we consider directly extending the scheme (1.5) to the generic case of (1.1) withm≥3, then the resulting direct extension of ADMM reads as

xk+11 := argminLβ(x1, xk2, . . . , xkm, λk)x1 ∈ <n+1 ,

· · · ·

xk+1i := argminLβ(xk+11 , . . . , xk+1i−1, xi, xki+1, . . . , xkm, λk)xi∈ <n+i ,

· · · ·

xk+1m := argminLβ(xk+11 , . . . , xk+1m−1, xm, λk)xm ∈ <n+m , λk+1 :=λkβ(Pmi=1Aixk+1ib).

(1.6)

The direct extension of ADMM scheme (1.6) indeed works empirically for some applications, as shown in, e.g., [40, 43]. However, it was shown in [6] that the scheme (1.6) is not necessarily convergent. The convergence rate of ADMM and its extension are analysed in [9, 28, 30, 33, 32].

(4)

On the other hand, if we consider implementing the Jacobian decomposition to the ALM subproblem in (1.4), the resulting scheme reads as

xk+11 := argminLβ(x1, xk2, . . . , xkm, λk)x1 ∈ <n+1 ,

· · · ·

xk+1i := argminLβ(xk1, . . . , xki−1, xi, xki, . . . , xkm, λk)xi∈ <n+i ,

· · · ·

xk+1m := argminLβ(xk1, . . . , xkm−1, xm, λk)xm ∈ <n+m , λk+1 :=λkβ(Pmi=1Aixk+1ib).

(1.7)

The xi-subproblems in (1.7), which usually dominate the computation time at each iteration, can be solved in parallel; and they can be implemented in a distributed-computing system. This feature is of particular interest for the big-data scenario and the circumstances where parallel computing infrastructures are available. Note that the subproblems in (1.7) are of the same level of difficulty as those in (1.6) — each of them requires minimizing oneθi in the original objective of (1.1) plus a quadratic term with the positive orthant constraint<n+i. The scheme (1.7), however, is not necessarily convergent even whenm= 2, as shown in [22]. In the literature, it was suggested to correct the output of (1.7) by some correction steps to ensure the convergence; some prediction-correction methods based on the Jacobian decomposition of ALM (1.7) were thus presented in the literature, see, e.g., [20, 22].

Note that these prediction-correction methods usually converge fast for some applications arising in image processing and other areas. But their correction steps need the solutions of thexi-subproblems in (1.7) as the input and thus they are of less degrees of parallel computation. In [25, 10], it was proved that the convergence is ensured if the subproblems in (1.7) are regularized by quadratic proximal terms with sufficiently large proximal coefficients. For example, it was analyzed in [25] that the following scheme

xk+11 := argminLβ(x1, xk2, . . . , xkm, λk) + 2 kA1(x1xk1)k2 x1 ∈ <n+1 ,

· · · ·

xk+1i := argminLβ(xk1, . . . , xki−1, xi, xki, . . . , xkm, λk) +2kAi(xixki)k2 xi∈ <n+i ,

· · · ·

xk+1m := argminLβ(xk1, . . . , xkm−1, xm, λk) +2 kAm(xmxkm)k2 xm∈ <n+m , λk+1 :=λkβ(Pmi=1Aixk+1ib).

(1.8)

is convergent as long as the proximal coefficient sm−1. A more general analysis can be found in [10]. The scheme (1.8) requires no correction step because of the regularization of the quadratic terms 2kAi(xixki)k2 fori= 1, . . . , m; and the eligibility for parallel computation is remained. The iteration-complexity of a Jacobi-type non-Euclidean proximal ADMM for solving multi-block linearly constrained nonconvex programs is established in [35].

Note that thexi-subproblems in (1.8) are constrained minimization problems subject to the posi- tive orthants. To further simplify these subproblems, we can apply the logarithmic-quadratic proximal terms, which firstly appeared in [3], to the subproblems in (1.7). The key point is that the LQP regularization automatically excludes the points on the boundaries of the constraints in the feasible regions; thus the positive orthant constraints in (1.7) all become inactive and the decomposed sub- problems in (1.7) with the LQP regularization are all unconstrained. In the literature, the research on the combination of the LQP regularization with ALM-based splitting methods focuses only on the special case of (1.1) with m = 2 and mainly on the Gauss-Seidel decomposition. For instance, the combination of the LQP regularization with the ADMM scheme (1.5) in [46, 1] is in the variational inequality context. We also refer to [29] and [31] for the combination of the LQP with the generalized ADMM proposed in [12] and the strictly contractive Peaceman-Rachford splitting method proposed

(5)

in [23], respectively. Some other interesting applications of LQP can be found in, e.g., [2]. Finally, it is referred to [31, 44, 7] for the convergence rate analysis for the mentioned methods.

Our first purpose is proposing the scheme of ALM with Jacobian decomposition and LQP regu- larization for the multiple-block convex minimization model (1.1) with m ≥ 3, see (3.2) for detail.

Both the exact and inexact versions will be proposed. The inexact version allows the decomposed subproblems to be solved inexactly subject to certain inexactness criterion. To the best of our knowl- edge, it is the first work of combining the LQP regularization with the Jacobian decomposition of the ALM (1.7) for the generic case of (1.1) withm≥3. Note that using the LQP regularization, instead of the quadratic proximal terms, is particularly useful for the case where the functionsθi’s are generic and the subproblems in (1.7) are not simple enough to have closed-form solutions and thus the con- strained subproblems in (1.7) need to be solved iteratively by a certain algorithm. In other words, the new scheme mainly differs from (1.7) in that only unconstrained subproblems are required to solve.

The second purpose of this paper is comprehensively analyzing the convergence for both the exact and inexact versions of the new scheme. More specifically, we discuss their convergence, including the global convergence, the worst-case convergence rate measured by the iteration-complexity in both the ergodic and nonergodic senses, and the linear convergence rates under additional assumptions.

The rest of this paper is organized as follows. In Section 2, we summarize some useful results and introduce some notation for further analysis. Then, we present the exact version of the ALM with full Jacobian decomposition and LQP regularization in Section 3, followed by some remarks. The convergence of the exact version of this new scheme is proved in Section 4. Then, we establish its convergence rate in Section 5. In Section 6, we present the inexact version of the new scheme, and analyze its convergence in Section 7. In Section 8, we report some preliminary numerical results to show the efficiency of the new scheme. Finally, we make some conclusions in Section 9.

2. Preliminaries

We first summarize some useful preliminaries known in the literature and introduce some notations to be used in the analysis. Some simple conclusions are also proved in this section.

2.1. The Logarithmic-quadratic Proximal Regularization

We first review the LQP regularization. More details are provided in [3]. Let us define ϕ(c) :=

(1

2(c−1)2+µ(c−logc−1) ifc >0,

+∞ otherwise, (2.1)

for a given scalar µ∈(0,1). Associated with ϕ, for anyz∈ <N++, we define d(z0, z) :=

PN

j=1

1

2(zj0zj)2+µ(zj2logzzj0 j

+zj0zjzj2) ifz0 ∈ <N++,

+∞ otherwise, (2.2)

and

Φ0(z, z0) := (z1ϕ0(z10/z1),· · ·, zNϕ0(zN0 /zN))T ∀z, z0 ∈ <N++, (2.3) where

ϕ0(zj0/zj) =zj0/zj −1 +µ(1zj/z0j), j= 1,· · · , N. (2.4) For any z0, z ∈ <N++, we have d(z0, z) ≥ kz0zk2/2 and d(z0, z) = 0 if and only if z0 =z. Moreover, the functiond(·,·) defined in (2.2) can be rewritten as

d(z0, z) =

N

X

j=1

zj2ϕ(zj0/zj) ∀z0, z∈ <N++,

(6)

and then we have

Φ0(z, z0) =∇z0d(z0, z) = (z0z) +µ[zZ2(z0)−1],

whereZ := diag(z1, z2, . . . , zN)∈ <N×N, (z0)−1∈ <N is a vector whose j-th element is 1/zj0.

The following lemma was proved in [46] and it was inspired by Proposition 1 in [3]. We need this lemma to analyze the convergence for the new algorithms.

Lemma 2.1. Let P := diag(p1, . . . , pN)∈ <N×N be a positive definite diagonal matrix, q(z)∈ <N be a monotone mapping ofzwith respect to<N+, and ϑ:<N → <. Letµ∈(0,1)be a constant. For given z, z¯ ∈ <N++, we define Z¯ := diag(¯z1, . . . ,z¯N), z−1 := (1/z1, . . . ,1/zN)T and

Φ0z, z) = (zz) +¯ µ(¯zZ¯2z−1).

Then, the variational inequality

ϑ(z0)−ϑ(z) + (z0z)T[q(z) +PΦ0z, z)]≥0 ∀z0∈ <N+, (2.5) has the unique positive solution z. In addition, for this positive solution z ∈ <N++ and any z0 ∈ <N+, we have

ϑ(z)ϑ(z0) + (z−z0)T[q(z) + (1 +µ)P(zz)]¯ ≤µk¯zzk2P, (2.6) where kzk2P :=zTP z.

2.2. Variational Reformulation of (1.1)

In our analysis, we need a variational reformulation of the convex minimization model (1.1). More specifically, let (x1, . . . , xm, λ) be a saddle point of the Lagrange function (1.2). Then, for any (x1, . . . , xm, λ)∈ <n+1 × · · · × <n+m× <l, we have the inequalities

L(x1, . . . , xm, λ) ≤ L(x1, . . . , xm, λ) ≤ L(x1, . . . , xm, λ). (2.7) Setting (x1, . . . , xi−1, xi, xi+1,· · · , xm, λ) = (x1, . . . , xi−1, xi, xi+1, . . . , xm, λ) in the second inequality of (2.7) fori= 1,· · ·, m, we get

xi ∈ <n+i θi(xi)−θi(xi) + (xixi)T(−ATi λ)≥0 ∀xi ∈ <n+i, i= 1, . . . , m.

On the other hand, the first inequality in (2.7) means λ∈ <l (λ−λ)T

m

X

i=1

Aixib≥0 ∀λ∈ <l.

Recall that Ω =<n+1× · · · × <n+m× <l. Thus, finding a saddle point of L(x1, . . . , xm, λ) is equivalent to finding a vectorw = (x1, . . . , xm, λ)∈Ω such that

θ1(x1)−θ1(x1) + (x1x1)T(−AT1λ)≥0 ∀x1 ∈ <n+1,

· · ·

θi(xi)−θi(xi) + (xixi)T(−ATi λ)≥0 ∀xi ∈ <n+i,

· · ·

θm(xm)−θm(xm) + (xmxm)T(−ATmλ)≥0 ∀xm∈ <n+m, (λ−λ)TPmi=1Aixib≥0 ∀λ∈ <l.

(2.8)

We can rewrite (2.8) in a compact way: solving (1.1) is equivalent to findingw = (x1, . . . , xm, λ)∈ Ω :=<n+1 × · · · × <n+m× <l such that

VI(Ω, F, θ) : θ(x)θ(x) + (w−w)TF(w)≥0 ∀w∈Ω, (2.9a)

(7)

where

x=

x1

... xm

, θ(x) =

m

X

i=1

θi(xi), (2.9b)

w=

x1

... xm

λ

and F(w) =

−AT1λ ...

−ATmλ Pm

i=1Aixib

. (2.9c)

Because the mapping F(w) defined in (2.9c) is affine with a skew-symmetric matrix, it is monotone.

We denote by Ω the solution set of VI(Ω, F, θ), and it is not nonempty under the non-emptiness assumption of the solution set of (1.1).

Then, we recall the characterization of the solution set Ω whose proof can be found in [14, 26]:

:= \

w∈Ω

w˜∈Ω|θ(x)θ(˜x) + (ww)˜ TF(w)≥0 . (2.10) With the characterization (2.10) and following Definition 1 in [38], we define an ε-approximation solution of VI(Ω, F, θ) as follows.

Definition 2.2. The vector ˜w∈Ω is called an ε-approximation solution of VI(Ω, F, θ) if it satisfies sup

w∈B( ˜w)

θ(˜x)θ(x) + ( ˜ww)TF(w) ≤ε, (2.11) where

B( ˜w) :=w∈Ω| kw−wk ≤˜ 1 .

Based on this definition, for an algorithm, if aftert iterations, we can find ˜w∈Ω such that θ(˜x)θ(x) + ( ˜ww)TF(w)≤ε ∀w∈ B( ˜w),

with ε= O(1/t), then we say this algorithm has a worst-case O(1/t) convergence rate measured by the iteration complexity. See, e.g., [36, 37] for more details.

The following lemma is useful for establishing a worst-caseo(1/t) convergence rate in Sections 5.3 and 7.4. It is similar as Lemma 1.2 in [10].

Lemma 2.3. If a sequence {at} ⊆ < obeys: (1) at ≥0; (2) Pt=0at<+∞; (3) atat−1+σt−1 for any integer t≥1, where the sequence{σt} satisfiesPt=1t<+∞ with σt≥0 for any integer t≥0, then we have at=o(1/t).

Proof. Since atat−1+σt−1, we get atak+

t−1

X

j=k

σj ∀k≤t−1.

By assumptions (1)-(3) in this lemma, we have 0≤ t

2 ·at

t

X

k=b2tc+1

ak+

t−1

X

k=bt2c+1 t−1

X

j=k

σj

t

X

k=bt

2c+1

ak+

t−1

X

k=bt

2c+1

k→0, ast→ ∞. Therefore, we getat=o(1/t). The proof is complete.

(8)

2.3. Some Notations

With the given positive scalarsβ and µ∈(0,1), we define the scalars ri > (m−1)β

1−µ λmax(ATi Ai) ∀i= 1, . . . , m,

whereAi is the coefficient matrix given in the model (1.1). We also define the matricesG,H,M,Nx

and N as following:

G:=

(1 +µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1 +µ)rmInm 0 0 · · · 0 β1Il

, (2.12)

H :=

(1 +µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1 +µ)rmInm 0 0 · · · 0 γβ1 Il

, (2.13)

M := diag(In1, . . . Inm, γIl), Nx :=µ·diag(r1In1, . . . , rmInm) (2.14) and

N := diag(Nx,0), (2.15)

whereγ ∈(0,2).

Below we prove three assertions regarding the matrices just defined. These assertions make it pos- sible to present our convergence analysis for the new algorithms compactly with alleviated notation.

Lemma 2.4. Let β >0; µ∈(0,1); γ ∈(0,2) and ri >(m−1)βλmax(ATi Ai)/(1−µ), i= 1, . . . , m.

The matrices G, H, M and N defined respectively in (2.12)-(2.15) have the following relationships:

HM =G, H˜ :=GT +GMTHM−2N 0 and H 0. (2.16) Proof. Using the definitions of the matrices H,M and G, by a simple manipulation, we obtain

HM =

(1 +µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1 +µ)rmInm 0 0 · · · 0 γβ1 Il

In1 · · · 0 0 ... . .. ... ... 0 · · · Inm 0 0 · · · 0 γIl

=

(1 +µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1 +µ)rmInm 0 0 · · · 0 β1Il

=G.

The first assertion HM =Gis proved.

(9)

Consequently, we get

MTHM = MTG

=

In1 · · · 0 0 ... . .. ... ... 0 · · · Inm 0 0 · · · 0 γIl

(1 +µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1 +µ)rmInm 0 0 · · · 0 1βIl

=

(1 +µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1 +µ)rmInm 0 0 · · · 0 γβIl

.

Using (2.12)-(2.15) and the above equation, we have H˜ = GT +GMTHM−2N

=

(1−µ)r1In1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (1−µ)rmInm 0 0 · · · 0 2−γβ Il

=

(m−1)βAT1A1 · · · −βAT1Am 0

... . .. ... ...

−βATmA1 · · · (m−1)βATmAm 0

0 · · · 0 0

+

(1−µ)r1In1−(m−1)βAT1A1 · · · 0 0

... . .. ... ...

0 · · · (1−µ)rmInm−(m−1)βATmAm 0

0 · · · 0 2−γβ Il

= PT

(m−1)Il · · · −Il 0 ... . .. ... ...

−Il · · · (m−1)Il 0

0 · · · 0 0

P

+

(1−µ)r1In1 −(m−1)βAT1A1 · · · 0 0

... . .. ... ...

0 · · · (1−µ)rmInm−(m−1)βATmAm 0

0 · · · 0 2−γβ Il

,(2.17)

with

P = diag(pβA1, . . . ,pβAm, Il).

Therefore, the matrix ˜H is positive definite ifβ >0; µ∈(0,1); γ ∈(0,2) and ri > (m−1)β

1−µ λmax(ATi Ai) ∀i= 1, . . . , m.

The second assertion is proved.

(10)

Similar as (2.17), the matrixH can be written as H = PT

(m−1)Il · · · −Il 0 ... . .. ... ...

−Il · · · (m−1)Il 0

0 · · · 0 0

P

+

(1 +µ)r1In1−(m−1)βAT1A1 · · · 0 0

... . .. ... ...

0 · · · (1 +µ)rmInm−(m−1)βATmAm 0

0 · · · 0 γβ1 Il

.

Therefore the matrixH is also positive definite. The proof is complete.

3. The ALM with Full Jacobian Decomposition and LQP Regularization — Exact Version

Now, we present the exact version of the ALM with full Jacobian decomposition and LQP regulariza- tion for solving the model (1.1). Some remarks will also be proved. Based on our previous introduction and motivation, the ALM with full Jacobian decomposition and LQP regularization can be summa- rized as follows:

xk+11 := argminLβ(x1, xk2, . . . , xkm, λk) +r1d(x1, xk1)x1 ∈ <n+1 ,

· · ·

xk+1i := argminLβ(xk1, . . . , xki−1, xi, xki+1, . . . , xkm, λk) +rid(xi, xki)xi ∈ <n+i ,

· · ·

xk+1m := argminLβ(xk1, . . . , xkm−1, xm, λk) +rmd(xm, xkm)xm ∈ <n+m , λk+1:=λkγβ(Pmj=1Ajxk+1jb),

(3.1)

where the LQP regularizerd(·,·) is defined in (2.2), β >0, µ∈(0,1), γ ∈(0,2) and ri > (m−1)β

1−µ λmax(ATi Ai) ∀i= 1, . . . , m.

Recall the analysis in [3]. The LQP regularization terms rid(xi, xki) force the solution of the xi- subproblem in (3.1) to stay strictly in the interior of <n+i. Hence, the constraints <n+i are not active and the iterative scheme (3.1) can be further specified as

xk+11 := argminLβ(x1, xk2, . . . , xkm, λk) +r1d(x1, xk1) ,

· · ·

xk+1i := argminLβ(xk1, . . . , xki−1, xi, xki+1, . . . , xkm, λk) +rid(xi, xki) ,

· · ·

xk+1m := argminLβ(xk1, . . . , xkm−1, xm, λk) +rmd(xm, xkm) , λk+1 :=λkγβ(Pmj=1Ajxk+1jb),

(3.2)

in which only munconstrained minimization subproblems are involved.

Algorithm 1.

Step 0.Let ε >0, β >0, µ∈(0,1), γ ∈(0,2)and ri>(m−1)βλmax(ATi Ai)/(1−µ), i= 1, . . . , m.

Choose (x01, . . . , x0m, λ0)∈ <n++1 × · · · × <n++m × <l. Set k:= 0.

Références

Documents relatifs

Abstract — The problem ofminimizing a quadratic form over a bail centered at the origin is considered The stabihty ofLagrangian duality is established and complete characterizations

The first phase con- sists in reformulating the initial problem either into a compact mixed integer linear program or into a 0–1 quadratic convex program.. The second phase

THEOREM 5: If the penalized primai problems have optimal solutions, then the proximal point method applied to the dual problem is equivalent to the generalized augmenîed

Key-words: Augmented Lagrangian algorithm, augmentation parameter update, closest fea- sible problem, convex quadratic optimization, feasible shift, global linear

Within the ALM framework, [17] perform a single proximal gradient update to primal variable at each iteration and establish O(ε −1 ) complexity result to have an ε-optimal solution

Apparently, the problems obtained in this way are more complicated than the initial ones, however they have the advantage that they can be solved using the geometric

Woo, G., Namm, R.V., Sachkoff, S.A.: An iterative method based on a modified La- grangian functional for finding a saddle point in the semicoercive Signorini

Convex programming, Operator splitting methods, Alternating direction method of multipli- ers, proximal point algorithm, Douglas-Rachford splitting method, Peaceman-Rachford