Scheduling under Arborescence Precedence

(1)

Scheduling under Arborescence Precedence

2

Constraints

3

Nguyễn Kim Thắng

4

IBISC, Univ Evry, University Paris Saclay

5

Evry, France

6

[email protected]

7

Abstract

8

We consider a scheduling problem on unrelated machines with precedence constraints. There arem

9

unrelated machines andnjobs and every job has to be processed non-preemptively in some machine.

10

Moreover, jobs have precedence constraints; specifically, a precedence constraintj≺j⁰ requires that

11

jobj⁰ can only be started whenever jobjhas been completed. The objective is to minimize the

12

total completion time.

13

The problem has been widely studied in more restricted machine environments such as identical

14

or related machines. However, for unrelated machines, much less is known. In the paper, we

15

study the problem where the precedence constraints form a forest of arborescences. We present

16

aO (logn)²/(log logn)³

-approximation algorithm — that improves the best-known guarantee of

17

O (logn)²/log logn

due to Kumar et al. [12] a decade ago. The analysis relies on a dual-fitting

18

method in analyzing the Lagrangian function of non-convex programs.

19

2012 ACM Subject Classification Theory of Computation→Approximation Algorithms Analysis

20

Keywords and phrases Scheduling, Precedence Constraints, Lagrangian Duality

21

Digital Object Identifier 10.4230/LIPIcs.MFCS.2020.73

22

Funding Research supported by the ANR project OATA n^o ANR-15-CE40-0015-01.

23

1 Introduction

24

In this paper, we consider a classic scheduling problem on unrelated machines with precedence

25

constraints. There aremunrelated machines andnjobs. Each jobj has a processing time

26

p_ijif it is processed on machinei. A job must be executed non-preemptively in some machine

27

i(i.e., in an interval of length pij in machinei). Jobs haveprecedence constraintswhich are

28

represented by a partial order≺. Specifically, a dependence constraintj≺j⁰ requires that

29

jobj⁰ can only be started whenever jobj has been completed. Hence, we need to assign jobs

30

to machines and process them in some order consistent with the precedence constraints. The

31

objective is to minimize the total completion time, i.e.,P

jC_j whereC_j is the completion

32

time of jobj. In the standard three field notion, the problem is denoted asR|prec|P

jCj.

33

The weighted version of this problem is a similar one where additionally jobs have

34

weights and the objective is to minimize the total weighted completion time, denoted as

35

R|prec|P

jwjCj. Little is known for both problemsR|prec|P

jCj andR|prec|P

jwjCj in

36

the unrelated machine environments. However, the problem has been widely considered

37

in more restricted machine environments such as identical parallel machines or related

38

parallel machines. The problemP|prec|P

jw_jC_j corresponding to the setting of identical

39

machines (pij=pj ∀i) has been extensively studied. Many algorithms and techniques have

40

been designed for the latter over decades [13, 9, 4, 16, 6, 10, 5, 2, 19, 18]. The problem

41

P|prec|P

jwjCj has been revived with significant progresses recently. Li [15] provided a

42

(2 + 2 ln 2 +)-approximation by a subtle rounding based on a time-index LP. Later on, Garg

43

et al. [8] gave a (2 +)-approximation algorithm when the number of machines is a constant.

44

licensed under Creative Commons License CC-BY

(2)

Their result relies on a lift and project approach developed by Levey and Rothvoss [14] and

45

Garg [7]. This approximation ratio matches to the lower bound of 2 proved by Bansal and

46

Khot [2] assuming a variant of the Unique Game Conjecture.

47

In the more general setting of related machines (in whichpij =pj/siwheresiis the speed

48

of machinei), the corresponding problem Q|prec|P

jw_jC_j does not admit any constant

49

approximation assuming a (stronger) variant of the Unique Game Conjecture [3]. On the

50

positive side, Chudak and Shmoys [6] showed anO(logm)-approximation algorithm. This

51

approximation ratio remained the best known upper bound until recently Li [15] gave an

52

improvedO(logm/log logm)-approximation algorithm.

53

Despite progress in more restricted machine environments, there is still a large gap in

54

the understanding of the problemsR|prec|P

jC_j andR|prec|P

jw_jC_j. When the preced-

55

ence constraints are a collection of node-disjoint chains, the problems become the job shop

56

scheduling problems [17, 11] — again a classic problem with a long history. A particular

57

interesting case of the problem R|prec|P

jwjCj is the setting where the precedence con-

58

straints form a forest (i.e., the underlying undirected graph of the constraints is a forest),

59

denoted asR|f orest|P

jw_jC_j. This problem is motivated by several applications such as

60

evaluating large expression-trees and tree-shaped parallel processes. Kumar et al. [12] gave

61

anO log³n/(log logn)²

-approximation algorithm forR|f orest|P

jw_jC_j. When the forests

62

are out-trees or in-trees, the approximation ratio can be improved toO log²n/log logn .

63

It has remained the best-known result for a decade until now in both unweighted job and

64

weighted job settings.

65

1.1 Our contribution and approach

66

We study the special setting ofR|prec|P

jC_j where the precedence constraints form a forest

67

ofarborescences/out-trees. (An arborescence/out-tree is a directed acyclic graph where the

68

in-degree of every vertex is at most 1.) We denote the problem byR|arborescences|P

jC_j.

69

The main result of the paper is the following.

70 71

Theorem. There exists anO (logn)²/(log logn)³

-approximation algorithm for the problem

72

R|arborescences|P

jC_j where nis the number of jobs.

73

Approach. In our approach, instead of directly dealing with the problem

74

R|arborescences|P

jCj, we consider first a related problem in the speed-scaling model.

75

In the latter, machines can execute jobs with different speeds and that consumes energy. The

76

objective of the new problem is to minimize the total completion time plus energy (under

77

the same precedence constraints). Intuitively, this problem can be considered as a smooth

78

and relaxed version of the original problem where the energy plays the role of a regularizer.

79

More specifically, in the original problem, at any time every machine either executes some

80

job or do not execute any job; these cases correspond to the speed of 1 or 0, respectively. In

81

the related problem, one is allowed to choose an arbitrary (non-negative) speed. Moreover,

82

the role of the energy function is to prevent the speed from being chosen too high or too low

83

— both situations would lead to a large approximation ratio when converting a solution of the

84

related speed-scaling problem to that of the original one. (Low speed results in a large total

85

completion time whereas high speed yields a large factor in order to convert that speed to 0-1

86

speed.) Finally, given a solution for the problem of minimizing the total completion time plus

87

energy, we show that one can transform that solution to a feasible schedule of the problem

88

R|arborescences|P

jCj with some reasonable loss factor depending on the energy function.

89

(3)

In the paper, we choose the energy function of the formz^αwhereα= Θ(logn/log logn) in

90

order to minime the loss.

91

Following the strategy described above, we focus on the design of an algorithm for the

92

problem of minimizing the total completion time plus energy and analyze its performance by

93

using tools in mathematical programming. In previous works on scheduling under precedence

94

constraints, the most successful techniques are LP-based roundings [15, 12] or lift-and-

95

project methods [7, 8]. In this paper, we take a different approach that relies non-convex

96

mixed-integer formulations and weak duality presented in [20]. With this approach, we

97

can construct a formulation that is convenient for the design and analysis of our algorithm

98

since the formulation does not need to be either linear or convex. Moreover, one can work

99

directly with integral variables without relaxing them, so avoiding serious integrality gap

100

issue. Specifically, we consider a non-convex formulation for the problem of minimizing

101

the total completion time plus energy and analyze the corresponding Lagrangian function,

102

using the dual-fitting method, in order to bound the dual. The approach allows us to prove

103

an approximation guarantee. That algorithm subsequently is used to derive the improved

104

O (logn)²/(log logn)³

-approximation algorithm for the problemR|arborescences|P

jC_j.

105

2 Preliminaries

106

Given a set ofnjobs, the precedence contraints≺can be represented succinctly by a directed

107

dependence graph. In this graph, there are n vertices, each represents a job, and there

108

is an arc (j, j⁰) if j ≺ j⁰. Note that if in the graph there is a directed path j₁, j₂, . . . , j_k

109

and an arc (j1, jk) then one can simply remove the arc (j1, jk) in the graph while always

110

maintaining the job dependences. In the paper, we consider dependence graph as a collection

111

ofarborescences. Anarborescenceis a directed acyclic graph where the in-degree of every

112

vertex is at most 1. The problem, as defined earlier, is to schedule jobs on unrelated machines

113

in order to minimize the total completion time under the arborescence constraints, i.e.,

114

R|arborescences|P

jCj.

115

Total Completion Time plus Energy. In order to design algorithm for the problem

116

R|arborescences|P

jC_j, we study the following related problem in the speed-scaling model.

117

In the problem, there aremunrelated machines andnjobs. An algorithm can choose speeds

118

si(t) for every machine i at every timet in order to execute jobs. That incurs the total

119

energy ofR∞

0 s_i(t)^αdt whereα≥2 is a fixed parameter. Each jobj has a volumep_ij if it

120

is executed on machinei. A job can be processed preemptively in a machine but without

121

migration, i.e., every job must be assigned to some single machine. A jobj assigned to some

122

machineiis completed at timeCjif the total volume executed by machineion this job up to

123

timeC_j is equal top_ij. Moreover, jobs have precedence constraints≺which are represented

124

by a collections of arborescences. A job j cannot be executed before the completion of

125

every jobj⁰ wherej⁰≺j. In this problem, an algorithm needs to assign jobs to machines,

126

decide the running speeds and execute jobs in some order consistent with the precedence

127

constraints. The objective is to minimize the total completion time plus energy, which is

128

P

jC_j+P

i

R∞

0 s_i(t)^αdt. In the paper, we first design an algorithm for this problem and

129

subsequently derive an algorithm for the problemR|arborescences|P

jCj.

130

Weak Duality. A property of mathematical programming, which holds for non-convex

131

optimization and is crucial in our analysis, is the weak duality, stated as follows. For

132

completeness, we incorporate also its (short) proof.

133

(4)

ILemma 1 (Weak duality). Consider a possibly non-convex optimization problem p^∗ :=

134

minxf0(x) : fi(x) ≤ 0, i = 1, . . . , m where fi : Rⁿ → R for 0 ≤ i ≤ m. Let X

135

be the feasible set of x. Let L : Rⁿ ×R^m → R be the Lagrangian function L(x, λ) =

136

f0(x) +Pm

i=1λifi(x). Define d^∗ = max_λ≥0min_x∈XL(x, λ) where λ ≥ 0 means λ ∈ R^m+.

137

Thenp^∗≥d^∗.

138

Proof. We observe that, for every feasiblex∈ X, and everyλ≥0,f0(x) is bounded below byL(x, λ):

∀x∈ X, ∀λ≥0 : f0(x)≥L(x, λ) Define a functiong:R^m→Rsuch that

g(λ) := min

z L(z, λ) = min

z f₀(z) +

m

X

i=1

λifi(z)

Asg is defined as a point-wise minimum, it is a concave function.

139

We have, for anyxandλ,L(x, λ)≥g(λ). Combining with the previous inequality, we get

∀x∈ X : f₀(x)≥g(λ)

Taking the minimum overx, we obtain∀λ≥0 : p^∗≥g(λ).Therefore, p^∗≥max

λ≥0g(λ) =d^∗.

J

140

Notations. Given a collection of arborescencesG, for every jobj, defineprev(j) to be the

141

jobj⁰ if there exists an arc (j⁰, j) in the graphG; andprev(j) =∅ if the in-degree ofj is

142

0. Note that as inGthe in-degree of every vertex is at most one, prev(j) is well-defined.

143

Intuitively,prev(j) is the last job on which j depends. LetCj be the completion time of

144

jobj. Moreover, define theavailable timeAj of jobj as Cprev(j)ifprev(j)6=∅; andAj= 0

145

otherwise. Informally,Aj is the earliest time wherej can be executed. Thepending-time of

146

jobj is defined asC_j−A_j, that represents the duration from the momentj is available to

147

be executed until its completion. Note that this definition is different (but has some flavour)

148

to the notion of flow-time in scheduling. Additional, a jobj ispending if it is available but

149

has not been completed.

150

3 Approximation Algorithm for Completion Time plus Energy

151

Minimization

152

In this section, we consider the problem of minimizing the total completion time plus energy

153

defined in the previous section. Let G be a collection of arborescences representing job

154

dependencies. For every jobj, define theweight of jobj aswj=P

j⁰:jj⁰1. In other words,

155

w_j is the number of jobs which depends on jobj (includingj itself); equivalently,w_j is the

156

number of nodes in the sub-arborescences rooted atj.

157

We first make the following observation.

158

X

j

wj Cj−Aj

=X

j

X

j⁰:jj⁰

1

Cj−Aj

=X

j

X

j⁰j

(Cj⁰ −Aj⁰) =X

j

Cj.

159 160

The last equality holds due to the structure of arborescences: the set {j⁰ : j⁰ j} forms

161

a path (j1, j2, . . . , jk) wherejk =j andj1 is the root of the arborescence containingj; so

162

(5)

Aj_` =Cj`−1 for 2≤`≤kandAj₁ = 0. Hence, the total job completion time is equal to

163

the total weighted pending-time of jobs with respect to the weightwj’s defined above. So

164

in order to consider the total completion time, we will rather consider the total weighted

165

pending-time.

166

Before presenting the algorithm, we define some notions. At a time t, the remaining

167

volumeof a jobj assigned to machineiis denoted as qij(t). Thedensity of jobj in machine

168

iisδij =wj/pij. Theresidual density of a pending jobj assigned to machineiat timetis

169

δ_ij(t) =w_j/q_ij(t). (Asj is pending,q_ij(t)>0.)

170

Our algorithm, named Algorithm 1, consists of scheduling and assignment policies

171

described as follows.

172

1. Scheduling policy. At any time t, every machinei sets its speed si(t) =βWi(t)^1/α

173

whereWi(t) is the total weight of jobs assigned to machineiwhich are still pending at

174

timet; andβ >0 is a constant to be chosen later. Moreover, at every time, every machine

175

i processes the highest residual density job among the pending ones assigned toi.

176

2. Assignment policy. Whenever any job j is available, i.e., all jobsj⁰ ≺j have been

177

completed, immediately assign jobj to some machine. Note that different assignments of

178

j (to different machines) give rise to different marginal increases of the total weighted

179

pending-time (with respect to the scheduling policy). Here, among all machines, assign

180

(immediately) jobj to the one that minimizes the marginal increase of the total weighted

181

pending-time.

182

Formulation. Letsij(t) be the variable that represents the speed of jobj on machineiat

183

timet. VariablesA_j and C_j denote the available time and the completion time of job j,

184

respectively. Letxij be the variable indicating whether jobj is assigned to machinei. The

185

problem could be relaxed as the following formulation. We emphasize that in the formulation,

186

we donot relax the integrality of variablesxij’s.

187

minimize X

i

Z ∞

0

X

j

sij(t) ^α

dt+X

i,j

Z Cj

Aj

sij(t)dt

δijxij(Cj−Aj)

188

+ α

β(α−1) X

i,j

Z Cj

Aj

sij(t)dt

xijw

α−1 α 189 j

subject to X

i

xij = 1 ∀j

190

xij

Z C_j

Aj

sij(t)dt=pijxij ∀j

191

Aj=C_prev(j) ∀j:prev(j)6=∅

192

Aj= 0 ∀j:prev(j) =∅

193

xij ∈ {0,1} ∀i, j

194

sij(t)≥0 ∀i, j, t

195

Cj≥0 ∀j

196 197

The first constraint ensures that every job is assigned to some machine. The second constraint guarantees that if a job j is assigned to some machine i then it will be fully processed during the interval [Aj, Cj] in machinei. In the objective, the first term represents

(6)

the energy cost. The second term stands for the weighted pending-time of jobs, i.e., Z Cj

A_j

s_ij(t)dt

δ_ijx_ij(C_j−A_j) =p_ijx_ijδ_ij(C_j−A_j) =x_ijw_j(C_j−A_j)

by the second constraint. The last term in the objective, inspired by [1], is added in order to

198

reduce the integrality gap. In this term,β is a parameter (depending on α) to be chosen

199

later. Note that, in order to minimize the objective function under the above constraints,

200

every algorithm will setsij(t) = 0 ∀i, j,∀t /∈[Aj, Cj].

201

The following lemma shows that the objective value of any feasible schedule is within a

202

constant factor of thecost of the schedule, which is the sum of the completion times and the

203

energy consumed. The proof follows the scheme of a similar lemma in [1]. For completeness,

204

we give the proof in the appendix.

205

ILemma 2. Consider a feasible scheduleS for an instanceI of the problem. Letx_ij and

206

sij(t)be the corresponding solution to the mathematical program. Then the objective value of

207

such solution for the mathematical program is at most (1 +_β(α−1)^α ) the cost ofS.

208

Proof. Let C_j be the completion time of job j in schedule S. In the objective of the

209

formulation, the first term clearly captures the consumed energy. Due to the constraints, the

210

second term isP

jw_j C_j−A_j

— the total weighted pending-time (which equals the total

211

completion time).

212

In the remaining, we show that the last term in the objective is bounded by _β(α−1)^α

213

the cost ofS. The arguments follow the ones in [1]. In schedule S, assume that job j is

214

executed during [Aj, Cj] in machinei. Then the average speed seij of j during [Aj, Cj] is

215

p_ij/(C_j−A_j). Thus,C_j−A_j ≥p_ij/es_ij. The total energy consumed to complete jobj is at

216

least (Cj−Aj)es^α_i ≥pijse^α−1_i . Therefore,

217

wj(Cj−Aj) +pijes^α−1_i ≥wjpij/esi+pijes^α−1_i

218

≥p_ijw

α−1 α

j

(α−1)^α¹ + (α−1)⁻^α−1^α

219

≥p_ijw

α−1 α

j =X

i⁰

Z Cj

A_j

s_i⁰_j(t)dt

x_i⁰_jw

α−1 α

j .

220 221

The second inequality is due to the first order condition. In the last term, note thatxi⁰j= 1

222

ifi⁰ =iandx_i0j= 0 ifi⁰6=i. As the energy function is convex, the total energy consumed

223

of a schedule is larger than the sum of energy consumed on each individual job. Summing

224

the above inequality for all jobsj, we deduce that the third term in the objective function is

225

bounded by factor _β(α−1)^α the cost ofS. J

226

Dual program and variable setting. The dual of that program is max minx,s,CLwhereLis

227

the Lagrangian function associated to the above mathematical program and the maximum is

228

taken over dual variables. Letλij be the dual variable corresponding to the second constraint.

229

Set all dual variables exceptλ_ij’s equal to 0, the Lagrangian function becomes

230

X

i

Z ∞

0

X

j

s_ij(t) α

dt+X

j

Z Cj

A_j

δ_ij(C_j−A_j)x_ijs_ij(t)dt

231

+ α

β(α−1) X

i,j

Z Cj

A_j

s_ij(t)dt

x_ijw

α−1 α

j +X

i,j

λ_ijx_ij

p_ij− Z Cj

A_j

s_ij(t)dt

232 233

(7)

Hence, the dual program is

234

x,s,Cmin

X

i,j

λijpijxij

235

−X

i,j

Z C_j

A_j

xijsij(t)

λij−si(t)^α−1− α β(α−1)w

α−1 α

j −δij(Cj−Aj)

dt

236

≥min

x

X

i,j

λ_ijp_ijx_ij

237

−max

x,s,C

X

i,j

Z Cj

A_j

x_ijs_ij(t)

λ_ij−s_i(t)^α−1− α β(α−1)w

α−1 α

j −δ_ij(C_j−A_j)

dt

238 239

Chooseλij such thatλijpij equals the increase of the total weighted pending-time of jobs

240

(different toj) assigned to machine iplus the weighted pending-time of jobj if the latter

241

is assigned to i. In other words,λijpij equals the marginal increase in the total weighted

242

pending time if jobj is assigned to machinei. Recall that by the assignment policy of the

243

algorithm, jobj is assigned to machineithat minimizesλ_ijp_ij.

244

Analysis

245

The strategy of the analysis is to show that, with the chosen dual variables, the dual has

246

value at least some factor (smaller than 1) times the cost of the algorithm schedule. Then,

247

by weak duality, we derive an approximation ratio for the algorithm.

248

We first show that the algorithm admits some monotone property. Consider two sets of

249

jobsI andI⁰ assigned to machinei such that they are identical except that there is only a

250

jobj∈ I \ I⁰ (i.e.,I =I⁰∪ {j}). Moreover, assume that all jobs in I⁰ have available times

251

earlier than that ofj. For every job k, define the fractional weight of k in machine i at

252

timet aswkqik(t)/pik. LetVi(t) be the total fractional weight of pending jobs assigned to

253

machinei. The following lemma, which has been proved in [1], shows a property ofV_i(t).

254

I Lemma 3 ([1]). Let I be a set of jobs and I⁰ = I \ {j} where j ∈ I is the job with

255

maximum available time (among ones in I). Fix an arbitrary machinei. Let V_i^I(t) and

256

V_i^I⁰(t)be the total fractional weights of pending jobs at time tin machinei if the sets of jobs

257

assigned to machinei areI andI⁰, respectively. Then, V_i^I⁰(t)≤V_i^I(t)for every timet.

258

Informally, Lemma 3 shows that for every machinei,Vi(t) is monotone w.r.t the set of

259

jobs assigned to machinei. In fact, Lemma 3 is proved by Anand et al. [1] in the online

260

setting. The proof remains exactly the same by replacing the available timesA_j’s in our

261

setting by the release timesrj’s of jobs in the online setting.

262

We are now proving a crucial lemma relating the dual variables and the fractional pending

263

weights.

264

ILemma 4. It holds thatλij−δj(t−Aj)−_β(α−1)^α w

α−1 α

j ≤ _β(α−1)^α Vi(t)^α−1^α for every machine

265

i and every timet≥A_j.

266

Proof. By Lemma 3, it is sufficient to prove the inequality for a fixed machineiassuming that no new job will be assigned toiafterAj. For simplicity of the notations, as machineiis fixed, in the remaining of the proof, we drop the index of the machines in all the parameters (e.g.,δj(t) stands forδij(t), etc). Moreover, denote againqk =qk(Aj) and δk=δk(Aj) for every pending jobk. At Aj, rename jobs in non-increasing order of their residual densities, i.e.,q1/w1≤. . .≤qn/wn (note thatqk/wk is the inverse of jobk’s residual density). Denote

(8)

Wk=wk+. . .+wn for 1≤k≤n. The marginal increase in the total weighted pending-time due to the assignment of jobj is

w_j q₁

βW₁^1/α

+. . .+ q_j βW_j^1/α

+W_j+1 q_j βW_j^1/α

where the first term is the weighted pending-time of jobj and the second one is the increase

267

of the weighted pending-time of other jobs (note that only jobs with density smaller than

268

that ofj has their completion times increased). LetC_j^∗ be the completion time of jobj if it

269

is assigned to machinei. We consider different cases of timet.

270

Case 1: t≤C_j^∗. Let k be the pending job att with the smallest index. In other words,

271

the machine has processed all jobs 1, . . . , k−1 and a part of jobk in interval [A_j, t]. By the

272

definition ofλj, we have that

273

λ_j−δ_j(t−A_j) =δ_j

q_k(t)

βW_k^1/α + q_k+1

βW_k+1^1/α +. . .+ q_j βW_j^1/α

+ W_j+1 βW_j^1/α

274

=δ_j

wk(t) δ_kβW_k^1/α

+ wk+1

δ_k+1βW_k+1^1/α

+. . .+ wj

δ_jβW_j^1/α

+ Wj+1

βW_j^1/α

275

≤ 1 β

wk(t) W_k^1/α

+ wk+1

W_k+1^1/α

+. . .+ wj

W_j^1/α

+ wj+1

W_j+1^1/α

+. . .+ wn

Wn^1/α

276

≤ 1 β

Z V(t)+w_j

w_n

dz

z^1/α ≤ α

β(α−1)(V(t) +wj)^α−1^α

277

≤ α

β(α−1)

V(t)^α−1^α +w

α−1 α

j

.

278 279

The second equality is due to the definition of the residual density. The first inequality holds

280

sinceδ_j ≤δ_k⁰ for every jobk⁰ ≤jandW_j ≥W_j+1≥. . .≥W_n. The second inequality holds

281

since functionz^−1/αis decreasing. The last inequality holds because 0<(α−1)/α <1.

282

Case 2: t > C_j^∗. Letkbe the pending job att with the smallest index. We have

283

λj−δj(t−Aj) = Wj+1

βW_j^1/α

−δj(t−C_j^∗) = 1 βW_j^1/α

wj+1+. . .+wn

−δj(t−C_j^∗)

284

≤δj+1

qj+1

βW_j+1^1/α

+. . .+δn

qn

βWn^1/α

−δj(t−C_j^∗)

285

≤δ_k qk(t) βW_k^1/α

+δ_k+1 qk+1

βW_k+1^1/α

+. . .+δ_n qn

βWn^1/α

286

=δ_k wk(t) δkβW_k^1/α

+δ_k+1 wk+1

δk+1βW_k+1^1/α

+. . .+δ_n wn

δnβWn^1/α

287

≤ 1 β

Z V(t)

w_n

dz

z^1/α ≤ α

β(α−1)V(t)^α−1^α

288 289

where the first inequality holds sinceWj≥Wj+1≥. . .≥Wn; the second inequality is due

290

toδj ≥δk⁰ for every jobk⁰> j.

291

Combining both cases, the lemma follows. J

292

ITheorem 5. Algorithm 1 is8(1 +_ln^α_α)-approximation forβ= _α−1¹ (α−1 + ln(α−1))^α−1^α .

293

(9)

Proof. LetP^∗ be the total weighted pending-time due to the algorithm (that also equals

294

the total completion time). By the choice of dual variables, we have

295

x,s,CminL= min

x

X

i,j

λijpijxij

296

−max

x,s,C

X

i,j

Z C_j

A_j

xijsij(t)

λij−si(t)^α−1− 1 βw

α−1 α

ij −δj(Cj−Aj)

dt

297

≥ P^∗−max

x,s,C

X

i,j

Z C_j

A_j

x_ijs_ij(t)

λ_ij−s_i(t)^α−1−1 βw

α−1 α

ij −δ_j(t−A_j)

dt

298

≥ P^∗−max

x,s,C

X

i,j

Z Cj

A_j

x_ijs_ij(t) α

β(α−1)V_i(t)^α−1^α −s_i(t)^α−1

dt

299

=P^∗−max

x,s,C

X

i

Z ∞

0

X

j

x_ijs_j(t)

α

β(α−1)V_i(t)^α−1^α −s_i(t)^α−1

dt

300

≥ P^∗−max

x,s,C

X

i

Z ∞

0

si(t) α

β(α−1)Vi(t)^α−1^α −si(t)^α−1

dt

301 302

where the first inequality follows by the assignment policy (assign jobj to machineithat

303

minimizesλ_ijp_ij) andt≤C_j; the second inequality is due to Lemma 4. By the first order

304

condition, functionz(_β(α−1)^α V^α−1^α −z^α−1) is maximized atz0= _((α−1)β)^V^1/α_1/(α−1). We have

305

min

x,s,CL≥ P^∗− α−1 ((α−1)β)^α−1^α

X

i

Z ∞

0

Vi(t)dt

306

≥ P^∗− α−1 ((α−1)β)^α−1^α

X

i

Z ∞

0

Wi(t)dt=

1− α−1 ((α−1)β)^α−1^α

P^∗

307 308

where the second inequality holds sinceV_i(t)≤W_i(t) for everyiandt.

309

Besides, the total weighted pending-time plus energy is

P^∗+ Z ∞

0

s^α(t)dt=P^∗+X

i

Z ∞

0

β^αW_i(t)dt= (1 +β^α)P^∗.

Therefore the primal objective is bounded by (1 +β^α) + _β(α−1)^α (1 +β^α)

P^∗ (Lemma 2).

310

Thus, the approximation ratio is at most

311

(1 +β^α) +_β(α−1)^α (1 +β^α)

1− ^α−1

((α−1)β)^α−1^α

(1)

312

Chooseβ= _α−1¹ (α−1 + ln(α−1))^α−1^α . Observe that

313

1 + ln(α−1) α−1

α−1

< e^ln(α−1)=α−1

314

⇒(α−1 + ln(α−1))^α−1<(α−1)^α⇒β <1

315 316

Moreover,β >(α−1)^−1/α. With the chosenβ, the denominator of (1) becomes α−1+ln(α−1)^ln(α−1)

317

and the nominator is bounded by 8 (since α^−1/α < β < 1 and α ≥ 2). Hence, the

318

approximation ratio is at most 8(1 +α/lnα). J

319

(10)

4 Approximation Algorithm for R|arborescences|

^P_j

C

_j

320

We are now considering the problem R|arborescences|P

jC_j. Fix the parameter α such

321

thatα^α=n, soα= Θ _{log logn}^logⁿ

. Notice that given an instance ofR|arborescences|P

jC_j,

322

there is a corresponding instance of the problem of minimizing the total completion time

323

plus energy in which the energy function of every machine isR∞

0 s_i(t)^αdt. Our algorithm

324

Algorithm 2for theR|arborescences|P

jCj problem is the following.

325

1. Given an instance ofR|arborescences|P

jCj, consider the corresponding instance of the

326

problem of minimizing the total completion time plus energy (defined in Section 2) with

327

parameterαsuch thatα^α=n. Solve the latter by Algorithm 1 and obtain a scheduleS1

328

(with machine speeds).

329

2. Transform the scheduleS1 to a scheduleS2such that at any time twhere si(t)> αfor

330

some machinei, reduce the speeds_i(t) toα. Note that this transformation might delay

331

job completion times.

332

3. Given the scheduleS2, transform to a unit-speed scheduleS3as follows. In the schedule

333

S3, preserve the job-to-machine assignments as in scheduleS2. In every machine, execute

334

jobs non-preemptively (by unit-speed) in the non-decreasing order of their completion

335

times in scheduleS2. Return the non-preemptive scheduleS3.

336

We first show some properties of schedulesS2 andS3.

337

ILemma 6. 1. The cost (i.e., total completion time plus energy) of the schedule S2 is at

338

most that of schedule S1.

339

2. The total completion time of S3 is at mostαtimes that ofS2.

340

Proof. 1. Assume that the speed of some machine i at some time t is si(t) > α. The increasing rate of energy cost in machineiat time tis

d(si(t))^α

dt =αs^α−1_i (t)> α^α=n.

However, the increasing rate of the total completion time is at mostn. Therefore, one can

341

reduce the speedsi(t) to get a smaller cost. Hence, by operations of Step 2 in Algorithm

342

2, the total completion time plus energy of the scheduleS2 is at most that of scheduleS1.

343

2. If the speed of a machine is reduced by a factorαthen the completion time of each job

344

will be increased by at most a factor α. Therefore, the total completion time is increased

345

by at most a factorα.

346

J

347

I Theorem 7. Algorithm 2 is O _{(log log}^log²ⁿ_n)3

-approximation for the problem

348

R|arborescences|P

jCj.

349

Proof. LetC(S) andE(S) be the total completion time and the energy of the scheduleS,

350

respectively. LetS^∗be an optimal schedule for the problem of minimizing the total completion

351

time plus energy. LetOP T be an optimal schedule for the problemR|arborescences|P

jCj.

352

Note thatOP T is a feasible solution to the problem of minimizing the total completion time

353

plus energy where at any time the machine speeds are unit (whenever there is still a pending

354

(11)

job). We have

355

C(S3)≤α· C(S2)≤α·(C(S2) +E(S2))≤α·(C(S1) +E(S1))

356

≤8α

1 + α logα

(C(S^∗) +E(S^∗))≤8α

1 + α logα

(C(OP T) +E(OP T))

357

≤16α

1 + α logα

· C(OP T)

358 359

The first and third inequalities follow from Lemma 6. The fourth inequality is due to

360

Theorem 5. The last inequality holds since inOP T, at every time every machine runs with

361

speed either 1 or 0, so the total energy incurred in a machine is bounded by the maximum

362

completion time of a job in that machine. The theorem follows sinceα= Θ _{log log}^logⁿ_n

. J

363

Remark. The weighted versionR|arborescences|P

jw_jC_j can be solved by a similar al-

364

gorithm and the approximation ratio will beO ρ·_{(log log}^log²ⁿ_n)₃

whereρ= max_j,j⁰_:w_j₀_>0_w^w^j

j0.

365

5 Conclusion

366

In this paper, we present a new approach for the problem R|arborescences|P

jC_j using

367

non-convex formulations and a dual-fitting method. In high level, the consideration of a

368

smooth variant of the problem helps to bypass a hard constraint of the problem (that every

369

job has to be processed by unit speed). Moreover, the formulation of a non-convex program

370

with mixed integer variables (assignment variables) and continuous variables (speed variables)

371

allows us to get rid of the integrality gap issue while still benefit from several continuous

372

aspects. Finally, the analysis holds by the simple yet powerful weak duality which holds even

373

for non-convex programs. The approach enables an improvement, albeit rather small, over a

374

long standing approximation. We hope that the approach would provide additional tools

375

and a different point of view towards the design of algorithms with improved performance

376

guarantees for the general problemsR|prec|P

jC_j andR|prec|P

jw_jC_j.

377

References

378

1 S. Anand, Naveen Garg, and Amit Kumar. Resource augmentation for weighted flow-time

379

explained by dual fitting. InProc. 23rd ACM-SIAM Symposium on Discrete Algorithms,

380

pages 1228–1241, 2012.

381

2 Nikhil Bansal and Subhash Khot. Optimal long code test with one free bit. InProc. 50th

382

Symposium on Foundations of Computer Science, pages 453–462, 2009.

383

3 Abbas Bazzi and Ashkan Norouzi-Fard. Towards tight lower bounds for scheduling

384

problems. InProc. 23rd European Symposium on Algorithms, pages 118–129, 2015.

385

4 Soumen Chakrabarti, Cynthia A Phillips, Andreas S Schulz, David B Shmoys, Cliff Stein,

386

and Joel Wein. Improved scheduling algorithms for minsum criteria. InProc. Colloquium

387

on Automata, Languages, and Programming, pages 646–657, 1996.

388

5 Chandra Chekuri and Sanjeev Khanna. Approximation algorithms for minimizing average

389

weighted completion time. In Handbook of Scheduling, pages 220–249. Chapman and

390

Hall/CRC, 2004.

391

6 Fabián A Chudak and David B Shmoys. Approximation algorithms for precedence-

392

constrained scheduling problems on parallel machines that run at different speeds.Journal

393

of Algorithms, 30(2):323–343, 1999.

394