Mixing MIR Inequalities with Two Divisible Coefficients

(1)

HAL Id: hal-00387098

https://hal.archives-ouvertes.fr/hal-00387098

Submitted on 24 May 2009

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

Mixing MIR Inequalities with Two Divisible Coeﬀicients

Miguel Constantino, Andrew Miller, Mathieu van Vyve

To cite this version:

Miguel Constantino, Andrew Miller, Mathieu van Vyve. Mixing MIR Inequalities with Two Divisible

Coeﬀicients. Mathematical Programming, Series A, Springer, 2010, 123, pp.451-483. �10.1007/s10107-

009-0266-9�. �hal-00387098�

(2)

Mixing MIR Inequalities with Two Divisible Coefficients

Miguel Constantino CIO-DEIO University of Lisbon

Lisbon, Portugal

miguel.constantino@fc.ul.pt

Andrew J. Miller IMB, Universit´e Bordeaux 1

Team RealOpt, INRIA Bordeaux Sud-Ouest

Andrew.Miller@math.u-bordeaux1.fr

Mathieu Van Vyve N-Side S.A., Louvain-la-Neuve, Belgium

mathieu.vanvyve@gmail.com

Revised: December 3, 2008

Abstract

This paper is a polyhedral study of a generalization of the mixing set where two different, divisible coefficients are allowed for the integral variables. Our results generalize earlier work on mixed integer rounding, mixing, and extensions. They also directly apply to applications such as production planning problems involving lower bounds or start-ups on production, when these are modeled as mixed-integer linear programs.

We define a new class of valid inequalities and give two proofs that they suffice to describe the convex hull of this mixed-integer set. We give a characterization of each of the maximal faces of the convex hull, as well as a closed form description of its extreme points and rays, and show how to separate over this set inO(nlogn). Finally, we give several extended formulations of polynomial size, and study conditions under which adding certain simple constraints on the integer variables preserves our main result.

(3)

1 Introduction

In this paper we study the polyhedral structure of the mixed integer set P

^{M M IX}

:

s + Lz

_t

≥ b

_t

, 1 ≤ t ≤ k, s + Cz

_t

≥ b

_t

, k + 1 ≤ t ≤ n, z ∈

Zⁿ

, s ≥ 0,

where b

t

∈

R

, L, C > 0 and C/L ∈

Z

.

The polyhedral descriptions of many mixed integer sets can be obtained based on the so-called Mixed Integer Rounding inequalities (M IR) and extensions (Gomory [1960], Nemhauser and Wolsey [1988]). The basic M IR inequality is derived from the set

P

^{M IR}

= {(s, z) ∈

R⁺

×

Z

: s + Cz ≥ b}.

If we let α = ⌈b/C⌉ and γ = b − C(α − 1), then we can write the basic M IR inequality as

s ≥ γ(α − z).

G¨ unl¨ uk and Pochet [2001] showed how the convex hull of certain sets can be obtained by mixing basic M IR inequalities. This procedure was developed on a generalization of the MIR set that we can define as P

^{GM IX}

= {(s, z) ∈

R⁺

×

Zⁿ

: s + Cz

_t

≥ b

_t

, 1 ≤ t ≤ n}. Those authors showed that inequalities obtained by the mixing procedure describe the convex hull of P

^{GM IX}

.

The set P

^{M M IX}

studied in this paper considers two divisible coefficients L and C, and it therefore generalizes P

^{GM IX}

. Indeed, P

^{GM IX}

is the special case of P

^{M M IX}

with either k = 0 or k = n. Our aim is to describe the convex hull of P

^{M M IX}

. To do this, we will develop a way to define valid inequalities by applying the mixing procedure twice. This approach not only yields the convex hull of P

^{M M IX}

, but other remarkable results as well.

These include an O(n log(n)) separation algorithm for P

^{M M IX}

that is as fast asymptotically as the separation algorithm for P

^{GM IX}

(which is described in Pochet and Wolsey [1994] and Miller and Wolsey [2003]). Analysis of the separation algorithm also yields, for each of the the facet-defining inequalities that we define, a system of inequalities that describe the set of points lying in the face defined by that inequality. Using a different approach, we obtain several exact extended formulations for P

^{M M IX}

.

While this research was in progress, Zhao and de Farias [2007a,c] have

studied a generalization of P

^{M M IX}

that may have more capacities, each

(4)

of which divides all larger capacities. They show how to compute the extreme points of and optimize over the MIP model in question (they give a closed-form list of extreme points in only a very limited case). Our results for conv(P

^{M M IX}

) are more detailed and provide more information than do the results just mentioned; however, the work of Zhao and de Farias is par- ticularly impressive because it applies to models with m divisible capacities for any m ≤ n. (More recently, Di Summa [2007] has refined the algorithm given by Zhao and de Farias [2007a] to compute the extreme points of this general model, and Conforti et al. [2008] have proposed an integral extended formulation that generalizes the results of Section 6 of this paper).

A theoretical motivation for this research is the generalization of the mixing inequalities to apply to more complicated models which often occur as substructures of difficult MIP models. Moreover, applications motivating this research arise in the study of a couple of MIP lot sizing models, which we present briefly below.

Lot Sizing with Lower Bounds

Consider the following single-item lot-sizing model with lower bounds. This model may arise as a submodel of complex multi-item multi-resource production planning problems.

max

n

X

t=1

(f

t

y

t

+ p

t

x

t

+ h

t

s

t

)

s

_t−1

+ x

t

= d

t

+ s

t

, 1 ≤ t ≤ n (1)

Ly

t

≤ x

t

≤ Cy

t

, 1 ≤ t ≤ n, (2)

y

t

∈

Z+

, 1 ≤ t ≤ n, (3)

where d

_t

≥ 0 are the demands, f

_t

are the set-up costs, p

_t

are the unit

production costs, and h

t

are the unit holding costs. In this model multiple

set-ups in a single period are allowed, so the variables y can take integer

values larger than one. When the costs satisfy the Wagner-Whitin property

(p

t

≤ h

t

+ p

_t−1

) (Pochet and Wolsey [1994]) there exists an optimal solution

for this model for which the stock is minimal with respect to when setups

occur, that is, for fixed y, the stock variables s have the minimum feasible

values. The solution (s, x, y) is a stock-minimal solution of (1)-(3) if and

(5)

only if

s

k

= max

"

max

t≤k

(L

k

X

i=t

y

i

−

k

X

i=t

d

i

)

⁺

, max

t≥k+1

(

t

X

i=k+1

d

i

− C

t

X

i=k+1

y

i

)

⁺

#

,

for all 0 ≤ k ≤ n and (s, x, y) satisfies (1) and (3) (see Van Vyve [2003]).

This implies that the dominant of the stock minimal solutions (the set of (s

^′

, x, y) such that there exists s ≤ s

^′

with (s, x, y) satisfying (1)-(3)) is the feasible set of the following model.

s

k

≥ L

k

X

i=t

y

i

−

k

X

i=t

d

i

, 1 ≤ t ≤ k ≤ n, (4)

s

k

≥

t

X

i=k+1

d

i

− C

t

X

i=k+1

y

i

, 1 ≤ k + 1 ≤ t ≤ n, (5)

y

t

∈

Z+

1 ≤ t ≤ n, (6)

s ≥ 0. (7)

Constantino [1995, 1998] describes valid inequalities for (4)-(7) when C is very large (C ≥

Pn

i=k+1

d

i

). The inequalities described in this paper are strong in the more general case where L divides C exactly. We show that for k fixed, these inequalities describe the convex hull of solutions for (4)-(7) when C is large.

To relate (4)-(7) to P

^{M M IX}

for a fixed k, let s = s

_k

and define z

_t

=

−

Pk

i=t

y

i

, b

t

= −

Pk

i=t

d

i

, for 1 ≤ t ≤ k and z

t

=

Pt

i=k+1

y

i

, b

t

=

Pt i=k+1

d

i

for k+1 ≤ t ≤ n. This yields the model P

^{M M IX}

with extra constraints com- ing from the nonnegativity of the y

_t

variables, z

₁

≤ · · · ≤ z

_k

≤ 0 ≤ z

_k+1

≤

· · · ≤ z

n

. Observe that we also have b

₁

≤ · · · ≤ b

k

≤ 0 ≤ b

_k+1

≤ · · · ≤ b

n

. Lot Sizing with Startups

Consider now a single-item lot-sizing model with startups and capacities.

max

X

(f

t

y

t

+ g

t

w

t

+ p

t

x

t

+ h

t

s

t

) (8)

s

_t−1

+ x

t

= d

t

+ s

t

, 1 ≤ t ≤ m (9)

x

t

≤ Ky

t

, 1 ≤ t ≤ m, (10)

y

t

− y

t−1

≤ w

t

≤ y

t

, 1 ≤ t ≤ m, (11)

y

t

, w

t

∈ {0, 1}, 1 ≤ t ≤ m. (12)

(6)

where d

t

≥ 0 and g

t

is the startup cost, incurred if there is a set up in period t and no setup in period t − 1. We consider a relaxation of (9)-(12). The stock minimal solutions of this model satisfy the following inequalities, for 1 ≤ ℓ ≤ m:

s

ℓ

≥

t

X

i=ℓ+1

d

i

− K

t

X

i=ℓ+1

y

i

, ℓ + 1 ≤ t ≤ m, (13)

s

ℓ

≥

t

X

i=ℓ+1

d

i

− M(y

ℓ

+

t

X

i=ℓ+1

w

i

), ℓ + 1 ≤ t ≤ m, (14)

y

t

, w

t

∈

Z+

ℓ + 1 ≤ t ≤ m, (15)

y

ℓ

∈

Z+

, s ≥ 0. (16)

where M ≥

Pm

i=ℓ+1

d

_i

. In order to write (13)-(16) as P

^{M M IX}

, chose M to be a multiple of K and consider the following transformation for ℓ fixed:

s = s

_ℓ

, z

t

=

Pt

i=1

y

_ℓ+i

, b

t

=

Pt

i=1

d

_ℓ+i

for 1 ≤ t ≤ m − ℓ = k and z

t

= y

_ℓ

+

Pt−k

i=1

w

_ℓ+i

, b

t

=

Pt−k

i=1

d

_ℓ+i

for k + 1 ≤ t ≤ 2(m − ℓ) = n.

The constraints implied from the nonnegativity of the y

t

and w

t

variables are 0 ≤ z

₁

≤ · · · ≤ z

k

and 0 ≤ z

_k+1

≤ · · · ≤ z

n

. Observe that we have b

₁

≤ · · · ≤ b

_k

and b

_t

= b

_t−k

for k + 1 ≤ t ≤ n. In this case constraints (11) tie together variables z

t

for t ≤ k and t > k.

Reconsidering the set P

^{GM IX}

, observe that the basic M IR inequalities obtained considering each constraint separately are s ≥ γ

t

(α

t

− z

t

), with α

t

and γ

t

defined as above. These inequalities can be combined by taking dif- ferences in the following way. Let U ⊆ {1, · · · , n} and let [1], . . . , [|U |] be any ordering of the elements of U such that γ

_[1]

≤ γ

_[2]

≤ . . . ≤ γ

_[|U|]

. The mixed M IR (or simply mixing) inequalities (Pochet and Wolsey [1993],G¨ unl¨ uk and Pochet [2001]) are

s ≥

X

[t]∈U

(γ

_[t]

− γ

_[t−1]

)(α

_[t]

− z

_[t]

) s ≥

X

[t]∈U

(γ

_[t]

− γ

_[t−1]

)(α

_[t]

− z

_[t]

) + (C − γ

_[|U|]

)(α

_[1]

− z

_[1]

− 1),

for each set U ⊆ {1, · · · , n}, with γ

_[0]

= 0.

(7)

This paper is organized as follows. In the next section, we introduce the two-level mixed MIR inequalities for the mixing set with two divisible capacities. In Section 3 we present a polynomial time separation algorithm;

based on this, we give a proof for the description of the convex hull of solutions of P

^{M M IX}

in Section 4. In Section 5 we describe the extreme points and rays of conv(P

^{M M IX}

). Since there are O(|I

1

||I

2

|) extreme points and n + 1 extreme rays, this extremal description implies both a fast optimiza- tion algorithm and a polynomial size extended formulation for P

^{M M IX}

. In Section 6 we give other extended formulations based on a decomposition of the continuous variable. This approach allows us to derive an alternative convex hull proof, and we can also use it to show, in section 7, that our inequalities suffice to describe the convex hull of (4)-(7) for fixed k, when C is very large. We finish with some conclusions in Section 8.

2 Two–level Mixing Inequalities

In this section we define the family of two–level mixing inequalities. For the remainder of the paper, we assume that the model P

^{M M IX}

is normalized so that L = 1; thus we consider P

^{M M IX}

as being the set of points (s, z) ∈

R

×

Zⁿ

satisfying the following constraints:

s + z

i

≥ b

i

, 1 ≤ i ≤ k, (17)

s + Cz

i

≥ b

i

, k + 1 ≤ i ≤ n, (18)

z ∈

Zⁿ

, (19)

s ≥ 0, (20)

where C > 0 is an integer.

First we recall some results concerning the mixing inequalities.

Lemma 1 (G¨ unl¨ uk and Pochet [2001]) Let W = {(s, z) ∈

R^p

×

Zⁿ

: f (s) +

Πθ

ⁱ

(z) ≥ π

ⁱ

, i = 1, . . . , m}, where f (s) ≥ 0 and θ

ⁱ

(z) ∈

Z

. Let τ

i

integer and

0 ≤ γ

ⁱ

≤ Π be defined such that π

ⁱ

= (τ

ⁱ

− 1)Π + γ

ⁱ

, γ

⁰

= 0, and suppose

w.l.g. that γ

¹

≤ γ

²

≤ · · · ≤ γ

^m

. Let S = {i

1

, . . . , i

_|S|

} ⊆ {1, . . . , m} with

0 = i

₀

≤ i

₁

≤ · · · ≤ i

_|S|

. The following mixed-MIR (or mixing) inequalities

are valid for W :

(8)

f (s) ≥

|S|

X

t=1

(γ

ⁱ^t

− γ

ⁱ^t−1

)(τ

ⁱ^t

− θ

ⁱ^t

(z)) (21)

f (s) ≥

|S|

X

t=1

(γ

ⁱ^t

− γ

ⁱ^t−1

)(τ

ⁱ^t

− θ

ⁱ^t

(z)) + (Π − γ

ⁱ^|S|

)(τ

ⁱ¹

− 1 − θ

ⁱ¹

(z)). (22) Note that we have very slightly extended the original notation by allowing τ

ⁱ

to be defined either as ⌈π

ⁱ

/Π⌉, in which case 0 < γ

ⁱ

≤ Π (as in the original paper G¨ unl¨ uk and Pochet [2001]), or as ⌊π

ⁱ

/Π⌋ + 1 so that 0 ≤ γ

ⁱ

< Π.

Indeed, when π

ⁱ

is not a multiple of Π, both definitions are equivalent.

If π

ⁱ

is a multiple of Π, then it is readily checked that both definitions yield the same inequality (22), and also that inequality (21) is redundant in the two cases.

Lemma 2 Let (¯ s, z) ¯ ∈

R^p

×

Rⁿ

. Under the conditions of Lemma 1, a most violated mixed-M IR inequality is given by a set S = {i

1

, . . . , i

_|S|

} such that

1. τ

ⁱ¹

− θ

ⁱ¹

(¯ z) ≥ τ

ⁱ²

− θ

ⁱ²

(¯ z) ≥ · · · ≥ τ

ⁱ^|S|

− θ

ⁱ^|S|

(¯ z);

2. τ

ⁱ^t

− θ

ⁱ^t

(¯ z) ≥ τ

ⁱ

− θ

ⁱ

(¯ z), for i : i

_t−1

< i < i

t

, t = 1, . . . , |S|;

3. either 1 ≥ τ

ⁱ¹

−θ

ⁱ¹

(¯ z), in which case τ

ⁱ^|S|

−θ

ⁱ^|S|

(¯ z) ≥ 0; 0 ≥ τ

ⁱ

− θ

ⁱ

(¯ z), for i : i > i

_|S|

; and the most violated inequality is (21);

or τ

ⁱ¹

− θ

ⁱ¹

(¯ z) − 1 ≥ 0, in which case τ

ⁱ^|S|

− θ

ⁱ^|S|

(¯ z) ≥ τ

ⁱ¹

− θ

ⁱ¹

(¯ z) − 1;

τ

ⁱ¹

−θ

ⁱ¹

(¯ z)−1 ≥ τ

ⁱ

−θ

ⁱ

(¯ z), for i > i

_|S|

; and the most violated inequality is (22).

Proof: The proof is discussed in Pochet and Wolsey [1994] and Miller and Wolsey [2003].

2

Note that it is possible that the indices in S satisfying 1 and 2 above may satisfy τ

ⁱ¹

− θ

ⁱ¹

(¯ z) = 1, in which case each of the inequalities (21) and (22) defined by this set will be most violated inequalities.

Now let I

1

= {1, . . . , k}, I

2

= {k + 1, . . . , n} and I = I

1

∪ I

2

. Then we can see that the set P

^{M M IX}

defines two intersecting mixing sets, one based on s and the variables in I

₁

, in which Π = 1, and the other based on s and the variables in I

2

, in which Π = C, in both cases θ

ⁱ

(z) = z

i

.

Example 1

(9)

Consider the following instance of P

^{M M IX}

in which n = 4, k = 2, C = 5 and the right hand-side vector b is equal to [3.8, 5.3, 1.6, 9.9] (from Van Vyve [2003]):

s + z

₁

≥ 3.8 (23)

s + z

₂

≥ 5.3 (24)

s + 5z

₃

≥ 1.6 (25)

s + 5z

4

≥ 9.9 (26)

z

_i

∈

Z

i ∈ {1, 2, 3, 4} (27)

s ≥ 0. (28)

Thus I

₁

= {1, 2} and I

₂

= {3, 4}. The first two inequalities (for variables in I

₁

) and (28) define a mixing set with unit capacity. Equations (25)–(26) (for variables in I

2

) and (28) define another mixing set with capacity 5.

Thus we can use Lemma 1 to define the following valid inequality for P

^{M M IX}

:

s ≥ 0.3(6 − z

₂

) + 0.5(4 − z

₁

).

Here Π = 1, f (s) = s, θ

ⁱ

(z) = z

i

for i = 1, 2. Also, τ

¹

= 4 and τ

²

= 6, and γ

¹

= 0.8 and γ

²

= 0.3. The inequality listed is then given by (21) when S = {1, 2}.

Likewise, we can use Lemma 1 to define

s ≥ 1.6(1 − z

3

) + 3.3(2 − z

4

) + 0.1(0 − z

3

),

which is valid for P

^{M M IX}

. This inequality results from (22) when S = {3, 4}, because we can take Π = 5, f (s) = s, θ

ⁱ

(z) = z

i

for i = 3, 4.

2

Now we will discuss how to define strong valid inequalities for P

^{M M IX}

that have nonzero coefficients for variables in both I

₁

and I

₂

. To do this, we first define κ

_i

= ⌈b

i

⌉, η

_i

= b

_i

− (κ

_i

− 1) for i ∈ I

₁

and η

₀

= 1.

In addition, we let α

_i

= ⌈

^b_Cⁱ

⌉ and δ

_i

= b

_i

− (α

_i

− 1)C for i ∈ I

₂

. Also, let κ

i

= ⌈δ

i

⌉ and η

i

= δ

i

− (κ

i

− 1) for i ∈ I

₂

. Note that these definitions imply that

b

i

= (κ

i

− 1) + η

i

, i ∈ I

₁

,

b

i

= (α

i

− 1)C + δ

i

= (α

i

− 1)C + (κ

i

− 1) + η

i

, i ∈ I

₂

.

(10)

Observe that 0 < κ

i

≤ C for i ∈ I

₂

and 0 < η

i

≤ 1 for i ∈ I . For notational convenience, we also define κ

₀

= 0 and η

₀

= 1.

Throughout, we assume that 0 < η

_i

< 1, i ∈ I

₁

, and that 0 < δ

_i

<

C, i ∈ I

₂

. Relaxing this assumption does not affect most of the results we derive, but making it sometimes reduces the number of cases that must be considered and makes notation easier.

We also suppose without loss of generality that δ

_k+1

≤ δ

_k+2

≤ . . . ≤ δ

n

. Finally, for i ∈ I

₂

∪ {0} and j ∈ I ∪ {0}, let

κ

^j_i

=

(

κ

_i

if η

_i

≥ η

_j

κ

i

− 1 if η

i

< η

j

. Note that an equivalent definition of κ

^j_i

is

κ

^j_i

= min{κ ∈

Z

: κ ≥ κ

i

+ η

i

− η

j

}.

Thus, κ

^j_i

is the lowest value the integral part of s can take keeping (18) satisfied for i, if z

i

= α

i

− 1 and η

j

is the fractional part of s for some j ∈ I ∪ {0} \ {i} (except when j = 0 or δ

_j

is integer, in which case η

_j

= 1).

Note also that κ

^j₀

= κ

0

= 0 for all j ∈ I ∪ {0}. These quantities will be essential in defining the two-level mixing inequalities.

The next result is also helpful in defining the new inequalities. In addition, it will be important in characterizing separation and in showing that they define the convex hull of P

^{M M IX}

.

Lemma 3 For every j ∈ I ∪ {0},

0 = κ

^j₀

≤ κ

^j_k+1

≤ κ

^j_k+2

≤ · · · ≤ κ

^j_n

. (29) Proof: As δ

i

≤ δ

_i+1

implies κ

i

≤ κ

_i+1

, the only nontrivial case is when κ

^j_i

= κ

_i

and κ

^j_i+1

= κ

_i+1

− 1. In this case we must have η

_i+1

< η

_j

≤ η

_i

, so κ

i

= δ

i

− η

i

+ 1 < δ

_i+1

− η

_i+1

+ 1 = κ

_i+12

We will now use Lemma 1 twice to build the two–level mixing inequalities. The first step uses Lemma 1 to define, for each i ∈ I

₂

∪ {0}, a set of inequalities involving the variables in I

₂

∪ {0} that can then be mixed with simple M IR inequalities involving variables in I

₁

.

Lemma 4 Let S = {i

1

, . . . , i

_|S|

} ⊆ I

2

, with 0 = i

0

< i

1

< · · · < i

_|S|

, and let

j ∈ I

₂

∪ {0}. The following inequalities are valid for P

^{M M IX}

:

(11)

s + 1 − η

_j

≥

|S|

X

t=1

(κ

^j_i

t

− κ

^j_i_t−1

)(α

_i_t

− z

_i_t

) (30)

s + 1 − η

j

≥

|S|

X

t=1

(κ

^j_i_t

− κ

^j_i_t−1

)(α

it

− z

it

) + (C − κ

^j_i

|S|

)(α

i₁

− 1 − z

i₁

). (31) Proof: from (18) and the definition of α

i

, κ

i

and η

i

, we have

s + 1 − η

_i

+ Cz

_i

≥ (α

_i

− 1)C + κ

_i

.

Now if η

i

≥ η

j

we can write s + 1 − η

j

+ Cz

i

≥ (α

i

− 1)C + κ

i

; if η

i

< η

j

, as η

_i

≥ 0 and η

_j

≤ 1, we have s + 1 − η

_j

+ Cz

_i

≥ (α

_i

− 1)C + κ

_i

− 1. Thus

s + 1 − η

j

+ Cz

i

≥ (α

i

− 1)C + κ

^j_i

for i ∈ I

₂

. (32) Define f (s) = s + 1 − η

j

, which is nonnegative because s ≥ 0 and η

j

≤ 1, Π = C, θ

ⁱ

(z) = z

_i

, and π

ⁱ

= (α

_i

− 1)C + κ

^j_i

. Thus ⌈π

ⁱ

/Π⌉ = α

_i

and γ

ⁱ

= κ

^j_i

. The validity of the inequalities follows from Lemma 1.

2

Example 1 (continued)

We can compute the following values

0 1 2 3 4

α 1 2

δ 1.6 4.9

κ 4 6 2 5

η 1 0.8 0.3 0.6 0.9;

then the following table holds the values of κ

^j_i

: κ

^j_i

i = 3 i = 4

j = 0 1 4

j = 1 1 5

j = 2 2 5

j = 3 2 5

j = 4 1 5.

Recall that κ

^j₀

= κ

0

= 0 for all j ∈ I ∪ {0}. Now if we take S = {3, 4} and

j = 3, then inequalities (30) and (31) both yield

(12)

s + 1 − 0.6 = s + 0.4 ≥ 2(1 − z

₃

) + 3(2 − z

₄

). (33) which follows from the fact that both s+0.4 ≥ 2(1−z

₃

) and s+0.4 ≥ 5(2−z

₄

) are valid for P

^{M M IX}

.

Similarly, if we take j = 4, then inequalities (30) and (31) both yield s + 1 − 0.9 = s + 0.1 ≥ 1(1 − z

₃

) + 4(2 − z

₄

). (34) If we take j = 0, then inequalities (30) and (31) respectively yield

s + 1 − 1 = s ≥ 1(1 − z

3

) + 3(2 − z

4

), (35) s + 1 − 1 = s ≥ 1(1 − z

₃

) + 3(2 − z

₄

) + (−z

3

). (36)

2

The next Proposition follows from Lemmas 2 and 4.

Proposition 5 Given (¯ s, z) ¯ ∈

R

×

R^|I²^|

, the most violated inequality (30) or (31), for every j ∈ I

2

∪ {0}, is determined by a set S = {i

1

, . . . , i

_|S|

} ⊆ I

2

, with i

₁

< · · · < i

_|S|

such that

1. α

i1

− z ¯

i1

≥ α

i2

− z ¯

i2

≥ · · · ≥ α

i_|S|

− z ¯

i_|S|

;

2. α

_i_t

− z ¯

_i_t

≥ α

_i

− z ¯

_i

, for i

_t−1

< i < i

_t

, t = 2, . . . , |S| and k < i < i

₁

; 3. either 1 ≥ α

i₁

− z ¯

i₁

, in which case α

i_|S|

− z ¯

i_|S|

≥ 0; 0 ≥ α

i

− z ¯

i

, for

i : i > i

_|S|

; and the most violated inequality is (30);

or α

_i₁

− z ¯

_i₁

− 1 ≥ 0, in which case α

_i_|S|

− z ¯

_i_|S|

≥ α

_i₁

− z ¯

_i₁

− 1;

α

i1

− z ¯

i1

− 1 ≥ α

i

− z ¯

i

, for i : i > i

_|S|

; and the most violated inequality is (31).

Now for S ⊆ I

₂

, again let {i

t

} be any ordering of S such that 0 = i

₀

< i

₁

<

... < i

_|S|

. Then for each j ∈ I

₂

∪ {0}, define

ψ

_S^j

(z) =

|S|

X

t=1

(κ

^j_i_t

− κ

^j_i_t−1

)(α

it

− z

it

), (37) φ

^j_S

(z) = ψ

_S^j

(z) + (C − κ

^j_i

|S|

)(α

i1

− 1 − z

i1

). (38)

(13)

Note that these expressions are the right hand sides of the inequalities (30) and (31), respectively. Note also that, for each j ∈ I

₁

, s + 1 − η

j

≥ κ

j

− z

j

is valid for P

^{M M IX}

(from (17) and the definition of η

_j

and κ

_j

). This motivates us to define, for any j ∈ I ∪ {0} and any S ⊆ I

₂

,

θ

_S^j,r

(z) =







κ

j

− z

j

if j ∈ I

₁

ψ

_S^j

(z) if j ∈ I

₂

∪ {0} and r = 0 φ

^j_S

(z) if j ∈ I

₂

∪ {0} and r = 1.

Using this definition,

s + 1 − η

j

≥ θ

_S^j,r

(z) (39)

is a valid inequality for any j ∈ I ∪ {0}, regardless of the choices of S and r.

Proposition 6 Let U

₁

⊆ I

₁

, U

₂

⊆ I

₂

∪ {0}, U = U

₁

∪ U

₂

and suppose U = {j

1

, . . . , j

_|U|

} with 0 = η

j0

≤ η

j1

≤ · · · ≤ η

j_|U|

. For each j

u

∈ U

2

let r

_u

∈ {0, 1} and S

_u

⊆ I

₂

. The following inequalities are valid for P

^{M M IX}

:

s ≥

|U|

X

u=1

(η

_j_u

− η

_j_u−1

)θ

_S^j^u^r^u

u

(z), (40)

s ≥

|U|

X

u=1

(η

ju

− η

j_u−1

)θ

_S^j^u^r^u

u

(z) + (1 − η

j_|U|

)(θ

_S^j¹₁^r¹

(z) − 1) (41) Proof: Rewriting (39), we have that s − θ

_S^j,r

(z) ≥ η

_j

− 1 for every j ∈ U and r ∈ {0, 1}. The result follows from Lemma 1 with Π = 1, π

^j

= η

j

− 1, so τ

^j

= ⌈

^η^j₁⁻¹

⌉ = 0 and γ

^j

= η

_j

.

2

Many of the above inequalities are dominated. Given a point (s, z) ∈

R

×

Zⁿ

, the inequality (40) or (41) yielding the largest right hand side is such that, for each j ∈ U , the corresponding term θ

_S^j,r

(z) is of maximum value among all possible choices of the set S and r. From Proposition 5 (properties 1 and 2), it follows that for j ∈ U

₂

, the set S∗ maximizing θ

_S^j,r

(z) is the same for every j. From Proposition 5 (property 3), r takes the same value for every j.

From this Proposition and the definition of κ

^ℓ_i

, it follows that S ∪ {0} ⊇

U

2

. To see this, suppose that j ∈ U

2

and j / ∈ S ∪ {0}. Let ℓ be an index in

S ∪ {0} such that η

ℓ

= min{η

i

: i ∈ S ∪ {0} and η

i

≥ η

j

}. Such an index

(14)

always exists as η

j

≤ 1 = η

₀

. Observe that we have κ

^ℓ_i

= κ

^j_i

for i ∈ S, so ψ

^ℓ_S

(z) = ψ

_S^j

(z) and φ

^ℓ_S

(z) = φ

^j_S

(z). Thus, from Proposition 5, it follows that inequalities (40) and (41) with j ∈ U

₂

, are dominated by corresponding inequalities where j is replaced by ℓ or j is simply removed from U

2

.

From the above discussion, we have θ

^j,0_S

(z) = ψ

_S^j

(z) and θ

^j,1_S

(z) = φ

^j_S

(z) for j ∈ S ∪ {0}. Define, for j ∈ I

₁

, ψ

^j_S

(z) = φ

^j_S

(z) = κ

_j

− z

_j

. Thus the non-dominated inequalities are of one of the four following types:

s ≥

|U|

X

u=1

(η

ju

− η

ju−1

)ψ

_S^j^u

(z), (42)

s ≥

|U|

X

u=1

(η

_j_u

− η

_j_u−1

)φ

^j_S^u

(z), (43)

s ≥

|U|

X

u=1

(η

ju

− η

j_u−1

)ψ

_S^j^u

(z) + (1 − η

j_|U|

)(ψ

_S^j¹

(z) − 1), (44)

s ≥

|U|

X

u=1

(η

ju

− η

ju−1

)φ

^j_S^u

(z) + (1 − η

j_|U|

)(φ

^j_S¹

(z) − 1), (45) where U = U

₁

∪ U

₂

= {j

1

, . . . , j

_|U|

} with 0 = η

j₀

≤ η

j₁

≤ · · · ≤ η

j_|U|

, U

₁

⊆ I

₁

, S ⊆ I

₂

, and U

₂

⊆ S ∪ {0}.

Observe that if 0 ∈ U , as η

₀

= η

j_|U|

= 1, then (42) (resp. (43)) and (44) (resp. (45)) yield the same inequality, and so we do not need to consider (44) and (45) when 0 ∈ U . We will see later that when 0 ∈ / U inequalities of the form (42) and (43) are dominated by inequalities (44) and (45).

Example 1 (continued) Note that we can rewrite (23) as

s + 1 − 0.8 = s + 0.2 ≥ 4 − z

1

.

Thus, if we choose U

₂

= S = {3, 4} and U

₁

= {1}, (42) and (43) both yield

s ≥ 0.6 [2(1 − z

₃

) + 3(2 − z

₄

)] + 0.2 [4 − z

₁

] + 0.1 [1(1 − z

₃

) + 4(2 − z

₄

)] ,

(46)

while (44) and (45) both yield

(15)

s ≥ 0.6 [2(1 − z

₃

) + 3(2 − z

₄

)] + 0.2 [4 − z

₁

] + 0.1 [1(1 − z

₃

) + 4(2 − z

₄

)]

+ 0.1 [2(1 − z

₃

) + 3(2 − z

₄

) − 1] . (47) If we choose S = {3, 4}, U

₂

= S ∪ {0} = {0, 3, 4}, and U

₁

= {1}, then (42) and (44) both yield

s ≥ 0.6 [2(1 − z

₃

) + 3(2 − z

₄

)] + 0.2 [4 − z

₁

] + 0.1 [1(1 − z

₃

) + 4(2 − z

₄

)]

+ 0.1 [1(1 − z

3

) + 3(2 − z

4

)] , (48)

while (43) and (45) both yield

s ≥ 0.6 [2(1 − z

₃

) + 3(2 − z

₄

)] + 0.2 [4 − z

₁

] + 0.1 [1(1 − z

₃

) + 4(2 − z

₄

)]

+ 0.1 [1(1 − z

₃

) + 3(2 − z

₄

) + (−z

3

)] , (49) which coincidentally is the same inequality as (47).

Note that Proposition 5 tells us that this choice of S maximizes both ψ

_S^j

(z) and φ

^j_S

(z), for j = 0, 3, 4, if and only if

1 − z

3

≥ 2 − z

4

≥ −z

3

and 2 − z

4

≥ 0. (50) It is instructive to note that, given (50), (46) is always dominated by (48).

We will see that this holds in general: inequalities of the forms (42) and (43) in which 0 ∈ / U are always dominated by other inequalities in which 0 ∈ U . Given (50), there is one more implication worthy of note. If and only if

1 − z

3

≥ 1 ⇐⇒ z

3

≤ 0 ⇐⇒ φ

^j_S

(z) ≥ ψ

_S^j

(z), then each of (47) and (49) dominates (48).

2

3 Separation

The development of the separation algorithm for the Two-Level Mixing In-

equalities parallels the development necessary to define (42)–(45). It is eas-

iest to consider separation in two main phases: 1) defining which sets U

₁

,

U

2

, and S define the inequality; and 2) computing the coefficients of each

variable in the inequality based on which sets we choose. We will show how

to do the first phase in O(n log(n)), and how to do the second in O(n), thus

defining an O(n log(n)) separation algorithm for the inequalities (42)–(45).

(16)

3.1 Choosing the Sets

We first prove a Lemma that, given a point, will help us to identify which of the two-level mixing inequalities is most violated by that point.

Lemma 7 Let S ⊆ I

₂

, S = {i

1

, . . . , i

_|S|

}, with i

₁

< · · · < i

_|S|

, be a set satisfying 1 and 2 in Proposition 5. Then

1. ψ

_S^j

(z) ≥ 0 for j ∈ I

₂

∪ {0}

2. φ

^j_S

(z) ≥ 0 if α

i1

− z

i1

− 1 ≥ 0 for j ∈ I

₂

∪ {0}

3. ψ

_S^j

(z) ≥ ψ

^ℓ_S

(z) if η

j

≤ η

_ℓ

, j, l ∈ I

₂

∪ {0}

4. φ

^j_S

(z) ≥ φ

^ℓ_S

(z) if η

j

≤ η

ℓ

, j, l ∈ I

2

∪ {0}

Proof: Define α

i_|S|+1

− z

i_|S|+1

= 0. For j ∈ I

₂

, ψ

_S^j

(z) =

P|S|

t=1

(κ

^j_i_t

− κ

^j_i_t−1

)(α

_i_t

− z

_i_t

) =

P|S|

t=1

κ

^j_i

t

(α

_i_t

− z

_i_t

) − (α

_i_t+1

− z

_i_t+1

)

. Since S satisfies conditions of Proposition 5, α

it

− z

it

≥ α

it+1

− z

it+1

, so ψ

^j_S

(z) ≥ 0. If η

j

≤ η

ℓ

then

κ

^j_i

t

− κ

^ℓ_i_t

=

(

1 if η

j

≤ η

it

< η

ℓ

0 otherwise, so

ψ

_S^j

(z) − ψ

_S^ℓ

(z) =

X

it∈S:ηj≤η_it<ηℓ

(α

_i_t

− z

_i_t

) − (α

_i_t+1

− z

_i_t+1

)

≥ 0. (51)

The other inequalities follow from the definition of φ

^j_S

(z).

2

Observation 8 The following are consequences of Lemma 7 .

1. Since η

0

= 1 ≥ η

j

for all j and ψ

_S⁰

(z) ≥ 0 or φ

⁰_S

(z) ≥ 0 when α

i1

− z

i₁

− 1 ≥ 0, the most violated inequalities of the form (42) or (43) always have 0 ∈ U , so j

_|U|

= 0 in this case. We noticed before that inequalities (44) and (45) are the same as (42) and (43) respectively when 0 ∈ U.

2. The set of inequalities 0 ≥ ψ

^j_S

(z) and 0 ≥ φ

^j_S

(z) for j : η

j

> η

j_|U|

is

vacuously satisfied when j

_|U|

= 0, because 1 = η

₀

≥ η

_j

for all j.

(17)

The following proposition characterizes the most violated inequality (42)- (45), and it follows from Lemmas 2 and 7 and Proposition 5.

Proposition 9 Given (¯ s, z) ¯ ∈

R

×

R^|I|

, the most violated inequality (42)- (45), is determined by the choice of sets U

₁

⊆ I

₁

, S ⊆ I

₂

, and U

₂

⊆ S ∪ {0}, where S = {i

1

, . . . , i

_|S|

}, with i

₁

< · · · < i

_|S|

and U = U

₁

∪U

2

= {j

1

, . . . , j

_|U|

} with 0 = η

j0

≤ η

j1

≤ · · · ≤ η

j_|U|

, satisfying

1. α

i₁

− z ¯

i₁

≥ α

i₂

− z ¯

i₂

≥ · · · ≥ α

i_|S|

− z ¯

i_|S|

;

2. α

it

− z ¯

it

≥ α

i

− z ¯

i

, for i : i

t−1

< i < i

t

, t = 1, . . . , |S|, i

0

= k 3. if 1 ≥ α

i₁

− z ¯

i₁

:

(a) α

i_|S|

− z ¯

i_|S|

≥ 0;

(b) 0 ≥ α

i

− z ¯

i

, for i : i > i

_|S|

;

(c) ψ

_S^j¹

(¯ z) ≥ ... ≥ ψ

_S^j^|U|

(¯ z) ≥ ψ

^j_S¹

(¯ z) − 1;

(d) ψ

_S^j^u

(¯ z) ≥ ψ

^j_S

(¯ z), for j : η

j_u−1

< η

j

< η

ju

, u = 1, . . . , |U |;

(e) if ψ

⁰_S

(¯ z) ≥ ψ

_S^j¹

(¯ z) − 1, then j

_|U|

= 0 and the most violated inequality is of type (42);

if ψ

_S^j¹

(¯ z) − 1 ≥ ψ

⁰_S

(¯ z), then 0 ∈ / U , ψ

_S^j¹

(¯ z) − 1 ≥ ψ

^j_S

(¯ z) for j : η

j

>

η

_j_|U|

, and the most violated inequality is of type (44).

4. if α

i₁

− z ¯

i₁

− 1 ≥ 0:

(a) α

i_|S|

− z ¯

i_|S|

≥ α

i₁

− z ¯

i₁

− 1;

(b) α

i₁

− z ¯

i₁

− 1 ≥ α

i

− z ¯

i

, for i : i > i

_|S|

; (c) φ

^j_S¹

(¯ z) ≥ ... ≥ φ

^j_S^|U|

(¯ z) ≥ φ

^j_S¹

(¯ z) − 1;

(d) φ

^j_S^u

(¯ z) ≥ φ

^j_S

(¯ z), for j : η

_j_u−1

< η

_j

< η

_j_u

, u = 1, . . . , |U |;

(e) if φ

⁰_S

(¯ z) ≥ φ

^j_S¹

(¯ z) − 1, then j

_|U|

= 0 and the most violated inequality is of type (43);

if φ

^j_S¹

(¯ z) − 1 ≥ φ

⁰_S

(¯ z), then 0 ∈ / U , φ

^j_S¹

(¯ z) − 1 ≥ φ

^j_S

(¯ z) for j : η

j

> η

j_|U|

, and the most violated inequality is of type(45).

Observe that Lemma 7 shows that many of the inequalities in items (c) and

(d) in Proposition 9 are redundant, because they are implied by conditions

1 and 2 and the definitions of ψ

_S^j

(·) and φ

^j_S

(·).

(18)

One of the implications of Proposition 9 is that separating for the most violated two-level mixing inequality amounts to 1) choosing the best set S ⊆ I

₂

and then 2) choosing the best set U ⊆ I

₁

∪ S ∪ {0}. The first can be done essentially by sorting values of α

i

− z ¯

i

, i ∈ I

₂

; the second can be done essentially by sorting values of ψ

_S^j

(¯ z) or φ

^j_S

(¯ z) (depending on whether or not α

_i₁

− z ¯

_i₁

≥ 1), j ∈ I

₁

∪ S ∪ {0}.

Note that we can compute ψ

^j_S

(¯ z), j ∈ S ∪ {0}, by first computing ψ

_S⁰

(¯ z) and then applying (51), accordingly to the ordering η

j_|U|

, · · · , η

j₁

, η

j₀

. Thus, we can compute the set of all these values in time O(n). Hence the com- plexity is dominated by two orderings of n coefficients at most.

We have now proved the following.

Proposition 10 Given a point (¯ s, z), the sets defining the most violated ¯ inequality of the forms (42)–(45) can be identified in O(n log n).

This is remarkable, given that separating for the original mixing inequalities themselves takes only O(n log n). Note that we need to do more work to actually define the inequality that is defined by the sets referred to in Propo- sition 10: we need to determine the coefficients of each of the variables. This is the subject of the next subsection.

3.2 Computing the Coefficients

Let U

₁

⊆ I

₁

, S ⊆ I

₂

and U

₂

⊆ S ∪ {0} be given. We can write inequality (42) as

s ≥

X

[i]∈U1

(η

_[i]

− η

_[i−1]

)(κ

_[i]

− z

_[i]

) +

X

i∈S

A

i

(α

i

− z

i

), where

A

i

=

X

[t]∈U2

(η

_[t]

− η

_[t−1]

)(κ

^[t]_i

− κ

^[t]_i−1

).

Observe that

κ

^[t]_i

− κ

^[t]_i−1

=







κ

i

− κ

_i−1

+ 1 if η

_i−1

< η

_[t]

≤ η

i

κ

_i

− κ

_i−1

− 1 if η

_i

< η

_[t]

≤ η

_i−1

κ

i

− κ

_i−1

otherwise.

(19)

Therefore

A

i

= (κ

i

− κ

i−1

)

X

[t]∈U2

(η

_[t]

− η

_[t−1]

) +

X

[t]∈U2

ηi−1<η_[t]≤ηi

(η

_[t]

− η

_[t−1]

) −

X

[t]∈U2

ηi<η_[t]≤ηi−1

(η

_[t]

− η

_[t−1]

)

= (κ

_i

− κ

_i−1

)G

_[|U|]

+ (G

_i

− G

_i−1

)

⁺

− (G

_i−1

− G

_i

)

⁺

= (κ

i

− κ

_i−1

)G

_[|U|]

+ (G

i

− G

_i−1

) where G

i

=

P

[t]∈U2

η_[t]≤ηi

(η

_[t]

− η

_[t−1]

) and U = U

₁

∪ U

₂

. All values G

i

can be computed in O(n) time using the recurrence

G

_[t]

= G

_[t−1]

+

(

η

_[t]

− η

_[t−1]

if [t] ∈ U

2

0 otherwise.

Then, computing each coefficient A

i

can be done in constant time.

The computation of the coefficients of inequalities (43),(44), and (45) can be done in a similar way.

4 The Convex Hull

In this section we show that the convex hull of P

^{M M IX}

is described by inequalities (42)–(45). This is done by showing that each face defined by any of those inequalities is integral, that is, all the extreme points in each face have integer coordinates z. We will refer to faces of polyhedra as being

“integral” with this meaning throughout the paper. Then we discuss the possibility of adding other constraints to the system (42)–(45), while keeping integrality. We start with an important Lemma.

Lemma 11 Let P

n

= {z ∈

Rⁿ

: Az ≤ b} be an integral polyhedron in

Rⁿ

. Assume that ψ

¹

(·), ψ

²

(·) are linear functions, that ψ

¹

(z) ≥ ψ

²

(z) is implied by Az ≤ b, and that ψ

^j

(z) ∈

Z

if z ∈

Zⁿ

, j = 1, 2. Then the polyhedron P

_n+1

= {(z, y) ∈

Rⁿ

×

R¹

: Az ≤ b, ψ

¹

(z) ≥ y ≥ ψ

²

(z)} is integral in

Rⁿ⁺¹

.

Proof: Consider some extreme point (¯ z, y) of ¯ P

_n+1

. Note that ψ

¹

(¯ z) >

¯

y > ψ

²

(¯ z) cannot hold, because otherwise we could easily construct two points of which (¯ z, y) is a convex combination. ¯

If ψ

¹

(¯ z) = ¯ y, then ¯ z must be an extreme point of P

n

. Otherwise, we can write ¯ z =

PK

k=1

λ

_k

z

^k

with z

^k

∈ P

_n

, λ

_k

≥ 0 for k = 1 · · · K and

PK

k=1

λ

_k

= 1.

Then taking y

^k

= ψ

¹

(z

^k

), k = 1, ..., K would yield (¯ z, y) = ¯

PK

k=1

λ

_k

(z

^k

, y

^k

),