On the elimination of inessential points in the smallest enclosing ball problem

(1)

HAL Id: hal-01863587

https://hal.archives-ouvertes.fr/hal-01863587

Submitted on 28 Aug 2018

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

To cite this version:

Luc Pronzato. On the elimination of inessential points in the smallest enclosing ball prob-lem. Optimization Methods and Software, Taylor & Francis, 2019, 34 (2), pp.225-247. �10.1080/10556788.2017.1359266�. �hal-01863587�

(2)

Vol. 00, No. 00, Month 20XX, 1–23

On the elimination of inessential points in the smallest enclosing ball problem

L. Pronzatoa∗

a_Universit´_{e Cˆ}_{ote d’Azur, CNRS, Laboratoire I3S, Sophia Antipolis}

(Received 00 Month 20XX; final version received 00 Month 20XX)

We consider the construction of the smallest ballB∗enclosing a setXnformed by n points

in Rd. We show that any probability measure on Xn, with mean c and variance matrix

V , provides a lower bound b on the distance to c of any point on the boundary of B∗, with b having a simple expression in terms of c and V . This inequality permits to remove inessential points fromXn, which do not participate to the definition ofB∗, and can be used

to accelerate algorithms for the construction ofB∗. We show that this inequality is, in some sense, the best possible. A series of numerical examples indicates that, when d is reasonably small (d ≤ 10, say) and n is large (up to 105_{), the elimination of inessential points by a}

suitable two-point measure, followed by a direct (exact) solution by quadratic programming, outperforms iterative methods that compute an approximate solution by solving the dual problem.

Keywords: minimum enclosing ball; smallest ball; Chebyshev centre; core sets; optimal design of experiments

AMS Subject Classification: 90C25; 90C46; 62K05

1. Introduction

Given a set of n points Xn = {X1, . . . , Xn} ⊂ Rd, d ≥ 2, we consider the algorithmic

construction of the minimum ballB∗(Xn) enclosingXn. We are interested in particular

in the situation where d is reasonably small but n can be large. For c ∈Rd_{and r ∈}_R+_,

we denote by Bd(c, r) the (closed) ball {X ∈Rd: kX − ck ≤ r}, with k.k the Euclidean

norm. We shall write B∗(Xn) = Bd(cn∗, rn∗), where c∗n (the Chebyshev centre of Xn)

minimises

f (c) = max

i=1,...,nkXi− ck

2 ₍₁₎

with respect to c ∈ Rd and r_n∗ = maxi=1,...,nkXi − c∗nk. A ball Bd(c, r) is said to be

a (1 + )-approximation to B∗(Xn), > 0, when Xn ⊂ Bd(c, r) and r ≤ (1 + )rn∗; a

subset Xq ⊆ Xn is said to be an -core set of Xn if B∗(Xq) =Bd(c∗q, rq∗) is such that

r∗_q ≤ r∗

n≤ (1 + )r∗q.

The construction of B∗(Xn) is a classical optimisation problem, for which many

al-gorithms have been proposed in the literature, see, e.g., the historical sketch in [7] and the references in [28]. A recent application concerns the construction of space-filling

(3)

signs for compter experiments based on an extension of Lloyd’s clustering algorithm [16]. Some methods are exact and rely on extensions of linear programming algorithms, see [5, 9, 26]; some use the dual formulation of the problem and construct a sequence of (1 + k)-approximations of B∗(Xn) with k tending to zero, see [4, 28]. The former are

exponential in d and are thus restricted to problems with moderate dimension (d. 20, say); the latter can also solve problems with large d and compute a (1 + )-approximation toB∗(Xn) in O(nd/) arithmetic operations, returning an -core set of size O(1/), see

[6, 28].

Both types of methods can strongly benefit from a reduction of the size ofXn, in the

same way as algorithms for the construction of the minimum-volume ellipsoid containing Xn can be accelerated when inessential points are eliminated by the inequality of [11],

see [24, Sect. 3.6]. The objective of removing inessential points presents some similarities with that of obtaining small -core sets, with one capital difference though: a point Xi is

called inessential when B∗(Xn\ {Xi}) exactly coincides with B∗(Xn), which happens

in particular when Xi lies in the interior of B∗(Xn). By removing inessential points,

we thus aim at constructing small 0-core sets. Although we know there always exists a 0-core set of size at most d + 1, its construction requires the knowledge of B∗(Xn). The

objective of the paper is to derive a simple inequality that any point Xj on the boundary

ofB∗(Xn) must satisfy, without knowingB∗(Xn). More precisely, we show that for any

probability measure ξ onXn, with c(ξ) and V (ξ) the corresponding mean and covariance

matrix respectively, any point Xj on the boundary of B∗(Xn) satisfies

kXj− c(ξ)k2 ≥ trace[V (ξ)] + γ(ξ) −

p

γ(ξ){2trace[V (ξ)] + γ(ξ)} , (2)

where γ(ξ) = maxi=1,...,nkXi − c(ξ)k2 − trace[V (ξ)]. We also prove that this bound on

kX_j − c(ξ)k2 _{is, in some sense, the best possible,} _{and a comparison with the bound} previously proposed in [2] is provided. Since algorithms based on the dual formulation of the smallest enclosing ball problem generate a sequence of measures ξk, they provide for free a sequence of inequalities that can be used as sieves to eliminate inessential points fromXn, and thereby generate a sequence of 0-core sets of decreasing size. When

imbedded in the algorithm, these sieves yield an increasing simplification of iterations, and thus an acceleration of the algorithm, see [2]. Moreover, the 0-core set obtained after a few iterations may be small enough to allow the efficient use of an exact quadratic programming (QP) algorithm for the construction of B∗(Xn)

The paper is organised as follows. Section 2 introduces the notation and presents the QP and dual formulations of the minimum enclosing ball problem. The inequality (2) is proved in Section 3, where we also explain why it cannot be improved. Two iterative algo-rithms are presented in Section 4: a multiplicative algorithm inspired from experimental design theory and the vertex-direction algorithm of [28]. Some computational results are presented in Section 5 that illustrate the benefit of the elimination of inessential points, when using an iterative algorithm to solve the dual problem or before using QP for the direct approach. In particular, they indicate that for moderate d the application of an exact QP algorithm to the resulting 0-core set yields the exact minimum ball B∗(Xn)

(4)

2. Quadratic programming and dual formulations

For any Xi ∈ Xn and c and c0 in Rd, we can write kXi− ck2 = kXi− c0k2− 2(Xi−

c0)>(c − c0) + kc − c0k2. Therefore, f (c) defined in (1) can be written as

f (c) = max i=1,...,n n kXi− c0k2− 2(Xi− c0)>(c − c0) o + kc − c0k2, (3)

and its minimisation is equivalent to the minimisation of kc − c0k2 + t with respect to

(c, t) ∈Rd+1, subject to the n linear constraints

kXi− c0k2− 2(Xi− c0)>(c − c0) ≤ t , i = 1, . . . , n . (4)

When d is small enough, simplex-type or projection methods can thus be used to obtain the exact solution in finite time (assuming calculations with infinite precision); see in particular the introduction of [10] and the references therein. In case the QP solver requires a strictly convex problem, one may add a regularisation term, quadratic in t, to the objective function and minimise kc − c0k2+ t + δt2 with δ arbitrarily small (the

solution obtained being however not exact in this case).

On the other hand, the dual formulation of the problem yields iterative methods that construct a sequence of (1+k)-approximations ofB∗(Xn) with ktending to zero, which

are of particular interest when d is large. Direct calculation, using Lagrangian duality, shows that the construction of B∗(Xn) is equivalent to the determination of Lagrange

coefficients that define weights w = (w1, . . . , wn) in the probability simplex

Pn= {w ∈Rn: n X i=1 wi = 1 and wi ≥ 0, i = 1, . . . , n} (5) and maximise φ(w) = trace[V (w)] = n X i=1 wikXi− c(w)k2, (6) where c(w) =Pn

i=1wiXi and V (w) =Pni=1wi[Xi− c(w)][Xi− c(w)]>; see, e.g., [7, 28].

The centre c∗_n of B∗(Xn) corresponds to c(w∗) for the optimal weights w∗ maximising

φ(w), and its radius r_n∗ equals pφ(w∗_{). The weights w define a probability measure ξ on}

the Xi, and c(w) and V (w) respectively correspond to the mean and variance matrix for

ξ (which, with a slight abuse of notation, we shall also denote by c(ξ) and V (ξ)).

There exist other geometrical problems for which the dual is known to correspond to an optimal design problem, i.e., to the construction of an optimal probability measure on Xn. In particular, the determination of the minimum-volume ellipsoid, with fixed

centre c, containingXn, is equivalent to the D-optimal design problem corresponding to

the maximisation of det[M (w)] with respect to w = (w1, . . . , wn) ∈Pn, with M (w) the

information matrix M (w) = n X i=1 wi(Xi− c)(Xi− c)>

(5)

for the estimation of the d unknown parameters θ in the linear regression model yi = (Xi−

c)>θ + εi, where the εi are i.i.d. observation errors; see [19]. The optimal ellipsoid is given

by {x ∈Rd: (x − c)>M (w∗)(x − c) ≤ 1/d}, with w∗ an optimal vector of weights. When the centre of the ellipsoid is free, the determination of the minimum-volume enclosing ellipsoid forms a D-optimal design problem in Rd+1 [21]: the optimal ellipsoid is given by the intersection between the minimum enclosing ellipsoid, centred at the origin, for the n points (Xi, 1) ∈ Rd+1, and the hyperplane {z ∈ Rd+1 : zd+1 = 1}; see also [17,

Sects. 5.6 & 9.1] and the references therein. The connection between the construction of the thinnest covering cylinder and a Ds-optimal design problem is established in [20] for

cylinders with fixed centre and in [21] when the centre is free.

On the other hand, the maximisation of trace[V (w)] in (6) is not equivalent to an A-optimal design problem, for which one minimises trace[M−1(w)] for some information matrix M (w). As shown in the next section, the connection with an optimal design prob-lem can nevertheless be used to derive the inequality (2), using an approach resembling that in [11].

3. An inequality to eliminate inessential points

Consider the more general situation where X denotes a compact subset of Rd, with Ξ the set of probability measures onX . For any ξ ∈ Ξ, denote

c(ξ) = Eξ(x) = Z X x ξ(dx) and φ(ξ) = trace[Var(ξ)] = Z X kx − c(ξ)k 2_{ξ(dx) ,} ₍₇₎

so that c(ξ) = c(w) and φ(ξ) = φ(w) in the finite case where X = Xn with wi = ξ(Xi),

i = 1, . . . , n. The dual problem to the determination of B∗(X ) corresponds to the maximisation of φ(ξ) with respect to ξ ∈ Ξ: the centre c∗and radius r∗ ofB∗(X ) satisfy c∗= c(ξ∗) and r∗ =pφ(ξ∗_{), where ξ}∗ _{maximises φ(ξ) with respect to ξ ∈ Ξ.}

3.1 A necessary and sufficient condition for optimality

First note that Ξ is convex: for any ξ, ν ∈ Ξ and α ∈ [0, 1], (1 − α)ξ + αν ∈ Ξ. Denote g(α) = φ[(1 − α)ξ + αν], which is a quadratic function of α. The directional derivative of φ(ξ) at ξ in the direction ν ∈ Ξ is given by

Fφ(ξ; ν) = dg(α) dα α=0 = Z kx − c(ξ)k2ν(dx) − φ(ξ) . (8)

Note that d2g(α)/dα2 = −2kc(ν) − c(ξ)k2 ≤ 0, showing that φ(·) is concave. It is not strictly concave1,but any pair ξ_a∗ and ξ_b∗ of optimal measures necessarily satisfy c(ξ∗_a) = c(ξ_b∗), implying that the optimal ball is unique. Concavity implies that ξ∗ ∈ Ξ is optimal if and only if Fφ(ξ∗; ν) ≤ 0 for all ν ∈ Ξ. This is equivalent to Fφ(ξ∗; δx) ≤ 0 for all x ∈X ,

with δx the delta measure at x. Moreover, Fφ(ξ∗; ξ∗) = 0 implies that Fφ(ξ∗; δx) = 0 for

any x in the support of ξ; that is, ξ∗_{{x ∈ R}d _{: F}

φ(ξ∗; δx) = 0} = 1. We thus obtain the

following property, usually called Equivalence Theorem in experimental design theory

(see, e.g., [8, 12, 14, 18]). When X is finite, the conditions are equivalent to the Karush-Kuhn-Tucker optimality conditions in [28]; see also [7].

1_{There may exist ξ 6= ν such that φ(ξ) = φ(ν) and c(ξ) = c(ν), and then g(α) is constant for all α ∈ [0, 1] (think} for example ofXngiven by the vertices of several regular simplices in Rdall having the same centre).

(6)

Theorem 3.1 The centre of B∗(X ) is given by c(ξ∗), where ξ∗ ∈ Ξ satisfies any of the three following equivalent conditions:

(i) ξ∗ maximises φ(ξ) with respect to ξ ∈ Ξ,

(ii) ξ∗ minimises maxx∈X kx − c(ξ)k2 with respect to ξ ∈ Ξ,

(iii)

kx − c(ξ∗)k2≤ φ(ξ∗) for all x ∈X . (9)

Moreover, kx − c(ξ∗)k2 = φ(ξ∗) for any x in the support of ξ∗.

3.2 (1 + )-approximations and -core sets For any ξ ∈ Ξ, define

γ(ξ) = max

x∈X kx − c(ξ)k

2_{− φ(ξ) .} ₍₁₀₎

Since γ(ξ) = maxx∈X Fφ(ξ; δx), Theorem 3.1 indicates that γ(ξ) ≥ 0 for all ξ ∈X , with

γ(ξ∗) = 0. In some sense, γ(ξ) quantifies the (absolute) suboptimality of the measure ξ. In this section we show how it is related to the (relative) notions of (1 + )-approximation and -core set introduced in Section 1.

Consider the ballB(ξ) = Bd(c(ξ),pγ(ξ) + φ(ξ)). It containsX by construction, and

Theorem 3.1 indicates that the radius of B∗(X ) equals pφ(ξ∗_{) ≥} _{pφ(ξ). Therefore,}

B(ξ) forms a (1 + )-approximation of B∗₍_{X ) for}

= (ξ) = [1 + γ(ξ)/φ(ξ)]1/2− 1. (11)

LetS (ξ) denote any compact subset of X such that ξ[S (ξ)] = 1 (the support of ξ, say). From Theorem 3.1, the radius r∗(ξ) of the smallest ball enclosing S (ξ) is not smaller than pφ(ξ), so that

p

φ(ξ) ≤ r∗(ξ) ≤pφ(ξ∗_{) ,} ₍₁₂₎

where the second inequality follows from S (ξ) ⊂ X . On the other hand, φ(ξ∗) = min

c∈Rdmax_x∈X kx − ck

2_{≤ max}

x∈X kx − c(ξ)k

2 _{= γ(ξ) + φ(ξ)} ₍₁₃₎

(which is also a direct consequence of the concavity of φ(·), which implies that, for any ξ ∈ Ξ, φ(ξ∗) ≤ φ(ξ) + Fφ(ξ; ξ∗) ≤ φ(ξ) + maxx∈X Fφ(ξ; δx) = γ(ξ) + φ(ξ)). Therefore,

the combination of (12) and (13) gives

r∗(ξ) ≤pφ(ξ∗_{) ≤ [1 + γ(ξ)/φ(ξ)]}1/2_r∗_{(ξ) ,}

indicating that S (ξ) is an -core set for given by (11).

These connections are used in particular in [28] to give a thorough characterisation of the convergence properties of two algorithms that generate sequences of measures ξk, in terms of their associated (1 + k)-approximations and k-core sets. See also Section 4.

(7)

3.3 The inequality

Following an approach similar to [11, 15], we now prove the main result of the paper.

Theorem 3.2 For any compact subset X ⊂ Rd and any probability measure ξ onX , any y ∈X such that

ky − c(ξ)k2< b[φ(ξ), γ(ξ)] = φ(ξ) + γ(ξ) −pγ(ξ)[2φ(ξ) + γ(ξ)] , (14)

where φ(ξ) and γ(ξ) are respectively defined by (7) and (10), is in the interior of the smallest ball B∗(X ) enclosing X .

Proof. Take any ξ in Ξ and consider γ(ξ) defined by (10). Then, kx−c(ξ)k2 ≤ φ(ξ)+γ(ξ) for all x ∈X , which implies that

Z

X

kx − c(ξ)k2_ξ∗_{(dx) = φ(ξ}∗_{) + kc(ξ}∗_{) − c(ξ)k}2 _{≤ φ(ξ) + γ(ξ)} ₍₁₅₎

for an optimal measure ξ∗. Also, (9) implies

Z

X

kx − c(ξ∗)k2ξ(dx) = φ(ξ) + kc(ξ∗) − c(ξ)k2 ≤ φ(ξ∗) . (16)

Consider now any y on the boundary of B∗(X ). From Theorem 3.1 and the triangular inequality, it satisfies ky − c(ξ∗)k =pφ(ξ∗_{) ≤ ky − c(ξ)k + kc(ξ}∗_{) − c(ξ)k, that is,}

ky − c(ξ)k ≥pφ(ξ∗_{) − kc(ξ}∗_{) − c(ξ)k .} ₍₁₇₎

We do not know the values of φ(ξ∗) and c(ξ∗), but we can compute a lower bound on the right-hand side of (17), using (15) and (16). Denote u =pφ(ξ∗_{) and v = kc(ξ}∗_{) − c(ξ)k.}

The set {(u, v) ∈ R2 : u2 + v2 ≤ φ(ξ) + γ(ξ) and u2 _{− v}2 _{≥ φ(ξ)} is convex, and the}

minimum of u − v is obtained for u =pφ(ξ) + γ(ξ)/2 and v = pγ(ξ)/2;Figure 1 gives an illustration. Therefore, (17) implies that ky − c(ξ)k2≥ b[φ(ξ), γ(ξ)].

Figure 1. Determination of the lower bound (14) in the proof of Theorem 3.2: admissible set for (u, v) (coloured) and optimum point minimising u − v (dot).

(8)

Note that b(φ, γ) = φ + γ −pγ[2φ + γ] is decreasing in γ, with b(φ, 0) = φ and limγ→∞b(φ, γ) = 0. The right-hand side of (14) gives the tightest lower bound on ky −

c(ξ)k2 for a y on the boundary ofB∗(X ), in the sense of the following theorem. Theorem 3.3 For any integer d ≥ 2 any γ > 0 and δ > 0, there exist a compact subset X of Rd_{, a probability measure ξ on}_{X , and a point y on the boundary of B}∗_{(X ) such}

that γ = maxx∈X kx − c(ξ)k2− φ(ξ) and ky − c(ξ)k2 < b[φ(ξ), γ] + δ, with b(φ, γ) as in

Theorem 3.2.

Proof. The proof relies on the construction of an example. The dimension d is irrelevant, and we only need to consider a finite setX3 with three points X1, X2 and X3whose first

two coordinates are respectively (0, −1), (0, 1) and (1 + a, 0), a > 0, with ξ the measure that allocates weights α, α, and 1 − 2α to X1, X2 and X3, α ∈ (0, 1/2). Then, the first

two coordinates of c(ξ) are ((1 − 2α)(1 + a), 0), and φ(ξ) = 2α[1 + (1 + a)2(1 − 2α)]. Also, kX1 − c(ξ)k2− φ(ξ) = kX2 − c(ξ)k2 − φ(ξ) = (1 − 2α)[(1 + a)2(1 − 4α) + 1] and

kX3− c(ξ)k2− φ(ξ) = −2α[(1 + a)2(1 − 4α) + 1], so that γ = kX1− c(ξ)k2− φ(ξ) = kX2−

c(ξ)k2−φ(ξ) for any a ≥ 0 when α < 1/4. For any α < 1/4 and δ > 0, we can then choose a smaller than some h(α, δ) to obtain kX3− c(ξ)k2 < φ(ξ) + γ −pγ[2φ(ξ) + γ] + δ. For

instance, when α = 1/6, we can take a < h(1/6, δ) =p9 δ − 1 + 2√27 δ2_{+ 9 δ + 1 − 1.}

On the other hand, the smallest ball containing {X1, X2} isBd(0, 1), which shows that

X3 is on the boundary ofB∗(X3) since kX3k > 1.

It is instructive to compare the bound b[φ(ξ), γ(ξ)] in (14) with that derived in [2]. One may first note that (15) and (16) imply that

for any ξ ∈ Ξ , kc(ξ∗) − c(ξ)k2 ≤ γ(ξ)

2 = φ(ξ)

(2 + 2)

2 , (18)

with given by (11), whereas the simple geometric arguments used in [2] only give kc(ξ∗) − c(ξ)k2≤ φ(ξ) (2 + 2_{). In the same paper, the authors combine this inequality}

with (17) and obtain that any point y on the boundary of B∗(X ) satisfies ky − c(ξ)k ≥pφ(ξ) [1 − (2 + 2)1/2] =pφ(ξ) [1 −pγ(ξ)/pφ(ξ)] .

Note that γ(ξ) must be smaller than φ(ξ) (i.e., < √2 − 1) in order to get a positive bound able to eliminate points. To compare this result with Theorem 3.2, denote

bAY[φ(ξ), γ(ξ)] = φ(ξ)[max{1 −

p

γ(ξ)/pφ(ξ), 0}]2; (19)

bAY(φ, γ) is decreasing in γ, with bAY(φ, 0) = φ and bAY(φ, γ) = 0 for γ ≥ φ, and

bAY(φ, γ) < b(φ, γ) given by (14) for any φ > 0 and γ > 0. We can also write bAY(φ, γ) =

φ(ξ)[max{1−(2+2)1/2, 0}]2and b(φ, γ) = φ[(1+)2−{(2+)[1+(1+)2_]}1/2_{], with the}

approximation level = (1 + γ/φ)1/2− 1, see (11). Figure 2-left presents b(φ, γ)/φ (solid line) and bAY(φ, γ)/φ (dashed line) as functions of ∈ [0, 1]; the difference between the

two curves is shown on the right part. The superiority of b(φ, γ) compared to bAY(φ, γ)

is also significant for small , so that when approaching the optimum with an iterative algorithm, the elimination of inessential points is likely to be more efficient with (14) than when using the bound in [2]. Note that the computational costs of the two bounds are roughly equivalent.

(9)

Figure 2. b(φ, γ)/φ (solid line, left), bAY(φ, γ)/φ (dashed line, left) and [b(φ, γ) − bAY(φ, γ)]/φ (right) as functions of = (1 + γ/φ)1/2− 1.

3.4 Effectiveness of the elimination

Take any probability measure ξ on X and consider a point y eliminated by (14), that is, such that ky − c(ξ)k2 < b[φ(ξ), γ(ξ)]. By construction of the bound (14), it satisfies ky − c(ξ∗)k ≤ pφ(ξ∗) (this can be directly checked, using the triangular inequality ky − c(ξ∗_{)k ≤ ky − c(ξ)k + kc(ξ}∗_{) − c(ξ)k and the inequalities (16) and (18)). Therefore,}

y belongs to B∗(X ) = Bd(c(ξ∗),pφ(ξ∗)). LetI (ξ) denote the set of inessential points

eliminated by (14) and µ denote the Lebesgue measure on X . We thus have

ω(ξ) = µ[I (ξ) ∩ B ∗₍_{X )]} µ[B∗₍_{X )]} = µ[I (ξ)] µ[B∗₍_{X )]} ≤ b[φ(ξ), γ(ξ)] φ(ξ∗₎ d/2 .

Denote δ(ξ) = γ(ξ)/φ(ξ), and suppose that δ(ξ) = δ > 0. Then, b[φ(ξ), γ(ξ)] = φ(ξ)(1 + δ −pδ(2 + δ)) and Lemma 3.2 of [28] implies that φ(ξ∗) > φ(ξ)(1 + δ2/[4(1 + δ)]). Therefore,

ω(ξ) < hd/2(δ) , (20)

with h(δ) = 4(1 + δ)(1 + δ − pδ(2 + δ))/[4(1 + δ) + δ2] < 1, implying that µ[I (ξ)]/µ[B∗_{(X )] → 0 as d → ∞. We can thus expect that in general, for points}

Xi approximately uniformly distributed in a compact set, the effectiveness of the sieve

formed by (14) will decrease as the dimension d increases. This can be investigated more precisely in some simple situations. Define

α(ξ) = µ[I (ξ)] µ(X ) ,

the proportion of points eliminated by (14), and let ξu denote the uniform probability

measure on X .

X is the d-dimensional ball Bd(0, 1). In that case, X = B∗(X ) and α(ξ) = ω(ξ)

for any ξ. When x ∼ ξu, then kxk has the density ϕ(r) = drd−1, r ∈ [0, 1], and φ(ξu) =

d/(d + 2), γ(ξu) = 1 − φ(ξu) = 2/(d + 2). This gives b[φ(ξu), γ(ξu)] = 1 − 2

√

(10)

and therefore α(ξu) = bd/2[φ(ξu), γ(ξu)] = 1 −2 √ d + 1 d + 2 d/2 ,

which is a decreasing function of d, the values of α(ξu) being already moderate for small

d, with α(ξu|d = 2) = 1 −

√

3/2 ' 0.1340 and α(ξu|d = 3) =

√

5/25 ' 0.0894. Similarly, for the bound (19) of [2] we obtain bAY[φ(ξu), γ(ξu)] = d/(d + 2) (1 −p2/d)2 for d > 2

(and 0 for d = 1, 2). The values of b[φ(ξu), γ(ξu)] and bAY[φ(ξu), γ(ξu)] are plotted against

d in Figure 3-left; the corresponding proportions α(ξu) are presented in Figure 3-right.

Figure 3. b[φ(ξu), γ(ξu)] (stars, left), bAY[φ(ξu), γ(ξu)] (triangles, left) and corresponding proportions α(ξu) of eliminated points (right, log-scale) as functions of d.

X is the hypercube [−1/2, 1/2]d_. _{Direct calculation gives φ(ξ}

u) = d/12 and

γ(ξu) = d/4 − φ(ξu) = d/6, so that b[φ(ξu), γ(ξu)] = d(1/4 −

√

2/6). For d ≤ 17, Bd(c(ξu), b1/2[φ(ξu), γ(ξu)]) ⊂ X , and α(ξu) = bd/2[φ(ξu), γ(ξu)] Vd, with Vd =

vol[Bd(0, 1)] = πd/2/Γ(d/2 + 1) the volume of the d-dimensional unit ball Bd(0, 1).

Again α(ξu) is a decreasing function of d, with α(ξu|d = 2) = (3 − 2

√

2) π/6 ' 0.0898 and α(ξu|d = 3) = (

√

2 − 1)3π/6 ' 0.0372. Note that (19) does not permit to eliminate any point since γ(ξu) > φ(ξu).

Although (20) indicates that the effectiveness of the elimination of inessential points decreases with d for a fixed δ (that is, for a fixed level of approximation 1 + =√1 + δ, see Section 3.2), the proportion α(ξ) can be significant when ξ approaches optimality (so that δ = δ(ξ) is small enough in (20)). In particular, algorithms for the solution of the dual formulation of the smallest enclosing ball problem generate sequences of measures ξk that can be used as sieves to progressively eliminate points. Two such methods are presented in the next section.

4. Algorithms for the dual

4.1 A multiplicative algorithm

We return to the case of a finite set Xn, with wi = ξ(Xi) the weight allocated by the

(11)

w0

i = 1/n, consider the application of successive iterations of the form

w_ik+1=w_b_ik+1= w_ik kXi− c(w

k_)k2

Pn

j=1wjkkXj− c(wk)k2

, i = 1, . . . , n . (21)

This type of algorithm is called multiplicative in the literature on optimal experimental design: the weights wk_i of the measure ξk at iteration k are simply multiplied by positive factors fi(wk)/Pkj=1wkjfj(wk), with here fi(wk) = kXi − c(wk)k2 = dφ(w)/dwi

w=wk. In the case of D-optimal design, similar iterations ensure monotonic convergence to the minimum-volume ellipsoid containing Xn, see [22, 23, 29]. Here the iteration (21) does

not guarantee that φ(wk+1_{) > φ(w}k_{) for all non-optimal w}k_{, and, following [27], we}

consider iterations of the (more general) form

w_ik+1=we k+1 i (βk) = w k i [1 + βkFφ(ξk; δXi)] = wk_i {1 + β_k[kXi− c(wk)k2− φ(wk)]} , (22)

where βk≥ 0, Fφ(ξ; ν) is the directional derivative defined in (8), and where ξk allocates

weight wk_i to Xi, i = 1, . . . , n. Note thatPni=1we

k+1

i (βk) = 1 and that allwe

k+1

i (βk) remain

non-negative if βk is small enough. Also note thatwe

k+1

i [1/φ(wk)] =wb

k+1

i given by (21).

The iteration (22) corresponds to a projected second-order method for the maximisation of φ(w), see [27] and [17, Sect. 9.1], and there always exists a step-size βk> 0 such that

φ(wk+1) > φ(wk) when w∗ is not optimal. Since here φ[w_e_ik+1(βk)] is quadratic in βk, the

maximising value β_k∗ can be calculated explicitly and is given by

β_k∗ = Pk i=1wb k+1 i [kXi− c(wk)k2− φ(wk)] 2φ(wk_{) kc(} b wk+1_{) − c(w}k_)k2 , (23)

where the components ofw_bk+1are given by (21). Since the iteration (21) is simpler than (22)-(23), it is advisable to always try the former first, and switch to the latter only if (21) does not yield an increase of φ(·) (numerical experimentation indicates that this is rather exceptional). To ensure that all components ofw_e_ik+1(βk) remain non-negative, we

should normally take βk = min{β∗k, βk,max}, where βk,max = [φ(wk) − minj=1,...,nkXj−

c(wk_)k2_]−1_{≥ 1/φ(w}k_{), see (22). However, from the quadratic dependence of φ[}

e wk+1_i (βk)] in βk, φ(wb k+1 i ) ≤ φ(wk) is equivalent to 1/φ(wk) ≥ 2 eβ ∗

k and thus implies βk,max≥ 2β∗k.

The construction is summarised in Algorithm 1.

Algorithm 1 stops when a (1 + k)-approximation of B∗(Xn) is obtained, with k =

p

1 + γ(wk_)/φ(wk_{) − 1 < . The sequence {φ(w}k_{)} is monotonically increasing, but the}

investigation of its convergence properties as k → ∞ is out of the scope of this paper and will be considered elsewhere. The complexity of each iteration is roughly proportional to n, and the algorithm may benefit from the elimination of inessential points using the results of Section 3.3. This is considered in the next section.

4.2 Elimination of inessential points by the multiplicative algorithm

The uniform measure, with w0_i = 1/n for all i, used to initialise Algorithm 1 can be used to eliminate inessential points from Xn. For a given n, the proportion α(w0) of points

that can be eliminated depends on the precise location of the Xi, but we can consider

(12)

Algorithm 1 Multiplicative algorithm for the smallest enclosing ball problem Require: Xn a set of n points inRd and > 0.

Set w0_i = 1/n for i = 1, . . . , n; k ← 0; compute c(w0), φ(w0) and γ(w0). while γ(wk)/φ(wk) > (1 + )2− 1 do

compute w_b_ik+1 given by (21), compute c(w_bk+1) and φ(w_bk+1); if φ(wb

k+1

i ) > φ(wk) then set wk+1=wb

k+1_;

else compute w_ik+1=w_e_ik+1(β_k∗) given by (22)-(23), compute c(wk+1) and φ(wk+1); end if compute γ(wk+1), k ← k + 1; end while return wk, c(wk), k= p 1 + γ(wk_)/φ(wk_{) − 1}

compact setX ⊂ Rdwith strictly positive d-dimensional Lebesgue measure µ and equal to the closure of its interior. The Xi may be independently identically distributed inX

with the probability measure ξu = µ/vol(X ), with vol(X ) the volume of X , or they

may correspond to the first n points of a low-discrepancy sequence on X , see, e.g., [13, Chap. 3]. In both situations,

lim n→∞α(w 0_{) = α(ξ} u) = ξunBd c(ξu), b1/2[φ(ξu), γ(ξu)] ∩Xo,

where b(φ, γ) is given by (14) and the convergence is almost sure when the Xi are i.i.d.

The values of α(ξu) obtained in Section 3.4 for the case whereX is a d-dimensional ball

or hypercube suggest that the elimination of inessential points via (14) will be generally not very effective when using ξu only. Below we investigate how the situation improves

when applying several iterations (21).

In terms of probability measure, the iteration (21) can be written as

ξk+1(dx) = kx − c(ξ

k_)k2_ξk_(dx)

R

y∈X ky − c(ξk)k2ξk(dy)

, x ∈X .

When initialised at the uniform measure ξu on X , it corresponds to the limiting

be-haviour of (21) as n → ∞ for points Xi uniformly distributed inX . When 0 is a centre

of symmetry for X , φ(ξk+1) > φ(ξk), c(ξk) = 0 and maxx∈X kx − c(ξk)k2 = M for

all k, with M = 1 when X = Bd(0, 1) and M = d/4 when X = [−1/2, 1/2]d.

Di-rect calculation gives b[φ, M − φ] = M −pM2_{− φ}2_{, which is increasing in φ, so that}

α(ξk+1_{) > α(ξ}k_).

Consider the case X = Bd(0, 1). After k iterations, φ(ξk) =

R1

0 r 2_ϕ

k(r)dr, with

ϕk(r) = (d + 2k) rd−1+2k, which gives φ(ξk) = (d + 2k)/(d + 2k + 2). The proportion of

points eliminated by (14) after those k iterations is

α(ξk) =n1 − [1 − φ2(ξk)]1/2od/2= 1 − 2 √ d + 1 + 2k d + 2 + 2k d/2 , (24)

which is decreasing in d for fixed k, but increases in k for fixed d, with limk→∞α(ξk) = 1.

The value of αk slightly improves when inessential points are removed after each it-eration, provided the mass of eliminated points is suitably distributed on the

(13)

ing ones. Suppose for instance that we simply renormalise the total mass of remain-ing points. Then, at iteration k ≥ 1, φ(ξk) = R_A11/2_(ξk−1₎r2ϕk(r)dr, where ϕk(r) =

(d + 2k) [1 − A(d+2k)/2_(ξk−1_)]−1_rd−1+2k_{, r ∈ [A}1/2_(ξk−1_{), 1], with A(ξ) = 1 −}_{p1 − φ}2_(ξ). This gives φ(ξk) = d + 2k d + 2(k + 1) 1 − Ad/2+k+1(ξk−1) 1 − Ad/2+k_(ξk−1₎ and α(ξ k_{) = A}d/2_(ξk_{) , k ≥ 1 .} ₍₂₅₎

Numerical evaluations for different d and k indicate that α(ξk) is only marginally larger than the value in (24), with the consequence that trying to remove inessential points at each iteration of Algorithm 1 is generally not very efficient.

4.3 A vertex-direction algorithm

Algorithm 4.1 of [28] is similar to the algorithm of [25] for the construction of the mini-mum ellipsoid containing Xn and to the algorithm proposed in [3] for the construction

of a D-optimal design measure. The detailed analysis in [28] indicates in particular that the algorithm asymptotically presents linear convergence; see also [1]. An initialisation at a two-point measure is proposed,

ξ2= (1/2)(δX_i1+ δX_i2) , with i1= arg max

i=1,...,nkXi− X1k and i2 = arg maxi=1,...,nkXi− Xi1k ,

(26) so that wi1 = wi2 = 1/2 and wi = 0 for all i 6= i1, i2 (when the order of indices is randomised, X1 can be considered as randomly drawn among the Xi). This construction

ensures that Xi1 and Xi2 will be far apart, without requiring the computation of all n(n−1)/2 pair distances. It is a key argument in the complexity analysis of the algorithm. Direct calculation gives φ(ξ2) = kXi1− Xi2k

2_/4.

The method is summarised in Algorithm 2 below, with two small modifications com-pared with the original version in [28]: (i) the choice between a plus-iteration (displace-ment in the direction of the furthest point Xi+ to the current center c(wk)) or a minus-iteration (reduction of the weight allocated to the closest point Xi− to c(wk) among the current support J (wk)) is based on the comparison between the values of φ(wk+1) corresponding to these two options, whereas [28] simply compares γ(wk) with γ−(wk); (ii) the algorithm is stopped when γ(wk)/φ(wk) ≤ (1 + )2− 1, whereas the condition is max{γ(wk), γ−(wk)}/φ(wk) ≤ (1 + )2 − 1 in [28]. These minor differences do not modify the complexity analysis in the same paper, and the algorithm returns a (1 + )-approximation in 18 + 50/ iterations at most.

The two-point measure ξ2 defined by (26) can also be used to eliminate inessential

points. Let Xi∗ denote the furthest point in X_n from c(ξ₂) = (X_i

1 + Xi2)/2. Then,

kXi∗− c(ξ₂)k ≤ σ kX_i

2 − Xi1k for some σ > 0 implies that γ(ξ2)/φ(ξ2) ≤ 4σ

2_{− 1 and}

thus

b[φ(ξ2), γ(ξ2)]

φ(ξ2)

≥ τ2 = 4σ2−p16σ4_{− 1 .}

Any point Xisuch that kXi−c(ξ2)k < (τ /2) kXi2−Xi1k is thus in the interior ofB

∗₍_X n).

On the other hand, note that the bound bAY[φ(ξ2), γ(ξ2)] given by (19) is informative

only when σ < √2/2 (to ensure that γ(ξ2) < φ(ξ2)). Since τ is decreasing in σ, the

(14)

Algorithm 2 Vertex-direction algorithm for the smallest enclosing ball problem

Require: Xn a set of n points inRd and > 0.

Set w_i0₁ = w0_i₂ = 1/2 and w_i0 = 0 for all i 6= i1, i2, where i1 and i2 are given by (26);

k ← 0; Set c(w0) = (Xi1+ Xi2)/2, φ(w 0_{) = kX} i1− Xi2k 2_/4, _{J (w}0_{) = {i} 1, i2}, γ−(w0) = 0,

i− = 1, compute γ(w0) and i+= arg maxi=1,...,nkXi− c(w0)k.

while γ(wk)/φ(wk) > (1 + )2− 1 do

if γ(wk) > γ−(wk)/[1 − γ−(wk)/φ(wk)] then compute αk= γ(wk)/{2[φ(wk) + γ(wk)]},

set w_ik+1+ = (1 − αk)wik++ αk and wk+1i = (1 − αk)wki for all i 6= i+,

compute c(wk+1) = (1 − αk)c(wk) + αkXi+; else

compute αk= minγ−(wk)/{2[φ(wk) − γ−(wk)]}, wik−/(1 − w_ik−) , set w_ik+1− = (1 + αk)wik− − αk and wik+1= (1 + αk)wik for all i 6= i−,

compute c(wk+1_{) = (1 + α} k)c(wk) − αkXi−; if αk= wki−/(1 − w_ik−) then J (wk+1_{) =}_{J (w}k_{) \ {i}−_} else J (wk+1_{) =}_{J (w}k₎ end if end if

compute φ(wk+1_{), γ(w}k+1_{) and i}+ _{= arg max}

i=1,...,nkXi− c(wk+1)k,

i− = arg mini=1∈J (wk+1₎kX_i − c(wk+1)k and γ−(wk+1) = φ(wk+1) − kX_i− − c(wk+1)k; k ← k + 1; end while return wk_{, c(w}k_), k= p 1 + γ(wk_)/φ(wk_{) − 1}

X = Bd(0, 1) orX = [−1/2, 1/2]d we can take σ = 1/2, which gives τ = 1: all points in

the interior of Bd(c(ξ2), kXi2 − Xi1k/2) are eliminated (and ξ2 is optimal whatever the

choice of X1 inX ). More generally, Lemma 3.1 in [28] gives σ = 3/2 for any Xn, since

kXi∗−c(ξ₂)k ≤ kX_i∗−X_i 1k+kXi1−c(ξ2)k ≤ kXi2−Xi1k+ 1 2kXi1−Xi2k = 3 2kXi1−Xi2k .

This bound is not tight, however: equality can only be achieved when Xi∗, X_i

1 and Xi2

are aligned, with Xi1 between Xi∗ and Xi2, which contradicts the fact that Xi1 is the

furthest point in Xn from some X1. A more precise analysis, see Appendix A, yields

σ =√7/2, and the corresponding bound is tight. This indicates that, for any setXnand

for any point X1∈Xn used for the construction of ξ2, any Xi such that

kXi− c(ξ2)k < 0.133974 kXi2− Xi1k <

q

7 − 4√3 kXi2 − Xi1k/2 (27)

can always be eliminated2. In practice, kXi∗−c(ξ₂)k is often much smaller than

√

7 kXi1−

Xi2k/2, and ξ2proves generally more efficient than the uniform measure ξufor eliminating

inessential points. This is illustrated in the next section.

2_{Although the value σ =}√_{7/2 gives a tight bound, one may notice that the inequality (27) is suboptimal since} the worst-case situations in Theorem 3.3 and Lemma A.1 correspond to different measures.

(15)

5. Computational results

Methods to be compared. In this section, we report the results of computational ex-periments comparing different methods for the construction of B∗(Xn). The first one

(henceforth QP) corresponds to the direct application of the QP solver of Matlab (the function qp.m) to the minimisation of (3), see Section 1. In the method QP0, we first

eliminate inessential points using the sieve (14) for ξ2 given by (26) and then apply the

same QP solver.

The choice of c0 in (3) is arbitrary, and c0 = c(ξu) = (1/n)Pn_i=1Xi seems natural.

However, we found that c0has a significant influence on the computational time, and that

taking c0 out of the convex hull Conv(Xn) ofXngenerally yields a faster computation of

the optimal solution. Note that, when c0 ∈ Conv(/ Xn), for any t ∈R there exists a c ∈ Rd

satisfying the constraints (4) (and the set of such feasible c is unbounded). On the other hand, no feasible c exists for small enough t when c0 ∈ Conv(Xn). In our computations

we take c0 = 2 Xia− Xib, where ia= arg maxi=1,...,nu >_X

i and ib= arg mini=1,...,nu>Xi,

with u> = (1, 0, . . . , 0) (the choice of u does not seem important). The QP solver is initialised at (c(ξu), 0) (which is not necessarily feasible for (4)).

We also consider the iterative construction of an (1 + )-approximation of B∗(Xn),

using Algorithms 1 and 2 (henceforth A1 and A2), both with = 10−3. A1 and A2 do not eliminate any point. As noticed in Section 4.2, it is not very efficient to try to eliminate inessential points at each iteration of A1. Our experiments indicate that a suitable compromise between the computational cost of the elimination test and the benefit of reducing the dimension of w is obtained when the sieve (14) is used about every 5 iterations of A1 or A2; the corresponding methods are denoted by A15 and A25,

respectively. For each of them, inessential points are also eliminated at the initialisation, using (14) with ξ2. A105 and A205 differ from A15 and A25 by the stopping rule only: they

are stopped when an (1+)-approximation is obtained or earlier if n−2d inessential points have already been eliminated. In case of early stopping, QP applied to the resulting 0-core set will thus have to deal with 2d constraints only (the value 2d is somewhat arbitrary, but seems reasonable for most situations since B∗(Xn) has d + 1 points at most on its

boundary when the n points in Xn are in general position). In A105-QP and A205-QP

we apply QP to the 0-core sets returned by A10₅ and A20₅, respectively. Finally, A2∗₅ is similar to A25 but uses = 10−6, and thus returns an (1 + )-approximation very close to

the exact B∗(Xn) given by QP, QP0, A105-QP and A205-QP. We shall call these methods

(including A2∗₅) exact in what follows.

When using A1 or A2, points that are eliminated by (14) for the current measure ξk may carry a positive weight w_ik, and the weights of remaining points then need to be renormalised. Denote by Ik the set of indices of those remaining points; following [11],

we replace wk_i by z_ik/(Pn

j=1zjk), where zik= 0 for i /∈Ik, zki = 1.1 wikif kXi− c(wk)k2≥

φ(wk) and zk_i = wk_i otherwise (i ∈Ik and kXi− c(wk)k2 < φ(wk)).

Measures of performance. The experiments were carried out on a PC with a clock speed of 2.50 GHz and 32 Go RAM.

We first compare (Tables 1, 4 and 7) the effectiveness of the sieve (14) for the uniform measure ξu used to initialise A1 and for ξ2 given by (26): π(ξ) = 1 − α(ξ) gives the

proportion of points that are not eliminated by ξ. To compare the efficiency of (14) with that of the bound (19) proposed in [2], we also give the value πAY(ξ2) obtained

when bAY[φ(ξ2), γ(ξ2)] is used instead of b[φ(ξ2), γ(ξ2)]. We also indicate the number

(16)

inessential points.

In Tables 2, 5 and 8, N gives the number of iterations performed to reach the required

precision for A1, A15, A2, A25 ( = 10−3) and A2∗5 ( = 10−6), or to eliminate at least

n − 2d points for A10₅ and A20₅.

Finally, in Tables 3, 6 and 9 we compare the computational times of the different methods considered, with t(QP), the computational time of QP, taken as a reference: for each method M other than QP, with computational time t(M), we indicate the ratio ρ(M)=t(M)/t(QP).

n consecutive points of Sobol’ low-discrepancy sequence in [0, 1]d. Table 1 indicates that ξ2 is much more effective than the uniform measure ξu for eliminating points with

(14) when d is not too large, d. 10 say; one may note the good agreement between π(ξu)

and the theoretical value π∗ = 1 − [π d(1/4 −√2/6)]d/2/Γ(d/2 + 1) (d ≤ 17) derived in Section 3.4. For d between 3 and 10, πAY(ξ2) is most often significatively larger than

π(ξ2), which illustrates the superiority of the bound (14) over (19). The number of

remaining points after running A15 or A25 are very close in most cases. Exceptions, like

n = 103 and n = 104for d = 3 and n = 105 for d = 4, correspond to situations where A25

is used for less than 5 iterations, so that inessential points are only eliminated once (at the initialisation) whereas A15 makes much more iterations, see Table 2 (when less than

5 iterations are done, then κ = n π(ξ2)). As expected, κ(A2∗5) is smaller than κ(A25) in

all cases, and Table 1 indicates that A2∗₅ is able to provide small 0-core sets for the sets Xn considered.

Table 2 shows that the elimination of inessential points does not directly influence the number of iterations required to reach a given precision: N(A15) is often smaller

than N(A1), but not always; the effect on A2 is limited. A15 requires systematically

more (sometimes much more) iterations than A25 to reach the required precision ,

which can be related to the general observation that multiplicative algorithms tend to be slow close to the optimum. This is consistent with the observations that sometimes A10₅ requires significantly less iterations than A15, whereas N(A205) is close to N(A25) in all

circumstances: A15 may have reached an (1 + 0)-approximation, 0 > , close enough to

the optimum to be able to eliminate many points, but may still require many iterations to reach an (1+)-approximation. The number of iterations of A2∗₅ ( = 10−6) shows a great variability among the cases considered, and the large values obtained for d = 2, n = 103 and n = 104may look surprising. However, they do not contradict the complexity bound N(A2) < 18 + 50/ of [28] and can be explained by the potential slow convergence of

first-order methods close to the optimum. A simple example with d = 2 and n = 4 gives an illustration.

Take Xn = {X1, X2, X3, X4} with X1 = (1 − a, a)>, X2 = (a, 1 − a)>, X3 = (0, 0)>

and X4 = (1, 1)>, a < 1/2. When a < 1/2 −

√

3/6, then kX1− X2k > kX1− X3k, so

that i1 = 2 and i2 = 1 in (26). The initial w0 of A2 is thus (1/2, 1/2, 0, 0), and A2 may

require many iterations to reach precision depending on the value of a. For instance, for = 10−5, N(A2)=6252 when a = 10−3 and N(A2)=62502 when a = 10−4 (whereas

N(A1)=7361 and N(A1)=1 for a = 10−3 and a = 10−4, respectively).

A noticeable observation from Table 3 is that a standard QP solver gives the solution in reasonable time if n is not too big, even for rather large d. A10₅ (respectively, A20₅) is slightly faster than A15 (respectively, A25) since it is stopped earlier; the comparison

with A1 (respectively, A2) shows that the elimination of points significantly accelerates convergence3. Since A15 and A25 only provide (1 + )-approximations with = 10−3,

(17)

Table 1. Sobol’ sequence in [0, 1]d_{: proportion π (in %) of points not eliminated} and number κ(M) of remaining points after applying method M.

d n π∗ π(ξu) π(ξ2) πAY(ξ2) κ(A15) κ(A25) κ(A2∗5)

2 103 91.02 91.0 0.4 0.4 4 4 4 104 _91.02 _91.04 _0.04 _0.04 ₄ ₄ ₄ 105 _91.02 _91.03 _0.004 _0.004 ₄ ₄ ₄ 3 103 _96.28 _96.66 _1.3 _2.40 ₅ ₁₃ ₄ 104 96.28 96.34 1.23 3.96 11 123 4 105 _96.28 _96.30 _0.060 _0.136 ₄₅ ₆₀ ₃ 4 103 _98.39 _98.30 _17.2 _44.8 ₈ ₇ ₃ 104 _98.39 _98.41 _0.02 _0.02 ₂ ₂ ₂ 105 98.39 98.39 2.318 10.359 32 2318 5 5 103 _99.28 _99.30 _34.1 _70.5 ₁₀ ₈ ₅ 104 _99.28 _99.28 _19.16 _63.45 ₂₀ ₁₆ ₆ 105 _99.28 _99.28 _5.976 _29.445 ₂₇ ₁₆ ₅ 10 103 99.98 99.8 75.8 99.4 13 13 8 104 _99.98 _99.95 _85.34 _99.96 ₂₈ ₃₀ ₁₀ 105 _99.98 _99.98 _45.128 _95.730 ₄₀ ₄₈ ₁₁ 20 103 _99.9 _99.7 _100.00 ₃₄ ₃₄ ₁₃ 104 _99.99 _98.56 _100.00 ₅₂ ₅₇ ₁₄ 105 _99.999 _95.217 _99.999 ₅₃ ₄₀ ₁₁ 30 103 _99.9 _99.9 _100.00 ₂₈ ₂₈ ₁₂ 104 _99.99 _99.98 _100.00 ₄₂ ₄₈ ₁₄ 105 _99.999 _98.897 _100.00 ₉₈ ₁₀₈ ₁₆ 40 103 _99.9 _100.00 _100.00 ₄₆ ₃₃ ₁₃ 104 _99.99 _100.00 _100.00 ₆₀ ₇₁ ₁₄ 104 _99.999 _99.989 _100.00 ₁₆₂ ₁₂₁ ₁₉ 50 103 _99.9 _100.00 _100.00 ₄₃ ₅₁ ₁₅ 104 _99.99 _100.00 _100.00 ₇₇ ₁₁₃ ₁₇ 105 _99.999 _100.00 _100.00 ₁₈₅ ₁₅₅ ₂₇

comparing their computational time with that of QP is unfair. A10₅-QP is sometimes faster than QP, but is always slower than A20₅-QP, which is often faster than QP and sometimes the fastest among the exact methods considered. A2∗₅ is seldom the fastest among exact methods and is often much slower than QP. In this example, QP0 is faster

than QP for n ≤ 10 and slightly slower when n ≥ 20 (i.e., when few points are eliminated by ξ2); it is frequently the fastest exact method when n ≤ 5.

n points i.i.d. N (0, Id). Table 4 indicates that the elimination of inessential points

is more efficient with A15 than A25, and that both methods are able to provide small

0-core sets. For d . 10, πAY(ξ2) is significatively larger than π(ξ2), confirming the

su-periority of the bound (14) over (19). Table 5 gives the same indications as Table 2: sometimes A10₅ requires significantly less iterations than A15, an indication of the slow

convergence of the multiplicative algorithm near the optimum. Also, N(A15)>N(A25)

and N(A105)>N(A205) in all cases. One may notice the large values of N(A2∗5). Table 6

shows that QP0 and A25-QP are often the fastest among exact methods, which is never

the case for A2∗₅. QP0 shows remarkably stable performance and is significantly faster

than QP when n ≤ 5 (i.e., when the elimination of inessential points by ξ2is effective, see

Table 4) and is only slightly slower than QP for n ≥ 10. QP is the fastest exact method for n small enough (n ≤ 103) when d ≥ 10 and for all n ≥ 103 when d is large (d ≥ 40).

n points i.i.d. uniformly in Bd(0, 1). This corresponds to a difficult situation for

algorithms 1 and 2, and due to the larger computational times required compared to previous examples we only consider d ≤ 40 (and n ≤ 104 _{for d = 40). Table 7 shows that}

(18)

Table 2. Sobol’ sequence in [0, 1]d_{: number N}

of iterations per-formed to reach precision = 10−3( = 10−6for A2∗₅).

d n A1 A15 A105 A2 A25 A205 A2 ∗ 5 2 103 ₄₄ ₁ ₀ ₀ ₀ ₀ ₃₂₂₆₃ 104 ₁₇₃ ₁ ₀ ₀ ₀ ₀ ₃₂₂₆₃ 105 ₂₆₆ ₀ ₀ ₀ ₀ ₀ ₀ 3 103 ₈₀ ₂₇₀ ₇₀ ₃ ₃ ₃ ₁₂ 104 ₂₅₃ ₁₆₉ ₁₆₉ ₃ ₃ ₃ ₃₄ 105 ₂₄₂ ₂₁₉ ₂₁₉ ₁ ₁ ₁ ₇ 4 103 ₉₁ ₈₄ ₇₅ ₅ ₆ ₅ ₁₄ 104 ₉₄ ₀ ₀ ₀ ₀ ₀ ₀ 105 ₂₂₉ ₁₂₃ ₁₂₃ ₄ ₄ ₄ ₈₁₈ 5 103 ₉₃ ₉₂ ₇₅ ₂₉ ₂₂ ₂₀ ₇₆ 104 ₂₁₂ ₈₈ ₈₈ ₆₃ ₈₁ ₈₁ ₁₇₈ 105 179 107 107 50 55 55 465 10 103 ₈₉ ₁₃₉ ₄₀ ₆₂ ₅₆ ₃₅ ₄₅₇ 104 ₁₇₅ ₉₇ ₉₇ ₆₉ ₇₉ ₇₉ ₄₄₆ 105 ₂₀₀ ₁₃₇ ₁₃₇ ₆₆ ₇₄ ₇₄ ₉₃₀ 20 103 241 139 115 89 89 85 714 104 ₁₆₆ ₁₅₂ ₁₅₂ ₄₄ ₃₇ ₃₇ ₃₄₈ 105 ₂₄₄ ₁₄₂ ₁₄₂ ₆₁ ₃₆ ₃₅ ₃₀₁ 30 103 ₂₈₆ ₂₀₄ ₅₀ ₃₇ ₂₈ ₂₅ ₃₇₃ 104 336 237 210 87 63 55 1007 105 ₃₄₂ ₂₂₂ ₂₂₂ ₇₆ ₆₆ ₆₆ ₉₅₉ 40 103 ₂₀₆ ₁₁₇ ₈₀ ₂₈ ₂₆ ₁₅ ₁₃₂ 104 ₁₁₅ ₉₉ ₉₀ ₆₀ ₅₆ ₅₀ ₃₁₁ 105 359 188 188 76 45 45 744 50 103 ₁₅₃ ₁₀₃ ₆₀ ₅₆ ₄₄ ₃₀ ₃₃₆ 104 ₁₉₁ ₁₅₄ ₁₂₅ ₅₆ ₅₄ ₅₄ ₆₁₇ 105 ₂₆₆ ₁₄₃ ₁₄₃ ₉₃ ₇₉ ₇₉ ₁₇₂₆

Table 3. Sobol’ sequence in [0, 1]d_{: computational time t(QP) (in s) and ratios ρ(M)=t(M)/t(QP) —} averaged over 10 repetitions. Italicized figures correspond to the fastest exact method.

d n t(QP) QP0 A1 A15 A105 A2 A25 A205 A1 0 5-QP A2 0 5-QP A2 ∗ 5 2 103 _0.006 _0.40 _2.13 _0.27 _0.22 _0.16 _0.14 _0.14 _0.37 _0.29 _864.1 104 _0.030 _0.08 _3.27 _0.08 _0.07 _0.04 _0.05 _0.05 _0.10 _0.08 _164.1 105 _0.27 _0.06 _4.69 _0.07 _0.07 _0.04 _0.06 _0.06 _0.08 _0.06 _0.06 3 103 _0.004 _0.49 _4.11 _11.05 _3.20 _0.32 _0.30 _0.30 _3.43 _0.59 _0.70 104 _0.029 _0.10 _5.19 _1.08 _1.07 _0.10 _0.08 _0.08 _1.10 _0.12 _0.27 105 _0.28 _0.06 _4.43 _0.20 _0.20 _0.06 _0.07 _0.07 _0.20 _0.07 _0.07 4 103 _0.004 _0.56 _4.75 _4.23 _3.99 _0.44 _0.49 _0.45 _4.25 _0.68 _0.83 104 _0.029 _0.08 _1.96 _0.09 _0.08 _0.05 _0.06 _0.06 _0.11 _0.09 _0.06 105 _0.32 _0.10 _4.32 _0.17 _0.17 _0.11 _0.07 _0.07 _0.18 _0.10 _0.45 5 103 _0.005 _0.92 _4.06 _3.80 _3.20 _1.30 _1.03 _0.97 _3.48 _1.25 _2.81 104 _0.031 _0.28 _4.22 _0.67 _0.66 _1.20 _0.52 _0.53 _0.70 _0.57 _1.04 105 _0.32 _0.14 _4.08 _0.20 _0.20 _1.05 _0.12 _0.12 _0.21 _0.13 _0.31 10 103 _0.007 _0.97 _3.18 _4.12 _1.64 _2.28 _1.79 _1.24 _2.05 _1.63 _11.6 104 _0.040 _0.95 _3.61 _0.93 _0.93 _1.36 _0.49 _0.49 _1.01 _0.57 _1.94 105 _0.40 _0.57 _7.03 _0.64 _0.64 _2.16 _0.26 _0.26 _0.65 _0.27 _0.58 20 103 0.011 1.06 6.18 3.02 2.62 2.21 1.73 1.65 3.21 2.20 10.9 104 _0.059 _1.14 _6.17 _1.76 _1.77 _1.70 _0.47 _0.48 _1.88 _0.58 _1.33 105 _0.55 _1.15 _12.71 _1.99 _2.01 _3.00 _0.50 _0.50 _2.02 _0.51 _0.58 30 103 _0.015 _1.08 _5.66 _2.78 _1.05 _0.76 _0.52 _0.49 _1.65 _1.06 _3.96 104 0.075 1.18 15.65 2.30 2.26 4.02 0.73 0.72 2.38 0.84 2.70 105 _0.74 _1.22 _23.86 _2.80 _2.80 _5.05 _0.93 _0.93 _2.82 _0.94 _1.12 40 103 _0.021 _1.05 _3.38 _1.56 _1.23 _0.52 _0.42 _0.31 _1.93 _0.94 _1.19 104 _0.092 _1.16 _6.52 _2.58 _2.58 _3.28 _0.99 _0.98 _2.78 _1.16 _1.46 105 0.92 1.28 27.89 4.06 4.05 5.68 1.04 1.03 4.07 1.06 1.14 50 103 _0.030 _1.04 _2.02 _1.04 _0.77 _0.71 _0.45 _0.36 _1.49 _1.06 _1.98 104 _0.12 _1.19 _11.88 _3.21 _3.18 _3.44 _1.13 _1.14 _3.42 _1.36 _1.93 105 _1.19 _1.24 _21.67 _4.24 _4.23 _7.31 _1.14 _1.14 _4.26 _1.17 _1.36

(19)

Table 4. Xii.i.d.N (0, Id): proportion π (in %) of points not eliminated and number κ(M) of remaining points after applying method M — averaged values over 100 repetitions, rounded to the nearest integer.

d n π(ξu) π(ξ2) πAY(ξ2) κ(A15) κ(A25) κ(A2∗5)

2 103 _93.40 _12.81 _40.23 ₄ ₂₉ ₁₀ 104 _94.86 _9.00 _37.00 ₅ ₂₄₀ ₆₅ 105 _95.92 _3.61 _25.69 ₆ ₄₉₅ ₂₁ 3 103 _96.70 _32.94 _75.37 ₅ ₅₂ ₉ 104 _97.68 _18.59 _66.15 ₆ ₁₁₂ ₁₇ 105 _98.25 _9.19 _50.79 ₈ ₁₂₉ ₄ 4 103 _98.21 _46.39 _84.81 ₇ ₂₀ ₅ 104 _98.77 _34.30 _82.96 ₈ ₁₂₇ ₅ 105 _99.16 _24.00 _78.00 ₁₀ ₅₀₇ ₃₂₀ 5 103 _98.89 _62.18 _92.52 ₈ ₂₂ ₅ 104 _99.31 _47.69 _94.04 ₁₀ ₈₁ ₅ 105 _99.54 _33.27 _84.58 ₁₂ ₁₇ ₅ 10 103 _99.78 _93.52 _99.93 ₁₃ ₁₅ ₈ 104 _99.91 _89.03 _99.99 ₁₇ ₂₆ ₈ 105 _99.96 _81.00 _99.92 ₂₂ ₂₅ ₉ 20 103 _99.99 _99.94 _100.00 ₂₂ ₂₄ ₁₂ 104 _100.00 _99.79 _100.00 ₃₂ ₃₃ ₁₃ 105 _100.00 _99.39 _100.00 ₄₂ ₄₅ ₁₄ 30 103 _100.00 _100.00 _100.00 ₃₀ ₃₁ ₁₆ 104 _100.00 _100.00 _100.00 ₄₅ ₄₆ ₁₈ 105 100.00 100.00 100.00 64 67 20 40 103 _100.00 _100.00 _100.00 ₃₉ ₄₀ ₁₈ 104 _100.00 _100.00 _100.00 ₅₉ ₆₃ ₂₁ 105 _100.00 _100.00 _100.00 ₈₆ ₉₂ ₂₄ 50 103 100.00 100.00 100.00 48 49 21 104 _100.00 _100.00 _100.00 ₇₄ ₇₇ ₂₄ 105 _100.00 _100.00 _100.00 ₁₀₇ ₁₁₂ ₂₈

Table 5. Xii.i.d.N (0, Id): number Nof iterations performed to reach precision = 10−3 ( = 10−6 for A2∗5) — averaged values over 100 repetitions, rounded to the nearest integer.

d n A1 A15 A105 A2 A25 A205 A2∗5 2 103 ₈₀ ₁₂₆ ₉₅ ₂₇ ₂₃ ₂₂ ₉₁ 104 ₈₄ ₂₈₈ ₁₈₇ ₃₈ ₃₆ ₃₆ ₂₉₁ 105 114 132 123 40 35 35 74 3 103 ₈₄ ₉₈ ₅₅ ₃₆ ₃₃ ₂₆ ₈₆ 104 ₉₉ ₈₃ ₆₈ ₄₈ ₄₀ ₃₇ ₉₂ 105 ₁₁₂ ₁₀₇ ₁₀₀ ₅₁ ₄₆ ₄₅ ₁₇₀ 4 103 107 87 59 46 40 34 385 104 ₁₁₀ ₈₆ ₆₆ ₄₇ ₄₀ ₃₇ ₁₄₂ 105 ₁₂₇ ₉₀ ₈₂ ₇₄ ₆₃ ₆₁ ₂₈₈ 5 103 ₁₀₄ ₈₂ ₅₄ ₅₀ ₄₆ ₃₇ ₁₉₁ 104 122 93 74 64 52 47 199 105 ₁₃₆ ₁₀₀ ₉₁ ₈₈ ₇₄ ₇₁ ₃₃₃ 10 103 ₁₂₅ ₉₃ ₅₂ ₅₉ ₄₉ ₃₆ ₃₂₀ 104 ₁₆₃ ₁₁₂ ₉₁ ₇₈ ₆₂ ₅₆ ₃₄₉ 105 175 124 116 86 67 64 465 20 103 ₁₆₂ ₁₂₁ ₅₇ ₆₄ ₅₆ ₃₅ ₃₃₄ 104 ₁₉₄ ₁₃₆ ₉₄ ₈₇ ₇₁ ₅₈ ₆₀₂ 105 ₂₂₂ ₁₅₆ ₁₄₄ ₉₉ ₈₂ ₇₈ ₇₅₄ 30 103 ₁₆₉ ₁₂₄ ₅₃ ₇₂ ₆₂ ₃₇ ₄₆₅ 104 ₂₂₈ ₁₅₃ ₁₁₀ ₉₂ ₈₁ ₆₅ ₈₁₃ 105 ₂₄₉ ₁₆₈ ₁₅₅ ₁₁₇ ₉₄ ₈₉ ₁₂₀₀ 40 103 ₁₆₈ ₁₁₉ ₆₁ ₆₈ ₆₄ ₃₅ ₅₃₂ 104 ₂₂₉ ₁₅₉ ₁₀₅ ₉₃ ₈₁ ₆₅ ₁₀₅₄ 105 ₂₈₀ ₁₈₂ ₁₇₀ ₁₁₄ ₉₆ ₉₃ ₁₄₇₂ 50 103 ₁₇₆ ₁₂₃ ₆₃ ₈₀ ₇₃ ₄₀ ₇₂₃ 104 ₂₃₄ ₁₅₁ ₁₁₆ ₁₀₆ ₈₉ ₇₀ ₁₁₇₁ 105 ₂₈₄ ₁₇₇ ₁₆₉ ₁₁₇ ₁₀₀ ₉₅ ₁₈₅₆

(20)

Table 6. Xii.i.d.N (0, Id): computational time t(QP) (in s) and ratios ρ(M)=t(M)/t(QP) — averaged over 100 repetitions. Italicized figures correspond to the fastest exact method.

d n t(QP) QP0 A1 A15 A105 A2 A25 A205 A1 0 5-QP A2 0 5-QP A2 ∗ 5 2 103 _0.005 _0.52 _4.01 _5.98 _4.66 _1.43 _1.24 _1.16 _4.87 _1.39 _4.15 104 _0.030 _0.18 _1.61 _1.60 _1.15 _0.71 _0.27 _0.27 _1.18 _0.32 _1.71 105 _0.27 _0.09 _1.79 _0.16 _0.15 _0.58 _0.08 _0.08 _0.16 _0.09 _0.10 3 103 _0.004 _0.73 _4.16 _4.61 _2.72 _1.87 _1.65 _1.37 _2.95 _1.66 _3.69 104 _0.029 _0.28 _2.03 _0.61 _0.53 _0.92 _0.31 _0.30 _0.56 _0.34 _0.59 105 _0.29 _0.16 _2.18 _0.17 _0.17 _0.85 _0.10 _0.10 _0.17 _0.11 _0.17 4 103 _0.005 _0.83 _4.89 _3.74 _2.69 _2.14 _1.80 _1.60 _2.95 _1.88 _14.28 104 _0.030 _0.43 _2.18 _0.65 _0.54 _0.88 _0.32 _0.31 _0.58 _0.36 _0.84 105 _0.30 _0.31 _2.54 _0.20 _0.20 _1.33 _0.13 _0.13 _0.20 _0.14 _0.24 5 103 _0.004 _0.95 _4.83 _3.70 _2.62 _2.37 _2.04 _1.74 _2.93 _2.08 _7.14 104 _0.032 _0.56 _2.43 _0.70 _0.61 _1.20 _0.40 _0.37 _0.66 _0.42 _1.06 105 _0.32 _0.42 _3.06 _0.25 _0.25 _1.79 _0.16 _0.16 _0.26 _0.16 _0.28 10 103 _0.006 _1.10 _4.58 _3.11 _1.99 _2.16 _1.66 _1.40 _2.43 _1.84 _8.18 104 _0.039 _0.96 _3.41 _0.85 _0.76 _1.57 _0.45 _0.42 _0.83 _0.49 _1.55 105 _0.40 _0.92 _6.19 _0.63 _0.63 _2.77 _0.30 _0.30 _0.64 _0.31 _0.46 20 103 _0.010 _1.08 _4.23 _2.51 _1.44 _1.70 _1.23 _0.88 _2.00 _1.43 _5.35 104 0.061 1.12 6.87 1.12 1.01 3.01 0.58 0.54 1.12 0.65 1.89 105 _0.58 _1.16 _10.94 _1.21 _1.21 _4.45 _0.51 _0.52 _1.22 _0.53 _0.69 30 103 _0.019 _1.06 _2.78 _1.51 _0.83 _1.17 _0.82 _0.56 _1.40 _1.11 _4.06 104 _0.081 _1.12 _9.63 _1.36 _1.27 _3.83 _0.75 _0.72 _1.42 _0.87 _2.15 105 0.77 1.19 16.42 1.75 1.75 7.18 0.80 0.79 1.77 0.81 1.02 40 103 _0.028 _1.04 _2.09 _1.09 _0.69 _0.83 _0.61 _0.42 _1.37 _1.09 _3.24 104 _0.10 _1.14 _11.30 _1.63 _1.54 _4.33 _0.90 _0.88 _1.77 _1.09 _2.38 105 _0.97 _1.20 _20.39 _2.15 _2.15 _7.83 _1.05 _1.04 _2.18 _1.06 _1.27 50 103 0.038 1.04 1.79 0.89 0.60 0.77 0.55 0.38 1.34 1.10 3.21 104 _0.13 _1.16 _12.71 _1.84 _1.78 _5.54 _1.11 _1.10 _2.05 _1.36 _2.46 105 _1.20 _1.22 _22.43 _2.48 _2.49 _8.74 _1.33 _1.33 _2.52 _1.36 _1.56

πAY(ξ2) is significatively larger than π(ξ2) for d. 5 only.As in Table 4, κ(A15)< κ(A25),

but the figures are now much larger, indicating that the algorithms have difficulties with providing small 0-core sets. As a consequence, here A10₅ (respectively, A20₅) does not stop earlier than A15 (respectively, A25), and the results for A105 and A205 are omitted

in Tables 8 and 9 since they are identical to those for A15 and A25. The number of

iterations for given d and n in Table 8 is significantly larger than in Tables 2 and 5, with N(A15)>N(A25) for d ≤ 10 and N(A25) slightly larger than N(A15) for d ≥ 30. The

number of iterations of A2∗₅ is now very large. Table 9 shows that QP0 is generally the

fastest among exact methods for d ≤ 5 and is only slightly slower than QP for larger d. On the other hand, A20₅-QP is much slower than QP for d ≥ 10 and A2∗₅ is by far the slowest exact method is all cases considered.

Finally, one may notice that, for given d and n, the computational times for QP (and thus of QP0) are quasi identical in Tables 3 and 6 and are only increased by a small factor

in Table 9, enhancing the interest of using QP with elimination of inessential points to solve smallest enclosing ball problems with moderate d.

6. Conclusions

An inequality has been derived that permits to remove inessential (interior) points during the computation of the smallest enclosing ball of a set of points. The inequality is, in some sense, the best possible, and is given by a simple expression depending on the mean and the (trace of the) variance matrix of a probability measure placed on the set of points. Any probability measure gives such an an inequality. Algorithms for the

(21)

Table 7. Xiuniform inBd(0, 1): proportion π (in %) of points not eliminated and number κ(M) of remaining points after applying method M — averaged values over 100 repetitions, rounded to the nearest integer.

d n π∗ π(ξu) π(ξ2) πAY(ξ2) κ(A15) κ(A25) κ(A2∗5)

2 103 _86.60 _87.04 _36.00 _52.84 ₆₇ ₈₇ ₉ 104 86.60 86.78 24.82 36.35 644 978 35 105 _86.60 _86.66 _17.45 _25.34 ₆₃₈₄ ₁₆₆₂₇ ₂₃₅ 3 103 _91.06 _91.61 _63.29 _83.69 ₁₀₀ ₁₁₂ ₉ 104 _91.06 _91.26 _49.12 _67.89 ₉₄₈ ₁₀₆₁ ₄₄ 105 91.06 91.13 40.78 57.17 9399 17555 339 4 103 _93.52 _94.21 _80.24 _95.49 ₁₂₇ ₁₄₀ ₁₀ 104 _93.52 _93.74 _70.89 _89.16 ₁₂₂₉ ₁₃₇₅ ₅₄ 105 _93.52 _93.60 _62.08 _81.02 ₁₂₂₄₅ ₁₅₄₄₉ ₄₄₅ 5 103 _95.06 _95.64 _89.42 _98.78 ₁₅₇ ₁₆₄ ₁₁ 104 _95.06 _95.29 _85.33 _97.54 ₁₅₂₂ ₁₆₆₇ ₆₄ 105 _95.06 _95.15 _77.66 _93.34 ₁₅₁₇₃ ₁₇₄₀₇ ₅₅₃ 10 103 _98.21 _98.70 _99.79 _100.00 ₂₈₇ ₂₉₇ ₁₉ 104 _98.21 _98.39 _99.61 _100.00 ₂₇₉₅ ₂₉₂₀ ₁₁₃ 105 _98.21 _98.28 _99.19 _100.00 ₂₇₉₂₅ ₂₉₀₅₄ ₁₀₈₀ 20 103 _99.54 _99.74 _100.00 _100.00 ₄₈₈ ₄₉₉ ₃₉ 104 _99.54 _99.64 _100.00 _100.00 ₄₇₉₉ ₄₈₅₉ ₂₂₁ 105 _99.54 _99.58 _100.00 _100.00 ₄₇₈₉₂ ₄₈₅₆₂ ₂₀₈₄ 30 103 _99.84 _99.93 _100.00 _100.00 ₆₃₂ ₆₃₈ ₅₆ 104 _99.84 _99.89 _100.00 _100.00 ₆₂₃₇ ₆₂₇₆ ₃₂₉ 105 _99.84 _99.86 _100.00 _100.00 ₆₂₆₂₄ ₆₂₅₇₇ ₃₀₅₃ 40 103 _99.93 _99.98 _100.00 _100.00 ₇₃₈ ₇₄₀ ₇₂ 104 _99.93 _99.96 _100.00 _100.00 ₇₂₈₆ ₇₂₈₄ ₄₃₃

solution of the dual problem construct sequences of probability measures (defined by the Lagrange coefficients), which can thus straightforwardly be used to progressively eliminate inessential points. A two-point measure ξ2, already proposed in the literature

to efficiently initialise such dual algorithms [28], has been shown to efficiently directly remove a significant proportion of points in various situations with reasonably small dimension d. Several numerical experiments have indicated that this simple pre-filtering of the input set is clearly beneficial to a QP solver when enough inessential points are removed (d is small enough) and that the extra cost (slow-down factor) due to pre-filtering is marginal otherwise (for large d). Other methods, like those in [9, 26]4 might also benefit from the input-size reduction offered by this pre-filtering. Notice, finally, that these methods rely on the computation of a sequence of smallest enclosing balls for sets of d + 1 points, from which a sequence of probability measures, and thus of eliminating inequalities, could easily be deduced; see [9, Sect. 3].

Acknowledgments

The author thanks the two referees for their comments that helped to improve the presen-tation of the paper. He also thanks the referee who pointed out the existence of reference [2].

4_{See also the implementation in http://doc.cgal.org/latest/Bounding_volumes/classCGAL_1_1Min_sphere__d.} html