HAL Id: hal-00903687
https://hal.archives-ouvertes.fr/hal-00903687v2
Preprint submitted on 17 Dec 2014
HAL
is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or
L’archive ouverte pluridisciplinaire
HAL, estdestinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires
A universal deviation inequality for random polytopes
Victor-Emmanuel Brunel
To cite this version:
Victor-Emmanuel Brunel. A universal deviation inequality for random polytopes. 2013. �hal-
00903687v2�
A universal deviation inequality for random polytopes
Victor-Emmanuel Brunel
CREST, Paris (France) - University of Haifa, Haifa (Israel) [email protected]
Abstract
We consider the convex hull of a finite sample of i.i.d. points uni- formly distributed in a convex body inRd,d≥2. We prove an expo- nential deviation inequality, which leads to rate optimal upper bounds on all the moments of the missing volume of the convex hull, uniformly over all convex bodies ofRd, with no restriction on their volume, loca- tion in the space and smoothness of the boundary. As a consequence of this inequality, we use an extension of an identity due to Efron, to prove uniform upper bounds on the moments of the number of vertices of this convex hull, and we show that the rates of convergence of these bounds are tight.
Keywords.
convex, convex hull, deviation inequality, metric entropy, ran- dom polytope, vertices
1 Introduction
Probabilistic properties of random polytopes have been studied extensively
in the literature in the last fifty years. Consider a convex body K in
Rd, and
a set of n i.i.d. random points uniformly distributed in K. The convex hull
of these random points is a random polytope. Its number of vertices and
its missing volume, i.e., the volume of its complement in K, have been first
analyzed in the seminal work of R´enyi and Sulanke [13, 14]. They derived
the asymptotics of the expected missing volume in the case d = 2, when K is
supposed to be either a polygon with a given number of vertices, or a convex
set with smooth boundary. More recently, considerable efforts were devoted
to understanding the behavior of the expected missing volume. Thus, several
particular examples of K were studied, including a d-dimensional simple
polytope
1[1], a d-dimensional polytope [2] and a d-dimensional Euclidean ball [5]. In [7], it is shown that the expected missing volume is maximal when K is an ellipsoid, see also the references therein. B´ar´ any and Larman [3]
showed that if K has volume one, then the expected missing volume has the same asymptotic behavior as the volume of the (1/n)-wet part of K , defined as the union of all caps of K (a cap being the intersection of K with a half space) of volume at most 1/n. This reduces the initial probabilistic problem to computation of such a deterministic volume, which is a specific analytic problem that was extensively studied. When K has a smooth boundary, a key point was the introduction of the affine surface area, see [16, 18], which leads to the result that the expected missing volume is of the order n
−2/(d+1). When K is a polytope, it is of the order (ln n)
d−1/n [3]. In addition, [3]
proves that the expected missing volume, in dimension d, is minimal for simple polytopes, and maximal for ellipsoids. As a conclusion, the properties of the expected missing volume are now very well-understood. Much less is known about its higher moments and deviation probabilities. In particular, using a jackknife inequality for symmetric functions of n random variables, Reitzner [12] proved that if K is a d-dimensional smooth convex body, the variance of the missing volume is bounded from above by n
−(d+3)/(d+1), and he conjectured that this is the actual order of magnitude for the variance. In addition, he proved that the second moment of the missing volume is exactly of the order n
−4/(d+1), with explicit constants in terms of the affine surface area of K. Vu [19] obtained deviation inequalities for general convex bodies of volume one, involving quantities such as the volume of the wet part, and derived precise deviation inequalities in the cases when K is a polytope, and when it has a smooth boundary. These inequalities involve constants which depend on K in an unknown way. The main tools are martingale inequalities and, as a consequence, upper bounds on the moments of the missing volume are proved, again with implicit constants depending on K.
Let V
nstand for the missing volume of the convex hull of n i.i.d. points uniformly distributed in a convex body K. If K has volume one, there exist positive constants α, c and ǫ
0such that:
P h
| V
n−
E[V
n] | ≥ √ λv
i≤ 2e
−λ/4+ e
−cǫn,
∀ ǫ ∈ α(ln n)/n, ǫ
0, λ ∈ 0, n | K(ǫ) | ,
where v = 36ng(ǫ)
2| K(ǫ) | , g(ǫ) = sup {| F | : F star-shaped ⊆ K(ǫ) } , and K(ǫ) is the ǫ-wet part of K defined in [3]. If K has a smooth boundary and
1Ad-dimensional simple polytope is a convex polytope such that each of its vertices is adjacent to exactlydedges.
is of volume one, Vu [19] showed the existence of positive constants c and α, which depend on K, such that for any λ ∈
0, (α/4)n
−(d+1)(3d+5)(d−1)(d+3)
, the following holds:
P
"
| V
n−
E[V
n] | ≥
qαλn
−d+3d+1#
≤ 2 exp( − λ/4) + exp
− cn
3d+5d−1. This last inequality allows one to derive upper bounds on the variance and on the q-th moment of the missing volume, respectively of orders n
−(d+3)/(d+1)and n
−2q/(d+1), for q > 0, for a smooth convex body K of volume one, up to constant factors depending on K in an unknown way. Note that these two inequalities proved by Vu remain true for K of any positive volume, if
| V
n−
E[V
n] | is replaced by | V
n−
E[V
n] | / | K | . The aim of the present paper is to prove a universal deviation inequality and upper bounds on the moments of the missing volume, i.e., with no restriction on the volume and boundary structure of K, and with constants which do not depend on K. The only assumptions on K are compactness and convexity.
In addition, we derive, from our deviation inequality, moment inequali- ties for the number of vertices of the random convex hull. To our knowledge, these inequalities are new, and the corresponding rates turn to be tight, uni- formly on all convex bodies.
2 Statement of the problem and notation
Let d ≥ 2 be an integer. We denote by | · | the Lebesgue measure in
Rd, ρ the Euclidean distance in
Rd, B
dthe unit Euclidean ball with center 0, and β
dits volume.
If G ⊆
Rdand ǫ > 0, we denote by G
ǫ= { x ∈
Rd: ρ(x, G) ≤ ǫ } the closed ǫ-neighborhood of G. Here, ρ(x, G) = inf
y∈G
ρ(x, y).
When G
1and G
2are two subsets of
Rd, we denote by G
1△ G
2their symmetric difference, and the Hausdorff distance between G
1and G
2is defined as:
d
H(G
1, G
2) = inf { ǫ > 0 : G
1⊆ G
ǫ2, G
2⊆ G
ǫ1} .
For brevity, we call a convex body a compact and convex subset of
Rdwith
positive Lebesgue measure. We denote by K
dthe class of all convex bodies
in
Rd, and by K
1dthe set of all convex bodies that are included in B
d. For a given K ∈ K
d, consider a sample of n i.i.d. random points X
1, . . . , X
n, uniformly distributed in K. We denote by ˆ K
nthe convex hull of X
1, . . . , X
n. This is a random polytope whose missing volume is denoted by V
n, i.e., V
n= | K \ K ˆ
n| . We denote respectively by
PKand
EKthe joint probability measure of (X
1, . . . , X
n) and the corresponding expectation operator. We are interested in deviation inequalities for V
n, i.e., in bounding from above the probability
PK
[V
n> ǫ | K | ],
for ǫ > 0. This yields, as a consequence, upper bounds for the moments
EK[V
nq], q > 0. In order to obtain a deviation inequality, we use the metric entropy of the class K
d. We first prove that it is sufficient to obtain a devia- tion inequality for K ∈ K
1d, by a scaling argument. The deviation inequality that we prove is uniform on the class K
1d, hence it is of much interest in a statistical framework. If one aims to recover K from the observation of the sample points X
1, . . . , X
n, using ˆ K
nas an estimator, the risk, measured in terms of the Nikodym distance (defined as the Lebesgue measure of the symmetric difference), can be bounded from above uniformly on K
d, with no assumption on the volume, boundary structure and location in
Rdof K.
3 Deviation inequality for random polytopes
Theorem 1.
There exist two positive constants C
1and C
2, which depend on d only, such that:
sup
K∈Kd
PK
"
n | K \ K ˆ
n|
| K | − C
2n
−2/(d+1)!
> x
#
≤ C
1e
−x/(dd), ∀ x > 0. (1) Theorem 1 involves constants which depend at least exponentially on the dimension d. This seems to be the price for getting a uniform deviation inequality on K
d. Note that the missing volume is normalized here by the volume of K . Theorem 1 may be refined by normalizing the missing volume by another functional of K, which could be expressed in terms of the affine surface area of K, as in [17] where only the first moment of V
nis considered.
Theorem 1 allows one to derive upper bounds for all the moments of the
missing volume. Indeed, applying Fubini’s theorem leads to the following
corollary.
Corollary 1.
For every positive number q, there exists some positive con- stant A
q, which depends on d and q only, such that
EK h
| K \ K ˆ
n|
qi≤ A
q| K |
qn
−2q/(d+1), ∀ K ∈ K
d. (2) Note that no restriction is made on K except for its compactness and convexity. In particular, its boundary may not be smooth, and K may be located anywhere in the space, not necessarily in some given compact set.
In this sense, the exponential deviation inequality (1) and the inequality on the moments (2) are universal. Combining this corollary with the lower bound for the minimax risk given in [11] in a statistical framework yields the following result.
Corollary 2.
For every positive number q, there exist some positive con- stants a
qand A
q, which depend on d and q only, such that
a
qn
−2q/(d+1)≤ sup
K∈Kd
EK
"
| K \ K ˆ
n|
| K |
!q#
≤ A
qn
−2q/(d+1). In order to prove Theorem 1, we first state two lemmas.
Lemma 1.
Let K ∈ K
d. There exists an ellipsoid E in
Rdsuch that K ⊆ E and | E | ≤ d
d| K | .
Proof of Lemma 1 can be found in [10] and [8]. The second lemma is based on the Steiner formula for convex bodies. It shows that on K
d1, the Nikodym distance is bounded from above by the Hausdorff distance, up to some positive constant.
Lemma 2.
There exists some positive constant α
1which depends on d only, such that
| G △ G
′| ≤ α
1d
H(G, G
′), ∀ G, G
′∈ K
1d.
Proof. Let G ∈ K
d. Steiner formula (see Section 4.1 in [15]) states that there exist some positive numbers L
1(G), . . . , L
d(G), such that
| G
λ\ G | =
d
X
j=1
L
j(G)λ
j, λ ≥ 0. (3) Besides the L
j(G), j = 1, . . . , d are increasing functions of G. In particular, if G ∈ K
1d, then L
j(G) ≤ L
j(B
d).
Let G, G
′∈ K
1d, and let λ = d
H(G, G
′). Since G and G
′are included in the
unit ball, λ is not greater than its diameter, so λ ≤ 2. By definition of the Hausdorff distance, G ⊆ G
′λand G
′⊆ G
λ. Hence,
| G △ G
′| = | G \ G
′| + | G
′\ G | ≤ | G
′λ\ G
′| + | G
λ\ G |
≤ 2
d
X
j=1
L
j(B
d)λ
j≤ λ
d
X
j=1
L
j(B
d)2
j.
The Lemma is proved by setting α
1=
Pdj=1
L
j(B
d)2
j.
Note that since δ ≤ 1, Steiner formula (3) implies, for G ∈ K
d1, that
| G
δ\ G | ≤ α
2δ, (4)
where α
2=
Pdj=1
L
j(B
d).
Proof of Theorem 1
This proof is inspired by Theorem 1 in [9], which derives an upper bound on the risk of a convex hull type estimator of a convex function. Let K ∈ K
d. Let E be an ellipsoid which satisfies the properties of Lemma 1, and T an affine transform in
Rdwhich maps E to the unit ball B
d. Note that β
d= | det T || E | , so T is invertible. Let us denote K
′= T (K) and X
i′= T (X
i), i = 1, . . . , n. Let ˆ K
n′be the convex hull of X
1′, . . . , X
n′. By the definition of T , the following properties hold :
(i) K
′∈ K
1d,
(ii) X
1′, . . . , X
n′are i.i.d. uniformly distributed in K
′, (iii) T ( ˆ K
n) = ˆ K
n′.
Furthermore, one has the following:
| K \ K ˆ
n|
| K | = | K
′\ K ˆ
n′|
| det T || K | = | K
′\ K ˆ
n′| | E | β
d| K | ≤ d
dβ
d| K
′\ K ˆ
n′| . (5) Let δ = n
−2/(d+1). A δ-net of K
1d, for the Hausdorff distance, is a collection of subsets of K
1dsuch that for each G ∈ K
1d, there is G
∗in this collection of sets which satisfies d
H(G, G
∗) ≤ δ. Bronshtein [4] showed that there exists a finite δ-net, of cardinality N
δ≤ C
1δ
−d−12for some positive constant C
1. Let { G
1, . . . , G
Nδ} be such a δ-net. Let j
∗, ˆ j ∈ { 1, . . . , N
δ} be such that:
d
H(K
′, G
j∗) ≤ δ and d
H( ˆ K
n′, G
ˆj) ≤ δ.
Let ε > 0. By (5) and (ii),
PK"
| K \ K ˆ
n|
| K | > ε
#
≤
PK′
| K
′\ K ˆ
n′| > β
dd
dε
. (6)
Let us recall that if G, G
′and G
′′are three Borel subsets of
Rd, then the following triangle inequality holds:
| G \ G
′′| ≤ | G \ G
′| + | G
′\ G
′′| . (7) Thus, | K
′\ K ˆ
n′| ≤ | K
′\ G
j∗| + | G
j∗\ G
ˆj| + | G
ˆj\ K ˆ
n′| and, by the definition of j
∗and ˆ j, and by Lemma 2 and (6),
PK
"
| K \ K ˆ
n|
| K | > ε
#
≤
PK′
| G
j∗\ G
ˆj| > β
dd
dε − 2α
1δ
. (8)
Set ε
′=
βdddε − 2α
1δ. (8) implies
PK"
| K \ K ˆ
n|
| K | > ε
#
≤
Xj=1,...,Nδ:|Gj∗\Gj|>ε′
PK′h
ˆ j = j
i. (9)
Let j ∈ { 1, . . . , N
δ} be fixed, such that | G
j∗\ G
j| > ε
′. Recall that ˆ K
n′⊆ G
δˆj
, and thus if ˆ j = j, then X
i′∈ G
δj, i = 1, . . . , n. So,
PK′
h
ˆ j = j
i≤
PK′h
X
1′∈ G
δjin≤ 1 − | K
′\ G
δj|
| K
′|
!n
≤
1 − 1
β
d( | G
j∗\ G
j| − | G
j∗\ K
′| − | G
δj\ G
j| )
n,
using the triangle inequality (7) and the fact that | K
′| ≤ β
d. Denote by I
ε′= 1 if ε
′< β
d, and 0 otherwise. Continuing (9), and using (4), one gets:
PK
"
| K \ K ˆ
n|
| K | > ε
#
≤
Xj=1,...,Nδ:|Gj∗\Gj|>ε′
1 − ε
′β
d+ (α
1+ α
2)δ β
dn
≤ N
δ1 − ε
′β
d+ (α
1+ α
2)δ β
dn
I
ε′≤ C
1exp
δ
−d−12− εn
d
d+ (3α
1+ α
2)δn β
d
≤ C
1exp
α
3δn − εn d
d,
where α
3= 1+
3α1β+α2d
is a positive constant which depends on d only (recall that δ
−d−12= δn). Finally, by choosing ε = α
3d
dδ + x/n, for any x > 0, and by setting the constant C
2= α
3d
d, one gets (1).
4 Moment inequalities for the number of vertices of random polytopes
Let K ∈ K
d. Denote by V
nthe set of vertices of ˆ K
n, and by R
nthe cardinality of V
n, for n ∈
N∗. Efron [6] proved a simple but elegant identity, which connects the normalized expected missing volume
EKh|K\Kˆn|
|K|
i
and the expected number of vertices
EK[R
n+1] of the random polytope ˆ K
n+1. Namely, one has
EK
"
| K \ K ˆ
n|
| K |
#
=
EK[R
n+1]
n + 1 , ∀ n ∈
N∗. (10) This identity is extended to higher moments of | K \ K ˆ
n| in the following theorem.
Lemma 3.
Let K ∈ K
d, n and q be positive integers,
EK[R
n+q(R
n+q− 1) . . . (R
n+q− q + 1)]
≤
EK"
| K \ K ˆ
n|
| K |
!q#
(n + q)(n + q − 1) . . . (n + 1). (11) Combining Corollary 1 and Theorem 3 yields the following inequality:
EK
[R
n(R
n− 1) . . . (R
n− q + 1)] ≤ A
qn
q(d−1)d+1, ∀ n ∈
N∗, ∀ q ∈
N∗, where A
qis the same constant as in Corollary 1. Since the polynomial x
qis a linear combination of the polynomials x(x − 1) . . . (x − k + 1), 0 ≤ k ≤ q, we get the following inequality:
sup
K∈Kd
EK
[R
qn] ≤ B
qn
q(d−1)d+1, (12)
for some positive constant B
qwhich depends on d and q only.
Let K ∈ K
dand n, q be positive integers. By H¨older inequality,
EK[R
qn] ≥
EK[R
n]
qand, by Efron’s identity (10),
EK
[R
qn] ≥ n
qEK"
| K \ K ˆ
n−1|
| K |
#q
.
If the affine surface area of K is positive, which occurs, for instance, when the boundary of K is smooth with positive Gauss curvature, then it is known that
EKh|K\Kˆn−1|
|K|
i
is exactly of the order of n
−2/(d+1)(see [16]). It is shown in [7] that
EKh|K\Kˆn−1|
|K|
i
is maximal when K is an ellipsoid, and and the corresponding value is exactly of the order of n
−2/(d+1), up to a constant factor which depends on the dimension d only. Therefore, for smooth convex bodies K with positive curvature, the rate of the upper bound in (12) is tight, and one gets the following theorem:
Theorem 2.
Let n and q be positive integers. Then, for some positive constants b
qand B
qwhich depend on d and q only,
b
qn
q(dd+1−1)≤ sup
K∈Kd
EK
[R
qn] ≤ B
qn
q(dd+1−1).
Proof of Lemma 3
Let K ∈ K
dand q be a positive integer, and let X
1, X
2, . . . a sequence of i.i.d. random variables uniformly distributed in K. For precision’s sake, we denote by
P⊗nK
the n-product of the probability measure
PK, i.e., the joint probability measure of the random variables X
1, . . . , X
n, n ∈
N∗. The corresponding expectation operator is denoted by
E⊗nK
. Denote by
1the indicator function, which applies to an event. First, note that the expectation
E⊗nK
[ | K \ K ˆ
n|
q] can be rewritten as the integral:
E⊗n
K
[ | K \ K ˆ
n|
q] =
E⊗nK
Z
Kq1
(x
n+1∈ / K ˆ
n, . . . , x
n+q∈ / K ˆ
n)dx
n+1. . . dx
n+q,
and therefore,
E⊗nK
[ | K \ K ˆ
n|
q] = | K |
qP⊗n+qK
h
X
n+j∈ / K ˆ
n, ∀ j = 1, . . . , q
i. (13)
Since the event { X
n+j∈ / K ˆ
n, ∀ j = 1, . . . , q } contains { X
n+j∈ V
n+q, ∀ j = 1, . . . , q } , (13) becomes
E⊗n
K
[ | K \ K ˆ
n|
q] ≥ | K |
qP⊗n+qK
[X
n+j∈ V
n+q, ∀ j = 1, . . . , q]
= | K |
qn+q q
X
1≤i1<...<iq≤n+q
P⊗n+q
K
X
ij∈ V
n+q, ∀ j = 1, . . . , q
= | K |
qn+q q
E⊗n+q
K
X
1≤i1<...<iq≤n+q
1
X
ij∈ V
n+q, ∀ j = 1, . . . , q
= | K |
qn+q q
E⊗n+q
K
R
n+qq
= | K |
qE⊗n+qK