January 2008, Vol. 12, p. 127–153 www.esaim-ps.org
DOI: 10.1051/ps:2007044
DISTORTION MISMATCH IN THE QUANTIZATION OF PROBABILITY MEASURES
Siegfried Graf
1, Harald Luschgy
2and Gilles Pag` es
3Abstract. We elucidate the asymptotics of the Ls-quantization error induced by a sequence ofLr- optimaln-quantizers of a probability distributionP onRd when s > r. In particular we show that under natural assumptions, the optimal rate is preserved as long ass < r+d(and for everys in the case of a compactly supported distribution). We derive some applications of these results to the error bounds for quantization based cubature formulae in numerical integration onRd and on the Wiener space.
Mathematics Subject Classification. 60G15, 60G35, 41A25.
Received June 2, 2006. Revised November 24, 2006.
1. Introduction
Optimal quantization is devoted to the best approximation in LrRd(P) (r > 0) of a random vector X : (Ω,A,P) → Rd by random vectors taking finitely many values in Rd (endowed with a norm .). When X ∈ Lr(P), this leads for every n ≥ 1 to the following n-level Lr(P)-optimal quantization problem for the random vectorX defined by
en,r(X) := inf
X−q(X)r, q:Rd→Rd,Borel function, card(q(Rd))≤n
. (1.1)
One shows that the above infimum can be taken over the Borel functions q:Rd→α:=q(Rd), α⊂Rd, cardα≤n which are someprojection following the nearest neighbour rule on their imagei.e.
q(x) =
a∈α
a1Va(α)(x),
Keywords and phrases. Optimal quantization, Zador Theorem.
1 Universit¨at Passau, Fakult¨at f¨ur Informatik und Mathematik, 94030 Passau, Germany;[email protected]
2 Universit¨at Trier, FB IV-Mathematik, 54286 Trier, Germany;[email protected]
3Laboratoire de Probabilit´es et Mod`eles al´eatoires, UMR 7599, Universit´e Paris 6, case 188, 4, pl. Jussieu, 75252 Paris cedex 5, France;[email protected]
c EDP Sciences, SMAI 2008
Article published by EDP Sciences and available at http://www.esaim-ps.org or http://dx.doi.org/10.1051/ps:2007044
(Va(α))a∈α being a Borel partition ofRdsatisfying Va(α)⊂
x∈Rd : x−a= min
b∈αx−b
. The setα=q(Rd) is (also) called a Voronoin-quantizer and one denotes
Xα:=q(X).
Then, ifd(x, α) := mina∈αx−adenotes the distance ofxto the setα, one has X−Xαrr =Ed(X, α)r=
Rdd(x, α)rPX(dx)
which shows thaten,r(X) actually only depends on the distributionP =PX ofX so that
en,r(X) =en,r(P) = inf
card(α)≤n
d(x, α)rdP(x)
1 r
. The first two basic results in optimal quantization theory are the following (see [6]):
– The above infimum is in fact a minimum: there exists for every n ≥ 1 (at least) one Lr(P)-optimal n-quantizerα∗n. If supp(P) is infinite, then card(α∗n) =n.
– Zador’s Theorem: IfX∈Lr+η(P)i.e.
Rdxr+ηdP(x)<+∞for some η >0, then limn n1den,r(P) = (Qr(P))1r∈R+.
A more explicit expression is known for the real constantQr(P) (see (2.3) below). In particular,Qr(P)>0 if and only if P has an absolutely continuous part (with respect to the Lebesgue measure λd onRd). When P has an absolutely continuous part, a sequence (αn)n≥1 ofn-quantizers isLr-rate optimalforP if
lim sup
n
ndr
Rdd(x, αn)rPX(dx)<+∞
and isasymptotically Lr-optimalif lim
n
Rdd(x, αn)rPX(dx) en,r(P)r = 1.
Our aim in this paper is to deeply investigate the (asymptotic)Ls-quantization error induced by a sequence (αn)n≥1ofLr-optimaln-quantizers. It follows from the monotony ofs→ .sthat (αn)n≥1remains anLs-rate optimal sequence as long ass≤r. As soon ass > rno such straightforward answer is available (except for the uniform distribution over the unit interval since the sequence ((2k2−1n )1≤k≤n)n≥1isLr-optimal for everyr >0).
Our main motivation for investigating this problem comes from the recent application of optimal quantization to numerical integration (see [11]) and to the computation of conditional expectation (e.g. for the pricing of American options, see [1]). Let us consider for the sake of simplicity the case of the error bound in the quantization based cubature formulae for numerical integration. Let F : Rd → R be a C1 function with a Lipschitz continuous differentialDF. It follows from a simple Taylor expansion (see [11]) that for any random vectorX with distributionP =PX quantized byXα(α⊂Rd)
E(F(X))−E(F(Xα))−E(DF(Xα).(X−Xα))≤[DF]LipE|X−Xα|2
where [DF]Lip denotes the Lipschitz coefficient ofDF. Ifαis anL2-optimal (orquadratic) quantizer then it is stationary (see [11] or [6]) so that
Xα=E(X|Xα) which makes the first order term vanish since
E(DF(Xα).(X−Xα)) =E(DF(Xα).E(X−Xα|Xα)) = 0.
Finally, if (αn)n≥1is a sequence ofquadraticoptimaln-quantizers
E(F(X))−E(F(Xαn))≤[DF]Lip(en,2(P))2∼[DF]LipQ2(P)n−2d. (1.2) Now, if the HessianD2F does exist, is ρ-H¨older (ρ∈(0,1]) and computable, the same approach yields
E(F(X))−E(F(Xαn))−E((X−Xαn)∗D2F(Xαn)(X−Xαn))≤[D2F]ρX−Xαn2+2+ρρ. (1.3) Consequently elucidating the asymptotic behaviour ofX−Xαn2+ρ= (
Rdd(x, αn)2+ρdP(x))2+ρ1 is necessary to evaluate to what extend the cubature formula in (1.3) does improve the former one (1.2). Similar problems occur when evaluating the error in the first order quantization based scheme designed for the pricing of multi- asset American options or for non-linear filtering (see [2, 13]). One also meets such mismatch problems in infinite dimensions when dealing with (product) functional quantization on the Wiener space in order to price path-dependent European options (see the example in Sect. 6 and [12]).
The paper is organized as follows: in Section 2, a lower bound for theLs(P)-quantization rate of convergence of an asymptoticallyLr-optimal sequence (αn)n≥1ofn-quantizers is established. In particular this result implies that for absolutely continuous distributionsPwith unbounded support, the quantization raten−d1 inLscannot be preserved as soon ass > r+d. Ifs≤r+d, then the lower bound can be finite. We conjecture that, when (αn)n≥1 is Lr-rate optimal the lower bound is in fact the sharp rate. In Section 3, several natural criteria on the distributionP are derived. They ensure that (αn)n≥1isLs-rate optimal for a givens∈(r, r+d) or even for alls∈(r, r+d). Our criteria are applied to many parametrized families of distributions onRd. We investigate by the same method in Section 4 the critical cases=r+dand the super-critical cases > r+d. In Section 5 we show that for compactly supported distributions on the real line the lower bound obtained in Section 2 does hold as a sharp rate. Finally, in Section 6 we apply our results to the evaluation of errors in numerical integration by quantization based cubature formulae in finite and infinite dimensions.
Notations : • · will denote a norm onRd andB(x, r) will denote the closed ball centred atxwith radius r >0 (with respect to this norm),d(x, A) will denote the distance betweenx∈Rd and a subsetA⊂Rd.
• λd will denote the Lebesgue measure onRd (equipped with its Borelσ-fieldB(Rd)).
•Let (an)n≥0and (bn)n≥0be two sequences of positive real numbers. The symbolanbn is foran=O(bn) andbn=O(an) whereas the symbol an ∼bn meansan=bn+o(bn) asn→ ∞.
• xis for the integral part of the real numberx.
• f ∝g means that the functionsf andg are proportional.
2. The lower estimate
In this section we derive an explicit lower bound in the (r, s)-problem for non-purely singular probability distributionsP and asymptoticallyLr-optimal quantizers. This bound is expected to be best possible.
Letr∈(0,∞). Let P be a probability measure on (Rd,B(Rd)) satisfying
RdxrdP(x)<+∞ (2.1)
and supp(P) is not finite. Thenen,r(P)∈(0,∞) for everynanden,r(P)→0 asn→ ∞. A sequence (αn)n≥1
of quantizers is called asymptoticallyLr-optimal forP if cardαn≤nfor everynand
Rdd(x, αn)rdP(x)∼en,r(P)r as n→ ∞. (2.2) LetPa =f.λd denote the absolutely continuous part ofP with respect toλd. Assume that
Rdxr+ηdP(x)<
+∞for someη >0. Then by the Zador Theorem (see [6])
nlim→∞nr/den,r(P)r=Qr(P) (2.3)
where
Qr(P) :=Jr,d
Rdfd/(d+r)dλd
(d+r)/d
∈[0,∞) (2.4)
and
Jr,d:= inf
n≥1nr/den,r(U([0,1]d))r∈(0,∞),
(U([0,1]d) denotes the uniform distribution on the hyper-cube [0,1]d). This theorem was first stated by Zador in [14, 15] and then generalized by Bucklew and Wise (see [3]), the first completely rigourous proof has been proposed by Graf & Luschgy in [6]. Note that the finiteness of
Rdfd/(d+r)dλd is a simple consequence of the H¨older Inequality and the moment assumption
Rdxr+ηdP(x)<+∞: first note that
x≤1fd+rd dλd <+∞
sinceλd(x ≤1)<+∞and d+dr≤1. Then setting p= 1 +r/d,q= 1 +d/r,α= (r+η)d/(d+r)
B(0,1)c
fd+rd dλd =
x>1
x−αxαfd+rd (x)dλd(x)
≤
x>1
x−αqdλd(x)
1/q
xαpf(x)dλd(x)
1/p
=
x>1
x−(1+η/r)ddλd(x)
1/q
xr+ηdP(x)
1/p
<+∞.
Furthermore, for probabilities P on Rd with Pa = 0, the empirical measures associated to an asymptotically Lr-optimal sequence (αn)n≥1ofn-quantizers satisfy (see [6] Th. 7.5 or [4] for this slight extension)
1 n
a∈αn
δa
−→w Pr (2.5)
wherePr denotes theLr-point density measure ofP defined by
Pr:=fr.λd with fr:= fd/(d+r) fd/(d+r)dλd
. (2.6)
Note that the limitQr(P) in the Zador Theorem reads Qr(P) =Jr,d
fr−r/ddPa.
The quantity that naturally comes out in the (r, s)-problem,r, s∈(0,∞), is Qr,s(P) := Js,d
fr−s/ddPa (2.7)
= Js,d
Rdfd/(d+r)dλd s/d
{f >0}f1−s/(d+r)dλd∈(0,+∞].
Theorem 1. Letr, s∈(0,∞). AssumePa = 0and
Rdxr+ηdP(x)<+∞ for someη >0. Let(αn)n≥1 be an asymptotically Lr-optimal sequence of n-quantizers forP. Then
lim inf
n→∞ ns/d
d(x, αn)sdP(x)≥Qr,s(P). (2.8) Prior to the proof, let us provide a few comments on this lower bound.
Comments. • The main corollary that can be directly derived from Theorem 1 is that
{f >0}f1−s/(d+r)dλd= +∞=⇒ lim
n→∞ns/d
d(x, αn)sdP(x) = +∞
since thenQr,s(P) = +∞.
By contraposition, a necessary condition for an asymptotically Lr-optimal sequence of quantizers (αn) to achieve the optimal raten−s/d for the Ls-quantization error is thatQr,s(P)<+∞. But, under the moment assumption of Theorem 1 the following equivalence holds true
Qr,s(P)<+∞ ⇐⇒
f−d+rs dPa=
{f >0}f1−d+rs dλd<+∞ (2.9) since
Rdfd/(d+r)dλd<+∞.
In turn, for probability measuresP satisfyingλd(f >0) = +∞a necessary condition for the right hand side of (2.9) to be satisfied is that
s < d+r. (2.10)
Indeed, if s≥d+r, the following chain of inequalities holds true
λd(f >0) =
1{f >0}f−1dPa ≤ 1{f >0}f−1d+rs dPa
d+r s
=
{f >0} f1−s/(d+r)dλd
d+r
s
where we used thatp→ .Lp(Pa)is non-decreasing sincePa(Rd)≤1.
On the other hand, still whens < d+r, the following criterion holds for the finiteness ofQr,s(P):
∃ϑ >0,
Rdxds/(d+r−s)+ϑdP(x)<+∞ =⇒Qr,s(P)<+∞. (2.11)
Set ρ= 1− d+sr∈(0,1) and u= d+dsr−s+ϑ. Then (2.11) follows from the regular H¨older inequality applied with ˜p=ρ1 =d+d+r−rs and ˜q=1−1ρ =d+sr−s,
B(0,1)c
fρdλd ≤
B(0,1)c
(f(x)ρxuρ)p˜dλd(x)
1/p˜
B(0,1)c
x−uρq˜dλd(x) 1/q˜
=
B(0,1)c
f(x)xudλd(x) ρ
B(0,1)c
x−uρ/(1−ρ)dλd(x) 1−ρ
<+∞
using the moment assumption in (2.11) anduρ/(1−ρ) =d+ϑ1−ρρ > d.
• It is generally not true in the general setting of Theorem 1 that lim
n→∞ns/d
d(x, αn)sdP(x) =Qr,s(P) (see Counter-Example 2 in Sect. 3). However, one may reasonably conjecture that this limiting result holds true for sequences (αn) of exactly Lr-optimal n-quantizers. Our result in one dimension for compactly supported distributions (see Sect. 5) supports this conjecture.
• In any case, note that (2.8) improves the obvious lower bound lim inf
n→∞ ns/d
d(x, αn)sdP(x)≥lim inf
n→∞ ns/den,s(P)s≥Qs(P).
(The right inequality needs no moment assumption onPas can be checked from the proof of the Zador Theorem, see [6].) In fact, one even has that, for everyr, s∈(0,+∞),
Qr,s(P)≥Qs(P).
Furthermore, this inequality is strict whenr=s(except if f isλd-a.e.constant on{f >0}). Let us provide a short proof of this fact. Setp= (d+s)/s >1,q= (d+s)/d >anda=ds/(d+r)(d+s),b= (d+r−s)d/(d+ r)(d+s). Then the H¨older inequality yields
(Qs(P))d+sd =
fd/(d+s)dλd=
{f >0}fafbdλd
≤
fapdλd
1/p
{f >0}fbqdλd
1/q
(“<” iffapandfbq are not proportional)
=
fd/(d+r)dλd
s/(d+s)
{f >0}f1−d+rs dλd
d/(d+s)
= (Qr,s(P))d+sd .
Proof of Theorem 1. First keep in mind that, the r+η-moment assumption on P implies the finiteness of fd+rd dλd. The existence of at least one asymptoticallyLr-optimal sequence (αn)n≥1follows from the existence of anr-moment forP. For every integerm≥1, set
fm:=
m2m−1 k=0
k 2m1Em
k with Emk =
k
2m ≤f < k+ 1 2m
∩B(0, m).
The sequence (fm)m≥1 is non-decreasing and converges tof1{0≤f <+∞}=f λd-a.e.
LetIm:={k∈ {0, . . . , m2m−1} : λd(Emk )>0}.
For everyk∈Im, there exists a closed set Amk ⊂Ekm satisfying λd(Ekm\Amk )≤ 1
m32m·
Let εm∈(0,1] be a positive real number such that the closed setsAmk :={x∈Rd : d(x, Amk )≤εm}, k∈Im,
satisfy
Am
k
fd+rd dλd≤(1 + 1/m)
Am
k
fd+rd dλd<+∞.
Set
fm:=
m2m−1 k=0
k 2m1
Amk. It is clear that
{fm=fm} ⊂
0≤k≤m2m−1
(Emk \Amk ).
Hence
λd({fm=fm})≤
m2m−1 k=0
1
m32m = 1 m2
so that
m≥1
1{f
m=fm}<+∞ λd-a.e.
i.e., for λd-a.e. x, fm(x) = fm(x) for large enoughm so that fm converges to f λd-a.e.. Finally, as a result fm≤fm≤f andfmconverges tof λd-a.e. Then, for everyn≥1,
nsd
Rd(d(x, αn))sdP(x) ≥ nsd
Rd(d(x, αn))sfm(x)dλd(x)
= nsd
m2m−1 k=0
k 2m
Am
k
(d(x, αn))sdλd(x). (2.12)
Since all the sets Amk, k = 0, . . . , m2m−1 are bounded (as subsets of B(0, m+ 1)), there exists for every m ≥1 and every k∈ {0, . . . , m2m−1} a finite “firewall”βkm ⊂Rd (see [6] or Lem. 4.3 in [4] and note that Amk ⊂(Amk)εm/2:={x∈Rd : d(x,(Amk)c)> εm/2}) such that
∀n≥1, ∀x∈Amk, d(x, αn∪βkm) =d(x,(αn∪βkm)∩Amk ).
Setβm=∪0≤k≤m2m−1βmk . Then, for everyk∈ {0, . . . , m2m−1}, for everyx∈Amk ,
d(x, αn)≥d(x, αn∪βkm) =d(x,(αn∪βkm)∩Amk )≥d(x,(αn∪βm)∩Amk ).
Set temporarilynmk := card((αn∪βm)∩Amk). First note that it is clear that nmk
n ∼card(αn∩Amk)
n asn→ ∞.
It follows from the asymptoticLr-optimality of the sequence (αn) and the empirical measure theorem (see (2.5)) that
lim sup
n
card(αn∩Amk)
n ≤
Amk fd+rd dλd
fd+rd dλd
(2.13)
so that
lim inf
n
n nmk ≥
fd+rd dλd
Amk fd+rd dλd
≥ m m+ 1
fd+rd dλd
Am
k fd+rd dλd
. On the other hand, for everyk∈Im,
Am
k
(d(x, αn))sdλd(x)≥
Am
k
d(x,(αn∪βm)∩Amk)sdλd(x)≥λd(Amk )esnm
k,s(U(Amk ))
whereU(Amk ) denotes the uniform distribution overAmk (note that the inequality is trivial whenλd(Amk ) = 0).
Then one may apply Zador’s theorem which yields, combined with (2.13), lim inf
n nds
Amk
(d(x, αn))sdλd(x) ≥ λd(Amk )×lim inf
n
n nmk
s d×lim
nm
k
((nmk )sdenm
k(U(Amk)))s
≥ λd(Amk )×
⎛
⎝ m m+ 1
fd+rd dλd
Amk fd+rd dλd
⎞
⎠
s d
Js,d×(λd(Amk))ds
≥ Js,d
m m+ 1
sd
fd+rd dλd
sd
⎛
⎝ λd(Amk )
Am
k fd+rd dλd
⎞
⎠
ds
λd(Amk )
≥ Js,d
m m+ 1
s
d
fd+rd dλd
s d
k+ 1 2m
−d+rs
λd(Amk ) with the convention 00 = 0.
Consequently, using (2.12) and the super-additivity of lim inf yield that, for everym≥1,
lim inf
n nsd
Rd(d(x, αn))sdP(x) ≥ Js,d
m m+ 1
s
d
Rdfd+rd dλd
s dm2m−1
k=0
k 2m
k+ 1 2m
−d+rs
λd(Amk)
= Js,d
m m+ 1
s
d
Rdfd+rd dλd
s d
{f >0}
fm(fm+ 2−m)−d+rs dλd.
Now, by Fatou’s Lemma, one concludes by lettingmgo to infinity that lim inf
n nsd
Rd(d(x, αn))sdP(x)≥Js,d
Rdfd+rd dλd
sd
{f >0}f1−d+rs dλd.
3. The upper estimate
Let r, s ∈ (0,∞). In this section we investigate whether the upper bound
d(x, αn)sdP(x) = O(n−s/d) for Lr-optimal n-quantizersαn holds true. (This is of course less precise than the lower bound given in the previous section.) The reason for the restriction to (exactly) optimaln-quantizers (whens > r) instead of only asymptotically optimaln-quantizers will become clear soon. Seee.g.the subsequent Example 2. First note that the Lp(P)-norms being non-decreasing as a function ofp, the above upper bound trivially holds for s∈(0, r]
since
ns/d
d(x, αn)sdP(x)≤
nr/d
d(x, αn)rdP(x)
sr
.
The same argument shows that when this rate holds for somes >0, then it holds for everys ∈(0, s].
For a sequence (αn)n≥1 of finite codebooks in Rd and b ∈ (0,∞) we introduce the maximal function ψb:Rd →R+∪ {∞}by
ψb(x) := sup
n≥1
λd(B(x, bd(x, αn)))
P(B(x, bd(x, αn))) (3.1)
(with the interpretation 00 := 0). Note thatψb is Borel-measurable and depends on the underlying norm onRd. The theorem below provides a criterion based on these maximal functions that ensures theLs-rate optimality of Lr-optimal n-quantizers. In Corollaries 1, 3, 4 we derive more applicable criteria which only involve the distributionP.
Theorem 2. Let r, s∈(0,∞). Assume Pa = 0and
xr+ηdP(x)<+∞for some η >0. For every n≥1, letαn be anLr-optimaln-quantizer forP. Assume that the maximal function associated with the sequence(αn) satisfies
ψbs/(d+r)∈L1(P) (3.2)
for someb∈(0,1/2). Then
sup
n
ns/d
d(x, αn)sdP(x)<+∞. (3.3)
Proof. Let y ∈ Rd and set δ = δn = d(y, αn). For every x ∈ B(y, δ/2) and a∈ αn, we have x−a ≥ y−a − x−y ≥δ/2 and hence
d(x, αn)≥δ/2≥ x−y, x∈B(y, δ/2).
Letβ=βn=αn∪ {y}. Then
d(x, β) =x−y, x∈B(y, δ/2).
Consequently, for everyb∈(0,1/2),
en,r(P)r−en+1,r(P)r ≥
d(x, αn)rdP(x)−
d(x, β)rdP(x)
≥
B(y,δb)
(d(x, αn)r−d(x, β)r)dP(x)
=
B(y,δb)
(d(x, αn)r− x−yr)dP(x)
≥
B(y,δb)
((δ/2)r−(bδ)r)dP(x)
= ((1/2)r−br)δrP(B(y, bδ)).
One derives that
d(y, αn)r≤ C(b)
P(B(y, bd(y, αn))(en,r(P)r−en+1,r(P)r) (3.4) for everyy∈Rd, b∈(0,1/2), n≥1, where C(r, b) = ((1/2)r−br)−1. Note that en,r(P)r−en+1,r(P)r>0 for everyn∈N(see [6]).
Now we estimate the increments en,r(P)r−en+1,r(P)r. (This extends a corresponding estimate in [7] to distributions with possibly unbounded support.)
Seten,r =en,r(P) for convenience. Let{Va :a∈αn+1} with Va =Va(αn+1) be a Voronoi partition ofRd with respect to αn+1. ThenP(Va)>0 for alla∈αn+1 and cardαn+1=n+ 1 (see [6]),
card
a∈αn+1:
Va
x−ardP(x)> 4ern+1,r
n+ 1
≤ n+ 1 4
and
card
a∈αn+1 :P(Va)> 4 n+ 1
≤ n+ 1 4 · This implies that
βn+1:=
a∈αn+1:
Va
x−ardP(x)≤4ern+1,r
n+ 1 , P(Va)≤ 4 n+ 1
satisfies cardβn+1 ≥ (n+ 1)/2. Choose a closed hyper-cube K = [−m, m]d such that Pr(K) > 3/4. The empirical measure theorem (see (2.5) above or [4, 6] for details) implies
klim→∞
card(αk∩K)
k =Pr(K)
sincePr(∂K) =λd(∂K) = 0. We deduce card(αn+1∩K)≥3(n+ 1)/4 and hence card(βn+1∩K)≥(n+ 1)/4 for large enoughn. Since one can find a tessellation ofKinto [(n+ 1)/8]∨1 cubes of diameter less thanC1n−1/d, there exista1, a2∈βn+1,a1=a2 such that
a1−a2 ≤C1(r)n−1/d for everyn≥3. Letγ=αn+1\ {a1}. Using
d(x, γ)≤ x−a2 ≤ x−a1+a1−a2, one obtains
en,r(P)r−en+1,r(P)r ≤
d(x, γ)rdP(x)−
d(x, αn+1)rdP(x)
=
a∈γ
Va
x−ardP(x) +
Va1
d(x, γ)rdP(x)−
a∈αn+1
Va
x−ardP(x)
=
Va1
(d(x, γ)r− x−a1r)dP(x)
≤ (2r−1)
Va1
x−a1rdP(x) + 2ra1−a2rP(Va1)
≤ 4(2r−1)ern+1,r
n+ 1 +4·2rC1(r)rn−r/d
n+ 1 ·
Consequently, using (2.3), for everyn∈N,
en,r(P)r−en+1,r(P)r≤C2(r)n−(d+r)/d (3.5) whereC2(r) denotes a finite constant independent ofn. Combining (3.4) and (3.5), we get
ns/dd(x, αn)s ≤ C3(r, b)s
λd(B(x, bd(x, αn)) P(B(x, bd(x, αn))
s/(d+r)
(3.6)
≤ C3(r, b)sψb(x)s/(d+r) (3.7)
for every x∈ Rd, n∈N, b∈(0,1/2) and some finite constantC3(r, b). The proof is completed by integrating
both sides with respect toP.
Application to pointwise convergence rate. In the situation of Theorem 2, assumingP =Pa, but without assuming (3.2) (so thatsis not involved in that statement), one can deduce from (3.6) that
lim sup
n→∞ n1/dd(x, αn)≤C3(r, b)f−1/(d+r)<+∞ P(dx)-a.s. (3.8) sinced(x, αn)→0 P(dx)-a.s.(see [4]) implies in turn
P(B(x, bd(x, αn))
λd(B(x, bd(x, αn)) →f(x) P(dx)-a.s. as n→ ∞
by the differentiation of measures. This improves considerably for absolutely continuous distributions and (exactly)Lr-optimal quantizers an a.s. result in [4].
Next we observe that in cases∈(0, d+r) a local version of condition (3.2) is always satisfied.
Lemma 1. Assume
xrdP(x) < +∞ for some r ∈ (0,∞). Let (αn) be a sequence of finite codebooks in Rd satisfying
d(x, αn)rdP(x)→0. Then the associated maximal functions ψb are locally in Lp(P) for every p∈(0,1)i.e.
∀M, b∈(0,∞),
B(0,M)
ψp
bdP <+∞.
Proof. LetM, b∈(0,∞) and set A= supp(P). Then maxx∈B(0,M)∩Ad(x, αn)→0 (see [4]) and hence C(M) := sup
n≥1 max
x∈B(0,M)∩Ad(x, αn)<+∞.
One derives that
B(x, bd(x, αn))⊂B(0, bC(M) +M)
for every x∈B(0, M)∩A, n∈N. Define the Hardy-Littlewood maximal functionϕ: Rd →R+∪ {∞}with respect to the finite measuresλd(· ∩B(0, bC(M) +M)) andP by
ϕ(x) =ϕb,M(x) := sup
ρ>0
λd(B(x, ρ)∩B(0, bC(M) +M))
P(B(x, ρ)) ·
Then
ψb(x)≤ϕ(x)
for everyx∈B(0, M)∩A. From the Besicovitch covering theorem follows the maximal inequality P(ϕ > ρ)≤ C1λd(B(0, bC(M) +M))
ρ
for everyρ >0 where the finite constant C1 only depends on dand the underlying norm. (See [10], Th. 2.19.
The result in [10] is stated for Euclidean norms but it obviously extends to arbitrary norms since any two norm onRd are equivalent.) Consequently,
B(0,M)∩A
ψpbdP ≤
ϕpdP = ∞
0
P(ϕp> t)dt≤1 + ∞
1
P(ϕp> t)dt
≤ 1 +C2 ∞
1
t−1/pdt <+∞
whereC2=C1λd(B(0, bC(M) +M).