Asymptotically optimal quantization schemes for gaussian processes on Hilbert spaces

(1)

DOI:10.1051/ps:2008026 www.esaim-ps.org

ASYMPTOTICALLY OPTIMAL QUANTIZATION SCHEMES FOR GAUSSIAN PROCESSES ON HILBERT SPACES

^∗

Harald Luschgy

¹

, Gilles Pag` es

²

and Benedikt Wilbertz

²

Abstract. We describe quantization designs which lead to asymptotically and order optimal functional quantizers for Gaussian processes in a Hilbert space setting. Regular variation of the eigenvalues of the covariance operator plays a crucial role to achieve these rates. For the development of a construc- tive quantization scheme we rely on the knowledge of the eigenvectors of the covariance operator in order to transform the problem into a finite dimensional quantization problem of normal distributions.

Mathematics Subject Classification. 60G15, 60E99.

Received November 22, 2007. Revised May 5, 2008.

1. Introduction

Functional quantization of stochastic processes can be seen as a discretization of the path-space of a process and the approximation (coding) of a process by ﬁnitely many deterministic functions from its path-space. In a Hilbert space setting this reads as follows.

Let (H,·,·) be a separable Hilbert space with norm · and letX : (Ω,A,P)→ H be a random vector taking its values in H with distribution PX. For n ∈N, the L²-quantization problem for X of leveln (or of nat-level logn) consists in minimizing

Emin

a∈αX−a² _1/2

=min

a∈αX−a_L²_(P)

over all subsetsα⊂H with card(α)≤n. Such a setαis calledn-codebook orn-quantizer. The minimal nth quantization error ofX is then deﬁned by:

e_n(X) := inf

(Emin

a∈αX−a²)^1/2:α⊂H, card(α)≤n

. (1.1)

Keywords and phrases. Functional quantization, Gaussian process, Brownian motion, Riemann-Liouville process, optimal quantizer.

∗This work was supported in part by the AMaMeF Exchange Grant 1323 of the ESF.

1 Universit¨at Trier, FB IV-Mathematik, 54286 Trier, Germany;luschgy@uni-trier.de; wilbertz@uni-trier.de

2 Laboratoire de Probabilités et Modèles Aléatoires, UMR 7599, Université Paris 6, Case Courrier 188, 4 place Jussieu, 75252 Paris Cedex 05, France;gpa@ccr.jussieu.fr

Article published by EDP Sciences c EDP Sciences, SMAI 2010

(2)

Under the integrability condition

EX²<∞ (1.2)

the quantitye_n(X) is ﬁnite.

For a givenn-quantizerαone deﬁnes an associated closest neighbour projection π_α:=

a∈α

a1_C_a_(α)

and the induced α-quantization (Voronoi quantization) ofX by

Xˆ^α:=π_α(X), (1.3)

where{C_a(α) :a∈α} is a Voronoi partition induced byα, that is a Borel partition ofH satisfying C_a(α)⊂V_a(α) :={x∈H:x−a= min

b∈αx−b} (1.4)

for everya∈α. Then one easily checks that, for any random vectorX : Ω→α⊂H, EX−X²≥ EX−Xˆ^α²=Emin

a∈αX−a²

so that ﬁnally

e_n(X) = inf

(EX−Xˆ²)^1/2: ˆX =f(X), f :H →H Borel measurable, (1.5) card(f(H))≤n}

= inf

(EX−Xˆ²)^1/2: ˆX : Ω→H random vector, card( ˆX(Ω))≤n

.

Observe that the Voronoi cells V_a(α), a∈αare closed and convex (where convexity is a characteristic feature of the underlying Hilbert structure). Note further that there are inﬁnitely manyα-quantizations ofX which all produce the same quantization error and ˆX^αisP-a.s. uniquely deﬁned ifPX vanishes on hyperplanes.

A typical setting for functional quantization isH=L²([0,1],dt) but is obviously not restricted to the Hilbert space setting. Functional quantization is the natural extension to stochastic processes of the so-called optimal vector quantization of random vectors inH =R^dwhich has been extensively investigated since the late 1940’s in Signal processing and Information Theory (see [4,7]). For the mathematical aspects of vector quantization inR^d, one may consult [5], for algorithmic aspects see [16] and “non-classical” applications can be found in [14,15]. For a ﬁrst promising application of functional quantization to the pricing of ﬁnancial derivatives through numerical integration on path-spaces see [17].

We address the issue of high-resolution quantization which concerns the performance ofn-quantizers and the behaviour ofe_n(X) asn→ ∞. The asymptotics ofe_n(X) forR^d-valued random vectors has been completely elucidated for non-singular distributions PX by the Zador theorem (see [5]) and for a class of self-similar (singular) distributions by [6]. In inﬁnite dimensions no such global results hold, even for Gaussian processes.

It is convenient to use the symbols ∼ and , where a_n ∼ b_n means a_n/b_n → 1 and a_n b_n means lim sup_n→∞a_n/b_n ≤1. A measurable function ϕ: (s,∞)→(0,∞) (s ≥0) is said to be regularly varying at inﬁnity with indexb∈Rif, for everyc >0,

x→∞lim ϕ(cx)

ϕ(x) =c^b.

(3)

Now letX be centered Gaussian. Denote by K_X ⊂H the reproducing kernel Hilbert space (Cameron-Martin space) associated to the covariance operator

C_X :H →H, C_Xy:=E(y, XX) (1.6)

ofX. Letλ₁≥λ₂≥. . . >0 be the ordered nonzero eigenvalues ofC_X and let{uj:j≥1}be the corresponding orthonormal basis of supp(PX) consisting of eigenvectors (Karhunen-Lo`eve basis). Ifd:= dimK_X <∞, then e_n(X) =e_n ^d

j=1N(0, λ_j)

, the minimalnthL²-quantization error of ^d

j=1N(0, λ_j) with respect to thel₂-norm onR^d, and thus we can read oﬀ the asymptotic behaviour ofe_n(X) from the high-resolution formula

e_n

⎛

⎝^d

j=1

N(0, λ_j)

⎞

⎠∼q(d)√ 2π

Π^d_j=1λ_j1/2d d+ 2

d

_(d+2)/4

n^−1/d as n→ ∞, (1.7)

whereq(d)∈(0,∞) is a constant depending only on the dimensiond(see [5]). Except in dimensiond= 1 and d= 2, the true value ofq(d) is unknown. However, one knows (see [5]) that

q(d)∼ d

2πe _1/2

as d→ ∞. (1.8)

Assume dimK_X = ∞. Under regular behaviour of the eigenvalues the sharp asymptotics of e_n(X) can be derived analogously to (1.7). In view of (1.8) it is reasonable to expect that the limiting constants can be evaluated. The recent high-resolution formula is as follows.

Theorem 1.1 ([11]). Let X be a centered Gaussian. Assumeλ_j∼ϕ(j)asj → ∞, whereϕ: (s,∞)→(0,∞) is a decreasing, regularly varying function at infinity of index −b <−1for some s≥0. Set, for everyx > s,

ψ(x) := 1 xϕ(x)· Then

e_n(X)∼ b 2

_b−1 b b−1

_1/2

ψ(logn)^−1/2 as n→ ∞.

A high-resolution formula in case b= 1 is also available (see [11]). Note that the restriction−b≤ −1 on the index of ϕ is natural since ^∞

j=1λ_j < ∞. The minimal L^r-quantization errors of X, 0< r < ∞, are strongly equivalent to theL²-errorse_n(X) (see [3]) and thus exhibit the same high-resolution behaviour.

The paper is organized as follows. In Section 2 we investigate a new quantization design, which furnishes asymptotically optimal quantizers in the situation of Theorem1.1. Here the Karhunen-Loève expansion plays a crucial role. We also provide a lower bound for the dimension problem. The dimension problem, which is explained at the end of Section 3 seems to be one of the hardest unsolved problems in functional quantization of stochastic processes. In Section 3 we state different quantization designs, which are all at least order-optimal and discuss their possible implementations regarding the example of the Brownian motion and the class of Riemann-Liouville processes. Several proposed numerical schemes are new. The main focus in that section lies on “good” designs for finiten∈N.

(4)

2. Asymptotically optimal functional quantizers

LetX be aH-valued random vector satisfying (1.2). For everyn∈N,L²-optimaln-quantizersα⊂H exist, that is

(Emin

a∈αX−a²)^1/2=e_n(X)

(see [10]). If card(supp(PX)) ≥ n, optimal n-quantizers α satisfy card(α) = n, P(X ∈ C_a(α)) > 0 and the stationarity condition

a=E(X | {X∈C_a(α)}), a∈α (2.1)

or what is the same

Xˆ^α=E(X |Xˆ^α) (2.2)

for every Voronoi partition{Ca(α) :a∈α} (see [10]). In particular,EXˆ^α=EX.

Now let X be centered Gaussian with dimK_X =∞. The Karhunen-Lo`eve basis{uj :j ≥1} consisting of normalized eigenvectors of C_X is optimal for the quantization of Gaussian random vectors (see [10]). So we start with the Karhunen-Lo`eve expansion

X ^H= ∞ j=1

λ^1/2_j ξ_ju_j,

whereξ_j=X, uj/λ^1/2_j , j≥1 are i.i.d. N(0,1)-distributed random variables. The design of an asymptotically optimal quantization ofX is based on optimal quantizing blocks of coeﬃcients of variable (n-dependent) block length. Let n∈Nand ﬁx temporarily m, l, n₁, . . . , n_m∈Nwith Π^m_j=1n_j ≤n, wherem denotes the number of blocks,lthe block length andn_j the size of the quantizer for thejth block

ξ^(j):= (ξ_(j−1)l+1, . . . , ξ_jl), j∈ {1, . . . , m}.

Letα_j⊂R^l be anL²-optimaln_j-quantizer forξ^(j) and letξ^(j)=ξ^(j)^α^j be aα_j-quantization ofξ^(j). (Quanti- zation of blocks ξ^(j) instead of (λ^1/2_(j−1)l+1ξ_(j−1)l+1, . . . , λ^1/2_jl ξ_jl) is asymptotically good enough. For ﬁnitenthe quantization scheme will be considerably improved in Sect.3.) Then, deﬁne a quantized version ofX by

Xˆⁿ:=

m j=1

l k=1

λ^1/2_(j−1)l+k(ξ^(j))_ku_(j−1)l+k. (2.3)

It is clear that card( ˆXⁿ(Ω))≤n. Using (2.2) forξ^(j), one getsEXˆⁿ= 0. If ξ^(j)=

b∈αj

b1_C_b_(α_j₎(ξ^(j)),

then

Xˆⁿ=

a∈×^m_j=1αj

⎛

⎝^m

j=1

l k=1

λ^1/2_(j−1)l+ka^(j)_k u_(j−1)l+k

⎞

⎠Π^m_j=11_C

a(j)(αj)(ξ^(j))

where a= (a⁽¹⁾, . . . , a^(m))∈ ×^m_j=1α_j. Observe that in general, ˆXⁿ is not a Voronoi quantization ofX since it is based on the (less complicated) Voronoi partitions for ξ^(j), j≤m. ( ˆXⁿ is a Voronoi quantization ifl= 1 or ifλ_(j−1)l+1 =. . .=λ_jl for everyj.) Using again (2.2) forξ^(j)and the independence structure, one checks that Xˆⁿ satisﬁes a kind of stationarity equation:

E(X |Xˆⁿ) = ˆXⁿ.

(5)

Lemma 2.1. Let n≥1. For everyl≥1 and everym≥1 EX−Xˆⁿ²≤

m j=1

λ_(j−1)l+1e_n_j(N(0, I_l))²+

j≥ml+1

λ_j. (2.4)

Furthermore, (2.4) stands as an equality ifl= 1 (orλ_(j−1)l+1=. . .=λ_jl for every j, l≥1).

Proof. The claim follows from the orthonormality of the basis{uj :j≥1}. We have EX−Xˆⁿ²=

m j=1

l k=1

λ_(j−1)l+kE |ξ_k^(j)−(ξ^(j))_k |²+

j≥ml+1

λ_j

≤ m j=1

λ_(j−1)l+1 l k=1

E |ξ_k^(j)−ξ^(j))_k |²+

j≥ml+1

λ_j

= m j=1

λ_(j−1)l+1e_n_j(ξ^(j))²+

j≥ml+1

λ_j.

Set

C(l) := sup

k≥1k^2/le_k(N(0, I_l))². (2.5)

By (1.7),C(l)<∞. For everyl∈N,

e_n_j(N(0, I_l))²≤n^−2/l_j C(l). (2.6)

Then one may replace the optimization problem which consists, for ﬁxedn, in minimizing the right hand side of Lemma 2.1by the following optimal allocation problem:

min

⎧⎨

⎩C(l) m j=1

λ_(j−1)l+1n^−2/l_j +

j≥ml+1

λ_j :m, l, n₁, . . . , n_m∈N,Π^m_j=1n_j≤n

⎫⎬

⎭. (2.7)

Set

m=m(n, l) := max{k≥1 :n^1/kλ^l/2_(k−1)l+1(Π^k_j=1λ_(j−1)l+1)^−l/2k ≥1}, (2.8) n_j=n_j(n, l) := [n^1/mλ^l/2_(j−1)l+1(Π^m_i=1λ_(i−1)l+1)^−l/2m], j∈ {1, . . . , m}, (2.9) where [x] denotes the integer part ofx∈Rand

l=l_n:= [(max{1,logn})^ϑ], ϑ∈(0,1). (2.10) In the following theorem it is demonstrated that this choice is at least asymptotically optimal provided the eigenvalues are regularly varying.

Theorem 2.2. Assume the situation of Theorem 1.1. Consider Xˆⁿ with tuning parameters defined in (2.8)–

(2.10). Then Xˆⁿ is asymptotically n-optimal, i.e.

(EX−Xˆⁿ²)^1/2∼e_n(X) as n→ ∞.

Note that no block quantizer with ﬁxed block length is asymptotically optimal (see [11]). As mentioned above, Xˆⁿ is not a Voronoi quantization of X. If α_n := ˆXⁿ(Ω), then the Voronoi quantization ˆX^αⁿ is clearly also asymptoticallyn-optimal.

(6)

The key property for the proof is the following l-asymptotics of the constants C(l) deﬁned in (2.5). It is interesting to consider also the smaller constants

Q(l) := lim

k→∞k^2/le_k(N(0, I_l))² (2.11)

(see (1.7)).

Proposition 2.3. The sequences (C(l))_l≥1 and(Q(l))_l≥1 satisfy

l→∞lim C(l)

l = lim

l→∞

Q(l) l = inf

l≥1

C(l) l = inf

l≥1

Q(l) l = 1.

Proof. From [11] it is known that

lim inf

l→∞

C(l)

l = 1. (2.12)

Furthermore, it follows immediately from (1.7) and (1.8) that

l→∞lim Q(l)

l = 1. (2.13)

(The proof of the existence of lim

l→∞C(l)/l we owe to S. Dereich.) Forl₀, l∈Nwithl≥l₀, write l=n l₀+mwithn∈N, m∈ {0, . . . , l₀−1}.

Since for everyk∈N,

[k^l⁰^/l]ⁿ [k^1/l]^m≤k,

one obtains by a block-quantizer design consisting ofnblocks of lengthl₀andmblocks of length 1 for quantizing N(0, I_l),

e_k(N(0, I_l))²≤ne_[kl0/l](N(0, I_l₀))²+me_[k1/l](N(0,1))². (2.14) This implies

C(l) ≤ nC(l₀) sup

k≥1

k^2/l

[k^l⁰^/l]^2/l⁰ +mC(1) sup

k≥1

k^2/l [k^1/l]²

≤ 4^1/l⁰nC(l₀) + 4mC(1).

Consequently, usingn/l≤1/l₀,

C(l)

l ≤ 4^1/l⁰C(l₀)

l₀ +4mC(1) l and hence

lim sup

l→∞

C(l)

l ≤4^1/l⁰C(l₀) l₀ · This yields

lim sup

l→∞

C(l)

l ≤lim inf

l0→∞

C(l₀)

l₀ = 1. (2.15)

It follows from (2.14) that

Q(l)≤nQ(l₀) +mQ(1).

Consequently

Q(l)

l ≤Q(l₀)

l₀ +mQ(1) l

(7)

and therefore

1 = lim

l→∞

Q(l)

l ≤ Q(l₀) l₀ · This implies

linf0≥1

Q(l₀)

l₀ = 1. (2.16)

SinceQ(l)≤C(l), the proof is complete.

The n-asymptotics of the numberm(n, l_n)l_n of quantized coeﬃcients in the Karhunen-Lo`eve expansion in the quantization ˆXⁿ is as follows.

Lemma 2.4([12], Lem. 4.8). Assume the situation of Theorem1.1. Letm(n, l_n)be defined by (2.8) and (2.10).

Then

m(n, l_n)l_n∼ 2 logn

b as n→ ∞.

Proof of Theorem 2.2. For everyn∈N, m

j=1

λ_(j−1)l+1n^−2/l_j = m j=1

λ_(j−1)l+1(n_j+ 1)^−2/l

n_j+ 1 n_j

_2/l

≤ 4^1/lmn^−2/ml(Π^m_j=1λ_(j−1)l+1)^1/m

≤ 4^1/lmλ_(m−1)l+1. Therefore, by Lemma2.1and (2.6),

EX−Xˆⁿ²≤4^1/lC(l)

l mlλ_(m−1)l+1+

j≥ml+1

λ_j

for everyn∈N. By Lemma2.4, we have

ml=m(n, l_n)l_n∼2 logn

b as n→ ∞.

Consequently, using regular variation at inﬁnity with index−b <−1 of the functionϕ, mlλ_(m−1)l+1∼mlλ_ml ∼

2 b

_1−b

ψ(logn)⁻¹

and

j≥ml+1

λ_j∼ mlϕ(ml) b−1 ∼ 1

b−1 2

b _1−b

ψ(logn)⁻¹ as n→ ∞, where, like in Theorem 1.1,ψ(x) = 1/xϕ(x). Since by Proposition2.3,

n→∞lim

4^1/lⁿC(l_n) l_n = 1, one concludes

EX−Xˆⁿ² 2

b _1−b

b

b−1ψ(logn)⁻¹ as n→ ∞.

The assertion follows from Theorem1.1.

(8)

Let us brieﬂy comment on the true dimension of the problem.

Forn∈N, letCn(X) be the (nonempty) set of allL²-optimaln-quantizers. We introduce the integral number d^∗_n(X) := min{dim span (α) :α∈ Cn(X)}. (2.17) It represents the dimension at levelnof the functional quantization problem for X. Here span(α) denotes the linear subspace ofH spanned byα. In view of Lemma2.4, a reasonable conjecture for Gaussian random vectors is d^∗_n(X) ∼2 logn/bin regular cases, where −b is the regularity index. We have at least the following lower estimate in the Gaussian case.

Proposition 2.5. Assume the situation of Theorem 1.1. Then

d^∗_n(X) 1 b^1/(b−1)

2 logn

b as n→ ∞.

Proof. For everyn∈N, we have

d^∗_n(X) = min

⎧⎪

⎨

⎪⎩k≥0 :e_n

⎛

⎝^k

j=1

N(0, λ_j)

⎞

⎠

2

+

j≥k+1

λ_j ≤e_n(X)²

⎫⎪

⎬

⎪⎭ (2.18)

(see [10]). Deﬁne

c_n:= min

⎧⎨

⎩k≥0 :

j≥k+1

λ_j≤e_n(X)²

⎫⎬

⎭.

Clearly,c_n increases to inﬁnity asn→ ∞and by (2.18),c_n≤d^∗_n(X) for everyn∈N. Using Theorem1.1and the fact thatψis regularly varying at inﬁnity with indexb−1, we obtain

((b−1)ψ(c_n))⁻¹∼

j≥cn+1

λ_j∼e_n(X)²∼ 2

b _1−b

b

b−1ψ(logn)⁻¹

and thus

ψ(c_n)∼ 2

b _b−1

1

bψ(logn)∼ψ 1

b^1/(b−1) 2 logn

b

as n→ ∞. Consequently,

c_n∼ 1 b^1/(b−1)

2 logn

b as n→ ∞.

This yields the assertion.

(9)

3. Quantizer designs and applications

In this section we are no longer interested in only asymptotically optimal quantizers of a Gaussian process X, but rather in really optimal or at least locally optimal quantizers for ﬁniten∈N.

As soon as the Karhunen-Lo`eve basis (u_j)_j≥1 and the corresponding eigenvalues (λ_j)_j≥1 of the Gaussian process X are known, it is possible to transform the quantization problem of X in H into the quantization of _∞

j=1N(0, λ_j) on l²by the isometryS:H →l²

x→(uj, x)_j≥1 and its inverse

S⁻¹: (l²,·,·K)→(H,·,·), l→

j≥1

l_ju_j. (3.1)

The transformed problem then allows as we will see later on a direct access by vector quantization methods.

We may focus on the quantization problem of the Gaussian random vector ζ:=S(X)

onl²with distribution

ζ= (ζ_j)_j≥1∼ ∞ j=1

N(0, λ_j)

for the eigenvalues (λ_j)_j≥1 ofC_X. Note that in this case (λ_j)_j≥1also become the eigenvalues of the covariance operatorC_ζ.

3.1. Optimal quantization of

_∞

j=1

N (0 , λ

j

)

Since an infinite dimensional quantization problem is without any modification not solvable by a finite computer algorithm, we have to somehow reduce the dimension of the problem.

Assumeαto be an optimaln-quantizer for_∞

j=1N(0, λ_j), thenU := span(α) is a subspace ofl²with dimensiond^∗_n = dimU ≤n−1. Consequently there existd^∗_northonormal vectors inl²such that span(u₁, . . . , u_d∗

n) =U. Theorem 3.1 in [10] now states, that this orthonormal basis ofU can be constructed by eigenvectors ofC_ζ, which correspond to thed^∗_n largest eigenvalues. To be more precise, we get

e²_n ∞

j=1

N(0, λ_j)

=e²_n d^∗_n

n=1

N(0, λ_n)

+

j≥d^∗_n+1

λ_j. (3.2)

Hence it is suﬃcient to quantize only the ﬁnite-dimensional product measure _d^∗_n

j=1N(0, λ_j) and to ﬁll the remaining quantizer components with zeros.

Therefore we denote byζ^d the projection ofζ= (ζ_j)_j≥1 on the ﬁrstd-components,i.e. ζ^d= (ζ₁, . . . , ζ_d).

(10)

This approach leads for somed∈Nto our ﬁrst quantizer design.

Quantizer Design IProduct quantizer for_∞

j=1N(0, λ_j) Require: Optimal_d

j=1N(0, λ_j)-quantizerα^d ⊂R^d with card(α^d)≤n Quantizer:

α^I :=α^d× {0} ×. . . Quantization:

ζ^α^I =

a∈α^I

a^½_C_a_(αI)(ζ) = (ζ^d^α

d

,0, . . .) Distortion:

Eζ−ζ^α^I²_l2 =e²_n _d

j=1

N(0, λ_j)

+

j≥d+1

λ_j

The claim about the distortion of ζ^α^I becomes immediately evident from the orthogonality of the basis v_j = (δ_ij)_i≥1 inl² and

Eζ−ζ^α^I²_l2 =E ^d

j=1

ζ_j− ζ^d^α

d

j

v_n+

j≥d+1

ζ_jv_n ²

l²

=E^d

j=1

ζ_j− ζ^d^α

d

j

₂

+

j≥d+1

Eζ_j².

Unfortunately the true value of d^∗_n is only known forn= 2, which yields d^∗₂ = 1, but from Proposition2.5 we have the lower asymptotical bound

1 b^1/(b−1)

2 logn

b d^∗_n, as n→ ∞, whereas there is a conjecture for it to be d^∗_n ∼2 logn/b.

A numerical approach for this optimal design by means of a stochastic gradient method will be introduced in Section3.2, where also some choices for the block size dwith regard to the quantizer size n will be given.

Moreover, numerical results for this design can be found in Table1.

In addition to this direct quantization design, we want to present some product quantizer designs for _∞

j=1N(0, λ_j), which are even tractable by deterministic integration methods and therefore achieve a higher numerical accuracy and stationarity. These product designs reduce furthermore the storage demand for the precomputed quantizers when using functional quantization as cubature formulaee.g.

To proceed this way, we replace the single quantizer block α^d from quantizer Design I by the Cartesian product of saymsmaller blocks with maximal dimensionl < d. We will refer to the dimension of these blocks also as theblock length.

Letl_i denote the length of the ith block and set

k₁:= 0, k_i:=

i−1 ν=1

l_ν, i∈ {2, . . . , m},

(11)

then we obtain a decomposition ofζ^d into

ζ^d= (ζ⁽¹⁾, . . . , ζ^(m)), with ζ⁽ⁱ⁾:= (ζ_k_i₊₁, . . . , ζ_k_i_+l_i =ζ_k_i+1). (3.3) So we state for somel∈N:

Quantizer Design IIProduct quantizer for_∞

j=1N(0, λ_j) Require: Optimal _k_i+1

j=ki+1N(0, λ_j)-quantizers α⁽ⁱ⁾ ⊂ R^lⁱ with card(α⁽ⁱ⁾) ≤ n_i for some integers m ∈ N, l₁, . . . l_m≤l, n₁, . . . , n_m>1, ^m_i=1n_i≤nsolving

Block Allocation: ! _m

i=1

e²_n_i

ki+1

k=ki+1

N(0, λ_j)

+

j≥km+1+1

λ_j

"

→min. Quantizer:

α^II:=

#m i=1

α⁽ⁱ⁾× {0} ×. . .

Quantization:

ζ^α^II =

a∈α^II

a^½_C_a_(αII)(ζ) = (ζ⁽¹⁾^α

(1)

, . . . ,ζ^(m)^α

(m)

,0, . . .) Distortion:

Eζ−ζ^α^II²_l2 = m i=1

e²_n

i

ki+1

j=ki+1

N(0, λ_j)

+

j≥km+1+1

λ_j

Note that we do not use the asymptotically block allocation rules for then_i from (2.9), but perform instead the block allocation directly on the true distortion of the quantizer block and not on an estimate for them.

Next, we weaken our quantizer design, and obtain this way the asymptotically optimal design from Theo- rem2.2.

In fact the quantizer used for this scheme are a little bit more universal, since they do not depend on the position of the block.

The idea is to quantize blocksξ⁽ⁱ⁾∼ N(0, I_l_i) of standard normalsξ= (ξ_j)_j≥1∼_∞

j=1N(0,1) and to weight the quantizers by

$

λ⁽ⁱ⁾:=$

λ_k_i₊₁, . . . ,

% λ_k_i+1

, i∈ {1, . . . , m},

that is

$

λ⁽ⁱ⁾⊗α⁽ⁱ⁾=

($

λ_k_i₊₁a_k_i₊₁, . . . ,

%

λ_k_i+1a_k_i+1) :a= (a_k_i₊₁, . . . , a_k_i+1)∈α⁽ⁱ⁾

.

(12)

The design for somel∈Nthen reads as follows:

Quantizer Design IIIProduct quantizer for_∞

j=1N(0, λ_j) Require: Optimal _k_i+1

j=ki+1N(0,1)-quantizers α⁽ⁱ⁾ ⊂ R^lⁱ with card(α⁽ⁱ⁾) ≤ n_i for some integers m ∈ N, l₁, . . . l_m≤l, n₁, . . . , n_m>1, ^m_i=1n_i≤nsolving

Block Allocation:

! _m

i=1 ki+1

j=ki+1

λ_jE

ξ_j−ξ&⁽ⁱ⁾^α

(i)

j

₂

+

j≥km+1+1

λ_j

"

→min.

Quantizer:

α^III:=

#m i=1

$

λ⁽ⁱ⁾⊗α⁽ⁱ⁾× {0} ×. . . Quantization:

ζ^α^III =

a=(a⁽¹⁾,...,a^(m),0,...)∈α^III

a

#m i=1

½

C_a(i)

_√

λ⁽ⁱ⁾⊗α⁽ⁱ⁾(ζ⁽ⁱ⁾) Distortion:

Eζ−ζ^α^III²_l2 = m i=1

ki+1

j=ki+1

λ_jE

ξ_j−

&

ξ⁽ⁱ⁾^α

(i)

j

₂

+

j≥km+1+1

λ_j

In the end we state explicitly for the convenience of the reader the case l = 1, for which the Designs II and III coincide, and which relies only on one dimensional quantizers of the standard normal distribution.

These quantizers can be very easily constructed by a standard Newton-algorithm, since the Voronoi-cells in dimension one are just simple intervals.

This special case corresponds to a direct quantization of the Karhunen-Lo`eve expansion (2.1) and has been usede.g. in [10,17].

We will refer to this design also asscalar product quantizer.

Quantizer Design IVProduct quantizer for_∞

j=1N(0, λ_j)

Require: Optimal N(0,1)-quantizers α_i ⊂R with card(α_i) ≤n_i for some integersm ∈ N, n₁, . . . , n_m >1,

mi=1n_i≤nsolving

Block Allocation: ! _m

j=1

λ_je²_n_j

N(0,1)

+

j≥m+1

λ_n

"

→min. Quantizer:

α^IV:=

#m j=1

$λ_jα_j× {0} ×. . .

Quantization:

ζ^α^IV =

a∈α^IV

a

#m j=1

½Ca(√

λjαj)(ζ_j)= (^d $

λ₁ξ₁^α¹, . . . ,$

λ_mξ&_m^α^m,0, . . .)

Distortion:

Eζ−ζ^α^IV²_l2 = m j=1

λ_je²_n_j

N(0,1)

+

j≥m+1

λ_j

(13)

Clearly, it follows from the decomposition (3.3) that DesignIis optimal as soon the quantization of_d^∗_n

n=1N(0, λ_n) is optimal. Furthermore we obtain the proof of the asymptotically optimality for the quantizer Designs IIand IIIfrom Theorem2.2using the tuning parameter

l:=l_n := [(max{1,logn})^θ] for someθ∈(0,1), (3.4) i.e.

Eζ−ζ^αÎ²∼Eζ−ζ^αÎI²_l2 ∼Eζ−ζ^αÎII²_l2∼b 2

_b−1 b

b−1ψ(logn)⁻¹ as n→ ∞.

Using the same estimates as in the proof of Theorem2.2for the DesignIV, we only get Eζ−ζ^α^IV²_l2 b

2

_b−14C(1)(b−1) + 1

b−1 ψ(logn)⁻¹, (3.5)

so that we only can state, that DesignIVis rate optimal.

Remark 3.1. Note that if we replace the assumption of optimality for the quantizer blocks by stationarity in DesignsI–IV, the resulting quantizers are again stationary (but not necessary asymptotically optimal).

3.2. Numerical optimization of quadratic functional quantization

Optimization of the (quadratic) quantization ofR^d-valued random vector has been extensively investigated since the early 1950’s, ﬁrst in 1-dimension, then in higher dimension when the cost of numerical Monte Carlo simulation was drastically cut down (see [4]). Recent application of optimal vector quantization to numerics turned out to be much more demanding in terms of accuracy. In that direction, one may cite [13,16] (mainly focused on numerical optimization of the quadratic quantization of normal distributions). To apply the methods developed in these papers, it is more convenient to rewrite our optimization problem with respect to the standard d-dimensional distribution N(0, I_d) by simply considering the Euclidean norm derived from the covariance matrix Diag(λ₁, . . . , λ_d∗

n)i.e.

(quantizer Design I)⇔

⎧⎪

⎪⎨

⎪⎪

⎩

n-optimal quantization of

d^∗_n

k=1

N(0,1) for the covariance norm|(z₁, . . . , z_d∗

n)|²=_d^∗_n

k=1λ_kz_k².

The main point is of course that the dimension d^∗_n is unknown. However (see Fig. 1), one clearly veriﬁes on small values ofnthat in the case of the Brownian motion,i.e. b= 2 the conjecture (d^∗_n ∼logn) is most likely true. Then for higher values of n one relies on it to shift from one dimension to another following the rule d^∗_n=d,n∈ {e^d, . . . , e^d+1−1}.

3.2.1. A toolbox for quantization optimization: a short overview

Here is a short overview of stochastic optimization methods to compute optimal or at least locally optimal quantizers in ﬁnite dimension. For more details we refer to [16] and the references therein. LetZ =^d N(0;I_d) and denote by D_n^Z(x) the distortion function, which is in fact the squared quantization error of a quantizer x∈Hⁿ= (R^d)ⁿ inn-tuple notation,i.e.

D^Z_n :Hⁿ→R, x→E min

1≤i≤nZ−x_i²_H.

Competitive Learning Vector Quantization (CLV Q). This procedure is a recursive stochastic approximation

(14)

0 20 40 60 80 100 120 140 160 0.208

0.21 0.212 0.214 0.216 0.218 0.22 0.222 0.224 0.226 0.228

d(n) = 2 d(n) = 3 d(n) = 4 d(n) = 5

Figure 1. Optimal functional quantization of the Brownian motion. n→logn(e_n(W, L²

T))², n∈ {6, . . . ,160} for blocksizesd_n ∈ {2,3,4,5}. Vertical dashed lines: critical dimensions for d^∗_n,e²≈7,e³≈20,e⁴≈55,e⁵≈148.

gradient descent based on the integral representation of the gradient ∇D_n^Z(x), x∈Hⁿ of the distortion as the expectation of alocal gradientand a sequence of i.i.d. random variates,i.e.

∀x∈Hⁿ, ∇D_n^Z(x) =E(∇D^Z_n(x, Z)) for ∇D_n^Z(x) =

2'

Ci(x)(x_i−ξ)PZ(dξ)

1≤i≤n and ∇D^Z_n(x, Z) =

2(x_i−Z)1_C_i_(x)(Z)

1≤i≤n so that, starting fromx(0)∈(R^d)ⁿ, one sets

∀k≥0, x(k+ 1) = x(k)− c

k+ 1∇D_n^Z(x(k), Z_k+1),

where (Z_k)_k≥1 are i.i.d., Z₁ =^d N(0, I_d) andc∈(0,1] is a real constant to be tuned. As set, this looks quite formal but the operating CLVQ procedure consists of two phases at each iteration:

(i)Competitive Phase: search of the nearest neighborx(k)_i∗(k+1) of Z_k+1 among the components ofx(k)_i, i= 1, . . . , n(using a “winning convention” in case of conﬂict on the boundary of the Voronoi cells).

(ii)Cooperative Phase: one moves the winning component towardζ_k+1using a dilatationi.e. x(k+1)_i∗(k+1)= Dilatation_ζ_k+1_,1−_k+1^c (x(k)_i∗(k+1)).

This procedure is useful for small or medium values of n. For an extensive study of this procedure, which turns out to be singular in the world of recursive stochastic approximation algorithms, we refer to [14]. For general background on stochastic approximation, we refer to [1,8].

The randomized “Lloyd I procedure”. This is the randomization of the stationarity based ﬁxed point procedure since any optimal quantizer satisﬁes the stationarity property:

Z^x(k+1)=E(Z|Z^x(k)), x(0)⊂R^d.

(15)

At every iteration the conditional expectation E(Z|Z^x(k)) is computed using a Monte Carlo simulation. For more details about practical aspects of Lloyd I procedure we refer to [16]. In [13], an approach based on genetic evolutionary algorithms is developed.

For both procedures, one may substitute a sequence of quasi-random numbers to the usual pseudo-random sequence. This often speeds up the rate of convergence of the method, although this can only be proved (see [9]) for a very speciﬁc class of stochastic algorithm (to which CLVQ does not belong).

The most important step to preserve the accuracy of the quantization as n(and d^∗_n) increase is to use the so-called splitting method which ﬁnds its origin in the proof of the existence of an optimal n-quantizer: once the optimization of a quantization grid of sizenis achieved, one speciﬁes the starting grid for the sizen+ 1 or more generallyn+ν,ν≥1, by merging the optimized grid of sizenresulting from the former procedure withν points sampled independently from the normal distribution with probability density proportional toϕ^d+2^d where ϕdenotes the p.d.f. of N(0, I_d). This rather unexpected choice is motivated by the fact that this distribution provides the lowestin average random quantization error (see [2]).

As a result, to be downloaded on the website [18] devoted to quantization: www.quantize.maths-fi.com

•Optimized stationary codebooks forW: in practice, then-quantizersα:=α^d^∗ⁿof the distribution⊗^d_k=1^∗ⁿ N(0, λ_k), n= 1 up to 10 000 (d^∗_n runs from 1 up to 9).

• Companion parameters:

– distribution ofW&^γ: P(&W^γ=x_i) =P(Z_d^α∗ n=α_i);

– the quadratic quantization error: W−&W^γⁿ_L²_T.

3.3. Application to the Brownian motion on L

²

([0 , T ] , d t )

We present in this subsection numerical results for the above quantizer designs applied to the Brownian motionW on the Hilbert space

L²([0, T],dt),·_L²

T

. Recall that the eigenvalues ofC_W read

λ_j =

T π(j−1/2)

₂

, j≥1 and the eigenvectors

u_j= (2

T sin(t/$

λ_j), j≥1 which imply a regularity index ofb= 2 for the regularly varying function

ϕ(x) :=

T π

₂ x⁻². Letαbe a quantizer for_∞

j=1N(0, λ_j), then forS⁻¹ from (3.1)

γ:=S⁻¹α=

⎧⎨

⎩t→ (2

T

j≥1

a_jsin

π(j−1/2)t/T

: (a₁, a₂, . . .)∈α

⎫⎬

⎭ (3.6)

provides a quantizer forW, which produces the same quantization error asαand is stationary iﬀαis. Further- more we can restrict w.l.o.g. to the caseT = 1.

Concerning the numerical construction of a quantizer for the Brownian motion we need access to precomputed stationary quantizers of_k_i+1

j=ki+1N(0, λ_j) and_k_i+1

j=ki+1N(0,1) for all possible combinations of the block allocation problem. As soon as these quantizers are computed, we can perform the block allocation of the quantizer designs to produce optimal quantizers for_∞

j=1N(0, λ_j).

(16)

Table 1. Quantizer DesignI.

n d_n EW −&W^γ^I²_L2 T

1 1 0.5000

5 1 0.1271

10 2 0.0921

50 3 0.0558

100 4 0.0475

500 6 0.0353

1000 6 0.0318

5000 8 0.0258

10000 9 0.0238

Table 2. Quantizer Design II,l= 2.

n n_i l_i EW −W&^γ^II²_L2 T

1 1 1 0.5000

5 5 1 0.1271

10 10 1 0.0921

50 25×2 = 50 2 + 1 = 3 0.0580

100 50×2 = 50 2 + 1 = 3 0.0492

500 100×2 = 500 2 + 1 = 3 0.0372

1000 111×3×3 = 999 2 + 1 + 2 = 5 0.0339 5000 166×10×3 = 4980 2 + 2 + 2 = 6 0.0276 10000 208×12×4 = 9984 2 + 2 + 2 = 6 0.0255

Table 3. Quantizer DesignIII,l= 3.

n n_i l_i EW−W&^γ^III²_L2 T

1 1 1 0.5000

5 5 1 0.1271

10 5×2 1 + 1 = 2 0.0984

50 10×5 = 50 1 + 2 = 3 0.0616

100 12×4×2 = 96 1 + 1 + 1 = 3 0.0513 500 16×5×3×2 = 480 1 + 1 + 1 + 1 = 4 0.0387 1000 20×25×2 = 1000 1 + 2 + 1 = 5 0.0350 5000 26×8×8×3 = 4992 1 + 1 + 2 + 2 = 6 0.0285 10000 25×36×11 = 9900 1 + 2 + 3 = 6 0.0264

For the quantizers of DesignI we used the stochastic algorithm from Section3.2, whereas for Designs II–IV we could employ deterministic procedures for the integration on the Voronoi cells with max. block lengthsl= 2 respectivelyl= 3, which provide a maximum level of stationarity,i.e. ∇Dn ≤10⁻⁸.

The asymptotical performance of the quantizer designs in view of Theorem2.2,i.e.

n→lognEW−W&^γ²_L2 T