• Aucun résultat trouvé

Central limit theorems for simultaneous Diophantine approximations

N/A
N/A
Protected

Academic year: 2022

Partager "Central limit theorems for simultaneous Diophantine approximations"

Copied!
36
0
0

Texte intégral

(1)

Central limit theorems for simultaneous Diophantine approximations Tome 4 (2017), p. 1-35.

<http://jep.cedram.org/item?id=JEP_2017__4__1_0>

© Les auteurs, 2017.

Certains droits réservés.

Cet article est mis à disposition selon les termes de la licence CREATIVECOMMONS ATTRIBUTIONPAS DE MODIFICATION3.0 FRANCE. http://creativecommons.org/licenses/by-nd/3.0/fr/

L’accès aux articles de la revue « Journal de l’École polytechnique — Mathématiques » (http://jep.cedram.org/), implique l’accord avec les conditions générales d’utilisation (http://jep.cedram.org/legal/).

Publié avec le soutien

du Centre National de la Recherche Scientifique

cedram

Article mis en ligne dans le cadre du

Centre de diffusion des revues académiques de mathématiques

(2)

CENTRAL LIMIT THEOREMS FOR SIMULTANEOUS DIOPHANTINE APPROXIMATIONS

by Dmitry Dolgopyat, Bassam Fayad & Ilya Vinogradov

Abstract. — We study the distribution modulo1of the values taken on the integers ofrlinear forms indvariables with random coefficients. We obtain quenched and annealed central limit theorems for the number of simultaneous hits into shrinking targets of radii n−r/d. By the Khintchine-Groshev theorem on Diophantine approximations,r/dis the critical exponent for the infinite number of hits.

Résumé(Théorème central limite pour des approximations diophantiennes simultanées) Nous étudions la loi de probabilité modulo1des valeurs prises sur les entiers parrformes linéaires de dvariables à coefficients aléatoires. Nous montrons un théorème central limite,

« en moyenne » et « presque sûr », pour le nombre de points atteignant simultanément des cibles de rayon décroissant à une vitessen−r/d. D’après le théorème de Khintchine-Groshev sur les approximations diophantiennes,r/dest le seuil critique à partir duquel le nombre des points tend vers l’infini.

Contents

1. Introduction. . . 2

2. Central Limit Theorems on the space of lattices. . . 4

3. Diagonal actions on the space of lattices and Diophantine approximations . . . . 7

4. An abstract Central Limit Theorem. . . 9

5. Preliminaries on diagonal actions and Siegel transforms . . . 19

6. Proof of the CLT for diagonal actions. . . 22

7. Related results. . . 26

8. Variances. . . 29

Appendix. Truncation and norms. . . 30

References. . . 34

Mathematical subject classification (2010). — 60F05, 37A17, 11K60.

Keywords. — Central limit theorem, weakly dependent random variables, diophantine approxima- tion, linear forms, space of lattices.

D.D. was partially supported by the NSF.

B.F. was partially supported by ANR-15-CE40-0001 and by the project BRNUH.

I.V. appreciates the support of Fondation Sciences Mathématiques de Paris during his stay in Paris.

The research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP/2007–2013) / ERC Grant Agreement no. 291147.

(3)

1. Introduction

1.1. Results. — An important problem in Diophantine approximation is the study of the speed of approach to0of a possibly inhomogeneous linear form of several variables evaluated at integers points. Such a linear form is given by`:T×Td×Zd→T

`x,α(k) =x+

d

X

i=1

kiαi (mod 1).

More generally, forr >1, one can considerr linear forms`xjj(k) forj = 1, . . . , r corresponding to a := (αji) ∈ Td×r and x := (x1, . . . , xr) ∈ Tr, where each αj, j= 1, . . . , r, is a vector inTd.

Diophantine approximation theory classifies the matricesaand vectorsxaccording to how “resonant” they are; i.e., how well the vector (`xjj(k))rj=1 approximates 0:= (0, . . . ,0)∈Rr as k varies over a large ball inZd. One can then fix a sequence of targets converging to0, say intervals of radius rn centered at0 with rn →0, and investigate the integers for which the target is hit, namely the integers k such that

`jxjj(k)∈[−r|k|, r|k|]for everyj = 1, . . . , r. An important class of targets is given by radii following a power law,rn =cn−γ for someγ, c >0 (see for example [18, 15, 30]

or [5] for a nice discussion related to the Diophantine properties of linear forms).

Fix a norm | · |onRd, and letk · kdenote the Euclidean norm onRr. Forc >0, define the following sets

B1(k, d, r, c) = [0, c|k|−d/r]r⊂Rr and

B2(k, d, r, c) ={a∈Rr:kak6c|k|−d/r}.

Forι= 1 or2, we then introduce VN,ι(a,x, c) = #

06|k|< N: (`xjj(k))rj=1∈Bι(k, d, r, c) , UN,ι(a, c) =VN,ι(a,0, c).

(1.1)

A matrix a ∈ Td×r is said to be badly approximable if for some c > 0, the se- quence UN,ι(a, c) is bounded. By contrast, matrices a for which UN,ιε (a, c) is un- bounded for some ε >0, whereUN,ιε (a, c) is defined asUN,ι, but with radiicndr−ε instead of cn−d/r are calledvery well approximable or VWA. The obvious direction of the Borel-Cantelli lemma implies that almost every a ∈ Td×r is not very well approximable (cf. [7, Chap. VII]). The celebrated Khintchine-Groshev theorem on Diophantine approximation implies that badly approximable matrices are also of zero measure [17, 15, 12, 28, 6, 4]. Analogous definitions apply in the inhomogeneous case ofVN,ι(a,x, c), and similar results hold.

For targets given by a power law, the radii cn−d/r are thus the smallest ones to yield an infinite number of hits almost surely. A natural question is then to investigate statistics of these hits, which we call resonances. In the present paper we address in this context the behavior of the resonances on average over a and x, or on average

(4)

overawhilexis fixed at0or fixed at random. LetVoldenote the Euclidean volume and consider the expected number of hits

VbN,ι= Vol(Bι(1, d, r, c)) lnN

whenxandaare uniformly distributed on corresponding tori. LetN (m, σ2)denote the normal distribution with meanmand varianceσ2.

Theorem1.1. — Suppose (r, d) 6= (1,1), and let a be uniformly distributed on Tdr. Then,

UN,ι(a, c)−VbN,ι

√ lnN

converges in distribution toN (0, σι2) asN→ ∞, where σ12= 2crdζ(r+d−1)

ζ(r+d) Vol(B), σ22= πr/2 Γ(r2+ 1)σ21, (1.2)

andB is the unit ball in| · |-norm.

Remark1.2. — The restriction(r, d)6= (1,1)above is necessary. In fact, it is shown in [25, 27] using continued fractions that in that case the Central Limit Theorem (CLT) still holds forUN, but the correct normalization should be√

lnNln lnN rather than

√ lnN.

Theorem1.3. — Let a andxbe uniformly distributed inTdr. Then, VN,ι(a,x, c)−VbN,ι

q VbN,ι converges in distribution toN (0,1)asN → ∞.

The preceding theorems give CLTs in the cases of x fixed to be0 or x random.

The CLT also holds for almost every fixedx.

Theorem1.4. — Suppose (r, d) 6= (1,1). For almost every x, if a is uniformly dis- tributed inTdr, then(VN,ι(a,x, c)−VbN,ι)/

q

VbN,ι converges in distribution to a nor- mal random variable with zero mean and variance one.

1.2. Plan of the paper. — Using a by now standard approach of Dani correspon- dence (cf. [9, 23, 24, 1, 2, 3, 21]) we deduce our results about Diophantine approxi- mations from appropriate limit theorems for homogeneous flows. Namely we need to prove CLTs for Siegel transforms of piecewise smooth functions; these limit theorems are formulated in Section 2. The reduction of the theorems of Section 1 to those of Section 2 is given in Section 3. The CLTs in the space of lattices are in turn deduced from an abstract Central Limit Theorem (Theorem 4.2) for weakly dependent random variables which is formulated and proven in Section 4. In order to verify the condi- tions of Theorem 4.2 for the problem at hand we need several results about regularity of Siegel transforms which are formulated in Section 5 and proven in the appendix.

(5)

In Section 6 we deduce our Central Limit Theorems for homogeneous flows from the abstract Theorem 4.2. Section 8 contains the proof of the formula (1.2) on the vari- ances. Section 7 discusses some applications of Theorem 4.2 beyond the subject of Diophantine approximation.

2. Central Limit Theorems on the space of lattices

2.1. Notation. — We letG= SLd+r(R),Ge = SLd+r(R)n Rd+r. The multiplication rule inGe takes form (A, a)(B, b) = (AB, a+Ab). We regard G as a subgroup of Ge consisting of elements of the form (A,0). We let L be the abelian subgroup of G consisting of matricesΛa, andLe be the abelian subgroup ofGe consisting of matrices (Λa,(0, y))where0is an origin inRd,yis anr-dimensional vector,ais ad×rmatrix and

Λa=

Idd 0 a Idr

.

Let M be the space of d+r dimensional unimodular lattices and Mfbe the space of d+r dimensional unimodular affine lattices. We identify M andMfrespectively withG/SLd+r(Z)andGe/SLd+r(Z)n Zd+r.

We will need spacesCs,r(Rp), Cs,r(M), andCs,r(Mf)of functions which can be well approximated by smooth functions, given s,r > 0. Recall first that the space Cs(Rp)consists of functionsf:Rp→Rwhose derivatives up to ordersare bounded.

To define spacesCs(M)andCs(Mf), fix bases for Lie(G)and Lie(Ge); then,Cs(M) and Cs(Mf) consist of functions whose derivatives corresponding to monomials of order up tosin the basis elements are bounded (see Appendix for precise definitions).

Now we defineCs,r-norm on a space equipped with aCs-norm and anL1-norm by kfkCs,r = sup

0<ε<1

sup

f6f6f+ kf+−fkL1

εr(kf+kCs+kfkCs).

(2.1)

In the examples considered above (Rd+r,M and Mf), the L1-norm is taken with respect to the Haar measure. Some properties of these spaces are discussed in the Appendix.

Given a functionf onRr+d we consider its Siegel transformsS(f) :M →Rand Sf(f) :Mf→Rdefined by

S(f)(L) = X

e∈L

f(e), Sf(f)(Lf) = X

e∈Lf

f(e).

We emphasize that Siegel transforms of smooth compactly supported functions are never bounded but their growth at infinity is well understood, see Subsection 5.3.

2.2. Results for the space of lattices. — In this section we present general Central Limit Theorems for Siegel transforms. The reduction of Theorems 1.1, 1.3, 1.4 to the results stated here will be performed in Section 3.

Letf ∈Cs,r(Rd+r)be a non-negative function supported on a compact set which does not contain 0. (The assumption that f vanishes at zero is needed to simplify

(6)

the formulas for the moments of its Siegel transform. See Proposition 5.1.) Denote f =RR

Rd+rf(x, y)dxdy.

Given positive numbers K and αwe say that a subset S ⊂ M is (K, α)-regular if S is a union of codimension 1 submanifolds and there is a one-parameter sub- grouphu⊂Lsuch that

µ(L :h[−ε,ε]L ∩S6=∅)6Kεα,

where µ denotes the Haar measure on M. We say that a function ρ: M → R is (K, α)-regular if supp(ρ) has a (K, α)-regular boundary and the restriction of ρ to supp(ρ)belongs toCαwith

kρkCα(supp(ρ))6K.

(K, α)-regular functions onMfare defined similarly.

Let A be subgroup of G consisting of diagonal matrices. We use the notation da for Haar measure on A. We say that ρ is K-centrally smoothable if there is a positive function φsupported in a unit neighborhood of the identity inAsuch that R

Aφ(a)da= 1and

ρφ(L) :=

Z

A

ρ(aL)φ(a)da

hasLnorm less thanK. We say that a functionρonMfisK-centrally smoothableif ρ(L) = sup

x

ρ(L,x)

is a K-centrally smoothable function on M. As before, we write N (m, σ2) for the normal distribution with m and variance σ2 and “=⇒” stands for convergence in distribution.

Forp∈Nandt∈R, we denote thep×pdiagonal matrix Dp(t) =

 2t

. .. 2t

and let

(2.2) g=

Dd(−1) 0 0 Dr(d/r)

. Theorem2.1. — Suppose that (r, d)6= (1,1).

(a) There is a constantσ such that ifL is uniformly distributed on M then PN−1

n=0 S(f)(gnL)−N f σ√

N =⇒ N (0,1)

asN → ∞.

(b) Fix constants C, u, α, ε with ε <1/2. Suppose that L is distributed according to a densityρN which is(CNu, α)-regular andC-centrally smoothable. Then

PN−1

n=NεS(f)(gnL)−N f σ√

N =⇒ N (0,1)

asN → ∞.

(7)

Theorem2.2

(a) LetLfbe uniformly distributed onMf. Then there is a constantsuch that PN−1

n=0 Sf(f)(gnLf)−N f eσ√

N =⇒ N (0,1)

asN → ∞.

(b) Fix constants C, u, α, ε with ε <1/2. Suppose that Lfis distributed according to a densityρN which is(CNu, α)-regular andC-centrally smoothable. Then

PN−1

n=NεSf(f)(gnLf)−N f eσ√

N =⇒ N (0,1)

asN → ∞.

LetDe be an unstable box, that is

De={(Λa,(0,x))Lf0}(a,x)∈R1×R2,

whereLf0is a fixed affine lattice andR1andR2are boxes inRd×randRrrespectively.

Consider a partitionΠofDe into L-boxes. Thus elements ofΠare of the form {(Λa,(0,x0))Lf0}a∈R1

for some fixed x0.

Theorem2.3. — Suppose that(r, d)6= (1,1). Then for each unstable cubeDe and for almost everyL ∈De, ifLfis uniformly distributed in Π(L), then

PN−1

n=0 Sf(f)(gnLf)−N f eσ√

N =⇒ N (0,1)

asN → ∞.

The explicit calculation ofσandσewhenf is an indicator function (the case needed for Theorems 1.1-1.4) will be given in Section 8.

Remark2.4. — Central Limit Theorems for partially hyperbolic translations on ho- mogeneous spaces are proven in [10] (for bounded observables) and in [22] (for L4 observables); see also [31, 26] for important special cases. It seems possible to prove Theorem 2.1 for sufficiently large values of d+rby verifying the conditions of [22].

Instead, we prefer to present in the next section an abstract result which will later be applied to derive Theorems 2.1, 2.2, and 2.3. We chose this approach for three reasons.

First, this will make the paper self contained. Second, we replace theL4 assumption of [22] by a weaker L2+δ assumption which is important for small d+r. Third, our approach allows to give a unified proof for Theorems 2.1, 2.2, and 2.3.

(8)

3. Diagonal actions on the space of lattices and Diophantine approximations

In this section we reduce Theorem 1.1, 1.3, and 1.4 to Theorems 2.1–2.3. To fix our notation we considerUN,1andVN,1, the analysis ofUN,2 andVN,2 being similar. We also drop the extra subscript and writeUN,1 andVN,1 as UN andVN, respectively, until the end of this section.

In Subsections 3.1 and 3.2 we explain how to reduce Theorem 1.1 to Theorem 2.1.

The reductions of Theorems 1.3 and 1.4 to Theorems 2.2 and 2.3 require only minor modifications which will be detailed in Section 3.3.

3.1. Dani correspondence. — In this subsection we use the Dani correspondence principle to reduce the problem to a CLT for the action of diagonal elements on the space of lattices of the formΛawhere ais random.

Letabe the matrix with rows αi∈Rd,i= 1, . . . , r. Let Λa=

Idd 0 a Idr

. Letφbe the indicator of the set

(3.1) Ec :=n

(x, y)∈Rd×Rr:|x| ∈[1,2],|x|d/ryj ∈[0, c] forj = 1, . . . , ro and consider its Siegel transformΦ =S(φ).

Nown= (n1, . . . , nd)with|n|6N contributes toUN(a, c)(from (1.1)) precisely when there exists(m1, . . . , mr)∈Zr such that

(3.2) Xd

i=1

niα1,i+m1, . . . ,

d

X

i=1

niαr,i+mr

∈B(n, d, r, c).

Clearly such a vector(m1, . . . , mr)is unique. It is elementary to see that (3.2) holds if and only if

(3.3) gtΛa(n1, . . . , nd, m1, . . . , mr)∈Ec.

for some integert62[log2N] whereg is the diagonal matrix defined in (2.2).

Below we will use the notion AN BN to mean that the ratio |AN|/BN is bounded. From (3.2)–(3.3) we obtain

Lemma3.1. — For each ε >0 UN(a, c) =

[log2N]

X

t=[(log2N)ε]

Φ◦gtaZd+r) +RN, with kRNkL1(a)(log2N)ε and L1(a) denoting the L1-norm with respect to the Lebesgue measure on the unit cube in Rd×r.

Proof. — From (3.2)–(3.3), it follows that RN 6#

2[log2N]6|n|< N,

or|n|62(log2N)ε) : (`0,α1(n), . . . , `0,αr(n))∈B(n, d, r, c)

.

(9)

Note that for a fixed n ∈ Zd and j ∈ 1, . . . , r, the form{`j(n,0, αj)} is uniformly distributed on the circle. Hence

Leb

a∈Trd: (`0,α1(n), . . . , `0,αr(n))∈B(n, d, r, c) 1

|n|d, and so

kRNkL1(a) X

2[log2N]6|n|6N

1

|n|d + X

|n|62(log2N)ε

1

|n|d (log2N)ε. Hence, to prove Theorem 1.1 we can replaceUN(a, c)by

[log2N]

X

t=[(log2N)ε]

Φ◦gtaZd+r).

3.2. Changing the measure. — Note that the action ofgton the space of latticesM is partially hyperbolic and its unstable manifolds are orbits of the action Λa with a∈M(d, r)ranging in the set ofd×rmatrices. This will allow us to reduce the proof of Theorems 1.1, 1.3 and 1.4 to CLTs for the diagonal action on the space of lattices.

A similar reduction is possible for thegt-action on the space of affine lattices since in that case the unstable manifolds for the action of gt are given by (Λa,(0,x)), with a∈M(d, r),x∈Rr.

Letη = 1/k10d wherek = [log2N]. For i= 1, . . . , r+d−1 letti be independent uniformly distributed in[−η, η]. Also introduce a random matrixb∈M(r, d) where all the entries ofbare independent uniformly distributed in[−1,1]. Let

Dt= diag

1 +t1, . . . ,1 +td+r−1, Qr+d−1

`=1 (1 +t`)−1 and

Λb=

Idd b 0 Idr

. LetΛ(a,e b,t) =DtΛbΛa.

It is clear that ifa is distributed uniformly in a unit cube, thenΛ(a,e b,t) is dis- tributed according to a (Ck10d,1)-regular and C-centrally smoothable density for some constantC. Note that

gtΛ(a,e b,t) =DtΛbtgtΛa, where|bt|6e−t. Observe also that forh∈GandE⊂Rd+r, we have

S(1E)(hL) =S(1h−1E)(L), and thus ifεis sufficiently small then fort>kεwe have

S(1Ec)(gtΛ(a,e b,t)Zd+r)−S(1Ec)(gtΛaZd+r)

6S(1Ee)(gtΛaZd+r), whereEedenotes aCk−10d neighborhood of the boundary ofEc. Now the same argu- ment as in Lemma 3.1 gives

(10)

Lemma3.2. — For each ε >0,

[log2N]

X

t=[(log2N)ε]

Φ◦gtaZd+r) =

[log2N]

X

t=[(log2N)ε]

Φ◦gt(eΛ(a,b,t)Zd+r) +ReN,

with eRN

L1(a,b,t)1.

Now Theorem 1.1 follows from Theorem 2.1 except for the formula forσwhich is derived in Section 8.

3.3. Inhomogeneous case. — The reduction of Theorems 1.3 and 1.4 to Theorems 2.2 and 2.3 requires only small changes compared to the preceding section. To wit, Lem- mas 3.1 and 3.2 take the following form.

Lemma3.3

(a) For eachε >0, VN(a,x, c) =

[log2N]

X

t=[(log2N)ε]

Sf(1Ec)(gtaZd+r+ (0,x))) +RN(a,x, c), wherekRNkL1(a)(log2N)ε.

(b) Let bandthave the same distribution as in Subsection 3.2andybe uniformly distributed in [−1,1]d. Then for eachε >0

[log2N]

X

t=[(log2N)ε]

Sf(1Ec)(gtaZd+r+ (0,x)))

=

[log2N]

X

t=[(log2N)ε]

Sf(1Ec)(gt(eΛ(a,b,t)Zd+r+ (y,x))) +ReN(a,b,t,x,y, c), with

eRN

L1(a,b,t,y)1.

Note that in part (a) the error has small L1(a) norm for each fixed x. This fol- lows from the fact that for each x and k, ak+x is uniformly distributed on Tr. This is useful in the proof of Theorem 1.4 since we want to have a control for each (or at least, most)x. We also note that part (b) is only needed for Theorem 1.3 since in Theorem 2.3 we start with initial conditions supported on a positive codimension submanifold ofMf. (One of the steps in the proof of Theorem 2.3 consists of fattening the support of the initial measure, see Subsection 6.3, however Lemma 3.3(b) is not needed at the reduction stage).

4. An abstract Central Limit Theorem

In this section, we present an abstract Central Limit Theorem for weakly dependent random variables that is well adapted to variables coming from deterministic dynami- cal systems. In order to make our theorem applicable to mixing flows on non-compact manifolds as well as to non-uniformly hyperbolic systems we allow the variables to

(11)

be unbounded and take into account the existence of small exceptional “bad sets” on which the variables are not controlled. In Section 6, we will use the abstract CLT (The- orem 4.2) to prove Theorems 2.1, 2.2, 2.3. There, we will take the variablesξ` to be ξ`(L) = Φ(g`L),Φ =S(f)(orSf(f)), for a positive functionf ∈Cs,r(Rd+r)sup- ported on a compact set which does not contain0. The functionsΦare not bounded but, since we excluded the cases of linear lattices with r=d= 1, they are inLs for s >2. The latter fact follows from the results of [23] and [21] (which will be recalled in Section 5.3 and the Appendix).

4.1. Bounded random variables. — Before we state our CLT for variables in Ls, s >2, we give a simplified version of it in the case of bounded variables.

In what follows C, u >0, θ ∈ (0,1) are fixed constants. Let ξn be a sequence of random variables satisfying the following conditions. Write ξbnn−E(ξn)for the corresponding centered random variable.

(H1)] There exists filtration{F`}`>0such that for every`, kthere exists a bounded F`+k-measurable random variableξ`,`+k withEξ`,`+k=Eξ`such that

P(|ξ`−ξ`,`+k|>θk)6C(`+ 1)uθk.

(H2)] For`, k, there exists an eventG`,ksuch thatP(Gc`,k)6Cθk and forω∈Gk,`

|E(bξ`+k|F`)(ω)|6C(`+ 1)uθk. (H3)] There is a sequencebk such that forω∈Gk,` andk0>k

E(bξ`+kξb`+k0|F`)(ω)−bk0−k

6C(`+ 1)ueu(k0−k)θk. Theorem4.1. — If ξ` is a bounded sequence satisfying(H1)–g (H3)g then

Pn−1

`=0`−E(ξ`))

√n

converges as n→ ∞to the normal distribution with zero mean and variance σ2=b0+ 2

X

k=1

bk.

Theorem 4.1 follows from a more general statement (Theorem 4.2) that we now formulate and prove.

4.2. The general statement. — In what followsC, u >0, θ∈(0,1), ands > 2 are fixed constants. Let ξn be a sequence of random variables satisfying the following conditions. Writeξbnn−E(ξn)for the corresponding centered random variable.

(H1) Given anyK, there is a sequencesξnK of random variables such that (H1a) |ξKn|6K almost surely;

(H1b) E |ξnK−ξn|

=O(K1−s),E((ξnK−ξn)2) =O(K2−s).

(12)

(H2) There exists a filtrationF`=F`,n defined for06` < n such that for every pair of nonnegative numbers`, kwith`+k < n, there exists a variableξK`,`+k that is F`+k-measurable,|ξ`,`+kK |6K almost surely,Eξ`,`+kK =EξK` , and

P(|ξK` −ξK`,`+k|>θk)6CKu(`+ 1)uθk.

(H3) For`, k, there exists an eventGk,`such thatP(Gck,`)6Cθk and forω∈Gk,`

|E(bξK`+k|F`)(ω)|6C(`+ 1)uKuθk.

(H4) There exists a numerical sequencebK,k fork>0 such that forω∈Gk,` and k0 >k,

E(bξK`+kξbK`+k0|F`)(ω)−bK,k0−k

6C(`+ 1)uKueu(k0−k)θk.

In the following Theorem, part (a) is sufficient to prove results such as Theo- rem 2.1(a), where the distribution of the lattices is given by the Haar measure, while part (b) is tailored to adapt to the case of localized initial conditions such as in Theorem 2.1(b).

Theorem4.2

(a) Under conditions(H1)–(H4) ifω is distributed according toP then Pn−1

`=0 ξb`

√n =⇒ N (0, σ2), with

(4.1) σ20+ 2

X

j=1

σj, σj= lim

K→∞bK,j.

(b) Suppose conditions (H1)–(H4)are satisfied. Fixε >0 such that 1+εs +ε <1/2 and setKn=n(1+ε)/s. Suppose thatω is distributed according to a measurePn which has a densityρn=dPn/dP satisfying

(D1) ρn6Cnu;

(D2) for eachk there is an Fk-measurable density ρn,k such that P(|ρn−ρn,k|>θk)6Cnuθk;

(D3) For eachnε6`6n,

En(|ξ`−ξ`Kn|)6CKn1−s,

whereEn denotes the expectation with respect toPn, that is,En(η) =E(ηρn).

Then,

Pn−1

`=nεξb`

√n =⇒ N(0, σ2).

(13)

4.3. Limiting variance. — Here we show that the normalized variance converges.

Lemma4.3. — Under conditions (H1)–(H4) we have that

n→∞lim

E((Pn−1

`=0 ξb`)2) n =σ2, with σas in (4.1).

Proof. — First we record a property of cross-terms in the sum that lets us pass fromξn to the truncated sequence. We have

D:=E(bξm+jξbm)−E(bξm+jK ξbmK) =O(K2−s).

(4.2)

Indeed, we have

D=E(ξm+jξm)−E(ξKm+jξmK)

−E(ξm+j)E(ξm) +E(ξKm+j)E(ξmK)

=E(ξm+jKm−ξKm)) +E(ξmKm+j−ξKm+j)) +E((ξm−ξmK)(ξm+j−ξm+jK ))

−E(ξKm+j)E(ξm−ξmK)−E(ξKm)E(ξm+j−ξm+jK )−E(ξm−ξKm)E(ξm+j−ξKm+j).

Now applying the Cauchy-Schwarz inequality followed by (H1a) and (H1b), we arrive at the bound (4.2) sinces >2.

Next, applying (H1), (H3), and (H4) with`= 0 gives E(bξm+jK ξbmK) =E(bξm+jK ξbKm1Gm,0) +E(bξm+jK ξbmK1Gcm,0)

=bK,j+O(Kueujθm) +O(kbξKm+jkLkξbmKkLP(Gcm,0))

=bK,j+O(Kueujθm) +O(K2θm).

(4.3)

Combining (4.2) and (4.3) we get

bK,j=E(bξm+jξbm) +O(K2−s) +O(Kueujθm) +O(K2θm).

Take a small numbereε >0and assume thatj 6εK,e K <K <e 2K. Takingm=K/2 we see that

bK,j−b

K,je =O(K2−s).

Therefore for eachj, the following limits exist, σj:= lim

K→∞bK,j. and moreover

(4.4) bK,jj+O(K2−s).

We give now the key estimates on the cross-terms in the variance according to different regimes for`andj.

Sublemma4.4. — We have the following estimates.

(a) There existsA >0such that for all (`, `+j)∈[0, n]2

|E(bξ`ξb`+j)|6A.

(14)

(b) Let η= min

3(s−2) 20(1+u),15

. Ifj>lnn2 and`6n, then E(bξ`ξb`+j) =O(θηj).

(c) Ifj 6lnn2 and`>lnn4, then

E(bξ`ξb`+j)−σj=O(n−2).

(d) There existsC >0 andν >0 such that forj>lnn2,

j|6Cθνj.

Before we prove the sublemma, we show how it implies Lemma 4.3. We have E((Pn−1

`=0 ξb`)2)

n = 1

n n−1

X

`=0

E(bξ2`) + 2 X

`∈[0,n−2]

j∈[1,n−`]

E(bξ`ξb`+j)

= 1 n

n−1 X

`=lnn4

E(bξ`2) + 2 X

`∈[lnn4,n−2]

j∈[1,lnn2]

E(bξ`ξb`+j)

+o(1)

0+ 2

lnn2

X

j=0

σj+o(1)

0+ 2

X

j=0

σj+o(1),

where the second equality follows from parts (a) and (b) of Sublemma 4.4, the third equality follows from part (c), and the fourth equality from part (d).

Proof of Sublemma4.4. — Part (a) follows from (4.2) and (H1a).

To prove part (b) assume j >lnn2 and let K =θ−δj with δ= 3/20(1 +u). We claim that

|E(bξ`Kξb`+jK )|6Cθj/5. This is seen by writing

E(bξ`Kξb`+jK ) =E(bξK`,`+j/2E(bξ`+jK |F`+j/2))−E(bξ`+jK (bξK`,`+j/2−ξbK` )) =I+II and then estimating

II 6CKh

θj/2+K1+u(`+ 1)uθj/2i

6CK2+2uθj/2 using (H1a) and (H2), and

I6CK×Ku(`+ 1)uθj/26CK2+2uθj/2 using (H1a) and (H3).

Now (4.2) implies that

|E(bξ`ξb`+j)|6Cθj/5+Cθ(s−2)/2uj =O(θηj) as needed.

(15)

To prove part (c) we letK=n−2/(2−s). Then (4.2) yields E(bξ`ξb`+j)−E(bξ`Kξb`+jK ) =O(n−2).

Now (H4) and (4.4) imply

E(bξK` ξbK`+j)−σj=O(n−2) +O(θlnn3) and (c) follows.

To prove part (d), fixj, let K=θ−εj where ε1, and take`=j/ε. Then (H4) and (4.4) imply that

E(bξK` ξb`+jK )−σj =O(θνj), withν =ε(s−2)/2. From (4.2) it follows that

E(bξ`ξb`+j)−σj =O(θνj),

which together with (b) yields (d).

4.4. Proof of Theorem 4.2. — In our proof, we shall use a standard Bernstein method based on “big block-small block” technique. That is, we divide the inter- val[0, n]into big blocks of lengthnεalternating with small blocks of lengthnε2. The big blocks will be almost “independent,” each from the preceding ones, due to the buffer zones that are the small blocks, while the contribution of these small blocks can be neglected.

From now on we fix ε >0 so that 1+εs +ε < 12 and let K =Kn =n(1+ε)/s. Let

`m=m[nε], m= 0, . . . ,[n1−ε] :=mn. Denote Zm=

`m+1−1

X

`=`m+nε2

ξb`K, Zem=

`m+1−1

X

`=`m+nε2

ξb`, Zeem=

`m+nε2−1

X

`=`m

ξb`, Zˇ=

n−1

X

`=`mn

ξb`, so that

n−1

X

`=0

ξb`=

mn−1

X

m=0

ee Zm+

mn−1

X

m=0

Zem+ ˇZ.

We claim that (4.5)

Pmn−1

m=0 Zeem+ ˇZ

√n and

Pmn−1

m=0 (Zm−Zem)

√n

converge to0in probability; it will therefore suffice for the proof of Theorem 4.2 (a), to prove the following.

Lemma4.5

Pmn m=0Zm

σ√

n =⇒ N (0,1).

We first prove the claim. To begin with, observe that following the proof of Lemma 4.3, we have

(4.6) E

Xmn

m=0

ee

Zm+ ˇZ2

=O(n1−ε+ε2) =o(n).

(16)

As for the second sum in (4.5), we write Zm−Zem=

`m+1−1

X

`=`m+nε2

ξ`K−ξ`

− E(ξ`K)−E(ξ`) . Condition (H1b) gives

E|Zm−Zem|62

`m+1−1

X

`=`m+nε2

E(|ξK` −ξ`|)6CnεK1−s. Therefore,

(4.7)

mn

X

m=1

E|Zm−Zem|

√n =O K1−s√ n

=O

n1+εs 12

=O n−ε

−→0.

We now prepare for the proof of Lemma 4.5. We start by defining an exceptional set Gcm on which we will not be able to exploit the almost independence of Zm+1 from (Z1, . . . , Zm). The exact reasons for the definition of each condition on the

“good set”Gm will become evident in the course of the proof.

Letb`m+1=`m+1+nε2/2. With setsG`,k as in hypotheses (H2)–(H4), we let G(1)m =

nε

T

k=nε2

G`m+1,k∩G

b`m+1,k

form6mn. Next, define for`>`m+1+nε2 Ge`=n

ω:E

ξ`K−ξK

`,`+nε2/2

Fb`m+1

(ω)6θnε

2/10o

and set

G(2)m =

nε

T

k=nε2

Ge`m+1+k. Fork, k0∈[nε2, nε]withk0−k>nε2/2 define

Em,k,k0 =n ω0:

E(bξK`m+1+k0|F`m+1+k+nε2/2)(ω0) >θnε

2/10o

and let

Gm,k,k0 =n

ω:E 1Em,k,k00) F`bm+1

(ω)6θnε

2/10o

and

G(3)m = T

k,k0∈[nε2,nε] k0−k>nε2/2

Gm,k,k0.

Finally define “the good set”

Gm=G(1)m ∩G(2)m ∩G(3)m. Observe that (H2)–(H4) show that

(4.8) P(Gcm)6Cθnε

2/100

.

The main step in the proof of Lemma 4.5, and thus of our CLT, is the following

(17)

Sublemma4.6. — Forω∈Gm, (4.9) lnE eZm+1n

Fb`m+1

(ω) =−nε

2σ2(1 +o(1)), whereo(1)is uniform inm= 1, . . . , mn.

Proof. — To prove (4.9), we expand the exponential E eZm+1n

Fb`m+1

(ω) = 1 +i λ

√nE(Zm+1|Fb`m+1)(ω)−λ2

2nE(Zm+12 |Fb`m+1)(ω) +O λ3

n3/2E(Zm+13 |Fb`m+1)(ω)

= 1 +i λ

√nE(Zm+1|Fb`m+1)(ω)−λ2

2nE(Zm+12 |Fb`m+1)(ω) +oλ2

2nE(Zm+12 |Fb`m+1)(ω) , where the last step uses that|Zm+1|6Knε=o(n1/2).

Next, (H3) implies that onGm(which is contained inG

`bm+1,kfor eachk∈[nε2, nε]) we have

E(Zm+1|Fb`m+1)(ω) =E nε

X

j=nε2

ξbK` F`bm+1

(ω) =o θ0.5nε

2 .

To complete the proof of (4.9) it suffices to show that forω∈Gm,

(4.10)

E(Zm+12 |Fb`m+1)(ω)−nεσ2

=o(nε).

Note that

E(Zm+12 |Fb`m+1)(ω) =

nε

X

k,k0=nε2

E(bξ`K

m+1+kξbK`

m+1+k0|Fb`m+1)(ω).

Let us estimate individual terms in this sum. Without loss of generality assume that k0 >k. LetRbe a large constant and consider two cases.

(a)k > R(k0−k). In this case (H4) and (4.4) give E(bξ`Km+1+kξb`Km+1+k0|Fb`m+1)(ω) +O eθk

=bK,k0−kk0−k+O K2−s

+O θek . (b)k6R(k0−k)and hencek0−k > nε2/R. Then

E(bξ`K

m+1+kξb`K

m+1+k0|Fb`m+1)(ω) =E(bξK

`m+1+k,`m+1+k+nε2/2ξb`K

m+1+k0|F`bm+1)(ω) +E

ξb`Km+1+k−ξb`K

m+1+k,`m+1+k+nε2/2

ξbK`m+1+k0|Fb`m+1 (ω)

=I+II.

(18)

The second term is O θnε

2/10

K

since ω ∈ Ge`m+1+k. For the first term use that ω∈Gm,k,k0 to obtain

|I|= E(bξK

`m+1+k,`m+1+k+nε2/2E(bξ`K

m+1+k0|F`m+1+k+nε2/2)|Fb`m+1)(ω) 6K2E(1Em,k,k00)|Fb`m+1)(ω) +Kθnε

2/10

6K2θnε

2/10

,

so bothIandII are negligible. Combining the estimates of cases (a) and (b) we obtain

(4.10) completing the proof of (4.9).

Proof of Lemma4.5. — It now remains to derive Lemma 4.5 from (4.9). For j 6m, set

Zbj=

`j+1

X

`=`j+nε2

ξb`,nKε2/2. Then

E m

X

j=0

bZj−Zj

=O θnε

2/10

and so

E eiλn

Pm+1 j=0 Zj

−E eiλn(

Pm

j=0Zbj)+iλ nZm+1

=O θnε

2/10 , (4.11)

E eiλn

Pm j=0Zj

−E eiλn

Pm j=0Zbj

=O θnε

2/10 . (4.12)

Therefore, E eiλn

Pm+1

j=0 Zj(4.11)

= E

eiλn

Pm j=0Zbj

E eiλnZm+1|F`bm+1

+O θnε

2/10

(4.8)

= E

eiλnPmj=0Zbj1GmE eiλnZm+1|F`bm+1

+O θnε

2/100

(4.9)

= eσ

2λ2 2n E eiλn

Pm j=0Zbj

+o(nε/n)

(4.12)

= eσ

2λ2 2n E eiλn

Pm j=0Zj

+o(nε/n). Iterating this recurrence relationmn times we obtain

E eiλnPmnj=0Zm

=e−σ2λ2/2+o(1),

completing the proof of Lemma 4.5, and thus of part (a) of Theorem 4.2.

Part (b) of Theorem 4.2 can be established by a similar argument and we just briefly describe the necessary changes. To extend the proof of the Central Limit Theorem to the setting of part (b) we need to prove (4.5) and (4.9) with En instead ofE. For (4.5) we need to prove the analogues of (4.6) and (4.7).

We claim the following. First, (4.9) still holds withEn instead ofE. Second,

(4.13) En

Xmn

m=1

ee

Zm+ ˇZ2

=O(n1−ε+ε2) =o(n).

Références

Documents relatifs

Key words and phrases: Central limit theorem, spatial processes, m- dependent random fields, weak mixing..

We then derive rates in the central limit theorem for weighted sums of such randoms fields via an approximation by m-dependent random fields.. Introduction and

The understanding of these models in the case where the orientations or the charges are not independently distributed requires functional limit theorems for Z-random walk in

Keywords: Random walk; Ballistic; Random environment; Central limit theorem; Invariance principle; Point of view of the particle; Environment process; Green function.. Here is a

Key words: central limit theorem, invariance principle, strictly stationary process, maximal inequality, strong mixing, absolute regularity, Markov

Theorem 2.1 asserts a functional central limit theorem for the random walk XúJ when it is confined to an infinite equivalence class, under the assumption that the density

Since every separable Banach space is isometric to a linear subspace of a space C(S), our central limit theorem gives as a corollary a general central limit theorem for separable

The quantities g a (i, j), which appear in Theorem 1, involve both the state of the system at time n and the first return time to a fixed state a. We can further analyze them and get