• Aucun résultat trouvé

Moderate deviations for stationary sequences of bounded random variables

N/A
N/A
Protected

Academic year: 2022

Partager "Moderate deviations for stationary sequences of bounded random variables"

Copied!
24
0
0

Texte intégral

(1)

www.imstat.org/aihp 2009, Vol. 45, No. 2, 453–476

DOI: 10.1214/08-AIHP169

© Association des Publications de l’Institut Henri Poincaré, 2009

Moderate deviations for stationary sequences of bounded random variables

Jérôme Dedecker

a

, Florence Merlevède

b

, Magda Peligrad

c,1

and Sergey Utev

d

aUniversité Paris 6, LSTA, 175 rue du Chevaleret, 75013 Paris, France

bUniversité Paris Est-Marne la Vallée, LAMA and CNRS UMR 8050, 5 Boulevard Descartes, Cité Descartes-Champs sur Marne, 77 454 Marne La Vallée Cedex 2, France.

cDepartment of Mathematical Sciences, University of Cincinnati, P.O. Box 210025, Cincinnati, OH 45221-0025, USA dSchool of Mathematical Sciences, University of Nottingham, Nottingham, NG7 2RD, UK

Received 9 November 2007; accepted 3 March 2008

Abstract. In this paper we derive the moderate deviation principle for stationary sequences of bounded random variables under martingale-type conditions. Applications to functions ofφ-mixing sequences, contracting Markov chains, expanding maps of the interval, and symmetric random walks on the circle are given.

Résumé. Dans cet article, nous établissons un principe de déviation modérée pour des suites stationnaires de variables aléatoires bornées sous différentes conditions projectives. Nous appliquons ces résultats aux suitesφ-mélangeantes, à certaines chaînes de Markov contractantes, aux transformations uniformément dilatantes de l’intervalle, ainsi qu’à la marche aléatoire symétrique sur le cercle.

MSC:60F10; 60G10

Keywords:Moderate deviations; Martingale approximation; Stationary processes

1. Introduction

For the stationary sequence(Xi)iZof centered random variables, define the partial sums and the normalized partial sums process by

Sn= n

j=1

Xj and Wn(t )=n1/2

[nt]

i=1

Xi.

In this paper we are concerned with the moderate deviation principle for the normalized partial sums processWn, considered as an element ofD([0,1])(functions on[0,1]with left-hand side limits and continuous from the right), equipped with the Skorohod topology (see Section 14 in [2]) for the description of the topology onD([0,1])). More exactly, we say that the family of random variables{Wn, n >0}satisfies the Moderate Deviation Principle (MDP) in D[0,1]with speedan→0 and good rate functionI (·), if the level sets{x, I (x)α}are compact for allα <∞, and for all Borel sets

− inf

tΓ0

I (t )≤lim inf

n anlogP

anWnΓ

≤lim sup

n

anlogP

anWnΓ

≤ −inf

t∈ ¯Γ

I (t ). (1)

1Supported in part by a Charles Phelps Taft Memorial Fund grant and NSA Grant H98230-07-1-0016.

(2)

The moderate deviation principle is an intermediate behavior between the central limit theorem (an=a) and large deviations (an=a/n). Usually, MDP has a simpler rate function, inherited from the approximated Gaussian process, and holds for a larger class of dependent random variables than the large deviation principle.

De Acosta and Chen [5] used the renewal theory to derive the MDP for bounded functionals of geometrically ergodic stationary Markov chains. Puhalskii [21] and Dembo [10] applied the stochastic exponential to prove the MDP for martingales. Starting from the martingale case and using the so-called coboundary decomposition due to Gordin [18] (Xk=Mk+ZkZk+1, whereMk is a stationary martingale difference), Gao [17] and Djellout [15] obtained the MDP forφ-mixing sequences with summable mixing rate. In the context of Markov chains, the coboundary decomposition is known as the Poisson equation. Starting from this equation, Delyon, Juditsky and Liptser [9] proved the MDP forn1/2n

k=1H (Yk), whereHis a Lipschitz function, andYn=F (Yn1, ξn), whereF satisfies|F (x, z)F (y, t )| ≤κ|xy| +L|zt|withκ <1, andn)n1is an iid sequence of random variables independent ofY0. In their paper, the random variables are not assumed to be bounded: the authors only assume that there exists a positiveδsuch thatE(eδ|ξ1|) <∞. They strongly used the Markov structure to derive some appropriate properties for the coboundary (see their Lemma 4.2).

In this paper we propose a modification of the martingale approximation approach that allows to avoid the cobound- ary decomposition and thus to enlarge the class of dependent sequences known to satisfy the moderate deviation principle. Recent or new exponential inequalities are applied to justify the martingale approximation. The conditions involved in our results are well adapted to a large variety of examples, including regular functionals of linear processes, expanding maps of the interval and symmetric random walks on the circle.

The paper is organized as follows: In Section 2 we state the main results. A discussion of the conditions, clari- fications, and some simple examples and extensions follow. Section 3 describes the applications, while Section 4 is dedicated to the proofs. Several technical lemmas are proved in the Appendix.

2. Results

From now on, we assume that the stationary sequence (Xi)iZ is given by Xi =X0Ti, whereT:ΩΩ is a bijective bimeasurable transformation preserving the probabilityP on (Ω,A). For a subfield F0 satisfying F0T1(F0), letFi=Ti(F0). ByXwe denote theL-norm, that is the smallestusuch thatP(|X|> u)=0.

Our first theorem and its corollary treat the so-called adapted case,X0beingF0-measurable and so the sequence (Xi)iZis adapted to the filtration(Fi)iZ.

Theorem 1. Assume thatX0<and thatX0isF0-measurable.In addition,assume that

n=1

n3/2E(Sn|F0)

<, (2)

and that there existsσ2≥0with

nlim→∞n1E Sn2|F0

σ2

=0. (3)

Then,for all positive sequencesanwithan→0andnan→ ∞,the normalized partial sums processesWn(·)satisfy (1)with the good rate functionIσ(·)defined by

Iσ(h)= 1 2σ2

1

0

h(u)2

du (4)

if simultaneouslyσ >0,h(0)=0andhis absolutely continuous,andIσ(h)= ∞otherwise.

The following corollary gives simplified conditions for the MDP principle, which will be verified in several exam- ples later on.

(3)

Corollary 2. Assume thatX0<and thatX0isF0-measurable.In addition,assume that

n=1

n1/2E(Xn|F0)

<, (5)

and that for alli, j≥1,

nlim→∞E(XiXj|Fn)E(XiXj)

=0. (6)

Then the conclusion of Theorem1holds withσ2=

kZE(X0Xk).

The next theorem allows to deal with non-adapted sequences and it provides additional applications. LetF−∞=

n0F−nandF= kZFk.

Theorem 3. Assume thatX0<∞,E(X0|F−∞)=0almost surely,andX0isF-measurable.Define the pro- jection operators byPj(X)=E(X|Fj)E(X|Fj1).Suppose that(6)holds and that

jZ

P0(Xj)

<. (7)

Then the conclusion of Theorem1holds withσ2=

kZE(X0Xk).

2.1. Simple examples,comments and extensions

Comment 4 (φ-mixing sequences). Recall that ifY is a random variable with values in a Polish spaceY and ifM is aσ-field,theφ-mixing coefficient betweenMandσ (Y )is defined by

φ

M, σ (Y )

= sup

A∈B(Y)

PY|M(A)PY(A)

. (8)

For the sequence(Xi)iZand positive integerm,letφm(n)=supim>···>i1≥nφ (M0, σ (Xi1, . . . , Xim))and letφ (k)= φ(k)=limm→∞φm(k) be the usual φ-mixing coefficient. It follows from Corollary 2 that if the variables are bounded,the conclusion of Theorem1holds as soon as

k>0

k1/2φ1(k) <and lim

k→∞φ2(k)=0. (9)

The condition(9) improves on the one imposed by Gao[17],that is

k>0φ (k) <∞,to get the MDP for bounded random variables(see his Theorem1.2).

Comment 5 (Application to the functional LIL). Since the variables are bounded,under the assumptions of Theo- rem1or of Theorem3,the MDP also holds inC[0,1]for the Donsker process

Dn(t )=Wn(t )+n1/2

nt− [nt] X[nt]+1.

Hence,ifσ2>0,it follows from the proof of Theorem1.4.1in[14],that the process2log logn1/2

Dn(t ): t∈ [0,1]

(10) satisfies the functional law of the iterated logarithm.To be more precise,ifS denotes the subset ofC[0,1]consisting of all absolutely continuous functions with respect to the Lebesgue measure such thath(0)=0and1

0(h(t ))2dt≤1, then the process defined in(10)is relatively compact with a.s.limit setS.In the case of bounded random variables, we then get new criteria to derive the functional LIL.In particular,the functional LIL holds forφ-mixing bounded random variables satisfying(9).

(4)

Comment 6 (Linear processes). Let(ci)iZbe a sequence of real numbers in1(Z)(absolutely summable).Define Xk=

iZciεki where(εk)kZis a strictly stationary sequence satisfying(6)and(7).Then,so does the sequence (Xk)kZ,and the conclusion of Theorem3holds.In particular,the result applies ifε0isF0-measurable,E(ε1|F0)=0 and

nlim→∞E ε20|F−n

E

ε02=0.

Comment 7 (Non-mixing in the ergodic sense example). The following simple example shows that Theorem1 is applicable to non-mixing in the ergodic theoretical sense sequences.Moreover it covers a strictly larger class of examples than its Corollary2.For allkZ,letQk+1= −Qk,whereP(Q0= ±1)=1/2andXk=Qk+Yk,where (Yk)kZis an iid sequence of zero mean and bounded random variables,independent ofQ0.We can easily check that all the conditions of Theorem1hold while the conditions of Corollary2are not satisfied.

Comment 8 (Stationary ergodic martingales that do not satisfy MDP). Let Yk be the stationary discrete Markov chain with the state space N and the transition kernel given by P(Y1=j −1|Y0=j )=1 for all j ≥1 and P(Y1=j|Y0=0)=P(τ=j )for jNwithE(τ ) <andP(τ =1) >0which implies that(Yk)is ergodic.Let Xk=ξkI(Yk=0)where(ξk)is an iid sequence independent of(Yk)and such thatP(ξk= ±1)=1/2.ThenXk is a sta- tionary ergodic martingale difference which is also a bounded function of an ergodic Markov chain.Straightforward computations show that ifτ does not have a finite exponential moment then there exists a positive sequencean→0 withnan→ ∞for which(1)does not hold.Thus the MDP principle is not true in general for the stationary sequences satisfying(5)without a certain form of condition(3).A similar example was suggested in[15],Remark2.6.

Comment 9 (On Var(Sn) and Theorem 1). Note that if

n=1n3/2E(Sn|F0)2<∞, then, by Peligrad and Utev[19]

nlim→∞

Var(Sn)

n =σ2=E X21

+ j=0

2jE

S2j(S2j+1S2j) .

On the other hand,we shall prove later on that condition(2) along with (6) are sufficient for the validity of (3).

Therefore the conclusion of Theorem1holds under(2)and(6)withσ2identified in this remark.

Comment 10 (Sequences that are not strictly stationary). The proof of Theorem3is based on the exponential in- equality from Lemma22,that was established without stationarity assumption.Therefore,Theorem3admits various extensions to non-stationary sequences.The following slight generalization is motivated by the fixed design regres- sion problemZk=θ qk+Xk,where the fixed design points are of the formqk=1/g(k/n),the error processXk is a stationary sequence and we analyze the error of the estimatorθˆ=n1n

k=1Zkg(k/n).If{Xi}iZsatisfies the conditions of Theorem3,and ifgis a Lipschitz function,then the processWn= {n1/2[nt]

i=1g(i/n)Xi, t ∈ [0,1]}

satisfies(1)with the good rate functionJ (·)defined by J (h)= 1

2 1

0

h(u) g(u)

2

du, whereσ2=

kZ

E(X0Xk).

The proof of this result is omitted.It can be done by following the proof of Theorem3.To be more precise,we start by proving the MDP for the processWn(t )=n1/2[nv1(t )]

i=1 g(i/n)Xi wherev(t )=σ2t

0g2(x)dx.ForWn(·),the rate function isIσ(·)as in Theorem1.To go back to the processWn(·),use the change-of-timeWn=Wnv.

3. Applications

In this section we present applications to functions of φ-mixing processes, contracting Markov chains, expanding maps of the interval and symmetric random walks on the circle. The proofs are given in Section 4.

(5)

3.1. Functions ofφ-mixing sequences

In this section, we are partly motivated by Djellout et al. [16], Theorem 2.7, who have proved the MDP for Xk=f (Yk, . . . , Yk)E

f (Yk, . . . , Yk)

, whereYk=

iZ

ciεki. (11)

In their Theorem 2.7, Djellout et al. [16] assume that:

(i) i)iZis an iid sequence;

(ii) (condition onci) the spectral density ofYk is continuous on[−π,π[;

(iii) (condition onε0)ε0satisfies the so-called LSI condition, which implies thatE(exp(δε20)) <∞for some posi- tiveδ, and that the distributionε0has an absolutely continuous component with respect to the Lebesgue measure with a strictly positive density on the support ofμ[see their condition (2.1)];

(iv) (condition onf) the functionsif are Lipschitz fori=0, . . . , .

By applying our main results, we derive the Propositions 11 and 12. In the case whereXk is given by (11), the Proposition 11 will allow us to obtain the MDP for a large class of functions. However, we require a stronger condition than (ii), that is we assume that the sequence(ci)iZis in1(Z), and instead of (iii), we suppose thatε0takes its values in some compact interval[a, b](this assumption cannot be compared to the LSI condition (iii)). Our method allows to link the regularity off to the behavior of the coefficients(ci)iZ(in that case, the condition (16) given below means that

iZwj(2(ba)|ci|) <∞for anyj=0, . . . , , wherewjis the modulus of continuity off with respect to the jth coordinate). In addition, our innovations may be dependent: more precisely,i)i∈Zis assumed to be a stationary φ-mixing sequence.

We now describe our general results. Leti)iZ=0Ti)iZ be a stationary sequence ofφ-mixing random variables with values in a subsetA of a Polish space X. Starting from the definition (8), we denote byφε(n)the coefficientφε(n)=φ (σ (εi, i≤0), σ (εi, in)).

Our first result is for non-adapted sequences, that is satisfying the representation (12). LetH be a function from AZtoRsatisfying the condition

C(A): for anyx, yinAZ, H (x)H (y)

iZ

Δi1xi=yi, where

iZ

Δi <.

Define the stationary sequenceXk=X0Tkby Xk=H

ki)iZ

E H

ki)iZ

. (12)

Note thatXkis bounded in view ofC(A).

Proposition 11. Let(Xk)kZbe defined by(12),for a functionH satisfyingC(A).If

k>0φε(k)is finite,then the conclusion of Theorem1holds withσ2=

kZE(X0Xk).

For adapted sequences, that is satisfying the representation (13), we can assume thatH satisfies another type of condition. LetHbe a function fromANtoRsatisfying the condition

C(A): for anyi≥0, sup

xAN,yAN

H (x)H

x(i)yRi, whereRi decreases to 0,

the sequencex(i)ybeing defined by(x(i)y)j=xjforj < iand(x(i)y)j=yjforji. Define the stationary sequence Xk=X0Tkby

Xk=H

ki)iN

E H

ki)iN

. (13)

(6)

Proposition 12. Let(Xk)kZbe defined by(13),for a functionH satisfyingC(A).If

=1

R

k

φε(k)

k <∞, (14)

then the conclusion of Theorem1holds withσ2=

kZE(X0Xk).In particular,the condition(14)holds as soon as

1.

k>0φε(k) <and

k>0k1/2Rk<∞.

2.

k>0Rk<and

k>0k1/2φε(k) <∞.

Application to functions of linear processes

Assume thatεi takes its values in a compact intervalA= [a, b]ofR, and let(ci)iZbe a sequence of real numbers in 1(Z). Letm=infxAZ

iZcixi andM=supxAZ

iZcixi. For a functionf from[m, M]ZtoR, letwi be the modulus of continuity off with respect to theith coordinate, that is

wi(h)= sup

x∈[m,M]Z, t∈[m,M],|xi−t|≤h

f (x)f x(i,t ),

the sequencex(i,t )being defined byxj(i,t )=xj forj =iandxi(i,t )=t. Assume that for anyx, yin[m, M]Z, f (x)f (y)

iZ

wi

|xiyi|

<. Define the random variablesYk=

iZciεk−i, and let Xk=f

(Yki)iZ

E f

(Yki)iZ

(15) (note that (15) is a generalization of (11)). Clearly,Xk may be written as in (12), for a functionH fromAZ toR.

Moreover,H satisfiesC(A)withΔi

Zw(2(ba)|ci|)provided that

i∈Z

Z

w

2(b−a)|ci|

<. (16)

From Proposition 11, if

k>0φε(k) <∞and if (16) holds, then the conclusion of Theorem 1 holds. In partic- ular, the condition (16) holds as soon as there exist(bi)iZ in1(Z)andαin]0,1]such thatw(h)b|h|α and

iZ|ci|α<∞. Two simple examples of such functions are:

1. f (x)=

iZgi(xi)for somegi such that|gi(x)gi(y)| ≤bi|xy|αfor anyx, yin[m, M].

2. f (x)=qi=phi(xi)for somehisuch that|hi(x)hi(y)| ≤Ki|xy|α for anyx, yin[m, M].

Now, assume thatci=0 fori <0, so thatYk=

i0ciεki. Iff is in fact a function ofx throughx0only, we simply denote byw=w0its modulus of continuity over[m, M]. In that caseXk=f (Yk)E(Yk)may be written as in (13) for a functionH satisfying C(A)with Riw(2|ba|

ki|ck|). From item 1 of Proposition 12, if

k>0φε(k) <∞and if

n≥1

n1/2w

2|b−a|

kn

|ck|

<∞, (17)

then the conclusion of Theorem 1 holds. In particular, if|ci| ≤i for someC >0 andρ∈ ]0,1[, the condition (17) holds as soon as:

1

0

w(t ) t

|logt|dt <∞.

Note that this condition is satisfied as soon asw(t )D|log(t )|γ for someD >0 and someγ >1/2. In particular, it is satisfied iff isα-Hölder for someα∈ ]0,1].

(7)

3.2. Contracting Markov chains

Let (Yn)n0 be a stationary Markov chain of bounded random variables with invariant measure μ and transition kernel K. Denote by · the essential supremum norm with respect to μ. Let Λ1 be the set of 1-Lipschitz functions. Assume that the chain satisfies the two following conditions:

there existC >0 andρ∈ ]0,1[such that sup

gΛ1

Kn(g)μ(g)

n, (18)

for anyf, gΛ1and anym≥0 lim

n→∞Kn

f Km(g)

μ

f Km(g)∞,μ=0. (19)

We shall see in the next proposition that if (18) and (19) are satisfied, then the MDP holds inD[0,1]for the sequence

Xn=f (Yn)μ(f ) (20)

as soon as the functionf belongs to the classLdefined below.

Definition 13. LetLbe the class of functionsf fromRtoRsuch that|f (x)f (y)| ≤c(|xy|),for some concave and non-decreasing functioncsatisfying

1

0

c(t ) t

|logt|dt <∞. (21)

Note that (21) holds ifc(t )D|log(t )|γ for someD >0 and someγ >1/2. In particular,Lcontains the class of functions from[0,1]toRwhich areα-Hölder for someα∈ ]0,1].

Proposition 14. Assume that the stationary Markov chain(Yn)n0 satisfies (18)and (19),and let Xn be defined by(20).Iff belongs toL,then the conclusion of Theorem1holds with

σ2=σ2(f )=μ

fμ(f )2 +2

n>0

μ

Kn(f )·

fμ(f ) .

The proof of this proposition is based on the following lemma which has interest in itself.

Lemma 15. Letun=supgΛ1Kn(g)μ(g).Let f be a function fromRtoRsuch that|f (x)f (y)| ≤ c(|xy|)for some concave and non-decreasing functionc.Then

Kn(f )μ(f )

c(un).

Remark 16. Ifunnfor aC >0andρ∈ ]0,1[,and ifc(t )D|log(t )|γ forD >0andγ >0,then Kn(f )μ(f )

=O nγ

.

We now give two conditions under which (18) and (19) hold. Let [a, b]be a compact interval in which lies the support of μ. For a Lipschitz function f, let Lip(f )=supx,y∈[a,b]|f (x)f (y)|/|xy|. The chain is said to be Lipschitz contracting if there existκ >0 andρ∈ ]0,1[such that

Lip Kn(f )

κρnLip(f ). (22)

Let BV be the class of bounded variation functions from [a, b] toR. For anyfBV, denote bydf the total variation norm of the measure df: df =sup{

gdf,g≤1}. The chain is said to be to beBV-contracting if there existκ >0 andρ∈ [0,1[such that

dKn(f )κρndf. (23)

(8)

It is easy to see that if either (22) or (23) holds, then (18) and (19) are satisfied (to see that the condition (23) implies (19), it suffices to note that it implies the same property for twoBV functionsf, g (see (52)), and that any Lipshitz function from[a, b]toRis aBV functions).

Application to iterated random functions

The stationary bounded Markov chain(Yn)n0with transition kernelKis one-step Lipschitz contracting if there exists ρ∈ ]0,1[such that

Lip K(f )

ρLip(f ).

Note that ifK is one-step Lipschitz contracting then (22) obviously holds withκ=1. The one-step contraction is a very restrictive assumption. However, it is satisfied ifYn=F (Yn1, εn)for some iid sequencei)i>0independent ofY0, and some functionF such that

F (x, ε1)F (y, ε1)

1ρ|xy| for anyx, yinR. (24)

Remark 17. Under a more restrictive condition onF than(24),namely

F (x, z)F (y, t )ρ|xy| +L|zt|, (25)

Delyon et al. [9]have proved the MDP forXn=f (Yn)μ(f )whenf is a Lipschitz function.In their paper,the chain is not assumed to be bounded.It is only assumed that E(eδε1) <for someδ >0,which implies the same property forX1(for a smallerδ)by using the inequality(25).

3.2.1. Application to expanding maps

LetT be a map from[0,1]to[0,1]preserving a probabilityμon[0,1], and let Xk=fTnk+1μ(f ), Wn(t )=Wn(f, t )=n1/2

[nt]

i=1

fTni+1μ(f ) .

Define the Perron–Frobenius operatorKfromL2([0,1], μ)toL2([0,1], μ)via the equality 1

0

(Kh)(x)f (x)μ(dx)= 1

0

h(x)(fT )(x)μ(dx). (26)

The mapT is said to beBV-contracting if its Perron–Frobenius operator isBV-contracting, that is satisfies (23). As a consequence of Proposition 14, the following corollary holds.

Corollary 18. IfT isBV-contracting,and iff belongs toBVL,then the conclusion of Theorem1holds with σ2=σ2(f )=μ

fμ(f )2 +2

n>0

μ

fTn·

fμ(f ) .

Let us present a large class ofBV-contracting maps. We shall say thatT is uniformly expanding if it belongs to the classCdefined in [4], Section 2.1, p. 11. Recall that ifT is uniformly expanding, then there exists a probability measureμon[0,1], whose densityfμwith respect to the Lebesgue measure is a bounded variation function, and such thatμis invariant byT. Consider now the more restrictive conditions:

(a) T is uniformly expanding.

(b) The invariant measureμis unique and(T , μ)is mixing in the ergodic-theoretic sense.

(c) 1

fμ1fμ>0is a bounded variation function.

(9)

Starting from Proposition 4.11 in [4], one can prove that ifT satisfies the assumptions (a), (b) and (c) above, then it is BV contracting (see for instance [7], Section 6.3). Some well-known examples of maps satisfying the conditions (a), (b) and (c) are:

1. T (x)=βx− [βx]forβ >1. These maps are calledβ-transformations.

2. I is the finite union of disjoint intervals(Ik)1kn, andT (x)=akx+bkonIk, with|ak|>1.

3. T (x)=a(x1−1)− [a(x1−1)]for somea >0. Fora=1, this transformation is known as the Gauss map.

Remark 19. The case wheref (x)=x(that isXn=Tnμ(T ))has already been considered by Dembo and Zeitouni [11].However,in this paper,the assumptions onT are more restrictive than the assumptions(a), (b)and(c)above.

In particular,they assume that there is a finite partition(Ij)1jm of[0,1]on which T restricted toIk isC1and infxIk|T(x)|>1,so that their result does not cover the case of the Gauss map(Item3above).

3.3. Symmetric random walk on the circle LetKbe the Markov kernel defined by

Kf (x)=1 2

f (x+a)+f (xa)

on the torusR/Z, withairrational in[0,1]. The Lebesgue–Haar measuremis the unique probability which is invariant byK. Let(ξi)iZbe the stationary Markov chain with transition kernelKand invariant distributionm. Let

Xk=f (ξk)m(f ), Wn(t )=Wn(f, t )=n1/2

[nt]

i=1

f (ξi)m(f )

. (27)

From Derriennic and Lin [13], Section 2, we know that the central limit theorem holds forn1/2Wn(f,1)as soon as the series of covariances

σ2(f )=m

fm(f )2 +2

n>0

m f Kn

fm(f )

(28)

is convergent, and that the limiting distribution isN(0, σ2(f )). In fact the convergence of the series in (28) is equiv- alent to

kZ

| ˆf (k)|2

d(ka,Z)2<, (29)

wheref (k)ˆ are the Fourier coefficients off. Hence, for any irrational numbera, the criterion (29) gives a class of functionfsatisfying the central limit theorem, which depends on the sequence(d(ka,Z))kZ. Note that a functionf such that

lim inf

k→∞ kf (k)ˆ >0, (30)

does not satisfy (29) for any irrational numbera. Indeed, it is well known from the theory of continued fraction that ifpn/qnis thenth convergent ofa, then|pnqna|< qn1, so that d(ka,Z) < k1for an infinite number of positive integersk. Hence, if (30) holds, then| ˆf (k)|/d(ka,Z)does not even tend to zero asktends to infinity.

Our aim in this section is to give conditions onf and on the properties of the irrational numberaensuring that the MDP holds inD[0,1].

ais said to be badly approximable by rationals if for any positiveε,

(31) the inequality d(ka,Z) <|k|1εhas only finitely many solutions forkZ.

(10)

From Roth’s theorem the algebraic numbers are badly approximable (cf. Schmidt [22]). Note also that the set of badly approximable numbers in[0,1]has Lebesgue measure 1.

In Section 5.3 of [8], it is proved that the condition (29) (and hence the central limit theorem forn1/2Wn(f,1)) holds for any badly approximable numbera as soon as

sup

k=0

|k|1+εf (k)ˆ <∞ for some positiveε. (32)

Note that, in view of (30), one cannot takeε=0 in the condition (32).

In fact, for badly approximable numbers, the condition (32) implies also the MDP inD[0,1]:

Proposition 20. Suppose thatais badly approximable by rationals,i.e.satisfies(31).If the functionf satisfies(32), then the conclusion of Theorem1holds withσ2=σ2(f ).

Note that, under the same conditions, the process{Wn(f, t ), t∈ [0,1]}satisfies the weak invariance principle in D[0,1]. Indeed, to prove Proposition 20, we show that the conditions of Corollary 2 are satisfied, but these conditions imply the weak invariance principle (see for instance [19]). From Comment 5, we also infer that the Donsker process defined in (10) satisfies the functional law of the iterated logarithm.

4. Proofs

Since the proofs of our results are mainly based on some exponential bounds for the deviation probability of the maximum of the partial sums for dependent variables, we present these inequalities, which have interest in themselves.

4.1. Exponential bounds for dependent variables

We state first the exponential bound from Proposition 2 in [20] that we are going to use in the proof of the main theorem.

Lemma 21. Let(Xi)iZbe a stationary sequence of random variables adapted to the filtration(Fi)iZ.Then P

1maxin|Si| ≥t ≤4√

e exp

t2/2n

X1+80 n

j=1

j3/2E(Sj|F0)

2 .

In the next lemma, we bound the maximal exponential moment of the stationary sequence by using the projective criteria.

Lemma 22. Let{Yk}kZbe a sequence of random variables such that for allj,E(Yj|F−∞)=0almost surely andYj isF-measurable.Define the projection operators byPj(X)=E(X|Fj)E(X|Fj−1).Assume that

Pkj(Yk)

pj and D:=

j=−∞

pj<. Let{gk, kN}be a sequence of numbers and define,

Sk= k

i=1

giYi, Mk= max

1jkSj, G2n= n

i=1

gi2.

Then,

Eexp(t Mn)≤4 exp 1

2G2nD2t2

.

(11)

In particular, P

1maxkn|Sk| ≥x

≤8 exp

x2 2G2nD2

.

Proof. Start with the decomposition Yk=

j=−∞

Pkj(Yk)= j=−∞

bjPkj(Yk) bj ,

wherebj=pj/DPkj(Yk)/D, for anyjZ. Then Sm=

j=−∞

bj m

k=1

gkPkj(Yk) bj .

Thus,

Mn

j=−∞

bj max

1mn

m

k=1

Pkj(gkYk)

bj =:

j=−∞

bjMn(j ),

whereMn(j )denotes max1mn

m

k=1gkPkj(Yk)/bj.

Since exp(x)is convex and non-decreasing andbj≥0 with

jZbj=1, Eexp(t Mn)Eexp

j=−∞

bjt Mn(j )

j=−∞

bjEexp t Mn(j )

.

Consider the martingale differenceUk=gkPkj(Yk)/bj. SinceZk=exp(t (U1+ · · · +Uk)/2)is a submartingale, Doob’s inequality yields

Eexp t Mn(j )

=E

1maxknZk2

≤4EZ2n=4Eexp

t (U1+ · · · +Un) .

Applying Azuma’s inequality to the right-hand side, and noting that Uk= |gk|Pkj(Yk)

bj ≤ |gk|D, we infer that

Eexp t Mn(j )

≤4 exp 1

2G2nD2t2

.

Since

jZbj=1, we obtain that Eexp(t Mn)

j∈Z

bj4 exp 1

2G2nD2t2

=4 exp 1

2G2nD2t2

.

Next, to derive the one-sided probability inequality we use the exponential bound witht=x/(G2nD2), so P(Mnx)Eexp(t Mn)exp(−t x)=4 exp

x2 2G2nD2

.

(12)

Finally, to derive the two-sided inequality we observe that the stationary sequence{−Yj}also satisfies the conditions

of the lemma. The proof is complete.

The next technical lemma provides an exponential bound for any random vector plus a correction in terms of conditional expectations (see also [23]).

Lemma 23. Let {Xi}1in be a vector of real random variables adapted to the filtration {Fn}n1. Denote B=sup1inXi.Then,for allδ >0andca natural number withcB/nδ/2,we have

P

1maxin

1 n

i

u=1

Xu

δ

≤2 exp

δ2n 64B2c

+P

sup

1i≤[n/c]

1 c

ic

j=(i1)c+1

E(Xj|F(i1)c)δ

4

. (33)

Proof. Letcbe a fixed integer andk= [n/c](where, as before,[x]denotes the integer part ofx). The initial step of the proof is to divide the variables in consecutive blocks of sizecand to average the variables in each block

Yi,c=1 c

ic

j=(i1)c+1

Xj, i≥1.

Then, for all 1≤ikwe construct the martingale, Mi,c=

i

j=1

Yj,cE(Yj,c|F(j1)c)

= i

j=1

Dj,c

and we use the decomposition P

1maxjn

1 n

j

u=1

Xuδ

P

1maxik

1 k

i

j=1

Yj,c

δcB n

P

1maxik

1 k

i

j=1

Yj,cδ

2

P

1maxik

1

k|Mi,c| ≥δ 4

+P

1maxik

1 k

i

j=1

E(Yj,c|F(j1)c)δ

4

P

1maxik

1

k|Mi,c| ≥δ 4

+P

1maxjk

E(Yj,c|F(j1)c)δ 4

.

Next, we apply Azuma’s inequality to the martingale part and obtain, P

1maxik|Mi,c| ≥δk 4

≤2 exp

δ2k2 32kB2

≤2 exp

δ2n 64cB2

which implies that P

1maxin

1 n

i

u=1

Xu

δ

≤2 exp

δ2n 64B2c

+P

1maxik

E(Yi,c|F(i1)c)δ 4

proving the lemma.

4.2. Some facts about the moderate deviation principle

This paragraph deals with some preparatory material. The following theorem is a result concerning the MDP for a triangular array of martingale differences sequences. It follows from Theorem 3.1 and Lemma 3.1 of [21], (see also [15], Proposition 1 and Lemma 2).

Références

Documents relatifs

ρ-mixing case (Proof of Application 1 of Section 1). 4-5] for additive functionals. Exten- sion to general MRW is straightforward. Then Application 1 of Section 1 follows from

Random walk, countable group, entropy, spectral radius, drift, volume growth, Poisson

In many random combinatorial problems, the distribution of the interesting statistic is the law of an empirical mean built on an independent and identically distributed (i.i.d.)

In Section 4, using coupling, we obtain heat kernel estimates, which is later used in Section 5 to show the uniqueness of the non-zero limiting velocity.. Throughout the paper,

We prove a large deviation principle in n, with explicit rate function, for the empirical subtree measures of multitype Galton–Watson trees conditioned to have exactly n vertices..

Chen, Moderate deviations for empirical measures of Markov chains: upper bounds, J.. Ney, A new approach to the limit theory of recurrent Markov

3 Research partially supported by a grant from the Israel Science Foundation, administered by the Israel Academy of Sciences... These large deviations are essential tools in the

Therefore we can consider the CA of an arbitrary normal sequence of measures and obtain the above results as a consequence of the corresponding theorems for