Berry-Esseen theorem and local limit theorem for non uniformly expanding maps

(1)

www.elsevier.com/locate/anihpb

Berry–Esseen theorem and local limit theorem for non uniformly expanding maps

Sébastien Gouëzel

Département de mathématiques et applications, École normale supérieure, 45, rue d’Ulm 75005 Paris, France Received 28 June 2004; accepted 24 September 2004

Available online 29 January 2005

Abstract

In Young towers with sufficiently small tails, the Birkhoff sums of Hölder continuous functions satisfy a central limit theorem with speedO(1/√

n ), and a local limit theorem. This implies the same results for many non uniformly expanding dynamical systems, namely those for which a tower with sufficiently fast returns can be constructed.

Résumé

Dans les tours de Young ayant des queues suffisamment petites, les sommes de Birkhoff des fonctions hölderiennes satisfont le théorème central limite avec vitesseO(1/√

n )et le théorème de la limite locale. Par conséquent, de nombreux systèmes dynamiques non uniformément dilatants satisfont les mêmes conclusions : il suffit de pouvoir construire une tour avec des retours à la base suffisamment rapides.

MSC: 37A30; 37A50; 37C30; 37E05; 47A56; 60F15

Keywords: Non uniformly expanding maps; Berry–Esseen theorem; Local limit theorem

1. Results 1.1. Introduction

LetT:X→X be a probability preserving transformation andf:X→R. The functionsf ◦T^k, fork∈N, are identically distributed random variables, and it is an important problem in ergodic theory to see whether they satisfy the same kind of limit theorems as independent random variables.

E-mail address: [email protected] (S. Gouëzel).

doi:10.1016/j.anihpb.2004.09.002

(2)

Many results are known whenT is uniformly expanding or uniformly hyperbolic (without or with singularities, in the Markov or non Markov case), andf is Hölder continuous. In this case, it is indeed often possible to construct a space of functions containingf on which the transfer operator associated toT has a spectral gap. Therefore, the spectral perturbation method, introduced by Nagaev in the case of Markov chains, makes it possible to mimic the probabilistic proofs on independent variables. In this way, it is possible to get distributional convergence (to normal laws or stable laws), and more subtle results such as the speed of convergence (also called the Berry- Esseen theorem) or the local limit theorem (see for example [30,17,8,3]). These results, in turn, have important consequences concerning the asymptotic behavior of the system [2,33].

On the other hand, when the system is not uniformly expanding or uniformly hyperbolic, it is not possible to use directly the aforementioned spectral method. Consequently, other methods have been devised to handle the distributional convergence of Birkhoff sums. Among many techniques, the most flexible one is probably the martingale argument of Gordin (see for example [23,26,10,11,36]). Some results have also been obtained on the speed in the central limit theorem, by direct estimates (see [22,27]). However, there is currently no result concerning the local limit theorem, which is not surprising since the proof of this theorem requires a heavy Fourier machinery, even in the probabilistic case, and is not easily accessible to elementary methods.

The aim of this article is to prove the local limit theorem and the Berry–Esseen theorem for Hölder functions in the setting of Young towers [35], where the decay of correlations is not exponential and the transfer operator has no spectral gap. The Young towers are abstract spaces which can be used to model many non uniformly expanding maps, for example the Pomeau–Manneville maps in dimension 1 studied by Liverani, Saussol and Vaienti [24], the Viana map (for which a tower is built in [5]), or the unimodal maps for which the critical point does not return too quickly close to itself [9]. Thus, all these maps also satisfy the local limit theorem, and the central limit theorem with speedO(1/√

n). These results also apply in non uniformly hyperbolic settings, with the techniques of [34].

The proof is spectral: it uses perturbations of transfer operators, as in [17], but applied to first return transfer operators associated to an induced map, as defined by Sarig in [32]. The method is related to [14], with a more systematic use of Banach algebra techniques.

1.2. Results in Young towers

A Young tower [35] is a probability space (X, m) with a partition (Bi,j)i∈I,j <ϕ_i of X by positive measure subsets, whereI is finite or countable andϕ_i∈N^∗, together with a nonsingular map T:X→X satisfying the following properties.

1. ∀i∈I,∀0j < ϕ_i−1,T is a measure preserving isomorphism betweenB_i,j andB_i,j₊₁. 2. For everyi∈I,T is an isomorphism betweenB_i,ϕ_i₋₁andB:=

k∈IB_k,0.

3. Letϕbe the function equal toϕ_ionB_i,0, whenceT^ϕis a function fromBto itself. Lets(x, y)be the separation time of the pointsxandy∈BunderT^ϕ, i.e.s(x, y)=inf{n| ∃i=j, (T^ϕ)ⁿ(x)∈Bi,0, (T^ϕ)ⁿ(y)∈Bj,0}. AsT^ϕⁱ is an isomorphism betweenBi,0andB, it is possible to consider the inversegmof its jacobian with respect to the measure m. We assume that there exist constants β <1 and C >0 such that ∀x, y∈B_i,0,

|logg_m(x)−logg_m(y)|Cβ^s(x,y). 4. The mapT preserves the measurem.

5. The partition_∞

0 T⁻ⁿ((B_i,j))separates the points.

The notion of Young tower has been introduced by Young in [34,35] as a model for non uniformly expanding dynamical systems. The non uniformity is measured by the size of tails m{x ∈B|ϕ(x) > n}: if this quantity is very small, then most points enjoy some expansion before time n, when they first return to the basis. This expansion, in turn, is sufficient to study statistical properties of the system, including decay of correlations. Young has proved that, ifm[ϕ > n] =O(1/n^β)for someβ >1, then the correlations of sufficiently regular functions (see

(3)

the definition ofC_τ(X)below) decay likeO(1/n^β⁻¹). In particular, ifβ >2, these correlations are summable, and a martingale method can be used to prove that a central limit theorem holds.

We extend the separation timesto the whole tower, by settings(x, y)=0 ifx andy are not in the same set B_i,j, ands(x, y)=s(x, y)+1 otherwise, wherexandyare the next iterates ofxandyinB. For 0< τ <1, set

Cτ(X)=

f:X→R| ∃C >0, ∀x, y∈X, f (x)−f (y)Cτ^s(x,y) .

This space has a normfτ=inf{C| ∀x, y∈X,|f (x)−f (y)|Cτ^s(x,y)} + f_∞.

The following theorem is well known and can for example be proved using martingale techniques (see [35, Theorem 4]).

Theorem 1.1. Letτ <1. Assume thatm[ϕ > n] =O(1/n^β)withβ >2. Letf ∈C_τ(X)have a vanishing integral.

Then there existsσ²0 such that

√1 n

n−1

k=0

f ◦T^k→N(0, σ²).

Moreover,σ²=0 if and only iff is a coboundary, i.e. there exists a measurable functiongsuch thatf =g−g◦T almost everywhere.

The main results of this article are Theorems 1.2 and 1.3. To formulate the first one, we will need the following definition:

Definition. A mapf:X→Ris periodic if there existρ∈R,g:X→Rmeasurable,λ >0 andq:X→Z, such thatf =ρ+g−g◦T+λqalmost everywhere. Otherwise, it is aperiodic.

Theorem 1.2 (local limit theorem). Letτ <1. Assume thatm[ϕ > n] =O(1/n^β)withβ >2. Letf ∈C_τ(X)have a vanishing integral, and letσ²be given by Theorem 1.1.

Assume thatf is aperiodic. This implies in particularσ²>0. Then, for any bounded intervalJ⊂R, for any real sequencek_nwithk_n/√

n→κ∈R, for anyu∈C_τ(X), for anyv:X→Rmeasurable,

√n m

x∈X|Snf (x)∈J+kn+u(x)+v(Tⁿx)

→ |J|e⁻^κ²^/(2σ²⁾ σ√

2π .

The function on the right is the density ofN(0, σ²): this theorem (foru=v=0 andk_n=κ√

n) means that

m 1

√nS_nf ∈κ+ J

√n

∼P N(0, σ²)∈κ+ J

√n

.

Hence, it shows thatS_nf/√

nbehaves likeN(0, σ²)at the local level (contrary to Theorem 1.1 which deals with the global level). It is important thatf is aperiodic. Otherwise,f could be integer valued, and the theorem could not hold, e.g. fork_n=0,u=v=0 andJ= [1/3,2/3].

Forf:X→R, define a functionf_BonBby fB(x)=

ϕ(x)−1 k=0

f (T^kx). (1)

In the probabilistic case, the Berry–Esseen theorem, giving the speed of convergence in the central limit theorem, holds under anL³moment condition [12]. In the dynamical setting, we will need the same kind of hypothesis, but on the functionf_B. Note that, since|f_B|f_∞ϕandβ >2, we always havef_B∈L²(B).

(4)

Theorem 1.3 (speed in the central limit theorem). Letτ <1. Assume thatm[ϕ > n] =O(1/n^β)withβ >2. Let f ∈C_τ(X)have a vanishing integral, andσ²be given by Theorem 1.1.

Assume thatσ²>0, and that there exists 0< δ1 such that

|f_B|²1_|_f_B_|_>zdm=O(z⁻^δ)whenz→ ∞. If δ=1, assume also that

f_B³1_|_f_B_|_zdm=O(1). Then there existsC >0 such that∀n∈N^∗,∀a∈R, m

x 1

√nS_nf (x)a

−P

N(0, σ²)a C n^δ/2.

Whenf_B∈L^pfor some 2< p3, then the conditions of the theorem are satisfied forδ=p−2. In particular, when f_B∈L³, we obtain a convergence with speedO(1/√

n ), which is the usual Berry–Esseen theorem. Note also that, for anyf ∈Cτ(X), the conditions of the theorem are satisfied forδ=β−2 if 2< β <3, and forδ=1 if β >3. The formulation we have given is more precise than the usual Berry–Esseen theorem, in view of the applications, where anL^pcondition would not be optimal (see for example Theorem 1.5). In fact, the conditions of the theorem onf_Bcorrespond to necessary and sufficient conditions to get a central limit theorem with speed O(n⁻^δ/2)in the probabilistic (independent identically distributed) setting, as shown in [19, Theorem 3.4.1].

Remark. Using the same methods, it is possible to prove the same results in a more general setting, namely maps for which a first return map is Gibbs–Markov in the sense of [1]. For the sake of simplicity, we will only consider Young towers.

1.3. Applications 1.3.1. General setting

Let (X, d) be a locally compact separable metric space, endowed with a Borel probability measureµ, and T:X→X a nonsingular map for which µ is ergodic. Assume that there exist a bounded subsetB of X with µ(B) >0, a finite or countable partition (mod 0)(Bi)i∈IofB, withµ(Bi) >0, and integersϕi>0 such that:

1. ∀i∈I,T^ϕⁱ is an isomorphism betweenB_i andB.

2. ∃λ >1 such that,∀i∈I,∀x, y∈B_i,d(T^ϕⁱx, T^ϕⁱy)λd(x, y).

3. ∃C >0 such that,∀i∈I,∀x, y∈B_i,∀k < ϕ_i,d(T^kx, T^ky)Cd(T^ϕⁱx, T^ϕⁱy).

4. ∃θ >0 andD >0 such that,∀i∈I, the jacobiang_µdefined onB_i byg_µ(x)=dµ/d(µ◦T_|^ϕ_Bⁱ

i)satisfies: for all x, y∈B_i,|logg_µ(x)−logg_µ(y)|Dd(T^ϕⁱx, T^ϕⁱy)^θ.

Denote by ϕ the function on B equal to ϕ_i on each B_i. If µ{x|ϕ(x) > n}is summable, we can define a space X= {(y, j )|y∈B, j < ϕ(x)}, and a mapT:X→X byT(y, j )=(y, j +1)if j < ϕ(x)−1 and T(y, j )=(T^ϕ(y)(y),0)otherwise. Define alsoπ:X→Xbyπ(y, j )=T^j(y). Thenπ◦T=T ◦π.

Setµ=_∞

n=0T_∗ⁿ(µ|B∩ {ϕ > n}): it is a measure of finite mass onX, not necessarilyT-invariant. Young has proved in [35, Theorem 1] that there exists a unique invariant probability measuremonXwhich is absolutely continuous with respect toµ. It is ergodic, and(X, T, µ)is a Young tower in the sense of Section 1.2. The measurem=π_∗(m)isT-invariant, absolutely continuous and ergodic.

If f:X→Ris Hölder continuous, thenf:=f ◦π:X→Rbelongs to C_τ(X)for τ close enough to 1.

Moreover, the Birkhoff sumsn−1

k=0f◦T^kandn−1

k=0f◦T^khave the same distribution with respect respectively tomandm. Hence, Theorems 1.1, 1.2 and 1.3 on the functionfin the Young tower(X, T, m)imply the same results on the functionf in(X, T , m).

To apply these theorems, we have to check their assumptions. The conditionm[ϕ > n] =O(1/n^β)withβ >2 corresponds simply to the requirement

ϕi>n

µ(B_i)=O 1 n^β

for someβ >2.

(5)

To apply Theorem 1.2, we additionally have to check that the functionf is aperiodic for T, which can be complicated when the extensionXis not explicitly described. On the other hand, the aperiodicity off may be easier to check, using for example the information at the periodic points. In this case, the following abstract theorem ensures thatfis automatically aperiodic, whence we can apply Theorem 1.2.

Theorem 1.4. LetT:X→X be a probability preserving map on a probability space(X, m). Let(X, m)be a standard probability space, T:X→X an ergodic probability preserving map, and π:X→X a map with countable fibers, such thatm=π_∗(m)andT ◦π=π◦T. Letf:X→R. Then

• The functionf is a coboundary forT if and only the functionf ◦π is a coboundary forT.

• The functionf is aperiodic forT if and only if the functionf ◦πis aperiodic forT. 1.3.2. Examples

Recently, many maps have been shown to fit in the previous setting. For example, [5, Theorem 3] shows that the Alves–Viana map, given by

T:

S¹×R→S¹×R, (ω, x)→

16ω, a−x²+εsin(2π ω)

satisfies these assumptions (for anyβ >2) when 0 is preperiodic for the mapx→a−x², andεis small enough.

In fact, any map close enough toT in theC³-topology also satisfies them.

In the one-dimensional case, [9] shows that many unimodal maps of the interval also satisfy these hypotheses:

it is sufficient that the returns of the critical point close to itself occur at a slow enough rate.

Finally, we will discuss with more details the case of the Pomeau–Manneville maps, studied among many others by Liverani, Saussol and Vaienti [24]. They form an interesting class of applications, since the influence of the fixed point 0 becomes more and more important whenαincreases. The explicit formula (2) is not important, what matters is only the local behavior around the fixed point. Hence, all the following results can be extended to a much larger class of examples but, for the sake of simplicity, we will only consider the following maps.

Letα∈(0,1/2), and considerT:[0,1] → [0,1]given by T (x)=

x(1+2^αx^α) if 0x1/2,

2x−1 if 1/2< x1. (2)

This map has a parabolic fixed point at 0, and is expanding elsewhere. It has a unique absolutely continuous invariant probability measurem, whose density is Lipschitz on any interval of the form(ε,1][24, Lemma 2.3].

Theorem 1.5. Let 0< α <1/2, and letf:[0,1] →Rbe a Hölder function with vanishing integral, which cannot be written asg−g◦T. Thenf satisfies a central limit theorem with varianceσ²>0.

• Ifα <1/3, orf (0)=0 and there existsγ > α−1/3 such that|f (x)|Kx^γ, then there existsC >0 such that∀n∈N^∗,∀a∈R,

m

x 1

√nS_nf (x)a

−P

N(0, σ²)a C

√n.

• If 1/3< α <1/2,f (0)=0 and there existsγ >0 such that|f (x)|Kx^γ andδ:=_α₋¹_γ −2∈(0,1), then there existsC >0 such that∀n∈N^∗,∀a∈R,

m

x 1

√nSnf (x)a

−P

N(0, σ²)a C n^δ/2.

(6)

• If 1/3< α <1/2 andf (0)=0, then there existsC >0 such that∀n∈N^∗,∀a∈R, m

x 1

√nS_nf (x)a

−P

N(0, σ²)a C n^(1/2α)⁻¹.

Moreover, iff is aperiodic, it satisfies the local limit theorem.

Proof. Let x₀=1, andxn+1be the preimage ofxn in[0,1/2]. Letyn+1be the preimage ofxn in(1/2,1]: the intervals Bn=(yn+1, yn]form a partition ofB=(1/2,1]and, ifϕn=n, all the hypotheses of Section 1.3 are satisfied. Moreover,m(B_n)∼_n1/α^C+1 andx_n∼_n^D1/α for constantsC, D >0 [24]. In particular,m(ϕ > n)=O(1/n^β) forβ=1/α >2.

Let f be Hölder on[0,1]. If f (0)=0, thenf_B =nf (0)+o(n) onB_n. Otherwise, letγ >0 be such that

|f (x)|Kx^γ. Reducingγ if necessary, we can assume thatγ < α. Then it is easy to check that|f_B|Cn¹⁻^{γ /α} onB_n.

Using these estimates, we can check the integrability assumptions of Theorem 1.3 forδ=1 in the first case,

1

α−γ −2 in the second case, and_α¹−2 in the third case. Hence, Theorem 1.3 implies the desired estimates on the speed in the central limit theorem.

Finally, the local limit theorem is a direct consequence of Theorem 1.2. 2

The aperiodicity assumption is a priori not easy to check, since the periodicity equalityf=g−g◦T+ρ+λq is assumed to hold only almost everywhere. However, under suitable regularity assumptions onf, it is possible to prove that this equality holds everywhere (see e.g. [3] for locally constantf, [15] for Hölderf). For example, ifT is given by (2), thenf =log|T| −

log|T|is aperiodic.

In Section 2, we will prove Theorem 1.4, and show that it is sufficient to prove Theorems 1.2 and 1.3 in mixing Young towers (i.e., such that the return timesϕ_i satisfy gcd(ϕ_i)=1). The rest of paper is devoted to the proof of these theorems. In Section 3, we prove an abstract spectral result on perturbations of series of operators. In Section 4, we apply this result to first return transfer operators, to get the key result Theorem 4.6. We then use this estimate in the last two sections to prove respectively the local limit Theorem 1.2 and the Berry–Esseen Theorem 1.3.

2. Preliminary reductions 2.1. Proof of Theorem 1.4

Proof of the coboundary result. Iff is a coboundary, i.e.f =g−g◦T, thenf:=f ◦π can be written as f=g−g◦T, whereg=g◦π. However, the converse is not immediate: iff=g−g◦T, the functiong is a priori not constant on the fibersπ⁻¹(x), which prevents us from writingg=g◦π.

We use the following characterization of coboundaries: Let T be an endomorphism of a probability space (X, m). Then a measurable functionf onXcan be written asg−g◦T if and only if

∀ε >0,∃C >0, ∀n1, m

x∈X|S_nf (x)C

ε. (3)

This characterization, due to Schmidt, is proved for example in [4].

Iffis a coboundary, then (3) is satisfied byfinX, whence it is also satisfied byf inX(since this condition only involves distributions). Thus,f can be written asg−g◦T. 2

Proof of the aperiodicity result. Iff is periodic onX, i.e.f =ρ+g−g◦T +λqwhereq is integer-valued, thenf◦π=ρ+(g◦π )−(g◦π )◦T+λ(q◦π ), i.e.f ◦πis periodic. On the other hand, iff◦π=ρ+g− g◦T+λq, it is not necessarily possible to write directlyg=g◦π. The proof of the periodicity off will use

(7)

ideas of [4]. We can assume for example thatλ=2π. Replacingmby one of its ergodic components, we can also assume thatmis ergodic.

Since the projection π has countable fibers, there exists a measurable subsetA ofX such that π is an isomorphism betweenAandX, andm(A) >0. Define a functiong˜ onXbyg(x)˜ =g(x), wherex is the unique preimage ofx inA. Replacingf byf − ˜g+ ˜g◦T −ρ, andgbyg− ˜g◦π, we can assume without loss of generality thatg=0 onAandρ=0.

Forx∈X, letW_n(x)be the measure onS¹given by Wn(x)=1

n n

k=1

δ(e^iS^k^{f (x)})

whereδ(y)is the Dirac mass aty. Foru∈C⁰(S¹), it is possible by compactness to find a subsequencenksuch that

S¹

udWn_k(x)→L(u)(x) weak ∗ inL^∞(X).

It is possible to obtain this convergence for a dense countable set of functions inC⁰(S¹), by a diagonal argument.

By passing to a further subsequence, it is also possible to guarantee that_n¹n k=1

S¹udWn_k(x)→L(u)(x)on a set Y⊂Xwithm(Y )=1, by Komlos’ Theorem [21]. By density, we get the same convergence for anyu∈C⁰(S¹).

Forx∈Y, the mapu∈C⁰(S¹)→L(u)(x)∈Ris a nonnegative continuous linear functional sending 1 to 1, thus given by a probability measureP_x. Moreover, these measures satisfy P_{T x}(S)=P_x(e^{if (x)}S) for any Borel subsetSofS¹, sinceW_n(T x)(S)=W_n(x)(e^{if (x)}S)±²_n.

For someε >0, we will prove that m

x|P_x {1}

ε

ε. (4)

Ifx∈A∩T⁻^k(A), then e^iS^k^f^◦^π(x⁾=e^i(g^(x⁾⁻^g^◦^T^k^(x⁾⁾=1, i.e.S_kf ◦π(x)∈2πZ. Hence,

X

W_n(x) {1}

dm(x)=1 n

n

k=1

X

1

S_kf (x)∈2πZ

dm(x)1 n

n

k=1

X

1(A∩T⁻^kA)dm(x)

=

X

1_A·

1 n

n

k=1

1_A◦T^k

dm(x)→m(A)²>0 by Birkhoff Theorem. Thus, for large enoughn,

XWn(x)({1})3ε >0, whence (_n¹_n

k=1Wn_k(x))({1})2ε for large enoughn. Since(_n¹_n

k=1W_n_k(x))({1})1, we get m

x

1 n

n

k=1

W_n_k(x)

{1}

ε

ε.

Thus, the setC= {x|lim sup(¹_nn

k=1Wn_k(x))({1})ε}satisfiesm(C)ε. Finally,Px({1})εonC, and this proves (4).

Define a measure µ onX×S¹, by µ(U×V )=

UP_x(V )dm(x). Thenµ is invariant under the action of T_f:(x, y)→(T (x),e⁻^{if (x)}y), and [4] proves that, for almost every ergodic componentP of µ, there exists a compact subgroupHofS¹and a mapω:X→S¹such that, denoting bym_H the Haar measure ofH,

∀U×V ⊂X×S¹, P (U×V )=

U

m_H ω(x)V

dm(x).

Moreover, in this case, it is possible to write e^{if (x)}=^{ω(T x)}_ω(x) ψ (x), whereψtakes its values inH.

(8)

If, for all componentP ofµ, we hadH=S¹, thenP =m⊗Leb for allP, whenceµ=m⊗Leb. This is a contradiction, since µ(C× {1}) >0, while(m⊗Leb)(C× {1})=0. Thus, for some choice ofP,H=Z/kZ, whenceψ (x)^k=1, and e^{ikf (x)}=ω(T x)^k/ω(x)^k. Thus,f is periodic onX. 2

Remark. The proof only shows that the period of f onX divides the period off ◦π onX, not that they are equal. In fact, it is not hard to construct examples of Young towers where the two periods are different. Moreover, the result is not true without the assumption that the fibers are countable.

2.2. Reduction to the mixing case

In the proofs in Young towers, it is often useful to assume that the tower is mixing, i.e. gcd(ϕi)=1. This restriction may seem technical, but it is important (for example, without it, there is no decay of correlations any more). For limit theorems, however, it is irrelevant: Theorems 1.1, 1.2 and 1.3 for mixing towers imply the same results for general towers.

Proof. We assume that Theorems 1.1, 1.2 and 1.3 are true in any mixing Young tower. Let Xbe a non-mixing Young tower, with N =gcd(ϕ_i) >1, and let f ∈C_τ(X) be of vanishing integral. For k=0, . . . , N −1, set Z_k=

B_i,j for j ≡k modN. Then, for every k, (Z_k, T^N)is a mixing Young tower, to which we can apply Theorems 1.1, 1.2 and 1.3.

On Z_k, we consider the functionf_k given by_N₋₁

i=0 f (Tⁱx), i.e.(S_Nf )_|_Z_k. Then

Z_kf_k =

Xf =0. Theo- rem 1.1 applied tofkon(Zk, T^N)gives a constantσksuch that, onZk,

√1

sNS_sNf →N(0, σ_k²).

Writing an integernassN+rwithr < N, we get that √¹

nSnf→N(0, σ_k²)onZk. Finally, ifx∈Z0,Snf (x)and S_nf (T^kx)differ by at most 2kf_∞. Thus, √¹

nS_nf−^√¹_nS_nf ◦T^k tends to 0 in probability onZ₀, which shows thatσ_k=σ₀. Writingσ for this common number, we get that √¹

nS_nf →N(0, σ²)onX. Moreover, ifσ=0, then f₀is a coboundary forT^N, i.e.f₀=g−g◦T^Nwhereg:Z₀→Ris measurable. We extendgto the whole tower:

if(x, j )∈X, withj =sN+randr < N, setg(x, j )=g(x, sN )−_r₋₁

i=0f (x, sN+i). It is then easy to check thatf=g−g◦T. This proves Theorem 1.1.

For the local limit theorem, let us assume thatf is aperiodic, and takeu, vandk_n as in the assumptions of Theorem 1.2. We show that the functionsf_kare also aperiodic. Otherwise, for example,f₀=ρ+g−g◦T^N+λq, wheregandqare defined onZ₀. We extendgandqto the whole tower: for(x, j )∈Xwithj=sN+rand 0< r <

N, setg(x, j )=g(x, sN )−_r₋₁

0 (f (x, sN+i)−_N^ρ)+λq(x, sN ), andq(x, j )=0. Thenf =_N^ρ+g−g◦T+λq, which is a contradiction. Thus, all the functionsf_kare aperiodic. We can apply to them Theorem 1.2 in the mixing tower(Z_k, T^N), and get that

√sN m

x∈Z_k|S_sNf (x)∈J+k_sN+u(x)+v(T^sNx)

→m(Z_k)|J|e⁻^κ²^/(2σ²⁾ σ√

2π .

Summing overk, we get the conclusion of Theorem 1.2 forf, and for the times of the formsN. For times of the formsN+rwith 0< r < N, we use the same result forf◦T^r,u−Srf, v◦T^r and the sequenceksN+r, and get that

√sN+r m

x∈X|S_sNf ◦T^r(x)∈J+k_sN₊_r+u(x)−S_rf (x)+v(T^sN⁺^rx)

→ |J|e⁻^κ²^/(2σ²⁾ σ√

2π . AsS_sNf ◦T^r(x)+S_rf (x)=S_sN₊_rf (x), this concludes the proof of Theorem 1.2.

(9)

Finally, the central limit theorem with speed is deduced from the same result on eachZ_k, for times of the form sN. We extend the result to arbitrary times: writingnassN+rwithr < N, we have|S_sN₊_rf−S_sNf|rf_∞. This introduces an error, of orderO(1/√

n )O(1/n^δ/2). 2

Remark. For this proof, it was important to have a strong version of the local limit theorem, involving functionsu andv.

Theorem 1.1 is proved in [35, Theorem 4]. The rest of the paper is devoted to the proof of Theorems 1.2 and 1.3 in mixing Young towers. From this point on,Xwill be a mixing Young tower, i.e. gcd(ϕ_i)=1.

3. An abstract result

If (a_n)_n_∈N and (b_n)_n_∈N are two sequences indexed by N, we denote by (a_n) (b_n) the sequence c_n = _n

k=0a_kb_n₋_k. If(a_n)_n_∈Zand(b_n)_n_∈Zare two summable sequences indexed byZ, we also define their convolution c_n=(a_n) (b_n)byc_n=_∞

k=−∞a_kb_n₋_k. Hence, if

a_nzⁿ and

b_nzⁿare series with summable coefficients, the coefficient ofzⁿ in(

akz^k)(

bkz^k)is given by (an) (bn)(more precisely, it is given by thenth term ((a_k) (b_k))_nof this sequence, but we will often abuse notations and write simply(a_n) (b_n)). Finally, we write D= {z∈C| |z|<1}andD= {z∈C| |z|1}.

The goal of this section is to prove the following theorem:

Theorem 3.1. Letβ >2. LetR_n, forn∈N^∗, be operators on a Banach spaceE, with_∞

k=n+1R_k =O(1/n^β).

SetR(z)=

R_nzⁿ, and assume that 1 is a simple isolated eigenvalue ofR(1), whileI−R(z)is invertible for z∈D− {1}. LetP be the spectral projection associated toR(1)and the eigenvalue 1, and assume thatP R(1)P = µP withµ >0.

LetR_n(t )be operators onE (fort in some interval[−α, α]withα >0) such that_∞

k=n+1R_k(t )−R_k C|t|/n^β⁻¹. SetR(z, t )=

zⁿR_n(t ). Letλ(z, t )be the eigenvalue close to 1 ofR(z, t ), for(z, t )close to(1,0).

We assume thatλ(1, t )=1−M(t )withM(t )∼ct²for some constantcwith Re(c) >0.

Then, for small enought,I −R(z, t )is invertible for allz∈D. Let us denote its inverse by

T_n(t )zⁿ. Then there existα>0,d >0 andC >0 such that, for everyt∈ [−α, α], for everyn∈N^∗,

T_n(t )− 1 µ 1− 1

µM(t ) n

P C

n^β⁻¹ +C|t| 1 n^β⁻¹

(1−dt²)ⁿ. (5)

In the application of Theorem 3.1 to the proof of Theorems 1.2 and 1.3, the operatorsR_n will describe the returns to the basisB, and will be easily understood, as well as their perturbationsRn(t ). On the other hand,Tn

will describe all the iterates at timen, and Tn(t )will be closely related to the characteristic functionE(e^{it S}ⁿ^f).

Thus, (5) will enable us to describe preciselyE(e^{it S}ⁿ^f), and this information will be sufficient to get Theorems 1.2 and 1.3.

3.1. Banach algebras and Wiener Lemma

In this paragraph, we define some Banach algebras which will be useful in the following estimates. We postpone the proofs of the properties of these algebras to the Appendix.

A Banach algebra Ais a complex Banach space with an associative multiplication A×A→A such that ABAB, and a neutral element. The set of invertible elements is then an open subset ofA, on which the inversion is continuous.

LetCbe a Banach algebra. Ifγ >1, we writeOγ(C)for the set of formal series_∞

n=−∞A_nzⁿ, whereA_n∈C andA_n =O(1/|n|^γ)whenn→ ±∞, endowed with the standard product of power series, corresponding to the

(10)

convolution of the sequences(A_n)and(B_n). It admits the naive norm sup_n_∈Z(|n| +1)^γA_n, for which it is not a Banach algebra. However, there exists a norm, equivalent to the previous one, which makesOγ(C)into a Banach algebra (Proposition A.1). Moreover, this algebra satisfies a Wiener Lemma: ifA(z)=

A_nzⁿ∈Oγ(C)is such thatA(z)is invertible for everyz∈S¹, thenAis invertible inOγ(C)(Theorem A.3).

We will also use the Banach algebraOγ⁺(C), given by the set of series

Anzⁿ∈Oγ(C)such thatAn=0 for n <0. It is a closed subalgebra ofOγ(C), and it satisfies also a Wiener Lemma (Theorem A.4).

Notation. Iff:[−α, α] ×Z→R₊for someα >0, andCis a Banach algebra, we denote byO_C(f (t, n))the set of series_∞

n=−∞c_n(t )zⁿwherec_n:[−α, α] →Cis such that there existsα>0 andC >0 such that

∀t∈ [−α, α], ∀n∈Z, cn(t )Cf (t, n).

We will often omit the subscript inO_C. As usual, we will often write

cn(t )zⁿ=O(f (t, n))instead of the more correct formulation

c_n(t )zⁿ∈O(f (t, n)). We will also writeO(g(n))for the set of series

A_nzⁿwith A_nCg(n)for some constantC. This is a particular case of the previous notation, where the functionsf (t, n) are independent oft. Until the end of Section 3, the notationOwill always have this signification.

Remark. The notations O and O should not be confused: there are similarities (which is why we have used the same letter), but the calligraphic notationOindicates additionally a Banach algebra. In this case, we can for example use the continuity of inversion.

With these notations, we can reformulate Theorem A.3 as follows: if

A_nzⁿ=O(1/(|n| +1)^γ)forγ >1, and

A_nzⁿis invertible for everyz∈S¹, then(

A_nzⁿ)⁻¹=O(1/(|n| +1)^γ). The fact thatOγ(C)is a Banach algebra also implies that, forγ >1,

O 1

(|n| +1)^γ

O 1

(|n| +1)^γ

⊂O 1 (|n| +1)^γ

, (6)

i.e., if two series

A_nzⁿand

B_nzⁿ(withA_n, B_n∈C) satisfy sup

n∈Z

|n| +1γ

A_n<∞ and sup

n∈Z

|n| +1γ

B_n<∞, then the series

C_nzⁿ:=(

A_nzⁿ)(

B_nzⁿ)also satisfies sup_n_∈Z(|n| +1)^γC_n<∞. 3.2. Preliminary technical estimates

For notational convenience, we will often writet instead of|t|in what follows. Equivalently, the reader may consider that the proofs are written fort0. We will also write 1/|n|^γ instead of 1/(|n| +1)^γ, discarding the problem atn=0.

Lemma 3.2. Whenγ >1 andd >0,

O 1

|n|^γ

O

1n0t²(1−dt²)ⁿ

⊂O 1

|n|^γ +1n0t² 1−d 2t²

n .

Proof. Forn <0, the coefficient in the convolution is less than_∞

k=0t²(1−dt²)^k_|_n¹_|γ _|_n^C_|γ. Forn0, it is less than

n/2

k=−∞

1

|k|^γt²(1−dt²)^n/2+ n

k=n/2

1

(n/2)^γt²(1−dt²)ⁿ⁻^kCt²(1−dt²)^n/2+ C n^γ.

Finally, as√

1−dt²(1−^d₂t²), we get the conclusion. 2

(11)

Lemma 3.3. Letγ >1 andd >0. LetG_t(z)=O(t /|n|^γ+1_n₀t³(1−dt²)ⁿ), and assume thatF (z)=O(1/|n|^γ) is invertible for everyz∈S¹. Then[F (z)+G_t(z)]⁻¹=F (z)⁻¹+O(_|_n^t_|γ +1_n₀t³(1−₆₄^dt²)ⁿ).

Proof. We first assume thatF (z)=1. Setting Ht(z)=

n∈Z t

|n|^γzⁿ+

n∈Nt³(1−dt²)ⁿzⁿ, the norm of the coefficients of [1+G_t(z)]⁻¹ is less than the coefficients of [1−H_t(z)]⁻¹. Thus, it is sufficient to consider 1/(1−t K(z)−t³/(1−(1−dt²)z))whereK(z)=

zⁿ/|n|^γ. Note that 1

1−t K(z)−t³/(1−(1−dt²)z)= 1 1−t K(z)−t³

1−(1−dt²)z

1−(1−dt²)[1+t³/(1−t K(z)−t³)]z. (7) For small enought,|(1−dt²)[1+t³/(1−t K(z)−t³)]|<1, whence

1−(1−dt²)z

1−(1−dt²)[1+t³/(1−t K(z)−t³)]z

=

1−(1−dt²)z^∞

n=0

(1−dt²)ⁿ

1+ t³

1−t K(z)−t³ n

zⁿ

=1+ t z(1−dt²) 1−t K(z)−t³t²

∞ n=0

(1−dt²)ⁿ

1+ t³

1−t K(z)−t³ n

zⁿ. (8)

We first study the sum. LetA=Oγ(C)be the Banach algebra of the series whose coefficients areO(1/|n|^γ). As K(z)∈A, we have 1−t K(z)−t³=1+O_A(t ). Since the inversion is Lipschitz on a Banach algebra, we get t³/(1−t K(z)−t³)=t³+O_A(t⁴), whence[1+t³/(1−t K(z)−t³)]ⁿ_AC(1+2t³)ⁿ. Let us estimate the coefficient ofz^pint²_∞

n=0(1−dt²)ⁿ[1+t³/(1−t K(z)−t³)]ⁿzⁿ. This is at most t²

∞ n=0

(1−dt²)ⁿ 1+ t³ 1−t K(z)−t³

n

−n+p

^∞

n=0

t²(1−dt²)ⁿ(1+2t³)ⁿ

|−n+p|^γ

^∞

n=0

t² 1−d 2t²

n

1

|−n+p|^γ

for t small enough so that(1−dt²)(1+2t³)(1−^d₂t²). We find the same expression as in the convolution betweenO(1n0t²(1−^d₂t²)ⁿ)andO(1/|n|^γ), that we have already estimated in Lemma 3.2. Thus, we get at most O(1/|p|^γ+1_p₀t²(1−^d₄t²)^p).

Ast z(1−dt²)/(1−t K(z)−t³)=O(t /|n|^γ), another convolution yields that (8) is 1+O t

|n|^γ +1_n₀t³ 1−d 8t²

n .

Multiplying by 1/(1−t K(z)−t³)=1+O(t /|n|^γ)gives that (7)=1+O(t /|n|^γ +1_n₀t³(1−₁₆^dt²)ⁿ). This concludes the proof in the caseF (z)=1.

We now handle the case of an arbitraryF (z). Note that F (z)+G_t(z)₋1

=

1+F (z)⁻¹G_t(z)₋1

F (z)⁻¹.

The Wiener Lemma A.3 implies that F (z)⁻¹ =O(1/|n|^γ), whence Lemma 3.2 gives F (z)⁻¹Gt(z) = O(t /|n|^γ+1n0t³(1−^d₂t²)ⁿ). Thus, the caseF (z)=1 yields

1+F (z)⁻¹G_t(z)₋₁

=1+O t

|n|^γ +1_n₀t³ 1− d 32t²

n .

Another convolution withF (z)⁻¹gives the result. 2