A NNALES DE L ’I. H. P., SECTION B
M ICHAEL L ACEY K ARL P ETERSEN M ATE W IERDL D AN R UDOLPH
Random ergodic theorems with universally representative sequences
Annales de l’I. H. P., section B, tome 30, n
o3 (1994), p. 353-395
<http://www.numdam.org/item?id=AIHPB_1994__30_3_353_0>
© Gauthier-Villars, 1994, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
Random ergodic theorems
with universally representative sequences
Michael LACEY
(1)
Karl PETERSEN
(2),
Mate WIERDL(4)
Dan RUDOLPH
(3)
Department of Mathematics, Indiana University, Bloomington, IN 47405, U.S.A.
Department of Mathematics, CB 3250, Phillips Hall, University of North Carolina,
Chapel Hill, NC 27599, U.S.A.
Department of Mathematics, University of Maryland, College Park, MD 20742, U.S.A.
Vol. 30, nO 3, 1994, p. 353-395 Probabilités et Statistiques
ABSTRACT. - When elements of a
measure-preserving
action ofR~
orZ~
are selected in a random way,
according
to astationary
stochastic process,a.e. convergence of the averages of an LP function
along
theresulting
orbits may almost
surely hold,
in everysystem;
in such a case we call thesampling
schemeuniversally representative.
We show that i.i.d.integer-
valued
sampling
schemes areuniversally representative (with
p >1)
if andonly
ifthey
have nonzero mean, and we discuss avariety
of othersampling
schemes which have or lack this
property.
Key words: random ergodic theorems, random sampling of stationary processes.
( ~ )
Research supported by NSF Grant DMS-9003245 and an NSF Postdoctoral Fellowship.(~)
Research supported by NSF Grants DMS-8900136, DMS-9103656, and DMS-9203489.(3 )
Research supported by NSF Grant DMS-01524351.(4)
Research supported by NSF Grants DMS-9103656 and DMS-9203489.Classification A.M.S. : ergodic theory 28 D 05, 28 D 10, 28 D 15.
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques - 0246-0203
Vol. 30/94/03/$ 4.00/@ Gauthier-Villars
RESUME. - Si des elements d’une action de
R~
ou7ld preservant
unemesure sont choisis selon un processus
stochastique stationnaire,
il estpossible
que presque surement les moyennes d’une fonction de classe LP selon les orbitesgenerées convergent
p.p., pourchaque système ;
en un telcas nous
appelons
le schema d’échantillon universellementreprésentatif.
Nous montrons que les suites i.i.d. à valeurs entières forment un schema universellement
representatif (pour
p >1)
si et seulement si elles ontune
esperance
non nulle. Nous considéronsplusieurs
autresexemples
desschemas d’échantillon
qui
ont ou n’ont pas cettepropriété.
1. INTRODUCTION
Suppose
that elements of ameasure-preserving
action areapplied
in arandom but
stationary
way, and we are interested in the almosteverywhere
convergence of the averages of some function
along
the orbits sogenerated.
The random sequence of elements is fixed in
advance,
and the same se-quence is
applied
in all actions. If the sequence is veryregular,
forexample periodic,
we willalways
have a.e. convergence of the averages for all actions and allintegrable functions,
because of thepointwise ergodic theorem;
it is natural to ask whether a sequence that istypical
in some sense, orsufficiently chaotic, stochastic,
orcomplex,
willproduce
the same results. We have found someexamples
ofstationary
processes that doproduce
a.e. conver-gence in this universal manner, but we also show that in
general
there existcounterexamples,
even forindependent identically
distributed sequences.To establish notation and
terminology,
let I be a set, which will index families{Si :
i EI~
ofmeasure-preserving
transformationsacting
onprobability
spaces(Y, C, v).
A scheme forchoosing commuting m.p.t.’s
atrandom in a
stationary
way will beprovided by
a shift-invariant measureP on S2 = Thus
if wEn,
we will be interested in the a.e. dv(y)
convergence of the averages
for all
systems (Y, C, v, {Si :
i EI~, g).
We have in mind thefollowing examples:
I =
(0, oo):
we make measurements ofcontinuous-parameter stationary
processes, the gaps between measurementshaving
beendetermined
by another, positive-real-valued stationary
process;(1)
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
I = Z: at each time
apply
a power of a fixedm.p.t.
T chosen inadvance in a
stationary
way;(2)
I =
~0, 1~: apply commuting m.p.t.’s So
andSi, perhaps
chosenindependently, perhaps according
to some otherstationary
measure;(3)
I = ~-1, 1 ~:
at eachtime, apply
either am.p.t.
T or(4)
I= ~ 0, 1,..., d - 1 ~ :
at each timeapply
one of dcommuting
m.p.t.’s perhaps
chosenindependently; (5)
I =
7Ld:
weapply
astationary
sequence of elements of a7Ld
action.(6)
We will say that the scheme
(H, P) (H
= isuniversally representative for
LP if for eSZ,
for everyprobability
space(Y, C, v),
every v-preserving
action{Si :
i E1}
onY,
and every g E Lp(Y, C, v),
theaverages converge a.e. dv
(y). (The
RandomErgodic
Theoremdeals with a weaker
property,
since it fixes the processbeing sampled
and states that almost all
sampling
sequencesyield
almosteverywhere
convergence. This follows
immediately
from thepointwise ergodic
theoremapplied
to theskew-product
transformationy)
--~SW1 y)
on SZ xY,
where a is the shift on
SZ.)
Thequestion
of mean convergence(in
LP(Y))
foris also of
interest;
it is answered below(Corollary
toProposition 1)
forexamples
oftype (1)
and mentioned from time to time in remarks about the otherexamples.
First,
we treat the case I =R,
randomsampling
incontinuous-parameter
flows. The return-time theorem of
Bourgain, Furstenberg, Katznelson,
and Ornstein[8] corresponds
to thespecial
case when thesampling
times takevalues in a lattice in R. This return-time
theorem,
in the formulationinvolving
averages ofproducts
ofpairs
offunctions,
extends to thereal-parameter
case(Theorem 1).
Thisimplies
that anysampling
schemealways
workscorrectly
on each function in a dense set inLB namely
those that are smooth in the time
parameter (Proposition 1); consequently
convergence
always
holds in the mean of orderp >
1(Corollary 1).
Ingeneral, however,
almosteverywhere
convergence can fail. Forexample,
if the
spacing
process has a continuous distribution in someinterval,
thenwith
probability
1 thesampling
sequence will fail on some function in anyreal-parameter
process(Example 1).
On the otherhand,
there also exist nontrivialexamples
ofuniversally representative sampling
schemes.For
example,
if the spaces betweensampling
times take values 1 and c~, where c~ may beirrational,
the selectionsbeing
madeaccording
toa suitable Markov measure, then the
sampling
scheme isuniversally representative (Example 2). By
a modification of thisidea,
we can also constructuniversally representative
schemes that allowarbitrarily
smallgaps between measurement times
(Examples
3 and4).
Given auniversally
Vol. 30, n° 3-1994.
representative sampling scheme,
certain kinds ofperturbations
of it willproduce
anotheruniversally representative
scheme(Example 5). Although
it seems
likely
that there areuniversally representative
schemes in which the gaps between measurements areindependent
and take values in acountable set
clustering
at0,
so far we have not been able to construct aparticular example.
The remainder of the paper concerns the random selection of transformations from a
commuting family, perhaps
from anaperiodic
actionof several
commuting
transformations or from the set of powers of asingle
transformation. The case when the selections are made
according
to aBernoulli scheme
(i.e.,
in an i.i.dway),
isalready
of considerable interest.Suppose
first that we choose either a transformation or its inverseaccording
to the Bernoulli measure
,t3 ( 1 / 2, 1/2);
then because of the recurrence of thesymmetric
one-dimensional random walk(in n steps approximately ~
sites are
visited,
each oneapproximately vn times),
at firstglance
onemight
supose that when we look at the Cesaro averages we areseeing good weights
on powers of asingle m.p.t.,
and so we should obtain agood
sequence with
probability
1. We show(Theorem 4), by using
Strassen’sfunctional law of the iterated
logarithm,
that in fact this is not the case:in any
aperiodic system,
there isalways
acounterexample.
On the otherhand,
whenevercommuting m.p.t.’s
areapplied according
to the Bernoulli scheme,t3 (1/2, 1/2),
we have a.e. convergence of the Cesaro averagesalong
thesubsequence
2n log n(Theorem 7);
this isproved by
the Fourier method introduced intoergodic theory by
J.Bourgain.
If powers of asingle m.p.t.
are to beapplied according
to the values of aninteger-valued independent
sequence ofintegrable
randomvariables,
then there isalways
a.e. convergence of the
ergodic
averages for LPfunctions,
p >1,
if andonly
if the
sampling
process has nonzero mean(Theorem
5 and6,
combinedwith Theorem
4). Choosing
amongcommuting
transformationsaccording
to the entries in a certain Sturmian sequence does
provide
anexample
ofa
universally representative
scheme(Theorem 3).
The
variety
of behavior found in theseexamples
makes the formulation of a condition thatmight
be necessary and sufficient for asampling
scheme to be
universally representative
seemfairly
remote at this time. Thedifficulty (and interest)
ofquestions
of this kind comes from two sources:inappropriate averaging,
either the discretesampling
of a flow or the one-dimensional
sampling
of a two-dimensionalaction;
and the need to dealsimultaneously
withuncountably
manysampled
processes,combining
un-countably
manyexceptional
sets of measure zero into asingle
set of measurezero.
Sampling
theorems of a different kind were considered in[14], [4].
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
The authors thank M.
Akcoglu,
M.Boshemitzan,
and J.King
forhelpful suggestions
andencouragement
and the referee for manyimprovements
inthe
presentation.
2. RANDOM SAMPLING OF
CONTINUOUS-PARAMETER STATIONARY PROCESSES Let
(cv) : k, = l; 2, ...~ be a positive
real-valuedintegrable stationary
process and Tk
(c~)
=cSi (c~)
+ ...(w),
To = 0. The Tk(w)
formour sequence of
sampling times,
and the~k (c~)
are the gaps betweenmeasurements. The
sampling
schemef 6k}
isuniversally representative (for L~)
if for a.e. cv,(cc;)}
is agood
sequence, in thefollowing
sense: Forevery
measure-preserving
flow{St :
2014oo t~}
on a measure space(Y, C, v)
andintegrable function g
onY,
Because of the theorem on
ergodic decompositions,
we may as well assumethat the
sampling
process as well as all of the processesbeing sampled
areergodic. By subtracting
off a constant, we may also assume that theintegral of g
is 0.Letting
H =(0, oo)N
with the measure Pgenerated
wehave the
representation 8k ( c,~ )
== 8c,v ) ,
where a : SZ -~ H is the shifttransformation
(0, oo )
reads off the first coordinate of cv.Discrete
samples
and return times. - To see howsampling
theorems areconnected with return-time theorems in the
particular
case when 8 is N-valued
(and similarly,
with suitableadjustments,
when 8 takes values ina lattice in
R),
we consider theprimitive
transformation % : builtover the floor H with
ceiling
function 8 - 1(see [15],
p.40).
Then Tk(c~)
is the time of the k’th
entry of 03C9 ~ 03A9
to H under 03C3. Thus if J = Tn then in the first Jsteps
of its orbit undercr,
the number of times that 03C9enters H is
Nj (o, SZ)
= n, and we haveVol. 30, nO 3-1994.
For a.a. 03C9, the first factor converges to
1 where is
the usual extensionof P to
S2).
That for a.a. cv the second factor converges a.e. dv(y)
for all(Y, C,
v,g)
is the return-time theorem of[8] (which applies just
as wellto bounded measurable functions as to characteristic function like
x~).
THEOREM 1. -
If (Tt :
-~ t~}
is anergodic measure-preserving flow
on afinite
measure space(X, ,~, ~c)
andf
E L°°(X, B,
then a.e.x E X has the
following
property : For each system(Y, C,
v,~St~, g), where {St : -~
t~}
is anergodic measure-preserving flow
on thefinite
measure space(Y, C, v)
and g is anintegrable function
onY,
Proof. -
We assumethat g
hasintegral
0.Moreover,
iff
is in the closed linear span inL2
of theeigenfunctions
of~Tt~,
then existence and identification of the limit both followeasily
from theErgodic Theorem,
soby subtracting
theprojection of f
onto thissubspace (which
is also bounded since it is a conditionalexpectation
withrespect
to thecorresponding
factoralgebra)
we may assume thatf
is in theorthocomplement
inL2
of theeigenfunctions of ~ Tt ~ .
In this case we want to show that the limit above is 0 for almost all y.It is known that all but
perhaps countably
many of the mapsTt
areergodic; rescaling
if necessary, we may assume thatTi
isergodic,
andhence is
ergodic
for each r E N. The set ofgood x
whose existence is claimed in the theorem is formedby deleting
thefollowing
sets of measure0 from X.
First, apply
the return-time theorem of[8]
tof
and each of the mapsT l/r,
r EN, discarding
a bad set of measure 0 for each r.Next,
notice thatby
the same Fubiniargument
that proves the LocalErgodic Theorem,
For each r E
N and q
EQ,
discard the set of measure 0 of those x which are notgeneric
for theset {u : f ( u )
>q ~ ,
in the sense thatthe orbit of x under fails to visit this set with the correct
limiting frequency.
Supose
now that x is in thisremaining good
set and that(Y, C,
v,~St~, g)
isgiven.
It is sufficient to consider averages overAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
intervals of
length t
= ~ e N.Upon dividing
each interval[jy, ~
+1]
into r
subintervals [k r, k+1 r]
we find thatSince
f
EL°°,
we have a maximalinequality
forcoming
from the MaximalErgodic Theorem,
so it isenough
to prove convergence a.e. for a dense set offunctions g
EL~ (Y),
such as those withg
(Ss y) uniformly
continuousin s, uniformly
in y.(The
familiar functionswith h E L°°
(Y) and §
E C(R)
withcompact support
haveDealing
now with such aspecial g,
choose rlarge enough
that the secondterm in
(1)
is less than agiven IlK
for all n.Since
Dh f (u)
~ 0 a.e., we may choose r alsolarge enough
that~c. ~~c :
>1/K.
Because the orbit of x under visits{Dl/r f
> with theright frequency,
we can choose nlarge enough
thatVol. 30, n° 3-1994.
Then the first term in
(1)
is boundedby
which is
arbitrarily
small forlarge
K.Finally,
since r was fixedfirst, using
the return-time theorem from[8]
for a.e. y we may choose n
large enough
that the third term in(1)
isalso less than
1/K.
Remark. - E.
Lesigne (personal communication)
has observed that theargument
of[8],
whenapplied
tointegral
averages, alsoyields
this result.In the case of a
general sampling scheme,
we may beconsidering
returnsto a set which has measure 0. With the notation
above,
if 8 : SZ -~(0, oo )
is the function
generating
thespacings
betweensampling times,
considerthe flow under the function 8 with base
(H, a).
Thatis,
we now letby letting
eachpoint 0)
flow up at unitspeed
until it hits theceiling,
atwhich
point
it moves to0).
Then81 (c,~), 82 (cv),
... are thespacings
between hits of the floor
H,
so that the Tk(cv)
areexactly
the return timesof
(w, 0)
to H under this flow.In order to have a set of
positive
measure to return to, we take a narrow band below theceiling
and work with its characteristic functionFor small
h, f h
hasintegral
very near 1.PROPOSITION 1. -
Any sampling
scheme isrepresentative for
the(dense)
setD
(Y) of functions
g ELl (Y) for
which g(Ss y)
is auniformly
continuousfunction of
suniformly
in y.Proof. -
Asbefore,
we may assume that g has mean 0. We consider htaking
values in a sequencetending
to 0.Suppose
cv is such that a.e.point
~ _
(o, t)
isgood
in the sense of thecontinuous-parameter
return-time theorem(Theorem 1 )
for all the functionsf h
asabove,
as h varies in thisAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
countable set.
Suppose
further that 03C9 isgeneric
for the function 8 onH,
inthe sense of the
ergodic
theorem and the transformation a. Thenso that if x =
(c.,~, 7/)
we haveGiven c >
0, choose ~ ~
0 very small sothat g
has oscillation less thanc/2
on any interval oflength
r~(w, r~)
isgood,
in the senseof Theorem
1,
for all thefh
with h in our countable set. Then choose hso small that the second term in
(2)
is less than c for all n and y. Forlarge
n, the third term is of the order of this can be made less than cby choosing h
small and nlarge.
The fourth and fifth terms tend to 0 as n -* oo becausethey
are boundedby Finally, by
Theorem 1 we can choose n alsolarge enough
that the first term is also less than c.COROLLARY 1. - Let
(S2, P) be a sampling
scheme as above.Then for
a.e.o,
for
everymeasure-preserving flow
t~}
on a measurespace
(Y, C, v), p > 1, and g
E LP(Y),
Vol. 30, n° 3-1994.
Proof. - Always assuming ergodicity
and mean0,
for h E D(Y),
which can be made
arbitrarily
smallby choosing
first h and then n.COROLLARY 2. -
If a sampling
scheme(Q, P)
is such thatfor
a.e. c,~,for
all(Y, C, v, ~ St ~ )
the averagessatisfy
a maximalinequality
on LP(Y)
(p > 1),
then the scheme isuniversally representative for
LP.3. EXAMPLES OF BAD AND GOOD SAMPLING SCHEMES
Example
1. -Suppose
that thespacing
function b isuniformly
distributed in[1/2, 1]
and that the spaces are allindependent.
Then withprobability
1 thesampling
times Tk and 1 arelinearly independent
overQ,
so it follows from[3]
that thesampling
scheme is notuniversally representative
in this case: for a set of cv ofprobability 1,
for every nontrivial(Y, C, v, {Ss})
there is a boundedmeasurable g
on Y for whichthe averages
An g (y)
of thesamples
at the times determinedby cv
fail toconverge a.e. It can be
argued
that this is thetype
ofsampling
scheme that isnecessarily employed by
actualscientists,
and thattherefore,
in view ofthis
example
and theCorollary
toProposition 1,
von Neumann wasright
when he
argued
that the MeanErgodic Theorem,
rather than thepointwise
one, was the real
ergodic
theorem.Example
2. - We will show that there are some nontrivial Markov processes that dogive
rise touniversally representative sampling
schemes.Suppose
that thesampling
process(H, P, 8)
is such that 8 takes valuesa ~
for some a > 0 and that the difference between the number of 1’s and the number of seen remains bounded: there is K such thatfor all n =
1, 2,...
and ~ 03A9(actually
K could alsodepend
onc,v).
Particular
examples
can be obtained fromstationary
Markov measures onrun-limited
graphs
likeAnnales de l’lnstitut Henri Poincaré - Probabilités et Statistiques
Then the
sampling
times Tk(cv)
are in ZUZ a, a countable dense set inR,
butof course the spaces between them are bounded below. If
0 _ g
EL~ (Y),
then
where
and
(Si
denotes the usualergodic
maximaloperator applied
toSi Sa .
From this the weak
1,1 inequality
for supnAn g (y)
followsimmediately,
for all o E SZ. Since
by Proposition
1yields
a.e. convergence ofAn g (y)
for a dense setof g
ELl (Y),
it follows that for almost all w thesampled
averagesAn g (y)
converge a.e. dv(y)
forall g
ELl (Y).
Example
3. - We will extend theprevious example
to one in which thespacing
function 8 takesarbitrarily
smallpositive values,
say in a sequence rj that tends to 0. Under suitable conditions we will obtain asampling
scheme that is
universally representative
for LP(Y)
for p > 1.Let each rj E
(0, 1),
and let Pagain
have memory such that astep
ofsize rj
isalways
followedimmediately by
one of size 1 - r j. Forexample,
such a
system
can be constructed from anarbitrary measure-preserving
system Po, To)
and countablepartition A2, ...~
as follows. Let(H, P, T)
be theprimitive
transformation builtby putting
a "second floor"over
(no, Po),
the second floorbeing merely
a copy(no, Po). (Points
onthe first floor are
mapped
upby
the identification map tocorresponding points
in the secondfloor,
andpoints
in the second floor aremapped by To
and then tocorresponding points
in thefirst).
Wedefine ~ ~ ~
onAj
and 1 - rj on
TA.
We claim that if p~ =
P (Aj)
decreases to 0sufficiently rapidly,
thenfor a.e. c,~, for all
Y,
forall g
E LP(Y)
the surpremum(over n)
of theaverages
An g (y) along
the times T~(w)
will be finite a.e. dv(y),
andhence, by
Banach’sPrinciple
andProposition 1,
thesampling
scheme will beuniversally representative
for LP(Y). (For bounded g
a more directargument using only
the return times theorem ispossible).
Forsimplicity,
Vol. 30, n° 3-1994.
let us suppose first that p > 2 and g E LP
(Y).
Then for m =0; 1,...,
T2 m
(cv)
= m and T2 m+1(cv)
= m + b(T2 ~m
for all cv on the first floor ofSL,
so that for0 _
g E LP(Y)
The surpremum of the first term is controlled
by
theergodic
maximalfunction To deal with the second term, for
each j
let x~ = and letxi
denote theergodic
maximal function of x~ underT2.
Then the surpremum of the second term is boundedby
We claim that it is
possible
to choose theprobabilities
pj so thatlet us assume this for the moment and note that
if g
E LP for some p >2,
thenS 1 g2 )
E foreach j, by
the DominatedErgodic Theorem,
so that its square root is
again
in LP CLl (with Ll
norm boundedindependently of j ).
Then the above estimate shows thatTherefore the same result also follows for a sequence of
sampling
times{k (03C9)} generated by
apoint 03C9
on the second floor of H.Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
It remains to show that one can choose the p~ so as to
guarantee
the almostsure convergence of the series
~~ (xi (CcJ))1/2;
forexample,
p~ =will work. For
then, by
the MaximalErgodic Theorem,
so that
~ P ~x~ > 2-~ ~
oc, and hence withprobability
1 we have2-j
forlarge enough j.
In the case of an
arbitrary
p > 1 the sameargument applies, except
that theappropriate
version of Holder’sinequality
should be used. Therequirement
then becomes~ ( cv ) ) a
oo a.s., where p >Again
this can bearranged by choosing
the p~ =P (6
=suitably.
Example
4. -Upon deleting
the times T~(cv)
that are in 7~ from thesampling
scheme inExample 3,
we obtain a scheme thatmight
betermed an
example
of randomdelay:
the k’th measurement is taken at timeTk (w)
= k +b ( ~ ~ c~ ) ,
where 0b ( cv )
1 for all c~, so thatthe size of the gap between the k’th and
(k
+1)’ st
measurements is’Y~
(~)
=1-f-b ~) -b ~) _
’Y~),
if ~y(c,~)
=1+b (cv).
Notice that
again ~ (~y 2014 1) w)
is bounded for each cv,exposing
theconnection of these
examples
withExample
2. Since Z isuniversally representative
for LP(Y),
this sclieme will be also whenever p and the p~are chosen so that the
corresponding
scheme inExample
3 isuniversally representative
for LP(Y).
Question. -
Is itpossible
togenerate
auniversally representative
sequence of
sampling
times if the spacesôk
= b o~~
between timesare chosen
independently
from aset ~ r~ ~ clustering
at 0? Will the sequenceautomatically
beuniversally representative
if =r~ ~
-~ 0 fastenough?
Example
5. - The idea behindExamples
3 and 4actually permits
construction of a
fairly
wide class ofexamples. Roughly speaking,
ifa
sampling
scheme admits a return-timestype theorem,
then so does any countable-valued stochasticperturbation
of thescheme,
if its tail distribution vanishessufficiently quickly.
To make thisprecise,
let~T~ (cv) ~
be asequence of
sampling times, i.e.,
anincreasing
sequence ofnonnegative
real-valued functions on a
probability
space(H, ~’, P);
we say that the sequence has return timesfor
p( > 1 )
in case for allf
E L°°(H),
foralmost all cv,
given
acontinuous-parameter measure-preserving flow ~ St ~
on a
probability
space(Y, C, v) and g
E LP(Y),
Vol. 30, n° 3-1994.
For
example,
the sequence Tk = k has return times for eachp >
l. Asusual,
let q denote the index dual to p.THEOREM 2. - Let
p >_
1.Suppose
that~T~ ~
has return timesfor
p andthat SZ --~
,~32, ... ~
is a countable-valued measurablefunction for
which there is a sequence
of positive
reals such that~
oo andJ
Then + also has return times for p.
Remarks.
1.
Similarly,
for any measurablefunction 1
onH, (cv)
= Tk{c~)
+,~ (Tk w) - 1 (w)
also has return times.2. The situation of the theorem can be
generalized
still further. Forexample,
in the definition of "has returntimes",
we may suppose that instead of transformationsSTk
(w)sampled
from ameasure-preserving flow,
we are
dealing
with a random sequence of contractions{cv) ~
onLP
(y).
Theperturbation
is formedby taking
a sequence of contractionsVi, V 2 , ...
on LP(Y)
and a measurablefunction j :
S2 -+ N andreplacing
~~ {cv) = by Wk (w)
=U~
Under thecondition that
£ an
oo, the sequence~W~ (cv)~
willalso have return times in this sense.
(A particular
casemight
involve the deflection of the action of a combination ofcommuting m.p.t.’s (ST)
offcourse,
by applying
powers of either S or T atrandom.)
3. If we are concerned
only with g
EL°°,
we candispense
with theassumption involving
thesequence {aj}.
Proof of
Theorem 2. - We may assume without loss ofgenerality
thatf , g >
0. Let x; =x ~ ~_,~~ ~ ,
and abbreviateh~
= g~ =g 5,~~ ,
andg~, ~ = g~
STk
~W). Thegood
set of c,~ will consist of all thosepoints
forwhich
(1)
there is ajo (c,~)
such thatfor j > jo (cv)
we havewhere * denotes the
ergodic
maximal function for T(this
is a set of fullmeasure, since
by
the MaximalErgodic
Theorem andhypothesis,
(2)
c~ isgood,
in the sense of the return-timeshypothesis,
for eachh~ .
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
Notice that also for each k there is a set of full measure of y for which there is a
jo (y)
such thatfor j > jo (~/).
in what
follows,
we will consideronly
those y that are in these sets of fullmeasure for all k.
Also,
welet jo (w, y)
=max {jo (w), jo (y)}.
For p > 1the
interchange
of limit and sum in the calculation[in
which the limits inside the sum exist a.e. dv(y)
foreach j, by
pro-perty (2)]
isjustified by
the DominatedConvergence Theorem,
since(using
Holder’ s
Inequality)
If p = 1,
then thehypothesis
ooonly
allowsj3
to takefinitely
manyvalues,
and theinterchange
of limits isagain permissible.
4. RANDOM APPLICATION OF COMMUTING M.P.T.’S - A GOOD EXAMPLE AND A KEY COUNTEREXAMPLE In this section we
investigate
theproblem
of a.e. convergence of averages when the transformationsbeing applied
at eachinteger
time are selectedfrom an
arbitrary
set ofcommuting m.p.t.’s,
notnecessarily
from a one-parameter
flow. This isalready
aninteresting question
in the verysimple
Vol. 30, n° 3-1994.
case when at each time we choose
independently
one of twocommuting m.p.t.’s,
such as either am.p.t.
or itsinverse;
we will show below that sometimes this scheme admitspointwise ergodic theorems,
but notalways.
If the two transformations are chosen
according
to ageneral
shift-invariant measure, theprinciples
that makepossible
the convergence of random averages are not at allclear;
for two-dimensional averages(summing
overrectanges
in7l2)
one can prove return-timetheorems,
but here we aredealing
withsomething
different -again
someinappropriate
averages, this time a sort of one-dimensionalsampling, by
means of a randomwalk,
ofa two-dimensional situation.
It will
develop
that forindependent
sequences(i.e.,
if P =p°°
for aprobability
measure p onI)
there is a difference betweenexamples
of thekinds
(2)
and(4) (powers
of asingle transformation)
and those of kinds(3)
and(5) (general aperiodic higher-dimensional actions).
The first kindproduces good
schemes for a.e. convergence of the if andonly
if theexpectation
of the powers is not zero. The second kind neverproduces good schemes;
there isalways, however,
a.e. convergence of a fixedsubsequence
of the
An .
Sturmian
samples. -
Webegin by considering
agood
scheme oftype (3),
in which twocommuting m.p.t.’s
chosen in a certainstationary (but highly dependent)
wayproduce
agood sampling
scheme: we will show that whenever twocommuting m.p.t.’s
areapplied according
to the entries in certain Sturmian sequences, we will have a.e. convergence of theresulting ergodic
averages for all functions inL I.
Fix an irrational a in(0, 1),
letJ =
[0, ( j a)]
for some nonzerointeger j (where ~x~
denotes the fractionalpart
ofx),
and for each x E[0, 1] define § (x)
E~0, 1~~ by
When
sampling
apair
ofcommuting m.p.t.’s So, Sl,
at each time k wewill
apply
~x) ~~_ 1) .THEOREM 3. - Each x E
[0, 1]
generates agood ~0, 1~-sequence,
in thefollowing
sense: Given anyprobability
space(Y, C, v), commuting m.p.t.’s
So
andSl
onY,
and g EL1 (Y),
the averagesAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
Proof -
Given anysystem (Y, C, v, So, Si), by considering
the skew-product
transformation(x, y)
2014~(x
+ a,S~j
J (~)y)
of the rotationby
aon
[0, 1]
with(Y, So, Si)
we canobtain,
for eachfixed g
EL~ (Y),
convergence of the averages for a.e.
(x’, y).
We will use thisto show that in fact for all x E
[0, 1],
these averages converge a.e. on Yif g
EL°° (Y) .
Given x E
[0, 1],
notice that if x’ is very near x, and say to theright
ofx, then x
+ k
a and x’+ k
a are either both in J or both not inJ, except
when x+ k
a hits a very short interval to the left of anendpoint
of J.However,
if thelength
of J is a, when we create adiscrepancy
at timek
by hitting
the first of theseintervals,
weimmediately
correct it at timek +
1;
if thelength
of Jis ( j a) ,
theresynchronization
occurs afterfinitely many steps. Therefore (y) - (g) ~
is bounded times thefrequency
of visits of x-f-1~ a
to a shortinterval,
and so will be very smallif x is close to
x’. Thus, given g
E L°°(Y),
choose a countable dense setof x’ for each of which there is a full set of y E Y with convergence of the associated averages; then
given x
E[0, 1]
and c >0,
if we choose one ofour
countably
manyx’ sufficiently
close to x, on the intersection of thesecountably
many sets of y we will haveIn order to lift convergence from L°°
(Y)
to all ofLI (Y),
we willagain
use a bounded-deviation
type
of trick to establish a maximalinequality.
Since
k al
is bounded(see [16]),
we may writewhere m is bounded
(say by M)
andinteger-valued.
LetSt
be thesuspension (flow
under the constant function1)
ofSi
onY
= Y x[0, 1 )
and defineT (y, r) = (So y, r)
onY
and S =Si;
then T andSt
commute. Extendfunctions g
on Y toY by 9 (y, r)
=g (y)
for all r E[0, 1). Let g
benonnegative
andintegrable
onY,
and defineVol. 30, n° 3-1994.
Then
since
gM (y, t)
= gMT~ y)
or gMT’~
Sy) depending
on whether the fractional
part
of k c~ is to the left orright
of 1 - t. Thenthe Maximal
Ergodic
Theoremapplied
to the transformationTS03B1
onY
and Fubini’s Theoremyield
a maximal estimate for the averagesAn
g.Now we consider the
problem
ofapplying
twocommuting m.p.t.’s according
to a Bernoulli(independent identically-distributed)
sequence on twosymbols.
We will see that thissampling
scheme isalways universally representative
for a fixedsubsequence
of the sequence of averages; but when the transformation are am.p.t.
and itsinverse,
it isuniversally representative
if andonly
if the distribution is notsymmetric.
First we show that
making independent
choices of 1and -1,
each withp robabilit y 1 2
does not form auniversally representative
scheme(for applying
am.p.t.
and itsinverse).
THEOREM 4. - Consider a sequence c.~
E ~ -1,
chosenaccording
to theBernoulli measure
,~3 1 2 ’ 1 - 2
thatis,
’~ ~ ~ c,~
is a sequenceof in dependent, identically-distributed
choices and 1 eachbeing
chosen withprobability 1 2.
Asusual, for
each k =1 2,
... letSk ( cv )
= 03C91 + ... + okdenote the
position
at time kof
the random walk whose increment at timej
is Then withprobability 1,
c,~ is a bad sequencefor
U and inthe
following
sense:given
anyergodic m.p.t.
U on a nonatomicprobability
space
(Y, C, v),
there is g EL1 (Y; C, v)
such that the averagesAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques