• Aucun résultat trouvé

Entropy of random walk range

N/A
N/A
Protected

Academic year: 2022

Partager "Entropy of random walk range"

Copied!
13
0
0

Texte intégral

(1)

www.imstat.org/aihp 2010, Vol. 46, No. 4, 1080–1092

DOI:10.1214/09-AIHP345

© Association des Publications de l’Institut Henri Poincaré, 2010

Entropy of random walk range

Itai Benjamini

a

, Gady Kozma

a

, Ariel Yadin

b

and Amir Yehudayoff

c

aFaculty of Mathematics and Computer Science, The Weizmann Institute of Science, Rehovot 76100, Israel.

E-mails:[email protected];[email protected]

bDPMMS, Centre for Mathematical Sciences, University of Cambridge, Cambridge CB3 0WB, UK. E-mail:[email protected] cSchool of Mathematics, Institute for Advanced Study, Princeton, NJ 08540, USA. E-mail:[email protected]

Received 27 April 2009; revised 30 September 2009; accepted 2 October 2009

Abstract. We study the entropy of the set traced by ann-step simple symmetric random walk onZd. We show that ford≥3, the entropy is of ordern. Ford=2, the entropy is of ordern/log2n. These values are essentially governed by the size of the boundary of the trace.

Résumé. Nous étudions l’entropie de la trace d’une marche aléatoire simple et symétrique de longueurnsurZd. Nous montrons que sid≥3, cette entropie est d’ordren, tandis que pourd=2 elle est d’ordren/log2n. Ces valeurs proviennent essentiellement de la taille de la frontière de la trace.

MSC:82C41; 94A17

Keywords:Random walk; Entropy

1. Introduction

A natural observable of a random walk is itsrange, the set of positions it visited. In this note we study the entropy of this range, which is roughly the number of bits of information needed in order to describe it. We calculate the entropy of the range of a random walk onZd,d∈N, up to constant factors.

1.1. Main result

LetS(0), . . . , S(n)be a simple symmetric random walk onZd,d∈N, of lengthn. Define therangeof the random walk to be

R(n)=

S(0), S(1), . . . , S(n) ,

the set of vertices visited by the walk.

In this note we study the entropy ofR(n)as a function ofn(for formal definition of entropy, see Section2.1). We calculate the value of the entropy,H (R(n)), up to constants, precisely:

Theorem 1. Ford=2there exist constantsc2, C2>0such that for alln∈N, c2

n

log2(n)H R(n)

C2

n log2(n),

(2)

and ford≥3there exist constantscd, Cd>0such that for alln∈N, cdnH

R(n)

Cdn.

The proof of Theorem1is organized as follows: We first prove the lower bound which is easier and follows directly from estimates on the size of the boundary of the range; in two dimensions the boundary of the range of the walk is of ordern/log2n, and in higher dimensions it is linear inn. This is done in Section2.2. We then show the upper bounds.

The proof for dimensions greater than two is in Section2.3. The proof for dimension two, which requires a certain renormalization argument, appears in Section2.4. An interesting feature of the procedure is that at each step of the renormalization process, the number of “active” boxes is not determined by examining the previous renormalization step, but rather globally. For a more detailed discussion of this procedure see Section2.4.2.

The one-dimensional case is not difficult.

Exercise

In the cased=1, there exist constantsc1, C1>0 such that for alln∈N, c1lognH

R(n)

C1logn.

2. Entropy of random walk

2.1. Entropy

Here we provide some background on entropy. LetXbe a random variable taking values in an arbitrary finite setΩ. For xΩ, let p(x) be the probability that X=x. The entropy of X is defined as H (X)=E[−logp(X)] (all logarithms in this note are base 2). For two random variables X andY, the conditional entropyof X conditioned onY is defined asH (X|Y )=H (X, Y )H (Y ).

Proposition 2. The following relations hold:

(i) 0≤H (X)≤log|Ω|.

(ii) For every functionf,H (f (X)|X)=0.

(iii) H (X)H (Y )+H (X|Y ).

For more information on entropy and for proofs of these properties see, e.g., [1], Chapter 2.

2.2. Lower bound Notation

ByPzandEz we denote the probability measure and expectation of the random walk conditioned onS(0)=z. We denoteP=P0andE=E0. Letz, w∈Zd andA⊂Zd. Denote by dist(z, w) the graph distance betweenzandw inZd. Denote dist(z, A)=inf{dist(z, a): aA}. We writezwif dist(z, w)=1, andzAif dist(z, A)=1. The inner boundaryofAis defined as

∂A=

zA: z∼Zd\A .

Letpn(A)=P[R(n)=A].

Lemma 3. For everyA⊂Zd, pn(A)

1− 1

2d |∂A|−1

.

(3)

Proof. LetT0=0 and define inductively forj≥1, Tj=inf

tTj1+1: S(t)∂A .

By the strong Markov property, for any 0< j <|∂A|, P

S(Tj+1) /∈A|S(0), . . . , S(Tj), Tj<∞ ≥ 1 2d.

The eventAR(n) implies thatTjnfor all j ≤ |∂A|. The eventR(n)Aimplies thatS(Tj+1)∈Afor all j≤ |∂A| −1. LetEj be the event thatS(Tj+1)∈AandTj+1<∞. Thus,

P

R(n)=A ≤P |∂A|−1

j=1

Ej

|∂A|−1 j=1

P[Ej|E1, . . . , Ej1] ≤

1− 1 2d

|∂A|−1

.

Lemma3shows that in order to bound the entropy of the random walk trace from below, it is enough to bound the expected value of the size of the inner boundary of the random walk trace from below. More precisely, we have:

Corollary 4. H (R(n))≥ −log(1−2d1)·E[|∂R(n)| −1].

The following lemma gives the lower bound for the entropy of the random walk trace.

Lemma 5. For anyd≥2,there exists a constantcd>0such that for alln∈N, H

R(n)

c2 n

log2(n), d=2, cdn, d≥3.

Proof. By Corollary4, it suffices to show that E∂R(n) ≥

c2 n

log2(n), d=2, cdn, d≥3

for some constantscd>0. Forz∈Zd, defineTz=inf{t≥0:S(t)=z}. By Lemma 19.1 of [4], and by the transience of the random walk ford≥3, there exist constantscd>0 such that for anyzw∈Zd,

Pz[Tw> n] ≥ c

2

logn, d=2, cd, d≥3.

Denote the right-hand side of the above inequality byfd(n). Using the strong Markov property at timeTz, for any zw∈Zd,

P

z∂R(n) ≥P[Tzn, Tw> n] ≥fd(n)P[Tzn]. This proves the lemma, since

E∂R(n) ≥fd(n)

z∈Zd

P[Tzn] =fd(n)ER(n) ,

and since

ER(n) ≥

c2·lognn , d=2, cdn, d≥3

for some constantscd>0 (see, e.g., Theorem 20.1 in [4]).

(4)

2.3. Upper bound

We now show that the lower bounds on the entropy of the random walk trace given by Lemma5are correct up to a constant. The transient case is much simpler than the two-dimensional case.

Proposition 6. Ford≥3,there exists a constantCd>0such that for alln∈N, H

R(n)

Cd·n.

Proof. LetΩ = {A⊂Zd: pn(A) >0}. By clause (i) of Proposition2 it suffices to prove that|Ω| ≤(2d)n. This follows from the fact that the number of possiblen-step trajectories inZdstarting at 0 is(2d)n. 2.4. Two dimensions

We now turn to the two-dimensional case, which is more elaborate.

Forz∈Z2, we denote byztheL2-norm ofz. Define Tz,r=inf

t≥0 :S(t)zr , andTr=T0,r. Also set

τz,r=inf

t≥0: S(t)−zr , andτr=τ0,r.

2.4.1. Probability estimates

We begin with some classical probability estimates regarding the random walk onZ2, which we include for the sake of completeness.

Lemma 7. There exists a constantC >0such that for alln∈N, E

0maxkn

S(k)2

Cn.

Proof. LetS(k)=(X(k), Y (k)), soS(k)2= |X(k)|2+ |Y (k)|2. Doob’s maximal inequality (see, e.g., [5], Chap- ter II) on the martingaleX(k)tells us that

E

0maxkn

X(k)2

≤4EX(n)2 .

The martingale|X(k)|2k/2 tells us thatE[|X(n)|2] =n/2, which completes the proof, sinceX(k)andY (k)have

the same distribution.

Lemma 8. There exist constantsc1, c2>0such that for alln∈Nandλ >0, P

1maxjn

S(j )λ

c1·exp

c2λ2 n

.

Proof. This is a consequence of Theorem 2.13 in [4].

Lemma 9. There exists a constantc >0such that the following holds.LetT =T0,0.Then,forz∈Z2andr≥2z, Pz[Tτr] ≥clog(r/z)

logr .

(5)

Proof. Leta:Z2→ [0,∞)be thepotential kerneldefined in Chapter 1.6 of [2]. That is,a(0)=0,a(·)is harmonic inZ2\ {0}, and there exist constantsc1, c2>0 such that for anyz∈Z2\ {0},a(z)=c1logz +c2+O(z2).

Sincea(·)is harmonic inZ2\ {0}, ifr >zthena(S(t ))is a martingale up to timeT =min{T , τr}. Thus, a(z)=

1−Pz[Tτr]

·Ez

a S

T

|T > τr , which implies

Pz[Tτr] ≥1−c1logz +c2+O(z2)

c1logr+c2+O(r2) .

We also need an upper bound.

Lemma 10. There exists a constantC >0such that for everyz∈Z2andr, Rsuch that1≤r12z14R, Pz[TrτR] ≤C·log(R/z)

log(R/r) .

Proof. Using the potential kernel from the proof of Lemma 9 with the stopping time min{Tr, τR}, there exists a constantc1>0 such that

Pz[TrτR] ≤c1(logR−logz)+O(R1+ z2) c1(logR−logr)+O(r2)

C·log(R/z) log(R/r)

for some constantC >0.

Lemma 11. For any0< α <1, there exists a constantC >0such that the following holds.Letz∈Z2 such that z ≥1/α.Then for anyn∈Nsuch thatn >z4,

Pz[Tαzn] ≤ C log(n/z4).

Proof. By adjusting the constant, we can assume without loss of generality thatn/z4is large enough. Letr=αz andR=n1/4. Using the potential kernel from the proof of Lemma9with the stopping timeT =min{Tr, τR},

Pz[TrτR] ≤c1log(z/r)+O(r1)

c1log(R/r)+O(r1)C1

log(n/z4) (2.1)

for some constantC1=C1(α) >0 independent ofzandn. Also, considering the martingaleS(t)2tup to timeτR shows thatEz[τR] ≤(R+1)2. Thus, by Markov’s inequality,

Pz[τR> n] ≤ 4

n. (2.2)

Equations (2.1) and (2.2) together prove the proposition, since

Pz[Trn] ≤Pz[TrτR] +Pz[τR> n].

Lemma 12. There exists a constantC >0such that for alln∈Nand1≤r12

nthe following holds.Letz∈Z2 be such thatz ≥√

n.Then, Pz[Trn] ≤ C

log(n/r2).

(6)

Proof. Form≥1, letAmbe the event{τmz< Trτ(m+1)zn}. The family{Am}consists of pairwise disjoint events, and

Pz[Trn] ≤

m=1

P[Am].

For everym≥1, using the strong Markov property at timeτmz, Pz[Am] ≤Pz[τmzn] ·max

Px[Trτ(m+1)z]: mzxmz +1 . By Lemma8, there exist constantsC1, c2>0 such that

Pz[τmzn] ≤Pz

1maxjn

S(j )mzz

C1exp

c2m2 .

By Lemma10, for anyx∈Z2such thatmzxmz +1, Px[Trτ(m+1)z] ≤Px[Trτ2(mz+1)] ≤ c3

log(n/r2) for some constantc3>0. Summing over allm≥1,

Pz[Trn] ≤ c3 log(n/r2)

m=1

c1exp

c2·m2

.

2.4.2. Upper bound in two dimensions

The general scheme of the proof is quite simple. Here is how we efficiently store the information (this implies that the entropy cannot be too big). Divide the relevant area, which is roughly a√

n×√

nbox, to smaller blocks. Each of these blocks can have three states: empty (most common), full, or partial (least common). Record the state of all blocks.

The empty and full blocks need not be considered any further. The partial blocks are further divided into sub-blocks, and recorded recursively. We will show that this scheme requires an average of≈n/log2nbits, which will finish the proof.

Hence the crucial question is: how many partial blocks are there? Considerk×kblocks. For a block to be partial, the random walk needs to first hit it (this “costs” 1/logn), and then escape before covering the block completely. It is well known that the cover time of ak×ksquare is orderk2log2k, that is, it requires about log2k“visits” to the block (if you are unfamiliar with the notion of a visit, examine the proof of Lemma15, a visit is the time interval[τj, σj]).

Thus, the random walk needs to “escape” from our block, this costs another factor of 1/logn, but it has about log2k

“attempts” to do so. To conclude, the final probability of a typical block to be partial is order log2k/log2n.

This argument is spelt out in Lemmas15and17below. Lemma15contains the calculation above except for the very first 1/logn term, since there the starting point is close to the block. Lemma17contains the full calculation, with the main problem being estimating separately blocks which are close by and far away (Lemma13helps with the far away blocks).

Let us use this opportunity to repeat a point already made in theIntroduction. Our blocks have a typical hierarchical structure, with the blocks of levelj contained in blocks of levelj+1. Naïvely, at levelj the blocks we are interested in are blocks partially covered by the random walk, which are contained in partial block at level j+1, which are themselves contained in partial blocks at level j +2, etc. So it seems that estimating the number of such blocks requires going through this hierarchy. But it does not, because once a block is partially covered by the random walk, we get this property automatically for all its super-blocks. We can thus estimate the number of partially covered blocks in levelj directly. This simplifies the proof significantly.

We move to the details of the proof. Forz∈Z2andk∈N, let Q(z, k)=

z+ j, j

: −kj, jk

;

i.e.,Q(z, k) is the square of side length 2k+1 centered atz. For a path x(0), x(1), . . . , x(n)inZ2, we denote by x[s, t]the pathx(s), x(s+1), . . . , x(t ).

(7)

Lemma 13. There exist constantsc, C >0 such that for alln, k∈Nsuch that kn1/4,and allz∈Zd such that z ≥5√

n,

P

R(n)Q(z, k)=∅ ≤ C logn·exp

cz2 n

.

Proof. Letλ= z −2√

n. LetT be the first time the walkS(·)started at 0 hitsQ(z, k). Thenτλ< Tz,2k< T. By Lemmas8and12,

P

R(n)Q(z, k)=∅ ≤P[τλn] ·max

Px[Tz,2kn]: λxλ+1

≤P

1maxjn

S(j )λ · c1

logn

c2 logn·exp

c3z2 n

for some constantsc1, c2, c3>0.

Lemma 14. There exists a constantC >0such that the following holds.For alln, k∈Nsuch thatkn1/4,and all z∈Zdsuch that1≤ z<5√

n,

P

R(n)Q(z, k)=∅ ≤C·log(10√ n/z)

logn .

Proof. By adjusting the constant, we can assume without loss of generality thatz ≥3k. LetQ=Q(z, k). Define σ0=0, and fori≥1, define

σi=τ10i n=inf

t≥0: S(t)≥10in

. The event{R(n)Q=∅}is contained in the event

S[0, σ1] ∩Q=∅

i1

S[σi, σi+1] ∩Q=∅, σin .

Since 3k≤ z<5√

n, we have that on the event{S[0, σ1] ∩Q=∅}, the random walk started at 0 enters the ball of radius 2karoundzbefore exiting the ball of radius 20√

naroundz. Translating by minuszwe get by Lemma10that there exists a constantC1>0 such that

P

S[0, σ1] ∩Q=∅ ≤Pz[T2kτ20n] ≤C1·log(10√ n/z)

logn .

Fixi≥1. By Lemma8, P[σin] ≤P

0maxjn

S(j )≥10in

C2·exp

C3·102i for some constantsC2, C3>0. Using Lemma10again,

P

S[σi, σi+1] ∩Q=∅|σinC4 logn for some constantC4>0. Therefore,

P

R(n)Q=∅ ≤C1·log(10√ n/z)

logn +C2·C4 logn

i1

exp

C3·102i

.

We have reached the main geometric lemma.

(8)

Lemma 15. There exists a constantC >0such that the following holds.Letn, k∈N,letQ=Q(0, k)and letzQ.

Then, Pz

∂R(n)Q=∅ ≤C·log2k logn.

Proof. Without loss of generality assume that log2k≤logn. DefineQ+=Q(0, k+1). SoQ+ contains the union ofQwith all vertices that are adjacent toQ. Defineτ0=0, and inductively

σj=inf

tτj: S(t)≥10k , τj+1=inf

tσj: S(t)Q+ .

IfQ+R(n)then∂R(n)Q=∅. Thus, it suffices to bound from above the probability of the event{Q+R(n)}. It is convenient to choosem= logk·logn, as will soon be clear. SetVj= {σj+1σj2mn }andUj= {Q+R(σj)}.

We prove the following inclusion of events Q+R(n)

⊆ {σ0n/2} ∪Um

m1

j=0

(UjVj). (2.3)

Assume that the event on the right-hand side of (2.3) does not occur; i.e., assume thatσ0< n/2, thatUm, and that for all 0≤jm−1,UjVj. LetJ=min{0≤jm:Uj}. Consider the following cases:

• Case 1:J=0. ThenQ+R(σ0). Sinceσ0< n/2, we get thatQ+R(n).

• Case 2:J >0. Since we assumed thatUm, we know that 1≤Jm. By the assumptionm1

j=0(UjVj), we have thatσj+1σj< n/2m, for all 0jJ−1. Since we assumed thatσ0< n/2, we get that

σJ =σ0+

J1

j=0

σj+1σj< n.

ButJwas chosen so thatUJ occurs, soQ+R(σJ)R(n). This proves (2.3).

Fix j ≥0. SinceS(t)z2t is a martingale, we have that Ez[σjτj |F(τj)] ≤C1k2for some constant C1>0. Using Markov’s inequality,

Pz

σjτjn 4m

F(τj)

C2mk2

n (2.4)

for some constantC2>0. By Lemma11, there exists a constantC3>0 such that Pz

τj+1σjn 4m

F(σj)

C3

logn. (2.5)

The two inequalities, (2.4) and (2.5), imply that Pz

Vj|F(σj)C4

logn (2.6)

for some constantC4>0. Using Lemma9, there exists a universal constantC5>0 such that for anyxQ+, Pz

xS[τj, σj] |F(τj)C5

logk.

(9)

Thus,

Pz[Uj] =Pz

Q+R(σj) ≤min

1,Q+·(1C5/logk)j+1

≤min

1, C6k2exp

C5(j+1)/logk

(2.7) for some constantC6>0. Plugging (2.4), (2.6) and (2.7) into (2.3) yields

Pz

Q+R(n) ≤Pz[σ0n/2] +Pz[Um] + K

j=0

Pz[UjVj] +

j >K

Pz[UjVj]

C7

k2

n +nC8+ K

j=0

1

logn+

j >K

k2exp(−C5(j+1)/logk) logn

C9log2k

logn , (2.8)

whereK= 4 log2k/C5andC7, C8, C9>0 are constants.

Definition 16. DefineΛ(k)= {(2k+1)z: z∈Z2}.The collection {Q(z, k)}zΛ(k) consists of disjoint squares that coverZ2.Fork, n∈Nandz∈Z2,defineI (z, k, n)to be the indicator function of the event{∂R(n)Q(z, k)=∅}.

Define

M(k, n)=

zΛ(k)

I (z, k, n),

i.e.the number of squares that intersect∂R(n).

Lemma 17. There exists a constantC >0such that for everyk, n∈N, E

M(k, n)C·max

1, n

k2 ·log2k log2n

.

Proof. Fixk, n∈N. Forz∈Z2, the event{∂R(n)Q(z, k)=∅}implies the event

0maxjn

S(j )≥ z −√

2(k+1)

.

We start with an a priori bound. Using Lemma7, there exist constantsC1, C2>0 so that E

M(n, k)

zΛ(k)

P

z ≤ max

0jn

S(j )+√

2(k+1)

≤E

zΛ(k): z ≤ max

0jn

S(j )+√

2(k+1)

C1·max

1, k2·E

0maxjn

S(j )2

C2·max

1, n k2

,

where the third inequality holds as for every R >0, the size of {zΛ(k): zR}is at most a constant times max{1, k2·R2}. Thus, we can assume without loss of generality thatk < k+1≤(n−√

n)1/4n1/4. Let

τQ(z)=inf

t≥0:S(t)Q(z, k+1)

(we use that fact thatQ(z, k+1)is ‘bigger’ thanQ(z, k)), and let J (z, k, n)=1{τQ(z)nn}·I (z, k, n).

(10)

For allzΛ(k), a.s.

I (z, k, n)1{nn<τQ(z)n}+J (z, k, n).

Summing over allzΛ(k), a.s.

M(n, k)≤4√

n+

zΛ(k)

J (z, n, k). (2.9)

By the strong Markov property at timeτQ(z)and Lemma15, there exists a constantC3>0 such that a.s.

P

∂R(n)Q(z, k)=∅|τQ(z)n−√

nC3·log2k

logn. (2.10)

By Lemma14, ask < n1/4, there exists a constantC4>0 such that for allz∈Zdwith 1≤ z<5√ n,

P

τQ(z)n−√

nC4·log(10√ n/z)

logn ,

which implies P

J (z, k, n)C5·log2k

logn ·log(10√ n/z)

logn (2.11)

for some constantC5>0.

DenoteΓ =5√

n/(2k+1). Summing over allzΛ(k)such that 2≤ z<5√ n,

zΛ(k) 2z<5

n

log 10√

n/z

x,y∈Z 2x2+y22

log 2Γ /

x2+y2

C6Γ

2xΓ

log(2Γ /x)≤C7Γ2 (2.12)

for some constantsC6, C7>0. Plugging (2.12) into (2.11), and summing over allzΛ(k)such thatz<5√ n, we get

zΛ(k):z<5 n

P

J (z, k, n)C8·log2k log2n· n

k2 (2.13)

for some constantC8>0. In addition, by Lemma13, there exist constantsC9, C10>0 such that for everyzΛ(k) such thatz ≥5√

n,

P

τQ(z)n−√

nC9 logn·exp

C10z2 n

,

which implies, using (2.10), P

J (z, k, n)C11· log2k log2n·exp

C10z2 n

for some constantC11>0. Summing over allzΛ(k)such thatz ≥5√ n,

zΛ(k):z5 n

P

J (z, k, n)C11· log2k log2n

zΛ(k):z5 n

exp

C10z2 n

C12·log2k log2n· n

k2 (2.14)

for some constantC12>0. The lemma follows by (2.9), (2.13) and (2.14).

(11)

Fork < n∈N, let∂(k, n)be the vector(I (z, k, n))zΛ(k)∩[−2n,2n]2. Note that M(k, n)=

zΛ(k)

I (z, k, n)=

zΛ(k)∩[−2n,2n]2

I (z, k, n).

Lemma 18. Letk, , n∈Nand letk =(2+1)k+.Then, H

∂(k, n)| k, n

≤E M

k, n ·(2+1)2.

Proof. For anyzΛ(k), the squareQ(z, k)is of side length 2k +1=(2+1)(2k+1), and so Q(z, k)is a union of(2+1)2disjoint squares from the collection{Q(z, k)}zΛ(k).

IfQ(z, k)Q(z, k), thenI (z, k, n)I (z, k, n). Thus, conditioned on the vector∂(k, n), there are at most 2M(k,n)·(2+1)2 possibilities for the vector∂(k, n). By clause (i) of Proposition2, and by the definition of conditional

entropy,H (∂(k, n)|∂(k, n))≤E[M(k, n)·(2+1)2].

Lemma 19. There exists a constantC2>0such that for alln, H

R(n)

C2 n log2(n).

Proof. Since the vector ∂(0, n) determines R(n), clauses (ii) and (iii) of Proposition 2 yield that H (R(n))H (∂(0, n)).

Setk0=0, and forj ≥0, define inductivelykj+1=3kj +1. For everyj ≥1, since 3kjkj+1≤4kj, it holds that logkk j

j ≤9j3j. Letm >0 be the smallestj such thatkj> n. The vector∂(km, n)is constant, and so its entropy is zero. By Lemmas17and18, for 0≤jm−1, there exist universal constantsc2, c3>0 such that

H

∂(kj, n)|∂(kj+1, n)

c3·max

1, n

log2n·(j+1)2 9j+1

.

Using clause (iii) of Proposition2, there exists a constantC >0 such that

H

∂(0, n)

m1

j=0

H

∂(kj, n)|∂(kj+1, n) +H

∂(km, n)

C· n log2n.

Remark 20. The proof of Lemma19shows that provided one can calculate the various conditional probabilities(e.g., with unlimited computational power),one can sample the range of a random walk using only ordern/log2nbits.

3. Concluding remarks and problems for further research

3.1. Extracting entropy

Lemma5shows that the entropy ofR(n)in two dimensions is at leastc2n/log2n. It is interesting to note that one can extract order ofn/log2nalmost uniformly distributed random bits, by observing a sample of the range. We sketch the construction.

Consider the two configurations that appear in Fig.1. The walk can only enter the configuration from the “bridge”

on the right, so the situation is symmetric to vertical flips. This symmetry implies that conditioned on outside of the configuration, both have the same probability of occurring. Thus, any occurrence of such a configuration in the range of the random walk gives an independent bit, e.g., setting the bit to be 1 if the right configuration occurs, and 0 if the left configuration occurs. Considerations similar to those raised in the proofs above show that the expected number of such configurations is of ordern/log2n.

(12)

Fig. 1. Two symmetric configurations.×’s are vertices occupied by the range.

3.2. Intersection equivalence

Consider then×nsquare centered at 0 inZ2, and consider the following procedure. Divide the square into 4 squares of side lengthn/2. Retain each of the squares with probability 1/2, independently. Continue inductively: at levelk, divide each remaining square of side lengthn2(k1) into 4 squares of side lengthn2k, and retain each one with probabilityk/(k+1)independently.

This procedure produces a random subset of then×nsquare, denote this set byQ(n2). In [3], Peres shows that the setsQ(n2)andR(n2)areintersection equivalent; that is, there exist constantsc, C >0 such that for any setA⊂Z2,

c≤P[Q(n2)A=∅]

P[R(n2)A=∅] ≤C

from a random starting point. The entropyH (Q(n2))is of ordern2/log2(n), as isH (R(n2)). Note that intersection equivalence does not imply or follow from equal entropy. See [3] for more details.

3.3. Open questions

LetGbe an infinite graph, and let{S(n)}n0be a simple random walk onG. LetR(n)= {S(0), S(1), . . . , S(n)}be the range of the walk at timen. LetH (n)be the entropy ofR(n).

Our results above suggest the following natural question.

• How small canH (n)be in transient graphs? It is possible to construct (spherically symmetric) trees that are transient but haveH (n)=O(log2n). Is it possible to get a smaller entropy?

Note added in proof

David Windisch has some new results on this topic, including an answer to a question posed in previous versions of this paper, see [6].

Acknowledgements

We wish to thank Elchanan Mossel for indicating the configuration in the construction in Section3.1, and to Eric Shellef for helpful discussions. The material is based upon work supported by the National Science Foundation under agreement No. DMS-0835373. Any opinions, findings and conclusions or recommendations expressed in the material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

References

[1] T. M. Cover and J. A. Thomas.Elements of Information Theory. Wiley, New York, 1991.MR1122806 [2] G. F. Lawler.Intersections of Random Walks. Springer, New York, 1996.MR1117680

(13)

[3] Y. Peres. Intersection-equivalence of Brownian paths and certain branching processes.Comm. Math. Phys.177(1996) 417–434.MR1384142 [4] P. Révész.Random Walk in Random and Non-Random Environments. World Scientific, Hackensack, NJ, 2005.MR2168855

[5] D. Revuz and M. Yor.Continuous Martingales and Brownian Motion. Springer, Berlin, 1991.MR1083357

[6] D. Windisch. Entropy of random walk range on uniformly transient and on uniformly recurrent graphs. Preprint. Available athttp://arxiv.org/

abs/1001.0355.

Références

Documents relatifs

We prove a level 3 large deviation principle, under almost every environment, with rate function related to a relative

Key words: Firm epireflection, completion, weak injectivity, saturated object, absolutely closed object, complete object, factorization structure, topological

Pellikaan: Finite determinacy of functions with non-isolated singularities, Proc.

Theorem 2.2.. The necessary condition ii) holds true if the conductivity matrix is not symmetric. Several other results also extend to the non-symmetric case. However, the

First of all, due to the rectifica- tion theorem we prove (see Theorem 2.2) that any non-vanishing smooth gradient field ∇u is isotropically realizable locally in R d , in the

answer to the following question of the author (see [3]): is a soluble group polycyclic if all centralizers of its finitely generated subgroups.. are finitely

Suppose now that theorem 3 is valid for codimension q — 1 foliations and let ^ be a ^codimension q strong ^-foliation on X given by the equation a? 1 = 0,. , c^ are closed Pfaff

This work was partially supported by the strategic project POS- DRU 107/1.5/S/77265, inside POSDRU Romania 2007-2013 co-financed by the Euro- pean Social Fund-Investing in