• Aucun résultat trouvé

A Bilateral version of Shannon-Breiman-McMillan Theorem

N/A
N/A
Protected

Academic year: 2021

Partager "A Bilateral version of Shannon-Breiman-McMillan Theorem"

Copied!
8
0
0

Texte intégral

(1)

HAL Id: hal-00000923

https://hal.archives-ouvertes.fr/hal-00000923

Preprint submitted on 5 Dec 2003

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de

A Bilateral version of Shannon-Breiman-McMillan Theorem

Pierre Tisseur

To cite this version:

Pierre Tisseur. A Bilateral version of Shannon-Breiman-McMillan Theorem. 2003. �hal-00000923�

(2)

ccsd-00000923 (version 1) : 5 Dec 2003

A Bilateral Version of the

Shannon-McMillan-Breiman Theorem

Pierre Tisseur

Laboratoire G´enome et Informatique Universit´e d’Evry, Tour Evry 2.

December 5, 2003

Abstract

We give a new version of the Shannon-McMillan-Breiman theorem in the case of a bijective action. For a finite partition α of a compact set X and a measurable action T on X, we denote by C

n,m,αT

(x) the element of the partition α ∨ T

1

α ∨ . . . ∨ T

m

α ∨ T

−1

α ∨ . . . ∨ T

−n

α which contains a point x. We prove that for µ-almost all x,

n+m→∞

lim −1

n + m

log µ(C

n,m,αT

(x)) = h

µ

(T, α),

where µ is a T -ergodic probability measure and h

µ

(T, α) is the metric entropy of T with respect to the partition α.

1 Introduction

The Shannon-McMillan-Breiman theorem [2], [3] is used in many problems

related to the metric entropy map of an ergodic measure. We extend this

well-known result to the case of a bijective dynamical system. Our proof

follows the line of Petersen’s proof [3]. We illustrate this new result with

an example that gives an inequality between shifts and cellular automata

entropies and some analog of the Lyapunov exponents. Our bilateral version

of the Shannon-McMillan-Breiman theorem is expected to be useful in other

areas of dynamical systems.

(3)

2 Background material

Let X be a compact space, µ a probability measure on X and T a measurable map from X to X. We denote by α a finite partition of X and by C

n,αT

(x) the element of the partition α ∨ T

−1

α ∨ . . . ∨ T

−n

α which contains the point x. For all point x the information map I is defined by

I(α)(x) = − log µ(C

αT

(x)) = X

A∈α

− log µ(A)χ

A

(x),

where C

αT

(x) is the element of α which contains x and χ

A

is the characteristic function defined by

χ

A

(x) =

1 if x ∈ A, 0 otherwise.

The information map satisfies

I(α ∨ β) = I (α) + I(β|α), (1)

I (T α|T β) = I (α|β) ◦ T

−1

. (2) These two properties are easily proved from the definition of I and the fact that T is a surjective map. We refer to [3, p. 238], [4, Chap.8] for a detailed proof of (1Background materialequation.1) and (2Background materialequation.1).

A simple formulation of the metric entropy with respect to the partition α is given by

h

µ

(T, α) = lim

n→∞

Z

X

I(α| ∨

nk=1

T

k

α)(x) dµ(x), where

I(α|β)(x) = − X

A∈α,B∈β

χ

A∩B

(x) log

µ(A ∩ B) µ(B)

is the conditional information map representing the quantity of information given by the partition α knowing the partition β about the point x.

We recall the Shannon-McMillan-Breiman theorem [2] [3].

Theorem 1 (Shannon-McMillan-Breiman’s theorem) If µ is a T -ergodic measure, then for µ-almost all x in a compact X we have

n→∞

lim

−1

n log µ(C

n,αT

(x)) = h

µ

(T, α).

(4)

3 A bilateral version of Shannon-McMillan- Breiman’s theorem

In order to prove the main result (Theorem 2) we need to expose two technical lemmas. The proof of Lemma 2 and Theorem 2 requires a bilateral version of the Birkhoff pointwise ergodic theorem: for a T -ergodic measure µ one has

n+m→∞

lim 1 n + m + 1

n

X

k=−m

f ◦ T

k

(x) = Z

X

f (x)dµ(x)

for almost all x with a map f in L

1

. This result is easily deduced from the Birkhoff pointwise ergodic theorem (see [4, Chap.10]) by breaking up the infinite sum in two proportional parts.

Lemma 1 For all integers m and n we have

n−1

X

k=−m

I(α| ∨

n−kj=1

T

−j

α) ◦ T

k

= I (∨

mj=−n

T

j

α) − I(T

−n

α).

Proof. Note that I(∨

mj=−n

T

j

α) = I(∨

m+nj=0

T

m−j

α). Using (1Background materialequation.1) we have

I(∨

m+nj=0

T

m−j

α) = I(∨

m+nj=1

T

m−j

α) + I(T

m

α| ∨

m+nj=1

T

m−j

α) and from (2Background materialequation.1) we get

I(T

m

α| ∨

m+nj=1

T

m−j

α) = I(α| ∨

m+nj=1

T

−j

α) ◦ T

−m

. Hence,

I(∨

mj=−n

T

j

α) = I(∨

m+nj=1

T

m−j

α) + I (α| ∨

m+nj=1

T

−j

α) ◦ T

−m

. The same operations on I(∨

m+nj=1

T

m−j

α) yields

I(∨

m+nj=1

T

m−j

α) = I(∨

m+nj=2

T

m−j

α) + I(α| ∨

m+nj=2

T

1−j

α) ◦ T

−m+1

= I(∨

m+n−1j=1

T

m−1−j

α) + I(α ∨

m+n−1j=1

T

−j

α) ◦ T

−m+1

. Hence,

I(∨

mj=−n

T

j

α) = I(∨

m+n−1j=1

T

m−1−j

α)

+ I (α ∨

m+n−1j=1

T

−j

α) ◦ T

−m+1

+ I(α| ∨

m+nj=1

T

−j

α) ◦ T

−m

.

(5)

Iterating similarly t − 1 times on I(∨

m+n−1j=1

T

m−1−j

α) leads to I(∨

mj=−n

T

j

α) = I(∨

m+n−tj=1

T

m−t−j

α) +

t

X

k=0

I (α| ∨

m+n−kj=1

T

−j

α) ◦ T

−m+k

. Taking t = m + n − 1 gives

I(∨

mj=−n

T

j

α) = +I(T

−n

α) +

m+n−1

X

k=0

I (α| ∨

m+n−kj=1

T

−j

α) ◦ T

−m+k

which completes the proof.

Lemma 2 If µ is a T ergodic measure then for almost all x in X,

m+n→∞

lim 1 m + n + 1

n−1

X

k=−m

s→∞

lim I(α| ∨

sj=1

T

−j

α) ◦ T

k

(x) =

m+n→∞

lim 1 m + n + 1

n−1

X

k=−m

I(α| ∨

n−kj=1

T

−j

α) ◦ T

k

(x).

Proof. For notational convenience, we introduce f = lim

s→∞

I(α| ∨

sj=1

T

−j

α) and F

N

= sup

s≥N

|I(α| ∨

sj=1

T

−j

α) − f|.

It is well known that the sequence (I(α| ∨

sj=1

T

−j

α))

s∈N

converge almost everywhere and in L

1

. The proof of this convergence (see [4, Chap.8] and [3, p.262]) requires the increasing martingale theorem.

We need to show that

m+n→∞

lim 1 m + n + 1

n−1

X

k=−m

I (α| ∨

n−kj=1

T

−j

α) ◦ T

k

− f ◦ T

k

= 0.

Note that 1 m + n + 1

n−1

X

k=−m

|I(α| ∨

n−kj=1

T

−j

α) ◦ T

k

− f ◦ T

k

| ≤ 1

m + n + 1

n−1

X

k=n−N

|I(α| ∨

n−kj=1

T

−j

α) ◦ T

k

− f ◦ T

k

|

+ 1

m + n + 1

n−N−1

X

k=−m

|F

N

| ◦ T

k

.

(6)

If we fix N and let n +m tend to infinity then the first term in the right-hand side of the above inequality goes to zero. Since the map F

N

belongs to L

1

(see [3]), the bilateral version of Birkhoff’s ergodic theorem applies and we can assert that

m+n→∞

lim 1 m + n + 1

n−N−1

X

k=−m

|F

N

| ◦ T

k

= Z

X

F

N

dµ.

Since lim

N→∞

F

N

= 0, the dominated convergence theorem implies that R

X

F

N

dµ tends to zero which completes the proof.

Theorem 2 For a bijective map T from X to X and a T -ergodic measure µ, we have for µ-almost all x

n+m→∞

lim

−1

n + m log µ(C

n,m,αT

(x)) = h

µ

(T, α),

where C

n,m,αT

(x) represents the element of the partition α ∨ T α . . . ∨ T

m

α ∨ T

1

α ∨ . . . ∨ T

−n

α containing the point x.

Proof. Since the sequence I(α| ∨

sj=1

T

−j

α)

s∈N

converges to a L

1

map and by using the dominated convergence theorem, it follows that

h

µ

(T, α) = lim

s→∞

Z

X

I (α| ∨

sj=1

T

−j

α)(x) d

µ

(x) = Z

X

s→∞

lim I (α| ∨

sj=1

T

−j

α)(x) d

µ

(x) for µ-almost all x. The bilateral version of Birkhoff’s ergodic theorem implies that for almost all x

h

µ

(T, α) = lim

m+n→∞

1 m + n + 1

n−1

X

k=−m

s→∞

lim I (α| ∨

sj=1

T

−j

α) ◦ T

k

(x).

From Lemma 2 it follows that h

µ

(T, α) = lim

m+n→∞

1 m + n + 1

n−1

X

k=−m

I(α| ∨

n−kj=1

T

−j

α) ◦ T

k

(x).

Using Lemma 1 for almost all x we obtain h

µ

(T, α) = lim

m+n→∞

1

m + n + 1 I(∨

mj=−n

T

j

α)(x) − I(T

−n

α)(x)

.

(7)

Since (I(T

−n

α))

n∈N

is bounded for µ-almost all point x, the sequence

I(Tn+m+1nα)

tends almost surely to zero. Hence,

h

µ

(T, α) = lim

m+n→∞

1

m + n + 1 I(∨

mj=−n

T

j

α)(x)

= − lim

m+n→∞

1

m + n log µ(C

n,m,αT

(x)).

4 An illustration

In this example we do not give a definition of the particular discrets dy- namical systems called cellular automata; the reader can find a survey in [1]

and the complete proof of this illustration in [5]. The bilateral version of the Shannon-McMillan-Breiman theorem is needed to establish an inequal- ity between the entropy of a cellular automaton F denoted by h

µ

(F, α), the entropy h

µ

(σ) of a particular bijective cellular automaton σ called the shift, and some discret analog of the Lyapunov exponents. Here the measure µ is F -invariant and σ-ergodic.

The standart Shannon-McMillan-Breiman theorem [4, Chap.10] says that in the case of an invariant measure µ.

h

µ

(F, α) = Z

n→∞

lim

−1

n log µ C

n,αF

(x) .

In [5] one proves that there exists some integer and bounded maps f

n

and g

n

such that, for all point x, one has

C

n,αF

(x) ⊃ C

fσn(x),gn(x),α

(x)

with lim

n→∞

f

n

(x) + g

n

(x) = +∞ for µ-almost all point x for a certain class of cellular automata. For those that do not belong to this class, the entropy is equal to zero (see [5]). With these properties we obtain

h

µ

(F, α) ≤ Z

n→∞

lim

−1

n log µ C

fσn(x),gn(x),α

(x) dµ(x) and

h

µ

(F, α) ≤ Z

lim inf

n→∞

−1

f

n

(x) + g

n

(x) + 1 log µ C

fσn(x),gn(x),α

(x)

× g

n

(x) + f

n

(x) + 1

n dµ(x).

(8)

The bilateral version of the Shannon-McMillan-Breiman theorem implies that h

µ

(F, α) ≤ h

µ

(σ, α)

Z

lim inf

n→∞

f

n

(x) + g

n

(x) + 1

n .

Using the Fatou lemma, we have

h

µ

(F, α) ≤ h

µ

(σ, α) × (λ

+µ

+ λ

µ

), where

λ

+µ

= lim inf

Z f

n

(x)

n dµ(x) and λ

µ

= lim inf

Z g

n

(x) n dµ(x) are called the left and right average Lyapunov exponents.

References

[1] Franc ¸ois Blanchard, Petr Kurka, and Alejandro Maass, Topological and measure-theoretic properties of one-dimensional cellular automata, Phys. D, 103 (1997), pp. 86–99. Lattice dynamics (Paris, 1995).

[2] Leo Breiman , The individual ergodic theorem of information theory , Ann. Math. Stat., 28 (1957), pp. 809–811.

[3] Karl Petersen , Ergodic theory , Cambridge Studies in Advanced Mathematics, 2, Cambridge University Press, Cambridge, UK, 1983.

[4] Mark Pollicott and Michiko Yuri, Dynamical systems and er- godic theory, vol. 40 of London Mathematical Society Student Texts, Cambridge University Press, Cambridge, UK, 1998.

[5] Pierre Tisseur, Aspects ergodiques des automates cellulaires, PhD

thesis, Universit´e d’Aix-Marseille II, 1999.

Références

Documents relatifs

A regular binary tree T having s as generating sequene of leaves, is given in.

In a second step, we will use an analogue of the well-known closed subgroup theorem (also called Cartan theorem for finite dimensional Lie groups) which, in the Banach Lie context

In Section 2, we recall the notions of isomorphic left D-modules and equivalent matrices, as well as the effective version of Fitting’s result and the statement of Warfield’s

Below, we briefly mention four classical beta ensembles, namely the Hermite (Gaussian) beta ensemble, the Circular beta ensemble, the Laguerre (Wishart) beta ensemble, and the

In fact, we shall prove that there exist a symmetric singular measure for which the convolution is absolutely continuous with continuous Radon-Nikodym derivative.. Our

The proof relies on upper estimates for the variance of the Bessel point process and the Plancherel measure, which are obtained using the classical contour integral representations

Indeed, it has been proved in [7] that in general the empir- ical measure of Y t does not satisfy a large deviations principle, and in the special case it does (which depends on the

Keywords: Sum rules, Szeg˝ o’s theorem, Verblunsky coefficients, matrix measures on the unit circle, relative entropy.. A sum rule is an identity between an entropy-like functional