• Aucun résultat trouvé

Forbidden Words in Symbolic Dynamics

N/A
N/A
Protected

Academic year: 2022

Partager "Forbidden Words in Symbolic Dynamics"

Copied!
31
0
0

Texte intégral

(1)

doi:10.1006raama.2000.0682, available online at http:rrwww.idealibrary.com on

Forbidden Words in Symbolic Dynamics

1

M.-P. Beal´

Institut Gaspard Monge, Uni¨ersite de Marne la Vallee, 5 Bd Descartes,´ ´ Champs-sur-Marne, F-77454 Marne la Vallee, France´

E-mail address: [email protected]

F. Mignosi, A. Restivo, and M. Sciortino

Dipartimento di Matematica ed Applicazioni, Uni¨ersity of Palermo, Via Archirafi 34, 90123 Palermo, Italy

E-mail address: [email protected], [email protected], [email protected].

Received January 26, 2000; accepted February 17, 2000

We introduce an equivalence relation,between functions from to . By describing a symbolic dynamical system in terms of forbidden words, we prove that the,-equivalence class of the function that counts the minimal forbidden words of a system is a topological invariant of the system. We show that the new invariant is independent from previous ones, but it is not characteristic. In the case of sofic systems, we prove that the,-equivalence of the corresponding functions is a decidable question. As a more special application, we show, by using the new invariant, that two systems associated to Sturmian words having ‘‘different slope’’

are not conjugate.

2000 Academic Press

Key Words:symbolic dynamics; combinatoric on words; automata and formal languages.

1. INTRODUCTION

In this paper, we present a new topological invariant for symbolic dynamics. The techniques we use, and some complementary results, are from combinatorics on words and from the theory of automata and formal languages. Indeed, there are deep connections between these theories and

Ž w x .

symbolic dynamics cf. 6 and references therein . Several notions from symbolic dynamics have a natural interpretation in terms of formal lan-

1Partially supported by MURST project: Modelli di calcolo inno¨ati¨i: metodi sintattici e combinatori.

163

0196-8858r00 $35.00

Copyright

2000 by Academic Press All rights of reproduction in any form reserved.

(2)

guages, and conversely. Each theory can benefit, in terms of ideas, con- cepts, and techniques, from the results in the other one.

One of the classical ways of describing a symbolic dynamical system is by using forbidden words. A symbolic dynamical system is the set of all two sided-infiniteor bi-infinitesequences that do not contain as factors words in

Ž w x.

a fixed set FF cf. 24 . The set FF may be finite or infinite. Several different sets of forbidden words can lead to the same symbolic dynamical system SS. If one of such sets is finite, then SS is called of finite type. Given a symbolic dynamical system SS, it is possible to associate to SS the set

Ž .

M SS of the forbidden words that have no proper factor that is forbidden.

Ž .

The elements of M SS are called minimal forbidden words. Sometimes it is more useful to describe and to study a symbolic dynamical system by specifying and analyzing the words that are allowed rather than the ones

Ž .

that are forbidden. We denote by L SS the set of all allowed factors of Ž .

sequences of SS. It is possible to prove that L SS is a factorial and

Ž . Ž .

extensible language cf. Section 2.1 . Moreover, L SS uniquely character-

Ž w x.

izes the symbolic dynamical system SS cf. 24, Proposition 1.3.4 . In

Ž . Ž .

Section 2, we see that there is a strong relation between L SS and M SS . Two symbolic dynamical systems SS and TTare conjugateif there exists a conjugacy from SS into TT, i.e., a k-block map that is bijective. A property of a system SS that is preserved under conjugacy is said to be a topological in¨ariant or conjugacy in¨ariant of SS. The conjugacy problem or isomor- phism problem consists in deciding whether two given symbolic dynamical systems are conjugate or not, and it is still an open problem even for systems of finite type. If there exists a topological invariant that is different for two symbolic dynamical systems, then they are not conjugate. Several topological invariants have been found, such as topological entropy and

Ž .

growth rate of complexity function, that are related to L SS , and zeta function that takes into account the elements of a given period.

Ž . In this paper, we show a new invariant that is related to M SS . Moreover, our main theorem has as a particular case the classical result that the notion ‘‘of finite type’’ is a topological invariant.

If f and g are two functions from⺞to⺞, we write f$g if there exists a constant K)0 such that

K

nGK, f nŽ .FK

Ý

g nŽ qi..

isyK

We say that f and g are linearly equi¨alent, and we write f,g, if f$g and g$f.

It is easy to verify that,is an equivalence relation.

(3)

Ž . Ž . We denote by FSS n the function that counts the elements of M SS of length n. Our main result states that if two symbolic dynamical systems SS and TT are conjugate, then the two functions FSS and FTT, that count,

Ž . Ž .

respectively, the elements of M SS and M TT , are linearly equivalent. In other words, we prove that the,-equivalence class of FSS is a topological invariant of SS. This result provides a new tool to show that some systems

Ž .

are not conjugate. In the case of sofic systems cf. Section 2.1 , we prove that it is decidable to establish whether, given two symbolic dynamical systems SS and TT, the functions FSS and FTT are linearly equivalent.

Moreover, we prove that the new invariant is independent from previous ones; indeed, we give examples of sofic systems such that the functions of minimal forbidden words are not linearly equivalent but they have the same well-known topological invariants such as zeta function, topological entropy, and growth rate of complexity function of allowed words. Never- theless, this new invariant is not characteristic. This means that there exist systems SS and TT that are not conjugate but that have the same,- equivalence class of FSS and FTT. As a more special application of our main result, we give a combinatorial proof that two systems associated to Sturmian words having ‘‘different slope’’ are not conjugate.

In view of the relationship between symbolic dynamics and the theory of automata and formal languages, we note that the new topological invariant suggested the idea of a data compression scheme based on forbidden

w x w x

words and described in 13 . Moreover, in 12 is shown a close relation between forbidden words and factor automata. Minimal forbidden words have been also considered in the study of complexity in the framework of a

Ž w x.

hierarchical modeling of physical systems cf. 2 . The sketch of some of w x

the results of this paper also appears in 5 .

2. SYMBOLIC DYNAMICS

Symbolic dynamics is a field born at the beginning of the 1920s with the Ž w x.

work in topology of Marston Morse cf. 26 . Later, the theory was developed as a branch of ergodic theory. There are deep connections between the theory of automata and formal languages and symbolic

Ž w x .

dynamics cf. 27, 6 and references therein . Several results from symbolic dynamics have a natural interpretation in terms of formal languages and conversely.

We present in this section a short introduction to the basic concepts of symbolic dynamics and to their relations with the theory of formal languages and finite automata. Basic definitions and notations are from

(4)

w x24 . For any other notations not explicitly defined here, we refer to 3, 8,w x

24, 25 .

Let Abe a finite alphabet and A* the set of finite words of letters from

q 4

A, the empty word ⑀ included. We denote by A the set A*_ ⑀ . Let A be the set of bi-infinite sequences of letters of A. An element of

Ž .

A ,xs xi igZ, xigA, is called a two sided-infinite word. The set A is also called the full A-shift.

The shift map ␴ is a function defined on A. It associates to x the elementys␴Ž .x defined by the rule yisxiq1 for any integer i.

A two sided-infinite word xgA a¨oids a set of words FF;A* if no factor of x belongs to FF. We denote by SSFF the set of all ygA which avoid FF. Therefore, we can think of FF as the set of forbidden factors of the words of SSFF.

A symbolic dynamical systemorshift spaceis a subset SS of Asuch that S

SsSSFF for someFF;AAq.

The set FF may be finite or infinite. In any case, it is at most countable since Aq is countable.

The shift map␴SS is the restriction to SS of the shift map ␴ on the full shift.

If a symbolic dynamical system X is contained in a symbolic dynamical system Y, we say that X is a subshift of Y. For example, the full A-shift

Ž .

A is a symbolic dynamical system we can take FFs⭋. Another less trivial example is the following.

4

EXAMPLE1. Let SS be the set of two sided-infinite words on As a,b such that the letter b is always followed by the lettera. Then SSsSSb b4 is a shift space often called golden mean shift.

2.1. Symbolic Dynamics and Formal Languages

Given a symbolic dynamical system SS, it is possible to associate two languages: the set

MŽSS.s

F

FFsuch that SSsSSFF

4

␭g⌳

Ž .

and the set L SS of all factors of sequences in SS, i.e.,

4

LŽSS.s wgA* such thatxgSSNwis factor ofx . Ž .

It is easy to prove that SSsSSSS. and that any element of M SS has the Ž .

property that any proper factor of it belongs to L SS . The elements of Ž .

M SS are called minimal forbidden words. By definition, we observe that Ž .

any symbolic dynamical system SS is uniquely specified by M SS .

(5)

Ž .

It is easy to prove that L SS is a factorial language, i.e., ᭙u,¨gA*, u¨gLŽSSu,¨gLŽSS., and, furthermore it is an extensible language, i.e.,

᭙¨gLŽSS.,

there exist x,ygAsuch that x¨gLŽSS.and¨ygLŽSS..

Ž . Ž w

So the language L SS also uniquely characterizes the system SS cf. 24, x.

Proposition 1.3.4 . Hence, it is possible to describe the system SS by specifying the forbidden words or the allowed factors. The following two

Ž . Ž .

remarks show that there are strong connections between L SS and M SS . Ž . REMARK 2. From an algebraic point of view, we observe that A*_L SS

Ž .

is a two sided ideal of the free monoid A* and M SS is the base of this ideal, i.e.,

A*_LŽSS.sA*MŽSS.A*.

Hence,

LŽSS.sA*_A*MŽSS.A*, Ž .1

Ž . Ž .

and so M SS uniquely characterizes L SS .

Ž .

Conversely, the following remark shows that L SS also uniquely charac- Ž .

terizes M SS .

Ž .

REMARK3. A word¨sa a1 2. . .an belongs to M SS if and only if the two conditions hold:

¨ is forbidden i.e.,Ž ¨fLŽSS..,

䢇 both a a1 2. . .any1gLŽSS. and a a2 3. . .angLŽSS. Žthe prefix and Ž ..

the suffix of ¨ of length ny1 belong to L SS . Hence, we have that

MŽSS.sALŽSS.lLŽSS.Al

Ž

A*_LŽSS.

.

. Ž .2

Ž . Ž . Ž .

From the equalities 1 and 2 , it follows that M SS uniquely character-

Ž . Ž . Ž .

izes L SS and L SS uniquely characterizes M SS , respectively.

Recall that a language L;A* is rational if it is recognized by a finite

Ž w x. Ž . Ž .

state automaton cf. 17 . From the equalities 1 and 2 , one also derives

Ž . Ž . Ž .

that L SS and M SS are simultaneously rational, i.e., L SS is rational if Ž .

and only if M SS is a rational language.

(6)

A symbolic dynamical system SS is of finite type if SSsSSFF for some finite set FF;AAq. An equivalent definition is that SS is of finite type if and

Ž .

only if M SS is finite.

A symbolic dynamical system SS is soficif SSsSSFF for some rational set

q Ž .

F

F;A . Since SSsSSS., SS is sofic if M SS is rational or, according to

Ž . Ž . Ž .

relations between L SS and M SS , if L SS is rational. It is clear that a system of finite type is sofic. The converse is not true, as shown by the following example.

EXAMPLE 4. Let SS;A be the set of two sided-infinite words on

4

As a,b such that between any two b’s there are an even number of a’s.

We can take

2nq1 4

F

Fs ba b: nG0 sMŽSS..

Then SS is sofic but, since FF is infinite, it is not of finite type. The system is also called e¨en shift.

2.2. Symbolic Dynamics and Finite Automata

In this subsection, we introduce the notion of conjugacy and we show certain deep connections between symbolic dynamics and the theory of finite automata.

Let k be an integer greater than or equal to 1. A k-block map orŽ k-local function. ␺ from a system SS into a system TT is defined by a

k Ž .

partial function f from A into B, by two integers m for memory and a Žfor anticipation with. mqask, and satisfies, for any integer i,

Ž

Žan.ngZ

.

isf aŽ iymq1aiymq2. . .aiy1a ai iq1. . .aiqa..

A k-block map ␺: SSªTT is a conjugacy from SS to TT if it is invertible, i.e., if there exists a block map␾ from TT to SS such that

␾ ␺

Ž

Ž .x

.

sx for all xgSS and ␺ ␾

Ž

Ž .y

.

sy for all ygTT.

In this case, we say that the two systems are conjugate. It is easy to prove that the shift map␴ is a conjugacy.

Ž .

We denote by Q,E a finite automaton, whereQis the set of the states and E is the set of the edges labeled in a finite alphabet. Moreover, we suppose that all states are both initial and terminal.

It is obvious that a sofic system can be also defined as the set of bi-infinite labels of bi-infinite paths on a finite automaton. In this case, we say that the system is recognized by the automaton.

Ž .

A transducer is an automaton Q,E with a set of edges E labeled by

(7)

A=B, where Aand Bare two finite alphabets. We, as usual, write a label

Ž .

aNb, agA,bgB, instead of writing the ordered couple a,b. The input of the transducer is the automaton labeled by A and obtained from the transducer after removing the second components of the labels of the edges. The output of the transducer is obtained from the transducer after removing the first components of the labels of the edges. It is an automa- ton labeled by B.

Without loss of generality eventually by using a composition with aŽ power of the shift , we can assume that a. k-block map ␺ defined by a partial function f from Ak into B, has memory k and no anticipation.

Ž .

Hence, ␺ can be represented, for any integer n with nG ky1 , by a

Ž . Ž .

transducer labeled by A=B,Tns Q,E , where Qs a a1 2. . .an ,aig A4 and E is the set of the edges

Ž .

cNf anykq2. . .a cn 6

a a . . .a a a . . .a c ,

Ž 1 2 n. Ž 2 3 n .

for all ai, cgA.

Ž .

We can remark that if uN¨ is the label of a bi-infinite path of a

Ž Ž . Ž ..

transducer, then ␴ u N␴ ¨ also. We say that Tn is a representation of

Ž . Ž .

␺: any bi-infinite path of Tn labeled uN¨ with ugSS verifies ¨s␺ u. 2.3. Topological In¨ariants

It is interesting and useful that the combinatorial notions of symbolic dynamics such as symbolic dynamical system,k-block map, conjugacy, have a natural topological interpretation.

Let us endow A with the discrete topology and A with the product topology. Therefore, A is a compact space. This topology can be defined by the distance

yk

4

2 if x/yand ksmax nG0Nxisyi,ynFiFn , d x,Ž y.s

½

0 if xsy,

with x, ygA and using the convention that ks y1 if x0/y0.

Ž w x.

It is known cf. 24 that a subset XX of A is a symbolic dynamical system if and only if it is closed for the previous topology and invariant under the shift ␴XX.

A homomorphismbetween two systems SS and TT is a map ␾: SSªTT which is continuous and commutes with the shift, i.e., such that

TT(␾s␾(␴SS.

A k-block map is continuous and commutes with the shift. The

Ž w x w x.

Curtis᎐Lyndon᎐Hedlund theorem cf. 24 , also in 22 assures that any

(8)

continuous map from A into B that commutes with the shift is a k-block map for some integer k.

If a homomorphism is one-to-one and onto, its inverse map is continu- ous. Then it is called a topological conjugacy or topological isomorphism.

From the Curtis᎐Lyndon᎐Hedlund theorem, it follows that the notion of topological conjugacy and the combinatorial notion of conjugacy are equivalent. Hence, in the sequel we will use the term topological conjugacy as well as conjugacy.

A property of a system SS that is preserved under conjugacy is said to be a topological in¨ariantor conjugacy in¨ariantof SS. The conjugacy problem or isomorphism problem consists in deciding whether two given symbolic dynamical systems are conjugate or not. It is still an open problem even for systems of finite type. If there exists a topological invariant that is different for two symbolic dynamical systems, then they are not conjugate. Several topological invariants have been found, such as topological entropy, growth rate of complexity function of allowed words, and zeta function.

Recall that the topological entropy of a symbolic dynamical system SS is defined as

hŽSS.slim sup 1 log2

Ž

fSSŽ .n

.

,

nª⬁ n

Ž . Ž Ž . n. Ž .

where fSS n sCard L SS lA is called a complexity functionof L SS . Ž .

Since L SS is a factorial and extensible language, lim sup can be substi- tuted by lim.

Ž . w x

Another topological invariant related to L SS is given in 18 . It is stated that if SS and TT are conjugate, then there exists a constant c)0 such that, for nGc,

fTTŽnyc.FfSSŽ .n FfTTŽnqc.. Ž .

Hence, the growth rate of complexity function of L SS is preserved by topological conjugacy. Moreover, it is possible to observe that this last invariant is stronger than topological entropy, but it is the same in the case of sofic systems which have a nonzero entropy.

The zeta function is a topological invariant of symbolic dynamical systems which takes into account the number of elements of a given period. In fact, let SS be a symbolic dynamical system and let Pnsxg

nŽ . 4

S

S: ␴ x sx be the set of points of period dividing n. The zeta function of SS is defined as

pn

SSŽ .z sexp

Ý

zn,

n)0 n

(9)

Ž .

where pnsCard Pn . It is possible to prove that the zeta function is stronger than topological entropy for sofic systems.

However, none of the previous invariants is characteristic. For instance, there exist systems SS and TT that are not conjugate but that have the same topological entropy, the same zeta function, and the same growth rate of

Ž .

complexity function of allowed words cf. Example 21 .

3. LINEARLY EQUIVALENT FUNCTIONS

In this section, we introduce an equivalence relation between functions from ⺞to ⺞, the linear equivalence,, and we show some properties of this relation. In particular, in the special case of functions corresponding to ⺞-rational series, we give an effective characterization of the pairs of linearly equivalent functions. Further, we show that the notion of linear equivalence is strictly weaker than the classical notion ‘‘to have the same rate of growth.’’

Let f, g be two functions from⺞to⺞. We write f$g if there exists a constant K)0 such that

K

nGK, f nŽ .FK

Ý

g nŽ qi..

isyK

We say that f and g are linearly equi¨alent and we write f,g if f$g and g$f.

It is easy to verify that,is an equivalence relation. Moreover, we can observe that,satisfies some properties.

PROPOSITION5. Let f1, f2, g1, g2, f be functions fromto⺞. Ž .i If f1,f and g2 1,g2,then f1qg1,f2qg2.

Ž .ii If c is a positi¨e constant, then f,cf.

Proof. Ž .i As f1,f2, there exists Kf such that

Kf

nGKf, f n1Ž .FKf

Ý

f2Žnqi.

isyKf

and

Kf

f2Ž .n FKf

Ý

f n1Ž qi..

isyKf

(10)

Since g1,g2, there exists Kg such that

Kg

nGKg, g n1Ž .FKg

Ý

g2Žnqi.

isyKg

and

Kg

g2Ž .n FKg

Ý

g n1Ž qi..

isyKg

4

Let us set Ksmax Kf,Kg . Hence, ᭙nGK,

Kf Kg

f n1Ž .qg n1Ž .FKf

Ý

f2Žnqi.qKg

Ý

g2Žnqi.

isyKf isyKg

K

FK

Ý Ž

f2Žnqi.qg2Žnqi.

.

.

isyK

Analogously, one can prove the other inequality.

Ž .ii If c)1, then it is sufficient to take Ksc; if c-1, then we can take Ks1rc.

We now recall the notion of ⺞-rational sequence. A sequence of Ž .

nonnegative integers fn ng⺞ is said to be⺞-rationalif fn is the number of paths of length n going from an initial state to a final state in a finite

Ž .

automaton. The series f associated to a sequence fn ng⺞ is defined by

Ž . n

f zf zn . The series associated to an⺞-rational sequence is said to be ⺞-rational. By a slight abuse of notation, we also denote by f the function from⺞to⺞which associates fn to n. With this convention, f nŽ . and fn denote the same nonnegative integer. The following result can be

w x

found in 30, Theorem 10.1, p. 61 . This theorem is proved for the set of Ž w x.

q-rational series see 30 that contains the set of⺞-rational series.

PROPOSITION6. Let f be an ⺞-rational series which is not a polynomial.

The poles of f of minimal modulus are of the form r␳, where r is a strictly positi¨e real andis a root of unity.The number r is one of these poles and the multiplicity of any pole ris at most the multiplicity of r.

w

The following proposition and corollary can be found in 30, Theorem x w x

10.2, p. 62 , also in 9 . Both these two results are proved for⺢q-rational series; therefore, they also hold for⺞-rational series. The following propo- sition says that any⺞-rational series is a merge of⺞-rational series which

(11)

have a dominating root. Notice also that this result characterizes the

Ž w x.

⺞-rational series cf. 30, Theorem 10.5, p. 64 . Ž .

PROPOSITION7. Let fn ng⺞ be an ⺞-rational sequence.Then there are natural numbers m, p such that,if 0FjFpy1, then

fmqjqn psP njŽ .␣jnq

Ý

PjiŽ .njin,

i

< <

wherej is a nonnegati¨e real,ij)maxiji, and the Pj’s and the Pji’s are nonzero polynomials. Furthermore, m, p, and Pj’s and the Pji’s, thej’s and theji’s are computable.

w x

Proof. We briefly recall the proof given in 30, Theorem 10.2, p. 62 . Ž .

Let f be the series associated to fn ng⺞. If f is not a polynomial, there

Ž .

is a computable integer msuch that

fmqns

Ý

Q niŽ .␭ni,

i

where the Qi’s are computable polynomials in ⺓, and where the ␭yi 1 are the poles of the series. Furthermore, the degree of Qiis the multiplicity of

y1 Ž w x

the root ␭i as pole of the series f minus one see 30, Lemma 9.7, p. 56 w x.

or 9, p. 58 . This decomposition of f is called the decomposition in exponential polynomials. Let p be the smallest positive integer such that

p < <p

i s ␭i , for every index i. Let1, . . . ,␣K be the distinct values of the numbers ␭ip. Then

K

j n

fmqjqn psk

Ý

s1

ž

iN␭

Ý

ps␣iQiŽjqnp.

/

k

i k

K

s

Ý

QjkŽ .nkn.

ks1

The numbers ␣k such that Qjk is nonzero are the inverses of the poles of

Ž . n

the series f zn fmqjqn pz , which is⺞-rational. We use then Proposi- tion 6 to conclude: every number ␣k such that Qjk is nonzero and whose modulus is maximal is of the form ␣␳, where ␣ is a positive real and ␳ is a root of unity. By the definition of the number p, we get ␳s1.

Ž .

COROLLARY 8. Let fn ng⺞ be an⺞-rational sequence.Then there is a computable positi¨e integer p such that

fjqn p;c nj ljjn as nª⬁ Žjs0, . . . ,py1 ,.

where cj)0, ljg⺞, andj is a nonnegati¨e real. Furthermore,j and lj are computable.

(12)

Ž . Ž . We now order the pairs ␣j,lj with the lexicographic order: ␣j,lj - Ž␣k,lk. iff ␣j-␣k or ␣js␣k and lj-lk. The maximal pair Ž␣j,lj. is

Ž . Ž .

named the maximal pair of the sequence fn ng⺞. This pair ␣,l charac-

l Ž1rp.n Ž .

terizes the maximal growth rate n␣ of the sequence fn ng⺞.

Ž . Ž .

COROLLARY 9. Let fn ng⺞ and gn ng⺞ be two ⺞-rational sequences.

Ž . Ž .

The two functions fromto ⺞, f: nªf n and g: nªg n, are linearly equi¨alent iff they ha¨e the same maximal pair.

Ž . Ž . Ž .

Proof. Let ␣,l be the maximal pair of fn ng⺞ and ␣ ⬘,l⬘ be the

Ž . Ž . Ž .

maximal pair of gn ng⺞. Let us assume that ␣,l ) ␣ ⬘,l⬘. Then there are integers m, m⬘, p, p⬘, j, j⬘and positive reals c,c⬘ such that

fjqn p;cnln as nª⬁, and

gj⬘qn p;cnl⬘␣ ⬘n as nª⬁.

Ž . Ž .

Then f and g cannot be linearly equivalent. Conversely, if ␣,l s ␣ ⬘,l⬘, there is a constant K such that, for great enough n, we have

p⬘y1

fnFK

Ý

gnqi.

is0

Then f$g. By symmetry, we get g,f.

COROLLARY 10. It is decidable whether two functions that define ⺞- rational sequences are linearly equi¨alent.

REMARK11. As a particular case, we get the following result. If f and g are two⺞-rational series that have a dominating root, that is a unique Žreal pole of minimal modulus, then the corresponding functions are. linearly equivalent iff they have the same dominating root and the same degree of this dominating root.

REMARK12. It is easy to prove that if two functions f and g from⺞to

Ž Ž . Ž . < Ž .<

⺞have the same rate of growth the notation is f n 7g n , i.e., f n F

< Ž .< < Ž .< < Ž .<

C g n and g n FC f n for some constant C and for large values of w x.

n, cf. 20 , then they are linearly equivalent. The converse is not true. In fact, the following two ⺞-rational functions i.e., they defineŽ ⺞-rational sequences.

1 if nis a multiple of 5, f nŽ .s

½

0 otherwise,

1 if nis a multiple of 7, g nŽ .s

½

0 otherwise,

(13)

Ž Ž ..

are linearly equivalent they have the same maximal pair 1, 0 but they have different rate of growth.

4. MAIN RESULT

In the following, we will consider two systems SS and TT, SS over the

Ž . Ž .

alphabet A, and TT over the alphabet B. Let FSS n and F nTT be the

Ž . Ž .

functions that count, respectively, the elements of M SS and M TT of length n, i.e.,

FSSŽ .n sCard

Ž

MŽSS.lAn

.

, FTTŽ .n sCard

Ž

MŽTT.lBn

.

. We can now state our main result.

THEOREM13. If two symbolic dynamical systems SS andTTare conjugate, then the two functions FSS and F are linearly equiTT ¨alent.

In other words, our main result states that the,-equivalence class of FSS is a topological invariant of SS.

Let us first remark that, if SS and TT are conjugate, then there exists a k-block map␺ partially defined from A to Bsuch that ␺ is a conjugacy between SS and TT. Without loss of generality eventually by using aŽ composition with a power of the shift , we can assume that the map has. memory k and no anticipation. Hence, ␺ can be represented, for any

Ž .

integer nwith nGky1, by a transducer labeled by A=B,Tns Q,E ,

Ž . 4

where Qs a a1 2. . .an ,aigA and E is the set of the edges

Ž .

cNf anykq2. . .a cn 6

a a . . .a a a . . .a c ,

Ž 1 2 n. Ž 2 3 n .

Ž .

for all ai, cgA. Any bi-infinite path of Tn labeled uN¨ with ugSS verifies ¨s␺Ž .u . Of course, it is not true that any input label of a bi-infinite path of Tn belongs to SS.

Recall that an automaton is local if it does not admit two distinct equally labeled cycles. For any local automaton, there exist two integers m

Ž .

and a mfor memory and afor anticipation such that two equally labeled paths of length mqago through the same state after their beginnings of

Ž .

length m. Such an automaton is said to be m,a-local. Remark that the

Ž .

input automaton of the transducer Tn is a n, 0 -local automaton. This input automaton is usually known as a De Brujin graph.

We now define, for any nGky1, the transducer TnX obtained fromTn by removing all states defined by forbidden words of length nof SS. Let us denote by SSn the sofic system recognized by the input automaton of TnX.

(14)

In order to give the proof of Theorem 13, we will first state and prove the following five lemmas.

LEMMA14. For any state of TnX, there exists a bi-infinite path of TnX going through it.

Proof. Let usa1. . .an be a state of TnX. The word u is a factor of a bi-infinite word w of SS. There exists a path of Tn labeled by w in its input. Then there exists also a path of TnX labeled by win its input since w belongs to SS. This path goes through the state u at a time because u is a factor of length nof w. This proves also that any finite path ofTnX can be extended to infinity on the left and on the right.

LEMMA15. Letbe a conjugacy between SS andTT.Then there exists an integer n such that,0 for any nGn0,TnX has a local output.

w x X

Proof. By 3, Proposition 3.5 , in order to show that Tn has a local output, it is sufficient to prove that the k-block map ␺ is one-to-one on SSn.

By contradiction, let us suppose that for each n0Gky1, there exists nGn0 such that ␺ is not one-to-one on SSn. Then there exists an increasing sequence of indices n1,n2, . . . such that ␺ is not one-to-one on S

Sn for all ig⺞. Hence, there exist xi, yigSSn, xi/yi, such that

i i

Ž . Ž .

xi s␺ yi. As A is a compact metric space, there exists a subse-

Ž . Ž . Ž .

quence xi kg⺞convergent to x, and yi kg⺞has a subsequence yi hg⺞

k k Ž . k h

convergent to y, with x, ygA . Obviously, xi hg converges to x, too.

k h

Ž .

If we renumber opportunely the indices, we can write that xi ig⺞ con- Ž .

verges to xand yi ig⺞ converges to y. We are going to prove that xgSS.

Ž . Ž . 4

For all ⑀G0, let us denote by I x the set zgA Nd x,z -⑀ . As Žxi i.g⺞ converges to x, there exists i such that for all iGi, xigIr2Ž .x Ž . and xigSSni. As the central factor of xi of length ni is a block of L SS , there exists a word x⬘gSS which has such block as factor and, without loss of generality eventually by using the invariance by the shift transfor-Ž mation , we can assume that this factor is central. Hence, by definition of.

Ž . niy1.r2@

the distance, it follows that d xi,x⬘ F1r2 . Let us choose i large

?Žniy1.r2@ Ž .

enough such that 1r2 -⑀r2; hence, d x,x⬘ -⑀ and so x⬘g I xŽ .. As SS is closed, then xgSS. Analogously, it is possible to prove that

Ž . Ž . Ž .

ygSS. As ␺ is continued, and for all ig⺞,␺ xi s␺ yi , then ␺ x s

␺Ž .y . This contradicts the fact that ␺ is one-to-one from SS to TT.

X Ž .

LEMMA16. We assume that Tn has an m,a-local output,where m and a are chosen greater than or equal to k and n0 0.Then for each integer nGn0,

X Ž Ž . .

T has an mn q nyn0 ,a-local output.

(15)

X Ž .

Proof. We fix an integer nGn0. We assume that Tn is k1,k2 -local in its output. We assume that k1 and k2 are greater than or equal to k and that k1 is greater than or equal to n. We just have to prove that TnXq1 is Žk1q1,k2.-local in its output.

We consider two paths of TnXq1 of length k1q1qk2, with the word u

Ž .

as output label. We assume that the first path resp. the second one goes

Ž .

through state p resp. p⬘ after the first k1q1 symbols. We want to show that psp⬘. These two finite paths can be extended to two bi-infinite

X Ž . Ž .

paths ofTnq1 by Lemma 14 . The first bi-infinite path is labeled by w,z ,

Ž .

the second one by w⬘,z⬘. We have usziq1. . .ziqk1qk2q1 and us

X X Ž . Ž .

ziq1. . .ziqk1qk2q1 for some index i. The labels w,z and w⬘,z⬘ are also labels of paths of TnX because all labels of path of TnXq1 are labels of paths

X X

Ž . X

ofTn. AsTn is k1,k2 -local in its output, the two bi-infinite paths of Tn go through the same state qafter the symbol of index iqk1, and they also go through the same state r after the symbol of index iqk1q1. Let qsa1. . .an and rsa2. . .anq1. By definition of TnX, the input symbols of length nq1, wiqkynq1. . .wiqkq1 and of wXiqkynq1. . .wXiqkq1 are the

1 1X X Ž 1 . 1

same worda1. . .anq1. Back toTnq1, as Tnq1 is nq1 -local, we also have wiqkynq1 . . .wiqkq1sp and wXiqkynq1 . . . wXiqkq1sp⬘. We get

1 1 1 1

psp⬘.

LEMMA17. Any word of length nykq1which is an output label of a path of TnX is a factor of a word in TT.

Proof. We consider a word u of length nykq1, output label of a path ofTnX. Then by Lemma 14, there exists a bi-infinite path ofTnX labeled Žw,z. such that, for an index i, we have usziqk. . .ziqn. The word wiq1. . .wiqn is a word of length n input label of a path ofTnX. It is then a

X Ž .

factor of a word in SS. Then there exists a path inTn, labeled w⬘,z⬘ with w⬘gSS and such that wiq1. . .wiqnswXiq1. . .wXiqn. As w⬘belongs to SS,

X Ž .

z⬘belongs to TT. By definition of Tn, we have ziqkqjsf wiqjq1. . .wiqjqk , for js0, . . . ,nyk. We get usziqk. . .ziqnszXiqk. . .zXiqn. Then u is a factor of a word in TT since z⬘belongs to TT.

We have now all notations and elements to prove the fundamental lemma:

LEMMA18. If SS andTTare conjugate, for each integer n greater than n0, we ha¨e

mqayn0

FSS.Ž .n FC

Ý

FTT.Žnqi.,

isykq1

Ž Ž ..l

where Cs 2 Card B and lsmqayn0qky1.

(16)

Proof. Let x be a minimal forbidden word of length n of SS. It is an

X Ž .

input label of a path of Tny1 labeled w,z , which is not the label of a path of TnX. Let us assume that xsw w1 2. . .wn. We consider the part of the output label of this path yszmyny1.. . .z z0 1. . .z zn nq1. . .znqa.

0

Ž .

The word y is of length mq nyn0 qa. Let us assume that y is a factor of a word in TT. Therefore the word y represents the output label of

X Ž .

a path of Tny1 labeled w⬘,z⬘ with

w⬘gSS, z⬘gTT, and yszyX Žmyny1.. . .zX0zX1. . .zXnzXnq1. . .zXnqa.

0

X Ž Ž . . X X X

As Tny1 is mq ny1yn0 ,a-local in its output, we have w w1 2. . .wn sw w1 2. . .wnsx. This contradicts the fact that x is not a factor of a word in SS. Then y is not a factor of a word in TT. To each minimal forbidden word x of length n, we associate a word y defined as above. The previous argument shows also that the words y associated to two distinct words x are themselves distinct. By Lemma 17, we know that each word of length nyk, which is an output label of a path of TnXy1, is a factor of a word in TT. So each factor of length nyk of y is a factor of a word in TT, and y is not, a factor of a word in TT. Each word y can be obtained with a minimal forbidden word my of TT of length between nykq1 and nqmqay n0, completed left or right by symbols of B. Now the number of such words y, greater than or equal to the number of minimal forbidden words of SS of length n, is less than or equal to

mqayn0

C

Ý

FLŽTT.Žnqi..

isykq1

This gives the announced formula.

We can now prove our main result.

Proof of Theorem13. We have to prove that there exists a constant K such that

Ž .1 For any nGK, FSS.Ž .n FKÝKisyKFLŽTT.Žnqi.. Ž .2 For any nGK, FLŽTT.Ž .n FKÝisyKK FSS.Žnqi..

By previous lemma, by exchanging the roles played by SS and TT, we have

Ž . mqayn0 Ž .

that there exists M such that FLŽTT. n FMÝisykq1FSS. nqi . Let us

4

set K1smaxC,mqayn0,ky1 and K2smax M,mqayn0,ky

4 4 Ž . Ž .

1 . If we set Ksmax K1,K2 , then inequalities 1 and 2 follow.

Let us remark that, as a particular case of Theorem 13, we obtain the following well-known result of symbolic dynamics concerning systems of finite type. Recall that a system SS is of finite type if the set FF of forbidden factors is finite.

(17)

COROLLARY 19. Let SS be a system of finite type and let TT be a system conjugate to SS.Then TT is also of finite type.

It is well known that the entropy of a system is a topological invariant.

Theorem 13 allows us to state that the entropy of minimal forbidden words is an invariant, too.

5. SOFIC SYSTEMS

In this section, we are interested in sofic systems. In fact, it is known that, for these symbolic dynamical systems, the previously described invari- ants such as topological entropy, zeta function, and growth rate of com- plexity function of allowed words are computable. In this section, we prove that the,-equivalence is a decidable question for sofic systems. More- over, we give examples that show that the new invariant is independent from previous ones, but it is not characteristic.

Recall that a symbolic dynamical system SS is sofic if SSsSSFF for some

q Ž .

rational set FF;A . It is easy to prove that SS is sofic if and only if M SS

Ž . Ž .

is rational. According to relations between M SS and L SS , we have that

Ž . Ž .

M SS is rational if and only if L SS is rational; then it follows that SS is Ž .

sofic if and only if L SS is a rational language. Hence, it is obvious that a sofic system SS can be also defined as the set of bi-infinite labels of bi-infinite paths on a finite automaton AA, and we say that SS is recognized by AA.

If MMAA is the adjacency matrix of the finite automaton AAand ␭AA is the w x

largest eigenvalue of MMAA, it is proved, cf. 24 , that the topological entropy Ž .

of the sofic system SS is computable and h SS slog2AA. Moreover, it has been proved by Manning and Bowen that the zeta function of a sofic

w x w x

system SS is a rational function. See 24 or 4 for an exposition. A method to compute it by an operation on finite automata, the external power, is

w x

there explained. More recently, Reutenauer 29 has obtained new results showing that the zeta function of a sofic system is not only rational but even⺞-rational. He has also extended his results to more general symbolic dynamical systems, introduced by Fried under the name of finitely pre-

Ž w x. sented systems cf. 19 .

The growth rate of complexity function of allowed words is easily computable because the complexity function fSS of the sofic system SS is

Ž . r Ž n.

given by fSS ni,js1 MMAA i j, where AAis an unambiguous finite automa- ton that recognizes SS and MMAA is the r=r adjacency matrix of AA.

It is an important question to establish whether our topological invariant is also computable, i.e., given two symbolic dynamical systems SS and TT, to decide whether FSS and FTT are linearly equivalent or not.

Références

Documents relatifs

Prove that the gamma distribution is actually a continuous probability distributions that is it is non-negative over x &gt; 0 and its integral over R + is equal to 1.. Exercise 8

First introduced by Faddeev and Kashaev [7, 9], the quantum dilogarithm G b (x) and its variants S b (x) and g b (x) play a crucial role in the study of positive representations

A second scheme is associated with a decentered shock-capturing–type space discretization: the II scheme for the viscous linearized Euler–Poisson (LVEP) system (see section 3.3)..

For general n ≥ 2, G n was first studied by Chai, Wang and the second author [4] from the viewpoint of algebraic geometry. Among other things, they established the

We prove that arbitrary actions of compact groups on finite dimensional C ∗ -algebras are equivariantly semiprojec- tive, that quasifree actions of compact groups on the Cuntz

On the one hand, we show that the corresponding optimization problems associated with these numbers are both APX-hard, where for the intersection number our results hold even

Keywords : factorial language, anti-factorial language, factor code, fac- tor automaton, forbidden word, avoiding a word, failure function..

Keywords : formal languages, factorial language, anti-factorial language, factor code, factor automaton, forbidden word, avoiding a word, failure function.. 1