• Aucun résultat trouvé

2 Minimal Forbidden Words

N/A
N/A
Protected

Academic year: 2022

Partager "2 Minimal Forbidden Words"

Copied!
12
0
0

Texte intégral

(1)

Minimal Forbidden Words and Symbolic Dynamics

Marie-Pierre Beal1, Filippo Mignosi2and Antonio Restivo2

1 LITP - Institut Blaise Pascal, Universite Denis Diderot 2 Place Jussieu 75251 Paris cedex 05.[email protected]

2 University of Palermo, Dipartimento di Matematica ed Applicazioni Via Archira 34. 90123 Palermo.mignosi|[email protected]

Abstract. We introduce a new complexity measure of a factorial formal languageL: the growth rate of the set of minimal forbidden words. We prove some combinatorial properties of minimal forbidden words. As main result we prove that the growth rate of the set of minimal forbidden words forLis a topological invariant of the dynamical system dened byL.

Classication: Automata and Formal Languages

1 Introduction

Let LAbe afactoriallanguage, i.e. a language containing all factors of its words.

A word w2A is aminimal forbidden wordfor L if w =2L and all proper factors of w belong to L. We denote by MF(L) the language of minimal forbidden words for L. It turns out (as also stressed by the results of this paper) that the combinato- rial properties of MF(L) provide an usefull tool to investigate the structure of the language L or of the system that it describes. Consider, for instance, the case of locally testable factorial languages (cf [14]): they are characterized by the fact that the corresponding languages of minimal forbidden words are nite. In the context of Symbolic Dynamics they correspond to systems of nite type. Another example is given by a language L which is the set of factors of an innite word (or of a set of innite words): in this case, as we show in Sec. 2, the elements of MF(L) are closely related to thebispecialfactors (cf. [7], [8] and [6]) of the innite word. Minimal for- bidden words have been also considered in the study of complexity in the framework of a hierarchical modeling of physical systems (cf. [2] and [3]).

In this paper we are mainly concerned with the function FL(n), that counts, for any n, the number of words of MF(L) of length n. Our main result is expressed in the framework of symbolic dynamics and it states, roughly speaking, that the growth of the function FL(n) is a topological invariant of the dynamical system dened by L. There are deep connections between the theory of automata and formal languages and symbolic dynamics (cf. [16] and references therein). Several results from symbolic dynamics have a natural interpretation in terms of formal languages and conversely.

Let us recall some basic denitions. If AZ is the set of all biinnite sequences of elements of a nite alphabet A, endowed with the usual topology, a symbolic

(2)

dynamical systemis dened as a closed subset S of AZ which is invariant under the shift function. It is easy to show that a systemSAZ is uniquely specied by the associated language L(S)Aof nite factors of elements inS. The language L(S) is a factorial language. We denote by FS(n) = FL(S)(n) the function that counts the number of minimal forbidden words ofS of length n.

Two symbolic dynamical systems areconjugateorisomorphicif there exists a bi- continuous bijection between them which commutes with the shift. A long standing open problem of the theory (cf. [18], [11] and [16] ) is to decide whether two given soc systems are isomorphic. Recall that a system S is soc if L(S) is a rational language.

A property of a system S that is preserved under isomorphism is said to be a topological invariant of S. Several topological invariants have been found, like topological entropy and zeta functions. None of them is characteristic; for instance there exist systemsSandT that are not isomorphic but that have same topological entropy and same zeta function (cf. next Example 1).

Let g, f be two functions fromN to N. We say that f and g arelinearly equivalent, and we write f 'g, if there exist two constants K1 and K2 such that

1) For any nK1, f(n)K1PKi=1 K1g(n + i).

2) For any nK2, g(n)K2PKi=2 K2f(n + i).

It is easy to verify that'is an equivalence relation.

Our main result states that if two symbolic dynamical systems S and T are isomorphic then the two functions FS and FT, that count, respectively, the number of minimal forbidden words ofS and ofT are linearly equivalent; in other word, our main result states that the equivalence class of FS is a topologial invariant ofS. This new invariant in not characteristic too. As a corollary we have that the topological entropy HMF(L)of the set of minimal forbidden words of L is a topological invariant of the dynamical system associated to L.

This result provides a new tool to show that some systems are not isomorphic. We give indeed examples of non-isomorphic soc systems having the same zeta function and dierent growths of their minimalforbidden words. As a more special application of our main result, we show that two (non soc) systems associated to Sturmian words having \dierent slope" are not isomorphic.

From the point of view of formal languages the fact that the growth of the function FL(n) is invariant under some natural transformations suggests that this function is a "good" tool to investigate combinatorial properties of factorial lan- guages and, in particular, properties of innite words. The growth of FL(n) provides a new complexity measure of the language L.

As a nal remark we note that the statement of our main result presents some analogies with a theorem of Maurer and Nivat (cf. [13]) concerning rational bijections of rational languages. However the results are very dierent as explained with more details in the last section.

2 Minimal Forbidden Words

For any notation not explicitely dened in this paper we refer to [12] and to [9].

(3)

Let A be a nite alphabet and A the set of nite words of letters from A, the empty word included.

Let L A be a factorial language, i.e. 8u;v 2 A, uv 2 L ) u;v 2 L. The complement AnL of L is a (two sided) ideal of A. Denote by MF(L) its base: AnL = AMF(L)A.

MF(L) is the set of Minimal Forbidden words for L. A word v2A is forbidden for L if v =2L. It is minimal if it has no proper factors that are forbidden.

Remark 2.1:

The set MF(L) uniquely characterizes L. Indeed L = AnAMF(L)A.

Remark 2.2:

A word u = a1a2an2MF(L) if and only if (1) u is forbidden, i.e.

u =2L,and(2) both a1a2an 12L and a2an2L.

From this last remark we derive the following expression of MF(L): MF(L) = AL\LA\(AnL):

As a consequence of both previous remarks we have the following

Proposition1.

The language L2Rat(A)if and only ifMF(L)2Rat(A) Example1.

d,y,z c,t,x

a

Figure 1. Languages L and MF(L) a,t

a,z

b,c,x b,d,y

a

c,x,z d,t,y

b b

b,c b,c

a,d

y x

y x

Figure 2. Languages S and MF(S) a,d

a,d

b,c,x b,c,y

a,d

A word v2L isspecial on the leftwith respect to B, where B A andCard(B) 2, if for any b2B, bv belongs to L. Anagously we dene wordsspecial on the right. Given B;CA such thatCard(B)2 andCard(C)2, we say that a word v2L isbispecialwith respect to (B;C) if it is special on the left with respect to B and special on the right with respect to C.

In the case of a two letters alphabet A, special and bispecial words have been extensively studied (cf. [5], [7], [8], [6]). Remark that, since Card(A)= 2, there is no need to specify the sets B and C (both must be equal to A). In this case let us denote by BS(L) the set of bispecial elements of L.

(4)

Example 2. Let K be the set of factors of the Fibonacci innite word

f

(cf. [5]). Then BS(K) =fvj v is a palindrome prex of

f

g=

=f;a;aba;abaaba;abaababaaba;abaababaabaababaaba;g: MF(K) =fwjw = bvb; v is the n-th palindrome prex of

f

, n is eveng[

[fwjw = ava; v is the n-th palindrome prex of

f

, n is oddg=

=fbb;aaa;babab;aabaabaa;babaababaabab;aabaababaabaababaabaa;g: As can be seen by previous example, the sets MF(K) and BS(K) are \similar".

This fact is motivated by the following two easy propositions, that relate bispecial and minimal forbidden words.

Proposition2.

If u2A is bispecial with respect to(B;C) and buc =2L for some b2B and c2C, then buc2MF(L).

Proof. If u is bispecial with respect to (B;C) then bu2L and uc2L. Since buc =2L, by Remark 2, buc2MF(L).

The converse of previous proposition holds true under the supplementary hy- pothesis that L isextensible. Recall that a Language L is extensible if, for any v2L there exist x;y2A such that xv2L and vy2L.

Proposition3.

Let L be a factorial extensible language. If w = buc2MF(L), b, c 2A, then there exist B;CA,Card(B)2,Card(C)2,b2B and c2C such that uis bispecial with respect to (B;C).

Proof. We prove that u is left special with respect to B. A symmetric argument proves that u is right special with respect to C.

Since buc2 MF(L), uc 2 L. Since L is extensible there exists a letter x such that xuc2L. This letter x is dierent from b because buc =2L. If we set B =fx;bg then it is easy to verify that u is left special with respect to B.

Given a language LA, thecomplexity functionfL(n) of L is dened as fL(n) =Cardfv2L such thatjvj= ng:

In this paper we study the complexity function of the set of minimalforbidden words MF(L), fMF(L)(n) (that we denote by FL(n) in order to simplify the notation). It is often useful to consider also the formal series gL(z) dened as follows:

gL(z) = n>0FL(n)zn:

A measure of the complexity of a language L is given by its topological entropy HL, dened as follows:

HL= limsupn

!1

log2(pn fL(n)):

It is known that for a factorial extensible language previous limsup can be substi- tuted by lim.

Example 3. If L and S are the languages of Example 1, then HL = HS while 1 = HMF(L)6= HMF(S)= 2. Indeed gL(z) = 18z12(12zz2) and gS(z) = 2z21(144zz2).

If K is the set of factors of the innite Fibonacci word (cf. Example 2) then for any n0, fK(n) = n + 1 and for any n2, FK(n) = 1 if n is a bonacci number, FK(n) = 0 otherwise. Consequently HK= 0 and HMF(K)= 0.

(5)

3 Symbolic Dynamics

Symbolic dynamics is a eld born at the beginning of the years 20 with the work in topology of Marston Morse (cf. [15]). Later the theory was developed as a branch of ergodic theory. There are deep connections between the theory of automata and formal languages and symbolic dynamics (cf. [16] and references therein). Several results from symbolic dynamics have a natural interpretation in terms of formal languages and conversely.

We present in this section a short introduction to the concepts of symbolic dy- namics. Basic denitions and notations are from [16]. For any other notations not explicitely dened here we refer to [4].

Let AZ be the set of bi-innite sequence of letters of an alphabet A. An element of AZ,

x

= (xn)n2Z;xi2A is called aninnite word.

Let us endow A with the discrete topology and AZ with the product topology.

Then AZ is a compact space.

Theshift is a function dened on AZ. It associates to

x

the element

y

= (

x

) dened by the rule: yn= xn+1 for any integer n.

Asymbolic dynamical systemS is a closed subset of AZ that is invariant by the shift function; i.e. (S) =S.

Given a system S, the set L(S) of all factors of all innite words in S is a factorial and extensible language. Conversely it is possible to associate to a factorial and extensible language L a systemS(L) composed by all the innite words

x

such that all the factors of

x

belong to L. It is easy to see that the system associated to L(S) is again S and that L(S(L)) is again L. Hence any system S is uniquely specied by the associated language L(S). We will write FS instead of FL(S). The entropy HS of a systemS is the topological entropy of the corresponding language L(S).

A morphism between two systems S and T is a map : S ! T which is continuous and commutes with the shift, i.e. such that = .

An isomorphismis a bijective morphism. A property of a systemS that is pre- served under isomorphism is said to be atopological invariantofS.

Let k be an integer greater or equal to 1. A k-block map (or k-localfunction) from a systemSinto a systemT is dened by a partial function f from Akinto B, by two integers m (for memory) and a (for anticipation) with m + a = k, and satises, for any integer i, ((an)n2Z)i= f(ai m+1ai m+2:::ai 1aiai+1:::ai+a). A k-block map is continuous and commutes with the shift. The Curtis-Hedlund theorem (cf.

[10]) assures that any continuous map from AZ into BZ that commutes with the shift is a k-block map for some integer k.

A nite automaton is a couple (Q;F) of a nite set of states Q and of a set of edges F labeled by letters of A. All states are both initial and terminal.

A soc system (or soc shift) can be dened as the set of bi-innite labels of bi-innite paths on a nite automaton. We say that the automaton recognizes the soc system. A long standing open problem of the theory (cf. [18], [11] and [16] ) is to decide whether two given soc systems are isomorphic. We should also mention that there is a strong connection between soc systems and the theory of constrained

(6)

coding, a sub-eld of coding theory. In terms of codes an isomorphism is a sliding- block code which is one-to-one and onto (cf. [1] and [4]).

An automaton is local if it does not admit two distinct equally labeled cycles.

For any local automaton, there exist two integers m and a (m for memory and a for anticipation) such that two equally labeled paths of length m + a go through the same state after their beginnings of length m. Such an automaton is said to be (m;a)-local.

Systems recognized by a local automata are called systemsof nite type. A system of nite type can be also dened as a system consisting of all innite words avoiding a nite set of words. Clearly, any system of nite type is also a soc system. One can prove that the notion \of nite type" and the notion of \soc" are invariant under isomorphism in the sense that any system that is isomorphic to a system of nite type (resp. soc) is again of nite type (resp. soc). Remark that the isomorphism problem is open even for the systems of nite type.

4 Main Result

In this section we state and prove our main result. We begin with a denition:

Let g, f be two functions fromN to N. We say that f and g arelinearly equivalent, and we write f 'g, if there exist two constant K1 and K2 such that

1) For any nK1, f(n)K1PKi=1 K1g(n + i).

2) For any nK2, g(n)K2PKi=2 K2f(n + i).

It is easy to verify that ' is an equivalence relation. In the following we will consider two systems S andT,S over the alphabet A, andT over the alphabet B.

Let FS and FT, be the functions that count, respectively, the number of minimal forbidden words ofS and ofT.

We can now state our main result.

Theorem4.

If two symbolic dynamical systems S and T are isomorphic then the two functions FS and FT are linearly equivalent.

In other word, our main result states that the equivalence class of FS is a topolo- gial invariant ofS.

SinceSandT are isomorphic then there exists a k-block map partially dened from AZ to BZ such that is an isomorphism between S and T. Without loss of generality (eventually by using a composition with a power of the shift), we can assume that the map has memory k and no anticipation.

A transduceris an automaton (Q;F) with a set of edges F labeled by AB, where A and B are two nite alphabets; we, as usual, write a label ajb, a2A;b2 B instead of writing the ordered couple (a;b). The input of the transducer is the automaton labeled by A and obtained from the transducer after removing the second components of the labels of the edges. The output of the transducer is obtained from the transducer after removing the rst components of the labels of the edges. It is an automaton labeled by B.

(7)

All block maps dened by a partial function f from Ak into B, with integers m = k and a = 0 can be represented, for any integer n with nk, by a transducer labeled by AB, Tn= (Q;F) where Q =f(a1a2:::an);ai 2Agand F is the set of the edges :

(a1a2:::an)ajf(an k+2:::ana)

!(a2a3:::ana);

for all ai;a2A. We can remark that if (ujv) is the label of a path of a transducer, then ((u)j(v)) also.

We say that Tnis a representation of : any path of Tnlabeled (ujv) with u2S veries v = (u). Of course it is not true that any input label of a path of Tnbelongs toS.

Remark also that the input automaton of the transducer Tnis a local automaton;

it has memory n and no anticipation. This input automaton is usually known as a De Bruin graph.

We now dene, for any nk, the transducer Tn0 obtained from Tn by removing all states dened by forbidden words of length n ofS.

We will rst state and prove the following four lemmas:

Lemma5.

For any state ofTn0, there exists a bi-innite path of Tn0 going through it.

Proof. Let u = a1:::anbe a state of Tn0. The word u is a factor of a bi-innite word w of S. There exists a path of Tn labeled by w in its input. Then there exists also a path of Tn0 labeled by w in its input since w belongs to S. This path goes through the state u at a time because u is a factor of length n of w. This proves also that any nite path of Tn0 can be extended to the innity on the left and on the right.

Lemma6.

There exists an integer n0 such that for any n n0, Tn0 has a local output.

Proof. We will consider in the following only integers n greater or equal to k. As the input of Tnis local, there exists at most one bi-innite path of the input of Tnwith a given label. The input of Tn0 is also a local automaton. To show that the output of Tn0 is local for n big enough, it is sucient to show that it is impossible to have the following situation:

For any integer N, there exists an integer n > N and two bi-innite paths of Tn0 labeled by ((ani)i2Zj(xni)i2Z) and ((bni)i2Zj(yni)i2Z), with (ani)i2Z 6= (bni)i2Z and (xni)i2Z = (yni)i2Z. We get a sequence of these bi-innite paths, indexed by P =

fp(n);n2

N

g, where p is an increasing function from

N

to

N

.

Let us assume that the above sentence is true. For each integer n 2 P, there exists an index q such that anq6= bnq. Without loss of generality, we can assume that q = 0 for each n by shift invariance of the property of being label of a path of Tn0. We have now an innite number of integers n and bi-innite paths of Tn0 labeled by ((ani)i2Zj(xni)i2Z) and ((bni)i2Zj(yni)i2Z), with an0 6= bn0 and (xni)i2Z = (yni)i2Z. As AZ is a compact set, we extract from each sequence indexed by P, ((ani)i2Z)n2P and ((bni)i2Z)n2P, a convergent sequence. After renumbering (by changing the set P), we can assume now that these sequences converge respectively to the two sequences (ai)i2Z and (bi)i2Z.

(8)

We also denote by wnthe word of length n : anbn

2

c+n:::an1an0an1:::anbn

2

c, where n is equal to 1 if n is even and is equal to 0 otherwise.

As the word wnis an input label of length n of a path of Tn0, it is not a forbidden word of S. As ((ani)i2Z)n2P converge to (ai)i2Z, any nite word factor of (ai)i2Z is a factor of a word of S because it is a factor of a word wnfor some integer n. The bi-innite sequence (ai)i2Z is then a limit of sequences of S. As a consequence the sequences (ai)i2Z and (bi)i2Z belong to S, by compactnes of the symbolic system S.

As is a continuous map, the outputs (xni)i2Z and (yni)i2Z also converge to (xi)i2Z and (yi)i2Z, and we have (xi)i2Z = ((ai)i2Z), (yi)i2Z = ((bi)i2Z). As (xni)i2Z = (yni)i2Z for all n2P, we have (xi)i2Z = (yi)i2Z. But (ai)i2Z 6= (bi)i2Z

because we have a06= b0 since an0 6= bn0 for all n2P. This contradicts the fact that is one-to-one from S to T.

Lemma7.

We assume thatTn00 has a(m;a)-local output, wheremandaare chosen greater than or equal to kandn0. Then for each integernn0,Tn0 has a(m+(n n0);a)-local output.

Proof. We x an integer nn0. We assume that Tn0 is (k1;k2)-local in its output.

We assume that k1 and k2 are greater or equal to k and that k1 is greater or equal to n. We just have to prove that Tn0+1 is (k1+ 1;k2)-local in its output.

We consider two paths of Tn0+1of length k1+1+k2, with the word u as output label.

We assume that the rst path (resp. the second one) goes through state p (resp. p0) after the rst k1+ 1 symbols. We want to show that p = p0. These two nite paths can be extended to two bi-innite paths of Tn0+1 (by Lemma 5). The rst bi-innite path is labeled by (w;z), the second one by (w0;z0). We have u = zi+1:::zi+k1+k2+1

and u = zi0+1:::zi0+k1+k2+1 for some index i. The labels (w;z) and (w0;z0) are also labels of paths of Tn0 because all labels of path of Tn0+1 are labels of paths of Tn0. As Tn0 is (k1;k2)-local in its output, the two bi-innite paths of Tn0 go through the same state q after the symbol of index i + k1, and they also go through the same state r after the symbol of index i + k1+ 1. Let q = a1:::anand r = a2:::an+1. As Tn0 is (n;0)-local in its input, the input symbols of length n+1, wi+k1 n+1:::wi+k1+1and of w0i+k1 n+1:::w0i+k1+1 are a xed word a1:::an+1. Back to Tn0+1, we also have wi+k1 n+1:::wi+k1+1= p and wi0+k1 n+1:::wi0+k1+1= p0. We get p = p0.

Lemma8.

Any word of lengthn k + 1which is an output label of a path ofTn0 is a factor of a word of T.

Proof. We consider a word u of length n k+1, output label of a path of Tn0. Then by Lemma 5 there exists a bi-innite path of Tn0 labeled (w;z) such that, for an index i, we have u = zi+k:::zi+n. The word wi+1:::wi+nis a word of length n input label of a path of Tn0. It is then a factor of a word ofS. Then there exists a path in Tn0, labeled (w0;z0) with w02S and such that wi+1:::wi+n= w0i+1:::wi0+n. As w0belongs to

S, z0 belongs toT. By denition of Tn0, we have zi+k+j = f(wi+j+1:::wi+j+k), for j = 0:::(n k). We get u = zi+k:::zi+n = zi0+k:::z0i+n. Then u is a factor of T since z0belongs toT.

We have now all notations and elements to prove the fundamental lemma:

(9)

Lemma9.

IfS and T are isomorphic, for each integern greater thann0, we have FL(S)(n)Km+Xa n0

c= k+1FL(T)(n + c) whereK = (2card(B))l and l = m + a n0+ k 1.

Proof. Let x be a minimal forbidden word of length n ofS. It is an input label of a path of Tn0 1labeled (w;z), which is not the label of a path of Tn0. Let us assume that x = w1w2:::wn. We consider the part of the output label of this path y = z (m n0 1):::z0z1:::znzn+1:::zn+a. The word y is of length m+(n n0)+a. Let us assume that y is a factor ofT. It is then the output label of a path of Tn0 1labeled (w0;z0) with w02S, z02T and y = z0(m n0 1):::z00z10:::zn0z0n+1:::zn0+a. As Tn0 1 is (m + (n 1 n0);a)-local in its output, we have w10w20:::wn0 = w1w2:::wn= x.

This contradicts the fact that x is not a factor ofS. Then y is not a factor ofT. To each minimal forbidden word x of length n we associate a word y dened as above.

The previous argument shows also that the words y associated to two distinct words x are themselves distinct.

By Lemma 8 we know that each word of length n k, which is an output label of a path of Tn0 1, belongs to T. So each factor of length n k of y is a factor of a word ofT and y is not factor of T. Each word y can be obtained with a minimal forbidden word my ofT of length between n k +1 and m n0+ n+ a, completed left or right by symbols of B. Now the number of such words y, greater or equal to the number of minimal forbidden words ofS of length n, is less or equal to

Km+Xa n0

c= k+1FL(T)(n + c) This gives the announced formula.

In order to prove our main result we have to prove that there exists two constant K1 and K2 such that

1) For any nK1, FL(S)(n)K1PKi=1 K1FL(T)(n + i).

2) For any nK2, FL(T)(n)K2PKi=2 K2FL(S)(n + i).

If we set K1 = maxfK;m + a n0;k 1gthen inequality (1) is an immediate consequence of previous lemma. Inequality (2) comes also by previous lemma, by exchanging the roles played byS andT.

5 Applications

In this section we present some consequences and applications of our main result.

Let us rst remark that, as a particular case of Theorem 4, we obtain the following well known result of Symbolic Dynamics concerning systems of nite type. Recall that a systemS is of nite type if L(S) is a locally testable language.

(10)

Corollary10.

LetS be a system of nite type and let T be a system isomorphic to

S. ThenT is also of nite type.

It is well known that the entropy of a system is a topological invariant. Theorem 4 allows us to state that the entropy of minimal forbidden words is an invariant too.

Corollary11.

LetS be a dynamical system.HMF(L(S)) is a topological invariant.

Theorem 4 and Corollary 11 provide useful tools to prove that two systems are not isomorphic. In particular, ifSis a soc system, i.e. L(S) is a rational language, by Proposition 1, MF(L(S)) is also a rational language and its entropy can be easily computed. For instance the two soc systems in Example 1 are not isomorphic, because HMF(L(S))6= HMF(L(T)), but they have same entropy (and also same zeta function).

However there exist soc systems that have dierent zeta functions (and hence are non isomorphic) but that have linearly equivalent growths of minimal forbid- den words. Moreover there exist soc systems that have equal zeta functions and that have linearly equivalent growths of minimal forbidden words but that are non isomorphic.

Let us now consider the case of dynamical systems associated to Sturmian words.

Recall that a Sturmian word can be also dened by considering the intersections with a squared-lattice of a semi-line having a slope which is an irrational number > 0 (cf. [7]). A vertical intersection is denoted by the letter a, a horizontal intersection by b and the intersection with a corner by ab or ba. It is possible to prove that the language of factors of a Sturmian word dened in this way (and hence the associated systemS) depends only on the slope of the line.

Let S, S be the dynamical systems associated to two Sturmian words such that 6= and 6=1. It is possible to verify that the entropy of both systems and the entropy of their minimal forbidden words are zero. However in what follows, as a consequence of Theorem 4, we prove in a purely combinatorial way that thatS,

S are not isomorphic; this also shows that Theorem 4 is stronger than Corollary 11.Before stating next theorem we need some more preliminaries.

We construct the innite sequence of pairs of words (An;Bn), n0, as follows.

We set (A0;B0) = (a;b). For any n 0 the pair (An+1;Bn+1) is obtained from (An;Bn) by using one of the following two rules:

First rule: (An+1;Bn+1) = (An;AnBn).

Second rule: (An+1;Bn+1) = (BnAn;Bn).

Let > 0 be an irrational number and let [q0;q1;q2;] be the development in continued fraction of . It has been proved (cf. [17]) that if one applies q0times rst rule, q1times second rule, q2times rst rule, and so on, then the sequencesfAngn0,

fBngn0 converge to the same innite Sturmian word

x

that is associated to the semi-line that have slope and that starts from the origin. Moreover, if jAnj2 (resp. jBnj2) then An(resp. Bn) is a prex of

x

.

Let us dene A(

x

) = Sn0fAng and B(

x

) = Sn0fBng. Finally, for any integer k 1 let I(k) =fp2Z;pkjthere exists u2A(

x

)[B(

x

) such that p =jujg.

(11)

In what follows and are two positive irrational number.

The proof of the following lemma is left to the reader.

Lemma12.

If I(k) = I(k) for somek1then either = or =1.

Theorem13.

If 6= and 6=1, then S is not isomorphic to S.

Proof. Suppose that S is isomorphic to S; we want to prove that either = or = 1. By Lemma 12 we will prove that there exists an integer k1 such that I(k) = I(k). By [7, Proposition 9] it is easy to prove that s is bispecial inS if and only if s is a palindrome prex of

x

. By [7, Proposition 7] s is a palindrome prex of

x

if and only if s2 A(

x

)(ba) 1[B(

x

)(ab) 1. By Propositions 2 and 3, if w2MF(L(S)) then there exists u2A(

x

)[B(

x

) such thatjwj=jujand conversely if u 2A(

x

)[B(

x

), juj2, then there exists w 2 MF(L(S)) such that jwj = juj (more precisely the bispecial prex s of u of length juj 2 is the

\central" factor ofjwj) .

Hence, for k2, I(k) is the set of lengths pk of minimal forbidden words of L(S) and then, for nk, FS(n) is a zero-one function that coincides with the characteristic function of the set I(k). Therefore, in order to prove that I(k) = I(k), it sucies to prove that for nk, FS(n) = FS(n) .

By denition of (An;Bn), the sequence of the distances between two consecutive integers in I(k) (resp. I(k)) (i.e. the distances between two consecutive integers n, m such that FS(n) = FS(m) = 1 ) is an increasing divergent sequence. This fact, joined with the hypothesis that, by Theorem 4, FS and FS are linearly equivalent, implies that there exist C 0, k2 such that for any nk, if FS(n) = 1 (resp.

FS(n) = 1 ) then there exists m such that jn mj C and FS(m) = 1 (resp.

FS(m) = 1 ).

Using the developments in continued fractions of and , one can derive by arithmetic computations that the constant C is equal to zero. This concludes the proof.

Remark that if = 1 then

x

is obtained from

x

by exchanging letter a with letter b. Hence SandS are trivially isomorphic.

Finally let us briey discuss some consequences of our main result in the frame- work of formal languages. The fact that the growth of the function FL(n) is invariant under some natural transformation suggests that this function is a \good" tool to investigate the structure of the language L. The equivalence relation'introduced in this paper between functions from N to N induces an equivalence relation be- tween languages as follows: L1 ' L2 if and only if FL1 ' FL2. This gives a new classication for (factorial) formal languages. For instance locally testable factorial languages correspond to a class in this new hierarchy. In the same way, languages having a number of minimal forbidden words of length n which is a polynomial of same degree (exponential having same base, resp.) are in the same level of the hierarchy.

Remark that our theorem presents some analogies with a result of Maurer and Nivat in [13] concerning rational bijections of rational languages. We observe that the notion of \isomorphism", here considered for (factorial, extensible) languages

(12)

associated to topologicallyisomorphic systems, is dierent from the notion of rational bijection in [13]. On the other hand the result of Maurer and Nivat gives a condition involving a function that counts, for any n, the number of words in the language of length n; whereas in our result we count the number of forbidden words. Moreover the equivalence relation between functions introduced in this paper is ner than that considered in [13].

References

1. R. Adler, D. Coppersmith, M. Hassner. Algorithms for Sliding-Block Codes, IEEE Trans. Inf. Theory,IT-29(1983), 5-22.

2. R. Badii. Quantitave Characterization of Complexity and Predictability, Physics Let- ters A160(1991), 372-377.

3. R. Badii. Complexity and Unpredictable Scaling of Hierarchical Structures, in \Chaotic Dynamics: Theory and Practice", T. Bountis Ed., Plenum Press, New York, 1992.

4. M. P. Beal. Codage Symbolique, Masson, 1993.

5. J. Berstel. Fibonacci Words - a Survey, in \The Book of L", G. Rozenberg, A. Salomaa eds., Springer Verlag 1986.

6. J. Cassaigne. Complexite et Facteurs Speciaux, Actes des Journees Montoises, 1994.

7. A. de Luca, F. Mignosi. Some Combinatorial Properties of Sturmian Words, Theor.

Comp. Science,136(1994), 361-385.

8. A. de Luca, L. Mione. On Bispecial Factors of the Thue-Morse Word, Inf. Proc. Lett.,

49(1994), 179-183.

9. S. Eilenberg. Automata, Languages, Machines, Vol. A, Academic Press, 1974.

10. G. A. Hedlund. Endomorphisms and Autorphisms of the Shift Dynamical System, Math. System Theory,3(1969), 320-375.

11. K. H. Kim, F. W. Roush. Williams Conjecture is False for Reducible Matrices, J.

Amer. Math. Soc.5(1992), 213-125.

12. M. Lothaire. Combinatorics on Words, Addison-Wesley, Reading, MA 1983.

13. H. A. Maurer, M. Nivat. Rational bijections of rational sets, Acta Informatica 13 (1980), 365-378.

14. R. McNaughton, S. Papert. Counter-Free Automata, M.I.T. Press, MA 1970.

15. M. Morse. Recurrent geodesics on a surface of negative curvature, Trans. Amer. Math.

Soc.22(1921), 84-110.

16. D. Perrin. Symbolic Dynamics and Finite Automata, invited lecture in Proc. MFCS'95, Lect. Notes in Comp. Sci.,969.

17. G. Rauzy, Mots innis en arithmetique in \Automata on Innite words", M. Nivat and D. Perrin eds., Lecture Notes in Comp. Science,192, Springer, Berlin 1984.

18. R. Williams. Classication of shifts of nite type, Annals of Math.,98(1973), 120-153.

Errata, Annals of Math.,99(1974), 380-381.

This article was processed using the LTEX macro package with LLNCS style

Références

Documents relatifs

It is shown that small fragments of the first-order theory of the subword order, the (partial) lexicographic path ordering on words, the homomorphism preorder, and the infix order

assertion is now proved.. We now complete the proof of the theorem. It follows that the surface Ml is standardly embedded in B, up to ambient isotopy. The

First introduced by Faddeev and Kashaev [7, 9], the quantum dilogarithm G b (x) and its variants S b (x) and g b (x) play a crucial role in the study of positive representations

For a large class of locally compact topological semigroups, which include all non-compact, a-compact and locally compact topological groups as a very special case,

Prove that the gamma distribution is actually a continuous probability distributions that is it is non-negative over x > 0 and its integral over R + is equal to 1.. Exercise 8

In this article, our contribution is threefold: (a) We present a new O(n)-time and O(n)-space algorithm for computing all minimal absent words on a fixed-sized alphabet; (b) We

Results: Our contribution in this article is twofold: first, we bridge this unpleasant gap by presenting an O ( n ) -time and O ( n ) -space algorithm for computing all minimal

Eight years after the war, Beloved appears outside Sethe’s house who then adopts her and starts to believe that the young woman is her dead daughter come back to life..