Palindromic prefixes and episturmian words

(1)

Palindromic prefixes and episturmian words

Stéphane Fischler

Équipe d’Arithmétique et de Géométrie Algébrique, Bâtiment 425, Université Paris-Sud, 91405 Orsay Cedex, France Received 14 March 2005

Available online 27 December 2005

Abstract

Letwbe an infinite word on an alphabetA. We denote by(n_i)_i₁the increasing sequence (assumed to be infinite) of all lengths of palindromic prefixes ofw. In this text, we give an explicit construction of all wordswsuch thatn_i₊₁2n_i+1 for alli, and study these words. Special examples include characteristic Sturmian words, and more generally standard episturmian words. As an application, we study the values taken by the quantity lim supn_i₊₁/n_i, and prove that it is minimal (among all nonperiodic words) for the Fibonacci word.

Keywords:Palindrome; Prefix; Sturmian word; Episturmian word; Fibonacci word

1. Introduction

The purpose of this text is to study infinite words (on an arbitrary, not necessarily finite, alphabet A) which have “sufficiently many” palindromic prefixes. The motivation comes from diophantine approximation (see below), though this question is also related to physics, namely to the spectral theory of discrete one-dimensional Schrödinger operators. Words with many palindromic factors can be used in this setting [11], corresponding to the combinatorial notion of

“palindrome complexity” (see, for instance, [1]). On the other hand, replacing “whole-line methods” by “half-line methods” in connection with this problem leads [4] to the use of words with many palindromic prefixes, like the ones studied below.

In precise terms, given an infinite wordw, we shall denote (in this Introduction) by(n_i)_i₁ the increasing sequence of all lengths of palindromic prefixes ofw, withn₁=0 corresponding

E-mail address:[email protected].

doi:10.1016/j.jcta.2005.12.001

(2)

to the empty prefix. The words studied here always have infinitely many palindromic prefixes, so we assume the sequence(ni)i1to be infinite.

A trivial example of such a wordwis any periodic word with a palindromic period. A more interesting example is the Fibonacci wordw=babbababbabba . . .on the two-letter alphabet {a, b}, for which the sequence(n_i)=(0,1,3,6,11, . . .)is given byn_i=F_i₊₁−2 (whereF_i is theith Fibonacci number); this follows from [6, Theorem 5]. More generally, any characteristic Sturmian word satisfiesn_i₊₁2ni+1 for anyi, and denoting by[0, s1, s₂, . . . ,]the continued fraction expansion of its slope we have (see Section 3.1):

lim supn_i₊₁

n_i =lim sup[1,1, sk, s_k₋₁, . . . , s₁]. (1) In particular, ifwis the Fibonacci word then lim supni+1/n_iis the golden ratioγ=(1+√

5)/2.

A generalization of characteristic Sturmian words to an arbitrary alphabet has been given by Droubay, Justin and Pirillo [8]: these are standard episturmian words. They also satisfyn_i₊₁ 2ni+1 for anyi, but there is no easy equation like (1) to compute lim supn_i₊₁/n_i.

In this text, we study the wordswwith abundant palindromic prefixes in the following sense:

Definition 1.1.An infinite wordwis said to haveabundant palindromic prefixesif the sequence (n_i)_i₁of all lengths of its palindromic prefixes is infinite and satisfiesn_i₊₁2ni+1 for any i1.

A completely explicit construction of all words with abundant palindromic prefixes is given, which generalizes one of the constructions [12] of standard episturmian words. This is a strict generalization, i.e., there are words with abundant palindromic prefixes which are not standard episturmian. Moreover, our results extend to words such thatn_i₊12ni+1 for any sufficiently large integeri; in particular, a general construction of all such words is given.

For any wordw, we let δ(w)=lim supn_i₊₁ n_i

ifwadmits infinitely many palindromic prefixes, andδ(w)= ∞otherwise. Then 1/δ(w)measures the “density” of palindromic prefixes inw. We letDbe the set of real numbers that can be writtenδ(w)for some wordw (on a suitable alphabet). Moreover, we letD0be the set of all numbersδ(w) obtained from words w with abundant palindromic prefixes. The inclusion D0⊂D∩ [1,2]trivially holds, and it is not difficult to prove (see Section 2.2) that(2,+∞] ⊂D.

Denoting bythe union of two disjoint sets, the following result holds.

Theorem 1.2.We haveD=D0(2,+∞].

Actually, for any wordwsuch thatδ(w) <2, there is a wordw with abundant palindromic prefixes such that the palindromic prefixes ofw satisfy the same recurrence relation as those ofw(see Proposition 6.1) and, therefore,δ(w)=δ(w).

The easiest examples of words with abundant palindromic prefixes are periodic words (with a palindromic period) and characteristic Sturmian words (for whichδ(w)can be computed thanks to Eq. (1)). Denote by D the set of numbers δ(w), for these words w. Obviously we have D⊂D0, and the following theorem shows that this inclusion is an equality if we restrict to words with “sufficiently many” palindromic prefixes.

(3)

Theorem 1.3.We haveD∩ [1,√

3] =D0∩ [1,√

3] =D∩ [1,√ 3].

For a periodic wordwwith a palindromic period, we have triviallyδ(w)=1. For a characteristic Sturmian wordwwith slope[0, s1, s₂, . . . ,], Eq. (1) allows one to computeδ(w). From this it is easy to deduce that the characteristic Sturmian wordwwith minimal value ofδ(w)is the Fibonacci word. This shows that 1 and the golden ratioγ=(1+√

5)/2 are the two smallest elements inD. Cassaigne studied [3, Corollary 1 and Theorem 2] the next elements, and his result (together with Theorem 1.3) yields:

Theorem 1.4.The smallest elements inD0(respectively inD)make up an increasing sequence (σn)_n0withσ0=1andσ1=γ, converging to the smallest accumulation pointσ_∞ofD0(re- spectively ofD).

In more precise terms, this statement means thatD0∩[1, σ_∞)= {σ_n, n0}. Moreover, allσ_n, andσ_∞=1.721. . ., are given in an explicit way in terms of their continued fraction expansion.

For instance, writingm¯ for the periodic repetition mmm . . .=m^ω of a finite sequence m, we have:

σ2=1+√

2/2=1.707. . .= [1,1,2¯] and

σ3= 2+√

10

/3=1.720. . .= [1,1,2,1,1].

As a corollary, we see that the Fibonacci word has maximal “palindromic prefix density”

among nonperiodic words.

Corollary 1.5.Letwbe an infinite word withδ(w) < γ. Thenwis periodic.

For a characteristic Sturmian wordw with slope [0, s1, s2, . . . ,], Morse and Hedlund have computed [14] the recurrence function ofw. This gives (see [3, Corollary 1]) a formula for the recurrence quotient(w)ofw, namely(w)=2+lim sup[sk, sk−1, . . . , s1]. Therefore Eq. (1) gives in this case:

δ(w)=2(w)−3 (w)−1 ,

hence (as above) the Fibonacci word has minimal recurrence quotient (equal to(5+√ 5)/2) among all characteristic Sturmian words. Rauzy has conjectured [16] that it has minimal recurrence quotient among all nonperiodic words. Corollary 1.5 is an analogue of this conjecture.

The motivation for this text comes from diophantine approximation. Actually D0\ {1} is equal [9] to the set denoted byS0∩ [1,2]in [10], defined in terms of an exponent that measures the simultaneous approximation to a real number and its square by rational numbers with the same denominator. In particular, Theorem 2.1 in [10] follows from this equality and Theorem 1.3 stated above.

This connection between palindromic prefixes and diophantine approximation is due to Roy [17]. It allows one to get a purely number-theoretical proof of Corollary 1.5 stated above, by applying Davenport–Schmidt’s theorem [5] on simultaneous approximation toξ andξ²to the real numberξ obtained (as in [17]) from an infinite wordw.

(4)

The structure of this text is as follows. We first explain the notation (Section 2.1), and prove that for anyα >2 there is a wordwsuch thatδ(w)=α(Section 2.2). This explains why the rest of the text is devoted only to wordswsuch thatδ(w)2.

Then we recall how characteristic Sturmian words (Section 3.1) and standard episturmian words (Section 3.2) are constructed, with a special emphasis on their palindromic prefixes. In Section 4, we construct all words with abundant palindromic prefixes (Section 4.1). To study these words, the key definition is the one of reduced functions, which allows us to state (Sec- tion 4.2) the main results on words with abundant palindromic prefixes. Moreover, we explain (Section 4.3) how to computeδ(w)for such a wordw, using the associated reduced functionψ.

The proof of the results stated in Section 4 is given in Section 5, using general lemmas (Sec- tions 5.1 and 5.2) that might be of independent interest.

Next we briefly explain how to generalize the results of Section 4 to words that satisfy n_i₊₁2ni+1 for any sufficiently largei (Section 6.1). This allows to prove (in Section 6.2) Theorem 1.2 stated above.

Theorem 1.3 is proved in Section 7.1, and the setD0(respectivelyD) is studied near√ 3 in Section 7.3 (respectively in Section 7.4); this implies that Theorem 1.3 is optimal. We also define A-strict words with abundant palindromic prefixes in Section 7.2, and prove that anyA-strict standard episturmian wordwsuch thatδ(w) <√

3 is either periodic or characteristic Sturmian.

Section 8 contains questions and open problems about words with abundant palindromic prefixes. At last, Appendix A is devoted to the proof of two technical results: Proposition 6.2 (stated in Section 6.2) and Lemma 7.1 (stated in Section 7.1). These statements concern asymptotic properties of the sequence(n_i)associated with a word w such that δ(w) <2. They are also useful for the diophantine analogue [9] of this text.

2. Notation and a peculiar construction

2.1. Notation

Throughout the text, we consider a (finite or infinite) alphabetA, which we assume to be disjoint fromN^∗= {1,2,3, . . .}. Of course, this is not a serious restriction; it allows us to consider AN^∗as a disjoint union.

We denote by|u| the length of a finite wordu, that is the number of letters inu, and byε the empty word (which has length zero). Given a finite wordu=u₁. . . u_p withu_i∈Afor any i∈ {1, . . . , p}, we denote byu˜its mirror imageu_p. . . u₁, in such a way thatuis a palindrome if, and only if,u= ˜u. We setε˜=ε, so thatεis considered a palindrome. We say that a word u=u₁. . . u_p is a prefix ofuifppanduj=u_j for anyjp, that is if there is a wordu such thatu=uu. We extend this definition to the case whereuis an infinite¹wordu₁u₂u₃. . .. In particular,εis a palindromic prefix of any (finite or infinite) word.

In the same way, a worduis a suffix ofuif, and only if, there is a finite wordusuch that u=uu. If this happens then either bothuanduare finite, or bothuanduare infinite.

Ifw andw are finite words such thatw is a prefix ofw, we denote byw⁻¹w the word w such thatw=ww. In the same way, ifw=ww, we writew=ww⁻¹. An important special case is the following: ifwandware palindromes andwis a prefix ofw, thenwis also

1 In this text, we consider only right infinite words. In particular, all palindromes are assumed to be finite.

(5)

a suffix ofw andww⁻¹w is again a palindrome (of whichwis a prefix). In this situation, if w=wwthen we haveww⁻¹w=ww²(see Lemma 5.1 below).

Remark 2.1.Letwbe a word on the (finite or infinite) alphabetA, such thatn_i₊₁2nifor any i sufficiently large (with the sequence(n_i)_i₁defined in the Introduction). Then only finitely many letters of Aoccur in w; this follows from Proposition 6.1 proved below. Therefore the interesting case, throughout this paper, is whenAis finite.

2.2. Words with scarce palindromic prefixes

In this section, we prove that(2,+∞] ⊂D. This result explains why all wordswstudied in the rest of this text are such thatδ(w)2.

Obviously there are wordswwith only a finite number of palindromic prefixes; they satisfy δ(w)= ∞hence∞ ∈D. Now letαbe a real number greater than 2, and chooseε >0 such that 2+ε < α. Denote by(pk)_k₀a sequence of positive integers such that₁₀^p^kk tends toα, with

pk

10^k >2 for anyk. We define an increasing sequence(n_i)_i₁in the following way. We letn₁=0, n₂=1 and ifi2 is even we letv_i₊₁be the maximal integer such that there exists a multiple of 10^vⁱ⁺¹, denoted byn_i₊₁, with 2ni< n_i₊₁< (2+ε)n_i+1. Ifi3 is odd, we letn_i₊₁=p_v_i₁₀ⁿⁱ_vi. With this definition, we haven_i₊12ni+1 for anyi1, andv_i (which is defined only wheni is odd) tends to infinity asitends to infinity. This implies lim supⁿ_nⁱ⁺¹

i =α.

Now let us construct a wordw such that (ni)i1 is exactly the sequence of all lengths of palindromic prefixes ofw. We consider an alphabetA= {δ_k, k∈N}withδ_i=δ_j wheni=j. We define finite palindromesπ_i, of lengthn_i, byπ₁=εand, fori1:

π_i₊₁=π_iδ_iδ₀ⁿⁱ⁺¹⁻²ⁿⁱ⁻²δ_iπ_i ifn_i₊₁2ni+2, πi+1=πiδiπi ifni+1=2ni+1.

Then for anyi1,πi is a palindrome written on the alphabet{δ0, . . . , δi−1}. It is also a prefix ofπi+1, and all palindromic prefixes ofπi+1(exceptπi+1itself) are prefixes ofπi. The infinite wordwdefined as the limit ofπ_i asitends to infinity satisfies the required property: its palindromic prefixes are exactly theπ_i’s, withn_i= |π_i|. Thereforeα=lim supⁿⁱ_n⁺¹

i =δ(w)∈D.

This proves the desired result, namely(2,+∞] ⊂D.

3. Sturmian and episturmian words

In this section, we recall how to construct characteristic, or standard, Sturmian (Section 3.1) and standard episturmian (Section 3.2) words, with a focus on the properties of their palindromic prefixes.

3.1. Characteristic Sturmian words

In this section, we recall a construction of characteristic Sturmian words (see [13, Chapter 2]) and properties of their palindromic prefixes.

We consider the two-letter alphabet A= {a, b}. Let s₁, s₂, . . ., be an infinite sequence of positive integers. Defineσ₀=a,σ₁=a^s¹⁻¹band, by induction,σ_n=σ_n^sⁿ₋₁σ_n₋₂for anyn2.

In the terminology of [13, p. 75],(σn)is the standard sequence associated with(s1−1, s2, s3, . . .).

For anyn1,σn is a prefix ofσn+1; therefore the wordsσn tend to an infinite wordcα, called thecharacteristic Sturmian wordwith slopeα= [0, s1, s₂, . . .].

(6)

Forn2 and 1psn, the wordσ_n^p₋₁σn−2is a prefix ofσn=σ_n^sⁿ₋₁σn−2(sinceσn−2is a prefix ofσ_n₋₁), hence ofc_α. Moreover, it ends withbaifnis even, and withabifnis odd. As it is a standard word, there exists a palindromeπˆ_n,psuch that

σ_n^p₋₁σn−2= ˆπn,pba ifnis even,

σ_n^p₋₁σ_n₋₂= ˆπ_n,pab ifnis odd. (2)

Actuallyπˆ_n,p is even a central word, so it can be writtenπ abπ for some palindromesπ,π (see [7]). However, in what follows, we shall use only the fact that the wordsπˆn,pdefined in this way are palindromes. This fact can be proved directly (see, for instance, [2, Lemma 5.3]).

We shall now define a sequence(π_i)_i₁of palindromic prefixes ofc_α. First, for anyk1 we lett_k=s₁+ · · · +s_k. Now observe that for anyis₁there is exactly one pair(n, p)withn2 and 1ps_nsuch thati=t_n₋₁+p−1. Therefore the equality

ˆ

πn,p=πt_n₋₁+p−1 forn2 and 1psn

definesπ_i in a unique way foris₁(andπ_t_k₋₁= ˆπ_k,s_k is obtained fromσ_k by removing the last two letters). Ifs₁2, we letπ_i =aⁱ⁻¹ for anyi∈ {1, . . . , s1−1}. Thenπ_i is defined for anyi1; we haveπ₁=εand eachπ_i is a prefix ofπ_i₊₁. Moreover, allπ_i’s are palindromic prefixes ofc_α.

Actually theπi’s are the only palindromic prefixes ofcα. This follows from de Luca’s result ([6, Theorem 5]; see also [8, §3]) thatπ_i₊₁is the right palindromic closure ofπ_iδ_i, whereδ_i∈A is the letter inπ_i₊₁that comes right afterπ_i(see Section 3.2 below). Another proof of this result can be obtained by applying Theorem 4.12 proved in this text (see Example 4.6).

Since the π_i’s are exactly the palindromic prefixes of c_α, we have the equality δ(c_α)= lim sup|π_i₊1|/|π_i|. It is not difficult to deduce Eq. (1) from this (see [2, Proposition 7.1]).

Letk3. It is not difficult to prove the relation

π_t_k₊=σ_k⁺¹π_t_k−1₋₁ for any∈ {0, . . . , sk+1} (3) using (for the case=s_k₊₁) the identity σ_k₋₁π_t_k₋₁=σ_kπ_t_k₋₁₋₁ (see, for instance, [2, Lem- ma 5.1]). From Eq. (3) immediately follows

πt_k+1=πt_kπ_t⁻¹

k−1−1πt_k, π_t_k₊₊₁=π_t_k₊π_t⁻¹

k+−1π_t_k₊ for any∈ {1, . . . , sk+1−1}. (4) We are going now to define a mapψ:N^∗→N^∗Ain such a way that, for anyi1:

π_i₊₁=π_iπ_ψ(i)⁻¹ π_i ifψ (i)∈N^∗,

π_i₊₁=π_iψ (i)π_i ifψ (i)∈A. (5)

The possibility to define inductively, in this way, the palindromic prefixes ofc_α usingψwill be the crucial point in the construction of Section 4.

Fork3 we let ψ (t_k)=t_k₋1−1, and ifi > t3is not among thet_k’s we letψ (i)=i−1.

Then Eq. (4) shows that (5) holds for anyit3. To define the valuesψ (i) for 1i < t3, we distinguish between two cases.

First, let us assumes₁=1. Thenπ=b⁻¹ for 1t₂andπ_t₂₊=(b^s²a)b^s² for any 0s₃. We let ψ (1)=b,ψ (t₂)=a andψ (i)=i−1 for i∈ {2, . . . , t3−1} \ {t₂}. Then Eq. (5) holds for anyi1.

Now let us assumes12. Thenπ=a⁻¹for 1t1andπ_t₁₊=(a^s¹⁻¹b)a^s¹⁻¹for any 0s2. Moreover, Eq. (3) holds also fork=2. We letψ (1)=a,ψ (t1)=b,ψ (t2)=t1−1 andψ (i)=i−1 fori∈ {2, . . . , t3−1} \ {t₁, t₂}. Then Eq. (5) holds for anyi1.

(7)

3.2. Standard episturmian words

Denote byw⁽⁺⁾ the (right) palindromic closure of a finite wordw, that is the shortest palindrome of whichwis a prefix. LetΔ=δ₁δ₂. . .be an infinite word on an alphabetA. Droubay, Justin and Pirillo gave [8] the following definition (see [12, Corollary 2.2]):

Definition 3.1.The standard episturmian word with directive wordΔis the limit of the sequence (π_i)_i₁defined byπ₁=εandπ_i₊₁=(π_iδ_i)⁽⁺⁾fori1.

The important point here (which will be generalized in Section 4.1) is that a standard episturmian word can be constructed as a limit of an infinite sequence of its palindromic prefixes.

GivenΔ, define a functionψ:N^∗→N^∗Aas follows. Forn1, letψ (n)=δnif the letter δn occurs for the first time inΔat the nth position. Otherwise, letψ (n)=n wheren is the greatest integer such that 1nn−1 andδ_n=δn. Then for anyi1 we have [12, p. 287]:

π_i₊₁=π_iπ_ψ(i)⁻¹ π_i ifψ (i)∈N^∗ and

π_i₊1=π_iψ (i)π_i ifψ (i)∈A.

The crucial remark in what follows is that these equalities could have been taken as a definition of the sequence(π_i), and therefore of standard episturmian words.

Example 3.2.Lets₁, s₂, . . . be a sequence of positive integers, andA= {a, b}be a two-letter alphabet. The standard episturmian word with directive word Δ=a^s¹⁻¹b^s²a^s³b^s⁴. . . is the characteristic Sturmian word with slope[0, s1, s₂, . . .]. This follows from Section 3.1 (see also [6, proof of Theorem 5]).

Example 3.3.LetA= {a, b, c}andΔ=(abc)^ω=abcabcabc . . .. Then the standard episturmian wordwwith directive wordΔis [12, Example 2.1] the Tribonacci (or Rauzy [15]) word (that is, the fixed pointabacabaabacabab . . .of the morphism defined bya→ab,b→acand c→a). The corresponding functionψ is given byψ (n)=n−3 forn4, andψ (n)=δ_nfor 1n3.

4. Words with abundant palindromic prefixes

In this section, we give a general construction (Section 4.1) of all words with abundant palindromic prefixes, using functions ψ. Then we define (Section 4.2) reducedfunctions ψ; this definition allows us to state the main results about words with abundant palindromic prefixes, namely Theorems 4.12 and 4.14. At last, we explain in Section 4.3 how to computeδ(w)(for a wordwwith abundant palindromic prefixes) using the associated reduced functionψ.

4.1. A general construction

Letψ:N^∗→N^∗Abe any map such that, for eachn1:

either ψ (n)∈A or 1ψ (n)n−1.

(8)

Defineπ1=εand, fori1:

π_i₊₁=π_iπ_ψ(i)⁻¹ π_i ifψ (i)∈N^∗ and

πi+1=πiψ (i)πi ifψ (i)∈A.

It is not difficult to prove by induction that allπi’s are palindromes, and thatπiis a prefix ofπi+1

(for instance, ifψ (i)∈N^∗, writingπ_i=π_ψ(i)b_i= ˜b_iπ_ψ(i)yieldsπ_i₊₁= ˜b_iπ_ψ(i)b_i=π_ψ(i)b²_i; the easy Lemma 5.1 stated below can also be used). However, in general there is no letterδ_i∈A such thatπ_i₊₁be the palindromic closure ofπ_iδ_i.

Definition 4.1.We callword with abundant palindromic prefixesassociated withψ, and denote byw_ψ, the limit of the sequence(π_i).

This definition is consistent with the one given in the Introduction since the following result holds (it is proved in Section 5 as a consequence of Theorem 4.14 stated below).

Theorem 4.2.Letwbe an infinite word, and(n_i)_i₁be the increasing sequence(assumed to be infinite)of the lengths of its palindromic prefixes(withn₁=0). Then the following statements are equivalent:

(i) We haven_i₊₁2ni+1for anyi1 (i.e.,whas abundant palindromic prefixes).

(ii) For some function ψ, we have w=w_ψ (i.e., w is the word with abundant palindromic prefixes associated withψ).

Let us study in more details the word with abundant palindromic prefixes associated with a mapψ. First, let us consider the letterδ_i inπ_i₊₁that comes right afterπ_i. This is the first letter of π_i⁻¹π_i₊₁, the one such that π_iδ_i is a prefix of π_i₊₁. We haveδ_i =ψ (i) if ψ (i)∈A, and δ_i=δ_ψ(i)otherwise. This explains the following definition.

Definition 4.3.We callword of first lettersassociated withψthe wordΔ=δ₁δ₂. . .defined (for eachn1) byδ_n=ψ (n)ifψ (n)∈A, andδ_n=δ_ψ(n)otherwise.

The assumptions on ψ imply ψ (1)∈A and π2 =ψ (1)=δ1. For ψ (2) there are two possibilities: either ψ (2)∈A (thenπ3=ψ (1)ψ (2)ψ (1) andδ2=ψ (2)), orψ (2)=1 (then π₃=ψ (1)ψ (1)andδ₂=ψ (1)).

Already from this example we can see that several functionsψ may lead to the same word of first lettersΔ: for instance, takingψ (2)=ψ (1)∈Ayields the same value of δ₂ as taking ψ (2)=1∈N^∗, but not the same value ofπ₃. Using this example it is not difficult to produce functionsψandψwith the same word of first letters but such thatw_ψ=w_ψ. Therefore a word w_ψ with abundant palindromic prefixes is not given just by its word of first lettersΔ, but by a richer structure: the function²ψ. To be precise,ψ is given exactly by the wordΔ=δ₁δ₂. . . together with the choice, for anyn1, of an integern∈ {0, . . . , n−1}that satisfies either n=0 orδ_n=δ_n. If we fixΔ, then a special choice ofψis obtained by taking fornthe greatest

2 Actually one may restrict to reduced functions, see Section 4.2 below.

(9)

integern< nsuch thatδ_n=δn(andn=0 if there is no such integer, i.e., if the letterδnoccurs for the first time inΔ at thenth position). For this functionψ, the word wψ is the standard episturmian word with directive word Δ(see Section 3.2). Therefore Definitions 4.1 and 4.3 generalize Definition 3.1 of standard episturmian words.

Remark 4.4.Two distinct functionsψ andψalways lead to distinct sequences(π_i)and(π_i), but may lead to the same wordw_ψ=w_ψ(see Example 4.8 below).

Example 4.5.If ψ (n)=n−1 for anynN then π_N₊=π_Nω for any 0, withω= π_N⁻¹₋₁πN. Therefore in this casewψ is ultimately periodic, hence periodic with a palindromic period (see Lemma 5.6 below).

Example 4.6.Let A= {a, b}be a two-letter alphabet, and(s_k)_k₁ be a sequence of positive integers. For any k1, let t_k=s₁+ · · · +s_k if s₁2 and t_k =s₁+ · · · +s_k₊₁ if s₁=1.

In both cases, lett₀=1. Moreover, let ψ (i)=i−1 if i1 is not amongt₀, t₁, t₂, . . ., and ψ (t_k)=t_k₋₁−1 for anyk2. Ifs₁2, letψ (1)=aandψ (t₁)=b; ifs₁=1, letψ (1)=b andψ (t₁)=a. Then the wordw_ψ associated withψ is the characteristic Sturmian word with slope[0, s1, s₂, . . .]. The function ψ, the palindromesπ_i and the sequence(t_k)are exactly the same as in Section 3.1 (except that the indexkint_kis shifted ifs₁=1).

Example 4.7.In the previous example, ifsk=1 for anyk1 then ψ (1)=b,ψ (2)=a and ψ (i)=i−2 for anyi3. The wordwψ=babbab . . .is the Fibonacci word.

4.2. Reduced functions

Two problems immediately arise from the construction of words with abundant palindromic prefixes. First, are there other palindromic prefixes ofw_ψthan theπ_i’s? Second, can two distinct functionsψandψlead to the same wordw?

In general, the answers to both questions are positive, as shown in the following example.

This is the reason whyreducedfunctions are studied below.

Example 4.8.Letψbe a function, andi2 be a integer, such thatψ (i+1)=ψ (i)=i−1. Let b_ibe the finite nonempty word such thatπ_i=π_i₋₁b_i. Thenπ_i₊₁=π_i₋₁b²_i andπ_i₊₂=π_i₋₁b_i⁴. Now Lemma 5.1 stated below shows thatπ_i₋₁b³_i is a palindromic prefix ofπ_i₊₂(hence ofw_ψ), of length strictly between those ofπ_i₊₁andπ_i₊₂. This gives a palindromic prefix ofw_ψwhich is not among theπ_n’s constructed fromψ. To avoid this problem, consider a functionψsuch that ψ(n)=ψ (n)forni,ψ(i+1)=iandψ(i+2)=i+1. Denoting by(π_n)the sequence of finite palindromes associated withψ, we haveπ_n=π_nforni+1,π_i₊₂=π_i₋₁b³_i andπ_i₊₃= π_i₋1b⁴_i. Forni+3, we letψ(n)=ψ (n−1)ifψ (n−1)i+1, andψ(n)=ψ (n−1)+1 otherwise. Then we haveπ_n =π_n₋₁for anyni+3, andw_ψ=w_ψ. In this way the functions ψandψdefine the same word, but the family of finite palindromes associated withψcontains the “missing” palindromeπ_i₋₁b_i³.

Letψ:N^∗→N^∗Abe any function (in the sequel we always assume that, for eachn1, eitherψ (n)∈Aor 1ψ (n)n−1).

(10)

Denote by(tk)k0the family of all indexesn(in increasing order) such that either 1ψ (n) n−2 or ψ (n)∈A. This family can be either finite or infinite. We always havet0=1, since ψ (1)∈A.

Definition 4.9.A functionψ is said to bereducedif the associated sequence(t_k)satisfies, for anyk1, the following two conditions:

• ψ (t_k)=ψ (t_k₋₁).

• Eitherψ (t_k)∈Aorψ (t_k) < t_k₋₁.

In the special case where the family(tk)is finite (i.e.,ψ (n)=n−1 fornsufficiently large, see Example 4.5), we assume in this definition that both properties hold for anyk such thattk

exists.

Remark 4.10.The functionψin Example 4.6 is reduced, and the definition of(t_k)given there is consistent with the one introduced here.

Remark 4.11.The functionψ in Example 4.8 is not reduced. In fact there is an integerksuch thati+1=tk, and we havetk−1i−1=ψ (tk).

In the situation of Example 4.8, we have seen thatψis not reduced, and that theπ_i’s are not the only palindromic prefixes ofw_ψ. Actually both phenomena are equivalent:

Theorem 4.12.Letψ:N^∗→N^∗Abe a function such that, for eachn1, eitherψ (n)∈Aor 1ψ (n)n−1. Then the following assertions are equivalent:

• The functionψis reduced.

• The palindromic prefixes ofwψare exactly theπi’s constructed fromψ.

This theorem will be proved in the next section (Section 5.3). It is not difficult to deduce the following corollary (see Example 4.5 and Lemma 5.6).

Corollary 4.13.Letψbe a reduced function. Thenw_ψ is periodic if, and only if,ψ (n)=n−1 for any sufficiently large integern.

Letψbe a reduced function, and(πi)be the associated sequence of finite palindromes (that is, thanks to Theorem 4.12, the sequence of all palindromic prefixes ofwψ). Then the following assertions are easily seen to be equivalent:

• For any sufficiently largeiwe haveψ (i)∈N^∗.

• For any sufficiently largeiwe have|π_i₊₁|2|π_i|.

If these assertions hold thenw_ψ can be written on a finite alphabet.

In addition to Theorem 4.12, another important property of reduced functions is the following generalization of Theorem 4.2, proved in Section 5 below.

(11)

Theorem 4.14.Letwbe an infinite word, and(ni)i1be the increasing sequence(assumed to be infinite)of the lengths of its palindromic prefixes(withn₁=0). Then the following statements are equivalent:

(i) We haven_i₊₁2ni+1for anyi1 (i.e.,whas abundant palindromic prefixes).

(ii) There exists a functionψsuch thatw=w_ψ.

(iii) There exists a reduced functionψsuch thatw=w_ψ. Moreover, the reduced functionψin(iii)is unique.

It is possible to write down a “reduction” algorithm (generalizing Example 4.8) that allows one to obtain, from any functionψ, the reduced functionψsuch thatw_ψ=w_ψ. In this situation, the construction of Section 4.1 applied withψgives a sequence(π_i)of palindromic prefixes ofw_ψ; withψ, it gives another sequence(π_i). Theorem 4.12 shows that(πi)is a sub-sequence of(π_i).

Again, the “reduction” algorithm allows one to obtain explicitly the full sequence(π_i)from the sub-sequence(π_i). This algorithm is partly used in [9], but in the present text we shall not need it; the crucial point here is just the uniqueness of the reduced functionψcorresponding toψ.

Definition 4.15.Letwbe a word with abundant palindromic prefixes. The reduced functionψ in Theorem 4.14 is called thedirective functionofw.

Remark 4.16. The uniqueness assertion in Theorem 4.14 immediately follows from Theo- rem 4.12 and Remark 4.4.

Now we can put Definitions 4.3 and 4.15 together in the following way:

Definition 4.17.Letwbe a word with abundant palindromic prefixes. We callword of first letters associated withwthe word of first letters associated with the directive function ofw.

The following property holds: if(π_i)is the sequence of all palindromic prefixes of a wordw with abundant palindromic prefixes, andΔ=δ1δ2. . .is the associated word of first letters, then πiδi is a prefix ofπi+1for anyi1.

4.3. Computation ofδ(w)using reduced functions

Definition 4.18.With any reduced functionψwe associate the increasing sequence of nonnega- tive integers(n_i)_i₁defined byn₁=0 and, for alli1:

n_i₊₁=2ni−n_ψ(i) ifψ (i)∈N^∗ and

n_i₊₁=2ni+1 ifψ (i)∈A.

Theorem 4.12 shows thatn_i is the length of theith palindromic prefix ofw_ψ. In the same way, we introduce the following definition so thatδ(ψ )=δ(w_ψ):

Definition 4.19.For any reduced functionψwe letδ(ψ )=lim supⁿⁱ_n⁺¹

i , where(ni)is associated withψas in Definition 4.18.

(12)

This definition ofδ(ψ )is completely elementary. It is useful because of the following fact:

for a wordwwith abundant palindromic prefixes, we haveδ(w)=δ(ψ )whereψis the directive function ofw(see Definition 4.15).

5. Proof of the main results

5.1. General lemmas about palindromic prefixes

The first lemma is very easy, and sufficient to prove half of Theorem 4.12 (see Section 5.3 below).

Lemma 5.1.Let p andu be two words, such thatp and puare palindromes. Thenpu² is a palindrome(and so is, by induction, the word puⁿ for anyn2). Similarly, ifp andup are palindromes thenuⁿpis a palindrome for anyn0.

Proof. Ifpandpuare palindromes then we havep˜=pandup˜ =puhence pu²= ˜uup˜ = ˜upu=pu².

The case where p and up are palindromes is analogous. This concludes the proof of Lemma 5.1. 2

In particular, in this situationpuandpu²are palindromes, one is a prefix of the other, and the quotient of their lengths is less than 2 (or equal to 2 whenpis empty). The following lemma gives a kind of converse to this phenomenon (at least in the casen=n).

Lemma 5.2.Letwbe an infinite word, andn,n,nbe integers such thatnnn+n. We assume that the prefixes ofwwith lengthsn,n,nare palindromes, denoted bya,aand a, respectively. Leta₀be the prefix ofwof lengthn+n−n. Then the following holds:

• There is a wordbsuch thata=a₀banda=ab.

• Ifnn−nthena₀is a palindrome.

Remark 5.3.This lemma will be used only whennn−n, and in this case the first property will be written

a=aa₀⁻¹a

sincea0 is both a suffix ofa and a prefix ofa. Moreover, an important special case is when n=n. The lemma then reads: if a anda are palindromes, withnn2n, thena0 is a palindrome and we havea=a0banda=a0b².

Proof of Lemma 5.2. Asais a prefix ofa, there exists a wordbsuch thata=ab. The word bis a suffix ofa, therefore its mirror imageb˜is a prefix ofa(hence also ofw) sincea is a palindrome. Nowb˜has lengthn−nn, thereforeb˜is a prefix ofa. Asais a palindrome,b is a suffix ofa: there exists a wordcsuch thata=cb. It is clear thatc=a₀is the prefix ofwof lengthn+n−n.

(13)

Assume nownn−n, and let us show thata0is a palindrome. Let 1i(n+n−n)/2;

then we haveinhence:

w_n₊_n−n+1−i=w_n−n+i=w_n+1−i=w_i,

by using successively that a, a and a are palindromes. This concludes the proof of Lemma 5.2. 2

Lemma 5.4.Letwbe an infinite word. Letn< nbe two consecutive lengths of palindromic prefixes ofw;let us denote byπand π the corresponding prefixes, withπ=πωfor some wordω. Then any palindromic prefixπ ofwsuch thatn|π|n+ncan be writtenπω^t witht0.

Proof. Assume there is a prefix π of w, of length n, which contradicts the lemma and has minimal length. Asnandnare consecutive, we haven > n. Lemma 5.2 gives a palindromic prefixπ₀ofwof lengthn− |ω|> n, such thatπ=π₀ω. This contradicts the minimality ofπ, and concludes the proof. 2

Lemma 5.5.Letwbe an infinite word. Letn₀< n₁< n₂be three consecutive lengths of palin- dromic prefixes ofw;let us denote byπ₀,π₁andπ₂the corresponding prefixes. Then:

• eitherπ₂=π₁π₀⁻¹π₁,

• orn₂> n₀+n₁.

Proof. If n2n0+n1, one may apply Lemma 5.2 withn=n2,n=n0 andn=n1. Then n2+n0−n1is the length of a palindromic prefix ofw; but this length is strictly betweenn0

andn2, therefore it isn1. We get in this wayπ2=π1π₀⁻¹π1, which concludes the proof of the lemma. 2

5.2. Ultimately periodic words

Lemma 5.6.Letwbe an infinite ultimately periodic word, infinitely many prefixes of which are palindromic. Thenwis periodic with a palindromic period. Moreover, ifd denotes the smallest length of a period ofwthen there existsr∈ {1, . . . , d}with the following property. For anynd, the prefix ofwof lengthnis a palindrome if, and only if,n≡rmodd.

Proof. Ifwwere ultimately periodic but not periodic, there would exist two nonempty wordsπ₀ andπ such thatw=π0π π π . . ., and such that the last letter ofπ0be different from that ofπ.

But this contradicts the assumption thatwhas arbitrary long palindromic prefixes. In fact, if we denote byz1. . . zd the wordπ and byz0=zd the last letter ofπ0, then this assumption implies that the wordz_d₋₁. . . z₀appears infinitely many times inw, and is therefore a cyclic permutation of the periodz₁. . . z_d. Asz₀=z_d, this is impossible.

Thereforewis periodic, and can be writtenw=π π π . . .with a periodπof minimal lengthd. Let nd be the length of a palindromic prefix of w. Then we have w_i =w_n₊₁₋_i for all i∈ {1, . . . , d}. Ifn is another such integer, not congruent ton modd, we obtain w_i =w_i₊_ε for all i∈ {1, . . . , d}with 1εd −1; this contradicts the minimality of d. Therefore all