We illustrate these techniques by giving decidability results on the difference hierarchies based on shuffle ideals, strongly cyclic regular languages and the polynomial closure of group lan- guages

(1)

OLIVIER CARTON, DOMINIQUE PERRIN, AND JEAN- ´ERIC PIN IRIF, CNRS and Universit´e Paris-Diderot,, Case 7014, 75205 Paris Cedex 13, France.

e-mail address: [email protected]

Laboratoire d’informatique Gaspard-Monge, Université de Marne-la-Vallée, 5, boulevard Descartes, Champs- sur-Marne, F-77454 Marne-la-Vallée Cedex 2.

IRIF, CNRS and Universit´e Paris-Diderot,, Case 7014, 75205 Paris Cedex 13, France.

ABSTRACT. Difference hierarchies were originally introduced by Hausdorff and they play an important role in descriptive set theory. In this survey paper, we study difference hierarchies of regular languages. The first sections describe standard techniques on difference hierarchies, mostly due to Hausdorff. We illustrate these techniques by giving decidability results on the difference hierarchies based on shuffle ideals, strongly cyclic regular languages and the polynomial closure of group languages.

Dedicated to the memory of Zolt´an ´Esik.

1. INTRODUCTION

Consider a set E and a set F of subsets of E containing the empty set. The general pattern of a difference hierarchy is better explained in a picture. Saturn’s rings-style Figure 1 represents a decreasing sequence

X₁ ⊇X₂⊇X₃ ⊇X₄⊇X₅

of elements ofF. The grey part of the picture corresponds to the set(X₁−X₂) + (X₃−X₄) +X₅, a typical element of the fifth level of the difference hierarchy defined byF. Similarly, then-th level of the difference hierarchy defined by F is obtained by considering length-n decreasing nested sequences of sets.

Received by the editorsThursday 5^thOctober, 2017 13:23.

1998 ACM Subject Classification: Formal languages and automata theory, Regular languages, Algebraic language theory.

Key words and phrases: Difference hierarchy, regular language, ordered syntactic monoid.

The third author is partially funded from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 670624). The first and third authors are partially funded by the DeLTA project (ANR-16-CE40-0007).

LOGICAL METHODS

IN COMPUTER SCIENCE DOI:10.2168/LMCS-???

c O. Carton, D. Perrin, and J.- ´E. Pin Creative Commons

1

(2)

X1

X₂ X₃

X₄ X₅

Figure 1: Five subsets ofE.

Difference hierarchies were originally introduced by Hausdorff [12,13,14]. They play an important role in descriptive set theory [28, Section 11] and also yield a hierarchy on complexity classes known as the Boolean hierarchy [15, Section 3], [30, Section 2], [3], [2, Section 3]. Difference hierarchies were also used in the study ofω-regular languages [4,6,8,7,9,29].

The aim of this paper is to survey difference hierarchies of regular languages. Decidability questions for difference hierarchies over regular languages were studied in [10] and more recently by Glasser, Schmitz and Selivanov in [11]. The latter article is the reference paper on this topic and contains an extensive bibliography, to which we refer the interested reader. However, paper [11]

focuses on forbidden patterns in automata, a rather different perspective than ours.

We first present some general results on difference hierarchies and their connection with closure operators. The results on approximation of Section5, first presented in [5], lead in some cases to convenient algorithms to compute chain hierarchies.

Next we turn to algebraic methods. Indeed, a great deal of results on regular languages are obtained through an algebraic approach. Typically, combinatorial properties of regular languages — being star-free, piecewise testable, locally testable, etc. — translate directly to algebraic properties of the syntactic monoid of the language (see [18] for a survey). It is therefore natural to expect a similar algebraic approach when dealing with difference hierarchies. However, things are not that simple. First, one needs to work withordered monoids, which are more appropriate for classes of regular languages not closed under complement. Secondly, although Theorem 7.2yields a purely algebraic characterization of the difference hierarchy, it does not lead to decidability results, except for some special cases. Two such cases are presented at the end of the paper. The first one studies the difference hierarchy of the polynomial closure of a lattice of regular languages. The main result, Corollary8.5, which appears to be new, states that the difference hierarchy induced by the polynomial of group languages is decidable. The second case, taken from [5], deals with cyclic and strongly cyclic regular languages.

Our paper is organised as follows. Prerequities are presented in Section2. Section3covers the results of Hausdorff on difference hierarchies and Section4is a brief summary on closure operators. The results on approximation form the core of Section 5. Decidability questions on regular languages are introduced in Section6. Section7on chains is inspired by results of descriptive set theory. Two results that are not addressed in [11] are presented in Sections8and9. The final Section 10opens up some perspectives.

(3)

2. PREREQUISITES

In this section, we briefly recall the following notions: upsets, ordered monoids, stamps and syntactic objects.

LetEbe a preordered set. Anupper setofEis a subsetU ofEsuch that the conditionsx∈U and x 6 y imply y ∈ U. An ordered monoid is a monoid M equipped with a partial order6 compatible with the product onM: for allx, y, z∈M, ifx6ythenzx6zyandxz6yz.

Astampis a surjective monoid morphismϕ:A^∗ → M from a finitely generated free monoid A^∗onto a finite monoidM. IfMis an ordered monoid,ϕis called anordered stamp.

Therestricted direct productof two stampsϕ₁ :A^∗→M₁ andϕ₂ :A^∗ → M₂ is the stampϕ with domainA^∗ defined byϕ(a) = (ϕ₁(a), ϕ₂(a)). The image ofϕis an [ordered] submonoid of the [ordered] monoidM₁×M₂.

A^∗

M₁ M₂

Im(ϕ)⊆M₁×M₂

ϕ₁ ϕ₂

ϕ

π₁ π₂

Stamps and ordered stamps are used to recognise languages. A languageLofA^∗ isrecognised by a stampϕ:A^∗→ Mif there exists a subsetP ofMsuch thatL=ϕ⁻¹(P). It isrecognised by an ordered stampϕ:A^∗ →M if there exists an upper setU ofMsuch thatL=ϕ⁻¹(U).

The syntactic preorder of a language was first introduced by Sch¨utzenberger in [26, p. 10]. Let Lbe a language ofA^∗. Thesyntactic preorderofLis the relation6_Ldefined onA^∗ byu6_Lvif and only if, for everyx, y∈A^∗,

xuy∈L =⇒ xvy∈L.

The associated equivalence relation∼L, defined byu∼L vifu6_Lvandv6_Lu, is thesyntactic congruence of L and the quotient monoid M(L) = A^∗/∼L is the syntactic monoid of L. The natural morphismη:A^∗ →A^∗/∼Lis thesyntactic stampofL. Thesyntactic imageofLis the set P =η(L).

Thesyntactic order 6_P is defined on M(L)as follows: u 6_P vif and only if for all x, y ∈ M(L),

xuy∈P =⇒ xvy∈P

The partial order6_P is stable and the resulting ordered monoid(M(L),6_P)is called theordered syntactic monoidofL. Note thatP is now an upper set of(M(L),6_P)andηbecomes an ordered stamp, called the ordered syntactic stamp ofL.

3. DIFFERENCE HIERARCHIES

LetEbe a set. In this article, alatticeis simply a collection of subsets ofEcontaining∅andEand closed under taking finite unions and finite intersections. Alatticeclosed under complement is a Boolean algebra. Thorough this paper, we adopt Hausdorff’s convention to denote union additively,

(4)

set difference by a minus sign and intersection as a product. We also sometimes denote L^c the complement of a subsetLof a setE.

LetF be a set of subsets ofE containing the empty set. We set B0(F) = {∅}and, for each integern>1, we letBn(F)denote the class of all sets of the form

X=X₁−X₂+ · · · ±Xn (3.1) where the setsX_iare inFand satisfyX₁ ⊇X₂⊇X₃ ⊇ · · · ⊇X_n. By convention, the expression on the right hand side of (3.1) should be evaluated from left to right, but given the conditions on the Xi’s, it can also be evaluated as

(X₁−X₂) + (X₃−X₄) + (X₅−X₆) +· · · (3.2) Since the empty set belongs to F, one has Bn(F) ⊆ Bn+1(F) for all n > 0 and the classes Bn(F)define a hierarchy within the Boolean closure ofF. Moreover, the following result, due to Hausdorff [13], holds:

Theorem 3.1. LetF be a lattice of subsets ofE. The union of the classesBn(F)forn>0is the Boolean closure ofF.

Proof. Let B(F) = ∪n>1Bn(F). By construction, every element of Bn(F) is a Boolean combination of members of F and thus B(F) is contained in the Boolean closure of F. Moreover B₁(F) = F and thusF ⊆ B(F). It is therefore enough to prove thatB(F)is closed under complement and finite intersection. IfX=X₁−X₂+ · · · ±X_n, one has

E−X=E−X₁+X₂− · · · ∓X_n

and thusX∈ B(F)impliesE−X∈ B(F). ThusB(F)is closed under complement.

LetX=X₁−X₂+ · · · ±X_nandY =Y₁−Y₂+ · · · ±Y_mbe two elements ofB(F). Let Z=Z₁−Z₂+ · · · ±Z_n+m−1

with

Z_k= X

i+j=k+1 iandjnot both even

XiYj

Therefore

Z₁=X₁Y₁,

Z₂=X₁Y₂+X₂Y₁, Z₃=X₁Y₃+X₃Y₁,

Z4=X1Y4+X2Y3+X3Y2+X4Y1, ...

Zn+m−1=

(XnYm ifmandnare not both even

∅ otherwise

We claim that Z = XY. To prove the claim, consider for each set X = X1−X2 + · · · ±Xn

associated with the decreasing sequenceX₁, . . . ,X_nof subsets ofE, the functionµ_X defined onE by

µX(x) = max{i>1|x∈Xi}

with the convention thatµX(x) = 0ifx ∈E−X₁. Thenx∈Xif and only ifµX(x)is odd. We now evaluateµ_Z(x)as a function ofi=µ_X(x)andj=µ_Y(x). We first observe that ifk>i+j,

(5)

then x /∈ Z_k. Next, if iand j are not both even, then x ∈ X_iY_j and X_iY_j ⊆ Z_i+j−1, whence µZ(x) = i+j−1. Finally, if iandj are both even, then x /∈ Z_i+j−₁ and thusµZ(x) is either equal to0or toi+j−2. Summarizing the various cases, we observe thatµ_X(x)andµ_Y(x)are both odd if and only ifµZ(s)is odd, which proves the claim. It follows thatB(F)is closed under intersection.

An equivalent definition ofBn(F)was given by Hausdorff [14]. LetX △Y denote the symmetric difference of two subsetsXandY ofE.

Proposition 3.2. For everyn>0,Bn(F) ={X1△X₂ △ · · · △Xn |Xi ∈ F}.

Proof. Indeed, if X = X₁ −X₂ + · · · ±X_n with X₁ ⊇ X₂ ⊇ X₃ ⊇ · · · ⊇ X_n, thenX = X1 △ X2 △ · · · △ Xn. In the opposite direction, if X = X1 △ X2 △ · · · △ Xn, then X=Z₁−Z₂+ · · · ±ZnwhereZ_k=P

i₁, . . . , ikdistinctsXi₁· · ·Xi_k. 4. CLOSURE OPERATORS

We review in this section the definition and the basic properties of closure operators.

LetE be a set. A map X → X fromP(E) to itself is aclosure operator if it isextensive, idempotent and isotone, that is, if the following properties hold for allX, Y ⊆E:

(1) X⊆X(extensive) (2) X=X(idempotent)

(3) X⊆Y impliesX ⊆Y (isotone)

A setF ⊆Eisclosed ifF =F. IfF is closed, and ifX ⊆F, thenX⊆F =F. It follows thatX is the smallest closed set containingX. This justifies the terminology “closure”. Actually, closure operators can be characterised by their closed sets.

Proposition 4.1. A set of closed subsets for some closure operator onEis closed under (possibly infinite) intersection. Moreover, any set of subsets ofEclosed under (possibly infinite) intersection is the set of closed sets for some closure operator.

Proof. LetX →Xbe a closure operator and let(Fi)i∈I be a family of closed subsets ofE. Since a closure is isotone, T

i∈IFi ⊆ Fi = Fi. It follows thatT

i∈IFi ⊆ T

i∈IFi and thusT

i∈IFi is closed.

Given a set F of subsets ofE closed under intersection, denote byX the intersection of all elements ofF containingX. Then the mapX →Xis a closure operator for whichF is the set of closed sets.

In particular,X∩Y ⊆X∩Y, but the inclusion may be strict.

Example 4.1. The trivial closure is the application defined by X=

(∅ ifX=∅ E otherwise For this closure, the only closed sets are the empty set andE.

Example 4.2. IfEis a topological space, the closure in the topological sense is a closure operator.

Example 4.3. The convex hull is a closure operator. However, it is not induced by any topology, since the union of two convex sets is not necessarily convex.

(6)

Theintersection of two closure operatorsX → X¹ andX → X² is the functionX → X³ defined byX³ =X¹∩X².

Proposition 4.2. The intersection of two closure operators is a closure operator.

Proof. Let ³ be the intersection of ¹ and ². First, since X ⊆ X¹ and X ⊆ X², one has X ⊆ X³ = X¹ ∩X². In particular, X³ ⊆ X³

3

. Secondly, since X¹ ∩X² ⊆ X¹, X¹∩X²

1

⊆X¹

1

=X¹. Similarly,X¹∩X²

2

⊆X². It follows that X³³ =X¹∩X²¹∩X¹∩X²²⊆X¹∩X² =X³

and hence X³ = X³³. Finally, if X ⊆ Y, then X¹ ⊆ Y ¹ and X² ⊆ Y ², and therefore X³⊆Y ³.

Let us concluse this section by giving a few examples of closure operators occurring in the theory of formal languages.

Example 4.4. Iteration. The map L → L^∗ is a closure operator. Similarly, the map L → L⁺, whereL⁺denotes the subsemigroup generated byL, is a closure operator.

Example 4.5. Shuffle ideal. Theshuffle product (or simplyshuffle) of two languages L₁ and L₂ overAis the language

L₁ xxyL₂ ={w∈A^∗ |w=u₁v₁· · ·unvnfor some wordsu₁, . . . , un, v₁, . . . , vnofA^∗ such thatu₁· · ·un∈L₁andv₁· · ·vn∈L₂}. The shuffle product defines a commutative and associative operation over the set of languages over A. Given a languageL, the languageL xxy A^∗ is called theshuffle ideal generated byLand it is easy to see that the mapL→LxxyA^∗is a closure operator.

This closure operator can be extended to infinite words in two ways: the finite and infinite shuffle idealsgenerated by anω-languageXare respectively:

XxxyA^∗ ={y0x₁y₁· · ·x_ny_nx|y₀, . . . , y_n∈A^∗andx₁· · ·x_nx∈X}

XxxyA^ω ={y0x1y1x2· · · |y0, . . . , yn∈A^∗andx1x2· · · ∈X}

The mapsX→ XxxyA^∗ andX →XxxyA^ω are both closure operators.

Example 4.6. Ultimate closure. Theultimate closureof a languageXof infinite words is defined by:

Ult(X) ={ux|u∈A^∗ andvx∈Xfor somev∈A^∗} The mapX→Ult(X)is a closure operator.

5. APPROXIMATION

In this section, we consider a setF of closed sets of E containing the empty set. It follows that the corresponding closure operator satisfies the condition ∅ = ∅. We first define the notion of an approximationof a set by a chain of closed sets. Then the existence of a best approximation will be established. In this section,Lis a subset ofE.

(7)

Definition 5.1. A chain F₁ ⊇ F₂ ⊇ · · · ⊇ F_n of closed sets is ann-approximation ofL if the following inclusions hold for allksuch that2k+ 16n:

F₁−F₂ ⊆F₁−F₂+F₃−F₄ ⊆ · · · ⊆F₁−F₂+ · · · +F_2k−1−F_2k⊆ · · ·

⊆L⊆ · · · ⊆F₁−F₂+F₃− · · · +F_2k+1⊆ · · · ⊆F₁−F₂+F₃ ⊆F₁ There is a natural order among then-approximations of a given setL. Ann-approximation F₁ ⊇ F2 ⊇ · · · ⊇FnofLis said to bebetterthan ann-approximationF₁^′ ⊇F₂^′ ⊇ · · · ⊇F_n^′ if, for all ksuch that2k+ 16n,

F₁−F₂+F₃− · · · +F_2k+1 ⊆F₁^′−F₂^′ +F₃^′− · · · +F_2k+1^′ and

F₁^′−F₂^′ + · · · +F_2k−1^′ −F_2k^′ ⊆F₁−F₂+ · · · +F_2k−1−F_2k We will need the following elementary lemma:

Lemma 5.1. LetX,Y andZbe subsets ofE.

(1) The conditionsX−Y ⊆ZandX−Z⊆Y are equivalent, (2) IfY ⊆XandX−Y ⊆Z, thenX−Z =Y −Z.

The description of the best approximation ofLrequires the introduction of two auxiliary functions.

For every subsetXofE, set

f(X) =X−L and g(X) =X∩L (5.1)

The key properties of these functions are formulated in the following lemma.

Lemma 5.2. For every subsetX ofE, X−f(X) ⊆ Land X−g(X) ⊆ L^c. Furthermore, the following properties hold:

(1) ifX⊇Y ⊇L, thenf(X)⊇f(Y)andX−f(X)⊆Y −f(Y)⊆L, (2) ifX⊆Y ⊆L, theng(X)⊇g(Y)andX−g(X)⊆Y −g(Y)⊆L.

Proof. Sinceg(X) =X−L^c, the results concerninggcan be deduced from those concerningf by taking the complement. Let us prove the statements involvingf. The first part of the lemma follows from a simple computation:X−f(X) =X−X−L⊆X−(X−L) =X∩L⊆L.

Suppose now thatX ⊇Y ⊇L. ThenX−L⊇Y −Land thusf(X) ⊇f(Y). Furthermore, X−Y ⊆X−L⊆X−L=f(X). Therefore, by Lemma5.1,X−f(X) =Y−f(X)⊆Y−f(Y).

Lemma 5.3. Let F₁ ⊇ F₂ ⊇ · · · ⊇ F_n be an n-approximation of L and, for 1 6 k 6 n, let Sk=F1−F2+ · · · ±Fk. Then, for16k6n,

(f(S_k) =f(F_k)ifkis odd

g(S_k^c) =g(F_k)ifkis even (5.2)

Proof. Ifk= 1, thenS₁ =F₁and the result is trivial. Suppose thatk >1. Ifkis odd,S_k−1 ⊆L and thus S_k−L = (S_k−1+F_k)−L = F_k −L. It follows that f(S_k) = f(F_k). Ifkis even, L⊆S_k−1and thusS_k^c∩L= (S_k−1^c +F_k)∩L=F_k∩L. Thereforeg(S_k^c) =g(F_k).

(8)

Define a sequence(L_n)_n>0 of subsets ofEbyL₀ =Eand, for alln>0, L_n+1=

(f(L_n) ifnis odd

g(Ln) ifnis even (5.3)

The next theorem expresses the fact that the sequence(L_n)_n>0 is the best approximation ofLas a Boolean combination of closed sets. In particular, ifLn=∅for somen >0, thenL∈ Bn−1(F).

Theorem 5.4. Let L be a subset of E. For every n > 0, the sequence (L_k)₁6k6n is the best n-approximation ofL.

Proof. We first show that the sequence(L_k)_16k6nis ann-approximation ofL. First, everyL_k is closed by construction. We show thatL_k+1 ⊆ L_kby induction onk. This is true fork = 0since L₀ =E. Now, ifkis even,L_k+1 =L_k∩L⊆L_k=L_kand ifkis odd,L_k+1 =L_k−L⊆L_k= L_k.

Set, fork > 0, Sk = L₁−L₂+ · · · ±Lk. By Lemma5.2, the relations L_2k−1 −L_2k = L_2k−1−f(L_2k−1)⊆Lhold for everyk >0, and similarly,L_2k−L_2k+1=L_2k−g(L_2k)⊆L^c. It follows thatS_2k ⊆L. FurthermoreS_2k+1^c = (L₀−L₁) + (L₂−L₃) + · · ·+ (L_2k−L_2k+1)⊆L^c and thusL⊆S_2k+1.

We now show that the sequence(L_k)_16k6nis the best approximation ofL. Let(L^′_k)_16k6nbe anothern-approximation ofL. Set, fork > 0, S_k^′ = L^′₁−L^′₂+ · · · ±L^′_k. Then, by definition, L⊆L^′₁and thus

S1 =L1 =L⊆L^′₁ =L^′₁ =S₁^′.

Let k be an even number and suppose by induction that S_k−1 ⊆ S_k−1^′ . Then, by definition of an approximation, S_k^′ = S_k−1^′ −L^′_k ⊆ L, and thus by Lemma5.1, S_k−1^′ −L ⊆ L^′_k. It follows f(S_k−1^′ ) =S_k−^′ ₁−L⊆L^′_k=L^′_k. Now, sinceS_k−1^′ ⊇S_k−1⊇L, Lemma5.2(1) shows that

S_k^′ =S_k−1^′ −L^′_k⊆S^′_k−1−f(S_k−1^′ )⊆S_k−1−f(S_k−1).

Now, by Lemma5.3,f(Sk−1) =f(Lk−1) =Lk, whenceS_k^′ ⊆Sk−1−Lk=Sk. Similarly,L⊆S_k+1^′ =S_k^′ +L^′_k+1 and thusS^′c_k−L^c⊆L^′_k+1. It follows that

g(S^′^c_k) =S^′c_k−L^c⊆L^′_k+1=L^′_k+1. Therefore, one gets by Lemma5.1,

S^′c_k+1=S^′c_k−L^′_k+1⊆S^′^c_k−g(S^′c_k)⊆S_k^c−g(S_k^c).

Now, by Lemma 5.3, g(S_k^c) = g(L_k) = L_k+1. Thus S^′c_k+1 ⊆ S_k^c −L_k+1 = S_k+1^c , whence S_k+1 ⊆S_k+1^′ .

WhenF is a set of subsets ofE closed under arbitrary intersection, Theorem5.4provides a characterization of the classesBn(F).

Corollary 5.5. LetLbe a subset ofE and letF be a set of subsets ofE closed under (possibly infinite) intersection and containing the empty set. Let(L_k)₁6k6nbe the bestn-approximation ofL with respect toF. ThenL∈ Bn−1(F)if and only ifL_n=∅and in this case

L=L1−L2+ · · · ±Ln−1 (5.4)

(9)

Proof. IfL∈ Bn−1(F), thenL=F₁−F2+· · · ±Fn−1withF₁, . . . , F_n−1 ∈ F. LetF_n=∅. Then the sequence(Fk)₁6k6nis ann-approximation ofL. Since(Lk)₁6k6nis the bestn-approximation ofL, one hasL=L₁−L₂+ · · · ±L_n−1. Thus, with the notation of Lemma5.3,

(f(L_n−1) =f(L) =∅ifn−1is odd

g(L_n−1) =g(L^c) =∅ifn−1is even (5.5) Therefore,Ln=∅by (5.3).

Conversely, suppose thatL_n=∅. Ifn= 2k, then

(L₁−L₂) + · · · + (L_2k−1−L_2k)⊆L⊆(L₁−L₂) + · · · + (L_2k−3−L_2k−2) +L_2k−1 Ifn= 2k+ 1, then

(L₁−L₂) + · · · + (L_2k−1−L_2k)⊆L⊆(L₁−L₂) + · · · + (L_2k−1−L_2k) +L_2k+1 In both cases, one getsL=L₁−L₂+ · · · ±L_n−1and thusL∈ Bn−1(F).

Let us illustrate this corollary by a concrete example.

Example 5.1. Let A = {a, b, c} and letL be the lattice of shuffle ideals. If L is the language {1, a, b, c, ab, bc, abc}, a straightforward computation gives

L₀ =A^∗

L1 =g(L0) =A^∗ xxy(L0∩L) =A^∗xxyL=A^∗

L₂ =f(L₁) =A^∗xxy(L₁−L) =A^∗xxy{aa, ac, ba, bb, ca, cb, cc}

L₃ =g(L₂) =A^∗ xxy(L₂∩L) =A^∗xxyabc

L₄ =f(L₃) =A^∗xxy(L₃−L) =A^∗xxy{aabc, abac, abca, babc, abbc, abcb, cabc, acbc, abcc}

L5 =g(L4) =A^∗ xxy(L4∩L) =∅

It follows thatL=L₁−L₂+L₃−L₄andL∈ B₄(L), butL /∈ B₃(L).

It is also possible to use the approximation algorithm for a setLof subsets ofEclosed under (possibly infinite) union and containing the setE. In this case, the set

L^c ={L^c |L∈ L}

is closed under (possibly infinite) intersection and contains the empty set. Consequently, the approximation algorithm can be applied toL^c but it describes the difference hierarchy Bn(L^c). To recover the difference hierarchyBn(L), the following algorithm can be used. First compute the best L^c-approximation of even length ofLand the bestL^c-approximation of odd length ofL^c, say

L=L^c₁−L^c₂+ · · · ±L^c_n (5.6) L^c =F₁^c−F₂^c+ · · · ±F_m^c (5.7) withneven,modd,L_i, F_i ∈ LandL_nandF_mpossibly empty to fill the parity requirements. Now Ladmits the followingL-decompositions, whereL₁andF₁ are possibly empty (and consequently deleted):

L=L_n−L_n−1+ · · · ±L₁ (5.8)

=Fm−Fm−1+ · · · ±F₁ (5.9) It remains to take the shortest of the two expressions to get the bestL-approximation ofL.

(10)

6. DECIDABILITY QUESTIONS ON REGULAR LANGUAGES

Given a lattice of regular languagesL, four decidability questions arise:

Question 1. Is the membership problem forLdecidable?

Question 2. Is the membership problem forB(L)decidable?

Question 3. For a given positive integern, is the membership problem forBn(L)decidable?

Question 4. Is the hierarchyBn(L)decidable?

Indeed, given a regular languageL, Question1asks to decide whetherL∈ L, Question2whether L ∈ B(L) and Question3 whetherL ∈ Bn(L). Question 4asks whether on can one effectively compute the smallest n such that L ∈ Bn(L), if it exists. Note that if Questions 2 and 3 are decidable, then so is Question4. Indeed, given a language L, one first decides whetherLbelongs toB(L)by Question2. If the answer is positive, this ensures thatLbelongs toBn(L)for somen and Question3allows one to find the smallest suchn.

If the latticeLis finite, it is easy to solve the four questions in a positive way. In some cases, a simple application of Corollary5.5 suffices to solve Question3immediately. One just needs to find the appropriate closure operator and to provide algorithms to compute the functionsf(X)and g(X)defined by (5.1).

Example 6.1. LetLbe the lattice generated by the languages of the formB^∗, whereB ⊆A. Then bothLandB(L)are finite. It is known that a regular language belongs toLif and only if its ordered syntactic monoid is idempotent and commutative and satisfies the inequation1 6xfor allx[20].

It belongs toB(L)if and only if its syntactic monoid is idempotent and commutative.

Finally, one can define a closure operator by setting L = B^∗, where B is the set of letters occurring in some word ofL. For instance, letL= ({a, b, c}^∗− {b, c}^∗) + ({a, b}^∗−a^∗) + 1. This language belongs toB(L)and its minimal automaton is represented below:

1

2

3 c

a b a

b, c

a, b, c

Applying the approximation algorithm of Section5, one getsL₀ ={a, b, c}^∗,L₁ ={b, c}^∗,L₂ = b^∗andL₃ =∅and thusL={a, b, c}^∗− {b, c}^∗+b^∗is the best3-approximation ofL.

If the lattice is infinite, our four questions become usually much harder, but can still be solved in some particular cases. But let us first present a powerful tool introduced in [5], chains in ordered monoids.

(11)

7. CHAINS AND DIFFERENCE HIERARCHIES

Chains can be defined on any ordered set. We first give their definition, then establish a connection with difference hierarchies.

Definition 7.1. Let(E,6)be a partially ordered set and letXbe a subset ofE. AchainofEis a strictly increasing sequence

x₀< x₁ < . . . < x_m−1

of elements ofE. It is called anX-chainifx0 is inXand thexi’s are alternatively elements ofX and of its complementX^c. The integermis called thelengthof the chain. We letm(X)denote the maximal length of anX-chain.

There is a subtle connection between chains and difference hierarchies of regular languages. LetM be a finite ordered monoid and letϕ:A^∗→M be a surjective monoid morphism. Let

L={ϕ⁻¹(U)|U is an upper set ofM}

By definition, every language ofLis recognised by the ordered monoidM.

Theorem 7.1. If there exists a subset P ofM such thatL = ϕ⁻¹(P) and m(P) 6 n, then L belongs toBn(L).

Before starting the proof, let us clarify a delicate point. The conditionL =ϕ⁻¹(P)means thatL is recognised by themonoidM. It does not mean thatLis recognised by theordered monoidM, a property which would requireP to be an upper set.

Proof. For eachs∈M, letm(P, s)be the maximal length of aP-chain ending withs. Finally, let, for eachk >0,

U_k ={s∈M |m(P, s)>k}

We claim that U_k is an upper set of M. Indeed, ifs ∈ U_k, there exists aP-chain x₀ < x₁ <

· · · < xr−1 = sof length r > k. Lettbe an element of M such that s 6 t. If sand tare not simultaneously inP, thenx₀ < x₁ < · · · < x_r−1 < tis aP-chain of lengthr+ 1>k. Otherwise, x0 < x1 < · · · < xr−2 < tis aP-chain of lengthr >k. Thusm(P, t)>k, andt∈Uk, proving the claim.

We now show that

P =U₁−U₂+U₃−U₄· · · ±Un (7.1) First observe thats∈P if and only ifm(P, s)is odd. Sincem(P)6n, one hasm(P, s) 6nfor everys∈Mand thusU_n+1 =∅. Formula (7.1) follows, since for eachr>0,

{s∈M |m(P, s) =r}=U_r−U_r+1.

Let, for16i6n,Li =ϕ⁻¹(Ui). SinceUi is an upper set, eachLibelongs toL. Moreover, one gets from (7.1) the formula

L=L₁−L₂+L₃· · · ±L_n (7.2) which shows thatL∈ Bn(L).

(12)

We now establish a partial converse to Theorem7.1. A lattice of regular languages is a setLof regular languages ofA^∗ containing ∅andA^∗ and closed under finite union and finite intersection.

LetLbe a lattice of regular languages ofA^∗.

Theorem 7.2. LetLbe a lattice of regular languages. If a languageLbelongs toBn(L), then there exist an ordered stampη:A^∗→M and a subsetP ofMsatisfying the following conditions:

(1) ϕis a restricted product of syntactic ordered stamps of members ofL, (2) L=η⁻¹(P),

(3) m(P)6n.

Proof. IfL∈ Bn(L), then

L=L1−L2+L3 · · · ±Ln

withL₁ ⊇L₂ ⊇ · · · ⊇L_nandL_i ∈ L. Letη_i :A^∗ →(M_i,6_i)be the syntactic morphism ofL_i and letPi = ηi(Li). Then eachPi is an upper set ofMi and Li = η⁻¹_i (Pi). Letη :A^∗ → M be the restricted product of the stampsηi. Condition (1) is satisfied by construction.

Observe that ifη(u) = (s₁, . . . , s_n)is an element ofM, the conditions_i+1 ∈P_i+1is equivalent withu ∈ L_i+1, and sinceL_i+1 is a subset ofLi, this condition also impliesu ∈ Li and si ∈ Pi. Consequently, for each elements= (s₁, . . . , s_n)ofM, there exists a uniquek∈ {0, . . . , n}such that

s₁ ∈P₁, . . . , s_k∈P_k, s_k+1∈/ P_k+1, . . . , s_n∈/ P_n This uniquekis called thecutofs. Setting

P ={s∈M |the cut ofsis odd}

one gets

η⁻¹(P) = [

kodd

(L₁∩ · · · ∩L_k)−L_k+1

= [

kodd

(L_k−L_k+1) =L (7.3) which proves (2).

Let nowx₀ < x₁ < · · · < x_m−1 be aP-chain. Let, for0 6i6m−1,x_i = (s_i,1, . . . , s_i,n) and letki be the cut ofxi. We claim that k_i+1 > ki. Indeed, sincexi < x_i+1,s_i,k_i 6_i s_i+1,k_iand sinceP_i is an upper set,s_i,k_i ∈P_i impliess_i+1,k_i ∈P_i+1, which proves thatk_i+1 >k_i. But since xiandx_i+1are not simultaneously inP, their cuts must be different, which proves the claim. Since x₀ ∈P, the cut ofx₀is odd, and in particular, non-zero. It follows that0< k₀ < k₁ < · · · < k_m−1 and since the cuts are numbers between0andn,m6n, which proves (3).

It is tempting to try to improve Theorem 7.2 by taking for M the syntactic morphism of L and forϕthe syntactic morphism ofL. However, Example 5.1 ruins this hope. Indeed, let F = {1, a, b, c, ab, bc, abc} be the set of factors of the wordabc. Then the syntactic monoid ofLcan be defined as the setF ∪ {0}equipped with the product defined by

xy =

(xy ifx,yandxyare all inF 0 otherwise

Now the syntactic image ofLis equal toF. It follows thatM−F ={0}and thus, whatever order is taken on M, the length of a chain is bounded by 3. Nevertheless, ifL is the lattice of shuffle ideals, thenLdoes not belong toB3(L).

Therefore, ifLis a regular language, the maximal length of an L-chain cannot be in general computed in the syntactic monoid of L. It follows that decidability questions onBn(L), as presented in Section6below, cannot in general be solved just by inspecting the syntactic monoid. An exceptional case where the syntactic monoid suffices is presented in the next section.

(13)

8. THE DIFFERENCE HIERARCHY OF THE POLYNOMIAL CLOSURE OF A LATTICE

A languageLofA^∗ is amarked productof the languagesL₀, L₁, . . . , L_nif L=L₀a₁L₁· · ·anLn

for some lettersa₁, . . . , anofA. Given a setLof languages, thepolynomial closureofLis the set of languages that are finite unions of marked products of languages ofL. Thepolynomial closureofL is denoted Pol Land the Boolean closure of Pol Lis denotedBPolL. Finally, let co-Pol Ldenote the set of complements of languages in PolL. In this section, we are interested in the difference hierarchy induced by PolL. We consider several examples.

8.1. Shuffle ideals. If L = {∅, A^∗}, then PolL is exactly the set of shuffle ideals considered in Examples4.5and6.1andBPolLis the class ofpiecewise testable languages. The following easy result was mentioned in [20].

Proposition 8.1. A language is a shuffle ideal if and only if its syntactic ordered monoidMsatisfies the inequation16xfor allx∈M.

The syntactic characterization of piecewise testable languages follows from a much deeper result of Simon [27].

Theorem 8.2. A language is piecewise testable if and only if its syntactic monoid isJ-trivial.

Note that the closed sets of the closure operatorX→X xxyA^∗of Example4.5are exactly the shuffle ideals. It follows that for the latticeLof shuffle ideals, the four questions mentioned earlier have a positive answer. More precisely, the decidability of the membership problem forLand for B(L)follows from Proposition8.1and Theorem 8.2, respectively. The decidability of Question3 (and hence of Question4) follows from the approximation algorithm. See Example5.1.

8.2. Group languages. Recall that agroup language is a language whose syntactic monoid is a group, or, equivalently, is recognized by a finite deterministic automaton in which each letter defines a permutation of the set of states. According to the definition of a polynomial closure, apolynomial of group languagesis a finite union of languages of the formL₀a₁L₁· · ·a_kL_kwherea₁, . . . , a_kare letters andL₀, . . . , L_kare group languages.

LetdGbe the metric onA^∗defined as follows:

rG(u, v) = min{|M| |M is a finite group that separatesuandv}

dG(u, v) = 2^−r^G^(u,v)

It is also known that the closure of a regular language fordGis again regular and can be effectively computed. This result was actually proved in two steps: it was first reduced to a group-theoretic conjecture in [22] and this conjecture became a theorem in [25].

LetGbe the set of group languages onA^∗and let Pol Gbe the polynomial closure ofG. We also let co-PolGdenote the set of complements of languages of Pol G. The following characterization of Pol Gwas given in [17].

Theorem 8.3. LetLbe a regular language and letMbe its ordered syntactic monoid. The following conditions are equivalent:

(1) L∈Pol G,

(2) Lis open in the pro-group topology onA^∗,

(14)

(3) for allx∈M,16x^ω.

Theorem8.3shows that Pol Gis decidable. The corresponding result forBPolGhas a long story, related in detail in [19], where several other characterizations can be found.

Theorem 8.4. Let L be a regular language and let M be its syntactic monoid. The following conditions are equivalent:

(1) L∈BPolG,

(2) the submonoid generated by the idempotents ofMisJ-trivial,

(3) for all idempotentse,fofM, the conditionsef e=eimpliesef =e=f e.

Theorem 8.4 shows thatBPolG is decidable. Now, Theorem 8.3 shows that a regular language belongs to co-Pol G if and only if it is closed in the pro-group topology on A^∗. It follows that co-PolGis closed under arbitrary intersections and the operation associating to a regular language ofA^∗ its closure in the pro-group topology is a closure operator. As we have seen, the closure of a regular language is regular and can be effectively computed. It follows that the algorithm described at the end of Section5can be applied to get our last corollary:

Corollary 8.5. The difference hierarchyBn(Pol G)is decidable.

9. CYCLIC AND STRONGLY CYCLIC REGULAR LANGUAGES

Cyclic and strongly cyclic regular languages are two classes of regular languages related to symbolic dynamic and first studied in [1]. It was shown in [5] that an appropriate notion of chains suffices to characterise the difference hierarchy based on the class of strongly cyclic regular languages. This contrasts with Section7, in which the general results on chain did not lead to a full characterization of difference hierarchies.

LetA = (Q, A,·)be a finite (possibly incomplete) deterministic automaton. A word u sta- bilises a subsetP ofQifP·u = P. Given a subset P ofQ, letStab(P)be the set of all words that stabiliseP. The languageStab(A)that stabilisesAis by definition the set of all words which stabilise at least one nonempty subset ofQ.

Definition 9.1. A language isstrongly cyclicif it stabilises some finite deterministic automaton.

Example 9.1. IfAis the automaton represented in Figure2, then

Stab({1}) = (b+aa)^∗, Stab({2}) = (ab^∗a)^∗, Stab({1,2}) =a^∗ andStab(A) = (b+aa)^∗+ (ab^∗a)^∗+a^∗.

1 2

a

a b

Figure 2: The automatonA.

One can show that the set of strongly cyclic languages of A^∗ form a lattice of languages but are not closed under quotients. For instance, as shown in Example9.1, the languageL= (b+aa)^∗+ (ab^∗a)^∗+a^∗ is strongly cyclic, but Corollary9.5will show that its quotientb⁻¹L = (b+aa)^∗ is not strongly cyclic, sinceaa∈(b+aa)^∗buta /∈(b+aa)^∗.

We will also need the following characterization [1, Proposition 7]:

(15)

Proposition 9.1. LetA= (Q, A, E)be a deterministic automaton. A wordubelongs toStab(A) if and only if there is some stateqofAsuch that for every integern, the transitionq·uⁿexists.

Strongly cyclic languages admit the following syntactic characterization [1, Theorem 8]. As usual, s^ωdenotes the idempotent power ofs, which exists and is unique in any finite monoid.

Proposition 9.2. LetLbe a non-full regular language. The following conditions are equivalent:

(1) Lis strongly cyclic,

(2) there is a morphismϕfromA^∗onto a finite monoidM with zero such that L=ϕ⁻¹({s∈M |s^ω6= 0}),

(3) the syntactic monoid M of L has a zero and its syntactic image is the set of all elements s∈Msuch thats^ω 6= 0.

Proposition9.2leads to a simple syntactic characterization of strongly cyclic languages. Recall that a language ofA^∗ isnondenseif there exists a wordu∈A^∗such thatL∩A^∗uA^∗ =∅.

Proposition 9.3. Let L be a regular language, let M be its syntactic monoid and let P be its syntactic image. ThenLis strongly cyclic if and only if it satisfies the following conditions, for all u, x, v∈M:

(S1) ux^ωv∈P impliesx^ω ∈P, (S₂) x^ω ∈P if and only ifx∈P.

Furthermore, if these conditions are satisfied and ifLis not the full language, thenLis nondense.

Proof. LetLbe a strongly cyclic language, letM be its syntactic monoid and letP be its syntactic image. IfLis the full language, then the conditions (S₁)and(S₂)are trivially satisfied. IfLis not the full language, then Proposition9.2shows thatM has a zero and thatP ={s∈ M |s^ω 6= 0}.

Observing thatx^ω = (x^ω)^ω, one gets

x∈P ⇐⇒x^ω 6= 0⇐⇒(x^ω)^ω 6= 0⇐⇒x^ω∈P which proves(S₂). Similarly, one gets

ux^ωv∈P ⇐⇒ (ux^ωv)^ω 6= 0 =⇒ x^ω6= 0 ⇐⇒ x∈P which proves(S₁).

Conversely, suppose that L satisfies (S₁) and (S₂). If L is full, then L is strongly cyclic.

Otherwise, letz /∈P. Thenz^ω ∈/ P by(S₁)anduz^ωv /∈P for allu, v ∈M by(S₂). This means thatzis a zero ofM and that0∈/ P. By Proposition9.2, it remains to prove thatx∈Pif and only ifx^ω 6= 0. First, ifx ∈ P, thenx^ω ∈P by(S₂)and since0 ∈/ P, one has x^ω 6= 0. Conversely, ifx^ω 6= 0, then ux^ωv ∈ P for someu, v ∈ M, since x^ω is not equivalent to0 in the syntactic congruence ofP. It follows thatx^ω ∈P by(S₁)andx∈Pby(S₂).

We turn now to cyclic languages.

Definition 9.2. A subset of a monoid is said to be cyclicif it is closed under conjugation, power and root. Thus a subsetP of a monoidM is cyclic if it satisfies the following conditions, for all u, v ∈M andn >0:

(C₁) uⁿ∈P if and only ifu∈P, (C₂) uv∈P if and only ifvu∈P.

This definition applies in particular to the case of a language ofA^∗.

Example 9.2. IfA={a, b}, the languageb^∗and its complementA^∗aA^∗are cyclic.

(16)

One can show that regular cyclic languages are closed under inverses of morphisms and under Boolean operations but not under quotients. For instance, the language L = {abc, bca, cab} is cyclic, but its quotient a⁻¹L = {bc} is not cyclic. Thus regular cyclic languages do not form a variety of languages. However, they admit the following straightforward characterization in terms of monoids.

Proposition 9.4. LetLbe a regular language ofA^∗, letϕbe a surjective morphism fromA^∗to a finite monoidM recognisingLand letP =ϕ(L). ThenLis cyclic if and only ifPis cyclic.

Corollary 9.5. Every strongly cyclic language is cyclic.

Proof. LetLbe a strongly cyclic language, letM be its syntactic monoid and letP be its syntactic image. By Proposition9.3,P satisfies(S₁)and(S₂). It suffices now to prove that it satisfies(C₂).

The sequence of implications

xy∈P ⇐⇒^(S²⁾ (xy)^ω ∈P ⇐⇒ (xy)^ω(xy)^ω∈P ⇐⇒ (xy)^ω−1xy(xy)^ω−1xy∈P

⇐⇒ ((xy)^ω−1x)(yx)^ωy∈P =^(S⇒¹⁾ (yx)^ω ∈P ⇐⇒^(S²⁾ yx∈P.

shows thatxy ∈P impliesyx∈P and the opposite implication follows by symmetry.

Proposition9.2implies that every strongly cyclic language is cyclic. Actually, for any regular cyclic language, there is a smallest strongly cyclic language containing it [5, Theorem 2].

Proposition 9.6. LetLbe a regular cyclic language ofA^∗, letη:A^∗ → M be its syntactic stamp and letP =η(L). ThereM has a zero and the language

L=

(η⁻¹({s|s^ω 6= 0}) if0∈/P,

A^∗ otherwise.

is the smallest strongly cyclic language containingL.

Proof. If0 ∈/ P, then the language L is strongly cyclic by Proposition9.2. Morevover, sinceL is cyclic, P is cyclic by Proposition9.4. It follows that ifs ∈ P, thens^ω ∈ P and in particular s^ω6= 0. Consequently,LcontainsL.

It remains to prove that Lis the smallest strongly cyclic language containing L. LetX be a strongly cyclic language containing Land let u be a word ofL. LetA = (Q, A, E) be a deterministic automaton such thatX = Stab(A). Settings=η(u), one hass^ω 6= 0by definition ofL.

Consequently, η(s)ⁿ6= 0for every integernand there are two wordsx_nand y_nsuch thatx_nuⁿy_n belongs toL. By Proposition 9.1, there is a state qn ofAsuch that the transition qn·xnuⁿynis defined. The transition(q_n·x_n)·uⁿis thus defined for everynand by Proposition 9.1again, the wordubelongs toX. ThusL⊆Xas required.

Suppose now that0 ∈ P and letz be a word ofLsuch thatη(z) = 0. LetX be a strongly cyclic language containing L. IfX is not full, then X is nondense by Proposition 9.3 and there exists a wordu∈A^∗such thatA^∗uA^∗∩X=∅. SinceXcontainsL, one also getsA^∗uA^∗∩L=∅ and in particularzu /∈L. But this yieds a contradiction, sinceη(zu) =η(z)η(u) = 0∈Pand thus zu∈η⁻¹(P) =L. Thus the only strongly cyclic language containingLisA^∗.

Given a finite monoidM, the Green’s preorder relation6J defined onM by

s6_J tif and only ifs∈M tM, or equivalently, if there existsu, v∈M such thats=utv is a preorder onM. The associated equivalence relationJ is defined by

sJ tifs6_J tandt6_J s, or equivalently, ifM sM =M tM.

(17)

Corollary 9.7. LetLbe a regular cyclic language ofA^∗, letη:A^∗ →Mbe its syntactic stamp and letP =η(L). ThenLis strongly cyclic if and only if for all idempotentse, f ofM, the conditions e∈Pande6_J f implyf ∈P.

Proof. Suppose thatLis strongly cyclic and lete, f be two idempotents ofM such thate∈Pand e6J f. Letu, v ∈M be such thate=uf v. Sincef^ω =f, one getsuf^ωv ∈P and thusf ∈P by Condition(S₁)of Proposition9.3.

In the opposite direction, suppose that for all idempotents e, f of M, the conditions e ∈ P and e 6_J f imply f ∈ P. Since L is cyclic, it satisfies (C₁) and hence (S₂). We claim that it also satisfies (S₁). Indeed, ux^ωv ∈ P implies (ux^ωv)^ω ∈ P by (S₂). Furthermore, since (ux^ωv)^ω 6J x^ω, one also hasx^ω ∈P, and finallyx∈P by(S₂), which proves the claim.

The precise connection between cyclic and strongly cyclic languages was given in [1].

Theorem 9.8. A regular language is cyclic if and only if it is a Boolean combination of regular strongly cyclic languages.

Theorem9.8motivates a detailed study of the difference hierarchy of the classS of strongly cyclic languages. This study relies on a careful analysis of the chains on the set of idempotents of a finite monoid, pre-ordered by the relation6_J.

Definition 9.3. AP-chain of idempotentsis a sequence (e₀, e₁, . . . , em−1)of idempotents ofM such that

e₀6J e₁6J · · · 6J em−1

e0 ∈ P and, for0 < i < m,ei ∈ P if and only ifei−1 ∈/ P. The integerm is the length of the P-chain of idempotents.

We let ℓ(M, P) denote the maximal length of a P-chain of idempotents ofM. We consider in particular the case where ϕ : A^∗ → M is a stamp recognising a regular languageL ofA^∗ and P =ϕ(L). The next theorem shows that in this case,ℓ(M, P)does not depend on the choice of the stamp recognisingL, but only depends onL.

Theorem 9.9. LetLbe a regular language. Letϕ : A^∗ → M and ψ : A^∗ → N be two stamps recognisingL. IfP =ϕ(L)andQ=ψ(L), thenℓ(M, P) =ℓ(N, Q).

Proof. It is sufficient to prove the result whenϕis the syntactic stamp ofL. Since the morphismψ is surjective, M is a quotient of N and there is a surjective morphism π : N → M such that π◦ψ=ϕ. It follows that

π(Q) =Pandπ⁻¹(P) =Q. (9.1) We show that to anyP-chain of idempotents inN, one can associate aQ-chain of idempotents of the same length inMand vice-versa.

Let(e₀, . . . , e_m−1)be aQ-chain of idempotents inN and letf_i =π(e_i)for06i6 m−1.

Since every monoid morphism preserves 6_J, the relations (9.1) show that(f0, . . . , fm−1)is aP- chain of idempotents inM.

Let now(f₀, . . . , f_m−1)be aP-chain of idempotents inM. Sincef_i−1 6_J f_i, there exist for 16 i6m−1elementsui, vi ofM such thatuifivi =f_i−1. Let us choose an idempotente_m−1 such thatπ(e_m−1) =f_m−1 and some elementss_iandt_iofN such thatπ(s_i) =u_i andπ(t_i) =v_i. We now define a sequence of idempotents(e0, . . . , em−1)ofN by setting

e_m−2= (s_m−1e_m−1t_m−1)^ω e_m−3= (s_m−2e_m−2t_m−2)^ω · · · e₀= (s₁e₁t₁)^ω