• Aucun résultat trouvé

Automata and forbidden words

N/A
N/A
Protected

Academic year: 2022

Partager "Automata and forbidden words"

Copied!
12
0
0

Texte intégral

(1)

HAL Id: hal-00619578

https://hal-upec-upem.archives-ouvertes.fr/hal-00619578

Submitted on 13 Feb 2013

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

Automata and forbidden words

Maxime Crochemore, Filippo Mignosi, Antonio Restivo

To cite this version:

Maxime Crochemore, Filippo Mignosi, Antonio Restivo. Automata and forbidden words. Informa- tion Processing Letters, Elsevier, 1998, 67 (3), pp.111-117. �10.1016/S0020-0190(98)00104-5�. �hal- 00619578�

(2)

Automata and Forbidden Words

M. Crochemorey F. Mignosiz A. Restivox

Octob er 28, 1998

Abstract

Let L(M) be the (factorial) language avoiding a given anti-factorial language M. We design an automaton accepting L(M) and built from the languageM. The construction is eective ifM is nite.

If M is the set of minimal forbidden words of a single word v, the automaton turns out to be the factor automaton ofv (the minimal au- tomaton accepting the set of factors ofv).

We also give an algorithm that builds the trie ofM from the factor automaton of a single word. It yields a non-trivial upper bound on the number of minimal forbidden words of a word.

Keywords: formal languages, factorial language, anti-factorial language, factor code, factor automaton, forbidden word, avoiding a word, failure function.

1 Introduction

LetLAbe afactoriallanguage,i.e., a language containing all factors of its words. A wordw2Ais called aminimal forbidden wordforLifw2=Land all proper factors ofwbelong toL. We denote byMF(L) the language of minimal forbidden words forL.

The study of combinatorial properties ofMF(L) helps investigate the struc- ture of the languageLor of the system it describes. For instance, locally testable factorial languages (cf [8]) are characterized by the fact that the corresponding languages of minimal forbidden words are nite. In the context of Symbolic Dynamics they correspond to systems of nite type.

Another example is given by a language L that is the set of factors of an innite word: in this case, as shown in [2], the elements ofMF(L) are closely related to thebispecialfactors (cf. [6], [7] and [3]) of the innite word.

A preliminary version of this work has been accepted to MFCS'98

yInstitut Gaspard-Monge,[email protected]. Work by this author is supported in part by Programme \Genomes" of C.N.R.S.

zUniversita di Palermo,[email protected].

xUniversita di Palermo,[email protected].

(3)

A measure of complexity of the languageLis introduced in [2] based on the functionFL, that counts, for anyn, the number of words of lengthninMF(L).

Authors prove that the growth ofFL(n) as well as the topological entropy of

MF(L) are topological invariants of the dynamical system dened by L. This result provides a usefull tool to show that some systems are not isomorphic, which comes in addition to other notions like the ordinary notion of entropy and the zeta function, for example.

Finally, [5] considers properties of languages dened by nite forbidden sets of words. Authors dene the Mobius function for these languages.

In this paper we focus on the transformations between Land MF(L). We rst design an automaton acceptingLand that is built from the languageM =

MF(L). When M is a nite set the transformation is eective. Moreover, if

M is given by its digital tree, that is, its tree-like deterministic automaton, the algorithm is very similar to the algorithm of Aho and Corasick that builds a pattern-matching machine for a nite set of words [1].

In a second part we consider the particular situation of a language that is the set of factors of a single wordv. The construction of itsfactor automaton, the minimal deterministic automaton accepting the factors ofv (see [4]) is known to be rather intricate. It is remarkable that the preceding transformation yields exactly the factor automaton of v when the input if the set M of minimal forbidden words of v. We also give an algorithm that realizes the converse transformation, building the trie of M from the factor automaton of v. A corollary of the algorithmis a non-trivial upper bound on the number of minimal forbidden words of a word.

The complexities of algorithms described in this paper are all linear in the size of their input or output. Therefore, the design of possible faster algorithms relies on dierent representations of objects, which is not the aim of the paper.

2 Avoiding an anti-factorial language

Let A be a nite alphabet and A be the set of nite words drawn from the alphabet A, the empty word included. Let L A be a factorial language, i.e. a language satisfying: 8u;v 2A uv 2 L=) u;v 2 L. The complement languageLc =AnLis a (two-sided) ideal of A. Denote byMF(L) the base of this ideal, we haveLc =AMF(L)A.

The set MF(L) is called the set ofminimal forbidden wordsfor L. A word

v2A

is forbidden for the factorial languageL ifv2=L, which is equivalent to say thatvoccurs in no word ofL. In addition,v is minimal if it has no proper factor that is forbidden.

One can note that the set MF(L) uniquely characterizes L, just because

L=AnAMF(L)A: (1) The following simple observation provides a basic characterization of minimal

(4)

forbidden words.

Remark 1

A word v = a1a2an belongs to MF(L) i the two conditions hold:

v is forbidden, (i.e.,v2=L),

both a1a2an 12L and a2a3an 2L (the prex and the sux ofv of length n 1belong toL).

The remark translates into the equality:

MF(L) =AL\LA\(AnL): (2) As a consequence of both equalities (1) and (2) we get the following proposition.

Proposition 1

For a factorial languageL, languagesLand MF(L)are simul- taneously rational, that is,L2Rat(A)i MF(L)2Rat(A).

The set MF(L) is ananti-factorial languageor a factor code, which means that it satises: 8u;v 2MF(L)u6=v=)uis not a factor ofv, property that comes from the minimality of words ofMF(L).

We introduce a few more denitions.

Denition 1

A wordv2A avoids the setM,M A, if no word ofM is a factor of v, (i.e., if v 2=AMA). A language L avoidsM if every word of L avoids M.

From the denition ofMF(L), it readily comes thatLis the largest (accord- ing to the subset relation) factorial language that avoidsMF(L). This shows that for any anti-factorial languageM there exists a unique factorial language

L(M) for whichM = MF(L). The next remark summarizes the relation be- tween factorial and anti-factorial languages.

Remark 2

There is a one-to-one correspondence between factorial and anti- factorial languages. If L and M are factorial and anti-factorial languages re- spectively, both equalities hold: MF(L(M)) =M and L(MF(L)) =L.

We also refer to the next denition that is to be considered in the context of dynamical systems (see [9] for example).

Denition 2

The factorial languageLis said to beof nite typewhen MF(L) is nite.

Finally, with an anti-factorial nite languageM we associate the nite au- tomatonA(M) as described below. The automaton is deterministic and com- plete, and, as shown at the end of the section by Theorem 3, the automaton accepts the languageL(M).

The automatonA(M) is the tuple (Q;A;i;T;F) where

(5)

the setQof states isfwjw is a prex of a word inMg,

Ais the current alphabet,

the initial statei is the empty word,

the setT of terminal states isQnM.

States ofA(M) that are words ofM are sink states. The setF of transitions is partitioned into the three (pairwise disjoint) setsF1,F2, andF3 dened by:

F

1=f(u;a;ua)jua2Q;a2Ag(forward edges or tree edges),

F

2 = f(u;a;v)j u2 QnM;a 2 A;ua2= Q;v longest sux ofuainQg (backward edges),

F

3=f(u;a;u)ju2M;a2Ag(loops on sink states).

The transition function dened by the setF of arcs ofA(M) is noted.

Remark 3

One can easily prove from denitions that

1. if q2Qn(M[fg), all transitions arriving on stateq are labeled by the same lettera2A,

2. from any state q2Q we can reach a sink state, i.e.,qcan be extended to a word of M.

Denition 3

For anyv2A,qv denotes the state(;v), target of the unique path inA(M) starting at the initial state and labeled byv.

SinceA(M) is a complete automaton,qvis always dened. In the automaton

A(M) states are words, but to avoid misunderstandings we sometimes write \the word corresponding toqv" instead of just \the wordqv".

Remark 4

Note that ifv is a state of A(M) we haveqv=v.

We are now ready to state and prove the next lemma that is used in the proof of Theorem 3, the main result of the section.

Lemma 2

LetM be an anti-factorial language and considerA(M). Letv2A be such that, for any proper prexuofv,quis not a sink state (qu2=M). Then,

1. the wordqv is a sux ofv,

2. qv is the longest sux ofv that is also a state of A(M) (or 8q2Qq sux ofv=)q sux ofqv).

Proof.

By induction onjv j.

Base of the induction, jv j = 0. Then, v = =qv and points 1 and 2 are trivially satised.

Inductive step jv j >0. We can write v =ua;a2 A; hence, qv = (qu;a) or equivalently (qu;a;qv)2 F. By induction, qu is a sux of u and, ifq 2 Q is a sux ofu, q is also a sux ofqu. By hypothesis, the transition (qu;a;qv) cannot belong toF3 becausequ is not a sink state. We have two cases:

(6)

(i) (qu;a;qv)2F1, (ii) (qu;a;qv)2F2.

In case (i) condition 1 readily comes from the inductive hypothesis because

q

v = (qu;a). Let us suppose that q is a state sux of v. If q = then 2 is trivially satised; otherwise, if q 6= , q = (q0;a) for some state q0 2 Q. Since v =ua, q0 is a sux of uand, by induction, q0 is a sux of qu. Since

q

v =(qu;a) andq(=(q0;a)) is a sux ofqv, which proves that 2 is satised.

In case (ii), since by denitionqvis a sux of(qu;a), and since by inductionqu is a sux ofu,qv is a sux ofua=vand 1 holds. Ifq2Qis a sux ofvwith

q6=(otherwise 2 trivially holds), then q=q0afor someq0 sux ofu, and by inductionq0is a sux ofquand moreoverqis a sux of(qu;a). By denition

q

v is the longest sux of(qu;a) that is also a state, and consequently q is a

sux ofqv. ./

Denoting byLang(A) the language accepted by an automatonA, we get the main theorem of the section.

Theorem 3

For anyanti-factorial languageM,Lang(A(M)) =L(M).

Proof.

We rst prove L(M) Lang(A(M)). We have to show that if v is a word that avoids M then v 2 Lang(A(M)). Assume ab absurdo that

v2=Lang(A(M)); therefore qv is a sink state. Letube the shortest prex ofv for whichquis a sink state (note thatqu=qv). By lemma 2 statement 1,quis a sux ofu, but qv is by denition an element ofM, and sov does not avoid

M, a contradiction.

We then proveLang(A(M))L(M). Letv2Lang(A(M)). Let us suppose

ab absurdothat v does not avoidM,i.e., v=uw z for somew2M;u;z2A. We chooseuw as the shortest prex of v that belongs to AM. Since w2M it is by denition a state ofA(M); sincew is a state that is a sux ofuw, by Lemma 2 statement 2,w is a sux ofquw. But quw, which is by denition a state ofA(M), is a prex of an elementw0 ofM (note thatw0 is not empty).

Sincewis a sux of a prex ofw0,wis a factor ofw0, a contradiction because

M is anti-factorial. ./

The above denition of A(M) turns into the algorithm below, called L- automaton, that builds the automaton from a nite anti-factorial set of words.

The input is the trieT that representsM. It is a tree-like automaton accepting the setMand, as such, it is noted (Q;A;i;T;0). The procedure can be adapted to test whether T represents an anti-factorial set, or even to generate the trie of the anti-factorial language associated with a set of words.

In view of Equality 1, the design of the algorithm remains to adapt the construction of a pattern matching machine (see [1] or [4]). The algorithm uses a function f called a failure function and dened on states of T as follows.

States of the trieT are identied with the prexes of words inM. For a state

au(a2A,u2A),f(au) is0(i;u), quantity that may happen to be uitself.

Note that f(i) is undened, which justies a specic treatment of the initial state in the algorithm.

(7)

-

m

m

m m

m

0

1

2 3

4

a

@

@

@

@ R b

- a

- b

a

- b

- b

Figure 1: Trie of the factor codefaa;bbab;bbbgon the alphabetfa;bg. Squares represent terminal states.

L-automaton(trieT =(Q;A;i;T;0))

1. foreacha2A

2. if 0

(i;a)dened

3. set (i;a)= 0

(i;a);

4. setf( (i;a))=i;

5. else

6. set (i;a)=i;

7. foreachstatep2Qnfiginwidth-rstsearchandeacha2A

8. if 0

(p;a)dened

9. set (p;a)= 0

(p;a);

10. setf( (p;a))= (f(p);a);

11. else ifp62T

12. set (p;a)= (f(p);a);

13. else

14. set (p;a)=p;

15. return(Q;A;i;QnT; );

Example.

Figure 1 displays the trie that accepts M =faa;bbab;bbbg. It is an anti-factorial language. The automaton produced from the trie by algorithm

L-automaton is shown in Figure 2. It accepts the prexes of (ab[b)(ab)ba that are all the words avoidingM.

Theorem 4

LetT be the trie of an anti-factorial language M. Algorithm L- automaton builds a complete deterministic automaton acceptingL(M).

Proof.

The automaton produced by the algorithm has the same set of states as the input trie. It is clear that the automaton is deterministic and complete.

Letu2A+ and p=(i;u). A simple induction onjujshows that the word corresponding tof(p) is the longest proper sux ofuthat is a prex of some word inM. This notion comes up in the denition of the set of transitionsF2 in the automatonA(M). Therefore, the rest of the proof just remains to check that instructions implement the denition ofA(M). ./

(8)

-

m

m

m m

m

0

1

2 3

4

a

@

@

@

@ R b

- a

- b

? b 6

a

a

- b

- b

@

@

@

@ I

a

a,b

a,b

a,b

Figure 2: Automaton accepting the words that avoid the set faa;bbab;bbbg. Squares represent non-terminal states (sink states).

Prop osition5 AlgorithmL-automaton runs in time O(jQjjAj) on input

T = (Q;A;i;T;0) if transition functions are implemented by transition matri- ces.

Pro of. If transition functions and0are implemented by transition matrices, access to or denition of (p;a) or 0(p;a) (p state, a 2 A) are realized in constant amount of time. The result follows immediately. ./

3 Factor automaton of a single word

In this section we specialize the previous results to the language of factors of a single word. It is proved below that the contruction of Section 2 yields the factor automaton (minimaldeterministic automaton accepting the factors) of the word (see Theorem 7). The minimality of the automaton seems to be exceptional because, for example, the same construction applied to the set faa;abg does not provide a minimal automaton.

The reverse construction that produces the trie of minimal forbidden words from the factor automaton is described in the next section.

We consider a xed wordv2Aand denote byF(v) the language of factors ofv.

Prop osition6 The language F(v)is of nite type.

Pro of.Indeed, factors ofv, of lengths less thanjv j+1, avoid all words of length exactlyjv j+ 1. Therefore, every minimal forbidden word ofF(v) has length at

mostjv j+ 1. ./

For instance, for the wordv=abbab, the set of minimal forbidden words of

F(abbab) isfaa;aba;babb;bbb;cg(see Figures 3 and 4).

(9)

The result of the previous proposition is made more precise in the next sec- tion, but an immediate consequence of it and of the denition of the automaton

A(M) for an anti-factorial language M, the automaton A(MF(F(v))) has a nite number of states. The next statement gives a complete characterization of the automaton as the factor automaton ofv.

Theorem 7

For any v 2 A, the automaton obtained from A(MF(F(v))) by removing its sink states is the minimal deterministic nite automaton accepting the languageF(v) of factors of v.

Proof.

The automatonA(MF(F(v))) is already a deterministic nite automa- ton that accepts the languageF(v) by Theorem 3. We only have to prove that it is minimal after removing the sink states.

Supposeab absurdothat there exist two equivalent non-sink statesp;qinQ. By the standard equivalence relation of undistinginshability and by construction

p;q 2 F(v). Hence, v = xpy and v = x0q y0 and we can choose x and x0 of minimal length. We consider two cases:

(i) jxpj6=jx0q j, (ii) jxpj=jx0q j.

Case (i). We can suppose for example that jxpj<jx0q j(the case jxpj>jx0q j is handled symmetrically). Then,xpy 2F(v) implies that(p;y) is not a sink state, hence, by the equivalence (q ;y) is not a sink state, that is, q y 2 F(v) by Remark 4. Therefore, v = x"q y z where jx"j jx0j by the choice of x0 (of minimal length). Hence, jv j jx0j+jq j+jy j+jzj > jxpj+jy j = jv j, a contradiction.

Case (ii). The equalityjxpj=jx0q jimplies either that pis a sux ofq or the converse. Let us suppose for example that p = sq for some word s 6= . By Remark 3 statement 2, there existsw=pz that belongs toMF(F(v)). By the equivalence,q zis also a sink state and, again by the equivalence, for no proper prex uof q z, qu is a sink state. Hence, by Lemma 2.1, qq z is an element of MF(F(v)), that is, a sux ofq z. Sincep=sq ;s6=, qq z is a proper sux of

pz against the anti-factorial property ofMF(F(v)). A contradiction again.

After cases (i) and (ii) it appears that there cannot exist two dierent non- sink statesp;qinQthat are equivalent. Therefore the automaton without sink

states is minimal, which ends the proof. ./

The property stated by Theorem 7 does not generalize to any nite set words.

For example, consider the setM =faa;bag. Its trie has three internal nodes and then the automatonA(M) has three states after removing sink states. But the languageL(M) isb+ab and its minimal automaton has only two states.

4 Minimal forbidden words of a word

We end the article by an algorithm that builds the trie accepting the language MF(F(v)) of minimal words avoided by v. This is an implementation of the

(10)

- 0

m

1

m

2

m

3

m

4

m

6

m

5

m

- - - - -

a b b a b

&

b -

b

%

6

a

Figure 3: Factor automaton ofabbab; all states are terminal.

- 0

m

1

m

2

m

3

m

4

m

6

m

5

m

- - -

a b b

&

b -

b

%

6

a

c a a b b

Figure 4: Trie of minimalforbidden words ofF(abbab) on the alphabetfa;b;cg. Squares represent terminal states.

inverse of the transformation described in Section 2. Its design follows Equal- ity 2. A corollary of the transformation gives a bound on the number of minimal forbidden words of a single word, which improves on the bound coming readily from Proposition 6.

MF-trie(factor automaton A= (Q;A;i;T;) and its sux functions) 1. for each statep2Qin width-rst search fromiandeacha2A 2. if(p;a) undenedand(p=ior (s(p);a) dened) 3. set0(p;a) = new sink;

4. else if(p;a) =qandqnot already treated 5. set0(p;a) =q;

6. return(Q;A;i;fsink sg;0);

The input of algorithmMF-trie is the factor automaton of word v. It is the minimal deterministic automaton accepting the factors of v. It includes the failure function dened on the states of the automaton and calleds. This function is a by-product of ecient algorithms that build the factor automaton (see [4]). It is dened as follows. Let u2A+ and p=(i;u). Then, s(p) =

(i;u0) where u0is the longest sux ofufor which(i;u)6=(i;u0). It can be shown that the denition ofs(p) does not depend on the choice ofu.

(11)

Example

Consider the word v=abbabon the alphabetfa;b;cg. Its factor automaton is displayed in Figure 3. The failure function s dened on states has values: s(1) = s(5) = 0, s(2) = s(3) = 5, s(4) = 1, s(6) = 2. Algorithm

MF-trie produces the trie of Figure 4 that represents the set of ve words

faa;aba;babb;bbb;cg.

Theorem 8

LetA be the factor automaton of a wordv 2A. (It accepts the languageF(v).) Algorithm MF-triebuilds the tree-like deterministic automa- ton accepting the set of minimal forbidden words ofF(v), that is MF(F(v)).

Proof.

The transitions dened at line 5 duplicates the transition of the width- rst search tree, which is the tree of shortest paths from the the initial state of

A. This fact is used in the proof. All other transitions are created at line 3 and lead to a sink state. LetA0be the automaton produced by the algorithm.

Consider a word ua(a2 A) accepted by A0. (A0 accepts only non-empty words.) Let p = 0(i;u). By the remark above, u is the shortest word for which(i;u) =p. Therefore, ifu= bywith b 2 A, we have(i;y) = s(p) by denition of the sux functions. When the test \(s(p);a) dened" is satised, this implies thaty a 2F(v). Thus, by a62F(v), while by ;y a 2F(v). So, after Remark 1,by a=uais a minimal forbidden word forF(v).

Ifuis the empty word,p=i. The transition fromito the sink labeled bya is created under the condition \(p;a) undened", which means that the letter

adoes not occur inv. The word ais again a minimal forbidden word forF(v) in this case.

This proves that any word accepted byA0is inMF(F(v)).

Conversely, let ua2MF(F(v)). Ifuis the empty word, this means thata does not occur inv, therefore there is no transition labeled byainA. Lines 3 and 4 cope with this situation by creating a0-transition from the initial state to accepta.

Assume now that u =by with b 2 A. The word uis a factor of v, so let

p=(i;u). Note thatuis the shortest word for which p=(i;u), because all such words are suxes of each others in the factor automaton A. The word

ua is not a factor of v, so the condition \(p;a) undened" is satised. Let

q = s(p). We have q = (i;y) because of the minimality of length of u and the denition ofs. By the choice of ua=by a, y a is a factor of v. Thus, the condition \(s(p);a) dened" at line 3 is satised which yields the creation of a transition at line 4 to makeA0accept uaas wanted.

This ends the whole proof. ./

Corollary 9

A wordv2Ahas no more than2(jv j 2)(jAvj 1)+jAjminimal forbidden words ifjv j 3, where Av is the set of letters occurring in v. The bound becomesjAj+ 1 ifjv j<3.

Proof.

The number of words inMF(F(v)) is the number of sink states created during the execution of algorithm MF-trie. These states have exactly one

(12)

ingoing arc originated at a state of the factor automatonAof v. So, we have to count these arcs.

From the initial state of Athere is exactly jAj jAvj such arcs. From the (unique) state ofAwithout outgoing arc, there are at mostjAvjsuch arcs. From other states there are at mostjAvj 1 such arcs.

Forjv j3, it is known thatAhas at most 2jv j 2 states (see [4]). Therefore,

jMF(F(v))j(jAj jAvj)+jAvj+(2jv j 4)(jAvj 1) = 2(jv j 2)(jAvj 1)+jAj. Whenjv j<3, it can be checked directly thatjMF(F(v))jjAj+ 1. ./

Prop osition10 AlgorithmMF-trieruns in timeO(jv jjAj)on input word

v if transition functions are implemented by transition matrices.

Pro of. As for the proof of Proposition 5, the hypothesis on implementation implies that the running time of the algorithm is proportional to jQjjAj. Thus, the result is a consequence of the linear size ofA: the factor automaton ofvhas no more than 2jv jstates (see [4] for instance). ./

References

[1] A. V. Aho and M. J. Corasick. Ecient string matching: an aid to bibli- ographic search, Comm. ACM18:6(1975) 333{340.

[2] M.-P. Beal, F. Mignosi, and A. Restivo. Minimal Forbidden Words and Symbolic Dynamics, in (STACS'96, C. Puech and R. Reischuk, eds., LNCS

1046, Springer, 1996) 555{566.

[3] J. Cassaigne. Complexite et Facteurs Speciaux, Bull. Belg. Math. Soc.4 (1997) 67{88.

[4] M. Crochemore, C. Hancart. Automata for matching patterns, in (Hand- book of Formal Languages, G. Rozenberg, A. Salomaa, eds.", Springer- Verlag", 1997, Volume 2, Linear Modeling: Background and Application) Chapter 9, 399{462.

[5] V. Diekert, Y. Kobayashi. Some identities related to automata, determi- nants, and Mobius functions, Report 1997/05, Fakultat Informatik, Uni- versitat Stuttgart, 1997.

[6] A. de Luca, F. Mignosi. Some Combinatorial Properties of Sturmian Words, Theor. Comp. Sci.136(1994) 361{385.

[7] A. de Luca, L. Mione. On Bispecial Factors of the Thue-Morse Word,Inf.

Proc. Lett. 49(1994) 179{183.

[8] R. McNaughton, S. Papert. Counter-Free Automata, M.I.T. Press, MA 1970.

[9] D. Perrin. Symbolic Dynamics and Finite Automata, invited lecture in (Proc. MFCS'95, LNCS969, Springer, Berlin 1995).

Références

Documents relatifs

In particular, data from Google Books, accessed both via Ngram Viewer and through the tools available at BYU Corpora, have been used to test hypotheses for the

For the streaming setting, if we consider 0-lookahead repair strategies with aggregate cost, the threshold problem becomes solvable in polynomial time (hence it is as difficult as

Szeider [ 20 ] shows, using a reduction from 3-SAT, that the problem of finding a shortest simple exception avoiding path is NP-complete even when each forbidden path has two edges..

Adult Spanish second language (L2) learners of English and native speakers of English participated in an English perception task designed to investigate their ability to use

In this paper we present a statistical method for automatic language identification of written text using dictionaries containing stop words and diacritics.. We propose

A further correlation exists between word classes and the overall rates of contact words in bilingual speech corpora: those with over 20% contact words exhibit greater word

aspects of language production in L2: pronunciation accuracy and novel word form learning 49.. (i.e. the ability to encode, access, and produce

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des