Automaticity IV : sequences, sets, and diversity

(1)

J

OURNAL DE

T

HÉORIE DES

N

OMBRES DE

B

ORDEAUX

J

EFFREY

S

HALLIT

Automaticity IV : sequences, sets, and diversity

Journal de Théorie des Nombres de Bordeaux, tome

8, n

o

_{2 (1996),}

p. 347-367

<http://www.numdam.org/item?id=JTNB_1996__8_2_347_0>

L’accès aux archives de la revue « Journal de Théorie des Nombres de Bordeaux » (http://jtnb.cedram.org/) implique l’accord avec les condi-tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti-lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte-nir la présente mention de copyright.

Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques

(2)

Automaticity

IV:

_{Sequences, Sets,}

and

_Diversity

par JEFFREY SHALLIT

RÉSUMÉ. Dans cet article nous étudions la

_complexité

de la

de-scription

(i)

de suites sur un

alphabet

fini et

_(ii)

de sous-ensembles de l’ensemble N des entiers naturels.

Soit

_(s(i))i~0

une suite sur un

alphabet

fini 0394. Nous définissons

la k-automaticité de _s,notée

_Ask(n),

comme le

plus petit

nombre

possible

d’états d’un automate déterministe

_qui,

_{pour tout}i tel

que

0 ~ i ~

n,

prend l’expression

de i en base k comme entrée

et calcule

_s(i).

Nous donnons des

_exemples

de suites

_qui

ont une

_grande

automaticité dans toutes les bases

_k;

par

exemple

nous _{montrons que}la k-automaticité de la fonction

_{caractéristique}

des nombres

_premiers

satisfait

_{Ask (n)}

=

03A9(n1/43)

_pourtout k >

2,

rendant ainsi

_quantitatif

le théorème

_classique

de

_Minsky

et

Papert

des nombres

_{premiers exprimés}

en base 2 n’est _pas

régulier.

Nous donnons des

_exemples

de suites dont la k-automaticité

est

_petite

dans toute base, ainsi que des

exemples

de suites dont

la k-automaticité est

_petite

dans certaines bases et

_grande

dans d’autres. Nous obtenons aussi des bornes pour l’automaticité de

certaines suites

_qui

sont des

_points

fixes

_{d’homomorphismes,}

_par

exemple

les mots infinis de Fibonacci et de Thue-Morse.

Enfin nous définissons un _concept

voisin, appelé diversité,

et nous donnons des

exemples

de suites _ayantune diversité élevée.

ABSTRACT. This paper studies the

descriptional complexity

of

(i)

sequences over a finite alphabet; and

(ii)

subsets of N

(the

natural

numbers).

If

_{(s (i )) i ~0}

is a _sequenceover a finite

alphab et 0394,

then we define

the

_{k-automaticity of}

s,

Aks(n),

to be the smallest

possible

number

of states in _anydeterministic finite automaton

_that,

for all i with

0 ~ i ~ n, takes i

expressed

in base k as

_input

and _computes

_s(i).

We

_give

_examples

_{of sequences}that have

_{high automaticity}

in all bases

_k;

for

_example,

we show that the characteristic _sequenceof

the

_primes

has

_{k-automaticity}

_{Aks(n) =}

_03A9(n1/43)

for all

_{k ~ 2,}

thus

_making

_quantitative

the classical theorem of

_Minsky

and

Papert

that the set of

_primes

_expressed

in base 2 is not

_regular.

Research _supportedin _part_bya _grantfrom NSERC. Manuscrit regu le 5 avril 1996.

(3)

We

_{give examples}

of sequences with low

automaticity

in all

bases

_k,

and low

_automaticity

in some bases and

high

in others.

We also obtain bounds on the

automaticity

of certain _sequences

that are fixed

points

of

homomorphisms,

such as the Fibonacci

and Thue-Morse infinite words.

Finally,

we define a related _conceptcalled

diversity

and

_give

examples

of sequences with

high diversity.

1. Introduction and Definitions

In this _paper,I

_study

the

_{descriptional complexity}

of

_(i)

_sequencesover

a finite

alphabet;

and

(ii)

subsets of N

(the

natural

numbers).

In

_1972,

Cobham

_[5]

introduced the notion of what is now called a

k-automatic _sequence.

_(In

the

_literature,

one can also find the terms

k-recognizable

sequence and uniform

tag

sequence.)

Roughly speaking,

a

sequence over a finite

alphabet

is k-automatic if and

only

if

s(i)

is a finite-state function of the base-k

_{representation}

of i.

However,

most _sequencesare not k-automatic for _anyk. Instead of

simply

saying

that a _sequenceis not

_k-automatic,

we can measure

quanti-tatively

how "close" a _sequenceis to

being

k-automatic

using

the

_concept

of

_automaticity

studied in

_previous

_papersof the author and co-authors

[26,

27, 20,

10].

In addition to its evident intrinsic

_interest,

_automaticity

has

_proved

useful in

_obtaining

nontrivial lower bounds in

_{computational}

complexity

theory;

see

[7,

8, 16,

17].

More

_formally,

define a deterministic finite automaton with

_output

(DFAO)

M to be a

6-tuple,

(Q, E, 8, qo, 0, T),

where

Q

is a finite set of

states, E is a finite

_{input alphabet,}

_qois the start _state,and A is a finite

output

alphabet.

The

_{map 8 : Q}

x E -~

_Q

is called the transition

_function,

and is extended in the obvious way to a _map8 :

Q

x E* 2013~

_Q.

The _map

r :

_{Q -~}

A is the

_output

function. On

_input

w E

_E*,

the machine M

outputs

the

_single

_symbol

_r(8(qo,w)).

For more on these

_concepts,

_see,for

example,

[15].

Let k be an

_{integer >}

2 and define

Ek

=

{0,1, ... , k-l~.

If w _E

Ek,

then

by

I mean w evaluated as a base-k

integer,

that

_is,

if w = _{wlw2 ...}_wv,.,

then = If n > 0 is _an

integer,

then

by

(n)k

I _mean

the default

_{base-k representation}

of n - that

_is,

one not

_containing

_leading

zeroes. Note that

_(O)k

= e, the

empty

string.

Suppose

is a _sequenceover the finite

_alphabet

A. If there exists

a DFAO M such that for all i >

_0,

we have

_s(i)

=

r(8(qo,

w R))

for all

w E

_Ek

such that

_[w]k

=

i,

then the _sequence is said _tobe

(4)

awkward definition results from the

_problem

of

_"leading

zeroes"

_input,

and

our convention that the machine M reads the

input

number

starting

with

the least

_{significant digit.}

Here is one alternate definition of k-automatic _sequences. Define the

k-fiber of the _sequence a to be

Then is a

regular

set for all a E Ll if and

only

if the _sequence

is k-automatic.

Another alternate definition of k-automatic _sequencescan be

_given

in terms of a set called the k-kernel. Let

_{(s(n))n>o}

be a _sequenceover a finite

alphabet.

The k-kernel of which we denote

_by

K:,

is defined as

follows:

Eilenberg

[9,

Proposition

3.3,

p. 107]

proved

that a _sequenceis k-automatic

if and

_only

if its k-kernel is finite.

Given a _sequence we can define its

k-automaticity

as

follows: is the smallest

_possible

number of states in _anyDFAO M =

(Q, E,

ð,

such that for all i with 0 i _n,_wehave

s (i)

for all u~ E

_~~

with

_[w]k

= i. We

emphasize

that the

automa-ton is fed with the

_digits

of

_i,

_starting

with the least

_{significant digit.}

This convention is

_actually

_important

to

_specify,

since it is known that there are

languages

of low

_automaticity

whose reversal has

_high

_{automaticity;}

see

[10].

There is another way to define

k-automaticity.

Suppose

we define the

n-truncated k-kernel of the _{sequence s,}as follows:

The n-truncated k-kernel consists of finite sequences. Call two such

se-quences v, w E n-dissimilar if there exists a

_{position j}

for which

both

_v(j)

and

_{w ( j )}

are defined and

v(j)

~

w(j). (Note

that under this

definition,

if v is a

prefix

of _w,then v and w are

similar.)

Then is

defined to be the maximum number of

_pairwise

n-dissimilar _{sequences in}

K~ (n) .

It is not hard to see that this definition is identical to the

_previous

one; see

[27].

Note that the condition m

(n -

a)/k’

is

_equivalent

to

kim

+ a n; in other

words,

the variable that is bounded

_by

n is not m

but the "true" variable

kim

+ a.

The

_following

basic results on

automaticity

are _easy_{to prove}

_[27]:

(5)

(b)

A~ (n)

=

0(1)

if and only if s is k-automatic;

(c)

There exists an absolute constant c such that

_if

s is not

k-automatic,

then

_{As (n) >_}

_{c logk n for infinitely}

many n.

(d)

For _anysequence s we have =

n) .

As

_parts

_(b)

and

_(c)

of this theorem

_show,

if a _sequenceis not

k-automatic,

then its

_{k-automaticity}

must be

_greater

than

_{c logk n}

_infinitely

often. This

_suggests

_studying

_sequencesthat are not

_k-automatic,

but which are "as close as

possible"

to k-automatic. We _saythat a _sequence

is

_{k-quasiautomatic}

if =

0(log n).

We then have the

fol-lowing

theorem,

whose

_proof

is _easyand is omitted:

PROPOSITION 2. A _sequence is

_{k-quasiautomatic if}

and

_{only if}

it

is

_{ke-quasiautomatic f or}

all e > 1.

So far we have discussed the

k-automaticity

of _sequences,but the same

terminology

can be used for sets of

non-negative integers.

We _saya set S C N is k-automatic if its characteristic _sequence is k-automatic.

Similarly,

if S is a _set,then

by

AS (n)

we mean

Axs

(n).

2. Classical sets with

_{high automaticity}

in all bases

In this

_section,

we examine two classical sets

_(the

_primes,

the square-free

_numbers)

and show that their characteristic _sequenceshave

_high

k-automaticity

(that

is,

Q(n")

for some e >

₀₎

in all bases k > 2.

_(By

_f

=

0(g)

we mean there exist

positive

constants c, no such that

f (n) > cg(n)

for all

n >

_{no . )}

For the

_primes,

our results can be viewed as

making

_quantitative

the classical result of

_Minsky

and

_Papert

_[19]

that the

_primes

_expressed

in

base 2 cannot be

_accepted

_by

a finite automaton.

Our method is based on the

_following

useful lemma:

LEMMA 3. Let

_{(s(i))i>o be a}

_sequenceover a

finite alphabet A,

and _suppose

that there exists a constant d such that

for

all _{r, a,}b with r >

_2,

1

a, b

r,

a ~ b,

and

gcd(r, a)

=

gcd(r, b)

=

1,

there exists _a

_non-negative

integer

m =

0(rd)

such that

s(rm

₊

a) ~

s(rm

₊

b).

Then

f or

2,

where the

_implied

constant in the

big-Q

does not

_depend

on k.

Proof. Since m =

0(rd),

there exists a constant c such that

_{m ~ crd -1}

for all r > 2. Let i =

L(10gk n -

logk

c)/(d

+ Then

Put r =

k’.

It follows that there exists m _{such that}

s(kim

+

_b).

_However,

kim

+ a

1~i

= _{c .}

(6)

and the same bound holds for It follows that the two

_subsequences

~s(k~t + a) )t>o

and are n-dissimilar. Since

a, b

were

arbitrary

integers

relatively

prime

to r, we know that there are at least

cp(ki)

pairwise

n-dissimilar _sequences,where _{cp is}Euler’s

_{phi-function.}

By

[21,

Theorem

_15],

we know that

p(n) >

n/(5loglogn)

for n > 3.

Hence

Thus

_{As (n) _}

We first examine the

_automaticity

of the characteristic _sequenceof the

primes.

We need the

_following

lemmas.

LEMMA 4. For all x > 1 we have

ez/3,

where the

product is

over

primes

only.

Proof. Let

_{~9(x) _}

_{log p,}

where the sum is over

primes

only.

We know

that

_t9(x)

1.000081x for x > 0

_[22,

p.

360],

and .84x 101

[21,

Theorem

_10].

It follows that

_{lOg p}

> 1.68x -1.000081x >

_x~3

for x >

101/2.

Now it is

easily

verified

by

_computer

or hand calculation

that

_logp

>

_X13

for 1 x

101/2.

It follows that > for all x > 1.

LEMMA 5. Given

_integers

_{k, l}

> 1 with

_{gcd(k, l)}

_=1,

there exists a

_prime

p = km + 1 with m =

O(max(k,I)11/2).

The _constantin the

big-0 is

independent of k

and l .

Proof. Choose x =

max(1,I/k,310gl).

Then from the

_prime

qfl

with x

_q

2~.

Now _q> so

_{kq > l,}

and = 1. Hence

_by

Heath-Brown’s

version of Linnik’s theorem

_[14],

there exists a

prime p =- 1

(mod kq)

with _{p =}

_Q((~q)11/2).

Since

_q

2~ = _wehave

p =

O(max(111/2, (~ log l)11/2’ ~11/2)),

Hence m =

(p - l)~~

=

O(nlaX(lll/2~-1’ ~9/2(lOgl)11/2)’ ~9/2)’

and the

result follows.

LEMMA 6. Given

_integers

_r,a,b

_{with r >_}

_2,

_{gcd(r,a) =}

_{gcd(r, b) -}

_1~

1 ~

_{a, b}

_r,and

_{a ~ b,}

there exists m =

O(r165/4)

such that _rm₊_ais

(7)

Proof. We use a trick

suggested

by

_papersof Hartmanis and Shank

[12]

and Allen

_[1].

By

Heath-Brown’s version of Linnik’s theorem

_[14],

there exists mo =

0(r9/2)

such that _p=

rmo + a is a

prime.

Define _q= _rmo₊b. Then

q = If _qis

composite,

we’re

done,

and _mo

= ~ (r9 /2 ) .

Otherwise,

assume _qis

_{prime. Now,}

in Lemma

_5,

take k =

qr and 1 =

qr + p. Then

there exists mi =

0((qr

₊

p)11/2) = O(r143/4)

such that

(qr)ml

₊

(qr

₊

p)

is

_prime.

_{However, t}

=

(qr)ml

₊

(qr + q)

is

composite,

since q I t

and

q t.

Take m =

qml + q + mQ. Then m =

0(rls5/4).

THEOREM 7. The set P

_{o f prime}

numbers has

_{k-automaticity}

~ (nl/43 )

for

all

_integers

l~ > 2.

Proof. Combine Lemmas 3 and 6.

We note that the constant

_1/43

in Theorem 7 is not

_{optimal. Indeed,}

the

constant

_11/2

in Lemma 5 is almost

_certainly

not

_{optimal. Wagstaff}

_[31]

has

_provided

a heuristic model that

predicts

that the least

prime

_congruent

to 1

_{(mod k)}

is If this

_prediction

were

_true,

it

would

_improve

the constant

_1/43

in Theorem 7 to

_1/(2

_{+ E) .}

We now turn to

_providing

a lower bound on the

k-automaticity

of the

squarefree

numbers. Recall that a number n is said to be

_squarefree

if

_t21n

for all

_{integers t}

> 1.

LEMMA 8. Let

_defined

as

_follows:

Then

_for

all e >

_0,

and

_{r, a, b}

such that r >

_{2, 1}

a _r,and 0 b r

with

_gcd(a,r)

_squarefTee

and

_{a :0 b,}

there exists an m =

O(r13/9+E)

such

that rm + a is

_squarefree

and rrri + b is not

_squarefree.

Proof. Let _qbe the least

_prime

not

_dividing

_{rib -}

Since

_{rib - at I}

_r2,

by

the

_prime

number theorem we have _q=

O(logr2)

=

O(logr).

Now

rk + b - 0

_(rrzod

_qz)

if and

_only

if k =

(mod

Let _cbe such that

0 c

q2

and c -

(mod

q 2).

Consider the arithmetic

_progression

We have

_{gcd (r q 2, rc}

+

_a)

is

_squarefree,

because _any

_prime

divisor of

_rq2

and rc + a must be a divisor of r or

q2.

_implies

t

_a,

and we know

gcd(r, a)

is

squarefree by hypothesis.

On the other

hand,

rc + a = 0

_{(mod q)}

_implies

that rc m -a

(mod q).

But rc - -b

_{(mod q),}

(8)

Then, by

a result of Heath-Brown

[13],

there exists an mo =

O(r13/9+E)

such that

_{(rq2)mo+(rc+a)}

is

_squarefree.

Take m =

q2mo +c.

Then rm+a

is

_squarefree,

but rm + b is divisible

_by

_q2.

THEOREM 9. The set S

_of

_squarefree

numbers has

_{k-automaticity}

S~ (n2~5 )

for

all k > 2.

Proof.

_Apply

Lemma 3 with d

_{= 13/9}

+6. ’

Again,

the constant

_2/5

in Theorem 9 is not

_optimal.

3. A set with low

_automaticity

in all bases

In this section I

_give

an

example

of a _sequencethat is

_{k-quasiautomatic}

for all

_{k 2::}

2.

THEOREM 10.

_Define

_a(l)

=

_1,

and

a(i

₊

1)

=

a(i)

₊

for i >

1. Then the set A =

~a(i) : i >

not

k-automatic,

but is

k-quastautomatic

f or

all k > 2.

The sequence

(a(i))

begins

1,3,39,331815,114126085737676800331815,...

Proof.

_First,

we note the

_following

observation.

_Suppose

there exists an

infinite

_string

w = *

over

Ek

=

{0,1,... , ~ 2013 1}

such that all

but

_finitely

_manymembers s of a set S have the

_{"prefix property",}

that

is,

(s)’

is a

prefix

of w. Then =

O(logn).

To see

_this,

note that in

this case we can write S =

Sl

U

S2,

where

S,

is finite

and 82

has the

prefix

property.

To build an automaton that

_accepts

all the base-k

_{representations}

of elements of

_{82 n}

_{~0, n~,}

we

_simply

create a linear chain of

nodes,

with

transitions between them labeled with the

_symbols

of w. The

accepting

states

_correspond

to the members of

S2,

and of course we need a

single

dead

state in addition to handle the other transitions. The

_resulting

automaton

has

_{1092 n + 0 (1)}

states.

Since

Sl

is

_finite,

we can

_accept

it with a finite automaton. The result

now follows because we can

_accept

Sl

U

S2

using

a direct

_product

construc-tion.

The construction of the _sequence should now be clear. For bases

k >

_2,

the _sequencehas the

_property

that

_(a(i))’

is a

_prefix

of

(a(i +

1))~

provided i

> 1~ -1. Hence the observation of the

_previous

two

_paragraphs

applies,

and the

_automaticity

of A is

_{O (log n)}

for all k > 2.

_Note,

_however,

that the constant in the

_{big-0 depends}

on k.

To show that A is not k-automatic for any

k,

it suffices to show that

= _oo.But this

follows,

since from the recurrence we have

(9)

4.

_Automaticity

of fixed

_points

of

_{homomorphisms}

Let cp

be a

homomorphism

from A* to A*. If there is a

symbol

a E 0

such that

_cp(a)

= _axfor _{some x}_E

A*,

then

is a fixed

point

of _p;that

is,

cp(y)

=

y. If further cp is

nonerasing

(i.e.,

p(b)

~

E for all b E

_A),

then _yis infinite. If

_Icp(b)1

= k for all b E

_A,

then _~pis said to be k-uniform. A 1-uniform

_homomorphism

is called a

coding.

A well-known theorem of Cobham

_[5]

states that is the

image

(under

a

coding)

of a fixed

point

of a k-uniform

_homomorphism

if and

_only

if is k-automatic.

A natural

_problem

is to determine the

_automaticity

of fixed

_points

of non-uniform

_{homomorphisms.}

In

_particular,

are there fixed

_points

of

homo-morphisms

which are

quasi-automatic,

but not automatic? This

question

was raised

_by

the author in 1992 in the context of the fixed

_point

of the

_homomorphism

1 -

_121;

2 - 12221. The _sequence

_(tn)

and its

relationship

to the classical Thue-Morse _sequencewas studied

_by

Allouche

et al.

_[2].

_Computation

_strongly

_suggests

that

_(tn)

is

_{2-quasiautomatic.}

For

example,

= for 0 n

1864134,

but _notfor n = 1864135.

Although

we are not

_yet

able to _provethe

_{2-quasiautomaticity}

of

_(tn),

it

is

_possible

to _provethat it is not 2-automatic

_[24].

_(This

last result was,

according

to J.-P. Allouche

_{(personal communication),}

also

_{proved by}

M.

Mkaouar.)

We now

_give

three

examples. First,

we exhibit a

homomorphism

whose

fixed

_point

is

_{2-quasiautomatic,}

but not 2-automatic.

_Next,

we

_give

a

ho-momorphism

whose fixed

_point

is

_2-automatic,

but not

_{k-quasiautomatic}

for _anyodd k.

_Finally,

we use some

_simple

theorems of

_Diophantine

approximation

to exhibit a

homomorphism

whose fixed

_point

is not

k-quasiautomatic

for _anyk > 2.

THEOREM 11. Let =

cba,

= _aa,and = b. Let

(si)i>o

be the

fixed

point

of

’P

beginning

with c. Let X =

f 2i

_{+ j : j >}

0~.

Then

(a)

so = c

and si = b

if

and

onlyifi E X;

(b)

(si)i>o is

not 2-automatic.

(c)

2-quasiautomatic.

(10)

For

_part

_(b),

it sufhces to show that L =

F2 (s, b)

is not a

_regular

set. It is easy to see that

Now a routine

_argument

_using

the

_pumping

lemma

_[15]

_completes

the

_proof.

Finally,

for

_part

_(c),

it suffices to construct an automaton with

_output

with

_0(log

_n)

states that

_generates

the terms of the _sequence

_(si)

_correctly

for all i n. We sketch the construction of such an

_automaton,

_leaving

the

details to the reader. Let

E:5n

= If L is a

language,

we _saythat

L’ is an nth-order

_{approximation}

to L if L fl

E:5n

= L’ n

~~n.

The basic idea of our construction is that it suffices to concentrate on

Ll

=

~’2(s,

b)R

and create an automaton

_accepting

a

(1

+

_hog2

_{nJ )th}

order

_{approximation}

to

LR.

This is _easy,since

_strings

in

LR

_begin

with a short _sequenceof bits

which are followed

_by

_manyzeroes and then a 1.

The state set consists of four

_parts.

The first

_part

is A =

~qw : w

_E

(0

+ This

_{part of the}

automaton forms a

_binary

tree

that can handle all

possible

_strings

of

length

[log2 log2 nJ

+ 1. The

tran-sitions between states in the first

_part

are

_{given by}

8(qw, e) =

qwe for

Iwl :5

Llog2log2 nJ

and e E

_{0?1}’

The

_output

function for the states

in A is

_{given by}

= _c,

r(qw)

= b if

X,

and = a for all

other w.

The second

_part

of the automaton consists of a linear chain of

_states,

B =

fpi :

_{0 ’}

L’092

The transitions between states in the second

part

are

_{given by}

_{b(pz, 0)}

= _pi-ifor 2 i

hog2

n ] ,

I

b(pl,1)

= po, and

6 (po, 0)

= _po.The

_output

function for these states is = a for i

_~

_0,

and

_r(po)

= b.

The third

_part

of the state set is C =

fp’ :

0 i

L’092

a _{copy of}

B. The

_output

function for these states is

_r(p~)

= b for all i.

The fourth and final _partconsists of a

single

dead state d. We set

b(d, e)

= d for _e_E

~0,1~,

and

T(d)

= a.

The start state is qe. We leave to the reader the task of

_specifying

the connections between the _{different groups of}_states,

_observing

that transi-tions

_{6(qw, 0)}

for

_L’092

_{log2 nj}

+ 1 that are not

_self-loops

_goto a state

in C if

_[wR]2

E

_X,

_{and otherwise go}to a state in B. As an

_example,

the machine in

_Figure

1

_computes

_(si)

_correctly

for all i

28.

The total number of states needed is

_IAI

+

_IBI

+ 1

_{6 log2}

n.

Next,

we exhibit a

homomorphism

whose fixed

point

is

_2-automatic,

but

has

_{high k-automaticity}

for all odd k.

Define

_~p(0)

=

_01;

p(1)

=

_00,

and consider the fixed

point

(p(i))i>o

starting

with 0. It is easy to see that

p(i)

=

v2(i

₊

1)

mod

2,

where

v2(n)

(11)

FIGURE 1. Automaton

_computing

_s(i)

for 0 i 256. The

input

is the base-2

_expansion

of

_i,

_starting

with the least

significant

bit. The

_output

is

_s(i).

The states are labeled

with the name of the

_state,

followed

_by

a

_slash,

followed

by

the

_output

associated with that state. All unmarked transitions go to the dead

_state,

labeled

_d/a.

We first

_give

two

_simple

lemmas:

LEMMA 12. Let be a _sequenceover a

finite

alphabet A,

and

sup-pose that there exists a constants d such that

for

all

r, a, b

with r >

_2,

0 _a,b _r,and

_{a ~}

_b,

there exists a

non-negative integer

m =

0(rd)

such

that

_s(rm

+

_a)

_s(rm

(

+

_{b) .}

Then =

f2(nl/(d+l)

k

for

all k

>

2,

s

)

I )

J

-where the

_implied

constant in the

_big-Q

does not

_depend

orc k.

Proof.

_Exactly

the same as the

proof

of Lemma 3.

LEMMA 13.

_Suppose

r is odd and 1 a b r. Therc there exists m such

that 0 m 4r and

v2 (rm

+

_{a) fl v2 (rm}

+

_{b) (mod}

_2).

Proof. Let b - a =

2C .t,

where t is odd. Let _{m -}

(2c+l-b)r-l

(mod

2~+2);

the definition is

_meaningful

since r is odd. Then rm+b ==

(mod

2~+2),

so

v2 (rm

+

_b)

= _c₊1. On the other

hand,

Since t is

_odd,

we have +

_a)

= _c. Now _2c

r, so

2c+2

4r,

and

(12)

Now we can state and _proveour theorem on the

k-automaticity

of

THEOREM 14.

_If

_p(i)

=

v2(i

₊

1)

mod

_2,

then is 2-automatic.

If

k > 3 is

_odd,

then

_A(n)

=

P(nl/2 Ik)

p

Proof. The fact that p is 2-automatic follows from the fact that the

defining

homomorphism p

is

_2-uniform;

see

_[5].

To

_get

the

_automaticity

bound for odd

_l~,

_simply

combine Lemmas 12 and 13.

As a

corollary,

we can obtain a lower bound for the

_automaticity

of the

Thue-Morse _{sequence in}all odd bases.

_{Let sk}

_(i)

denote the sum of the

_digits

of i when

expressed

in base J~. Then the Thue-Morse _sequence is defined as follows:

t(i)

=

s2 (z)

mod 2.

It is _easyto see that the Thue-Morse _sequenceis 2-automatic.

_However,

we have the

following

THEOREM 15. Let k > 3 be an odd

_integer.

Then =

O(nl/4/k1/2).

Proof. Our

_proof

is based on the

following

identity,

which is well-known

and

_easily

_proved

_by

_considering

the base-2

_expansion

of i + 1:

Taking

this modulo

2,

we obtain

where _pis the function defined in Theorem 14.

Let

_{M~ _}

be a DFAO

_computing

t(i)

for all i with

0 i n, and assume that

~In

has states. Now consider and

create a

_slightly

modified automaton M’ =

_{(Q’, ~~, b’, qo, a, T’)}

such that

on

_{input w}

with

[W’lk

=

i,

M’

_computes

the shifted sequence

t(i

₊

1)

for

0 i n. This can be achieved as follows: define

Q’

=

Q

x

~0,1},

where

the second

_component

of _everystate denotes a _carryto be

_propagated,

and let

_qo

=

[qo,1].

Define

Also define

_r’([~0])

=

r(q)

and

T’([~l])

=

T(S(q,1)).

We leave it _to

the reader to

_verify

that the construction does indeed

compute

the shifted

(13)

We now

_{implement equation}

(2)

by

forming

the direct

product

of the

automata and

_M’,

and

_using

an

_output

function that

_computes

the

function

_p(i)

_correctly

for all i with 0 i n. It follows that

Since

_{~p (n)}

=

O(n1/2/k),

the desired result follows.

We now turn to the third

_problem:

_finding

a fixed

_point

of a

homomor-phism

of

_high

_automaticity

in all bases. Our methods are based on the

theory

of

_{Diophantine approximation}

_[4]

and Sturmian words

_(also

called characteristic words or Christoffel

words).

For a _surveyon Sturmian

words,

see

_[3].

_First,

we introduce some notation.

If a is a real irrational

number,

we can

_expand

it

_uniquely

as an

infi-nite continued

_{fraction, a}

=

[ao,

_{a1, a2, ...}

].

The _ai_arecalled the

partial

quotients

of a. We _saythe

partial

quotients

of a are bounded

_by

B if ai B for all 2 > 1.

(For

a _surveyon bounded

partial quotients,

see

[25].)

We define

_pn/qn

=

[ao,

a1, ... ,

an],

and call

pn/qn

the nth

convergent

to a.

We define

_a~,

the nth

_complete

_quotient,

to be _{an+ 1,}...

] .

We define

~a~

= _{a -} the fractional

_part

of _a,and =

min(a -

[a] - a),

the distance to the nearest

_integer.

We then have

LEMMA 16. Let a be an irrational real

number,

0 a

_1,

with

partial

quo-tients

_{bounded by}

B. Let the numbers.

_0,

_{~a~, ~2a~, ... , Ital, 1}

be

_arranged

in

_ascending

order and let them be labeled _,Pt+t. Then

Proof. Let be the

_convergents

to a. It is a _consequenceof the

three-distance

theorem

_(also

called Steinhaus’

_conjecture)

that

(14)

example,

[29,

30,

28].)

Now we know

and the result follows.

Our next lemma is a version of the

_{inhomogeneous}

_{approximation}

theo-rem. Unlike the traditional versions of this

theorem,

the

_requirement

that a has bounded

partial quotients

allows us to bound the size of the

_integers

that effect the desired

_{approximation.}

LEMMA 17. Let a be an irrational real

numbers,

0 a

_1,

with

_partial

quotients

bounded

_by

B. Let

_{0 ,Q}

1 be a real number. Then

for

all

N > 1 there exist

_integers

_{p, q}with 0

_(B

+

_2)N2

such that

N

Proof.

_By

Dirichlet’s theorem

_(see,

_e.g.,

_[4,

Theorem

_I]),

there exist

_integers

n, r with 1 n N and 0 r

N,

such that

rl [

1/N.

Choose k

such that N _qk,where are the

_convergents

to a.

_Then,

as in the

_previous

_theorem,

Without loss of

_generality,

assume na - r >

_0,

and set

_{p’ =}

_{L8/(na - r) J .}

Then 0

_p’

_(B

+

_2)N,

and 0

_{# -}

_{(na - r)p’}

_1/N.

Hence

_Ip’na

(15)

The next lemma shows that Sturmian sequences

corresponding

to real numbers with bounded

_partial

_quotients

have the

_property

that for all

_pairs

of

_subsequences

of the form with

_{c ~ d,}

there is a small

witness i = m that shows that these

subsequences

are different.

LEMMA 18. Let 0 a 1 be an 2rrational reat number with

_partial

quo-tients bounded

b y

B.

_Define

the Sturmian word

_by

8i =

L(i

₊

for i

> 1. Let r > 2 be an

_integer.

Then

_for

all

_integers

_c,d

with 0

_{c, d}

_r,

_{c ~}

_d,

there exists an

_{integer m}

with 0 m

_4(B+2)3r3

such that _8rm+c

_7-

Proof. We use the "circular

representation"

for intervals in

[0, 1),

identify-ing

the

_point

0 with the

_point

_1,

and

_considering

each

_point

modulo 1.

_Thus,

for

_example,

the interval we write as

(2/3,1/3)

is

really

(2~3,1)

U

_(o,1/3).

See,

for

_example,

_[11,

_§3.8,

_§23.2].

It is _easyto see

_{that si}

= 1 -

_{ia~

E

_{[1 - a,1).}

Hence if we could find m such that

it would follow that

_{s,,,+, 0}

Now

We have +

_A(Id)

_1,

where _pis

_Lebesgue

_measure;hence these intervals have nontrivial intersection whenever

_{c #}

d. In

_fact,

the

_endpoints

of these intervals are

_precisely

of the form

1-ia}

for some i with 0 i r.

Let _{po, pi , ... ,} denote the

_{points 0,}

_{fal, ~2a~,}

... ,

arranged

in

_increasing

order. It follows that and

_by

Theorem

_16,

we know this

quantity

is bounded below

by

B 12 r

(B+2)r

Now let m’ be the

_midpoint

of the interval

I, n

Id.

To find m with

Srm+d, it suffices to find

_integers

_{m, t}

with

By

a folklore result

_(see,

_e.g.,

_[23]),

since a has

_{partial quotients}

bounded

by B,

we know that ra has

partial

quotients

bounded

_by

r(B

+

_2).

_By

Theorem

_17,

it follows that such an m exists with m

_r(B

+

_2)(2(B

+

(16)

THEOREM 19. Let 0 a 1 be an irrational real number with bounded

partial

quotients.

Let si =

L(i

₊

1)aj - LiaJ

for

i _> 1. Then

for

all

k >_ 2,

the

_{k-automaticity of}

the _sequence

_(Si)i21

is

_O(nl/4/k).

Proof. Combine Lemmas 12 and 18.

It now follows from this

result,

for

example,

that the fixed

point

of the

homomorphism

1 ~

_10,

₀- 1 is _not

k-quasiautomatic.

This follows

because this fixed

_point

can be obtained as a Sturmian _sequence

_{by setting}

a =

(J5 -

1)/2.

It is known for which _athe

corresponding

Sturmian

sequence is the fixed

point

of a

_{homomorphism;}

see

[6].

5.

_Diversity

As we have seen in Section

_1,

a _{sequence is}k-automatic if and

_only

if its

k-kernel

_(defined

in

_Eq.

₍₁₎₎

is finite. The most

_spectacular

_waya _sequence can fail to be k-automatic is for all the sequences in the k-kernel to be

distinct;

we call such a _sequence

_strongly

Results of the

_previous

sections

_suggest

that the

_property

of

_strong

_diversity

and related

_properties

deserve further

_study.

We make the

_following

definitions:

DEFINITIONS 20. A _sequence

_weakly

k-&verse

_{if the}

_p(k)

sub-sequences

gcd(a, k)

=1,1

a

l~~

are all distinct. A

sequence is

weakly

diverse

_if

it is

_weakly

k-dzverse

_for

all 1 > 2. A _sequence

_{(s(I))j>o}

is

_if

the k

_subsequences

_{(s(ki

+

0 a

k}

are all distinct. A _sequenceis diverse

if

it is k-diverse

for

all

k > 2.

A _sequence

_strongly

k-diverse

_if

the

_subsequences

_{( (s(k’ . j}

+

a))i>o :

0 a

01

are all distinct. A _sequenceis

strongly

diverse

if

it is

_strongly

k-diverse

_for

all l~ > 2.

A

_{sequence is maximally}

diverse

_if

the

_subsequences

+

_{a) )j>o :}

0 a

k, 1~ >

1}

are all distinct.

The results of

_previous

sections can now be

_rephrased

in the

_language

of

_diversity.

In Section 2 we showed that the characteristic _sequencesof

the

_primes

and

_squarefree

numbers are

weakly

diverse. In Section 4 we

showed that the sequence

(v2(i

+

_1))i>o

is k-diverse for all odd

_1~,

and we

also showed that if a is a real number with bounded

_{partial quotients,}

then

is diverse.

We now

_give

an

_example

_ofa _sequencethat is

strongly

k-diverse for

k = 2. Consider the _setX =

{2~

+ j : j >

0}

introduced in Theorem

_11,

and let be the characteristic sequence of this set. Then we have

(17)

THEOREM 21. The _sequence

_strongly

2-diverse.

Proof We must show

_that,

_given

any four

integers j, k,

a, b

0,

0 a

2~,

0 b

2k,

and

(j, a) 54 (k, b),

there exists n > 0 with

C23n+a

0

Without loss of

generality, assume j k

and

if j

=

k,

then a b. Let no = and set n =

2noo23_;+a

₊_no.Then ₊_a=

+ no

2~

+ a =

2’

₊i for i = _no

2~

₊_a.Hence _C23n+a= 1.

It remains to show = 0. To see

_this,

To prove the first

_inequality,

There are three cases to examine.

(i)

If k

_{= j,}

then this

_inequality

follows from the

_assumption

that a b.

(ii)

+

1,

then we must show a - b + 1 =

no2k-l.

Since a

2j

=

2k-17

it suffices _toshow

2k-l

no (2~ - 2k-1)

=

no2k-l,

which is true since _{no >}2.

(iii) If k > j

+

_2,

then we must show

_{l~ - j}

+

2i

_{no (2 k - 2~).}

Now

l~ - j

2 k

2 k,

and

2i

2k -

2 ~

2j

_provided

2k

> 3 ~

_2j,

which is true + 2.

_Adding

these

_{inequalities,}

we find

1~ - j

+

2i

2 (2k - 2~ )

no (2k - 2j ),

as desired.

To _provethe second

_inequality,

we must show

There are two cases to consider.

(i)

If 1~

_{= j,}

we must show b - a

2noo2j+a

+ 1. Since b

_2k,

it suffices

to show

2k

2no + 1 and for this it suffices to take

_{no > k.}

(ii)

If k >

_j,

_{then no(2 k -2j)+j-k+b-a}

_{no(2 k -2j)+2 k}

_{(no+1)2 k}

Thus it suffices to show

_(no

+

_1)2k

2n°. Choose

_{no >}

2k+r.

Then

2 no

_{provided no >}

_2,

which it

_is,

_{since no > 2 k+l}

0

We now show the

_following:

(18)

Proof. Since the set of

_pairs

_{~(1~, a) :}

0 a

1~, l~ >

1}

is

countably

infinite,

it suffices to show that if

_{(k, a) ~}

_{(1, b),}

then the set of _sequences for which

_s(ki

+

_a)

=

s(li

₊

b)

for all i > 0 is of _{measure zero.}

Let g

= If

gfb -

a, or if k = l and and

a ~

b,

then the linear

progressions

(ki

+ and +

_b)j>o

contain no terms at all in common.

Therefore the

_subsequences

and are

_independent

and hence the

_probability

that

_they

are identical is 0.

Otherwise,

assume 1~ a. In order that

(k/g)i - (t/g) j

=

(b -

a)/g,

we must have i =

(l/g)i’

₊

io,

and j

= ₊

jo,

for some

constants

_{io, jo. Since k :A 1,}

at least one of

_{llg, k/g}

must be different from

1. Without loss of

_generality,

assume it is Choose a constant

il

~ io;

then the set

contains no terms in common with

Hence the

_subsequences

_io)

+ and

_io)

+

are

independent,

and so the

probability

that

they

are identical is

zero.

Although

almost all

_{0,1-sequences}

are

_maximally

_diverse,

it is not so _easy

to prove that any individual sequence has the

maximally

diverse

property.

We now

_give

some

_examples

of

_maximally

diverse _sequences.

Let a be a real irrational number with 0 a 1. Recall the definition

of the Sturmian infinite word from Section 4:

THEOREM 23. All Sturmian sequences are

_maximally

diverse.

Proof. We must show that

_{given j, k >}

_1,

0 a

_j,

0 b

_k,

and

(j,

a) #

(k, b),

there exists an n > 0 such that

Skn+b-It is easy to see

that sn

=1 4==~

_~na}

E

_[1-

_{a, 1).}

Hence it suffices to

exhibit n such that

First,

let us consider the

_{case j}

= k. We have In this case, _we

(19)

intersection. Define I =

Ia

nIb;

then

p(1)

_>0. It now suffices to choose n such

_{that ~ jna}}

E I. Such an n exists

by

Kronecker’s theorem

(e.g.,

[11,

Theorem

_438]).

Second,

let us consider the l~. Without loss of

generality,

let us

assume j

k. Define

_p(I),

the

_projection

of an

_ordinary

interval

I,

to be

p (I )

= _E

I } .

Thus,

for

example,

p([e,7r))

=

[e

-

2, ~r -

3).

Consider

_Ia,

and let its left and

_right

_endpoints

be t and u

_{respectively.}

If h

"wraps

around"

0,

then choose u E

_{[l, 2)}

so that

p([t, u))

=

Ia.

Define

Io

and

11

I claim that

_p(Ii)

> For if

Ia

contained a subinterval of measure

> then

_7i

=

[0,1),

and _so =1 _>

Otherwise,Ia

contains

no subinterval of measure so

ti(I,,)

In this _case,

>

Now + =1. Hence

_tc(h)

+ > 1 and so

_II

and

_Ib

have

nontrivial intersection. Let

I,

n 76;

then > 0.

_By

our definition

of

Io

and

_I,,

there is an interval

13

g

Io

such that if

13 =

[v, w],

then

[kv, kw]

C

_12.

_Also,

since

₁₃

_g

_Io,

it is clear that

_{Uv, jwl}

9 1..

Again,

by

Kronecker’s

_theorem,

we can find n such that

inal

E

_13.

For this n we

have ~ jna}

E

7~

and

_~kna}

E

12

C

_Ib,

as desired.

If a _sequence is

_diverse,

then we know that for all

_{r, a, b}

with

0

_{a, b}

r and

b,

there exists an m such that

d(rm+a) i= d(rm+ b).

If there is a function

f

such that m =

0 ( f (n) ),

then

f (n)

is said _tobe

a

_diversity

measure for d. In a

_previous

section we

_showed,

for

_example,

that the

_diversity

measure for Sturmian sequences

corresponding

to real numbers with bounded

_partial

_quotients

is

_0(r3).

We now show that the

_diversity

measure for almost all _sequencesis low:

THEOREM 24. Almost all

_binary

sequences have the

property

that

_for

all

r >

_1,

and

_for

all a, b with 0 a, b r and

_{a ~}

_b,

there exists m =

O(log r)

such that

_{Srm+a 0}

_{srm+b .}

Proof.

_By

Theorem

_22,

we _{may restrict}our attention to _{sequences that}

are diverse.

We have

(20)

Choose

_{f (r) - r4log2rl;}

then

_{r~ 21~2~f~r~}

=

0(r-2).

(Here

f =

0(g)

means

_f

=

0(g)

and g

=

0( f ).)

Then

r~ 21~ 2-f ~r~

converges. Hence

by

the Borel-Cantelli

_lemma,

with

_probability

1 at most

_finitely

_manyof the events of the form

₍₃₎

occur. That

_is,

with

probability

_1,

the event

V

_pairs

_{(a, b)}

3m

_{r 4log2 r~}

such that _Srm+b

occurs all but

_finitely

_{many times. Hence with}

_probability

1 we have m =

0(log r).

Interestingly enough,

I do not know a

single

expticit example

of a diverse

sequence with

diversity

measure 0 (log n).

6.

_{Acknowledgments.}

I thank Mark Turner for

_suggesting

the term

_{"diversity" .}

Drew

_Vandeth,

Eric

_Bach,

and Jean-Paul Allouche read a draft of _{this paper and made}

many useful

suggestions.

I thank the referee for his comments. REFERENCES

[1]

D.

_Allen,

Jr. On a characterization of the

nonregular

set of

_primes.

J.

_Comput.

System

Sci. 2

_(1968),

464-467.

[2]

J.-P.

_Allouche,

A.

_Arnold,

J. _Berstel,S. _Brlek,W.

_Jockusch,

S.

_Plouffe,

and B. E.

_Sagan.

A relative of the Thue-Morse _sequence.Discrete Math. 139

(1995), 455-461.

[3]

J. Berstel. Recent results on Sturmian words. To appear, Proc. of DLT ’95.

Also available at

http :

//www-litp.

ibp

fr/berstel/Liaisons/magdeburg.

ps . gz.

[4] J. W. S. Cassels. An Introduction to

Diophantine Approximation.

Cambridge

University

Press,1957.

[5]

A. Cobham. Uniform _tagsequences. Math.

Systems Theory

6

(1972),164-192.

[6]

D.

_Crisp,

W. _Moran,A.

_Pollington,

and P. Shiue. Substitution invariant

cut-ting

sequences. J. Théorie Nombres Bordeaux 5

(1993),123-137.

[7]

C. Dwork and L.

_Stockmeyer.

On the power of

_{2-way probabilistic}

finite state automata. In Proc. 30th Ann.

_Symp.

Found.

_Comput.

_Sci.,

_{pages 480-485.}

IEEE

_{Press, 1989.}

[8]

C. Dwork and L.

_Stockmeyer.

A time

_complexity

_{gap for}_two-way

_{probabilistic}

finite-state automata. SIAM J.

_{Comput. 19}

_{(1990), 1011-1023.}

[9]

S.

_Eilenberg.

_{Automata, Languages,}

and

Machines,

Vol. A. Academic

Press,

1974.

[10]

I. Glaister and J. Shallit.

_Automaticity

III:

_{Polynomial automaticity}

and context-free

_languages.

_{Submitted,1996.}

[11]

G. H.

_Hardy

and E. M.

_Wright.

An Introduction to the

_{Theory of}

Numbers. Oxford

_University

_{Press, 1989.}

[12]

J. Hartmanis and H. Shank. On the

_recognition

of

_primes

_by

automata. J. Assoc.

_Comput.

Mach. 15

_(1968),

382-389.

(21)

[13]

D. R. Heath-Brown. The least

_square-free

number in an arithmetic

progres-sion. J. Reine

_Angew.

Math. 332

_(1982),

204-220.

[14]

D. R. Heath-Brown. Zero-free

_regions

for Dirichlet L-functions and the least

prime

in an arithmetic

progression.

Proc. Lond. Math. Soc. 64

(1992),

265-338.

[15]

J. E.

_Hopcroft

and J. D. Ullman. Introduction to Automata

_Theory,

Lan-guages, and

Computation. Addison-Wesley, 1979.

[16]

J.

_Kaneps

and R. Freivalds. Minimal nontrivial space

complexity

of

probabilis-tic _one-way

_Turing

machines. In B.

_{Rovan, editor,}

MFCS ’90

_{(Mathematical}

Foundations

_{of Computer}

_Science),

Vol. 452 of Lecture Notes in

_Computer

Science,

pages 355-361.

Springer-Verlag,

1990.

[17]

J.

_Kaneps

and R. Freivalds.

_Running

time to

_recognize

_nonregular

lan-guages

by 2-way

probabilistic

automata. In J. Leach

_Albert,

B.

_Monien,

and M.

_{Rodríguez Artalejo,}

_editors,

ICALP ’91

_(18th

International

_Colloquium

on

Automata, Languages,

and

Programming),

Vol. 510 of Lecture Notes in

Computer

Science,

pages 174-185.

Springer-Verlag,

1991.

[18]

D. E. Knuth. The Art

_of

_Computer

_Programming,

Vol. III:

_Sorting

and

Search-ing. Addison-Wesley,

1973.

[19]

M.

_Minsky

and S.

_Papert.

_{Unrecognizable}

sets of numbers. J. Assoc.

_Comput.

Mach. 13

_{(1966), 281-286.}

[20]

C.

Pomerance,

J. M.

_Robson,

and J. O. Shallit.

_{Automaticity II: Descriptional}

complexity

in the _unarycase. To _appear,Theoret.

Comput.

Sci.

[21]

J. B. Rosser and L. Schoenfeld.

_Approximate

formulas for some functions of

prime

numbers. Ill. J. Math. 6

_(1962),

64-94.

[22]

L. Schoenfeld.

_Sharper

bounds for the

_Chebyshev

functions

_03B8(x)

and

_03C8(x).

II. Math.

_Comp.

30

_(1976),

337-360.

_Corrigenda

in Math.

_Comp.

30

_(1976),

900.

[23]

J. Shallit. Some facts about continued fractions that should be better known. Technical

_{Report CS-91-30,}

_University

of

_Waterloo,

_Department

of

_Computer

Science,

July

1991.

[24]

J. Shallit. The fixed

_point

of 1~

_121,

2 ~ 12221 is _not2-automatic.

Unpub-lished

_manuscript,

dated

_September

10, 1992.

[25]

J. Shallit. Real numbers with bounded

_{partial quotients:}

a _survey.

_Enseign.

Math. 38

_{(1992),151-187.}

[26]

J. Shallit and Y. Breitbart.

Automaticity: Properties

of a measure of

descrip-tional

_complexity.

In P.

_Enjalbert,

E. W.

_Mayr,

and K. W.

_Wagner,

_editors,

STACS 94: 11th Annual

_Symposium

on Theoretical

_{Aspects of Computer}

Sci-ence, Vol. 775 of Lecture Notes in

Computer Science,

pages 619-630.

Springer-Verlag, 1994.

[27]

J. Shallit and Y. Breitbart.

Automaticity

I:

_Properties

of a measure of

de-scriptional complexity.

To appear, J.

_Comput.

_System

Sci.

[28]

N. B. Slater.

_Gaps

and _stepsfor the _sequencen03B8 mod 1. Proc.

_Cambridge

Phil. Soc. 63

_{(1967), 1115-1123.}

[29]

V. T. Sós. On the distribution mod 1 _{of the sequence}n03B1. Ann. Univ. Sci.

(22)

[30]

S. Swierczkowski. On successive

_settings

of an arc on the circumference of a

circle. Fundamenta Math. 46

_{(1958), 187-189.}

[31]

S. S.

_Wagstaff,

Jr. Greatest of the least

_primes

in arithmetic

_progressions

having

a

_given

modulus. Math.

_Comp.

33

_(1979),

1073-1080.

Jeffrey

SHALLIT

Department

of

_Computer

Science

University

of Waterloo

Waterloo,

Ontario,

Canada N2L 3G1 e-mail:

Automaticity IV : sequences, sets, and diversity

J

OURNAL DE

T

HÉORIE DES

N

OMBRES DE

B

ORDEAUX

J

EFFREY

S

HALLIT

Automaticity IV : sequences, sets, and diversity

Journal de Théorie des Nombres de Bordeaux, tome

8, n

2 (1996),

p. 347-367

Automaticity

Sequences, Sets,

Diversity

complexité

de-scription

(i)

alphabet

(ii)

(s(i))i~0

alphabet

Ask(n),

plus petit

possible

qui,

0 ~ i ~

prend l’expression

s(i).

exemples

qui

grande

k;

exemple

caractéristique

premiers

Ask (n)

03A9(n1/43)

2,

quantitatif

classique

Minsky

Papert

lequel l’ensemble

premiers exprimés

régulier.

exemples

petite

exemples

petite

grande

qui

points

d’homomorphismes,

exemple

voisin, appelé diversité,

exemples

descriptional complexity

(i)

(ii)

(the

numbers).

(s (i )) i ~0

alphab et 0394,

k-automaticity of

Aks(n),

possible

that,

expressed

input

s(i).

give

examples

high automaticity

_{2 (1996),}

_{Sequences, Sets,}

_Diversity

_complexité

_(ii)

_(s(i))i~0

_Ask(n),

_qui,

_s(i).

_exemples

_qui

_grande

_k;

_{caractéristique}

_premiers

_{Ask (n)}

_quantitatif

_classique

_Minsky

_{lequel l’ensemble}

_{premiers exprimés}

_exemples

_petite

_petite

_grande

_qui

_points

_{d’homomorphismes,}

_{(s (i )) i ~0}

_{k-automaticity of}

_that,

_input

_s(i).

_give

_examples

_{high automaticity}

_k;

_example,

_primes

_{k-automaticity}

_{Aks(n) =}

_03A9(n1/43)

_{k ~ 2,}

_making

_quantitative

_Minsky

_primes

_expressed

_regular.

_{give examples}

_k,

_automaticity

_give

_study

_{descriptional complexity}

_(i)

_1972,

_[5]

_(In

_literature,

_{representation}

_k-automatic,

_concept

_automaticity

_previous

_interest,

_automaticity

_proved

_obtaining

_{computational}