Maximum likelihood estimation for discrete-time processes with finite state space ; a linear case

(1)

Série Mathématiques

B. S AGALOVSKY

Maximum likelihood estimation for discrete-time processes with finite state space ; a linear case

Annales scientifiques de l’Université de Clermont-Ferrand 2, tome 69, série Mathéma- tiques, n

^o

19 (1981), p. 229-236

<http://www.numdam.org/item?id=ASCFM_1981__69_19_229_0>

L’accès aux archives de la revue « Annales scientifiques de l’Université de Clermont- Ferrand 2 » implique l’accord avec les conditions générales d’utilisation (http://www.

numdam.org/conditions). Toute utilisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright.

Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques

http://www.numdam.org/

(2)

MAXIMUM LIKELIHOOD ESTIMATION FOR DISCRETE-TIME PROCESSES

WITH FINITE STATE SPACE

A LINEAR CASE

B. SAGALOVSKY

Universidad Simon

Bolivar, CARACAS,

^Venezuela

PRELIMINARY REMARK : This report,

presented

^atthe Ecole d’Ete de Calcul des

Probabilites, Saint-Flour, July, 1980,

^isabstracted from

reference

[1].

^The^readershould go ^tothis reference for formal

proofs

of the results

presented

^{and for}^someadditional results.

THE PROBLEM :

Consider a process

Xo, xIt X2$

^...,^which^takes^values^{on a}^finite

state space ^S

= ~ 1, 2,

^{.. , ,}

s}.

^Denote

by ¿( n

^the

^past

of the process up ^toand

including

^{the n-th}

transition, i.e., Fn =o{Xo _xit

^...,

_{X }.}

Given

3l ,

^{we can}^consider^the

probability

of the process

going

^to

s tate j

^{in time}

(n

⁺

1),

^which^we^can^write^as

In what follows we assume that

i _n

îs^known^{as a}^functionôfânûnknown

parameter a,Write

îtâs ^{and look}ât^the

^behavior

of the MLE

(maximum

likelihood

estimate)

^of^a.

(3)

We make the

following restrictive assumptions :

Al : a is

real,

^and^the^true^value

a°

is known to

belong

^to^a

bounded interval I ⁼

[

_{a ,}a

]

A2 :

pi (a)

is linear in _{a, that}is

n

A3 : The effect of a is restricted

by

^the

following

condition :

3K

> 0 such that for each _n,for each

possible

^evolution^{of the}

process up ^totime _n,we have that for

every j E ^S,

^either

We can

interpret

^A3^as

stating that,

^at^timen, ^weknow which states

might

^be

occupied

^at^time

(n+1),

^and^weknow that the

probability

^of

occupancy of such ^statesis at least K. Even

though

^current^work^seems

to indicate that

assumptions

At and A2 can be

relaxed,

^A3^seems^to^be

essential in .the

developments

^that^follow.

AN EXAMPLE :

The

example

^thatmotivated this

study

^has

already

^been

analyzed

ⁱⁿ

references

E2]

^and

E3] by

^different

methods,

^and^was

suggested

^to^this

author

by

^oneof the authors in

[31,

Professor P.

Varaiya.

Consider a Markov Chain with state space S ⁼

I I p 2,

^.,.,

s}

^whose

transition

probabilities {p.., i, j ^E S} depend

^both^{on an}^unknown

parameter âândônâ^controlaction u that we can

apply

^on^eachtransition.

That

is,

Pij

⁼

^{u) .}

After each transition we can estimate a from our observations on the chain. Call

ocn

this estimate. _{We say}that u is an

adaptive

^control^if

u ⁼

u(X , out

^that

is,

if the control we exert at time n

depends

^both^on

n n n

the state we occupy ^atthat time and on the estimate we have of what the true value of a may be.

(4)

As the control

depends

^on^{the whole}

past

^{of the}^chain

(through a )

n

the

resulting

^process^is^no

longer

^Markov. ^We^can

imagine

^more

complex

ways of

selecting un ^based,

^for

^example,

^both^on

an

^{and the}^estimated

variance of _an,etc.

n

Assumptions

Al and A2

keep

almost the same form in this

special

case,

’

and A3 takes the

simpler

^{form :}

NOTATION AND MAIN RESULTS :

We define the likelihood of a

given ^{a E}

^I^at^time^{n as}

and the

log-likelihood

where we have denoted

Observe that p

Assumptions A2,

^{A3 allow}^us^to^consider^thederivatives of

Ln(a),

^which

take a

simple

^{form :}

We can now define the MLE of

a°

^at^timen,

a. ,

^to^{be the}^smallest

element of I ⁼ such that

(5)

To

study

^the

asymptotic

^behavior^of

a;n

^it^now^suffices^to^look^at

the

asymp to ti c

^behavior^of

L’ ,

^as^the

f o l lowing p ic ture

_{suggests :}

Define

To

simplify

^notation^weassume, without loss of

generality,

^that

ao = 0,

^and

also -1 E I, 1 E

^I.

Now ~

Using

^{the fact}

that,

^since

it can be

easily

^shown^that

E’ .(0)

_m ⁼⁰ ^i~m

_and, furthermore,

^{for all}m;

Take now a 0

f ixed,

^the

_argument being symmetric

^fora > 0. Then

Em (a) >

^{0 and}

L’(a)

turns out to be a

submartingale.

^We^can

_decompose

this

submartingale by considering,

^for

(6)

the

martingale

and the

increasing

process

(where

^we^have

dropped

^the^{a’s in}

Ym(a), ^etc.)

After some

manipulations,

^{it is}easy ^toshow that

Therefore,

^the

asymptotic

^behavior^of

A n

^is

^easily

determined

by

^the

asymptotic

^behavior^of

aa

as n goes ^to^00.

Suppose

had the same

asymptotic

^behavior^as^n. ^Then^we^{would have}

and our a 0 could not be the MLE for n

large (see

^the^first^or^third

drawings

ⁱⁿ^our

previous picture).

To have L’ of the same order of A we would need

n n

To see under what conditions this would

happen,

^consider

After some

algebra,

^one^can^{show that}

(7)

n-l

Let’s call B _n

= Z ^{V ,}

^the^process^of

"variations"

of M . This

m=o

m n

process

plays

^an

important

^roleⁱⁿ

defining

^the^behavior^of

M , since

n

it

is, somehow)

the natural time scale for M . In

particular,

^{we can}

find in Neveu

E4] _that,

almost

surely,

^{for each}

realization,

converges ^to^afinite limit M

By

the bound we got ^on

Vm,

^wehave that I _, so that we

conclude

that,

for almost all

realizations,

If

An ~ 00

^and

Bn

^remains

^bounded,

^then

In either _case, _, whence and we can now

wr i te ,

^{wi th}

no

question mark,

(an analogous

^result^states^that

L’n(a) -

^-oo^{a. s,} ^if^a>

0) ,

n

We

already suggested

^anargument ^for

showing that,

^if

L’(a)

⁺00, then

n

a cannot be the MLE for

aO.

This suggests ^the

following theorem,

^whose

proof

^can^{be found}ⁱⁿ

1 IJ.

THEOREM :

Except

^for^a

negligible

^set^of

realizations,

^ifthe sequence of MLE’s has an accumulation

point a* 0 ^aO

⁼

0,

^then

(8)

Corollar :

Under the conditions of the

theorem,

Some other results can be proven

using

^our

knowledge

^{of the}^limit

behaviour of Ll. _n The most

important

^seems^to^{be the}

following.

Proposition :

a

a*,

^where

^a*

^may

depend

^on^therealization.

n n

APPLICATION :

We can now

apply

^the^results^we^obtained^to^the

example

^that

motivated this

work,

^a^MarkovChain with transition

probabilities.

Let’s assume

A4 :

un, the

^controlⁱⁿ^force^after^X ^{has been}

observed,

^is^of

n n ^’

the

adaptive

^form ^un

=(f(,a .

T _n

^{X );}

_n ^and

aij (ç(a,

3. j ^T

^{i) )}

^is continuous in a, for every

i, j ^E

^S.

Proposition :

Under Al to

A4,

^andexcept ^for^a

negligible

^set^of

realizations,

^if

the sequence of MLE’s has an accumulation

point a* ~ ^ao

⁼

0,

^then

0 for _everystate i that is reached

infinitely often, ij

for

every j ^E

^S.

A5 : The chain is

irreducible

for all

pairs (a,u)

ⁱⁿ^the^sense

that :

Vi.jes

^such^that

pi Q.- ~ . i Q. ^(a,u)

>

0,

For a chain

satisfying

A5 Feller shows that all states are reached

infinitely

ôften. ^This^result^carriesôver^toôur^situation

(using A3),

and we obtain our final

Proposition :

Under Al to

A5,

^and

except

^for^a

negligible

^set^of

realizations,

(9)

a n

This last result can be

phrased

^as

saying

^that

a*

is such

that,

^if

we were to

take

the control

U(i) - ~(a*,i), depending only

^on^the

present ^state^{of the}

chain,

^then

a°

and

a*

would be

indistinguishable,

since

they

^would

produce equal

^values^ofall the _p.. ’s.

This characterizes the set of all

possible

^limit

points a*

of the sequence of maximum

likelihood estimators.

REFERENCES :

[1] B. Sagalovsky. "Adaptive

^ControZ

^and

^Parameter

Estimation in Markov Chains : 03B1 Linear Case ".

Internal

Report USB/79/12,

^Univ.

Simón Bolívar, Caracas,

^1979.^To^appear,^IEEE^Trans.on

Aut.Control,

^1981.

[2]

^P.

Mand1, "Estimation and

ControZ in

Markov Chains ", Adv.Appl.Prob.,

6 , ^40-60,

^1979.

[3]

V.

Borkar,

^P.

Varaiya, "Adaptive ^ControZ of Markov Chains .

^{I :}

Finite

Parameter-

Set",

IEEE Trans. on Aut.

Control, AC-24,

^953-

958,

^1979.

[4]

J.

Neveu,

"Discrete-Parameter

Martingales",

^North^Ho11.^Pub.

^Co.,

1975.

[5]

W.

Feller, "Introduction

to

Probability ^{and its} Applications",

3rd

edition, Vol. 1, Wiley,

^1968.

ACKNOWLEDGEMENT :

Besides Professor P.

Varaiya,

^who

suggested

^the

problem,

^{and the}

Statistics

Group

^atUniversidad

Simon Bolivar

which

provided suggestions

for alternative

proofs,

the author wishes to

acknowledge

^theUniversidad

Simon Bolivar

and its Decanato de

Investigaciones

^for

providing

^the

encouragement,

^{and the}

funds,

^{that made}

possible

^hisassistance to The Ecole d’Ete.

Maximum likelihood estimation for discrete-time processes with finite state space ; a linear case

Série Mathématiques

B. S AGALOVSKY

Maximum likelihood estimation for discrete-time processes with finite state space ; a linear case

Annales scientifiques de l’Université de Clermont-Ferrand 2, tome 69, série Mathéma- tiques, n

19 (1981), p. 229-236

MAXIMUM LIKELIHOOD ESTIMATION FOR DISCRETE-TIME PROCESSES

WITH FINITE STATE SPACE

A LINEAR CASE

B. SAGALOVSKY

Bolivar, CARACAS,

presented

Probabilites, Saint-Flour, July, 1980,

[1].

proofs

presented

Xo, xIt X2$

= ~ 1, 2,

s}.

by ¿( n

past

including

transition, i.e., Fn =o{Xo xit

X }.

3l ,

probability

going

s tate j

(n

1),

i n

parameter a,Write

behavior

(maximum

estimate)

following restrictive assumptions :

real,

a°

belong

[

]

pi (a)

by

following

3K

possible

every j E S,

interpret

stating that,

might

occupied

(n+1),

probability

though

assumptions

relaxed,

developments

example

study

already

analyzed

E2]

E3] by

methods,

suggested

by

[31,

Varaiya.

I I p 2,

s}

probabilities {p.., i, j E S} depend

apply

is,

Pij

u) .

ocn

adaptive

u(X , out

is,

depends

^past

transition, i.e., Fn =o{Xo _xit

_{X }.}

i _n

^behavior

every j E ^S,

probabilities {p.., i, j ^E S} depend

^{u) .}

selecting un ^based,

^example,

given ^{a E}

_and, furthermore,

_argument being symmetric

Em (a) >

_decompose

Ym(a), ^etc.)

^easily

= Z ^{V ,}