• Aucun résultat trouvé

Limit Theorems for Stochastic Approximations Algorithms With Application to General Urn Models

N/A
N/A
Protected

Academic year: 2021

Partager "Limit Theorems for Stochastic Approximations Algorithms With Application to General Urn Models"

Copied!
26
0
0

Texte intégral

(1)

HAL Id: hal-01726014

https://hal.archives-ouvertes.fr/hal-01726014v3

Preprint submitted on 7 May 2018

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Limit Theorems for Stochastic Approximations Algorithms With Application to General Urn Models

Soumaya Idriss, Nabil Lasmar

To cite this version:

Soumaya Idriss, Nabil Lasmar. Limit Theorems for Stochastic Approximations Algorithms With

Application to General Urn Models. 2018. �hal-01726014v3�

(2)

Limit Theorems for Stochastic Approximations Algorithms With Application to General Urn Models.

Soumaya Idriss

, Nabil Lasmar

May 7, 2018

Abstract

In the present work we study the multidimensional stochastic approximation algorithms where the drift function his a smooth function and where jacobian matrix is diagonalizable over Cbut assuming that all the eigenvalues of this matrix are in the the region Repzq ą0. We give results on the fluctuation of the process around the stable equilibrium point ofh. We extend the limit Theorem of the one dimensional Robin’s Monroe algorithm [MR73]. We give also application of these limit Theorem for some class of urn models proving the efficiency of this method.

1 Introduction

1.1 The Stochastic Approximation Algorithms

A stochastic approximation algorithm (SAA) is a sequencepXnq0of anRd´valued random vectorpdě1q, defined on some probability space`

Ω,F, P˘

by the following recursion:

Xn`1Xn`1{γn`1hpXnq (1)

where pγnq1 is a real sequence and h : Rd ÝÑ Rd is called a drift function. The one dimensional (SAA) was first introduced by Robbins and Monroe [RM51] where they consideredγn as a sequence of positive constants such that ÿ

ně1

1{γn “ 8 and ÿ

ně1

1{γn2 ă 8, they also viewed h as a monotonous function. Their aim was to find the zero θ of the function h which was assumed to beă 0 for xăθ and hpxq ą 0 for xą θ. Blum [Blu54] proved the almost sure convergence ofXn toθ under suitable conditions. Several studies on (SAA) were exhibited, most of them deal with the fluctuation of the stochastic approximation procedure around the equilibrium point. For instance, Chung who was the first to provide a central limit Theorem for the sequenceXn´θ[Chu54] resorting to the method of moments, whereXn is a real valued process. Due to some complications arising from the us of the moments’ method, Chung’s work was simplified by both Sacks [Sac58]and Fabian [Fab68] and was generalized to a multidimensional version of CLT. The standard adapted conditions state that the functionhshould be a smooth function and that all eigenvalues of∇hpθqare to be with real partsě 1

2. The case where those eigenvalues are with real partsă 1

2 was investigated by Major and R´ev´esz [MR73] in the one dimensional case, following Blums’ [Blu54]

conditions and using Sack’s ideas. In this paper, relying on Renluund’s one dimensional version of the (SAA), we exhibit a multidimensional (SAA) in order to provide thed-dimensional extensions of the limit Theorems found in [MR73] and that by relaxing some of the conditions required in [Fab68, Ren11].

Definition 1. A multivariate stochastic approximation algorithmpZnqně0is a stochastic process on some probability spacepΩ,F, Pqtaking values in the cube r0,1sd and adapted to a filtration pFnqně0 and satisfying

Zn`1Zn`1{γn`1`

hpZnq `Yn`1˘

, (2)

whereYn, γn PFn, h:r0,1sdÝÑRd and the following conditions holds almost surely:

pS1qc{nď1{γnďcu{n, pS2q›

›Yn

2ďKY, pS3q›

›hpZnq›

2ďKf, pS4q›

›E`

γn`1Yn`1

ˇ ˇFn˘

}2ďKe{n2 wherec, cu, Kf, Ke, Ky are positive constants and denoting be} }2 the Euclidian norm inRd.

Département des Mathématiques, Faculté des Sciences de Monastir, Monastir, Tunisia (maysoun-@hotmail.fr).

Département des Mathématiques, Institut Préparatoire aux Études d’Ingénieur, Monastir, Tunisia (nabillasmar@yahoo.fr).

(3)

The processpYn`1qně0 called the noise of (SAA) and is broadly assumed to be a martingale difference sequence (meaningE`

Yn`1ˇ ˇFn˘

“0), that isYn`1 is uncorrelated with the past of the process.

Among methods to find zeros of the drift functionhis the ordinary differential method (o.d.e method). This method consists to find the stable equilibrium points of the ordinary differential equation

x1hpxptqq

Recall that the Euler method (deterministic) for numerical approximation of the trajectory ofx1ptq “hpxptqqwould be establishing the following discrimination

Zn`1Zn`1{γhpZnq

withγą0. The difference with the stochastic algorithms is that the time scaleγis replaced by a time varying step size γn`1, in addition, the presence of the noiseYn`1. In the literature we found criteria of the existence of limit points ofpZnq using the differential equation method. Theses methods impose conditions on h,γn, for example we have a version of almost sure convergence given by [Duf97, LP13b]

Theorem 1. Consider the stochastic approximation algorithm pZnqně0 on r0,1sd Zn`1Zn`1{γn`1

´

hpZnq `Yn`1

¯

where h : Rd ÝÑ Rd differentiable function, pYn`1qně0 is a martingale difference with respect with the filtration pFnqně0, pγnqně0 is a sequence of positive random variables pFnq ´measurable and satisfying the constraining conditions

ÿ

0

1{γn“ 8and ÿ

0

1{γn2 ă 8almost surely. (3)

Then the set of adherence valuesΘ8 of pZnqně0 is a connected compact, and left stable by the flow of the ordinary differential equationΘ9 “ ´hpΘq.

Furthermore ifΘ˚ is a uniformly stable equilibrium onΘ8 then ZnÝÑΘ˚ almost surely.

1.2 General urn Models

The general Pólya urn model (GPU) is a discrete time process with reinforcement defined as follow: an urn containing initially say T0 balls of different colors, fix the number of colors to be an integerdě2. The (GPU) amounts to drawing asampleofmě1 balls from the urn at each discrete epoch of time , that is amongmsampled balls, one hasξpiq balls of colori, iP t1, . . . , du. At thenthdraw we observe the sample and then put back the balls in the urn according to a replacement rule, determined by a mappingRdefined on the simplex

Σpdqm “ pv1, . . . , vdq PNd,

d

ÿ

j“1

vjm(

and taking values inZd(meaning that the balls are to be added as well as to be removed). To ensure the durability of the process, that is regardless of the substraction of balls the urn is never empty. In other terms, the process never dies, we impose conditions (generally sufficient),on the replacement rule and on the initial composition of the urn calledtenability conditions. See for example [KM16] for the concept of tenability for linear urn models.

Let us denote by ζn “ pξnp1q, . . . , ξpdqn qthe vector composition of the drawing balls at the stagen, each component, i say, correspond to the number of drawn balls of color i. We denote also by Mn “ pXnp1q, . . . , Xnpdqq the vector composition of the urn, and byTn the total number of balls in the urn, both afterndraws. Therefore the evolution of the urn is determined by the following recursion:

Mn`1Mn`Rpζn`1q. (4)

IfpFnqně0designate theσ-field generated by the firstndraws then, givenpFnq,ζn follows a multivariate hyperge- ometric distribution. Note that in the with-replacement case, one has

P`

ζn`1“ pv1, . . . , vdqˇ ˇFn˘

“ 1

`Tn

m

˘

d

ź

i“1

ˆMnpiq

vi

˙ ,

(4)

while in the without-replacement case, P`

ζn`1“ pv1, . . . , vdqˇ ˇFn˘

ˆ m

v1, . . . , vd

˙ d

ź

i“1

`Znpiq˘vi

,

whereZnpiq:“ Mnpiq

Tn refers to the proportion of thei-thcolor in the urn afterndraws.

Several studies on one draw urn models used the method of Athreya and Karlin [AK68] and this method consist to embed the process into a continuous time Markov branching process. This process is determined by the same data of the urn and each ball of typeilives an exponentially distributed with mean 1 and when it dies it is replaced by rij balls of typej. Janson [Jan04] developed this method for multi-type urn model when the replacement matrix is irreducible, that is every color is dominant [Jan04] in the following sense: it is possible to find balls of any type in the urn beginning with balls of an appropriate type.

1.3 A General Urn Model Viewed as a Stochastic Approximation Algorithm

Among recursive model with reinforcement we can adapt stochastic approximation algorithm to urn model by consideringZnas the normalized vector proportion of the number of balls of each color afterndraws divided by the total number of balls. For the multicolored urn with replacement mappingRthe proportion of ballsZn is solution of the following (SAA) [LMS16]

Zn`1Zn`1{Tn`1phpZnq `∆Mn`1`ǫn`1q where the drift functionhis given by

hpxq “ ÿ

νPΣpdqn

ˆ m

ν1, . . . , νd

˙ d

ź

i“1

xνiipRpvq ´rpvqxq

with ∆Mn`1Yn`1´E` Yn`1

ˇ ˇFn˘

whereYn`1Rpζn`1´rpζn`1qqZn.

The error termǫn`1 vanishes if the urn is with replacement but in any cases it has an order of O`

1{Tn`1˘ . The tenability conditions dos not always guarantee the assumption 0ăc1ďTn{nďc2. For example the two colors urn with replacement matrixR

ˆ ´1 2 1 ´2

˙

with initial compositionp1,0qis tenable and we haveTn “Op1q. For that we impose additional assumptions such as lim infTn{ną0. The last condition is unaffectedly realized if we suppose for example that}Rpvq}1:“

d

ÿ

j“1

Rjpvq ě1 for everyvpdqm.

Higueras et al [HMPM03, HMPM06] showed that the urn composition can be written as a (SAA) under some extra assumptions including the balance condition. Laruelle and Pages [LP13b, LP13a]studied the response-adaptive randomized process in clinical trials based on the randomized urn model yet studied by Bai and Hu[BH99, BH05].

Noticing that such model evolves with a time-depend replacement rule but converging almost surely to a stochastic matrix. Moreover in Laruelle and Pages in [LP13a] investigated the weighted urn witch the drawing rule is no longer uniform among the balls of the urn but the conditional probability of drawing a ball at timenis an empirical ratio of a function of the proportions of each color and this function is chosen generally with regular variation. They derives the almost sure convergence and the asymptotic normality of the vector composition of ball by allying stochastic approximations corresponding Theorems [Duf97]. In order to reduce assumption in [LP13b] Zhang [Zha16] presented a new central limit theorem for (SAA) by imposing assumptions on the error term. Renlund [Ren10, Ren11] applied results of one dimensional stochastic algorithms to the two colors urn with one or two draws by reducing Fabian’s limit Theorems [Fab68] into one dimensional process.

1.4 Main Assumptions

Our basic assumptions are as follows:

(A1) hadmits an unique stable equilibrium point ΘP r0,1sd such that}Θ}1“1.

(A2) his smooth and all the eigenvalues of∇hpΘqare within the region Repzq ą0( .

(5)

(A3) There exist a zero Θ ofhsuch thatZnÝÑΘ almost surely and there exist a realγą0 such that n{γn“1{γ`O`

}Zn´Θ}2`1{n˘

. (5)

We shall also impose the following additional assumptions on the martingale differencepYnq0

(A4) The Lindeberg’s condition: for everyǫą0

nÝÑ8lim E`

}Yn}221}Yn}2ąǫ˘

“0.

(A5) There exist a deterministic matrix ΣY ‰0 such that lim

nÝÑ8E` YnYn1ˇ

ˇFn´1˘

“ΣY.

The Assumption (A3) has been used by Renlund [Ren11] and this assumption is satisfied by manly urn models as balanced urn. Another example is the following when the replacement mapping is an invertible matrix (and in particular irreducible).

1.4.1 Example

Assume that R is a dˆdinvertible matrix. Then by the relation 1 we obtain R´1Mn`1´R´1Mnζn`1. If ZnMn

Tn converges to Θ then,

d

ÿ

j“1

R´1Zn

i“ p1{Tnq

d

ÿ

j“1

R´1Z0

i`mn{Tn

wheremě1 represents the number of drawn balls. Hence mn{Tn´

d

ÿ

j“1

R´1Θ‰

i

d

ÿ

j“1

R´1`

Zn´Θ˘‰

i`O` 1{n˘

.

The condition (A1) dismiss the case of the convergence of the process to a non equilibrium point: If the conditions of Theorem 1 are satisfied and Θ is the unique stable zero of hthen most Theorems of the limit in distribution give results on the fluctuation ofZn´Θ. For the general urn models, the set of equilibrium points, if it exists, is generally known. In the next example that, we present an urn models for which we compute the set of stable zeros of the drift functionh.

1.4.2 Example

Consider an irreducible and multicolored urn with replacement matrixR“ prijqdi,j“1 such that forij we have rijě0. Here irreducibility of the urn refers to the replacement matrix being irreducible [Jan04], that is every color is a dominant one in the following sense: there existsnPN˚ such that aftern draws, all the colors are accounted for in the urn. Perron Froebenius theory states that for largeα, since the matrixR`αIdis positive,Rhas a largest real eigenvalueλ1 such that, for all λPsppRqztλ1u, we have Repλq ăλ1.We assume in the lines of this example that λ1ą0.The drift function of the associated stochastic process is given by hpXq “RX´ |RX|1X where|X|1 is the trace of the vectorX.

(a) Ifλ1is simple and ifV1denotes an eigenvector associated toλ1, with positive entries. It is easy to show that Θ“ V1

|V1|1 is a zero of hand is an isolated point in the set of zeros ofh belonging in the affine hyperplane H1x“ px1, . . . , xdq PRd, x1`. . .`xd“1u.

Regardless of the other zeros ofh, we prove that Θ is the unique stable equilibrium point ofh. In fact, if we set A“∇hpΘqthenAH“ pR´λ1IdqH´ |RH|1Θ. In particular´λ1 is an eigenvalue ofA with associated eigenvector´Θ.Let us writeRd“RΘ‘E1whereE1is stable byAand letX PE1an eigenvector of A associated to an eigenvalueµ‰ ´λ1. ThenpR´ pλ1`µqIdqX “0. Hence we haveµ`λ1 PsppRqztλ1u. Therefore, Repµq ă0.Since the set of stable zeros ofhis connected, the result follows.

(6)

(b) If λ1 is multiple, the set of zeros of hlocated onH1 defined above isZphq “kerpR´λ1Idq XH1. We show here that Zphqis exactly the set of the stable equilibrium of h: For fixed Θ “ pθ1, . . . , θdq PZphq we have

hpΘq “R´λ1Id´QΘwhereQΘis the matrix with rank 1 with entries“ QΘ

ijθi|Rej|1wherepe1, . . . , edq denotes the canonical basis ofRd. From the relation

d

ÿ

i“1

|Rei|1θiλ1 we deduce that Θ is an eigenvector of QΘ associate to λ1. Let PΘ a transition matrix ofQΘ, then the projection ofPΘ´1hpΘqPΘ on kerpQΘq is equal toR1´λ1I1 whereR1 is thepd´1q ˆ pd´1qmatrix given byPΘ´1RPΘ

ˆ λ1 0 0 R1

˙ . It is clear by this transformation that all the eigenvalues of∇hpΘqare with real partă0.

1.5 Notations

For x“ pxiqi“1,...,d P Rd, }x}2 denotes the canonical Euclidian norm of the vector x, }x}1

d

ÿ

i“1

xi the trace of x. We also use the notation }A}2 for an algebraic norm of the matrix AP MdpRq. sppAq defines the set of the eigenvalues ofA. For a differentiable functionh“ ph1, . . . , hdq, we denote by∇hpΘqits jacobian matrix with entries pB{BxjhipΘqqdi,j“1 where Θ is a zero ofh.

The matrix Γ“1{γhpΘq and its eigenvalues will play a essential role in this paper. The eigenvalues of ∇hpΘq are in the following order

Repλ1q ě. . .Repλr1q ěγ{2ąRepλr1`1q ě. . .Repλrq. since∇hpθqis assumed o be diagonalizable overCwe obtain the decomposition

Cd“ ‘λPspphpΘqqkerp∇hpΘq ´λIdq.

Let for λ P spp∇hpΘq, spλq be the dimension of kerp∇hpΘq ´ λIdq and V1pλq, . . . , Vsλpλq the eigenvectors of

hpΘq generating the eigenspace kerp∇hpΘq ´λIdq. Define the corresponding real subspace Fλ as follow: Fλ “ kerp∇hpΘq ´λIdqif this latter eigenspace is real and we setv1pλq “V1pλq, . . . vsλpλq “Vsλpλq. Otherwise we let Fλthe subspace of Rd generated by the family of vectors pvipλqq2si“1λ as follow:

v2ipλq “1{2`

Vipλq `Vipλ

andv2i´1pλq “1{2i`

Vipλq ´Vipλ.

We have a decomposition ofRd into a direct sum

Rd“ ‘λPspphpΘqqFλ

and the projection of ∇hpΘq in the new basis is real and diagonal by blocks . Let P1 the transition matrix of

hpΘqcorresponding to this decomposition. Then P1´1ΓP1is diagonal by blocks in the following form P1´1ΓP1

¨

˚

˝ Γpλ1q

. ..

Γpλrq

˛

We let π1 and π2 to be the matrices of the projection on ‘λ:Repλqěγ{2Fλ and ‘λ:Repλqăγ{2Fλ respectively, Γ1π1P1´1ΓP1 and Γ2π2P1´1ΓP1. ThenP1´1ΓP1

ˆ Γ1 0 0 Γ2

˙

and by this definitionsp1q “ tλ1{γ, . . . , λr1{γu andsp2q “ tλr1`1{γ, . . . , λr{γu.

We associate to Γ1 (respectively Γ2) the transition matrixQ1 (respectively Q2) that diagonalizes Γ1 (respectively Γ2).

For later use, we define the sets

Λ1“ pi, jq P r1, ds2, Repλiq ąγ{2 and Repλjq ąγ{2 or if Repλiq “γ{2 andλjλithenrYP1sij “0u, Λ2“ pi, jq P r1, ds2, Repλiq “γ{2 andλjλiandrPΣYP1sij ‰0u

and

Λ3“ pi, jq P r1, ds2, Repλiq ăγ{2 and Repλjq ăγ{2u.

(7)

Further we define the matrix ΣPMdpRqby the entries Σij “ rQ´11 π1P1´1ΣYP11´1π1Q11´1sij

γpλi`λj´γq 1pi,jqPΛ1` 1

γ2rQ´11 π1P1´1ΣYP11´1π1Q11´1sij1pi,jqPΛ1

`rQ´12 π2P1´1ΣYP11´1π2Q12´1sij

γpγ´λi´λjq 1pi,jqPΛ3

`i, jP t1, . . . , du˘ .

Thus if Λ3“ HthenQ1P andP1Id. Otherwise if Λ12“ HthenQ2P andP1Id.

1.6 Aim of the Paper

In the present work we give some theoretical results concerning the stochastic approximation algorithms following the definition 1 and satisfying the assumptions (A1)-(A5). Our motivation for the central limit Theorems is Sacks Theorem for triangular arrays [Sac58]. For the refinement of the limit Theorems we are motivated by Major’s Theorems for one dimensional stochastic approximation algorithm [MR73]. These results find numerous applications in the field of randomized models with reinforcement. We improve results on pólya urns such as balanced Pólya process with irreducible replacement rule, the two colors urn with multiple drawing and refinement of limit theorems for cyclic urns.

1.7 Organization of the Paper

The paper is organized as follow: The section 2 we give the main results concerning the limit Theorems for stochastic approximation algorithms.

In section 3 deals with application to two types of urn models: the balanced Pólya process and the two colors urn with multiple drawing.

The section 4 is a preliminary section and the sections 5 and 6 are reserved to the proofs of main results.

Finally we complete some examples on urns models.

2 Main Results

The following Theorem is an adaptation of the central limit Theorem for (SAA) of Sacks [Sac58]. The adaptation consist in allowingγn to be random instead of deterministic (A{n for some positive constant A). We also relax Fabian’s central limit Theorem [Fab68] of the orthogonality of transition matrixP and we do no need the positivity of Γ.

Theorem 2. LepZnqně0 be a stochastic approximation algorithm according to definition 1. Assume that (A1)-(A5) holds.

1. IfRepλminq ěγ{2andΛ2“ Hthen

?n`

Zn´Θ˘ D ÝÑN`

0, PΣP1˘ .

2. IfΛ2‰ H then

c n lnpnq

`Zn´Θ˘ D ÝÑN`

0, PΣP1˘ .

Theorem 3. Consider the stochastic approximation algorithm in definition 1. Assume that (A1),(A2) and(A3) hold. Suppose furtherΛ12“ H.

1. If eitherhis linear orRepλminq ą1{2Repλmaxq, there exists a random variableZ1 inRd such that elnpnqΓ`

Zn´Θ˘

ÝÑZ1almost surely.

2. If for some j “ 1, . . . , r´1 we have 1{2Repλj`1q ăRepλminq ď 1{2Repλjq, there exists a random variable Zj P t0d´sju ˆRsj wheresj denotes the dimension of the subspacerℓ“j`1ker`

hpΘq ´λId

˘and such that for everyβ ďRepλminqwe have almost surely

Sn;jpβq`

Zn´Θ˘

ÝÑZj (6)

(8)

where Sn,jpβqisdˆdmatrix given by the relation Sn,jpβqP1

¨

˚

˚

˚

˚

˚

˚

˚

˚

˝ nβ

. ..

nβ

elnpnqΓj`1q . ..

elnpnqΓrq

˛

.

In the next Theorem we extend the results of Theorem 3 by giving a limit in distribution ofZn´Θ in the case when Repλminq ą1{2Repλmaxq. We put forNě1,

L`“!

pP r1, ds: @“1, . . . , N, @i1, . . . , iℓ`1P r1, ds, ÿ

1

kλp{γ´pλi1`...`λiℓ`1q{γ`

1{γk`1´1{γk˘ ă 8)

. ForpPL`and anypi1, . . . , iℓ`1q P r1, dsℓ`1 we define ϕppi1, . . . , iℓ`1q “ ÿ

kě1

kλp{γ´pλi1`...`λiℓ`1q{γ`

1{γk`1´1{γk˘ and

Dppλi1, . . . , λi`1q{γq “diag`

ϕ1pi1, . . . , iℓ`1q11PL`, . . . , ϕdpi1, . . . , iℓ`1

1dPL`

˘.

Denote by D` “ pλ1{γ11PL`, . . . , λd{γ1dPL`q and D´P´1ΓP ´D`. Further, we set Γ`P D`P´1 and Γ´P D´P´1. For a complex number β we set, if it exists, pďq such that Repλpq ąRepβqor Repβq ąRepλqq and ifβPspp∇hpΘqqtakeβλp`1. . .λ1.

Theorem 4. Suppose that the stochastic approximation algorithms pZnq0 given by the definition 1 satisfies the assumptions (A1)-(A5) and such that Repλmaxq ă γ{2. Suppose further either h is linear or Repλminq ą Repλmaxq{2.There exists a pair of random variables Z3,Z2PRd such that

?n´

Zn´Θ´e´lnpnqΓZ3´SnpN,Γ,Z1D ÝÑN´

0, PΣP1¯ whereN ětRepλmaxq{Repλminqu`1 andSnpN,Γ,Z1qis given by the following sum SnpN,Γ,Z1q “

N

ÿ

ℓ“1

1{ℓ! ÿ

i1,...,iℓ`1

Wi1. . . Wiℓ`1PDppλi1`. . .`λiℓ`1q{γqP´1DhpΘqpVi1, . . . , ViqViℓ`1

`

N

ÿ

ℓ“2

1 γℓ!

d

ÿ

i1,...,i“1

Wi1. . . WiDnppλi1`. . .`λiℓq{γ,ΓqH

i1{γ`...`λiℓ{γqpΓqDhpΘq`

Vi1, . . . , Vi

˘.

HereW1, . . . , Wdare the complex components of the vectorZ1in the basis of eigenvectorspV1, . . . , Vdq. The matrices HβpΓqandDnpβ,Γq pβ PCqare given by

P´1HβpΓqP “diag`

ζp1`β´λ1q, . . . , ζp1`β´λpq,1, . . . ,1,1{pλq´βq, . . . ,1{pλd´βand

P´1Dnpβ,ΓqP “diag`

n´λ1, . . . , n´λp, n´βplnpnq `γq, . . . , n´βplnpnq `γq, n´β, . . . , n´β˘ . Recall that ζ denotes the Zeta Riemann function andγ is the Euler gamma constant.

In what follows we present a limit Theorem for the stochastic approximation algorithm pZnq0 regardless of where the spectrum of Γ is located. We set σ “ maxtRepλq, λ P spp∇hpΘqq and Repλq ă γ{2u and N “ tσ{Repλminqu`1.

Theorem 5. Consider the stochastic approximation algorithm pZnq0 mentioned in definition 1. Suppose that assumptions (A1)-(A5) are satisfied. Suppose furtherΛ2“ H. IfRepλminq ąσ{2then there exists a pair of random variables Zp2q

1 andZp2q

2 Pπ2Rd such that

?n`

Zn´Θ´P1

´0, e´lnpnqΓ2Z2p2q´e´lnpnqΓ2SnpN,Γ2,Z1p2q1

˘ D ÝÑN`

0, P1ΣP11˘ .

(9)

3 Application to Urn Models

3.1 Multiple Color Balanced Pólya Process.

Consider a d-color urn for an integerdě2. At each discrete time step, a ball is drawn (uniformly), observe its color and then put it back in the urn together with new balls according to a replacement rule given by thedˆdmatrix R“`

rij˘

1ďijďd.Denoting byMn“`

Mnp1q, . . . , Mnpdq˘1

the vector composition of the urn afternexperiments with M0corresponding to the initial composition.

We assume that the entriesrij ě0 for ij and we allow to diagonal entries to be negative (meaning that we can remove balls from the urn), provided that the urn in question is tenable.

This kind of urn model was investigated by many different authors with different approaches. For example, Janson [Jan04] Mailler[Mai14] and Pouyanne [Pou08] by an algebraic approach and the theory of branching processes, Pages [LP13b] by using the stochastic approximation methods. The two-color balanced urn model was studied by Flajolet [FGP05] relying on analytic methods.

In the present section we assume that the urn is irreducible and balanced with balance S ě 1. By the Perron- Frobenius theory of positive matricesR`AId, for a large positive constantA, has a large real eigenvalueλ1 and all the other eigenvalues are with real partsďλ1. Using the terminology in [Pou08, JP16] we say that the Pólya urn islargeifλ1is simple and all other eigenvalues satisfy Repλq ě1{2Repλ1q. The urn is said to besmallifλ1 is simple and for all iě2, Repλiq ď 1{2Repλ1q. We say that the process is critically small if it is small and there exists an eigenvalueλwith real part“λ1{2.

Theorem 6. Suppose that the urn is irreducible andR is diagonalizable overC.

1. If the urn is small butΛ2“ H, then

Mn´nSV1

?n

ÝÑD N`

0, P1ΣP11

˘

where

Σ“

ˆ 1{S2π1P1´1ΣYP1´1π11 0 0 Q1WQ11

˙

(7) andW is the complex matrix with entries given by

rWskℓ“ rQ´11π2P1´1ΣYP11´1π21Q11´1skℓ

SpS´λk´λq p2ďk, ℓďdq and with ΣYR`

1{SId´1{S21b1˘

R1 where 1b1is the matrix with entries equal to 1.

2. If the urn is small butΛ2‰ Hthen 1 anlnpnq

`Mn´nSV1˘ D ÝÑN`

0, P1ΣP11˘

whereΣis the matrix with ranksď2CardpΛ2qwith entriesΣij“1{S2rQ´11 π2P1´1ΣYP11´1π12Q11´1sij1ppi,jqPΛ2q. 3. If the urn is large then there exist a random variableZ such thate1{SlnpnqR`

Mn´nSV1

˘ÝÑZ almost surely.

Furthermore we have the limit in distribution

?1 n

`Mn´nSV1´e1{SlnpnqRD ÝÑN`

0, S2ΣQ12˘ where Σis the complex matrix

Σ“

¨

˚

˚

˝

π1P1´1ΣYP11´1π11 rπ1P1´1ΣYP11´1π31Q12´1sk

k rπ1P1´1ΣYP11´1π31Q13´1sk

k

rQ´21π3P1´1ΣYP11´1π31Q12´1sk

Spλk`λ´Sq

˛

. (8)

Noticing that the term e´1{SlnpnqRZ is the oscillating term since ifW1, . . . , Wd are the components ofZ is the basis of eigenvectorsV1, . . . , Vd we gete´1{SlnpnqRZ “

d

ÿ

k“1

n´λk{SWkVk.The next Theorem is a limit Theorem for balanced Pólya process.

(10)

Theorem 7. Suppose that the urn is irreducible, balanced and Ris diagonalizable over Cand thatΛ2“ H. There exist a vector random variableZ“`

0,0,Zp3q˘

such that

?1 n

´Mn´nSV1´P1

`0,0, e´lnpnqΓ2Z3˘¯ D ÝÑN`

0, P1ΣP11

˘.

3.2 Multiple drawing Non-Balanced Two-Color Urn Model

This model is defined as follow: Starting with a two-color urn, containing initiallyW0white balls andB0black balls such thatT0W0`B0ěmwithmis an integerě1. At each stage of the process, we drawwithout replacementm balls from the urn and count the number of the sampled white balls (saykwhite balls). Then, we return the balls in the urn witham´k white balls andbm´k black balls whereak, bk PZ, 0ďkďm. We denote the replacement rule by

R

¨

˚

˚

˚

˝

a0 b0

a1 b1

... ... am bm

˛

.

We designate by Wn and Bn the number of white and black balls after n draws. The total number of balls of both colors will be TnWn `Bn. The multiple drawing urn model has been originally introduced by Chen an Wei [CW05] with akcpm´kq and bkck where c is a positive integer. They proved by the martingale theory the almost sure convergence of Wn after suitable normalization to a positive random variable W8 and showing the absolute convergence ofW8. Chen and Kuba [CK13] gaves the moments of the random variableW8. Kuba and Sulzbach [KS15] generalized the model of Chen and Wei to a general two-color model under the affinity (EpWn`1

ˇ

ˇFnq “ anWn`bn) and the balance (ak `bkσ ě 1 0 ď k ď m) conditions. Kuba and Mahmoud [KMP13] studied the limit in distribution withakCpm´kqand bkCk where C PN˚ and in [KM17] under the affinity condition.

In this section we improve the results of [LMS16] for two color unbalanced urn proved using Renlund’s stochastic approximation Theorems [Ren11]. The major problem is to control the rate of convergence of theTn{nto its limit.

We state by giving the tenability conditions for the model with multiple drawingmě2 [KS15, LMS16].

Lemma 1. Consider the urn process with initial composition pW0, B0q1 and replacement function R. Assume that m ě 2 and we denote by a (respectively b) the greatest common divisor of ta1, . . . , am´1u p respectively tb1, . . . , bm´1uq. The urn is tenable if and only if for everykďm´1 we have,

akě ´a and bk ě ´b and the additional conditions

amPJ´m´a`1,´m˘

YJ´m,8KX!

P ´N, rW0s1P r´s1,`1s1, . . . ,rm`a´1s1() bmPJ´m´b`1,´m˘

YJ´m,8KX!

P ´N, rB0s2P r´s2,`1s2, . . . ,rm`b´1s2() where for all integerℓ, rs1 prespectivelyrs2q denotes the remainder of the division ofℓ bya prespectivelybq. Theorem 8. Assume that the tenability conditions are satisfied and that lim infTn{n ą 0. Suppose there exists θP r0,1ssuch that the proportion of white balls, denoted byZn converges almost surely toθ.

Let w

m

ÿ

k“0

ˆm k

˙

θkp1´θqm´kpam´k`bm´kq, λ“ ´h1pθq

w andσ2Hpθq w2 where

Hpxq “

m

ÿ

k“0

ˆm k

˙

xkp1´xqm´kpam´k´xbm´kq2. and

hpxq “

m

ÿ

k“0

ˆm k

˙

`am´k´xpam´k`bm´k

xkp1´xqm´k Assuming that am‰0 ifθ“0 andb0a0 if θ“1. Then

(11)

paq if λą1{2then

?npZn´θqÝÑD N ˆ

0, σ2 2λ´1

˙ ,

pbq if λ“1{2then

c n

lnpnqpZn´θqÝÑD Np0, σ2q,

pcq if λă1{2then there exists a random variable Z such that nλpZn´θq ÝÑZ almost surely. Also we have

?n

˜

Zn´θ´n´λh1pθq

m`1

ÿ

ℓ“2

hpℓqp1q

p´1qn´ℓλZ

¸ ÝÑD N

ˆ 0, σ2

1´2λ

˙ .

4 Preliminaries

The following lemma is a simple version of Chung’s lemma [Chu54, MR73]

Lemma 2. Let bn be a sequence of positive numbers such that the following recursion holds bn`1ď´

c{n¯

bn`V{n1`p

with c, pą0 andV ą0. Ifcąpthenbn “O` 1{np˘

and ifp“1 withcă1then bn“O` 1{nc˘

.

Lemma 3. Let M PMdpRqbe a diagonalizable matrix over C.IfPm,n denotes the product

n

ź

j“m

´Id´1 jM¯

then

Pm,ne´lnpm{nqM´

Id`O`1 m

˘¯

. (9)

Proof. LetP be a transition matrix ofM withP´1M P“diag`

λ1, . . . , λd˘

.Applying the relations 1´λ

je´λ{j`Op1{j2q

n

ÿ

j“m

1 j “ln`

n{m˘

`O` 1{m˘

and

n

ÿ

j“m

1

j2 “Op1{mq

we obtain

Pm,nPdiag`

n

ź

j“m

´1´λ j

¯

,ďd˘

P´1Pdiag`

e´λlnpm{nq` 1`O`

p1 m

˘˘˘P´1

Pdiag`

e´λlnpm{nq˘ P´1`

1`O` p1

m

˘˘“exp`

´lnpm{nqM˘`

1`O` 1{m˘˘

.

Lemma 4. If M P MdpRq is an invertible matrix and diagonalizable over C. P denotes a transition matrix of M such that P´1M P “diag`

λ1, . . . , λd˘

such that 1 ąRepλ1q ě. . . ěRepλdq. For a complex number β we let, if it exists, a pair of positive integers pďq such that Repλpq ąRepβq or Repβq ąRepλqq and ifβ PsppMq take βλp`1. . .λ1. Then

n

ÿ

j“1

1

jβ`1elnpj{nqM “DnpβqHβ`

1`Op1{n

“HβpMqDnpβ, Mq`

1`Op1{n

(10) whereDnpβ, MqandHpβ, Mqare already defined in Theorem 4.

The next lemma is a version of the Bauer-Fike Theorem for matrices. This Theorem gives results on the perturbation of the spectrum of a diagonalizable complex matrices. In our current work we deal only with hermitian matrices.

Références

Documents relatifs

Furthermore, because the process has evolved for a long time, we expect the density of particles at time t to follow approximately (2.10). The key to proving Theorem 1.5 is to

We give two main results, which are necessary tools for the study of stability in section 3: we prove uniqueness, up to an additive constant, of the optimal transport

In this example, at each step, one ball is removed from the urn, then black and white balls are added, b in total (with b ≥ 2), the number of white balls following a distribution µ

[25] (generalized by [23]) studied partial sums and sample variance of a possibly nonzero mean stochastic volatility process with infinite variance and where the volatility is

In the sequel, we will build again the spectral measure for invertible transformation and reversible Markov chain and then apply to Ergodic theorem and Central limit theorem..

2012 ACM Subject Classification Mathematics of computing → Enumeration, Mathematics of computing → Generating functions, Mathematics of computing → Distribution functions, Math-

Each time a ball is drawn, the types of balls which are added or removed are random variables, whose distribution depends on the type of the ball that has been picked: see for

Keywords : Markov additive process, central limit theorems, Berry-Esseen bound, Edge- worth expansion, spectral method, ρ-mixing, M -estimator..