• Aucun résultat trouvé

ON LARGE DEVIATIONS LOWER BOUNDS FOR SIMULTANEOUSLY IRREDUCIBLE FELLER PROCESSES

N/A
N/A
Protected

Academic year: 2022

Partager "ON LARGE DEVIATIONS LOWER BOUNDS FOR SIMULTANEOUSLY IRREDUCIBLE FELLER PROCESSES"

Copied!
11
0
0

Texte intégral

(1)

FOR SIMULTANEOUSLY IRREDUCIBLE FELLER PROCESSES

MIOARA BUICULESCU

Communicated by Marius Iosifescu

Simultaneously irreducible Markov processes, previously studied in [10] and [15], are considered. It is shown that an m-irreducible process is either m- simultaneously irreducible or there exists a strong singularity of the semigroup of the process with respect to m. Form-simultaneously irreducible Feller pro- cesses, using the skeleton method, large deviations lower bounds results are derived with a rate that takes into account the modications performed in [13, 14] and [16, 17] to the classical rate. The particular case of Feller processes with fast explosion studied in [5, 11, 13, 14] is also considered.

AMS 2010 Subject Classication: 60F10, 60J25.

Key words: simultaneous irreducibility, large deviations lower bounds, Feller processes.

1. INTRODUCTION

Subsequent to the classical results on large deviations due to Donsker and Varadhan, lower bounds were reconsidered under dierent sets of conditions, for Markov chains as well as for continuous time Markov proceses (see e.g. [1, 2, 8, 16, 17]). Regarding the continuous time case we also mention Takeda's results concerning symmetric Feller processes with short lifetimes ([13, 14]).

Section 2 of this paper is mainly based on the results related to large deviations lower bounds for Markov chains in [17]; these consists in expressing the rate in [2] in terms of the Donsker and Varadhan entropy and in obtaining from the results in [2] large deviations lower bounds for occupation measures.

We go along the same lines with a further modication of the rate used in [17], appropriate to the substochastic case.

In Section 3 we consider processes that are m-simultaneously irreducible (i.e., each h-skeleton of the process is m-irreducible) and recall from [10] and [15] some properties of these processes to be subsequently used. We prove that for anm-irreducible process the following dichotomy holds: either the process is m-simultaneously irreducible, or there is a strong singularity of the semigroup of the process with respect to m.

REV. ROUMAINE MATH. PURES APPL., 58 (2013), 2, 163173

(2)

We then turn to m-simultaneously irreducible Feller processes and show that the proof of large deviations lower bounds in [6] (given under much stronger regularity conditions) still works in our case with the modied rate. It actually comes to applying the large deviations lower bounds result obtained in Section 2 to a certain skeleton of the process.

Finally, in the last section we work under Takeda's hypotheses in [13, 14]

except that we eliminate m-symmetry (which actually comes essentially into account in the proof of large deviations lower bounds there). We show that for such a process the decay parameter associated with it by m-irreducibility and the spectral radius of the semigroup on bounded functions coincide.

2. IRREDUCIBLE MARKOV CHAINS:

THE SUBSTOCHASTIC CASE

Following [1, 2] and [17] we adopt the approach based on the theory of irreducible kernels developed in [9].

Recall that a nonnegative kernel K(x,dy) on a measurable space (E,E), with E countably generated is said to be m-irreducible (m being a non-zero, non-negative σ-nite measure on(E,E)) if

m(A)>0 implies

X

k=0

K(n)(x, A)>0, for any x∈E.

One may assume m to be a maximal irreducibility measure, i.e. mK m.

With anym-irreducible kernelKone associates the convergence parameter R(K) dened as

R(K) := sup{r ≥0 :

P

k=0

rnK(n)(x, A)<∞, for somex∈E, A∈ E with m(A)>0}.

Let now(Xn)nbe a Markov chain dened on Ω,F,(Fn)n,(Px)x∈E values in the Polish space(E,E),substochastic transition kernelP and lifetimewith ζ. Throughout the paper we denote by bE the space of all real and bounded measurable functions on (E,E) and by Mb(E) the space of all nite signed measures on(E,E).Also,M1(E) will be the space of all probability measures on (E,E); as usual we endowM1(E) with the weak topology.

With any V ∈bE and P one associates the kernel PV (x,dy) := [expV(x)]P(x,dy) whose powers are expressed as

PV(n)

g(x) =Px g(Xn) exp

n−1

X

k=0

V (Xk) ; n < ζ

! .

(3)

The kernel PV is m-irreducible whenever the kernel P is m-irreducible;

therefore, one may associate with it the convergence parameter R PV and dene

Λ (V) :=−logR PV .

It is known from [1] that Λ is a proper convex function; it is also lower semicontinuous with respect to the weak topology σ(bE,Mb(E)) because it satises the criterion established in Lemma 3.4 of [17] in this sense. The fact that the transition kernel P is substochastic is irrelevant in these properties.

We now introduce the rate function that will govern the large deviations lower bounds results in this paper. For any ν∈ M1(E)let

Jm(ν) :=

 sup{R

logP u+εu+ε dν;ε >0, u∈bE, u≥0} whenν m

∞ otherwise

In the sequel, we proceed to establish that the fundamental connection betweenJmandΛvia the Legendre transforms holds in our case (substochastic transition kernel, modied rate). To do that, we follow a well trodden way, as for instance in [17]; the proof is in a way simpler since we do not work on truncated spaces and conne ourself to the spaces bE and Mb(E).

Proposition 1. If Λ :Mb(E)→IRS

{∞} is the Legendre transform of Λ on bE, i.e.

Λ(ν) := sup{< ν, V >−Λ (V) ; V ∈bE}

then, for any ν ∈ Mb(E) we have

Λ(ν) =

Jm(ν) if ν ∈ M1(E) +∞ otherwise.

Proof. ThatΛ is+∞whenν ∈ Mb(E)is such thatν6= 0, or whenνis positive butν /∈ M1(E) may be proven exactly as in [12], top of page 134; the equality Λ(ν) = +∞ when ν ∈ M1(E), but ν is not absolutely continuous w.r.t. m follows as in [17], proof of Theorem 2.3.

Let now ν m. To get the equality Λ(ν) ≥ Jm(ν) we have to show that for arbitrary u, ε as in the denition of Jm(ν) there exists V ∈bE such

that Z

log u+ε

P u+εdν ≤< ν, V >−Λ (V).

For that it is enough to take V := logP u+εu+ε which takes values in h

logMε

u,logMuεi

(Mu := supu) and is such thatPV (u+ε)≤u+ε, whence R PV

≥1 and thus,Λ (V)≤0 , which leads to the required inequality.

(4)

For the opposite inequality we extend to the substochastic case and the present rate the argument in Lemma 6.1 from [1]. Namely, dening

Jm0 (ν) := sup ( R

logP u+εu+ε dν; ε >0, u≥0, ν(u=∞) = 0,

logh

u+ε P u+ε

i

∈L1(ν) )

we want to show thatJm0 (ν) =Jm(ν).To do this consider as in [7, Section 3]

the space E:=E∪ {∆} with∆an isolated point, extend each function u in E to a functioneu inE by giving it a constant value in∆ and dene

Pe(x, A) :=

P(x, A) x∈E, A⊆ E 1−P(x, E) x∈E, A={∆}

1 x= ∆, A={∆}.

Consider now, Jem(ν) := supn

Rlog u+ε

Peeu+εdν;ε >0, eu≥0, eu∈bEeo ,

Jem0 (ν) := sup

Rlog u+ε

Peeu+εdν;ε >0, eu≥0, ν(ue=∞) = 0,

log h u+ε

Peu+εe

i

∈L1(ν)

 .

We haveJm(ν) =Jem(ν)by Lemma 3.2 in [7],Jem(ν) =Jem0 (ν)by Lemma 6.1 in [1] and Jm(ν) ≤ Jm0 (ν) since the functions whose log is integrated in Jm(ν) are bounded. So, we have only to prove thatJm0 (ν) ≤Jem0 (ν). To this end consider a sequence (uen)n in E obtained from a given u in E satisfying the conditions in the denition of Jm0 (ν) and such that uen(∆) = cn ↓ 0, h

logP u+cu+ε

n(1−P1)+ε

i

∈L1(ν).We have

Jem0 (ν)≥ Z

log u+ε

P u+cn(1−P1) +εdν for everyn and thus, by generalized Fatou's Lemma we have

Jem0 (ν)≥lim inf

n→∞

Z

log u+ε

P u+cn(1−P1) +εdν ≥ Z

lim inf

n→∞ log u+ε

P u+cn(1−P1) +ε

dν = Z

log u+ε P u+εdν whenceJm0 (ν)≤Jem0 (ν) and thus,Jm0 (ν) =Jm(ν).

Turning now to the inequality Λ(ν) ≤ Jm0 (ν), consider V ∈ bE , λ >

Λ (V) and s a small function associated with PV. Then, the function u :=

P

k=0

e−λk PV(k)

sis such that u > s, ν(u=∞) = 0,P u(x) =eλ−V(x)(u(x)−

(5)

s(x))and therefore,

log u+ε

P u+ε ≥ −λ+V + log P u P u+ε implying

Jm0 (ν)≥sup

ε>0

−λ+ν(V) +ν

log P u P u+ε

=−λ+ν(V) leading to the inequality we were looking for.

The equality in Proposition 1 being established in our context, the next results (Proposition 2 and Theorem 1 below) follow exactly as in [17], because the other properties involved (Fenchel's duality theorem, de Acosta's continuity theorem of the convergence parameter, as well as the criterion established in [17] for lower semicontinuity in the weak topologyσ(bE,Mb(E))) are general results. However, we would like to point out that in proving Theorem 1 (along the lines in [17]) we invoke Proposition 2.1 in [2] (valid in the substochastic case) instead of the result correspondingly used in [17].

Proposition 2. For any V ∈bE we have

λ(V) = sup{< ν, V >−Jm(ν) ; ν ∈ M1(E)}.

Corollary. The convergence parameter R(P) satises the equality R(P) = exp

ν∈Minf1(E)Jm(ν)

.

Theorem 1. Let X = (Xn) be an m-irreducible Markov chain and let Ln(ω, A) := 1n

n−1

P

k=0

1A(Xk(ω)), forn < ζ(ω)andA∈ E. Then for every initial measure η and every setG open in the weak topology ofM1(E) we have

lim inf

n→∞

1

nlogPη(Ln∈G, n < ζ)≥ −inf

ν∈GJm(ν).

3. SIMULTANEOUSLY IRREDUCIBLE FELLER PROCESSES WITH LIFETIME

The rst part of this section regards general Markov processes. So let X =

Ω,F,(Ft)t≥0,(θt)t≥0,(Xt)t≥0,(Px)x∈E

be a right Markov process with borelian state space(E,E), semigroup of transition functions(Pt)t≥0, resolvent Uαf(x) =R

0 e−αtPtf(x),f ∈ E,f ≥0 and lifetimeζ.

(6)

Denition 1. A Markov process X is said to bem-irreducible if m(Γ)>0⇒U1(x,Γ)>0, for every x∈E.

With every m-irreducible Markov process we associate the decay parame- ter dened by

λ:= sup{γ ≥0 : Z

0

exp (γt)Pt(x, A)>0,for somex∈E, A∈ Ewith m(A)>0}.

Let us now introduce (according to [15]) them-simultaneously irreducible Markov processes and discuss some of their properties. These processes were also considered in [10] under the name of non-singular Markov processes.

Denition 2. A Markov process X is said to be m-simultaneously irre- ducible if for each h >0the chain(Xnh)n is m-irreducible.

Consider now the Lebesgue decomposition Pt(x,·) =Pta(x,·) +Pts(x,·), wherePta(x,·)m and Pts(x,·) is singular with respect tom. Following [10]

set

ϕ(x) :=

Z 0

e−uPu(x, E) du and recall

Proposition 3 ([10], (6.2)). The following conditions are equivalent:

(i) The processX ism-simultaneously irreducible

(ii) For some h >0 the skeleton {Pnh}n=0,1,2... is m-irreducible.

(iii) There exists B0 ∈ E, with m(B0) > 0 such that ϕ(x) > 0, for every x∈B0.

Under any of these conditions the convergence parameter Rh of the h-skeleton is given byRh= exp (λh), whereλ is the decay parameter ofX.

It is interesting to note the following dichotomous result:

Proposition 4. If X is m-irreducible, with m satisfying mPt m, for every t, then we have either ϕ(x)>0 for every x∈E, orm(ϕ >0) = 0.

Proof. Let us show rst that ϕis 1-super-mean-valued. Indeed, using the fact that under mPtm, for everyt one has Pt+us ≤PtsPus and therefore,

e−tPtϕ(x) =e−tR

0 e−uPt(Pu−Pus) (x, E) du=

≤R

0 e−(t+u)(Pt+u−PtsPus) (x, E) du

≤R

0 e−(t+u) Pt+u−Pt+us

(x, E) du

=R

t e−vPva(x, E) du≤ϕ(x).

(7)

Let ϕe := limt↓0e−tPtϕ be the 1-excessive regularization of ϕ (see [3], Ch. II). Then ϕe≤ϕ and U1(x,ϕ < ϕ) = 0e , for every x ∈ E, which implies m(ϕ < ϕ) = 0e . The set {ϕe= 0} being absorbing, by m-irreducibility we have either{ϕe= 0}=∅or m(ϕ >e 0) = 0, whence the asserted dichotomy.

Remarks. 1. WheneverX isψ-irreducible, there exists aσ-nite measure m such thatX ism-irreducible andmPtm, for everyt >0.See the remark preceding Proposition 1.1 in [15].

2. According to the Corollary to Proposition 2 in casePh ism-irreducible we have R(Ph) = exp

inf

ν∈M1(E)Jm(h)(ν)

whereJm(h)(ν) is the rate associated with (Xnh)n as in Section 2. Using this and Proposition 3, we get for an m-simultaneously irreducible process

λ=h−1 inf

ν∈M1(E)Jm(h)(ν) for every h >0.

From now, on our process X will be a Feller process (as dened for in- stance in [3]), i.e. having as state space a locally compact with countable base Hausdor space E and subject to the following conditions:

(i) Pt(C0(E))⊆ C0(E), for everyt >0 (ii) lim

t→0 sup

x∈E

|Ptf(x)−f(x)|= 0, for anyf ∈ C0(E).

Here C0(E) designates the class of all continuous functions on E that vanish at innity.

For any ω ∈Ω, t < ζ(ω)dene Lt(ω, B) := 1

t Z t

0

1B(Xs(ω)) ds, B ∈ E and for anynh < ζ(ω)

L(h)n (ω, B) := 1 n

n−1

X

k=0

1B(Xkh(ω)), B ∈ E.

Let A be the generator of X, i.e. Af = αf −g when f = Uαg, α >

0, g∈bE.

We now introduce the rate associated with them-irreducible Feller process X and ν ∈ M1(E)

Jm(ν) :=

sup{−R Au

u+εdν;ε >0, u=Uαf, f ∈ Cb(E), f ≥0} whenν m

∞ otherwise

(8)

As usualCb(E)denotes the class of all continuous and bounded functions on E. Note that this denition takes into account the modication of the rate introduced in [17] as well as the one in [13, 14].

Theorem 2. Let X be a Feller process, simultaneously irreducible with respect to the measure m. Then, for any set G, open in M1(E) we have

lim inf 1

t→∞ t

logPx(Lt∈G, t < ζ)≥ −inf

ν∈GJm(ν).

Proof. The stepwise procedure used in the proof of (1.13), Theorem 3 in [6] consists in reducing the problem to the large deviations lower bounds of a certain skeleton(Xnh0)nof the process. This scheme still works in our context considering a metric onE×E(Ebeing the Alexandro compactication of E) and taking into account the lifetime of the process and of its skeletons. When appling the corresponding asymptotic result toL(hn0)we use Teorem 1 in Section 1 above and Lemma 4.1 in [13] which actually states thatJm(h)(ν)≤hJm(ν), for any ν∈ M1(E).The Feller assumption plays an ssential role when evaluating the distance between Lnh and L(h)n by the Levy-Prohorov metric on M1(E) and choosing the adequate h0-skeleton.

Remark. By Lemma 4.1 in [13] and Remark 2 following Proposition 4 above, we haveλ≤infν∈M1(E)Jm(ν).

4. FELLER PROCESSES WITH FAST EXPLOSION

In this section, we strenghten the hypotheses on the process X, namely we assume that:

(I) X is Feller

(II) X is m-irreducible

(III) Pt(x,·)m, for each t >0, x∈E (IV) U11∈ C0(E)

As direct consequences of these hypotheses we get that m has full support, that X is m-simultaneously irreducible and (according to [11]) that sup

x∈E

Px[exp (γζ)]<∞for someγ >0(whence the designation of these processes as being with fast explosion).

In fact our hypotheses dier from the ones in [13, 14] only by the fact that we do not assumem-symmetry. Them-symmetry hypothesis is involved in Takeda's theorem on large deviations only in the lower bound and in expressing the rate in terms of the Dirichelet form associated with the process onL2(m).

(9)

Remark. If we assumed Pt(Cb(E)) ⊆ Cb(E), for every t > 0 and U11 ∈ C0(E), then we would havePt(C0(E))⊆ C0(E), for every t >0.

Let us now consider the spectral radius of the semigroup (Pt) on bE, i.e.

λ0:= lim

t→∞

−t−1logkPtkbE .

In the context of Feller processes with fast explosion the impact of this parameter was studied in [11]; among other things it is proved there that

1 1 +λ0

= lim

n→∞

sup

x∈E

U1(n)1 (x) 1/n

.

For such processes we have also a formula related to the decay parameter λ: there exists a setN, withm(N) = 0 such that

1

1 +λ = lim sup

n→∞

(U1(n)1 (x))1/n for every x∈E\N (see Proposition 5 in [5]).

Of course, Theorem 2 above is valid for these processes as well as the in- equality λ≤infν∈M1(E)Jm(ν),which in turn impliesλ0 ≤infν∈M1(E)Jm(ν). But it is worth noting that in this case infν∈M1(E)Jm(ν) is in fact attained.

To this end let us recall from [11], Theorem 3.6, that under conditions (I), (II) and (IV) there exists a λ0-invariant probability measure µ, i.e. eλ0tµPt = µ, for every t >0.Conditions (II) and (III) imply that µ∼m.

Proposition 5. Under conditions (I)(IV) we have Jm(µ) =λ0.

Proof. We already know that λ0 ≤Jm(µ) ; to prove the inequality λ0 ≥ Jm(µ) let f ∈ Cb(E), f ≥0 andu= ¯Uαf.Then, ifMf := sup

x∈E

f(x) we have R Au

u+εdµ = R αUαf−f

Uαf+ε dµ≥ Mα

f+αε

α

α+λ0 −1

µ(f) =

= α+λα

0

µ(f)

Mf+αε(−λ0)≥ −λ0 which implies the asserted inequality.

We end up this section with some properties of a class of compacts that naturally appear in the context. For each γ >0 let

Kγ(n):=

U1(n)1≥ 1 (1 +γ)n

. Because for eachγ < λ0, Φγ1 (x) =

R

0

eγtPt1 (x) dt=

P

k=0

(1 +γ)nU1(n+1)1 (x) is bounded, there exists Nγ such that Kγ(n) = ∅, for every n ≥ Nγ. On the

(10)

other hand, we have Kλ(n)

0 6= ∅, for every n, because the existence of n0 such thatKλ(n)

0 =∅would implysup

x∈E

U1(n0)1 (x)< (1+λ1

0)n contradicting the fact that

1

1+λ0 = inf

p

sup

x∈E

U1(p)1 (x) 1/p

.

Proposition 6. Under conditions (I)(IV) the following properties hold:

(i) For everyγ > λ0 we havelim sup

n

Kγ(n)=E (ii) sup

n≥m

U1(n)11/n

1+λ1

0 uniformly as m→ ∞.

Proof. (i) The set N alluded to at the begining of this section vanishes under our hypotheses and thus, sinceλ0 =λwe havelim sup

n→∞ U1(n)1 (x)1/n

=

1

1+λ0 > 1+γ1 for every x∈E, which obviously implies lim sup

n

Kγ(n)=E.

For (ii) assume by way of contradiction that the stated uniform conver- gence does not hold. Then, there would be an ε0 such that for every m there existsnmandxmsuch that U1(nm)1 (xm)1/nm

≥ε0+1+λ1

0 = 1+γ1 withγ < λ0, contradicting the fact that for γ < λ0 there exists Nγ such that Kγ(n) =∅, for n≥Nγ.

A further analysis of these sets Kγ(n) could pave the way for a direct approach to the equality λ = λ0, which is of independent interest in the λ-theory. For related questions see also [4].

REFERENCES

[1] A. de Acosta, Large deviations for vector-valued functionals of a Markov chain: lower bounds. Ann. Probab. 16 (1988), 925990.

[2] A. de Acosta and P. Ney, Large deviations lower bounds for arbitrary additive functionals of a Markov chain. Ann. Probab. 26 (1998), 16601682.

[3] R.M. Blumenthal and R.K. Getoor, Markov Processes and Potential Theory. Academic Press, New York, 1968.

[4] M. Buiculescu, Exponential decay parameters associated with excessive measures. In:

Seminaire de Probabilites XXXVIII, Lecture Notes in Math. 1857 (2005), Springer, 135144.

[5] M. Buiculescu, Feller processes with fast explosion. Rev. Roumaine Math. Pures Appl.

52 (2007), 631638.

[6] M.D. Donsker and S.R.S. Varadhan, Asymptotic evaluation of certain Markov processes expectations for large time. I, Comm. Pure Appl. Math. 28 (1975), 147.

[7] M.D. Donsker and S.R.S. Varadhan, Asymptotic evaluation of certain Markov processes expectations for large time. III, Comm. Pure Appl. Math. 29 (1976), 389461.

(11)

[8] N.C. Jain, Large deviation lower bounds for additive functionals of Markov processes.

Ann. Probab. 18 (1990), 10711098.

[9] E. Nummelin, General Irreducible Markov Chains and Non-Negative Operators. Cam- bridge Tracts in Math. 83, Cambridge University Press, 1984.

[10] S. Niemi and E. Nummelin, On non-singular renewal kernels with applications to a semigroup of kernels. Stochastic Process. Appl. 22 (1986), 177202.

[11] S. Sato, An inequality for the spectral radius for Markov processes. Kodai Math. J. 6 (1985), 447536.

[12] D.W. Stroock, An Introduction to the Theory of Large Deviations. Springer, Berlin, 1984.

[13] M. Takeda, On a large deviations for symmetric Markov processes with nite lifetime.

Stochastics and Stochastic Reports 59 (1996), 143167.

[14] M. Takeda, Lp-independence of the spectral radius of symmetric Markov semigroups. In:

Stochastic Processes, Physics and Geometry: New Interplays (Leipzig, 1999), pp.613 623. CMS Conf. Proc. 29. Amer. Math. Soc., RI, 2000.

[15] P. Tuominen and R.J. Tweedie, Exponential decay of general Markov processes and their discrete skeletons. Adv. in Appl. Probab. 11 (1979), 784802.

[16] L.M. Wu, Uniformly integrable operators and large deviations for Markov processes.

J. Funct. Anal. 172 (2000), 301376.

[17] W. Liu and L.M. Wu, Identication of the rate function for large deviations of an irre- ducible Markov chain. Electron. Commun. Probab. 14 (2009), 540551.

Received 31 December 2012 Romanian Academy

Gheorghe Mihoc and Caius Iacob Institute of Mathematical Statistics

and Applied Mathematics, Calea 13 Septembrie nr. 13, 050711 Bucharest, Romania mioara_buiculescu@yahoo.com

Références

Documents relatifs

Zannier, “A uniform relative Dobrowolski’s lower bound over abelian extensions”. Delsinne, “Le probl` eme de Lehmer relatif en dimension sup´

We add that for the problem of upper large deviations, the rate function has been established in [10, 8] and finer asymptotic results in the case of geometric offspring

Abstract The Lamperti correspondence gives a prominent role to two random time changes: the exponential functional of a L´evy process drifting to ∞ and its inverse, the clock of

The chemical distance D(x, y ) is the length of the shortest open path between two points x and y in an infinite Bernoulli percolation cluster.. From this bound we also derive a

Unité de recherche INRIA Rennes : IRISA, Campus universitaire de Beaulieu - 35042 Rennes Cedex (France) Unité de recherche INRIA Rhône-Alpes : 655, avenue de l’Europe - 38334

In this section, we exhibit several examples of Markov chains satisfying H (p) (for different norms) for which one can prove a lower bound for the deviation probability of

Probability Laws Related To The Jacobi Theta and Riemann Zeta Functions, and Brownian Excursions.. Integrals and

Abstract: For beta ensembles with convex poynomial potentials, we prove a large deviation principle for the empirical spectral distribution seen from the rightmost particle..