• Aucun résultat trouvé

Parametric first-order Edgeworth expansion for Markov additive functionals. Application to <span class="mathjax-formula">$M$</span>-estimations

N/A
N/A
Protected

Academic year: 2022

Partager "Parametric first-order Edgeworth expansion for Markov additive functionals. Application to <span class="mathjax-formula">$M$</span>-estimations"

Copied!
28
0
0

Texte intégral

(1)

www.imstat.org/aihp 2015, Vol. 51, No. 2, 781–808

DOI:10.1214/13-AIHP592

© Association des Publications de l’Institut Henri Poincaré, 2015

Parametric first-order Edgeworth expansion for Markov additive functionals. Application to M -estimations

D. Ferré

L.A.R.E.M.A. (UMR-CNRS 6093), Université d’Angers, UFR sciences, 2 bd Lavoisier, 49045 Angers Cedex 01, France.

E-mail:[email protected]

Received 17 February 2012; revised 25 September 2013; accepted 4 November 2013

Abstract. We give a spectral approach to prove a parametric first-order Edgeworth expansion for bivariate additive functionals of strongly ergodic Markov chains. In particular, given anyV-geometrically ergodic Markov chain(Xn)n∈Nwhose distribution de- pends on a parameterθ, we prove thatp(Xn1, Xn);pP, n≥1}satisfies a uniform (in(θ, p)) first-order Edgeworth expansion provided that{ξp(·,·);pP}satisfies some non-lattice condition and an almost optimal moment domination condition. Further- more, the sequence(Xn)n∈Nneed not be stationary. This result is applied toM-estimators of Markov chains and in particular of V-geometrically ergodic Markov chains. TheM-estimators of some autoregressive processes are studied.

Résumé. Grâce à une approche spectrale, nous donnons des conditions assurant la validité du développement d’Edgeworth d’ordre 1 paramétrique, dans le cadre général des fonctionnelles bivariées et additives de chaînes de Markov fortement ergodiques. En particulier, soit(Xn)n∈Nune chaîne de MarkovV-géométriquement ergodique dont la loi dépend d’un paramètreθ. Nous montrons alors que{ξp(Xn1, Xn);pP, n≥1}satisfait un développement d’Edgeworth d’ordre 1 uniforme (en(θ, p)) si{ξp(·,·);pP}

satisfait une condition de type non-lattice ainsi qu’une condition quasi-optimale de moment-domination. De plus, ce résultat est établi dans le cas où les données(Xn)n∈Nne sont pas nécessairement stationnaires. Ce résultat est appliqué en particulier aux M-estimateurs associés à des chaînes de MarkovV-géométriquement ergodiques. LesM-estimateurs de processus autorégressifs sont étudiés.

MSC:60F05; 60J05; 62F12; 62M05

Keywords:Edgeworth expansion;V-geometrically ergodic Markov chain; Non-arithmeticity condition; Perturbation operator

1. Introduction

Let(E,E)be any measurable space, and let(Xn)n0 be a Markov chain on a general state space E with transition kernel(Qθ(x,·);xE)whereθis a parameter in some setΘ. The initial distribution of the chain is denoted byμθ. The underlying probability measure is denoted byPθ,μθ.

Let{ξp(·,·);pP}be a family of measurable functions fromE2intoR, whereP is any set. Let us define the following bivariate additive functionals

n≥1,∀pP, Sn(p):=

n k=1

ξp(Xk1, Xk). (1)

(2)

We are interested in appropriate conditions on the model, on the family{ξp(·,·);pP}and on the initial probability measureμθ, under which a first-order Edgeworth expansion exists (also called Esseen theorem), namely there exist a polynomial functionAθ,p(·)and a positive real numberσθ,psuch that

sup

(θ,p)Θ×Psup

u∈R

Pθ,μθ

Sn(p) σθ,p

nu

N(u)η(u)n1/2Aθ,p(u) =o

n1/2

, (2)

whereN is the standard normal distribution function andηis its density. Note that expansion (2) holds uniformly in (θ, p)Θ×P.

One of these appropriate conditions is the following non-arithmeticity condition:

Hypothesis (N-A) (Non-arithmeticity). For any compact subsetK0ofR,there existsρ∈ [0,1)such that for all n≥1, sup{|Eθ,μθ[eit Sn(p)]|;tK0, (θ, p)Θ×P} =O(ρn).

This non-arithmeticity condition may be satisfied under the following non-lattice condition:

Hypothesis (N-L). For allpP,there exist neither continuous functionAp(·):E→Rnor constantCpsuch that we have for all(x, y)E2,ξp(x, y)=Ap(y)Ap(x)+Cp.

As illustrated later in M-estimation, the bivariate and parametric form of (1), as well as the previous uniform control and the possible non-stationarity ofμθ, are required for statistical applications.

Edgeworth expansions in the Markov setting can be established by the two following methods:

1. The regeneration method. This standard method, introduced by [29], was used by Bolthausen [4] to establish the Berry–Esseen theorem for univariate additive functionals of the form Sn =n

k=1ξ(Xk), by splitting Sn

into a sum of independent blocks. This method can be applied to the general class of Harris-recurrent chains (Xn)n0which either possess an accessible atom or satisfy some minorization condition. Bolthausen’s work was extended to Edgeworth expansions by Malinovskii [23] and next generalized to bivariate additive functionals Sn=n

k=1ξ(Xk1, Xk)by Jensen [19].

Note that in [4,19,23], neither the distribution of(Xn)n0nor the functionξ depends on parameters. However a recent work due to Bertail and Clémençon [2] provides a Berry–Esseen theorem adapted to the above men- tioned parametric setting (just mention that the regeneration method allows to establish the Berry–Esseen theorem for “studentized functionals”), but the extension to Edgeworth expansions would generate even more difficulties.

Furthermore this statement only concerns univariate additive functionals and the extension of their proof to the bivariate case (1) induces dependence between the regeneration blocks and hence provides at least one more diffi- culty to handle with.

2. The weak Nagaev–Guivarc’h spectral method. This method, based on the Keller–Liverani perturbation theorem [20], enables the statement of limit theorems for additive functionals associated to strongly ergodic Markov chains (Harris recurrence is no more required). This method has been fully described in [18] in the case of univariate additive functionals. It is specially efficient forρ-mixing andV-geometrically Markov chains, as well as for iterated function systems. In those models, the extension of Berry–Esseen type results of [18] to the case of bivariate additive functionals of the type (1) has already been obtained in [11,12,17] with in addition the desired control on the parameters(θ, p). The resulting moment conditions on {ξp;pP}are (almost) optimal with respect to the independent case. Let us note that they are explicit for these three models and do not depend on the initial probability (unlike the ones given by the regeneration method).

In this paper, we will state expansion (2) for the class of strongly ergodic Markov chains, and apply Fourier techniques via the perturbation operator theory of Nagaev–Guivarc’h.

Our work extends the Berry–Esseen type results of Hervé, Ledoux and Patilea [17] to the first-order Edgeworth expansion. As in the independent case, the gap from Berry–Esseen to Edgeworth type results induces at least a new difficulty: the requirement of the non-arithmeticity hypothesis.

In Section2.1, we consider a family of random variables (r.v.)Sn(p)(not necessarily derived from Markovian models) defined on a general parametric probability space(Ω,F,{Pθ;θΘ}), and we state hypotheses calledR(m)

(3)

and (N-A) under which expansion (2) holds true. These hypotheses concern the behavior of the characteristic function tφn,p(t)ofSn(p): HypothesisR(m)focuses on the form and the regularity ofφn,pneart=0; whereas Hypothesis (N-A) focuses on the behavior ofφn,poutsidet=0.

In Section2.2, we specify the form ofSn(p): from now on,Sn(p)is defined by (1) where(Xn)n0is assumed to be a strongly ergodic Markov chain, and we give a brief review of the weak Nagaev–Guivarc’h spectral method to check HypothesisR(m)and (N-A) in this Markov context. In fact, as already done in [17], HypothesisR(m)can be investigated thanks to an easy extension of the results of [18].

By contrast, the method developed in [18] is not sufficient to study Hypothesis (N-A). Indeed, the non-arithmeticity condition has to be checked uniformly in both the parameterθof the Markovian model and the parameterp of the family{ξp;pP}involved in (1). The study of (N-A) in this context is original and constitutes an important part of this work (actually, even in the independent case, this question is far from being obvious). In our Markov setting, this study is based on the operator perturbation theory, quasi-compactness arguments and Ascoli theorem. Specifically, in Section3, we give three approaches to reduce Hypothesis (N-A) to some simple non-lattice conditions in the case of general strongly ergodic Markov chains.

Section4is devoted toV-geometrically ergodic Markov chains. For this instance and more specifically for dom- inated models,1we reduce (N-A) using one of the three approaches presented in Section3.3. Combining this result together with the sufficient conditions of [17] to check HypothesisR(m)and the general Edgeworth type statement of Section2.1, provides expansion (2) under assumptions close to the ones of the independent case.

At last, statistical applications are studied in Section5: a first-order Edgeworth expansion for M-estimators of dominatedV-geometrically ergodic Markov chains is derived from the results of Section4.

More precisely, let α0 be the so-called true value of some real parameter of interest andˆn)n∈N its estimator of the formαˆn:=arg minα∈A1nn

k=1F (α, Xk1, Xk), where(Xn)n0is some dominatedV-geometrically ergodic Markov chain (whose underlying probability measure is still denoted byPθ,μθ). We state in Section5the appropriate conditions under which there exist a polynomial function Aθ(·)and a positive real numberσ (θ ) (both explicitly defined in Theorem2) such that

sup

θΘ

sup

u∈R

Pθ,μθ

n

σ (θ )(αˆnα0)u

N(u)η(u)n1/2Aθ(u) =o

n1/2 .

This theorem, which extends Pfanzagl theorem [25] obtained for independent and identically distributed (i.i.d.) data under some moment conditions of order 3, is valid under a natural adaptation of the statistical regularity conditions of [25], moment domination conditions of order 3+ε, and some simple non-lattice condition as well. To the best of our knowledge, this result is new. Notice that our moment domination conditions are not only almost optimal, but also take the same form as the ones used in [6] to prove the asymptotic normality ofM-estimators underV-geometrically ergodicity.

Whereas statistical applications studied in Section5only concern dominated models, the results of Section6are much more general. Indeed the adaptation of Pfanzagl proof is developed in Section6for general statistical models under HypothesesR(3)and (N-A). Note that this adaptation is not straightforward. Finally, the results of this section are applied in Section6.4toM-estimators of an example of non-dominatedV-geometrically ergodic Markov chains:

some AR(d)processes withd >1.

2. Fourier techniques and first-order Edgeworth expansion

In this section, we present some results based on Fourier techniques. These results appeal to the next Hypotheses R(m)and (N-A) that are well-suited for the markovian case as explained in Section2.2.

2.1. HypothesesR(m)and(N-A)and first-order Edgeworth expansion

Let(Ω,F,{Pθ;θΘ})be any statistical model, whereΘ is some parameter space. The underlying expectation is denoted byEθ. Consider a family{Sn(p);n∈N, pP}of real r.v. defined on(Ω,F,{Pθ;θΘ}), whereP is any set. Note that, since we will get results uniformly in(θ, p), they can be applied when the parameterpdepends onθ.

1This choice is only for convenience but is not necessary, see, for example, Section6.4.

(4)

HypothesisR(m), mN. There exists a bounded open intervalI0⊂Roft=0such that one has for all(θ, p)Θ×P,n≥1,tI0

Eθ eit Sn(p)

=λθ,p(t)n

1+lθ,p(t)

+rθ,p,n(t), (3)

whereλθ,p(·),lθ,p(·)andrθ,p,n(·)areC-valued functions of classCmonI0satisfying the following properties:

λθ,p(0)=1, λ(1)θ,p(0)=0, lθ,p(0)=0, rθ,p,n(0)=0, and for=0, . . . , m

supλ()

θ,p(t)+l()

θ,p(t);tI0, (θ, p)Θ×P

<+∞,

κ∈ [0,1),∃G >0,∀n≥1, suprθ,p,n() (t);tI0, (θ, p)Θ×P

n.

Furthermore,the functionsλ(m)θ,p(·),lθ,p(m)(·)andrθ,p,n(m) (·)are continuous onI0uniformly in(θ, p)Θ×P. Hypothesis (N-A) (Non-arithmeticity). For any compact subsetK0ofR,there existsρ∈ [0,1)such that

n≥1, supEθ eit Sn(p);tK0, (θ, p)Θ×P

=O ρn

.

Note that under HypothesisR(2), the functiont→Eθ[eit Sn(p)]is of classC2onI0for all(θ, p)Θ×P. Then by Fatou lemma, for all(θ, p)Θ×P, one hasEθ[Sn(p)2]<+∞. Therefore, when considering the derivative of Eq. (3), one easily obtains that for all (θ, p)Θ×P, limEθ[Sn(p)]/n=0 when n→ +∞. Note that under HypothesisR(2), when considering the second derivative of Eq. (3), one easily obtains as well

n≥1, lim

n→+∞ sup

(θ,p)Θ×P

Eθ[Sn(p)2] n

<+∞, (4)

and in a similar way, under HypothesisR(4),

n≥1, lim

n→+∞ sup

(θ,p)Θ×P

Eθ[Sn(p)4] n2

<+∞. (5) Finally, under Hypothesis R(3), we obtain some of the assertions of Proposition1 below. The other ones can be proved by borrowing the proof of [8], Chapter XVI.4, Theorem 4.1.

Proposition 1 (First-order Edgeworth expansion). If{Sn(p);n∈N, pP}satisfies HypothesisR(3),then for all(θ, p)Θ×P,the following limits

bθ,p:= lim

n→+∞Eθ Sn(p)

, σθ,p2 := lim

n→+∞

Eθ[Sn(p)2]

n ,

are well-defined and bounded inθΘ.The numberbθ,pis the bias of order1/nof the statistics of interestSn(p)/n.

Furthermore ifinf(θ,p)Θ×Pσθ,p>0and if the family {Sn(p);n∈N, pP}satisfies Hypothesis(N-A)as well, then there exists a polynomial functionAθ,psuch that

sup

(θ,p)Θ×Psup

u∈R

Pθ

Sn(p) σθ,p

nu

N(u)η(u)n1/2Aθ,p(u) =o

n1/2 .

The polynomial function is of the type Aθ,p(u)=a1(θ, p)+a2(θ, p)u2 where the coefficients satisfy fori=1,2, sup(θ,p)Θ×P|ai(θ, p)|<+∞.Furthermore,ifEθ[|Sn(p)|3]<+∞for alln≥1and(θ, p)Θ×P,then the limit

m3θ,p,3:= lim

n→+∞

Eθ[Sn(p)3]

n −3σθ,p2 bθ,p

(5)

is well-defined and bounded inθΘ,and moreoverAθ,p(u):=m3θ,p,33 θ,p

(1u2)bσθ,pθ,p.

Remark 1. In the i.i.d.case,HypothesesR(3)and(N-A)are easily checked.Indeed consider(Xn)n∈Na sequence of i.i.d.E-valued r.v.whose common distribution depends onθΘ,and{ξp(·);pP}a family of measurable functions fromEintoR.The following assertions are obviously equivalent:

(a) The family{n

k=1ξp(Xk);n∈N, pP}fulfills HypothesisR(m)if and only ifEθ[ξp(X1)] =0for all(θ, p)Θ×Pandsup(θ,p)Θ×PEθ[|ξp(X1)|m]<+∞.

(b) The family{n

k=1ξp(Xk);n∈N, pP}fulfills Hypothesis(N-A)if and only if,for any compact subsetK0of R,one has

sup

tK0

sup

(θ,p)Θ×P

Eθ eit ξp(X1)<1. (6) When (6)is considered at (θ, p)fixed, it can be easily relaxed to the usual condition: ξp(X1)is non-lattice.By contrast,it is not easy to relax the uniform condition(6).Note that this condition is only discussed in[25]under the stronger Cramér condition:

lim sup

t→+∞ sup

(θ,p)Θ×P

Eθ eit ξp(X1)<1.

HypothesesR(m)and(N-A)are the tailor-made assumptions to borrow the proof of the first-order Edgeworth expan- sion in the i.i.d.case,2and consequently to expandPθ{Sn(p)/(σθ,p

n)u}with a polynomial function independent onn.Notice that,under less restrictive conditions,the results of[7]provide a first-order Edgeworth-type expansion but with a polynomial function depending onn.

2.2. The main lines of the weak spectral method for Markovian models

Consider from now on the following general Markovian setting. Let(E,E)be any measurable space, and let(Xn)n0

be a Markov chain with state spaceEand transition kernel(Qθ(x,·);xE)whereθis a parameter in some setΘ.

The initial distribution of the chain is denoted byμθ (i.e.,X0μθ). The underlying probability measure and the associated expectation are denoted by Pθ,μθ andEθ,μθ. We assume that (Xn)n∈N admits an invariant probability measure denoted byπθ (i.e.,∀θΘ,πθQθ=πθ). Notice that we do not require stationarity for(Xn)n∈N.

Let{ξp(·,·);pP}be a family of measurable functions fromE2intoR, whereP is any set. Let us define the following r.v.

n≥1,∀pP, Sn(p):=

n k=1

ξp(Xk1, Xk). (7)

This kind of (parametric and bivariate) functionals is required when concerning with Markovian M-estimators, as detailed in Section5.

Now we are going to study HypothesesR(m)and (N-A) using the Nagaev–Guivarc’h spectral method. For all t∈R,(θ, p)Θ×PandxE, let us define the Fourier kernel of(Qθ, ξp)by

Qθ,p(t)(x, dy):=eit ξp(x,y)Qθ(x,dy). (8)

As usual, for all bounded measurableC-valued functionf onE, we set Qθ,p(t)f:=

E

f (y)eit ξp(·,y)Qθ(·,dy).

2One difference is thatbθ,pis null in the i.i.d. case.

(6)

It is easy to see that we have from Markov property

t∈R,(θ, p)Θ×P,n≥1, Eθ,μθ eit Sn(p)f (Xn)

=μθ Qθ,p(t)nf . In particular, we obtain

t∈R,(θ, p)Θ×P,n≥1, Eθ,μθ eit Sn(p)

=μθ Qθ,p(t)n1E

, (9)

where1Estands for the function identically equal to 1 onE.

Equality (9) links the characteristic function ofSn(p)to the iterated Fourier operatorQθ,p(t)n. Thus, according to Eq. (9), HypothesisR(m)requires the study of the behavior oftQθ,p(t)nnear 0. A natural assumption to do it is to assume that there exists a Banach spaceBwhich contains the function1E and on which(Qθ)θΘ acts continuously (i.e.,∀θΘ,QθL(B)) and (Qθ)θΘ satisfies the following uniform strong ergodicity properties (ERG.1) and (ERG.2):

(ERG.1) {πθ;θΘ}is bounded inB.

(ERG.2) The transition kernel(Qθ)θΘhas a spectral gap onB(uniformly inθ), that is

n→+∞lim sup

θΘ

QnθΠθ

B=0,

whereΠθ denotes the rank-one projection defined onBbyΠθf :=πθ(f )1E. More precisely, we use the following equivalent form (ERG.2)of (ERG.2):

(ERG.2) There existc0>0 and 0≤κ0<1 (independent onθΘ) such that

θΘ,n∈N, QnθΠθ

Bc0κ0n.

Note that under (ERG.2), for allθΘ, the spectrumσ (Qθ|B)ofQθ acting onBbelongs to the set{z∈C; |z| ≤ κ0} ∪ {1}.

Then, to derive the properties ofR(m)from (9), we need some spectral perturbation method to control (uniformly in(θ, p)Θ×P) the spectrum ofQθ,p(t)acting on Bwhenever|t|is small enough. The usual method requires the continuity att=0 of theL(B)-valued functiontQθ,p(t), but this continuity assumption involves too strong hypotheses (see [18], Section 3, for details). An alternative method consists in using the Keller–Liverani theorem [20,22] (see also [1,9]). Using this method, the regularity ofλθ,p(·),lθ,p(·)andrθ,p,n(·)is studied in [18] in the case ofρ-mixing Markov chains,V-geometrically ergodic Markov chains and for iterated function systems. More exactly, their results are only established for additive univariate functionals of(Xn)n∈N, but the extension to our parametric bivariate case (7) is quite natural. This work has already been done in [17] in the case ofV-geometrically ergodic Markov chains (in Section4, we will directly use their results).

By contrast, as already mentioned in Introduction, the method developed in [18] is not sufficient to investigate Hypothesis (N-A) in our parametric bivariate case. We can all the same easily state the following implication: thanks to (9), provided that the following condition is imposed onθ)θΘ:

{μθ;θΘ}is bounded inB (10)

and that for allt∈R,(θ, p)Θ×P, the operatorQθ,p(t)belongs toL(B), the family{Sn(p):=n

k=1ξp(Xk1, Xk);n∈N, pP}fulfills Hypothesis (N-A) whenever the following condition holds:

Hypothesis (N-A)(Operator-type non-arithmeticity). For any compact subsetK0⊂R,there existsρ <1such that

n≥1, supQθ,p(t)n

B;tK0, (θ, p)Θ×P

=O ρn

.

In the next section, we replace this condition by some more practical non-lattice conditions.

(7)

3. From non-lattice conditions to (N-A)

We assume that the general Markovian assumptions of the previous Section2.2hold true. Furthermore, we also assume thatBis a Banach space of complex measurable functions defined onEwhich contains the function1E, such that for allθΘ,πθBand such that for allt∈R,(θ, p)Θ×P, the Fourier operatorsQθ,p(t)defined in (8) belong to L(B).

Let us introduce the following non-lattice condition which will be proved (under some additional conditions) to imply the previous operator-type non-arithmetic condition (N-A).

Hypothesis (N-L) (Non-lattice). There exist no(θ0, p0)Θ×P,no reala=a(θ0, p0),no closed subgroupH=cZ with c=c(θ0, p0)∈R,no πθ0-full Qθ0-absorbing set3 A=A(θ0, p0)E, and finally no measurable bounded functionα=α(θ0, p0):E→Rsuch that

xA, ξp0(x, y)+α(y)α(x)a+H, Qθ0(x,dy)-a.s. (11) 3.1. Intermediate conditions

The link between (N-L) and (N-A)is based on the three following operator-type properties. The first one concerns a control of the spectral radius ofQθ,p(t)acting onBdenoted byr(Qθ,p(t)|B):

t=0,∀(θ, p)Θ×P, r

Qθ,p(t)|B

<1. (i)

The second property consists in assuming that one has for any compact subsetK0⊂R rK0:=sup

r

Qθ,p(t)|B

;tK0, (θ, p)Θ×P

<1. (ii)

Notice that, whenever (ii) holds true, for all z∈C, |z|> rK0 and for all tK0, for all (θ, p)Θ×P, the resolvent operator (zQθ,p(t))1is well-defined in L(B). Then the last property consists in assuming that there existsρ0∈ [rK0,1)such that, for allρ0,1),

supz−Qθ,p(t)1

B;tK0, (θ, p)Θ×P,|z| =ρ

<+∞. (iii)

Below we study the following implications:

(a) (N-L)⇒(i) under some conditions (and even better: (N-L)⇔(i) under some additional conditions) (b) (i)⇒(ii) and (iii) under some conditions;

(c) (ii) and (iii)⇒(N-A).

The main difficulty is the proof of the statement (b). For this part, three methods are proposed in Section3.3. Notice that the operator-type non-arithmetic condition (N-A)obviously implies property (i).

3.2. From the non-lattice condition(N-L)to property(i)

The following lemma is an easy extension of [18], Lemma 12.1, to our parametric bivariate case.

Lemma 1. Assume that the following assumptions hold true:

1. For allθΘ,λ∈Csuch that|λ| ≥1,and for allfB,f=0,we have

n≥1,|λ|n|f| ≤Qnθ|f|

⇒ |λ| =1and|f| =πθ

|f|

>0πθ-a.s.

.

2. For all (θ, p)Θ×P,t ∈R,there exists 0≤γ =γ (θ, p, t) <1 such that the elements of the spectrum of Qθ,p(t)acting onBwith modulus greater thanγare isolated eigenvalues of finite multiplicity.

3A setAEis said to beπθ0-full ifπθ0(A)=1 andQθ0-absorbing ifQθ0(a, A)=1 for allaA.

(8)

If (N-L)holds true as well,then(i)is fulfilled.Moreover property(i)is equivalent to the following condition:there exist not0∈R, no 0, p0)Θ×P, no λ=λ(θ0, p0, t0)∈C such that|λ| =1, no πθ0-full Qθ0-absorbing set A=A(θ0, p0, t0)Eand finally no boundedw=w(θ0, p0, t0)Bsuch that|w||Ais non-null constant, satisfying

xA, eit0ξp0(x,y)w(y)=λw(x), Qθ0(x,dy)-a.s.

The last property of Lemma1will not be used later, it is only recalled here for a better understanding.

Remark 2. In fact,property(i)is equivalent to(N-L)whenevereBfor all bounded real measurable functionψ onE.Notice that this assumption is obviously fulfilled in theV-geometrically ergodic Markovian model to be studied.

Assumption 1 of Lemma1is always satisfied for strongly ergodic models (cf. (ERG.2)) such thatBis stable under complex moduli and such that for allxE, the Dirac distributionδxatxbelongs toB. In particular, this assumption is satisfied by theV-geometrically ergodic Markovian model to be studied (other conditions are given in [18] to check assumption 1).

Assumption 2 is much more difficult to be checked. For now, we only mention that it is equivalent to the following condition: for all (θ, p)Θ×P, t ∈R, the essential spectral radiusress(Qθ,p(t)|B)of Qθ,p(t)acting on B is such thatress(Qθ,p(t)|B)γ <1. Recall thatQθ,p(t)is said to be quasi-compact onBwheneverress(Qθ,p(t)|B) <

r(Qθ,p(t)|B).

3.3. Three methods for condition(i)to imply(ii)and(iii)

To obtain the implication (i)⇒(ii) and (iii), we can use one of the following three approaches, in which the setsΘ andP are assumed to be compact.

First approach. Using the standard operator perturbation theory, specifically the upper-semi-continuity of the func- tion “spectral radius” (see, e.g., [16], p. 19), one can prove the following statement:

Assume thatQθ,p(t)Qθ0,p0(t0)B→0when(t, θ, p)(t0, θ0, p0). Then the implications(i)⇒(ii)⇒(iii) are true.

However, as already mentioned in Section2.2, the last assumption of continuity oftQθ,p(t)is too restrictive.

That is why we will not apply this approach in this work.

Second approach. It consists in using the perturbation Keller–Liverani theorem instead of the standard perturbation theory. The proof of the following proposition is not provided in this paper since it is an easy extension of [18], Lemma 12.3.

Proposition 2. Assume that there exists some semi normed space Bsuch that for allt ∈Rand (θ, p)Θ×P, Qθ,p(t)belongs toL(B)andBB(i.e.,BBand the identity map is continuous fromBintoB). Furthermore assume that for allt0∈Rand(θ0, p0)Θ×P,there exists a neighborhoodI0of(t0, θ0, p0)such that

(C1) there existc >0, 0≤κ <1andM >0such that for all(t, θ, p)I0,fB,n∈N,one hasQθ,p(t)nfBnfB+cMnfB.

(C2) Qθ,p(t)Qθ0,p0(t0)B,B→0when(t, θ, p)(t0, θ0, p0).

Then the implications(i)⇒(ii)⇒(iii)are true.

This second approach is applied in Section6.4to some AR(d)processes withd≥2.

Note that condition (C2) may be difficult to be checked because of the continuity with respect toθ. However, under the standard dominated model assumption, condition (C2) can be dodged using the following approach.

Third approach. It consists in using quasi-compactness and Ascoli-type arguments. For instance, let us give a brief account for the implication (i)⇒(ii). We assume by absurd that (ii) does not hold and (i) holds true, namely on the one hand there exists a compact subsetK0ofRsuch thatrK0 :=sup{r(Qθ,p(t)|B);tK0, (θ, p)Θ×P} ≥1, and on the other handrK0≤1<+∞. Then there exist some sequences(tk)k∈NK0Nandk, pk)k∈N×P)N

(9)

such that limr(Qθk,pk(tk))≥1 when k→ +∞. Under the quasi-compactness assumption 2 of Lemma1, the previous property implies the existence of k)k∈N∈CN and(wk)k∈NBN such thatQθk,pk(tk)wk=zkwk and

|λk| =r(Qθk,pk(tk)). Finally, from compactness arguments (in particular by using Ascoli theorem), there exist somet˜∈K0,(θ ,˜ p)˜ ∈Θ×P,λ˜∈C,|˜λ| ≥1, andw˜ ∈Bsuch thatQθ ,˜p˜(˜t )w˜ = ˜λw, which is in contradiction with˜ (i). Similar arguments can be used to prove (ii)⇒(iii). In practice, it is easier to use Ascoli theorem when the model is dominated (i.e.,∀xE,θΘ,Qθ(x,dy)=qθ(x, y)μ(dy)) with suitable conditions on the function (qθ)θΘ and on the dominating positive measureμ.

This approach is detailed in Section4.2forV-geometrically ergodic Markov chains.

3.4. From properties(ii)and(iii)to the operator-type non-arithmetic condition(N-A) Lemma 2. Assume that properties(ii)and(iii)hold true.Then(N-A)is fulfilled.

Proof. Let K0⊂R be any compact set and let Γ denote the oriented circle defined by{z∈C; |z| =ρ}where ρ0,1). From the Von Neumann series, we have for alltK0and(θ, p)Θ×P,

z∈C, |z| =ρ

zQθ,p(t)1

=

+∞

n=0

zn1Qθ,p(t)n

and hence, we obtain

tK0,(θ, p)Θ×P,n≥1, Qθ,p(t)n= 1 2iπ

Γ

zn

zQθ,p(t)1

dz.

Then (N-A)can easily be derived thanks to (iii).

3.5. Conclusion

Sections3.2,3.3and3.4give a procedure to derive (N-A)(and so (N-A)) from the non-lattice condition (N-L). In some cases, we may need some even more simple condition than (N-L) to check (N-A). However, notice that this new condition, denoted by (N-L), is not equivalent to (N-L).

Assume thatEis a topological connex set and letE:=B(E)be the associated Borel algebra.

Hypothesis (N-L). For allpP,there exist neither continuous functionAp(·):E→Rnor constantCpsuch that we have for all(x, y)E2,ξp(x, y)=Ap(y)Ap(x)+Cp.

To connect (N-L)with (N-L), we need the following hypotheses on both the model andp)p∈P.

Hypothesis(S). There exists positive measures{μθ;θΘ}onEsatisfyingSupp(μθ)=Efor allθΘand such that we have for anyBE andθΘ:

xE, Qθ(x, B)=0

μθ(B)=0 .

Hypothesis(C). For allpP,the applicationξpis continuous fromE2intoR.

Lemma 3. Assume that Hypothesis(S)and(C)hold true.If the family(ξp)p∈P fulfills(N-L),then(N-L)is fulfilled.

Proof. Assume that (N-L) is not fulfilled, that is we have (11) with some0, p0)Θ×P,a∈R, some closed subgroupH=cZ(c∈R), someπθ0-fullQθ0-absorbing setAE, and finally some bounded measurable function α:E→R. For the sake of simplicity, let us omit the dependence on0, p0). For allxA, there existsExEsuch thatQ(x, Ex)=1 andξ(x, y)+α(y)α(x)a+H for allyEx. Letx0A. One has

yEx0, ξ(x0, y)+α(y)α(x0)a+H.

(10)

Thanks to Hypothesis (S), one has μ(E\Ex0)=0 and μ(E\A)=0 (recall that A is Q-absorbing), and hence μ(E\{AEx0})=0,AEx0⊃Supp(μ)=EwhereAEx0 denotes the closure ofAEx0. In particular,AEx0

is not empty. LetxAEx0, then

yEx0Ex, ξ(x, y)

ξ(x0, y)ξ(x0, x)

a+H.

Let us defineA(x):=ξ(x0, x)andf (x, y):=ξ(x, y)+A(x)A(y). Thenf (x, Ex0Ex)a+Hfor allxAEx0. Then, by continuity arguments and sinceE=Supp(μ)=Ex0Ex, one can easily show thatf (x, E)a+H.

In the same way,f (AEx0, E)a+H, and finallyf (E, E)a+H. Sincef (E, E)is connex and a+H is

discrete,f is constant onE2.

Remark 3. Letμbe a positive measure onE satisfyingSupp(μ)=E.Assume that the following dominated model condition holds:for allθΘ,there exists a non-negative measurable application qθ(·,·)on(E×E,EE)such that for allxE,BE,Qθ(x, B)=

Bqθ(x, y)dμ(y)and for allxEand forμ-almost allyE,qθ(x, y) >0.

Then one can show that Hypothesis(S)holds true.

Remark 4. In the i.i.d.case,Hypothesis(N-L)is never checked under Hypothesis(C).However,we can state another assumption adapted to the i.i.d.case.Indeed consider(Xn)n∈N a sequence of i.i.d.E-valued r.v.whose common distributionπθ depends onθΘ,and{ξp(·);pP}a family of measurable functions fromE intoR.Let us first recall(N-L):

There exist no(θ0, p0)Θ×P,no reala=a(θ0, p0),no closed subgroupH=cZwithc=c(θ0, p0)∈R,no πθ0-full setA=A(θ0, p0)E,and finally no measurable bounded functionα=α(θ0, p0):E→Rsuch that for all xA,ξp0(y)+α(y)α(x)a+H πθ0(dy)-a.s.

Then the following result is obvious using similar arguments as in Lemma3.Assume that E is a topological set and that the family(ξp)p∈Pfulfills the following non-lattice condition:

For allpP andθΘ,there exist neither closed subgroupH=cZwithc=c(θ, p)∈Rnor reala=a(θ, p) such that we haveξp(x)a+H πθ(dx)-a.s.

Then(N-L)is fulfilled.

4. The case of uniformV-geometrically ergodic Markov chains

In this section, we illustrate the previous results for uniformV-geometrically ergodic Markov chains. From now on, for the sake of simplicity, we consider thatE:=Rd(withd∈N), equipped with any norm · , andμLebd denotes the Lebesgue-measure onE. Let us assume thatΘis a compact set. We introduce the uniform (inθΘ)V-geometrically ergodic Markovian model:

Model(M). For allθΘ,there exist both aQθ-invariant probability measure denoted byπθ and an unbounded functionV:E→ [1,+∞)such that

(VG1) supθΘπθ(V ) <+∞,

(VG2) limn→+∞sup{|Qnθf (x)πθ(f )|/V (x);f:E→Cmeasurable,|f| ≤V , xE, θΘ} =0.

Model(M)satisfies properties (ERG.1) and (ERG.2) on the weighted-supremum normed space associated withV. This space, denoted byB1, is the Banach space composed of measurable functionsf:E→Csuch that

fB1 :=sup

xE

|f (x)|

V (x) <+∞. (12)

Model(M)has already been considered for statistical investigation, see, for instance, [6,13,17]. Whenθis fixed and when the Markov chain is irreducible and aperiodic, (VG1) and (VG2) can be checked using the so-called drift- criterion, we refer to [24], Chapter 15.2.2 (V4), for details. Notice that condition (10) on the initial distribution for B=B1is equivalent to the following one:

sup

θΘ

μθ(V ) <+∞. (13)

(11)

In the next Sections4.1and4.2, we consider a family p)p∈P of measurable functions fromE2 intoR, withP assumed to be a compact set, and we successively study HypothesesR(m)and (N-A) for Model(M), before applying these results in Section5toM-estimators.

4.1. Study of HypothesisR(m)

Let us recall the following proposition which has already been proven in [17], Lemma 1.

Proposition 3. Let us consider a Model(M).Assume that,for allpP,ξpis centered with respect to the invariant measure family(πθ)θΘ,4and that assume that(ξp)p∈Pfulfills the following moment domination condition for some m∈N:

ε >0, sup

|ξp(x, y)|m+ε

V (x)+V (y);(x, y)E2, pP

<+∞. (Dm)

Finally assume that the initial distribution family(μθ)θΘsatisfies(13).Then the family{Sn(p):=n

k=1ξp(Xk1, Xk);n∈N, pP}satisfies HypothesisR(m).

Up to the arbitrarily small real number ε >0, condition (Dm) is the expected (with respect to the i.i.d. case) assumption to obtain HypothesisR(m)in our model. Indeed, in [6], condition (D2) is the key assumption to prove the asymptotic normality whereas in [17], condition (D3) is the key assumption to prove Berry–Esseen bounds. Here one also needs to investigate Hypothesis (N-A).

4.2. Study of Hypothesis(N-A)for dominated Models(M)

Further assumptions are required to apply what we called the third approach in Section3.3. Some of them concern the dominated model and the other ones involve the regularity of the applicationsp)p∈P.

Assumption(S). For allθΘ,there exists a measurable applicationqθ(·,·)onE2such that

xE, Qθ(x,dy)=qθ(x, y)μLebd (dy).

Furthermore for allxEand forμLebd -almost allyE,the applicationθqθ(x, y)is continuous and there exists β >0such that

for allθ0Θ,there exists a neighborhoodV1=V10)ofθ0such that

x0E, lim

xx0

sup

θ∈V1

E

V (y)βqθ(x, y)qθ(x0, y)μLebd (dy)=0.

for allx0Eandθ0Θ,there exists a neighborhoodV2=V2(x0, θ0)ofθ0such that

E

V (y)β sup

θ∈V2

qθ(x0, y)μLebd (dy) <+∞.

Assumption(C). The family(ξp)p∈Psatisfies

for allxEand forμLebd -almost allyE,the functionpξp(x, y)is continuous.

for allx0Eandp0P,there exist neighborhoodsV3=V3(x0, p0)ofx0andV4=V4(p0)ofp0,some positive numbersC,υ1andυ2such that we have

pV4,xV3,yE, ξp(x, y)ξp(x0, y)Cxx0υ1V (y)υ2.

4I.e., for all(θ, p)Θ×P,Eθ,πθ[ξp(X0, X1)] =0.

Références

Documents relatifs

of axially symmetric boundary maps p whose smooth harmonic extensions are not energy minimizers.. Our proof reduces to the analysis of a scalar partial differential

But a proof of such a result would require the highly advanced technique of the spectral theory of automorphic functions [2].. Since this would require

The Poincaré-Birkhoff-Witt theorem and the Verma module construction In this section we construct a basis for d q and Pol(Uq(n)) by proving a Poincaré-.. Birkhoff-Witt

The author was not aware of the existence of this paper when the paper referred to in footnote 1 was published, but the functor En-2 considered by McKinsey was not,

This paper consists of two parts sections 2-5 are concerned with the estimations of the best constant involving the L 2 norm in Wente's inequality, the remainder is devoted to search

Thus our result can be used in adaptive estimation of quadratic functionals in a density model (see for example [4] where white noise is treated), where the theory needs good bounds

DORR, The approximation o f solutions of elliptic boundary-values problems via the p-version ofthe finite element method, SIAM J. BABUSKA,' The h-p version of the finite element

— Systems of partial differential equations, Cauchy problem, hyperbolicity, strong hy- perbolicity, symmetrizers, energy estimate, local uniqueness, finite speed of