• Aucun résultat trouvé

A uniform Berry-Esseen theorem on M-estimators for geometrically ergodic Markov chains

N/A
N/A
Protected

Academic year: 2021

Partager "A uniform Berry-Esseen theorem on M-estimators for geometrically ergodic Markov chains"

Copied!
29
0
0

Texte intégral

(1)

HAL Id: hal-00563638

https://hal.archives-ouvertes.fr/hal-00563638

Submitted on 7 Feb 2011

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

A uniform Berry-Esseen theorem on M-estimators for geometrically ergodic Markov chains

Loïc Hervé, James Ledoux, Valentin Patilea

To cite this version:

Loïc Hervé, James Ledoux, Valentin Patilea. A uniform Berry-Esseen theorem on M-estimators for geometrically ergodic Markov chains. Bernoulli, Bernoulli Society for Mathematical Statistics and Probability, 2012, 18 (2), pp.703-734. �10.3150/10-BEJ347�. �hal-00563638�

(2)

A uniform Berry-Esseen theorem on M −estimators for geometrically ergodic Markov chains

Loïc HERVÉ, James LEDOUX, Valentin PATILEA September 7, 2010

Abstract

Let {Xn}n≥0 be a V-geometrically ergodic Markov chain. Given some real-valued functional F, define Mn(α) := n−1Pn

k=1F(α, Xk−1, Xk), α ∈ A ⊂ R. Consider an M−estimatorαbn, that is as a measurable function of the observations satisfyingMn(bαn) minα∈AMn(α) +cn with {cn}n≥1 some sequence of real numbers going to zero. Under some standard regularity and moment assumptions, close to those of the i.i.d. case, the estimatorαbn satisfies a Berry-Esseen theorem uniformly with respect to the underlying probability distribution of the Markov chain.

AMS subject classification : 62F12, 62M05, 60F05, 60J05 Keywords : spectral method

I Introduction

Let(E,E) be a measurable space with E a countably generated σ-field, and let{Xn}n≥0 be a Markov chain with state space E and transition kernels {Qθ(x,·) : x E} where θ is a parameter in some general set Θ. The initial distribution of the chain, i.e. the probability distribution of X0, is denoted by µ and may or may not depend on θ. Although {Xn}n≥0 does not need to be the canonical version, we use the standard notation Pθ,µ to refer to the probability distribution of {Xn}n≥0 (and Eθ,µ for the expectation w.r.t. Pθ,µ). We consider that {Xn}n≥0 is a V-geometrically ergodic Markov chain, where V : E→[1,+∞) is some fixed unbounded function. This class of Markov chains is large enough to cover interesting applications (see [MT93, §16.4,§16.5]).

The parameter of interest is α0 =α0(θ)⊂ A, where α0(·) is a function of the parameterθ and Ais an open interval of R. To estimate α0, let us introduce the statistic

Mn(α) := 1 n

Xn k=1

F(α, Xk−1, Xk), (1)

whereF is a real-valued measurable functional onA ×E2. We define anM−estimator1 to be a random variableαbn depending on the observations(X0, . . . , Xn)such that

Mn(αbn)min

α∈AMn(α) +cn,

Université Europénne de Bretagne, INSA-IRMAR, UMR-CNRS 6625. Institut National des Sciences Appliquées de Rennes, 20, Avenue des Buttes de Cöesmes CS 14315, 35043 Rennes Cedex, France. Loic.Herve, James.Ledoux, Valentin.Patilea@insa-rennes.fr

1This is slightly more general than the usual definition ofM−estimators or minimum contrast estimators, wherecn= 0, see [Arc98].

(3)

where{cn}n≥1 is a sequence of non-negative real numbers going to zero to be specified later.

Assume that for allθΘ

Mθ(α) := lim

n→∞Eθ,µ[Mn(α)]

is well defined everywhere on A and does not depend on µ. In addition, assume that there exists a unique “true” valueα0of the parameter of interest, that isMθ0)< Mθ(α),∀α6=α0. We want to prove the followinguniform Berry-Esseen bound forαbn

sup

θ∈Θ

sup

u∈R

¯¯

¯¯Pθ,µ

½ n

τ(θ)(bαnα0)u

¾

Γ(u)

¯¯

¯¯=O µ 1

n

, (BE)

where Γ denotes the standard normal distribution function, and τ(θ) is some positive real number defined in Theorem 3.

To derive (BE), we use Pfanzagl’s approach [Pfa71]. Besides technical assumptions, this approach relies on several ingredients. First, we need the uniform consistency condition:

(UC) ∀d >0, supθ∈ΘPθ,µ©

|bαnα0| ≥dª

=O(1/ n).

Second, consider the following two convergence properties: ifSn0) =Pn

k=1ξ(α0, Xk−1, Xk) withξ(α0, Xk−1, Xk) centered,

(a) the sequence ©

Eθ,µ[Sn20)]/nª

n≥1 converges to a real number σ2(θ);

(b) there exists a positive constant B(ξ) such that for any n1 sup

θ∈Θ

sup

u∈R

¯¯

¯¯Pθ,µ

½ Sn0) σ(θ)

n u

¾

Γ(u)

¯¯

¯¯ B(ξ)

n .

The properties (a) and (b) will be required for certain ξ(α0, x, y) defined as linear combi- nations of some functionals related to F. To obtain (a) and (b) for such ξ(α0, x, y)’s with V-geometrically ergodic Markov chains, a natural moment (or V−domination) condition is used: there exist positive constantsCξ andm such that

∀(x, y)E2,∀α∈ A, |ξ(α, x, y)|m Cξ(V(x) +V(y)). (2) The paper is organized as follows. In Section II, an extended version of Pfanzagl’s theorem [Pfa71, Th 1] is stated for any sequence of observations, not necessarily markovian. Sec- tion III is devoted to a Berry-Esseen bound for the additive functionalPn

k=1ξ(α0, Xk−1, Xk) of aV-geometrically ergodic Markov chain{Xn}n≥0 withξ satisfying Inequality (2). In Sub- section III.2, we prove that the properties (a) and (b) are fulfilled when Inequality (2) holds with the (almost expected) orderm, namely: m >2 (a), and m >3 (b). These results follow from the weak spectral method based on the theorem of Keller and Liverani [KL99].

This approach, introduced in [HH04], is fully described in [HP10] in the Markov context (see also [GL06, Gou08] and other references given in [HP10]). It is important to notice that Pfan- zagl’s method requires the precise control of the constantB(ξ)in Property (b) as a function of the size of ξ. The present operator-type approach shows that B(ξ) depends only on the constant Cξ in Inequality (2). Thanks to these preliminary results, in Section IV we prove our main statement, that is:

(4)

(R) under some technical assumptions and the uniform consistency condition (UC), if two functionalsF0 andF00 related to F (in the basic caseF0 andF00 are the first and second order derivatives of F with respect to α) satisfy Inequality (2) for some m > 3 and constantsCF0, CF00 that do not depend onα, thenαbn satisfies property (BE).

To the best of our knowledge, the result(R)is new. It completes the central limit theorem for {bαn}n≥1 proved in [DY07] when Inequality (2) holds withm= 2. The domination condition (2) required by(R)is almost optimal in the sense that we imposem >3 in place of the best possible value m= 3 obtained in the i.i.d. case. In Section V, our results are applied to the AR(1) process with ARCH (AutoRegressive Conditional Heteroscedastic) of order 1 errors.

The paper ends with a conclusion section.

Let us close the introduction with a brief review of previous related work in the litera- ture. In [Pfa71], {Xn}n∈N is a sequence of i.i.d. random variables and Pfanzagl proved a Berry-Esseen theorem for minimum contrast estimators (which are special instances of M- estimators) associated with functionals of the formF(α, Xk). In [Pfa71], the moment condi- tions onF0 :=∂F/∂α,F00:=2F/∂α2 are the expected ones since the property (b) is fulfilled under the expected third moment condition [Fel71, Chap. XVI]. Using convexity arguments, Bentkuset al. [BBG97] proposed an alternative method for deriving Berry-Esseen bounds for M−estimators with i.i.d. data. In the Markov context, the method proposed by Pfanzagl is extended, first by Rao to cover the case of uniformly ergodic Markov chains [Rao73], second in [MR89] to the case of the linear autoregressive model. However, their assumptions to get (BE) include much stronger moment conditions involving both the functional F and the Markov chain. Here, as already mentioned, the weak spectral method of [HP10] enables us to have an (almost) optimal treatment of (a) and (b), and hence an improved Berry-Esseen result (BE).

II The Pfanzagl method revisited

We state and prove a general result that allows to derive uniform Berry-Esseen bounds for M−estimators. This result is an extended version of Theorem 1 in [Pfa71] and is applied to our Markov context in Section IV.

II.1 The result

Consider a statistical model ¡

Ω,F,{Pθ, θ Θ}¢

, where Θ denotes some parameter space, and let {Xn}n≥0 be any sequence of observations (not necessarily Markovian). Let us denote the expectation with respect toPθ byEθ.

For each n, letMn(α)be a measurable functional of the observationsX0, . . . , Xn and the parameter of interestα∈ AwhereAis some open interval ofR. Let{cn}n≥1be a sequence of non-negative real numbers going to zero at some rate to be specified later. AnM−estimator is a measurable function αbn of the observations(X0, . . . , Xn) such that

Mn(αbn)min

α∈AMn(α) +cn. (3)

This is the usual definition of minimum contrast estimators as soon ascn0.

(5)

Assumptions. Suppose that for all n 1 and α ∈ A, there exist Mn0(α), Mn00(α) some measurable functions depending on X0, X1, ..., Xn and on the parameter of interest such that the following properties hold true:

(A1) ∀θ Θ, there exists a unique α0 =α0(θ) ∈ A such that Mθ00) = 0 where Mθ0(α) :=

limn→∞Eθ[Mn0(α)] (the limit is assumed to be well defined for all(θ, α)Θ× A);

(A2) 0<infθ∈Θm(θ) supθ∈Θm(θ) < where m(θ) := limn→∞Eθ[Mn000)] (the limit is assumed to be well defined for all θ);

(A3) for every n1, there existsrn>0 independent of θ such that rn=o(n−1/2) and sup

θ∈Θ

Pθ©

|Mn0(αbn)| ≥rnª

=O(n−1/2);

(A4) forj= 1,2, there exists a functionσj(·)such that0<infθ∈Θσj(θ)supθ∈Θσj(θ)< and there exists a positive constant B such that for alln1

sup

θ∈Θ

sup

u∈R

¯¯

¯¯Pθ

½ n

σ1(θ)Mn00)u

¾

Γ(u)

¯¯

¯¯ B

n, sup

θ∈Θ

sup

u∈R

¯¯

¯¯Pθ

½ n σ2(θ)

¡Mn000)m(θ)¢

u

¾

Γ(u)

¯¯

¯¯ B

n; (A4’) for n1, |u| ≤2

lnn, andθΘ, there is a positive number σn,u(θ) such that

n,u(θ)σ1(θ)| ≤A0 |u|

n,

¯¯

¯¯Pθ

½ n σn,u(θ)

µ

Mn00) + u σ1(θ)

n m(θ)

¡Mn000)m(θ)¢

u

¾

Γ(u)

¯¯

¯¯ B0

n

with some positive constantsA0, B0 independent of n, u, θ;

(A5) for any (α, α0)∈ A2, letRn(α, α0) be defined by the equation Mn00) =Mn0(α) + [Mn00(α) +Rn(α, α0)](α0α).

For each n, there existωn0 and a real-valued measurable functionWn depending on X0, . . . , Xn, both independent ofθ, such that ωn=o(1)and

∀(α, α0)∈ A2, |Rn(α, α0)| ≤©

α0|+ωnª Wn, and there is a constantcW >0 such that

sup

θ∈Θ

Pθ{cW Wn}=O(n−1/2).

(A6) αbn is assumed to be uniformly consistent, that is there existsγn=o(1)such that sup

θ∈Θ

Pθ©

|bαnα0| ≥dª

γn,

where d:= infθ∈Θm(θ)/8cW with cW and m(θ) defined in (A5) and (A2) respectively.

(6)

Let us comment on these assumptions. Condition (A1) identifies the true value of the parameter. In Conditions (A1) and (A2), the expectations Eθ[Mn0(α)] and Eθ[Mn000)] may depend on n, like for instance in the Markovian framework considered in the sequel when the initial distribution is not the stationary distribution. Condition (A3) ensures that the estimator (approximately) satisfies a kind of first order condition. Such a condition allows to take into account the numerical errors we are faced when computingαbn. It may also be useful when the estimator of the parameter α0 depends on some “nuisance” parameters (see the example in the second part of Section V). Conditions (A4) and (A4’) are the uniform Berry- Esseen bounds forMn00),Mn000) and for some of their linear combinations. The identity definingRn(α, α0) in Condition (A5) is guaranteed by a Taylor expansion when the criterion Mn(α) is twice differentiable with respect toα. In this caseMn0 and Mn00 are nothing else but the first and second order derivatives ofMnwith respect to α. The reminder Rn(α, α0) must satisfy a Lipschitz condition. For instance, when ωn = 0, this holds true if α 7→ Mn(α) is three times continuously differentiable with a bounded third order derivative. Condition (A6) is a standard consistency condition (e.g. see [BBG97]). General sufficient conditions for (A6) withγn=O(n−1) have been proposed in the case of i.i.d. observations or uniformly ergodic Markov chains (see [MP71, Lemma 4] and [Rao73, Lemma 4.1] respectively). Such general arguments can easily be adapted to the geometrically ergodic Markov chain framework. In specific examples, like the one investigated in Section V, Condition (A6) can be checked by direct arguments.

The proof of Theorem 1, which adapts the arguments of [Pfa71], is given in Subsection II.2.

Theorem 1 Under Conditions (A1-A6), there exists a positive constant C such that

∀n1, sup

θ∈Θ

sup

u∈R

¯¯

¯¯Pθ

½ n

τ(θ)(αbnα0)u

¾

Γ(u)

¯¯

¯¯C µ 1

n+

nrn+ωn+γn

(4) with τ(θ) :=σ1(θ)/m(θ).

To obtain the classical orderO(n−1/2)of the Berry-Esseen bound, one needs γn=O(n−1/2), rn =O(n−1) and ωn =O(n−1/2). Note that this usually requires that the sequence{cn}n≥1 in (3) decreases at the rate n−3/2. This is to be compared to the rate n−1 that is usually required to obtain the asymptotic normality of M−estimators (see [Arc98]).

Remark 1 A close inspection of the proof of Theorem 1 below , shows that the constant C in inequality (4) can be tracked provided that theO(·) ando(·) rates in Assumptions (A3)-(A6) are more explicit. For the sake of brevity, we only consider the case wherecn=rn=ωn= 0, α(θ) = θ and (A3) is: for any n 1, |Mn0(bθn)| = 0. The constants C in the various inequalities of Assumptions (A4)-(A6) are denoted byC1, C2 in (A4), C3, C4 in (A4’),C5 in (A5) and we choose γnC6n−1/2 in (A6). Then, we can obtain from Propositions 1-2 that

∀n1, sup

θ∈Θ

¯¯

¯¯Pθ

½ n

τ(θ)(bαnα0)u

¾

Γ(u)

¯¯

¯¯ C

n where C := 12 +1 + 2C1+ 2C2+exp(−aa2/2) +C5+C6 when |u| ≥2

lnn;

or C:= 2£ 1

+ 2C1+ 4C2+ 2exp(−aa2/2)+ 2C5+C6¤

+C4+16e−1(C32cW)

σ1

when|u|<2

lnn provided thatp

n/lnnmax¡

8cWσ2,4¢ 1; with a:= infθ∈Θ¡

m(θ)/4σ2(θ)¢

, σ:= supθ∈Θσ1(θ)/m(θ),σ1 = infθ∈Θσ1(θ).

(7)

II.2 Proof of Theorem 1.

The hypotheses of Theorem 1 are assumed to hold. For the sake of brevity, the sequence {rn}n≥1 in (A3) is supposed to be such that rn = o(n−1/2), and |Mn0(bαn)| ≤ rn for every n 1. In the general case, it suffices to work on the event {|Mn0(αbn)| ≤ rn} and to bound the probability of the event {|Mn0(bαn)|> rn}using (A3). From Conditions (A2) and (A4), τ(θ) := σ1(θ)

m(θ), m:= inf

θ∈Θm(θ), m:= sup

θ∈Θ

m(θ), σj := inf

θ∈Θσj(θ), σj := sup

θ∈Θ

σj(θ) (j = 1,2) are well defined. Recall that0< mm < and 0< σj σj <∞. Note that the function τ(·)is positive and bounded. In the following,C denotes a positive constant whose value may be different from line to line.

Inequality (4) is proved, first for |u| ≥ 2

lnn, second for |u| < 2

lnn. In fact, for

|u| ≥2

lnn, the bound in Inequality (4) does not involve rn and ωn.

Proposition 1 There exists a positive constant C such that for each n 1 and all u R such that |u| ≥2

lnn sup

θ∈Θ

¯¯

¯¯Pθ

½ n

τ(θ)(bαnα0)u

¾

Γ(u)

¯¯

¯¯ C

n +γn. (5) Proof. For |u| ≥2

lnn, it is easily checked that

¯¯

¯¯Pθ

½ n

τ(θ)(αbnα0)u

¾

Γ(u)

¯¯

¯¯Pθ

½ n τ(θ)

¯¯bαnα0¯

¯2 lnn

¾

+ Γ(−2 lnn).

Now,

Γ(−2

lnn) 1 2

lnn

1

Z +∞

2 lnn

v ev22 dv= 1 2

lnn

1

1 n2. Finally, the proof is complete if there existsC >0 such that (see [MP71, Lem. 6])

∀n1, sup

θ∈Θ

Pθ

½ n τ(θ)

¯¯bαnα0¯

¯>2 lnn

¾

C

n +γn. (6) It follows from (A5) and (A3) that|Mn00)|+rn≥ |bαn−α0| |Mn000)+Rn(bαn, α0)|. Then,

n σ1(θ)

¯¯bαnα0¯¯>2

lnn m(θ) =

n σ1(θ)

¡¯¯Mn00)¯¯+rn¢

>2

lnn m(θ)

¯¯Mn000) +Rn(bαn, α0)¯¯

provided that Mn0(αbn) 6= Mn00). Next, introducing the event ©

2|Mn000) +Rn(αbn, α0)|>

m(θ)ª

and its complement (which includes the event {Mn0(αbn) =Mn00)}), we obtain Pθ

½ n τ(θ)

¯¯bαnα0¯

¯>2 lnn

¾

Pθ

½ n

σ1(θ){|Mn00)|+rn}> lnn

¾

+Pθ©

2|Mn000) +Rn(αbn, α0)| ≤m(θ)ª .

It is easily checked from (A4) andrn=o(n−1/2) that sup

θ∈Θ

Pθ

½ n

σ1(θ){|Mn00)|+rn}> lnn

¾

=O µ 1

n

+ 2 Γ

µ

lnn+

nrn σ1(θ)

=O µ 1

n

.

Références

Documents relatifs

In this section, the weak spectral method is applied to the three classical models cited in Section 1: the v-geometrically ergodic Markov chains, the ρ-mixing Markov chains, and

The goal of this paper is to show that the Keller-Liverani theorem also provides an interesting way to investigate both the questions (I) (II) in the context of geometrical

As an exception, let us mention the result of Adamczak, who proves in [Ada08] subgaussian concentration inequalities for geometrically ergodic Markov chains under the

Closely linked to these works, we also mention the paper [Log04] which studies the characteristic function of the stationary pdf p for a threshold AR(1) model with noise process

The goal of this article is to combine recent ef- fective perturbation results [20] with the Nagaev method to obtain effective con- centration inequalities and Berry–Esseen bounds for

The quantities g a (i, j), which appear in Theorem 1, involve both the state of the system at time n and the first return time to a fixed state a. We can further analyze them and get

Keywords: Berry-Esseen bounds, Central Limit Theorem, non-homogeneous Markov chain, Hermite-Fourier decomposition, χ 2 -distance.. Mathematics Subject Classification (MSC2010):

The rate of convergence in Theorem 1.1 relies crucially on the fact that the inequality (E r ) incorporates the matching moments assumption through exponent r on the