• Aucun résultat trouvé

Nonparametric regression estimation for random fields in a fixed-design

N/A
N/A
Protected

Academic year: 2021

Partager "Nonparametric regression estimation for random fields in a fixed-design"

Copied!
22
0
0

Texte intégral

(1)

HAL Id: hal-00004096

https://hal.archives-ouvertes.fr/hal-00004096

Submitted on 4 Feb 2005

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de

Nonparametric regression estimation for random fields in a fixed-design

Mohamed El Machkouri

To cite this version:

Mohamed El Machkouri. Nonparametric regression estimation for random fields in a fixed-design.

Statistical Inference for Stochastic Processes, Springer Verlag, 2007. �hal-00004096�

(2)

ccsd-00004096, version 1 - 4 Feb 2005

Nonparametric regression estimation for random fields in a fixed-design

Mohamed EL MACHKOURI 4th February 2005

Abstract

We investigate the nonparametric estimation for regression in a fixed-design setting when the errors are given by a field of dependent random variables. Sufficient conditions for kernel estimators to con- verge uniformly are obtained. These estimators can attain the optimal rates of uniform convergence and the results apply to a large class of random fields which contains martingale-difference random fields and mixing random fields.

AMS Subject Classifications (2000): 60G60, 62G08

Key words and phrases: nonparametric regression estimation, kernel estimators, strong consistency, fixed-design, exponential inequalities, martingale difference random fields, mixing, Orlicz spaces.

Short title: Nonparametric regression in a fixed design.

(3)

1 Introduction

Over the last few years nonparametric estimation for random fields (or spatial processes) was given increasing attention stimulated by a growing demand from applied research areas (see Guyon [18]). In fact, spatial data arise in various areas of research including econometrics, image analysis, meterology, geostatistics... Our aim in this paper is to investigate uniform strong con- vergence rates of a regression estimator in a fixed design setting when the errors are given by a stationary field of dependent random variables which show spatial interaction. We are most interested in conditions which ensure convergence rates to be identical to those in the case of independent errors (see Stone [33]). Currently the author is working on extensions of the present results to the random design framework. Let Zd, d ≥ 1 denote the integer lattice points in the d-dimensional Euclidean space. By a stationary real random field we mean any family (εk)k∈Zd of real-valued random variables defined on a probability space (Ω,F,P) such that for any (k, n)∈Zd×N and any (i1, ..., in) ∈ (Zd)n, the random vectors (εi1, ..., εin) and (εi1+k, ..., εin+k) have the same law. The regression model which we are interested in is

Yi =g(i/n) +εi, i∈Λn={1, ..., n}d (1) wheregis an unknown smooth function and (εi)i∈Zdis a zero mean stationary real random field. Note that this model was considered also by Bosq [8] and Hall et Hart [19] for time series (d= 1). LetK be a probability kernel defined on Rd and (hn)n≥1 a sequence of positive numbers which converges to zero and which satisfies (nhn)n≥1 goes to infinity. We estimate the function g by the kernel-type estimator gn defined for anyx in [0,1]d by

gn(x) = P

i∈ΛnYiK

x−i/n hn

P

i∈ΛnK

x−i/n hn

. (2) Note that Assumption A1) in section 2 ensures thatgnis well defined. Until now, most of existing theoretical nonparametric results of dependent random variables pertain to time series (see Bosq [9]) and relatively few generaliza- tions to the spatial domain are available. Key references on this topic are Biau [5], Carbon et al. [10], Carbon et al. [11], Hallin et al. [20], [21], Tran [34], Tran and Yakowitz [35] and Yao [36] who have investigated nonpara- metric density estimation for random fields and Altman [2], Biau and Cadre [6], Hallin et al. [22] and Lu and Chen [25], [26] who have studied spatial prediction and spatial regression estimation. The classical asymptotic theory in statistics is built upon central limit theorems, law of large numbers and

(4)

large deviations inequalities for the sequences of random variables. These classical limit theorems have been extended to the setting of spatial pro- cesses. In particular, some key results on the central limit theorem and its functional versions are Alexander and Pyke [1], Bass [3], Basu and Dorea [4], Bolthausen [7] and more recently Dedecker [12], [13], El Machkouri [16]

and El Machkouri and Voln´y [17]. For a survey on limit theorems for spa- tial processes and some applications in statistical physics, one can refer to Nahapetian [28]. Note also that the main results (section 3) of this work are obtained via exponential inequalities for random fields discovered by El Machkouri [16].

The paper is organized as follows. The next section sets up the notations and the assumptions which will be considered in the sequel. In section 3, we present our main results on both weak and strong consistencies rates of the estimator gn. The last section is devoted to the proofs.

2 Notations and Assumptions

In the sequel we denotekxk= max1≤k≤d|xk| for anyx= (x1, ..., xd)∈[0,1]d. With a view to obtain optimal convergence rates for the estimator gn defined by (2), we have to make the following assumptions on the regression function g and the probability kernel K:

A1) The probability kernel K is symmetric, nonnegative, supported by [−1,1]d and satisfies a Lipschitz condition |K(x)−K(y)| ≤ ηkx−yk for anyx, y ∈[−1,1]dand someη >0. In addition there exists c, C >0 such that c≤K(x)≤C for anyx∈[−1,1]d.

A2) There exists a constant B >0 such that |g(x)−g(y)| ≤Bkx−yk for any x, y ∈[0,1]d, that is g isB-Lipschitz.

A Young function ψ is a real convex nondecreasing function defined on R+ which satisfies limt→∞ψ(t) = +∞and ψ(0) = 0. We define the Orlicz space Lψ as the space of real random variables Z defined on the probability space (Ω,F,P) such that E[ψ(|Z|/c)] < +∞ for some c > 0. The Orlicz space Lψ equipped with the so-called Luxemburg norm k.kψ defined for any real random variable Z by

kZkψ = inf{c >0 ; E[ψ(|Z|/c)]≤1}

is a Banach space. For more about Young functions and Orlicz spaces one can refer to Krasnosel’skii and Rutickii [24]. Let β > 0. We denote by ψβ

(5)

the Young function defined for any x∈R+ by

ψβ(x) = exp((x+ξβ)β)−exp(ξββ) where ξβ = ((1−β)/β)1/β11{0<β<1}. On the latticeZdwe define the lexicographic order as follows: ifi= (i1, ..., id) and j = (j1, ..., jd) are distinct elements of Zd, the notation i <lex j means that either i1 < j1 or for some p in {2,3, ..., d}, ip < jp and iq = jq for 1≤q < p. Let the sets {Vik; i∈Zd, k ∈N} be defined as follows:

Vi1 ={j ∈Zd; j <lex i}, and for k ≥2

Vik=Vi1∩ {j ∈Zd;|i−j| ≥k} where |i−j|= max

1≤l≤d|il−jl|. For any subset Γ of Zd defineFΓ =σ(εi; i∈Γ) and set

E|k|i) = E(εi|FV|k|

i ), k ∈Vi1.

Denoteβ(q) = 2q/(2−q) for 0< q <2 and consider the following conditions:

C1) ε0 ∈L and

X

k∈V01

kE|k|0)k<∞.

C2) There exists 0< q <2 such that ε0 ∈Lψβ(q) and X

k∈V01

q

kE|k|0)|

2 ψβ(q)

<∞.

C3) There existsp >2 such thatε0 ∈Lp and X

k∈V01

kE|k|0)kp2 <∞.

C4) ε0 ∈L2 and P

k∈Zd|E(ε0εk)|<∞.

Remark 1 Note that Dedecker [12] established the central limit theorem for any stationary square-integrable random field (εk)k∈Zd which satisfies the condition P

k∈V01kE|k|0)k1<∞.

(6)

In classical statistical physics, there exists spatial processes which satisfy conditions C1),...,C4). For example, Nahapetian and Petrosian [29] gave sufficient conditions for a Gibbs field (εk)k∈Zd to possess the following mar- tingale difference property: for any i in Zd, E(εi|FVi1) = 0 a.s. Another examples of random fields which satisfy conditions C1),...,C4) can be found also among the class of mixing random fields. More precisely, given two sub-σ-algebras U and V of F, different measures of their dependence have been considered in the literature. We are interested by two of them. The α-mixing and φ-mixing coefficients had been introduced by Rosenblatt [31]

and Ibragimov [23] respectively and can be defined by

α(U,V) = sup{|P(U∩V)−P(U)P(V)|, U ∈ U, V ∈ V}

φ(U,V) = sup{kP(V|U)−P(V)k, V ∈ V}.

We have 2α(U,V) ≤ φ(U,V) and these coefficients equal zero if and only if the σ-algebrasU andV are independent. Denote by♯Γ the cardinality of any subset Γ of Zd. In the sequel, we shall use the following non-uniform mixing coefficients defined for any (k, l, n) in (N∪ {∞})2×Nby

αk,l(n) = sup{α(FΓ1,FΓ2), ♯Γ1 ≤k, ♯Γ2 ≤l, ρ(Γ12)≥n}, φk,l(n) = sup{φ(FΓ1,FΓ2), ♯Γ1 ≤k, ♯Γ2 ≤l, ρ(Γ12)≥n},

where the distance ρ is defined by ρ(Γ12) = min{|i−j|, i ∈ Γ1, j ∈ Γ2}. We say that the random field (εk)k∈Zd isα-mixing orφ-mixing if there exists a pair (k, l) in (N∪ {∞})2such that limn→∞αk,l(n) = 0 or limn→∞φk,l(n) = 0 respectively. For more about mixing coefficients one can refer to Doukhan [15]. We consider the following mixing conditions:

C1)ε0 ∈L and

X

k∈Zd

φ∞,1(|k|)<∞. C2) There exists 0< q <2 such that ε0 ∈Lψβ(q) and

X

k∈Zd

q

φ∞,1(|k|)<∞

or X

k∈Zd

c2k(β(q))<∞ where for any β >0

ck(β) = inf (

c >0

Z α1,∞(|k|) 0

ψβ

Qε0(u) c

du≤1 )

. (3)

(7)

C3) There exists p >2 such that ε0 ∈Lp and X

k∈Zd

Z α1,∞(|k|) 0

Qpε0(u)du

!2/p

<∞ (4)

where Qε0 is the inverse cadlag of the tail function t → P(|ε0|> t) (i.e. for any u≥0, Qε0(u) = inf{t >0|P(|ε0|> t)≤u}).

Remark 2 Let us note that if p= 2 +δ for some δ >0 then the condition

X

m=1

md−1α

δ 2+δ−ε

1,∞ (m)<∞ for some ε >0

is more restrictive than condition (4) and is known to be sufficient for the ran- dom field (εk)k∈Zd to satisfy a functional central limit theorem (cf. Dedecker [13]).

In statistical physics, using the Dobrushin’s uniqueness condition (cf. [14]), one can construct Gibbs fields satisfying a uniform exponential mixing con- dition which is more restrictive than conditions C1), C2) and C3) (see Guyon [18], theorem 2.1.3, p. 52).

3 Main results

Let (Zn)n≥1 be a sequence of real random variables and (vn)n≥1be a sequence of positive numbers. We say that

Zn=Oa.s.[vn] if there exists λ >0 such that

lim sup

n→∞

|Zn|

vn ≤λ a.s.

Our main result is the following.

Theorem 1 Assume that the assumptionA1) holds.

1) If C1) holds then

sup

x∈[0,1]d|gn(x)−Egn(x)|=Oa.s.

(logn)1/2 (nhn)d/2

. (5)

(8)

2) If C2) holds for some 0< q <2 then sup

x∈[0,1]d|gn(x)−Egn(x)| =Oa.s.

(logn)1/q (nhn)d/2

. (6)

3) Assume that C3)holds for some p >2andhn =n−θ2(logn)θ1 for some θ1, θ2 >0. Leta, b≥0 be fixed and denote

vn = na(logn)b

(nhn)d/2 and θ = 2a(d+p)−d2−2 d(3d+ 2) . If θ ≥θ2 and d(3d+ 2)θ1+ 2(d+p)b > 2 then

sup

x∈[0,1]d|gn(x)−Egn(x)|=Oa.s.[vn]. (7) Remark 3 Theorem 1 shows that the optimal uniform convergence rate is obtained for bounded errors (cf. estimation (5)) and that it is “almost”

optimal if one considers errors with only finite exponential moments (cf.

estimation (6)).

Theorem 2 Assume that the assumption A1) holds.

1) Assume that C3) holds for somep > 2. Let a >0 be fixed and denote vn = na

(nhn)d/2 and θ = 2a(d+p)−d2 d(3d+ 2) . If θ > 0 and hn≥n−θ then

sup

x∈[0,1]d|gn(x)−Egn(x)| p

=O[vn]. (8)

2) If C4) holds then sup

x∈[0,1]dkgn(x)−Egn(x)k2 =O

(nhn)−d/2

. (9)

In the sequel, we denote by Lip(B) the set of B-Lipschitz functions. The following proposition gives the convergence of Egn(x) to g(x).

Proposition 1 Assume that the assumption A2) holds then sup

x∈[0,1]d

sup

g∈Lip(B)|Egn(x)−g(x)|=O[hn].

(9)

From Proposition 1 and Theorem 1 we derive the following corollary.

Corollary 1 Assume thatA1)andA2)hold and lethn = n−dlogn1/(2+d)

.

1) If C1) holds then

sup

x∈[0,1]d

sup

g∈Lip(B)|gn(x)−g(x)|=Oa.s.

"

logn nd

2+d1 #

. (10)

2) If C2) holds for some 0< q <2 then sup

x∈[0,1]d

sup

g∈Lip(B)|gn(x)−g(x)|=Oa.s.

"

u(n)

logn nd

2+d1 #

(11) where u(n) = (logn)(2−q)/2q.

3) Let ε >0 be fixed. If C3) holds for some p >2 satisfying p≥ 4d3+ (4−2ε)d2+ (2−4ε)d+ 4

2ε(2 +d) (12)

then sup

x∈[0,1]d

sup

g∈Lip(B)|gn(x)−g(x)|=Oa.s.

"

u(n)

logn nd

2+d1 #

(13) where u(n) =nε.

Remark 4 Note that the consistency rate (n−dlogn)1/(2+d) is known to be the optimal one (see Stone [33]).

From Proposition 1 and Theorem 2 we derive the following corollary.

Corollary 2 Assume that A1) and A2) hold and let hn=n−d/(2+d). 1) Let ε >0 be fixed. If C3) holds for some p >2 satisfying

p≥ 4d3+ (4−2ε)d2−4εd

2ε(2 +d) (14)

then

sup

x∈[0,1]d

sup

g∈Lip(B)|gn(x)−g(x)| p

=Oh

n2+dd i

. (15)

(10)

2) If C4) holds then

sup

x∈[0,1]d

sup

g∈Lip(B)|gn(x)−g(x)| 2

=Oh

n2+dd i

. (16)

Finally the rates of convergence obtained above are valid when the errors are given by a mixing random field. More precisely, we have the following corollary.

Corollary 3 Theorems 1 and 2 and Corollaries 1 and 2 still hold if one replace conditions C1), C2) and C3) by conditions C1), C2) and C3) respectively.

4 Proofs

For any x in [0,1]d and any integer n ≥1 we define Bn(x) = Egn(x)−g(x) and Vn(x) =gn(x)−Egn(x). More precisely

Bn(x) = P

i∈Λnai(x)g(i/n) P

i∈Λnai(x) −g(x) Vn(x) =

P

i∈Λnai(x)εi

P

i∈Λnai(x) whereai(x) =K

x−i/n hn

. In the sequel, we denote alsoSn(x) =P

i∈Λnai(x)εi

for any x∈[0,1]d. We start with the following lemma.

Lemma 1 There exists constants c, C >0 such that for any x∈[0,1]d and any n∈N,

c

d

Y

k=1

[n(xk+hn)]≤ X

i∈Λn

ai(x)≤C

d

Y

k=1

[n(xk+hn)] (17) where [.] denote the integer part function.

Proof of Lemma 1. Since the kernel K is supported by [−1,1]d, we have X

i∈Λn

ai(x) =

[n(x1+hn)]

X

i1=1

. . .

[n(xd+hn)]

X

id=1

ai(x).

By assumption, there exists constants c, C > 0 such that c≤K(y)≤C for any y∈[−1,1]d. The proof of Lemma 1 is complete.

(11)

4.1 Proof of Theorem 1

Let (υn)n≥1 be a sequence of positive numbers going to zero. Following Carbon and al. [11] the compact set [0,1]d can be covered by rn cubes Ik

having sides of length ln = υnh2d+1n and center at ck. Clearly there exists c > 0 such that rn≤c/ldn. Define

A1,n(g) = max

1≤k≤rn

sup

x∈Ik

|gn(x)−gn(ck)| A2,n(g) = max

1≤k≤rn

sup

x∈Ik

|Egn(x)−Egn(ck)| A3,n = max

1≤k≤rn|gn(ck)−Egn(ck)| then

sup

x∈[0,1]d|gn(x)−Egn(x)| ≤ sup

g∈Lip(B)

[A1,n(g) +A2,n(g)] +A3,n. (18)

Lemma 2 Fori= 1,2 we have sup

g∈Lip(B)

Ai,n(g) =Oa.s.[vn].

Proof of Lemma 2. Since g ∈ Lip(B), we can assume without loss of gener- ality that g is bounded by B on the set [0,1]d. For any x∈Ik, we have

gn(x)−gn(ck) =σ12

where

σ1 = P

i∈ΛnYi(ai(x)−ai(ck)) P

i∈Λnai(x) and

σ2 = P

i∈Λn(ai(ck)−ai(x)) P

i∈Λnai(x)×P

i∈Λnai(ck) X

i∈Λn

Yiai(ck).

Now, by Lemma 1 and AssumptionA1), we derive that there exists constants c, η > 0 such that for any n sufficiently large

1| ≤ 2dηln/hn

c(nhn)d X

i∈Λn

|Yi| ≤ ηvnhdn

c B+ 1

nd X

i∈Λn

i|

!

and

2| ≤ 4dηndln/hn

c2(nhn)2d X

i∈Λn

|Yi| ≤ ηvn

c2 B+ 1 nd

X

i∈Λn

i|

!

(12)

Since (εi) is a stationary ergodic random field the lemma easily follows from the last inequalities and the Birkhoff ergodic theorem. The proof of Lemma 2 is complete.

Lemma 3 Assume that either C1) holds and vn = (logn)1/2/(nhn)d/2 or C2) holds for some 0< q <2 and vn= (logn)1/q/(nhn)d/2 then

A3,n =Oa.s.[vn]

Proof of Lemma 3. Let 0 < q ≤ 2 be fixed. We consider the exponential Young function define for any x ∈ R+ by ψq(x) = exp((x+ξq)q)−exp(ξqq) where ξq= ((1−q)/q)1/q11{0<q<1}. Let λ >0 and x∈[0,1]d be fixed

P(|Vn(x)|> λvn) =P Sn(x)

> λvn

X

i∈Λn

ai(x)

!

≤(1 +eξqq) exp

λvnP

i∈Λnai(x)

||P

i∈Λnai(x)εi||ψqq q

.

For any i∈Λn and any 0< q <2 denote bi,q(a(x)ε) =

ai(x)εi

2

ψβ(q) + X

k∈Vi1

q

ak(x)εkE|k−i|(ai(x)εi)

2 ψβ(q)

(19) and

bi,2(a(x)ε) =

ai(x)εi

2

+ X

k∈Vi1

kak(x)εkE|k−i|(ai(x)εi)k (20) where Vi1 = {j ∈ Zd; j <lex i}. Using Kahane-Khintchine inequalities (cf.

El Machkouri [16], Theorem 1) we derive that if Condition C2) holds for some 0< q <2 then

P(|Vn(x)|> λvn)≤(1 +eξqq) exp

λ vnP

i∈Λnai(x) M(P

i∈Λnbi,q(a(x)ε))1/2q

q (21) where M is a positive constant depending only on q and on the probability kernel K. Now using the definition (19) and Lemma 1 there exist constants c, M >0 such that

sup

x∈[0,1]d

P(|Vn(x)|> λvn)≤(1 +eξqq) exp

− λ vn(P

i∈Λnai(x))1/2

M +ξq

!q

≤(1 +eξqq) exp

−cqλqvnq([nhn])dq/2 Mq

(13)

So if vn = (logn)1/q/(nhn)d/2 and n is sufficiently large then sup

x∈[0,1]d

P(|Vn(x)|> λvn)≤(1 +eξqq) exp

− cqλq logn 2dq/2Mq

. (22)

If ConditionC1) holds then (21) still hold withq = 2 (cf. El Machkouri [16], Theorem 1). So if vn= (logn)1/2/(nhn)d/2 and n is large it follows that

sup

x∈[0,1]d

P(|Vn(x)|> λvn)≤2 exp

− c2λ2 logn 2dM2

. (23)

Since

P(|A3,n|> λvn)≤ rn sup

x∈[0,1]d

P(|Vn(x)|> λvn),

using (22) and (23), choosingλsufficiently large and applying Borel-Cantelli’s lemma, we derive

P

lim sup

n→∞ {|A3,n|> λvn}

= 0 and

P

lim sup

n→∞

|A3,n| vn ≤λ

= 1.

The proof of points 1) and 2) of Theorem 1 are completed by combining Inequality (18) with Lemmas 2 and 3.

Lemma 4 Assume that C3) holds for some p > 2 and hn = n−θ2(logn)θ1 for some θ1, θ2 >0. Let a, b≥0 be fixed and denote

vn= na(logn)b

(nhn)d/2 and θ = 2a(d+p)−d2−2 d(3d+ 2) . If θ≥θ2 and d(3d+ 2)θ1+ 2(d+p)b >2 then

n→+∞lim

|A3,n| vn

= 0 a.s.

Proof of Lemma 4. Let p >2 be fixed. For any λ >0 P(|Vn(x)|> λvn) =P |Sn(x)|> λvnX

i∈Λn

ai(x)

!

≤ λ−pE|Sn(x)|p vpn(P

i∈Λnai(x))p

≤ λ−p vpn(P

i∈Λnai(x))p 2pX

i∈Λn

ci(x)

!p/2

(14)

where ci(x) = ai(x)2ik2p +ai(x)P

k∈Vi1ak(x)kεkE|k−i|i)kp2. The last es- timate follows from a Marcinkiewicz-Zygmund type inequality by Dedecker (see [13]) for real random fields. Noting that there exists γ > 0 such that ci(x) ≤ γai(x), x ∈ [0,1]d and using Lemma 1, we derive that there exists γ >0 such that

P(|A3,n|> λvn)≤rn sup

x∈[0,1]d

P(|Vn(x)|> λvn)≤ γ τnλp

where τn =ldnvnp([nhn])dp/2. Since vn =na(logn)b/(nhn)d/2 and ln =vnh2d+1n it follows

1 τn

= (nhn)d(d+p)/2

hd(2d+1)n na(d+p)(logn)b(d+p)([nhn])dp/2. If n is sufficiently large, we derive

1

τn ≤ 2dp/2(nhn)d(d+p)/2

hd(2d+1)n na(d+p)(logn)b(d+p)(nhn)dp/2

= 2dp/2

hd(3d+2)/2n na(d+p)−d2/2(logn)b(d+p)

≤ 2dp/2

n(logn)b(d+p)+θ1d(3d+2)/2 since θ ≥θ2. Now b(d+p) +θ1d(3d+ 2)/2 >1 implies P

n≥1τn−1 <∞. Applying Borel- Cantelli’s lemma, it follows that for any λ >0

P

lim sup

n→∞ {|A3,n|> λvn}

= 0, that is for any λ >0

P

lim sup

n→∞

|A3,n| vn ≤λ

= 1.

The proof of Lemma 4 is complete and the point3) of Theorem 1 is obtained by combining Inequality (18) with Lemmas 2 and 4. The proof of Theorem 1 is complete.

4.2 Proof of Theorem 2

We follow the first part of the proof of Theorem 1 and we consider the estimation (18).

(15)

Lemma 5 Assume that C3) holds for some p >2. Let a > 0 be fixed and denote

vn= na

(nhn)d/2 and θ = 2a(d+p)−d2 d(3d+ 2) . If θ >0 and hn ≥n−θ then

kA3,nkp =O[vn].

Proof of Lemma5. Letp >2 andx∈[0,1]dbe fixed. Using the Marcinkiewicz- Zygmund type inequality by Dedecker (see [13]) as in the proof of Lemma 4 there exist γ′′, c >0 such that

kVn(x)kp = E|Sn(x)|p P

i∈Λnai(x)p

!1/p

≤γ′′ X

i∈Λn

ai(x)

!−1/2

≤ γ′′

√c([nhn])−d/2 by Lemma 1.

It follows that

rn1/p sup

x∈[0,1]dkVn(x)kp =O vn

τn

whereτn=ld/pn vn([nhn])d/2. Ifnis sufficiently large thenτn ≥2−d/2lnd/pvn(nhn)d/2, hence using hn ≥n−θ we obtainτn≥2−d/2. Finally, we derive

kA3,nkp =k max

1≤k≤rn|Vn(xk)|kp ≤rn1/p sup

x∈[0,1]dkVn(x)kp =O[vn].

The proof of Lemma 5 is complete. The point 1) of Theorem 2 is obtained by combining inequality (18) and lemmas 2 and 5.

Now, we are going to prove the point 2) of Theorem 2. We have E(Sn(x)2) = X

k,l∈Λn

ak(x)al(x)E(εkεl)

= X

k∈Λn

ak(x)2E(ε2k) +X

k6=l

ak(x)al(x)E(εkεl)

=E(ε20) X

k∈Λn

ak(x)2+ X

k∈Λn

ak(x) X

l∈Λn\{k}

al(x)E(εkεl)

≤X

l∈Zd

|E(ε0εl)| × X

k∈Λn

ak(x).

(16)

If Condition C4) holds then using Lemma 1 there exists γ >0 such that for any x ∈ [0,1]d we have E(Sn(x)2) ≤ γQd

k=1[n(xk+hn)]. Let x ∈ [0,1]d be fixed, using Lemma 1, there exists c >0 such that

kVn(x)k2 = kSn(x)k2 P

i∈Λnai(x)

√γ c

d

Y

k=1

[n(xk+hn)]

!−1/2

√γ c([nhn])d/2

≤ 2d/2√γ

c(nhn)d/2 for n sufficiently large.

The proof of Theorem 2 is complete.

4.3 Proof of Proposition 1

Since g ∈ Lip(B), it follows that

|Bn(x)|=

P

i∈Λn(g(i/n)−g(x))ai(x) P

i∈Λnai(x)

≤Bhn P

i∈Λnk(i/n−x)/hnkai(x) P

i∈Λnai(x)

≤Bhn.

The proof of Proposition 1 is complete.

4.4 Proof of Corollary 1

Let hn= (n−dlogn)1/(2+d) then Proposition 1 gives sup

x∈[0,1]d

sup

g∈Lip(B)|Egn(x)−g(x)|=O

"

logn nd

2+d1 #

. (24)

Assume that C1) holds. Noting that (logn)1/2

(nhn)d/2 =

logn nd

2+d1

(17)

and using (5) we obtain sup

x∈[0,1]d|gn(x)−Egn(x)|=Oa.s.

"

logn nd

2+d1 #

. (25)

Combining (24) and (25) we derive (10).

Assume that C2) holds for some 0< q <2. Noting that (logn)1/q

(nhn)d/2 =

logn nd

2+d1

×(logn)(2−q)/2q and using (6) we obtain

sup

x∈[0,1]d|gn(x)−Egn(x)|=Oa.s.

"

logn nd

2+d1

×(logn)(2−q)/2q

#

. (26) Combining (24) and (26) we derive (11).

Letε >0 be fixed and assume that C3) holds for some p >2 which satisfies condition (12). Applying the point 3) of Theorem 1 withθ1 = 1/(2 +d) and θ2 =d/(2 +d) and noting that

vn= na(logn)b (nhn)d/2 =nε

logn nd

2+d1

⇐⇒

a=ε and b= 1 2

it follows

sup

x∈[0,1]d|gn(x)−Egn(x)|=Oa.s.

"

nε

logn nd

2+d1 #

. (27)

Combining (24) and (27) we derive (13). The proof of Corollary 1 is complete.

4.5 Proof of Corollary 2

Let hn=n−d/(2+d) then Proposition 1 gives sup

x∈[0,1]d

sup

g∈Lip(B)|Egn(x)−g(x)|=Oh

n2+dd i

. (28)

Letε >0 be fixed and assume that C3) holds for some p >2 which satisfies condition (14). Applying the point 1) of Theorem 2 and noting that

vn= na

(nhn)d/2 =n2+dd ⇐⇒a=ε

(18)

it follows that

sup

x∈[0,1]d|gn(x)−Egn(x)| p

=Oh

n2+dd i

. (29)

Combining (28) and (29) we derive (15).

Since hn =n−d/(2+d) then (nhn)−d/2 =hn. So, if C4) holds then combining (28) and (9) we derive (16). The proof of Corollary 2 is complete.

4.6 Proof of Corollary 3

Let p > 2 be fixed. Using Rio’s inequality [30] (see also Dedecker [13]) we obtain the bound

kE|k|0)kp2 ≤4

Z α1,∞(|k|) 0

Qpε0(u)du

!2/p

(30) hence condition C3) is more restrictive than condition C3).

By Serfling’s inequality (see McLeish [27] or Serfling [32]) we know that kεkE|k|0)k≤2kε0k2φ∞,1(|k|)

so condition C1) is more restrictive than condition C1).

Now for 0 < q < 2 there exists C(q) > 0 (cf. Inequality (17) in [16]) such that

q

kE|k|0)|

2

ψβ(q) ≤C(q)q

φ∞,1(|k|). (31) In [16] we used the following lemma which can be obtain by the expansion of the exponential function.

Lemma 6 Let β be a positive real number and Z be a real random variable.

There exist positive universal constants Aβ andBβ depending only onβ such that

Aβ sup

p>2

kZkp

p1/β ≤ kZkψβ ≤Bβ sup

p>2

kZkp p1/β . Consider the coefficient ck(β) given by (3) and denote

dk(p) =

Z α1,∞(|k|) 0

Qpε0(u)du

!1/p

then the following version of lemma 6 holds.

(19)

Lemma 7 Let β be a positive real number. There exist positive universal constants Aβ and Bβ depending only on β such that for any k ∈Zd

Aβsup

p>2

dk(p)

p1/β ≤ck(β)≤Bβsup

p>2

dk(p) p1/β .

Now combining lemmas 6 and 7 and inequality (30) there exists C(q) > 0 such that

q

kE|k|0)|

2

ψβ(q) ≤C(q)c2k(β(q)). (32) Finally condition C2) is more restrictive than conditionC2) and the proof of Corollary 3 is complete.

Aknowledgements. I would like to express my thanks to the anonymous referee for his/her careful reading of the manuscript and valuable sugges- tions. I am indebted for ´E. Youndje for many stimulating conversations on nonparametric estimation.

References

[1] K. S. Alexander and R. Pyke. A uniform central limit theorem for set-indexed partial-sum processes with finite variance. Ann. Probab., 14:582–597, 1986.

[2] N. Altman. Krige, smooth, both or neither ? Australian and New Zealand Journal of Statistics, 42:441–461, 2000.

[3] R. F. Bass. Law of the iterated logarithm for set-indexed partial sum processes with finite variance. Z. Wahrsch. verw. Gebiete, 70:591–608, 1985.

[4] A. K. Basu and C. C. Y. Dorea. On functional central limit theorem for stationary martingale random fields. Acta. Math. Hung., 33:307–316, 1979.

[5] G. Biau. Spatial kernel density estimation. To appear in Mathematical methods of Statistics, 2003.

[6] G. Biau and B. Cadre. Nonparametric spatial prediction. To appear in Statistical Inference for Stochastic Processes, 2003.

[7] E. Bolthausen. Exact convergence rates in some martingale central limit theorems. Ann. Probab., 10(3):672–688, 1982.

(20)

[8] D. Bosq. Bernstein-type large deviations inequalities for partial sums of stron mixing processes. Statistics, 24:59–70, 1993.

[9] D. Bosq. Nonparametric Statistics for Stochastic Processes-Estimation and Prediction-2nde Edition. Lecture Notes in Statistics, Springer Ver- lag, New York, 1998.

[10] M. Carbon, M. Hallin, and B. Wu. Kernel density estimation for random fields: the l1 theory. Nonparametric Statist., 6:157–170, 1996.

[11] M. Carbon, L.T. Tran, and B. Wu. Kernel density estimation for random fields. Statist. Probab. Lett., 36:115–125, 1997.

[12] J. Dedecker. A central limit theorem for stationary random fields.

Probab. Theory Relat. Fields, 110:397–426, 1998.

[13] J. Dedecker. Exponential inequalities and functional central limit the- orems for random fields. ESAIM: Probability and Statistics, 5:77–104, 2001.

[14] R. L. Dobrushin. The description of a random fields by mean of con- ditional probabilities and condition of its regularity. Th. Probab. Appl., 13:197–224, 1968.

[15] P. Doukhan. Mixing : Properties and Examples, volume 85. Lecture Notes in Statistics, Berlin, 1994.

[16] M. El Machkouri. Kahane-Khintchine inequalities and functional central limit theorem for stationary random fields. Stoch. Proc. and Their Appl., 120:285–299, 2002.

[17] M. El Machkouri and D. Voln´y. Contre-exemple dans le th´eor`eme central limite fonctionnel pour les champs al´eatoires r´eels. Annales de l’IHP, 2:325–337, 2003.

[18] X. Guyon. Random fields on a Network: Modeling, Statistics and Ap- plications. Springer, New York, 1995.

[19] P. Hall and J.D. Hart. Nonparametric regression with long-range de- pendence. Stoch. Proc. and Their Appl., 36:339–351, 1990.

[20] M. Hallin, Z. Lu, and L.T. Tran. Density estimation for spatial linear processes. Bernoulli, 7:657–668, 2001.

(21)

[21] M. Hallin, Z. Lu, and L.T. Tran. Density estimation for spatial processes:

the l1 theory. Journal of Multivariate Analysis, 88:61–75, 2004.

[22] M. Hallin, Z. Lu, and L.T. Tran. Local linear spatial regression. Annals of Statistics, 32, 2004. In press.

[23] I. A. Ibragimov. Some limit theorems for stationary processes. Theory Probab. Appl., 7:349–382, 1962.

[24] M. A. Krasnosel’skii and Y. B. Rutickii. Convex Functions and Orlicz Spaces. P. Noordhoff LTD-Groningen-The Netherlands, 1961.

[25] Z. Lu and X. Chen. Spatial nonparametric regression estimation: Non- isotropic case. Acta Mathematicae Applicatae Sinica, English series, 18:641–656, 2002.

[26] Z. Lu and X. Chen. Spatial kernel regression estimation: weak consis- tency. Statistics and Probability Letters, 68:125–136, 2004.

[27] D. L. McLeish. A maximal inequality and dependent strong laws. Ann.

Probab., 3(5):829–839, 1975.

[28] B. Nahapetian. Limit Theorems and Some Applications in Statistical Physics. B. G. Teubner Verlagsgesellschaft, Stuttgart, Leipzig, 1991.

[29] B. Nahapetian and A. N. Petrosian. Martingale-difference Gibbs random fields and central limit theorem. Ann. Acad. Sci. Fenn., Series A-I Math., 17:105–110, 1992.

[30] E. Rio. Covariance inequalities for strongly mixing processes. Annales de l’IHP, 29(4):587–597, 1993.

[31] M. Rosenblatt. A central limit theorem and a strong mixing condition.

Proc. Nat. Acad. Sci. USA, 42:43–47, 1956.

[32] R. J. Serfling. Contributions to central limit theory for dependent vari- ables. Ann. Math. Statist., 39(4):1158–1175, 1968.

[33] C. J. Stone. Optimal global rates of convergence for nonparametric regression. Annal. of Statist., 10(4):1043–1053, 1982.

[34] L.T. Tran. Kernel density estimation on random fields. Journal of Multivariate Analysis, 34:37–53, 1990.

[35] L.T. Tran and Yakowitz S. Nearest neighbor estimators for random fields. Journal of Multivariate Analysis, 44:23–46, 1993.

(22)

[36] Q. Yao. Exponential inequalities for spatial processes and uniform con- vergence rates for density estimation. In Development of Modern Statis- tics and Related Topics - In Celebration of Prof. Yaoting Zhang’s 70th Birthday, H. Zhang and J. Huang (edit.), World Scientific, Singapore, pages 118–128, 2003.

Mohamed EL MACHKOURI

Laboratoire de Math´ematiques Rapha¨el Salem UMR 6085, Universit´e de Rouen

Site Colbert,

F76821 Mont-Saint-Aignan Cedex

email : [email protected]

Références

Documents relatifs

The paper considers the problem of robust estimating a periodic function in a continuous time regression model with dependent dis- turbances given by a general square

Key words and phrases: Central limit theorem, spatial processes, m-dependent random fields, physical dependence measure, nonparametric estimation, kernel density estimator, rate

Key words and phrases: Central limit theorem, deconvolution kernel density esti- mator, strongly mixing random fields, nonmixing random fields, physical depen- dence measure..

To obtain the Pinsker constant for the model (1.1) one has to prove a sharp asymptotic lower bound for the quadratic risk in the case when the noise variance depends on the

Inspired both by the advances about the Lepski methods (Goldenshluger and Lepski, 2011) and model selection, Chagny and Roche (2014) investigate a global bandwidth selection method

Keywords: Kernel estimator; Spatial regression; Random fields; Strong mixing coef- ficient; Dimension reduction; Inverse Regression.. The set S can be discret, continuous or the set

(1999), Massart (2004) have constructed an adaptive procedure of model selection based on least squares estimators and have obtained a non-asymptotic upper bound for quadratic

Key words and phrases: Nonparametric regression estimation, asymptotic nor- mality, kernel estimator, strongly mixing random field.. Short title: Asymptotic normality of