• Aucun résultat trouvé

Donsker results for the

N/A
N/A
Protected

Academic year: 2022

Partager "Donsker results for the"

Copied!
46
0
0

Texte intégral

(1)

Donsker results for the

smoothed empirical process of dependent observations

Eric Beutner, Maastricht University Joint work with Henryk Zähle

Besancon, May 13

(2)

Overview

2

Introduction

Extending the benchmark Free lunch? Yes free lunch!

One more free lunch? No (too much asked).

(3)

Introduction

(4)

Introduction

4

Let Z1, . . . , Zn be identically distributed with cdf F0 and density f0 and consider the kernel based estimator for f0

n(z) = 1 n

Xn

i=1

1 hn K

z − Zi hn

(5)

Introduction

Let Z1, . . . , Zn be identically distributed with cdf F0 and density f0 and consider the kernel based estimator for f0

n(z) = 1 n

Xn

i=1

1 hn K

z − Zi hn

=

Z 1 hn K

z − y hn

dFbn(y),

where Fbn is the empirical distribution function based on Z1, . . . , Zn and hn is the bandwidth.

(6)

Introduction

4

Let Z1, . . . , Zn be identically distributed with cdf F0 and density f0 and consider the kernel based estimator for f0

n(z) = 1 n

Xn

i=1

1 hn K

z − Zi hn

=

Z 1 hn K

z − y hn

dFbn(y),

where Fbn is the empirical distribution function based on Z1, . . . , Zn and hn is the bandwidth.

Let the kernel K be symmetric and for some C > 0 Z

K(y) dy = 1, Z

yK(y) dy = 0, and Z

y2K(y)dy = C.

(7)

Introduction

Let Z1, . . . , Zn be identically distributed with cdf F0 and density f0 and consider the kernel based estimator for f0

n(z) = 1 n

Xn

i=1

1 hn K

z − Zi hn

=

Z 1 hn K

z − y hn

dFbn(y),

where Fbn is the empirical distribution function based on Z1, . . . , Zn and hn is the bandwidth.

Let the kernel K be symmetric and for some C > 0 Z

K(y) dy = 1, Z

yK(y) dy = 0, and Z

y2K(y)dy = C.

For f0 twice continuously differentiable, hn = n−1/5 is optimal for MISE( ˆfn) =

Z

E[( ˆfn(z) − f0(z))2] dz

(8)

Introduction (cont’d)

5

Ideally, we could use this estimate for f0 to estimate, for instance, moments of F0. The second moment plug-in estimate would be

Z

z2n(z) dz =

Xn

i=1

Z

z2 1

nhn K

z − Zi hn

dz.

(9)

Introduction (cont’d)

Ideally, we could use this estimate for f0 to estimate, for instance, moments of F0. The second moment plug-in estimate would be

Z

z2n(z) dz =

Xn

i=1

Z

z2 1

nhn K

z − Zi hn

dz.

Substituting z by zhn + Zi this equals 1

n

Xn

i=1

Z

z2h2nK(z) dz + 2 n

Xn

i=1

Zihn Z

zK(z) dz + 1 n

Xn

i=1

Zi2.

(10)

Introduction (cont’d)

5

Ideally, we could use this estimate for f0 to estimate, for instance, moments of F0. The second moment plug-in estimate would be

Z

z2n(z) dz =

Xn

i=1

Z

z2 1

nhn K

z − Zi hn

dz.

Substituting z by zhn + Zi this equals 1

n

Xn

i=1

Z

z2h2nK(z) dz + 2 n

Xn

i=1

Zihn Z

zK(z) dz + 1 n

Xn

i=1

Zi2.

The difference between this estimate and R

z2f0(z) dz equals 1

n

Xn

i=1

Zi2 − Z

z2f0(z) dz + Ch2n.

(11)

Introduction (cont’d)

Ideally, we could use this estimate for f0 to estimate, for instance, moments of F0. The second moment plug-in estimate would be

Z

z2n(z) dz =

Xn

i=1

Z

z2 1

nhn K

z − Zi hn

dz.

Substituting z by zhn + Zi this equals 1

n

Xn

i=1

Z

z2h2nK(z) dz + 2 n

Xn

i=1

Zihn Z

zK(z) dz + 1 n

Xn

i=1

Zi2.

The difference between this estimate and R

z2f0(z) dz equals 1

n

Xn

i=1

Zi2 − Z

z2f0(z) dz + Ch2n.

⇒ R

z2n(z)dz not √

n consistent for E[Z12] as √

nhn = n1/10 → ∞.

(12)

Introduction (cont’d)

6

Can we find K such that we have √

n consistency for the plug-in estimator

n → Z

z2n(z)dz,

i.e. √

n

Z

z2n(z)dz − Z

z2f0(z) dz

= OP (1), and MISE-optimality at the same time?

(13)

Introduction (cont’d)

Can we find K such that we have √

n consistency for the plug-in estimator

n → Z

z2n(z)dz,

i.e. √

n

Z

z2n(z)dz − Z

z2f0(z) dz

= OP (1), and MISE-optimality at the same time?

More generally, for which functions g and kernels K can we have MISE-optimality and

√n

Z

g(z) ˆfK(z) dz − Z

g(z)f0(z) dz

= OP (1) at the same time?

(14)

Introduction (cont’d)

7

Comparing this last question to what is known for the plug-in estimator based on Fbn where even

sup

˜ g∈G˜

√n

Z

˜

g(z)dFbn(z) − Z

˜

g(z) dF0(z)

= OP (1)

for several classes of functions G˜ (even converges weakly)

(15)

Introduction (cont’d)

Comparing this last question to what is known for the plug-in estimator based on Fbn where even

sup

˜ g∈G˜

√n

Z

˜

g(z)dFbn(z) − Z

˜

g(z) dF0(z)

= OP (1)

for several classes of functions G˜ (even converges weakly)

we may ask if we can have for some kernels MISE-optimality and sup

˜ g∈G˜

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1) for the same classes of functions (or even weak convergence).

(16)

Introduction (cont’d)

8

Already known: If Z1, . . . , Zn are iid it holds, for instance, that for G˜ = BV = {g˜ : R → R | variation of g˜ ≤ c} we have

sup

˜ g∈BV

√n

Z

˜

g(z) dFbn(z) − Z

˜

g(z) dF0(z)

= OP(1).

Giné and Nickl (2008) proved that for this class of functions and f0 bounded and m-times continuously differentiable one has with

hn = n2m+11 (i.e.MISE optimality) sup

˜ g∈BV

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1),

(17)

Introduction (cont’d)

Already known: If Z1, . . . , Zn are iid it holds, for instance, that for G˜ = BV = {g˜ : R → R | variation of g˜ ≤ c} we have

sup

˜ g∈BV

√n

Z

˜

g(z) dFbn(z) − Z

˜

g(z) dF0(z)

= OP(1).

Giné and Nickl (2008) proved that for this class of functions and f0 bounded and m-times continuously differentiable one has with

hn = n2m+11 (i.e.MISE optimality) sup

˜ g∈BV

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1),

if √

nhm+kn → 0 and if with r = 1, . . . , m, and k > 1/2 Z

K(z) dz = 1, Z

zrK(z) dz = 0, Z

|z|m+k|K(z)| dz < ∞.

(18)

Introduction (cont’d)

9

Giné and Nickl (2008) proved two more results in the same spirit with G˜ ⊂ C(R) where

C(R) = {f : R → R | |f(x)| ≤ M, x ∈ R, f continuous}.

Clearly, our example from the beginning g(z) = z2 does not belong to C(R) nor is it of bounded variation on R.

(19)

Introduction (cont’d)

Giné and Nickl (2008) proved two more results in the same spirit with G˜ ⊂ C(R) where

C(R) = {f : R → R | |f(x)| ≤ M, x ∈ R, f continuous}.

Clearly, our example from the beginning g(z) = z2 does not belong to C(R) nor is it of bounded variation on R.

First, one might ask whether the above results can be extended, for instance, to the set

BVloc = {g˜ : R → R | g˜ is locally of bounded variation}.

(20)

Introduction (cont’d)

9

Giné and Nickl (2008) proved two more results in the same spirit with G˜ ⊂ C(R) where

C(R) = {f : R → R | |f(x)| ≤ M, x ∈ R, f continuous}.

Clearly, our example from the beginning g(z) = z2 does not belong to C(R) nor is it of bounded variation on R.

First, one might ask whether the above results can be extended, for instance, to the set

BVloc = {g˜ : R → R | g˜ is locally of bounded variation}.

Second, one might ask whether we can extend the results to time series settings, where

Zi is ARMA(p, q) or Zi is GARCH(p, q).

(21)

Extending the benchmark

(22)

Locally bounded variation: iid

11

First recall the benchmark for the kernel based density estimator sup

˜ g∈BV

√n

Z

˜

g(z)dFbn(z) − Z

˜

g(z) dF(z)

= OP(1).

Hence, we first need to extend this result into two directions:

1. Replace sup w.r.t. g˜ ∈ BV by sup w.r.t. g˜ ∈ BVloc;

2. Replace Fbn based on iid Z1, . . . , Zn by Zi is ARMA(p, q) or Zi is GARCH(p, q) (or more generally, by some weak

dependence concept).

(23)

Locally bounded variation: iid

Let φ : R → [1, ∞) be a weight function and put BV(1/φ),≤c =

˜

g ∈ BVloc :

Z 1

φ(z) |d˜g|(z) ≤ c

.

(24)

Locally bounded variation: iid

12

Let φ : R → [1, ∞) be a weight function and put BV(1/φ),≤c =

˜

g ∈ BVloc :

Z 1

φ(z) |d˜g|(z) ≤ c

.

Note that for φ ≡ 1 we get the functions of bounded variation.

(25)

Locally bounded variation: iid

Let φ : R → [1, ∞) be a weight function and put BV(1/φ),≤c =

˜

g ∈ BVloc :

Z 1

φ(z) |d˜g|(z) ≤ c

.

Note that for φ ≡ 1 we get the functions of bounded variation.

If we take φ(z) = (1 + |z|)2+ǫ, ǫ > 0, then our example from the beginning g(z) = z2 (dg(z) = 2z) is included in

BV(1/(1+|z|)2+ǫ),≤c =

˜

g ∈ BVloc :

Z 1

(1 + |z|)2+ǫ |dg˜|(z) ≤ c

.

(26)

Locally bounded variation: iid

12

Let φ : R → [1, ∞) be a weight function and put BV(1/φ),≤c =

˜

g ∈ BVloc :

Z 1

φ(z) |d˜g|(z) ≤ c

.

Note that for φ ≡ 1 we get the functions of bounded variation.

If we take φ(z) = (1 + |z|)2+ǫ, ǫ > 0, then our example from the beginning g(z) = z2 (dg(z) = 2z) is included in

BV(1/(1+|z|)2+ǫ),≤c =

˜

g ∈ BVloc :

Z 1

(1 + |z|)2+ǫ |dg˜|(z) ≤ c

.

Theorem: Let Z1, . . . , Zn be iid and φ be a weight function. Then for F0 with R

φ2dF0 < ∞, we have sup

˜ g∈BV

(1/φ),c

√n

Z

˜

g(z) dFbn(z) − Z

˜

g(z)dF0(z)

= OP(1).

(27)

Dependent data

Theorem: Z1, . . . , Zn be strictly stationary and α-mixing with α(n) = O(n−θ) for some θ > 1 + √

2. Let φλ(x) := (1 + |x|)λ, λ ≥ 0 and assume that R

R |x|γ dF0(x) < ∞ where γ > θ−12θλ . Then we have sup

˜

g∈BV(1/φ),≤c

√n

Z

˜

g(z) dFbn(z) − Z

˜

g(z)dF0(z)

= OP(1).

(28)

Dependent data

13

Theorem: Z1, . . . , Zn be strictly stationary and α-mixing with α(n) = O(n−θ) for some θ > 1 + √

2. Let φλ(x) := (1 + |x|)λ, λ ≥ 0 and assume that R

R |x|γ dF0(x) < ∞ where γ > θ−12θλ . Then we have sup

˜

g∈BV(1/φ),≤c

√n

Z

˜

g(z) dFbn(z) − Z

˜

g(z)dF0(z)

= OP(1).

Theorem: Let Zt := P

s=0 as εt−s, t ∈ N, with (εi)i∈Z i.i.d. Assume as = s−β ℓ(s), β ∈ (12,1), s ∈ N. Then Cov(X0, Xk) ∼ k1−2β, hence non-summable, thus long-memory.

With φλ as above and E[|ε0|2+2λ] < ∞, we have sup

˜

g∈BV(1/φ),≤c

rn

Z

˜

g(z) dFbn(z) − Z

˜

g(z) dF0(z)

= OP(1), where rn = nβ−1/2.

(29)

Free lunch? Yes free lunch!

(30)

Intro

15

First note that we have Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

=

ZZ

˜

g(z) 1 hn K

z − y hn

dFbn(y)dz −

ZZ

˜

g(z) 1 hnK

z − y hn

dF0(y)dz +

ZZ

˜

g(z) 1 hn K

x − y hn

dF0(y)dz − Z

˜

g(z)dF0(z)

Rewriting the second line as Z Z

˜

g(z) 1 hnK

z − y hn

dz

| {z }

¯ gn(y)

dFbn(y)−

Z Z

˜

g(z) 1 hn K

z − y hn

dz

| {z }

¯ gn(y)

dF0(y)

we see that the above benchmark result applies if all the g¯n belong to BV(1/φ),≤c

(31)

Intro

First note that we have Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

=

ZZ

˜

g(z) 1 hn K

z − y hn

dFbn(y)dz −

ZZ

˜

g(z) 1 hnK

z − y hn

dF0(y)dz +

ZZ

˜

g(z) 1 hn K

x − y hn

dF0(y)dz − Z

˜

g(z)dF0(z)

Rewriting the second line as Z Z

˜

g(z) 1 hnK

z − y hn

dz

| {z }

¯ gn(y)

dFbn(y)−

Z Z

˜

g(z) 1 hn K

z − y hn

dz

| {z }

¯ gn(y)

dF0(y)

(32)

Intro

15

First note that we have Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

=

ZZ

˜

g(z) 1 hn K

z − y hn

dFbn(y)dz −

ZZ

˜

g(z) 1 hnK

z − y hn

dF0(y)dz +

ZZ

˜

g(z) 1 hn K

x − y hn

dF0(y)dz − Z

˜

g(z)dF0(z)

Rewriting the second line as Z Z

˜

g(z) 1 hnK

z − y hn

dz

| {z }

¯ gn(y)

dFbn(y)−

Z Z

˜

g(z) 1 hn K

z − y hn

dz

| {z }

¯ gn(y)

dF0(y)

we see that the above benchmark result applies if all the g¯n belong to BV(1/φ),≤c

(33)

Intro

Hence, it only remains to consider sup

˜ g∈G˜

ZZ

˜

g(z) 1 hnK

x − y hn

dF0(y)dz − Z

˜

g(z)dF0(z).

Consider this for

˜

g ∈ G˜ := {gx : R → R|gx(z) = φ(x)1(−∞,x)(z), x ≤ 0, and gx(z) = −φ(x)1[x,∞)(z), x > 0}.

Then the above becomes sup

x

φ(x) Z

K(z) (F0(x + zhn) − F0(x)) dz.

(34)

Free lunch

17

If the benchmark result holds for F0 we have

φ(x)F0(x) → 0 for x → −∞ and φ(x)(1 − F0(x)) for x → ∞.

(35)

Free lunch

If the benchmark result holds for F0 we have

φ(x)F0(x) → 0 for x → −∞ and φ(x)(1 − F0(x)) for x → ∞.

Hence, for K compact and x small (or large) F0(x + zhn) − F0(x)

will also be small for all z even when compared to φ(x).

(36)

Free lunch

17

If the benchmark result holds for F0 we have

φ(x)F0(x) → 0 for x → −∞ and φ(x)(1 − F0(x)) for x → ∞.

Hence, for K compact and x small (or large) F0(x + zhn) − F0(x)

will also be small for all z even when compared to φ(x).

Theorem: We have the following extension of the above result sup

˜

g∈BV(1/φ),≤c

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1),

(37)

Free lunch

If the benchmark result holds for F0 we have

φ(x)F0(x) → 0 for x → −∞ and φ(x)(1 − F0(x)) for x → ∞.

Hence, for K compact and x small (or large) F0(x + zhn) − F0(x)

will also be small for all z even when compared to φ(x).

Theorem: We have the following extension of the above result sup

˜

g∈BV(1/φ),≤c

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1),

if √

nhm+kn → 0, if supz φ(z)f0(m)(z) ≤ C and if with r = 1, . . . , m, and k > 1/2

Z

K(z) dz = 1, Z

zrK(z) dz = 0, Z

|z|m+k|K(z)| dz < ∞.

(38)

One more free lunch? No (too much asked).

18

(39)

Intro

Now consider K non-compact. Then the above reasoning that F0(x + zhn) − F0(x)

will be small if F0(x) is small does not apply anymore (we can make the first (almost) equal to 1 by making z large).

(40)

Intro

19

Now consider K non-compact. Then the above reasoning that F0(x + zhn) − F0(x)

will be small if F0(x) is small does not apply anymore (we can make the first (almost) equal to 1 by making z large).

Yet, intuitively, if K puts not too much weight on these z then Z

K(z) (F0(x + zhn) − F0(x)) dz should still be small if x is small.

(41)

Intro

Now consider K non-compact. Then the above reasoning that F0(x + zhn) − F0(x)

will be small if F0(x) is small does not apply anymore (we can make the first (almost) equal to 1 by making z large).

Yet, intuitively, if K puts not too much weight on these z then Z

K(z) (F0(x + zhn) − F0(x)) dz should still be small if x is small.

Impose the following: f0 is m-times continuously differentiable and for all t ∈ [0,1] and all x, y ∈ R we have

sup

x |φ(x)f(m)(x + ty)| ≤ L2|y|p(y), where p is a bounded function.

(42)

Too much asked

20

Theorem: Let f0 be as above and K be non-compact. Then sup

˜ g∈BV

(1/φ),c

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1),

(43)

Too much asked

Theorem: Let f0 be as above and K be non-compact. Then sup

˜ g∈BV

(1/φ),c

√n

Z

˜

g(z) ˆfK(z)dz − Z

˜

g(z)f0(z)dz

= OP(1), if √

nhm+sn → 0, and if with r = 1, . . . ,⌊m + s⌋ Z

K(z) dz = 1, Z

zrK(z) dz = 0, Z

|z|m+s|K(z)| dz < ∞, where s = supy p(y).

(44)

Example

21

K compact: Then for f0 density of the double exponential we clearly have that

sup

z

φ(z)f0(z) is finite for any polynomial weight function.

(45)

Example

K compact: Then for f0 density of the double exponential we clearly have that

sup

z

φ(z)f0(z)

is finite for any polynomial weight function.

K non-compact and same f0: Then for a polynomial weight of the form (1 + |z|)λ the above condition holds with p > λ.

Thus, we have to increase the order of the kernel beyond what is

needed by the smoothness and that increase is a function of the weight.

(46)

22

That’s all.

Références

Documents relatifs

Based on corrections from multiple reference stations, a measurement domain algorithm tries to estimate, for each satellite, the combined effect of the error sources at

Des hommes nés sur le sol français , des hommes à qui long-temps nous payâmes un tribut d’éloges et d’admiration , ont donné dans leur cœur accès au démon de l’orgueil ;

The objectives of this workshop are to carry out a situation assessment of computerization of health statistical information systems in the countries of the Eastern

Le débat que devrait susciter une telle situation, périlleuse non seulement pour l’emploi mais aussi pour la démocratie, est occulté par le discours économique dominant qui

On appelle trajectoire de type I une trajectoire de marche aléatoire de longueur n qui atteint a et qui termine (strictement) en-dessous, et trajectoire de type II une trajectoire

Similarly to the polycopie, the exercises list and the previous assessments texts are given to the students from the beginning of the year, available on the lecturer’s webpage..

Central limit theorem: the sample distribution of confidence, support pairs for a rule is normally distributed Hypothesis: The distribution of confidence, support values for rules

We expect that these two families (conic transforms and conic transforms of a line transform) should yield all Fano threefolds with same intermediate Jacobians as X, thereby