• Aucun résultat trouvé

Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates

N/A
N/A
Protected

Academic year: 2022

Partager "Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates"

Copied!
14
0
0

Texte intégral

(1)

A NNALES DE L ’I. H. P., SECTION B

J OSÉ R AFAEL L EON R.

Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates

Annales de l’I. H. P., section B, tome 19, n

o

3 (1983), p. 297-309

<http://www.numdam.org/item?id=AIHPB_1983__19_3_297_0>

© Gauthier-Villars, 1983, tous droits réservés.

L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.

Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques

http://www.numdam.org/

(2)

Asymptotic behaviour

of the quadratic

measure

of deviation

of multivariate density estimates

José Rafael LEON R.

Departamento de Matematicas, Facultad de Ciencias, Universidad Central de Venezuela, Caracas, Aptdo. 21201

Ann. Inst. Henri Poincaré.

Vol. XIX, 3-1983, p. 297-309.

Section B : Calcul des Probabilités et Statistique.

RESUME. - Nous obtenons un test

d’adequation

de la distribution

asymptotique 112

et nous ’prouvons

également

que la statis-

tique

consideree est

asymptotiquement gaussienne

sous les

hypotheses

de

contiguite

de la forme

fN

=

f °

+

L2(/l), 6r~ (

0.

ABSTRACT. - We obtain a test of

goodness

of fit from the

asymptotic

distribution

112

and we also prove that the statistic under consi.deration is

asymptotically gaussian

under

contiguous

alternatives of the

form fN

=

f °

+ E

L2(1l), ðN !

0.

Key

words and

phrases:

Multidimensional

density estimates,

qua- dratic measure,

asymptotic distribution,

test. of

goodness

of fit.

1. INTRODUCTION

Set

X2,

...,

XN,

... be a sequence of

independent identically

distributed random vector with values in We shall suppose that their

common distribution has a

density f

with

respect

to

Lebesgue

measure

and

that f

E

L2(~), where J1

is a Borel

probability

measure on with

density

with respect to

Lebesgue

measure.

Annales de l’lnstitut Henri Poincaré-Section B-Vol. XIX, 0020-2347/ 1983’29 7/S 5,00

(3)

If

{~ }j~

1 is

complete

orthonormal system in the n-th

partial

sum of the

respective

Fourier series for

f

is

where

Cencov [2] ]

defines the

following

estimator of

fn :

where the

3§s

are estimators

of aj

defined

by

FN being,

as

usual,

the

empirical

distribution function of the

sample X b X2, ... , XN.

The aim of this paper is to

give

conditions under which

When

appropriately centered,

has

gaussian asymptotic

distribution.

The method we use is

inspired by Naradaya [8 ], although

instead of

using

the

strong approximation

of the

empirical

process

by

a Brownian

bridge,

we

approximate

the estimator

by

functions of Gaussian variables with values in

L2(~c).

For these ones, the result follows from the central limit theorem on the real

line,

and the

approximation

allows to

study

the

behaviour of

N n II II I

.

The result is

applied

to various

complete

orthonormal sets.

Finally,

we consider tests of

goodness

of fit

fn ~ ~ 2

and

the behaviour of

f I I2,

which

permit together

to

study

the

asymptotic

behaviour of the

statistic f ~2.

Annales de l’Institut henri Poincaré-Section B

(4)

299 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES

2. ASSOCIATED GAUSSIAN VARIABLES We have defined

If we

put

it is clear that

If we consider the

Yn,k

as

independent identically

distributed random variables with values in we have

and

where

rn

is the covariance of and

(., . )

is the scalar

product

in

L2(,u).

It follows that

Define the centered

gaussian

random variable with values in

L2Cu) by

in such a way that

Zl,n

and

Ynsk

have the same covariance.

Now,

if

Ço

is

a normalized

gaussian

real random

variable, independent from ~ ~~ }~‘-1,

consider the random variable

(with

values in

(5)

Them

Z2,n

is

gaussian,

= 0 and its covariance satisfies

LEMMA 2 .1. CO. then

P~~oof. _

The result now follows from

Before

studying

the

asymptotic

behaviour of

Zl,n

let us define:

Note that

~n-1

1 are the

eigenvalues

of An.

LEMMA 2. 2.

- Suppose

that there exists m &#x3E; 3 such

that 1 S - 0 1 . Then,

if lim n -+ (’£’,

(1; = 62

&#x3E;

0,

we have n

Annales de l’lnstitut Henri Poincaré-Section B

(6)

301 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES

So that

The result follows

letting n -

00.

THEOREM ~ . 3. Under the same

hypothesis

of Lemma 2 . 2 we have

Proof

- Since

Z2,n

is

gaussian

we can find a

basis {e1, ... , en}

such

that

where Yi, ..., y~ are normalized

independent gaussian

random variables. So

and

The result follows from

Together

with Lemma 2.2 and

Lindeberg’s

Theorem on the line.

REMARK. - From Lemma 2.1 and

Z~ ~

= we obtain

3. MAIN THEOREM

The

following

Theorem is due to Kuelbs and Kurtz

[7] see

also Gine

and Leon

[4 ].

The statement is

adapted

to our present needs.

THEOREM 3.1. - be

independent identically

distributed

(7)

random variables with values in

E(Yi)

=

0, Y1 113)

00

and Z1

a centered

gaussian

variable with the same covariance as

Yi. Then,

for each t and 6 &#x3E;

0,

holds true.

We now prove our main result:

THEOREM 3 .2. -

Suppose

that for some a &#x3E; 0

1

If, additionally, ~fr~03B1

x,

03C32n

~

03C32

&#x3E; o

and -Sm(n)

=

0(1)

for some

n

/11 &#x3E;_

3,

them ..- --,

Proof

- Define

Gn,N(t)

and

Gn(t)

in the

following

way:

By

theorem 3.1.

But

Annales de l’lnstitut Henri Poincaré-Section B

(8)

MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES 303

If we choose 5 ==

6(n)

such that

5~~~~ -~ 0,

the remark

following

theorem 2.3 and the fact that

A~

is

bounded,

since

imply :

Where

~~

denotes the normal

(0, 0-2)

distribution.

Finally,

if we choose

6(n)

= with

8n

~

0, ~2n1~2 _ 8n

~ 0

holds,

and the theorem will follow if the first term in

(3.1)

tends to zero.

This is achieved if

8n

tends to zero

slowly enough.

COROLLARY 3.3. - If in addition to the

hypothesis

of theorem

3.2,

one has

gM = !! fn - f [ [ 2

==

9(~~~N’~),

them

We shall use the

following

theorem to

verify

that

THEOREM 3.4. - Define vi =

sup Suppose

that

x

them

Proof

-

now

(9)

on

applying

Holder’s

inequality

it follows that

but

then

according

to the

hypothesis.

4. EXAMPLES AND APPLICATIONS

a)

Let us consider the space

L~((-

rc,

rc)2)

with

respect

to

Lebesgue

measure, and the

complete

orthonormal

set 27T

. with

the obvious modifications in the

previous

sections to be able to include

complex

valued

functions f

we may

study

the estimators

(Here, Ck,j

are the Fourier coefficients

of f

and

Ck,j

their

estimators).

We

easily verify

and

and

by Beppo-Levi’s

Theorem:

Annales de l’lnstitut Henri Poincaré-Section B

(10)

305

MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES

The bound

sup ) Dn(x) =

0

(log n),

for the Dirichlet-Kernel

( [10 ], p.151) gives :

Moreover = 1 and if we

put v

=

(2n

+

1)2

Finally

if

f

is

periodic continuously

difTerentiable in

C((2014

1t,

7r)~

until

the second order and it has three derivatives in

L 2

with

respect

to each variable i.

!!2

00

and ~ fyyy !!2

00, then is easy to

verify

that

g(n)

=

O(n-6)

=

(Vl/2N-1)

if n = NIX with ot

&#x3E; -.

Then,

under the above conditions

for f,

we

get

if

Note. - This result was obtained

by Naradaya [8 ]

in the univariate case,

although

our method

improves

the choice of the

exponent

a. With minor

changes

the same

proof applies

to densities on ~p.

b)

As a second

example

we consider an

asymptotic

test of

goodness

of fit for uniform distribution on a

sphere.

-

The basis for

L2(S2)

with the measure invariant

by

rotations is denoted

by { Y7(0, ~) }~=~

=

0,1, ... ~ 0

o

27~ - ~ ~

and cons-

tructed from the

spherical

harmonics

( [3 ],

p.

511).

We

put,

with the obvious notations -

where

(81, ~ 1 ),

...,

(0N,

is the observed

sample.

(11)

The statistic

Tn,N == II ,fn.N - 11Il2(s2)

can be used to test

uniformity.

One

easily

verifies that

A - Id

so that

0 - 1 62 - 2 Sm(n) n =

1.

Moreover n

Hence,

Theorem 3.2

gives

c)

As afinal

example, let f E L 2( - 1, 1), r(x)

=

1 and 03C6j

=

(2j

+

the sequence of

Legendre polynomials

So that

The

remaining

conditions can be verified

using

the

following

Theorem

( [5 ], p. 116).

THEOREM 4.1. - With the above notations and

f

E C

[ - 1, 1 ],

consider

Toeplitz

matrices.

If

~) (i

=

1, ..., n)

are the

eigenvalues

of them for each m

&#x3E;

1

holds true.

In our case, we get

and

nnales de l’lnstitut Henri Poincaré-Section B

(12)

MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES 307

If assume that

[ - 1, I],

then =

0{n- 5) (see [6 ],

p.

209)

and

g(n)

= if n =

N°‘ ,

a &#x3E;

2/11.

Summing

up, if

f 4 =C[2014 1 ~ 1 ]&#x3E; n

=

N°‘ 2

a

1/4,

then

with

5. ASYMPTOTIC BEHAVIOUR UNDER CONTIGUOUS ALTERNATIVES

Suppose

that one wants to test the null

hypothesis

against

the sequence of alternatives

where &#x26; is a fixed function in

L2(/l)

and

ðN ~

0.

N

The

following

Theorem states that

= 2014!! fn~2

is

asympto-

n

tically gaussian

under The

proof

follows the lines of

[1 ],

th. 4.2 and

[8 ],

th.

4.2,

and the result can be

applied

to the

previous examples.

We must define before

THEOREM 5 .1. Under if A +

c( 20142014 ), 2n ~ 03C320

&#x3E;

0,

suppose THEOREM 5.1.- Under

HN,

If

An

= A + 0

n

(fn ~ (fo &#x3E;

0,

suppose also that the

hypothesis

of the Theorem 3.2 are satisfied for n =

NIX,

(2 -x) 0 a ao and

6r~

=

(N ~ )

then

Proof

- Denote

(13)

Where

EHN

denotes the

expectation

when the true

underlying

distribution has

density f N.

Let {

a

complete

orthonormal basis for

L2(/l)

and )’i = the

Fourier coefficients of the function ~. Define

We have

So that :

but

moreover

and

then we can conclude

and

finally applying

Theorem 3.2

Annales de l’lnstitut Henri Poincaré-Section B

(14)

309 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES

ACKNOWLEDGMENT

The author wishes to express his

gratitude

to Professor J.

Bretagnolle

for a careful

reading

of a first version of this paper, and for his valuable

suggestions

that

improved considerably

the results.

[1] P. J. BICKEL and M. ROSENBLATT, On some measures of the deviations of density

functions estimates. The Ann. of Statistics, t. 1, 6, 1973, p. 1071-1095.

[2] N. N. CENCOV, Evaluations of an unknown distribution density from observations.

Soviet Math., t. 3, 1962, p. 1559-1562.

[3] R. COURANT and D. HILBERT, Methods of Mathematical Physics, t. 1, New York, 1953.

[4] E. GINÉ, J. LEÓN, On the central limit theorem in Hilbert Spaces. Stochastica, t. IV,

1, 1980, p. 43-71.

[5] U. GRENANDER, Szegö. Toeplitz forms and their application. University of California

Press. Berkeley. 1958.

[6] E. ISACCSON, H. KELLER, « Analysis of Numerical Methods » John Wiley and Sons.

Inc. New York, 1966.

[7] KUELBS, KURTZ, Bery Essen estimates in Hilbert Space and an application to the LIL.

Ann. Probability, t. 2, 1974, p. 387-407.

[8] E. A. NARADAYA, On a quadratic measure of deviation of the estimate of a distribution density. Theo. Prob. and its App., 4, 1976, p. 844-850.

[9] A. ZYGMUND, Trygonometric series, 2nd Ed. Cambridge University Press. New York, 1959.

(Manuscrit reçu le 15 octobre 1982)

Références

Documents relatifs

(1.4.1) One can show that there is a ring morphism Poinc ℓ : K 0 (Var k ) → Z[t] extend- ing Poinc ℓ (in characteristic zero one may use the fact, proven by F.Bittner, that the class

These properties play a crucial role in the main theorem which estab- lishes that the value functions of the starting perturbation problems converge to the solution of an

Then, denoting by l(a) the number of occurrences of the pattern “01” in the binary expansion of a, we give the asymptotic behavior of this probability distribution as l(a) goes

Nicolas, “On practical partitions,” Collectanea Math.. Ramanujan, “Asymptotic formulae for the distribution of integers of various

We analyse the asymptotic behaviour of the eigenvalues and eigenvectors of a Steklov problem in a dumbbell domain consisting of two Lipschitz sets connected by a thin tube

used to study intensional properties of primitive recursive algorithms..

We obtain non-asymptotic Gaussian concentration bounds for the difference between the invariant distribution ν of an ergodic Brownian diffusion process and the empirical distribution

We determine the asymptotic behaviour of the number of Eulerian circuits in undirected simple graphs with large algebraic connectivity (the second-smallest eigenvalue of the