Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates

(1)

A NNALES DE L ’I. H. P., SECTION B

J OSÉ R AFAEL L EON R.

Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates

Annales de l’I. H. P., section B, tome 19, n

^o

3 (1983), p. 297-309

<http://www.numdam.org/item?id=AIHPB_1983__19_3_297_0>

L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions). Toute utilisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.

Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques

http://www.numdam.org/

(2)

Asymptotic ^behaviour

of the quadratic

^measure

^of ^deviation

of multivariate density ^estimates

José Rafael LEON R.

Departamento ^deMatematicas, Facultad de Ciencias, Universidad Central de Venezuela, Caracas, Aptdo. ^n°²¹²⁰¹

Ann. Inst. Henri Poincaré.

Vol. XIX, ^n°3-1983, p. 297-309.

Section B : Calcul des Probabilités et Statistique.

RESUME. ^-Nous obtenons un test

d’adequation

^{de la}distribution

asymptotique 112

^et^nous’prouvons

également

^{que la}^statis-

tique

consideree est

asymptotiquement gaussienne

^sous^les

hypotheses

^de

contiguite

de la forme

fN

⁼

f °

⁺

L2(/l), 6r~ (

^0.

ABSTRACT. - We obtain ^atest of

goodness

of fit from the

asymptotic

distribution

112

^and^we^alsoprove that the statistic under consi.deration is

asymptotically gaussian

^under

contiguous

alternatives of the

form fN

⁼

f °

⁺ ^E

L2(1l), ðN !

^0.

Key

words and

phrases:

Multidimensional

density estimates,

quadratic measure,

asymptotic distribution,

^test.^of

goodness

^{of fit.}

1. INTRODUCTION

Set

X2,

^...,

XN,

^... ^be^asequence of

independent identically

distributed random vector with values in We shall _supposethat their

common distribution has a

density f

^with

respect

^to

Lebesgue

^measure

and

that f

^E

L2(~), where J1

^is^a^Borel

probability

measure on with

density

^withrespect ^to

Lebesgue

^measure.

Annales de l’lnstitut Henri Poincaré-Section B-Vol. XIX, 0020-2347/ 1983’29 7/S 5,00

(3)

If

{~ }j~

¹^is

^complete

orthonormal system ⁱⁿ ^the^n-th

partial

sum of the

respective

Fourier series for

f

^is

where

Cencov _[2] _]

defines the

following

^estimator^of

fn :

where the

3§s

^areestimators

of aj

^defined

by

FN being,

^as

usual,

^the

empirical

distribution function of the

sample X b X2, ... , XN.

The aim of this _paperis to

give

conditions under which

When

appropriately centered,

^has

gaussian asymptotic

distribution.

The method we use is

inspired by Naradaya [8 ], although

instead of

using

the

strong approximation

^{of the}

empirical

process

by

^a^Brownian

bridge,

we

approximate

^the^estimator

by

functions of Gaussian variables with values in

L2(~c).

^For^these^ones,the result follows from the central limit theorem ^onthe real

line,

^{and the}

approximation

^allows^to

study

^the

behaviour of

N n ^II ^{II I}

^.

The result is

applied

^to^various

complete

orthonormal sets.

Finally,

^we^consider^tests^of

goodness

^{of fit}

fn ~ ~ 2

^and

the behaviour of

f I I2,

^which

permit together

^to

study

^the

asymptotic

behaviour of the

statistic f ~2.

Annales de l’Institut henri Poincaré-Section B

(4)

299 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES

2. ASSOCIATED GAUSSIAN VARIABLES We have defined

If we

put

it is clear that

If ^weconsider the

Yn,k

^as

independent identically

distributed random variables with values in we have

and

where

rn

is the covariance of and

(., . )

is the scalar

product

ⁱⁿ

L2(,u).

It follows that

Define the centered

gaussian

random variable with values in

L2Cu) by

in such a way that

Zl,n

^and

Ynsk

^{have the}^samecovariance.

Now,

^if

Ço

^is

a normalized

gaussian

real random

variable, independent from ~ ~~ }~‘-1,

consider the random variable

(with

values in

(5)

Them

Z2,n

^is

gaussian,

⁼0 and its covariance satisfies

LEMMA 2 .1. CO. then

P~~oof. _

The result ^nowfollows from

Before

studying

^the

asymptotic

^behaviour^of

Zl,n

^let^us^define:

Note that

~n-1

¹ ^are^the

eigenvalues

^ofAn.

LEMMA 2. 2.

- Suppose

that there exists m > 3 such

that 1 S - 0 1 . Then,

^{if lim}n -+ (’£’,

(1; = ⁶²

>

0,

^we^have ⁿ

Annales de l’lnstitut Henri Poincaré-Section B

(6)

So that

The result follows

letting n -

^00.

THEOREM ~ . 3. Under the same

hypothesis

^ofLemma 2 . 2 we have

Proof

^{- Since}

Z2,n

^is

gaussian

^{we can}^find^a

basis {e1, ... , en}

^such

that

where _Yi,..., y~ ^arenormalized

independent gaussian

random variables. So

and

The result follows from

Together

with Lemma 2.2 and

Lindeberg’s

^Theorem^on^{the line.}

REMARK. - From Lemma 2.1 and

Z~ ~

⁼ ^we^obtain

3. MAIN THEOREM

The

following

Theorem is due to Kuelbs and Kurtz

[7] see

^{also Gine}

and Leon

[4 ].

^The^statement^is

adapted

^to^ourpresent needs.

THEOREM 3.1. ^- be

independent identically

distributed

(7)

random variables with values in

E(Yi)

⁼

0, Y1 113)

⁰⁰

and Z1

a centered

gaussian

variable with the ^samecovariance ^as

Yi. Then,

^for each t and 6 >

0,

holds true.

We now prove ^ourmain result:

THEOREM 3 .2. ^-

Suppose

^{that for}some a > 0

1

If, additionally, ~fr~03B1

^x,

03C32n

^~

^03C32

> o

and -Sm(n)

⁼

0(1)

^for^some

n

/11 >_

3,

^them ..- --,

Proof

^{- Define}

Gn,N(t)

^and

^Gn(t)

^{in the}

^following

^way:

By

theorem 3.1.

But

(8)

MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES 303

If we choose 5 ==

6(n)

such that

5~~~~ -~ 0,

the remark

following

theorem 2.3 and the fact that

A~

^is

bounded,

^since

imply :

Where

~~

denotes the normal

(0, 0-2)

distribution.

Finally,

^if^we^choose

6(n)

⁼ ^with

8n

^~

0, ^{~2n1~2 _} 8n

^~⁰

holds,

and the theorem will follow if the first term in

(3.1)

^tends^to^zero.

This is achieved if

8n

^tends^to^zero

slowly enough.

COROLLARY 3.3. ^-If in addition to the

hypothesis

of theorem

3.2,

one has

gM = !! fn - f [ [ 2

⁼⁼

9(~~~N’~),

^them

We shall use the

following

^theorem^to

verify

^that

THEOREM 3.4. ^-Define vi ⁼

sup Suppose

^that

x

them

Proof

^-

now

(9)

on

applying

^Holder’s

inequality

it follows that

but

then

according

^to^the

hypothesis.

4. EXAMPLES AND APPLICATIONS

a)

^Let^usconsider the space

L~((-

^rc,

rc)2)

^with

respect

^to

Lebesgue

measure, and the

complete

orthonormal

set ^27T

^. ^with

the obvious modifications in the

complex

^valued

functions f

^we^may

study

the estimators

(Here, Ck,j

^arethe Fourier coefficients

of f

^and

Ck,j

^their

estimators).

We

easily verify

and

by Beppo-Levi’s

^Theorem:

(10)

305

MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES

The bound

sup ) Dn(x) =

⁰

(log n),

for the Dirichlet-Kernel

( [10 ], p.151) gives :

Moreover ⁼1 and if we

put v

⁼

(2n

⁺

1)2

Finally

^if

f

^is

periodic continuously

difTerentiable in

C((2014

^1t,

7r)~

^until

the second order and it has three derivatives in

L 2

with

respect

^to^each variable i.

!!2

⁰⁰

and ~ fyyy ^!!2

^00,^{then is}^easy^to

verify

^that

g(n)

⁼

O(n-6)

⁼

(Vl/2N-1)

^{if n}⁼^{NIX with}^ot

> -.

Then,

under the above conditions

for f,

^we

get

if

Note. - This result was obtained

by Naradaya [8 ]

in the univariate case,

although

^our^method

improves

the choice of the

exponent

^a.^With minor

changes

^the^same

proof applies

^to^densities^on^~p.

b)

^As^a^second

example

^we^consider^an

asymptotic

^test^of

goodness

of fit for uniform distribution on a

sphere.

^-

The basis for

L2(S2)

^{with the}^measure^invariant

by

rotations is denoted

by { Y7(0, ~) }~=~

⁼

0,1, ... ~ 0

^o

^{27~ - ~} ^~

^and^cons-

tructed from the

spherical

^harmonics

( [3 ],

p.

511).

We

put,

^{with the}obvious notations -

where

(81, ~ 1 ),

^...,

(0N,

^is^the^observed

sample.

(11)

The statistic

Tn,N == II ,fn.N - 11Il2(s2)

^can^{be used}^to^test

uniformity.

One

easily

verifies that

A - Id

^so^that

0 - 1 62 - 2 Sm(n) n =

1.

Moreover ⁿ

Hence,

Theorem 3.2

gives

c)

^{As afinal}

example, let f E L 2( - 1, 1), r(x)

⁼

1 and 03C6j

⁼

(2j

⁺

the _sequenceof

Legendre polynomials

So that

The

remaining

conditions can be verified

using

^the

following

^Theorem

( [5 ], p. 116).

THEOREM 4.1. - With the above notations and

f

^E^C

[ - 1, 1 ],

^consider

Toeplitz

^matrices.

If

~) (i

⁼

1, ..., n)

^are^the

eigenvalues

^of them for each m

>

1

holds true.

In our case, ^weget

and

nnales de l’lnstitut Henri Poincaré-Section B

(12)

MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES 307

If assume that

[ - 1, I],

^then ⁼

0{n- 5) (see [6 ],

p.

209)

^and

g(n)

⁼ ^{if n}⁼

N°‘ ,

^a>

2/11.

Summing

^up,^if

f 4 =C[2014 1 ~ 1 ]> n

⁼

N°‘ 2

^a

1/4,

^then

with

5. ASYMPTOTIC BEHAVIOUR UNDER CONTIGUOUS ALTERNATIVES

Suppose

^that^onewants to test the null

hypothesis

against

the sequence of alternatives

where & is ^afixed function in

L2(/l)

^and

ðN ~

^0.

N

The

following

^Theorem^states^that ^’

= 2014!! fn~2

^is

asympto-

n

’

tically gaussian

^under ^The

proof

follows the lines of

[1 ],

th. 4.2 and

[8 ],

th.

4.2,

and the result can be

applied

^to^the

previous examples.

We must define before

THEOREM 5 .1. Under if A +

c( 20142014 ), 2n ~ 03C320

>

0,

suppose THEOREM 5.1.- Under

HN,

^If

An

⁼^A⁺⁰

n

(fn ^~(fo >

0,

suppose also that the

hypothesis

of the Theorem 3.2 are satisfied for n ⁼

NIX,

(2 -x) 0 a ao and

6r~

⁼

(N ~ )

^then

Proof

^{- Denote}

(13)

Where

EHN

denotes the

expectation

^{when the}^true

underlying

distribution has

density f N.

Let {

^a

complete

orthonormal basis for

L2(/l)

^and^)’i⁼ ^the

Fourier coefficients of the function ^~.Define

We have

So that :

but

moreover

and

then we can conclude

and

finally applying

Theorem 3.2

(14)

ACKNOWLEDGMENT

The author wishes to express his

gratitude

^toProfessor J.

Bretagnolle

for ^acareful

reading

^of^afirst version of this _paper,and for his valuable

suggestions

^that

improved considerably

the results.

[1] P. J. BICKEL and M. ROSENBLATT, ^Onsome measures of the deviations of density

functions estimates. The Ann. of Statistics, ^t.1, ^n°6, 1973, p. 1071-1095.

[2] ^{N. N.}CENCOV, Evaluations of an unknown distribution density from observations.

Soviet Math., ^t.3, 1962, p. 1559-1562.

[3] R. COURANT and D. HILBERT, ^Methodsof Mathematical Physics, ^t.1, ^NewYork, ^1953.

[4] ^E.GINÉ, ^J.LEÓN, On the central limit theorem in Hilbert Spaces. Stochastica, ^t.IV,

n° 1, 1980, p. ^43-71.

[5] ^U.GRENANDER, Szegö. Toeplitz ^forms^{and their}application. University ^of^California

Press. Berkeley. ^1958.

[6] ^E.ISACCSON, ^H.KELLER, ^«Analysis ^of^Numerical^{Methods »}^JohnWiley ^{and Sons.}

Inc. New York, ^1966.

[7] KUELBS, KURTZ, Bery Essen estimates in Hilbert Space ^and^anapplication ^to^{the LIL.}

Ann. Probability, ^t.2, 1974, p. 387-407.

[8] ^{E. A.}NARADAYA, ^On^aquadratic ^measureof deviation of the estimate of a distribution density. Theo. Prob. and its App., ^n°^{4, 1976,}p. 844-850.

[9] ^A.ZYGMUND, Trygonometric series, 2nd Ed. Cambridge University Press. New York, 1959.

(Manuscrit reçu le 15 octobre 1982)

Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates

A NNALES DE L ’I. H. P., SECTION B

J OSÉ R AFAEL L EON R.

Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates

Annales de l’I. H. P., section B, tome 19, n

3 (1983), p. 297-309

Asymptotic behaviour

of the quadratic

of deviation

of multivariate density estimates

d’adequation

asymptotique 112

également

tique

asymptotiquement gaussienne

hypotheses

contiguite

fN

f °

L2(/l), 6r~ (

goodness

asymptotic

112

asymptotically gaussian

contiguous

form fN

f °

L2(1l), ðN !

Key

phrases:

density estimates,

asymptotic distribution,

goodness

X2,

XN,

independent identically

density f

respect

Lebesgue

that f

L2(~), where J1

probability

density

Lebesgue

{~ }j~

complete

partial

respective

f

Cencov [2] ]

following

fn :

3§s

of aj

by

FN being,

usual,

empirical

sample X b X2, ... , XN.

give

appropriately centered,

gaussian asymptotic

inspired by Naradaya [8 ], although

using

strong approximation

empirical

by

bridge,

approximate

by

L2(~c).

line,

approximation

study

N n II II I

applied

complete

Finally,

goodness

fn ~ ~ 2

Asymptotic ^behaviour

^of ^deviation

of multivariate density ^estimates

^complete

Cencov _[2] _]

N n ^II ^{II I}

(1; = ⁶²

^03C32

^Gn(t)

^following