A NNALES DE L ’I. H. P., SECTION B
J OSÉ R AFAEL L EON R.
Asymptotic behaviour of the quadratic measure of deviation of multivariate density estimates
Annales de l’I. H. P., section B, tome 19, n
o3 (1983), p. 297-309
<http://www.numdam.org/item?id=AIHPB_1983__19_3_297_0>
© Gauthier-Villars, 1983, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
Asymptotic behaviour
of the quadratic
measureof deviation
of multivariate density estimates
José Rafael LEON R.
Departamento de Matematicas, Facultad de Ciencias, Universidad Central de Venezuela, Caracas, Aptdo. n° 21201
Ann. Inst. Henri Poincaré.
Vol. XIX, n° 3-1983, p. 297-309.
Section B : Calcul des Probabilités et Statistique.
RESUME. - Nous obtenons un test
d’adequation
de la distributionasymptotique 112
et nous ’prouvonségalement
que la statis-tique
consideree estasymptotiquement gaussienne
sous leshypotheses
decontiguite
de la formefN
=f °
+L2(/l), 6r~ (
0.ABSTRACT. - We obtain a test of
goodness
of fit from theasymptotic
distribution
112
and we also prove that the statistic under consi.deration isasymptotically gaussian
undercontiguous
alternatives of theform fN
=f °
+ EL2(1l), ðN !
0.Key
words andphrases:
Multidimensionaldensity estimates,
qua- dratic measure,asymptotic distribution,
test. ofgoodness
of fit.1. INTRODUCTION
Set
X2,
...,XN,
... be a sequence ofindependent identically
distributed random vector with values in We shall suppose that their
common distribution has a
density f
withrespect
toLebesgue
measureand
that f
EL2(~), where J1
is a Borelprobability
measure on withdensity
with respect toLebesgue
measure.Annales de l’lnstitut Henri Poincaré-Section B-Vol. XIX, 0020-2347/ 1983’29 7/S 5,00
If
{~ }j~
1 iscomplete
orthonormal system in the n-thpartial
sum of the
respective
Fourier series forf
iswhere
Cencov [2] ]
defines thefollowing
estimator offn :
where the
3§s
are estimatorsof aj
definedby
FN being,
asusual,
theempirical
distribution function of thesample X b X2, ... , XN.
The aim of this paper is to
give
conditions under whichWhen
appropriately centered,
hasgaussian asymptotic
distribution.The method we use is
inspired by Naradaya [8 ], although
instead ofusing
the
strong approximation
of theempirical
processby
a Brownianbridge,
we
approximate
the estimatorby
functions of Gaussian variables with values inL2(~c).
For these ones, the result follows from the central limit theorem on the realline,
and theapproximation
allows tostudy
thebehaviour of
N n II II I
.The result is
applied
to variouscomplete
orthonormal sets.Finally,
we consider tests ofgoodness
of fitfn ~ ~ 2
andthe behaviour of
f I I2,
whichpermit together
tostudy
theasymptotic
behaviour of thestatistic f ~2.
Annales de l’Institut henri Poincaré-Section B
299 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES
2. ASSOCIATED GAUSSIAN VARIABLES We have defined
If we
put
it is clear that
If we consider the
Yn,k
asindependent identically
distributed random variables with values in we haveand
where
rn
is the covariance of and(., . )
is the scalarproduct
inL2(,u).
It follows that
Define the centered
gaussian
random variable with values inL2Cu) by
in such a way that
Zl,n
andYnsk
have the same covariance.Now,
ifÇo
isa normalized
gaussian
real randomvariable, independent from ~ ~~ }~‘-1,
consider the random variable
(with
values inThem
Z2,n
isgaussian,
= 0 and its covariance satisfiesLEMMA 2 .1. CO. then
P~~oof. _
The result now follows from
Before
studying
theasymptotic
behaviour ofZl,n
let us define:Note that
~n-1
1 are theeigenvalues
of An.LEMMA 2. 2.
- Suppose
that there exists m > 3 suchthat 1 S - 0 1 . Then,
if lim n -+ (’£’,(1; = 62
>0,
we have nAnnales de l’lnstitut Henri Poincaré-Section B
301 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES
So that
The result follows
letting n -
00.THEOREM ~ . 3. Under the same
hypothesis
of Lemma 2 . 2 we haveProof
- SinceZ2,n
isgaussian
we can find abasis {e1, ... , en}
suchthat
where Yi, ..., y~ are normalized
independent gaussian
random variables. Soand
The result follows from
Together
with Lemma 2.2 andLindeberg’s
Theorem on the line.REMARK. - From Lemma 2.1 and
Z~ ~
= we obtain3. MAIN THEOREM
The
following
Theorem is due to Kuelbs and Kurtz[7] see
also Gineand Leon
[4 ].
The statement isadapted
to our present needs.THEOREM 3.1. - be
independent identically
distributedrandom variables with values in
E(Yi)
=0, Y1 113)
00and Z1
a centeredgaussian
variable with the same covariance asYi. Then,
for each t and 6 >0,
holds true.
We now prove our main result:
THEOREM 3 .2. -
Suppose
that for some a > 01
If, additionally, ~fr~03B1
x,03C32n
~03C32
> oand -Sm(n)
=0(1)
for somen
/11 >_
3,
them ..- --,Proof
- DefineGn,N(t)
andGn(t)
in thefollowing
way:By
theorem 3.1.But
Annales de l’lnstitut Henri Poincaré-Section B
MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES 303
If we choose 5 ==
6(n)
such that5~~~~ -~ 0,
the remarkfollowing
theorem 2.3 and the fact that
A~
isbounded,
sinceimply :
Where
~~
denotes the normal(0, 0-2)
distribution.Finally,
if we choose6(n)
= with8n
~0, ~2n1~2 _ 8n
~ 0holds,
and the theorem will follow if the first term in(3.1)
tends to zero.This is achieved if
8n
tends to zeroslowly enough.
COROLLARY 3.3. - If in addition to the
hypothesis
of theorem3.2,
one has
gM = !! fn - f [ [ 2
==9(~~~N’~),
themWe shall use the
following
theorem toverify
thatTHEOREM 3.4. - Define vi =
sup Suppose
thatx
them
Proof
-now
on
applying
Holder’sinequality
it follows thatbut
then
according
to thehypothesis.
4. EXAMPLES AND APPLICATIONS
a)
Let us consider the spaceL~((-
rc,rc)2)
withrespect
toLebesgue
measure, and the
complete
orthonormalset 27T . with
the obvious modifications in the
previous
sections to be able to includecomplex
valuedfunctions f
we maystudy
the estimators(Here, Ck,j
are the Fourier coefficientsof f
andCk,j
theirestimators).
We
easily verify
and
and
by Beppo-Levi’s
Theorem:Annales de l’lnstitut Henri Poincaré-Section B
305
MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES
The bound
sup ) Dn(x) =
0(log n),
for the Dirichlet-Kernel( [10 ], p.151) gives :
Moreover = 1 and if we
put v
=(2n
+1)2
Finally
iff
isperiodic continuously
difTerentiable inC((2014
1t,7r)~
untilthe second order and it has three derivatives in
L 2
withrespect
to each variable i.!!2
00and ~ fyyy !!2
00, then is easy toverify
thatg(n)
=O(n-6)
=(Vl/2N-1)
if n = NIX with ot> -.
Then,
under the above conditionsfor f,
weget
if
Note. - This result was obtained
by Naradaya [8 ]
in the univariate case,although
our methodimproves
the choice of theexponent
a. With minorchanges
the sameproof applies
to densities on ~p.b)
As a secondexample
we consider anasymptotic
test ofgoodness
of fit for uniform distribution on a
sphere.
-The basis for
L2(S2)
with the measure invariantby
rotations is denotedby { Y7(0, ~) }~=~
=0,1, ... ~ 0
o27~ - ~ ~
and cons-tructed from the
spherical
harmonics( [3 ],
p.511).
We
put,
with the obvious notations -where
(81, ~ 1 ),
...,(0N,
is the observedsample.
The statistic
Tn,N == II ,fn.N - 11Il2(s2)
can be used to testuniformity.
One
easily
verifies thatA - Id
so that0 - 1 62 - 2 Sm(n) n =
1.Moreover n
Hence,
Theorem 3.2gives
c)
As afinalexample, let f E L 2( - 1, 1), r(x)
=1 and 03C6j
=(2j
+the sequence of
Legendre polynomials
So that
The
remaining
conditions can be verifiedusing
thefollowing
Theorem( [5 ], p. 116).
THEOREM 4.1. - With the above notations and
f
E C[ - 1, 1 ],
considerToeplitz
matrices.If
~) (i
=1, ..., n)
are theeigenvalues
of them for each m>
1holds true.
In our case, we get
and
nnales de l’lnstitut Henri Poincaré-Section B
MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES 307
If assume that
[ - 1, I],
then =0{n- 5) (see [6 ],
p.209)
andg(n)
= if n =N°‘ ,
a >2/11.
Summing
up, iff 4 =C[2014 1 ~ 1 ]> n
=N°‘ 2
a1/4,
thenwith
5. ASYMPTOTIC BEHAVIOUR UNDER CONTIGUOUS ALTERNATIVES
Suppose
that one wants to test the nullhypothesis
against
the sequence of alternativeswhere & is a fixed function in
L2(/l)
andðN ~
0.N
The
following
Theorem states that ’= 2014!! fn~2
isasympto-
n
’
tically gaussian
under Theproof
follows the lines of[1 ],
th. 4.2 and[8 ],
th.
4.2,
and the result can beapplied
to theprevious examples.
We must define before
THEOREM 5 .1. Under if A +
c( 20142014 ), 2n ~ 03C320
>0,
suppose THEOREM 5.1.- UnderHN,
IfAn
= A + 0n
(fn ~ (fo >
0,
suppose also that thehypothesis
of the Theorem 3.2 are satisfied for n =NIX,
(2 -x) 0 a ao and
6r~
=(N ~ )
thenProof
- DenoteWhere
EHN
denotes theexpectation
when the trueunderlying
distribution hasdensity f N.
Let {
acomplete
orthonormal basis forL2(/l)
and )’i = theFourier coefficients of the function ~. Define
We have
So that :
but
moreover
and
then we can conclude
and
finally applying
Theorem 3.2Annales de l’lnstitut Henri Poincaré-Section B
309 MEASURE OF DEVIATION OF MULTIVARIATE DENSITY ESTIMATES
ACKNOWLEDGMENT
The author wishes to express his
gratitude
to Professor J.Bretagnolle
for a careful
reading
of a first version of this paper, and for his valuablesuggestions
thatimproved considerably
the results.[1] P. J. BICKEL and M. ROSENBLATT, On some measures of the deviations of density
functions estimates. The Ann. of Statistics, t. 1, n° 6, 1973, p. 1071-1095.
[2] N. N. CENCOV, Evaluations of an unknown distribution density from observations.
Soviet Math., t. 3, 1962, p. 1559-1562.
[3] R. COURANT and D. HILBERT, Methods of Mathematical Physics, t. 1, New York, 1953.
[4] E. GINÉ, J. LEÓN, On the central limit theorem in Hilbert Spaces. Stochastica, t. IV,
n° 1, 1980, p. 43-71.
[5] U. GRENANDER, Szegö. Toeplitz forms and their application. University of California
Press. Berkeley. 1958.
[6] E. ISACCSON, H. KELLER, « Analysis of Numerical Methods » John Wiley and Sons.
Inc. New York, 1966.
[7] KUELBS, KURTZ, Bery Essen estimates in Hilbert Space and an application to the LIL.
Ann. Probability, t. 2, 1974, p. 387-407.
[8] E. A. NARADAYA, On a quadratic measure of deviation of the estimate of a distribution density. Theo. Prob. and its App., n° 4, 1976, p. 844-850.
[9] A. ZYGMUND, Trygonometric series, 2nd Ed. Cambridge University Press. New York, 1959.
(Manuscrit reçu le 15 octobre 1982)