A NNALES DE L ’I. H. P., SECTION B
H EROLD D EHLING M ANFRED D ENKER W ALTER P HILIPP
The almost sure invariance principle for the empirical process of U-statistic structure
Annales de l’I. H. P., section B, tome 23, n
o2 (1987), p. 121-134
<http://www.numdam.org/item?id=AIHPB_1987__23_2_121_0>
© Gauthier-Villars, 1987, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
The almost
sureinvariance principle
for the empirical process
of U-statistic structure
Herold DEHLING, Manfred DENKER and Walter
PHILIPP
Vol. 23, n° 2, 1987, p. 121-134. Probabilités et Statistiques
ABSTRACT. - We prove the
generalization
of Kiefer’s almost sureinvariance
principle
for theempirical
distribution function toempirical
processes of a certain class of
dependent
random variableshaving
a U-sta-tistic structure with
non-degenerate
kernel.Key-words : U-statistics, empirical process, almost sure invariance principle.
RESUME. - Nous montrons la
generalisation
d’un theoreme de Kiefer concernant leprincipe
d’invariance p. p. pour larepartition empirique
dans le cas de certains processus
empiriques qui
ont une structure commeles
U-statistiques.
1. INTRODUCTION
Empirical
processes of U-statistic structure were first consideredby Serfling [7] in
order tostudy generalized
L-statistics.(For
further moti-vations to
investigate
these processes see alsoJanssen, Serfling,
Vera-verbeke
[5 ]).
Let
Xi, X2,
... be i. i. d. random variables and let g : ~m -~ ~ be ameasurable kernel. As a
generalization
of the classicalempirical
distri-bution function
(for
the purpose of itsprobabilistic investigation
or itsAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques - 0246-0203
Vol. 23/87/02/121/14/$ 3,40/© Gauthier-Villars 121
statistical
application)
one considers theempirical
distribution Jln definedby
the random variablesi. e. for put
If =
then {
>1 }
is said to be anempi-
rical process of U-statistic structure.
Serfling [7] proved
central limit theorems for functionals of theempi-
rical distribution function
(e.
d.f.)
of U-statistic structure. Silverman[9]
extended these results
by proving
the weak convergence ofn1~2(~c,~ - ,u)
to some continuous Gaussian process in
weighted
sup-norm metrics inD( -
oo, +Finally,
we should mention that Glivenko-Cantelli results appear inHelmers, Janssen, Serfling [4 ].
Though
the contents of this note is a little moregeneral,
our main result is the almost sureapproximation
ofby
a suitable Gaussian process with a remainder term of orderO(n1 2 ),
À > 0. This result extendsan earlier one
by Csorgo,
Horvath,Serfling [2] ]
and Kiefer’s classicalresult for the e. d. f. of i. i. d. random variables.
We would like to thank S.
Csorgo,
P. Janssen and R.Serfling
formaking
their papers
[2 ] [4 ]
and[~] ]
available to us.2. NOTATION AND STATEMENT OF RESULTS
A
Kiefer-process
is a two-parameter mean-zero Gaussian processK(s, t),
0 s1, t
> 0 with thefollowing
covariance structure:Given a
Kiefer-process K(s, t),
we define the Gaussian process03B3n(x1, ..., xm) by
mOur main result is :
THEOREM 1. After
possible
redefinition on alarger probability
spacethere exists a Kiefer process
K(s, t)
such that with yn defined in(2.1)
we have:P sup
n( ~n(t) - ~(t)) -
_oo5t_+x
Moreover,
1 where ~. _ .
2 400
Remark. In the classical case of a U-statistic
our result
gives
the well known weak invarianceprinciple
for squareintegrable, non-degenerate
kernels g. This can be shownby
standardmethods. First of all
(2.2)
shows the existence of a Gaussian process{ G(s, t) : 0 s _ 1, t E f~ ~
such that~c(t))
convergesweakly
to
G(s, t).
Thenweakly.
This last statement isobviously
truereplacing
theintegrand t by
a bounded function and a well knownL2-approximation.
In the same way Theorem 1 can be
applied
to other functionals of ~c, whichcorrespond
to classical statisticalprocedures (e.
g.quantiles).
Thishas been discussed
extensively
inSerfling [7] ]
and need not berepeated here,
but we shallgive
oneapplication
for the law of the iteratedlogarithm
at the end of this section. First of all we obtain a Functional Law of the Iterated
Logarithm
as acorollary
of Theorem 1.Vol. 2-1987.
COROLLARY 2.
2014 -B/"
is almostsurely relatively
2 log log n
compact in D
[o,
1]
and has thefollowing
set of limit functions :Theorem 1 will be a
corollary
of an almost sure bound fordegenerate U-statistics,
which we shall formulate below.By
D =we
denote,
asusual, the’ space
of all functionsf : ll~ -~
R which areright-
continuous and have left limits. A measurable function h :
[0,1 ]"" -~
Dis called a D-valued kernel of dimension m. h is said to be
degenerate,
ifand all i = 1, ..., m.
If J c ~ 1, ... , m ~
is an index set ofcardinality I,
we can define a kernelhj
of dimension
I by
/* ~ n , -*2014f
The
following
facts are well-known for R-valuedkernels,
but carry overimmediately
to the D-valued case. Define h*by
Then h* is a
degenerate
kernel and the(non-degenerate)
kernel h can bewritten as a sum of
degenerate
kernels in thefollowing
way:where
Pj :
f~m -~[Rill
denotes the obviousprojection.
DEFINITION 3. - The U-statistic with kernel h is defined
by
THEOREM 4. - Let h :
[o, 1 ]m --~
be a kernelsatisfying
(2. 7) h(xl, ... , xm)
is anon-decreasing
element of D for all(xl,
...,[0, 1 ]m
and
Then,
for thecorresponding degenerate
kernel h* in(2.5),
we haveRemark. -
’ ~ (
denotes the sup-norm in D andby ht(x 1,
...,xm)
wemean the value of ...,
xm)
at t E ~.For
non-degenerate
statistics we have thefollowing corollary:
COROLLARY 5. - Let h :
[0,1 ]~ -~
Dsatisfy (2. 7)
and(2 . 8).
Then thereexists
(after
apossible enlargening
of the basicprobability space)
a sequencet E
(~ ~J-1,2,...
ofindependent, identically
distributed Gaussian processes on R such that (03BB400
As an
application
of ourresults,
we obtain now the Law of the IteratedLogarithm
forgeneralized
L-statistics. Letbe the ordered values
of g(Xil’
...,Xim)
taken over allm-tuples (i 1,
...,im)
of distinct elements
..., n ~.
Let Cn,i, 1 _ i _ n~m), bearbitrary
Vol. 2-1987.
constants. Statistics
given by Ecn ’ ;Wn ; ’
were termed GL-statisticsby Serfling [7]. They generalize
linear functions of order statistics(L-sta- tistics)
as well as U-statistics.As noted in
Serfling [7],
asufficiently large
subclass of GL-statistics isgiven by
where -1n(t)
=inf {x : n(x) ~ t}
is thequantile
function. Under certainrestrictions, Serfling
provesasymptotic normality
ofThis is achieved
through
a linearization ofT(un) - T(,u), given by
where m =
,u‘. Jl)
is a U-statistic with kernelThe
asymptotic
variance ofn 1 ~2(T(,un) -
is thengiven by
~f~
THEOREM 6.
2014 Let
be twice differentiable and havepositive
derivatives at itspj-quantiles.
LetJ(t)
vanish for t outside[a, j~ ],
where 0 a/3 1,
and suppose that on J is bounded and continuousLebesgue-
ande. Assume that ...,
Xm)
oo and 062(T, ~c)
oo. Then3. A MOMENT
INEQUALITY
FOR DEGENERATE U-STATISTICS
For a kernel h :
[0, 1 ]’~ -~ f~ let )!
denote the usual LP-norm with respect toLebesgue-measure
Probabilités
LEMMA 7. - Let h :
[o, 1 ]’~ --~ ~
bedegenerate with II h [ [ 2 p
oo forsome
integer
p. ThenRemark. - This lemma
gives
aprecise
estimate for the « o » term in Lemma B ofSerfling ( [~ ],
p.186) indicating
thedependence
on p and m also. Estimates of this type are known to be useful when noexponential inequality
is available(but
can notreplace
the lattercompletely,
e. g. for moderate deviationresults). However,
for our purpose in theproof
ofTheorem 4 this lemma will be sufficient.
Proof
For each choice of
(pi, ... , pn)
the inner sumcontains {2pm) !
-
Pl ! ... Pn !
elements. We divide the outer sum into three parts
~1,
E2 andE3
wherein ~~ we sum over those
m-tuples (pi, ..., pn)
with one pi = 1 in~2
overthose with all pi
e { 0, 2 ~
and inL3
over theremaining
terms. Becauseof
degeneracy,
eachintegral
vanishes. InE2,
the outer sum has/ n pm)
summands,
the inner sum(2pm)! p1!...pn!
. To estimate theexpectations
inwe note that in the
product
each index occurs
exactly
twice. Hence we can subdivide thisproduct
into 2m - 1
subproducts,
each of themcontaining only independent
Vol. 2-1987.
random variables.
Applying
Holder’sinequality
first and thenindepen- dence,
we obtainHence
Stirling’s
formulayields :
In order to estimate
E3,
note thatonly
pm - 1 of thepi’s
in(pl,
...,pn~
can be non-zero.
Counting
the number ofn-tuples
with this restrictionwe find that there are
(
pm - n 1ways
to choose pm- the 1 non-zeropi’s
and atmost l0pm-1 ways of
solving
theequation qi = 2pm
in the non-
negative integers.
Henceapplying
Holder’sinequality
we find:4. PROOF OF THEOREM 4
To
simplify
thenotation,
we restrict ourselves to the case m = 3.(This
is the
simplest general
case for m. m =1,
2 areeasier.)
We definepartitions {
- oo =to(n)
...tn(n)
= +~}
such thatAt this
point
we have to assume that the map t --~X2 , X3 )
isAnnales de l’lnstitut Henri Poincaré - Probabilités Statistiques
continuous. If this is not the case, a minor modification of the
proof
isnecessary. Put :
Then we have :
where
We first estimate the first term on the r. h. s. of
(4. 2). Using
Lemma 7with p
= 3 we find :While
estimating
the second term on the r. h. s.of (4 . 2)
we putif no confusion arises from this.
Using (2 . 5), (2.6)
and themonotonicity
of ht we find :
By (4.1),
the last term is boundedby
8n2. For K= ~ 1, 2, 3 ~
we canapply (4.4),
sinceVol. 2-1987.
For K
= ~ 1, 2 ~
we findusing
Lemma 7with p
=log
n :For K ==
{ 1 ~
we findusing
Bernstein’sinequality (see
Bennett[l ]
and
(4.1))
(4.2), (4.4)-(4.8) together
with anapplication
of the Borel-Cantelli Lemmagive
the desired result.5. PROOF OF THEOREM 1
We shall prove Theorem 1
using
Theorem 4 and thefollowing
almostsure invariance
principle
forindependent identically
formedD(R)-valued
random elements :
PROPOSITION 8. i >
1 }
be a sequence ofindependent
iden-tically
distributed real-valued random variables andD(R)
be a map
satisfying
(5.1) i) h(x)
is anon-decreasing
element of D for all x E Rii)
0hlx)
1.Then after a
possible
redefinition on alarger probability
space thereProbabilités
exists a sequence of
independent, identically
distributed Gaussian processes~ X(s, n) : ~,
n >1,
such thatfor some constant H > 0 and £ =
1/2
400.In
particular,
withprobability
one. - ,
Proof
-(D(~),11- I I )
is a(non-separable)
Banach space and we canapply
Theorem 6.1 ofDudley
andPhilipp [3 ]. Thus,
it suffices to showthat for each m =
1,2,
... one can find apartition
such that
’ " " ’
The
proof
of this carries over almost verbatim fromDudley
andPhilipp,
Theorem 7.4 and we shall omit it here.
We now introduce the
map h : [o, 1 ]m --~ by
Here ht
is defined in Remark after Theorem 4. We shallapply Proposition
8m
to the map
f t(x) = 03A3(h{i})t(x)
as defined in(2.4). By (5.2)
we canapproxi- mate { ft(Xj) - E ft(Xj)} by Y J (t),
whereYj(.), j
=1, 2, ...
arei. i. d. Gaussian processes. We want to
identify
this limit process as a certain stochasticintegral. Let ~ Bk(t), 0 _ t __ 1 ~k? 1
be a sequence of i. i. d.Brownian
bridges
on[o, 1 ]
and defineVol. 2-1987.
It is easy to
verify
thatf 1
The covariance structure of the
process
E can becomputed
as follows : " °But this is
just
the covarianceof { f ’t(X 1 ), t
e[R }
and henceof { Yi(t),
t~R}.
Hence the two Gaussian
processes 10 ft(x)dB1(x), t e R} and {Y1(t),
t e R}
have the same distribution and
using (5.4)
and(5.5)
we findUsing
Lemma 2.11 ofDudley
andPhilipp
we mayactually
assume thatboth sides of
(5 . 6)
are identical a. e.To finish the
proof
of Theorem 1 weapply
Theorem 4 to the terms withI J -- m -
2 on theright-hand
side of(2. 6).
This and the above remarkstogether
with(5.2) yield
Theorem 1.6. PROOF OF THEOREM 6
As is shown in
Serfling [7]
can bedecomposed
into alinear part and a small remainder in the
following
way:where
Probabilités
and
By
the law of the iteratedlogarithm
fornon-degenerate
U-statistics it suffices to show(6 . 5)
can beproved
as inSerfling [7], replacing
his Lemma 3 . 3by
ourCorollary 2,
which showsthat [[
=O(n-1~2 (log log n)1~2).
The indi-vidual summands in
(6 . 3)
are the remainder terms in ageneralized
Bahadurrepresentation
for thequantiles
of the e. d. f. of U-statistics structure.Then
(6.6)
is a consequence of thefollowing proposition.
PROPOSITION 9. - Let 0 p 1 and suppose
that /1
is twice differen- tiable at with~c’( ,u -1 ( p))
> 0. Thenwhere
The
proof
of thisProposition
isanalogous
to theproof
of the classicalBahadur
representation,
asgiven
inSerfling [6],
e. g. Theonly
differenceis that
Hoeffding’s
and Bernstein’sinequalities
have to bereplaced by
their counterparts for U-statistics
(Serfling [6 ],
p.201).
REFERENCES
[1] G. BENNETT, Probability inequalities for the sum of independent random variables.
JASA, t. 57, 1962, p. 33-45.
[2] S. CSÖRGÖ, L. HORVATH, R. SERFLING, An approximation for the empirical process of U-statistic structure. Technical report, Johns Hopkins University, 1983.
Vol. 23, n° 2-1987.
[3] R. M. DUDLEY, W. PHILIPP, Invariance principles for sums of Banach space valued random elements and empirical processes. Zeitschrift Wahrscheinlichkeitstheorie
verw. Gebiete, t. 62, 1983, p. 509-552.
[4] R. HELMERS, P. JANSSEN, R. SERFLING, Asymptotic theory for the e. d. f. of U-sta-
tistics and applications to GL-statistics and the bootstrap. Preprint, 1984-1985.
[5] P. JANSSEN, R. SERFLING, N. VERAVERBEKE, Asymptotic normality for a general
class of statistical functions and applications to measures of spread. Ann. Stat.,
t. 12, 1984, p. 1369-1379.
[6] R. SERFLING, Approximation Theorems of Mathematical Statistics. John Wiley
and Sons, New York, 1980.
[7] R. SERFLING, Generalized L-, M- and R-statistics. Ann. Stat., t. 12, 1984, p. 76-86.
[8] B. W. SILVERMAN, Limit theorems for dissociated random variables. Adv. Appl.
Prob., t. 8, 1976, p. 806-819.
[9] B. W. SILVERMAN, Convergence of a class of empirical distribution functions of
dependent random variables. Ann. Prob., t. 11, 1983, p. 745-751.
(Manuscrit reçu le 25 avril 1985) (Revise le 13 août 1986)