A NNALES DE L ’I. H. P., SECTION B
D. B ITOUZÉ B. L AURENT P. M ASSART
A Dvoretzky-Kiefer-Wolfowitz type inequality for the Kaplan-Meier estimator
Annales de l’I. H. P., section B, tome 35, n
o6 (1999), p. 735-763
<http://www.numdam.org/item?id=AIHPB_1999__35_6_735_0>
© Gauthier-Villars, 1999, tous droits réservés.
L’accès aux archives de la revue « Annales de l’I. H. P., section B » (http://www.elsevier.com/locate/anihpb) implique l’accord avec les condi- tions générales d’utilisation (http://www.numdam.org/conditions). Toute uti- lisation commerciale ou impression systématique est constitutive d’une infraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright.
Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques
http://www.numdam.org/
735
A Dvoretzky-Kiefer-Wolfowitz type inequality
for the Kaplan-Meier estimator
D.
BITOUZÉ,
B.LAURENT,
P. MASSARTa L.M.P.A. Joseph Liouville, Universite du Littoral-Cote d’ Opale, Centre Universitaire de la Mi-Voix, 50 rue F. Buisson, B.P. 699, F-62228 Calais Cedex, France
b Universite Paris Sud. Bat. 425, F-91405 Orsay Cedex, France
Article received on 10 February 1998, revised 12 May 1999
Vol. 35, n° 6, 1999, p. 763. Probabilités et Statistiques
ABSTRACT. - We prove a new
exponential inequality
for theKaplan-
Meier estimator of a distribution function in a
right
censored data model.This
inequality
is of the sametype
as theDvoretzky-Kiefer-Wolfowitz inequality
for theempirical
distribution function in the non-censoredcase. Our
approach
is based on Duhamelequation
which allows to useempirical
processtheory.
@Elsevier,
ParisKey words: Kaplan-Meier estimator, censored data, exponential inequality, law of
iterated logarithm, empirical process
RESUME. - Nous prouvons une nouvelle
inegalite exponentielle
pour l’estimateur deKaplan-Meier
d’une fonction derepartition
dans unmodele de donnees censurees a droite. Cette
inegalite
est du memetype
que1’ inegalite
deDvoretzky-Kiefer-Wolfowitz
pour la fonction derepartition empirique
dans le cas non censure. Notreapproche
estbasee sur
l’équation
de Duhamelqui
nouspermet
d’utiliser la theorie des processusempiriques.
©Elsevier,
Paris1 E-mail: [email protected].
2 E-mail:
[email protected].
3 E-mail:
[email protected].
’Annales de l’Institut Henri Poincaré - Probabilités et Statistiques - 0246-0203
1. INTRODUCTION
Let
Zi, Z2 , ... , Zn,...
be a sequence ofindependent
random variableswith common distribution function F on the real line. The
properties
ofthe
empirical
distribution functionFn
based on thesample Z 1,
...,Zn
as an estimator of F have been
investigated
for along
time. Inparticular,
Donsker’stheorem,
see Donsker[7],
which ensures the weakconvergence of
F)
towardsB°
o F whereB°
is a Brownianbridge,
is an essential tool forstudying
theasymptotic
behaviour of several statistics(the Kolmogorov-Smimov
or Von Misesstatistics,
forexample).
As a matter offact,
this limit theorem may becompleted by
a
sharp nonasymptotic
bound which is due toDvoretzky, Kiefer,
and Wolfowitz(DKW) [ 10] :
Moreover,
Massart[ 19] proved
that C may be taken as 2 which makes the use of this bound relevant for smallsamples.
The estimator
Fn
may be viewed as a NonParametric Maximum Likelihood Estimator(NPMLE)
of F. In the randomcensorship model,
the NPMLE of F can be also
explicitly computed,
it is the celebratedKaplan-Meier
estimator introducedby Kaplan
and Meier[18].
Our aimin this paper is to prove an
exponential
bound for theKaplan-Meier
estimator which is of the same
type
as the DKWinequality.
Let us first describe the
right
censored data model. LetX 1, ... , X n , ...
and
Yi , ... , Yn,...
beindependent
sequences ofindependent
and iden-tically
distributednonnegative
random variables with distribution func- tions F and Grespectively.
One observesZ1, ... , Zn
such that for1~’~Z,=(W~)
whereIn survival
analysis,
theXi ’s represent lifetimes,
and theYi’s censoring
times. The
Kaplan-Meier
estimatorFn
of the distribution function F is definedby
where ri
is the rank of( Wi ,
1 -6i )
in the set{ ( W~ ,
1 -3 j ), j
E{ 1, ... , n } } .
Theordering
islexicographical,
this means that in the case of tiesAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
among the set
{ Wi ,
1~ ~ n },
death observations(8i = 1 )
come beforecensored observations
(8i
=0).
Note that in thecomplete
data modelFn
reduces to the
empirical
distribution function.Many
efforts have been made to extend known results on theempirical
distribution function to the
Kaplan-Meier
estimatorand
there is ahuge
literature on the
topic.
Donsker’s theorem(see
Donsker[7])
has beenextended in various directions. The weak convergence of the process
F)
to a Gaussian process was establishedby
Breslow andCrowley [5]
on a fixed interval]
- oo,~]
with w r, where r is definedby
(with
the convention that inf0 =-I-oo).
Gill[12] proved
the weakconvergence of a
conveniently weighted
version of theKaplan-Meier
process on the whole interval
]
- oo,r].
Concerning exponential bounds,
Dinwoodie[6]
has established alarge
deviationprinciple
for censored data which allows tostudy
theasymptotic
behaviour of’
when n is
large
and 8 is fixed.Unfortunately,
this limit theorem does notprovide
any information about the moderate deviationsF ~ ( ~
and
therefore,
a DKWtype inequality
cannot be deduced from such aresult.
Up
to ourknowledge,
the firstnonasymptotic exponential
bound forthe
Kaplan-Meier
estimator is due to Foldes andRejto [11]. They proved
that if 03C9 03C4:
where C and 17 are absolute constants.
The main result of this paper
improves
on thepreceding
bound ofFöldes and
Rejtö
and may be stated as follows:THEOREM 1. - Let
Fn
be theKaplan-Meier
estimatorof
the distrib- utionfunction
F. There exists an absolute constant C suchthat, for
anypositive h,
It is worth
noticing that,
for censoreddata,
theKaplan-Meier
estimatormay fail to be
uniformly
consistent on the wholeline,
thishappens
whenever = 1 and
F’(r)
1(see
Gill[13]).
Theweighting
factor 1 - G that we use allows to control the uniform deviation on
the whole line almost as well as in the
complete
data model. One canwonder whether our bound is
sharp
or not. Theasymptotic
behaviour of the left hand side ofinequality ( 1 )
is well known in the non censored case.Provided that F is
continuous,
it converges as n goes toinfinity
towardswhere
B°
is a Brownianbridge.
Since6(~) 2014 2e ~,
as À goes toinfinity,
it means that Massart’s upper bound2e-2À2
isoptimal
andthat,
in the censored case, we miss this bound
by
a factor 1.25e~~ . Therefore,
our bound has the
right exponential decay
withrespect
toÀ 2.
Of course, it would be desirable tocompute C,
butunfortunately,
ourtechniques
arenot
sharp enough
to do thatefficiently.
We shall in fact prove the
following slightly
betterinequality
than( 1 ):
from which one can
straightforwardly
derive a bounded Law of IteratedLogarithm (LIL):
Again
this almost sure upper. bound is known to beoptimal
in the noncensored case. This result seems to be new. Gu and Lai
[16] actually proved
a functional LIL which isapparently stronger,
but since this result does not hold on the whole line it does notimply (2).
To prove Theorem
1,
we follow the idea that Van der Laan hasdeveloped
in his Thesis[25].
He noticedthat,
for someindirectly
observed
models,
the NPMLEFn
of a distribution function F satisfiesan
identity
of thetype:
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
where vn
is a centered and normalizedempirical
measure and7~
denotes some influence function for the
(sub-probability)
distribution function I~ . He showed that~ asymptotics
forFn
can be derived from thisimplicit equation provided
that the class of influence functions has the Donskerproperty
when K and tvary.
Such an
identity
.indeed holdsfor
theKaplan-Meier
estimator and follows from Duhamelequation. Entropy
withbracketing computations
allows as Duhamel
equation
to derive Theorem 1 from anonasymptotic exponential
bound forempirical
processes which is ofindependent
interest.
The paper is
organized
as follows: in Section2,
we statenonasymptotic
and
asymptotic
bounds for theKaplan-Meier estimator,
Section 3 is devoted to ageneral exponential inequality
for theempirical
process. The mainproofs
arepostponed
to Section 4.2. EXPONENTIAL
INEQUALITY
AND ASYMPTOTIC BOUNDS FOR THE KAPLAN-MEIER ESTIMATOR2.1. An
exponential inequality
In this
section,
we consider theright
censored data model as defined in Section 1. TheKaplan-Meier
estimator of the distribution of interest F is denotedby Fn
and G denotes the distribution function of thecensoring
times... ,
We normalize the
Kaplan-Meier
process where G- is the left-continuous version of G. This allows us to formulate anexponential inequality
for the deviation of the supremum of the process( 1 - G - ) ( Fn - F) on
the .whole line. ’We shall in fact prove aslightly stronger
result than Theorem 1 which. has theadvantage
toreadily imply
a bounded LIL.
THEOREM 2. - The
following inequality
holdsfor
theKaplan-Meier
estimator
Fn.
For any ~, >0,
where C is an absolute constant.
Remark 1. - Since 1-
G fi
1-G-,
the same result holds if wereplace
G-
by
G.Vol. n° 6-1999.
One
gets
as acorollary
of Theorem 2:COROLLARY 1. - The sequence
of Kaplan-Meier
estimatorssatisfies
the bounded lawof
the iteratedlogarithm:
Remark 2. - In the
complete
datamodel, inequality (4)
issharp
sinceit is known that
This result is due to Smimov
(see
Shorack and Wellner[21]).
Remark 3. -
Strong consistency
results for theKaplan-Meier
estima-tor can be
immediately
derived from the bounded LIL. As a matter offact, provided
thatG"(r) 1,
weeasily get
thatIf
G (r)
= 1 andF - ( i )
=1,
thestrong consistency
can be derivedeasily
from thestrong consistency
in the caseG’"(r)
1 via somemonotonicity argument
aspointed
outby
Gill[ 13 ] .
We recover here thatthe uniform
consistency
of theKaplan-Meier
estimator holds on[0, r]
provided
that condition(5)
is fulfilled:’
This result is due to Stute and
Wang [23]
whoproved
itby using martingale
technics. Note that Stute andWang [23]
have furthermoreinvestigated the case
where condition(5)
does not hold.They proved
that in this case
Fn
is not consistent since it converges to a limit which is different from F. In thissituation,
one needs tomodify
theKaplan-Meier
estimator to
get
a consistent estimator(see
Gill[ 13]).
2.2. An
asymptotic upper
bound’ We now state an
asymptotic
upper bound which allows to understand howsharp
is our DKWtype inequality
for theKaplan-Meier
estimator.Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
THEOREM 3. - Let
Fn
be theKaplan-Meier
estimatorof F,
and letB°
be a Brownianbridge.
For anypositive h,
Remark 4. -
Inequality (6)
ensuresthat, asymptotically,
the uniform deviation of theconveniently weighted Kaplan-Meier
process is stochas-tically
smaller than the standardKolmogorov-Smimov limiting
distribu-tion. We do not know whether the
inequality
holds or not for the
Kaplan-Meier
estimator.Indeed,
this is a naturalquestion
since on the one hand we know from(6)
that it holdsasymptotically
and on the other hand that it is valid for all n in thenon censored case
(see
Massart[19]). Inequality ( 1 )
does not of courseimply (7)
but can be considered as a firststep
towards such a result.Proof -
Theproof
of Theorem 3 follows from well knownarguments.
Let T =
it, G- (t) 1 },
let C be thecontinuous, nondecreasing
andnonnegative
function .and let
K(t) =
+C(t))
ifC(t)
+00 andK(t) =
1 ifC(~) ==
+00.Combining
the weak convergence result towards a Gaussian processgiven by
Gill[ 14],
p.173,
with therepresentation
of that process from a Brownianbridge given by
Gill[12],
we have that the processconverges towards ((1 - F)(1 - G)/(1 -
in
D (T ) . Therefore,
and
inequality (6)
follows from the fact that( 1 - G) ( 1 - F) / ( 1 -
K)
1. D2.3. Connection with
empirical
processes: van der Laan’sidentity
A connection between the
Kaplan-Meier
processFn -
F and theempirical
processPn -
P based on the variablesZi
=( Wi , 6i)
isgiven by
van der Laan’sidentity (van
der Laan[26]).
In ourparticular
case,this
identity
isequivalent
to the well-known Duhamelequation
for theunivariate
product integral (van
der Laan[26],
p.14,
and Gill and Johansen[ 15],
Theorem6).
The versionpresented
below is to be found in Gill[ 14],
p. 172. LetIK,x
be the influence functiondefined,
for any[0,1]
valued and
nondecreasing
functionI~ ,
and for any x such that G -(x ) 1, by
where
1~A
denotes the indicator function of any set A.Moreover, IK,x
== 0 ifG- (x)
= 1..THEOREM 4
(van
derLaan). -
For any x E JR .Remark 5. - In the definition of the influence
function,
it mayhappen
that some division
by
0 occurs. Wejust
refer to Gill[ 14],
p.128, where
it is shown that the convention
0/0 equals
1 isadequate.
Denoting by
lC the class of[0,1]
valued and nondecreasing
functionsonR,
andby
I the set of functions I ={IK,x,
K EJ’C,
x E we deducefrom
identity (8)
thatSo that we are in a
position
to derive Theorem 2 fromgeneral exponential
bounds for
empirical
processesinvolving
metricproperties
of Y. Thepurpose of the next section is
precisely
to establish such bounds.3.
EXPONENTIAL
BOUNDS FOR EMPIRICAL PROCESSES LetZl , ... , Zn
be nindependent identically
distributed random vari- ables with distribution P on some measurable space(Z, C).
LetPn
de-Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
note the
empirical probability
measure based upon( Z 1, ... , Zn ) .
Forp E
[ 1, oo [,
we denoteby
the set of measurable real valued func-tions f
on Z suchthat
isintegrable
withrespect
to P. We setvn =
P).
This means thatwhenever
(P). Moreover,
forf
E we denoteby Varp( f)
the
quantity j f 2
dP -( f f dP)2.
We shall deal withL2-entropy
withbracketing
as defined below.DEFINITION 1. - Let p be some
nonnegative
measure. Given ~ >0,
a bracket withL2( )-diameter
£ isdefined from
apair of functions f,
g inwith
f
g and = £ asLet F C
Denoting by N(e)
the minimumcardanalaty of a covering of0 by a finite
setof brackets
with diameter notlarger
than £, theentropy with
bracketing of0
isdefined
as thefunction
E r+ln(N(e) V e).
The notion of
entropy
withbracketing
has been introducedby
Dud-ley [9]
and theimportance
ofL2-entropy
withbracketing
has beenpointed
outby
Ossiander[20].
It refines onL~-entropy
and isespecially
well suited for
studying
classes of functions withuniformly
bounded vari- ations forwhich,
of course,L~-entropy
is irrelevant. The control of theentropy
withbracketing
of the class of influence functions .~ introduced in Section2,
will beprecisely
derived from theentropy computation
ofthe class of functions with
uniformly
boundedvariations,
due to van deGeer
[24].
LEMMA 1
(van
deGeer). -
Let be somenonnegative
boundedmeasure on some interval U C R. Given some
positive
numberM,
theL2( )-entropy
withbracketing
Hof
the classof functions defined
on Uwith variation bounded
by
Msatisfies
theinequality,
for
any ~ >0,
where y is some absolute constant.We now
provide
somegeneral exponential
bounds forempirical
processes which are refinements of related bounds in
Birge
and Mas-sart
[2].
Ascompared
to theirProposition 3,
the mainnovelty
here isthat we
provide
aHoeffding type inequality
with anexplicit optimal
sub-Gaussian rate of
decay.
Theproof
uses theadaptive
truncation proce- dure introducedby
Bass[ 1 ]
forpartial
sum processes and Ossiander[20]
for
empirical
processes.Throughout
thesequel,
P* will denote the outerprobability
measure associated with P(we
recallthat, generally speaking,
supf~F|03BDn(f) | is
notnecessarily measurable).
THEOREM 5. - Let F be a class
of
measurablefunctions defined
onZ. We assume that
for
some constants m and MLet a = M - m.
Then, a2/4 for
anyf
in 0. Let H denote theL2(P)-entropy
withbracketing of0.
Assume thatH1/2
isintegrable
at0,
and let
for
anypositive
t. Let £0 E]0,
1[,
and a be somepositive
number such thata~ 2 a2 /4.
For anypositive
t andh,
we set= t + 1 -
1 + 2t
andThen, for
anypositive h,
where C is an absolute constant, and h can be taken as h1 or
h2.
We first deduce from Theorem 5 a
sharp Hoeffding type inequality taking
benefit of aspecial shape
for theentropy
function. Because of theentropy computation by
van de Geer[24],
thisinequality
willimply
Theorem 2.
COROLLARY 2. - Let .~ be a class
of
realfunctions defined
on ,~and
satisfying
condition(9). Suppose
theL2(P)-entropy
withbracketing
Annales de l ’lnstitut Henri Poincaré - Probabilités et Statistiques
of0,
denoted withH, satisfies H(£) ~ y/e for
some y inR+.
Thenfor
any
positive
real number À:where C is an absolute constant and a = M - m.
Since the
function 1/r
defined in Theorem 5 satisfies1/r~ (t) ~ t2 ,
one can also
straightforwardly get
from Theorem 5 a Bernsteintype
inequality. Although
we shall notpresent
hereapplications
of thisbound,
it should be noticed that it can be used to derive local
properties
of theKaplan-Meier
estimator in thespirit
of B itouze’s Thesis[4j .
COROLLARY 3. - Let 0 be a class
of
realfunctions defined
on Zsuch that condition
(9)
holds. Let H be theL2(P)-entropy
withbracketing of0.
We assume that isintegrable
at 0 and wedefine
Let () E
]0, 1 ].
Thefollowing inequality
holdsfor
anypositive
a such thatsupf~F VarP(f) 03C32 a2/4 and for
anypositive
À:where C is an absolute constant and
4. PROOFS 4.1. Proof of Theorem 2
Identity (8)
ensures thatVol. n° 6-1999.
where
and J( is the class of
[0,1]
valued and nondecreasing
functions on R.Therefore, inequality (3)
may be derived fromCorollary
2applied
withthe class I. In order to
apply Corollary 2,
we have toverify that,
forsome constants m and
M,
anyf
in I satisfiesm f C
M and we haveto evaluate the
IL2 (P) -entropy
withbracketing
of l.Since vn
iscentered,
we may add a constant to the functionf
withoutaffecting Vn (f).
Moreprecisely,
thefollowing equality
isequivalent
to
(8):
for any x inR,
where
may be written
We now consider the class of functions
J
={JK,x, K
EJC,
x ER) .
Thismodification will make easier the evaluations of the constants m and M.
Indeed, noticing
thaton ] -
oo,x ] ,
both(1 - K)(x)/(1 - K )
and( 1 - G - ) (x ) / ( 1 - G - ) belong
to[0,1] (since K
and G - arenondecreasing maps),
one canverify
that0 ~
1 for anyJK,x in J.
We now use the
following
lemma whoseproof
ispostponed
toAppendix
A. This result derives from Lemma 1 due to van de Geer[24].
LEMMA 2. - For any
probability
measureP,
theL2(P)-entropy
withbracketing of
the classJ satisfies
H(s) x
y/ £,
where y is an absoluteconstant.
Now the
proof
of Theorem 2 followsimmediately
fromCorollary
2. 0Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
4.2. Proof of the LIL
(Corollary 1)
Let s be in
]0,1[
andMk
beG~) (Fk -
We deducefrom Theorem 2
that,
forÀ ~ C/(2s),
Let
~(n)
=lnlnn/2
and let 9 > 1. From the aboveinequality applied
to ~ 1 - and À =
03B803A6(nk),
we derive thatwhere nk
is theinteger part
of From Borel-Cantellilemma,
we deducethat a.s.,
M~
for klarge enough (k > k*).
Let now nbe
large enough such
that nk- forsome k >
k*. We haveso
since,
aseasily
-- 1 as n - +0oo, we deducethat for any 0~ >
1,
a. s. ()’ for nlarge enough.
This leads toThis concludes the
proof
ofinequality (4).
4.3. Proof of Theorem 5 We set
We shall prove
exponential inequality ( 10) by controlling
The
required
two sidedinequality
will followby considering
the class-0 instead of .~ and
multiplying
the one sidedprobability
boundby
2.4.3.1. Notations
We first introduce some notations. Let k be fixed in
N and sk
befor
any k
in N.We consider a
covering
of .~’by
a setNck
of brackets with diameter notlarger
than sk . We assume thatNck
hascardinality
notlarger
thaneH(Sk).
For any
f
in0,
we denoteby
a bracketof Nck
such thatf
and we denoteby Ok ( f )
the function For any function g EL~(P),
we shall denote thequantity P 1 /2 (g 2 )
=Let be
f U
AM,
where A denotes theminimum,
so that thefunction
f -
isnonpositive (recall that m f
M for anyf in F).
We consider some
decreasing
sequence ofpositive
real numbersand,
for anyf
in0,
we seti(f)
=min{k
EN,
>uk }
/B p where p has to be chosen in N.We denote
by
Id theidentity operator.
4.3.2.
Decomposition
We shall use the
following decomposition given
in Doukhan et al.[8], p. 412.
°
Let A be a real number. The
following inequality
derives from theprevious identity.
where, assuming
that~=1 ~ ~ A,
Annales de l ’lnstitut Henri Poincaré - Probabilités et Statistiques
The ~ ’s
will be chosen later on.4.3.3. Controls of
Pi
toP6
We shall often use the
following
twoarguments.
ARGUMENT 1. - For
any j in
N and any setA,
ARGUMENT 2
(Markov’s inequality). -
For anynonnegative
randomvariable Z and any
positive
real numberç,
Control of In order to control we shall use
essentially Hoeffding’s inequality
and aspecial
version of Bernstein’sinequality
due to
Birge
and Massart[3].
Theseinequalities provide
controlsin
probability
for thepartial sum Sn
ofindependent
and centered variables. Infact,
sincethey rely
on Chernoff’sbound,
it follows from classicalmartingale arguments (mainly
Doob’sinequality)
thatthey
remain valid for
Sk
instead ofSn.
Theresulting inequalities
arestated below
without proof (for
more details about the combination of Chernoff’s bound andmartingale arguments
see for instance Shorack and Wellner[21],
pp.444-445).
LEMMA 3. - Let
!7i,..., Vn be independent
random variables suchE(Ui )). Then, for
anypositive À,
LEMMA 4. - Let
Ul ,
...,Un
beindependent
random variables satis-fying for
somepositive
constants 03B4 and c and anym >
2 thefollowing
moment condition
Let
Sk
=(Ui - E ( Ui ) ), for i ~ k
n.Then, for
anypositive À,
where
with
~/r (t)
= t -f- 1 - 1 -~ 2t.Moreover,
In what
follows,
we shall use Lemma 4repeatedly
in the situation where! L~ ~& and E ( U 2 ) ~ 2 .
In that case,inequality ( 12)
holds with c =b / 3 .
We shall now control
]IDI-
Whenf
describes the set0,
there are at most functions of thetype 77o(/).
Since each of these functions satisfiesm C M,
it follows from Lemma 3 that for any~,1
> 0Moreover,
we note that _and
Annales de l ’lnstitut Henri Poincaré - Probabilités et Statistiques
Applying
Lemma4, with Ui
=77o(/)(Z,) - P(77o(/)),
we obtainwhere c =
a/3.
Thisimplies
thatwhere
h(À)
can be taken as2~,2/a2
=h 1 (~,)
or(À)
=h2(~.).
Control of
P2.
FromArgument 1,
weget
and,
since = 0implies Ao(/)
> uo, we obtainby Argument
2Setting À2
= we concludeControl of
]ID3.
FromArgument 1,
weget
and, by Cauchy-Schwarz inequality,
Setting À3
= we concluden°
Control of
?4.
FromArgument 1,
weget
and,
sincei ( f )
= kimplies Ok_ 1 ( f )
weget
Argument
2provides
us withWe deduce from the above
inequality that, setting À4
=fl
Control of From
Argument 1,
weget
Argument
2provides
us withWe deduce from the above
inequality that, setting À5
=Control of
IP°’6.
LetAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
We have in view to use Lemma 4 to control
vn ( Vk (. f ))
for each k andthen sum up the
resulting probability
bounds to handle?6.
We can boundfrom
above by 2uk-1
andP(Vk(f)2) by 9 sf . Indeed,
. since
i ( f ) ~ k implies A~-i(/) ~
.
by triangular inequality
and since 1 =Lemma 4 with c = and 8 =
3sk yields
for any~k
> 0We now control the
quantity
We recall that .When
f
varies in0,
there are at most > functionsVk ( f ) . Therefore, setting IHfk
=N(~y),
weget:
Hence,
for any~1, ... ,
1 >0,
n°
Note that H is a
nonincreasing
function soIt follows
that,
if we set~k
=2Hk
+h (~,),
we obtainMoreover,
1by definition,
soSetting K
= andwe
get
at last4.3.4. Conclusion
Collecting inequalities (11)
and(13)
to(14) gives, setting A
=6
It remains to bound from above A .
1= 1
Annales de l ’lnstitut Henri Poincaré - Probabilités et Statistiques
since 1 =
2Ek .
In order to minimize theprevious bound,
we will nowchoose the sequence
(Uk)kEN
suchthat, for k > 1,
which is
equivalent
to uk-i =(observe that,
from thedefinitions ~~ =
2-k~o and ~k
=2IHIk
+/~),
the sequence is indeeddecreasing). Replacing
uk-l, weget
thefollowing
upper boundWe recall that
flsp-i
= tends to 0 as p - +00. This allows to bound from above A -~.1,
for any £ >0, by
We will now control
By
definition of~k, IHIk
and ~k,Moreover,
Collecting
the above evaluations weobtain,
for any £ >0,
which
leads, recalling
that~,1
=À,
toIt follows from
inequality ( 15)
thatwhere = 0.25. Since
o~2 a2/4
and~(t) t2/2,
we havethat
/x(~) ~ ~.2/20~ 2.
This concludes theproof
of Theorem 5. 04.4. Proof of
Corollary
2 .Let
H (s)
be boundedby y / ~ . By
Theorem5,
where
C3
andC4
are absolute constants.Setting ~- = ~, ( 1
+ +~4~0,
it follows that forany § )
Using
that1 / ( 1 ~- t ) 2 > ( 1 - t ) 2
for any0 ~
weget
as soon as So
1/(2C3).
Setting
So =1/~.
there exists absoluteconstants ~ *
andCs
suchthat,
for
any ~ ~~,
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
Possibly enlarging Cs , previous inequality
holds forany ç
> 0.This concludes the
proof
ofCorollary
2. D4.5. Proof of
Corollary
3We claim that the function
h2
satisfies for anypositive
À andK,
theinequality h 2 (~, -r- I~ ) > ~W+~2(~). Indeed,
since =it is clear that +
y) ~ + ~ -1 (y ) . Therefore,
the reverseinequality
holds forh 2 .
LetSince
Hl~~(80~) ~,
weget K ) h]~ (H (9a)) .
We now
apply inequality (10)
to ~/ = À+ K,
for some8 E
]0,1].
Weget, possibly enlarging C,
Since
’~’ ~ ) ~ 2 +2 ~
weget
This concludes the
proof
ofCorollary
3. a5. CONCLUSION
To conclude we would like to mention several
questions
of interest thatderive from the results of this paper.
5.1.
Substituting Gn
for GFor
statistical~purposes,
it would be desirable toreplace
the factor1 - G
by
1 -Gn
in theexponential inequality ( 1 ).
We do not know whether such aninequality
holdsnonasymptotically.
All we can do isto
provide
anasymptotic
evaluation which isanalogous
to Theorem 3.Vol. n° 6-1999.
THEOREM 6. - Let
Fn
be theKaplan-Meier
estimatorof F, Gn
be theKaplan-Meier
estimatorof
the distributionfunction
Gof
thecensoring times,
and be thelargest
observation. LetB°
be a Brownianbridge.
Then, for
anypositive À,
Inequality (16) actually
allows to build confidence bands for F in thespirit
of Hall and Wellner[17]
and Gill[14]. Nevertheless,
the conservatism of these bands may make them useless underheavy censorship.
5.2. Proof of Theorem 6
’
Using inequality (6)
and thedecomposition
we have to show that
We first notice that the
problem
issymmetric
withrespect
to F and G.This allows to assume that .
We shall
distinguish
two cases: eitherG (r) 1,
orG (r)
= 1.In the first case, we write
for t x
It follows from Theorem 2 that the supremum on R of the process
We conclude with a result due to
Wang [27]
which ensures thatAnnales de l’Institut Henri Poincaré - Probabilités et Statistiques
If
G (r)
=1,
eitherF (r)
1 and we can argue as aboveby exchanging
F forG,
orF-(i)
= 1. If it is so, F and G are continuous atpoint
r, therefore =r)
=0,
andG(W(n))
1 a. s . Let a r.Since
F’(r) 1,
it follows from Remark 3 that supt03C3I CGn - G) (t) | ~
0 a. s.
Using
thisresult,
and similararguments
asabove,
we show thatNow,
It follows from Gill
[12],
Lemma 2.6 that(Gn - G) / ( 1 - G) [
is bounded in
probability. Moreover,
it follows from the convergence of the process,In(1 - G)(Fn - F)
towards((1 - F)(1 - G)/(l - K))
xin
D(T)
thatSince K is a
[0,1] valued,
continuous andnondecreasing function,
weshall consider the case of
K(t)
1 and the case of this limitequals
1. In the first case, since F and G are continuous atpoint
r andtend to
1,
weget
In the second case, we
get (B°(K))(t)
= 0 a.s.Moreover,
the processBO(K)
is continuous.Finally,
we obtain in both cases,This concludes the
proof
of Theorem 6.5.3.
Normalizing
factorOne could wonder whether the
normalizing
factor( 1 - G)
used inTheorem 1 is
adequate
or not. If one tries to answer thisquestion,
it isinteresting
first to notice that someweighting
isactually
necessary if one wants toget
a uniform result on the real line since otherwise onereally gets
into trouble due to the bad behaviour of theKaplan-Meier
estimatorin the tail
(see
Stute[22]).
In order to better understand what kind ofweighting
should beconsidered,
let usanalyse
theasymptotic
behaviourof the
Kaplan-Meier
processF).
Letand let = if
C(t)
+0oo and = 1 ifC(t)
= +0oo. Itfollows from Gill
[12]
that the process,In(1 - G)1/2(R - F)
conver-ges in
D(T)
towards{(1 - F)(1 - G)1/2/(1 -
where T ={t, G-(t) 1}.
One can notice that the variance of thelimiting
process isuniformly
bounded since it isequal
to( 1 - G ) ( 1 - F ) 2 C . Therefore,
aquestion
arises: is itpossible
toreplace
in theexponential inequality ( 1 )
the
normalizing
factor 1- Gby ( 1- G) 1 ~2.
Our method does not allow to do this since it is based on the use of the influence function of the processF).
The influence function has to be normalizedby ( 1 - G-) (or
1 -G),
in order to be stabilized on the whole interval. It is not clear at all whether another method could work since it couldhappen
that thesize of the supremum norm of the influence function indeed
plays
a rolein an
exponential
bound like ours.APPENDIX A A.l. Proof of Lemma 2
In order to
get
theL2(P)-entropy
withbracketing
of the classJ,
wewill use Lemma 1. We recall that
where may be written under the form
Annales de l’Institut Henri Poincaré - Probabilités et Statistiques
if we set
We denote
by
=1, 2, 3,
the classes of functions definedby
Since the functions
JK,x
are[0,1]
valued and monotone, it follows from Lemma 1 that there exists someconstant 1]
such thatfor j
=1, 2, 3,
the
L2(P)-entropy
withbracketing Hj
of the classJ(j)
satisfies~/6B By
definition of theentropy,
we can find sets of bracketsNj
withdiameter not
larger than ~ /3, covering
definedby
where N is bounded
by
For each function inj
=1, 2, 3,
we can find i E{1,2,... N } satisfying
Since the functions
Jx,x
are[0, 1 ]
valuedfunction,
we can choose also[0, 1 ]
valued functions to define the sets of bracketscovering
the classesy~j=i,2,3.
With these sets, we can build a set of brackets with diameter not
larger
than £
covering
the classJ. Indeed,
for each functionJK,x
inJ
thereexist i, i’, i"
inf 1, 2,..., N~
such thatIt is easy to
verify
that theJL2(P)-norm
of the function+ fL2,i’)fL3,i"-
+ bounded
by ~. Hence,
we found a set of brackets with diameter notlarger
than Ecovering y
withcardinality
boundedby N3.
Itfollows that the