Moment estimators of the extreme value index for randomly censored data in the Weibull domain of attraction

(1)

HAL Id: hal-01162967

https://hal.archives-ouvertes.fr/hal-01162967

Preprint submitted on 11 Jun 2015

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Moment estimators of the extreme value index for randomly censored data in the Weibull domain of

attraction

Julien Worms, Rym Worms

To cite this version:

Julien Worms, Rym Worms. Moment estimators of the extreme value index for randomly censored

data in the Weibull domain of attraction. 2014. �hal-01162967�

(2)

Moment estimators of the extreme value index for randomly censored data in the Weibull domain of attraction

Julien Worms

¹

, Rym Worms

²

21 october 2014

Abstract

This paper addresses the problem of estimating the extreme value index in presence of random censoring for distributions in the Weibull domain of attraction. The methodologies introduced in [Worms (2014)], in the heavy-tailed case, are adapted here to the negative extreme value index framework, leading to the definition of weighted versions of the popular moments of relative excesses with arbitrary exponent . This leads to the definition of two families of estimators (with an adaptation of the so called Moment estimator as a particular case), for which the consistency is proved under a first order condition. Illustration of their performance, issued from an extensive simulation study, are provided.

Keywords : Extreme value index, Tail inference , Random censoring , Kaplan-Meier integration

AMS Classification : 62G32 (Extreme value statistics) , 62N02 (Estimation for cen- sored data)

1Universit´e de Versailles-Saint-Quentin-en-Yvelines, Laboratoire de Math´ematiques de Versailles (CNRS UMR 8100), F-78035 Versailles Cedex, France,[email protected]

2Université Paris-Est, Laboratoire d’Analyse et de Mathématiques Appliquées (CNRS UMR 8050), UPEMLV, UPEC, F-94010, Créteil, France, [email protected]

(3)

1 Introduction

Extreme value statistics is an active domain of research, with numerous fields of ap- plication, and which benefits from an important litterature in the context of i.i.d.

data, dependent data, and (more recently) multivariate or spatial data. By contrast, methodological articles in the case of randomly censored data are quite recent and few : [Einmahl et al. (2008)] presents a general method for adapting estimators of the extreme value index in a censorship framework (a methodology based on a previ- ous work [Beirlant et al. (2007)]), [Diop et al. (2014)] extends the framework to data with covariate information, and [Worms (2014)] proposes a more survival analysis- oriented approach restricted to the heavy tail case. Other existing works on the topic of extremes for censored data are [Brahimi et al. (2013)] and the review paper [Gomes and Neves (2011)].

In this paper, the topic of extreme value statistics for randomly censored data with negative extreme value index is addressed. Our initial purpose was to rely on the ideas of [Worms (2014)] in order to define a more ”natural” version (with respect to that proposed in [Einmahl et al. (2008)]) of the moment estimator in the context of censored observations. We finally came out to propose weighted versions of the popular moments of the relative excesses (with arbitrary exponent), and therefore define competitive estimators of the extreme value index in this censoring situation, for distributions in the Weibull maximum domain of attraction.

Let us first define more precisely the framework, the data, and the notations.

In the classical univariate framework of i.i.d. data, a central task is to estimate the extreme value index γ, which captures the main information about the behavior of the tail distribution of the data. More precisely, a distribution function (d.f.) F is said to be in the maximum domain of attraction of H

_γ

(noted F P D p H

_γ

q ) with

H

_γ

pxq :“

"

exp `

´p1 ` γxq

^´1{γ

˘

for γ ‰ 0 and 1 ` γx ą 0 expp´ expp´xqq for γ “ 0 and x P R ,

if there exist two normalizing sequences p a

_n

q Ă R

^`

and p b

_n

q Ă R such that F

ⁿ

pa

n

x ` b

n

q

^nÑ8

ÝÑ H

γ

pxq p@x P R q.

We consider in this paper two independent i.i.d. non-negative samples pX

_i

q

iďn

and pC

_i

q

iďn

with respective continuous distribution functions F and G (with end-points τ

_F

and τ

_G

, where τ

_F

: “ sup t x, F p x q ă 1 u ). In the context of randomly right-censored observations, one only observes, for 1 ď i ď n,

Z

i

“ X

i

^ C

i

and δ

i

“ I

^XiďCi

.

We denote by H the distribution function of the Z-sample, satisfying 1 ´ H “ p1 ´ F qp1 ´ Gq

and by Z

_1,n

ď ¨ ¨ ¨ ď Z

_n,n

the associated order statistics. In the whole paper,

δ

_1,n

, . . . , δ

_n,n

denote the δ’s corresponding to Z

_1,n

, . . . , Z

_n,n

, respectively. F and G

(4)

are assumed to belong to the maximum domains of attraction DpH

_γ_X

q and DpH

_γ_C

q respectively, where γ

_X

and γ

_C

are real numbers, which implies that H P D p H

_γ

q , for some γ P R .

Our goal is to estimate the extreme value index γ

_X

in this context of right cen- sorship. The most interesting cases, described in [Einmahl et al. (2008)], are the following :

case 1: γ

_X

ą 0 , γ

_C

ą 0 in this case γ “ γ

_X

γ

_C

γ

_X

` γ

_C

case 2: γ

_X

ă 0 , γ

_C

ă 0 , τ

_F

“ τ

_G

in this case γ “ γ

_X

γ

_C

γ

_X

` γ

_C

case 3: γ

_X

“ γ

_C

“ 0 , τ

_F

“ τ

_G

“ `8 in this case γ “ 0.

In [Worms (2014)], case 1 above was considered and an adaptation of the so-called Hill estimator to the right censoring framework was proposed. In this paper, our aim is to consider case 2 above and adapt the approach leading to the so-called Moment Estimator to this censored situation. An adaptation of this estimator was already proposed in [Einmahl et al. (2008)] : it consists in dividing the classical Moment Estimator ˆ γ

_n^Z

of γ (calculated from the Z-sample) by the proportion

p p :“ k

^´1_n

ř

kn

i“1

δ

_n´i`1,n

of uncensored data in the tail, where k

_n

is the number of upper order statistics retained. Note that ˆ γ

_n^Z

is an appropriate combination of the following moments

M

^pαq_n,k_n

:“ 1 k

_n

kn

ÿ

i“1

log

^α

ˆ Z

_n´i`1,n

Z

_n´k_n_,n

˙ ,

for α “ 1 or 2 (where log

^α

pxq stands for plogpxqq

^α

), and that p p estimates the ultimate proportion p of uncensored observations in the tail, which turns out to be equal to

p : “ γ

γ

_X

“ γ

_C

γ

_X

` γ

_C

.

Our goal is to show that relying on usual strategies in the survival analysis lit- erature leads to estimators of γ

_X

which are often sharper than those obtained by simply dividing an estimator of γ by the proportion of uncensored observations. By

“usual” strategy we mean using “Kaplan-Meier”-like random weights : we refer to [Worms (2014)] for more detailed informations concerning the origin of the two kinds of random weights appearing in the formulas below. As a matter of fact, we define, for any given α ě 1, the following two versions of randomly weighted moments of the log relative excesses :

M

_n,k^pαq

n

: “ 1

n p 1 ´ F ˆ

_n

p Z

_n´k_n_,n

qq

kn

ÿ

i“1

δ

_n´i`1,n

1 ´ G ˆ

_n

p Z

_n´i`1,n^´

q

ˆ log

^α

ˆ Z

_n´i`1,n

Z

_n´k_n_,n

˙ ˙

(1)

(5)

and

M Ă

_n,k^pαq

n

:“ 1

np1 ´ F ˆ

_n

pZ

_n´k_n_,n

qq

kn

ÿ

i“1

1 1 ´ G ˆ

_n

pZ

_n´i`1,n^´

q

ξ

_i,n

(2)

where

ξ

_i,n

: “ i ˆ

log

^α

ˆ Z

_n´i`1,n

Z

n´kn,n

˙

´ log

^α

ˆ Z

_n´i,n

Z

n´kn,n

˙˙

(3) and p k

_n

q is a sequence of integers satisfying, as n tends to `8 ,

k

_n

Ñ `8 and k

_n

“ opnq. (4)

Above, ˆ F

_n

and ˆ G

_n

naturally denote the Kaplan-Meier estimators of F and G, respec- tively, defined as follows : for t ă Z

_n,n

,

1 ´ F ˆ

_n

ptq “ ź

Zi,nďt

ˆ n ´ i n ´ i ` 1

˙

δi,n

and 1 ´ G ˆ

_n

ptq “ ź

Zi,nďt

ˆ n ´ i n ´ i ` 1

˙

1´δi,n

.

It should be noted that these 2 weighted versions of the moments of the log- excesses defined in (1) and (2) are in fact closely related : as a matter of fact, they differ only when the maximum observation Z

_n,n

is censored (when δ

_n,n

“ 1, we have indeed M

_n,k^pαq

n

“ M Ă

_n,k^pαq

n

, see Proposition 1 in Section 5). However, both versions deserve attention : firstly because in practice the last observation is often a censored one, and secondly because when they do differ, the difference is the only term involving the information contained in the maximum observation Z

_n,n

(this difference is therefore non-asymptotically not negligible, although it tends to 0 in probability, as stated in Proposition 1 in Section 5).

In section 2 below, assumptions are presented and discussed, convergence results for the weighted moments M

_n,k^pαq

n

and M Ă

_n,k^pαq

n

are stated, and we describe how classes of estimators of γ

_X

can be deduced by combining these moments for different values of α. In Section 3, performance of these estimators will be presented on the basis of simulations. Section 4 provides some words of conclusion, Section 5 is devoted to the proof of Theorem 1 below, and finally the Appendix includes standard (but central to our proofs) results on regularly varying functions, as well as the proofs of the different lemmas which were used in Section 5.

2 Results

2.1 Assumptions

In addition to (4), our results need the following minimal assumption :

(A) F P D p H

_γ_X

q , G P D p H

_γ_C

q with γ

_X

ă 0 , γ

_C

ă 0 and x

^˚

: “ τ

_F

“ τ

_G

. As noted earlier, this assumption implies that H P DpH

_γ

q with τ

_H

“ x

^˚

and

γ “ γ

_X

γ

_C

γ

_X

` γ

_C

ă 0.

(6)

If we note U ptq “ H

^Ð

p1´ 1{tq the quantile function associated to H, then x

^˚

“ U p8q and H P D p H

_γ

q is equivalent to the existence of some positive function a such that

tÑ`8

lim

log U ptxq ´ log U ptq

aptq{U ptq “ x

^γ

´ 1

γ , @x ą 0, (5)

which, since γ ă 0, is itself equivalent to

tÑ`8

lim

U p8q ´ U ptxq

U p8q ´ U ptq “ x

^γ

, @x ą 0. (6) This means that the function U p8q ´ U is regularly varying (at `8) with index γ (see the appendix for the definition of regular variation at `8 ). A reference for the equivalence of conditions (5) and (6) to (A) is [Haan and Ferreira (2006)] (respectively relation (3.5.4) and Corollary 1.2.10 there).

Finally, we will need some very mild additional assumption on pk

_n

q (K) there exists some δ ą 0, or some δ ě γ

X

´ γ

C

γ

_X

` γ

_C

if γ

_C

ě γ

_X

, such that

´ logpk

_n

{nq L

k

_n

“ Opn

^´δ

q. (7)

2.2 Asymptotic results

Let us introduce the notation a

_n,k

:“ apn{k

_n

q{U pn{k

_n

q (see the previous paragraph for the definition of functions U and a), where a

_n,k

Ñ 0 (cf equation (3.5.5) in [Haan and Ferreira (2006)]). In the paper, Betap¨, ¨q denotes the usual Beta function, Betapa, bq “ ş

1

0

t

^a´1

p1 ´ tq

^b´1

dt pa ą 0, b ą 0q.

Theorem 1 Under assumption (A) and conditions p4q and (K), for any given α ě 1, both

^M

pαq n,kn

pa_n,kq^α

and

^M^Ă

pαq n,kn

pa_n,kq^α

converge in probability, as n tends to 8, to

|γ

_X

|

^´1

|γ|

^´α

Betap|γ

_X

|

^´1

; α ` 1q.

The following corollary states the consistency of our two different adaptations of the Moment estimator to this censored framework.

Corollary 1 Under conditions of Theorem 1, as n Ñ 8,

p γ

n,M om

:“ M

_n,k^p1q_n

` 1 ´ 1 2

˜

1 ´ p M

_n,k^p1q

n

q

²

M

_n,k^p2q

n

¸

´1

ÝÑ

P

γ

X

and

r γ

_{n,M om}

: “ M Ă

_n,k^p1q

n

` 1 ´ 1 2

˜

1 ´ p M Ă

_n,k^p1q

n

q

²

M Ă

_n,k^p2q_n

¸

´1

ÝÑ

P

γ

_X

.

(7)

In fact, by using the elementary properties of the Beta function, the weighted moments M

n^pαq

or M Ă

n^pαq

can be combined in different ways, leading to the definition of two different classes of consistent estimators of γ

_X

, parametrized by α ě 1 (proofs of the 3 corollaries are easy and omitted). In the next section, we study their finite sample performance.

Corollary 2 Under conditions of Theorem 1, as n Ñ 8,

p γ

_n,1^pαq

: “ `

V

_n,α^´1

` α ` 1 ˘

´1 P

ÝÑ γ

_X

and

r γ

_n,1^pαq

:“

´

V r

_n,α^´1

` α ` 1

¯

´1

ÝÑ

P

γ

_X

where

V

_n,α

: “ 1 ´ α ` 2 α ` 1

pM

n^pα`1q

q

²

M

n^pαq

M

n^pα`2q

and V r

_n,α

: “ 1 ´ α ` 2 α ` 1

p M Ă

n^pα`1q

q

²

M Ă

n^pαq

M Ă

n^pα`2q

.

Corollary 3 Under conditions of Theorem 1, as n Ñ 8,

p γ

_n,2^pαq

: “ 1 ´ pα ` 1qR

_n,α

pα ` 1qp1 ´ R

n,α

q

ÝÑ

P

γ

_X

and

r γ

_n,2^pαq

: “ 1 ´ p α ` 1 q R r

_n,α

pα ` 1qp1 ´ R r

_n,α

q

ÝÑ

P

γ

_X

,

where

R

_n,α

:“ M

n^p1q

M

n^pαq

M

n^pα`1q

and R r

_n,α

:“ M Ă

n^p1q

M Ă

n^pαq

M Ă

n^pα`1q

.

Remark 1 It is straightforward to see that γ ˆ

_n,2^pαq

with α “ 1 equals 1 ´

¹₂

p 1 ´ R

_n,1

q

^´1

, which is very close to ˆ γ

_{n,M om}

, since M

_n,k^p1q_n

Ñ 0 in our finite endpoint framework.

Remark 2 If M

^pαq_n,k_n

denotes the unweighted moments defined in the introduction, it can be proved that under p A q and p 4 q , for α ě 1,

M

^pαq_n,k

n

pa

_n,k

q

^α

ÝÑ |γ|

P ^´α´1

Betap|γ|

^´1

; α ` 1q .

Therefore, it is easy to check that combining those moments as described in Corollaries

2 and 3 leads to consistent estimators of γ, and thus dividing the latter by p ˆ (defined

in the introduction) leads to 2 classes of consistent estimators q γ

_n,1^pαq

and q γ

_n,2^pαq

of γ

_X

. We

also define q γ

_{n,M om}

as the estimator of γ

_X

obtained by dividing the classical Moment

estimator of γ by the proportion p. A finite-sample comparison of those estimators ˆ

with our new competitors is presented in the following section.

(8)

Remark 3 Note that the combination of moments proposed in Corollaries 2 and 3 become inadequate in the framework of a positive extreme value index: it can be indeed proved that, in this framework, the combinations p γ

_n,j^pαq

and r γ

_n,j^pαq

, for j “ 1 or 2, converge in probability to zero, by proving that M

_n,k^pαq_n

and M Ă

_n,k^pαq_n

converge in probability to γ

_X^α

Γpα ` 1q (in the complete data case, this result is known for M

^pαq_n,k_n

, see [Segers (2001)]). This could suggest, in the positive index case, the definition of estimators of γ

_X

which would be equal to ˆ γ

_n,j^pαq

or ˜ γ

_n,j^pαq

(for j “ 1 or 2) plus a

”censored version” of the Hill estimator (in the same spirit as the definition of the Moment estimator, which equals the Hill estimator plus a term converging to 0 in the positive index case).

3 Finite sample behavior

The goal of this Section is to present our results concerning the finite sample per- formances of our new estimators of the extreme value index in presence of random censorship, presented in Corollaries 1, 2 and 3. In each case considered, 2000 random samples of size n “ 500 were generated, and the median bias and mean squared error (MSE) of the different estimators of γ were plotted against the number k

_n

of excesses used.

A great variety of situations can be (and has been) considered in our simulation study : various values of γ

_X

and γ

_C

(and therefore various censoring rates in the tail), various families of underlying distributions (Reverse Burr, generalized Pareto, Beta), and choice of the value of α. It is impossible to illustrate here the different possible combinations of these features : we will therefore try to draw some general conclusions from the many different situations we have observed, and provide a partial illustration with 3 particular cases.

Concerning the choice of the tuning parameter α, we did not find a value which seemed preferable in every situation : nonetheless, in general, for small values of k

_n

, a value of α around 1 or 2 yields better MSE, whereas for high values of k

_n

, the MSE is lower for values of α greater than 2. We decided not to include this preliminary study in this article, and chose (almost arbitrarily) the value α “ 2 in all our subsequent simulations.

Let us now settle the vocabulary used in this section. We will call Moment es- timators the estimators p γ

_{n,M om}

and r γ

_{n,M om}

appearing in Corollary 1, as well as the estimator q γ

_{n,M om}

introduced in Remark 2 above. We will call type 1 (resp. type 2) estimators the estimators p γ

_n,1^pαq

and r γ

_n,1^pαq

(resp. p γ

_n,2^pαq

and r γ

_n,2^pαq

) appearing in Corollary 2 (resp. 3), as well as the estimator q γ

_n,1^pαq

(resp. q γ

_n,2^pαq

) introduced in Remark 2.

We will also consider names for the different methods : the KM method (for Kaplan-Meier-like weights, appearing in the definition of M

_n,k^pαq

n

), leading to p γ es-

timators, the L method (for Leurgans-like weights) leading to r γ estimators (the

name comes from the mathematician Sue Leurgans who inspired the weights, see

[Worms (2014)] for details and a reference), and the EFG method (for constant

(9)

50 100 150 200 250

-0.5-0.3-0.10.1

Bias

50 100 150 200 250

0.00.20.40.6

MSE

Figure 1: Comparison between p γ

_n,1^p2q

(thick black), r γ

_n,1^p2q

(dashed black), q γ

_n,1^p2q

(thin black), p γ

_{n,M om}

(thick grey), r γ

_{n,M om}

(dashed grey) and q γ

_{n,M om}

(thin grey) for a RevBurr p 1, 1, 1, 10 q censored by a RevBurr p 10, 2 { 3, 1, 10 q (γ

_X

“ ´ 1 ą γ

_C

“ ´ 3 { 2, p=2/5, weak censoring)

weighting by ˆ p), leading to q γ estimators (the names comes from the initials of the authors of [Einmahl et al. (2008)]).

There are two main questions addressed in this empirical study : is one of the 3 methods preferable to the others (and in which conditions) and is there a better choice for the type of estimator (type 1 , type 2, or classical Moment estimator) ? Unsurprisingly, after our intensive simulation study, we may say that the answer is no for the 2 questions, if an overall superiority is looked for. However, we can make some partial comments concerning the choice of the method and of the estimator type, whether the censoring is strong or weak, or the value of | γ | is small or not.

Note first that, if the censoring rate 1 ´ p in the tail is very low (say lower than 10%), we observed that there was not much difference between the 3 methods (KM, L, EFG), and that it was just a question of choosing between type 1, type 2, and moment estimator. This is why, in the following, we only consider cases where the censoring rate 1 ´p “

_γ ^γ^X

X`γ_C

is larger than 1/4, and talk about strong censoring in the tail when this rate is greater than 1{2 (i.e. γ

X

ď γ

C

), and weak censoring otherwise (when γ

_X

ą γ

_C

).

For “high” values of γ

_X

, i.e. lower than ´ 1 { 2, we have most of the time observed better performance of the KM and L methods with respect to the EFG method, in strong or weak censoring frameworks. In this context, the type 1 estimators are generally preferable to the type 2 estimators, and comparable or preferable to the moment estimator.

For values of γ

_X

between ´ 1 { 2 and 0 (sometimes called the “regular” case, and

which is the most frequently encountered in practice), there exists a great variety

(10)

50 100 150 200 250

-0.30-0.150.00

Bias

50 100 150 200 250

0.000.020.04

MSE

Figure 2: Comparison between p γ

_n,2^p2q

(thick black), r γ

_n,2^p2q

(dashed black), q γ

_n,2^p2q

(thin black), p γ

_{n,M om}

(thick grey), r γ

_{n,M om}

(dashed grey) and q γ

_{n,M om}

(thin grey) for a RevBurr p 1, 8, 1 { 2, 10 q censored by a RevBurr p 10, 4, 1 { 2, 10 q (γ

_X

“ ´ 1 { 4 ą γ

_C

“

´1{2, p=1/3, weak censoring)

of situations. We observed that the moment estimators were generally better than the type 2 estimators, which were themselves generally better than the type 1 ones.

Concerning the choice of the method, for the moment estimator, it seems difficult to suggest a particular one, between the KM, L, and EFG methods (even though in many cases, at least one among the KM and L methods was better than the EFG method). Concerning the inferiority of types 1 and 2 versus the moment estimator, it should be noted that it is mainly due to the bias, which contributes the most to the MSE (in fact, we clearly noticed that the variances of the types 1 and 2, for α “ 2, are almost always lower than the variance of the moment estimator).

The 3 particular situations we chose as illustrations of the comments above involve the Reverse Burr class of distributions RevBurrpβ, τ, λ, x

^˚

q (with β, τ, λ ą 0) : its survival function is

P pX ą xq “ p1 ` β

^´1

px

^˚

´ xq

^´τ

q

^´λ

, and its extreme value index is ´1{pλτ q.

In Figure 1, the value of γ

_X

is lower than ´1{2, and therefore, as motivated above,

for readability purposes we only kept the type 1 estimators on the graph, whereas for

the other two figures, the value of γ

_X

is between ´1{2 and 0 and we therefore only

kept the type 2 estimator illustrated. Remind here that these 3 examples are only 3

particular cases of the numerous combinations of features we have considered in our

simulation study.

(11)

50 100 150 200 250

-0.5-0.4-0.3-0.2-0.10.00.1

Bias

50 100 150 200 250

0.000.050.100.150.200.250.300.35

MSE

Figure 3: Comparison between p γ

_n,2^p2q

(thick black), r γ

_n,2^p2q

(dashed black), q γ

_n,2^p2q

(thin black), p γ

_{n,M om}

(thick grey), r γ

_{n,M om}

(dashed grey) and q γ

_{n,M om}

(thin grey) for a RevBurr p 10, 8, 1 { 2, 10 q censored by a RevBurr p 10, 5, 1, 10 q (γ

_X

“ ´ 1 { 4 ă γ

_C

“ ´ 1 { 5, p=5/9, strong censoring)

4 Conclusion

In this paper, we applied the methodology introduced in [Worms (2014)] to define

weighted versions of the moments of relative excesses, and consequently construct new

estimators of the extreme value index for randomly-censored data with distributions

in the Weibull domain of attraction. We proposed, in particular, a new adaptation

of the famous Moment estimator. Our intensive simulation study shows that the

proposed estimators are competitive even if, in many cases, the bias would need to

be reduced. A future possible work would be to exploit our weighting methodology

in order to estimate other parameters of the tail (for reducing the bias, for example)

as well as extreme quantiles. The asymptotic normality remains a question to be

addressed (difficulties come from the control of the Kaplan-Meier estimates in the

tail).

(12)

5 Proof of Theorem 1

Before proceeding to the proof of Theorem 1, we state the following Proposition which explains the link between our two proposals of weighted moments.

Proposition 1 piq For any α ě 1, M Ă

_n,k^pαq

n

“ M

_n,k^pαq

n

` p1 ´ δ

_n,n

qD

_n,k^pαq

n

, where

D

^pαq_n,k

n

“ 1

np1 ´ F ˆ

_n

pZ

_n´k_n_,n

qqp1 ´ G ˆ

_n

pZ

_n,n^´

qq log

^α

ˆ Z

_n,n

Z

_n´k_n_,n

˙ .

p ii q Under the same assumptions as Theorem 1, for any α ě 1, we have D

_n,k^pαq

n

“ o

_P

pa

^α_n,k

q.

According to this proposition, the validity of Theorem 1 for M

_n,k^pαq

n

is a consequence of Theorem 1 for M Ă

_n,k^pαq

n

. Proposition 1 will be proved at the end of the appendix.

We now need to state the following technical Lemmas, which will be proved in the appendix. The notation ξ

_i,n

was introduced in (3) and we note

Z ˜

i,n

:“ x

^˚

´ Z

_n´i`1,n

x

^˚

´ Z

_n´k_n_,n

.

Lemma 1 Let α ě 1 and u

i

“ u

i,n

“

_k_nⁱ_`1

. Under assumptions (A) and (4):

p i q if 0 ă a ă 1 then 1 k

_n

kn

ÿ

i“1

u

^´a_i

ξ

_i,n

a

^α_n,k

ÝÑ p1

P

´ aq|γ|

^´α´1

Beta

ˆ 1 ´ a

| γ | ; α ` 1

˙ ,

p ii q if a ą 1, then for any δ

¹

ą 0, 1 k

^a`δ_n ¹

kn

ÿ

i“1

u

^´a_i

ξ

_i,n

a

^α_n,k

ÝÑ

P

0. Lemma 2 For any given positive exponents θ and θ

¹

ą 0, there exist constants c ą 1, c

¹

ă 1 both arbitrarily close to 1, and a

_`

ą 0, a

_´

ą 0, arbitrarily close to γθ and to γθ

¹

respectively, such that

nÑ8

lim P

˜ max

iďkn

Z ˜

_i,n^θ

cu

^´a_i ^`

ą 1

¸

“ lim

nÑ8

P

˜ min

iďkn

Z ˜

_i,n^θ¹

c

¹

u

^´a_i ^´

ă 1

¸

“ 0.

We now proceed to the proof of Theorem 1 (for M Ă

_n,k^pαq

n

), which has structural simi- larities with the proof of Theorem 2 in [Worms (2014)]. We shall refer to the latter when necessary. We have the decomposition

M Ă

_n,k^pαq

n

“ A

_n

pW

_n

` R

_n

q ,

(13)

where

A

_n

: “

^1´_1´GpZ^G^pⁿ^pZ^n´kn,n^q

n´kn,nq

, W

_n

: “

_k¹_n

ř

kn

i“1

W

_i,n

, R

_n

:“

_k¹

n

ř

kn

i“1

pC

_i,n

´ 1qW

_i,n

, C

_i,n

:“

^1´GpZ^n´i`1,n^q

1´GpnpZ^´_n´i`1,nq

, and

W

_i,n

:“ 1 ´ GpZ

_n´k_n_,n

q 1 ´ G p Z

_n´i`1,n

q ξ

_i,n

.

Since A

_n

ÝÑ

^P

1 as n Ñ 8 (see Theorem 2 in [Cs¨ org˝ o (1996)]), we need to prove that R

_n

“ o

_P

p a

^α_n,k

q and

W

_n

a

^α_n,k

ÝÑ

P

l

_α

, (8)

where l

_α

denote the limit in the statement of Theorem 1.

5.1 Proof of W

_n

{ a

^α_n,k

ÝÑ

^P

l

_α

Since G P DpH

γ_C

q, with γ

C

ă 0, is equivalent to t Ñ 1 ´ Gpx

^˚

´ tq being regularly varying at 0 with index ´ 1 { γ

_C

(see the appendix for the definition of regular variation at 0), the bounds p12q in Corollary 4 (in the appendix) applied to f “ G, x “ px

^˚

´ Z

n´i`1,n

q{px

^˚

´ Z

n´kn,n

q and t “ x

^˚

´ Z

n´kn,n

, yield, for ą 0, n sufficiently large and every 1 ď i ď k

_n

,

p1 ´ q ξ

_i,n

Z ˜

^γ

´1

C `

i,n

ď W

_i,n

ď p1 ` q ξ

_i,n

Z ˜

^γ

´1 C ´

i,n

. (9)

Let η ą 0. We first write, for sufficiently small, Pp W

_n

{ a

^α_n,k

´ l

_α

ą η q ď Pp k

^´1_n

ř

kn

i“1

Z ˜

^γ

´1

C ´

i,n ξin

a^α_n,k

´ l

_α

ą

^η₂

q , P p l

_α

´ W

_n

{a

^α_n,k

ą η q ď P p l

_α

´ k

_n^´1

ř

kn

i“1

Z ˜

^γ

´1 C ` i,n

ξin

a^α_n,k

ą

^η₂

q.

Let us now consider constants c ą 1 and c

¹

ă 1, both arbitrary close to 1, and a

_`

ą 0 and a

_´

ą 0 both arbitrary close to γ{γ

C

. These constants come from the application of Lemma 2 above with θ “ γ

_C^´1

´ and θ

¹

“ γ

_C^´1

` . Using positivity of ξ

_in

, it comes

P

˜ W

_n

a

^α_n,k

´ l

_α

ą η

¸

ď P

¨

˝max

iďkn

Z ˜

^γ

´1 C ´ i,n

cu

^´a_i ^`

ą 1

˛

‚

` P

˜ c k

_n^´1

kn

ÿ

i“1

u

^´a_i ^`

ξ

_in

a

^α_n,k

´ l

_α

ą η 2

¸

P

˜

l

_α

´ W

_n

a

^α_n,k

ą η

¸

ď P

¨

˝min

iďkn

Z ˜

^γ

´1 C ` i,n

c

¹

u

^´a_i ^´

ă 1

˛

‚

` P

˜

l

_α

´ c

¹

k

_n^´1

kn

ÿ

i“1

u

^´a_i ^´

ξ

_in

a

^α_n,k

ą η

2 ¸

(14)

where u

_i

“ u

_i,n

“ i{pk

_n

` 1q for 1 ď i ď k

_n

. If we call l

_α,a

the limit in the statement of Lemma 1 p i q , and if we apply Lemma 2 as indicated previously, we have

lim sup

nÑ8

P

´

Wn

a^α_n,k

´ l

_α

ą η

¯

ď lim sup

nÑ8

P

´

k

^´1_n

ř

kn

i“1

u

^´a_i ^`_a^ξαⁱⁿ

n,k

´ l

_α,a_`

ą

¹_c

p

^η₂

` l

_α

q ´ l

_α,a_`

¯

lim sup

nÑ8

P

´

l

_α

´

_a^Wαⁿ n,k

ą η

¯

ď lim sup

nÑ8

P

´

l

_α,a_´

´ k

_n^´1

ř

kn

i“1

u

^´a_i ^´_a^ξαⁱⁿ

n,k

ą

_c¹1

p

^η₂

´ l

_α

q ` l

_α,a_´

¯

Since l

α

“ l

_α,γ{γ_C

, and both a

_`

and a

_´

are arbitrary close to γ{γ

C

ă 1, it is easy to see that p8q comes from the application of Lemma 1 piq to a “ a

_`

and a “ a

_´

.

5.2 Proof of R

n

“ o

_P

pa

^α_n,k

q

Let us use the same decomposition as in the proof of the negligibility of the term R

_n

in [Worms (2014)] (see subsection 5.1.2 there). In other words, we define, for some δ

¹

ą 0,

Cptq ˜ :“

ż

t 0

dG p x q

p1 ´ Gpxqq

²

p1 ´ F pxqq and h

i,n

:“ p CpZ ˜

n´i`1,n

qq

^´¹²^´δ¹

, and we readily have |R

_n

| ď T

_n¹

T

_n²

, where

T

_n¹

: “ sup

1ďiďkn

? n | h

_i,n

p C

_i,n

´ 1 q| and T

_n²

: “ 1 k

_n

kn

ÿ

i“1

W

_i,n

h

^´1_i,n

n

^´¹²

.

Using sharp results of the survival analysis literature, we have already proved in [Worms (2014)] that T

_n¹

“ O

_P

p 1 q . It remains to prove that

T

_n²

{a

^α_n,k

“ o

_P

p1q.

First, from the definition of h

in

and ˜ C, since p1 ´ Hq “ p1 ´ F qp1 ´ Gq we clearly have

h

^´1_in

ă

ˆ ´ logp1 ´ GpZ

n´i`1,n

qq 1 ´ HpZ

_n´i`1,n

q

˙

¹₂`δ¹

.

Moreover, under assumption (A), 1 ´ Hpx

^˚

´ ¨q is regularly varying at zero with index

´1{γ and ´ logp1 ´Gpx

^˚

´ ¨qq is slowly varying at 0 : therefore, the application of p9q, as well as bound p 11 q to ´ log p 1 ´ G p x

^˚

´ ¨qq and bound (12) to f “ G and f “ H, implies that , for n sufficiently larger, T

_n²

ď 4P

_n

Q

_n

where

P

_n

:“ n

^´¹²

ˆ ´ logp1 ´ GpZ

_n´k_n_,n

qq 1 ´ HpZ

_n´k_n_,n

q

˙

¹

2`δ¹

Q

_n

:“ 1 k

_n

kn

ÿ

i“1

ξ

_in

Z ˜

_i,n^β

,

where β “ p2γq

^´1

` γ

_C^´1

´

¹

, for some

¹

ą 0.

(15)

We thus need to prove that P

_n

Q

_n

{a

^α_n,k

“ o

_P

p1q.

Let η ą 0 and consider constants c ą 1 arbitrarily close to 1 and a

_`

ą 0 arbitrarily close to γβ “

¹₂

`

_γ^γ

C

´ γ

¹

ą

¹₂

`

_γ^γ

C

. We have, P

˜ P

_n

Q

_n

a

^α_n,k

ą η

¸ ď P

˜ max

iďkn

Z ˜

_i,n^β

cu

^´a_i ^`

ą 1

¸

` P

˜

P

_n

k

^´1_n

kn

ÿ

i“1

u

^´a_i ^`

ξ

_in

a

^α_n,k

ą η

c

¸

. (10) First, Lemma 2 is applied with θ “ β and thus the first term of the right-hand side of p 10 q tends to 0. Next, Lemma 1 is applied with a “ a

_`

: we thus need to distinguish the case γ

_C

ă γ

_X

(for which γβ ă 1) from the case γ

_C

ě γ

_X

(for which γβ ą 1 when

¹

ą 0 gets small).

piq Case γ

_C

ă γ

_X

First of all, assumption (K) implies that P

n

“ o

_P

p1q (see relation p20q in [Worms (2014)]). Since a

_`

ă 1 in this case, Lemma 1 p ii q implies that

_k¹

n

ř

kn

i“1

u

^´a_i ^`_a^ξαⁱⁿ n,k

“ O

_P

p 1 q and consequently the second term of the right hand-side of (10) tends to

0. p ii q Case γ

_C

ě γ

_X

In this case a

_`

ą 1, therefore Lemma 1 p ii q implies that, for any given δ

¹

ą 0, k

n^´pa^`^`δ¹^q

ř

kn

i“1

u

^´a_i ^`_a^ξαⁱⁿ

n,k

“ o

_P

p1q. Moreover, assumption (K) implies that, for δ

¹

ą 0 small enough and a

_`

sufficiently close to 1{2`γ{γ

_C

, we have k

_n^a^`^`δ¹^´1

P

_n

“ O

_P

p1q. Hence, the second term of the right hand-side of (10) also tends to 0. ˛

6 Appendix

6.1 Regular variation and Potter-type bounds

Definition 1 An ultimately positive function f : R

^`

Ñ R is regularly varying (at infinity) with index α P R , if

tÑ`8

lim f ptxq

f ptq “ x

^α

p@ x ą 0 q .

This is noted f P RV

_α

. If α “ 0, f is said to be slowly varying.

Remark 4 Regular variation (and slow variation) can be defined at zero as well.

A function f is said to be regularly varying at zero with index α if the function x Ñ f p1{xq is regularly varying at infinity, with index ´α.

Proposition 2 (See [Haan and Ferreira (2006)] Proposition B.1.9)

Suppose f P RV

_α

. If x ą 0 and δ

₁

, δ

₂

ą 0 are given, then there exists t

₀

“ t

₀

p δ

₁

, δ

₂

q such that for any t ě t

₀

satisfying tx ě t

₀

, we have

p1 ´ δ

₁

qx

^α

minpx

^δ²

, x

^´δ²

q ă fptxq

f p t q ă p1 ` δ

₁

qx

^α

maxpx

^δ²

, x

^´δ²

q.

(16)

If x ă 1 and ą 0, then there exists t

₀

“ t

₀

pq such that for every t ě t

₀

, p 1 ´ q x

^α`

ă fptxq

fptq ă p 1 ` q x

^α´

and if x ě 1 ,

p 1 ´ q x

^α´

ă f p tx q

f ptq ă p 1 ` q x

^α`

.

Corollary 4 If f is a positive function with end-point x

^˚

, such that t Ñ 1 ´ fpx

^˚

´ tq is regularly varying at 0 with index α, i.e.

1 ´ fpx

^˚

´ txq

1 ´ fpx

^˚

´ tq Ñ x

^α

, as t Ñ 0,

for some α P R , then for every ą 0, there exists t

₀

ą 0 such that, @ 0 ă t ă t

₀

,

@0 ă x ă 1,

p1 ´ qx

^α`

ď 1 ´ f px

^˚

´ txq

1 ´ f px

^˚

´ tq ď p1 ` qx

^α´

(11) and

p 1 ´ q x

^´α`

ď 1 ´ f px

^˚

´ tq

1 ´ f px

^˚

´ txq ď p 1 ` q x

^´α´

(12) Below, U corresponds to the quantile function associated to H introduced in para- graph 2.1.

Corollary 5 If U satisfies condition p6q, then for every ą 0, there exists t

₀

ą 0 such that, @0 ă t ă t

₀

, @x ě 1,

p 1 ´ q x

^γ´

ď U p8q ´ U ptxq

U p8q ´ U ptq ď p 1 ` q x

^γ`

. (13) Proposition 3 (see [Haan and Ferreira (2006)] Theorem B.2.18)

If U satisfies condition p5q with the positive function a, then there exists a function q

₀

equivalent to a{U at infinity such that @ ą 0, Dt

₀

ą 0, @t ě t

₀

, @x ě 1,

x

^γ

´ 1

γ ´ x

^γ`

ď log U ptxq ´ log U ptq

q

0

ptq ď x

^γ

´ 1

γ ` x

^γ`

. (14)

We now proceed to the proofs of the different lemmas stated previously and finally

of Proposition 1 .

(17)

6.2 Proof of Lemma 1

Let c

_n

:“ pq

₀

pn{k

_n

q{a

_n,k

q

^α

(which tends to 1 as n Ñ 8) and pY

_i

q be a sequence of i.i.d standard Pareto random variables. Let

LL

_i,k

:“ logpU pY

n´i`1,n

qq ´ logpU pY

n´kn,n

qq

q

₀

pY

_n´k_n_,n

q and QQ

_i,k

:“

´

Yn´i`1,n

Y_n´kn,n

¯

γ

´ 1

γ .

For every 1 ď i ď k

_n

, we thus have c

^´1_n

ξ

_i,n

a

^α_n,k

“

d

i ppLL

_i,k

q

^α

´ pLL

_i`1,k

q

^α

q

“ α

_i,n

` β

_i,n

, where

α

_i,n

:“ i ppQQ

_i,k

q

^α

´ pQQ

_i`1,k

q

^α

q β

_i,n

:“ ipB

_k,n

piq ´ B

_k,n

pi ` 1qq B

_k,n

p i q : “ p LL

_i,k

q

^α

´ p QQ

_i,k

q

^α

, with B

_k,n

pk

_n

` 1q “ 0.

Our first step will be to prove that (with δ

¹

ą 0) 1

k

_n

kn

ÿ

i“1

u

^´a_i

β

i,n

or 1 k

_n^a`δ¹

kn

ÿ

i“1

u

^´a_i

β

i,n

is o

_P

p1q whether 0 ă a ă 1 or a ą 1.

Using bounds p14q for some

¹

ą 0, with t “ Y

_n´k_n_,n

and x “ Y

_n´i`1,n

{Y

_n´k_n_,n

ą 1, and relying on the mean value theorem, we easily prove that, since α ě 1 and γ ă 0,

|pLL

_i,k

q

^α

´ pQQ

_i,k

q

^α

| ď c

¹

ˆ Y

_n´i`1,n

Y

_n´k_n_,n

˙

γ`¹

ď c

¹

(15)

for some constant c (close to α|γ|

^1´α

). Therefore |B

_k,n

piq| “ o

_P

p1q, uniformly on 1 ď i ď k

_n

. Since |B

_k,n

piq| “

ˇ ˇ ˇ

ř

kn

j“i βj,n

j

ˇ ˇ

ˇ , we thus have, when 0 ă a ă 1, ˇ

ˇ ˇ

1 kn`1

ř

kn

i“1

u

^´a_i

β

i,n

ˇ ˇ ˇ “

ˇ ˇ ˇ

ř

kn

i“1 βi,n

i

u

^1´a_i

ds ˇ ˇ ˇ

“ p1 ´ aq ˇ ˇ ˇ

ř

kn

i“1 βi,n

i

ş

ui

0

s

^´a

ds ˇ ˇ ˇ ď p1 ´ aq ř

kn

i“1

ˇ ˇ ˇ

ř

kn

j“i βj,n

j

ˇ ˇ ˇ

ş

ui

ui´1

s

^´a

ds

“ o

_P

p1q ř

kn

i“1

pu

^1´a_i

´ u

^1´a_i´1

q ď o

_P

p1q

_k¹

n

ř

kn

i“1

u

^´a_i

“ o

_P