• Aucun résultat trouvé

A HILL TYPE ESTIMATOR OF THE WEIBULL TAIL-COEFFICIENT

N/A
N/A
Protected

Academic year: 2022

Partager "A HILL TYPE ESTIMATOR OF THE WEIBULL TAIL-COEFFICIENT"

Copied!
26
0
0

Texte intégral

(1)

HAL Id: hal-00724602

https://hal.inria.fr/hal-00724602

Submitted on 21 Aug 2012

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de

A HILL TYPE ESTIMATOR OF THE WEIBULL TAIL-COEFFICIENT

Stéphane Girard

To cite this version:

Stéphane Girard. A HILL TYPE ESTIMATOR OF THE WEIBULL TAIL-COEFFICIENT. Com- munications in Statistics - Theory and Methods, Taylor & Francis, 2004, 33 (2), pp.205-234. �hal- 00724602�

(2)

A HILL TYPE ESTIMATOR OF THE WEIBULL TAIL-COEFFICIENT

St´ephane Girard

SMS/LMC, Universit´e Grenoble 1, BP 53, 38041 Grenoble cedex 9, France [email protected]

Key Words: Weibull tail coefficient; extreme value index; order statistics.

ABSTRACT

We present a new estimator of the Weibull tail-coefficient. The Weibull tail-coefficient is defined as the regular variation coefficient of the inverse cumulative hazard function. Our estimator is based on the log-spacings of the upper order statistics. Therefore, it is very similar to the Hill estimator for the extreme value index. We prove the weak consistency and the asymptotic normality of our estimator. Its asymptotic as well as its finite sample performances are compared to classical ones.

(3)

1. INTRODUCTION

Let X1, X2, . . . , Xn be a sequence of independent and identically distributed random variables with cumulative distribution function F. We address the problem of estimating the Weibull tail-coefficient θ >0 defined when the distribution tail satisfies

(A.1) : 1−F(x) = exp(−H(x)), x≥x0 ≥0, with H(t) = inf{x, H(x)≥t}=tθℓ(t), where ℓ is a slowly varying function i.e.

ℓ(λx)/ℓ(x)→1 as x→ ∞ for all λ >0.

The inverse cumulative hazard function H is said to be regularly varying at infinity with index θ and this property is denoted byH ∈ Rθ.

Whenℓ is a constant function, this problems reduces to estimating the shape parameter of a Weibull distribution. In this context, simple and efficient methods exist, see for instance [1]

for a moment based estimator. Distributions with non-constant slowly varying functions include for instance normal, gamma and extended Weibull distributions (see Section 3 for their definitions). Such distributions are of great use to model large claims in non-life insur- ance [2]. Dedicated estimation methods have been proposed since the relevant information on the Weibull tail-coefficient is only contained in the extreme upper part of the sample. A first direction was investigated in [3] where an estimator based on the record values is proposed.

Another approach consists of using the kn upper order statistics Xnkn+1,n ≤ . . . ≤ Xn,n

where (kn) is a sequence of integers such that 1 ≤ kn < n. Our estimator belongs to this family. It is defined by

θˆn=

kn1

X

i=1

(log(Xni+1,n)−log(Xnkn+1,n))

,kn1

X

i=1

(log2(n/i)−log2(n/kn)), (1) where log2(t) = log(log(t)),t >1. Some related estimators [4, 5, 6] will be presented below and a precise comparison will be done from both theoretical and practical points of view.

Formally, (1) is similar to the expression of the Hill estimator [7]:

ˆ

γn= 1 kn−1

kn1

X

i=1

(log(Xni+1,n)−log(Xnkn+1,n)) (2)

(4)

of the extreme value index γ for Pareto type distributions, i.e. such that (1−F)(λx)/(1−F)(x)→λ1/γ asx→ ∞ for all λ >0.

We refer to [8] for a review on this topic. Although estimators (1) and (2) do not address the same problem, it will appear that they share similar properties. In Section 2 we state the main asymptotic properties of our estimator. These results are illustrated in Section 3 on some examples of distributions and are compared in Section 4 with the results obtained by other estimators. The behavior of our estimator on finite sample situations is presented in Section 5. Finally, Section 6 is dedicated to the proofs.

2. ASYMPTOTIC RESULTS

In this section, we state our main results on the limiting behavior of the estimator ˆθn. Its weak consistency is established in Theorem 1 and its asymptotic normality in Theorem 2.

The proofs are postponed to Section 6. The weak consistency of ˆθnis proved under the usual conditions on the (kn) sequence (which are also necessary for the weak consistency of the Hill estimator, see [9], Theorem 2):

Theorem 1 Suppose (A.1) holds. If kn→ ∞ and kn/n→0 then θˆn P

→θ.

The study of the limit distribution of ˆθn requires a second order condition onℓ: There exist ρ≤0 and b(x)→0 such that uniformly locally onλ≥1 when x→ ∞,

(A.2) : log

µℓ(λx) ℓ(x)

∼b(x)Kρ(λ), with Kρ(λ) = Rλ

1 uρ1du. It can be shown [10] that necessarily |b| ∈ Rρ. The second order parameter ρ ≤0 tunes the rate of convergence of ℓ(λx)/ℓ(x) to 1. The closer ρ is to 0, the slower is the convergence. Condition (A.2) is the cornerstone in all proofs of asymptotic normality for extreme value estimators. It is used in [11] to prove the asymptotic normality of the Hill estimator and in [12] for one of its refinements.

(5)

Theorem 2 Suppose (A.1) and (A.2) hold. Then

k1/2n (ˆθn−θ)→ Nd (0, θ2), for any sequence (kn) satisfying

kn → ∞, kn1/2b(log(n/kn))→0and kn1/2/log(n/kn)→0.

Note that condition kn1/2/log(n/kn)→ 0 implies condition kn/n → 0. If ρ < −1, condition kn1/2/log(n/kn)→0 implies kn1/2b(log(n/kn))→0 and therefore all the corresponding distri- butions require the same conditions on the sequence (kn). Conversely, if −1 < ρ ≤ 0, the conditionk1/2n b(log(n/kn))→0 is the stronger one and the convergence is slower. If ρ=−1, the conditionkn1/2b(log(n/kn))→0 may reveal necessary depending on the functionb. Some examples are now provided.

3. EXAMPLES

In this section, we give some examples of distributions satisfying assumptions (A.1)and (A.2).

• Gaussian distribution N(µ, σ2), σ > 0. From [13], Table 3.4.4, we have H(x) = x1/2ℓ(x) and an asymptotic expansion of the slowly varying function is given by:

ℓ(x) = 21/2σ− σ 23/2

logx

x +O(1/x).

Thereforeθ = 1/2, ρ=−1 and b(x) = log(x)/(4x).

• Gamma distribution Γ(β, α), α, β > 0. We use the following parameterization of the density

f(x) = βα

Γ(α)xα1exp (−βx).

From [13], Table 3.4.4, we obtain H(x) =xℓ(x) with ℓ(x) = 1

β + α−1 β

logx

x +O(1/x).

We thus haveθ = 1, ρ=−1 andb(x) = (1−α) log(x)/x.

(6)

• Weibull distribution W(α, λ), α, λ > 0. The inverse cumulative hazard function is H(x) = λx1/α, and then ℓ(x) = λ for all x > 0. Therefore b(x) = 0 and we use the usual convention ρ = −∞. A simplified version of Theorem 2 dedicated to this particular case is given in Corollary 1 below.

• Extended Weibull distribution EW(α, β), α ∈ (0,1), β ∈ R. This distribution is introduced in [6]. Its distribution tail is given by:

1−F(x) =r(x) exp(−xα), (3)

where r∈ Rβ. It follows thatH(x) =x1/αℓ(x) and the following asymptotic expan- sion holds:

ℓ(x) = 1 + β α2

logx

x +O(1/x).

It is easily seen that θ = 1/α, ρ = −1 and b(x) = −βlog(x)/(α2x). In [6], it is remarked that this model encompasses the normal and gamma distributions. Then, an estimator of α dedicated to the model (3) is proposed. Its properties are briefly reviewed and compared to those of ˆθn in the next section. In particular, we give in Corollary 2 a specific version of Theorem 2 for this family of distributions.

• Modified Weibull distributionMW(α),α >0. LetY ∼ W(α,1),α >0, and introduce X = Y logY. Then, the inverse cumulative hazard function of X can be written H(x) = x1/αℓ(x) with ℓ(x) = αlog(x). Therefore, we have θ = 1/α, ρ = 0 and b(x) = 1/log(x). Let us note that this distribution does not belong to the Extended Weibull family.

The parameters θ and ρ as well as the auxiliary function b(x) associated to these distribu- tions are summarized in Table 1. We compare theoretically in the next section our estimator to other ones proposed in the literature on some of these examples.

(7)

4. DISCUSSION

Our estimator is now compared to other proposals by studying their asymptotic normality properties obtained in some simple situations.

4.1 Comparison with Broniatowski estimator.

In [5], another estimator of the Weibull tail coefficient is proposed. It is defined by θ˜n = 1

kn kn1

X

i=1

log(Xni+1,n)

log2(n/i) . (4)

Both estimators ˆθn and ˜θn are based on a similar principle: Introduce q(t), the quantile function defined by

q(t) = (1−F)(t) = H(log(1/t)) = (log(1/t))θℓ(log(1/t)), under (A.1). The estimator ˜θn is motivated by the approximation

log(q(t))

log2(1/t) =θ+log(ℓ(log(1/t)))

log2(1/t) ≃θ. (5)

The estimator ˆθn, as for itself, is motivated by the approximation log(q(t))−log(q(s)) = θ(log2(1/t)−log2(1/s)) + log

µℓ(log(1/t)) ℓ(log(1/s))

≃ θ(log2(1/t)−log2(1/s)), (6)

sinceℓis slowly varying. This approximation is similar to the one used in the Hill estimator [7]

for estimating the extreme value index. It is important to note that, when ℓ is a constant function, the approximation (6) is exact but not the approximation (5). This remark has an important consequence on the asymptotic behavior of the estimator. From Theorem 2, we easily obtain that, whenℓ is a constant function, the asymptotic normality of (ˆθn−θ) holds under weaker assumptions on kn:

Corollary 1 Suppose (A.1) holds and ℓ is a constant function. Then,

k1/2n (ˆθn−θ)→ Nd (0, θ2).

for any sequence (kn) satisfying

kn→ ∞and kn1/2/log(n/kn)→0. (7)

(8)

Obtaining a similar result for (˜θn−θ) is not straightforward. In particular, Theorem 2 in [5]

requires to verify the condition

k1/2n log(n/kn) log2(n/knn →0, (8) where βn is a bias term defined by

βn= n kn

Z kn/n

0

du

log2(1/u). (9)

It can be shown (see Lemma 2) that there exist no sequence (kn) verifying condition (8), and thus, it is not possible to make the bias term vanish. This comparison is completed on Section 5 by some simulations.

4.2 Comparison with Beirlant et al. estimator.

Beirlant et al.[4] propose to use the mean residual life functione(x) = E(X−x|X > x) to estimate the parameter θ. They remark that, under assumption(A.1), one have

(logx)e(H(logx))

H(logx) ∼θ (10)

when x→ ∞. They thus introduce the following estimator θˇn = log(n/kn)

Xnkn+1,n

1 kn−1

kn1

X

i=1

{Xni+1,n−Xnkn+1,n}.

A central limit theorem for ˇθn is also established (see [4], Theorem 3.2(i)), with the same hypothesis and conclusions as Theorem 2. Consequently, ˆθn and ˇθn share the same asymp- totic properties. Nevertheless, we can again remark that the approximation (10) is not exact whenℓ is a constant function. The performance of ˆθn and ˇθnon finite sample situations will be compared in Section 5.

4.3 Estimation within the Extended Weibull family.

In [6], an asymptotic maximum likelihood estimator (AMLE) for the parameter α of the EW(α, β) distribution is proposed. This estimator, which will be denoted by ˆαn, is defined by an implicit equation that we do not reproduce here. The authors proved that ([6], Theorem 4.5)

kn1/2log2(n/kn) µ

ˆ

αn1−α1− log(ˆαn/α) log2(n/kn)

d

→ N(0,1), (11)

(9)

under conditions

kn→ ∞, kn/n →0 andk1/2n log2(n/kn)

log(n/kn) →0. (12)

As a comparison, the application of Theorem 2 to the particular case of EW(α, β) distribu- tions lead to the following

Corollary 2 Suppose F is a EW(α, β) distribution. Then,

k1/2n (ˆθn−θ)→ Nd (0, θ2), (13) for any sequence (kn) satisfying (7).

We can note that, for a fixed value ofkn, the asymptotic convergence rate in (11) is slightly better than in (13). At the opposite, condition (12) is more severe than condition (7) and then the optimal convergence rates are the same.

5. NUMERICAL EXPERIMENTS

The finite sample performance of our estimator is investigated on 7 different distributions:

Γ(0.5,1), Γ(1.5,1), N(1.2,1), W(2.5,2.5), W(0.4,0.4), MW(2.5), and MW(0.4). In each case, N = 100 samples (Xn,i)i=1,...,N of size n= 500 were simulated. On each sample (Xn,i), the estimate ˆθn,i(kn) is computed for kn = 2, . . . ,120. Finally, the Hill-type plots are built by drawing the points

à kn, 1

N

N

X

i=1

θˆn,i(kn)

!

as well as the associated empirical 90% confidence interval. We also present the associated MSE (mean square error) plots obtained by plotting the points

à kn,

N

X

i=1

³θˆn,i(kn)−θ´2! .

The same work is achieved with the estimators ˜θn and ˇθn for comparison. The estimator ˆ

αn is not implemented here since it does not address the same problem as ˆθn, ˜θn and ˇθn. For instance, it is not suited for the modified Weibull distribution. Nevertheless, the reader

(10)

interested in its behavior on simulated and real data can refer to [14].

Results are presented on Figures 2–8. It appears that, in all the simulated cases, ˆθn gives better results than ˇθn. It is also seen that ˆθn has a smaller bias and a larger variance than θ˜n. For this reason, ˆθn should be preferred to ˜θn even if the MSE associated to ˆθn is larger than ˜θn’s one. Indeed, in many cases, the true value θ belongs to the empirical confidence interval associated to ˆθn and is outside the interval associated to ˜θn.

Let us note that the sign of the bias of ˆθn and ˇθn is given by the functionb. It is positive for the normal, modified Weibull and gamma distributions (with shape parameter less than one) and negative for the gamma distribution with shape parameter more than one. The bias term vanishes for the Weibull distribution. The worst results are obtained for the modified Weibull distribution. In this situation ρ = 0 and thus the bias term which is driven by b decreases very slowly.

In practice, the choice of the parameter kn is the key problem to obtain a correct estima- tion of θ. If kn is too small, the variance of ˆθn is huge and conversely, if kn is too large, the bias is very important. A first solution to select the optimal kn can be based on the drawing of a quantile plot (QQ-plot) adapted to our situation. It consists of drawing the pairs (log2(n/i),log (Xni+1,n)) for i = 1, . . . , n−1. From (6), the resulting graph should be approximatively linear (with slope θ), at least for the large values ofi. On Figure 1 two cases are presented: The QQ-plot associated to a Weibull distribution which is perfectly lin- ear, and the QQ-plot associated to a normal distribution which is only ultimately linear. A graphical method to select kn could consist of choosing the point where the graph becomes linear. Of course, this method is not very satisfying and an automatic selection method should be developed. A first step could be to adapt the procedure proposed in [15] for the Hill estimator to our framework.

(11)

6. PROOFS

Consider {E1,n, . . . , En,n}the order statistics generated bynindependent standard expo- nential random variables. We collect in the next lemma classical results on the asymptotic behavior of these order statistics.

Lemma 1 Suppose kn → ∞ and kn/n→0. Then,

(i) Eni+1,n/log(n/i)→P 1, uniformly on i= 1, . . . , kn, and (ii) k1/2n (Enkn+1,n−log(n/kn))→ Nd (0,1).

Proof : Both results are proved using the representation of the Exp(1) ordered statistics (see [16], p. 72):

{Ej,n}j=1,...,n

=d

( j X

r=1

Er

n−r+ 1 )

j=1,...,n

,

where {E1, . . . , En}are independent standard exponential variables.

(i) Introducing for i= 1, . . . , kn the random variables Zn,i= Eni+1,n

log (n/i)−1,

straightforward calculations lead to the uniform bounds 0 ≤ E(Zn,i) ≤ 1/log (n/kn) and var(Zn,i)≤2/log2(n/kn).

(ii) This result is also found in [4], p. 32. The Lyapounov theorem for triangular array yields k1/2n

Ã

Enkn+1,n

n

X

s=kn

1 s

!

→ Nd (0,1).

The remaining term is controlled with the well-known equality

n

X

s=1

1

s =γE+ logn+O(1/n), where γE is the Euler’s constant.

(12)

The next lemma is used to prove that there exist no sequence (kn) such that kn → ∞, kn/n→0 and k1/2n log(n/kn) log2(n/knn →0, withβn given in (9).

Lemma 2 If kn→ ∞ and kn/n→0, then, for n large enough, βnlog2(n/kn)>1/2.

Proof : Introducing t =kn/(nu) and µn = log(n/kn) in (9) yields βn=

Z +

1

dt

t2log(log(t) +µn).

Since µn → ∞, forn large enough log(log(t) +µn)>0 and therefore βn>

Z eµn

1

dt

t2log(log(t) +µn) > 1 log(2µn)

Z eµn

1

dt

t2 ∼ 1 log2(n/kn).

Thus βnlog2(n/kn) is larger than a sequence tending to 1. As a conclusion, for n large enough,βnlog2(n/kn)>1/2.

Proof of Theorem 1: For the sake of simplicity, in this proof, we note k for kn. We also write ˆθnn(2)n(1) with

τn(1) = 1 k−1

k1

X

i=1

(log2(n/i)−log2(n/k)) = 1 k−1

k1

X

i=1

log µ

1 + log(k/i) log(n/k)

¶ .

The well-known inequality −x2/2≤log(1 +x)−x≤0,x >0 yields

−1 2

1 log(n/k)

1 k−1

k1

X

i=1

log2(k/i)≤log(n/k)τn(1)− 1 k−1

k1

X

i=1

log(k/i)≤0. (14) Now, since when k→ ∞,

1 k−1

k1

X

i=1

log2(k/i)→ Z 1

0

log2(x)dx= 2 and 1 k−1

k1

X

i=1

log(k/i)→ − Z 1

0

log(x)dx= 1, it follows that

τn(1) = 1

log (n/k)+O

µ 1 log2(n/k)

. (15)

Let us consider now

τn(2) = 1 k−1

k1

X

i=1

(log(Xni+1,n)−log(Xnkn+1,n)).

(13)

Let E1,n, . . . , En,n be ordered statistics generated by n independent standard exponential random variables. Under (A.1), we have

τn(2) =d 1 k−1

k1

X

i=1

(logH(Eni+1,n)−logH(Enk+1,n))

=d θ 1 k−1

k1

X

i=1

log

µEni+1,n

Enk+1,n

+ 1

k−1

k1

X

i=1

log

µℓ(Eni+1,n) ℓ(Enk+1,n)

=: τn(3)θ+τn(4).

In order to rewriteτn(3) into a simpler form, we refer to [17], Lemma 1.4.3, which implies that {Eni+1,n}i=1,...,k1 = {Enk+1,n+ (Eni+1,n−Enk+1,n)}i=1,...,k1

=d {Enk+1,n+Eki,k1}i=1,...,k1 (16) where{E1,k1, . . . , Ek1,k1}are ordered statistics independent fromEnk+1,n and generated byk−1 independent standard exponential variables{E1, . . . , Ek1}. Therefore,

τn(3) =d 1 k−1

k1

X

i=1

log µ

1 + Eki,k1 Enk+1,n

d

= 1

k−1

k1

X

i=1

log µ

1 + Ei Enk+1,n

¶ ,

and, similarly to (14),

−1 2

1 Enk+1,n

1 k−1

k1

X

i=1

Ei2 ≤τn(3)Enk+1,n− 1 k−1

k1

X

i=1

Ei ≤0.

Now, since when k→ ∞, 1 k−1

k1

X

i=1

Ei2 −→a.s. 2 and 1 k−1

k1

X

i=1

Ei

−→a.s. 1,

by the strong law of large numbers and Lemma 1(i), it yields τn(3) =d 1

Enk+1,n

1 k−1

k1

X

i=1

Ei+OP

µ 1 log2(n/k)

, (17)

and, in view of (15), τn(3)

τn(1)

=d log(n/k) Enk+1,n

1 k−1

k1

X

i=1

Ei+OP

µ 1 log(n/k)

P

→1, (18)

(14)

with Lemma 1(i) and the strong law of large numbers.

Finally, τn(4) can be rewritten using the Karamata representation of ℓ (see e.g. [18]):

ℓ(x) =c(x) exp µZ x

x0

δ(u) u du

, x≥x0,

where c(x)→c0 >0 and δ(x)→0 as x→ ∞. This entails τn(4) = 1

k−1

k1

X

i=1

log

µc(Eni+1,n) c(Enk+1,n)

+ 1

k−1

k1

X

i=1

Z En−i+1,n

En−k+1,n

δ(u)

u du=:τn(5)n(6), and the end of the proof is similar to [9], p. 757: τn(5) converges to 0 in probability in view of Lemma 1(i). Besides, τn(6) can be controlled by introducing the random variable

ξn= sup{|δ(u)|, u≥Enk+1,n}, and remarking that

n(6)| ≤ξn

1 k−1

k1

X

i=1

Z Eni+1,n

En−k+1,n

1

udu=ξnτn(3) =oPn(3)),

with Lemma 1(i). From (15) and (18), we obtain thatτn(4)n(1) converges to 0 in probability and the conclusion follows with (18): ˆθnn(2)n(1) P

→θ.

Proof of Theorem 2: Keeping in mind the notations introduced in the previous proof, we have

k1/2(ˆθn−θ) = θk1/2 Ãτn(3)

τn(1)

−1

!

+k1/2τn(4)

τn(1)

.

In view of (15), (17) and Lemma 1(i), k1/2

Ãτn(3)

τn(1)

−1

!

=d

"

k1/2 Ã 1

k−1

k1

X

i=1

Ei−1

!

−k1/2

µEnk+1,n

log(n/k) −1

¶#

log(n/k) Enk+1,n

+ OP

µ k1/2 log(n/k)

=d

"

k1/2 Ã 1

k−1

k1

X

i=1

Ei−1

!

−k1/2

µEnk+1,n

log(n/k) −1

¶#

(1 +oP(1))

(15)

+ OP

µ k1/2 log(n/k)

=d k1/2 Ã 1

k−1

k1

X

i=1

Ei−1

!

(1 +oP(1)) +OP

µ k1/2 log(n/k)

¶ ,

with Lemma 1(ii). The central limit theorem together with with the restrictions inposed on k yield

θk1/2 Ãτn(3)

τn(1)

−1

!

→ Nd (0, θ2).

Define xn =Enk+1,n and λi,n = Eni+1,n/Enk+1,n. It is clear, in view of Lemma 1(i) that xn

→ ∞P and λi,n

P 1. Thus, (A.2)yields that uniformly in i= 1, . . . , k−1:

log

µℓ(Eni+1,n) ℓ(Enk+1,n)

∼b(Enk+1,n)Kρ

µEni+1,n

Enk+1,n

(1 +oP(1)).

Using the representation (16), we get

τn(4) = (1 +d oP(1))b(Enk+1,n) 1 k−1

k1

X

i=1

Kρ

µ

1 + Eki,k1

Enk+1,n

= (1 +d oP(1))b(Enk+1,n) 1 k−1

k1

X

i=1

Kρ

µ

1 + Ei

Enk+1,n

= (1 +d oP(1))b(Enk+1,n) Enk+1,n

1 k−1

k1

X

i=1

Ei,

similarly to (17), since (ρ− 1)x2/2 ≤ Kρ(1 +x)− x ≤ 0 when x ≥ 0. Now, (15) and Lemma 1(i) entail

k1/2τn(4)

τn(1)

= (1 +d oP(1))k1/2b(Enk+1,n) 1 k−1

k1

X

i=1

Ei d

= (1 +oP(1))k1/2b(Enk+1,n), by the strong law of large numbers. Finally, b(Enk+1,n)/b(log(n/k)) converges to 1 in probability, since Enk+1,n/log(n/k) converges to 1 in probability and |b| ∈ Rρ. As a conclusion, and again due to the restrictions on k,

k1/2τn(4)

τn(1)

= (1 +d oP(1))k1/2b(log(n/k)) =oP(1), and the result is proved.

(16)

References

[1] Cran, G.W., (1988), Moment estimators for the 3-parameter Weibull distribution,IEEE Transactions on Reliability, 37(4), 360–363.

[2] Beirlant, J., Teugels, J.L., (1992), Modeling large claims in non-life insurance.Insurance:

Math. Econom., 11, 17–29.

[3] Berred, M., (1991), Record values and the estimation of the Weibull tail-coefficient.

Comptes-Rendus de l’Acad´emie des Sciencest. 312, S´erie I, 943–946.

[4] Beirlant J., Broniatowski, M., Teugels, J.L., Vynckier, P., (1995), The mean residual life function at great age: Applications to tail estimation, Journal of Statistical Planning and Inference, 45, 21–48.

[5] Broniatowski, M., (1993), On the estimation of the Weibull tail coefficient, Journal of Statistical Planning and Inference,35, 349–366.

[6] Kl¨uppelberg, C., Villase˜nor, J.A., (1993), Estimation of distribution tails - a semi- parametric approach, Deutschen Gesellschaft f¨ur Versicherungsmathematik, XXI(2), 213–235.

[7] Hill, B.M., (1975), A simple general approach to inference about the tail of a distribu- tion, The Annals of Statistics, 3, 1163–1174.

[8] Cs¨org¨o, S., Viharos, L., (1998), Estimating the tail index,Asymptotic methods in prob- ability and statistics, North-Holland, Amsterdam, 833–881.

[9] Mason, D.M., (1982), Laws of large numbers for sums of extreme values, Annals of Probability, 10, 756–764.

[10] Geluk, J.L., de Haan, L., (1987), Regular Variation, Extensions and Tauberian Theo- rems.Math Centre Tracts,40, Centre for Mathematics and Computer Science, Amster- dam.

(17)

[11] H¨ausler, E., Teugels, J.L., (1985), On asymptotic normality of Hill’s estimator for the exponent of regular variation, The Annals of Statistics, 13, 743–756.

[12] Beirlant, J., Dierckx, G., Goegebeur, Y., Matthys, G., (1999), Tail index estimation and an exponential regression model, Extremes, 2, 177–200.

[13] Embrechts, P., Kl¨uppelberg, C., Mikosch, T., (1997), Modelling extremal events, Springer.

[14] Keller, B., Kl¨uppelberg, C., (1991), Statistical estimation of large claims distribu- tions. Mitteilungen der Schweizerischen Vereinigung der Versicherungsmathematiker, 203–216.

[15] Guillou, A., Hall, P., (2001), A diagnostic for selecting the threshold in extreme-value analysis, J. Roy. Statist. Soc. Ser. B, 63(2), 293–305.

[16] Arnold, B.C., Balakrishnan, N., Nagaraja H.N., (1992), A First course in order statis- tics, Wiley and sons.

[17] Reiss, R.-D., (1989),Approximate distribution of order statistics, Springer-Verlag.

[18] Bingham, N.H., Goldie, C.M., Teugels, J.L., (1987),Regular variation, Encyclopedia of Mathematics and its Applications, 27, Cambridge University Press.

(18)

θ b(x) ρ

N(µ, σ2) 1/2 1 4

logx

x −1

Γ(β, α6= 1) 1 (1−α)logx

x −1

W(α, λ) 1/α 0 −∞

EW(α, β 6= 0) 1/α −β α2

logx

x −1

MW(α) 1/α 1

logx 0

Table 1: Parameters θ,ρ and the function b(x) associated to some usual distributions

(19)

−1.30 −0.98 −0.66 −0.34 −0.02 0.30 0.62 0.94 1.26 1.58 1.90

−1.20

−0.92

−0.64

−0.36

−0.08 0.20 0.48 0.76 1.04 1.32 1.60

−1.30 −0.98 −0.66 −0.34 −0.02 0.30 0.62 0.94 1.26 1.58 1.90

−1.20

−0.92

−0.64

−0.36

−0.08 0.20 0.48 0.76 1.04 1.32 1.60

Figure 1: Two examples of QQ-plots, log(Xni+1,n) is plotted versus log2(n/i). Solid line:

W(2,1) distribution, dashed line: N(1,1) distribution

(20)

0 12 24 36 48 60 72 84 96 108 120 0.0

0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

. . . .

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

1.0 2.4 3.8 5.2 6.6 8.0 9.4 10.8 12.2 13.6 15.0

.

.

. . .

.. .. . . . ...

. . . .

. . . .. . . .. . . .. . . .

(d) Mean square error as a function ofkn. Figure 2: Comparison of estimates ˆθn, ˜θnand ˇθnfor the Γ(0.5,1) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90% confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

(21)

0 12 24 36 48 60 72 84 96 108 120 0.0

0.3 0.6 0.9 1.2 1.5 1.8 2.1 2.4 2.7 3.0

. . . .

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.3 0.6 0.9 1.2 1.5 1.8 2.1 2.4 2.7 3.0

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.3 0.6 0.9 1.2 1.5 1.8 2.1 2.4 2.7 3.0

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

1.0 1.8 2.6 3.4 4.2 5.0 5.8 6.6 7.4 8.2 9.0

.

.

.

.

..

. . .

. . .

. ..

. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .

(d) Mean square error as a function ofkn. Figure 3: Comparison of estimates ˆθn, ˜θnand ˇθnfor the Γ(1.5,1) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90% confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

(22)

0 12 24 36 48 60 72 84 96 108 120 0.0

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0

. . . .

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 0.8 1.6 2.4 3.2 4.0 4.8 5.6 6.4 7.2 8.0

.

.

.

. ..

. . . . . .

. . . .. .

. . . ... ... . . .

(d) Mean square error as a function ofkn. Figure 4: Comparison of estimates ˆθn, ˜θnand ˇθnfor theN(1.2,1) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90% confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

(23)

0 12 24 36 48 60 72 84 96 108 120 0.00

0.18 0.36 0.54 0.72 0.90 1.08 1.26 1.44 1.62 1.80

. . . .

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.00 0.18 0.36 0.54 0.72 0.90 1.08 1.26 1.44 1.62 1.80

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.00 0.18 0.36 0.54 0.72 0.90 1.08 1.26 1.44 1.62 1.80

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0.0 1.4 2.8 4.2 5.6 7.0 8.4 9.8 11.2 12.6 14.0

.

. . ..

. . .. . . ... . . .. . .. .... .... . . .

(d) Mean square error as a function ofkn. Figure 5: Comparison of estimates ˆθn, ˜θn and ˇθn for the W(2.5,2.5) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90%

confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

(24)

0 17 34 51 68 85 102 119 136 153 170 0.0

1.2 2.4 3.6 4.8 6.0 7.2 8.4 9.6 10.8 12.0

...

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0 1 2 3 4 5 6 7 8 9 10

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0 1 2 3 4 5 6 7 8 9 10

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0 6 12 18 24 30 36 42 48 54 60

.

.

. .

. . .. . . . . . ... . . ..

. . . ... .. .... .... . . .

(d) Mean square error as a function ofkn. Figure 6: Comparison of estimates ˆθn, ˜θn and ˇθn for the W(0.4,0.4) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90%

confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

(25)

0 12 24 36 48 60 72 84 96 108 120

−2.0

−1.4

−0.8

−0.2 0.4 1.0 1.6 2.2 2.8 3.4 4.0

. . . .

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

−2.0

−1.4

−0.8

−0.2 0.4 1.0 1.6 2.2 2.8 3.4 4.0

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

−2.0

−1.4

−0.8

−0.2 0.4 1.0 1.6 2.2 2.8 3.4 4.0

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

1.0 2.9 4.8 6.7 8.6 10.5 12.4 14.3 16.2 18.1 20.0

.. .

.. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . . .. . . .. . . . .. . .. . . .. . . . .. . .. . .. . .. . . .. . .. .. .. . .. . .

(d) Mean square error as a function ofkn. Figure 7: Comparison of estimates ˆθn, ˜θn and ˇθn for the MW(2.5) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90%

confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

(26)

0 12 24 36 48 60 72 84 96 108 120 0

2 4 6 8 10 12 14 16 18 20

. . . .

(a) Estimate ˆθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0 2 4 6 8 10 12 14 16 18 20

. . . .

(b) Estimate ˜θn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

0 2 4 6 8 10 12 14 16 18 20

. . . .

(c) Estimate ˇθn as a function of kn.

0 12 24 36 48 60 72 84 96 108 120

1.0 28.9 56.8 84.7 112.6 140.5 168.4 196.3 224.2 252.1 280.0

.

.. . . ... . . . .. . . .

(d) Mean square error as a function ofkn. Figure 8: Comparison of estimates ˆθn, ˜θn and ˇθn for the MW(0.4) distribution. (a)-(c) Dotted line: true value, solid line: mean over 100 samples, dashed lines: empirical 90%

confidence interval. (d) Dotted line: ˆθn, solid line: ˜θn, dashed line: ˇθn.

Références

Documents relatifs

Our result is the classification of the G-vector bundles over X up to G-equivariant isomorphism.. This type of classification problems have been introduced, in a

occurs in the notation, and sheaves are taken to be sheaves in the fffp- topology (faithfully flat finitely presented) relative to the base scheme.. We denote the group

3 There seems to be no established terminology for this concept, and our choice of the term ’perfect’ was suggested by the established term ’perfectly normal’... A

On processors that feature IEEE 754 style rounding, it is most natural to perform operations on centers using any rounding mode ◦ (and preferably rounding to the nearest),

T hese stories were collected as part of the Family Medicine in Canada: History and Narrative in Medicine Program, an ongoing project of the College of Family Physicians of

of CAMBRIDGE of COPENHAGEN. 1 The investigations in this paper were completed in a collaboration between the authors several years ago. An account of the

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

Circuits for True Random Number Generation with On-Line Quality MonitoringA.