• Aucun résultat trouvé

On automatic bias reduction for extreme expectile estimation

N/A
N/A
Protected

Academic year: 2022

Partager "On automatic bias reduction for extreme expectile estimation"

Copied!
37
0
0

Texte intégral

(1)

HAL Id: hal-03086048

https://hal.archives-ouvertes.fr/hal-03086048v2

Preprint submitted on 5 Jan 2021

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

On automatic bias reduction for extreme expectile estimation

Stéphane Girard, Gilles Stupfler, Antoine Usseglio-Carleve

To cite this version:

Stéphane Girard, Gilles Stupfler, Antoine Usseglio-Carleve. On automatic bias reduction for extreme expectile estimation. 2021. �hal-03086048v2�

(2)

On automatic bias reduction for extreme expectile estimation

St´ephane Girarda, Gilles Stupflerb & Antoine Usseglio-Carleveb

a Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, 38000 Grenoble, France b Univ Rennes, Ensai, CNRS, CREST - UMR 9194, F-35000 Rennes, France

Abstract

Expectiles induce a law-invariant risk measure that has recently gained popularity in actuar- ial and financial risk management applications. Unlike quantiles or the quantile-based Expected Shortfall, the expectile risk measure is coherent and elicitable. The estimation of extreme ex- pectiles in the heavy-tailed framework, which is reasonable for extreme financial or actuarial risk management, is not without difficulties; currently available estimators of extreme expectiles are typically biased and hence may show poor finite-sample performance even in fairly large samples.

We focus here on the construction of bias-reduced extreme expectile estimators for heavy-tailed distributions. The rationale for our construction hinges on a careful investigation of the asymptotic proportionality relationship between extreme expectiles and their quantile counterparts, as well as of the extrapolation formula motivated by the heavy-tailed context. We accurately quantify and estimate the bias incurred by the use of these relationships when constructing extreme expectile estimators. This motivates the introduction of classes of bias-reduced estimators whose asymptotic properties are rigorously shown, and whose finite-sample properties are assessed on a simulation study and three samples of real data from economics, insurance and finance.

Keywords: Asymmetric least squares, Bias reduction, Expectiles, Extremes, Extrapolation, Heavy tails, Second-order parameter.

1 Introduction

Theαth expectileξαof an integrable random variableY is defined as ξα= arg min

θ∈R Eα(Y θ)ηα(Y)), (1)

where ηα(u) = 1{u 0}|u2 is the so-called expectile check function and 1{·} the indicator function. Expectiles areL2−analogues of quantiles, which are obtained by minimising asymmetrically weighted mean absolute deviations (Koenker and Bassett,1978):

qαarg min

q∈R

Eα(Y q)ρα(Y)),

where ρα(u) = 1{u 0}||u| is the quantile check function. Expectiles, originally introduced byNewey and Powell(1987) in the context of testing for homoscedasticity and conditional symmetry of the error distribution in linear regression, are always uniquely defined by their convex optimisa- tion problem (unlike quantiles, for which uniqueness is only guaranteed if the underlying distribution function is strictly increasing). They satisfy

α=E[|Y ξα|1{Y ξα}]/E|Y ξα|. (2)

In particular, contrary to quantiles, expectiles are determined by tail expectations rather than tail prob- abilities. Expectiles induce risk measures which have recently gained substantial traction in the risk

(3)

management context, for several axiomatic and practical reasons, including the fact that they induce the only risk measure, apart from the simple expectation, defining a law-invariant, coherent (Artzner et al.,1999) and elicitable (Gneiting,2011) risk measure, seeBellini et al.(2014) andZiegel(2016). As such, a natural backtesting methodology exists for expectiles. Quantiles are indeed elicitable, but not coherent in general, and are often criticised for missing out on relevant information about distribution tails because their calculation only depends on the frequency of tail events. The Expected Shortfall, meanwhile, takes into account the actual values of the risk variable on the tail event and is a coherent risk measure, but is not elicitable. In financial applications specifically, expectiles are linked through Formula (2) to the notion of gain-loss ratio, which is well-known in the literature on no good deal valuation in incomplete markets and is a popular performance measure in portfolio management (see Bellini and Di Bernardino,2017, and references therein). Further axiomatic, theoretical and practical justification for the use of expectiles alongside or instead of the quantile and Expected Shortfall can be found in, among others,Ehm et al.(2016) andBellini and Di Bernardino(2017).

Expectile estimation was first considered inNewey and Powell(1987) in the context of linear regression, and has been developing since then; recent contributions include Sobotka and Kneib(2012) as well asHolzmann and Klar(2016) andKr¨atschmer and Z¨ahle(2017) for the estimation of central, non-tail expectiles of fixed orderα. By contrast, probabilistic aspects of extreme expectiles, withα1, were first considered byBellini et al.(2014) and laterBellini and Di Bernardino(2017). The estimation of extreme expectiles has been considered even more recently inDaouia et al.(2018,2019,2020), where it is shown that extreme expectiles can be estimated in several ways. The construction of each of the estimators uses a combination of the heavy-tailed distributional assumption (representing the tail structure of many financial and actuarial data examples fairly well, seee.g. p.9 of Embrechts et al.

(1997) and p.1 of Resnick (2007)) and a remarkable asymptotic proportionality relationship linking extreme expectiles to their quantile counterparts. An inspection of the finite-sample results ofDaouia et al.(2018,2020) reveals that these estimators suffer from substantial finite-sample bias, even though this is not clear from the asymptotic normality results presented therein. This is of course an issue if expectiles are to be used widely in the management of extreme risk. Partial answers to this bias problem are presented inGirard et al.(2020a) andGirard et al.(2020b) in a regression setup; however, the arguments therein rely on either a simplified setting where one of the leading bias terms can be shown to vanish, or on a crude asymptotic expansion of the bias term where only the part of the bias that is in some sense easiest to correct in the conditional setup is removed. To be more specific, the method ofGirard et al. (2020b) is designed to eliminate the source of bias due to the amount of tail heaviness (which has an important influence in the asymptotic proportionality relationship), but cannot handle the bias purely due to the second-order framework. It will therefore perform poorly when this particular source of bias dominates, that is, when the underlying heavy-tailed distribution is far from the standard Pareto distribution on which the extrapolation procedure is based.

The contribution of this paper is to provide a wide class of automatic, data-driven, second-order fully bias-reduced versions of the extreme expectile estimators currently available in the literature. This is done in three steps. First, we briefly recall the construction of extreme expectile estimators at a levelβ =βn 1 such that n→ ∞, where ndenotes sample size. This construction is based on the extrapolation of purely empirical expectile estimators at a much lower, intermediate levelαn 1, with the help of an appropriate tail index estimator, and we highlight how bias may appear from tail index estimation and from the extrapolation procedure itself through a specific bias term. Second, in the Hall-Welsh subclass of heavy-tailed models (Hall and Welsh,1985) which contains most of the heavy-tailed distributions typically encountered in extreme value analysis, we provide estimators of this bias term that we then use to define versions of extrapolated extreme expectile estimators that are fully corrected for extrapolation bias. Third and last, we discuss the use of bias-reduced estimators of the tail index as a way to complete the elimination of the bias of our extrapolated estimators.

We compare the use of an existing bias-reduced version of the Hill estimator constructed in Caeiro et al.(2005) with that of a novel bias-reduced modification of an expectile-based estimator, the biased version having been studied in Girard et al. (2020b) and Padoan and Stupfler (2020). As we shall

(4)

see in our simulation study, the expectile-based tail index estimator allows us to gain accuracy mostly in those difficult situations where the so-called second-order parameter is close to 0, that is, when the underlying heavy-tailed distribution is far from the standard Pareto distribution on which the extrapolation procedure is based. In this sense, we make substantial further gains compared to the partial bias-correction procedure discussed inGirard et al.(2020b) that cannot handle this case. The combination of these second and third steps results in fully bias-reduced classes of extrapolated extreme expectile estimators in our heavy-tailed setting. To make these estimators completely automatic, we introduce a selection rule of the Asymptotic Mean Squared Error-optimal value of the tuning parameter αn representing the upper sample fraction used in our tail index and expectile estimators as well as in the extrapolation bias correction term. This results in estimators whose finite-sample performance is far superior to that of previously considered estimators in the extreme expectile estimation literature, as we shall illustrate in our simulation study.

The paper is organised in the following way. Section 2 gives details on our estimation framework and briefly reviews currently available extreme expectile estimators. Section 3 contains the main contributions of the paper on the bias-reduced estimation of extreme expectiles; all our methods and samples of real data are incorporated into the R package Expectrem, currently available athttps:

//github.com/AntoineUC/Expectrem, and Section3.5explains in detail what this package contains and how to call our methods. Section4 showcases the performance of our estimators on a simulation study. We finally illustrate the practical applicability of our procedures on real samples of economic, actuarial and financial data in Section 5. More details about the implementation of our methods, mathematical proofs and a complete set of numerical results from our simulation study are relegated to the Supplementary Material document.

2 State of the art on extreme expectile estimation

We start by describing the existing techniques in extreme expectile estimation. Suppose throughout that the available data (Y1, . . . , Yn) is made of independent realisations of the random variableY with cumulative distribution functionF (resp. survival functionF = 1F). It is assumed thatE|Y|<∞, so that expectiles ofY of any order exist indeed. Our goal is to estimate an extreme expectile of Y, i.e.whose order tends to 1 asn→ ∞.

Intermediate level We start by the case of a so-called intermediate levelαn1, namely such that n(1αn)→ ∞asn→ ∞. Intermediate levels tend to infinity slowly enough that expectiles are well within the sample and can thus be estimated by purely empirical methods. It was observed byJones (1994) that the αnth expectile is actually the quantile of level αn associated with the distribution functionE defined by

E(y) = 1E(y) =E[(Y y)1{Y >y}] E|Y y| .

Recall that the quantile at levelαnof the distribution functionFis defined asqαn= inf{yR|F(y) αn} = inf{y R|F(y)1αn}. Intermediate quantiles of F may then be estimated by inverting the empirical conditional survival function induced by the observations:

qbαn = infn

yR|Fbn(y)1αn

o

=Yn−bn(1−αn)c,n with Fbn(y) = 1 n

n

X

i=1

1{Yi>y}.

HereY1,nY2,n≤ · · · ≤Yn,n are the order statistics associated with (Y1, . . . , Yn) andb·c denotes the floor function. We apply the same principle to the estimation of the intermediate expectile: replacing population averaging with sample averaging in the definition of the distribution functionE results in the estimator

ξbαn= infn

yR|Ebn(y)1αn

o

with Ebn(y) = Pn

i=1(Yiy)1{Yi>y}

Pn

i=1|Yiy| .

(5)

This estimator is an unconditional version of the intermediate conditional expectile estimator intro- duced inGirard et al.(2020b). A straightforward calculation shows that this estimator is in fact also exactly theLeast Asymmetrically Weighted Squares (LAWS) estimatorstudied inDaouia et al.(2018), that is, the unique solution of the empirical counterpart of the minimisation problem (1):

ξbαn = arg min

θ∈R n

X

i=1

ηαn(Yiθ). (3)

An alternative estimator can be found, in the class of heavy-tailed distributions we shall focus on hereafter. Recall that the distribution ofY is heavy-tailed if and only if there existsγ >0 such that

∀y >0, lim

t→∞

F(ty)

F(t) =y−1/γ or equivalently lim

t→∞

q1−(ty)−1 q1−t−1

=yγ.

The tail index γ tunes the tail heaviness of the distribution of Y: ifγ > a then E(Y1/a1{Y >0}) =

(a precise statement is Exercise 1.16 p.35 in de Haan and Ferreira, 2006). Our minimal working assumption throughout will therefore be thatγ <1 andE(Y)<∞, whereY = max(−Y,0). In this case, we have the following asymptotic proportionality relationship between expectile and quantile:

limα↑1

ξα

qα = (γ−11)−γ. (4)

This was first noted byBellini et al.(2014). This connection suggests the class ofindirect estimators

ξαn= (γ−11)−γbqαn= (γ−11)−γYn−bn(1−αn)c,n (5)

whereγis a consistent estimator ofγ.

Extreme level The problem of most relevance in extreme value analysis is to consider the case of a levelβn1 such thatn(1βn)c <asn→ ∞. In this situation, purely empirical methods are no longer consistent, and one has to use information about the tail of the data in order to construct an extrapolation procedure. In the context of expectile estimation, this is made possible by the heavy tail assumption and convergence (4): these entail

ξβn

ξαn

qβn

qαn

1βn

1αn

−γ

as n→ ∞.

We call this approximation the Weissman approximation, after the work of Weissman(1978) on ex- treme quantile estimation. This justifies introducing the class of semiparametric extrapolating estima- tors

ξ?βn=

1βn

1αn

−γ

ξαn (6)

where ξα

n is any consistent estimator of ξαn. One immediately deduces from (6) two subclasses of estimators, replacing ξα

n by the LAWS estimator of ξαn or its indirect counterpart; in the latter, the estimator of γ can be chosen different from the estimator featured in the above extrapolation procedure, although we shall not pursue this here for the sake of simplicity. One may also construct a weighted combination of the LAWS-based and indirect estimators, as done inDaouia et al.(2021), although the finite-sample benefit of doing so can be marginal.

The extrapolated LAWS-based and indirect estimators unfortunately suffer from a sizeable amount of finite-sample bias. This is clear from, among others, Figures 3 and 4 inDaouia et al. (2018), where it can be seen that even for the sample size n = 1,000, and for certain distributions of interest in extreme value modelling, these estimators have a relative bias of the order of 50%. This means that the estimator is on average 50% larger than the target extreme expectile. The contributions of this paper, which we gather in the next section, are a precise quantification of this bias using a standard second-order refinement of the heavy tail condition, and the introduction of automatic bias reduction procedures whose finite-sample performance will be examined in detail in Sections4and5.

(6)

3 Automatic bias reduction methodology for extreme expec- tile estimation

3.1 Rationale for our bias correction methods

The construction of the class of extrapolated estimators in (6) relies on the successive use of Equa- tion (4), in order to approximate a ratio of high expectiles by a ratio of high quantiles at corresponding levels, and an approximation of this ratio of high quantiles that is warranted by the heavy tail assump- tion. The magnitude of the bias of the extrapolated estimators will therefore be crucially driven by the rates of convergence and the error terms in these two approximations.

To simplify the exposition, we assume from now on that kn = n(1αn) is a sequence of positive integers, and we rewrite the assumptions αn 1 and n(1αn) → ∞ as kn → ∞ and kn/n 0.

This choice is motivated by the fact that in quantile estimation, this quantitykn denotes the effective sample size,i.e.the number of top order statistics eventually used for the estimation. Adopting this convention will make it easier to state and compare our results with existing results in the extreme value analysis of heavy tails. This is not a restriction in practice: our estimators of properly extreme expectiles, having orderβn 1 such thatn(1βn) c < ∞, are built on intermediate expectile estimators whose levelαnwe are free to choose, and those levels such thatkn=n(1αn) is an integer constitute a sufficient set of levels to work with.

A classical device in extreme value analysis for bias quantification is the following second-order regular variation condition that refines our initial heavy tail assumption.

C2(γ, ρ, A) The survival functionF is second-order regularly varying with index −1/γ < 0, second- order parameterρ0 and a measurable auxiliary functionAhaving constant sign and converging to 0 at infinity,i.e.

∀y >0, lim

t→∞

1 A(1/F(t))

F(ty)

F(t) y−1/γ

=y−1/γyρ/γ1

γρ .

Here and throughout the ratio (ya1)/ashould be read as logy whena= 0.

An equivalent condition on the tail quantile functiont7→q1−t−1 is that

∀y >0, lim

t→∞

1 A(t)

q1−(ty)−1

q1−t−1 yγ

=yγyρ1

ρ .

Seede Haan and Ferreira(2006, Theorem 2.3.9). Numerous examples of commonly used distributions that satisfy this assumption can be found inBeirlant et al.(2004).

The fundamental argument behind our methodology is that the direct and indirect extrapolated esti- mators,i.e.

ξbβ?n=

n(1βn) kn

−γ

ξb1−kn/n and ξe?βn =

n(1βn) kn

−γ

−11)−γYn−kn,n

whereγis a

kn−consistent estimator ofγ, satisfy

log ξb?β

n

ξβn

!

= (γγ) log

kn

n(1βn)

+ log ξb1−kn/n ξ1−kn/n

!

log

n(1βn) kn

γ ξβn

ξ1−kn/n

(7)

and log ξe?β

n

ξβn

!

= (γγ) log

kn n(1βn)

+ log

−11)−γ −11)−γ

+ log

Yn−kn,n q1−kn/n

log

n(1βn) kn

γ

γ−11γ ξβn q1−kn/n

. (8)

(7)

Under conditionC2(γ, ρ, A) and standard technical assumptions on kn and βn, ξb1−kn/n and Yn−kn,n

have the same rate of convergence 1/

kn, which is also the rate of convergence of the final (pure bias) nonrandom term, seee.g.Theorem 5 inDaouia et al.(2020) and its proof. Since log(kn/[n(1 βn)])→ ∞, the first term dominates, leading to a common asymptotic distribution forξbβ?

n andξeβ?

n: if

knγ)−→d Γ, then

kn

log(kn/[n(1βn)])log ξb?βn ξβn

!

−→d Γ and

kn

log(kn/[n(1βn)])log ξeβ?n ξβn

!

−→d Γ. (9)

A naive bias-correction strategy would thus focus on the bias incurred in the estimation ofγ, identifiable through the asymptotic distribution Γ. However, Equations (7) and (8) reveal another source of bias, which is the use of the Weissman approximation itself to control the final terms in Equations (7) and (8) (note that the second term in Equation (7) and the third term in Equation (8) are asymptotically unbiased, see Theorem 2 in Daouia et al. (2018) and Theorem 2.4.8 p.52 in de Haan and Ferreira (2006)). Our contribution hereafter is to design fully bias-reduced extreme expectile estimators by eliminating both of these sources of bias.

3.2 Construction of bias-reduced extreme expectile estimators

We construct bias-reduced versions of the extrapolated estimators ξb?β

n and ξeβ?

n in two steps. Our methods will feature estimators of the second-order parameterρ, but also estimators of the auxiliary functionA. Estimating this function without any further assumption can be a difficult task; however, for most of the distributions satisfying conditionC2(γ, ρ, A) used for modelling purposes, the function Atakes the formA(t) =bγtρ, for a certain nonzero constantb andρ <0. We assume in what follows that the functionAis indeed of this form, which amounts to assuming that the underlying distribution belongs to the Hall-Welsh class in the sense ofGomes and Pestana(2007). We give a list of examples of classical heavy-tailed distributions in Table1, containing among others the distributions we shall work with in our simulation study, with their respective values ofγ,ρandb.

The functionAcan then be estimated using consistent estimatorsγ,bandρofγ,bandρ, respectively.

We assume in Section3.2.1that such estimators are given; we shall explain in detail in Section3.2.2 which estimatorsγwe consider, while the estimatorsbandρare calculated directly from the R package mop(see AppendixAfor a brief summary of how these estimators are constructed). We start by dealing with the bias due to the extrapolation procedure itself, contained in the final terms of Equations (7) and (8).

3.2.1 Bias due to the extrapolation procedure

We start by the nonrandom bias term in Equation (7), and we write n(1βn)

kn

γ

ξβn ξ1−kn/n =

n(1βn) kn

γ

qβn q1−kn/n

| {z }

1 +B1,n

× γ−11−γ q1−kn/n ξ1−kn/n

| {z }

1 +B2,n

× γ−11γ ξβn qβn

| {z }

1 +B3,n

. (10)

By Theorem 2.3.9 inde Haan and Ferreira (2006), the bias term B1,n can be written as B1,n= [n(1βn)/kn]−ρ1

ρ A(n/kn)(1 + o(1)).

We now focus on the other two bias termsB2,nandB3,nlinking an expectile to its quantile counterpart, at intermediate and extreme levels. It follows from the proof of Proposition 1 inDaouia et al.(2018)

(8)

that

F(ξα)

1α = γ−11

(1 +r(α)) (11)

with 1 +r(α) =

1E[Y] ξα

1

1

1 +A 1

Fα)

1

γ(1γρ)(1 + o(1)) −1

asα1.

Using Lemma 1 inDaouia et al.(2020) together with the heavy tail assumption then entails ξα

qα = γ−11−γ

(1 +r(α))−γ 1 + γ−11−ρ

(1 +r(α))−ρ1

ρ A((1α)−1)(1 + o(1))

! (12) asα1. Withα= 1kn/nandα=βn, this yields

B2,n= (1 +r(1kn/n))γ 1 + γ−11−ρ

(1 +r(1kn/n))−ρ1

ρ A(n/kn)(1 + o(1))

!−1

1

andB3,n= (1 +r(βn))−γ 1 + γ−11−ρ

(1 +r(βn))−ρ1

ρ A((1βn)−1)(1 + o(1))

!

1.

Each of these bias terms can be estimated. Recall our assumption thatA(t) =bγtρ, and use consistent estimatorsγ ofγ,ρofρandbofb to estimateB1,n by

B1,n=[n(1βn)/kn]−ρ1

ρ bγ(n/kn)ρ.

Let furtherYndenote the sample mean ofY1, . . . , Yn,ξ1−k

n/nbe either the LAWS or indirect interme- diate expectile estimator, andξ?βnbe the related extrapolated estimator (in our current implementation we use the LAWS estimator forξ1−k

n/nand its extrapolated version for ξ?β

n). The remainder terms r(1kn/n) andr(βn) are estimated by

r(1kn/n) = 1 Yn

ξ1−k

n/n

! 1

12kn/n

1 + b[Fbn1−kn/n)]−ρ 1γρ

−1

1

and r?n) = 1 Yn

ξ?βn

! 1

n1 1 +b γ−11−ρ

1γρ (1βn)−ρ

!−1

1.

This yields estimators ofB2,n andB3,n as

B2,n= (1 +r(1kn/n))γ 1 + γ−11−ρ

(1 +r(1kn/n))−ρ1

ρ bγ(n/kn)ρ

!−1

1

andB3,n= (1 +r?n))−γ 1 + γ−11−ρ

(1 +r?n))−ρ1

ρ bγ(1βn)−ρ

!

1.

We deduce from (7) and (10) that a version of the direct extrapolated estimatorξbβ?

n, corrected for the bias exclusively due to the heavy-tailed extrapolation, is

ξb?,RBβ

n =ξbβ?n(1 +B1,n)(1 +B2,n)(1 +B3,n)

=

"n(1βn) kn

−γ

ξb1−kn/n

#

(1 +B1,n)(1 +B2,n)(1 +B3,n). (13)

(9)

A correction for the extrapolation bias in the indirect estimator is simpler. Indeed, n(1βn)

kn γ

γ−11γ ξβn

q1−kn/n =

n(1βn) kn

γ qβn

q1−kn/n

| {z }

1 +B1,n

× γ−11γ ξβn

qβn

| {z }

1 +B3,n

. (14)

From (14), a version of the indirect extrapolated estimatorξeβ?

n, corrected for the bias exclusively due to the heavy-tailed extrapolation, is then

ξeβ?,RB

n =ξeβ?n(1 +B1,n)(1 +B3,n)

=

"n(1βn) kn

−γ

−11)−γYn−kn,n

#

(1 +B1,n)(1 +B3,n). (15) The methodology introduced here differs from the earlier bias reduction techniques introduced inGirard et al.(2020a) and Girard et al.(2020b) in a regression setup. InGirard et al.(2020a), the bias term proportional to 1/ξ1−kn/n is not corrected because the theory therein focuses on a random variable with expectation 0; inGirard et al.(2020b), the bias term proportional to A(n/kn) is not corrected because it is very difficult to correct accurately this source of bias in the conditional, nonparametric setup on which that paper focuses. In addition, the correction terms in the aforementioned papers rely on linearising the bias terms, whereas we keep the structure of the bias as intact as possible. This makes our correction termB2,n more accurate indeed than in these earlier attempts. The inclusion of termB3,n is also new; while it could be expected that this term only has a small influence because it relies on quantities calculated at a higher asymptotic order, it is our experience that its inclusion substantially improves finite-sample performance.

We now concentrate on reducing the bias due to the estimation of γ. Combined with the general corrections in (13) and (15), this will result in a fully bias-corrected extrapolated estimator.

3.2.2 Bias reduction for tail index estimation: an expectile-based method

Numerous tail index estimators have been introduced and studied in the literature; a review of some of the most important estimators is given in de Haan and Ferreira (2006, Chapter 3). There are various techniques for the reduction of bias of such estimators, an excellent summary being given in the Introduction ofCai et al. (2013). Here our contribution is to propose a bias-reduced version of a purely expectile-based tail index estimator, our procedure being partly inspired by a method developed in, among others,Caeiro et al. (2005). To make the construction of this estimator easier, we start by briefly recalling how the technique ofCaeiro et al. (2005) works. Consider the classical Hill estimator (Hill,1975):

bγHk

n= 1

kn kn

X

i=1

logYn−i+1,n Yn−kn,n

.

It is known that underC2(γ, ρ, A), if moreover

knA(n/kn) λ R, then (see Theorem 3.2.5 in de Haan and Ferreira,2006)

pkn(γbkHnγ)−→ Nd λ

1ρ, γ2

.

In finite samplesλ

knA(n/kn) =

knbγ(n/kn)ρ, meaning that the pseudo-estimator

bγHkn

1 b

1ρ n

kn

ρ

(10)

should be asymptotically unbiased with the same variance as the Hill estimator.Caeiro et al. (2005) then plug in consistent estimatorsb ofbandρofρand arrive at the bias-reduced Hill estimator

bγkH,RB

n =bγkHn 1 b

1ρ n

kn

ρ! .

Theorem 3.1 ofCaeiro et al. (2005) shows that bγkH,RB

n is indeed

kn−asymptotically Gaussian with expectation zero and varianceγ2. The construction of this estimator essentially hinges on eliminating the bias by multiplying the original estimator by a quantity cancelling this bias.

We adapt here this construction to propose a bias-reduction procedure for the estimator

bγEk

n=

1 +nFbn(bξ1−kn/n) kn

−1

.

The rationale behind this estimator, studied in different contexts byGirard et al.(2020b) andPadoan and Stupfler(2020), is that, from (11),

γ=

1 +Fα) 1α

1 1 +r(α)

−1

1 + F(ξα) 1α

−1

as α1.

To find a bias-reduced version of thisasymptotic proportionality tail index estimatorbγkE

n, we follow the above idea and use the sample counterpartr(1kn/n) of r(1kn/n) defined in Section 3.2.1: this yields a bias-reduced asymptotic proportionality tail index estimator as

bγE,RBk

n =

1 +nFbn(bξ1−kn/n) kn

1 1 +r(1kn/n)

−1

.

In our current implementation we takeξ1−kn/n=ξb1−kn/nandγ=bγkH,RB

n insider(1−kn/n), specifically for the calculation of this bias-reduced version.

Our first main theoretical result gives the asymptotic normality and unbiasedness ofbγkE,RB

n .

Theorem 1. Suppose that E|Y|2+δ <for someδ >0. Assume further thatC2(γ, ρ, A)holds with 0 < γ < 1/2, ρ < 0 and A(t) = bγtρ, and let kn be a sequence such that kn → ∞ and kn/n 0 asn → ∞. If

knA(n/kn) λ1 R,

kn/q(1kn/n) λ2 R, andγ, ρ and b are consistent estimators ofγ,ρandb such that ρ) log(n) = oP(1), then

pkn(bγkE,RB

n γ)−→ Nd

0,γ3(1γ) 1

.

We provide a comparison of the two tail index estimators in terms of variance in Figure 1. The asymptotic variance of bγkE,RB

n is substantially smaller than that of bγkH,RB

n when γ is less than 0.35.

The variance ofγbkE,RB

n explodes, however, asγ1/2, which is to be expected since the estimatorsbγkE

n

andbγkE,RB

n are based on the intermediate LAWS estimatorξb1−kn/n, itself known to be asymptotically normal only whenγ <1/2 (seeDaouia et al.,2018). In our implementation (and in particular in the calculation of theBj,n in Section3.2.1), one can choose eitherγ=bγkH,RB

n orγ=γbkE,RB

n .

3.3 Fully bias-reduced estimators and their asymptotic properties

Our final, fully bias-reduced estimators are members of the following two classes. The first class, which extrapolates the intermediate LAWS estimator, is defined as

ξb?,RBβ

n =

"

n(1βn) kn

−γ

ξb1−kn/n

#

(1 +B1,n)(1 +B2,n)(1 +B3,n)

Références

Documents relatifs

In this chapter we propose inference (i.e. estimation and testing) for vast dimensional elliptical distributions. Estimation is based on quantiles, which always exist regardless of

The quantile function at an arbitrarily high extreme level can then be consistently deduced from its value at a typically much smaller level provided γ can be consistently

In this communication, we first build simple extreme analogues of Wang distortion risk measures and we show how this makes it possible to consider many standard measures of

As regards the estimation of the extreme PH risk measure, the surprising conclusion reached when discussing the performance of our estimators with moderately heavy tails is still

They proposed a kernel density estimation of heavy-tailed distributions based on a transforma- tion of the original data set with a modi…cation of the Champernowne

2 we recall the known results for the sample covariance matrix estimator under light-tailed assumptions, together with some recent high-probability results for an alternative

However, the results on the extreme value index, in this case, are stated with a restrictive assumption on the ultimate probability of non-censoring in the tail and there is no

Abstract This paper presents new approaches for the estimation of the ex- treme value index in the framework of randomly censored (from the right) samples, based on the ideas