• Aucun résultat trouvé

A tail bound for sums of independent random variables : application to the symmetric Pareto distribution

N/A
N/A
Protected

Academic year: 2021

Partager "A tail bound for sums of independent random variables : application to the symmetric Pareto distribution"

Copied!
8
0
0

Texte intégral

(1)

HAL Id: hal-00192500

https://hal.archives-ouvertes.fr/hal-00192500

Preprint submitted on 28 Nov 2007

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires

A tail bound for sums of independent random variables : application to the symmetric Pareto distribution

Christophe Chesneau

To cite this version:

Christophe Chesneau. A tail bound for sums of independent random variables : application to the

symmetric Pareto distribution. 2007. �hal-00192500�

(2)

A tail bound for sums of independent random variables : application to the

symmetric Pareto distribution

Christophe Chesneau

Laboratoire de Math´ematiques Nicolas Oresme, Universit´e de Caen Basse-Normandie,

Campus II, Science 3, 14032 Caen, France.

http://www.chesneau-stat.com chesneau@math.unicaen.fr

Abstract:In this note we prove a bound of the tail probability for a sum of nindependent random variables. It can be applied under mild assumptions;

the variables are not assumed to be almost surely absolutely bounded, or admit finite moments of all orders. Moreover, in some cases, it is signifi- cantly better than the bound obtained via the standard Markov inequality.

To illustrate this result, we investigate the bound of the tail probability for a sum ofnweighted i.i.d. random variables having the symmetric Pareto distribution.

AMS 2000 subject classifications:60E15.

Keywords and phrases:Tail bound, symmetric Pareto distribution.

1. MOTIVATION

Let (Yi)i∈N be independent random variables. For any n ∈ N, we wish to determine the smallest sequence of functionspn(t) such that

P Ã n

X

i=1

Yi≥t

!

≤pn(t), t∈[0,∞[.

This problem is well-known; numerous results exist. The most famous of them is the Markov inequality. Under mild assumptions on the moments of theXi’s, it gives a polynomial boundpn(t). In many cases, this bound can be improved. For instance, if theXi’s are almost surely absolutely bounded, or admit finite mo- ments of all orders (and these moments satisfy some inequalities), the Bernstein inequalities provide better results. The obtained boundspn(t) are exponential.

SeePetrov(1995) and Pollard(1984) for further details and complete bibliog- raphy.

In this note, we present a new inequality which provides a boundpn(t) of the formpn(t) =vn(t) +wn(t), wherevn(t) is polynomial, andwn(t) is exponential.

It can be applied under mild assumptions on theXi’s; as for the Markov inequal- ity, only knowledge of the order of a finite moment is required. The main interest

(3)

Christophe Chesneau/A tail bound for sums of independent random variables 2

not satisfied, and can give better results than the Markov inequality. In order to illustrate this, we investigate the bound of the tail probability for a sum of n weighted i.i.d. random variables having the symmetric Pareto distribution.

This is particularly interesting because the exact expression of the distribution of such a sum is really difficult to identify. See, for instance,Ramsay (2006).

Moreover, there are some applications in economics, actuarial science, survival analysis and queuing networks.

The note is organized as follows. Section2 presents the main result. In Sec- tion3 we illustrate the use of this result by considering the symmetric Pareto distribution. The technical proofs are postponed to Section4.

2. MAIN RESULT

Theorem 2.1 below presents a bound of the tail probability for a sum of n independent random variables. As mentioned in Section1, it requires knowledge only of the order of a finite moment.

Theorem 2.1. Let(Yi)i∈N be independent random variables. We suppose that

• for any n∈N, and any i∈ {1, ..., n}, we have, w.l.o.g.,E(Yi) = 0,

• there exists a real number p ≥ 2 such that, for any n ∈ N, and any i∈ {1, ..., n}, we haveE(|Yi|p)<∞.

Then, for anyt >0, and any n∈N, we have

P Ã n

X

i=1

Yi≥t

!

≤Cpt−pmax³

rn,p(t),(rn,2(t))p/2´ + exp

µ

− t2 16bn

, (2.1)

where, for anyu∈ {2, p},rn,u(t) =Pn i=1

|Yi|u1{|Yi|≥3bnt }

´

,bn=Pn i=1

Yi2¢ andCp= 22p+1max¡

pp, pp/2+1epR

0 xp/2−1(1−x)−pdx¢ .

The proof of Theorem2.1uses truncation technics, the Rosenthal inequality and one of the Bernstein inequalities. SeeRosenthal(1970) andPetrov(1995).

Clearly, Theorem 2.1 can be applied for a wide class of random variables.

However, if the variables are almost surely absolutely bounded, or have finite moments of all orders, the Bernstein inequalities can give more optimal results than (2.1). But, when these conditions are not satisfied, Theorem2.1 becomes of interest. This fact is illustrated in Section3 below for the symmetric Pareto distribution. Other examples can be studied in a similar fashion.

3. APPLICATION: SYMMETRIC PARETO DISTRIBUTION Proposition 3.1 below investigates the bound of the tail probability for a sum ofnweighted i.i.d. random variables having the symmetric Pareto distribution.

(4)

Proposition 3.1. Let s >2 and(Xi)i∈N be i.i.d. random variables with the probability density function f(x) = 2−1s|x|−s−11{|x|≥1}. Let (ai)i∈N be a se- quence of nonzeros real numbers such that Pn

i=1|ai|s < ∞. Then, for any n ∈ N, any t ∈ (0,3bρnn), where ρn = (Pn

i=1|ai|s)1/s, and any p ∈ (2, s), we have

P Ã n

X

i=1

aiXi≥t

!

≤Kp

¡t−2p+sbp−sn ¢

n

X

i=1

|ai|s+ exp µ

− t2 16bn

, (3.1)

wherebn

s s−2

´Pn

i=1a2i,Kp= 3p−smax µ³

s s−p

´ ,³

s s−2

´p/2

Cp, and Cp = 22p+1max¡

pp, pp/2+1epR

0 xp/2−1(1−x)−pdx¢ .

Notice that, since the distribution of the variables is symmetric, the constant Cp (associated to the Rosenthal inequality) can be improved. For its optimal form, we refer toIbragimov and Sharakhmetov (1997).

In the literature, there exist several results on the approximation of the tail probability of a sum ofni.i.d. random variables having the symmetric Pareto distribution. But, to our knowledge, contrary to Proposition3.1, these results are asymptotic (i.e.t→ ∞). See, for instance,Goovaerts, Kaas, Laeven, Tang, and Vernic (2005).

Illustration. Here, we consider a simple example to compare the precision between (3.1) and the bound obtained via the Markov inequality.

Lets >2 and (Xi)i∈N be i.i.d. random variables with the probability density functionf(x) = 2−1s|x|−s−11{|x|≥1}. For any integernsuch thatn1/2−1/s(logn)−1/2>

23/2

3 s1/2, and anyp∈¡

max(s2,2), s¢

, if we take t=tn= 23/2(snlogn)1/2, then we can balance the two terms of the bound in (3.1); there exist two constants, Q1>0 andQ2>0, such that

P Ã n

X

i=1

Xi≥tn

!

≤Q1n1−s/2(logn)−p+s/2+n1−s/2≤Q2n1−s/2. (3.2)

Under the same framework, for any p < s, the Markov inequality combined with the Rosenthal inequality (see Lemma 4.1 below) implies the existence of two constants,R1>0 andR2>0, such that

P Ã n

X

i=1

Xi≥tn

!

≤t−pn E ï

¯

¯

¯

¯

n

X

i=1

Xi

¯

¯

¯

¯

¯

p!

≤R1t−pn np/2≤R2(logn)−p/2. (3.3)

Therefore, forn large enough, the rate of convergence in (3.2) is really faster than those in (3.3). In this case, (3.1) gives a better result than the Markov inequality.

(5)

Christophe Chesneau/A tail bound for sums of independent random variables 4

4. PROOFS

. Proof of Theorem2.1.Letn∈N. For anyt >0, we have

P Ã n

X

i=1

Yi≥t

!

=P Ã n

X

i=1

(Yi−E(Yi))≥t

!

≤U+V,

where

U =P Ã n

X

i=1

³

Yi1{|Yi|≥3bnt } −E³

Yi1{|Yi|≥3bnt }

´´≥ t 2

!

and

V =P Ã n

X

i=1

³

Yi1{|Yi|<3bnt } −E³

Yi1{|Yi|<3bnt }

´´≥ t 2

! . Let us boundU and V, in turn.

The upper bound forU.The Markov inequality yields

U ≤2pt−pE ï

¯

¯

¯

¯

n

X

i=1

³

Yi1{|Yi|≥3bnt } −E³

Yi1{|Yi|≥3bnt }

´´

¯

¯

¯

¯

¯

p!

. (4.1)

Now, let us introduce the Rosenthal inequality. SeeRosenthal(1970).

Lemma 4.1 (Rosenthal’s inequality). Let p≥2 and(Xi)i∈N be independent random variables such that, for any n ∈ N, and any i ∈ {1, ..., n}, we have E(Xi) = 0andE(|Xi|p)<∞. Then we have

E ï

¯

¯

¯

¯

n

X

i=1

Xi

¯

¯

¯

¯

¯

p!

≤cpmax

n

X

i=1

E(|Xi|p), Ã n

X

i=1

E¡ Xi2¢

!p/2

,

wherecp= 2 max¡

pp, pp/2+1epR

0 xp/2−1(1−x)−pdx¢ .

For any i ∈ {1, ..., n}, set Zi = Yi1{|Yi|≥3bnt } −E³

Yi1{|Yi|≥3bnt }

´. Since E(Zi) = 0 and E(|Zi|p) ≤ 2p

|Yi|p1{|Yi|≥3bnt }

´ ≤ 2pE(|Yi|p) < ∞, Lemma 4.1applied to the independent variables (Zi)i∈N gives

E ï

¯

¯

¯

¯

n

X

i=1

Zi

¯

¯

¯

¯

¯

p!

≤cpmax

n

X

i=1

E(|Zi|p), Ã n

X

i=1

E¡ Zi2¢

!p/2

, (4.2)

wherecp= 2 max¡

pp, pp/2+1epR

0 xp/2−1(1−x)−pdx¢ .

(6)

It follows from (4.1) and (4.2) that

U ≤ 2pt−pcpmax à n

X

i=1

E(|Zi|p), Ã n

X

i=1

E¡ Zi2¢

!!

≤ 22pt−pcpmax

n

X

i=1

E

³

|Yi|p1{|Yi|≥3bnt }

´ ,

à n X

i=1

E

³

Yi21{|Yi|≥3bnt }

´

!p/2

= Cpt−pmax³

rn,p(t),(rn,2(t))p/2´

, (4.3)

whereCp= 22pcp.

The upper bound forV.Let us present one of the Bernstein inequalities. See, for instance,Petrov(1995).

Lemma 4.2(Bernstein’s inequality). Let(Xi)i∈N be independent random vari- ables such that, for anyn∈N and anyi∈ {1, ..., n}, we haveE(Xi) = 0 and

|Xi| ≤M <∞. Then, for anyλ >0, and any n∈N, we have

P Ã n

X

i=1

Xi≥λ

!

≤exp Ã

− λ2 2(d2n+λM3 )

! ,

whered2n=Pn

i=1E(Xi2).

For any i ∈ {1, ..., n}, set Zi = Yi1{|Yi|<3bnt } −E³

Yi1{|Yi|<3bnt }

´. Since E(Zi) = 0 and |Zi| ≤ |Yi|1{|Yi|<3bnt }+E

³

|Yi|1{|Yi|<3bnt }

´

6btn, Lemma 4.2 applied with the independent variables (Zi)i∈N, and the parametersλ= 2t and M =6btn, gives

V ≤exp

− t2

8³ Pn

i=1

Yi1{|Yi|<3bnt }

´+6t¡6bn

t

¢´

.

SincePn i=1

Yi1{|Yi|<3bnt }

´≤Pn i=1

Yi2¢

=bn, it comes

V ≤exp µ

− t2 16bn

. (4.4)

Putting (4.3) and (4.4) together, we obtain the inequality

P Ã n

X

i=1

Yi≥t

!

≤U+V ≤Cpt−pmax³

rn,p(t),(rn,2(t))p/2´ + exp

µ

− t2 16bn

¶ . Theorem2.1is proved.

(7)

Christophe Chesneau/A tail bound for sums of independent random variables 6

. Proof of Proposition 3.1. Let n ∈ N. Set, for any i ∈ {1, ..., n}, Yi = aiXi. Clearly, (Yi)i∈N are independent random variables such that E(Yi) = aiE(Xi) = 0 and Pn

i=1E(Yi2) = ³

s s−2

´Pn

i=1a2i < ∞. In order to apply Theorem 2.1, let us bound the term rn,u(t) = Pn

i=1

|Yi|u1{|Yi|≥3bnt }

´ = Pn

i=1|ai|uE µ

|Xi|u

|Xi|≥|3bn

ai|t

ª

for anyu∈ {2, p}, and anyp∈¡

max(s2,2), s¢ . Recall that ρn = (Pn

i=1|ai|s)1/s. Since t ∈ ³ 0,3bρnn´

⊆ ³ 0,3bσnn´

, where σn = supi=1,...,n|ai|, we have E

µ

|Xi|u

|Xi|≥|3bn

ai|t

ª

= sR

3bn

|ai|txu−s−1dx =

³ s s−u

´ ³3bn

|ai|t

´u−s

.Hence,

rn,u(t) = µ s

s−u

¶ µ3bn

t

u−s n

X

i=1

|ai|s.

Therefore,

max³

rn,p(t),(rn,2(t))p/2´

≤Rp

µ3bn

t

p

max

 µ3bn

n

−s

, õ

3bn

n

−s!p/2

,

whereRp= max µ³

s s−p

´ ,³

s s−2

´p/2¶ .

Since t ∈ ³ 0,3ρbn

n

´ and p > 2, we have max Ã

³3bn

n

´−s

, µ³

3bn

n

´−sp/2!

=

³3bn

n

´−s

.Hence,

max³

rn,p(t),(rn,2(t))p/2´

≤Rp

µ3bn

t

p−s n

X

i=1

|ai|s. (4.5)

Putting (4.5) in Theorem2.1, we obtain

P Ã n

X

i=1

aiXi≥t

!

≤Kp¡

t−2p+sbp−sn ¢

n

X

i=1

|ai|s+ exp µ

− t2 16bn

¶ ,

wherebn

s s−2

´Pn

i=1a2i,Kp = 3p−smax µ³

s s−p

´ ,³

s s−2

´p/2

Cp, and Cp = 22p+1max¡

pp, pp/2+1epR

0 xp/2−1(1−x)−pdx¢

. Proposition3.1is proved.

References

Goovaerts, M., Kaas, R., Laeven, R., Tang, Q., and Vernic, R. (2005). The Tail Probability of Discounted Sums of Pareto-like Losses in Insurance.Scandina- vian Actuarial Journal, Issue 6, November 2005 , pp. 446 - 461

(8)

Ibragimov, R., and Sharakhmetov, Sh. (1997). On an exact constant for the Rosenthal inequality,Theory Probab. Appl., 42, pp. 294302.

Petrov, V. V. (1995). Limit Theorems of Probability Theory, Clarendon Press, Oxford.

Pollard, D. (1984). Convergence of Stochastic Processes, Springer, New York.

Ramsay, Colin M. (2006) The distribution of sums of certain i.i.d. Pareto vari- ates.Commun. Stat., Theory Methods. 35, No.1-3, pp. 395-405.

Rosenthal, H. P. (1970). On the subspaces ofLp(p≥2) spanned by sequences of independent random variables,Israel Journal of Mathematics8: pp. 273-303.

Références

Documents relatifs

We present two-sided estimates of moments and tails of polynomial chaoses of order at most three generated by in- dependent symmetric random variables with log-concave tails as well

The original result of Komlós, Major and Tusnády on the partial sum process [9] can be used for asymptotic equivalence in regression models, but presumably with a non-optimal

- We show how estimates for the tail probabilities of sums of independent identically distributed random variables can be used to estimate the tail probabilities of sums

Erratum / Correction to : “A bound on the moment generating function of a sum of dependent.. variables with an application to simple random sampling

A bound on the moment generating function of a sum of dependent variables with an application to simple random sampling without replacement.. Annales

In order to check that a parametric model provides acceptable tail approx- imations, we present a test which compares the parametric estimate of an extreme upper quantile with

Random walk, countable group, entropy, spectral radius, drift, volume growth, Poisson

Maximum a posteriori and mean posterior estimators are constructed for various prior distributions of the tail index and their consistency and asymptotic normality are