• Aucun résultat trouvé

Multidimensional version of the results of Komlós, Major and Tusnády for vectors with finite exponential moments

N/A
N/A
Protected

Academic year: 2022

Partager "Multidimensional version of the results of Komlós, Major and Tusnády for vectors with finite exponential moments"

Copied!
68
0
0

Texte intégral

(1)

MULTIDIMENSIONAL VERSION

OF THE RESULTS OF KOML

OS, MAJOR AND TUSN

ADY

FOR VECTORS WITH FINITE EXPONENTIAL MOMENTS

A.YU. ZAITSEV

Abstract. A multidimensional version of the results of Komlos, Ma- jor and Tusnady for the Gaussian approximation of the sequence of successive sums of independent random vectors with nite exponential moments is obtained.

1. Introduction

The statement of the problem is well known. It is required to construct on a probability space a sequence of independent random vectors

X

1

:::X

n

(with given distributions) and a corresponding sequence of independent Gaussian random vectors

Y

1

:::Y

n (this means that

Y

ihas the same mean and the same covariance operator as

X

i,

i

= 1

:::n

) so that the quantity

(

XY

) = max

1 k n k

P

i=1

X

i;Pk

i=1

Y

i (1.1)

would be so small as possible with large probability. Herej

x

j= maxj

x

jj, for

x

= (

x

1

:::x

d) 2 Rd. Sometimes we shall use another notation for sum- mands and, by analogy with notation in Sakhanenko (1984), write (

X

e

Y

e)=

max

1 k n k

P

i=1

X

ei; Pk i=1

Y

ei and so on. In some sense this problem is one of the most important in probability approximations because many well-known probability theorems can be considered as consequences of results about strong approximation of sequences of sums by corresponding Gaussian se- quences. This is related to the law of iterated logarithm, to several theo- rems about large deviations, to the estimates for the rate of convergence of the Prokhorov distance in the invariance principles (see Prokhorov (1956), Skorokhod (1961), Borovkov (1973)) as well as to the Strassen-type (1964) approximations (see, for example, Csorg}o and Hall (1984)).

The rate of approximation in the invariance principle was studied by many authors (see, e.g., Prokhorov (1956), Skorokhod (1961), Borovkov (1973), Csorg}o and Revesz (1975) and the bibliography in Csorg}o and Revesz (1981),

URL address of the journal: http://www.emath.fr/ps/

Research supported by the Alexander von Humboldt Foundation, by the Russian Foun- dation of Basic Research, grants 93-11-01454 and 96-01-00672, by grant RFBR-DFG 96- 01-00096-ge, and by the International Science Foundation, grant R3600.

Received by the journal April 30, 1996. Revised November 18, 1997. Accepted for publication January 16, 1998.

c

Societe de Mathematiques Appliquees et Industrielles. Typeset by TEX.

(2)

Csorg}o and Hall (1984), Shao (1995)). Skorokhod (1961) worked out a powerful method known now as Skorokhod embedding. For a long time it seemed that it is impossible to propose a method which could be better.

However, in 1975, Komlos, Major and Tusnady (KMT) (1975{76) worked out a new method of dyadic approximation. With the help of this method they obtained an exhaustive solution of the problem mentioned above for sequences of independent identically distributed random variables. Sakha- nenko (1984) generalized and essentially improved KMT results for the case of non-identically distributed random variables.

The rst attempts to extend the KMT approximations to the multidimen- sional case (see Berkes and Philipp (1979), Philipp (1979), Berger (1982), Einmahl (1987a,b)) had a partial success only. Comparatively recently U. Einmahl (1989) obtained multidimensional analogs of KMT results which are close to optimal. However, in his results (in general case of vectors with nite exponential moments) there is an unnecessary logarithmic factor. In this paper we improve the result of Einmahl (1989) and obtain multivariate analogs of KMT results.

Notation 1.1. Writing

z

2 Rd (resp. Cd), we always keep in mind that

z

= (

z

1

:::z

d) =

z

1

e

1++

z

d

e

d, where

z

j 2R1 (resp. C1) and the

e

j

are the basis vectors. The scalar product of vectors

xy

2 Rd(resp. Cd) is denoted by

xy

=

x

1

y

1++

x

d

y

d. For

z

2Rd(orCd) we shall use the Euclidean norm k

z

k =

zz

1=2 and the maximum norm j

z

j= max

1 j kj

z

jj. For

b >

0 we denote

L

(

b

) = max1

log

b

. The distribution and the corre- sponding covariance operator of a random vector

will be denoted by L(

) and cov

(or cov

F

, if

F

=L(

)). The letter

I

will be used for the iden- tity operator. The symbols

cc

1

c

2

:::

(without arguments) will be used for absolute positive constants. The letter

c

can denote dierent constants when we do not need to x their numerical values. The numbering of con- stants starts in each new section again, except Sections 6{9, where we use the common enumeration.

Definition 1.2. (Zaitsev (1986)) Denote by Ad(

) a class of probability distributions depending on a parameter

0. The class Ad(

) consists of

d

-dimensional distributions

F

for which the function

'

(

z

) =

'

(

Fz

) = log

Z

Rd

e

hzxi

F

f

dx

g (

'

(0) = 0) (1.2) is dened and analytic fork

z

k

<

1,

z

2Cd, and

d

u

d

2v

'

(

z

) k

u

k

Dvv

(1.3) for all

u

,

v

2Rdand k

z

k

<

1, where

D

= cov

F

, and

d

u

'

is the derivative of

'

in the direction

u

.

In Section 2 we consider some properties of classesAd(

). As examples of distributions from Ad(

c

) one can consider the distributions concentrated on the ball B =

x

2 Rd: k

x

k

, more general distributions satisfy- ing Bernstein-type inequality conditions and innitely divisible distributions

(3)

with spectral measures concentrated on B (see Zaitsev (1986), pp. 205{

207).

The following Theorem 1.3 is the main result of the paper. It was an- nounced in Zaitsev (1995a, 1996b, 1997) and, in somewhat weakened form, in Zaitsev (1995b).

Theorem 1.3. Suppose that

1,

>

0 and

1

:::

nare random vectors with distributionsL(

k)2Ad(

), E

k = 0, cov

k =

I

,

k

= 1

:::n

. Then one can construct on a probability space a sequence of independent random vectors

X

1

:::X

n and a corresponding sequence of independent Gaussian random vectors

Y

1

:::Y

n so that L(

X

k) =L(

k),

k

= 1

:::n

, and

E exp

c

1( )(

XY

)

d

3

L

(

d

)

exp;

c

2( )

d

9=4+

L

;

n=

2

(1.4) where

c

1( ),

c

2( ) are positive quantities depending only on .

In a particular case, when

d

= 1 and all summands have a common vari- ance, Theorem 1.3 is equivalent to the main result of Sakhanenko (1984).

One should take into account that, using the Holder inequality, we can trans- fer the factors under exponential sign in the right-hand side of (1.4) to the denominator of the fraction in the left-hand side of (1.4). Naturally, this makes the result weaker. Einmahl (1989) considered multidimensional vec- tors with identical covariance operators, satisfying multidimensional analogs of Sakhanenko's conditions.

One can easily verify that if a vector

has nite exponential moments

E

e

hhi, for

h

2

V

, where

V

Rd is some neighborhood of zero, then

F

=L(

)2Ad(

c

(

F

)). Therefore, Theorem 1.3 can be considered as a gen- eralization and renement of the main result of KMT (1975{76). In partic- ular, from Theorem 1.3 one can easily derive the following result, obtained by KMT (1975{76) in the one-dimensional case.

Corollary 1.4. Suppose that a random vector

has nite exponential moments E

e

hhi, for

h

2

V

, where

V

Rd is some neighborhood of zero.

Then one can construct on a probability space a sequence of independent random vectors

X

1

X

2

:::

and a corresponding sequence of independent Gaussian random vectors

Y

1

Y

2

:::

so that L(

X

k) = L(

),

k

= 1

2

:::

,

and n

P

j=1

X

j; Pn

j=1

Y

j =

O

(log

n

) a.s. (1.5) An analog of this result was obtained by Einmahl (1989) under additional smoothness-type restrictions on the distributionsL(

). Einmahl (1989, The- orem 10), has also proved an analog of Theorem 1.3 but only for suciently smooth distributions L(

). As it is noted in KMT (1975{76), from the re- sults of Bartfai (1966) it follows that the accuracy of approximation in (1.5) is the best possible.

The following Theorem 1.5 (by means of its consequence, Theorem 9.1) will be used in the proof of Theorems 1.3 and 1.6. However, it is of inde- pendent interest because sometimes it can give a more exact dependence of constants on the characteristics of distributions of summands.

(4)

Theorem 1.5. Suppose that

1,

1,

>

0 and

e1

:::

en are random vectors with distributionsL(

ek)2Ad(

), E

ek= 0, cov

ek=

I

,

k

= 1

:::n

. Assume that there exist random vectorse

k which have the same moments of the rst three orders as the vectors

ek,L(

ek)2Ad(

) and P

ek

= 1,

k

= 1

:::n

. Then one can construct on a probability space a sequence of independent random vectors

X

e1

::: X

en and a corresponding sequence of independent Gaussian random vectors

Y

1

:::Y

n so that L(

ek) = L(

X

ek),

k

= 1

:::n

, and

E exp

c

3( )(

XY

e )

d

3

2

exp;

c

4( )(

d

3=2

)3=2+

L

(

n

)

where

c

3( ),

c

4( ) are positive quantities depending only on .

The following Theorem 1.6 shows that if all moments of the third order of the vectors

1

:::

nin Theorem 1.3 are equal to zero, then the dependence of constants on the dimension can be slightly sharpened.

Theorem 1.6. Suppose that

1,

>

0 and

1

:::

n are random vectors with distributionsL(

k)2Ad(

), E

k= 0, cov

k=

I

,

k

= 1

:::n

. Assume that for all

uvw

2Rd we have

E

k

u

k

v

k

w

= 0

k

= 1

:::n:

(1.6) Then one can construct on a probability space a sequence of independent random vectors

X

1

:::X

n and a corresponding sequence of independent Gaussian random vectors

Y

1

:::Y

n so that L(

X

k) = L(

k),

k

= 1

:::n

, and

E exp

c

5( )(

XY

)

d

3

exp;

c

6( )

d

9=4+

L

;

n=

2

(1.7) where

c

5( ),

c

6( ) are positive quantities depending only on .

Remark 1.7. It is evident that the condition (1.6) is automatically fullled if the vectors

1

:::

n have symmetric distributions.

In Theorems 1.3, 1.5 and 1.6 the random vectors

1

:::

nand

e1

:::

en

are, generally speaking, non-identically distributed. However, they have the same covariance operator

I

. Thus, the problem of obtaining an adequate multidimensional generalization of the main result of Sakhanenko (1984) remains open. By the author's opinion, the proof of Theorem 1.3 can be transformed in the proof of an analogous result for vectors with non-identical covariance operators. The conditions cov

j =

I

,

1 can be apparently changed by

B

j2

2,

B

2j

j2, where

B

j2 j2 are respectively the max- imal and the minimal eigenvalues of cov

j,

j

= 1

:::n

. The constants will depend, naturally, on

, and

n

should be changed in the right-hand side of (1.4) by the maximal eigenvalue of the covariance operator of the last sum

1 ++

n. The author hopes to consider this situation in a separate paper. Moreover, it is well known that the results similar to The- orems 1.3, 1.5 and 1.6 imply many useful consequences. For example, one can derive estimates in the invariance principle in the case when the sum- mands

X

j satisfy less restrictive moment conditions (see, e.g., Shao (1995) and the bibliography in Shao (1995)). The author is going to devote to the consequences of Theorems 1.3, 1.5 and 1.6 also a separate publication.

(5)

In Theorems 1.3, 1.5 and 1.6 we consider the case

1. Obviously, the assertions of these theorems become stronger for small

. An analog of The- orem 1.3 in a particular case when the summands have smooth distributions and

can be arbitrarily small is obtained in Gotze and Zaitsev (1997). Note that for small

the distributions of summands are close to Gaussian ones (see Zaitsev (1986)).

The proof of Theorems 1.3, 1.5 and 1.6 consists of several steps, many of which are close to corresponding steps from the proofs of the main results of KMT (1975{76), Sakhanenko (1984) and Einmahl (1989). In Section 2 we introduce the necessary notation and formulate auxiliary results. In par- ticular (see Denition 2.2), we dene the classesRd;

"

Ad(

)Ad(

) of pairs of distributions with close cumulants of the second and third orders and discuss the properties of classesAd(

) andRd;

0 . We dene as well the classes Ad(

)Ad(

) of distributions satisfying some smoothness conditions which guarantee the existence of smooth densities (see Deni- tion 2.12).

The main tool for the proof of Theorems 1.3, 1.5 and 1.6 is the esti- mates for the closeness of quantiles of one-dimensional conditional distri- butions contained in Lemmas 3.1 and 4.1. For the proof of Lemmas 3.1 and 4.1 we use Lemmas 2.14 and 2.15 which are completely proved in Za- itsev (1996b), Lemmas 6.1 and 7.1. In Zaitsev (1996b) one can also nd a sketch of the proof of Lemmas 3.1 and 4.1. We give the complete proof of these lemmas in Sections 3 and 4. In Lemma 2.14 we give a non-uniform estimate for large deviations in the local CLT for conditional distributions of the last coordinates of vectors with smooth distributions fromAd(

) under condition that the rst

d

;1 coordinates are xed (see (2.17)). Analogously, Lemma 2.15 contains a non-uniform estimate for the closeness of conditional densities of smooth distributions from Rd;

0 (see (2.23)). Moreover, Lemma 2.15 states that if

w >

0 is separated from zero and from inn- ity, then the derivative of the logarithm of conditional density at point

w

is negative. An estimate of this derivative is also presented (see (2.24)). This estimate is essentially used in the proof of Lemma 4.1.

In Lemma 3.1 we give the estimates for the closeness of quantiles in the CLT for conditional distributions of the last coordinates of vectors with smooth distributions from Ad(

) under condition that the rst

d

;1 co- ordinates are xed (see (3.1)). Analogously, in Lemma 4.1 we provide an estimate for the closeness of quantiles of conditional distribution functions of smooth distributions fromRd;

0 (see (4.1)). The additional information about the closeness of cumulants leads to the better orders of estimates in Lemma 4.1 in comparison with Lemma 3.1.

In Section 5 we describe the KMT (1975{76) scheme of dyadic approxi- mation. This scheme was essentially used by Sakhanenko (1984) and Ein- mahl (1989). Note, however, that the formulations and the proofs of KMT (1975{76, inequality (2.5)), Sakhanenko (1984, Lemma 1, p. 34) and Ein- mahl (1989, inequality (4.11)) are close but slightly dierent. Our approach practically coincides with Einmahl's one. As in KMT (1975{76), Sakha- nenko (1984) and Einmahl (1989), we suppose that some independent ran- dom vectors with smooth distributions are already constructed and dene in-

(6)

dependent random vectors with given distributions by successive construct- ing sums of blocks of summands, containing 2N

2N;1

2N;2

:::

4

2

1 sum- mands. For this we apply the so-called Rosenblatt (1952) quantile transfor- mation which was already used by Einmahl (1989). This transformation is a natural generalization of the usual one-dimensional quantile transformation which have been applied by KMT (1975{76) and Sakhanenko (1984).

The scheme of the proof of Theorem 1.3 is very close to that of the main result of Sakhanenko (1984). At rst we suppose that the Gaussian vectors

Y

1

:::Y

n,

n

= 2N, are already constructed and construct the independent vectors which are bounded with probability one, have suciently smooth distributions and the same moments of the rst, second and third orders as the needed independent random vectors

X

1

:::X

n. Then we construct the vectors

X

1

:::X

n in several steps. After each step the number of

X

j

which are not constructed becomes smaller in 2ptimes, where

p

is a suitably chosen positive integer. In each step we begin with already constructed vectors which are bounded with probability one and have suciently smooth distributions and the needed moments up to the third order. Then we construct the vectors such that, in each block of 2p summands, only the rst vector has the initial bounded smooth distribution. The rest 2p;1 vectors have the needed distributionsL(

j). These 2p;1 vectors from each block will be chosen as

X

j and will be not involved in the next steps of the procedure. The coincidence of third moments will allow us to use more precise estimates of the closeness of quantiles of conditional distributions obtained in Section 4.

In Section 6 we realize the rst step of the procedure just described.

We estimate the rate of approximation in the case when we construct on a probability space the independent Gaussian vectors

Y

j and the independent vectors

X

j with smooth distributions and the needed moments up to the third order. The main result of Section 6 is formulated in Theorem 6.4.

Sections 7 and 8 are devoted to the estimation of the rate of approximation in the case when we begin with the vectors

Y

e1

::: Y

en,

n

= 2N, having smooth distribution with bounded supports and with the same moments up to the third order as the needed vectors

X

e1

::: X

en and construct the vectors

X

e1

::: X

en in several steps, diminishing after each step the number of

X

ej which are not constructed in 2p times. The scheme of the proof is a modied version of that from Sakhanenko (1984). We use the induction with respect to the number of steps of the procedure and prove that this procedure provides the good estimate for

j(

N

) = kPj

=1

Y

ek; Pj k=1

X

ek (1.8)

(see (8.28), (8.94), (8.97)). It is very important that in (1.8) there is no supremum over

j

= 1

:::

2N. This supremum is taken in the last moment only (see (8.98)). At the end of Section 8 we give the proof of Theorem 1.5.

Theorems 1.3 and 1.6 are proved in Section 9. At rst we prove an auxiliary result (Theorem 9.1) which is in fact a simple consequence of The- orem 1.5. However, its formulation is more convenient than that of The- orem 1.5 when we investigate the character of dependence of constants of

(7)

the parameters. This is taken into account in the proofs of Theorems 1.3 and 1.6. For the proof of Theorem 1.3 we need also Lemma 9.2 which shows that there exist absolute positive constants

c

7

c

8 such that for any distribution

F

2Ad(1) with mean zero and cov

F

=

I

there exist a distri- bution

G

2Ad(

c

7) with the same moments up to the third order and such that

G

x

:j

x

j

c

8

L

(

d

)= 1. It can be veried that this statement is valid without

L

(

d

) if all the moments of the third order of the distribu- tion

F

are equal to zero. Therefore, the factor

L

(

d

) from (1.4) is absent in the denominator of the fraction in the left-hand side of the inequality (1.7) of Theorem 1.6.

2. Notation and auxiliary results

Notation 2.1. Let Fd be the set of all

d

-dimensional probability distri- butions dened on the

-eld of Borel subsets of the Euclidean space Rd. Let Gd be the collection of all Gaussian distributions from Fd. Below

symbolizes dierent quantities not exceeding one in absolute value,

E

a is the distribution concentrated at a point

a

. By

F

b(

t

),

t

2Rd, we denote the characteristic function of a distribution

F

2Fd. The product of measures is understood as their convolution:

F G

=

F G

. By N(0

I

)2Gd we denote the Gaussian distribution with mean zero and covariance operator

I

. The notation (

F

) will be used for the Gaussian distribution whose mean and covariance operator are the same as those of a distribution

F

2Fd. In one- dimensional case we denote by () the distribution function of the normal law with mean zero and variance

2 and by

'

() the corresponding density.

We shall identify the covariance operators with corresponding covariance matrices.

Let

0,

F

= L(

) 2 Ad(

), k

h

k

<

1,

h

2 Rd. Then the conjugate distribution

F

=

F

(

h

) is dened by

F

f

dx

g=;E

e

hhi ;1

e

hhxi

F

f

dx

g (2.1) (sometimes it is called the Cramer transform). Denote by

(

h

) a random vector withL;

(

h

) =

F

(

h

). It is clear that

F

(0) =

F

and

if

U

1

:::U

n2Ad(

)

k

h

k

<

1

U

=jQn

=1

U

j

then

U

(

h

) =jQn

=1

U

j(

h

)

:

(2.2) Definition 2.2. For

"

0 denote by Rd;

"

the collection of pairs of

d

-dimensional probability distributions (

F

1

F

2) such that

F

k 2 Ad(

),

k

= 1

2, and

d

u

d

2v

g

1(0);

d

u

d

v2

g

2(0)

"

k

u

k

Dvv

d

2v

g

1(0);

d

v2

g

2(0)

"

2

Dvv

(2.3) for all

uv

2Rd, where

D

=

D

1+

D

2,

g

k(

z

) =

'

(

F

k

z

),

D

k= cov

F

k. Remark 2.3. If

"

= 0, then the relation (

F

1

F

2)2Rd;

"

means that the distributions

F

1

F

2 2 Ad(

) have identical cumulants of second and third orders.

(8)

ClassesAd(

) andRd;

"

have properties which are very convenient for applying. Some of them were considered in Zaitsev (1986, 1988, 1996a).

It is evident that if

1

<

2, thenAd(

1)Ad(

2). Moreover, it is easy to see that, for xed

, the classAd(

) is closed with respect to convolution: if

F

1

F

2

:::F

n2Ad(

), then

F

1

F

2

F

n 2Ad(

). The classAd(0) coincides with the classGdof all

d

-dimensional Gaussian distributions. The following inequality was proved in Zaitsev (1986) and can be considered as an estimate of stability of this characterization: if

F

2Ad(

),

>

0, then

;

F

(

F

)

cd

2

L

(

;1), where

(

) is the Prokhorov distance. Other useful properties of classes Ad(

) and Rd;

0 are contained in the following Lemmas 2.4, 2.5, 2.7, 2.8 and 2.11.

Lemma 2.4. Suppose that

0,

F

2Ad(

),

h

2Rd, k

h

k

<

1,

F

=L(

),

D

= cov

,

D

(

h

) = cov

(

h

). E

= 0. Then

a) for all

u

2Rdthe following relations are valid:

D

(

h

)

uu

=

Duu

;1 +

k

h

k

(2.4) log E

e

hhi= 12

Dhh

1 + 13

k

h

k

(2.5) logE

e

ihhi=; 1

2

Dhh

1 + 13

k

h

k

(2.6) b) if k

h

k

12, then

F

(

h

)2Ad(2

).

Lemma 2.4 is contained in Zaitsev (1986, Lemma 2.1).

Lemma 2.5. Suppose that

0,

F

=L(

)2Ad(

),

y

2Rm, 2R1, and

A

:Cd !Cm is a linear operator such that

A

(Rd)Rm. Let

e2Rkbe the vector consisting of a subset of coordinates of the vector

. Then

L(

A

+

y

)2Am;k

A

k

where k

A

k= supx

2Rdkxk 1k

Ax

k

L( )2Ad(j j

)

L(

e)2Ak(

)

:

Proof. Put

(

z

) = log E

e

hzA+yi = log E

e

hA zi+

zy

(2.7) where

A

:Cm !Cd is the adjoint operator for

A

,

z

2Cm. Let

D

= cov

,

M

= cov

A

. By virtue of (1.2), (1.3), (2.7), for all

uv

2 Rm,

z

2 Cm,

k

A

z

k

<

1, the following relations are valid:

d

u

d

v2

(

z

) =

d

A u

d

A v2

'

(

Fw

) w=A z

k

A

u

k

DA

vA

v

k

A

kk

u

k

E

; E

A

v

2

=k

A

kk

u

k

E

A

; E

Av

2=k

u

kk

A

k

M v v

:

(2.8) If k

z

k k

A

k

<

1 then k

A

z

k

k

A

kk

z

k

= k

z

k k

A

k

<

1 and the inequality (2.8) is also valid. From (2.7), (2.8) and from denition of classes Ad(

) it follows that L(

A

+

y

) 2 Am;k

A

k

. The relations

(9)

L( ) 2 Ad(j j

), L(

e)2Ak(

) follow directly from the statement just

proved. tu

Corollary 2.6. Let the conditions of Lemma 2

:

5 be satised, 2 R1, (

F

1

F

2) 2Rd;

0 ,

F

j = L(

j),

y

j 2Rm,

j

= 1

2, and let

e1

e2 2 Rk be the vectors consisting of identical subsets of coordinates of the vectors

1

2 respectively. Then

;

L(

A

1+

y

1)

L(

A

2+

y

2) 2Rm;k

A

k

0

;

L( 1)

L( 2) 2Rd;j j

0

;L(

e1)

L(

e2) 2Rk(

0)

:

Proof. It is sucient to use Lemma 2.5, the denition of classesRd;

"

and the corresponding analogs of the relations (2.7), (2.8), taking into account

Remark 2.3. tu

Lemma 2.7. Suppose that

0,

F

k =L;

(k) 2 Adk(

), and the vectors

(k),

k

= 1

2, are independent. Let

2Rd1+d2 be the vector with the rst

d

1 coordinates coinciding with those of

(1) and with the last

d

2 coordinates coinciding with those of

(2). Then

F

=L(

)2Ad1+d2(

).

Proof. Let

D

= cov

,

D

(k) = cov

(k),

k

= 1

2. Using the natural analogous notation, we see that for all

u

,

v

2Rd1+d2 and

z

2Cd1+d2,k

z

k

<

1,

d

u

d

2v

'

(

Fz

) =

d

u

d

v2;

'

(

F

1

z

(1)) +

'

(

F

2

z

(2))

=

d

u(1)

d

v2(1)

'

(

F

1

z

(1)) +

d

u(2)

d

v2(2)

'

(

F

2

z

(2))

k

u

(1)k

D

(1)

v

(1)

v

(1)+k

u

(2)k

D

(2)

v

(2)

v

(2)

k

u

k

Dvv

:

It remains to use the denition of classesAd(

). tu Lemma 2.8. Suppose that

0, ;L(

1(k))

L(

2(k)) 2 Rdk(

0),

k

= 1

2, and the pairs of vectors

(1)j

j(2),

j

= 1

2, are the pairs of independent vectors. Let

j 2Rd1+d2,

j

= 1

2, be the vectors with the rst

d

1 coordinates coinciding with those of

j(1) and with the last

d

2 coordinates coinciding with those of

j(2). Then ;L(

1)

L(

2) 2Rd1+d2(

0).

Proof. It suces to apply Lemma 2.7, taking into account Remark 2.3. tu Remark 2.9. Naturally, Lemmas 2.7 and 2.8 can be evidently extended on the case of any nite number of independent vectors.

Remark 2.10. Lemma 2.7 can be easily derived from Lemma 2.5, using the completeness of classesAd(

) with respect to convolution.

Lemma 2.11. Let

0 and

X

1

:::X

n be independent random vectors with L(

X

i) 2Ad(

), E

X

i = 0,

S

i =

X

1 ++

X

i, for

i

= 1

:::n

. Let

ht

2 R1, 0

h <

1, 0

t <

12,

= max

1 i n

S

i . Denote by

B

2 the maximal eigenvalue of cov

S

n. Then

E

e

hjSnj2

de

h2B2

E

e

t 3

de

4t2B2 and

P

x

2

d

maxexp;;

x

24

B

2

exp;;

x

4

x

0

:

(2.9)

(10)

Proof. By Lemma 2.5, the distributions of the coordinates of the vectors

X

i

belong to the classA1(

) and it suces to prove the statement of the lemma for

d

= 1 (recall thatjjmeans the maximum norm). Using the completeness of the classesA1(

) with respect to convolution, we see that by Lemma 2.4, inequality (2.5),

E

e

hjSnj E

e

hSn + E

e

;hSn 2exp 1

2

h

2

B

21 + 13

h

2

e

h2B2

:

(2.10) Note that the random sequence exp;

h

j

S

ij (indexed by

i

) is a positive submartingale. Therefore, by Doob's inequality (see Doob (1953), p. 314) and by (2.10),

P

x

= P

e

h

e

hx E exp;

h

j

S

nj;

hx

2exp;

h

2

B

2 ;

hx :

(2.11) We then deduce (2.9) from (2.11) via the classical Bernstein{Cramer{Cher- no calculation. Integrating by parts and applying (2.11) with

h

= 2

t

, we get

E

e

t= 1 +

Z

1

0

te

txP

x

dx

3

e

4t2B2

:

t u

Definition 2.12. Let

0,

>

0,

>

0. Denote by Ad(

) the class of distributions

F

2 Ad(

) such that for

h

2 Rd, k

h

k

<

1, and for all

v

2Rdthe following inequalities hold:

(2

);d

Z

ktkd1

F

bh(

t

)

dt

2

d

2

2(2

)d=2(det

D

)1=2

(2.12) (2

);d

Z

ktkd1

tv

F

bh(

t

)

dt

D

;1

vv

1=2

(2

)d=2(det

D

)1=2

(2.13) where

F

h=

F

(

h

),

D

= cov

F

and

2

>

0 is the minimal eigenvalue of

D

.

According to (2.1), we have (if

F

=L(

))

F

bh(

t

) = E

e

hit(h) i=;E

e

hhi ;1E

e

hh+iti

:

(2.14) Therefore, the conditions (2.12), (2.13) play in this paper the role which is similar to that of Sakhanenko (1984, inequality (49), p. 9) or Einmahl (1989, inequality (1.5)). Note, however, that the condition (2.13) is needed for estimating the derivatives of densities (see (2.24)). Such estimates are necessary for obtaining more precise bounds for the closeness of conditional distributions when the compared distribution have the coinciding third mo- ments.

Notation 2.13. Below for

x

= (

x

1

:::x

d) 2 Rd,

d

2, we shall denote by

x

0the vector

x

0= (

x

1

:::x

d;1) consisting of the rst

d

;1 coordinates of

x

. Analogously, we shall use a prime to denote the matrix

D

0 of size

(11)

(

d

;1)(

d

;1) consisting of the rst

d

;1 rows and rst

d

;1 columns of a matrix

D

of size

d

d

. If

F

=L(

) and

2Rdwe shall write

F

0=L(

0).

The following Lemmas 2.14 and 2.15 are completely proved in Zaitsev (1996b, Lemmas 6.1 and 7.1).

Lemma 2.14. Suppose that

0,

F

=L(

)2Ad(

4

4) and, if

d

2

F

0=L(

0)2Ad;1(

4

4)

:

(2.15) Assume that E

= 0 and the last coordinate

d of the vector

is not corre- lated with the previous ones. Let

2 =

De

d

e

d= E

d2

>

0 be the minimal eigenvalue of

D

= cov

F

. Denote by

p

(

x

),

x

2Rd, the probability density of distribution

F

. Let

p

0(

x

0),

x

02Rd;1,

d

2, be the density of the vector

0. Denote by

p

(d)(

x

dj

x

0) =

p

(

x

)

p

0(

x

0),

x

d2R1, the conditional density of the

d

th coordinate

d of the vector

2Rdfor a xed value of

0 =

x

0 (for

d

= 1 we dene

p

(1)(

x

1j

x

0) =

p

(

x

)). Denote by

G

the corresponding (depending on

x

0) one-dimensional conditional distribution.

Then there exist absolute positive constants

c

1

:::c

6 such that the fol- lowing statements are true for

d

3=2

c

1, ;

D

0 ;1=2

x

0

c

2

d

3=2

.

For

d

2 there exists a parameter

h

0 =

h

0(

x

0) 2 Rd;1 which gives the solution of the equation E

0(

h

0) =

x

0. Dene the parameter

h

2Rd which has the rst

d

;1 coordinates coinciding with corresponding coordinates of

h

0 and the last

d

th coordinate

h

d = 0. If

d

= 1, we take

h

= 0. Denote

y

0 =

E

(

h

)

e

d. Then

j

y

0j2

:

88

;

D

0 ;1=2

x

02 2

:

88

k

x

0k2

2

(2.16)

and, ifj

w

j c3d2 , then for the density

v

(

w

) of the distribution

GE

;y0 the following representation is valid:

v

(

w

) =

'

(

w

)exp

c

4

d

3=2+

d

;

D

0 ;1=2

x

01 +

w

2

2

+ j

w

j3

3

(2.17)

and

2p12

exp

;

w

2

2

v

(

w

) 2

p2

exp

;

w

2 4

2

:

(2.18)

Moreover, for all

w

2R1,

v

(

w

)

c

5

exp

;minn

w

2

4

2

c

6j

w

j

d

o

:

(2.19)

Note that for

d

= 1 we use in the formulations the natural agreement

;

D

0 ;1=2

x

0 =;

D

0 ;1=2

x

0=j

x

0j=k

x

0k= 0.

Lemma 2.15. Suppose that

0, (

F

1

F

2)2Rd;

0 . Let, for

k

= 1

2,

F

k =L(

k)2Ad(

4

4) and, if

d

2

F

k0 =L(

k0)2Ad;1(

4

4)

:

(2.20)

Références

Documents relatifs

Yor (1997) On the distribution and asymptotic results for exponential functionals of Levy processes, In ”Exponential functionals and principal values related to Brownian

[2] beschrieben eine Besserung der Symptome bei dem spontanen OHSS nach 10 Tagen, in anderen Ka- suistiken wurde ein Zeitraum von we- nigen Wochen genannt.. In dem hier

Keywords: Bernstein’s inequality, sharp large deviations, Cram´ er large deviations, expansion of Bahadur-Rao, sums of independent random variables, Bennett’s inequality,

As we shall see our conditions are well adapted to a large variety of models, including left random walks on GL d ( R ), contracting iterated random functions, autoregressive

Interestingly, Factor 1 (craving related to the relief of withdrawal and/or negative affect), which is closely related to dependence (as measured by the FTND), was

Nous souscrivons donc aux recommandations européennes les plus récentes concernant le traitement anticoagulant et antiagrégant des patients atteints de fibrillation auriculaire (et

We show that these vectorial exponential functions can achieve high order convergence with a significant gain in term of the number of basis functions compare to more

On the two dual graphs above, the full points represent components of odd multiplicity whereas empty points represent components of even multiplicity of the total transform of