ABOUT THE LINDEBERG METHOD
FOR STRONGLY MIXING SEQUENCES
EMMANUEL RIO
Abstract. We extend the Lindeberg method for the central limit theorem to strongly mixing sequences. Here we obtain a generalization of the central limit theorem of Doukhan, Massart and Rio to nonsta- tionary strongly mixing triangular arrays. The method also provides estimates of the Levy distance between the distribution of the normal- ized sum and the standard normal.
Introduction
In this paper, we are interested in the central limit theorem for strongly mixing and possibly nonstationary sequences of real-valued random vari- ables. First let us recall some recent results for strongly mixing sequences, improving on the classical results of Ibragimov (1962). In order to state these results, we need some more notation.
Definition 1. Let (
X
i)i2ZZ be a sequence of real-valued random variables with mean zero. For any nonincreasing cadlag functionH
: IR+ !IR+, letH
;1denote the cadlag inverse function ofH
, which is dened byH
;1(u
) = supft
2IR+:H
(t
)> u
gwith the convention that sup= 0. For any real-valued random variable
X
with distribution functionF
, we denote byQ
X orQ
F the inverse function oft
!IP(jX
j> t
). We setQ
i=Q
Xi.If (
u
n)n0 is a nonincreasing sequence of nonnegative real numbers, we denote byu
(:
) the cadlag rate function which is dened byu
(t
) =u
t]. Throughout the sequel,u
;1denotes the inverse function of this rate functionu
(:
).It comes from Doukhan, Massart and Rio (1994) that the central limit theorem for strictly stationary sequences with strong mixing coecients (
n)n0 holds under the integral conditionZ
1
0
;1(x
)Q
20(x
)dx <
1:
(I:
1)ESAIM: Probability and Statistics is an electronic journal with URL address
http://www.emath.fr/ps.
Received by the journal 1 February 1995. Revised 16 October. Accepted for publi- cation 11 December 1995
When
X
0 satises the moment assumption IE(jX
0jr)<
1, condition (I.1) holds as soon as Xn>0
n
2=(r;2)n<
1:
(I:
2) (I.2) improves on Ibragimov's condition Pn>0r=n (r;2)<
1. Moreover, from a paper of Bolthausen (1980) dealing with rates of convergence in the central limit theorem for Markov chains on a countable space, we believe that (I.2) cannot be improved for strongly mixing Markov chains.The aim of this paper is, rst to extend the central limit theorem for strongly mixing sequences to a central limit theorem for triangular arrays, and second to obtain rates of convergence in the central limit theorem. We refer the reader to Bergstrom (1972), Krieger (1984) and Samur (1984) for the central limit theorem for
-mixing triangular arrays with stationary rows and to Tikhomirov (1980) and Gotze and Hipp (1983) for rates of con- vergence and asymptotic expansions in the central limit theorem for mixing sequences. Let us also mention the recent works of Peligrad and Utev (1994) and Peligrad (1995), which improve the previous results for triangular ar- rays.The proofs of central limit theorems for mixing sequences often are based either on Gordin's theorem (1969) see Hall and Heyde (1980)] or on the Bernstein's method see Ibragimov and Linnik (1971)]. Unfortunately the extension of these techniques to nonstationary sequences is quite deli- cate. So, in order to obtain central limit theorems for triangular arrays, we will adapt the Lindeberg method see Lindeberg 1922] to strongly mixing sequences. Up to our knowledge, the Lindeberg method was rst used in the setting of strongly mixing processes by Doukhan and Portal (1983a) see also Doukhan and Portal (1987)]. They studied the rates of convergence in the multidimensional central limit theorem and extended some estimates of Yurinskii (1977) to mixing sequences. Next Doukhan, Leon and Portal (1984) and (1985)] obtained some related results for Hilbert space valued stationary mixing random variables. This method has two main advantages:
it leads to optimal conditions concerning the tail ditributions of the random variables and it gives precise estimates of the Levy distance between the distribution of the normalized sum and the standard normal distribution for stationary and strongly mixing sequences.
Let us now recall the Lindeberg central limit theorem for independent summands. Let (
X
in)i21n] be a triangular array of independent square- integrable random variables with mean zero, normalized in such a way that Var(X
1n++X
nn) = 1. LetS
nn=X
1n++X
nn. ThenS
nn converges in distribution to a standard normal distribution if, for any positive"
,n
X
i=1IE(
X
in2 1IjXinj>") !0 asn
!1 which is equivalent ton
X
i=1
Z
1
0
Q
2Xin(x
)(Q
Xin(x
)^1)dx
!0 asn
!1:
(I:
3)In a recent note concerning moment inequalities for stationary strongly mix- ing sequences see Rio (1994)], we prove that, for any
p
2, the moment conditionM
p(Q
0) =Z
1=2
0
(
;1(x=
2)Q
0(x
))pdx
;1(x=
2)<
1is sucient to imply a Rosenthal type inequality of order
p
see Rosen- thal (1970) for these inequalities in the independent setting]. Hence we obtain a generalization of the classical moment inequalities by replacingQ
0 by ;1(x=
2)Q
0(x
) and the Lebesgue measure by the weighted measuredx=
;1(x=
2). Exactly in the same way, we will obtain a generalization of the Lindeberg condition to strongly mixing sequences by replacingQ
Xin by ;1(x=
2)Q
Xinanddx
bydx=
;1(x=
2) in (I.3). Since the Lindeberg method provides estimates of the error between the characteristic function of the nor- malized sum and the characteristic function of the standard normal, we also obtain upper bounds on the Levy distance between the distributions func- tions in the stationary case, via Esseen's inequality (1945). In particular, ifM
2+(Q
0) is nite for some (p5;1)=
2, we obain an upper bound of the order ofn
;=2.1. The main results
Definition 2. For any two
-algebrasA andB in (TIP), let (AB) = sup(AB)2ABjIP(
A
\B
);IP(A
)IP(B
)j (1:
1) denote the strong mixing coecient introduced by Rosenblatt (1956). The strong mixing coecients (n)n>0 of the sequence (X
i)i2ZZ are dened by n = supk2ZZ
(Fk;nGk) (1:
2) whereFl = (X
i:i
l
) and Gl =(X
i :i
l
). We make the convention that0 = 1=
4.Throughout the section,
Q
is any nonincreasing function from 01] into IR+ such thatQ
supi>0Q
i.Let us recall some basic covariance inequalities for strongly mixing se- quences, improving on the covariance inequalities of Davydov (1968). By Theorems 1.1 and 1.2 in Rio (1993), the following upper bounds on the variance of the partial sums of strongly mixing sequences hold.
Proposition 1. Let (
X
i)i2ZZ be a sequence of real-valued random vari- ables with nite variance and mean zero. Let the strong mixing coecients (n)n0 be dened by0 = 1=
4 and n = supk2ZZ
(Fk;n(X
k))for any positive
n
. Then, for any integerss
andt
such thats < t
,jCov(
X
sX
t)j2Z
2t;s
0
Q
t(x
)Q
s(x
)dx:
(a
) LetS
n=X
1++X
n. ThenVar
S
n X(st)2]0n]2
jCov(
X
sX
t)j4Xni=1
Z
1=2
0
;1(x=
2)^n
]Q
2i(x
)dx
(b
) where;1 denotes the inverse of the mixing rate function associated with the strong mixing coecients (n)n0.Suppose that
M
2(Q
) =Z
1=2
0
;1(x=
2)Q
2(x
)dx <
+1:
(1:
3)Then
n
;1VarS
n 4M
2(Q
):
(1:
4)Hence, if (
X
i)i2ZZ is strictly stationary, the series Pt2ZZCov(X
0X
t) is ab- solutely convergent to some nonnegative number2,n!+1lim
n
;1VarS
n=2 and 2 Xt2ZZ
jCov(
X
0X
t)j4M
2(Q
):
(c
) The main results are the following estimates of the nearness of the charac- teristic function of the normalized sum and of the characteristic function of the standard normal.Theorem 1. Let (
X
i)i2ZZ be a strongly mixing sequence of real-valued random variables with nite variance and mean zero. Suppose thatX
i= 0 a.s. for anyi =
21n
]. Let'
k(t
) = IE(exp(itS
k)),V
0 = 0V
k = VarS
k andV
n= supk21n]
V
k:
(1:
5) (i) For any nonnegative quantile functionQ
and any positivet
, letM
3(Qt
) =Z
1
0
;1(x=
2)Q
2(x
)(t
;1(x=
2)Q
(x
)^1)dx:
Then, for any real
t
,jexp(
V
nt
2=
2)'
n(t
);1j58t
2exp(V
nt
2=
2)Xnk=1
M
3(Q
kjt
j):
(ii) Suppose furthermore that (X
i)i2ZZ fullls condition (1.3). Then, forany real
t
,jexp(
V
nt
2=
2)'
n(t
);1j8(p2 + 1)t
2M
3(Q
jt
j)Xnk=0exp(
V
kt
2=
2):
Remark 1. It comes from Proposition 2 in section 2 that Theorem 1 and Corollary 1 may be obtained under the following weaker denition of the strong mixing coecients:
n= supk2ZZ
sup
(p1p2)2IN2
(Fk;n(X
k+p1X
k+p2)):
(1:
6) Consequently some upper bounds on these mixing coecients would be of interest.(i) is a result generalizing Lindeberg's one see Lindeberg (1922)] to strongly mixing sequences. So, our main application of (i) is the following central limit theorem for strongly mixing triangular arrays.
Corollary 1. Let (
X
in)n>0i21n]be a double array of real-valued random variables with nite variance and mean zero. Let (kn)k0 be the sequence of strong mixing coecients of the sequence (X
in)i21n] and ;1(n) be the inverse function of the the associated mixing rate function. We setS
in=X
1n++X
in andV
in= VarS
in:
Suppose furthermore thatlimsupn
!1
imax21n](
V
in=V
nn)<
1:
(a
) LetQ
in=Q
Xin. ThenS
nn converges to the standard normal distribution ifV
nn;3=2Xni=1
Z
1
0
;1(n)(x=
2)Q
2in(x
)inf(;1(n)(x=
2)Q
in(x
)pV
nn)dx
;!0 (b
) asn
tends to1.Remark 2. Note that
kn = 0 for amyk
n
, which implies that ;1(n)(x=
2)n
for any positivex
.Application 1. Let (
i)i2ZZ be a strictly stationary and strongly mixing sequence of real-valued random variables with mean zero, satisfying the conditionM
2(Q
0)<
1. Let (a
in)i21n] be a triangular array of real numbers such thatn
X
i=1
a
2in = 1 and limn!1
imax21n]j
a
inj= 0:
We setX
in=a
ini. Then, by (b) of Proposition 1,Var
S
kn4Xni=1
a
2inM
2(Q
i)4M
2(Q
0):
This inequality ensures that Corollary 1(a) is equivalent to liminfn
V
nn>
0.Consequently, if (a) holds, condition (b) of Corollary 1 is ensured by the mixing condition
n
X
k=1
a
2knZ 10
;1(x=
2)Q
20(x
)(ja
knj;1(x=
2)Q
0(x
)^1)dx
;!0 asn
! 1, where ;1 stands for the inverse function of the strong mixing rate function of (i)i2ZZ. Since maxi21n]ja
inj tends to zero asn
! 1, Lebesgue dominated convergence theorem lets us prove that the above ex- pression converges to zero. Hence the CLT forS
nn holds, which generalizes (c) of Theorem 2.2 of Peligrad and Utev (1994).Let us now give some applications of Theorem 1(ii) to Berry-Esseen type estimates. Let the class of two times dierentiable Orlicz functions be dened by :
=
: IR+!IR+ convex, (0) =0(0) = 0 00 nondecreasing, concave:
(1:
7a
) For anyin , we dene the weighted momentsM
(Q
) byM
(Q
) =Z
1=2
0
(;1(x=
2)Q
(x
)) ;1(x=
2)dx
(1:
7b
) If(x
) =x
r, we setM
(Q
) =M
r(Q
).When (
X
i)i2ZZ is a strictly stationary sequence verifying (1.3) and the additional condition 2 =Xt2ZZCov(
X
0X
t)6= 0 (1:
8) the central limit theorem holds see Doukhan, Massart and Rio (1994)].More precisely
S
n=
pn
converges to a standard normal distribution. We then get the following estimates of the Levy distance for partial sums of a stationary sequence as a by-product of Theorem 1(ii).Theorem 2. Let (
X
i)i2ZZ be a strictly stationary sequence of real-valued random variables with mean zero and nite variance satisfying (1.3) and (1.8). Let the sequence of strong mixing coecients be dened by (1.2). Let denote the d.f. of a standard normal.(i) For any
in ]01 such that2+(
Q
) = supx2]01=4
x
;1(x
)(;1(x
)Q
(x
))2+<
1 (1:
9) we have :!n = supx
2IR
jIP(
S
nx
pn
);(x
)j=O
(n
;=2_n
;(1+)=(4+2)):
(ii) Suppose that
M
3(Q
)<
1. Thenxsup2IRjIP(
S
nx
pn
);(x
)j=O
(n
;1=3):
(iii) Suppose that
M
(Q
)<
1 for some functionin verifying (x
)x
2(logx
)p for some positivep
asx
!+1. Thenxsup2IRjIP(
S
nx
pn
);(x
)j=O
(logn
);p:
Remark 2. Note that the moment condition
M
2+(Q
)<
1 is stronger than (1.9). Moreover it comes from the lower bouds of Tikhomirov (1980) that (i) of Theorem 2 is nearly optimal when(;1 +p5)=
2.Application 2.
(a) Bounded random variables. Assume that k
X
0k1<
1. Then (1.9) holds if and only if n =O
(n
;1;). In that case, Theorem 2(i) yields!n=
O
(n
;=2) if (;1 +p5)=
2 and !n =O
(n
;(1+)=(4+2)) otherwise, which improves on Theorem 1 of Tikhomirov (1980). NowM
3(Q
) is nite if and only if Pn>0n
n<
1. Then, by (ii) of Theorem 2, !n =O
(n
;1=3).Under the weaker condition
M
(Q
)<
1 for some in such that (x
)x
2(logx
)p, Theorem 2(iii) yields !n =O
((logn
);p) under the con- dition Pn>0(logn
)pn<
1. Note that the loss between this condition and Ibragimov's condition for the central limit theoremPn>0n<
1 see Ibragimov and Linnik (1971) for the CLT] is logarithmic.(b) Conditions on the tail function. Suppose that, for some
>
2, IP(jX
0j> u
) =O
(u
;). Then(
Q
) = supu2]01]
u
1=Q
(u
)<
1and 2+(
Q
) is nite for<
;2 if n =O
(n
;(1+)=(;2;)), while the conditionM
3(Q
)<
1 needs>
3 and the summability conditionPn>0
n
1;3n =<
1.(c) Moment conditions. Suppose that, for some
r >
3, IE(jX
0jr)<
1. Then, by the Holder inequality applied on 01],M
3(Q
) is nite ifX
n>0
n
(r+3)=(r;3)n<
1:
Under this mixing condition, Theorem 2(ii) yields !n =
O
(n
;1=3). However, Bolthausen (1980 and 1982) obtains !n =O
(n
;1=2) for Harris recurrent Markov chains under the same mixing condition.(d) Exponential mixing rates. Assume that the mixing coecients sat- isfy
n=O
(a
n) for somea
in ]01. Then Theorem 2 in Tikhomirov (1980) yields!n =
O
(n
;=2(logn
)1+)under the moment assumption IE(j
X
0j2+)<
1, for any 2]01]. If<
1, one can use the Stein-Tikhomirov method to obtain !n =O
(n
;=2) under the moment conditionIE(j
X
0j2+(log+jX
0j)1+)<
1:
(1:
10) For geometric rates of mixing, (1.10) is equivalent toM
2+(Q
)<
1. Since this condition is stronger than (1.9), (i) of Theorem 2 slightly improves on the Stein-Tikhomirov method if (p5;1)=
2. However, Theorem 2 of Tikhomirov (1980) yields much more better rates if>
(p5;1)=
2.For geometric rates of mixing, the condition IE(
X
2(log+jX
j)p)<
1 is equivalent to the conditionM
(Q
)<
1 of (iii) with (x
)x
2(logx
)p;1. Thus (iii) of Theorem 2 yields !n =O
((logn
)1;p), which improves on Corol- laire 1 of Bulinskii and Doukhan (1990) in the special case of sequences.2. The Lindeberg method for strongly mixing sequences.
In this section, we generalize the Lindeberg method to strongly mixing sequences. The main step of this extension is Proposition 2 below. This proposition is applied to obtain estimates of the characteristic function of a sum of mixing random variables. In section 4, we give an application of this proposition to moment inequalities for sums of non identically distributed random variables.
Definition 3. LetF(
b
2b
3) be the class of real-valued two times continu- ously dierentiable functionsf
such that kf
(2)k1b
2 and kf
(2)kLb
3, wherekf
(i)k1= supx2IRjf
(i)(x
)jandk
f
(i)kL = sup(xyx)2I6=yR2
j
f
(x
);f
(y
)jj
x
;y
j:
Letv
k =V
k;V
k;1= IE(X
k2) + 2kX;1i=1IE(
X
kX
i):
(In the weak dependence setting,
v
k may fail to be nonnegative). We set"1k= supf
2F(b2b3)jIE(
f
(S
k;1+X
k);f
(S
k;1);v
k2
f
00(S
k;1))j:
The main step of the proof of Theorem 1 is the following upper bound for"1k.
Proposition 2. Let (
X
i)i2ZZbe a sequence of real-valued random variables with nite variance and mean zero. Suppose thatX
i= 0 a.s. for anyi
0.Let the sequence (
n)n0of strong mixing coecients of (X
i)i2ZZbe dened by (1.6). Letu
be any real in 01=
2] andp
=;1(u=
2). We setM
k(x
) =kX;1i=0
Q
k;i(x
)1Ix<2iand
M
k0(xu
)) =Xp;1i=0 p;1
X
l=0
Q
k;i(x
)Q
k;i;l(x
)1Ix<2(i^l):
Then"1k 4
b
2Z u0
M
k(x
)Q
k(x
)dx
+ 4b
3Z 10
M
k0(xu
)Q
k(x
_u
)dx
(a
) andnX
k=1"1k24Xn
k=1
Z
1
0
;1(x=
2)^n
]Q
2k(x
)inf(b
2b
3;1(x=
2)^n
]Q
k(x
))dx:
(
b
) Proof. Throughout the proof, we make the convention thatS
i = 0 for anyi
0. We setM
k(xu
) =pX;1i=0
Q
k;i(x
)1Ix<2i and #X
k= (X
k^Q
k(u
))_(;Q
k(u
)):
(2:
2) We start by the proof of (a). By the Taylor integral formula,f
(S
k);f
(S
k;1);f
0(S
k;1)X
k =X
kZ
1
0
(
f
0(S
k;1+vX
k);f
0(S
k;1))dv
=
X
kZ
1
0
(
f
0(S
k;1+vX
k);f
0(S
k;1+v X
#k))dv
+X
kX
#kZ
1
0 Z
1
0
vf
00(S
k;1+vv
0X
#k)dvdv
0:
(2:
3) The rst term on right hand is bounded up byb
2jX
k(X
k;X
#k)j=
2. Moreover
Z
1
0 Z
1
0
vf
00(S
k;1+vv
0X
#k)dvdv
0; 12f
00(S
k;1)b
3 6jX
#kj:
SinceIEj
X
k(X
k;X
#k)j=Z u0
Q
k(x
)(Q
k(x
);Q
k(u
))dx
(2:
4) it follows thatjIE(
f
(S
k);f
(S
k;1);f
0(S
k;1)X
k;12
f
00(S
k;1)X
kX
#k)jb
22
Z u
0
Q
k(x
)(Q
k(x
);Q
k(u
))dx
+b
3 3Z
1=2
0
Q
2k(x
)Q
k(x
_u
)dx:
(2:
5) Now we control the second order term. Let ;ki =f
00(S
k;i);f
00(S
k;i;1).Clearly
f
00(S
k;1)X
kX
#k=pX;1i=1;ki
X
kX
#k+f
00(S
k;p)X
kX
#k:
Sincej;kij
b
3jX
k;ij, it follows from Proposition 1(a) applied toX
= ;kiand
Y
=X
kX
#k thatjCov(;ki
X
kX
#k)j2b
3Z 2i0
Q
k;i(x
)Q
k(x
)Q
k(x
_u
)dx:
Noting that 2
pu
, we also get thatjCov(
f
00(S
k;p)X
kX
#k)j2b
2Z u0
Q
k(x
)Q
k(x
_u
)dx:
Hence
12jCov(
f
00(S
k;1)X
kX
#k)jZ
1=2
0
(
b
3(M
k(xu
);Q
k(x
)) +b
21Ix<u)Q
k(x
)Q
k(x
_u
)dx
(2:
6) which together with (2.5) and (2.4) implies thatjIE(
f
(S
k;1+X
k);f
(S
k;1);f
0(S
k;1)X
k);12IE(
f
00(S
k;1))IE(X
k2)jb
2Z u0
Q
2k(x
)dx
+b
3Z 10
M
k(xu
)Q
k(x
)Q
k(x
_u
)dx:
(2:
7) It remains to give an estimate of the expectation off
0(S
k;1)X
k. ClearlyIE(
f
0(S
k;1)X
k) =kX;1i=1Cov(
f
0(S
k;i);f
0(S
k;i;1)X
k):
(2:
8) In order to estimate the terms in (2.8), we need the following general prin- ciple, due to Frechet (1951, 1957) and Bass (1955).Lemma 1. Let
Z
1, ...Z
mbe nonnegative random variables with respective quantile functionsQ
Z1, ... ,Q
Zm. ThenIE(
Z
1:::Z
m)Z
1
0
Q
Z1(x
):::Q
Zm(x
)dx:
Remark 3. Actually Frechet gives a complete proof in the case
m
= 2.However, the proof uses the same arguments in the general case.
Let us also state the following by-products of Lemma 1, which will be used later on : with the same notations as in Lemma 1,
Z
1
0
Q
Z1Z2(x
)Q
Z3(x
):::Q
Zm(x
)dx
Z 10
Q
Z1(x
)Q
Z2(x
):::Q
Zm(x
)dx
(2:
9a
)and
Z
1
0
Q
Z1+Z2(x
)Q
Z3(x
):::Q
Zm(x
)dx
Z
1
0
(
Q
Z1(x
) +Q
Z2(x
))Q
Z3(x
):::Q
Zm(x
)dx:
(2:
9b
) Proof of (2.9). LetU
be a random variable uniformly distributed on 01].Then, for any nonnegative r.v.
T
,Q
T(U
) has the same distribution asT
. Consequently, by Lemma A1 of Berkes and Philipp (1979), one can construct random variablesT
1:::T
mon another probability space such that:(
T
1T
2T
3:::T
m) has the same distribution as (Q
Z1Z2(U
)Q
Z3(U
):::
) and (T
1T
2) has the same law as (Z
1Z
2). So, by Lemma 1, we haveZ
1
0
Q
Z1Z2(x
)Q
Z3(x
):::Q
Zm(x
)dx
= IE(T
1T
2:::T
m)Z
1
0
Q
Z1(x
)Q
Z2(x
):::Q
Zm(x
)dx:
The proof of (2.9b) is omitted. tu
For any
i
p
, 2iu
. So, noting thatj
f
0(S
k;i);f
0(S
k;i;1)jb
2jX
k;ij we have, by Proposition 1(a),(2
:
10) jCov(f
0(S
k;i);f
0(S
k;i;1)X
k)j2b
2Z u0
1Ix<2i
Q
k;i(x
)Q
k(x
)dx:
From now on, we assume that
i < p
. Let us replaceX
k by #X
k. Applying Proposition 1(a), we get thatjCov(
f
0(S
k;i);f
0(S
k;i;1)X
k;X
#k)j 2b
2Z u0
1Ix<2i
Q
k;i(x
)(Q
k(x
);Q
k(u
))dx:
(2:
11) Nowf
0(S
k;i);f
0(S
k;i;1);f
00(S
k;i;1)X
k;i=R
kiwhere
R
ki is Fk;i-measurable and jR
kijb
3X
k2;i=
2. Consequently, by Proposition 1(a), we have:jCov(
R
kiX
#k)jb
3Z 2i0
Q
2k;i(x
)Q
k(x
_u
)dx:
(2:
12) In order to estimate the term Cov(f
00(S
k;i;1)X
k;iX
#k), we introduce the decomposition below:f
00(S
k;i;1) =Xi;1l=1(
f
00(S
k;i;l);f
00(S
k;i;l;1)) +f
00(S
k;2i):
By Proposition 1(a) applied with
X
= ;kl+iX
k;iandY
=X
ktogether with Lemma 1 and (2.9a),jCov(;kl+i
X
k;iX
#k)j2b
3Z 2i0
Q
k;i;l(x
)Q
k;i(x
)Q
k(x
_u
)dx:
(2:
13) Next, by Lemma 1 applied to independent r.v.'s,jIE(
f
00(S
k;2i)X
k;i)IE( #X
k)jb
2Z u0
1Ix<2i
Q
k;i(x
)(Q
k(x
);Q
k(u
))dx
(2:
14) (because IE( #X
k) = IE( #X
k;X
k) andu <
2i).As a second step, we bound up jCov(
f
00(S
k;2i)X
k;iX
#k)j. Clearlyf
00(S
k;2i) =pX;1l=i;kl+i+
f
00(S
k;i;p):
Now, by Proposition 1(a) applied with
X
= ;kl+i,Y
=X
k;iX
#k and (2.9a)jCov(;kl+i
X
k;iX
#k)j2b
3Z 2l0
Q
k;i;l(x
)Q
k;i(x
)Q
k(x
_u
)dx:
(2:
15) Noting that 2pu <
2i, applying both (a) of Proposition 1 withX
=f
00(S
k;i;p),Y
=X
k;iX
#k and (2.9a), we also get thatjCov(
f
00(S
k;i;p)X
k;iX
#k)j2b
2Z u0
1Ix<2i
Q
k;i(x
)Q
k(u
)dx:
(2:
16) Adding the inequalities (2.10), (2.11), (2.12), (2.13), (2.14), (2.15) and (2.16) and summing oni
andl
, we then get :jIE(
f
0(S
k;1)X
k);pX;1i=1IE(
f
00(S
k;2i))IE(X
k;iX
#k)j 3b
2Z u0
(
M
k(x
);Q
k(x
))Q
k(x
)dx
+ 2b
3Z 10
(
M
k0(xu
);M
k(xu
)Q
k(x
))Q
k(x
_u
)dx:
(2:
17) It remains to bound upD
k=Xp;1i=1IE(
f
00(S
k;2i))IE(X
k;iX
#k);kX;1i=1IE(
f
00(S
k;1))IE(X
k;iX
k):
We rst note that, by Proposition 1(a),X
ip
jIE(
f
00(S
k;1))IE(X
k;iX
k)jb
2Xip
jIE(
X
k;iX
k)jb
2Z u0 X
ip1Ix<2i