• Aucun résultat trouvé

A new large deviation inequality for <span class="mathjax-formula">$U$</span>-statistics of order <span class="mathjax-formula">$2$</span>

N/A
N/A
Protected

Academic year: 2022

Partager "A new large deviation inequality for <span class="mathjax-formula">$U$</span>-statistics of order <span class="mathjax-formula">$2$</span>"

Copied!
12
0
0

Texte intégral

(1)

URL:http://www.emath.fr/ps/

A NEW LARGE DEVIATION INEQUALITY FOR U-STATISTICS OF ORDER 2

Jean Bretagnolle

1

Abstract. We prove a new large deviation inequality with applications when projecting a density on a wavelet basis.

R´esum´e. Nous prouvons une in´egalit´e de grandes d´eviations applicable `a la projection d’une densit´e sur une base d’ondelettes.

AMS Subject Classification. 60F10, 62-xx.

Received July 2, 1998. Revised September 14, 1999.

1. Introduction and main result

LetFbe some law onR. Whengis a measurable function fromRdtoR,E(g) andV ar(g) denote expectation and variance with respect to the ProbabilityFd. Let f be a bounded and symmetric function fromR2 toR. FollowingArcones and Gin´e[1], we construct its canonical projections: ξandη being independent with lawF

π1f(x) =Ef(x, η)−Ef(ξ, η)·

π2f(x, y) =f(x, y)−Ef(x, η)−Ef(ξ, y) +Ef(ξ, η)· (1) Letξi,i= 1,2,· · ·, nbe an-sample ofF (n2). We consider theU-statistics (without any normalisation)

Un(2)(f) = Σ1i6=jnfi, ξj)· Un(1)1f) = Σ1inπ1fi)·

Un(2)2f) = Σ1i6=jnπ2fi, ξj), thus

Un(2)(f−Ef)) = 2(n1)Un(1)1f) +Un(2)2f)· (2) We are interested in a large deviation inequality for the latter U-statistic whenf is centered and bounded.

First, if|f| ≤candEf2=σ2, the usual Bernstein type inequality is

P(Un(2)(f−Ef)≥n(n−1)t) exp ([n/2]t2/{2+ 2ct/3}). (AG1) But we can considerUn(1)1f) (which is a sum of i.i.d. R-valued random variables) as the main part and it can be interesting to bound separately the second partUn(2)2f). Now, asπ2f is canonical of order 2, if2f| ≤c

Keywords and phrases:Large deviations, U-statistics.

1 URAD 0743, Universit´e d’Orsay, bˆatiment 425, 91405 Orsay Cedex, France; e-mail:[email protected] c EDP Sciences, SMAI 1999

(2)

andσ2=E(π2f)2, there exist two constantsc1, c2such that

P(|Un(2)2f)| ≥(n1)t)≤c1exp(−c2t/{σ+c2/3t1/3n1/3}) (AG2) ((AG1) and (AG2) can be found in Arcones and Gin´e [1]).

The normalized version of inequality (AG2) can be restated P(|Un(2)2f)|)≥a3Max (nσx, c

nx3/2))≤a4 exp (−x). (AG2’) The aim of the paper is to give a new large deviation inequality (Th. 1 and Cor. 1 later). To every partitionD we associate two functionnals kfkDandvD(F) (see Def. 4 later) such that

P(|Un(2)2f)| ≥a03xkfkD Max (np

vD(F),1))≤a04exp (2

x) (3’)

where alla3,a03are universal constants. For a comparison between the two inequalities, see the discussion after Theorem 2infra.

We need now some definitions: let (Iλ |λ∈D) be a Borelian partition ofRfinite or enumerable, where Iλ

denotes the subset and its indicator. Let τ be a permutation ofD. Itsgraphgτ is{(λ, τ(λ))|λ∈D}, a subset ofD×D. AcollectionGis an enumerable set (τs|s∈G) of permutations such thatD×D⊂ ∪sGgτs. His the family of collections. For any matrixM = (aλ,µ|λ∈D; µ∈D) we set

kMkG= ΣiG SupλD|aλ,τi(λ)| and kMkD= Inf G∈HkMkG. Letf be a bounded real valued function defined onR×R. We set

M(D, f) = (aλ,µ) where aλ,µ= Sup (f2(x, y)Iλ(x)Iµ(y)) kfk2D = kM(D, f)kD

vD(F) = ΣλP∈Iλ)2. Our main result is

Theorem 1. There exists some constant C (C = 80π holds) such that for every integer k > 0, for every partitionD, for every symmetricf

E(Un(2)2f)2k)≤C2k×(EN2k)4× kfk2kD Max(n2k(vD(F))k, n2vD(F)).

whereN denotes the standard normal distribution.

Corollary 1. There exists some constantC such that for every partition D, for every symmetric f, for every x >0

P(|Un(2)2f)| ≥CxkfkDMax (np

vD(F),1))exp(62 x).

As we will see in the followingDiscussion,

Remark 1: The classical inequality (AG) is strictly better wheneverx≤nσ2/c2. Remark 2: Nevertheless, our inequality can work when the classical one does not.

Remark 3: Finally, up to some logarithm,

xis the best possible rate.

Main application: Let (Ψ1,Ψ)be a wavelet andελ,ε`,λbe the associated basis:

ελ(x) = Ψ1(x+λ)/kΨ1k2, ε`,λ(x) = 2`/2Ψ(2`x+λ)/kΨk2Z, `∈N).

(3)

Letpbe some density of Probability onRequipped with Lebesgue measuredx, assumed to be square integrable in applications.

Letβλ beR

p(x)ελ(x)dx andγ`,λbeR

ε`,λ(x)p(x)dx.

We setpL= Σλβλελ+ Σ`LΣλγ`,λε`,λ(the projection ofpup to level of resolutionL) and want to estimate its square norm

θL .

=kpLk22= Σλβ2λ+ Σ`Lλγ`,λ2 ))· (4) When (ξi | 1 i n) is a n-sample with density p (n 2), the empirical estimators of coefficients are βλ,n= Σiελi)/n,γ`,λ,n= Σiε`,λi)/n. LetpL,n be

pL,n= Σλβλ,nελ+ Σ`LΣλγ`,λ,nε`,λ.

The “natural estimator” for θL is

Σλβλ,n2 + Σ`Lλγ`,λ,n2 ).

But the latter has positive bias, and the unbiased estimator is

θbL,n= (n(n1))1Σ1i6=jn{Σλελiλj) + Σ`Lλε`,λi`,λj))} · (5) Let ∆n,L =θbL,n−θL. According to (5), ∆n,L can be decomposed into canonicalU-statistics in the following way: let Φ1be Ψ1/kΨ1k2and let Φ be Ψ/kΨk2,

f(x, y) = ΣλΦ1(x+λ)Φ1(y+λ), f`(x, y) = ΣλΦ(2`x+λ)Φ(2`y+λ) (3) δn= (2/n)Un(1)1f) + (1/n(n1))Un(2)2f) (4) δ`,n= (2·2`/n)Un(1)1f`) + (2`/n(n−1))Un(2)2f`). (6) Then ∆n,L=δn+ Σ`Lδ`,n. In the decomposition above, the sum ofU-statistics of order 1 is equal to the part up to level of resolutionLof R

p(pn−p). It can be bounded in Probability by classical Bernstein’s inequality.

The control of each U-statistic of order 2 will be performed by our inequality. This is very useful, either to estimateθL [3], or in model selection: in this problem, the authors consider a wide family of finite dimensional projections. Our study is quite general, but we observe that if Ψ1 and Ψ are with compact support, for every Lthe family (βλ, α`,λ|`≤L) is in fact with finite dimension, and hypothesis (H)infra holds. Thus our result can be used in adaptive estimation of quadratic functionals in a density model (see for example [4] where white noise is treated), where the theory needs good bounds up to 2`=O(n2). These bounds cannot be obtained by classical Hoeffding’s bounds (see the chapter “Discussion”).

We assume in the whole paper Hypothesis (H).

|Φ(x)| ≤Σu∈Zωu1ux<u+1 with Σu∈Zωu<∞

and we setM2(Φ) = Σ(u,v,w)∈Z3 ωuωvωu+wωv+w. (H) In the particular case when the law has densityp, for the normalisedU-statistic, we have

Theorem 2. We assume that the function ΦwithkΦk2= 1satisfies hypothesis (H), and that the law F has a densitypwith kpk2= (R

p2(x)dx)1/2 <∞. Set

kpk2,`= 2`/2

Σk∈Z

Z (k+1)2−`

k2−`

p(x)dx

!2

1/2

(4)

δ`,n(2)= (2`/n(n−1))Un(2)2f`) where f`(x, y) = ΣλΦ(2`x+λ)Φ(2`y+λ) Z`,n=p

n(n−1)2`/2δ`,n(2). Then, ifnkpk2,`2`/2, we have, for C as in Theorem 1

P(|Z`,n| ≥2C×M(Φ)×x)≤exp(62 x)·

In this formula,V ar(Z`,n)does not depend onn. Moreover, there exists positive and finite constantsa(Φ),b(Φ) depending only onΦ such that

lim`→∞kpk2,`=kpk2·

lim`→∞2f`k=b(Φ) and a(Φ)kpk22lim inf`→∞V ar(Z`,n)· 2f`k4M(Φ) and V ar(Z`,n)≤ kpk22M2(Φ)·

Remark 4: Such a result is interesting only if it works, for givenn,`,uniformlyfor large classes of densities.

Obviously we need some uniform control ofkpk2, but this is not sufficient in view of conditionnkpk2,`2`/2. Thus we need some extra condition. If for example we assumme that the support ofpis contained in some interval [x, x+M] we getkpk2,`1/(M+ 1) and the bound works if n2(M+ 1)2`.

I would like to thanks the two anonymous referees whose remarks and suggestions have much improved the presentation.

2. Discussion

a) About Corollary 1.

We consider inequality stated in Corollary 1 and assertion (AG2”). We assummekfk = 1. Without further knowledge about the law F, we can only bound σ2=V ar(π2(f)) byEf2 andc =2fk by 4. Up to some change of a3, (AG2’) is restated

P

|Un(2)2f)| ≥a3 Max nxp

Ef2,√

nx3/2

≤a4 exp (−x).

On the other hand, we have the obvious inequality

Ef2≤ kfk2DvD(F).

Proof of Remark 1: Let us assumme x≤ 2. Thus Max(nσx, c

nx3/2) = nσx≤ nkfk2x xkfkD Max (np

vD(F),1) and, up to constants (AG2’) isalwaysbetter than the bound of Corollary 1 wheneverx≤nσ2. Nevertheless, our inequality provides a possible alternative if there exists some partitionDsuch thatn2vD1 andkfk2DvD(F)≈Ef2(up to constant). We exhibit two extremal cases in the case when the law is the uniform one:

1) f(x, y) =110x<p;0y<p, where 0< p <1. We setq= 1−p. We havekfk = 1, Ef2=p2. Choosing I1= (0, p)Iλ=)p+ (λ2)q/K, p+ (λ1)q/K) for 1< λ≤K+ 1, we getvD=p2+q2/K and, for a convenient choice ofG,kfk2D= 1. MoreoverV ar(π2f) =p2q2 and2fk=max(p2, q2).

When pis small, up to constants, in the classical inequality we can use either precise true parameters or rough estimates (kfk andEf2for2fk andV ar(π2f)) and we get

P(|Un(2)2f)| ≥aMax (nxp,

nx3/2))≤a4 exp (−x) (AG2’) Our result is

P(|Un(2)2f)| ≥Cx Max (np,1)) exp (62

x). (Cor 1)

(5)

In this setting, we can assumex >1,nlarge.

If 1 < x < np2, the classical inequality is better, but (up to constants) only with respect to the exponent ofx.

A contrariothe classical result does not work in the case whennp2=o(1) but not our one provided thatnp is large, and this justify the Remark 2.

We will see that it is a quite general result in themain application.

Remark 3: Assumingnp= 1 and denotingfn the corresponding function, whenn→ ∞, it is easy to prove that Un(2)2fn) converges in law to Y23Y + 1 where the law ofY is the Poisson law with parameter 1. Thus

lim inf x→∞ limn{−Log P(Un(2)2fn)≥x)/√

xLog (

x)} ≥1 proving that the power 1/2 is the best possible.

2) Let be g(x, y) = 110x1; 0y111px1p; py1p ( p small) and ε(x) = 11x1/2111/2<x and finally f(x, y) =ε(x)ε(y)g(x, y). Then, for the uniform law,f =π2f andV ar(π2f) =V ar(f) = 4p(1−p). Obviously, for every D, kfk2D = |D| and vD(F) 1/D, thus kfkDmax(np

vD(F),1) n (and n is obtained by the partition withoneelement).

The classical inequality gives for some universala P(|Un(2)2f)| ≥aMax (n

px,√

nx3/2))≤a4 exp (−x) and,whatever be p, our inequality provides only

P(|Un(2)2f)| ≥Cxn)≤ exp (62 x) a very poor result!

b)About the main application:

b1) We consider firstly the case of the Haar basis (Φ(x) = 110x<1 or Φ(x) = 110x<1/2111/2x<1), with uniform law on the interval ]0,1]. In the first case, at the level `, setting D = ([λ2`,(λ+ 1)2`[| λ Z), f` = Σλ∈ZΦ(2`x+λ)Φ(2`y+λ), we have M(Φ) = kf`kD = 1, 2(f`)k = (12`), vD(F) = 2` and V ar(π2(f`)) = (12`)22`.

Thus for every`≥1 we have

2`vD(F) =kfkD= 1.

1/2≤V ar(Z`,n) = 2`V ar(π2f`)1.

1/2≤ kπ2f`k1 and, whenever 2`≤n2, Theorem 2 provides

P

|Z`|/ q

V ar(Z`,n)2Cx

exp (62 x).

The classical one provides P

|Z`|/ q

V ar(Z`,n)≥a3(x+ (2`/n)1/2x3/2)

≤a4 exp (−x)

and does not work ifn=o(2`).

Remark 5: Massart (private communication) thinks that using Talagrand’s inequality the best possible band- with is 2`=O(n3/2) and this is the principal motivation of this work.

(6)

b2) Finally, let κ2n,` the chi-square (with 2`1 degrees of freedom) associated to the partition: with Nλ = Σ1in11λ2−`i(λ+1)2−`

κ2n,`= Σλ(Nλ−ENλ)2/ENλ. The centered and normalised κ2n,` is equal to Z`/p

V ar(Z`,n). Thus our result provides a large deviation inequality forκ2n,`even in the case whenn≈2`/2. Remark that the mean number of visitsENλcan beO(1/n)!

The second case is the best possible: for every ` we have kf`k = 1, f` is canonical and kfk2DvD(F) = V ar(Z`).

b3) We consider now the general case in the main application:

Using the final assertions of Theorem 2, we see that we have asymptotically the same conclusion as in the case of b1: whenever` is large, ourZ, up to constants depending only on Φ and the lawp, is the normalised U-statistic of order 2 corresponding to some canonical function the Sup norm of which is equivalent to 1.

Thus we get a large deviation inequality which cannot be obtained using the classical result for 2`/2n2`.

3. Proofs

The proof is based on De la P enae 0s inequalities [2]. As all bounds are continuous with respect tokfkD, vD(F), it suffices to prove that, ifGis a collection such that ΣiG SupλDSup (f2(x, y)Iλ(x)Iτi(λ)(y)) = 1, then

E(Vn2k)≤C2k×(EN2k)4× Max (n2v,(n2v)k) (8) where, to simplify notations, we set

Vn = Σ1i6=jnπ2fi, ξj) and v=vD(F)· (9) 1: Symmetrization

Letεi, ε0i,Ni,Ni0, ξi, ηi be six independent n-samples: the common law ofε’s is the law of the centered sign, the common law of theN’s is the normalN(0,1), the common law ofξ’s andη’s is the lawF.

Using the first Theorem ofDe la P eena, asπ2f is canonical, we get:

For every Γ even, increasing onR+ and convex

EΓ(Vn)≤EΓ(4Σ1i,jn;i6=jπ2fi, ηj))· Using the classical symmetrization inequalities (see [2] again), we have

EΓ(Vn)≤EΓ(16Σ1i,jn;i6=jεiε0jπ2fi, ηj)).

As theεi,ε0i can be viewed as conditional expectations ofp

π/2Ni,p

π/2Ni0, using convexity again we get EΓ(Vn)≤EΓ(8πΣ1i,jn;i6=jNiNj0π2fi, ηj)).

We set now

Wn= Σ1i,jn;i6=jπ2fi, ηj)2. (10) In law, (Σ1i,jn;i6=jNiNj0π2fi, ηj))2 = N2Σijπ2fi, ηj)Nj)2) or N2ΣkλkNk2, where Σkλk = Wn with λk0, thus, by convexity:

Lemma 1.For every k∈N, we have

EVn2k (8π)2k(EN2k)2EWnk.

(7)

2: Bounds for functions

k is a natural integer. The current indexesi,j of the sample belongs to [1, n]. The currentsbelongs to G, other current indexes asλ,µ,.. belong toD. We have|f| ≤√

hwhere

h(x, y) = Σλ,µaλ,µIλ(x)Iµ(y). (11) Thus2f(x, y)| ≤R R

(

h(x, y) +√

h(x, t) +√

h(z, y) +√

h(z, t))F(dz)F(dt) and finally π2f(x, y)24

Z

(h(x, y) +h(x, t) +h(z, y) +h(z, t))F(dz)F(dt)· Thus by convexity

Lemma 2. Forhdefined in (11) and natural integerk we have

EWnk (16)kE((Σ1i6=jnh(ξi, ηj))k). (12) 3: Bounds for moments

We define the numbers of visits ofIλ by each of the two samples as

Xλ = Σ1inIλi) and Yλ= Σ1inIλi)· (13) Letτ be the current permutation ofGandπτ be Sup λaλ,τ(λ). We have obviously

Σ1i6=jnh(ξi, ηj)ΣτπτλXλYτ(λ))· As Στπτ = 1, by convexity again and (13) we obtain

EWnk(16)k SupτE((ΣλXλYτ(λ))k)· Appendix 1 contains the proof of the main technical result, namely:

Lemma 3.With previous notations, for everyτ, for every integerk≥1, we have E(ΣλXλYτ(λ))k6kMax (n2v,(n2v)k)(EN2k)2.

Collecting the previous bounds, proof of Theorem 1 is achieved.

4: Proof of Corollary 1

A) We assume thatn2v≥1. LetX beUn(2)(f)/CkfkDn√ v.

Appendix 2 contains the proof of the quite obvious

Lemma 4.If for every natural integerk we haveEX2k(EN2k)4, then P(|X| ≥x)≤exp 62

x . B) Now, ifn2v≤1, letY beUn(2)(f)/CkfkD. For the same reason we have

P(|Y| ≥x)≤ exp 62 x

.

This achieves the proof.

5: Proof of Theorem 2

Let ∆ be some positive integer and D be the partition (Iλ,D = (λ/∆,(λ+ 1)/∆(| λ∈Z). In what follows indexesλ, u, v, sbelong toZ.

(8)

Let pλ,D be P Iλ,D) and pD be the density pD(x) = Σ∆pλ,DIλ,D. Expectation with respect to pD is denotedED. We have

lim→∞kpD−pk2= 0, ∆vD=kpDk22and thus lim→∞∆vD=kpk22·

We set Φ(x) = Σu∈Zγu(x−u)11ux<u+1, where the support ofγu is included in [0,1[ (thus defining theγu’s);

we haveuk≤ωu. AskΦk2= 1, there exists someuo withuok2>0.

We setfD(x, y) = ΣλΦ(∆x+λ)Φ(∆y+λ).

a)Bounds forEfD2(ξ, η)andkfDkD:

We begin by bounding from below the quantityEfD2(ξ, η).

We haveEfD2(ξ, η) = Σλ,µ(EΦ(∆ξ+λ)Φ(∆η+µ))2Σλ(EΦ2(∆x+λ))2, thusEfD2(ξ, η)Σλ(Eγu2o(∆ξ+ λ−uo)Iuoλ,D)2. A classical computation gives (EgIµ,D)2(EDgIµ,D)2≥ −2kgIµ,DkkgIµ,Dk2pµ,D(R

Iµ,D(p pD)2dx)1/2 thenEfD2(ξ, η)Σλ(EDγ2uo(∆ξ+λ−uo)Iuoλ,D)2uo(u2ok2/√

∆)√vDkp−pDk2. As (EDγu2o (∆ξ+λ−uo)Iuoλ,D)2=p2uoλ,Duok42, we get

∆EfD2(ξ, η)∆vDuok42u3okp−pDk2p

∆vDand

lim inf→∞∆EfD2(ξ, η)≥ kp2k2a(Φ) :=kpk22uok42>0· (15) On the other hand,|fD(x, y)| ≤g(x, y) := Σλ|Φ(∆x+λ)Φ(∆y+λ)|. Using (H), we haveg(x, y)≤Σλ,u,vωu+λωv+λ

Iu(x)Iv(y).

Setting √as= Σuωuωu+s,g(x, y)≤Σλ,s√asIλ(x)Iλ+s(y), then

g2(x, y)ΣsasλIλ(x)Iλ+s(y))· (16) But Σsas= Σu,v,sωuωvωu+sωv+s=M(Φ)2Σu,vωuωvωuωv = (Σuωu2)21 because kΦk2= 1. Taking forG the collection ofλ:→λ+s, we get

kfDk2D≤M2(Φ) andM2(Φ)1· (17)

We recall the obvious upper bound

EfD2 ≤vDM2(Φ)≤ kpk22M2(Φ)/∆· (18) b)Bounds forkfDk:

Obviously,kfDk does not depend onD, and is less thanM(Φ):

There exists someb(Φ) with 0< b(Φ) =kfDk≤M(Φ). (19) d)Bounds forkπ2fDk2 andkπ2fDk:

As for ∆ large, Supλpλ,D = o(1/√

∆) we have |EΦ(∆η +λ)| = o(1/√

∆). Thus kEf(x, η)k :=

Supx|Ef(x, η)| →0. We haveπ2fD(x, y) =fD(x, y)−Ef(x, η)−Ef(ξ, y) +Ef(ξ, η), and asymptotically we have

lim→∞2fDk=b(Φ) (and obviously by (19)kπ2fDk4M(Φ). (20) We haveE(fD)2 ≥V ar(π2fD)Σλ(V arΦ2(∆ξ+λ))2 = Σλ(E(Φ2(∆ξ+λ)−E(Φ(∆ξ+λ)2)+)2. Using the fact that ∆E(Φ(∆ξ+λ)2 goes uniformly to 0 and (15), we get

a(Φ)kpk22 lim inf→∞∆V ar(π2fD) (21)

(9)

∆V ar(π2fD)≤ kpk22M2(Φ).

e) Proof of Theorem 2

We set now ∆ = 2`. UsingnC/p

n/(n−1)2C,n2vD1,kfDkD≤M(Φ) and ∆vD≤ kpk22, Corollary 1 gives the exponential upper bound.

Appendix 1: Proof of Lemma 3

In this appendix where partitionD is fixed, we use notation

vD=v, pλ=P(ξ∈Iλ), ηλ=E(Xλ) =npλ thus Ση2λ=n2 We recall thatτ is some permutation ofD, and thatλis the current point ofD.

We consider two laws onND×ND, the current point of which is (X,Y). In all cases,XandYare independent with the same law.

In the first case, the law of X is M(n,p) (n 2), the Multinomial where p = (pλ), with associated expectationE.

In the second one, theXλ are independent, with Poisson law, and mean valueEη(X) =η, whereη = (ηλ) = np. The associated expectation isEη.

We consider the mappingU fromND×NDtoRdefined byU(X,Y) = ΣλXλYσ(λ). We will first prove that for every positive integerk E(Uk) is less than Eη(Uk) and then furnish an upper bound for this moment.

1: Reduction to the Poisson case

In what follows, Eµ denotes the expectation associated to the Poisson law with parameter µ. X[k]is the PolynomialX(X1)· ·(X−k+ 1), for whichEµX[k]=µk.

Definitions: A mappingψfrom Nto Nis strongly positive if

ψ(X) = ΣkakX[k], withak 0 for everyk. (d1) A mapping Ψ from ND toN is strongly positive if there exist some enumerableI, a family (ψλ,i |λ;i∈I), a family (ai|i∈I), where eachψλ,iis strongly positive and eachai is positive, such that

Ψ(X) = ΣiaiΠλψλ,i(Xλ)· (d2)

Lemma 5.If Ψis strongly positive, then

E(Ψ(X))≤Eη(Ψ(X)). (a1)

For everyk,Xk and{X(X−1)}k are strongly positive. Moreover for k >0, we have

Eµ{X(X1)}kMax (µ2k, µ2)×(EN2k)2. (a2) Remark: The upper bound in (a2) is increasing ofkandµ(k1 andµ >0).

Proof of (a1): By d2, it suffices to prove the formula when Ψ(X) = ΠXλ[kλ]. For such a Ψ, EΨ(X) = 0 if Σλkλ > n, and (n!/(n−Σλkλ)!)Πλpkλλ else, obviously less than Πλ(npλ)kλ =Eηψ(X).

Proofof (a2): The fact thatXk is strongly positive (in our sense) is well-known.

LetTk be {X(X1)}k. T1is X[2] andEµT1=µ2. Assumek >1; withx=X−2,Tk =X[2]{(x+ 2)(x+ 1)}k1. But{(x+ 2)(x+ 1)}k1is polynomial with respect tox, with positive coefficients, thus strongly positive

(10)

with respect tox: {(x+ 2)(x+ 1)}k1= Σ0j2k2γj,kx[j]; finallyX[2]x[j] =X[j+2]andTkis strongly positive.

Moreover,EµTk=EµΣ0j2k2γj,kX[2+j]=µ2Σ0j2k2γj,kEX[j] =µ2Eµ{(X+ 2)(X+ 1)}k1.

Letgk begk(X) =Tk(X)11X>1. gkis convex, and ifX is a Poisson r.v, almost surelygk(X) =Tk. Fork= 1, Eµg1(X) =µ2; letkbe>1. We have obtainedEµ{X(X1)}k =µ2Eµgk1(X+ 2). LetY be independent of X, Poisson with parameter 2; by Jensen, conditionally onX =x, gk1(x+ 2)≤E2gk1(x+Y), thus, as the law ofX+Y is Poisson with parameter 2 +µ, we getEµ{X(X1)}k≤µ2Eµ+2gk1(X), thus, recursively

ifk >0, thenEµ{X(X1)}k [µ(µ+ 2)· · ·(µ+ 2k2)]2.

The productµ(µ+ 2)· · ·(µ+ 2k2) is bounded by Max (µk, µ)×(1·3·5· · ·(2k1)) = Max (µk, µ)×EN2k and the proof is achieved for a2.

Now we return to the proof. Uk being a sum with positive coefficients of products of powers of the almost surely positiveX’s and theY’s is obviously strongly positive with respect to theX’s andY’s; by independence and (a1), we obtain for everyk≥0:

EUk≤EηUk· (a3)

2: The Poisson case

For every pairx,y of natural integers, we have easily

xy≤x(x−1) +y(y−1) +11x=1×11y=1· (a4) Let us define now

Z= ΣλXλ(Xλ1), Z0 = ΣλYλ(Yλ1)

T = Σλ11Xλ=1×11Yτ(λ)=1 (d3)

Using the fact that τ is a permutation, by a4 we haveU ≤Z+Z0+T, then, as the laws ofZ andZ0 are the same

EηUk3k Max (EηZk, EηTk)· (a5) 3: Bound for the first term

AsEηZ=n2v, we assume thatk >1. We set

E(k, µ) = Max (µ2k, µ2)(EN2k)2 if k >0and1else· By a2, we have

EηZk Σkλ0;Σλkλ=k{k!/Πλkλ!}ΠλE(kλ, ηλ)· (a6) First case: If for eachλ,ηλ1, then, as ΠλEN2kλ ≤ENλkλ, we have

EηZk(EN2k)2Σkλ0;Σλkλ=k{k!/Πλkλ!λ2kλ (5) thus, in the first case, for everyk≥0

EηZk (n2v)k(EN2k)2· (A1)

(11)

Second case: For eachλ,ηλ1:

Let A be a non-void subset of [0, D[∩Z, and M(A, k) be the subset of NA given by (ki | i A; ki >

0f or each i∈A; ΣiAki=k) . We set

S(A, k) = ΣM(A,k){k!/Πiki!}ΠiE(ki, ηi)· The general term ofS(A, k) is{k!/Πiki!}Πiηi2ΠiE(N2ki)2.

Letνj be (EN2j)2/j! (j∈N). Elementary computation gives

If 1≤j≤k, thenνjνk≤νj1νk+1. (a7) Thus the general term ofS(A, k) is bounded by Πiη2iE(N2k)2 (obtained outside ofM(A, k), when allki are 0 except one).

On the other hand, it is well-known that|M(A, k)|= (ka11), wherea=|A|. Finally Eη(Zk)Σnon voidS(A, k)≤(E(N2k)2× {Σa>0(ka11)(Σ|A|=aΠiAη2i)

As Σ|A|=aΠiAηi2(n2v)a and Σa>0(ka11) = 2k1, we get in the second case, for any k≥1

EηZk2k1 Max (n2v,(n2v)k)(EN2k)2· (A2) General case: We divide [0, D[ into two (non void) subsets:

1= (λλ<1) and ∆2= (λλ1), and setvi= Σλiη2λ. Using (A1, A2), we obtain

E(Zk)(EN2k)2(n2v2)k+ Σj>0(kj)2j1(EN2j)2(EN2k2j)2 Max (n2v1,(n2v1)j)(n2v2)kj. The latter bound is increasing ofvi, each bounded byv. Thus finally, in any case, fork≥1

EηZk 2k Max (n2v,(n2v)k)(EN2k)2· (A3) 4: Bound for the second term

We can boundEη11Xλ=1×11Yτ(λ)=1byb2λ:=ηλητ(λ). We take notations ofSecond caseof previous paragraph.

Setting now

S0(A, k) = ΣM(A,k){k!/Πiki!}Πib2i, we haveEηTk = ΣAnon voidS0(A, k).

The current term ofS0(A, k) is bounded byk!Πib2i. Thus we obtain here EηTk≤k!2k1Max (w, wk) wherew= Σλb2λ. As by Cauchy-Schwartzw≤n2v andk!≤(EN2k)2, we get again fork≥1

EηTk 2k Max (n2v,(n2v)k)(EN2k)2· (A4)

Using (a3, a5, A3) and (A4), the proof is finished.

Appendix2: Proof of Lemma 4

For k N, we set uk = ekEN2k(2k+ 1)k, rk = uk+1/uk = e{(2k+ 1)/(2k+ 3)}k+1 and finally ϕ(x)

= 1 + (x+ 1) Log ((2x+ 1)/(2x+ 3)) forx≥0, thenϕ(k) = Log (rk). We have ϕ0(x) = Log ((2x+ 1)/(2x+ 3)) + 1/(2x+ 1) + 1/(2x+ 3).

ϕ00(x)/2 = 1/(2x+ 1)1/(2x+ 3)1/(2x+ 1)21/(2x+ 3)20.

(12)

Asϕ0 goes to 0 whenxgoes to , we haveϕ00. As ϕgoes to 0 whenxgoes to,ϕ≤0. Thus, fork >0, uk+1≤uk≤u0= 1: we have proved that

for everyk∈N, EN2k ≤ek(2k+ 1)k. *

Let us assumme that for everyk∈N,EX2k(EN2k)4. If 2k+ 3≥√

x≥2k+ 1,viaMarkov’s inequality and assertion *,P(|X| ≥x)≤e4k≤e2x+6. Then the result is proved forx≥1 and obvious for 0≤x≤1.

References

[1] M.A. Arcones and E. Gin´e, Limit Theorems forU-processes.Ann. Probab.21(1993) 1494-1542.

[2] V. De la Pe˜na, Decoupling and Khintchine’s inequalities forU-statistics.Ann. Probab.20(1992) 1887-1892.

[3] B. Laurent, Efficient estimation of integral functionals of a density.Ann. Statist.24(1996) 659-681.

[4] B. Laurent and P. Massart, Adaptative estimation of a quadratic functional by model selection (1998) preprint.

Références

Documents relatifs

Si s est modérément ramifiée, elle est induite d’un caractère modérément ramifié Il de l’extension quadratique non ramifiée de F, à. valeurs dans

Vavilov, Description of subgroups of the full linear group over a semilocal ring that contain the group of diagonal

There exist extensive classifications of map-germs from n-space (n 2) into the plane up to contact equivalence (K-equivalence), see for example [D, Da, G, M, W].. In the

Soit (n, V) une représentation irréductible de G ayant un caractère central, et de dimension infinie.. Les fonctions de Kirillov

Toute utilisa- tion commerciale ou impression systématique est constitutive d’une in- fraction pénale.. Toute copie ou impression de ce fichier doit conte- nir la présente mention

section we shall assume that F is a local field and n is an irreducible admissible representation of GL2(F).. We start with the archimedean

Toute utilisation commerciale ou impression systématique est constitutive d’une infrac- tion pénale.. Toute copie ou impression de ce fichier doit contenir la présente mention

L’accès aux archives de la revue « Nouvelles annales de mathématiques » implique l’accord avec les conditions générales d’utilisation ( http://www.numdam.org/conditions )..