• Aucun résultat trouvé

Maximum entropy distribution of order statistics with given marginals

N/A
N/A
Protected

Academic year: 2022

Partager "Maximum entropy distribution of order statistics with given marginals"

Copied!
41
0
0

Texte intégral

(1)

DOI:10.3150/16-BEJ868

Maximum entropy distribution of order statistics with given marginals

C R I S T I NA BU T U C E A1, J E A N - F R A N Ç O I S D E L M A S2,*, A N N E D U T F OY3and R I C H A R D F I S C H E R1,2,3,**

1LAMA (UPE-MLV), UPE, Marne La Vallée, France. E-mail:[email protected] 2CERMICS, École des Ponts, UPE, Champs-sur-Marne, France.

E-mail:*[email protected];**[email protected]

3EDF Research & Development, Industrial Risk Management Department, Palaiseau, France.

E-mail:[email protected]

We consider distributions of ordered random vectors with given one-dimensional marginal distributions.

We give an elementary necessary and sufficient condition for the existence of such a distribution with finite entropy. In this case, we give explicitly the density of the unique distribution which achieves the maximal entropy and compute the value of its entropy. This density is the unique one which has a product form on its support and the given one-dimensional marginals. The proof relies on the study of copulas with given one-dimensional marginal distributions for its order statistics.

Keywords:copula; entropy; maximum entropy; order statistics

1. Introduction

Order statistics, an almost surely non-decreasing sequence of random variables, have received a lot of attention due to the diversity of possible applications. If X =(X1, . . . , Xd) is a d-dimensional random vector, then its order statisticsXOS=(X(1), . . . , X(d))corresponds to the permutation of the components of X in the non-decreasing order, so that X(1)X(2)

· · · ≤X(d). The components of the underlying random vectorX are usually, but not necessar- ily, independent and identically distributed (i.i.d.). Special attention has been given to extreme valuesX(1)andX(d), the rangeX(d)X(1), or the median value. Direct application of the dis- tribution of thekth largest order statistic occurs in various fields, such as climatology, extreme events, reliability, insurance, financial mathematics. We refer to the monographs of David and Nagaraja [8] and Arnold, Balakrishnan and Nagaraja [1] for a general overview on the subject of order statistics. We are interested in the dependence structure of order statistics, which has received great attention. In the i.i.d. case, Bickel [3] showed that any two order statistics are pos- itively correlated. The copula of the joint distribution ofX(1)andX(d)is derived in Schmitz [19]

with exact formulas for Kendall’sτ and Spearman’sρ. In Avérous, Genest and Kochar [2], it is shown that the dependence of thejth order statistic on theith order statistic decreases as the distance betweeniandj increases according to the bivariate monotone regression dependence ordering. The copula connecting the limit distribution of the two largest order statistics, called bi-extremal copula, is given by de Melo Mendes and Sanfins [9] with some additional properties.

Exact expressions for Pearson’s correlation coefficient, Kendall’sτ and Spearman’s ρ for any

1350-7265 © 2018 ISI/BS

(2)

two order statistics are obtained in Navarro and Balakrishnan [16]. For the non i.i.d. case, Kim and David [14] show that some pairs of order statistics can be negatively correlated, if the un- derlying random vector is sufficiently negatively dependent. Positive dependence measures for two order statistics are considered in Bolandet al.[4] when the underlying random variables are independent but arbitrarily distributed or when they are identically distributed but not indepen- dent. A generalization of these results for multivariate dependence properties is given by Hu and Chen [11]. See also Dubhashi and Häggström [10] for conditional distribution of order statistics.

Here, we focus on the cumulative distribution function (c.d.f.) of order statistics without refer- ring to an underlying distribution. That is, we consider random vectorsX=(X1, . . . , Xd)∈Rd such that a.s. X1≤ · · · ≤Xd and we suppose that the one-dimensional marginal distributions F=(Fi,1≤id)are given, whereFi is the c.d.f. ofXi. A necessary and sufficient condition for the existence of a joint distribution of order statistics with one-dimensional marginalsFis that they are stochastically ordered, that isFi1(x)Fi(x)for all 2≤id, x∈R. With the marginals fixed, the joint distribution of the order statistics can be characterized by the connect- ing copula of the random vector, which contains all information on the dependence structure of the order statistics. Copulas of order statistics derived from an underlying i.i.d. sample were con- sidered in [2] in order to calculate measures of concordance between any two pairs of order statis- tics. For order statistics derived from a general parent distribution, Navarro and Spizzichino [17]

shows that the copula of the order statistics depends on the marginals and the copula of the parent distribution through an exchangeable copula and the average of the marginals. Construction of some copula of order statistics with given marginals were given in Lebrun and Dutfoy [15].

Our aim is to find the c.d.f. of order statistics of dimensiond with fixed marginals which maximizes the relative entropyHh defined by (13). In an information-theoretic interpretation, the maximum entropy distribution is the least informative among order statistics with given marginals. This problem appears in models where the one-dimensional marginals are well known (either from different experimentation or from physical models) but the dependence structure is unknown, see Butuceaet al. [7]. In [6], the same authors gave, when it exists, the maximum entropy distribution of(X1, . . . , Xd)such thatXiis uniformly distributed on[0,1]for 1≤id and the distribution ofX(d)=max1idXi is given, see Remark4.8.

For ad-dimensional random variable X=(X1, . . . , Xd)with c.d.f. F and copulaCF, the relative entropy ofF can be decomposed into the sum of the relative entropy (with respect to a one-dimensional probability densityh) of its one-dimensional marginals plus the entropy ofCF, see Lemma2.1. In our case, since the marginalsF=(Fi,1≤id)are fixed, maximizing the entropy of the joint distributionF of an order statistics is equivalent to maximizing the entropy of its copulaCF under constraints, (see Section2.4). Therefore, we shall find the maximum entropy copula for order statistics with fixed marginal distributions.

The main result of this paper is given by Theorem5.4. It states that there exists a unique maximum entropy c.d.f.FFgiven by (52) if and only if:

d i=1

Hh(Fi)d i=2

RFi(dt)log

Fi1(t)Fi(t)>−∞.

(3)

In this case,FFis absolutely continuous with densityfFdefined as, forx=(x1, . . . , xd)∈Rd: fF(x)=f1(x1)

d i=2

fi(xi)

Fi1(xi)Fi(xi)exp

xi

xi−1

fi(s)

Fi1(s)Fi(s)ds 1LF(x), wherefiis the density function ofFi andLF⊂Rdis the set of ordered vectors(x1, . . . , xd), that isx1≤ · · · ≤xd, such thatFi1(t) >Fi(t)for allt(xi1, xi)and 2≤id. See Example5.8 for an illustrative example.

The rest of the paper is organized as follows. In Section2, we introduce the basic notations and give the definition of the objects used in later parts. Section3describes the connection between copulas of order statistics with fixed marginals, and symmetric copulas with fixed multidiago- nals. The multidiagonal, given by Definition3.6, is the generalization of the diagonal section for copulas, which received great attention in copula literature. We show that there exists a one-to- one map between these two sets of copulas, see Corollary3.14. This bijection has good properties with respect to the entropy as explained in Proposition3.22. In Section4, we determine the max- imum entropy copula with fixed multidiagonal, see Theorem4.7. Since we obtain a symmetric copula as a result, this is also the maximum entropy symmetric copula with fixed multidiagonal.

In Section5, we use the one-to-one map between the two sets of copulas established in Section3 to give the maximum entropy copula of order statistics with fixed marginals. We finally obtain the density of the maximum entropy distribution for order statistics with fixed marginals by com- posing the maximum entropy copula with the marginals, see Theorem5.4. Section6contains the detailed proofs of Theorem4.7and other results from Section4. Section7 collects the main notations of the paper to facilitate reading.

2. Notations and definitions

2.1. Notations in R

d

and generalized inverse

For a Borel setA⊂Rd, we write|A|for its Lebesgue measure. Forx=(x1, . . . , xd)∈Rd and y=(y1, . . . , yd)∈Rd, we writexyifxiyifor all 1≤id. We define minx=min{xi,1≤ id}and maxx=max{xi,1≤id}forx=(x1, . . . , xd)∈Rd. IfJ is a real-valued function defined onR, we setJ (x)=(J (x1), . . . , J (xd)). We shall consider the following subsets ofRd:

S=

(x1, . . . , xd)∈Rd, x1≤ · · · ≤xd

and =SId,

withI = [0,1]. In what follows, usuallyx, y will belong to Rd, ands, t toRorI. For a set A⊂R, we note byAc=R\Aits complementary set.

IfJ is a bounded non-decreasing càd-làg function defined onR. Its generalized inverseJ1 is given byJ1(t)=inf{s∈R;J (s)t}, fort∈R, with the convention that inf∅= +∞and infR= −∞. We have fors, t∈R:

J (t )stJ1(s), J1J (t )t and JJ1J (t )=J (t ). (1)

(4)

We define the set of points whereJ is increasing on their left:

Ig(J )=

t∈R;u < tJ (u) < J (t )

. (2)

We have:

1(Ig(J ))cdJ =0 a.e., (3)

J1(R)Ig(J )∪ {±∞} (4) and fors∈R,tIg(J ):

J (t )stJ1(s) and J1J (t )=t. (5) Notice that ifJ is continuous in addition, then we have fortJ (R):

JJ1(t)=t. (6)

2.2. C.d.f. and copula

LetX=(X1, . . . , Xd)be a random vector onRd. Its cumulative distribution function (c.d.f.), denoted by F is defined by: F (x)=P(Xx),x∈Rd. The corresponding one-dimensional marginal c.d.f.s are(Fi,1≤id)withFi(t)=P(Xit ),t∈R. The c.d.f.F is called a copula ifXi is uniform onI= [0,1]for all 1≤id. (Notice a copula is characterized by its values on Idonly.)

We defineLdas the set of c.d.f.s onRd whose one-dimensional marginal c.d.f.s are continu- ous, andCLdas the subset of copulas. We setL0d(resp.C0) the subset of absolutely continuous c.d.f. (resp. copulas) onRd.

Let us define for a c.d.f.F with one-dimensional marginals(Fi,1≤id)the functionCF

defined onId:

CF(y)=F

F11(y1), . . . , Fd1(yd)

, y=(y1, . . . , yd)Id. (7) IfFLd, then CF defined by (7) is a copula thanks to (6). According to Sklar’s theorem,F is then completely characterized by its one-dimensional marginal c.d.f.s(Fi,1≤id)and the associated copulaCF which contains all information on the dependence:

F (x)=CF

F1(x1), . . . , Fd(xd)

, x=(x1, . . . , xd)∈Rd. (8) Equivalently, ifX=(X1, . . . , Xd)has c.d.f.F, thenCF is the c.d.f. of the random vector:

F1(X1), . . . , Fd(Xd)

. (9)

(5)

2.3. Order statistics

For a d-dimensional c.d.f. F, we write PF for the distribution of a random vector X = (X1, . . . , Xd)with c.d.f.F. Ad-dimensional c.d.f.F is a c.d.f. of order statistics (and we shall say thatX is a vector of order statistics) ifPF(X1X2≤ · · · ≤Xd)=1. Let us denote by LOSdLd the set of all c.d.f.s of order statistics with continuous one-dimensional marginal c.d.f.s. Thed-tuples(Fi,1≤id)of marginal c.d.f.s then verifyFi1Fi for all 2≤id. LetFdbe the set ofd-tuples of continuous one-dimensional c.d.f.s compatible with the marginal c.d.f.s of order statistics:

Fd=

F=(Fi,1≤id)(L1)d;Fi1Fi,∀2≤id

. (10)

For a givenF=(Fi,1≤id) inFd, we define the set of c.d.f.s F of order statistics with marginal c.d.f.sF:

LOSd (F)=

FLOSd ;Fi=Fi,1≤id

. (11)

IfFFd, then we haveLOSd (F)=∅, since the c.d.f. of(F11(U ), . . . ,Fd1(U )),U uniformly distributed onI, belongs toLOSd (F). We defineCOS(F)the set of copulas of order statistics with marginalsF:

COS(F)=

CFC;FLOSd (F)

. (12)

According to Sklar’s theorem, the mapFCF is a bijection betweenLOSd (F)andCOS(F)if FFd.

2.4. Entropy

Leth be a reference probability density function onR. We define hd(x)=d

i=1h(xi)for x=(x1, . . . , xd)∈Rd. The relative Shannon-entropy for a c.d.f.FLdis given by:

Hh(F )=

⎧⎨

−∞, ifFLd\L0d,

Rdflog f/ hd

, ifFL0d, (13)

withf the density ofF. Notice thatHh(F )∈ [−∞,0]is well defined. We will use the notation Hh(X)=Hh(F )ifXis a random vector with c.d.f.F andHh(f )=Hh(F )ifF has densityf. We shall simply writeH (F )(resp.H (X)andH (f )) instead ofHh(F )(resp.Hh(X)andHh(f )) whenh=1[0,1]. Note thatH (F )can be finite only ifF is the c.d.f. of a probability distribution on[0,1]d.

According to the next lemma, the relative entropy of anyFL1cd can be decomposed into the relative entropy of the one-dimensional marginals c.d.f.(Fi,1≤id)and the entropy of the associated copulaCF.

(6)

Lemma 2.1. LetFL1cd .We have:

Hh(F )=H (CF)+ d i=1

Hh(Fi). (14)

Proof. It is left to the reader to check thatF has a density, sayf, if and only ifFi has a density, sayfi, for 1≤idandCF has a density, saycF. Furthermore, in this case, we have:

f (x)=cF

F1(x1), . . . , Fd(xd)d

i=1

fi(xi) a.e. forx=(x1, . . . , xd)∈Rd, as well as, with the convention 0/0=0,

cF(u)=f (F11(u1), . . . , Fd1(ud)) d

i=1fi(Fi1(ui)) a.e. foru=(u1, . . . , ud)Id.

On the one hand, ifF does not have a density then we haveHh(F )= −∞. Since F does not have a density, then one of theFi orCF does not have a density either, and thenH (CF)+ d

i=1Hh(Fi)= −∞. Thus (14) holds.

On the other hand, let us assume thatF has a density, sayf. Elementary computations give withx=(x1, . . . , xd)and 1≤id:

Hh(Fi)= −

Rfi(xi)log

(fi/ h)(xi)

dxi= −

Rdf (x)log

(fi/ h)(xi) dx.

We also have withu=(u1, . . . , ud)andx=(x1, . . . , xd):

H (CF)= −

[0,1]dcFlog(cF)= −

f (F11(u1), . . . , Fd1(ud)) d

i=1fi(Fi1(ui)) log cF(u)

du

= −

f (x)log cF

F1(x1), . . . , Fd(xd) dx,

where, for the last equality, we used the change of variableFi(xi)=ui (forxiIg(Fi)) so that Fi1(ui)=Fi1Fi(xi)=xi holdsfi(xi) dxi-a.e and thatf (x) dx=0 on(di=1Ig(Fi))c. Then use thatf (x)=cF(F1(x1), . . . , Fd(xd)) d

i=1fi(xi)a.e. forx =(x1, . . . , xd)∈Rd to deduce thatH (CF)+d

i=1Hh(Fi)= −

flog(f/ hd)=Hh(F ).

Remark 2.2. Notice that ifFi has densityfi for 1≤id, then one can choose the reference probability densityh(t)=1dd

i=1fi(t)so thatHh(Fi)≥ −log(d). In this case,Hh(F )is finite if and only ifH (CF)is finite, and we have:

H (CF)=Hh(F )d i=1

Hh(Fi)= −

f (x)log

f (x)

d

i=1fi(xi) dx.

(7)

Thus,H (CF)is the relative entropy of the c.d.f.F with respect to the probability distribution with c.d.f.⊗di=1Fi of independent real valued random variables with the same one-dimensional marginal as the one with c.d.f.F. This emphasizes the fact thatHh(F )d

i=1Hh(Fi), when it is well defined, does not depend onh.

ForF=(Fi,1≤id)Fd, we defineJ(F)taking values in[0,+∞]by:

J(F)= d

i=2

RFi(dt)log

Fi1(t)Fi(t). (15) Our aim is to find the c.d.f.FLOSd (F)which maximizes the entropyHh. We shall see that this is possible if and only ifJ(F)is finite. From an information theory point of view, this is the distribution which is the least informative among distributions of order statistics with given one-dimensional marginal c.d.f.sF. Since the vector of marginal distribution functionsFis fixed, thanks to (14), we notice thatHh(F )is maximal onLOSd (F)if and only ifH (CF)is maximal on COS(F). Therefore, we focus on finding the copulaCCOS(F)which maximizes the entropyH. We will give the solution of this problem in Section5under some additional hypotheses onF.

3. Symmetric copulas with given order statistics

In this section, we introduce an operator on the setCOS(F)of copulas of order statistics with fixed marginal c.d.f.sF. This operator assigns to a copulaCCOS(F)the copula of the exchangeable random vector associated to the order statistics with marginal c.d.f.sFand copulaC. We show that this operator is a bijection betweenCOS(F)and a set of symmetric copulas which can be characterized by their multidiagonal, which is a generalization of the well-known diagonal sec- tion of copulas. This bijection has good properties with respect to the entropyH, giving us a problem equivalent to maximizingHonCOS(F). We shall solve this problem in Section4.

3.1. Symmetric copulas

Forx=(x1, . . . , xd)∈Rdwe definexOS=(x(1), . . . , x(d))the ordered vector (increasing order) ofx, wherex(1)≤ · · · ≤x(d)andd

i=1δˆxi=d

i=1δˆx(i), withδˆt the Dirac mass att∈R.

LetSdbe the set of permutations on{1, . . . , d}. Forx=(x1, . . . , xd)∈RdandπSd, we set xπ=(xπ(1), . . . , xπ(d)). A functionhdefined onRdis symmetric ifh(xπ)=h(x)for allπSd. A random vectorXtaking values inRdis exchangeable ifXπ is distributed asXfor allπSd. In particular a random vectorXtaking values in Rd is exchangeable if and only if its c.d.f. is symmetric. LetLsymdLd (resp.CsymLd) denote the set of symmetric c.d.f. (resp. copulas) onRd.

LetFLdand define its symmetrizationFsymLsymd by:

Fsym(x)= 1 d!

π∈Sd

F (xπ), x∈Rd. (16)

(8)

In particular, ifXis a random vector taking values inRdwith c.d.f.F andis a random variable independent ofX, uniformly distributed onSd, thenXis exchangeable with c.d.f.Fsym.

We define the following operator on the set of copulas of order statistics.

Definition 3.1. LetF(L1)d.ForCC we defineSF(C)as the copula of the exchangeable random variableX,whereXis a random vector onRdwith one-dimensional marginal c.d.f.s Fand copulaCandis an independent random variable uniform onSd.

The applicationSFis well-defined onCand takes values inCsym. In the above definition, with F=(Fi,1≤id), the one-dimensional marginal c.d.f.s ofXare equal to:

G=1 d

d i=1

Fi. (17)

Since the one-dimensional marginal c.d.f.sFi are continuous, we get thatGis continuous and thus the c.d.f. ofXbelongs toLd. In particular, thanks to Sklar’s theorem, the copula ofXis indeed uniquely defined.

Combining (8), (7) and (16), we can give an explicit formula forSF(C):

SF(C)(u)= 1 d!

π∈Sd

C F1

G1(uπ(1)) , . . . ,Fd

G1(uπ(d))

, uId. (18)

Remark 3.2. The copula SF(C) is not equal in general to the exchangeable copulaCsym de- fined similarly to (16) by Csym=(1/d!)

π∈SdC(xπ). However, this is the case if the one- dimensional marginal c.d.f.sFiare all equal, in which caseFi=Gfor all 1≤id.

IfXis a random vector onRd, letXOS=(X(1), . . . , X(d))be the order statistics ofX. The proof of the next lemma is elementary.

Lemma 3.3. LetXbe a random vector onRdwith c.d.f.F anda random variable indepen- dent ofX,uniformly distributed onSd.We have:

IfFLOSd ,then a.s.(X)OS=X.

IfFLsymd ,then(XOS)has the same distribution asX.

ForFFd, we define the set of copulasCsym(F)Csym as the image of COS(F) by the symmetrizing operatorSF:

Csym(F)=SF

COS(F)

. (19)

The following lemma is one of the main result of this section.

Lemma 3.4. LetFFd.The symmetrizing operatorSFis a bijection fromCOS(F)ontoCsym(F).

(9)

Proof. LetC1, C2COS(F)withSF(C1)=SF(C2). LetXandY be random vectors with one- dimensional marginal c.d.f.sFand copulaC1, C2, respectively. SinceC1, C2COS(F), we get thatXandY are order statistics. NoticeX andYhave the same one-dimensional marginals according to (17) and same copula given bySF(C1)=SF(C2). Therefore,X andYhave the same distribution. Thus, their corresponding order statistics(X)OSand(Y)OShave the same distribution. By Lemma3.3, we get thatX andY have the same distribution as well, which

impliesC1=C2.

Remark 3.5. We have in general Csym(F)=COS(F)Csym. One exception being when the marginal c.d.f.sFiare all equal. In this case, both sides reduce to one copula which is the Fréchet- Hoeffding upper bound copula:C+(u)=minu,uId.

3.2. Multidiagonals and characterization of C

sym

(F)

LetCCbe a copula andUa random vector with c.d.f.C. The maptC(t, . . . , t)fortI, which is called the diagonal section ofC, is the c.d.f. of maxU. We shall consider a generaliza- tion of the diagonal section ofC in the next definition.

Definition 3.6. LetCCbe a copula onRd andU a random vector with c.d.f.C.The multi- diagonal of the copulaC,δC=(i),1≤id),is thed-tuple of the one-dimensional marginal c.d.f.s ofUOS=(U(1), . . . , U(d))the order statistics ofU:for1≤id

δ(i)(t)=P(U(i)t ), tI.

We denote byD= {δC;CC}the set of multidiagonals. Notice thatDFd, see Remark3.7.

ForδDa multidiagonal, we defineCδ= {C;δC=δ}the set of copulas with multidiagonalδ.

A characterization of the setD is given by Theorem 1 of [13]: a vector of functions δ= (1), . . . , δ(d)) belongs toD if and only if δ(i) is a one-dimensional c.d.f. and the following conditions hold:

δ(i)δ(i+1), 1≤id−1, (20) d

i=1

δ(i)(s)=ds, 0≤s≤1. (21)

Remark 3.7. The condition (21) implies thatδ(i)L1, 1≤id, moreover they ared-Lipschitz.

Also, it is enough to knowd−1 functions fromδ(i), 1≤id, the remaining one is implicitly defined by (21). Condition (20) along with the continuity ofδ(i)implies that any multidiagonalδC is compatible with the continuous marginal distributions of an order statistics, thereforeDFd. Remark 3.8. Sinceδ(i), 1≤idare non-decreasing andd-Lipschitz, we have for almost every tI: 0≤(i))(t)d and thus|(i))(t)log((δ(i))(t))| ≤dlog(d)ford≥2. We deduce that ford≥2:

H (δ(i))dlog(d). (22)

(10)

Remark 3.9. LetCCsymbe a symmetric copula onRdandU a random vector with c.d.f.C.

We check that the multidiagonalδC=(i),1≤id)can be expressed in terms of the diagonal sections(C{i},1≤id)where for 1≤id:

C{i}(t)=P max

1kiUkt

=C(t, . . . , t

iterms

,1, . . . , 1

diterms

), tI.

According to 2.8 of [13], we have for 1≤id: δ(i)(t)=

d j=i

(−1)ji j−1

i−1 d

j C{j}(t), tI.

Conversely, we can express the functions(C{i},1≤id)withδC. For 1≤id andtI, we have:

C{i}(t)=P

1maxkiUkt

= d j=i

P

U(j )t| max

1kiUk=U(j ) P

1maxkiUk=U(j )

= d j=i

j1

i1

d

i

δ(j )(t),

where we used the definition ofδ(i)and the exchangeability ofU for the third equality.

The next technical lemma will be used in forthcoming proofs. Recall thatJ1 denotes the generalized inverse of a non-decreasing functionJ, see Section2.1for its definition and proper- ties, in particular,J1J (t )tfort∈R. Recall also that forx=(x1, . . . , xd)∈Rd, we write G(x)=(G(x1), . . . , G(xd)).

Lemma 3.10. LetX=(X1, . . . , Xd)be a random vector onRdwith one-dimensional marginals c.d.f.(Fi,1≤id).SetG=d

i=1Fi/d.We have for1≤id: P

XiG1G(t )

=P(Xit ), t∈R, that isFiG1G=Fi. (23) We also have forx∈Rd:

P

G(X)x

=P

XG1(x)

. (24)

Proof. Since G is the average of the non-decreasing functionsFi, if G(s)=G(s) for some s, s∈R, then we haveFi(s)=Fi(s)for every 1≤id. Thanks to (1), we haveGG1G(t )=G(t )and thusFiG1G(t )=Fi(t). This gives (23).

Recall definition (2) forIg(J )the set of points where the function J is increasing on their left. Since G is the average of the non-decreasing functions Fi, we deduce that Ig(G)=

(11)

1idIg(Fi). Notice that a.s.Xi belongs toIg(Fi). Thanks to (5), we get that a.s.{G(X)x} = {XG1(x)}. This gives (24).

We will also require the following lemma.

Lemma 3.11. LetX=(X1, . . . , Xd)be a random vector onRdwith one-dimensional marginals c.d.f.(Fi,1≤id).SetG=d

i=1Fi/d.We have for1≤id: FiG11

=GFi1. (25) Proof. Recall Definition (2) forIg(J )the set of points where the functionJis increasing on their left. Let 1≤id. Thanks to (4), we haveFi1(R)Ig(Fi)∪ {±∞}. SinceGis the average of the non-decreasing functionsFi, we deduce thatIg(G)=

1idIg(Fi). Thus we get:

Fi1(R)Ig(G)∪ {±∞}, (26) for all 1≤id. The functionFiG1is also bounded, non-decreasing and càd-làg therefore we have fort, s,∈R:

t

FiG11

(s) ⇐⇒ FiG1(t)s ⇐⇒ G1(t)Fi1(s)

⇐⇒ tGFi1(s),

where we used the equivalence of (1) for the first and second equivalence, (26) and the equiva- lence of (5) for the last. This gives that(FiG1)1=GFi1. In the following lemma, we show that forFFd, all copulas in Csym(F)share the same multidiagonal denoted byδF.

Lemma 3.12. LetF=(Fi,1≤id)Fd.LetCCOS(F)andU be a random vector with c.d.f.SF(C).LetδF=(i),1≤id)be the multidiagonal ofSF(C),that is the one-dimensional marginal c.d.f.s ofUOS,the order statistics ofU.We have thatδFdoes not depend onCand for 1≤id:

δ(i)=FiG1 and δ(i)1=GFi 1, (27) withGgiven by(17).Furthermore,Cis the unique copula ofUOS.

With obvious notation, we might simply writeδF=FG1, withGgiven by (17).

Proof of Lemma3.12. LetXbe a random vector of order statistics with marginalsFFdand copulaC. ThenSF(C)is the copula of the exchangeable random vectorX, whereis uniform onSd and independent ofX. We have already seen in (17) that the one-dimensional marginals ofX have the same distribution given byGL1. Thanks to (9), we deduce that the random vectorU, with c.d.f.SF(C), has the same distribution asG(X). SinceGis non-decreasing, this

(12)

implies that the order statistics ofU,UOS, has the same distribution asG((X)OS)that is as G(X), thanks to Lemma3.3. Then use (24) to get forx∈Rd:

P

UOSx

=P

G(X)x

=P

XG1(x)

. (28)

This gives the first part of the lemma as the multidiagonal ofUis the one-dimensional marginal c.d.f.s of its order statistics. The second equation in (27) is due to Lemma3.11. The fact thatCis the copula ofUOSand its uniqueness are due to (28) and the continuity ofδ(i), see Remark3.7.

The next proposition shows that the setCsym(F)is actually the set of symmetric copulas with diagonal sectionδF. This yields the main result of this section given by the subsequent corollary.

Proposition 3.13. LetFFd.We haveCsym(F)=CδFCsym. Proof. By Lemma3.12, we haveCsym(F)CδFCsym.

LetCCδFCsym andU be a random vector with c.d.f. C. LetG be given by (17). No- tice that X =G1(U ) is an exchangeable random vector with marginals G and copula C.

Thanks to Lemma3.3, the proof will be complete as soon as we prove that the one-dimensional marginal c.d.f.s of XOS=(X(1), . . . , X(d)), the order statistics of X, is given by F. Notice XOS=G1(UOS), withUOSthe order statistics ofU whose one-dimensional marginal c.d.f.s are given byδF. We have for 1≤idandt∈R:

P(X(i)t )=P

G1(U(i))t

=P

U(i)G(t )

=FiG1G(t )=Fi(t),

where we used (1) for the second equality, (27) for the third, and (23) for the last. This finishes

the proof.

Corollary 3.14. According to Proposition3.13, (19)and Lemma3.4,we get that for anyFFd, the symmetrizing operatorSFis a bijection betweenCOS(F)andCδFCsym.

We end this section by an ancillary result we shall use later.

Lemma 3.15. LetFFd.We haveJ(F)=JF).

Proof. LetF=(Fi,1≤id). We get, using (27) and the change of variables=G1(t)that:

I

δ(i)(t)log

δ(i1)(t)δ(i)(t)dt=

G−1((0,1))

Fi(ds)log

Fi1(s)Fi(s).

SincedFi=0 outsideG1((0,1))(asGis increasing as soon asFi is increasing), we get that the last integration above is also overR. We deduce that:

J δF

= d

i=2

RFi(ds)log

Fi1(s)Fi(s)=J(F).

(13)

3.3. Density and entropy of copulas in C

sym

(F)

We prove in this section thatSFpreserves the absolute continuity onCOS(F)forFFd and the entropy up to a constant. Let us introduce some notation. For marginalsFFd, let

iF=

s∈R,Fi1(s) >Fi(s)

for 2≤id. (29)

The complementary set(iF)cis the collection of the points whereFi1=Fi. We defineFI as:

F= d i=2

Fi

iFc

. (30)

By Remark3.7, we haveDFd, then the definitions (29) and (30) apply for allδD. In partic- ular, forδ=(1), . . . , δ(d))Dthe setsiδ, 2≤idare open subsets ofI, therefore(iδ)cI is a compact subset. This and the continuity ofδ(i)imply thatδ(i)((iδ)c)=δ(i)((iδ)cI )is also compact, henceδis compact. Notice that{0,1} ⊂δalways holds. We defineC0δ=CδC0the subset of absolutely continuous copulas with multidiagonalδand the subsetD0= {δD,Cδ0=

∅}of multidiagonals of absolutely continuous copulas. According to Theorem 2 of [13], the multidiagonalδbelongs toD0if and only if it belongs toDand the Lebesgue measure ofδis zero:|δ| =0.

Lemma 3.16. LetδD.We haveδD0if and only if for all2≤id,a.e.:

δ(i1)1(δ

i)c=δ(i) 1(δ

i)c=0. (31)

Furthermore,we have thatJ(δ) <+∞impliesδD0.

Proof. LetJ be a function defined onI, Lipschitz and non-decreasing. LetAbe a Borel subset ofI. We have:

J (A)=

1J (A)(t) dt= 1

0

1{sJ1J (A)}J(s) ds= 1

0

1A(s)J(s) ds,

where we used (3) and (5) for the last equality. This gives that |J (A)| =0 if and only if a.e.

J1A=0. Then use that δD0 if and only if |δ(i)((iδ)c)| =0 for all 1≤id and that δ(i1)((iδ)c)=δ(i)((iδ)c)to conclude thatδD0if and only if (31) holds for all 2≤id.

The last part of the lemma is clear.

Definition 3.17. LetFd0Fdbe the subset of marginalsFsuch that there exists an absolutely continuous c.d.f.of order statistics with marginalsF,that isLOSd (F)L0d=∅.

In particular, we haveD0Fd0. The next lemma gives a characterization of the setFd0.

Références

Documents relatifs

Madan and Yor [25] provided a sufficient condition on the marginals, under which the Azéma-Yor embedding stopping times corresponding to each marginal are automatically ordered, so

Remark 5.4 The analysis of the present section can be developed further to prove that, under the present conditions, the Az´ema-Yor solution of the Skorokhod embedding problem

The ancestral character estimation conducted at the family-level highlights the need for future examinations of pleuronectoids as so few species out of the total

For finite systems, due to surface energy effects, we expect a residual convexity for the system entropy after the transition leading directly to a bimodal distribution (accumulation

Supposing some small errors at the coefficients of the transfer function due to analog system, the proposed model approximates these errors using only the sampled output of the

In Section 6 we prove Theorem 1 by transforming the conditions on the order into conditions on the index and on the Frobenius conjugacy class with respect to certain finite

2014 The Dirichlet series of the multiplicative theory of numbers can be reinterpreted as partition functions of solvable quantum mechanical systems.. We

Distribution of the habitat suitability of the main malaria vector in French Guiana using Maximum Entropy modeling.. Yi Moua 1 , Emmanuel Roux 2 , Romain Girod 3 , Isabelle Dusfour 3