DOI:10.3150/16-BEJ868
Maximum entropy distribution of order statistics with given marginals
C R I S T I NA BU T U C E A1, J E A N - F R A N Ç O I S D E L M A S2,*, A N N E D U T F OY3and R I C H A R D F I S C H E R1,2,3,**
1LAMA (UPE-MLV), UPE, Marne La Vallée, France. E-mail:[email protected] 2CERMICS, École des Ponts, UPE, Champs-sur-Marne, France.
E-mail:*[email protected];**[email protected]
3EDF Research & Development, Industrial Risk Management Department, Palaiseau, France.
E-mail:[email protected]
We consider distributions of ordered random vectors with given one-dimensional marginal distributions.
We give an elementary necessary and sufficient condition for the existence of such a distribution with finite entropy. In this case, we give explicitly the density of the unique distribution which achieves the maximal entropy and compute the value of its entropy. This density is the unique one which has a product form on its support and the given one-dimensional marginals. The proof relies on the study of copulas with given one-dimensional marginal distributions for its order statistics.
Keywords:copula; entropy; maximum entropy; order statistics
1. Introduction
Order statistics, an almost surely non-decreasing sequence of random variables, have received a lot of attention due to the diversity of possible applications. If X =(X1, . . . , Xd) is a d-dimensional random vector, then its order statisticsXOS=(X(1), . . . , X(d))corresponds to the permutation of the components of X in the non-decreasing order, so that X(1)≤X(2)≤
· · · ≤X(d). The components of the underlying random vectorX are usually, but not necessar- ily, independent and identically distributed (i.i.d.). Special attention has been given to extreme valuesX(1)andX(d), the rangeX(d)−X(1), or the median value. Direct application of the dis- tribution of thekth largest order statistic occurs in various fields, such as climatology, extreme events, reliability, insurance, financial mathematics. We refer to the monographs of David and Nagaraja [8] and Arnold, Balakrishnan and Nagaraja [1] for a general overview on the subject of order statistics. We are interested in the dependence structure of order statistics, which has received great attention. In the i.i.d. case, Bickel [3] showed that any two order statistics are pos- itively correlated. The copula of the joint distribution ofX(1)andX(d)is derived in Schmitz [19]
with exact formulas for Kendall’sτ and Spearman’sρ. In Avérous, Genest and Kochar [2], it is shown that the dependence of thejth order statistic on theith order statistic decreases as the distance betweeniandj increases according to the bivariate monotone regression dependence ordering. The copula connecting the limit distribution of the two largest order statistics, called bi-extremal copula, is given by de Melo Mendes and Sanfins [9] with some additional properties.
Exact expressions for Pearson’s correlation coefficient, Kendall’sτ and Spearman’s ρ for any
1350-7265 © 2018 ISI/BS
two order statistics are obtained in Navarro and Balakrishnan [16]. For the non i.i.d. case, Kim and David [14] show that some pairs of order statistics can be negatively correlated, if the un- derlying random vector is sufficiently negatively dependent. Positive dependence measures for two order statistics are considered in Bolandet al.[4] when the underlying random variables are independent but arbitrarily distributed or when they are identically distributed but not indepen- dent. A generalization of these results for multivariate dependence properties is given by Hu and Chen [11]. See also Dubhashi and Häggström [10] for conditional distribution of order statistics.
Here, we focus on the cumulative distribution function (c.d.f.) of order statistics without refer- ring to an underlying distribution. That is, we consider random vectorsX=(X1, . . . , Xd)∈Rd such that a.s. X1≤ · · · ≤Xd and we suppose that the one-dimensional marginal distributions F=(Fi,1≤i≤d)are given, whereFi is the c.d.f. ofXi. A necessary and sufficient condition for the existence of a joint distribution of order statistics with one-dimensional marginalsFis that they are stochastically ordered, that isFi−1(x)≥Fi(x)for all 2≤i≤d, x∈R. With the marginals fixed, the joint distribution of the order statistics can be characterized by the connect- ing copula of the random vector, which contains all information on the dependence structure of the order statistics. Copulas of order statistics derived from an underlying i.i.d. sample were con- sidered in [2] in order to calculate measures of concordance between any two pairs of order statis- tics. For order statistics derived from a general parent distribution, Navarro and Spizzichino [17]
shows that the copula of the order statistics depends on the marginals and the copula of the parent distribution through an exchangeable copula and the average of the marginals. Construction of some copula of order statistics with given marginals were given in Lebrun and Dutfoy [15].
Our aim is to find the c.d.f. of order statistics of dimensiond with fixed marginals which maximizes the relative entropyHh defined by (13). In an information-theoretic interpretation, the maximum entropy distribution is the least informative among order statistics with given marginals. This problem appears in models where the one-dimensional marginals are well known (either from different experimentation or from physical models) but the dependence structure is unknown, see Butuceaet al. [7]. In [6], the same authors gave, when it exists, the maximum entropy distribution of(X1, . . . , Xd)such thatXiis uniformly distributed on[0,1]for 1≤i≤d and the distribution ofX(d)=max1≤i≤dXi is given, see Remark4.8.
For ad-dimensional random variable X=(X1, . . . , Xd)with c.d.f. F and copulaCF, the relative entropy ofF can be decomposed into the sum of the relative entropy (with respect to a one-dimensional probability densityh) of its one-dimensional marginals plus the entropy ofCF, see Lemma2.1. In our case, since the marginalsF=(Fi,1≤i≤d)are fixed, maximizing the entropy of the joint distributionF of an order statistics is equivalent to maximizing the entropy of its copulaCF under constraints, (see Section2.4). Therefore, we shall find the maximum entropy copula for order statistics with fixed marginal distributions.
The main result of this paper is given by Theorem5.4. It states that there exists a unique maximum entropy c.d.f.FFgiven by (52) if and only if:
d i=1
Hh(Fi)− d i=2
RFi(dt)log
Fi−1(t)−Fi(t)>−∞.
In this case,FFis absolutely continuous with densityfFdefined as, forx=(x1, . . . , xd)∈Rd: fF(x)=f1(x1)
d i=2
fi(xi)
Fi−1(xi)−Fi(xi)exp
− xi
xi−1
fi(s)
Fi−1(s)−Fi(s)ds 1LF(x), wherefiis the density function ofFi andLF⊂Rdis the set of ordered vectors(x1, . . . , xd), that isx1≤ · · · ≤xd, such thatFi−1(t) >Fi(t)for allt∈(xi−1, xi)and 2≤i≤d. See Example5.8 for an illustrative example.
The rest of the paper is organized as follows. In Section2, we introduce the basic notations and give the definition of the objects used in later parts. Section3describes the connection between copulas of order statistics with fixed marginals, and symmetric copulas with fixed multidiago- nals. The multidiagonal, given by Definition3.6, is the generalization of the diagonal section for copulas, which received great attention in copula literature. We show that there exists a one-to- one map between these two sets of copulas, see Corollary3.14. This bijection has good properties with respect to the entropy as explained in Proposition3.22. In Section4, we determine the max- imum entropy copula with fixed multidiagonal, see Theorem4.7. Since we obtain a symmetric copula as a result, this is also the maximum entropy symmetric copula with fixed multidiagonal.
In Section5, we use the one-to-one map between the two sets of copulas established in Section3 to give the maximum entropy copula of order statistics with fixed marginals. We finally obtain the density of the maximum entropy distribution for order statistics with fixed marginals by com- posing the maximum entropy copula with the marginals, see Theorem5.4. Section6contains the detailed proofs of Theorem4.7and other results from Section4. Section7 collects the main notations of the paper to facilitate reading.
2. Notations and definitions
2.1. Notations in R
dand generalized inverse
For a Borel setA⊂Rd, we write|A|for its Lebesgue measure. Forx=(x1, . . . , xd)∈Rd and y=(y1, . . . , yd)∈Rd, we writex≤yifxi≤yifor all 1≤i≤d. We define minx=min{xi,1≤ i≤d}and maxx=max{xi,1≤i≤d}forx=(x1, . . . , xd)∈Rd. IfJ is a real-valued function defined onR, we setJ (x)=(J (x1), . . . , J (xd)). We shall consider the following subsets ofRd:
S=
(x1, . . . , xd)∈Rd, x1≤ · · · ≤xd
and =S∩Id,
withI = [0,1]. In what follows, usuallyx, y will belong to Rd, ands, t toRorI. For a set A⊂R, we note byAc=R\Aits complementary set.
IfJ is a bounded non-decreasing càd-làg function defined onR. Its generalized inverseJ−1 is given byJ−1(t)=inf{s∈R;J (s)≥t}, fort∈R, with the convention that inf∅= +∞and infR= −∞. We have fors, t∈R:
J (t )≥s⇔t≥J−1(s), J−1◦J (t )≤t and J◦J−1◦J (t )=J (t ). (1)
We define the set of points whereJ is increasing on their left:
Ig(J )=
t∈R;u < t⇔J (u) < J (t )
. (2)
We have:
1(Ig(J ))cdJ =0 a.e., (3)
J−1(R)⊂Ig(J )∪ {±∞} (4) and fors∈R,t∈Ig(J ):
J (t )≤s⇔t≤J−1(s) and J−1◦J (t )=t. (5) Notice that ifJ is continuous in addition, then we have fort∈J (R):
J◦J−1(t)=t. (6)
2.2. C.d.f. and copula
LetX=(X1, . . . , Xd)be a random vector onRd. Its cumulative distribution function (c.d.f.), denoted by F is defined by: F (x)=P(X≤x),x∈Rd. The corresponding one-dimensional marginal c.d.f.s are(Fi,1≤i≤d)withFi(t)=P(Xi≤t ),t∈R. The c.d.f.F is called a copula ifXi is uniform onI= [0,1]for all 1≤i≤d. (Notice a copula is characterized by its values on Idonly.)
We defineLdas the set of c.d.f.s onRd whose one-dimensional marginal c.d.f.s are continu- ous, andC⊂Ldas the subset of copulas. We setL0d(resp.C0) the subset of absolutely continuous c.d.f. (resp. copulas) onRd.
Let us define for a c.d.f.F with one-dimensional marginals(Fi,1≤i≤d)the functionCF
defined onId:
CF(y)=F
F1−1(y1), . . . , Fd−1(yd)
, y=(y1, . . . , yd)∈Id. (7) IfF ∈Ld, then CF defined by (7) is a copula thanks to (6). According to Sklar’s theorem,F is then completely characterized by its one-dimensional marginal c.d.f.s(Fi,1≤i≤d)and the associated copulaCF which contains all information on the dependence:
F (x)=CF
F1(x1), . . . , Fd(xd)
, x=(x1, . . . , xd)∈Rd. (8) Equivalently, ifX=(X1, . . . , Xd)has c.d.f.F, thenCF is the c.d.f. of the random vector:
F1(X1), . . . , Fd(Xd)
. (9)
2.3. Order statistics
For a d-dimensional c.d.f. F, we write PF for the distribution of a random vector X = (X1, . . . , Xd)with c.d.f.F. Ad-dimensional c.d.f.F is a c.d.f. of order statistics (and we shall say thatX is a vector of order statistics) ifPF(X1≤X2≤ · · · ≤Xd)=1. Let us denote by LOSd ⊂Ld the set of all c.d.f.s of order statistics with continuous one-dimensional marginal c.d.f.s. Thed-tuples(Fi,1≤i≤d)of marginal c.d.f.s then verifyFi−1≥Fi for all 2≤i≤d. LetFdbe the set ofd-tuples of continuous one-dimensional c.d.f.s compatible with the marginal c.d.f.s of order statistics:
Fd=
F=(Fi,1≤i≤d)∈(L1)d;Fi−1≥Fi,∀2≤i≤d
. (10)
For a givenF=(Fi,1≤i≤d) inFd, we define the set of c.d.f.s F of order statistics with marginal c.d.f.sF:
LOSd (F)=
F ∈LOSd ;Fi=Fi,1≤i≤d
. (11)
IfF∈Fd, then we haveLOSd (F)=∅, since the c.d.f. of(F−11(U ), . . . ,F−d1(U )),U uniformly distributed onI, belongs toLOSd (F). We defineCOS(F)the set of copulas of order statistics with marginalsF:
COS(F)=
CF∈C;F ∈LOSd (F)
. (12)
According to Sklar’s theorem, the mapF →CF is a bijection betweenLOSd (F)andCOS(F)if F∈Fd.
2.4. Entropy
Leth be a reference probability density function onR. We define h⊗d(x)=d
i=1h(xi)for x=(x1, . . . , xd)∈Rd. The relative Shannon-entropy for a c.d.f.F∈Ldis given by:
Hh(F )=
⎧⎨
⎩
−∞, ifF ∈Ld\L0d,
−
Rdflog f/ h⊗d
, ifF ∈L0d, (13)
withf the density ofF. Notice thatHh(F )∈ [−∞,0]is well defined. We will use the notation Hh(X)=Hh(F )ifXis a random vector with c.d.f.F andHh(f )=Hh(F )ifF has densityf. We shall simply writeH (F )(resp.H (X)andH (f )) instead ofHh(F )(resp.Hh(X)andHh(f )) whenh=1[0,1]. Note thatH (F )can be finite only ifF is the c.d.f. of a probability distribution on[0,1]d.
According to the next lemma, the relative entropy of anyF ∈L1cd can be decomposed into the relative entropy of the one-dimensional marginals c.d.f.(Fi,1≤i≤d)and the entropy of the associated copulaCF.
Lemma 2.1. LetF∈L1cd .We have:
Hh(F )=H (CF)+ d i=1
Hh(Fi). (14)
Proof. It is left to the reader to check thatF has a density, sayf, if and only ifFi has a density, sayfi, for 1≤i≤dandCF has a density, saycF. Furthermore, in this case, we have:
f (x)=cF
F1(x1), . . . , Fd(xd)d
i=1
fi(xi) a.e. forx=(x1, . . . , xd)∈Rd, as well as, with the convention 0/0=0,
cF(u)=f (F1−1(u1), . . . , Fd−1(ud)) d
i=1fi(Fi−1(ui)) a.e. foru=(u1, . . . , ud)∈Id.
On the one hand, ifF does not have a density then we haveHh(F )= −∞. Since F does not have a density, then one of theFi orCF does not have a density either, and thenH (CF)+ d
i=1Hh(Fi)= −∞. Thus (14) holds.
On the other hand, let us assume thatF has a density, sayf. Elementary computations give withx=(x1, . . . , xd)and 1≤i≤d:
Hh(Fi)= −
Rfi(xi)log
(fi/ h)(xi)
dxi= −
Rdf (x)log
(fi/ h)(xi) dx.
We also have withu=(u1, . . . , ud)andx=(x1, . . . , xd):
H (CF)= −
[0,1]dcFlog(cF)= −
f (F1−1(u1), . . . , Fd−1(ud)) d
i=1fi(Fi−1(ui)) log cF(u)
du
= −
f (x)log cF
F1(x1), . . . , Fd(xd) dx,
where, for the last equality, we used the change of variableFi(xi)=ui (forxi ∈Ig(Fi)) so that Fi−1(ui)=Fi−1◦Fi(xi)=xi holdsfi(xi) dxi-a.e and thatf (x) dx=0 on(⊗di=1Ig(Fi))c. Then use thatf (x)=cF(F1(x1), . . . , Fd(xd)) d
i=1fi(xi)a.e. forx =(x1, . . . , xd)∈Rd to deduce thatH (CF)+d
i=1Hh(Fi)= −
flog(f/ h⊗d)=Hh(F ).
Remark 2.2. Notice that ifFi has densityfi for 1≤i≤d, then one can choose the reference probability densityh(t)=1dd
i=1fi(t)so thatHh(Fi)≥ −log(d). In this case,Hh(F )is finite if and only ifH (CF)is finite, and we have:
H (CF)=Hh(F )− d i=1
Hh(Fi)= −
f (x)log
f (x)
d
i=1fi(xi) dx.
Thus,H (CF)is the relative entropy of the c.d.f.F with respect to the probability distribution with c.d.f.⊗di=1Fi of independent real valued random variables with the same one-dimensional marginal as the one with c.d.f.F. This emphasizes the fact thatHh(F )−d
i=1Hh(Fi), when it is well defined, does not depend onh.
ForF=(Fi,1≤i≤d)∈Fd, we defineJ(F)taking values in[0,+∞]by:
J(F)= d
i=2
RFi(dt)log
Fi−1(t)−Fi(t). (15) Our aim is to find the c.d.f.F∗∈LOSd (F)which maximizes the entropyHh. We shall see that this is possible if and only ifJ(F)is finite. From an information theory point of view, this is the distribution which is the least informative among distributions of order statistics with given one-dimensional marginal c.d.f.sF. Since the vector of marginal distribution functionsFis fixed, thanks to (14), we notice thatHh(F )is maximal onLOSd (F)if and only ifH (CF)is maximal on COS(F). Therefore, we focus on finding the copulaC∗∈COS(F)which maximizes the entropyH. We will give the solution of this problem in Section5under some additional hypotheses onF.
3. Symmetric copulas with given order statistics
In this section, we introduce an operator on the setCOS(F)of copulas of order statistics with fixed marginal c.d.f.sF. This operator assigns to a copulaC∈COS(F)the copula of the exchangeable random vector associated to the order statistics with marginal c.d.f.sFand copulaC. We show that this operator is a bijection betweenCOS(F)and a set of symmetric copulas which can be characterized by their multidiagonal, which is a generalization of the well-known diagonal sec- tion of copulas. This bijection has good properties with respect to the entropyH, giving us a problem equivalent to maximizingHonCOS(F). We shall solve this problem in Section4.
3.1. Symmetric copulas
Forx=(x1, . . . , xd)∈Rdwe definexOS=(x(1), . . . , x(d))the ordered vector (increasing order) ofx, wherex(1)≤ · · · ≤x(d)andd
i=1δˆxi=d
i=1δˆx(i), withδˆt the Dirac mass att∈R.
LetSdbe the set of permutations on{1, . . . , d}. Forx=(x1, . . . , xd)∈Rdandπ∈Sd, we set xπ=(xπ(1), . . . , xπ(d)). A functionhdefined onRdis symmetric ifh(xπ)=h(x)for allπ∈Sd. A random vectorXtaking values inRdis exchangeable ifXπ is distributed asXfor allπ∈Sd. In particular a random vectorXtaking values in Rd is exchangeable if and only if its c.d.f. is symmetric. LetLsymd ⊂Ld (resp.Csym⊂Ld) denote the set of symmetric c.d.f. (resp. copulas) onRd.
LetF ∈Ldand define its symmetrizationFsym∈Lsymd by:
Fsym(x)= 1 d!
π∈Sd
F (xπ), x∈Rd. (16)
In particular, ifXis a random vector taking values inRdwith c.d.f.F andis a random variable independent ofX, uniformly distributed onSd, thenXis exchangeable with c.d.f.Fsym.
We define the following operator on the set of copulas of order statistics.
Definition 3.1. LetF∈(L1)d.ForC∈C we defineSF(C)as the copula of the exchangeable random variableX,whereXis a random vector onRdwith one-dimensional marginal c.d.f.s Fand copulaCandis an independent random variable uniform onSd.
The applicationSFis well-defined onCand takes values inCsym. In the above definition, with F=(Fi,1≤i≤d), the one-dimensional marginal c.d.f.s ofXare equal to:
G=1 d
d i=1
Fi. (17)
Since the one-dimensional marginal c.d.f.sFi are continuous, we get thatGis continuous and thus the c.d.f. ofXbelongs toLd. In particular, thanks to Sklar’s theorem, the copula ofXis indeed uniquely defined.
Combining (8), (7) and (16), we can give an explicit formula forSF(C):
SF(C)(u)= 1 d!
π∈Sd
C F1
G−1(uπ(1)) , . . . ,Fd
G−1(uπ(d))
, u∈Id. (18)
Remark 3.2. The copula SF(C) is not equal in general to the exchangeable copulaCsym de- fined similarly to (16) by Csym=(1/d!)
π∈SdC(xπ). However, this is the case if the one- dimensional marginal c.d.f.sFiare all equal, in which caseFi=Gfor all 1≤i≤d.
IfXis a random vector onRd, letXOS=(X(1), . . . , X(d))be the order statistics ofX. The proof of the next lemma is elementary.
Lemma 3.3. LetXbe a random vector onRdwith c.d.f.F anda random variable indepen- dent ofX,uniformly distributed onSd.We have:
• IfF ∈LOSd ,then a.s.(X)OS=X.
• IfF ∈Lsymd ,then(XOS)has the same distribution asX.
ForF∈Fd, we define the set of copulasCsym(F)⊂Csym as the image of COS(F) by the symmetrizing operatorSF:
Csym(F)=SF
COS(F)
. (19)
The following lemma is one of the main result of this section.
Lemma 3.4. LetF∈Fd.The symmetrizing operatorSFis a bijection fromCOS(F)ontoCsym(F).
Proof. LetC1, C2∈COS(F)withSF(C1)=SF(C2). LetXandY be random vectors with one- dimensional marginal c.d.f.sFand copulaC1, C2, respectively. SinceC1, C2∈COS(F), we get thatXandY are order statistics. NoticeX andYhave the same one-dimensional marginals according to (17) and same copula given bySF(C1)=SF(C2). Therefore,X andYhave the same distribution. Thus, their corresponding order statistics(X)OSand(Y)OShave the same distribution. By Lemma3.3, we get thatX andY have the same distribution as well, which
impliesC1=C2.
Remark 3.5. We have in general Csym(F)=COS(F)∩Csym. One exception being when the marginal c.d.f.sFiare all equal. In this case, both sides reduce to one copula which is the Fréchet- Hoeffding upper bound copula:C+(u)=minu,u∈Id.
3.2. Multidiagonals and characterization of C
sym(F)
LetC∈Cbe a copula andUa random vector with c.d.f.C. The mapt→C(t, . . . , t)fort∈I, which is called the diagonal section ofC, is the c.d.f. of maxU. We shall consider a generaliza- tion of the diagonal section ofC in the next definition.
Definition 3.6. LetC∈Cbe a copula onRd andU a random vector with c.d.f.C.The multi- diagonal of the copulaC,δC=(δ(i),1≤i≤d),is thed-tuple of the one-dimensional marginal c.d.f.s ofUOS=(U(1), . . . , U(d))the order statistics ofU:for1≤i≤d
δ(i)(t)=P(U(i)≤t ), t∈I.
We denote byD= {δC;C∈C}the set of multidiagonals. Notice thatD⊂Fd, see Remark3.7.
Forδ∈Da multidiagonal, we defineCδ= {C;δC=δ}the set of copulas with multidiagonalδ.
A characterization of the setD is given by Theorem 1 of [13]: a vector of functions δ= (δ(1), . . . , δ(d)) belongs toD if and only if δ(i) is a one-dimensional c.d.f. and the following conditions hold:
δ(i)≥δ(i+1), 1≤i≤d−1, (20) d
i=1
δ(i)(s)=ds, 0≤s≤1. (21)
Remark 3.7. The condition (21) implies thatδ(i)∈L1, 1≤i≤d, moreover they ared-Lipschitz.
Also, it is enough to knowd−1 functions fromδ(i), 1≤i≤d, the remaining one is implicitly defined by (21). Condition (20) along with the continuity ofδ(i)implies that any multidiagonalδC is compatible with the continuous marginal distributions of an order statistics, thereforeD⊂Fd. Remark 3.8. Sinceδ(i), 1≤i≤dare non-decreasing andd-Lipschitz, we have for almost every t∈I: 0≤(δ(i))(t)≤d and thus|(δ(i))(t)log((δ(i))(t))| ≤dlog(d)ford≥2. We deduce that ford≥2:
H (δ(i))≤dlog(d). (22)
Remark 3.9. LetC∈Csymbe a symmetric copula onRdandU a random vector with c.d.f.C.
We check that the multidiagonalδC=(δ(i),1≤i≤d)can be expressed in terms of the diagonal sections(C{i},1≤i≤d)where for 1≤i≤d:
C{i}(t)=P max
1≤k≤iUk≤t
=C(t, . . . , t
iterms
,1, . . . , 1
d−iterms
), t∈I.
According to 2.8 of [13], we have for 1≤i≤d: δ(i)(t)=
d j=i
(−1)j−i j−1
i−1 d
j C{j}(t), t∈I.
Conversely, we can express the functions(C{i},1≤i≤d)withδC. For 1≤i≤d andt∈I, we have:
C{i}(t)=P
1max≤k≤iUk≤t
= d j=i
P
U(j )≤t| max
1≤k≤iUk=U(j ) P
1max≤k≤iUk=U(j )
= d j=i
j−1
i−1
d
i
δ(j )(t),
where we used the definition ofδ(i)and the exchangeability ofU for the third equality.
The next technical lemma will be used in forthcoming proofs. Recall thatJ−1 denotes the generalized inverse of a non-decreasing functionJ, see Section2.1for its definition and proper- ties, in particular,J−1◦J (t )≤tfort∈R. Recall also that forx=(x1, . . . , xd)∈Rd, we write G(x)=(G(x1), . . . , G(xd)).
Lemma 3.10. LetX=(X1, . . . , Xd)be a random vector onRdwith one-dimensional marginals c.d.f.(Fi,1≤i≤d).SetG=d
i=1Fi/d.We have for1≤i≤d: P
Xi≤G−1◦G(t )
=P(Xi≤t ), t∈R, that isFi◦G−1◦G=Fi. (23) We also have forx∈Rd:
P
G(X)≤x
=P
X≤G−1(x)
. (24)
Proof. Since G is the average of the non-decreasing functionsFi, if G(s)=G(s) for some s, s∈R, then we haveFi(s)=Fi(s)for every 1≤i≤d. Thanks to (1), we haveG◦G−1◦ G(t )=G(t )and thusFi◦G−1◦G(t )=Fi(t). This gives (23).
Recall definition (2) forIg(J )the set of points where the function J is increasing on their left. Since G is the average of the non-decreasing functions Fi, we deduce that Ig(G)=
1≤i≤dIg(Fi). Notice that a.s.Xi belongs toIg(Fi). Thanks to (5), we get that a.s.{G(X)≤ x} = {X≤G−1(x)}. This gives (24).
We will also require the following lemma.
Lemma 3.11. LetX=(X1, . . . , Xd)be a random vector onRdwith one-dimensional marginals c.d.f.(Fi,1≤i≤d).SetG=d
i=1Fi/d.We have for1≤i≤d: Fi◦G−1−1
=G◦Fi−1. (25) Proof. Recall Definition (2) forIg(J )the set of points where the functionJis increasing on their left. Let 1≤i≤d. Thanks to (4), we haveFi−1(R)⊂Ig(Fi)∪ {±∞}. SinceGis the average of the non-decreasing functionsFi, we deduce thatIg(G)=
1≤i≤dIg(Fi). Thus we get:
Fi−1(R)⊂Ig(G)∪ {±∞}, (26) for all 1≤i≤d. The functionFi◦G−1is also bounded, non-decreasing and càd-làg therefore we have fort, s,∈R:
t≥
Fi◦G−1−1
(s) ⇐⇒ Fi◦G−1(t)≥s ⇐⇒ G−1(t)≥Fi−1(s)
⇐⇒ t≥G◦Fi−1(s),
where we used the equivalence of (1) for the first and second equivalence, (26) and the equiva- lence of (5) for the last. This gives that(Fi◦G−1)−1=G◦Fi−1. In the following lemma, we show that forF∈Fd, all copulas in Csym(F)share the same multidiagonal denoted byδF.
Lemma 3.12. LetF=(Fi,1≤i≤d)∈Fd.LetC∈COS(F)andU be a random vector with c.d.f.SF(C).LetδF=(δ(i),1≤i≤d)be the multidiagonal ofSF(C),that is the one-dimensional marginal c.d.f.s ofUOS,the order statistics ofU.We have thatδFdoes not depend onCand for 1≤i≤d:
δ(i)=Fi◦G−1 and δ−(i)1=G◦F−i 1, (27) withGgiven by(17).Furthermore,Cis the unique copula ofUOS.
With obvious notation, we might simply writeδF=F◦G−1, withGgiven by (17).
Proof of Lemma3.12. LetXbe a random vector of order statistics with marginalsF∈Fdand copulaC. ThenSF(C)is the copula of the exchangeable random vectorX, whereis uniform onSd and independent ofX. We have already seen in (17) that the one-dimensional marginals ofX have the same distribution given byG∈L1. Thanks to (9), we deduce that the random vectorU, with c.d.f.SF(C), has the same distribution asG(X). SinceGis non-decreasing, this
implies that the order statistics ofU,UOS, has the same distribution asG((X)OS)that is as G(X), thanks to Lemma3.3. Then use (24) to get forx∈Rd:
P
UOS≤x
=P
G(X)≤x
=P
X≤G−1(x)
. (28)
This gives the first part of the lemma as the multidiagonal ofUis the one-dimensional marginal c.d.f.s of its order statistics. The second equation in (27) is due to Lemma3.11. The fact thatCis the copula ofUOSand its uniqueness are due to (28) and the continuity ofδ(i), see Remark3.7.
The next proposition shows that the setCsym(F)is actually the set of symmetric copulas with diagonal sectionδF. This yields the main result of this section given by the subsequent corollary.
Proposition 3.13. LetF∈Fd.We haveCsym(F)=CδF∩Csym. Proof. By Lemma3.12, we haveCsym(F)⊂CδF∩Csym.
LetC∈CδF ∩Csym andU be a random vector with c.d.f. C. LetG be given by (17). No- tice that X =G−1(U ) is an exchangeable random vector with marginals G and copula C.
Thanks to Lemma3.3, the proof will be complete as soon as we prove that the one-dimensional marginal c.d.f.s of XOS=(X(1), . . . , X(d)), the order statistics of X, is given by F. Notice XOS=G−1(UOS), withUOSthe order statistics ofU whose one-dimensional marginal c.d.f.s are given byδF. We have for 1≤i≤dandt∈R:
P(X(i)≤t )=P
G−1(U(i))≤t
=P
U(i)≤G(t )
=Fi◦G−1◦G(t )=Fi(t),
where we used (1) for the second equality, (27) for the third, and (23) for the last. This finishes
the proof.
Corollary 3.14. According to Proposition3.13, (19)and Lemma3.4,we get that for anyF∈Fd, the symmetrizing operatorSFis a bijection betweenCOS(F)andCδF∩Csym.
We end this section by an ancillary result we shall use later.
Lemma 3.15. LetF∈Fd.We haveJ(F)=J(δF).
Proof. LetF=(Fi,1≤i≤d). We get, using (27) and the change of variables=G−1(t)that:
I
δ(i)(t)log
δ(i−1)(t)−δ(i)(t)dt=
G−1((0,1))
Fi(ds)log
Fi−1(s)−Fi(s).
SincedFi=0 outsideG−1((0,1))(asGis increasing as soon asFi is increasing), we get that the last integration above is also overR. We deduce that:
J δF
= d
i=2
RFi(ds)log
Fi−1(s)−Fi(s)=J(F).
3.3. Density and entropy of copulas in C
sym(F)
We prove in this section thatSFpreserves the absolute continuity onCOS(F)forF∈Fd and the entropy up to a constant. Let us introduce some notation. For marginalsF∈Fd, let
iF=
s∈R,Fi−1(s) >Fi(s)
for 2≤i≤d. (29)
The complementary set(iF)cis the collection of the points whereFi−1=Fi. We defineF⊂I as:
F= d i=2
Fi
iFc
. (30)
By Remark3.7, we haveD⊂Fd, then the definitions (29) and (30) apply for allδ∈D. In partic- ular, forδ=(δ(1), . . . , δ(d))∈Dthe setsiδ, 2≤i≤dare open subsets ofI, therefore(iδ)c∩I is a compact subset. This and the continuity ofδ(i)imply thatδ(i)((iδ)c)=δ(i)((iδ)c∩I )is also compact, henceδis compact. Notice that{0,1} ⊂δalways holds. We defineC0δ=Cδ∩C0the subset of absolutely continuous copulas with multidiagonalδand the subsetD0= {δ∈D,Cδ0=
∅}of multidiagonals of absolutely continuous copulas. According to Theorem 2 of [13], the multidiagonalδbelongs toD0if and only if it belongs toDand the Lebesgue measure ofδis zero:|δ| =0.
Lemma 3.16. Letδ∈D.We haveδ∈D0if and only if for all2≤i≤d,a.e.:
δ(i−1)1(δ
i)c=δ(i) 1(δ
i)c=0. (31)
Furthermore,we have thatJ(δ) <+∞impliesδ∈D0.
Proof. LetJ be a function defined onI, Lipschitz and non-decreasing. LetAbe a Borel subset ofI. We have:
J (A)=
1J (A)(t) dt= 1
0
1{s∈J−1◦J (A)}J(s) ds= 1
0
1A(s)J(s) ds,
where we used (3) and (5) for the last equality. This gives that |J (A)| =0 if and only if a.e.
J1A=0. Then use that δ∈D0 if and only if |δ(i)((iδ)c)| =0 for all 1≤i≤d and that δ(i−1)((iδ)c)=δ(i)((iδ)c)to conclude thatδ∈D0if and only if (31) holds for all 2≤i≤d.
The last part of the lemma is clear.
Definition 3.17. LetFd0⊂Fdbe the subset of marginalsFsuch that there exists an absolutely continuous c.d.f.of order statistics with marginalsF,that isLOSd (F)∩L0d=∅.
In particular, we haveD0⊂Fd0. The next lemma gives a characterization of the setFd0.