• Aucun résultat trouvé

On certain almost brownian filtrations

N/A
N/A
Protected

Academic year: 2022

Partager "On certain almost brownian filtrations"

Copied!
21
0
0

Texte intégral

(1)

www.elsevier.com/locate/anihpb

On certain almost Brownian filtrations

M. Émery

IRMA, 7, rue René Descartes, 67 084 Strasbourg cedex, France Received 11 September 2003; accepted 13 January 2005

Que peut-on affirmer, puisque ce qu’on avait cru probable d’abord s’est montré faux ensuite, et se trouve en troisième lieu être vrai?

M. Proust, Le côté de Guermantes.

Abstract

A consequence of Vershik’s results on discrete-time filtrations is the existence, in continuous time, of filtrationsF =(Ft)t0 which are “Brownian after zero” (that is, for eachε >0,Fε=(Fε+t)t0is generated byFεand someFε-Brownian motion), but not generated byF0and any Brownian motion. Among the filtrations that are Brownian after zero, how are the truly Brownian ones characterized? An answer is given by the self-coupling criterion (ii) of Theorem 1. This criterion is always satisfied whenF is immersible into the filtration of an infinite-dimensional Brownian motion.

2005 Elsevier SAS. All rights reserved.

Résumé

Une conséquence de la théorie des filtrations élaborée par Vershik il y a plus de trente ans est l’existence, en temps continu, de filtrationsF =(Ft)t0telles que, pour chaqueε >0,(Fε+t)t0soit engendrée parFε et par un mouvement brownien (nous les dirons « browniennes après zéro »), mais queF ne soit pas engendrée parF0et un mouvement brownien. Comment caractériser les filtrations browniennes parmi les filtrations browniennes après zéro ? Le théorème 1(ii) répond à cette question par un critère d’auto-couplage ; ce critère est toujours satisfait lorsqueF est immersible dans la filtration du mouvement Brownien à une infinité de dimensions.

2005 Elsevier SAS. All rights reserved.

This work was done while visiting Kyoto University. I am very thankful to the Research Institute for Mathemat- ical Sciences and to Professor S. Watanabe for their invitation and for making my stay most enjoyable—except for the terrible shock from Strasbourg in late January 2003.

In 1995, L. Dubins, J. Feldman, M. Smorodinsky and B. Tsirelson [4] have shown that it is possible to replace the Wiener measure by an equivalent probability in such a way that the new filtered probability space is no longer Brownian (that is, it is not generated by any Brownian motion whatsoever). The filtered probability space they

E-mail address: [email protected] (M. Émery).

0246-0203/$ – see front matter 2005 Elsevier SAS. All rights reserved.

doi:10.1016/j.anihpb.2005.01.001

(2)

have constructed has two further features: first, it is Brownian on the interval [ε,∞[ for each ε >0 (that is, Fε=(Fε+t)t0 is generated byFε and some Fε-Brownian motion); second, it is not immersible (this term will be defined soon) into “the” filtered probability space generated by an infinite-dimensional Brownian motion.

The latter property looks stronger than mere non-Brownianness; but we shall see in Corollary 1 that, for a filtration which is Brownian on[ε,∞[ for eachε >0, this property is in fact equivalent to being non-Brownian. In other words, consider a filtrationF which is Brownian on[ε,∞[for eachε >0; Corollary 1 says thatF is Brownian as soon as it is immersible into some infinite-dimensional Brownian filtration.

Most ideas and arguments are borrowed, some almost verbatim, from the theory of standard filtrations in dis- crete, negative time, mostly due to Vershik [14]. For instance, condition (iv) of Theorem 1 is a straightforward adaptation of his first-level standardness criterion (Condition 2 in Theorem 3.2 of [14]), and our proof of it just repeats his surprising argument. Similarly, the self-coupling condition (ii) of Theorem 1 is the continuous-time analogue of the discrete-time property called “I-cosiness” in [6], which is equivalent to Vershik’s standardness criterion [14].

But Vershik’s criterion, with its tower of measures on iterated measure spaces, does not lend itself to a restate- ment in continuous time; so the argument in [6], where Vershik’s criterion is used to show that I-cosiness entails standardness, does not adapt to our setting. Fortunately, another proof, bypassing Vershik’s criterion, has been given by S. Laurent [9]. Laurent’s method can be made to work in continuous time, but some adaptation is required: his approach relies on the explicit knowledge of the extremal points of some finite-dimensional set of joint laws; in continuous time, the set becomes infinite-dimensional, and has too many extremal points. This difficulty can be overcome by replacing the extremality argument with a density property (our Proposition 2).

This proposition says that, ifF is a filtration and ifX andY are twoF-Brownian motions, their joint law L(X, Y )can be approximated by laws of the formL(X, Y), whereXandY are two Brownian motions gen- erating the same filtration. This was unexpected in the case when, for instance,Y =Xon[0, T]and dY = −dX afterT, whereT is a stopping time independent ofXand ofY.

Conventions, notation and definitions

The words ‘positive’ and ‘increasing’ are always understood in the broad sense: the null function is positive and increasing. An open interval is written]s, t[; similarly[s, t[and]s, t]denote half-open intervals.

By a probability space, we always mean a triple(Ω,A,P)where theσ-fieldAisP-complete and essentially separable. By a sub-σ-field of A, we mean an (A,P)-complete sub-σ-field of A; it automatically inherits the property of being essentially separable. IfC is a class of events, orX a r.v. or a process,σ (C)orσ (X)denotes theσ-field generated byCorX, and by all negligible events ofA; similarly, the expression ‘generated by’ means

‘generated by . . . and by the null events’. A random variable is aP-equivalence class of measurable maps; L0de- notes the space of all a.s. finite r.v., endowed with the topology of convergence in probability; the elements of Lp are also defined up toP-equivalence. IfBandCare two sub-σ-fields ofA,B∨Cdenotes theσ-fieldσ (B∪C).

An isomorphism between two probability spaces(Ω,A,P)and(Ω,A,P)is a bijection between the quo- tientσ-fieldsA/PandA/Pthat preserves theσ-field structures and the probabilities; it extends (uniquely) to random variables.

A filtration on(Ω,A,P)is an increasing, right-continuous familyF =(Ft)t0of sub-σ-fields ofA; a filtered probability space is a system(Ω,A,P,F)whereF is a filtration. IfF is a filtration,Fdenotes the σ-field

tFt. Given two filtrationsF andGon(Ω,A,P), we say that F is included inGifFt ⊂Gt for eacht; and we say thatF is immersed inGif furthermore everyF-martingale is aG-martingale. We denote byF ∨Gthe smallest filtration containingF andG; it is given by(F ∨G)t =

ε>0(Ft+ε∨Gt+ε). We say thatF andGare jointly immersed if there exists a filtrationH on the same probability space such that each ofF andGis immersed inH. In that case, one can always chooseH=F ∨G. [Proof: eachF-martingale is anH-martingale adapted to the smaller filtrationF ∨G, hence also anF ∨G-martingale; similarly forG-martingales.]

(3)

Given two filtered probability spaces(Ω,A,P,F)and(Ω,A,P,F), we say thatF andFare iso- morphic if there exists an isomorphismΨ between the probability spaces(Ω,F ,P)and(Ω,F,P)such that, for eacht,Ft andFt are in correspondence byΨ. And we say thatF is immersible into F if F is isomorphic to a filtration immersed inF.

By convention, a Brownian motion (or, shortly, a BM) is always started at the origin. It may be one-dimensional, ord-dimensional, whered∞; the cased= ∞means that the process consists of countably many independent components, each of them a real BM. We shall abbreviate ‘d-dimensional BM’ by BMd. We call Brownian (more precisely:d-Brownian) any filtration generated by some BM (more precisely: by some BMd). AnF-BM (orF- BMd) is a (d-dimensional) Brownian motion for the filtration F: its increment on [s, t]is independent of Fs (equivalently, it is a BM and anF-martingale; or it is a BM whose natural filtration is immersed inF).

A non-Brownian example

To make the kind of situation we have in mind less abstract, we start with an example. It will not be used nor referred to in the sequel; skipping it is harmless.

Example 1. Suppose given a one-dimensional Brownian motionB, a finite setA (called the alphabet) with at least two elements, and a sequenceW =(Wn)n0 where, for eachn, Wn is a random word of length 2n, that is, a random element ofA2n. Each word Wn is uniformly distributed on A2n, and is independent of the whole processB; butW is far from independent ofB, for we supposeWn to be the first (resp. second) half of the twice longer wordWn+1iffB2n is larger (resp. smaller) thanB2(n+1). It is easy to show the existence of such a pair (B, W ): first, takingWnindependent ofB, construct the law of(B, Wn, Wn1, . . . , W0), and verify thatWn1too is uniform and independent ofB; then, take a projective limit.

FromBandW, define a filtrationF by Ft=σ

Bs, s∈ [0, t]

σ (Wn, 2nt ).

It is possible to show thatF is indeed a (right-continuous) filtration, enjoying the following three properties:

(1) for 0< st,Ft=Fsσ (BuBs, u∈ [s, t]),

(2) F0(=F0+) is degenerate: everyF0+-measurable r.v. is a.s. constant;

(3) F is not Brownian.

The first property is very easy: at time 2n the new word Wn is observed; it is a function of the previously observed wordWn+1and of the increment ofBbetween 2(n+1)and 2n, so the increments ofBsuffice to generate all information necessary to incrementF. This property is not valid fors=0, for there is more information inF than inBonly, eachWnbeing independent ofB.

The second property, right-continuity at 0, is less straightforward, but not difficult. The third one is much deeper;

it can be deduced from Smorodinsky’s study [13] of the processW (in inverse, discrete time), or, slightly less directly, from Vershik’s Example 3 in his theory of reversed, discrete-time filtrations [14]. We shall not attempt to give, or even sketch, the proofs of these properties. This example exhibits a pathological behaviour at time 0+, the non-Brownianness ofF being a germ property at time 0+, even thoughF0+is degenerate.

Definition. Fix 1d∞. A filtrationF isd-Brownian after zero if there exists anF-BMd Bsuch that, for all ts >0,Ft is generated byFs and by the process(BuBs, u∈ [s, t]), which is independent ofFs.

(4)

Lemma 1. LetF be a filtration; callFs=(Fs+t)t0the filtrationF shifted bys. Suppose that for eachs >0, there exists anFs-BMd Bssuch that the filtrationFs is generated by its initialσ-fieldFs and by the processBs. ThenF isd-Brownian after zero.

Proof. Partition]0,∞[into intervalsIn= ]2n,2n+1]wherenranges overZ, and define anF-BMdBbyBtB2n

=Bt2n2nfortIn(the increments ofB onInare given by the increments ofB2non]0,2n]). By induction onn, the property byFt =Fsσ (Bu, u∈ [0, t]), holds whens=2mandt=2nwithmn; it then extends easily to the general case whensandt are real numbers verifying 0< st. 2

The (admittedly artificial) example of this section shows that, for a filtrationF which is Brownian after zero, the degeneracy ofF0is not sufficient to imply Brownianness. The next theorem bridges this gap and gives a necessary and sufficient condition for Brownianness when the filtration is a priori known to be Brownian after zero.

Main results

Theorem 1. Fixd∈N∪ {∞}and a filtered probability space(Ω,A,P,F); suppose thatF isd-Brownian after zero. The following two statements are equivalent:

(i) F isd-Brownian.

(ii) (Self-coupling condition.) For eachR∈L1(F)and eachδ >0, there exists a probability space(Ω, A,¯ P) endowed with two filtrationsFandFverifying the following four conditions:

(a) F andFare isomorphic toF; in particular, there are two r.v.R∈L1(F )andR∈L1(F)corre- sponding toRby these isomorphisms;

(b) FandFare jointly immersed;

(c) for somes >0,FsandFsareP-independent;

(d) RR L1(Ω) < δ.

At first reading, condition (ii) looks awful, with the appearance of another filtered probability space(Ω, A,¯ P) where two filtrations are jointly immersed. One way of understanding it is to consider(Ω, A,¯ P)as an “enlarge- ment” of(Ω,A,P), in the sense of the sentence ‘at the cost of enlarging the space, we may suppose that . . . ’ The new filtrationFisomorphic toF should then be understood as beingF itself, and the meaning of (ii) becomes:

“After enlarging if necessary, there exists another filtrationF, jointly immersed withF, such that . . . ” Condition (ii) is the analogue, in our continuous-time setting, of the property called ‘I-cosiness’ in [6]. It is dubbed ‘self-coupling’ because, ifF is generated by some process X, (ii) means that it is possible, in some universe, to run simultaneously two copies ofX, in such a way that they are independent up to some times >0, but, at time infinity, the two values taken by some given functionalRofXhave become close to each other. This idea should be compared with the classical coupling method used to establish estimates for Markov processes.

The proof of Theorem 1 is rather long; it will be given later. Meanwhile, we shall comment on it, state and prove a corollary, and establish Proposition 2, a crucial density property.

Remarks. (1) LetDbe a dense subset of L1(F). If condition (ii) is satisfied for eachRDand eachδ >0, it holds in full generality. Indeed, forR∈L1andδ >0, there existsSDsuch that RS < δ; (ii) applied toS gives SS < δ, and by isomorphism RS = RS = RS < δ, whence RR <3δ, and (ii) holds forRtoo.

In particular, taking forDthe set of all simple,F-measurable random variables, it suffices to verify (ii) forR taking finitely many values. (I do not know if it suffices to verify it forRtaking two values, that is, for indicators of events; already in Vershik’s theory of filtrations in discrete, negative time, the corresponding question is open.)

(5)

(2) In condition (ii), the space L1 and its norm can equivalently be replaced by Lp for any p∈ [1,∞[, or by L0, with the distance d0(X, Y )=E[1∧ |XY|]for instance. Indeed, as shown by the preceding remark, we may suppose|R|to be bounded by some numberk1; so|RR|2k. It then suffices to use the elementary inequalityE[|RR|p](2k)pE[1∧ |RR|].

(3) (Remark due to S. Laurent [9].) If condition (ii) holds for someR and all δ, it holds also for all R1∈ L1(σ (R)) and allδ; consequently, it suffices to verify it for oneR that generatesF. To see this, suppose (ii) to hold for R and consider the set Φ of all bounded, Borel maps f:R→Rsuch that (ii) holds for fR and allδ >0; it suffices to verify thatΦcontains all bounded, Borel functions. The two-step argument is the same as in Slutsky’s lemma: First,Φcontains all bounded, Lipschitz functions, for, iff isk-Lipschitz, fRfR L1 k RR L1. Second, the setΦ is closed under uniformly bounded pointwise limits: if fnΦ are uniformly bounded andfn(x)f (x)for allx∈R, thenfΦ. To check this, fixδ >0. The quantityE[|fnRfR|]

tends to 0 whenn→ ∞, so it is smaller thanδfor somen(fixed in the sequel). Since (ii) holds forfnRandδ, there areF andF verifying (ii-a), (ii-b), (ii-c) andE[|fnRfnR|]< δ. By isomorphism, one has also E[|fnRfR|] = E[|fnRfR|] =E[|fnRfR|]< δ; soE[|fRfR|]<3δ, yieldingfΦ.

These two properties ofΦentail thatΦcontains all bounded, Borel functions.

Ford∞, callBd“the” filtration ofd-dimensional Brownian motion (it is unique up to isomorphism, whence the quotation marks). The next statement, a corollary of Theorem 1, says that the filtration B plays in our continuous-time framework the same rôle as the standard, non-atomic, discrete-time filtration in Vershik’s theory.

Corollary 1. Fixd. Assume that a filtrationF isd-Brownian after zero. The following three statements are equivalent:

(i) F isd-Brownian;

(ii) the independent productF ⊗Bis equal toB; (iii) F is immersible intoB.

The independent productF ⊗Bin (ii) is the filtration (more precisely: the filtered probability space) gen- erated by two independent filtrations, respectively isomorphic toF and toB; this product is well defined up to isomorphism only, so ‘equal to’ in (ii) really means ‘isomorphic to’.

Proof (the theorem is admitted). (i)⇒(ii) ⇒(iii) are trivial. To prove (iii)⇒(i), applying Theorem 1 toB, one sees thatBsatisfies the self-coupling property. By immersion, every subfiltration immersed inBalso satisfies the self-coupling property; this transfers toF by isomorphism. Applying now the theorem toF, one sees thatF isd-Brownian. 2

The interest of the theorem and its corollary is more theoretical than practical. In the special case that the filtration is Brownian after zero, they answer the following general question (see D. Revuz and M. Yor [12, p. 219]):

if a filtration has the predictable representation property w.r.t. some BMd, which additional assumption is sufficient to imply that it is d-Brownian? In view of Corollary 1, this question can be sharpened: If a filtration has the predictable representation property w.r.t. some BMd and is immersible intoB, is it necessarilyd-Brownian?

As observed in the introduction, the corollary also explains why the non-Brownian counterexamples constructed in [4] and [5] have the stronger property of not being immersible intoB.

Brownian examples

Putting the corollary at work to establish that a given particular filtration is Brownian, is disappointing: in all instances I know, exhibiting a generating BM is possible, and more informative than merely asserting its existence.

(6)

Nevertheless, in some cases, referring to the corollary is shorter and simpler than the constructive proof. Here are two examples; in both of them, a constructive proof of Brownianness is already available.

Example 2 (Generalized Goswami–Rao filtrations). Fix an integer d <∞ and a subgroup G of O(d). Let (Wd,Bd, λd,Bd)denote the canonical filtered probability space of BMd, andB the canonical process onWd. Each orthogonal transformationgGacts on the pathsωWdby(g·ω)(t )=g·(ω(t )); this action is an isomor- phism from(Wd,Bd, λd,Bd)to itself. A r.v.Y is said to be invariant if, for eachgG, one hasY=Yga.s.;

callIthe sub-σ-field of invariant events. As the action ofGcommutes with conditional expectation w.r.t.Btd, the subfiltrationF ofBddefined byFt=Bdt ∩Iis immersed inBd.

Malric has shown in [10] and [11] that the filtrationF isd-Brownian. SinceF is immersed inBd, this can also be derived from Corollary 1, provided we showF to bed-Brownian after zero. As we shall now see, in the particular case whenGis finite, this is straightforward. (But Malric’s proof works for allG, finite or not.)

We suppose from now on Gto be finite. There exists a Borel setA⊂Rd such thatAg·A=∅for each gG\{I}, and thatG·A=

gGg·Ahas a Lebesgue-negligible complementary set. (One can for instance choose a pointz∈Rdsuch thatg·z=zfor allgG\ {I}, and takeA= {y∈Rd:∀gG\ {I} yz < yg·z }.) The action ofGonG·Ais faithful, so for eachxG·Athere exists a uniqueγxGsuch thatγx·xA; moreover, xγxis a Borel map onG·A. Clearly,γg·x=γxg1since each of them maps the pointg·xto an element ofA.

For fixeds >0,Bs is a.s. inG·A, soγBs is a.s. well defined. The processβt =γBs·(Bs+tBs)is a BMd for the filtration(Bsd+t)t0. It is also invariant, for

βtg=γg·Bs·(g·Bs+tg·Bs)=γBsg1g·(Bs+tBs)=βt.

Hence it is adapted to the smaller filtrationFs, and it is an Fs-BMd. Using Lemma 1, to establish that F is Brownian after zero, it remains to see thatFs andβ generateFs. PuttingV =(Bs+vBs, v∈ [0, t])andV= v, v∈ [0, t]), we have to show that, forY ∈L(Bsd+t), the conditional expectationE[Y|I]is measurable for Fsσ (V ). We may takeY of the formU φ(V ), whereUisBsd-measurable; and it then suffices to write

E[Y|I] = 1

|G|

gG

Yg= 1

|G|

gG

(Ug)φ(Vg)

= 1

|G|

gG

UBs φ

VBs

= 1

|G|

gG

UBsφ(Vg)

= 1

|G|

gG

UBsφ(V )=φ(V ) 1

|G|

gG

(Ug)=φ(V )E[U|I].

Example 3 (Stationary Brownian motion on a sphere). LetMbe a compact, connected,d-dimensional Riemannian manifold without boundary, and call(Xt)t∈R the stationary Brownian motion with values inM: for eacht∈R, Xt is distributed onMaccording to the normalized Riemannian measure. Arnaudon [1] has established that the filtration(Ft)t∈Rgenerated byXisd-Brownian, that is, generated by anRd-valued Ornstein–Uhlenbeck process indexed byR(or, equivalently, by a BMd which is forced to jump to 0 at each integer instantt∈Z; equivalently again, the logarithmically time-changed filtration(Flnt)t0isd-Brownian). It may be interesting to observe here that Arnaudon’s proof relies on a coupling argument. This situation, and other stationary processes indexed byR, is a typical instance where Theorem 1 or Corollary 1, or their analogues obtained by a logarithmic time-change, can be expected to enter the picture, because the hypothesis that everything goes well on each interval[s,∞[is clearly satisfied; the problem is at time−∞only.

(7)

Corollary 1 gives a new proof of Arnaudon’s result, provided it is established that the processX is immersed in some, possibly higher-dimensional, Brownian motion. As we shall now see,1this immersion is quite simply obtained in the particular case whenMis ad-dimensional sphere.

So we are working in eternal time(t∈R), andXtakes values in the unit sphereS⊂Rd+1. Denote by dBt = (dBt1, . . . ,dBtd+1)a(d+1)-dimensional Brownian innovation, and consider the SDE on the sphere

dXt=projX

t (dBt)d 2Xtdt.

The drift term−d2Xtdtcompensates for the extrinsic curvature of the sphere, soXremains onSand is a Brownian motion onS. This SDE generates a stochastic flow of diffeomorphismsΦst onS. A. Carverhill, M. Chappell and D. Elworthy have shown in [3] thatΦ has only one characteristic exponent, namely−d/2. Since it is strictly negative, one has

sup

x,yS

E dist

Φst(x), Φst(y) →0 whents→ ∞. (∗)

(This property can also be obtained directly, by studying the diffusion dist(Φst(x), Φst(y)), which is the solution of some one-dimensional SDE.) A consequence of()is that, for fixedxS,

E dist

Φrt(x), Φst(x) =E dist

Φst Φrs(x)

, Φst(x) →0

whenrandstend to−∞withrs. By Cauchy’s criterion, for fixedxS, the limitXt=lims→−∞Φst(x)exists in probability; the flow property then shows thatXis a solution to the SDE, hence a Brownian motion onS. AsXt

is also measurable w.r.t. the past innovationBtd+1=σ (BtBs, s∈ ]−∞, t]), this realizes an immersion ofXin the(d+1)-dimensional Brownian filtrationBd+1, and Corollary 1(iii) applies.

More generally, the same argument works for any embedded compact manifold such that all characteristic exponents of the gradient Brownian flow (or, for that matter, of some Brownian flow) are strictly negative.

We now start proving Theorem 1. We begin with two sections devoted to establishing technical results that will be useful later.

Preliminaries: (1) Substantial families ofσ-fields

If (Ω,A,P) is a probability space and F a finite set, we denote by L(A;F ) the set of all F-valued, A-measurable random variables; L(A;F )is a metric space when endowed with the distance(R, S)→P[R=S]. If(E, d)is a separable metric space, L1(A;(E, d))(or L1(A;E)for short) denotes the set of allE-valued, A-measurable random variables Rsuch thatE[d(R, x)]is finite for some (⇔for all)xE. It is endowed with the distance(R, S)→E[d(R, S)]. It is well known that the set Lf(A;E)of all simple,E-valued,A-measurable r.v., is a dense subset of L1(A;(E, d)).

[Proof: Choose a numbering(xn)n0of a countable dense subsetDofE; defineψn:DDbyψn(xm)=xm ifmnandψn(xm)=x0ifm > n. ForR∈L1(A;(D, d)),E[d(ψnR, R)]tends to 0 by dominated convergence;

so Lf(A;(D, d))is dense in L1(A;(D, d)). To see that the latter is dense in L1(A;(E, d)), defineφδ:EDby φδ(x)=firstxnsuch thatd(xn, x) < δ;φδis measurable and verifiesd(x, φδ(x)) < δ, soE[d(R, φδR)]< δ.]

Definition. Let(Ω,A,P)be a probability space and B a set of sub-σ-fields ofA. We shall say that B is substantial inAif

B∈BL1(Ω,B,P)is dense in L1(Ω,A,P).

For instance, if(An)n∈N is an increasing sequence of sub-σ-fields ofA, the set{An, n∈N}is substantial in

nAn.

1 This argument is due to S. Watanabe; I thank him for allowing me to publish it here.

(8)

Observe that if B is substantial inA, the class

BBBgenerates theσ-fieldA; but the converse is false: for instance, the set B= {σ (A), A∈A}is in general not substantial inA.

Lemma 2. Let(Ω,A,P)be a probability space and B a set of sub-σ-fields ofA. The following three conditions are equivalent:

(i) B is substantial inA;

(ii) for each finite setF,

BBL(B;F )is dense in L(A;F );

(iii) for each separable metric space(E, d),

B∈BL1(B;(E, d))is dense in L1(A;(E, d)).

Proof. (iii)⇒(i) is trivial: takeE=R.

(i)⇒(ii). Fix F finite, R ∈L(A;F ) and δ >0. Let φ:F →Rbe a map such that for all eand f in F, e=f ⇒ |φ(e)φ(f )|2. Applying hypothesis (i) toφR, one obtains aB∈B and aT ∈L1(B;R)such that E[|TφR|]< δ; soP[|TφR|1]< δ. Defineψ:R→φ(F )byψ (x)=the point inφ(F )closest tox(the leftmost such point if there are two). On{|TφR|<1}, one hasψT =φRby the choice ofφ; hence

P[ψT =φR]P

|TφR|1 < δ.

It suffices to setS=φ1ψT to haveS∈L(B;F )andP[S=R] =P[φS=φR]< δ.

(ii)⇒(iii). FixR∈L1(A;(E, d))andδ >0. As Lf(A;E)is dense in L1(A;E), there exist some finite subset F of E and some T ∈L(A;F ) such that E[d(R, T )]< δ/2. Call a the diameter of F. Hypothesis (ii) yields a B∈B and a S∈L(B;F ) such that P[S =T]< δ/(2a). Since d(S, T )a1{S=T}, one has E[d(S, T )] aP[S=T]< δ/2, andE[d(R, S)]E[d(R, T )] +E[d(S, T )]< δ. 2

Lemma 2 will be used only once, in the proof of Lemma 3; what we shall need is only (i)⇒(iii), in the particular case when the separable metric spaceEis equal toRk.

Lemma 3. Given(Ω,A,P), letBandCbe sub-σ-fields ofA, and D a set of sub-σ-fields ofC, substantial inC.

The set{B∨D,D∈D}is substantial inB∨C.

Proof. We have to show that

D∈DL1(B ∨ D) is dense in L1(B ∨ C). But the random variables of the form

iαi1Bi1Ci (finite sum, αi ∈ R, Bi ∈ B, Ci ∈ C) are dense in L1(B ∨ C). Fix such a r.v.

α11B11C1 + · · · +αk1Bk1Ck. Applying Lemma 2 to the random variable 11C1, . . . , αk1Ck) with values in (E, d)=(Rk,

i|xiyi|), one gets the existence of some D ∈D and (D1, . . . , Dk)∈L1(D, E) such that E[

i|αi1CiDi|]< δ, and a fortiori E[|

ii1Bi1Ci1BiDi)|] < δ. As the r.v.

i1BiDi belongs to L1(B∨D), the proof is over. 2

Lemma 4. Fixs0. LetF be a filtration and G a set such that eachG∈G is a filtration immersed inF; suppose {G, G∈G}to be substantial inF. Then{Gs, G∈G}is substantial inFs; moreover, ifCis aσ-field such that Fs∨C=F, then{Gs∨C, G∈G}is substantial inF.

Proof. Since the mapEs:X→E[X|Fs]is a contraction from L1(F)onto L1(Fs), and since

G∈GL1(G)is dense in L1(F),

GGEsL1(G)is dense in L1(Fs). The immersion hypothesis gives EsL1(G)=L1(Gs);

so {Gs, G∈ G} is substantial in Fs. The second part of the conclusion follows immediately from this via Lemma 3. 2

(9)

Preliminaries: (2) Joint laws of Brownian motions

Throughout this section, the dimensiond is fixed and finite.

Call COV(d)the set of alld×d real matriceshsuch that, for all real vectors v=

v1

... vd

 and w=

w1

... wd

,

one has the inequality

tvhw=

i,j

vihijwj v w

(wheretvdenotes the transpose ofvand v its Euclidean norm).

The set COV(d)draws its name from ‘covariance’; for ifX andY are two random vectors in Rd with unit variance, their covariancehij=Cov[Xi, Yj]belongs to COV(d); this stems directly from property (iv) in Proposi- tion 1 below. But COV(d)also means ‘covariation’, for ifXandY are two BMdfor some filtration, the following lemma asserts that their covariation(d/dt )Xi, Yjttakes its values in COV(d).

Lemma 5. If X and Y are two BMd for some filtration F, there exists a predictable process H with values in COV(d)such that dXi, Yj(t )=Hij(t )dt.

Proof. The Kunita–Watanabe inequality says that there exists a predictable processH, with values ind×d matri- ces, defined up to a dt×dPnegligible set, such that dXi, Yj(t )=Hij(t )dt; moreover, as

v·X, v·Xt= v 2t, w·Y, w·Yt= w 2t and v·X, w·Yt= t 0

ij

viHij(s)wjds, one has

1 ts

t s

ij

viHij(u)wjdu v w

for alls, t, almost allωand all rational vectorsvandw. Hence, 1

ts t s

H (u)du∈COV(d)

for alls, t and almost allω. In the limit,Ht(ω)belongs to the closed set COV(d)for almost all(t, ω). Replacing Hby 0 wherever necessary makes it possible to choose a version ofH which is identically COV(d)-valued. 2 Proposition 1. (1) For ad×d matrixh, the following are equivalent:

(i) the matrixhbelongs to COV(d);

(ii) the transposethbelongs to COV(d);

(iii) for all vectorsv, hv v ; (iv) the symmetric, 2d×2dmatrixI h

th I

is positive;

(v) the symmetric,d×dmatrixIthhis positive.

(10)

(2) The set COV(d)is convex and compact. Its extremal points are the orthogonal matricesh∈O(d).

(3) Putd=d2+1. There existdmeasurable mapsr1, . . . , rd from COV(d)to O(d)anddmeasurable maps α1, . . . , αd from COV(d)to[0,1]such that, for eachh∈COV(d)

d

i=1

αi(h)=1 and

d

i=1

αi(h)ri(h)=h.

Remark (not used in the sequel). More generally, ifGis any compact group of reald×d matrices, the set of all extremal points of the convex hull ofGisGitself. Indeed,Gis a subgroup of O(d), and every point ofGis extremal in COV(d)by (2), and a fortiori in the smaller set convG.

Proof of Proposition 1. (1) Equivalence between (i) and (ii) stems trivially fromtvhw=t(tvhw)=twthv. Also, by homogeneity, (i) is equivalent to

w sup

v: v =1

tvhw w ,

or to (iii) since the left-hand side is hw . In turn, (iii) amounts totvthhvtvv, which says thatIthh0, that is, (v). Last, (iv) means thattvhw12( v 2+ w 2)for allvandw. Ifhis in COV(d), this inequality holds because the right-hand side is minorated by v w ; conversely, if this inequality holds, one hastvhw v w for all unit vectors, hence also for all vectors by homogeneity.

(2) Convexity, closedness and boundedness (each entry ofh∈COV(d)verifies|hij|1) are obvious on the definition of COV(d). The inclusion O(d)⊂COV(d)stems for instance from (v).

It is not difficult to see that the extremal subset of COV(d)is equal to O(d); see [8].2

(3) The last assertion of Proposition 1 comes from Carathéodory’s theorem (ifKis a compact, convex subset of Rn, every point of K is a barycentre of at most n+1 extremal points of K), and the measurable section theorem. 2

Still for finited, denote byWd= {w∈C(R+,Rd): w(0)=0}the canonical space ofd-dimensional Brownian motion, and byλdthe Wiener measure onWd.

We shall call JIBd (for “jointly immersed Brownian motions”) the set of probability measures on the product Wd×Wd defined as follows: a probabilityµ onWd×Wd is an element of JIBd if and only if there exist, on some filtered probability space(Ω,A,P,F), twoF-BMd XandY with joint lawL[X, Y] =µ. Observe that if µ∈JIBd, both marginals ofµare equal toλd; but the converse does not hold. For instance, ifXis a BMdand if Yt=Xt+1X1, the law of(X, Y )does not belong to JIBd, becauseX2X1is not independent ofY1.

Lemma 6. The set JIBd is convex; endowed with the topology of weak convergence ( for the bounded, continuous functions onWd×Wd), it is compact.

If X and Y are two BMd for some filtrationF, and if C is a sub-σ-field of F0, the conditional joint law L[X, Y|C]a.s. belongs to JIBd.

IfX andY are two processes, and ifF0is aσ-field such thatL[X, Y|F0]is a.s. in JIBd, thenX andY are BMdfor the smallest filtrationGsuch thatG0⊃F0and thatXandY areG-adapted.

2 I thank an anonymous referee for this reference.

(11)

Proof. IfXandY areF-BMd, then, for allk1, all 0< s1<· · ·< sk< t < u, and allρ, σ1, . . . , σk, τ1, . . . , τk in the dual ofRd,

E

eiρ(XuXt)−e12 ρ 2(ut ) exp i

k j=1

jXsj+τjYsj)

=0;

so, putting χ (x, y)=

eiρ(xuxt)−e12 ρ 2(ut ) exp i

k j=1

jxsj+τjysj),

everyµ∈JIBdverifiesµ(χ )=0. Similarly,µ(χ )ˆ =0, whereχ (x, y)ˆ =χ (y, x).

Conversely, if µ(χ )=µ(χ )ˆ =0 for all k and all dyadic sj, t, u, ρ, σj, τj, then, calling F the smallest µ-complete, right-continuous filtration onWd×Wd to which the canonical process(x, y)is adapted,x andy are(µ,F)-BMd, and soµis in JIBd.

Hence, JIBdis characterized as the set of all probabilitiesµonWd×Wd such thatµ(f )=0 for allf belong- ing to some countable set of bounded, continuous functions. Consequently, JIBd is closed and convex. It is also relatively compact, because it is tight: tightness follows immediately from both marginals of anyµ∈JIBd being equal toλd.

IfXandY are twoF-BMd, and ifCis a sub-σ-field ofF0, one hasE[χ (X, Y )|C] =E[ ˆχ (X, Y )|C] =0 a.s.

for each dyadicχ, showing thatL[X, Y|C] ∈JIBda.s.

Conversely, ifXandY are two processes, and ifL[X, Y|F0]belongs almost surely to JIBd, one hasL[X|F0]

=λd a.s., whenceL[X] =λd, andXis a BMd. Moreover, for everyF0-measurable r.v.F0and every dyadicχ, one hasE[χ (X, Y )exp iF0] =0. Consequently, for all dyadics < t < uandρ, and allGs-measurable, bounded r.v.Gs, one has

E

eiρ(XuXt)−e12 ρ 2(ut )

Gs =0, showing thatXis aG-BMd. 2

Keeping d finite, call MABd (for “mutually adapted Brownian motions”) the set of all probabilities µ on Wd×Wd such that there exist two Brownian motionsXandY which generate the same filtration and have joint lawL[X, Y] =µ. Clearly, MABdis a subset of JIBd.

Remark (not used in the sequel). It is easy to see that each element of MABdis an extremal point of JIBd, because it is already extremal in the much bigger set consisting of all probabilities onWd×Wd with first marginλd. The converse does not hold: there are many extremal points of JIBdthat do not belong to MABd. (Consider for instance, ifd=1, the joint law ofX andT X, whereT is the Lévy transform.) It would be interesting to characterize all extremal points of JIBd; this seems to be a difficult problem. (Already, in two dimensions, call JU (for ‘joint uniform’) the set of all probabilitiesµon the unit square such that both marginals ofµare the Lebesgue measure;

characterizing the extremal points of JU is, as far as I know, an open question.)

Proposition 2. The set MABdis dense in JIBd. In other words, ifXandY are twoF-BMd, there exist Brownian motionsξnandηnsuch that

for each fixedn,ξnandηngenerate the same filtration;

whenntends to infinity,(ξn, ηn)converges in law to(X, Y ).

Remark (not used in the sequel). An analogue of Proposition 2 is the following fact: whenT ranges over the set of all bimeasurable, Lebesgue-measure preserving bijections of[0,1]to itself, the probabilities1

0 δ(x,T x)dx are

(12)

dense in the set JU defined just before Proposition 2. In other, more probabilistic, words, if, on some sample space, XandY are two uniform r.v. on[0,1], there existξn,ηn, also with uniform law on[0,1], such that

• for eachn,ξnandηngenerate the sameσ-field;

• whenntends to infinity,n, ηn)converges in law to(X, Y ).

A proof of this remark is given by Gangbo [7].3Another, more direct, proof is the following. We shall work with ]0,1]instead of[0,1]; by ‘interval’, we shall always mean an interval of the form]a, b]. Call JUthe subset of JU which we claim is dense.

Partition ]0,1] into n subintervals I1, . . . , In of length 1/n; put Jk=Ik. (The intervals Ik will be used on the first factor, theJk on the second factor.) The squareS= ]0,1] × ]0,1]is partitioned inton2smaller squares Sk=Ik×J. Given a probabilityµ∈JU, it suffices to exhibit a probabilityν∈JUsuch thatν(Sk)=µ(Sk)for allkand; whenntends to infinity, theseνwill converge towardsµ, thus proving the remark.

Putmk=µ(Sk). The hypothesisµ∈JU implies that the 2n sumsn

k=1mk andn

=1mk are all equal to 1/n. This makes it possible to partition the interval Ik (respectivelyJ) inton subintervalsIk1, . . . , Ikn (re- spectivelyJ1, . . . , Jn) such that Leb(Ik)=Leb(Jk)=mk. (It is not really a partition, because ifmk=0 the corresponding subintervals are empty.) The family{Ik, 1k, n}(respectively{Jk, 1k, n}) is a “par- tition” of]0,1]inton2subintervals with lengthsmk. Since Leb(Ik)=Leb(Jk), there exists a bi-Borel, measure- preserving bijection T:]0,1] → ]0,1] such that T (Ik)=Jk for all k and ; the measure ν=1

0 δ(x,T x)dx belongs to JU. Call G the graph of T. The subsetGk of G corresponding to abscissae in Ik and ordinates inJk verifiesGkIk×JkSk; as

k,Gk=Gand as theSkare disjoint, one hasGk=GSk. Thus ν(Sk)=ν(Gk)=Leb(Ik)=Leb(Jk), that is,ν(Sk)=mk=µ(Sk). The claim is established.

We now come back to Proposition 2. Before proving it, here is a small lemma, obvious from the point of view of Rohlin’s theory, that will be needed in the proof.

Lemma 7. LetZandhbe two r.v. on some probability space(Ω,A,P). On another probability space(Ω,A,P), letζ andγ be two independent r.v., such thatζ has the same law asZandγ has a diffuse law. There exists onΩ a r.v.h, measurable forσ (ζ, γ ), such that the joint lawL(h, ζ )is equal toL(h, Z).

Proof of Lemma 7. By replacing γ with Fγ, where F (x)=P[γ x], we may suppose γ to be uni- formly distributed on[0,1]. Callρz(dj )a regular version of the conditional lawP[h∈dj |Z=z], letfz(r)= inf{j: ρz(]−∞, j]) > r}denote the inverse of the distribution function ofρz, and seth=fζ(γ ). Owing to the independence ofζ andγ, one hasL[h|ζ =z] =L[fzγ] =ρz. Callingµthe law ofZandζ, this gives

P[h∈dj, ζ∈dz] =µ(dz)ρz(dj )=P[h∈dj, Z∈dz], so(h, ζ )has the same law as(h, Z). 2

We now have all the necessary ingredients to prove Proposition 2. The proof is rather long; it can be best understood by keeping in mind two very different particular cases: the case whenY,tXt =ht, wherehbelongs to COV(d)by Lemma 5 and isF0-measurable; and the case when dYt =(1]0,T]1]T ,∞[)(t )dXt, whereT is a stopping time, independent ofXand ofY.

Proof of Proposition 2. The goal is to approximate an arbitraryµ∈JIBd by elements of MABd. The first steps of the proof will consist in replacingµby an element of some suitably chosen sequence tending toµ(that is, in takingµin some suitable dense subset of JIBd).

3 I thank F. Delbaen and S. Laurent who independently pointed out this reference to me.

Références