Trees within trees II: Nested Fragmentations

(1)

HAL Id: hal-01842036

https://hal.archives-ouvertes.fr/hal-01842036

Preprint submitted on 17 Jul 2018

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Trees within trees II: Nested Fragmentations

Jean-Jil Duchamps

To cite this version:

Jean-Jil Duchamps. Trees within trees II: Nested Fragmentations. 2018. �hal-01842036�

(2)

Trees within trees II: Nested Fragmentations

Jean-Jil Duchamps Sorbonne Université

July 16, 2018

Abstract

Similarly as in [4] where nested coalescent processes are studied, we generalize the definition of partition-valued homogeneous Markov fragmentation processes to the setting of nested partitions, i.e. pairs of partitions(^𝜁,𝜉)where𝜁is finer than𝜉. As in the classical univariate setting, under exchangeability and branching assumptions, we characterize the jump measure of nested fragmentation processes, in terms of erosion coefficients and dislocation measures. Among the possible jumps of a nested fragmentation, three forms of erosion and two forms of dislocation are identified – one of which being specific to the nested setting and relating to a bivariate paintbox process.

1 Introduction

Evolutionary biology aims at tracing back the history of species, by identifying and dating the relationships of ancestry between past lineages of extant individuals. This information is usually represented by a tree or phylogeny, species corresponding to leaves of the tree and speciation events (point in time where several species descend from a single one) corresponding to internal nodes [16,23].

Modern methods consist in analyzing and comparing genetic data from samples of individuals to statistically infer their phylogenetic tree. Probabilistic tree models have been well- developed in the last decades – either from individual-based population models like the classical Wright-Fisher model [2,10,15,23], or from time-forward branching processes, where the branching particles are species (see for instance Aldous’s Markov branching models [1] and the revolving literature [6, 7, 11,13]) – allowing for inference from genetic data. A challenge is that trees inferred from different parts of the genome generally fail to coincide, each of them being understood as an alteration of a “true” underlying phylogeny (which we call thespecies tree).

To understand the relation betweengene treesand the species tree, our goal is to identify a class of Markovian models coupling the evolution of both trees, making the assumption that in general, several gene lineages coexist within the same species, and at speciation events one or several gene lineages diverge from their neighbors to form a new species, i.e.

we model the problem as atree within a tree[9,18–20]. See Figure1for an instance of a simple nested genealogy where discrepancies arise between the resulting gene tree and species tree.

Figure 1: Example of a nested tree where the gene tree (in black) does not coincide with the species tree (in gray).

Recent research aims at defining mathematical processes giving rise to such nested trees, generalizing several well-studied univariate (we will sometime use this term as opposed to

“nested”) processes. Some work in progress involves a nested version [5,17] of the King- man coalescent [14] (considered the neutral model for evolution, appearing as a scaling limit of many individual-based population models). In [4] we study a nested generaliza- tion ofΛ-coalescent processes [3,21,22] and characterize their distribution. Our present goal is to generalize the forward-time branching models originated from Aldous [1]. His

(4)

assumptions (which will be formally defined for our context in Section3) are basically that the random process of evolution is homogeneous in time and that the law of the process is invariant under both relabeling and resampling of individuals (we then say the process is exchangeableandsampling consistent). We are interested in the partition-valued processes satisfying these assumptions, i.e. the so-called fragmentation processes [3,13], and in this article we generalize their definition tonested partition-valuedprocesses to model jointly a gene tree within a species tree.

Crane [7] also generalizes Aldous’s Markov branching models to study the gene tree/species tree problem but uses a different approach to the one we use here. Indeed, his model is such that first the entire species treetis drawn according to some probability, and then the gene treet⁰ is constructed thanks to a generalized Markov branching model that depends ont. In the meantime, our goal is to characterize the class of models in which there is a joint Markov branching construction of both the gene tree and the species tree, under the assumptions of exchangeability and sampling consistency.

In particular our main result Theorem17, which will be formally stated in Section5, con- sists in showing that nested fragmentation processes satisfying natural branching properties are uniquely characterized by

• threeerosion parameters𝑐_out,𝑐_in,1and𝑐_in,2(rates at which a unique lineage can fragment out of its mother block, in three different situations);

• two dislocation measures 𝜈_out and 𝜈_in that are Poissonian intensities of how blocks instantaneously fragment into several new blocks with macroscopic frequencies.

The article is organized as follows. Section2briefly introduces some definitions and notation used throughout the paper. In Section3we define our exchangeability and sampling consistency properties – or projective Markov property –, and show their equivalence to a

“strong exchangeability” property in a fairly general setting. We also recall some results in the univariate case which we seek to generalize to the nested case. In Section4we for- mulate some branching property assumptions, showing how they lead to simplifications in the representation of semi-groups of fragmentations, and giving a natural Poissonian construction of such processes. Under an additional branching property assumption, Section 5is devoted to the full characterization of the semi-group of simple nested fragmentation processes, in terms oferosionanddislocation measures. It is shown that dislocations, similarly as in the univariate case, can be understood as (bivariate) paintbox processes. Finally Section6briefly shows how our main result, Theorem17, translates in simpler terms when we make the classical biological assumption that all splits are binary.

2 Definitions, notation

For a set𝑆, writeP𝑆 for the set of partitions of𝑆:

P𝑆 := {^𝜋 ⊂P(^𝑆) \ {}, ∀^𝐴, ^𝐵∈ ^𝜋,𝐴∩^𝐵=and S

𝐴∈^𝜋𝐴=^𝑆}, whereP(^𝑆)denotes the power set of𝑆.

(5)

For𝑆,𝑆⁰two sets,𝜋 ∈ P𝑆 and𝜎:𝑆⁰→^𝑆aninjection, we write 𝜋^𝜎:= {^𝜎⁻¹(^𝐴), 𝐴∈ ^𝜋} \ {},

and if𝜇is a measure onP𝑆then we write𝜇^𝜎for the push-forward of𝜇by the map𝜋7→^𝜋^𝜎. Note that if𝑆⁰⁰

→𝜏 ^𝑆⁰ →^𝜎 ^𝑆are injections, then we have𝜋^𝜎𝜏 =(^𝜋^𝜎)^𝜏_{, and}^𝜇^𝜎𝜏=(^𝜇^𝜎)^𝜏_. For𝑆⁰ ⊂ ^𝑆, there is a natural surjective function 𝑟_𝑆_,_𝑆0 : P𝑆 → P𝑆⁰ called the restriction, defined by

𝑟_𝑆_,_𝑆0(^𝜋)=^𝜋|^𝑆⁰ :={^𝐴∩^𝑆⁰_, ^𝐴∈ ^𝜋} \ {}_. Note that𝜋_|_𝑆0 =^𝜋^𝜎^for^𝜎 ^:^𝑆⁰ →^𝑆,𝑥7→ ^𝑥the canonical injection.

There is always a partial order onP𝑆, denoted and defined as:

𝜋 ^𝜋⁰ _if ∀(^𝐴_,^𝐵) ∈^𝜋×^𝜋⁰_, ^𝐴∩^𝐵, ⇒ ^𝐴⊂ ^𝐵_,

that is𝜋 ^𝜋⁰ _if ^𝜋is finer than 𝜋⁰. We will work on the space consisting of two nested partitions, which we will noteP_𝑆^2,:

P^2,

𝑆 := {(^𝜁_,^𝜉) ∈ P_𝑆²_, ^𝜁 ^𝜉}_.

We equip the spaceP_𝑆^2, with a partial order defined naturally as (^𝜁_,^𝜉) (^𝜁⁰_,^𝜉⁰)if𝜁 ^𝜁⁰and𝜉 ^𝜉⁰_.

Let us now define, for𝑛∈ _,[^𝑛]_:={_{1, . . . ,}^𝑛}_and[∞]_:=_{, and for} ^𝑛∈ ∪ {∞}_: P𝑛:= P_[𝑛] ={^𝜁partition of[^𝑛]}_.

We will generally label the blocks of a partition𝜋= {^𝜋₁_,^𝜋₂_{, . . .}}, in the unique way such that

min𝜋₁< min𝜋₂ <. . .

The spaceP_∞^2, is endowed with a distance𝑑which makes it compact, defined as follows:

𝑑(^𝜋,𝜋⁰)= ^sup{^𝑛∈ _, ^𝜋_|[𝑛] =^𝜋|[^𝑛]}−1

, with the convention(sup)⁻¹=^0.

For𝑘 ≤ ^𝑛 ≤ ∞,𝜎 :[^𝑘] → [^𝑛]an injection and𝜋=(^𝜁_,^𝜉) ∈ P𝑛^2,, we write 𝜋^𝜎 :=(^𝜁^𝜎_,^𝜉^𝜎) ∈ P^2,

𝑘 . Also, we write𝜋_|[_𝑘_] :=(^𝜁_|[𝑘],𝜉_|[_𝑘_]) ∈ P^2,

𝑘 .

A measure𝜇onP𝑛or onP𝑛^2,is said to beexchangeableif for any permutation𝜎:[^𝑛] → [^𝑛], we have

𝜇^𝜎=^𝜇^.

A random variableΠ taking values inP𝑛or inP𝑛^2, is said to beexchangeableif for any permutation𝜎:[^𝑛] → [^𝑛], we have

Π^𝜎 ⁽=^𝑑⁾Π,

(6)

that is if its distribution is exchangeable. Similarly, a random process(_Π(^𝑡),𝑡 ≥ 0)taking values in P𝑛 or in P𝑛^2, is said to be exchangeable if for any initial state 𝜋₀ and any permutation𝜎:[^𝑛] → [^𝑛]_{, we have}

(_Π(^𝑡)^𝜎,𝑡 ≥ 0)under𝜋₀ (^𝑑)

= (_Π(^𝑡),𝑡 ≥ 0)under𝜋^𝜎 0, where𝜋is the distribution of the process started from𝜋.

Finally, a measure or a random process with values inP_∞_orP_∞^2,will be calledstrongly ex- changeableif its distribution is invariant under the action ofinjections. Note that while for processes this is a strictly stronger assumption than being exchangeable (see Section3.2), for measures the two properties are equivalent.

In the following we only consider time-homogeneous Markov processes.

3 Projective Markov property and strong exchangeability

3.1 Projective Markov process

For each𝑛 ∈ , let 𝐴_𝑛be a finite non-empty set. Assume there are surjective maps𝑟_𝑚_,_𝑛 : 𝐴_𝑚→ ^𝐴𝑛for each𝑚 ≥ ^𝑛which satisfy

∀^𝑝 ≥ ^𝑚 ≥^𝑛 ≥ 1, 𝑟_𝑚_,_𝑛◦^𝑟𝑝,𝑚 =^𝑟^𝑝^,^𝑛^,

∀^𝑛∈ , 𝑟_𝑛_,_𝑛= ^id^𝐴^𝑛^.

The family(^𝐴𝑛,𝑟_𝑚_,_𝑛, 𝑚 ≥ ^𝑛 ≥ 1)is called afinite inverse system, and we can define the inverse limit

𝐴=^lim←−− ^𝐴^𝑛^:=

(^𝑎𝑛,𝑛 ≥1) ∈Q

𝑛∈𝐴_𝑛, ∀^𝑚 ≥ ^𝑛,𝑟_𝑚_,_𝑛(^𝑎𝑚)= ^𝑎^𝑛 ^,

along with the canonical projection maps𝑟_𝑛 : 𝐴 → ^𝐴𝑛, (^𝑎𝑛,𝑛 ≥ ₁) 7→ ^𝑎𝑛. A natural distance𝑑can be defined on the space 𝐴, by

𝑑(^𝑎_,^𝑏)_:=(₁/₂+^sup{^𝑛 ≥ _1, ^𝑎𝑛= ^𝑏^𝑛})⁻¹_,

where we use the conventions sup =^{0 and}(₁/₂+^sup)⁻¹= 0. Note that its topology is then generated by the sets

𝑟⁻¹

𝑛 ({^𝑎})_, ^𝑛≥ _1,^𝑎∈ ^𝐴𝑛,

which are the balls of radius 1/^𝑛and center any𝑐 ∈ ^𝑟⁻_𝑛¹(^𝑎). The assumption that the sets 𝐴_𝑛are finite makes the space(^𝐴,𝑑)compact, so we can consider stochastic processes with values in𝐴.

Remark 1. P_∞ = ^lim←−− P𝑛 and P_∞^2, = ^lim←−− P_𝑛^2, are both inverse limits of finite inverse systems, where the restriction maps are𝑟_𝑚_,_𝑛: P𝑚→ P𝑛, 𝜋7→^𝜋_|[𝑛].

(7)

Proposition 2. Let𝑋 = (^𝑋(^𝑡),𝑡 ≥0)be a stochastic process with values in𝐴the inverse limit of a finite inverse system. Assume that the followingprojective Markov propertyholds:

For all𝑛≥ 1, the process𝑋^𝑛:=(^𝑟𝑛(^𝑋(^𝑡)),𝑡 ≥0)is a continuous-time Markov chain in the finite state space 𝐴_𝑛, whose distribution under𝑎 depends only on𝑟_𝑛(^𝑎)_.

Then𝑋 is a Markov process, whose distribution is characterized by a transition kernel𝐾 from 𝐴to𝐴(i.e.𝐾_𝑎( · )is a nonnegative measure on 𝐴for all𝑎∈ ^𝐴_and^𝑎7→ ^𝐾𝑎(^𝐵)is measurable for any𝐵Borel set of 𝐴) such that

• for all𝑎∈ ^𝐴_{, we have}^𝐾𝑎({^𝑎})= 0,

• for all𝑎 ∈ ^𝐴_and ^𝑎⁰ ∈ ^𝐴𝑛\ {^𝑟𝑛(^𝑎)}, the Markov chain 𝑋^𝑛 has a transition rate from 𝑟_𝑛(^𝑎)_to^𝑎⁰ _{equal to}

𝑞^𝑛

𝑎,𝑎⁰ = ^𝐾^𝑎 ^𝑟⁻^𝑛¹({^𝑎⁰}) . Proof. 𝑋^𝑛is a Markov chain, therefore there exist transition rates

𝑞^𝑛

𝑎,𝑎⁰ =^lim

𝑡↓0

1 𝑡

𝑎(^𝑋^𝑛(^𝑡)=^𝑎⁰)

for all𝑎 ∈ ^𝐴, 𝑎⁰ ∈ ^𝐴𝑛\ {^𝑟𝑛(^𝑎)}. Now since for 𝑛 < ^𝑚, 𝑋^𝑚 and 𝑋^𝑛 = ^𝑟^𝑚,𝑛(^𝑋^𝑚) are both Markov chains, necessarily we have

𝑞^𝑛

𝑎,𝑎⁰ = X

𝑎⁰⁰∈^𝑟⁻¹𝑚,𝑛(^𝑎⁰)

𝑞^𝑚

𝑎,𝑎⁰⁰. Fix𝑎^?∈ ^𝐴and𝑛≥ 1 and consider the application

𝑓_𝑛: 𝑎∈ ^𝐴𝑛\ {^𝑟𝑛(^𝑎^?)} 7−→^𝑞^𝑛

𝑟𝑛(𝑎^?),𝑎. Then these applications(^𝑓𝑛, 𝑛≥ 1)satisfy

∀^𝑚 ≥ ^𝑛≥ 1, 𝑎∈ ^𝐴𝑛\ {^𝑟𝑛(^𝑎^?)}, 𝑓_𝑛(^𝑎)= X

𝑎⁰∈𝑟_𝑚⁻¹_,_𝑛({𝑎})

𝑓_𝑚(^𝑎⁰).

It is then easy to check that Carathéodory’s extension theorem allows us to build a measure 𝐾_𝑎?on 𝐴\ {^𝑎^?}(which we see as a measure on 𝐴such that 𝐾_𝑎?({^𝑎^?})=0) for which

∀^𝑛 ≥1, 𝑎∈ ^𝐴𝑛\ {^𝑟𝑛(^𝑎^?)}, 𝐾_𝑎? 𝑟⁻¹

𝑛 ({^𝑎}) = ^𝑓^𝑛(^𝑎)= ^𝑞^𝑛𝑟𝑛(^𝑎^?),𝑎.

Let us check that𝐾is a kernel, i.e. that𝑎7→ ^𝐾𝑎(^𝐵)is measurable for any Borel set𝐵. For𝐵of the form𝑟⁻¹

𝑛 (^𝑎⁰), we have𝐾_𝑎(^𝐵)=^𝑞𝑟^𝑛𝑛(^𝑎),𝑎⁰, so𝑎7→ ^𝐾𝑎(^𝐵)is clearly measurable. It is readily checked that the sets𝑟⁻¹

𝑛 (^𝑎⁰) form a 𝜋-system and that the sets 𝐵 such that 𝑎 7→ ^𝐾𝑎(^𝐵) is measurable form a monotone class. The monotone class theorem then implies that this property holds for any Borel set𝐵⊂ ^𝐴.

Let us now show that𝐾characterizes uniquely the distribution of𝑋. Clearly,𝐾characterizes the distribution of𝑋^𝑛 for all 𝑛 ∈ since all the transition rates of the Markov chain 𝑋^𝑛 can be recovered as a function of 𝐾. By assumption, those distributions are consistent, in the sense that for any 𝑚 ≥ ^𝑛_{, we have} ^𝑟𝑚,𝑛(^𝑋^𝑚) ⁽=^𝑑⁾ ^𝑋^𝑛^{, where}⁽=^𝑑⁾ denotes equality in distribution. Then, by Kolmogorov’s extension theorem, there is a unique distribution for

𝑋 which satisfies𝑟_𝑛(^𝑋)⁽=^𝑑⁾ ^𝑋^𝑛^{for all}^𝑛 ∈.

(8)

Let us now note𝑟_𝑛(^𝑎) = ^𝑎^𝑛^{for any} ^𝑎 ∈ ^𝐴to ease the notation. Note that the infinitesimal generator𝐺_𝑛of the continuous-time finite-space Markov chain𝑋^𝑛is then given by

𝐺_𝑛𝑓(^𝑎𝑛)= X

𝑏𝑛∈𝐴𝑛\{𝑎𝑛}

𝑞^𝑛

𝑎,𝑏(^𝑓(^𝑏𝑛) −^𝑓(^𝑎𝑛))

=∫

𝐴

𝐾_𝑎(_d^𝑏) ^𝑓(^𝑏𝑛) − ^𝑓(^𝑎𝑛) ,

for any function𝑓 : 𝐴_𝑛→and𝑎∈ ^𝐴. Let us see that this result holds in the limit𝑛→ ∞, at least for a class of continuous functions𝑓 : 𝐴→. Whether the preceding result holds for a continuous function𝑓 will depend on its modulus of continuity𝜔_𝑓 : [0,∞) → [0,∞) defined for𝜀> 0 by

𝜔_𝑓(^𝜀):=^sup{|^𝑓(^𝑎) −^𝑓(^𝑎⁰)|, 𝑎,𝑎⁰ ∈ ^𝐴,𝑑(^𝑎,𝑎⁰) ≤^𝜀}, which is always finite since𝐴is compact.

Proposition 3. Let 𝑋 be a projective Markov process defined on the compact space (^𝐴_,^𝑑)_, inverse limit of a finite inverse system(^𝐴𝑛,𝑛 ∈), and consider its characteristic kernel𝐾 as given by Proposition2.

Let𝑘_𝑛:= ^max^𝑎^∈^𝐴^𝐾^𝑎(^𝐴\^𝑟⁻_𝑛¹({^𝑎𝑛}))denote the maximum jump rate of the Markov chain 𝑋^𝑛. Consider a function 𝑓 : 𝐴 → with a modulus of continuity denoted by 𝜔_𝑓, and suppose 𝜔_𝑓(₁/^𝑛)^𝑘²

𝑛+¹→₀_as^𝑛→ ∞_.

Then for every𝑎 ∈ ^𝐴, the function 𝑏 7→ (^𝑓(^𝑏) −^𝑓(^𝑎))_is ^𝐾𝑎-integrable and the infinitesimal generator𝐺of the Markov process𝑋 is well-defined on 𝑓 and satisfies

𝐺 𝑓(^𝑎)= ^lim

𝑡→0

𝑎𝑓(^𝑋𝑡) −^𝑓(^𝑎)

𝑡 =

∫

𝐴

𝐾_𝑎(d𝑏) ^𝑓(^𝑏) −^𝑓(^𝑎)

. (1)

Proof. First, note that if 𝑘_𝑛 = ^{0 for all} ^𝑛^{, then} ^𝐾^𝑎 = ^{0 for all} ^𝑎 ∈ ^𝐴and the process 𝑋 is almost surely constant, so (1) is correct. We now assume that𝑘_𝑛 >0 for𝑛large enough.

Fix𝑎∈ ^𝐴. Let us first check that𝑏7→ (^𝑓(^𝑏) −^𝑓(^𝑎))_is^𝐾𝑎-integrable. Let 𝐵₀ := ^𝐴\^𝑟⁻₁¹({^𝑎𝑛}) and for𝑛≥ 1,𝐵_𝑛:=^𝑟⁻𝑛¹({^𝑎𝑛}) \^𝑟⁻_𝑛₊¹₁({^𝑎𝑛+¹}), and notice that

∫

𝐴

𝐾_𝑎(d𝑏) |^𝑓(^𝑏) − ^𝑓(^𝑎)| ≤^𝐾𝑎(^𝐵₀)^𝜔𝑓(2)+

∞

X

𝑛=¹

∫

𝐵𝑛

𝐾_𝑎(d𝑏)^𝜔𝑓(1/^𝑛)

= ^𝑘1𝜔_𝑓(2)+X^∞

𝑛=¹

(^𝑘𝑛+¹−^𝑘𝑛)^𝜔𝑓(1/^𝑛). (2) By assumption, 𝜔_𝑓(1/^𝑛)^𝑘²_𝑛₊₁ → 0, so we have 𝜔_𝑓(1/^𝑛) = ^{𝑜 𝑘}⁻𝑛+²¹

, and since (^𝑘𝑛)𝑛 is a positive, nondecreasing sequence,

∞

X

𝑛=^𝑁

𝑘_𝑛₊₁−^𝑘𝑛

𝑘²

𝑛+¹

≤

∞

X

𝑛=^𝑁

𝑘_𝑛₊₁−^𝑘𝑛

𝑘_𝑛₊₁𝑘_𝑛 =

∞

X

𝑛=^𝑁

1 𝑘_𝑛

− ¹ 𝑘_𝑛₊₁

≤ ¹ 𝑘_𝑁,

which is finite for𝑁 such that𝑘_𝑁 >0. It follows that the sum in (2) is finite, so the function 𝑏7→ (^𝑓(^𝑏) −^𝑓(^𝑎))is 𝐾_𝑎-integrable.

(9)

Now for each𝑛∈ , consider a family(^𝑎¹,𝑎², . . . ,𝑎^𝑝) ∈ ^𝐴^𝑝such that𝐴_𝑛 ={^𝑎𝑛,𝑎¹

𝑛,𝑎²

𝑛, . . . ,𝑎

𝑝 𝑛} with no repetition, i.e. such that𝑝+¹= |^𝐴𝑛|. Now let us define for all𝑏∈ ^𝐴,𝑓_𝑛(^𝑏):= ^𝑓(^𝑎^𝑖) if and only if𝑏_𝑛 = ^𝑎^𝑖^𝑛. Notice that 𝑓_𝑛is an approximation of 𝑓, in the sense that the error function𝑔_𝑛 : 𝑏 7→ (^𝑓(^𝑏) − ^𝑓𝑛(^𝑏))necessarily satisfies|^𝑔𝑛(^𝑏)| ≤ ^𝜔𝑓(1/^𝑛). Note also that by definition,𝑓_𝑛(^𝑎)= ^𝑓(^𝑎).

Let us here treat the case when there exists𝑛≥ 1 such that𝜔_𝑓(1/^𝑛)=0. By the preceding remark, we have 𝑓_𝑛 = ^𝑓, in other words there exists an application e^𝑓𝑛 : 𝐴_𝑛 → such that 𝑓(^𝑏) = e^𝑓𝑛(^𝑏𝑛) = e^𝑓𝑛(^𝑟𝑛(^𝑏))_{. So} 𝑎𝑓(^𝑋𝑡) = 𝑎e^𝑓𝑛(^𝑟𝑛(^𝑋𝑡)), and since(^𝑟𝑛(^𝑋𝑡)_,^𝑡 ≥ ₀)_{is a} finite-state-space continuous-time Markov chain, it is immediate that

𝑎𝑓(^𝑋𝑡)= ^𝑓(^𝑎)+^𝑡 ^𝑝 X

𝑖=¹

𝑞^𝑛

𝑎,𝑎^𝑖(^𝑓(^𝑎^𝑖) − ^𝑓(^𝑎))

+^𝑂 (^{𝑡 𝑘}𝑛)²k^𝑓k_∞ , wherek^𝑓k_∞ := ^sup^𝑏∈^𝐴|^𝑓(^𝑏)|, and where the constant in the term𝑂 (^{𝑡 𝑘}𝑛)²k^𝑓k_∞

does not depend on𝑡,𝐾 or 𝑓. From this it is clear that

𝑎𝑓(^𝑋𝑡) − ^𝑓(^𝑎) 𝑡

−→

𝑡→0 𝑝

X

𝑖=¹

𝑞^𝑛

𝑎,𝑎^𝑖(^𝑓(^𝑎^𝑖) − ^𝑓(^𝑎))=

∫

𝐴

𝐾_𝑎(d𝑏)(^𝑓(^𝑏) − ^𝑓(^𝑎)).

Now let us assume that for all𝑛 ≥1,𝜔_𝑓(1/^𝑛)> 0. Since𝑓_𝑛(^𝑏)depends only on𝑏_𝑛, we can write

𝑎𝑓_𝑛(^𝑋𝑡)= ^𝑓(^𝑎)+^𝑡∫

𝐴

𝐾_𝑎(_d^𝑏)(^𝑓𝑛(^𝑏) − ^𝑓(^𝑎))+^𝑂 (^{𝑡 𝑘}𝑛)²k^𝑓k_∞

= ^𝑓(^𝑎)+^𝑡

∫

𝐴\^𝑟𝑛⁻¹({^𝑎𝑛})

𝐾_𝑎(d𝑏)(^𝑓(^𝑏) −^𝑓(^𝑎))+^𝑂(^𝑡𝜔𝑓(1/^𝑛)^𝑘𝑛)+^𝑂 (^{𝑡 𝑘}𝑛)²k^𝑓k_∞ , Notice also that

𝑎𝑓(^𝑋𝑡) −𝑎𝑓_𝑛(^𝑋𝑡) 𝑡

≤

𝜔_𝑓(1/^𝑛) 𝑡 , so that putting everything together, we have

𝑎𝑓(^𝑋𝑡) − ^𝑓(^𝑎)

𝑡 =∫

𝐴\𝑟⁻¹_𝑛 ({𝑎𝑛})

𝐾_𝑎(_d^𝑏) (^𝑓(^𝑏) −^𝑓(^𝑎))+^𝑂

𝜔_𝑓(₁/^𝑛)^𝑘𝑛+ ^𝜔^𝑓(1/^𝑛) 𝑡 +^{𝑡 𝑘}²^𝑛

. (3) If one can find𝑛 = ^𝑛(^𝑡) _{such that}^𝑛 → ∞_,^𝜔𝑓(₁/^𝑛)/^𝑡 → _{0 and}^{𝑡 𝑘}²_𝑛 → _{0 as}^𝑡 → _{0, then} passing to the limit in (3), by using the dominated convergence theorem for the integral, yields (1).

Now let us define for all𝑚 ≥ 1,𝑡_𝑚 := p

𝜔_𝑓(1/^𝑚)/^𝑘𝑝 and𝑡⁰

𝑚 := p

𝜔_𝑓(1/^𝑚)/^𝑘𝑚+¹. Notice that

𝑡_𝑚 ≥ ^𝑡_𝑚⁰ ≥^𝑡𝑚+¹ −→

𝑚→∞ 0,

so for each𝑡 ∈ (_0,^𝑡₁], there is an𝑚 ≥ 1 such that𝑡 ∈ [^𝑡𝑚+¹,𝑡_𝑚]_{. Then,}

• if𝑡 ≥ ^𝑡⁰_𝑚, let𝑛(^𝑡):=^𝑚, and we check 𝜔_𝑓(1/^𝑛)/^𝑡 ≤^𝜔𝑓(1/^𝑛)/^𝑡_𝑛⁰ =q

𝜔_𝑓(1/^𝑛)^𝑘𝑛+¹, and 𝑡 𝑘²

𝑛 ≤ ^𝑡𝑛𝑘²

𝑛=q

𝜔_𝑓(1/^𝑛)^𝑘𝑛;

(10)

• if𝑡 ≤ ^𝑡⁰_𝑚, let𝑛(^𝑡):=^𝑚+1, and we check 𝜔_𝑓(1/^𝑛)/^𝑡 ≤ ^𝜔𝑓(1/^𝑛)/^𝑡𝑛= q

𝜔_𝑓(1/^𝑛)^𝑘𝑛, and 𝑡 𝑘²

𝑛 ≤^𝑡_𝑛⁰₋₁^𝑘²𝑛 =q

𝜔_𝑓(1/(^𝑛−1))^𝑘𝑛. Since we assumed that𝜔_𝑓(₁/^𝑛) > 0 for all 𝑛, then 𝑡_𝑚 > 0 for all 𝑚, which implies that necessarily𝑛(^𝑡) → ∞as𝑡 → 0. Finally, the assumption that𝜔_𝑓(1/^𝑛)^𝑘²_𝑛₊₁→ 0 as𝑛 → ∞ ensures us that both𝜔_𝑓(₁/^𝑛)/^𝑡_and^{𝑡 𝑘}²_𝑛tend to 0 as𝑡 →0, which concludes the proof.

We are now interested in exchangeable projective Markov processes with values in the space of nested partitionsP_∞^2,, as an extension of univariate fragmentation processes (with values inP_∞_).

3.2 Strongly exchangeable Markov process

In the following, we write P for either P_∞ _or P_∞^2,, when our assertions are valid for both spaces. We will also writeP^𝑛for P𝑛or P𝑛^2,. A key property of those spaces is the following.

For any𝑛 ∈, and any𝜋∈_P𝑛, there is a𝜋^?∈ _Psatisfying:

• 𝜋^?

|[𝑛] =^𝜋

• for any𝜋⁰ ∈ _P such that𝜋⁰

|[^𝑛] =^𝜋, there is an injection𝜎: → _which satisfies𝜎_|[_𝑛_] =^id^[^𝑛^]^and(^𝜋^?)^𝜎=^𝜋⁰^.

Indeed for instance inP= P_∞, it is easy to choose a𝜋^?with an infinity of infinite blocks and no finite blocks, and such that 𝜋^?

|[^𝑛] = ^𝜋. This partition satisfies immediately the required property. We will call any such𝜋^?auniversal element ofPwith initial part𝜋 whenever we need to use one.

Proposition 4. LetΠ =(_Π(^𝑡)_,^𝑡 ≥0)be an exchangeable Markov process taking values inP with càdlàg sample paths. The following propositions are equivalent:

(i) Πis strongly exchangeable.

(ii) Πhas the projective Markov property, i.e.Π^𝑛 := (_Π(^𝑡)_|[𝑛],𝑡 ≥ 0)is a Markov chain for all𝑛 ∈ _.

Remark 5. Crane and Towsner [8, Theorem 4.26] show that the projective Markov property is equivalent to the Feller property for exchangeable Markov process taking values in a Fraïssé space (i.e. a space satisfying general “stability and universality” assumptions [see 8, Definitions 4.4 to 4.11]). In particular the space of partitions and the space of nested partitions are Fraïssé spaces (the argument essentially being the existence of so-called universal elements𝜋^?), so for the processes we consider, strong exchangeability is equivalent to the Feller property.

Proof. (^𝑖) ⇒ (^𝑖𝑖): Let𝑛 ∈ and𝜋∈ _P𝑛. Fix a universal𝜋^?∈ _Pwith initial part𝜋. Now take any𝜋₀∈ _P_{such that}(^𝜋₀)_|[𝑛] =^𝜋, and an injection𝜎:→_{such that}^𝜎_|[𝑛] =^id|[^𝑛]

(11)

and(^𝜋^?)^𝜎=^𝜋0. Now we have

𝜋₀(_Π^𝑛∈ ·)= 𝜋^?((_Π^𝜎)^𝑛∈ ·)

= 𝜋^?(_Π^𝑛∈ ·)_,

so this distribution depends only on𝜋, which proves thatΠ^𝑛is a Markov process. Now the assumption thatΠhas càdlàg sample paths ensures that the processΠ^𝑛stays some positive time in each visited statea.s.ThereforeΠ^𝑛is a continuous-time Markov chain.

(^𝑖𝑖) ⇒ (^𝑖)_{: Let}^𝜎 _: → be an injection. For 𝑛 ∈ _{, let}^𝜏be a permutation of_such that𝜏_|[_𝑛_] =^𝜎|[^𝑛]. This property implies(^𝜋^𝜏)_|[𝑛] =(^𝜋^𝜎)_|[𝑛] for any𝜋 ∈P. We deduce

𝜋((_Π^𝜎)^𝑛 ∈ ·)=𝜋((_Π^𝜏)^𝑛 ∈ ·)

=𝜋^𝜏(_Π^𝑛∈ ·)

=𝜋^𝜎(_Π^𝑛 ∈ ·)

where the last equality is a consequence of the projective Markov property (the distribution ofΠ^𝑛under𝜋depends only on the initial segment𝜋_|[_𝑛_]). Since it is true for all𝑛, we have

𝜋(_Π^𝜎 ∈ ·)=𝜋^𝜎(_Π∈ ·), which proves the property of strong exchangeability.

Remark 6. To be strongly exchangeable is strictly stronger than being exchangeable. To see that, define the Markov processΠ=(_Π(^𝑡)_,^𝑡 ≥ ₀)taking values inP_∞ _by:

• If𝜋∈ P_∞has an infinite number of blocks, then letΠunder𝜋be almost surely the constant function equal to𝜋.

• If𝜋∈ P_∞has a finite number of blocks, let𝑇 be an Exponential(1) random variable, and let the distribution ofΠunder𝜋 be that of the random function:

𝑡 7→

(

𝜋 if𝑡 <^𝑇 0∞ if𝑡 ≥^𝑇 ThenΠis clearly exchangeable but not strongly exchangeable.

Proposition 7. LetΠ =(_Π(^𝑡)_,^𝑡 ≥₀)be a strongly exchangeable Markov process inP. Then there is a unique kernel𝐾 fromP toPsuch that

• for all𝜋₀ ∈_{P, we have} ^𝐾𝜋₀({^𝜋₀})=0,

• for all𝜋₁ ∈ _P𝑛, for all 𝜋₂ ∈ _P𝑛\ {^𝜋₁}, the Markov chain Π^𝑛 has a transition rate from𝜋₁to𝜋₂equal to

𝐾_𝜋

0 𝜋_|[_𝑛_] = ^𝜋2

, where𝜋₀is any element ofPsuch that(^𝜋₀)_|[𝑛] =^𝜋1.

Furthermore this kernel is strongly exchangeable, i.e. for any 𝜋₀ ∈ _P and any injection 𝜎:→_{, we have}

𝐾^𝜎

𝜋₀ = ^𝐾^𝜋^𝜎₀^.

(12)

Proof. The first part of the proposition is an immediate consequence of Proposition 2. It remains only to prove that 𝐾 is strongly exchangeable. Consider 𝜋₀ ∈ _P, 𝑛 ∈ , 𝜋⁰ ∈ P^𝑛\ {(^𝜋₀)_|[𝑛]}and an injection𝜎: →_{. We have}

1 𝑡

𝜋₀ (_Π(^𝑡)^𝜎)_|[𝑛] =^𝜋⁰ = ¹ 𝑡

𝜋^𝜎

0 Π(^𝑡)_|[𝑛] = ^𝜋⁰ because of the exchangeability ofΠ, and taking limits we find

𝐾_𝜋

0 (^𝜋^𝜎)_|[𝑛] =^𝜋⁰

= ^𝐾^𝜋^𝜎₀ ^𝜋|[^𝑛] =^𝜋⁰ . So the two𝜎-finite measures 𝐾^𝜎

𝜋₀ and 𝐾_𝜋^𝜎

0 coincide on the sets of the form {^𝜋_|[𝑛] = ^𝜋⁰}, which constitute a 𝜋-system generating the Borel sets of P. Therefore they are equal,

which concludes the proof.

Remark 8. Consider a universal element 𝜋^? ∈ _P such that for any 𝜋 ∈ P, there is an injection𝜎such that𝜋=(^𝜋^?)^𝜎. The exchangeability property of the kernel𝐾then implies that𝐾_𝜋= ^𝐾𝜋^𝜎^?, therefore𝐾 is entirely determined by the single measure𝐾

𝜋^?. 3.3 Univariate results, mass partitions

Random exchangeable partitions𝜋 ∈ P_∞ and their relation to random mass partitions is well known [see3, Chapter 2]. Let us recall briefly some definitions and results, which we will then extend to the nested case. We define the space of mass partitions

Pm :=

s=(^𝑠₁_,^𝑠₂_{, . . .}) ∈ [_{0, 1}]_, ^𝑠₁≥ ^𝑠₂ ≥ _{. . . ,} P

𝑘𝑠_𝑘 ≤ ₁ _. ₍₄₎

Fors∈ _P_m, one defines an exchangeable distribution𝜚_sonP_∞, by the following so-called paintbox construction:

• for𝑘 ≥ 0, define𝑡_𝑘 =P^𝑘

𝑘⁰=¹𝑠_𝑘0, with𝑡₀=0 by convention.

• let(^𝑈𝑖,𝑖≥ ₁)be an i.i.d. sequence of uniform random variables in[_{0, 1}]_.

• define the random partition𝜋∈ P_∞ by setting

𝑖∼^𝜋 ^𝑗 ⇐⇒ ^𝑖= ^𝑗^or∃^𝑘 ≥ 1,𝑈_𝑖,𝑈_𝑗 ∈ [^𝑡𝑘−1,𝑡_𝑘). Note that the set 𝜋₀ := {[^𝑡𝑘−1,𝑡_𝑘)_,^𝑘 ≥ ₁} ∪ {{^𝑡}_, P

𝑘≥1𝑠_𝑘 ≤ ^𝑡 ≤ ₁} is a partition of [0, 1], and that we have𝜋= ^𝜋^𝜎₀^{, where}^𝜎^: → [0, 1]is the random injection defined by 𝜎 : 𝑖 7→ ^𝑈𝑖. Also, note that by definition some blocks are singletons (blocks {^𝑖} such that 𝑈_𝑖 ∈ [P

𝑘≥1𝑠_𝑘,1]), and by construction we have

#{^𝑖 ∈ [^𝑛]_, {^𝑖} ∈ ^𝜋} 𝑛

−→

𝑛→∞

𝑠₀:=¹−P

𝑘≥1𝑠_𝑘.

These integers that are singleton blocks are called thedustof the random partition𝜋and the last display tells us there is a frequency𝑠₀of dust.

Conversely, any random exchangeable partition𝜋has a distribution that can be expressed with these paintbox constructions𝜚_s. Indeed,𝜋hasasymptotic frequencies, i.e.

|^𝐵|:= ^lim

𝑛→∞

#(^𝐵∩ [^𝑛])

𝑛 exists a.s. for all𝐵∈ ^𝜋.

(13)

Let us write|^𝜋|^↓ ∈ _P_mfor the decreasing reordering of(|^𝐵|,𝐵∈ ^𝜋), ignoring the zero terms coming from the dust. Now it is known [14, Theorem 2] that the conditional distribution of𝜋given|^𝜋|^↓ =^s^is^𝜚^s, so we have

(^𝜋∈ · )=∫

(|^𝜋|^↓ ∈ ds)^𝜚_s( · )_.

This means that any exchangeable probability measure onP_∞is of the form𝜚_𝜈 where𝜈is a probability measure onPm, and

𝜚_𝜈( · ) :=∫

Pm

𝜚_s( · )^𝜈(ds).

Furthermore, Bertoin [3, Theorem 3.1] shows that any exchangeable measure 𝜇 on P_∞ such that

∀^𝑛≥ _1, ^𝜇(^𝜋_|[𝑛] ,¹[^𝑛])< ∞ ₍₅₎ can be written𝜇= ^𝑐e+^𝜚^𝜈^{, where}^𝑐 ≥ 0,𝜈is a measure onPmsatisfying

∫

Pm

(1−^𝑠₁)^𝜈(ds) <∞, (6) andeis the so-callederosion measure, defined by

e:=P

𝑖∈𝛿_{{ {}_𝑖_}_,_\{_𝑖_{} }}.

As a result, each fragmentation process with values in P_∞ is characterized by its erosion coefficient𝑐and characteristic measure𝜈, in such a way that its rates can be described as follows:

A block of size 𝑛 fragments, independently of the other blocks, into a partition with𝑘different blocks of sizes𝑛₁,𝑛₂, . . . ,𝑛_𝑘 with rate

𝑐1{^𝑘=^{2, and} ^𝑛¹=^{1 or}^𝑛²=¹}+

∫

Pm

𝜈(ds)X

i

𝑠^𝑛¹

𝑖₁ ·^𝑠^𝑛²

𝑖₂ · · ·^𝑠^𝑛^𝑘

𝑖𝑘, where 𝑠₀ is defined to be 1 − P

𝑖≥1𝑠_𝑖, and the sum is over the vectors i = (^𝑖₁, . . . ,𝑖_𝑘) ∈ {0, 1, . . .}^𝑘 such that 𝑖_𝑗 may be 0 only if 𝑛_𝑗 = ^{1, and if} ^𝑗 , ^𝑗⁰ ^and 𝑖_𝑗 ,0, then𝑖_𝑗0 , ^𝑖^𝑗.

We aim at showing a similar result concerning fragmentations of nested partitions.

4 Outer branching property

From now on, to be able to give a more precise characterization of nested fragmentation processes, we will exclude from the study those processes which exhibit simultaneous fragmentations in separate blocks. That is, we will assume a branching property: two different blocks at a given time undergo two independent fragmentations in the future. In the univariate case, Bertoin [3, Definition 3.2] expresses the branching property thanks to the introduction of a mapping Frag : P_∞× P_∞ → P_∞. While a similar definition could be

(14)

made in the nested case, the analog of the Frag mapping would be too lengthy to introduce and we found simpler to assume an equivalent fact, which is all we will use in later proofs:

distinct blocks fragment at distinct times.

We also need to distinguish two branching properties in the case of nested fragmentations, each concerning either the outer or the inner blocks (branching property for𝜉or for𝜁).

Definition 9. Let Π = (_Π(^𝑡),𝑡 ≥ 0) = ((^𝜁(^𝑡),𝜉(^𝑡)),𝑡 ≥ 0) be a strongly exchangeable Markov process with values in P_∞^2, and decreasing càdlàg sample paths. We say that Π satisfies theouter branching propertyif

Almost surely for all 𝑡 such thatΠ(^𝑡−) , Π(^𝑡), there is a unique block 𝐵 ∈ ^𝜉(^𝑡−) such thatΠ(^𝑡−)_|𝐵 ,Π(^𝑡)_|𝐵.

Moreover, we say thatΠsatisfies theinner branching propertyif

Almost surely for all 𝑡 such that 𝜁(^𝑡−) , ^𝜁(^𝑡), there is a unique block 𝐵 ∈ ^𝜁(^𝑡−) such that𝜁(^𝑡−)_|𝐵 ,^𝜁(^𝑡)_|𝐵.

Nested fragmentations processes satisfying both branching properties will be calledsimple.

The rest of the paper is dedicated to characterize as simply and precisely as possible simple nested fragmentations processes.

Proposition 10. LetΠ = (_Π(^𝑡),𝑡 ≥ 0) = ((^𝜁(^𝑡),𝜉(^𝑡)),𝑡 ≥ 0)be a strongly exchangeable Markov process with values in P_∞^2, and decreasing càdlàg sample paths. Write 𝐾 for its exchangeable characteristic kernel.

IfΠsatisfies theouter branching property, then the characteristic kernel 𝐾is characterized by a simpler kernel𝜅fromP_∞ _toP_∞^2, which is defined as

𝜅_𝜁( · ) _:= ^𝐾⁽^𝜁,1)( · )_,

where1denotes the partition of with only one block. The simpler kernel is also strongly exchangeable.

The kernel𝐾is determined by𝜅in the following way: fix𝜋₀=(^𝜁,𝜉) ∈ P_∞^2,and for simplicity suppose that all the blocks of𝜉are infinite. For all 𝐵 ∈ ^𝜉, define an injection𝜎_𝐵 : → whose image is𝐵, and𝜏_𝐵 : 𝐵 → _{such that}^𝜎𝐵◦^𝜏𝐵 = ^id^𝐵. By definition, (^𝜋₀)^𝜎^𝐵 _{is of the} form(^𝜁𝐵,1)_{, with}^𝜁𝐵 = ^𝜁^𝜎^𝐵. Now define 𝑓_𝐵 as the application which maps𝜋 ∈ P_∞^2, _{to the} unique𝜔∈ P_∞^2, _{such that}

• 𝜔 ({^𝐵,\^𝐵},{^𝐵,\^𝐵})_,

• 𝜔_|_𝐵 =^𝜋^𝜏^𝐵 and𝜔_|\_𝐵 =(^𝜋₀)_|\𝐵. Then for any Borel set 𝐴⊂ P_∞^2,_{, we have}

𝐾_𝜋

0(^𝐴)= X

𝐵∈^𝜉

𝜅_𝜁

𝐵({^𝑓𝐵(^𝜋) ∈ ^𝐴} ∩ {^𝜋,(^𝜋₀)^𝜎^𝐵}). Remark 11. This proposition shows how 𝐾_𝜋

0 is expressed in terms of the kernel𝜅only for 𝜋₀ =(^𝜁,𝜉)such that all the blocks of𝜉are infinite. In fact this is enough to characterize𝐾 entirely since if𝜋₀does not satisfy this property, there exists a nested partition𝜋⁰

0= (^𝜁⁰,𝜉⁰)

Trees within trees II: Nested Fragmentations

HAL Id: hal-01842036

https://hal.archives-ouvertes.fr/hal-01842036

Trees within trees II: Nested Fragmentations

Jean-Jil Duchamps

To cite this version:

Trees within trees II: Nested Fragmentations

Contents

1 Introduction

2 Definitions, notation

3 Projective Markov property and strong exchangeability

4 Outer branching property