The Aldous diffusion on continuum trees Soumik Pal University of - - PowerPoint PPT Presentation

the aldous diffusion on continuum trees
SMART_READER_LITE
LIVE PREVIEW

The Aldous diffusion on continuum trees Soumik Pal University of - - PowerPoint PPT Presentation

The Aldous diffusion on continuum trees Soumik Pal University of Washington, Seattle Vienna probability seminar Jun 11, 2019 Noah Forman, Douglas Rizzolo, Matthias Winkel arXiv:1804.01205, 1802.00862, 1609.06707 Thanks to NSF, UW RRF, EPSRC


slide-1
SLIDE 1

The Aldous diffusion on continuum trees

Soumik Pal

University of Washington, Seattle

Vienna probability seminar Jun 11, 2019

slide-2
SLIDE 2

Noah Forman, Douglas Rizzolo, Matthias Winkel arXiv:1804.01205, 1802.00862, 1609.06707

Thanks to NSF, UW RRF, EPSRC for grant support

slide-3
SLIDE 3

Part 1 The Aldous diffusion conjecture

slide-4
SLIDE 4

Aldous down-up chain

1 4 6 2 5 3 1 4 6 2 5 4 2 5 6 1 4 2 5 6 1 6 1 4 2 5 6 1 4 2 5 3

Markov chain on rooted leaf-labeled binary trees. Each transition has two parts.

◮ Down-move: delete unif random leaf, contract away parent

branch point.

◮ Up-move: select unif random edge, insert branch point, grow

  • ut new leaf-edge.
slide-5
SLIDE 5

Results

Proposition (Aldous ’01)

This is stationary with unif distrib on leaf-labeled binary trees.

Theorem (Schweinsberg ’01)

Relaxation time of Aldous chain on n-leaf trees is Θ(n2).

Conjecture (Aldous ’99)

This Markov chain has a continuum analogue: a continuum random tree-valued diffusion, stationary w/ law of Brownian CRT.

slide-6
SLIDE 6

What is a Brownian CRT? Aldous, Le Gall, ...

◮ Tree as a metric space with edge length 1/√n. n → ∞. ◮ Harris path representation (Harris ’52):

(CRT Figure due to I. Kortchemski)

slide-7
SLIDE 7

History and context

◮ Theoretical motivation: to construct a fundamental object –

“Brownian motion on R-tree space”.

◮ Applied motivation: Aldous diffusion and projected processes

are useful for inference on phylogenetic trees and genetic

  • modeling. E.g., Ethier-Kurtz-Petrov diffusion.

◮ See: Evans-Winter ’06, Evans-Pitman-Winter ’06, Crane ’14. ◮ Very recent related work: L¨

  • hr-Mytnik-Winter ’18. Analysis

without metric.

slide-8
SLIDE 8

Our result

◮ We have a pathwise construction of the

continuum-tree-valued analogue to the Aldous chain, stationary under BCRT (among other features).

◮ Forman-P.-Rizzolo-Winkel. “Aldous diffusion I: A projective

system of continuum k-tree evolutions.” ArXiv:1089.07756 [math.PR].

◮ For the remainder of this talk, we discuss this construction.

slide-9
SLIDE 9

Key challenge: perfectly ephemeral leaves

◮ Time scaling is by n2, where n is number of leaves. ◮ Takes O(n log(n)) moves to replace every leaf. In O(n2)

moves, w/ high probability, every leaf is replaced.

◮ Challenge: moves defined in terms of leaves, but in limit leaves

die instantly. Makes it difficult to describe limiting object.

◮ Strategy: re-orient; focus on branch points.

slide-10
SLIDE 10

Part 2 Projections and Intertwinings

slide-11
SLIDE 11

Intutition

◮ Brownian CRT can be constructed as a projective limit of

consistent finite trees.

◮ Idea goes back to original construction of Aldous. ◮ One can try a similar strategy in dynamics.

slide-12
SLIDE 12

Spinal projection (discrete regime)

2 1 2 4 5 leaf 1 leaf 2

8 14

1

13 10

2

7 6 11 9 5 12 4 3

slide-13
SLIDE 13

Spinal projection (discrete regime)

2 3 1 1 2 1 2 1 1 leaf 1 leaf 4 leaf 3 leaf 5 leaf 2 1 4

7 6 10 14 13 9 11 8

3 2

12

5

slide-14
SLIDE 14

Taking the limit

Idea: Fix j and consider what happens when n → ∞ in the projected trees.

◮ Take proportions of leaf masses in each component.

slide-15
SLIDE 15

Taking the limit

Idea: Fix j and consider what happens when n → ∞ in the projected trees.

◮ Take proportions of leaf masses in each component. ◮ P ’13: Wright-Fisher diffusion with negative mutation rates

finds limit up until the first time a labeled block vanishes.

slide-16
SLIDE 16

Taking the limit

Idea: Fix j and consider what happens when n → ∞ in the projected trees.

◮ Take proportions of leaf masses in each component. ◮ P ’13: Wright-Fisher diffusion with negative mutation rates

finds limit up until the first time a labeled block vanishes.

◮ What to do when a coordinate hits zero? ◮ FPRW: solves by resampling.

slide-17
SLIDE 17

Taking the limit

Idea: Fix j and consider what happens when n → ∞ in the projected trees.

◮ Take proportions of leaf masses in each component. ◮ P ’13: Wright-Fisher diffusion with negative mutation rates

finds limit up until the first time a labeled block vanishes.

◮ What to do when a coordinate hits zero? ◮ FPRW: solves by resampling. ◮ FPRW: There is a way to do this consistently over j that

allows taking projective limits. Intertwining.

slide-18
SLIDE 18

Taking the limit

Idea: Fix j and consider what happens when n → ∞ in the projected trees.

◮ Take proportions of leaf masses in each component. ◮ P ’13: Wright-Fisher diffusion with negative mutation rates

finds limit up until the first time a labeled block vanishes.

◮ What to do when a coordinate hits zero? ◮ FPRW: solves by resampling. ◮ FPRW: There is a way to do this consistently over j that

allows taking projective limits. Intertwining. Then let j → ∞.

slide-19
SLIDE 19

Spinal projection (continuum regime)

Continuum 5-tree w/ interval partitions.

Σ4 leaf 3 leaf 5 leaf 2 leaf 1 leaf 4 Σ3 Σ5 Σ2 Σ1 ρ

Interval partition (IP) β of [0, M]: a collection of disjoint, open intervals that cover [0, M] up to Leb-null set.

slide-20
SLIDE 20

Interval partitions

Interval partition (IP) β of [0, M]: a collection of disjoint, open intervals that cover [0, M] up to Leb-null set. Example: Excursion intervals of standard Brownian bridge. Call this a Poisson-Dirichlet 1

2, 1 2

  • interval partition, PDIP

1

2, 1 2

  • .
slide-21
SLIDE 21

Spinal projection of BCRT; Pitman-Winkel ’15

Σ4 leaf 3 leaf 5 leaf 2 leaf 1 leaf 4 Σ3 Σ5 Σ2 Σ1 ρ ◮ Dirichlet

1

2, . . . , 1 2

  • mass split among the 5 external and 4

internal components.

◮ Split the mass in each internal edge into an indep. PDIP

1

2, 1 2

  • .

We can recover path lengths from this picture, as diversities of interval partitions, Div(β) = lim

h→0

√ h#{U ∈ β : Leb(U) > h}.

slide-22
SLIDE 22

Projected diffusion on interval partitions

◮ One can recover the tree metric from diversity of interval

partitions.

◮ The Aldous diffusion projected to interval partitions is also

Markov.

◮ Select j leaves. Construct process of interval partitions from

the projected masses.

◮ If we can describe it, and repeat consistency over j, that gives

a projective limit as j → ∞.

◮ The limit is the Aldous diffusion itself.

slide-23
SLIDE 23

Projected diffusion on interval partitions

◮ One can recover the tree metric from diversity of interval

partitions.

◮ The Aldous diffusion projected to interval partitions is also

Markov.

◮ Select j leaves. Construct process of interval partitions from

the projected masses.

◮ If we can describe it, and repeat consistency over j, that gives

a projective limit as j → ∞.

◮ The limit is the Aldous diffusion itself. ◮ What is the dynamics on each interval partition?

slide-24
SLIDE 24

Part 3 Dynamics on interval partitions

slide-25
SLIDE 25

Projected chains and Chinese Restaurants

◮ Due to Dubins-Pitman ◮ CRP(α, θ), α ∈ [0, 1), θ > −α. E.g., α = 1 2, θ = 1 2. ◮ Customer n will join table w/ m other customers w/ weight

m − α.

◮ Or, sit at empty table w/ weight θ + α(# of tables).

1 − α 4 + θ Probabilities of customer 5 joining each table θ + 2α 4 + θ 3 − α 4 + θ

slide-26
SLIDE 26

A Chinese restaurant with re-seating

◮ Markov chain on composition/ partitions of [n]. ◮ Transition rule: uniform random customer leaves, then

re-enters according to CRP(α, θ) seating rule.

◮ See Petrov ’09; Borodin-Olshanski ’09

slide-27
SLIDE 27

Aldous chain as re-seating

2 2 2 2 3 2 1 2 2 1 2 1 1

slide-28
SLIDE 28

Poissonized down-up CRP

1 − 1

2

3 − 1

2 1 2

2 − 1

2

3 − 1

2 1 2 1 2 1 2

◮ each customer leaves after Exponential(1) time, ◮ for a table w/ m customers, add customers with rate m − 1 2, ◮ between any two tables, insert new tables w/ rate 1 2.

slide-29
SLIDE 29

Table populations

Tables evolve independently of each other. Population of each is a birth-and-death chain.

1 − 1

2

3 − 1

2 1 2

2 − 1

2

3 − 1

2 1 2 1 2 1 2

When it has population m, increases w/ rate m − 1

2, decreases w/

rate m. Birth-and-death chain.

slide-30
SLIDE 30

Coding the Poissonized, ordered CRP

1 − 1

2

3 − 1

2 1 2

2 − 1

2

3 − 1

2 1 2 1 2 1 2

slide-31
SLIDE 31

Convergence

In scaling limits:

◮ Law of birth-and-death chain of table populations in

re-seating, starting from 1, converges to BESQ(−1) excursion measure, Bessel square diffusion with drift −1.

◮ Draw lines connecting deaths and births of tables. Converges

to spectrally positive Stable 3

2

  • .
slide-32
SLIDE 32

Spindles on scaffolding

◮ Decorate jumps of Stable (3/2) by ind. BESQ (−1)

excursions.

◮ Scaffolding - L´

evy process.

◮ Spindles - independent excursions hanging on jumps of

scaffolding.

slide-33
SLIDE 33

The Skewer map

For y ∈ R, to get the level y skewer:

◮ Draw a line across picture at level y. ◮ From left to right, collect cross-sections of spindles. ◮ Slide together, as if on a skewer, to remove gaps. ◮ A stochastic process on interval partitions.

slide-34
SLIDE 34

The Skewer process

◮ As line moves up from level 0, interval partition evolves

continuously.

◮ Diversity=number of existing tables=local time of

Stable(3/2)=tree metric on the spine.

slide-35
SLIDE 35

The Skewer process

◮ As line moves up from level 0, interval partition evolves

continuously.

◮ Diversity=number of existing tables=local time of

Stable(3/2)=tree metric on the spine.

slide-36
SLIDE 36

The Skewer process

◮ As line moves up from level 0, interval partition evolves

continuously.

◮ Diversity=number of existing tables=local time of

Stable(3/2)=tree metric on the spine.

slide-37
SLIDE 37

The Skewer process

◮ As line moves up from level 0, interval partition evolves

continuously.

◮ Diversity=number of existing tables=local time of

Stable(3/2)=tree metric on the spine.

slide-38
SLIDE 38

The Skewer process

◮ As line moves up from level 0, interval partition evolves

continuously.

◮ Diversity=number of existing tables=local time of

Stable(3/2)=tree metric on the spine.

slide-39
SLIDE 39

The Skewer process

◮ As line moves up from level 0, interval partition evolves

continuously.

◮ Diversity=number of existing tables=local time of

Stable(3/2)=tree metric on the spine.

slide-40
SLIDE 40

Outline of the construction of the Aldous diffusion by FPRW; chatty version

◮ Poissonize: leaves die and born independently. ◮ Project on j leaves to get (j − 1) independent skewer

processes and j leaf masses.

◮ Each skewer process is a diffusion. Each mass is BESQ. ◮ Show consistency over j by intertwining. ◮ DePoissonize by scaling and time-change. ◮ Take projective limit. Obtain process stationary with

Brownian CRT.

◮ Prove limit is Markov and continuous is GH topology.

slide-41
SLIDE 41

Building the limit

Evolving interval partitions generate a tree-valued process.

ρ Σ1 Σ2 Σ3 Σ4 Σ5 X

(5) 1

X

(5) 2

X

(5) 3

X

(5) 4

X

(5) 5

β

(5) [5]

β

(5) {1,2,4}

β

(5) {1,4}

β

(5) {3,5}

type 2 type 0 type 1 type 2

slide-42
SLIDE 42

Part 4 Application: Ethier-Kurtz-Petrov diffusions

slide-43
SLIDE 43

Ranked interval lengths and Poisson-Dirichlets

◮ Consider interval partition (IP) of [0, 1] (mass one). ◮ Consider decreasing order stats of interval mass. ◮ Kingman simplex:

∇∞ =

  • x1 ≥ x2 ≥ . . . ,
  • i∈N

xi = 1

  • .

◮ PDIP gives Poisson-Dirichlet distributions on ∇∞. ◮ PDIP (1/2, 1/2) → PD (1/2, 1/2). ◮ PDIP (α, θ) → PD (α, θ), 0 ≤ α < 1, θ > −α.

slide-44
SLIDE 44

Diffusions on the Kingman simplex

◮ Diffusions on ∇∞ reversible with respect to PD (α, θ)? ◮ Ethier-Kurtz ’81, Petrov ’10 - generator for EKP (α, θ):

  • i≥1

xi ∂2 ∂x2

i

  • i,j≥1

xixj ∂2 ∂xi∂xj −

  • i≥1

(θxi + α) ∂ ∂xi .

◮ Also see Bertoin ’08, Borodin and Olshanski ’09, Feng-Sun

’10, Feng-Sun-Wang-Xu ’11, Ruggiero and coauthors ’09, ’13,’14.

◮ Mostly analytical or Dirichlet form techniques. ◮ Understanding on path behavior missing.

slide-45
SLIDE 45

Diffusions without ranking?

◮ Theorem (FPRW)

The de-Poissonized skewer process of interval partitions, when ranked gives EKP (1/2, 1/2) diffusion on the Kingman simplex.

◮ Provides pathwise description. ◮ Can be generalized to all (α, θ) (future work). ◮ Advantage of not ranking: provides better understanding of

evolution of small blocks.

◮ Allows us to settle some conjectures by previous authors. ◮ E.g. continuity of diversity process.

slide-46
SLIDE 46

Vielen Dank!