Large fringe and non-fringe subtrees in conditional Galton-Watson - - PowerPoint PPT Presentation

large fringe and non fringe subtrees in conditional
SMART_READER_LITE
LIVE PREVIEW

Large fringe and non-fringe subtrees in conditional Galton-Watson - - PowerPoint PPT Presentation

Large fringe and non-fringe subtrees in conditional Galton-Watson trees Xing Shi Cai, Luc Devroye School of Computer Science McGill University Probabilistic Midwinter Meeting Ume University Jan 18, 2017 1 / 47 Outline Introduction 1


slide-1
SLIDE 1

1/47

Large fringe and non-fringe subtrees in conditional Galton-Watson trees

Xing Shi Cai, Luc Devroye

School of Computer Science McGill University

Probabilistic Midwinter Meeting Umeå University Jan 18, 2017

slide-2
SLIDE 2

2/47

Outline

1

Introduction

2

Large Fringe Subtrees

3

Large Fringe Subtrees—Applications

4

Large Non-Fringe Subtrees

slide-3
SLIDE 3

3/47

What is a tree

A tree is an acyclic graph. In this talk, trees are unlabeled, rooted, and ordered (plane trees).

slide-4
SLIDE 4

4/47

Galton-Watson trees

A Galton-Watson (GW) tree Tgw starts with a single node. Each node in Tgw chooses a random number of child nodes independently from the same distribution ξ. Introduced by Bienaym´ e, 1845.

3 3 3 2 3 2 1

.....

0 0 0

Note We will always assume that Eξ = 1 and Var (ξ) ∈ (0, ∞).

slide-5
SLIDE 5

5/47

Conditional Galton-Watson trees

A conditional GW Tree Tgw

n

is Tgw restricted to |Tgw| = n. So P

  • Tgw

n

= T

  • = P {Tgw = T | |Tgw| = n} .

It covers many uniform random tree models:

full binary trees binary trees d-ary trees Motzkin trees Plane trees Cayley trees

slide-6
SLIDE 6

6/47

Example of conditional Galton-Watson trees

Let P {ξ = i} = 1/2i+1. In other words, ξ L = Ge(1/2). Tgw

n

is uniformly distributed among all trees of size n.

P {Tgw = T} = 2−7 for T ∈

slide-7
SLIDE 7

7/47

Fringe subtrees

For a node v of a tree T, the fringe subtree Tv contains v and all its decedents. It is what normally called a “subtree”.

slide-8
SLIDE 8

8/47

Fringe subtree count

Let NT(Tgw

n ) be the number of fringe subtrees of shape T

in Tgw

n .

slide-9
SLIDE 9

9/47

Fringe subtree count: bigger example

In the next example, NT(Tgw

n )

n = 15 120 = 1 8 = π(T) ≡ P {Tgw = T} . Is this just a coincidence?

slide-10
SLIDE 10

10/47

What is known

For large n, fringe subtrees in Tgw

n

behave like independent copies of Tgw. Take a uniform random fringe subtree of Tgw

n , the

probability to get T is about π(T) ≡ P {Tgw = T}. So NT(Tgw

n ) ≈ Bi(n, π(T)).

slide-11
SLIDE 11

11/47

What is known cont.

Theorem Aldous (1991) (Law of large number) As n → ∞, NT(Tgw

n )

n

p

→ π(T). Theorem Janson (2016) (Central limit theorm) As n → ∞, NT(Tgw

n ) − nπ(T)

γ √n

d

→ N(0, 1), where γ is a constant.

slide-12
SLIDE 12

12/47

What do we want to know

What if the T in NT(Tgw

n ) changes with n?

The height of the largest complete r-ary fringe subtree. The largest k such that Tgw

n

contains all trees of size k as fringe subtree. What about non-fringe subtrees?

slide-13
SLIDE 13

13/47

Outline

1

Introduction

2

Large Fringe Subtrees

3

Large Fringe Subtrees—Applications

4

Large Non-Fringe Subtrees

slide-14
SLIDE 14

14/47

Large fringe subtrees

If |Tn| → ∞, then π(Tn) ≡ P {Tgw = Tn} → 0. Then we should have NTn(Tgw

n ) ≈ Bi(n, π(Tn)) ≈ Po(nπ(Tn)).

Theorem 1.2 Let kn = o(n) and kn → ∞. Then lim

n→∞

sup

T:|T|=kn

dTV (NT(Tgw

n ), Po(nπ(T))) = 0.

slide-15
SLIDE 15

15/47

Large fringe subtrees cont.

Theorem 1.2 cont. So letting (Tn)n1 be a sequence of trees with |Tn| = kn, we have:

1 If nπ(Tn) → 0, then NTn(Tgw n ) = 0 whp. 2 If nπ(Tn) → µ ∈ (0, ∞), then NTn(Tgw n ) d

→ Po(µ).

3 If nπ(Tn) → ∞, then

NTn(Tgw

n ) − nπ(Tn)

  • nπ(Tn)

d

→ N(0, 1).

slide-16
SLIDE 16

16/47

The degree sequence

The degree of a node is the number of its children. The degree sequence of a tree, is the list of degrees of its nodes in Depth-First-Search order. We can count fringe subtree through degree sequence. 1 2 3 4 5 6 7 1 2 Degree sequence: T1 T2 (2, 1, 0, 3, 0, 0, 0) (1, 0)

slide-17
SLIDE 17

17/47

Count fringe subtrees through the degree sequence

Let (ξn

1 , . . . , ξn n) be the degree sequence of Tgw n .

Let (d1, . . . , d|T|) be the degree sequence of T. Then NT(Tgw

n ) can be write as

NT(Tgw

n ) = n

  • j=1

Ij ≡

n

  • j=1

1

(ξn

j ,...,ξn j+|T|−1)=(d1,...,d|T|)

.

slide-18
SLIDE 18

18/47

Why fringe subtrees are like unconditional Galton-Watson trees

When n is large, ξn

1 , . . . , ξn n are close to ξ1, . . . , ξn (n

independent copies of ξ). Thus P

  • Ij = 1
  • = P
  • ∩|T|

i=1

  • ξn

j+i−1 = di

|T|

  • i=1

P {ξi = di} = P {Tgw = T} ≡ π(T). So I1, . . . , In are close to iid Bernoulli π(T). This is why NT(Tgw

n ) = n j=1 Ij ≈ Bi(n, π(T)) ≈ Po(nπ(T)).

slide-19
SLIDE 19

19/47

The exchangeable pair method

The proof of Theorem 1.2 uses the exchangeable pair method (Ross (2011, thm. 4.37)). It is a variation of Stein’s method for Poisson distribution. Example

Let X1, . . . , Xn and Y1, . . . , Yn be iid Be(p). Let W = X1 + · · · + Xn. Let W ′ = W − XZ + YZ where Z

L

= Unif({1, . . . , n}). We have an exchange pair — (W, W ′)

L

= (W ′, W). Compute P {W ′ = W − 1 | X1, . . . , Xn} , P {W ′ = W + 1 | X1, . . . , Xn} . Then the method says dTV (W, Po(EW)) p.

slide-20
SLIDE 20

20/47

Subtree replacing – the naive way

Recall NT(Tgw

n ) = n i=1 Ii.

What if we do the same thing for NT(Tgw

n )?

Let ¯ N = NT(Tgw

n ) − IZ + I′ Z with I′ Z L

= IZ. Is ( ¯ N, NT(Tgw

n )) an exchangeable pair?

slide-21
SLIDE 21

21/47

Subtree replacing – the proper way

Choose a fringe subtree of Tgw

n

uniformly at random.

If its size is not the same as T, do nothing. Otherwise, replace it with Tgw

|T| .

Let ¯ N be the number of T in the new tree. Then (NT(Tgw

n ), ¯

N) is an exchangeable pair.

T

slide-22
SLIDE 22

22/47

Upper bound of the total variation distance

Let Tk be the set of all trees of size k. Let S ⊆ Tk. Let NS(Tgw

n ) be the number of fringe subtrees that belongs

to S. Let π(S) ≡ P {Tgw ∈ S}. So NT(Tgw

n ) = N{T}(Tgw n ).

Lemma 4.1 Let k = kn = o(n) and k → ∞. We have sup

S⊆Tk

dTV

  • NS(Tgw

n ), Po(nπ(S))

  • π(S)/π(Tk) +
  • π(S)/π(Tk)

1 + o

  • k−3/2

+ O k1/4 √n

  • .
slide-23
SLIDE 23

23/47

Large fringe subtrees count—set version

Theorem 1.3 Let Tk be the set of trees of size k. Let kn = o(n) and kn → ∞. Let (Sn)n1 be a sequence with Sn ⊆ Tkn. We have:

1 If nπ(Sn) → 0, then NSn(Tgw n ) = 0 whp. 2 If nπ(Sn) → µ ∈ (0, ∞), then NSn(Tgw n ) d

→ Po(µ).

3 If nπ(Sn) → ∞, then

NSn(Tgw

n ) − nπ(Sn)

  • nπ(Sn)

d

→ N(0, 1).

4 If π(Sn)/π(Tkn) → 0, then

lim

n→∞ dTV (NSn(Tgw n ), Po(nπ(Sn))) = 0.

slide-24
SLIDE 24

24/47

Outline

1

Introduction

2

Large Fringe Subtrees

3

Large Fringe Subtrees—Applications

4

Large Non-Fringe Subtrees

slide-25
SLIDE 25

25/47

Application 1—largest complete r-ary fringe subtree

Let T r-ary

h

be a complete r-ary tree of height h.

slide-26
SLIDE 26

26/47

Application 1—largest complete r-ary fringe subtree

Lemma 5.2 & 5.3 Let Hn,r be the height of the largest complete r-ary fringe subtree in Tgw

n . Then for r 2,

Hn,r − logr log n

p

→ − αr, where αr is a constant. And Hn,1 log(1/P {ξ = 1}) log n

p

→ 1. Method: Find the maximum h such that nπ(T r-ary

h

) → ∞. Then apply Theorem 1.2.

slide-27
SLIDE 27

27/47

Application 2—existence of all possible subtrees

Let Kn be the maximum k such that Tgw

n

contains all trees

  • f size k as fringe subtree.
slide-28
SLIDE 28

28/47

The coupon collector problem

Original version There are n types of coupons. Each time we draw one type of coupon uniformly at random. How many draws do we need to collect all n types? Generalized version There are n types of coupons. Each time we draw a coupon, we get type i with probability pi. How many draws do we need to collect all n types?

slide-29
SLIDE 29

29/47

The coupon collector problem: the answer

Lemma 5.1 (Generalized coupon collector) Assume X takes values in {1, . . . , n}. Let pi ≡ P {X = i}. Let X1, X2, . . . be i.i.d. copies of X. Let N ≡ inf{i 1 : |{X1, X2, . . . , Xi}| = n}. Let m be a positive integers. We have 1 −

n

  • i=1

(1 − pi)m P {N m} 1 n

i=1(1 − pi)m .

If pi = 1/n, then N = n log(n) + op(1).

slide-30
SLIDE 30

30/47

Connection to our problem

Draw independent copies Tgw

k

until every tree of size k has appeared. Let Mk be the number of draws. NTk(Tgw

n ) ≈ nπ(Tk).

So if nπ(Tk) > Mk, then probably we have all trees of size k as fringe subtree, otherwise we do not. This is a coupon collector problem!

slide-31
SLIDE 31

31/47

The least possible tree

Among all coupons, there is one that is least likely to appear. If we get this one, we are likely to have all coupons. Let T min

k

be the least possible fringe subtree of size k. Mk depends on pmin

k

≡ P

  • Tgw = T min

k

  • .

Lemma If npmin

k

→ 0, then T min

k

does not appear. If npmin

k

/k → ∞, then all possible subtrees of size k appear.

slide-32
SLIDE 32

32/47

What can we say about the least possible subtree?

pmin

k

certainly depends on ξ. But there is a small surprise. Theorem 5.2 We have (pmin

k

)1/k → L as k → ∞, where 0 L < 1 is a constant defined as L ≡ inf

i1

  • P {ξ = 0}

P {ξ = i} P {ξ = 0} 1/i .

slide-33
SLIDE 33

33/47

Threshold of existence of all possible subtrees

By theorem 5.2, if L > 0, then log(1/pmin

k

) ∼ k log(1/L). Kn = log1/L n + op(1) in this case. Theorem 5.1 Assume that as k → ∞, log(1/pmin

k

) ∼ γkα(log k)β, where α 1, β 0, γ > 0 are constants. Then Kn (log n/(log log n)β)1/α

p

→ αβ γ 1/α .

slide-34
SLIDE 34

34/47

Applications

GW Tree ξ log(1/pmin

k

) Kn Full binary trees 2 × Be(1/2) k log 2 log2 n Motzkin trees Unif({0, 1, 2}) k log 3 log3 n Binary trees Bi(2, 1/2) k log 4 log4 n d-ary trees Bi(d, 1/d) k log cd logcd n Plane trees Ge(1/2) k log 4 log4 n Cayley trees Po(1) k log k

log n log log n

cd is constant. Cayley tree is different because it has L = 0.

slide-35
SLIDE 35

35/47

Outline

1

Introduction

2

Large Fringe Subtrees

3

Large Fringe Subtrees—Applications

4

Large Non-Fringe Subtrees

slide-36
SLIDE 36

36/47

Non-fringe subtrees

Take a fringe subtree Tv. Replace some (or none) of Tv’s own fringe subtrees with leaves. The result is a called a non-fringe subtree at v. Not a non-fringe subtree !

slide-37
SLIDE 37

37/47

Non-fringe subtree count

Let Nnf

T (Tgw n ) be the number of non-fringe subtrees of

shape T in Tgw

n .

slide-38
SLIDE 38

38/47

Large Non-fringe subtree Count

Let πnf(T) be the prob. that Tgw has T as a non-fringe subtree at its root. We should have Nnf

T (Tgw n ) ≈ Bi(n, πnf(T)).

Theorem 1.4 Let Tn be a sequence of trees with |Tn| = o(n). We have

1 If nπnf(Tn) → 0, then Nnf Tn(Tgw n ) = 0 whp. 2 If nπnf(Tn) → ∞, then

Nnf

Tn(Tgw n )

nπnf(Tn)

p

→ 1.

slide-39
SLIDE 39

39/47

Proof by computing first and second moments

Lemma 6.9 & 6.10 Assume that |Tn| = o(n) and nπnf(Tn) → ∞. We have

1 E

  • Nnf

Tn(Tgw n )

  • = (1 + o(1))nπnf(Tn).

2 Var

  • Nnf

Tn(Tgw n )

  • = o(nπnf(Tn))2.

So Theorem 1.4 follows by Chebyshev’s inequality.

slide-40
SLIDE 40

40/47

Difference between fringe and non-fringe subtrees

Non-fringe subtrees can overlap. So it is more difficult to compute the second moment.

slide-41
SLIDE 41

41/47

Glue two trees

Let {T ⊞ T} be the trees that are two of T glued together.

slide-42
SLIDE 42

42/47

The second factorial moment

Lemma 6.8 Assuming that |T| = o(n), we have E

  • Nnf

T (Tgw n )(Nnf T (Tgw n ) − 1)

(nπnf(T))2 + 2n

  • T ′∈{T⊞T}

πnf(T ′) If the second term is o(nπnf(T))2 then we are done. Large T ′ ∈ {T ⊞ T} should not be a problem. And there cannot be many small T ′ (with |T ′| < 3/2|T|).

slide-43
SLIDE 43

43/47

Application 1—largest complete r-ary non-fringe subtrees

Lemma 6.12 & 6.13 Let ¯ Hn,r be the height of the largest complete r-ary non-fringe subtree in Tgw

n . Then for r 2,

¯ Hn,r − logr log n

p

→ − α′

r.

And ¯ Hn,1 log(1/P {ξ = 1}) log n

p

→ 1. Proof: Same as for fringe version.

slide-44
SLIDE 44

44/47

Application 2—maximum degree

A node of degree d can be seen as a non-fringe subtree that consists of the root and d-leaves. So Theorem 1.4 implies: Theorem Meir and Moon (1991) Assume that as k → ∞, 1 P {ξ = k}1/k → ρ > 1. Let Yn be the maximum degree in Tgw

n , then

Yn log n

p

→ 1 log ρ.

slide-45
SLIDE 45

45/47

Open questions

For fringe subtrees, does dTV

  • NTk(Tgw

n ), Po(nπ(Tk)

  • → 0,

as k → ∞? For non-fringe subtrees

A central limit theorem? What is the total number of non-fringe subtrees in Tgw

n ?

slide-46
SLIDE 46

46/47

Bibliography

  • D. Aldous, “Asymptotic fringe distributions for general families of random

trees,” The Annals of Applied Probability, vol. 1, no. 2, pp. 228–266, 1991.

  • I. J. Bienaym´

e, “De la loi de multiplication et de la dur´ ee des familles,” Soci´ et´ e Philomatique Paris, 1845, Reprinted in Kendall (1975).

  • X. S. Cai, “A study of large fringe and non-fringe subtrees in conditional

Galton-Watson trees,” PhD thesis, McGill University, Aug. 2016.

  • X. S. Cai and L. Devroye, “A study of large fringe and non-fringe subtrees in

conditional Galton-Watson trees,” Latin American Journal of Probability and Mathematical Statistics, 2017, To appear. arXiv: 1602.03850 [math.PR].

  • S. Janson, “Asymptotic normality of fringe subtrees and additive functionals in

conditioned Galton-Watson trees,” Random Structures and Algorithms, vol. 48, no. 1, pp. 57–101, 2016.

  • A. Meir and J. W. Moon, “On nodes of large out-degree in random trees,”

Congressus Numerantium, vol. 82, pp. 3–13, 1991.

  • N. Ross, “Fundamentals of Stein’s method,” Probability Surveys, vol. 8,
  • pp. 210–293, 2011.
slide-47
SLIDE 47

47/47

My coauthor