CS5314 Randomized Algorithms Lecture 14: Balls, Bins, Random Graphs - PowerPoint PPT Presentation

CS5314 Randomized Algorithms Lecture 14: Balls, Bins, Random Graphs (Poisson Approximation) 1

Objectives • Poisson Approximation for Balls-and-Bins : to approximate # balls in each bin as independent Possion RV with  = m/n • Revisit Coupon Collector 2

Poisson Approximation • Suppose we throw m balls into n bins independently and uniformly at random • From previous lecture, we observe that: # balls in a particular bin  Poisson RV with  = m/n • How about distribution of balls in all n bins ? 3

Poisson Approximation Question: Will distribution of n balls be the same as n independent Poisson RVs with mean m/n? Ans. No ! For instance, total # of balls is always exactly m, but sum of n independent Poisson RVs can be any value The difference is because of dependency ! 4

Poisson Approximation • Though “ n independent Poisson RVs ” do not have the same distribution as “ m balls into n bins ” we can show that they are related, so that we can use the “ Poisson Case ” to approximate the “ Exact Case ” Hopefully, the approximation will be useful … 5

Poisson Approximation Formally, we define X 1 (m) , X 2 (m) , … , X n (m) (m) = # balls in Bin j where X j (in Exact Case) Y 1 (m) , Y 2 (m) , … , Y n (m) which are n independent Poisson RVs with parameter m/n (in Poisson Case) 6

When Two Distributions Meet Theorem: Suppose  j=1 to n Y j (m) = k. Under this condition, the distribution of (Y 1 (m) , Y 2 (m) , … , Y n (m) ) is exactly the same as the distribution of (X 1 (k) , X 2 (k) , … , X n (k) ) regardless of the value of m or k How to prove? Throwing k balls in total 7

Proof • Let k 1 , k 2 , … , k n be non-negative integers whose sum is k • When throwing k balls into n bins, Pr ( (X 1 (k) ) = (k 1 , … ,k n ) ) (k) , … , X n k ! = k 1 ! k 2 !  k n ! n k 8

Proof Next, (m) ) = (k 1 , … ,k n ) |  j Y j (m) = k ) Pr ( (Y 1 (m) , … , Y n (m) = k 1 ) \  \ (Y n (m) = k n )) Pr((Y 1 = … (why?) (m) = k) Pr(  j Y j Question: What is this probability?? 9

Proof First, (m) = k j ) = e -m/n (m/n) kj / k j ! Pr(Y j (m) are independent, so Since Y 1 (m) , … , Y n (m) = k 1 ) \  \ (Y n (m) = k n ) ) Pr ( (Y 1 =  j e -m/n (m/n) kj / k j ! e -m m k = k 1 ! k 2 !  k n ! n k 10

Proof On the other hand, (m) = k) = e -m m k / k! Pr(  j Y j … [why??] So combining the previous results, (m) ) = (k 1 , … ,k n ) |  j Y j (m) = k ) Pr ( (Y 1 (m) , … , Y n = Pr ( (X 1 (k) ) = (k 1 , … , k n ) ) (k) , … , X n  this completes the proof 11

A Stronger Result • With the previous result between exact case and Poisson case, we can show a stronger result … • Before we proceed, let us obtain a useful upper bound for n ! 12

Upper Bound for n! n!  en 1/2 (n/e) n Lemma: Proof: Since ln x is a concave function, j ln x dx  ( ln (j-1) + ln j ) / 2 … (why?) j-1 n ln x dx  ln (n!) - (ln n)/2  … (why?) 1  n ln n – n + 1  ln (n!) - (ln n)/2  Lemma follows by exponentiation 13

Expectation of Loads • We now show a relationship between the expectation of any non-negative function of the loads in the two cases : Theorem: Let f(x 1 , … , x n ) be a non-negative function. Then, E[f(X 1 (m) , … , X n (m) )]  e m E[f(Y 1 (m) , … , Y n (m) )] How to prove? 14

Proof E[f(Y 1 (m) , … , Y n (m) )] =  k E[f(Y 1 (m) ) |  j Y j(m) =k ] Pr(  j Y j(m) =k ) (m) , … , Y n (m) ) |  j Y j(m) =m ] Pr(  j Y j(m) =m )  E[f(Y 1 (m) , … , Y n (m) )] Pr(  j Y j = E[f(X 1 (m) , … , X n (m) =m) … (why?) 15

Proof Next, using upper bound of m! , (m) =m) = e -m m m /m! Pr(  j Y j … (why?)  1 / (em 1/2 ) Thus, E[f(Y 1 (m) , … , Y n (m) )]  E[f(X 1 (m) , … , X n (m) )] / (em 1/2 )  This completes the proof 16

Remark • The previous theorem holds for any non- negative function f • E.g., if f = MAX, then we can relate the expected maximum load in the two cases • E.g., if f = an indicator for an event Z, then the theorem gives the relationship of Pr(Z occurs) in the two cases This latter gives the following corollary: 17

Bounding Exact Case Corollary: Referring to the scenario of throwing m balls into n bins. Any event Z that takes place with probability p in the Poisson case implies: Z takes place with probability at most em 1/2 p in the exact case How to prove? 18

Bounding Exact Case Proof: Let f be the indicator for event Z Then, Pr(Z occurs in exact case) = E[f(X 1 (m) , … , X n (m) )]  em 1/2 E[f(Y 1 (m) , … , Y n (m) )] = em 1/2 Pr(Z occurs in Poisson case) = em 1/2 p 19

An Even Stronger Result If we know more about f, we can obtain an even stronger bound: Theorem: Let f(x 1 , … , x n ) be a non-negative function such that E[f(X 1 (m) , … , X n (m) )] is monotonically increasing in m. Then, E[f(X 1 (m) , … , X n (m) )]  2 E[f(Y 1 (m) , … , Y n (m) )] How to prove? (Ex. 5.13, 5.14) 20

Bounding Exact Case (2) Corollary: Let Z be an event whose probability is monotonically increasing in # balls. If Z has probability p in the Poisson case,  Z has probability at most 2p in the exact case 21

Maximum Load (Revisited) • Some time ago, we have shown that for sufficiently large n, if we throw n balls into n bins, then w.h.p. : Maximum load  3 ln n / ln ln n • The proof is simply based on counting and union bound • Let ’ s see how the latest result can help in giving a lower bound … 22

Maximum Load (Revisited) Lemma: Suppose n balls are thrown to n bins, independently and uniformly at random. Then w.h.p. (at least 1-1/n) : Maximum load  ln n / ln ln n How to prove? Let ’ s bound the probability for the Poisson case, and then … 23

Proof Let M = ln n / ln ln n In the Poisson case, Pr(# of balls in Bin 1  M)  Pr(# of balls in Bin 1 = M) = e -1 (1) M / M! = 1/(eM!)  In the Poisson case, Pr(Max-Load  M)  (1 - 1/(eM!)) n  exp{ -n/(eM!) } 24

Proof Next, we simplify the bound by showing: - n / (eM!) - c ln n for some c  Recall that M!  eM 1/2 (M/e) M  M (M/e) M [for large n]  ln M!  ln M + M ln M – M  ln ln n + ln n – M  ln n – ln ln n – ln (2e) [for large n] 25

Proof Thus, M!  n / (2e ln n) [for large n]  exp{ - n / (eM!) }  exp{ -2ln n } = 1/n 2 So, in the Poisson case Pr(Max-Load  M)  1/n 2  In the Exact case Pr(Max-Load  M)  en 1/2 (1/n 2 )  1/n 26

Coupon Collector (Revisited) • Previously we have shown that if we want to collect a set of n coupons, the expected number of coupons we buy is n H(n)  n ln n • Suppose we have bought n ln n + cn coupons already. What is the probability that we have obtained a full collection ? 27

Coupon Collector (Revisited) • After buying n ln n + cn coupons: Pr(not having i th coupon) = (1 - 1/n) n ln n + cn  e – (1/n)(n ln n + cn) = e – c / n • After buying n ln n + cn coupons: Pr(not having a full collection)  e – c  Pr(having a full collection)  1 - e – c 28

Coupon Collector (Revisited) • Recently, we have seen that Chernoff bound usually gives a much tighter result Question: Can we apply Chernoff bound to get an even better result ? 29

Coupon Collector (Revisited) Theorem: Let X be the number of coupons we buy before getting one card of each n types of coupons. Then, for any c, lim n  1 Pr ( X  n ln n + cn ) = 1 - e -e-c When c = -4, 1 - e -e-c  1 Remark: When c = 4, 1 - e -e-c  0.02  For large n, #coupons is between n ln n  4n is ~ 98% !!!  This is an example of sharp threshold, where the random variable ’ s distribution is concentrated around its mean 30

Proof • We can consider the coupon collector ’ s problem as a balls-and-bins problem (What are the balls? How many bins?) • We shall use Poisson approximation so that intermediate steps will be easier • Suppose # balls in each bin is a Poisson RV with mean ln n + c, so that the expected total # balls is m = n ln n + cn 31

Proof Then, in the Poisson case, Pr(Bin 1 is empty) = e -(ln n + c) = e -c /n Let NE be the event that no bin is empty in Poisson case So, Pr(NE) = (1- e -c /n) n = e -e-c … [when n  1 ] 32

Two Facts Let Y be # balls thrown in the Poisson case Let r = 2m ln m We claim that as n  1 , 1. Pr(|Y-m|  r) = 0 (i.e., Y is very close to mean) Pr(NE | |Y-m|  r) = Pr(NE | Y=m) 2. In case Y is very close to mean, we can just assume Y = m when computing Pr(NE) Suppose our claim is true … 33

Consequence of Two Facts As n  1 , e -e-c = Pr(NE) = Pr(NE | |Y-m|  r) Pr(|Y-m|  r) + Pr(NE | |Y-m|  r) Pr(|Y-m|  r) = Pr(NE | |Y-m|  r) 0 + Pr(NE | Y=m) 1 = Pr(NE | Y=m) = Pr(no bin is empty in Exact Case when m balls are thrown ) 34

Consequence of Two Facts  Pr(some bin is still empty in Exact Case when m balls are thrown ) = 1 - e -e-c Recall: X = # balls thrown in the exact case until every bin is non-empty So X  m occurs if and only if some bin is still empty when m balls are thrown Thus, Pr ( X  m ) = 1 - e -e-c 35

Fact 1: Y is very close to mean Recall: n = number of bins Y = # balls thrown in Poisson case m = n ln n + cn = E[Y] r = (2m ln m) 1/2 Fact 1: In the Poisson case, as n  1 , Pr(|Y-m|  r) = 0 36

CS5314 Randomized Algorithms Lecture 14: Balls, Bins, Random Graphs - PowerPoint PPT Presentation

CS5314 Randomized Algorithms Lecture 14: Balls, Bins, Random Graphs (Poisson Approximation) 1 Objectives Poisson Approximation for Balls-and-Bins : to approximate # balls in each bin as independent Possion RV with = m/n Revisit

Randomized Algorithms Randomized Algorithms Two Types of Randomized Algorithms Two Types of

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah & Karan Singh 1 Randomized

Randomized algorithms Quick-sort Closest pair of points Inge Li Grtz 1 2

Probability and Delaunay triangulations 1 Randomized algorithms for Delaunay triangulations

Randomized Algorithms Lecture 3: Occupancy, Moments and deviations, Randomized selection

Randomized Algorithms, Quicksort and Randomized Selection Carola Wenk Slides courtesy of Charles

Randomized Algorithms Randomized Algorithms Markov Chains and Random Walks Markov Chains and

Randomized Algorithms Randomized Algorithms The Chernoff bound The Chernoff bound Speaker:

ECS231 Low-rank approximation revisited (Introduction to Randomized Algorithms) May 23, 2019

Mina Kwon - 2019.10.15 - Randomized Clinical Trial ? Randomized Clinical Trial Purpose:

Randomized Algorithms Lecture 9 September 24, 2013 Sariel (UIUC) CS573 1 Fall 2013 1 / 32

Introduction to Randomized Algorithms: QuickSort Lecture 2 August 27, 2020 Chandra (UIUC)

SYMMETRIC ENCRYPTION K is randomized E can be randomized or stateful D is deterministic

Randomized algorithms Inge Li Grtz Thank you to Kevin Wayne for inspiration to slides

CS 574: Randomized Algorithms Lecture 5. Coupon Collector Problems September 8, 2015 Lecture 5.

Randomized Algorithms Lecture 7: Random Walks - II Sotiris Nikoletseas Associate

Corporate Presentation Jul 2019 Sales Performance 1 st Half 2019 Overall: 651,680 units -15% Y

Experience (USE) APARTMENT OWNERS NETWORK MEETING VOICE working for a more sustainable

presented by Dr. Ni-Bin Chang NASA Grant No. NAG13-03008, Sponsored by Glenn Research Center,

ST Engineering Aerospace North American Operations 2019 VT Volant Aerospace, LLC ST Engineering

Environment and Regeneration New Waste Collection Service Autumn 2018 MERTON COUNCIL Autumn

Sonja Cherry-Paul & Dana Johansen What is the Digital Reading Workshop? - Readers

Tom McHugh Dun Laoghaire Rathdown County Council Deputy Chief Executive, Director of Municipal

City of Toronto Solid Waste Management Services Property Management Expo November 30, 2016 Jim

CS5314 Randomized Algorithms Lecture 14: Balls, Bins, Random Graphs - PowerPoint PPT Presentation

CS5314 Randomized Algorithms Lecture 14: Balls, Bins, Random Graphs (Poisson Approximation) 1 Objectives Poisson Approximation for Balls-and-Bins : to approximate # balls in each bin as independent Possion RV with = m/n Revisit

Randomized Algorithms Randomized Algorithms Two Types of Randomized Algorithms Two Types of

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah &amp; Karan Singh 1 Randomized

Randomized algorithms Quick-sort Closest pair of points Inge Li Grtz 1 2

Probability and Delaunay triangulations 1 Randomized algorithms for Delaunay triangulations

Randomized Algorithms Lecture 3: Occupancy, Moments and deviations, Randomized selection

Randomized Algorithms, Quicksort and Randomized Selection Carola Wenk Slides courtesy of Charles

Randomized Algorithms Randomized Algorithms Markov Chains and Random Walks Markov Chains and

Randomized Algorithms Randomized Algorithms The Chernoff bound The Chernoff bound Speaker:

ECS231 Low-rank approximation revisited (Introduction to Randomized Algorithms) May 23, 2019

Mina Kwon - 2019.10.15 - Randomized Clinical Trial ? Randomized Clinical Trial Purpose:

Randomized Algorithms Lecture 9 September 24, 2013 Sariel (UIUC) CS573 1 Fall 2013 1 / 32

Introduction to Randomized Algorithms: QuickSort Lecture 2 August 27, 2020 Chandra (UIUC)

SYMMETRIC ENCRYPTION K is randomized E can be randomized or stateful D is deterministic

Randomized algorithms Inge Li Grtz Thank you to Kevin Wayne for inspiration to slides

CS 574: Randomized Algorithms Lecture 5. Coupon Collector Problems September 8, 2015 Lecture 5.

Randomized Algorithms Lecture 7: Random Walks - II Sotiris Nikoletseas Associate

Corporate Presentation Jul 2019 Sales Performance 1 st Half 2019 Overall: 651,680 units -15% Y

Experience (USE) APARTMENT OWNERS NETWORK MEETING VOICE working for a more sustainable

presented by Dr. Ni-Bin Chang NASA Grant No. NAG13-03008, Sponsored by Glenn Research Center,

ST Engineering Aerospace North American Operations 2019 VT Volant Aerospace, LLC ST Engineering

Environment and Regeneration New Waste Collection Service Autumn 2018 MERTON COUNCIL Autumn

Sonja Cherry-Paul &amp; Dana Johansen What is the Digital Reading Workshop? - Readers

Tom McHugh Dun Laoghaire Rathdown County Council Deputy Chief Executive, Director of Municipal

City of Toronto Solid Waste Management Services Property Management Expo November 30, 2016 Jim

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah & Karan Singh 1 Randomized

Sonja Cherry-Paul & Dana Johansen What is the Digital Reading Workshop? - Readers