0 7 0
play

0%7*0 Just one of many ways to measure how close two I have an - PowerPoint PPT Presentation

Disclaimer: What We Know... Every irreducible, aperiodic Markov chain has a unique stationary distribution. Mixing Time ' n' Much handwaving today!! HIFI IT = m (1) Goal is to get a high level intuition / picture for CS 70, Summer


  1. ↳ Disclaimer: What We Know... Every irreducible, aperiodic Markov chain has a unique stationary distribution. Mixing Time ' n' Much handwaving today!! HIFI IT = π m (1) Goal is to get a high level intuition / picture for CS 70, Summer 2019 the concept of mixing time and applications   m 0 . 8 0 . 2 0 π m = π 0 P m = π 0 0 0 . 3 0 . 7 .   Emphasis is on heuristics rather than rigor 0 . 6 0 . 4 0 π m (2) Bonus Lecture, 8/9/19 π m (3) m Q: How long does it take to get close to the stationary distribution? 1 / 17 2 / 17 3 / 17 a Total Variation Distance Mixing Time: Definition Complete Graph With Loops 0%7*0 Just one of many ways to measure how close two I have an irreducible, aperiodic Markov chain. Mixing time analysis sometimes direct! Notation: µ ( n ) is the distribution at time n , and π distributions are. a ① n } { 1,2 , States over . , Take a random walk on a complete graph with . . is its (unique) stationary distribution . → Let P 1 , P 2 be two PMFs. Their TV distance is: loops . What is the stationary distribution? - - Elif I want to keep running my chain until: kn ftnh : . In ] / Phil symbtanwtnfn F- ( UM , Tl ) Estates i d small positive . . E E ← vertices Tv number What does the transition matrix look like? time in stationary . 9 ÷ dist at - TIME ' up thing lighted - The mixing time t mix ( " ) is the first time this .int . , di St happens. starting . . case worst = IT . Mixing Time? Dependence on " ? (Omitted fact: The TV distance between µ ( n ) 1 for E all U l l . . and π decreases as n increases.) prob , at with high least 4 / 17 5 / 17 6 / 17

  2. I Random-To-Top Shuffling I Random-To-Top Shuffling II Random-To-Top Shuffling III con figs n ! S : All orderings of n cards in a deck. At each time, each card is labeled coupled or not. Time until all cards get coupled = time until Deck . number Transitions: Choose card randomly in the deck. 1 is fully random. Initially, all cards start uncoupled . Move it to the top. me all States here uniform Pick a random card C . is over Tl . 2nd number 1st 3rd Di ff erent strategy called coupling: T2 -13 couple couple couple - - In both decks, take card C and move it to the top. . assume 't const Geomfnnt ) ← Geomftn ) If C isn’t already coupled, mark it as coupled . . arena ; Hed " . What happens when we look at each deck collector coupon . - top - to A individually? LOOKS like random ! move T ECT .tTnT=nl0gN chosen , -1T , -1 I # . . . arbitrary " coupled ; start at v. a. r 0cnu④ . ④ once # labeled For all � : t mix ( " ) = " a IS it ?9eEuFo¥ has State we keep applying p , . " same the position both decks ! in at it dist Stays . . 7 / 17 8 / 17 9 / 17 Random Hypercube Walk I Random Hypercube Walk II Random Hypercube Walk III - length vertices n Take an n -dimensional hypercube. Each coordinate is labeled coupled or not. Use Coupon Collector again! bit strings Stationary distribution of hypercube walk: Initially all coordinates are uncoupled . In F- Cfn . In ] before all vertices as same Pick a random coordinate i . . . . same deg Try coupling again: . Flip a coin to set the i -th coordinate to 0 or 1. ④ Lazywatk vertex self loop Wp 's If i -th coordinate is uncoupled , set it to coupled. Every : transitions to up In Ef time . randomized ] all Nbr N log N = word iarbitraryvertexi → truly random : For all " , t mix ( " ) = bits all randomize 10 / 17 11 / 17 12 / 17

  3. Conditions for Fast Mixing? Bottlenecks How To Measure Conductance? Complete graph K n ? We use a measurement called the conductance Measuring conductance = looking at all subsets =L ± to quantify the notion of a bottleneck. of states with vol ( A ) ≤ 2. E A How many subsets, potentially? 0¥ middle Path on n vertices? Start in The conductance of a set A ⊆ S is: . Milk ; EA ,jEA NZ mixing time ow - • - Φ( A ) = Alternative: Get lower bound on Φ( M ) using - second largest eigenvalue of transition matrix. Dumbbell? jeep 'll ← VOILA ) Φ( M ) ≥ nz 2 mixing time The conductance of the chain M is: min ECA ) Φ( M ) = Eigenvalues are much faster to compute! Ast VOKAKE mgaehffengnere pink dot chance f- . from getting hmltgefdte 13 / 17 14 / 17 15 / 17 Markov Chain Monte Carlo Markov Chain Monte Carlo Monte Carlo: randomized algorithm where the Key idea: Design a Markov chain so that its output is allowed to be incorrect stationary follows the distribution that you want Use cases: to sample from. Run the chain, wait for it to mix. I sampling from complicated distributions I counting combinatorial objects CE ) ! + Runtime depends on... mix I Bayesian inference I statistical physics I volume estimation, integration 16 / 17 17 / 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend