CMU-Q 15-381 Lecture 15: Predictions in Markov Chains Markov - PowerPoint PPT Presentation

CMU-Q 15-381 Lecture 15: Predictions in Markov Chains Markov Decision Processes Teacher: Gianni A. Di Caro

M AKING P REDICTIONS : G ENERAL T WO - STATE MC ü 6 (7) = 6 (8) . 7 Probability distribution over the states after 9 steps, given initial distribution (7) = < = 7 = > : ; Absolute probability of state > at step 9 given by ü the initial distribution . = 1 − ( ( 0 < (, ) < 1 § How do we compute . 7 ? ) 1 − ) § … ! = # $ # % = 1 −( Eigenvector matrix 1 ) ) −( § … ! *$ = $ −1 1 +,- § Diagonalization: ! *$ .! = 1 1 − ( − ) = 0 $ 0 0 0 % = 1 Eigenvalue matrix 0 0 § Pre-multiplying both terms by ! and post-multiplying by ! *$ : . = !1! *$ § . % = (!1! *$ )(!1! *$ ) = (!1)(! *$ !)(1! *$ ) = (!1)4 % (1! *$ ) = !11! *$ % 1 % = 0 $ 0 = !1 % ! *$ , % 0 0 % 2

G ENERAL T WO - STATE MC ! = 1 − % % 0 < %, & < 1 & 1 − & * , * = / . 0 + = 1 −% & −% § ! * = +, * + -. + -. = . * , , 1 & −1 1 0 / 0 123 % −% & % 6 7 § ! * = ⋯ = . % + , 8 = 1 − % − & −& & & 123 123 § 8 * → 0 as : → ∞ & % § ! * → . % = < the matrix ! * in the limit of large : & 123 § Probability distribution over the states after : steps, given initial distribution = (?) : (?) ! * → A . (?) < = § = (*) = = (?) ! * = A . (?) (?) A 0 A 0 1 & % (?) + &A 0 (?) + %A 0 (?) = (?) = % + & & A . %A . % + & % + & (?) + A 0 (?) = 1 as : → ∞ , and given that A . 3

L IMITING DISTRIBUTION FOR GENERAL 2- STATE MC § State distribution over the states after ! steps, given the initial distribution " ($) : . / )→+ " ($) , ) = lim )→+ " ()) = lim / + . = " / + . à The chain has a limiting state probability distribution, denoted here as " § " is independent of " ($) § à " i s an Invariant limiting distribution of the chain: the limit exists and its invariant with respect to the initial distribution § The limiting distribution " is also a stationary distribution : if the chain starts (or arrives) in " as a state probability distribution, it stays in " (i.e., the distribution becomes stationary , it won’t change): ", = " 2 3 4 1 − α α h i h i h i β (1 − α )+ αβ αβ + α (1 − β ) β β α α 5 = = α + β α + β α + β α + β α + β α + β β 1 − β 4

L ONG - TERM BEHAVIOR : L IMITING DISTRIBUTIONS § For studying the long-term behavior of a generic MC with one-step transition matrix ! and " states, let’s consider the limit of the # -step conditional transition probabilities , denoted with $ : (() = lim (→* + ,- lim (→* 1 2 ( = 3 2 4 = 5) = $ ,-     p ( n ) p ( n ) p ( n ) . . . Q 11 Q 12 . . . Q 1 m 11 12 1 m     p ( n ) p ( n ) p ( n )   Q 21 Q 22 . . . Q 2 m . . .    21 22 2 m  n →∞ T n = lim   lim =   . . . .  ...  ...   . . . . n →∞ . . .  . .  . . . . .          p ( n ) p ( n ) p ( n )  Q m 1 Q m 2 . . . Q mm . . . m 1 m 2 mm Let’s consider three different cases that can arise from the limit: 1) Limiting distribution exists 2) Limiting but no invariant distribution 3) No limiting (but possibly stationary) distribution 5

L IMITING DISTRIBUTION DOES EXIST (8) = lim 8→9 : &' lim 8→9 < = 8 = # = > = !) = % &' Limiting distribution: Let’s consider thet case when, for all !, #: 1. the limit reaches convergence values % &' o and for each # the value % &' is independent of initial the state ! , o 3 % ' = 1 → we can write % &' as % ' (i.e., % &' = % *' , ∀!, ,, # ∈ .), and ∑ '12 o     p ( n ) p ( n ) p ( n ) . . . Q 1 Q 2 . . . Q m 11 12 1 m     p ( n ) p ( n ) p ( n )   Q 1 Q 2 . . . Q m . . .   21 22 2 m   n →∞ T n = lim   lim =   . .   . . ... ... . .   . . n →∞  . . . . .  . . . . .          p ( n ) p ( n ) p ( n )  Q 1 Q 2 . . . Q m . . . m 1 m 2 mm 6

L IMITING DISTRIBUTION IS INVARIANT 2 3 Q 1 Q 2 . . . Q m 6 7 Q 1 Q 2 . . . Q m 6 7 h i n →∞ p (0) T n = p (0) p (0) p (0) 6 7 lim . . . . . 6 7 m ... 1 2 . . 6 . . . . . 7 6 7 4 5 4 5 Q 1 Q 2 . . . Q m h h i i =1 p (0) i =1 p (0) i =1 p (0) P m P m P m ⇥ ⇤ Q 1 Q 2 . . . Q m = = = p Q 1 Q 2 . . . Q m i i i à The (unconditional) convergence values of the limits for the ! - step conditional transition probabilities define the limiting distribution of the chain, which is invariant with respect to the initial conditions § After the process has been in operation for some long duration, the probability of finding it in state " is # $ , irrespective of the starting state 7

L IMITING ⟹ S TATIONARY DISTRIBUTIONS From " ($) = " ($'() for ) → ∞ , also " ($) = " ($'() , à the limiting distribution § is the solution of the fixed point equation : ", = " à Because of the above equation, the limiting distribution is always also a stationary distribution : if the chain starts with or arrives to at any step ) to a probability state distribution equal to " , it doesn’t change it anymore § " = ", looks similar to an eigenvector equation: -. = /. , with eigenvalue / = 1 § By transposing the matrices and calling , as 1 : " 2 = ("1) 2 ⇒ 1 2 " 2 = " 2 , which is a “regular” eigenvector equation § à The transposed transition matrix 1 2 has eigenvectors with eigenvalue 1 that are stationary distributions expressed as column vectors. 8

E IGENVECTORS AND S TATIONARY DISTRIBUTION § Therefore, if the eigenvectors of the transposed transition matrix ! are known, then so are the stationary distributions of the Markov chain. This can save a lot of computations, avoiding to computing powers of ! ! § The stationary distribution is a left eigenvector (as opposed to the usual right eigenvectors) of the transition matrix, " = "! § Note: When there are multiple eigenvectors associated to an eigenvalue of value 1 , each such eigenvector gives rise to an associated stationary distribution. However, this can only occur when the Markov chain is reducible , i.e. has multiple communicating classes. 9

S TATIONARY D ISTRIBUTION ü Using ! = !# we can easily find the stationary distribution (assumed that there is one, and independently from the limiting distribution) either: by solving the linear equation ! = !# ü ü or by using the eigenvectors of the transposed transition matrix # $ § For instance, in the case of the general 2-state MC, let ! = % 1 − % and then we can solve the matrix equation and find the stationary matrix: 10

L IMITING BUT NO INVARIANT DISTRIBUTION 2. Limiting but no invariant distribution: Consider the case when for all !, #, the limit reaches convergence values $ %& and for each # the value $ %& is dependent of the initial the state ! , such that we cannot + $ %& = 1, ∀# must hold: write as before $ %& as $ & ; ∑ %)*     p ( n ) p ( n ) p ( n ) . . . Q 11 Q 12 . . . Q 1 m 11 12 1 m     p ( n ) p ( n ) p ( n )   Q 21 Q 22 . . . Q 2 m . . .   21 22 2 m   n →∞ T n = lim   lim =   . .   . . ... ... . .   . . n →∞  . . . . .  . . . . .           p ( n ) p ( n ) p ( n ) Q m 1 Q m 2 . . . Q mm . . . m 1 m 2 mm 2 3 Q 11 Q 12 . . . Q 1 m 6 7 Q 21 Q 22 . . . Q 2 m 6 7 h i n →∞ p (0) T n = p (0) p (0) p (0) 6 7 lim . . . . . 6 ... 7 1 2 m . . 6 . . . . . 7 6 7 4 5 Q m 1 Q m 2 . . . Q mm à Each different initial distribution / (1) defines a possibly different limiting (stationary) distribution 11

L IMITING BUT NO INVARIANT DISTRIBUTION 2 3 Q 11 Q 12 . . . Q 1 m 6 7 Q 21 Q 22 . . . Q 2 m 6 7 h i n →∞ p (0) T n = p (0) p (0) p (0) 6 7 lim . . . . . 6 ... 7 1 2 m . . 6 . . . . . 7 6 7 4 5 Q m 1 Q m 2 . . . Q mm § Example: ! = 1 0 1 = % & , 2-state MC with 0 ≤ (, * ≤ 1 0 § ! + = ! for all , , such that a limiting distribution does exist but it always depends on - (/) 1 0 (/) (/) (/) (/) 1 = 1 2 1 2 1 & 1 & 0 12

N O L IMITING DISTRIBUTION 3. No Limiting distribution: The limit doesn’t reach a convergence value ! "# for all $, &. Therefore a limiting distribution as defined doesn’t exist. ( = 0 1 0 , in this case, ( ,- = 0 1 0 , ( ,-./ = 1 0 § 1 , 1 1 0 → the succession of ( ’s powers oscillates between the two matrices, § the MC is periodic of period 2 § However, a stationary distribution can still exist Limiting ⇒ Stationary, but the opposite doesn’t necessarily hold § 13

N O L IMITING , Y ES STATIONARY DISTRIBUTION ! = 0 1 0 , with, ! %& = 0 1 0 , ! %&() = 1 0 § 1 1 0 1 § The solution of the fixed point equation : 2 3 4 0 1 ⇥ ⇤ 5 = ⇥ ⇤ ⇥ ⇤ ⇥ ⇤ p T = p ⇒ a 1 − a a 1 − a 1 − a a a 1 − a = → 1 0 the resulting equation system: 1 − + = + + = 1 − + + = 0.5 satisfies the equations → / = 0.5 0.5 is a stationary distribution § This is intuitively expected since the oscillating behavior of the powers of ! that results in pairwise symmetric matrices, perfectly balances the probabilities of the two states of the chain. 14

CMU-Q 15-381 Lecture 15: Predictions in Markov Chains Markov - PowerPoint PPT Presentation

CMU-Q 15-381 Lecture 15: Predictions in Markov Chains Markov Decision Processes Teacher: Gianni A. Di Caro M AKING P REDICTIONS : G ENERAL T WO - STATE MC 6 (7) = 6 (8) . 7 Probability distribution over the states after 9 steps, given

Slides for 15-381/781 15-381/781 Fall 2016

FACT: A Diagnostic for Group Fairness Trade-offs Joon Kim, CMU (joonsikk@cs.cmu.edu ) Jiahao Chen,

The bluetides simulation Tiziana DiMatteo (CMU ) Yu Feng (Berkeley), Rupert Croft (CMU ), Aklant

A New Boosting Algorithm Using Input-Dependent Regularizer Rong Jin rong+@cs.cmu.edu Yan Liu

CMU-Q 15-381 Lecture 1: Introduction AI, basic definitions, problems, road map Teacher:

CMU-Q 15-381 Lecture 4: Path Planning Teacher: Gianni A. Di Caro A PPLICATION : M OTION P

15-381: Artificial Intelligence Introduction and Overview Course data All up-to-date info is

CMU-Q 15-381 Lecture 18: Reinforcement Learning I Teacher: Gianni A. Di Caro H OW REALISTIC ARE

CMU-Q 15-381 Lecture 20: Game Theory I Teacher: Gianni A. Di Caro I CE - CREAM W ARS

CMU-Q 15-381 Lecture 5: Classical Planning Factored Representations STRIPS Teacher: Gianni A.

CMU MDPs 15-381/781 Emma Brunskill (THIS TIME) Ariel Procaccia DeepMind 2 So long

CMU-Q 15-381 Lecture 8: Optimization I: Optimization for CSP Local Search Teacher: Gianni A.

CMU-Q 15-381 Lecture 16: Markov Decision Processes I Teacher: Gianni A. Di Caro R ECAP : M

CMU-Q 15-381 Lecture 23: Supervised Learning 1 Teacher: Gianni A. Di Caro M ACHINE L EARNING ?

EOR Enhanced Oil Recovery 3535 W. 16 th . St. Odessa, Texas 79763 Tel. (432) 381-6540 Fax

6. Parameter Passing Parameter Passing CS 381 Spring 2016 Example (Formal) Parameter void

Similarity Matching of Temporal Event-Interval Sequences S . MO H A MMA D MI R B A G H E R I A

Convergence and Stability of a New Quadrature Rule for Evaluating Hilbert Transform Maria

Diffusion Demo Glass tube filled with water. At time t = 0, add some drops of ink to one

Normal distribution in the subanalytic setting Julia Ruppert University of Passau Faculty of

Markov Chains: Classification of States IE 502: Probabilistic Models Jayendran Venkateswaran IE

A pointwise ergodic theorem for imprecise Markov chains Gert de Cooman Jasper De Bock Stavros

Outline Distributive Lattices and Markov Chains Coupling from the Past Mixing time on

STA 331 2.0 Stochastic Processes 4. Limiting Probabilities Dr Thiyanga S. Talagala August 25,