[PPT] - Games Miheer Dewaskar Chennai Mathematical Institute April 27, PowerPoint Presentation

SLIDE 1

Games

Miheer Dewaskar

Chennai Mathematical Institute April 27, 2016

1 / 19

SLIDE 2

Outline

Finite Duration Games Win-Lose Games Payoff Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games

2 / 19

SLIDE 3

Outline

Finite Duration Games Win-Lose Games Payoff Games Infinite Duration Games Simple Stochastic Games

3 / 19

SLIDE 4

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

SLIDE 5

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

SLIDE 6

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

SLIDE 7

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

SLIDE 8

Finite games

Win-Lose game

Box wins

Circle Wins Box Wins

4 / 19

SLIDE 9

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

SLIDE 10

Finite games

Win-Lose game

Circle wins

Circle Wins Box Wins

4 / 19

SLIDE 11

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

SLIDE 12

Finite games

Win-Lose game

Algorithm for optimal play

Circle Wins Box Wins

4 / 19

SLIDE 13

Finite games

Win-Lose game

Algorithm for optimal play

Circle Wins Box Wins

4 / 19

SLIDE 14

Finite games

Win-Lose game

Algorithm for optimal play

Box can always win

Circle Wins Box Wins

4 / 19

SLIDE 15

Finite games

Payoff game

4

1
1

4 1

2

Maximizer Minimizer

5 / 19

SLIDE 16

Finite games

Payoff game

4

1
1

4 1

2

Maximizer Minimizer

5 / 19

SLIDE 17

Finite games

Payoff game

4

1
1

4 1

2

Maximizer Minimizer

5 / 19

SLIDE 18

Finite games

Payoff game

Payoff

Min pays 4 units to Max

4

1
1

4 1

2

Maximizer Minimizer

5 / 19

SLIDE 19

Finite games

Payoff game

4

1
1

4 1

2

Maximizer Minimizer

5 / 19

SLIDE 20

Finite games

Payoff game

Payoff

Min pays -1 units to Max

4

1
1

4 1

2

Maximizer Minimizer

5 / 19

SLIDE 21

Finite games

Payoff game

MinMax algorithm

4

1
1
1

4 1 1

2

Maximizer Minimizer

5 / 19

SLIDE 22

Finite games

Payoff game

MinMax algorithm

4 4

1

1

1
1

4 1 1

2

Maximizer Minimizer

5 / 19

SLIDE 23

Finite games

Payoff game

MinMax algorithm

Value = 1 Min can ensure a payoff ≤ 1 Max can ensure a payoff ≥ 1

1 4 4

1

1

1
1

4 1 1

2

Maximizer Minimizer

5 / 19

SLIDE 24

Finite games

Payoff game

MinMax algorithm

Value = 1 Min can ensure a payoff ≤ 1 Max can ensure a payoff ≥ 1 When both play optimally the payoff is exactly 1.

1 4 4

1

1

1
1

4 1 1

2

Maximizer Minimizer

5 / 19

SLIDE 25

Outline

Finite Duration Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games

6 / 19

SLIDE 26

Parity Games

Winning conditions

5 6 1 2 3 Odd Even

7 / 19

SLIDE 27

Parity Games

Winning conditions

π1 =

5 6 1 2 3 Odd Even

7 / 19

SLIDE 28

Parity Games

Winning conditions

π1 = 1

5 6 1 2 3 Odd Even

7 / 19

SLIDE 29

Parity Games

Winning conditions

π1 = 1 5

5 6 1 2 3 Odd Even

7 / 19

SLIDE 30

Parity Games

Winning conditions

π1 = 1 5 2

5 6 1 2 3 Odd Even

7 / 19

SLIDE 31

Parity Games

Winning conditions

π1 = 1 5 2 1

5 6 1 2 3 Odd Even

7 / 19

SLIDE 32

Parity Games

Winning conditions

π1 = 1 5 2 1 2

5 6 1 2 3 Odd Even

7 / 19

SLIDE 33

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1

5 6 1 2 3 Odd Even

7 / 19

SLIDE 34

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins

5 6 1 2 3 Odd Even

7 / 19

SLIDE 35

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 =

5 6 1 2 3 Odd Even

7 / 19

SLIDE 36

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1

5 6 1 2 3 Odd Even

7 / 19

SLIDE 37

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5

5 6 1 2 3 Odd Even

7 / 19

SLIDE 38

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2

5 6 1 2 3 Odd Even

7 / 19

SLIDE 39

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1

5 6 1 2 3 Odd Even

7 / 19

SLIDE 40

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5

5 6 1 2 3 Odd Even

7 / 19

SLIDE 41

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins

5 6 1 2 3 Odd Even

7 / 19

SLIDE 42

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π =

5 6 1 2 3 Odd Even

7 / 19

SLIDE 43

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1

5 6 1 2 3 Odd Even

7 / 19

SLIDE 44

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2

5 6 1 2 3 Odd Even

7 / 19

SLIDE 45

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3

5 6 1 2 3 Odd Even

7 / 19

SLIDE 46

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3

5 6 1 2 3 Odd Even

7 / 19

SLIDE 47

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6

5 6 1 2 3 Odd Even

7 / 19

SLIDE 48

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6 5

5 6 1 2 3 Odd Even

7 / 19

SLIDE 49

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6 5 2

5 6 1 2 3 Odd Even

7 / 19

SLIDE 50

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6 5 2 1 . . . Parity(max Inf(π)) wins

5 6 1 2 3 Odd Even

7 / 19

SLIDE 51

Parity Games

Questions

Does either Even or Odd have a strategy to always win? If so, then how to compute the winning strategy?

8 / 19

SLIDE 52

Parity Games

Questions

Does either Even or Odd have a strategy to always win? Yes If so, then how to compute the winning strategy? By reduction to finite duration games

8 / 19

SLIDE 53

Parity Games

5 6 1 2 3 Odd Even 1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

SLIDE 54

Parity Games

5 6 1 2 3 Odd Even 1 5 6 5 2 3 6 5 3 1 2 1 3 3 6 5 2 6 Odd Wins Even Wins

9 / 19

SLIDE 55

Parity Games

5 6 1 2 3 Odd Even 1 5 6 5 2 3 6 5 3 1 2 1 3 3 6 5 2 6 Odd Wins Even Wins

9 / 19

SLIDE 56

Parity Games

5 6 1 2 3 Odd Even 1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

SLIDE 57

Parity Games

5 6 1 2 3 Odd Even

Finite game

Even has a winning strategy

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

SLIDE 58

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

SLIDE 59

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

9 / 19

SLIDE 60

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 Stack = 1

9 / 19

SLIDE 61

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 Stack = 1 2

9 / 19

SLIDE 62

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 Stack = 1 2 1

9 / 19

SLIDE 63

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 Stack = 1

9 / 19

SLIDE 64

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 Stack = 1 5

9 / 19

SLIDE 65

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 Stack = 1 5 6

9 / 19

SLIDE 66

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 Stack = 1 5 6 5

9 / 19

SLIDE 67

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 Stack = 1 5

9 / 19

SLIDE 68

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 Stack = 1 5 2

9 / 19

SLIDE 69

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 Stack = 1 5 2 3

9 / 19

SLIDE 70

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 Stack = 1 5 2 3 6

9 / 19

SLIDE 71

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 5 Stack = 1 5 2 3 6 5 Every eliminated cycle has max priority even

9 / 19

SLIDE 72

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 5 Stack = 1 5 . . . Every eliminated cycle has max priority even

9 / 19

SLIDE 73

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 5 Stack = 1 5 . . . Every eliminated cycle has max priority even Hence max Inf priority in π is Even

9 / 19

SLIDE 74

Parity Games

Better Algorithms

Marcin Jurdzinski and Jens Vöge. “A discrete strategy improvement algorithm for solving parity games”. In: Computer Aided Verification. Springer, 2000, pp. 202–215 Upper bound1 : O

(n/d)d

1see also Friedmann, “Exponential Lower Bounds for Solving Infinitary Payoff Games

and Linear Programs”.

10 / 19

SLIDE 75

Parity Games

Better Algorithms

Marcin Jurdzinski and Jens Vöge. “A discrete strategy improvement algorithm for solving parity games”. In: Computer Aided Verification. Springer, 2000, pp. 202–215 Upper bound1 : O

(n/d)d

Marcin Jurdzinski, Mike Paterson, and Uri Zwick. “A Deterministic Subexponential Algorithm for Solving Parity Games”. In: SIAM Journal on Computing 38.4 (Jan. 2008), pp. 1519–1532 nO(√n)

1see also Friedmann, “Exponential Lower Bounds for Solving Infinitary Payoff Games

and Linear Programs”.

10 / 19

SLIDE 76

Outline

Finite Duration Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games

11 / 19

SLIDE 77

Mean Payoff Games

Payoffs

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 78

Mean Payoff Games

Payoffs (ab)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 79

Mean Payoff Games

Payoffs (ab)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 80

Mean Payoff Games

Payoffs (ab)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 81

Mean Payoff Games

Payoffs (ab)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 82

Mean Payoff Games

Payoffs (ab)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 83

Mean Payoff Games

Payoffs (ab)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 84

Mean Payoff Games

Payoffs (ab)ω

a

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 85

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b − 2 1

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 86

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a − 2 + 3 2

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 87

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b − 2 + 3 − 2 3

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 88

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a − 2 + 3 − 2 + 3 4

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 89

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b − 2 + 3 − 2 + 3 − 2 5

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 90

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a − 2 + 3 − 2 + 3 − 2 + 3 6

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 91

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 92

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 93

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 94

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 95

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 96

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 97

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 98

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 99

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 100

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 101

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c − 1 1

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 102

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b − 1 − 2 2

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 103

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a − 1 − 2 + 3 3

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 104

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c − 1 − 2 + 3 − 1 4

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 105

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b − 1 − 2 + 3 − 1 − 2 5

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 106

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a − 1 − 2 + 3 − 1 − 2 + 3 6

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 107

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω Min pays −1

3 units to Max

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a . . . − 1 − 2 + 3 − 1 − 2 + 3 6 ∼ n(−1−2+3)

3n

→ 0

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 108

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω Min pays −1

3 units to Max

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a . . . − 1 − 2 + 3 − 1 − 2 + 3 6 ∼ n(−1−2+3)

3n

→ 0 Min tries to minimize lim Max tries to maximize lim

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 109

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω Min pays −1

3 units to Max

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a . . . − 1 − 2 + 3 − 1 − 2 + 3 6 ∼ n(−1−2+3)

3n

→ 0 Generally Min tries to minimize lim sup Max tries to maximize lim inf

a b c

2
1

2

2
1

3 Min Max

12 / 19

SLIDE 110

Mean Payoff Games

Questions

Does the game have a value? i.e. is there a v so that

Max can ensure lim inf ≥ v Min can ensure lim sup ≤ v

13 / 19

SLIDE 111

Mean Payoff Games

Questions

Does the game have a value? i.e. is there a v so that

Max can ensure lim inf ≥ v Min can ensure lim sup ≤ v

Yes How to compute the optimal strategies?

13 / 19

SLIDE 112

Mean Payoff Games

Questions

Does the game have a value? i.e. is there a v so that

Max can ensure lim inf ≥ v Min can ensure lim sup ≤ v

Yes How to compute the optimal strategies? Solution using the finite game

13 / 19

SLIDE 113

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

14 / 19

SLIDE 114

Mean Payoff

Finite Game

a b c

1

3 1 2

c

1

b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

14 / 19

SLIDE 115

Mean Payoff

Finite Game

a b c

1

3 1 2

c

1

b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

14 / 19

SLIDE 116

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

14 / 19

SLIDE 117

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

14 / 19

SLIDE 118

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Max can ensure ≥ 0 in the finite game

14 / 19

SLIDE 119

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

14 / 19

SLIDE 120

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0

14 / 19

SLIDE 121

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too

14 / 19

SLIDE 122

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a Stack = a

14 / 19

SLIDE 123

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b Stack = a b

14 / 19

SLIDE 124

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c Stack = a b c

14 / 19

SLIDE 125

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b Stack = a b c b

14 / 19

SLIDE 126

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b Stack = a b

14 / 19

SLIDE 127

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c Stack = a b c

14 / 19

SLIDE 128

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a Stack = a b c a

14 / 19

SLIDE 129

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a Stack = a

14 / 19

SLIDE 130

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b Stack = a b

14 / 19

SLIDE 131

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c Stack = a b c

14 / 19

SLIDE 132

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c a Stack = a b c a Every time a cycle with average value ≤ 0 is eliminated

14 / 19

SLIDE 133

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c a Stack = a Every time a cycle with average value ≤ 0 is eliminated

14 / 19

SLIDE 134

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c a Stack = a Hence limsup of averages of π is ≤ 0

14 / 19

SLIDE 135

Mean Payoff

Finite Game

a b c c b

1
1

3 1 2

a b c

2
1

2

2
1

3 Min Max

Max can ensure ≥ 0 Similarly Max can ensure liminf

f the average is ≥ 0

Hence the value of Mean payoff game is 0

14 / 19

SLIDE 136

Outline

Finite Duration Games Infinite Duration Games Simple Stochastic Games

15 / 19

SLIDE 137

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 138

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 139

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 140

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 141

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 142

Simple Stochastic Game

Circle Wins

1 2 1 2 1 2 1 2 16 / 19

SLIDE 143

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 144

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 145

Simple Stochastic Game

Box Wins

1 2 1 2 1 2 1 2 16 / 19

SLIDE 146

Simple Stochastic Game

Or

1 2 1 2 1 2 1 2 16 / 19

SLIDE 147

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 148

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 149

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 150

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 151

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 152

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 153

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 154

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

SLIDE 155

Simple Stochastic Game

Circle can win from with probability 1

1 2 1 2 1 2 1 2 16 / 19

SLIDE 156

Simple Stochastic Game

Values

1 1

1 2

1

1 2 1 2 1 2 1 2 17 / 19

SLIDE 157

Simple Stochastic Game

Values

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2 17 / 19

SLIDE 158

Simple Stochastic Game

Values

v( ) = 1 v( ) = 0 v( ) = 1 2(v( ) + v( )) v( ) = 1 2(v( ) + v( )) v( ) = max{v( ), v( )} v( ) = min{v( ), v( )}

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2 17 / 19

SLIDE 159

Simple Stochastic Game

Values

v( ) = 1 v( ) = 0 v( ) = 1 2(v( ) + v( )) v( ) = 1 2(v( ) + v( )) v( ) = max{v( ), v( )} v( ) = min{v( ), v( )}

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2

These equations have a unique solution.

17 / 19

SLIDE 160

Simple Stochastic Game

Values

v( ) = 1 v( ) = 0 v( ) = 1 2(v( ) + v( )) v( ) = 1 2(v( ) + v( )) v( ) = max{v( ), v( )} v( ) = min{v( ), v( )}

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2

These equations have a unique solution. From state s - has a strategy to reach with probability ≥ v(s) has a strategy to reach with probability ≥ 1 − v(s)

17 / 19

SLIDE 161

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

18 / 19

SLIDE 162

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

18 / 19

SLIDE 163

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

2Chatterjee and Fijalkow, “A reduction from parity games to simple stochastic games”. 18 / 19

SLIDE 164

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

Open Problem

Is there a polynomial time algorithm for any of them?

2Chatterjee and Fijalkow, “A reduction from parity games to simple stochastic games”. 18 / 19

SLIDE 165

Timeline

Lloyd S. Shapley. “Stochastic games”. In: Proceedings of the National Academy of Sciences 39.10 (1953), pp. 1095–1100 E.A. Emerson and C.S. Jutla. “Tree automata, mu-calculus and determinacy”. In: IEEE Comput. Soc. Press, 1991, pp. 368–377 Anne Condon. “The complexity of stochastic games”. In: Information and Computation 96.2 (Feb. 1992), pp. 203–224 Uri Zwick and Mike Paterson. “The complexity of mean payoff games

n graphs”.

In: Theoretical Computer Science 158.1 (May 1996),

pp. 343–359

Marcin Jurdziski. “Deciding the winner in parity games is in UP co-UP”. . In: Information Processing Letters 68.3 (1998), pp. 119–124

19 / 19

SLIDE 166

Timeline

Lloyd S. Shapley. “Stochastic games”. In: Proceedings of the National Academy of Sciences 39.10 (1953), pp. 1095–1100 E.A. Emerson and C.S. Jutla. “Tree automata, mu-calculus and determinacy”. In: IEEE Comput. Soc. Press, 1991, pp. 368–377 Anne Condon. “The complexity of stochastic games”. In: Information and Computation 96.2 (Feb. 1992), pp. 203–224 Uri Zwick and Mike Paterson. “The complexity of mean payoff games

n graphs”.

In: Theoretical Computer Science 158.1 (May 1996),

pp. 343–359

Marcin Jurdziski. “Deciding the winner in parity games is in UP co-UP”. . In: Information Processing Letters 68.3 (1998), pp. 119–124

Thank you

19 / 19