Games Miheer Dewaskar Chennai Mathematical Institute April 27, - - PowerPoint PPT Presentation

games
SMART_READER_LITE
LIVE PREVIEW

Games Miheer Dewaskar Chennai Mathematical Institute April 27, - - PowerPoint PPT Presentation

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite Duration Games Win-Lose Games Payoff Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games 2 / 19 Outline Finite


slide-1
SLIDE 1

Games

Miheer Dewaskar

Chennai Mathematical Institute April 27, 2016

1 / 19

slide-2
SLIDE 2

Outline

Finite Duration Games Win-Lose Games Payoff Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games

2 / 19

slide-3
SLIDE 3

Outline

Finite Duration Games Win-Lose Games Payoff Games Infinite Duration Games Simple Stochastic Games

3 / 19

slide-4
SLIDE 4

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

slide-5
SLIDE 5

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

slide-6
SLIDE 6

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

slide-7
SLIDE 7

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

slide-8
SLIDE 8

Finite games

Win-Lose game

Box wins

Circle Wins Box Wins

4 / 19

slide-9
SLIDE 9

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

slide-10
SLIDE 10

Finite games

Win-Lose game

Circle wins

Circle Wins Box Wins

4 / 19

slide-11
SLIDE 11

Finite games

Win-Lose game

Circle Wins Box Wins

4 / 19

slide-12
SLIDE 12

Finite games

Win-Lose game

Algorithm for optimal play

Circle Wins Box Wins

4 / 19

slide-13
SLIDE 13

Finite games

Win-Lose game

Algorithm for optimal play

Circle Wins Box Wins

4 / 19

slide-14
SLIDE 14

Finite games

Win-Lose game

Algorithm for optimal play

Box can always win

Circle Wins Box Wins

4 / 19

slide-15
SLIDE 15

Finite games

Payoff game

4

  • 1
  • 1

4 1

  • 2

Maximizer Minimizer

5 / 19

slide-16
SLIDE 16

Finite games

Payoff game

4

  • 1
  • 1

4 1

  • 2

Maximizer Minimizer

5 / 19

slide-17
SLIDE 17

Finite games

Payoff game

4

  • 1
  • 1

4 1

  • 2

Maximizer Minimizer

5 / 19

slide-18
SLIDE 18

Finite games

Payoff game

Payoff

Min pays 4 units to Max

4

  • 1
  • 1

4 1

  • 2

Maximizer Minimizer

5 / 19

slide-19
SLIDE 19

Finite games

Payoff game

4

  • 1
  • 1

4 1

  • 2

Maximizer Minimizer

5 / 19

slide-20
SLIDE 20

Finite games

Payoff game

Payoff

Min pays -1 units to Max

4

  • 1
  • 1

4 1

  • 2

Maximizer Minimizer

5 / 19

slide-21
SLIDE 21

Finite games

Payoff game

MinMax algorithm

4

  • 1
  • 1
  • 1

4 1 1

  • 2

Maximizer Minimizer

5 / 19

slide-22
SLIDE 22

Finite games

Payoff game

MinMax algorithm

4 4

  • 1

1

  • 1
  • 1

4 1 1

  • 2

Maximizer Minimizer

5 / 19

slide-23
SLIDE 23

Finite games

Payoff game

MinMax algorithm

Value = 1 Min can ensure a payoff ≤ 1 Max can ensure a payoff ≥ 1

1 4 4

  • 1

1

  • 1
  • 1

4 1 1

  • 2

Maximizer Minimizer

5 / 19

slide-24
SLIDE 24

Finite games

Payoff game

MinMax algorithm

Value = 1 Min can ensure a payoff ≤ 1 Max can ensure a payoff ≥ 1 When both play optimally the payoff is exactly 1.

1 4 4

  • 1

1

  • 1
  • 1

4 1 1

  • 2

Maximizer Minimizer

5 / 19

slide-25
SLIDE 25

Outline

Finite Duration Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games

6 / 19

slide-26
SLIDE 26

Parity Games

Winning conditions

5 6 1 2 3 Odd Even

7 / 19

slide-27
SLIDE 27

Parity Games

Winning conditions

π1 =

5 6 1 2 3 Odd Even

7 / 19

slide-28
SLIDE 28

Parity Games

Winning conditions

π1 = 1

5 6 1 2 3 Odd Even

7 / 19

slide-29
SLIDE 29

Parity Games

Winning conditions

π1 = 1 5

5 6 1 2 3 Odd Even

7 / 19

slide-30
SLIDE 30

Parity Games

Winning conditions

π1 = 1 5 2

5 6 1 2 3 Odd Even

7 / 19

slide-31
SLIDE 31

Parity Games

Winning conditions

π1 = 1 5 2 1

5 6 1 2 3 Odd Even

7 / 19

slide-32
SLIDE 32

Parity Games

Winning conditions

π1 = 1 5 2 1 2

5 6 1 2 3 Odd Even

7 / 19

slide-33
SLIDE 33

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1

5 6 1 2 3 Odd Even

7 / 19

slide-34
SLIDE 34

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins

5 6 1 2 3 Odd Even

7 / 19

slide-35
SLIDE 35

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 =

5 6 1 2 3 Odd Even

7 / 19

slide-36
SLIDE 36

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1

5 6 1 2 3 Odd Even

7 / 19

slide-37
SLIDE 37

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5

5 6 1 2 3 Odd Even

7 / 19

slide-38
SLIDE 38

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2

5 6 1 2 3 Odd Even

7 / 19

slide-39
SLIDE 39

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1

5 6 1 2 3 Odd Even

7 / 19

slide-40
SLIDE 40

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5

5 6 1 2 3 Odd Even

7 / 19

slide-41
SLIDE 41

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins

5 6 1 2 3 Odd Even

7 / 19

slide-42
SLIDE 42

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π =

5 6 1 2 3 Odd Even

7 / 19

slide-43
SLIDE 43

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1

5 6 1 2 3 Odd Even

7 / 19

slide-44
SLIDE 44

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2

5 6 1 2 3 Odd Even

7 / 19

slide-45
SLIDE 45

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3

5 6 1 2 3 Odd Even

7 / 19

slide-46
SLIDE 46

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3

5 6 1 2 3 Odd Even

7 / 19

slide-47
SLIDE 47

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6

5 6 1 2 3 Odd Even

7 / 19

slide-48
SLIDE 48

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6 5

5 6 1 2 3 Odd Even

7 / 19

slide-49
SLIDE 49

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6 5 2

5 6 1 2 3 Odd Even

7 / 19

slide-50
SLIDE 50

Parity Games

Winning conditions

π1 = 1 5 2 1 2 1 2 . . . inf(π1) = {1, 2} max Inf(π1) = 2 Even wins π2 = 1 5 2 1 5 2 . . . inf(π2) = {1, 2, 5} max Inf(π2) = 5 Odd wins π = 1 2 3 3 6 5 2 1 . . . Parity(max Inf(π)) wins

5 6 1 2 3 Odd Even

7 / 19

slide-51
SLIDE 51

Parity Games

Questions

Does either Even or Odd have a strategy to always win? If so, then how to compute the winning strategy?

8 / 19

slide-52
SLIDE 52

Parity Games

Questions

Does either Even or Odd have a strategy to always win? Yes If so, then how to compute the winning strategy? By reduction to finite duration games

8 / 19

slide-53
SLIDE 53

Parity Games

5 6 1 2 3 Odd Even 1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

slide-54
SLIDE 54

Parity Games

5 6 1 2 3 Odd Even 1 5 6 5 2 3 6 5 3 1 2 1 3 3 6 5 2 6 Odd Wins Even Wins

9 / 19

slide-55
SLIDE 55

Parity Games

5 6 1 2 3 Odd Even 1 5 6 5 2 3 6 5 3 1 2 1 3 3 6 5 2 6 Odd Wins Even Wins

9 / 19

slide-56
SLIDE 56

Parity Games

5 6 1 2 3 Odd Even 1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

slide-57
SLIDE 57

Parity Games

5 6 1 2 3 Odd Even

Finite game

Even has a winning strategy

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

slide-58
SLIDE 58

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

9 / 19

slide-59
SLIDE 59

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

9 / 19

slide-60
SLIDE 60

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 Stack = 1

9 / 19

slide-61
SLIDE 61

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 Stack = 1 2

9 / 19

slide-62
SLIDE 62

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 Stack = 1 2 1

9 / 19

slide-63
SLIDE 63

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 Stack = 1

9 / 19

slide-64
SLIDE 64

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 Stack = 1 5

9 / 19

slide-65
SLIDE 65

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 Stack = 1 5 6

9 / 19

slide-66
SLIDE 66

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 Stack = 1 5 6 5

9 / 19

slide-67
SLIDE 67

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 Stack = 1 5

9 / 19

slide-68
SLIDE 68

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 Stack = 1 5 2

9 / 19

slide-69
SLIDE 69

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 Stack = 1 5 2 3

9 / 19

slide-70
SLIDE 70

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 Stack = 1 5 2 3 6

9 / 19

slide-71
SLIDE 71

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 5 Stack = 1 5 2 3 6 5 Every eliminated cycle has max priority even

9 / 19

slide-72
SLIDE 72

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 5 Stack = 1 5 . . . Every eliminated cycle has max priority even

9 / 19

slide-73
SLIDE 73

Parity Games

5 6 1 2 3 Odd Even

Finite game

Every loop has max priority even

1 5 6 2 3 6 2 3 6 5 Odd Wins Even Wins

Extension to infinite plays

π = 1 2 1 5 6 5 2 3 6 5 Stack = 1 5 . . . Every eliminated cycle has max priority even Hence max Inf priority in π is Even

9 / 19

slide-74
SLIDE 74

Parity Games

Better Algorithms

Marcin Jurdzinski and Jens Vöge. “A discrete strategy improvement algorithm for solving parity games”. In: Computer Aided Verification. Springer, 2000, pp. 202–215 Upper bound1 : O

  • (n/d)d

1see also Friedmann, “Exponential Lower Bounds for Solving Infinitary Payoff Games

and Linear Programs”.

10 / 19

slide-75
SLIDE 75

Parity Games

Better Algorithms

Marcin Jurdzinski and Jens Vöge. “A discrete strategy improvement algorithm for solving parity games”. In: Computer Aided Verification. Springer, 2000, pp. 202–215 Upper bound1 : O

  • (n/d)d

Marcin Jurdzinski, Mike Paterson, and Uri Zwick. “A Deterministic Subexponential Algorithm for Solving Parity Games”. In: SIAM Journal on Computing 38.4 (Jan. 2008), pp. 1519–1532 nO(√n)

1see also Friedmann, “Exponential Lower Bounds for Solving Infinitary Payoff Games

and Linear Programs”.

10 / 19

slide-76
SLIDE 76

Outline

Finite Duration Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games

11 / 19

slide-77
SLIDE 77

Mean Payoff Games

Payoffs

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-78
SLIDE 78

Mean Payoff Games

Payoffs (ab)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-79
SLIDE 79

Mean Payoff Games

Payoffs (ab)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-80
SLIDE 80

Mean Payoff Games

Payoffs (ab)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-81
SLIDE 81

Mean Payoff Games

Payoffs (ab)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-82
SLIDE 82

Mean Payoff Games

Payoffs (ab)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-83
SLIDE 83

Mean Payoff Games

Payoffs (ab)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-84
SLIDE 84

Mean Payoff Games

Payoffs (ab)ω

a

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-85
SLIDE 85

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b − 2 1

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-86
SLIDE 86

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a − 2 + 3 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-87
SLIDE 87

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b − 2 + 3 − 2 3

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-88
SLIDE 88

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a − 2 + 3 − 2 + 3 4

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-89
SLIDE 89

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b − 2 + 3 − 2 + 3 − 2 5

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-90
SLIDE 90

Mean Payoff Games

Payoffs (ab)ω

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a − 2 + 3 − 2 + 3 − 2 + 3 6

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-91
SLIDE 91

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-92
SLIDE 92

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-93
SLIDE 93

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-94
SLIDE 94

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-95
SLIDE 95

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-96
SLIDE 96

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-97
SLIDE 97

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-98
SLIDE 98

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-99
SLIDE 99

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-100
SLIDE 100

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-101
SLIDE 101

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c − 1 1

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-102
SLIDE 102

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b − 1 − 2 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-103
SLIDE 103

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a − 1 − 2 + 3 3

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-104
SLIDE 104

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c − 1 − 2 + 3 − 1 4

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-105
SLIDE 105

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b − 1 − 2 + 3 − 1 − 2 5

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-106
SLIDE 106

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a − 1 − 2 + 3 − 1 − 2 + 3 6

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-107
SLIDE 107

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω Min pays −1

3 units to Max

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a . . . − 1 − 2 + 3 − 1 − 2 + 3 6 ∼ n(−1−2+3)

3n

→ 0

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-108
SLIDE 108

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω Min pays −1

3 units to Max

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a . . . − 1 − 2 + 3 − 1 − 2 + 3 6 ∼ n(−1−2+3)

3n

→ 0 Min tries to minimize lim Max tries to maximize lim

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-109
SLIDE 109

Mean Payoff Games

Payoffs (ab)ω Min pays 1

2 units to Max

a −2 − − →b +3 − − →a −2 − − →b +3 − − →a −2 − − →b +3 − − →a . . . − 2 + 3 − 2 + 3 − 2 + 3 6 ∼ n(−2+3)

2n

→ 1 2

(acb)ω Min pays −1

3 units to Max

a −1 − − →c −2 − − →b +3 − − →a −1 − − →c −2 − − →b +3 − − →a . . . − 1 − 2 + 3 − 1 − 2 + 3 6 ∼ n(−1−2+3)

3n

→ 0 Generally Min tries to minimize lim sup Max tries to maximize lim inf

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

12 / 19

slide-110
SLIDE 110

Mean Payoff Games

Questions

Does the game have a value? i.e. is there a v so that

Max can ensure lim inf ≥ v Min can ensure lim sup ≤ v

13 / 19

slide-111
SLIDE 111

Mean Payoff Games

Questions

Does the game have a value? i.e. is there a v so that

Max can ensure lim inf ≥ v Min can ensure lim sup ≤ v

Yes How to compute the optimal strategies?

13 / 19

slide-112
SLIDE 112

Mean Payoff Games

Questions

Does the game have a value? i.e. is there a v so that

Max can ensure lim inf ≥ v Min can ensure lim sup ≤ v

Yes How to compute the optimal strategies? Solution using the finite game

13 / 19

slide-113
SLIDE 113

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

14 / 19

slide-114
SLIDE 114

Mean Payoff

Finite Game

a b c

  • 1

3 1 2

c

  • 1

b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

14 / 19

slide-115
SLIDE 115

Mean Payoff

Finite Game

a b c

  • 1

3 1 2

c

  • 1

b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

14 / 19

slide-116
SLIDE 116

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

14 / 19

slide-117
SLIDE 117

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

14 / 19

slide-118
SLIDE 118

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Max can ensure ≥ 0 in the finite game

14 / 19

slide-119
SLIDE 119

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

14 / 19

slide-120
SLIDE 120

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0

14 / 19

slide-121
SLIDE 121

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too

14 / 19

slide-122
SLIDE 122

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a Stack = a

14 / 19

slide-123
SLIDE 123

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b Stack = a b

14 / 19

slide-124
SLIDE 124

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c Stack = a b c

14 / 19

slide-125
SLIDE 125

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b Stack = a b c b

14 / 19

slide-126
SLIDE 126

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b Stack = a b

14 / 19

slide-127
SLIDE 127

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c Stack = a b c

14 / 19

slide-128
SLIDE 128

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a Stack = a b c a

14 / 19

slide-129
SLIDE 129

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a Stack = a

14 / 19

slide-130
SLIDE 130

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b Stack = a b

14 / 19

slide-131
SLIDE 131

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c Stack = a b c

14 / 19

slide-132
SLIDE 132

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c a Stack = a b c a Every time a cycle with average value ≤ 0 is eliminated

14 / 19

slide-133
SLIDE 133

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c a Stack = a Every time a cycle with average value ≤ 0 is eliminated

14 / 19

slide-134
SLIDE 134

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Min can ensure ≤ 0 in the mean payoff game too π = a b c b c a b c a Stack = a Hence limsup of averages of π is ≤ 0

14 / 19

slide-135
SLIDE 135

Mean Payoff

Finite Game

a b c c b

  • 1
  • 1

3 1 2

a b c

  • 2
  • 1

2

  • 2
  • 1

3 Min Max

Max can ensure ≥ 0 Similarly Max can ensure liminf

  • f the average is ≥ 0

Hence the value of Mean payoff game is 0

14 / 19

slide-136
SLIDE 136

Outline

Finite Duration Games Infinite Duration Games Simple Stochastic Games

15 / 19

slide-137
SLIDE 137

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-138
SLIDE 138

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-139
SLIDE 139

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-140
SLIDE 140

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-141
SLIDE 141

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-142
SLIDE 142

Simple Stochastic Game

Circle Wins

1 2 1 2 1 2 1 2 16 / 19

slide-143
SLIDE 143

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-144
SLIDE 144

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-145
SLIDE 145

Simple Stochastic Game

Box Wins

1 2 1 2 1 2 1 2 16 / 19

slide-146
SLIDE 146

Simple Stochastic Game

Or

1 2 1 2 1 2 1 2 16 / 19

slide-147
SLIDE 147

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-148
SLIDE 148

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-149
SLIDE 149

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-150
SLIDE 150

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-151
SLIDE 151

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-152
SLIDE 152

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-153
SLIDE 153

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-154
SLIDE 154

Simple Stochastic Game

1 2 1 2 1 2 1 2 16 / 19

slide-155
SLIDE 155

Simple Stochastic Game

Circle can win from with probability 1

1 2 1 2 1 2 1 2 16 / 19

slide-156
SLIDE 156

Simple Stochastic Game

Values

1 1

1 2

1

1 2 1 2 1 2 1 2 17 / 19

slide-157
SLIDE 157

Simple Stochastic Game

Values

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2 17 / 19

slide-158
SLIDE 158

Simple Stochastic Game

Values

v( ) = 1 v( ) = 0 v( ) = 1 2(v( ) + v( )) v( ) = 1 2(v( ) + v( )) v( ) = max{v( ), v( )} v( ) = min{v( ), v( )}

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2 17 / 19

slide-159
SLIDE 159

Simple Stochastic Game

Values

v( ) = 1 v( ) = 0 v( ) = 1 2(v( ) + v( )) v( ) = 1 2(v( ) + v( )) v( ) = max{v( ), v( )} v( ) = min{v( ), v( )}

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2

These equations have a unique solution.

17 / 19

slide-160
SLIDE 160

Simple Stochastic Game

Values

v( ) = 1 v( ) = 0 v( ) = 1 2(v( ) + v( )) v( ) = 1 2(v( ) + v( )) v( ) = max{v( ), v( )} v( ) = min{v( ), v( )}

1 1

1 2 1 2

1

1 2 1 2 1 2 1 2

These equations have a unique solution. From state s - has a strategy to reach with probability ≥ v(s) has a strategy to reach with probability ≥ 1 − v(s)

17 / 19

slide-161
SLIDE 161

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

18 / 19

slide-162
SLIDE 162

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

18 / 19

slide-163
SLIDE 163

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

2Chatterjee and Fijalkow, “A reduction from parity games to simple stochastic games”. 18 / 19

slide-164
SLIDE 164

Complexity of solving games

Does Even win the Parity Game? Is the value of the Mean Payoff Game ≥ 0 Is the value in the Simple Stochasic Game ≥ 1

2

NP∩coNP

2

Open Problem

Is there a polynomial time algorithm for any of them?

2Chatterjee and Fijalkow, “A reduction from parity games to simple stochastic games”. 18 / 19

slide-165
SLIDE 165

Timeline

Lloyd S. Shapley. “Stochastic games”. In: Proceedings of the National Academy of Sciences 39.10 (1953), pp. 1095–1100 E.A. Emerson and C.S. Jutla. “Tree automata, mu-calculus and determinacy”. In: IEEE Comput. Soc. Press, 1991, pp. 368–377 Anne Condon. “The complexity of stochastic games”. In: Information and Computation 96.2 (Feb. 1992), pp. 203–224 Uri Zwick and Mike Paterson. “The complexity of mean payoff games

  • n graphs”.

In: Theoretical Computer Science 158.1 (May 1996),

  • pp. 343–359

Marcin Jurdziski. “Deciding the winner in parity games is in UP co-UP”. . In: Information Processing Letters 68.3 (1998), pp. 119–124

19 / 19

slide-166
SLIDE 166

Timeline

Lloyd S. Shapley. “Stochastic games”. In: Proceedings of the National Academy of Sciences 39.10 (1953), pp. 1095–1100 E.A. Emerson and C.S. Jutla. “Tree automata, mu-calculus and determinacy”. In: IEEE Comput. Soc. Press, 1991, pp. 368–377 Anne Condon. “The complexity of stochastic games”. In: Information and Computation 96.2 (Feb. 1992), pp. 203–224 Uri Zwick and Mike Paterson. “The complexity of mean payoff games

  • n graphs”.

In: Theoretical Computer Science 158.1 (May 1996),

  • pp. 343–359

Marcin Jurdziski. “Deciding the winner in parity games is in UP co-UP”. . In: Information Processing Letters 68.3 (1998), pp. 119–124

Thank you

19 / 19