Comparators for Quantitative games BY Suguman Bansal Swarat - - PowerPoint PPT Presentation

β–Ά
comparators for quantitative games
SMART_READER_LITE
LIVE PREVIEW

Comparators for Quantitative games BY Suguman Bansal Swarat - - PowerPoint PPT Presentation

Comparators for Quantitative games BY Suguman Bansal Swarat Chaudhuri Moshe Y. Vardi Dagstuhl Seminar March 16, 2017 Repeated games Infinitely many rounds of a base game Each agent receives a reward in every round Reward of


slide-1
SLIDE 1

Comparators for Quantitative games

BY

Suguman Bansal Swarat Chaudhuri Moshe Y. Vardi Dagstuhl Seminar March 16, 2017

slide-2
SLIDE 2

Repeated games

  • Infinitely many rounds of a base

game

  • Each agent receives a reward in

every round

  • Reward of each agent from game is

an aggregation of its rewards from every round

  • Discounted-sum aggregation
  • Mean-payoff aggregation . . .

3/16/17 Comparators for quantitative games 2

slide-3
SLIDE 3

Automated reasoning about repeated games

  • Existence of rational behavior in a repeated game
  • Find one rational behavior in repeated games
  • Find all rational behaviors in a repeated game
  • Finite representation
  • What properties hold in repeated game if agents behave rationally
  • Properties of rational behaviors in repeated games

3/16/17 Comparators for quantitative games 3

slide-4
SLIDE 4

Strategy in the Repeated games [Rubinstein 1986]

  • Finite state machines
  • Strategies is defined for an agent
  • All other agents comprise of environment
  • Transitions denote one round of repeated

game w.r.t. agent

  • Transition on (agent-Action, envt-Action)
  • Weight on transition is Reward of agent for

that transition

3/16/17 Algorithmic analysis of Regular repeated games 4

𝐷, 𝐸 , 0 𝐷, 𝐷 , 2 𝐸, 𝐷 , 3 𝐸, 𝐸 , 1 𝑑) 𝑑*

Tit-for-tat strategy in Repeated games

slide-5
SLIDE 5

Game executions [Rubinstein 1986]

3/16/17 Algorithmic analysis of Regular repeated games 5

𝑏, 𝑐 , 2 𝑐, 𝑏 , 3 𝑏, 𝑐 , (2, 3)

S1 S2 (S1, S2)

𝐷, 𝐸 , 0 𝐷, 𝐷 , 2 𝐸, 𝐷 , 3 𝐸, 𝐸 , 1 𝑑) 𝑑* 𝐷, 𝐸 , 0 𝐷, 𝐷 , 2 𝐸, 𝐷 , 3 𝐸, 𝐸 , 1 𝑑) 𝑑*

Γ—

𝐷, 𝐷 , (2, 2) 𝐷, 𝐷 , (2, 2) 𝐷, 𝐷 , (2, 2) 𝐷, 𝐷 , (2, 2)

Synchronize on all agent actions

slide-6
SLIDE 6

Strategies are Trees

3/16/17 Comparators for quantitative games 6

𝐷, 𝐸 , 0 𝐷, 𝐷 , 2 𝐸, 𝐷 , 3 𝐸, 𝐸 , 1 𝑑) 𝑑* 𝑑) 𝑑) 𝑑* 𝑑) 𝑑* 𝑑) 𝑑*

slide-7
SLIDE 7

Strategies Tree and their compositions

  • Every player has a regular set of strategies
  • Weighted tree automata
  • Composition of agent strategies results in regular set of game

executions

  • Compare rewards of player along different game executions

3/16/17 Comparators for quantitative games 7

slide-8
SLIDE 8

Comparing rewards

  • Reward of agent in a game execution
  • Reward sequence of agent 𝑇 = (𝑑2, 𝑑), … )
  • 𝑑4 is reward received in the i-th round of a game execution
  • Aggregate function 𝑔 ∢ β„• 8 β†’ ℝ
  • Reward of agent is 𝑔 𝑇
  • Which game execution results in a greater reward?
  • Reward sequences 𝐡 and 𝐢 on different game executions
  • Is 𝑔 𝐡 ≀ 𝑔(𝐢) ?

3/16/17 Comparators for quantitative games 8

slide-9
SLIDE 9

Outline of the talk ...

  • Introduce Comparator [Bansal, Chaudhuri, Vardi (under submission)]
  • A novel automata-theoretic technique to compare the aggregate of reward

sequences

  • Comparator for discounted-sum aggregate function
  • Applications of Comparators [Ongoing work]

3/16/17 Comparators for quantitative games 9

slide-10
SLIDE 10

Comparator

  • Comparator for aggregate function 𝑔: β„• 8 β†’ ℝ is a BΓΌchi automaton
  • Pair of bounded reward sequences 𝐡, 𝐢 is a word of the comparator

for 𝑔: β„• 8 β†’ ℝ iff 𝑔 𝐡 ≀ 𝑔(𝐢)

  • Comparator for Limsup aggregate function
  • Limsup of sequence of natural numbers is the largest number that occurs

infinitely often

3/16/17 Comparators for quantitative games 10

slide-11
SLIDE 11

Discounted-sum Comparator

  • Discounted sum of reward sequence 𝑆 with discount factor 𝑒 > 1 is

𝐸𝑇C 𝑆 = 𝑠2 + 𝑠

)

𝑒 + 𝑠* 𝑒* …

  • Discounted-sum Comparator with discount-factor 𝑒 > 1
  • Pair of bounded reward sequences 𝐡, 𝐢 is a word of the DS Comparator

with discount-factor 𝑒 iff 𝐸𝑇C 𝐡 ≀ 𝐸𝑇C 𝐢

3/16/17 Comparators for quantitative games 11

slide-12
SLIDE 12

DS Comparator : Core Insight – I

  • Sequence 𝐡 = (𝑏2, 𝑏), 𝑏* … )
  • Discount factor 𝑒 > 1
  • 𝐸𝑇C 𝐡 = 𝑏2 +

FG C + FH CH + β‹―

= 𝑏2. 𝑏)𝑏* … C = 𝐡C

  • Use lexicographic ordering of sequences
  • Works only if 𝑏4 ≀ 𝑒 βˆ’ 1 for all 𝑗 β‰₯ 0
  • Further difficulty when discount-factor is non-integeral

[Akiyama, Frougny, Sakarovitch, IJoM 2008]

Number in base 𝑒

[Chaudhuri, Sankaranarayanan, Vardi , LICS 2013]

3/16/17 Comparators for quantitative games 12

slide-13
SLIDE 13

DS Comparator : Core Insight – II

  • Sequence 𝐡 = (𝑏2, 𝑏), 𝑏* … )
  • Discount factor 𝑒 > 1
  • 𝐸𝑇C(𝐡) = 𝑏2 +

FG C + FH CH + β‹―

= 𝑏2. 𝑏)𝑏* … C = 𝐡C

  • 𝐸𝑇C 𝐡 ≀ 𝐸𝑇C(𝐢) iff 𝐡C ≀ 𝐢C
  • Find 𝐷 = (𝑑2, 𝑑), 𝑑*, … ), such that
  • 𝐸𝑇C 𝐷 = 𝐷C β‰₯ 0
  • 𝐡C + 𝐷C = 𝐢C

Arithmetic in base 𝑒

3/16/17 Comparators for quantitative games 13

slide-14
SLIDE 14

DS Comparator : Core Insight – II (cont..)

  • Consider (𝑒 = 10)

3/16/17 Comparators for quantitative games 14

A 5 13 6 0 0 0 …. C + 0 8 6 0 0 0 …. B 7 2 2 0 0 0 …. X 2 1 0 0 0 0 ….

=> 𝐸𝑇C 𝐡 + 𝐸𝑇C 𝐷 = 𝐸𝑇C(𝐢)

𝑗 = 0, 𝑏2 + 𝑑2 + 𝑦2 = 𝑐2 𝑗 > 0, 𝑏4 + 𝑑4 + 𝑦4 = 𝑐4 + 𝑒 β‹… 𝑦 4l)

slide-15
SLIDE 15

DS Comparator : Finite-state memory?

  • Upper bound 𝜈 on reward sequences, 𝑒 =

n

  • > 1 is discount factor
  • For all bounded reward sequences 𝐡, 𝐢
  • Can find sequences 𝐷 and π‘Œ that satisfy equations
  • 0 ≀ 𝑑4 ≀ 𝜈 β‹…

C Cl) , where is 𝑑4 of the form r

  • for integer 𝑛
  • 𝑦4 ≀ 1 +

t Cl), where is 𝑦4 of the form r

  • for integer 𝑛
  • Finitely many possibilties for 𝑦4, 𝑑

u pairs

3/16/17 Comparators for quantitative games 15

slide-16
SLIDE 16

DS Comparator : Construction

𝑏2 + 𝑦2 + 𝑑2 = 𝑐2

𝑏4 + 𝑑4 + 𝑦4 = 𝑐4 + 𝑒 β‹… 𝑦 4l)

3/16/17 Comparators for quantitative games 16

start 𝑦2, 𝑑2 𝑏2, 𝑐2 𝑦 4l) , 𝑑 4l) 𝑦4, 𝑑4 𝑏4, 𝑐4

Automaton accepts (𝐡, 𝐢) iff 𝐸𝑇C(𝐡) ≀ 𝐸𝑇C(𝐢)

slide-17
SLIDE 17

So far ...

ΓΌ Introduce Comparator [Bansal, Chaudhuri, Vardi (under review)]

ΓΌ A novel automata-theoretic technique to compare the aggregate of reward sequences ΓΌ Comparator for discounted-sum aggregate function

  • Applications of Comparators

3/16/17 Comparators for quantitative games 17

slide-18
SLIDE 18

Application in quantitative graph games

  • Graph game with vertices π‘Š

2, π‘Š ) and edges 𝐹

  • From vertex 𝑀 ∈ π‘Š

4, agent 𝑄4 picks the next vertex

  • Each agent receives a reward in every vertex
  • Reward of agent from game is given by aggregation over its reward

sequence

  • Objective of every player is to receive greater reward than the other

player

  • Find a winning strategy for player 𝑄4 ?

3/16/17 Comparators for quantitative games 18

slide-19
SLIDE 19

Quantitative graph game – Solution

If aggregate function 𝑔 ∢ β„• 8 β†’ ℝ has a comparator

  • Objective of agent is an πœ•-regular objective
  • We know how to solve graph games with πœ•-regular objectives!
  • Leverages algorithms from qualitative domain
  • Quantitative objective converted to qualitative objective
  • Generic solutions for aggregate functions
  • As long as their comparator exists

3/16/17 Comparators for quantitative games 19

slide-20
SLIDE 20

Comparators for repeated games ?

  • Finite-state representations for strategies in repeated games [Rubinstein

1986]

  • Regular set of strategies for each agent
  • Finite-state, weighted-tree automata-based representation for strategies
  • Finite-state representations of rational behaviors in this repeated

game

  • Use comparator

3/16/17 Comparators for quantitative games 20

slide-21
SLIDE 21

Summary

  • Introduced the idea of Comparators for qualitatively comparison of

aggregate of reward sequences

  • Comparators have applications in games and beyond
  • Quantitative graph games
  • Discounted-sum inclusion problem
  • Generic algorithms for aggregate functions with comparators
  • Leverage algorithms and heuristics from qualitative analysis
  • Potential to use comparators to reason about repeated games

3/16/17 Comparators for quantitative games 21