Comparators for Quantitative games
BY
Suguman Bansal Swarat Chaudhuri Moshe Y. Vardi Dagstuhl Seminar March 16, 2017
Comparators for Quantitative games BY Suguman Bansal Swarat - - PowerPoint PPT Presentation
Comparators for Quantitative games BY Suguman Bansal Swarat Chaudhuri Moshe Y. Vardi Dagstuhl Seminar March 16, 2017 Repeated games Infinitely many rounds of a base game Each agent receives a reward in every round Reward of
BY
Suguman Bansal Swarat Chaudhuri Moshe Y. Vardi Dagstuhl Seminar March 16, 2017
game
every round
an aggregation of its rewards from every round
3/16/17 Comparators for quantitative games 2
3/16/17 Comparators for quantitative games 3
game w.r.t. agent
that transition
3/16/17 Algorithmic analysis of Regular repeated games 4
π·, πΈ , 0 π·, π· , 2 πΈ, π· , 3 πΈ, πΈ , 1 π‘) π‘*
Tit-for-tat strategy in Repeated games
3/16/17 Algorithmic analysis of Regular repeated games 5
π, π , 2 π, π , 3 π, π , (2, 3)
S1 S2 (S1, S2)
π·, πΈ , 0 π·, π· , 2 πΈ, π· , 3 πΈ, πΈ , 1 π‘) π‘* π·, πΈ , 0 π·, π· , 2 πΈ, π· , 3 πΈ, πΈ , 1 π‘) π‘*
π·, π· , (2, 2) π·, π· , (2, 2) π·, π· , (2, 2) π·, π· , (2, 2)
Synchronize on all agent actions
3/16/17 Comparators for quantitative games 6
π·, πΈ , 0 π·, π· , 2 πΈ, π· , 3 πΈ, πΈ , 1 π‘) π‘* π‘) π‘) π‘* π‘) π‘* π‘) π‘*
executions
3/16/17 Comparators for quantitative games 7
3/16/17 Comparators for quantitative games 8
sequences
3/16/17 Comparators for quantitative games 9
for π: β 8 β β iff π π΅ β€ π(πΆ)
infinitely often
3/16/17 Comparators for quantitative games 10
πΈπC π = π 2 + π
)
π + π * π* β¦
with discount-factor π iff πΈπC π΅ β€ πΈπC πΆ
3/16/17 Comparators for quantitative games 11
FG C + FH CH + β―
= π2. π)π* β¦ C = π΅C
[Akiyama, Frougny, Sakarovitch, IJoM 2008]
Number in base π
[Chaudhuri, Sankaranarayanan, Vardi , LICS 2013]
3/16/17 Comparators for quantitative games 12
FG C + FH CH + β―
= π2. π)π* β¦ C = π΅C
Arithmetic in base π
3/16/17 Comparators for quantitative games 13
3/16/17 Comparators for quantitative games 14
A 5 13 6 0 0 0 β¦. C + 0 8 6 0 0 0 β¦. B 7 2 2 0 0 0 β¦. X 2 1 0 0 0 0 β¦.
π = 0, π2 + π2 + π¦2 = π2 π > 0, π4 + π4 + π¦4 = π4 + π β π¦ 4l)
n
C Cl) , where is π4 of the form r
t Cl), where is π¦4 of the form r
u pairs
3/16/17 Comparators for quantitative games 15
π2 + π¦2 + π2 = π2
π4 + π4 + π¦4 = π4 + π β π¦ 4l)
3/16/17 Comparators for quantitative games 16
start π¦2, π2 π2, π2 π¦ 4l) , π 4l) π¦4, π4 π4, π4
Automaton accepts (π΅, πΆ) iff πΈπC(π΅) β€ πΈπC(πΆ)
ΓΌ Introduce Comparator [Bansal, Chaudhuri, Vardi (under review)]
ΓΌ A novel automata-theoretic technique to compare the aggregate of reward sequences ΓΌ Comparator for discounted-sum aggregate function
3/16/17 Comparators for quantitative games 17
2, π ) and edges πΉ
4, agent π4 picks the next vertex
sequence
player
3/16/17 Comparators for quantitative games 18
If aggregate function π βΆ β 8 β β has a comparator
3/16/17 Comparators for quantitative games 19
1986]
game
3/16/17 Comparators for quantitative games 20
aggregate of reward sequences
3/16/17 Comparators for quantitative games 21