Learning Strategies in Game- Theoretic Data Interaction
Ben McCamish, Arash Termehchy, Behrouz Touri, Liang Huang Information & Data Management and Analytics Laboratory (IDEA)
1
Learning Strategies in Game- Theoretic Data Interaction Ben - - PowerPoint PPT Presentation
Learning Strategies in Game- Theoretic Data Interaction Ben McCamish, Arash Termehchy, Behrouz Touri, Liang Huang I nformation & D ata Manag e ment and A nalytics Laboratory (IDEA) 1 Querying a database of student grades Grades First_Name
Ben McCamish, Arash Termehchy, Behrouz Touri, Liang Huang Information & Data Management and Analytics Laboratory (IDEA)
1
wish to find in the database
communicate their intent
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
2
student Kerry Smith
database content and structure
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
3
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
4
matching query, mostly non-relevant.
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
5
subset of matching tuples
Results
First_Name Last_Name Dept. Grade
Sarah Smith CE A John Smith EE B
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
6
she is looking for
7
Results
First_Name Last_Name Dept. Grade
Sarah Smith CE A John Smith EE B
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
7
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
database and it’s content.
“Smith” and is in the Department “CS”
more accurately
8
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
tuple
9
desired tuple
Results
First_Name Last_Name Dept. Grade
Kerry Smith CS D
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
10
11
allowed the user to find the desired student
Results
First_Name Last_Name Dept. Grade
Kerry Smith CS D
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
11
student Kerry Smith
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
12
matching query
Results
First_Name Last_Name Dept. Grade
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
13
return Kerry Smith in CS department
Results
First_Name Last_Name Dept. Grade
Kerry Smith CS D
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
14
15
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
Results
First_Name Last_Name Dept. Grade
Kerry Smith CS D
15
information
16
Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
17
from intents to queries.
Query # Query q1
“Smith CE”
q2
“Smith”
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1 Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
18
saving time, …
exact intent behind the query.
Database Strategy (D)
e1 e2 e3 q1 1 q2 0.5 0.5 Intent # Intent e1
ans(y)← Grades(x,’Smith’, ‘EE’, y)
e2
ans(y)← Grades(x,’Smith’, ‘CE’, y)
e3
ans(y)← Grades(x,’Smith’, ‘CS’, y)
Query # Query q1
“Smith CE”
q2
“Smith”
Grades
First_Name Last_Name Dept. Grade
… … … …
Sarah Smith CE A John Smith EE B Kerry Smith CS D
… … … …
19
Sarah Smith in CE
from queries to intents
r(U, D) =
m
X
i=1
πi
n
X
j=1
Uij
`=1
Dj` prec(ei, e`)
Database Strategy (D)
e1 e2 e3 q1 1 q2 0.5 0.5
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
20
Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
r(U, D) =
m
X
i=1
πi
n
X
j=1
Uij
`=1
Dj` prec(ei, e`)
21
Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
Database Strategy (D)
e1 e2 e3 q1 1 q2 0.5 0.5
r(U, D) =
m
X
i=1
πi
n
X
j=1
Uij
`=1
Dj` prec(ei, e`)
22
Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
Database Strategy (D)
e1 e2 e3 q1 1 q2 0.5 0.5
r(U, D) =
m
X
i=1
πi
n
X
j=1
Uij
`=1
Dj` prec(ei, e`)
23
Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
Database Strategy (D)
e1 e2 e3 q1 1 q2 0.5 0.5
1.Learning may not converge or converge to a desired equilibrium in games, e.g., Shapely game.
24
increase its payoff by unilaterally deviating from the current strategy
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
Database Strategy (D)
e1 e2 e3 q1 1 q2 0.5 0.5
25
Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
26
Database Strategy (D)
e1 e2 e3 q1 1 q2 1
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1 Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
Smith in CE
http://tinyurl.com/charmarxiv
e2, payoff will not increase
27
.User Strategy (U)
q1 q2 e1 1 e2 1 e3 1
Database Strategy (D)
e1 e2 e3 q1 1 q2 1
User Strategy (U)
q1 q2 e1 1 e2 1 e3 1 Intent # Intent e1
John Smith in EE
e2
Sarah Smith in CE
e3
Kerry Smith in CS
Query # Query q1
“Smith CE”
q2
“Smith”
Database Strategy
e1 e2 e3 q1 1 q2 1
long run.
28
29
non-zero payoff by an amount independent of payoff
to its accumulated payoff
linear adjustment of its accumulated payoff
30
Method Mean Squared Distance Bush and Mosteller’s 0.0112 Cross’s 0.01131 Roth and Erev 0.00993 Roth and Erev Modified 0.00994 Win-Stay/Lose-Randomize 0.01752 Latest-Reward 0.15167
31
algorithms, such as UCB-1
system learning.
payoffs is a submartingale (statistically non-decreasing) and converges almost surely.
32
33
34