Speaker
Arpita rpita Bisw swas as
PhD Stude udent nt (Google Fellow) Game Theory y Lab, Dept. . of CSA, Indi dian n Institut stitute of Science nce, , Bangalore re Email address ress: arpita
ta.bisw iswas@li as@live.in e.in
Arpita rpita Bisw swas as PhD Stude udent nt (Google Fellow) - - PowerPoint PPT Presentation
Speaker Arpita rpita Bisw swas as PhD Stude udent nt (Google Fellow) Game Theory y Lab, Dept. . of CSA, Indi dian n Institut stitute of Science nce, , Bangalore re arpita ta.bisw iswas@li as@live.in e.in Email address ress:
Speaker
PhD Stude udent nt (Google Fellow) Game Theory y Lab, Dept. . of CSA, Indi dian n Institut stitute of Science nce, , Bangalore re Email address ress: arpita
ta.bisw iswas@li as@live.in e.in
“Mathematical framework for rigorous study of conflict and cooperation among rati tiona nal and int ntellige gent nt agents”
“Mathematical framework for rigorous study of conflict and cooperation among rati tiona nal and int ntellige gent nt agents” the agent would always choose an action that maximizes her/his (expected) utility.
game that a game theorist can make.
determining a best response strategy
“Mathematical framework for rigorous study of conflict and cooperation among rati tiona nal and int ntellige gent nt agents” the agent would always choose an action that maximizes her/his (expected) utility.
game that a game theorist can make.
determining a best response strategy preferences of the players expressed in terms of real numbers
The problem is as follows:
were never caught.
separately
while the other would be sentenced to 10 years in jail”.
Bunty ty Bubly Confess ess Confess ess Not
ess Not
Confess ess Two Player ers: s:
Two Ac Action ions: s:
The utilit ity y matrix rix models the strategic conflict when two players have to choose their priorities
Confess ess Confess ess Not
ess Not
Confess ess The utilit ity y matrix rix models the strategic conflict when two players have to choose their priorities < 𝑂, (𝐵𝑗)𝑗∈𝑂 , (𝑉𝑗)𝑗∈𝑂 > 𝑂 ∶ set of players 𝐵𝑗 ∶ set of actions for player 𝑗 𝑉𝑗 ∶ 𝐵1 × ⋯ × 𝐵|𝑂| → ℝ Action profile or Strategy profile Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Confess ess Confess ess Not
ess Not
Confess ess
Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Confess ess Confess ess Not
ess Not
Confess ess
A strategy profile in which no player gains by changing only his/her own strategy (assuming no one else changes their strategy) Bunty ty Bubly Two Player ers: s:
Two Ac Action ions: s:
Alice Bob Deep p Le Learn rning ng Deep Learning Webs bsite e Desi signi gning Websi site Designin gning Two Player ers: s:
Two Ac Action ions: s:
Project
Project
Alice Bob Deep p Le Learn rning ng Deep Learning Webs bsite e Desi signi gning Websi site Designin gning Two Player ers: s:
Two Ac Action ions: s:
Project
Project
Alice Bob Deep p Le Learn rning ng Deep Learning Webs bsite e Desi signi gning Websi site Designin gning Two Player ers: s:
Two Ac Action ions: s:
Project
Project Nash h Equil quilib ibria
Alice Bob Deep p Le Learn rning ng Deep Learning Webs bsite e Desi signi gning Websi site Designin gning Two Player ers: s:
Two Ac Action ions: s:
Project
Project Does s there exist st there e any ot
er Nash h Equi quilibr ibrium ium in this s game? e? Nash h Equil quilib ibria
Alice Bob Deep p Le Learn rning ng Deep Learning Webs bsite e Desi signi gning Websi site Designin gning Two Player ers: s:
Two Ac Action ions: s:
Project (DL)
Project (WD) Does s there exist st there e any ot
er Nash h Equi quilibr ibrium ium in this s game? e? Alice: With probability 2/3 choose DL and with probability 1/3 choose WD Bob: With probability 1/3 choose DL and with probability 2/3 choose WD Nash h Equil quilib ibria
𝑭𝒘𝒇𝒔𝒛 𝒈𝒋𝒐𝒋𝒖𝒇 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒋𝒅 𝒈𝒑𝒔𝒏 𝒉𝒃𝒏𝒇 𝒊𝒃𝒕 𝒃𝒖 𝒎𝒇𝒃𝒕𝒖 𝒑𝒐𝒇 𝒏𝒋𝒚𝒇𝒆 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝑶𝒃𝒕𝒊 𝑭𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏. ,𝑶𝒃𝒕𝒊 𝑼𝒊𝒇𝒑𝒔𝒇𝒏, 𝟐𝟘𝟔𝟏-.
𝑭𝒘𝒇𝒔𝒛 𝒈𝒋𝒐𝒋𝒖𝒇 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒋𝒅 𝒈𝒑𝒔𝒏 𝒉𝒃𝒏𝒇 𝒊𝒃𝒕 𝒃𝒖 𝒎𝒇𝒃𝒕𝒖 𝒑𝒐𝒇 𝒏𝒋𝒚𝒇𝒆 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝑶𝒃𝒕𝒊 𝑭𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏. ,𝑶𝒃𝒕𝒊 𝑼𝒊𝒇𝒑𝒔𝒇𝒏, 𝟐𝟘𝟔𝟏-.
𝑭𝒘𝒇𝒔𝒛 𝒈𝒋𝒐𝒋𝒖𝒇 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒋𝒅 𝒈𝒑𝒔𝒏 𝒉𝒃𝒏𝒇 𝒊𝒃𝒕 𝒃𝒖 𝒎𝒇𝒃𝒕𝒖 𝒑𝒐𝒇 𝒏𝒋𝒚𝒇𝒆 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝑶𝒃𝒕𝒊 𝑭𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏. ,𝑶𝒃𝒕𝒊 𝑼𝒊𝒇𝒑𝒔𝒇𝒏, 𝟐𝟘𝟔𝟏-.
𝒋𝒕 𝑸𝑸𝑩𝑬 − 𝒅𝒑𝒏𝒒𝒎𝒇𝒖𝒇 𝑮𝒋𝒐𝒆𝒋𝒐𝒉 𝒏𝒋𝒚𝒇𝒆 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝑶𝒃𝒕𝒊 𝑭𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏 ,𝑬𝒃𝒕𝒍𝒃𝒎𝒃𝒍𝒋𝒕 𝒇𝒖 𝒃𝒎. , 𝟑𝟏𝟏𝟕-.
gly Dominant inant Strategy egy Equi quilibr ibriu ium m (SDSE SE): ): 𝐵𝑜 𝑏𝑑𝑢𝑗𝑝𝑜 𝑞𝑠𝑝𝑔𝑗𝑚𝑓 𝑏1, ⋯ , 𝑏𝑜 𝑗𝑡 𝑑𝑏𝑚𝑚𝑓𝑒 𝒕𝒖𝒔𝒑𝒐𝒉𝒎𝒛 𝒆𝒑𝒏𝒋𝒐𝒃𝒐𝒖 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝒇𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏 𝑔𝑝𝑠 𝑏 𝑏𝑛𝑓 < 𝑂, 𝐵𝑗 , 𝑉𝑗 >, 𝑗𝑔 ∀𝑗 ∈ 𝑂 𝑏𝑜𝑒 ∀𝑐𝑗 ∈ 𝐵𝑗 ∖ *𝑏𝑗+, 𝑉𝑗 𝑏𝑗, 𝑐−𝑗 > 𝑉𝑗 𝑐𝑗, 𝑐−𝑗 ∀𝑐−𝑗 ∈ 𝐵−𝑗 . Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly
y Domin inant nt Strategy egy Equi quilibr ibriu ium m (WDS DSE): E):
𝐵𝑜 𝑏𝑑𝑢𝑗𝑝𝑜 𝑞𝑠𝑝𝑔𝑗𝑚𝑓 𝑏1, ⋯ , 𝑏𝑜 𝑗𝑡 𝑑𝑏𝑚𝑚𝑓𝑒 𝒙𝒇𝒃𝒍𝒎𝒛 𝒆𝒑𝒏𝒋𝒐𝒃𝒐𝒖 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝒇𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏 𝑔𝑝𝑠 𝑏 𝑏𝑛𝑓 < 𝑂, 𝐵𝑗 , 𝑉𝑗 >, 𝑗𝑔 ∀𝑗 ∈ 𝑂 𝑏𝑜𝑒 ∀𝑐𝑗 ∈ 𝐵𝑗, 𝑉𝑗 𝑏𝑗, 𝑐−𝑗 ≥ 𝑉𝑗 𝑐𝑗, 𝑐−𝑗 ∀𝑐−𝑗 ∈ 𝐵−𝑗 𝑏𝑜𝑒 𝑉𝑗 𝑏𝑗, 𝑐−𝑗 > 𝑉𝑗 𝑐𝑗, 𝑐−𝑗 𝑔𝑝𝑠 𝑡𝑝𝑛𝑓 𝑐−𝑗 ∈ 𝐵−𝑗 . Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly
inant Strategy egy Equi quilibr ibriu ium m (VWDSE DSE): ):
𝐵𝑜 𝑏𝑑𝑢𝑗𝑝𝑜 𝑞𝑠𝑝𝑔𝑗𝑚𝑓 𝑏1, ⋯ , 𝑏𝑜 𝑗𝑡 𝑑𝑏𝑚𝑚𝑓𝑒 𝒘𝒇𝒔𝒛 𝒙𝒇𝒃𝒍𝒎𝒛 𝒆𝒑𝒏𝒋𝒐𝒃𝒐𝒖 𝒕𝒖𝒔𝒃𝒖𝒇𝒉𝒛 𝒇𝒓𝒗𝒋𝒎𝒋𝒄𝒔𝒋𝒗𝒏 𝑔𝑝𝑠 𝑏 𝑏𝑛𝑓 < 𝑂, 𝐵𝑗 , 𝑉𝑗 >, 𝑗𝑔 ∀𝑗 ∈ 𝑂 𝑏𝑜𝑒 ∀𝑐𝑗 ∈ 𝐵𝑗, 𝑉𝑗 𝑏𝑗, 𝑐−𝑗 ≥ 𝑉𝑗 𝑐𝑗, 𝑐−𝑗 ∀𝑐−𝑗 ∈ 𝐵−𝑗 . Confess ess Confess ess Not
ess Not
Confess ess Bunty ty Bubly
Alice Bob Deep p Learning ng Deep Learning Websi site e Desi signing gning Websi site Designing gning
Analyzing these games show how agents can rationally form beliefs over what
profitable action as well as predicting behavior of others.
How would you create the rules of a game to achieve a desired objective? Ans: Mechanism hanism Design ign
How would you create the rules of a game to achieve a desired objective? Ans: Mechanism hanism Design ign
Game Theory, along with Mechanism Design have emerged as an important tool to model, analyze, and solve decentralized design problems in engineering involving multiple autonomous agents that interact strategically in a rational and intelligent way.
Courtesy: Google images
I want no less than half the cake I want no less than half the cake
Courtesy: Google images
I want no less than half the cake I want no less than half the cake
Courtesy: Google images
I want no less than half the cake I want no less than half the cake
Courtesy: Google images
I want no less than half the cake I want no less than half the cake
Courtesy: Google images
I want no less than half the cake I want no less than half the cake
Courtesy: Google images
I want no less than half the cake I want no less than half the cake
Courtesy: Google images
Mother makes one of the kids “cutter” and the other “chooser”
I want one a piece with at least one fourth of all the fruits I want a piece with at least one- fourth of all the kiwi pieces. I want the cake to be split into exactly 4 equal parts I want at least one- eighth of strawberry cream and at least one- eighth of kiwi cream
Courtesy: Google images
Courtesy: Google images
Courtesy: Google images
cativ ive Effici iciency ncy: Allocation should maximize the sum of value
ividua ual Ration ionalit ity: “Players do not loose anything by participating in the game” or “Voluntary Participation”
inant nt Strategy egy Incen enti tive e Compat atib ibilit ity:”Strategy-proofness”
Dictat atorshi ship: p: “There is no agent for whom all outcomes turn out to be favored outcomes.”
payment
Questions of Interest
A single project worth Rs. 300. One contractor (C) Two laborers (A and B).
least one laborer.
A single project worth Rs. 300. One contractor (C) Two laborers (A and B).
least one laborer. How to split the cost among the contractor and the two laborers? <100,100,100>?
Recall:
Recall:
Recall:
Recall: Shapley value split: <50, 50, 200>
Non–Cooperative Game Theory Mechanism Design Cooperative Game Theory
ce Allocation:
ement t Aucti ction:
dsourcin cing: g: Design a mechanism to complete as many task as possible with maximum quality.
ne Educat cation
tfor
ms (MOO OOCs Cs): Designing incentives to improve participation level of students and instructors.
al Net etwor
k Analysis ysis: Discovering influential nodes, providing incentives to ensure maximum spread of information over a network.
Separat ate e aucti tion
query: y:
itions
ed in order of bid (more e on this s later). ).
tisers ers pay bid of the adver erti tise ser r in the posit ition
.
Separate auction for every query:
Simp mple e set etting ting One “ad slot” and N competing advertisers Which ad to show and what should the advertiser pay? Solving this requires a mechan hanism sm comprising an allocat cation ion rule and a payme ment nt rule. Separate auction for every query:
Separate auction for every query:
VCG (Vickrey-Clarke-Groves) mechanism : * * Allocat cation ion rule: Give the ad-slot to the advertiser with maximum valuation/bid Simp mple e set etting ting One “ad slot” and N competing advertisers Which ad to show and what should the advertiser pay? Solving this requires a mechan hanism sm comprising an allocat cation ion rule and a payme ment nt rule. * * Pa Paymen ent t rule: Take the second-highest bid value from the selected advertiser
(parameters to be learnt)
an user click the ad (strategic parameter)
GOAL: Design a mechanism (allocation rule and payment rule) that ensures truthful elicitation of bids (“strategy proof-ness”) as well as maximizes the total payment received from the advertisers within a limited number of trials.
http://www.gametheory.net http://www.gametheorysociety.org http://william-king.www.drexel.edu/top/eco/game/game.html http://levine.sscnet.ucla.edu/ http://plato.acadiau.ca/courses/educ/reid/games/General_Games_Links.htm The book followed for preparing this lecture: “Game Theory and Mechanism Design” by Prof. Y. Narahari.