Algorithms for Graphical Games Luis E. Ortiz MIT CSAIL February 1, - PowerPoint PPT Presentation

Algorithms for Graphical Games Luis E. Ortiz MIT CSAIL February 1, 2005 DIMACS Workshop on Bounded Rationality Joint work with Sham Kakade, Michael Kearns, John Langford, Michael Littman and Robert Schapire

In this talk... Large population games with limited players’ interaction Game theory • Provides sound, rigorous mathematical formulation • Limited attention to problem representations: commonly “flat”, large-size representations that don’t exploit “structure” • Recently, graph-based representations introduced to model interaction Algorithms for computing equilibria in large population games with structured interactions Algorithms for Graphical Games 1/44

Internet Connectivity • 100s players/nodes • 1000s interactions • dense/sparse regions • Internet protocols [Korilis and Lazar, 1995; Nisan and Ronen, 1999; Papadimitriou, 2001; Roughgarden and Tardos, 2000; Shenker, 1995; ...] [Courtesy C AIDA] Algorithms for Graphical Games 2/44

International Trade [Kre mpel&Ple umpe r] Algorithms for Graphical Games 3/44

Graphical Models Graphical models: Models of structured probabilistic interaction • Deal with P ( X 1 , . . . , X n ) for which naive representation is size 2 n • Compact representations for P ; similarly for decision theory (Ex.: Bayesian and Markov networks, influence diagrams, . . . ) • Representation size mostly a function of the “degree of local interaction” among random variables X 1 , . . . , X n • Graph allows – Easy interpretation (useful for both modeling/knowledge- engineering and qualitative inference) – Compact representations – Efficient computation in some cases Want same benefits for game theory... Algorithms for Graphical Games 4/44

Graphical Models for Game Theory [Kearns, Littman and Singh, 2001] Graphical games • Borrow representational ideas from graphical models • Intuitive graph interpretation: A player’s payoff is only a function of its neighborhood • Ex’s: geography, organizational structure, networks • Analogy to probabilistic graphical models: special structure 8 7 3 2 1 5 4 6 Alternative graphical model formulations exist: Multi-agent influence diagrams (MAIDs) [Koller and Milch, 2001] ; Game networks [LaMura, 2000] ; Local-effect Games [Leyton-Brown and Tennenholtz, 2003] ; Action-Graph Games [Bhat and Leyton-Brown, 2004] Algorithms for Graphical Games 5/44

What about algorithms? • Algorithmic analogues to some inference methods in probabilistic graphical models already developed for computing Nash equilibria [Kearns, Littman and Singh, 2001; Littman, Kearns and Singh, 2001; Vickrey and Koller, 2002] • Sometimes “tractable” computation • Effective heuristics for general cases • Until recently, mostly work/results on computing Nash equilibria Algorithms for Graphical Games 6/44

Overview • Graphical Games – Compact representations for matrix/normal-form games • Computing Nash Equilibria • Correlated Equilibria • Maximum Entropy Correlated Equilibria Efficient representation and computation Algorithms for Graphical Games 7/44

Classical Example: Prisoners’ Dilemma [Tucker; see Luce and Raiffa, 1957] • Two prisoners: To confess or not to confess??? Payoff (sentences) Prisoner 2 Not-Confess Confess Not-Confess 1 year, 1 year ⇒ 10 years, 3 months Prisoner 1 ⇓ ⇓ Confess 3 months, 10 years ⇒ 8 years, 8 years • Only joint best response is for both to confess! Algorithms for Graphical Games 8/44

Noncooperative Game Theory [Von Neumann and Morgersten, 1944; Nash, 1951] Games with greedy players acting independently Mathematical formulation: Normal-form games A set of players { 1 , ..., n } , each with a set of actions { 0 , 1 } a ∈ { 0 , 1 } n , player i ’s payoff M i ( � Payoff matrix M i : if joint-action � a ) Mixed strategy: player i plays action 1 with probability p i Joint mixed strategy: product distribution Player individually maximizes its expected payoff Classical solution p ∗ such that no ( ǫ )-Nash equilibrium (NE) is a joint mixed strategy � player i can gain (more than ǫ ) by unilaterally deviating from p ∗ i . NEs always exist! Note representation size O ( n 2 n ) (exponential in number of players) Algorithms for Graphical Games 9/44

Graphical Games [Kearns, Littman and Singh, 2001] Definition • G : undirected graph representing the local interaction • Player i ’s payoff is only a function of its neighborhood N ( i ) – Implies conditional independence payoff assumption • local payoff matrix M ′ a ) = M ′ i : M i ( � i ( � a [ N ( i )]) • Graphical game: ( G, { M ′ i } ) max degree of local interaction k = max i | N ( i ) | ≪ n Representation size O ( n 2 k ) (exponential in max degree) 8 7 3 2 1 5 4 6 Algorithms for Graphical Games 10/44

Overview • Graphical Games • Computing Nash Equilibria – NashProp : a distributed, message-passing algorithm • Correlated Equilibria • Maximum Entropy Correlated Equilibria Algorithms for Graphical Games 11/44

The TreeProp Algorithm [Kearns, Littman and Singh, 2001] Dynamic programming algorithm Table-passing phase T ( w, v ) represents there exists a NE “upstream” in which V plays v and W is “clamped” to w U 2 U 3 U 1 Assignment-passing phase Assign NE mixed strategy to root. Recursively find assignments for immediate “upstream” V neighbors consistent with tables ( i.e. , are NE) Representation results W • For ǫ -NE need τ -size grid for tables polynomial in model size and 1 /ǫ Algorithms for Graphical Games 12/44

The NashProp Algorithm [Ortiz and Kearns, 2003] A distributed, message-passing algorithm: natural extension of TreeProp to arbitrary graphs Extension as from polytree algorithm to belief propagation [Pearl, 1988] • In our case: propagate “conditional Nash equilibria” Table-passing phase T ( w, v ) represents V ’s “belief” in a NE (in the rest of the graph) in which V plays v and U 2 U 1 U 3 W is “clamped” to w V Representation results carry over from TreeProp W Convergence for loopy graphs? Algorithms for Graphical Games 13/44

Convergence of Table-Passing Phase • Table-passing phase always converges • All NE preserved • For discretization scheme – Tables converge quickly (number of rounds polynomial in model size) – Each round takes polynomial in model size (for fixed grid size) Algorithms for Graphical Games 14/44

Assignment-passing phase • NEs preserved but search still needed • More 0 ’s in tables can lead to significantly reduced search space • Many heuristics possible – Backtracking local search • Discretization scheme: Computation time per round polynomial in model size (for fixed grid size) Discretization scheme leads to constraint satisfaction problem (CSP) formulations [Vickrey and Koller, 2002] : NashProp is a particular instantiation of arc-consistency followed by backtracking local search in a particular CSP Algorithms for Graphical Games 15/44

Example of Ideal Behavior r = 1 r = 3 r = 2 r = 8 • Graph: 3 × 3 “wrapped-around” grid • Each row shows outbound tables for each player Algorithms for Graphical Games 16/44

NashProp • Converging first phase with – table sizes for ǫ -NE polynomial in the size of the model – running time also polynomial (for fixed k ) • Second phase is a backtracking local search • For both phases, each round polynomial in the size of the model (for fixed k ) Algorithms for Graphical Games 17/44

Experimental Setup • Experiments – (Large) Number of players – Loopy graph topology – Random local (payoff) matrices – Different payoff structures • Used heuristic local search as assignment-passing phase Algorithms for Graphical Games 18/44

Experimental Results Table-Passing Phase Assignment-Passing Phase 14 10 0.53 cycle cycle grid grid 0.65 chordal(0.25,1,2,3) chordal(0.25,1,2,3) 0.59 12 chordal(0.25,1,1,2) chordal(0.25,1,1,2) 0.60 chordal(0.25,1,1,1) chordal(0.25,1,1,1) 8 chordal(0.5,1,2,3) chordal(0.5,1,2,3) 0.42 chordal(0.5,1,1,2) chordal(0.5,1,1,2) 10 chordal(0.5,1,1,1) chordal(0.5,1,1,1) grid(3) grid(3) 0.81 grid(2) grid(2) 0.61 number of rounds number of rounds 6 grid(1) grid(1) 8 ringofrings ringofrings 0.81 0.78 0.87 6 4 0.93 4 2 2 1.00 0 0 0 20 40 60 80 100 0 20 40 60 80 100 number of players number of players Algorithms for Graphical Games 19/44

NE in GG: Related Work • Exact NE Computation – All NE in trees: exponential in representation size [Kearns, Littman and Singh, 2001] ; Single NE in trees: polynomial for 2 -action [Littman, Kearns and Singh, 2002] , m -action open! ; Single NE in loopy graphs: continuation-method heuristic [Blum, Shelton and Koller, 2003] • Other approximation heuristics – CSP formulation: Cluster [Kearns, Littman and Singh, 2001] and junction- tree [Vickrey and Koller, 2002] ; Gradient ascent and “hybrid” approaches [Vickrey and Koller, 2002] • Some recent results on computing NE for torus-like GG [Daskalakis and Papadimitriou, 2004] Algorithms for Graphical Games 20/44

Overview • Graphical Games • Computing Nash Equilibria • Correlated Equilibria – Definition and motivation – Exploiting strategic structure – Representation – Connection to probabilistic graphical models – Computation • Maximum Entropy Correlated Equilibria Algorithms for Graphical Games 21/44

Algorithms for Graphical Games Luis E. Ortiz MIT CSAIL February 1, - PowerPoint PPT Presentation

Algorithms for Graphical Games Luis E. Ortiz MIT CSAIL February 1, 2005 DIMACS Workshop on Bounded Rationality Joint work with Sham Kakade, Michael Kearns, John Langford, Michael Littman and Robert Schapire In this talk... Large population

Algorithms for Parity Games Piotr Danilewski May 15, 2008 Piotr Danilewski Algorithms for

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Way Presented by Tamera Parsons, CPHQ July 24, 2017 1 Housekeeping Slides were sent this

APAC & ATAC July Meeting July 20/July 21, 2020 Texas Education Agency | Governance &

High-Performance Deep Learning: Issues, Trends, and Challenges CSE 5194.01 Autumn 20

Keeping the Conversation Going Cambridge University Press Remote Teaching Series Starting Soon!

Improving Undergraduate PL Curriculum Kathleen Fisher AT&T Labs Research 1 ACM SIGPLAN

Vis tools and case studies Vis tools and case studies Thomas Torsney-Weir VDA research group,

Introduction to HPSG Class 2: Constituent Order Variation & Complex Predicates Stefan M

TTMG 5001 Principles of Management for Engineers Session 2: Sep 21 Fall 2011 Michael Weiss

Algorithms for Graphical Games Luis E. Ortiz MIT CSAIL February 1, - PowerPoint PPT Presentation

Algorithms for Graphical Games Luis E. Ortiz MIT CSAIL February 1, 2005 DIMACS Workshop on Bounded Rationality Joint work with Sham Kakade, Michael Kearns, John Langford, Michael Littman and Robert Schapire In this talk... Large population

Algorithms for Parity Games Piotr Danilewski May 15, 2008 Piotr Danilewski Algorithms for

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

Graphical &gt; Tangible? What are their limitations? 93 94 Graphical &gt; Tangible? Graphical

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Graphical Screen Design Grids are an essential tool for graphical design Important graphical

10/4/15 Graphical Programming (1) Maze Program TOPICS Graphical Programming Using

LOGIC OF GAMES Andreas Blass University of Michigan Ann Arbor, MI 48109 ablass@umich.edu Games

Nash Dynamics and Potential Games Maria Serna Fall 2016 AGT-MIRI, FIB Potential Games Contents

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

Games with Sequential Actions: (Finite) Extensive- Form Games Xinshuo Weng Outline What are

Way Presented by Tamera Parsons, CPHQ July 24, 2017 1 Housekeeping Slides were sent this

APAC &amp; ATAC July Meeting July 20/July 21, 2020 Texas Education Agency | Governance &amp;

High-Performance Deep Learning: Issues, Trends, and Challenges CSE 5194.01 Autumn 20

Keeping the Conversation Going Cambridge University Press Remote Teaching Series Starting Soon!

Improving Undergraduate PL Curriculum Kathleen Fisher AT&amp;T Labs Research 1 ACM SIGPLAN

Vis tools and case studies Vis tools and case studies Thomas Torsney-Weir VDA research group,

Introduction to HPSG Class 2: Constituent Order Variation &amp; Complex Predicates Stefan M

TTMG 5001 Principles of Management for Engineers Session 2: Sep 21 Fall 2011 Michael Weiss

Graphical > Tangible? What are their limitations? 93 94 Graphical > Tangible? Graphical

APAC & ATAC July Meeting July 20/July 21, 2020 Texas Education Agency | Governance &

Improving Undergraduate PL Curriculum Kathleen Fisher AT&T Labs Research 1 ACM SIGPLAN

Introduction to HPSG Class 2: Constituent Order Variation & Complex Predicates Stefan M