Multiagent Evaluation under Incomplete Information Mark Rowland * , - PowerPoint PPT Presentation

Multiagent Evaluation under Incomplete Information Mark Rowland * , Shayegan Omidshafiei * , Karl Tuyls, Julien Pérolat, Michal Valko, Georgios Piliouras † , Rémi Munos * Equal contributors † Singapore University of Technology and Design

Motivation ● Problem of interest: ○ Multiagent evaluation under incomplete information 3 ○ Agent evaluation >2-player, general-sum games with noisy payoffs Algorithm Estimated Estimated ranking vector payofg table 2 ● Prototypical application: multiagent iterative training Meta-game 1 synthesis Game simulation Training Train agents via simulations in the underlying game 1 Playing Construct meta-game comparing performance of all 2 agent match-ups Evaluate (i.e., rank or score) agents in the meta-game 3

Multiagent Evaluation at a Glance 𝜷 -Rank Overview 1. Construct response graph capturing player-wise evolutionary deviations: graph over the pure strategy profiles, with directed edges if deviating player’s new strategy is a better-response (U,L) (U,C) (U,R) Player 2 L C R U 2, 1 1, 2 0, 0 (M,L) (M,C) (M,R) M 1, 2 2, 1 1, 0 Player 1 (D,L) (D,C) (D,R) D 0, 0 0, 1 2, 2

Multiagent Evaluation at a Glance 𝜷 -Rank Overview 1. Construct response graph capturing player-wise evolutionary deviations: graph over the pure strategy profiles, with directed edges if deviating player’s new strategy is a better-response (U,L) (U,C) (U,R) Player 2 L C R U 2, 1 1, 2 0, 0 (M,L) (M,C) (M,R) M 1, 2 2, 1 1, 0 Player 1 (D,L) (D,C) (D,R) D 0, 0 0, 1 2, 2 2. Perturb the response graph → evolutionary mutations ensuring a unique stationary distribution 3. Stationary distribution masses → 𝜷 -Rank

Multiagent Evaluation at a Glance 𝜷 -Rank Overview 1. Construct response graph capturing player-wise evolutionary deviations: graph over the pure strategy profiles, with directed edges if deviating player’s new strategy is a better-response (U,L) (U,C) (U,R) Player 2 L C R U 2, [1,2] 1, [1,2] 0, 0 (M,L) (M,C) (M,R) M 1, 2 2, 1 1, 0 Player 1 (D,L) (D,C) (D,R) D 0, 0 0, 1 2, 2 2. Perturb the response graph → evolutionary mutations ensuring a unique stationary distribution 3. Stationary distribution masses → 𝜷 -Rank

From Uncertainty in Payofgs to Rankings ● Key question: given confidence bounds on the payoff table entries, can we efficiently compute a range of plausible 𝜷 -Rank weights for the agents?

From Uncertainty in Payofgs to Rankings ● Key question: given confidence bounds on the payoff table entries, can we efficiently compute a range of plausible 𝜷 -Rank weights for the agents? Top-ranked agent when no payoff uncertainty ● Takeaway: need careful consideration of payoff uncertainties when ranking agents ●

Contributions Static sample complexity bounds quantifying # of interactions needed to confidently rank agents 1 Algorithm that adaptively simulates agent interactions that are most informative for ranking 2 Analysis of the propagation of payoff uncertainty to the final rankings computed 3 Sample complexity guarantees & efficient alg. for bounding rankings given payoff uncertainty ●

Details & evaluations at poster #220!.

Multiagent Evaluation under Incomplete Information Mark Rowland * , - PowerPoint PPT Presentation

Multiagent Evaluation under Incomplete Information Mark Rowland * , Shayegan Omidshafiei * , Karl Tuyls, Julien Prolat, Michal Valko, Georgios Piliouras , Rmi Munos * Equal contributors Singapore University of Technology and Design

Incomplete Information Econ 400 University of Notre Dame Econ 400 (ND) Incomplete Information

Synthesis under incomplete information Andreas Augustin June 12, 2008 Andreas Augustin

CHAPTER 11: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

CHAPTER 6: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

CHAPTER 12: LOGICS FOR MULTIAGENT SYSTEMS An Introduction to Multiagent Systems

A MultiAgent System for A MultiAgent System for Retrieving Bioinformatics Retrieving

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

1. Introduction ( (to Agents and Multiagent g g Systems) ems (SMA-UPC) Javier

and Applications Lecture 13: Programming Multiagent Systems [Part 2] Juan Carlos Nieves Snchez

1. Introduction (to Agents and Multiagent ( g g D) ems Design (MASD Systems) Javier

Multiagent Resource Allocation: What to optimise, how, and why? Ulle Endriss Imperial College

Agents and Artifacts: The A&A Meta-model for Multiagent Systems Multiagent Systems LS

Multiagent System-based Verification of Security and Privacy Ioana Boureanu Imperial College

Inf2D 07: Effective Propositional Inference Valerio Restocchi School of Informatics, University

3 Mixed and Continuous Strategies A pure strategy maps each of a players possible information

F# Overview: Immutable Data + Pure Func7ons

Exploiting Purity for Atomicity 1 Busy Acquire atomic void busy_acquire() { while (true) { if

On the Interval Property Level and Gorenstein in algebra and combinatorics algebras Pure O

Functors, Applicatives, and Monads Practice Curtis Millar CSE, UNSW (and Data61) 15 July 2020 1

Pure Functional Programming Functional Programming and Reasoning Dr Hans Georg Schaathun

Automatically proving linearizability Viktor Vafeiadis University of Cambridge CAV 2010

Sambuz

Useful Links

Newsletter

Mail Us

Multiagent Evaluation under Incomplete Information Mark Rowland * , - PowerPoint PPT Presentation

Multiagent Evaluation under Incomplete Information Mark Rowland * , Shayegan Omidshafiei * , Karl Tuyls, Julien Prolat, Michal Valko, Georgios Piliouras , Rmi Munos * Equal contributors Singapore University of Technology and Design

Incomplete Information Econ 400 University of Notre Dame Econ 400 (ND) Incomplete Information

Synthesis under incomplete information Andreas Augustin June 12, 2008 Andreas Augustin

CHAPTER 11: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

CHAPTER 6: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

CHAPTER 12: LOGICS FOR MULTIAGENT SYSTEMS An Introduction to Multiagent Systems

A MultiAgent System for A MultiAgent System for Retrieving Bioinformatics Retrieving

Multiagent Systems: Spring 2006 Ulle Endriss Institute for Logic, Language and Computation

1. Introduction ( (to Agents and Multiagent g g Systems) ems (SMA-UPC) Javier

and Applications Lecture 13: Programming Multiagent Systems [Part 2] Juan Carlos Nieves Snchez

1. Introduction (to Agents and Multiagent ( g g D) ems Design (MASD Systems) Javier

Multiagent Resource Allocation: What to optimise, how, and why? Ulle Endriss Imperial College

Agents and Artifacts: The A&amp;A Meta-model for Multiagent Systems Multiagent Systems LS

Multiagent System-based Verification of Security and Privacy Ioana Boureanu Imperial College

Inf2D 07: Effective Propositional Inference Valerio Restocchi School of Informatics, University

3 Mixed and Continuous Strategies A pure strategy maps each of a players possible information

F# Overview: Immutable Data + Pure Func7ons

Exploiting Purity for Atomicity 1 Busy Acquire atomic void busy_acquire() { while (true) { if

On the Interval Property Level and Gorenstein in algebra and combinatorics algebras Pure O

Functors, Applicatives, and Monads Practice Curtis Millar CSE, UNSW (and Data61) 15 July 2020 1

Pure Functional Programming Functional Programming and Reasoning Dr Hans Georg Schaathun

Automatically proving linearizability Viktor Vafeiadis University of Cambridge CAV 2010

Sambuz

Useful Links

Newsletter

Mail Us

Agents and Artifacts: The A&A Meta-model for Multiagent Systems Multiagent Systems LS