Grid-Wise Control for Multi-Agent Reinforcement Learning in Video - PowerPoint PPT Presentation

Jul 28, 2023 •276 likes •358 views

Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI Lei Han* 1 , Peng Sun* 1 , Yali Du* 2 , Jiechao Xiong 1 , Qing Wang 1 , Xinghai Sun 1 , Han Liu 3 , Tong Zhang 4 1 Tencent AI Lab, Shenzhen, China 2 University of

Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI Lei Han* 1 , Peng Sun* 1 , Yali Du* 2 , Jiechao Xiong 1 , Qing Wang 1 , Xinghai Sun 1 , Han Liu 3 , Tong Zhang 4 1 Tencent AI Lab, Shenzhen, China 2 University of Technology Sydney, Australia 3 Northwestern University, IL, USA 4 Hong Kong University of Science and Technology, Hong Kong, China * Equal contribution Email: leihan.cs@gmail.com
Introduction q Considered Problem • Multi-agent reinforcement learning (MARL) • Grid-world environment (video game) • Challenge Ø flexibly control an arbitrary number of agents Ø while achieving effective collaboration q Existing MARL Approaches • Decentralized learning Ø IQL, IAC (Tan, 1993; Foerster et al., 2017) • Centralized learning Ø CommNet, BicNet (Sukhbaatar et al., 2016; Peng et al., 2017) • Mixture Ø COMA, QMIX, Mean-Field (Foerster et al., 2017; Rashid et al., 2018; Yang et al., 2018) v Unable/instable to deal with variant agent number
GridNet q Architecture • Encoder • Decoder Ø Inputs are represented as an image-like structure Ø Up-sampling to construct an action map Ø An agent will take the action in the grid it occupies Ø Using conv/pooling layers to generate an embedding
GridNet q Algorithms • Can be integrated with many general RL algorithms Ø Q-learning Ø Actor-critic q Properties • Collaboration is natural Ø Stacked convolutional and/or pooling layers provide a large receptive field Ø Each agent is aware of other agents in its neighborhood • Fast parallel exploration Ø Convolutional parameters are shared by all the agents Ø Once an agent takes a beneficial action during its own exploration, the other agents will acquire the knowledge as well • Transferrable policy Ø The trained policy is easy to be transferred to other settings with a various number of agents
Experiments on Battle Games in StarCraft II q Scenarios • 5Immortals vs. 5Immortals ( 5I ) • 3Immortals+2Zealots vs. 3Immortals+2Zealots ( 3I2Z ) • mixed army battle ( MAB ) with a random number of various Zerg units • including Baneling, Zergling, Roach, Hydralisk and Mutalisk. q Training Strategies • Against handcraft policies: random (Rand) , attack-nearest (AN) , hit-and-run (HR) • Against self historic versions: self-play (SP) q Compared Methods • IQL : independent Q-learning [Tan, 1993] • IAC : independent actor-critic [Foerster et al., 2017] • Central-V : centralized value with decentralized policy [Foerster et al., 2017] • CommNet : communication net [Sukhbaatar et al., 2016] q Video link: https://youtu.be/LTcr01iTgZA
Experiments on Battle Games in StarCraft II • On 5I and 3I2Z • Performance (against each other) • Performance (against handcraft policies)
Experiments on Battle Games in StarCraft II q Learned Tactics • Transferability On 5I and 3I2Z • Directly apply the trained policy to maps with more agents • 10I , 20I , 5I5Z , 10I10Z • Performance On MAB • CommNet and Central-V cannot be applied
Thanks! Poster at Pacific Ballroom #243 Jun 11 th , 6:30 pm

Recommend

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Multi-agent learning Multi-agent reinforcement learning Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department, Faculty of Sciences, Utrecht University, The Netherlands.

752 views • 21 slides

REINFORCEMENT LEARNING IN MULTI-AGENT SYSTEMS MACHINE LEARNING MEETUP DR. ANA PELETEIRO

REINFORCEMENT LEARNING IN MULTI-AGENT SYSTEMS MACHINE LEARNING MEETUP DR. ANA PELETEIRO RAMALLO 29-08-2016 TABLE OF CONTENTS MULTI-AGENT SYSTEMS GAME THEORY REINFORCEMENT LEARNING MULTI-AGENT LEARNING 2 ZALANDO Our purpose: to Zalando

1.45k views • 20 slides

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

CPE/CSC 580-S06 Artificial Intelligence Intelligent Agents Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent Communication knowledge exchange among agents Agent Interaction eliminates explicit

623 views • 26 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

Multi-agent learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department,

Multi-agent learning Multi-agent reinforcement learning Multi-agent reinfo rement lea rning Multi-agent learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department, Faculty of Sciences, Utrecht University, The

528 views • 21 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid Scope of Grid? Sun and eScience Sun and Grid for Industry Grid for Industry : Issues 2 Scope of Grid? 3 Scope of Grid? The

186 views • 8 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

Foundations of Machine Learning Reinforcement Learning Reinforcement Learning Agent exploring

Foundations of Machine Learning Reinforcement Learning Reinforcement Learning Agent exploring environment. Interactions with environment: action state Agent Environment reward Problem: find action policy that maximizes cumulative reward

828 views • 66 slides

ROMA: Multi-Agent Reinforcement Learning with Emerging Roles Tonghan Wang, Heng Dong, Victor

ROMA: Multi-Agent Reinforcement Learning with Emerging Roles Tonghan Wang, Heng Dong, Victor Lesser, Chongjie Zhang Tsinghua University, UMass Amherst Multi-Agent Systems Robot Football Game Multi-Agent Assembly One Major Challenge of

426 views • 30 slides

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning Kyunghwan Son , Daewoo Kim, Wan Ju Kang, David Hostallero, Yung Yi School of Electrical Engineering, KAIST Cooperative Multi-Agent Reinforcement

662 views • 8 slides

ON-GRID VS OFF-GRID SOLAR On-Grid Solar is solar generation that is connected to the utility grid

ON-GRID VS OFF-GRID SOLAR On-Grid Solar is solar generation that is connected to the utility grid All solar generation enters the grid co - generation with solar The grid provides backup power for a normally stand-alone solar installation

495 views • 20 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.42k views • 88 slides

Multicore Programming Java Memory Model Jaroslav ev Peter Sewell ck Tim Harris

Multicore Programming Java Memory Model Jaroslav ev Peter Sewell ck Tim Harris University of Cambridge MSR with thanks to Francesco Zappa Nardelli, Susmit Sarkar, Tom Ridge, Scott Owens, Magnus O. Myreen, Luc Maranget, Mark Batty,

901 views • 74 slides

A Model of the Current Account Costas Arkolakis teaching fellow: Federico Esposito Economics

A Model of the Current Account Costas Arkolakis teaching fellow: Federico Esposito Economics 407, Yale January 2014 A Model of Current Account Determination The Model and the National Accounts The Formal Model Modeling the

684 views • 51 slides

Draft-ietf-anima-bootstrapping-keyinfra Versions 24-30 IETF 106 Singapore Slides from:

Draft-ietf-anima-bootstrapping-keyinfra Versions 24-30 IETF 106 Singapore Slides from: Michael Richardson mcr+ietf@sandelman.ca Status of BRSKI Edits for Adam Revision to Roach review Christian Huiteam IESG review SECDIR review IESG

361 views • 9 slides

Intro to SKARAB for programmers (and how to use HMC!) Jason Manley 2017 CASPER workshop

Intro to SKARAB for programmers (and how to use HMC!) Jason Manley 2017 CASPER workshop Hardware Hardware Virtex 7, 690T FPGA 4 Mezzanine sites per SKARAB 2 in front, 2 in back 16 SERDES links per site Designed to

266 views • 22 slides

2 MSC, Universit Paris Diderot, UMR CNRS-7057, Paris 3 MAS, Centrale, Chtenay-Malabry 4 LRI UMR

Adel Mezine 1 , Artmis Llamosi 2 , Vronique Letort 3 , Michle Sebag 4 , Florence dAlch -Buc 1,5 1 IBISC, Universit dEvry - Val dEssonne, Evry 2 MSC, Universit Paris Diderot, UMR CNRS-7057, Paris 3 MAS, Centrale,

563 views • 6 slides

U.S.- China Relations Unsustainable Codependency: From Trade War to Cold War? Stephen S. Roach

U.S.- China Relations Unsustainable Codependency: From Trade War to Cold War? Stephen S. Roach American Economic Challenges Symposium University of Wisconsin October 11, 2019 From Washington to Main Street: The U.S. Turns on China %

506 views • 23 slides

Visual Semantic Search: Retrieving Videos via Complex Textual Queries [Lin et al] CSC2523

Visual Semantic Search: Retrieving Videos via Complex Textual Queries [Lin et al] CSC2523 Winter 2015: Paper Presentation Micha Livne Goals Goals Background: semantic retrieval of videos in the context of autonomous driving Goals

762 views • 39 slides

PRISME FORUM TECHNICAL MEETING PRISME Forum Chair: Olivier Gien Global Head, Clinical IT, Sanofi

May 17-18, 2017 Berlin, Germany Host: Bayer AG SPRING 2017 PRISME FORUM TECHNICAL MEETING PRISME Forum Chair: Olivier Gien Global Head, Clinical IT, Sanofi Technical Meeting Chair: Dan Chapman Director, Discovery Research IM, UCB PRISME

352 views • 13 slides