Remember these? Playing Atari Games using RL VARSHA LALWANI AKSHAY - PowerPoint PPT Presentation

Oct 27, 2023 •363 likes •491 views

Remember these? Playing Atari Games using RL VARSHA LALWANI AKSHAY MASARE Motivation May be we can design game players for each one of them! But, how about an AI agent who can learn to play them all ! This is where the concept of a general game

Remember these?
Playing Atari Games using RL VARSHA LALWANI AKSHAY MASARE
Motivation May be we can design game players for each one of them! But, how about an AI agent who can learn to play them all ! This is where the concept of a general game player come into the picture. In this project we are trying to implement a deep reinforced learning based agent to play multiple video games.
Problem Statement Learning to play Breakout using a convolutional neural network model trained with a variant of Q-learning, whose input would be raw pixels and whose output would be a value function estimating future rewards.
Concepts Involved Reinforcement Learning Q-Learning Convolutional Neural Network
Reinforcement Learning and Q-Learning In a reinforcement learning model, an agent takes actions in an environment with the goal of maximising a cumulative reward. Q-learning is a model free form of RL Algorithm: 𝐽𝑜𝑗𝑢𝑗𝑏𝑚𝑗𝑨𝑓 𝑹 𝒕, 𝒃 𝑏𝑠𝑐𝑗𝑢𝑠𝑏𝑠𝑗𝑚𝑧 𝑆𝑓𝑞𝑓𝑏𝑢 𝑔𝑝𝑠 𝑓𝑏𝑑ℎ 𝑓𝑞𝑗𝑡𝑝𝑒𝑓 : 𝐽𝑜𝑗𝑢𝑗𝑏𝑚𝑗𝑨𝑓 𝑻 𝑆𝑓𝑞𝑓𝑏𝑢 𝑔𝑝𝑠 𝑓𝑏𝑑ℎ 𝑡𝑢𝑓𝑞 𝑝𝑔 𝑓𝑞𝑗𝑡𝑝𝑒𝑓 : 𝐷ℎ𝑝𝑝𝑡𝑓 𝒃 𝑔𝑠𝑝𝑛 𝒕 𝑣𝑡𝑗𝑜𝑕 𝑞𝑝𝑚𝑗𝑑𝑧 𝑒𝑓𝑠𝑗𝑤𝑓𝑒 𝑔𝑠𝑝𝑛 𝑹 𝑓. 𝑕. ∈ −𝑕𝑠𝑓𝑓𝑒𝑧 𝑈𝑏𝑙𝑓 𝑏𝑑𝑢𝑗𝑝𝑜 𝒃, 𝑝𝑐𝑡𝑓𝑠𝑤𝑓 𝒔, 𝒕′ 𝑹 𝒕, 𝒃 <− − 𝑹 𝒕, 𝒃 + 𝜷[𝒔 + 𝜹. 𝒏𝒃𝒚 𝑹 𝒕 ′ , 𝒃 ′ − 𝑹 𝒕, 𝒃 ] 𝒕 <− −𝒕 ′ 𝑣𝑜𝑢𝑗𝑚 𝒕 𝑗𝑡 𝑢𝑓𝑠𝑛𝑗𝑜𝑏𝑚
Convolutional Neural Networks • Suited for extracting features from images • We take 4 images at a time, downscaled to 84x84 pixels • Images taken as 2D matrices • 2D matrices convolved with linear filters • Weight matrices for multiple image
Arcade Learning Environment • It is built on top of Stella, open-source Atari 2600 emulator • Built in C++, Support for over 50 games • Can programmatically input player commands • Outputs Image of the game screen, score and the state of the game
References [1] The Arcade Learning Environment: An Evaluation Platform for General Agents by Marc G. Bellemare, Yavar Naddaf, Joel Veness, and Michael Bowling Journal of Artificial Intelligence Research 47, pp. 253-279, 2013. [2] Stella Emulator: http://stella.sourceforge.net/ [3] Playing Atari with Deep Reinforcement Learning by Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller NIPS Deep Learning Workshop, 2013.
Any Questions ??

Recommend

ATARI October 2015 7- 8 th , 2015 Summary 1. Atari today 2. Market / Products / Opportunities

ATARI October 2015 7- 8 th , 2015 Summary 1. Atari today 2. Market / Products / Opportunities 3. Organization 4. Finances October 2015 2 Atari Today 1. Atari Today October 2015 3 Atari at a Glance With an Atari is a Globally

645 views • 33 slides

ATARI October 2015 7- 8 th , 2015 Summary 1. Atari today 2. Market / Products / Opportunities

345 views • 33 slides

PLAYING ATARI WITH DEEP REINFORCEMENT LEARNING NEURAL NETWORK VISION FOR ROBOT DRIVING ARJUN

PLAYING ATARI WITH DEEP REINFORCEMENT LEARNING NEURAL NETWORK VISION FOR ROBOT DRIVING ARJUN CHANDRASEKARAN DEEP LEARNING AND PERCEPTION (ECE 6504) PLAYING ATARI WITH DEEP REINFORCEMENT LEARNING NEURAL NETWORK VISION FOR ROBOT DRIVING

1.07k views • 62 slides

General Atari 2600 Game Playing Michael Bowling Work with: Joel Veness, Marc Bellemare, Anna

General Atari 2600 Game Playing Michael Bowling Work with: Joel Veness, Marc Bellemare, Anna Koop, Mostafa Vafadoost http://www.arcadelearningenvironment.org Friday, September 14, 2012 Friday, September 14, 2012

502 views • 24 slides

Chenhao Tan 1 Can machines think? (Turing, 1950) 2 3 4 Atari game (Bonus: try Google

Chenhao Tan 1 Can machines think? (Turing, 1950) 2 3 4 Atari game (Bonus: try Google image search atari breakout) 5 Minh et al. 2013 https://www.youtube.com/watch?v=V1e YniJ0Rnk 6 Ghazvininejad, Shi, Choi, and

348 views • 34 slides

Flip-Flop One-bit Memory Something to Remember What I Remember D Q Remember Now! What I

Flip-Flop One-bit Memory Something to Remember What I Remember D Q Remember Now! What I Remember Q Chapter 5 Edge-Triggered 1 3 CSc 314 T W Bennet Mississippi College CSc 314 T W Bennet Mississippi College State

660 views • 14 slides

Game Theory Preliminaries: Playing and Solving Games Zero-sum games with perfect information

Game Theory Preliminaries: Playing and Solving Games Zero-sum games with perfect information R&N 6 Definitions Game evaluation Optimal solutions Minimax Non-deterministic games (first take) 1 Types of Games

591 views • 31 slides

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite

Games Miheer Dewaskar Chennai Mathematical Institute April 27, 2016 1 / 19 Outline Finite Duration Games Win-Lose Games Payoff Games Infinite Duration Games Parity Games Mean Payoff Games Simple Stochastic Games 2 / 19 Outline Finite

2.02k views • 166 slides

S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S +

1 S erious Games + Computer S cience = S erious CS K.Becker & J.R.Parker S S S S erious Games erious Games erious Games erious Games + Computer S + Computer S + Computer S + Computer S cience cience cience cience = S = S

452 views • 14 slides

Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs

Examples Potential Games Potential vs Congestion games Potential Games Matoula Petrolia April 14, 2011 Examples Potential Games Potential vs Congestion games Examples Potential Games Potential vs Congestion games Examples Potential

512 views • 20 slides

Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene,

Octal Games Pre-Grundy Games thks Pre-Grundy Games Games And Graphs Workshop 2017 In collaboration with : Eric Duch ene, Antoine Dailly and Urban Larsson Gabrielle Paris 1/26 Octal Games Pre-Grundy Games thks Are Pre-Grundy games

666 views • 55 slides

Game Playing Why do AI researchers study game playing? 1. Its a good reasoning problem, formal

Game Playing Why do AI researchers study game playing? 1. Its a good reasoning problem, formal and nontrivial. 2. Direct comparison with humans and other computer programs is easy. 1 What Kinds of Games? Mainly games of strategy with the

703 views • 42 slides

Game playing Chapter 6 Chapter 6 1 Outline Games Perfect play minimax decisions

Game playing Chapter 6 Chapter 6 1 Outline Games Perfect play minimax decisions pruning Resource limits and approximate evaluation Games of chance Games of imperfect information Chapter 6 2 Games vs.

542 views • 38 slides

AI in Multiplayer Games Alex Zook @zookae AI in Multiplayer Games AI so Playing Online

AI in Multiplayer Games Alex Zook @zookae AI in Multiplayer Games AI so Playing Online Doesnt Suck Alex Zook @zookae A Hammer What does AI do? Play against people (1 v 1) What have we learned about games by doing this? What

461 views • 10 slides

Game playing Chapter 5 Chapter 5 1 Outline Games Perfect play minimax decisions

Game playing Chapter 5 Chapter 5 1 Outline Games Perfect play minimax decisions pruning Resource limits and approximate evaluation Games of chance Games of imperfect information Chapter 5 2 Games vs.

578 views • 38 slides

Game playing Chapter 5 Chapter 5 1 Outline Games Perfect play minimax decisions

Game playing Chapter 5 Chapter 5 1 Outline Games Perfect play minimax decisions pruning Resource limits and approximate evaluation Games of chance Games of imperfect information Chapter 5 2 Games

711 views • 51 slides

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions

CS440/ECE448 Lecture 12: Stochastic Games, Stochastic Search, and Learned Evaluation Functions Slides by Svetlana Lazebnik, 9/2016 Modified by Mark Hasegawa-Johnson, 2/2019 Reminder: Exam 1 (Midterm) Thu, Feb 28 in class Review in

592 views • 41 slides

Transfer learning with neural language models CS 685, Spring 2020 Advanced Natural Language

Transfer learning with neural language models CS 685, Spring 2020 Advanced Natural Language Processing Mohit Iyyer College of Information and Computer Sciences University of Massachusetts Amherst many slides from Jacob Devlin & Matt Peters

870 views • 49 slides

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/

CS 4803 / 7643: Deep Learning Website: http://www.cc.gatech.edu/classes/AY2020/cs7643_spring/ Piazza: https://piazza.com/gatech/spring2020/cs4803dl7643a/ Staff mailing list (personal questions): cs4803-7643-staff@lists.gatech.edu Gradescope:

1.12k views • 110 slides

Monte-Carlo Game Tree Search: Advanced Techniques Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Monte-Carlo Game Tree Search: Advanced Techniques Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Abstract Adding new ideas to the pure Monte-Carlo approach for computer Go. On-line

818 views • 65 slides

AI Methodology Theoretical aspects Mathematical formalizations, properties, algorithms

AI Methodology Theoretical aspects Mathematical formalizations, properties, algorithms Engineering aspects The act of building (useful) machines Empirical science Experiments What's involved in Intelligence? A) Ability to interact

717 views • 37 slides

Deep Reinforcement Learning M. Soleymani Sharif University of Technology Spring 2020 Most

Deep Reinforcement Learning M. Soleymani Sharif University of Technology Spring 2020 Most slides are based on Bhiksha Raj, 11-785, CMU 2019, some slides from Fei Fei Li and colleagues lectures, cs231n, Stanford 2018, and some from Surguy

1.35k views • 97 slides

Welcome to CSCE 496/896: Deep Learning! Welcome to CSCE 496/896: Deep Learning! Please check

Welcome to CSCE 496/896: Deep Learning! Welcome to CSCE 496/896: Deep Learning! Please check off your name on the roster, or write your name if you're not listed Indicate if you wish to register or sit in Policy on sit-ins: You may sit

329 views • 6 slides

Adversarial Search (Game Playing) Chapter 5 Adapted from materials by Tim Finin, Marie

Artificial Intelligence Adversarial Search (Game Playing) Chapter 5 Adapted from materials by Tim Finin, Marie desJardins, and Charles R. Dyer Outline Game playing State of the art and resources Framework Game trees Minimax

426 views • 23 slides