osu!mania Reinforcement Learning Agent - PowerPoint PPT Presentation

Feb 13, 2023 •6 likes •136 views

osu!mania Reinforcement Learning Agent ichrysomallis@isc.tuc.gr 2014030078 Contents Introduction osu!mania game Graphical User Interface customization Agents environment Approach and

osu!mania Reinforcement Learning Agent Χρυσομάλλης Ιάσων ichrysomallis@isc.tuc.gr 2014030078
Contents  Introduction  osu!mania game  Graphical User Interface customization  Agent’s environment  Approach and variable definition  Q-learning  Deep reinforcement learning  Future plans 2
Introduction Topic: Develop an agent able to learn how to play the video game osu!mania, through reinforcement learning. Two agents :  Q-learning agent  Deep reinforcement learning agent 3
osu!mania game Rhythm game , notes are falling Single-tap notes 1. Hold notes 2. Judgment bar 3. Player keys 4. Combo 5. Hitburst 6. Overall accuracy 7. Score 8. 4
Graphical User Interface customization Fully customizable environment, all elements can be changed Each element is painted with solid color RGB = [X, 100, 100], where X is in accordance with the element’s identity (see numbers) 5
Agent’s environment Record screenshots and translate information based on the RGB values given Small fraction of the screen includes relevant information , specific boxes are being recorded 6
Approach and variable definition (1) Identical behavior on each column, problem can be narrowed down to single column learning  Agent’s actions : Instantaneous key tap 1. Key press (no release) 2. Key release 3. Do nothing 4. 7
Approach and variable definition ( 2 )  Rewards:  Epsilon: o Initial value = 1 o Decay value = 0.9977 o Minimum value = 0.01 8
Approach and variable definition ( 3 )  State:  One column of 200 pixels  Only red (R) layer  Three possible values (no note, singe-tap note, hold note) Deep reinforcement learning: o Raw input of the column Q-learning: o Only 8 pixels due to state complexity, taking one pixel every 15 pixels of the recorded column 9
Q-learning  Algorithm:  Steps:  Receive current state  Choose an action based on epsilon  Execute the action  Receive new state  Check if song is over  Update Q-table 10
Deep reinforcement learning  Neural network model (Keras):  Steps: Identical steps apart from last one. Save transitions in temporary memory and train the model with a smaller, randomly selected sample group (batch). 11
Results Q-learning agent DQN agent 12
Future plans  Try different combinations of neural network model layers  Design the neural network model in TensorFlow  Run the agent on GPU, instead of CPU  Make use of a high end computer 13

Recommend

Automation mania in the time of Automation mania in the time of Reason: considerations for

Automation mania in the time of Automation mania in the time of Reason: considerations for complex Reason: considerations for complex transportation systems transportation systems Stephen Popkin, Ph.D. Volpe National Transportation Systems

664 views • 38 slides

Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems

Multi-agent learning Multi-agent reinforcement learning Multi-agent learning Multi-agent reinforcement learning Gerard Vreeswijk , Intelligent Systems Group, Computer Science Department, Faculty of Sciences, Utrecht University, The Netherlands.

752 views • 21 slides

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Reinforcement Learning Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning: an Introduction, 2nd Edition: Chapters 6 (6.1 6.5) Outline Reinforcement Learning Reinforcement Learning: the

589 views • 27 slides

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Introduction to Reinforcement Learning RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem Inside an RL agent Temporal difference learning Many faces of Reinforcement Learning What is

552 views • 35 slides

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Reinforcement Learning Q-Learning Deep Q-Learning on Atari Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement Learning Q-Learning Deep Q-Learning on Atari Table of Contents Reinforcement Learning

939 views • 63 slides

Foundations of Machine Learning Reinforcement Learning Reinforcement Learning Agent exploring

Foundations of Machine Learning Reinforcement Learning Reinforcement Learning Agent exploring environment. Interactions with environment: action state Agent Environment reward Problem: find action policy that maximizes cumulative reward

828 views • 66 slides

REINFORCEMENT LEARNING IN MULTI-AGENT SYSTEMS MACHINE LEARNING MEETUP DR. ANA PELETEIRO

REINFORCEMENT LEARNING IN MULTI-AGENT SYSTEMS MACHINE LEARNING MEETUP DR. ANA PELETEIRO RAMALLO 29-08-2016 TABLE OF CONTENTS MULTI-AGENT SYSTEMS GAME THEORY REINFORCEMENT LEARNING MULTI-AGENT LEARNING 2 ZALANDO Our purpose: to Zalando

1.45k views • 20 slides

Free Picard Categories Michael Horst The Ohio State University horst.59@osu.edu

Free Picard Categories Michael Horst The Ohio State University horst.59@osu.edu https://u.osu.edu/horst.59/ October 28, 2018 Michael Horst OSU Picard Categories Michael Horst OSU Picard Categories Groupoid Michael Horst OSU Picard

992 views • 63 slides

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning<br/><br/> 4/25/19, 8*06 PM Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning? Spring 2019 Created:

371 views • 15 slides

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning and Simulation-Based Search Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and Simulation-Based Search Outline 1 Reinforcement Learning 2 Simulation-Based Search 3 Planning Under

425 views • 20 slides

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine playing a new game whose rules you dont know; after a hundred or so moves your don t know; after a hundred or so moves, your opponent announces, You

512 views • 30 slides

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest Lecture May 24, 2017 Lecture overview What makes a reinforcement learning algorithm safe ? Notation Creating a safe reinforcement learning

1.43k views • 88 slides

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

CPE/CSC 580-S06 Artificial Intelligence Intelligent Agents Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent Communication knowledge exchange among agents Agent Interaction eliminates explicit

623 views • 26 slides

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement

1 Deep Reinforcement Learning Qianqian Li, Nayeon Koong, Langtian He What is deep reinforcement learning? Agent/Actor + Action + Environment + State + Reward How does reinforcement learning work?

793 views • 31 slides

Reinforcement Learning Robert Platt Northeastern University Some images and slides are used

Reinforcement Learning Robert Platt Northeastern University Some images and slides are used from: 1. CS188 UC Berkeley 2. RN, AIMA Conception of agent act Agent World sense RL conception of agent Agent takes actions a Agent World s,r

1.03k views • 39 slides

The Reinforcement Learning Problem Robert Platt Northeastern University Agent Action Agent

The Reinforcement Learning Problem Robert Platt Northeastern University Agent Action Agent World Observation Reward On a single time step, agent does the following: 1. observe some information 2. select an action to execute 3. take note

374 views • 11 slides

The New York State Health Workforce Planning Data Guide Presented by: Robert Martiniano, DrPH,

The New York State Health Workforce Planning Data Guide Presented by: Robert Martiniano, DrPH, MPA Senior Program Manager Center for Health Workforce Studies School of Public Health | University at Albany, SUNY rmartiniano@albany.edu Rong

786 views • 14 slides

PAWIKAN: A Scalable Network Management System for the Philippine Research, Education, and

PAWIKAN: A Scalable Network Management System for the Philippine Research, Education, and Government Information Network (PREGINET) http://pawikan.sourceforge.net Authors: Edwin Vinas, Jerremeo Gabas, Paul Afroilan and Rey Babilonia

501 views • 28 slides

pTec Predictive Maintenance Solution Predictive Maintenance Solutions by Indalyz AG What if you

pTec Predictive Maintenance Solution Predictive Maintenance Solutions by Indalyz AG What if you were able to forecastwhen your equipment will fail, or when maintenance should really be performed? Being able to control budgets, downtimes,

1.29k views • 17 slides

Rapid Development of Custom Software Architecture Design Environments Robert T. Monroe Carnegie

Rapid Development of Custom Software Architecture Design Environments Robert T. Monroe Carnegie Mellon University Introduction and Motivation Introduction and motivation Capturing design expertise Customizing design environments

638 views • 62 slides

Water Rights Accounting New Accounting Model New Technology: 1979 versus 2011 Faster

Water Rights Accounting New Accounting Model New Technology: 1979 versus 2011 Faster processors Faster graphics Larger, faster, memory Larger, faster, disk storage Common Application Use single application to build and

585 views • 33 slides

director One system to control them all Director excels at PA system supervision and control

connect. control. monitor. visualize. director One system to control them all Director excels at PA system supervision and control Supported public address systems What is Director and what can it do? Director is a system management so

699 views • 15 slides

I ntroduction of Metros New Website Presented to the Office of the Board Secretary :

I ntroduction of Metros New Website Presented to the Office of the Board Secretary : Riders Advisory Council January 7, 2009 1 Current Enterprise Web Portal Site is rich in content & applications, but disorganized No

541 views • 8 slides

A NEXT -GEN LEARNING MANAGEMENT SYSTEM WHAT IS LMS? A Learning Management System (LMS) is a

A NEXT -GEN LEARNING MANAGEMENT SYSTEM WHAT IS LMS? A Learning Management System (LMS) is a software application or Web based technology used to plan, implement and assess a specific learning process. LMS includes Teacher and Student

493 views • 24 slides