Welcome 1 https://www.youtube.com/watch?v=1EpJv34gQ88&t=183s 2 - PowerPoint PPT Presentation

Welcome 1

https://www.youtube.com/watch?v=1EpJv34gQ88&t=183s 2

https://www.youtube.com/watch?v=kVmp0uGtShk&t=55s 3

Solving a Rubik‘s cube with a robotic hand (Learning dexterous manipulations) 4

Outline ● Why you should care ● How to train your robotic hand ● Learning dexterous manipulations 5

Why you should care ● Human hands are awesome ● Custom robot for every task ● Learning to use a humanoid hand would give more freedom 7

How to train your robotic hand ● Imitation Learning ● Simulation https://vcresearch.berkeley.edu/news/berkeley-startup-train-robots-puppets Andrychowicz, Marcin, et al. "Learning dexterous in-hand manipulation." arXiv preprint arXiv:1808.00177 (2018)., Figure 3 left 9

Simulations ● Simulate everything ● Collect a lot of data for training ● Train policy in Sim Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Figure 7 10

Reinforcement learning ● Learning from mistakes ● Agenct, action, states and reward ● Goal is represented through a function https://en.wikipedia.org/wiki/Reinforcement_learning#/media/File:Reinforcement_learning_diagram.svg 11

Deep Reinforcement learning ● Combine ANNs and RF ● Policy is learned by ANN ● Second ANN for state values https://en.wikipedia.org/wiki/Artificial_neural_network 12

Memory ● Long-short-term-memory (LSTM) ● Well suited for clasification based on time series – Store important information – Can retrieve it ater arbitrary time 13

Domain Randomizations (DR) ● Randomize physical properties of sim environments ● Hand-picked randomizations – Uniform distribution ● Problem: – What is important? – Not that robust 15

Automatic Domain Randomization (ADR) ● Basic Idea: – Automatically change domain randomizations with progress https://openai.com/blog/solving-rubiks-cube/ 16

Automatic Domain Randomization (ADR) ● Changes can be made in: – Cube size – Friction of the hand – Gravity – Brightness – Action delay – Motor backlash Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Figure 2a 17

Learning dexterous manipulations ● Using ADR ● Train for several months (~13 Thausand years of sim) ● Two networks during training – One to predict value function – One for agent policy 18

Learning dexterous manipulations Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Figure 12 19

The robotic hand ● The cage with 3 cameras from different angles ● Hand with tactile sensors ● Used CNN for vision Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Figure 4a 20

Comparisson Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Table 3 21

How robust is the outcome? Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Figure 17 22

Comparisson Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Table 6 npd = nats per dimension, where nat is the natural unit of information 23

But ... ● Not a Rubik‘s Cube but Giiker‘s Cube ● Policy only solved 20% with a ‚fair scramble‘ ● Other robotic hands can solve rubik‘s cube faster ● Solution steps were generated before Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand.", Figure 13b 24

Thank you https://www.youtube.com/watch?v=QyJGXc9WeNo 25

Questions? 26

Feedback 27

Source https://skymind.ai/wiki/deep-reinforcement-learning ● https://towardsdatascience.com/welcome-to-deep-reinforcement-learning-part-1-dqn-c3cab4d41b6b ● Akkaya, Ilge, et al. "Solving Rubik's Cube with a Robot Hand." ● Andrychowicz, Marcin, et al. "Learning dexterous in-hand manipulation." arXiv preprint arXiv:1808.00177 (2018). ● https://openai.com/blog/solving-rubiks-cube/ ● 28

Welcome 1 https://www.youtube.com/watch?v=1EpJv34gQ88&t=183s 2 - PowerPoint PPT Presentation

Welcome 1 https://www.youtube.com/watch?v=1EpJv34gQ88&t=183s 2 https://www.youtube.com/watch?v=kVmp0uGtShk&t=55s 3 Solving a Rubiks cube with a robotic hand (Learning dexterous manipulations) 4 Outline Why you should care

Welcome back... Welcome back... ..to me. Welcome back... ..to me. Test out Welcome back...

Welcome! Welcome! Welcome! Welcome! Autor:Johann Oberdorfer Autor:Johann Oberdorfer With

WELCOME WELCOME WELCOME WELCOME 85th ANNUAL MEETING 85th ANNUAL MEETING 85th ANNUAL MEETING

WELCOME WELCOME WELCOME WELCOME to our vibrant & small Conservation Village to our vibrant

WELCOME! WELCOME! WELCOME! WELCOME! African American Student Advocates African American

New Student Welcome Day will begin shortly. New Student Welcome Day 1 New Student Welcome Day

Welcome! Welcome! Welcome! Welcome! What will happen today? What will happen today? Lecture

Welcome to the Welcome to the by to the 2017 Opening Welcome to the Opening Meeting Kyle

10 minutes Welcome The presentation will begin in: 9 minutes Welcome The presentation will

Welcome back. Today. Welcome back. Today. Continue Sampling combinatorial structures. Welcome

Welcome Monthly Meeting August 2, 2019 Welcome & Check-in Agenda I. Welcome and

Welcome Quarterly engagement event Welcome and update Dr David Kelly Agenda Welcome and

Kaleidoscope Sensory Storytimes Welcome, welcome everyone, Now youre here lets have some fun.

Registered Charity: 1105351 Welcome! Welcome! Sandgate Primary School Sandgate Primary School

HOUSEKEEPING WELCOME WELCOME | WELCOME SNAPSHOT

Welcome Centre Immigrant Services (Ajax) Hermia Corbette, Ajax Welcome Centre Manager

Overview of ArgonCube for the DUNE Near Detector Jonathan Asaadi and Dan Dwyer On behalf of the

Booleans true: 2 false: 2 Natural numbers zero: N suc: N N recursion! Circle base: S1

Module 4: Building Working with Standard Dimensions Dimensions Using the Basic Level

Non-extendible Latin cuboids Ian Wanless Monash University, Australia D. Bryant, N. J. Cavenagh,

Last Time: Summary. Hypercube Eigenvalues of hypercube. V = { 0 , 1 } d ( x , y ) E when x

VO Ji Ndvornk Czech Technical University in Prague supported by grant LD-15113 of Czech

The Bolzano property and the cube-like complexes Przemys law Tkacz and Marian Turza nski

2.1 Explicit & Implicit Surfaces Hao Li http://cs599.hao-li.com 1 Administrative

Welcome 1 https://www.youtube.com/watch?v=1EpJv34gQ88&t=183s 2 - PowerPoint PPT Presentation

Welcome 1 https://www.youtube.com/watch?v=1EpJv34gQ88&t=183s 2 https://www.youtube.com/watch?v=kVmp0uGtShk&t=55s 3 Solving a Rubiks cube with a robotic hand (Learning dexterous manipulations) 4 Outline Why you should care

Welcome back... Welcome back... ..to me. Welcome back... ..to me. Test out Welcome back...

Welcome! Welcome! Welcome! Welcome! Autor:Johann Oberdorfer Autor:Johann Oberdorfer With

WELCOME WELCOME WELCOME WELCOME 85th ANNUAL MEETING 85th ANNUAL MEETING 85th ANNUAL MEETING

WELCOME WELCOME WELCOME WELCOME to our vibrant &amp; small Conservation Village to our vibrant

WELCOME! WELCOME! WELCOME! WELCOME! African American Student Advocates African American

New Student Welcome Day will begin shortly. New Student Welcome Day 1 New Student Welcome Day

Welcome! Welcome! Welcome! Welcome! What will happen today? What will happen today? Lecture

Welcome to the Welcome to the by to the 2017 Opening Welcome to the Opening Meeting Kyle

10 minutes Welcome The presentation will begin in: 9 minutes Welcome The presentation will

Welcome back. Today. Welcome back. Today. Continue Sampling combinatorial structures. Welcome

Welcome Monthly Meeting August 2, 2019 Welcome &amp; Check-in Agenda I. Welcome and

Welcome Quarterly engagement event Welcome and update Dr David Kelly Agenda Welcome and

Kaleidoscope Sensory Storytimes Welcome, welcome everyone, Now youre here lets have some fun.

Registered Charity: 1105351 Welcome! Welcome! Sandgate Primary School Sandgate Primary School

HOUSEKEEPING WELCOME WELCOME | WELCOME SNAPSHOT

Welcome Centre Immigrant Services (Ajax) Hermia Corbette, Ajax Welcome Centre Manager

Overview of ArgonCube for the DUNE Near Detector Jonathan Asaadi and Dan Dwyer On behalf of the

Booleans true: 2 false: 2 Natural numbers zero: N suc: N N recursion! Circle base: S1

Module 4: Building Working with Standard Dimensions Dimensions Using the Basic Level

Non-extendible Latin cuboids Ian Wanless Monash University, Australia D. Bryant, N. J. Cavenagh,

Last Time: Summary. Hypercube Eigenvalues of hypercube. V = { 0 , 1 } d ( x , y ) E when x

VO Ji Ndvornk Czech Technical University in Prague supported by grant LD-15113 of Czech

The Bolzano property and the cube-like complexes Przemys law Tkacz and Marian Turza nski

2.1 Explicit &amp; Implicit Surfaces Hao Li http://cs599.hao-li.com 1 Administrative

WELCOME WELCOME WELCOME WELCOME to our vibrant & small Conservation Village to our vibrant

Welcome Monthly Meeting August 2, 2019 Welcome & Check-in Agenda I. Welcome and

2.1 Explicit & Implicit Surfaces Hao Li http://cs599.hao-li.com 1 Administrative