Deep Robotic Learning Sergey Levine UC Berkeley Google Brain

robotic state low-level modeling & control observations estimation planning controls control prediction (e.g. vision) pipeline

standard classifier features mid-level features computer (e.g. SVM) (e.g. HOG) (e.g. DPM) vision Felzenszwalb ‘08 end-to-end training deep learning robotic state low-level modeling & control controls observations estimation planning control prediction (e.g. vision) pipeline end-to-end training deep state low-level modeling & robotic controls observations estimation planning prediction control (e.g. vision) learning

no direct supervision actions have consequences

1. Does end-to-end learning produce better sensorimotor skills? 2. Can we apply sensorimotor skill learning to a wide variety of robots & tasks? 3. Can we scale up deep robotic learning and produce skills that generalize ? 4. How can we learn safely and efficiently in safety-critical domains? 5. Can we transfer skills from simulation to the real world , and from one robot to another ?

Chelsea Finn

96.3% success rate end-to-end training 0% success (trained on pose only) rate pose prediction L.*, Finn*, Darrell, Abbeel , ‘16

Deep Robotic Learning Applications manipulation dexterous hands soft hands with N. Wagener, P. Abbeel with V. Kumar, A. Gupta, E. Todorov with C. Eppner, A. Gupta, P. Abbeel locomotion aerial vehicles tensegrity robot with X. Geng, M. Zhang, J. Bruce, K. Caluwaerts, M. Vespignani, V. SunSpiral, P. Abbeel with G. Kahn, T. Zhang, P. Abbeel with V. Koltun

ingredients for success in learning: supervised learning: learning robotic skills: computation computation ~ data algorithms algorithms ? data

Grasping with Learned Hand-Eye Coordination monocular • monocular camera (no depth) RGB camera • no camera calibration either 7 DoF arm • 2-5 Hz update • continuous arm control 2-finger gripper • servo the gripper to target • fix mistakes object • no prior knowledge bin Alex Peter Pastor Krizhevsky Deirdre Quillen L., Pastor, Krizhevsky, Quillen ‘16

Grasping Experiments

Policy Learning with Multiple Robots Rollout execution Local policy optimization Global policy optimization Mrinal Ali Yahya Kalakrishnan Yevgen Chebotar Adrian Li

Yahya, Li, Kalakrishnan, Chebotar , L., ‘16

Policy Learning with Multiple Robots: Deep RL with NAF Ethan Holly Tim Lillicrap Shane Gu Gu*, Holly*, Lillicrap , L., ‘16

Learning a Predictive Model of Natural Images original video Chelsea Finn predictions

Safe Uncertainty-Aware Learning unknown environment Key idea: To learn about collisions, must experience collisions (but safely!) 1. Learn a collision prediction model raw image command velocities neural network ensemble 2. Speed-dependent, uncertainty-aware collision cost Greg Kahn 3. Iteratively train with on-policy samples Kahn, Pong, Abbeel , L. ‘16

Safe Uncertainty-Aware Learning Kahn, Pong, Abbeel , L. ‘16

Training in Simulation: CAD2RL Fereshteh Sadeghi Sadeghi , L. ‘16

Training in Simulation: CAD2RL Sadeghi , L. ‘16

Sadeghi , L. ‘16

Learning with Transfer in Mind: Ensemble Policy Optimization (EPOpt) training on single torso mass training on model ensemble train test unmodeled effects ensemble adaptation adapt Aravind Rajeswaran

1. Does end-to-end learning produce better sensorimotor skills? 2. Can we apply sensorimotor skill learning to a wide variety of robots & tasks? 3. Can we scale up deep robotic learning and produce skills that generalize ? 4. How can we learn safely and efficiently in safety-critical domains? 5. Can we transfer skills from simulation to the real world , and from one robot to another ? 6. How can we get sufficient supervision to learn in unstructured real-world environments?

Learning what Success Means can we learn the goal with visual features? Finn, Abbeel , L. ‘16

Learning what Success Means Sermanet , Xu, L. ‘16

ingredients for success in learning: supervised learning: learning robotic skills: computation computation ~ data algorithms algorithms ? data

Fereshteh Sadeghi Aravind Rajeswaran Chelsea Finn Greg Kahn Announcement: New Conference Conference on Robotic Learning (CoRL) www.robot-learning.org Goal: bring together robotics & machine learning in a focused conference format Alex Conference: November 2017 Peter Pastor Krizhevsky Deirdre Quillen Trevor Darrell Pieter Abbeel Papers deadline: late June 2017 Steering committee: Ken Goldberg (UC Berkeley), Sergey Levine (UC Berkeley), Vincent Vanhoucke (Google), Abhinav Gupta (CMU), Stefan Schaal (USC, MPI), Michael I. Jordan (UC Berkeley), Raia Hadsell (DeepMind), Dieter Fox (UW), Joelle Pineau (McGill), J. Andrew Bagnell (CMU), Aude Billard (EPFL), Stefanie Tellex (Brown), Minoru Asada (Osaka), Wolfram Burgard (Freiburg), Pieter Abbeel (UC Berkeley) Mrinal Shane Gu Ethan Holly Tim Lillicrap Kalakrishnan Yevgen Chebotar Ali Yahya Adrian Li

Deep Robotic Learning Sergey Levine UC Berkeley Google Brain - PowerPoint PPT Presentation

Deep Robotic Learning Sergey Levine UC Berkeley Google Brain robotic state low-level modeling & control observations estimation planning controls control prediction (e.g. vision) pipeline standard classifier features

Self-Supervised Deep Learning for Robotic Grasping Lars Berscheid | KUKA Roboter GmbH | 10/10/2017

Learning in Robotic Systems Robotic Agents @ Allegheny College Janyl Jumadinova November 27,

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

A Robotic Auto-Focus System based on Deep Reinforcement Learning Xiaofan Yu, Runze Yu, Jingsong

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Kronnika Presentation August, 2020 2 2 What is Robotic Process Automation ? Robotic

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

Robotic assembly projects in JAXA Hiroki Kato Daichi Hirano Keisuke Watanabe Daisuke Joudoi

Robotic Navigation Unit Team 42 Robotic Navigation Unit Dr. Crassidis Faculty Mentor

Eyes wide open The importance of a smooth transition to a new EU deal Speakers: Carolyn

CS325 Artificial Intelligence Computer Vision II 3D Vision (Ch. 24) Dr. Cengiz Gnay, Emory

REST Eye for the SOA Guy Steve Vinoski Member of Technical Staff Verivue Westford, MA USA

Through the eyes of instructors: a phenomenographic investigation of student success Pivi

Order Matters Only. . . one word Him stick with the before chased boy the that dog big had the I

Texture Tues Jan 31, 2017 Kristen Grauman UT Austin Announcements Reminder: A1 due this

Word Meaning and Similarity Word Senses and Word Rela-ons Dan

Breakthrough Insight into DDR4/LPDDR4 Memory Greater Than 2400 Mb/s January 2015 Jennie