Deep Robotic Learning
Sergey Levine
UC Berkeley Google Brain
Deep Robotic Learning Sergey Levine UC Berkeley Google Brain - - PowerPoint PPT Presentation
Deep Robotic Learning Sergey Levine UC Berkeley Google Brain robotic state low-level modeling & control observations estimation planning controls control prediction (e.g. vision) pipeline standard classifier features
Sergey Levine
UC Berkeley Google Brain
robotic control pipeline
state estimation (e.g. vision) modeling & prediction planning low-level control controls
standard computer vision features (e.g. HOG) mid-level features (e.g. DPM) classifier (e.g. SVM) deep learning
Felzenszwalb ‘08
robotic control pipeline
state estimation (e.g. vision) modeling & prediction planning low-level control controls
deep robotic learning
state estimation (e.g. vision) modeling & prediction planning low-level control controls
end-to-end training end-to-end training
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
Chelsea Finn
end-to-end training 0% success rate 96.3% success rate pose prediction
(trained on pose only)
L.*, Finn*, Darrell, Abbeel, ‘16
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
Deep Robotic Learning Applications manipulation locomotion
with N. Wagener, P. Abbeel with V. Kumar, A. Gupta, E. Todorov with V. Koltun
aerial vehicles
with G. Kahn, T. Zhang, P. Abbeel
tensegrity robot
with X. Geng, M. Zhang, J. Bruce, K. Caluwaerts,
dexterous hands
with C. Eppner, A. Gupta, P. Abbeel
soft hands
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
ingredients for success in learning:
supervised learning: learning robotic skills: computation algorithms data computation algorithms
monocular RGB camera 7 DoF arm 2-finger gripper
bin
Grasping with Learned Hand-Eye Coordination
L., Pastor, Krizhevsky, Quillen ‘16
Peter Pastor Alex Krizhevsky Deirdre Quillen
Grasping Experiments
Policy Learning with Multiple Robots
Local policy optimization Global policy optimization Rollout execution
Mrinal Kalakrishnan Yevgen Chebotar Adrian Li Ali Yahya
Yahya, Li, Kalakrishnan, Chebotar, L., ‘16
Policy Learning with Multiple Robots: Deep RL with NAF
Gu*, Holly*, Lillicrap, L., ‘16
Shane Gu Ethan Holly Tim Lillicrap
Learning a Predictive Model of Natural Images
video predictions
Chelsea Finn
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
unknown environment
command velocities raw image neural network ensemble
collision cost Key idea: To learn about collisions, must experience collisions (but safely!)
Safe Uncertainty-Aware Learning
Kahn, Pong, Abbeel, L. ‘16 Greg Kahn
Safe Uncertainty-Aware Learning
Kahn, Pong, Abbeel, L. ‘16
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
Training in Simulation: CAD2RL
Sadeghi, L. ‘16 Fereshteh Sadeghi
Training in Simulation: CAD2RL
Sadeghi, L. ‘16
Training in Simulation: CAD2RL
Sadeghi, L. ‘16
Sadeghi, L. ‘16
Learning with Transfer in Mind: Ensemble Policy Optimization (EPOpt)
train test adapt training on single torso mass training on model ensemble unmodeled effects ensemble adaptation
Aravind Rajeswaran
sensorimotor skills?
wide variety of robots & tasks?
produce skills that generalize?
safety-critical domains?
real world, and from one robot to another?
in unstructured real-world environments?
Learning what Success Means
Finn, Abbeel, L. ‘16
Learning what Success Means
Sermanet, Xu, L. ‘16
ingredients for success in learning:
supervised learning: learning robotic skills: computation algorithms data computation algorithms
Announcement: New Conference Conference on Robotic Learning (CoRL) www.robot-learning.org
Goal: bring together robotics & machine learning in a focused conference format
Conference: November 2017 Papers deadline: late June 2017
Steering committee: Ken Goldberg (UC Berkeley), Sergey Levine (UC Berkeley), Vincent Vanhoucke (Google), Abhinav Gupta (CMU), Stefan Schaal (USC, MPI), Michael I. Jordan (UC Berkeley), Raia Hadsell (DeepMind), Dieter Fox (UW), Joelle Pineau (McGill), J. Andrew Bagnell (CMU), Aude Billard (EPFL), Stefanie Tellex (Brown), Minoru Asada (Osaka), Wolfram Burgard (Freiburg), Pieter Abbeel (UC Berkeley) Chelsea Finn
Peter Pastor Alex Krizhevsky Deirdre Quillen Mrinal Kalakrishnan Yevgen Chebotar Adrian Li Ali Yahya Shane Gu Ethan Holly Tim Lillicrap
Greg Kahn Fereshteh Sadeghi Aravind Rajeswaran Pieter Abbeel Trevor Darrell