ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta - - PowerPoint PPT Presentation

ece 598 sg special topics in learning based robotics
SMART_READER_LITE
LIVE PREVIEW

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta - - PowerPoint PPT Presentation

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL) Today, we will Course outline Course logistics Get to know each other Understand how we can build intelligent machines Source:


slide-1
SLIDE 1

ECE 598 SG Special Topics in Learning-based Robotics

Saurabh Gupta Assistant Prof. (ECE, CS, CSL)

slide-2
SLIDE 2

Today, we will…

  • Course outline
  • Course logistics
  • Get to know each other
slide-3
SLIDE 3

Understand how we can build intelligent machines

Source: The Atlantic

slide-4
SLIDE 4

Understand how we can build intelligent machines

… that can favorably change the state of the physical world around them.

slide-5
SLIDE 5

Understand how we can build intelligent machines

… that can favorably change the state of the physical world around them.

Video credit: Boston Dynamics, CNN

slide-6
SLIDE 6

Yet, Research Robots Keep Falling…

Video credit: IEEE Spectrum. DARPA Robotics Challenge Finals 2015.

slide-7
SLIDE 7

State-of-the-art Results in Object Pushing

Video credit: Pulkit Agrawal 2016

slide-8
SLIDE 8

Understand how we can build intelligent machines

Video credit: Pieter Abbeel

Household Robots

Understand how far are we from making this PR1 showcase a reality.

… that can favorably change the state of cluttered real world environments to solve a variety of tasks.

slide-9
SLIDE 9

Goals of the Course

  • Understand state-of-the-art in robotics and robot learning
  • Formulate robot learning problems as MDPs
  • Investigate alternative ways of solving MDPs
  • Applying these techniques to solve robotic tasks
slide-10
SLIDE 10

person, motorcycle, car, chair person, motorcycle, car, chair

Successes in Computer Vision “in the Wild”

Image Labeling Tasks

  • K. He et al. Mask R-CNN ICCV 2017
slide-11
SLIDE 11

Successes in Computer Vision “in the Wild”

Shape and Pose Estimation for Objects and Humans

  • S. Goel et al. Shape and

Viewpoint without Keypoints. ECCV 2020

  • A. Kanawaza et al. End-to-end Recovery of Human Shape and Pose. CVPR 2018
slide-12
SLIDE 12

Factors Leading to Success in Computer Vision

  • A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012
  • J. Deng et al. ImageNet: A Large-Scale Hierarchical Image Database. CVPR 2009

Large-scale labeled data Hand-crafted features to End-to-end trained features

slide-13
SLIDE 13

Factors Leading to Success in Computer Vision

  • A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012

Hand-crafted features

features (e.g. HOG) classifier (e.g. SVM) mid-level features (e.g. DPM)

Felzenszwalb et al.

end-to-end training

cat End-to-end trained features

Can large-scale learning enable robots to execute a variety of tasks in cluttered real-world environments?

slide-14
SLIDE 14

Robotic Tasks

Navigation

Robot with a first person camera Dropped into a novel environment Navigate around

“Go 300 feet North, 400 feet East”

Goal

“Go Find a Chair”

slide-15
SLIDE 15

Robotic Tasks

Manipulation

slide-16
SLIDE 16

Typical Classical Robotics Pipeline

Observations State Estimation Planning Low-level Controller Control

Slide adapted from S. Levine.

slide-17
SLIDE 17

Typical Classical Robotics Pipeline

Observations State Estimation Planning Low-level Controller Control

6DOF Pose Grasp Motion Planning Observed Images

Manipulation

Slide adapted from S. Levine.

slide-18
SLIDE 18

Typical Classical Robotics Pipeline

Observations State Estimation Planning Low-level Controller Control Observations Control

end-to-end training

But why?

Slide adapted from S. Levine.

slide-19
SLIDE 19

Robot Navigation

Robot with a first person camera Dropped into a novel environment Navigate around

“Go 300 feet North, 400 feet East”

Goal

“Go Find a Chair”

slide-20
SLIDE 20

Mapping Planning

Observed Images

Path Plan Geometric Reconstruction

Hartley and Zisserman. 2000. Multiple View Geometry in Computer Vision Thrun, Burgard, Fox. 2005. Probabilistic Robotics

  • Canny. 1988. The complexity of robot motion planning.

Kavraki et al. RA1996. Probabilistic roadmaps for path planning in high-dimensional configuration spaces. Lavalle and Kuffner. 2000. Rapidly-exploring random trees: Progress and prospects. Video Credits: Mur-Artal et al., Palmieri et al.

Observations State Estimation Planning Low-level Controller Control

slide-21
SLIDE 21

Geometric 3D Reconstruction of the World

Do we need to tediously reconstruct everything on this table?

Video Credit: Mur-Artal and Tardos, TRobotics 2016. ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras.

Unnecessary

slide-22
SLIDE 22

Geometric 3D Reconstruction of the World

Can’t speculate about space not directly observed.

Insufficient

slide-23
SLIDE 23

Geometric 3D Reconstruction of the World

Can’t exploit patterns in layout of indoor spaces.

Insufficient

slide-24
SLIDE 24

Geometric 3D Reconstruction of the World

Can’t exploit patterns in layout of indoor spaces.

Insufficient

slide-25
SLIDE 25

Visual Learning vs Robot Learning

  • A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012

Hand-crafted features

features (e.g. HOG) classifier (e.g. SVM) mid-level features (e.g. DPM)

Felzenszwalb et al.

end-to-end training

cat End-to-end trained features

slide-26
SLIDE 26

Visual Learning vs Robot Learning

  • How do we get supervision?
  • Non-stationarity
  • Exploration vs exploitation
slide-27
SLIDE 27

Formalism for Modeling Behavior

Reinforcement Learning

slide-28
SLIDE 28

Markov Decision Process

st

  • t

at p(st+1|st, at) rt = R(st+1, st, at) st+1

  • t+1

at+1 st+2 … p(st+2|st+1, at+1) rt+1 = R(st+2, st+1, at+1)

Transition Function Reward Function Goal

argmaxa0,…,aT∑

t

γtrt

  • t

at

Step Back

  • t+1

st+1 st

3D Relative Pose

Transition Function How you move, how the tiger moves? Reward Function Survived?

slide-29
SLIDE 29

Goals of the Course

  • Understand state-of-the-art in robotics and robot learning
  • Formulate robot learning problems as MDPs
  • Investigate alternative ways of solving MDPs
  • Applying these techniques to solve robotic tasks
slide-30
SLIDE 30

Challenges with Markov Decision Process

st

  • t

at st+1 p(st+1|st, at) rt = R(st+1, st, at) st+1

  • t+1

at+1 st+2 … p(st+2|st+1, at+1) rt+1 = R(st+2, st+1, at+1)

Transition Function Reward Function Goal

argmaxa0,…,aT∑

t

γtrt

  • t

at

Step Back

  • t+1

st+1 st

3D Relative Pose

Transition Function How you move, how the tiger moves? Reward Function Survived?

Need to live many many lives to learn how to live.

slide-31
SLIDE 31

Credit assignment problem in RL

  • t

B B B B B F B B B B B B B B B B F B B B B B B F B B

Yann LeCun’s Cake

slide-32
SLIDE 32

Alternatives to Solving MDPs

Pieter Abbeel’s Cake

  • M. Andrychowicz et al. Hindsight Experience Replay. NeurIPS 2018.
slide-33
SLIDE 33

Solve a Related but Supervision-rich Problem

  • S. Levine et al. Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-

Scale Data Collection. ISER 2017.

slide-34
SLIDE 34

Build Models and Plan with Them

PILCO - Inverting a pendulum

slide-35
SLIDE 35

Build Models and Plan with Them

PILCO - Inverting a pendulum

[PILCO] M. Deisenroth et al. PILCO: A Model-based and Data-Efficient Approach to Policy Search. ICML 2011

slide-36
SLIDE 36

Learn by Imitating Experts

  • S. Levine et al. End-to-End Training of Deep

Visuomotor Policies. JMLR 2016.

slide-37
SLIDE 37

Learn by Observing Experts

  • A. Kumar et al. Learning Navigation Subroutines by Watching
  • Videos. CoRL 2019.
slide-38
SLIDE 38

Hierarchies

Think about going to the airport. time tension in various muscles

Get Off Car Get Into Car Talk to the Uber driver

Take an Uber down to the airport

Request Uber Take Uber to airport Wait for Uber App Dest. FB Check

slide-39
SLIDE 39

Course Outline

  • Understand state-of-the-art in robotics and robot learning
  • Formulate robot learning problems as MDPs
  • Investigate alternative ways of solving MDPs
  • Applying these techniques to solve robotic tasks
slide-40
SLIDE 40

Course Outline

  • Understand state-of-the-art in robotics and robot learning
  • Formulate robot learning problems as MDPs
  • Investigate alternative ways of solving MDPs
  • Applying these techniques to solve robotic tasks
slide-41
SLIDE 41

Action to Execute

Goal (300, 400)

Spatial Representation

  • f the World

Mapper Planner

  • S. Gupta et al., CVPR 2017, IJCV 2020. Cognitive Mapping and Planning for Visual Navigation

Neural Network Typically, useful to incorporate problem-specific insights.

slide-42
SLIDE 42

Locomotion: Combining with low-level control

Deep Drone Racing: Learning Agile Flight in Dynamic Environments Kaufmann, et al. CoRL 2018

slide-43
SLIDE 43

Manipulation: Use of specialized hardware

Learning to Grasp and Re-grasp using Vision and Touch Calandra, et al. RAL 2018

slide-44
SLIDE 44

Course Outline

  • Understand state-of-the-art in robotics and robot learning
  • Formulate robot learning problems as MDPs
  • Investigate alternative ways of solving MDPs
  • Applying these techniques to solve robotic tasks
  • Perspectives
slide-45
SLIDE 45

Perspectives

  • Representations vs Behaviors
  • Big Data vs Clever Algorithms
  • Lessons from Cognitive Science, Psychology, Neuroscience
slide-46
SLIDE 46

Course Outline

  • Understand state-of-the-art in robotics and robot learning
  • Formulate robot learning problems as MDPs
  • Investigate alternative ways of solving MDPs
  • Applying these techniques to solve robotic tasks
  • Perspectives
slide-47
SLIDE 47

Today, we will…

  • Course outline
  • Course logistics
  • Get to know each other
slide-48
SLIDE 48

Course Logistics

http://saurabhg.web.illinois.edu/teaching/ece598sg/fa2020/

TA: Rishabh Goyal Instructor: Saurabh Gupta

slide-49
SLIDE 49

Today, we will…

  • Course outline
  • Course logistics
  • Get to know each other
slide-50
SLIDE 50

Thank you