ece 598 sg special topics in learning based robotics
play

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta - PowerPoint PPT Presentation

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL) Today, we will Course outline Course logistics Get to know each other Understand how we can build intelligent machines Source:


  1. ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL)

  2. Today, we will… • Course outline • Course logistics • Get to know each other

  3. Understand how we can build intelligent machines Source: The Atlantic

  4. Understand how we can build intelligent machines … that can favorably change the state of the physical world around them.

  5. Understand how we can build intelligent machines … that can favorably change the state of the physical world around them. Video credit: Boston Dynamics, CNN

  6. Yet, Research Robots Keep Falling… Video credit: IEEE Spectrum. DARPA Robotics Challenge Finals 2015.

  7. State-of-the-art Results in Object Pushing Video credit: Pulkit Agrawal 2016

  8. Understand how we can build intelligent machines … that can favorably change the state of cluttered real world environments to solve a variety of tasks . Household Robots Understand how far are we from making this PR1 showcase a reality. Video credit: Pieter Abbeel

  9. Goals of the Course • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

  10. Successes in Computer Vision “in the Wild” Image Labeling Tasks person, motorcycle, car, chair person, motorcycle, car, chair K. He et al. Mask R-CNN ICCV 2017

  11. Successes in Computer Vision “in the Wild” Shape and Pose Estimation for Objects and Humans S. Goel et al. Shape and Viewpoint without Keypoints. ECCV 2020 A. Kanawaza et al. End-to-end Recovery of Human Shape and Pose. CVPR 2018

  12. Factors Leading to Success in Computer Vision Hand-crafted features to Large-scale labeled data End-to-end trained features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012 J. Deng et al. ImageNet: A Large-Scale Hierarchical Image Database. CVPR 2009

  13. Factors Leading to Success in Computer Vision Hand-crafted features Can large-scale learning enable robots to execute a variety of tasks in mid-level features classifier features cluttered real-world environments? (e.g. DPM) (e.g. SVM) (e.g. HOG) Felzenszwalb et al. end-to-end training End-to-end trained cat features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012

  14. Robotic Tasks Navigation Goal “Go 300 feet North, 400 feet East” “Go Find a Chair” Robot with a first Dropped into a novel Navigate person camera environment around

  15. Robotic Tasks Manipulation

  16. Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller Slide adapted from S. Levine.

  17. Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller Manipulation Grasp Motion Observed Images 6DOF Pose Planning Slide adapted from S. Levine.

  18. Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller end-to-end training Observations Control But why? Slide adapted from S. Levine.

  19. Robot Navigation Goal “Go 300 feet North, 400 feet East” “Go Find a Chair” Robot with a first Dropped into a novel Navigate person camera environment around

  20. State Low-level Planning Control Observations Estimation Controller Mapping Observed Images Geometric Reconstruction Planning Hartley and Zisserman. 2000. Multiple View Geometry in Computer Vision Thrun, Burgard, Fox. 2005. Probabilistic Robotics Canny. 1988. The complexity of robot motion planning. Kavraki et al. RA1996. Probabilistic roadmaps for path planning in high-dimensional configuration spaces. Lavalle and Kuffner. 2000. Rapidly-exploring random trees: Progress and prospects. Path Plan Video Credits : Mur-Artal et al., Palmieri et al.

  21. Geometric 3D Reconstruction of the World Unnecessary Do we need to tediously reconstruct everything on this table? Video Credit: Mur-Artal and Tardos, TRobotics 2016. ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras.

  22. Geometric 3D Reconstruction of the World Insufficient Can’t speculate about space not directly observed.

  23. Geometric 3D Reconstruction of the World Insufficient Can’t exploit patterns in layout of indoor spaces.

  24. Geometric 3D Reconstruction of the World Insufficient Can’t exploit patterns in layout of indoor spaces.

  25. Visual Learning vs Robot Learning Hand-crafted features mid-level features classifier features (e.g. DPM) (e.g. SVM) (e.g. HOG) Felzenszwalb et al. end-to-end training End-to-end trained cat features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012

  26. Visual Learning vs Robot Learning • How do we get supervision? • Non-stationarity • Exploration vs exploitation

  27. Formalism for Modeling Behavior Reinforcement Learning

  28. Markov Decision Process … Step Back Transition Function o t a t o t +1 How you move, how the tiger moves? 3D Relative Reward Function Pose Survived? s t s t +1 … o t a t o t +1 a t +1 s t +1 s t +2 s t p ( s t +1 | s t , a t ) p ( s t +2 | s t +1 , a t +1 ) Transition Function r t = R ( s t +1 , s t , a t ) r t +1 = R ( s t +2 , s t +1 , a t +1 ) Reward Function argmax a 0 ,…, a T ∑ γ t r t Goal t

  29. Goals of the Course • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

  30. Challenges with Markov Decision Process … Step Back Transition Function o t a t o t +1 How you move, how the tiger moves? 3D Relative Reward Function Pose Survived? s t s t +1 … o t a t o t +1 a t +1 s t +1 s t +1 s t +2 s t Need to live many many lives to p ( s t +1 | s t , a t ) p ( s t +2 | s t +1 , a t +1 ) Transition Function learn how to live. r t = R ( s t +1 , s t , a t ) r t +1 = R ( s t +2 , s t +1 , a t +1 ) Reward Function argmax a 0 ,…, a T ∑ γ t r t Goal t

  31. Credit assignment problem in RL o t B B B B B B B B B F B B F B B B B B B B B B B B F B … Yann LeCun’s Cake

  32. Alternatives to Solving MDPs M. Andrychowicz et al. Hindsight Experience Replay. NeurIPS 2018. Pieter Abbeel’s Cake

  33. Solve a Related but Supervision-rich Problem S. Levine et al. Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large- Scale Data Collection. ISER 2017.

  34. Build Models and Plan with Them PILCO - Inverting a pendulum

  35. Build Models and Plan with Them PILCO - Inverting a pendulum [PILCO] M. Deisenroth et al. PILCO: A Model-based and Data-Efficient Approach to Policy Search. ICML 2011

  36. Learn by Imitating Experts S. Levine et al. End-to-End Training of Deep Visuomotor Policies. JMLR 2016.

  37. Learn by Observing Experts A. Kumar et al. Learning Navigation Subroutines by Watching Videos. CoRL 2019.

  38. Hierarchies Think about going to the airport. Take an Uber down to the airport Request Uber Wait for Uber Take Uber to airport App Dest. FB Check Get Into Car Talk to the Uber driver Get Off Car tension in various muscles time

  39. Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

  40. Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

  41. Typically, useful to incorporate problem-specific insights. Goal (300, 400) Mapper Planner Action to Execute Spatial Representation Neural of the World Network S. Gupta et al., CVPR 2017, IJCV 2020. Cognitive Mapping and Planning for Visual Navigation

  42. Locomotion: Combining with low-level control Deep Drone Racing: Learning Agile Flight in Dynamic Environments Kaufmann, et al. CoRL 2018

  43. Manipulation: Use of specialized hardware Learning to Grasp and Re-grasp using Vision and Touch Calandra, et al. RAL 2018

  44. Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks • Perspectives

  45. Perspectives • Representations vs Behaviors • Big Data vs Clever Algorithms • Lessons from Cognitive Science, Psychology, Neuroscience

  46. Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks • Perspectives

  47. Today, we will… • Course outline • Course logistics • Get to know each other

  48. Course Logistics http://saurabhg.web.illinois.edu/teaching/ece598sg/fa2020/ Instructor: TA: Saurabh Gupta Rishabh Goyal

  49. Today, we will… • Course outline • Course logistics • Get to know each other

  50. Thank you

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend