Auto-conditioned Recurrent Mixture Density Networks for - PowerPoint PPT Presentation

Sep 25, 2022 •11 likes •171 views

Auto-conditioned Recurrent Mixture Density Networks for Learning Generalizable Robot Skills Hejia Zhang, Eric Heiden, Stefanos Nikolaidis, Joseph J. Lim, Gaurav S. Sukhatme Introduction 2 Introduction learn generalizable robot skills

Auto-conditioned Recurrent Mixture Density Networks for Learning Generalizable Robot Skills Hejia Zhang, Eric Heiden, Stefanos Nikolaidis, Joseph J. Lim, Gaurav S. Sukhatme
Introduction 2
Introduction learn generalizable robot skills by imitation learning ● learn state-transition model (STM) to perform tasks with unseen goals ● perform tasks from high-level descriptions ● plan tasks with longer time horizons than the demonstrated tasks ● based on auto-conditioning technique and Recurrent Mixture Density Network (MDN) ● combinable with other methods, e.g. Trajectory Optimization, Inverse Dynamics Models ● 3
Architecture 4
State Transition Model (STM): State : Two requirements for robot skill models: (joint angles, task input, task description) Remember long state sequences (history) ● Capture underlying multimodal nature of real world ● (e.g., difgerent solutions for the same task, human motion prediction) Recurrent Neural Network Mixture Density Network Recurrent Mixture Density Network 5
Train RNNs via Auto-conditioning Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis . Yi Zhou, Zimo Li, Shuangjiu Xiao, Chong He, Zeng Huang, Hao Li. ICLR 2018. Improving multi-step prediction of learned time series models . Arun Venkatraman, Martial Hebert, J. Andrew Bagnell. AAAI 2015. 6
Architecture 7
Trajectory Optimization Smooth trajectory by minimizing the objective where 8
Experiments 9
Experiment - Stacking blocks 10
Experiment - Drawing circles 11
Experiment - Adaptability Reaching Pick & Place The goal is changed at the middle of each task execution. The plot shows how our model can adapt to changing goals and still works beyond the planning horizon of its demonstrations. 12
Experiment - Combine with other methods trajectory optimizer for smoothness and precision (goal-based) ● inverse dynamics model (IDM) for effjcient sim-2-real transfer ● Reaching to 4 goals Reaching to 1 goal Trajectories before and after smoothing Combination with inverse dynamics model 13
Conclusion 14
State : Conclusion (joint angles, human motions, task input, task description) Deeper insight into our neural network structure: Assumption 1 : Every single task can be solved in several ways. ● Assumption 2 : Difgerent phases of a single task governed by difgerent mixture Gaussian ● components (e.g. approaching, grasping, placing for pick-and-place tasks) “How do Mixture Density RNNs Predict the Future”. Kai Olav Ellefsen, Charles Patrick Martin, Jim Torresen. Arxiv preprint, 2019. Future directions: Investigate roles of individual Gaussians of MDN applied to learning robot skills ● (based on Ellefsen’s work) Generalize towards more complex tasks with human teammates ● Connect with trajectory optimization methods ● (optimize over variety of dynamic and task-based criteria) 15
Auto-conditioned Recurrent Mixture Density Networks for Learning Generalizable Robot Skills Hejia Zhang, Eric Heiden, Stefanos Nikolaidis, Joseph J. Lim, Gaurav S. Sukhatme

Recommend

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA Remember that

1.26k views • 99 slides

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER VI : VI : Learning in CHAPTER Learning in Recurrent Recurrent Networks Networks Introduction We

464 views • 17 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The

DataCamp Mixture Models in R MIXTURE MODELS IN R Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The handwritten digits dataset DataCamp Mixture Models in R Continuous versus discrete variables

440 views • 41 slides

Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R

DataCamp Mixture Models in R MIXTURE MODELS IN R Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R Description of mixture models 1. Which is the suitable probability distribution? Get familiar with

1.07k views • 36 slides

KODA AUTO University KODA AUTO University Agenda on KODA AUTO University Enterprise

KODA AUTO University KODA AUTO University Agenda on KODA AUTO University Enterprise Cooperation Introduction KODA AUTO University and its programmes University Enterprise links and cooperation, core concept KODA AUTO

879 views • 40 slides

KODA AUTO University KODA AUTO University Agenda on KODA AUTO University Enterprise

1.05k views • 61 slides

Probability of any given neighbourhood of root, conditioned on the root, conditioned on the tree

Probability of any given neighbourhood of the Probability of any given neighbourhood of root, conditioned on the root, conditioned on the tree being the tree being infinite infinite Moumanti Podder Moumanti Podder Courant Institute

1.12k views • 83 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class Recurrent Neural Network Cell Recurrent Neural Networks (RNNs) Bi-Directional Recurrent Neural Networks (Bi-RNNs) Multiple-layer /

583 views • 47 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

793 views • 21 slides

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Lecture 3 Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS In this talk I will in some detail describe the paper of Kingma and Welling. Auto-Encoding Variational Bayes , International

706 views • 58 slides

Relative Density Chapters 3.5 Relative Density 1 2/5/2015 Minimum Density Pluviate soil from

2/5/2015 Relative Density Chapters 3.5 Relative Density 1 2/5/2015 Minimum Density Pluviate soil from height of 25 mm Maximum Density Vibrate for 8 minutes 2 2/5/2015 Relative Density Correlations Clean sand (0 F c 5%) Sand w/

373 views • 13 slides

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep Learning textbook Ch. 10 Recurrent Neural Networks Long

667 views • 18 slides

The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente

The Power of Linear Recurrent The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente neuronale Netze? Frieder Stolzenburg Overview Frieder Stolzenburg Introduction Recurrent Neural Networks

1.17k views • 66 slides

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang Wang (CUHK) Recurrent Neural Network February 26, 2019 1 / 52 Outline 1 Recurrent neural networks Recurrent neural networks BP on RNN Variants

1.22k views • 52 slides

Outline n Univariate Gaussian n Multivariate Gaussian n Law of Total Probability n

Gaussians Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics Outline n Univariate Gaussian n Multivariate Gaussian n Law of Total Probability n Conditioning (Bayes rule)

580 views • 10 slides

Conditioned Generation Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Language

CS11-747 Neural Networks for NLP Conditioned Generation Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Language Models Language models are generative models of text s ~ P(x) The Malfoys! said Hermione. Harry was watching

744 views • 35 slides

Multi-Task & Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday,

Multi-Task & Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday, October 9 Fill out paper preferences by tomorrow. TensorFlow review session tomorrow, 4:30 pm in Gates B03 Plan for Today Multi-Task Learning - Models

917 views • 42 slides

Dependence and Conditioning Will Perkins January 31, 2013 Conditional Probability Definition If

Dependence and Conditioning Will Perkins January 31, 2013 Conditional Probability Definition If Pr( B ) > 0, then the conditonal probability of A given B is Pr[ A | B ] = Pr( A B ) Pr( B ) What does this look like on a Venn diagram?

464 views • 17 slides

Econometric Causality: Part I on Causality Based in part on Heckman (2008) International

Econometric Causality: Part I on Causality Based in part on Heckman (2008) International Statistical Review , 76(1):1-27 James J. Heckman Econ 312, Spring 2019 Heckman Econometric Causality Econometric Approach Econometric approach to

537 views • 43 slides

Lecture 12 Conditioning and Condition Numbers NLA Reading Group Spring 13 by Can

Lecture 12 Conditioning and Condition Numbers NLA Reading Group Spring 13 by Can Kavaklolu Outline Condition of a problem Absolute condition number Relative condition number Examples Condition of matrix-vector

330 views • 22 slides

Nonequilibrium Markov processes conditioned on large deviations Hugo Touchette National

Nonequilibrium Markov processes conditioned on large deviations Hugo Touchette National Institute for Theoretical Physics (NITheP) Stellenbosch, South Africa Advances in Nonequilibrium Statistical Mechanics Galileo Galilei Institute for

152 views • 11 slides

CONDITIONING STRATIGRAPHIC, RULE- BASED MODELS WITH GENERATIVE ADVERSARIAL NETWORKS: A DEEPWATER

THE AAPG 2019 ANNUAL CONVENTION & EXHIBITION CONDITIONING STRATIGRAPHIC, RULE- BASED MODELS WITH GENERATIVE ADVERSARIAL NETWORKS: A DEEPWATER LOBE, DEEP LEARNING EXAMPLE HONGGEUN JO JAVIER E. SANTOS MICHAEL J. PYRCZ Agenda Rule

619 views • 24 slides

Auto-conditioned Recurrent Mixture Density Networks for - PowerPoint PPT Presentation

Auto-conditioned Recurrent Mixture Density Networks for Learning Generalizable Robot Skills Hejia Zhang, Eric Heiden, Stefanos Nikolaidis, Joseph J. Lim, Gaurav S. Sukhatme Introduction 2 Introduction learn generalizable robot skills

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Bernoulli Mixture Models Victor Medina Researcher at SBIF DataCamp Mixture Models in R The

Structure of mixture models Victor Medina Researcher at SBIF DataCamp Mixture Models in R

KODA AUTO University KODA AUTO University Agenda on KODA AUTO University Enterprise

KODA AUTO University KODA AUTO University Agenda on KODA AUTO University Enterprise

Probability of any given neighbourhood of root, conditioned on the root, conditioned on the tree

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Relative Density Chapters 3.5 Relative Density 1 2/5/2015 Minimum Density Pluviate soil from

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Outline n Univariate Gaussian n Multivariate Gaussian n Law of Total Probability n

Conditioned Generation Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Language

Multi-Task &amp; Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday,

Dependence and Conditioning Will Perkins January 31, 2013 Conditional Probability Definition If

Econometric Causality: Part I on Causality Based in part on Heckman (2008) International

Lecture 12 Conditioning and Condition Numbers NLA Reading Group Spring 13 by Can

Nonequilibrium Markov processes conditioned on large deviations Hugo Touchette National

CONDITIONING STRATIGRAPHIC, RULE- BASED MODELS WITH GENERATIVE ADVERSARIAL NETWORKS: A DEEPWATER

Multi-Task & Meta-Learning Basics CS 330 Logistics Homework 1 posted today, due Wednesday,