Robot learning from few demonstrations by exploiting the structure - PowerPoint PPT Presentation

Robot learning from few demonstrations by exploiting the structure and geometry of data Sylvain Calinon Senior Researcher Idiap Research Institute, Martigny, Switzerland Lecturer EPFL, Lausanne, Switzerland External Collaborator IIT, Genoa, Italy

Artificial Intelligence for Society Research Groups: • Speech & Audio Processing • Perception & Activity Understanding • Computer Vision & Learning • Social Computing • Biometric Person Recognition • Applied Machine Learning • Natural Language Processing MARTIGNY • Robot Learning & Interaction Research • Computational Bioimaging Education • Uncertainty Quantification and Optimal Design Technology transfer

Learning from demonstration as an intuitive interface to transfer skills to robots

Learning from demonstration - Challenges

Finding Priors that are expressive enough to be used in a wide range of tasks

Prior 1: Movements are smooth and continuous Prior 2: Actions often relate to objects, tools or body landmarks Prior 3: Data spaces in robotics have geometries and structures

Movement generation as a mix of clustering, subspace analysis and optimal control Walking Running Walking We look for a compact and modular representation of continuous movements and skills that can learn from few interactions (with user and environment), that can exploit variation and coordination , and that can adapt to new situations in a fast manner.

Learning of motions from few demonstrations center covariance matrix Global sharing of local coordination patterns with: Dictionary of coordination patterns: [Tanwani and Calinon, IEEE RA-L 1(1), 2016]

Learning minimal intervention controllers Use low control commands! Track path! System plant state variable (position+velocity) control command (acceleration) Approach: Using control formalism in task space to tracking weight matrix solve analytically a basic control weight matrix form of model predictive control (MPC) with a double integrator as constant linear system [Tanwani and Calinon, IEEE RA-L 1(1), 2016]

Learning minimal intervention controllers Use low control commands! Track path! System plant [Tanwani and Calinon, IEEE RA-L 1(1), 2016]

Learning minimal intervention controllers  Analytical solution to generate Transition and state duration motion control by following a (HSMM) minimal intervention principle Stepwise reference with: [Calinon , Bruno and Caldwell, ICRA’2014]

Application: Editing motions with variations User interface to edit and generate natural and dynamic motions by considering variation and coordination Compliant controller to retrieve safe and human-like motions Daniel Berio Frederic Fol Leymarie [Berio, Calinon and Leymarie , IROS’2016] [ Berio, Calinon and Leymarie , MOCO’2017]

I-DRESS project Personalized assistance using haptic and visual information, with compliant controllers following a minimal intervention principle Dressing skills require some aspects to be time-independent, while other aspects are time- dependent for the generation of movements. Emmanuel Pignat [Pignat and Calinon, RAS 93, 2017]

Prior 2: Actions often relate to objects, tools or body landmarks Photo: Basilio Noris

Regression with a Task-parameterized motions context variable c : • Learning of • Retrieval with  Generic approach, but limited generalization capability

Control in multiple coordinate systems Track path in coordinate system j Use low control commands! New position and 2 2 orientation of coordinate 2 2 systems 1 and 2 2 Two candidate 1 1 coordinate systems (P=2) Set of demonstrations Reproduction in new situation [Calinon , HFR’2016]

Control in multiple coordinate systems Control in multiple coordinate systems Track path in coordinate system j Use low control commands! In many robotics problems, the parameters describing the task or situation can be interpreted as coordinate systems 2 1 [Calinon , HFR’2016]

Control in multiple coordinate systems Track path in coordinate system j Use low control commands!  Learning of a controller (instead of learning a trajectory) that adapts to new situations while regulating the gains according to the precision and coordination patterns required by the task [Calinon , HFR’2016]

Control in multiple coordinate systems Track path in coordinate system j Use low control commands!  Retrieval of control commands in the form of trajectory distributions, facilitating exploration and adaptation (in either control or state space) [Calinon , HFR’2016]

I-DRESS project SNSF, CHIST-ERA (2015-2018) [Canal, G., Pignat, E., Alenya, G, Calinon, S. and Torras , C., ICRA’2018]

I-DRESS project SNSF, CHIST-ERA (2015-2018) [Canal, Pignat, Alenya, Calinon and Torras , ICRA’2018]

http://dexrov.eu EC, H2020 (2015-2018)

Exploitation in shared control Teleoperator side Robot side only Gaussian ID is transmitted Dr Andras Kupcsik Dr Ioannis Havoutis [Havoutis and Calinon, Autonomous Robots, 2018]

Adaptation to different object shapes Coordinate system as task parameter [Calinon, Alizadeh and Caldwell, IROS’2013]

Bimanual coordination and co-manipulation [Rozo et al., IROS’2015] [Silvério et al., IROS’2015] [Rozo et al., IEEE T-RO 32(3), 2016] Dr Leonel Rozo Dr João Silvério

Learning & generalizing tasks prioritization Priority on left hand Demonstration Demonstration Reproduction Reproduction Candidate hierarchy Candidate hierarchy [Silvério, Calinon, Rozo and Caldwell (2018), Arxiv 1707.06791] [Calinon , ISRR’15]

Learning & generalizing tasks prioritization Priority on right hand Demonstration Demonstration Reproduction Reproduction Candidate hierarchy Candidate hierarchy [Silvério, Calinon, Rozo and Caldwell (2018), Arxiv 1707.06791] [Calinon , ISRR’15]

Learning & generalizing tasks prioritization Equal priority Demonstration Demonstration Reproduction Reproduction Candidate hierarchy Candidate hierarchy [Silvério, Calinon, Rozo and Caldwell (2018), Arxiv 1707.06791] [Calinon , ISRR’15]

Prior 3: Data spaces in robotics have geometries and structures

Motivation of using Riemannian manifolds

Interpolation on Riemannian manifolds Orientation (unit quaternions) Rigid body motions (position+orientation) Covariance features, inertia and gain matrices, manipulability ellipsoids, trajectory distributions (symmetric positive definite matrices)

Clustering on Riemannian manifolds Covariance features, inertia and gain matrices, manipulability ellipsoids, trajectory distributions (symmetric positive definite matrices) Orientation (unit quaternions) Rigid body motions (position+orientation)

Regression on Riemannian manifolds Gaussian mixture regression (GMR) to compute from the joint distribution encoded as a GMM → Regression for orientation data (unit quaternions on )

Regression with orientation and position data Four demonstrations of coordinated bimanual movement [Zeestraten, Havoutis, Silvério, Calinon and Caldwell, IEEE RA-L 2(3), 2017]

Regression with orientation and position data Four reproductions with perturbations by the user [Zeestraten, Havoutis, Silvério, Calinon and Caldwell, IEEE RA-L 2(3), 2017]

Regression with sEMG sensory data TACT-HAND SNSF, D-A-CH (2016-2019) Noémie Jaquier Surface Transformation in spatial Control of the electromyography covariances corresponding ( sEMG ) measurements (SPD matrices) hand pose [Jaquier and Calinon, IROS 2017]

Comparison: standard GMR vs geometric GMR sEMG data from Ninapro database processed as spatial covariances: 12 Input 4 Output [Jaquier and Calinon, IROS 2017]

Manipulability ellipsoid tracking Noémie Jaquier [N. Jaquier, L. Rozo, D.G. Caldwell and S. Calinon , RSS’2018]

Conclusion Combining statistical learning techniques and model predictive control provides a generative approach to the transfer of skills and movements Statistical learning in multiple coordinate systems can be exploited to learn robot skills and movements from few demonstrations, with adaptation to new situations Robotics is rich in structures and geometries that can be exploited to acquire skills and movements from a small set of interactions (with user or environment)

Robot learning from few demonstrations by exploiting the structure - PowerPoint PPT Presentation

Robot learning from few demonstrations by exploiting the structure and geometry of data Sylvain Calinon Senior Researcher Idiap Research Institute, Martigny, Switzerland Lecturer EPFL, Lausanne, Switzerland External Collaborator IIT,

Robothlon Team competition, each team programs a robot for each event Events Robot

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

Establishing a Korean Robot Ethics Charter 2007. 4. 14 Robot Division, Ministry of Commerce,

Out line Robot ics Percept ion Robot ics Planning Reading: R&N Sect .

Robot behaviour and control A robot can be defined as an intelligent link between perception

Robot Localization Localization Robot and and Kalman Filters Filters Kalman Rudy Negenborn

? 1 1/31/2012 Every robot maps to a point in Every robot maps to a point in its configuration

Robot Walking with Genetic Algorithms Bente Reichardt 14. December 2015 Bente Reichardt 1/52

What is a Robot? (3) What Can Robots Do? (1) Autonomous Underwater Vehicle Unmanned Aerial

Building New Robots 1 Extending Robot Language Suppose we needed a Robot to patrol the walls

Robot sensors A robot can be defined as an intelligent link between perception and action

Human-Robot Interaction CMSC 691 Spring 2016 2 u What is an interaction with a robot? u What is

Overview of Robot Decision Making Prof. Yuke Zhu Fall 2020 CS391R: Robot Learning (Fall 2020) 1

(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot

Some Applications of Nonnegative Tensor Factorizations (NTF) to Mining Hype rspectral &

Spac e Utilization & Me tr ic s Name of Chapte r : Mid- Atlantic WHY : Space utilization

PSD 2 la croise de changements juridiques et technologiques Marc Mouton, Partner, Arendt

Authentication Most technical security safeguards have Authentication authentication as a

and expression analysis: from handcrafted to learned features Huibin Li

Deep Learning for Computer Vision UCA Master 2 Data Science INRIA Sophia Antipolis STARS team

CSSE 232 Computer Architecture I Exceptions 1 / 12 Class Status Reading for today B.7-8 2

Basic OS Progamming Abstractions Don Porter CSE 306 Recap Weve introduced the idea of

Robot learning from few demonstrations by exploiting the structure - PowerPoint PPT Presentation

Robot learning from few demonstrations by exploiting the structure and geometry of data Sylvain Calinon Senior Researcher Idiap Research Institute, Martigny, Switzerland Lecturer EPFL, Lausanne, Switzerland External Collaborator IIT,

Robothlon Team competition, each team programs a robot for each event Events Robot

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

Establishing a Korean Robot Ethics Charter 2007. 4. 14 Robot Division, Ministry of Commerce,

Out line Robot ics Percept ion Robot ics Planning Reading: R&amp;N Sect .

Robot behaviour and control A robot can be defined as an intelligent link between perception

Robot Localization Localization Robot and and Kalman Filters Filters Kalman Rudy Negenborn

? 1 1/31/2012 Every robot maps to a point in Every robot maps to a point in its configuration

Robot Walking with Genetic Algorithms Bente Reichardt 14. December 2015 Bente Reichardt 1/52

What is a Robot? (3) What Can Robots Do? (1) Autonomous Underwater Vehicle Unmanned Aerial

Building New Robots 1 Extending Robot Language Suppose we needed a Robot to patrol the walls

Robot sensors A robot can be defined as an intelligent link between perception and action

Human-Robot Interaction CMSC 691 Spring 2016 2 u What is an interaction with a robot? u What is

Overview of Robot Decision Making Prof. Yuke Zhu Fall 2020 CS391R: Robot Learning (Fall 2020) 1

(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot

Some Applications of Nonnegative Tensor Factorizations (NTF) to Mining Hype rspectral &amp;

Spac e Utilization &amp; Me tr ic s Name of Chapte r : Mid- Atlantic WHY : Space utilization

PSD 2 la croise de changements juridiques et technologiques Marc Mouton, Partner, Arendt

Authentication Most technical security safeguards have Authentication authentication as a

and expression analysis: from handcrafted to learned features Huibin Li

Deep Learning for Computer Vision UCA Master 2 Data Science INRIA Sophia Antipolis STARS team

CSSE 232 Computer Architecture I Exceptions 1 / 12 Class Status Reading for today B.7-8 2

Basic OS Progamming Abstractions Don Porter CSE 306 Recap Weve introduced the idea of

Out line Robot ics Percept ion Robot ics Planning Reading: R&N Sect .

Some Applications of Nonnegative Tensor Factorizations (NTF) to Mining Hype rspectral &

Spac e Utilization & Me tr ic s Name of Chapte r : Mid- Atlantic WHY : Space utilization