Learning and control with movement primitives in multiple - - PowerPoint PPT Presentation

learning and control with movement primitives in multiple
SMART_READER_LITE
LIVE PREVIEW

Learning and control with movement primitives in multiple - - PowerPoint PPT Presentation

Learning and control with movement primitives in multiple coordinate systems Sylvain Calinon Robot Learning & Interaction Group Idiap Research Institute Martigny, Switzerland RIBA healthcare robot, RIKEN i-limb prosthetics, TouchBionics


slide-1
SLIDE 1

Sylvain Calinon

Robot Learning & Interaction Group Idiap Research Institute Martigny, Switzerland

Learning and control with movement primitives in multiple coordinate systems

slide-2
SLIDE 2
slide-3
SLIDE 3

RIBA healthcare robot, RIKEN i-limb prosthetics, TouchBionics Humanoid diver, Stanford Robonaut, NASA

slide-4
SLIDE 4

Learning from demonstration as an intuitive interface to transfer skills to robots

slide-5
SLIDE 5

Statistical learning dynamical systems

+

We look for a modular representation of movements and skills that can learn from wide-ranging data, that can adapt to new situations in a fast manner, that can exploit the robot embodiment, and that is robust to perturbations.

slide-6
SLIDE 6

System plant Model predictive control (MPC) Do it smoothly! Track path!

How to solve this cost function?

  • Pontryagin’s maximum principle

 Riccati equation

  • Dynamic programming
  • Linear algebra with stacked

vectors

[Tanwani and Calinon, IEEE RA-L 1(1), 2016]

state variable (position+velocity) control command (acceleration) tracking weight matrix control weight matrix

slide-7
SLIDE 7

System plant Model predictive control (MPC) Do it smoothly! Track path!

[Tanwani and Calinon, IEEE RA-L 1(1), 2016]

slide-8
SLIDE 8

Model predictive control (MPC)

[Tanwani and Calinon, IEEE RA-L 1(1), 2016]

Sharing of synergies with:

slide-9
SLIDE 9

Model predictive control (MPC)

[Tanwani and Calinon, IEEE RA-L 1(1), 2016]

 Minimal intervention  Safe/compliant robot controller

Transition and state duration Stepwise sequence with:

slide-10
SLIDE 10

Transfer of controllers from demonstration Demonstration Reproduction

[Calinon, Intelligent Service Robotics 9(1), 2016]

Holding a cup horizontally Bimanual coordination

slide-11
SLIDE 11

 in a new situation…

Extension to multiple coordinate systems

Coordinate system 1: This is where I expect data to be located! Coordinate system 2: This is where I expect data to be located!

[Calinon, Bruno and Caldwell, ICRA’2014][Calinon, HFR’2016]

 Product of linearly transformed Gaussians

slide-12
SLIDE 12

Track path in coordinate system j

MPC considering multiple coordinate systems

Do it smoothly!

1 2 2 2 2 1 2

Set of demonstrations Reproduction in new situation

New position and

  • rientation of coordinate

systems 1 and 2 Two candidate coordinate systems (P=2)

[Calinon, Bruno and Caldwell, ICRA’2014][Calinon, HFR’2016]

slide-13
SLIDE 13

MPC considering multiple coordinate systems

1 2

In many robotics problems, the parameters describing the task or situation can be recast as some form of coordinate systems or locally linear transformations

[Calinon, Bruno and Caldwell, ICRA’2014][Calinon, HFR’2016]

Track path in coordinate system j Do it smoothly!

slide-14
SLIDE 14

MPC considering multiple coordinate systems

 Learning of a controller that

adapts to new situations while regulating its gains according to the precision and coordination required by the task

In many robotics problems, the parameters describing the task or situation can be recast as some form of coordinate systems or locally linear transformations

Track path in coordinate system j Do it smoothly!

[Calinon, Bruno and Caldwell, ICRA’2014][Calinon, HFR’2016]

slide-15
SLIDE 15

Adaptation of movements to different shapes

[Calinon, Alizadeh and Caldwell, IROS’2013]

Candidate coordinate system

slide-16
SLIDE 16

[Silverio et al., IROS’2015] [Rozo et al., IROS’2015] [Rozo et al., IEEE T-RO 32(3), 2016]

Bimanual coordination and co-manipulation

Dr Leonel Rozo João Silvério

slide-17
SLIDE 17

Joint space constraints & tasks prioritization

Task space (operational space) Joint space (configuration space)

Task parameters as Jacobian operators Task parameters as null space projection

  • perators

[Silverio, Calinon, Rozo and Caldwell (submitted)] [Calinon, ISRR’2015]

Demonstration

  • n COMAN

Left hand priority Right hand priority Reproduction

  • n WALKMAN
slide-18
SLIDE 18

Learning from demonstration can be applied to various forms of robots and applications

(2015-2018) (2015-2018) (2012-2015)

slide-19
SLIDE 19

Stiff Compliant [Bruno, Calinon, Malekzadeh and Caldwell, ICIRA, LNCS 9246, 2015]

s=0 s=1

Continuous arm index s

MPC for continuum robots

slide-20
SLIDE 20

Flexible robots in minimal invasive surgery

Collaboration letting the surgeon control the tip, while the robot exploits the remaining degrees

  • f freedom that do not interfere

with the control of the tip

Insertion Retraction

[Bruno, Calinon and Caldwell, Autonomous Robots, 2016]

slide-21
SLIDE 21

Robotic dressing assistance

Dressing assistance for:

  • Putting on a coat
  • Putting on shoes

 Requires to extend movement

primitives to reaction, force and impedance primitives

slide-22
SLIDE 22

Telemanipulation with underwater robot Onshore Offshore

Same model (TP-HSMM) used for classification on the one side, and synthesis on the other side, with local adaptation to the task parameters

slide-23
SLIDE 23

Being skillful = exploiting variability and correlation

Needs correction Does not need correction

Recognition & synthesis

  • f motion primitives

Semi-autonomous teleoperation as a form of human-robot collaboration

[Tanwani and Calinon, IEEE RA-L 1(1), 2016]

Telemanipulation with underwater robot

slide-24
SLIDE 24
  • Model predictive control (MPC) can be smoothly combined

with learning from demonstration (LfD) for both planning and control problems

  • The parameters of the cost function in MPC can be learned

from demonstration, with weights as full precision matrices, instead of predefining those manually as diagonal matrices

  • LfD can be extended to the consideration of multiple

coordinate systems and to the learning of controllers

 Minimal intervention strategy that is safer for the users

  • Extending LfD to collaborative skills and teleoperation provides

new perspectives in shared control by exploiting the learned task variations and task synergies to cope with perturbations

Conclusion

slide-25
SLIDE 25

Contact:

sylvain.calinon@idiap.ch http://calinon.ch

Source codes:

http://www.idiap.ch/software/pbdlib/

Ajay Tanwani Emmanuel Pignat Noémie Jaquier Dr Ioannis Havoutis

Photo: Basilio Noris

Collaborators at Idiap: