Robotic Coach: how to revise humans' motions by Emphatic - PowerPoint PPT Presentation

Robotic Coach: how to revise humans' motions by Emphatic Demonstration Tetsunari Inamura National Institute of Informatics The Graduate University for Advanced Studies

My previous work: daily life robots The University of Tokyo, prof. Inaba’s lab.

Main interest • How to integrate symbolic expression and motion pattern of whole body – For easy interaction between human and robots “Pouring water in a white cup” “a white cup”

Latest topic • Robotic Coach that teaches human beings Like this? Human Robot Please, swing not like jumping, but more like squatting ! • Realization of not imitative robots, but robots that can let human beings imitate – One of the most useful and complex tasks which require integration of symbols and motion performance

Background • Standard coaching methods in sports / dancing – Coaching by demonstration (or video material) • Imitate whole body motion is often difficult • So many attention points – Direct coaching with physical interaction • Effective but expensive – Coaching by verbal explanation • Low cost, effective in various situations • Conversion from verbal expression into motion is unstable

Purpose of this project • Realization of robotic coach system that is used for training of human beings • Integration of verbal explanation and physical demonstration with emphasis • Design of common representation among “emphasis of motion” and “explanation by verbal expression”

Related works • Motion emphasis (modification, edit) – Interpolation / extrapolation (SIGGRAPH) [Bruderlin95][Rose98][Glardon04][Hoshino04] – Parameterization of motion [Matubara] • No relationship between symbol • Symbolization of motion – RNNPB (A kind of Recurrent Neural Network) [Tani][Ogata] • Generation of arbitrary motions is difficult – Self organization map for motion [Okada] • Only periodic motions are discussed

Approach 稲邑 03 ～ Mutual conversion model between sensorimotor patterns and symbols by proto ‐ symbol space • Recognition, generation and abstract of patterns – Bi ‐ directional model of recognition and generation – Imitation learning system for humanoid robots • Motion primitive: Decomposition and composition – Association of sensory pattern from motion pattern – Imitation of unknown motion • Conversion of patterns and symbols – Assignment of primitives using state point in phase space

Geometric representation of sensorimotor patterns � Sensorimotor patterns are assigned as static points � Configuration is defined by similarities among patterns Walk Stretch � Internal/External division generate various patterns Kick Squat Stoop Throw Symbol, discrete world Physical, continuous world

Construction of proto-symbol space Placement in the Euclid space based on the pseudo distance between proto ‐ symbols [Inamura ICHR03, IROS2006] Multi Dimensional Scaling Optimization method which minimize the error between Euclid distance and pseudo distance Pseudo distance M.D.S. Bhattacharyya Distance Proto-symbols (HMM) Proto-symbol space Kick Walk Run

Realtime behavior imitation via symbol space representation Not simple copy

Motion abstract/recognition by Hidden Markov Models a a 11 22 a a − N N 1 q q q q 12 N 2 1 N − 1 b 1 o b 2 o M = ∑ ( ) ( ) μ Σ b o c o N ( ) ( ; , ) i ij ij j j = = j 1 O o o o L { } Μ 1 2 abstraction Joint angle vector θ λ = a b o Parameter of HMM { , ( )} ij i Proto-symbol a ： state transition probabilities ij b i ( o ： output probabilities ) recognition λ P O O ( | ) Using likelihood to recognize motion pattern among the candidates of categories (proto-symbols)

Motion synthesis by proto ‐ symbol synthesis Inamura [IROS’08] • Time ‐ domain synthesis by Expected duration ∞ 1 ∑ Calculation of the expected = − − = − n s n a a ( 1) (1 ) duration at node i i ii ii a 1 = n 1 ii m = ∑ Expected duration at node i of the j s c s ( ) ˆ i j i synthesized HMM with the ratio of c c = L 1 , , j 1 using m HMMs m • Space ‐ domain synthesis by Gaussian Output probability is modeled = N μ σ b ( , ) by single Gaussian i i i Mean vector and covariane m m = ∑ = ∑ μ μ σ σ j j c ( ) 2 c 2 ( )2 ˆ ˆ matrix should be the target i j i i j i of interpolation/extrapolation j j

Interpolation & extrapolation Extrapolation punch 1 squat punch squat 1.5 Interpolation Extrapolation 0.5 * punch + 0.5 * squat From Squat to punch Inamura IROS 2008

Experiment environment Combination of immersive VR (surrounding display) and motion capturing system Interaction system between virtual agent with dynamic whole body motion Apply to the coaching system

Experiment conditions • Target motion: Swing motion of tennis • 5 subjects (beginner of tennis) • Output of HMM ： joint angle of all joints • Proto ‐ symbol space is constructed from two motions: 1) beginner’s motion 2) Target motion by expert • 3 coaching strategies – Coefficient of emphasis – Verbal expression [on/off]

Target motion shown to the beginner

Performed motion by the player • Not good motion: knee is not bending, right elbow should be lower, and so on.

Generated emphasized motion by the coaching system • ‐ 0.5 x [beginner motion] + 1.5 x [target motion] “ not like the previous motion” “Please follow more like this motion”

Emphasized motion Target motion 1 Beginner motion 1.5 -0.5 * [ 初心者 ] + 1.5 * [ 目標 ] -0.5 x [bginner] + 1.5 x [target]

3 conditions for evaluation 1. Only showing the target motion (without emphasis) Only showing the target motion (without emphasis) 1. – Regardless of player’ ’s performance s performance – Regardless of player α =1.0, no verbal expression α – =1.0, no verbal expression – 2. Showing emphasized motion (without verbal exp.) Showing emphasized motion (without verbal exp.) 2. – Emphasized motion is shown to the player – Emphasized motion is shown to the player α =2.0 、 without verbal expression α =2.0 、 without verbal expression – – 3. Showing emphasized motion and using verbal Showing emphasized motion and using verbal 3. expression expression – If the error was bigger, verbal expression is added – If the error was bigger, verbal expression is added α =2.0 α – =2.0 –

Evaluation result (Ave. error of imitation) Distance (error) between l -th performance and the target motion in proto- symbol space i : index of subject m : number of subjects (m=5 ) l ： number of trials (l=1,2,3,4) Trial # (l)

Evaluation (cont. error ratio) Ratio of error of l -th performance to the error of initial performance More than 1: increasing error i : index of subject Less than 1: decreasing error m : number of subjects 0 : Perfect imitation (m=5 ) l ： number of trials (l=1,2,3,4) Trial # (l)

Conclusion • Proposal of coaching robot system that shows emphasized motion and uses verbal expression • Motion emphasis and generation of verbal expression based on proto ‐ symbol space • Immersive VR system for coaching evaluation Future works • Mutual imitation learning between human and robot. Teach and learn in daily life env.

Robotic Coach: how to revise humans' motions by Emphatic - PowerPoint PPT Presentation

Robotic Coach: how to revise humans' motions by Emphatic Demonstration Tetsunari Inamura National Institute of Informatics The Graduate University for Advanced Studies My previous work: daily life robots The University of Tokyo, prof. Inabas

THE COACHING STAFF Coach Gross Head boys coach, throwing events (ggross@geneva304.org)

EMPHATIC A new hadron production experiment for improved neutrino flux predictions Jonathan

TRUSTEES MOTIONS TO DISMISS MOTIONS TO DISMISS ARE FILED FOR ARREARS, AND FAILURE TO PROVIDE

How machine learning is used in www.coach-bot.de processing text Fabian Reich www.coach-bot.de

2018 WOODSTOCK SOFTBALL #T22 2018 WOLVERINE COACHING STAFF Head Varsity Coach- Alex Gray

Language in humans Today: how do humans process language? Language in Humans We ve

Hector L Torres USAS Coach USATF Coach MS Sports and Science Monday, March 31, 14 Goals

Keeping Up With Coach Training Rachel Maruno, Sr. Manager Coach Training Megan Pak, Sr.

Soccer Club Coach Information Session July 2020 Agenda Introductions Why coach at the

Revision! How can we help? Revision Technique Didnt bother to revise ? How do you revise?

An Emphatic Approach to the Problem of Off-policy TD Learning Rich Sutton Rupam Mahmood Martha

Incremental Package Builds Guillaume Maudoux @layus NixCon 2017 Louvain-la-Neuve hold a huge

Learning in Robotic Systems Robotic Agents @ Allegheny College Janyl Jumadinova November 27,

Subordinate Brownian Motions and Their Applications Renming Song University of Illinois NIST,

Shadows Shadows What for? Shadows tell us about the relative locations and motions of objects

Anisotropy of gas Anisotropy of gas random motions random motions in galactic disks in

Third Quarter 2016 Results December 8, 2016 Acushnet Holdings Corp Third Quarter 2016 Results

Implicit Imitation in Multiagent Reinforcement Learning Bob Price and Craig Boutilier Slides:

Implicit Imitation in Multiagent Reinforcement Learning Bob Price and Craig Boutilier Slide 1

Repetitive Synchronous Imitation A new tool for looking at timing Repetitive Synchronous

Lessons learned from Promoting a Culture of Entrepreneurship PACE 2012-2016 IEEC,

L2RPN Challenge - Learning to Run a Power Network through AI Di Shi Team: Tu Lan, Jiajun Duan,

Value Iteration Networks Aviv Tamar Joint work with Pieter Abbeel, Sergey Levine, Garrett Thomas,

Load Hiding to Preserve Privacy from Smart Meter Measurements Ryan Fraser Advisor: Dr. Kevin