A Bayesian Model of Imitation in Infants and Robots Rajesh Rao, - PowerPoint PPT Presentation

A Bayesian Model of Imitation in Infants and Robots Rajesh Rao, Aaron Shon and Andrew Meltzoff (2004) Presented by Micha Elsner

How we gain new skills ● Maturation (more neurons, muscle power &c) ● Reinforcement learning (“trial and error”) – Behaviorists (Skinner, Watson) ● Independent invention and discovery – Piaget's theory: children are “little philosophers” who learn abstract principles from experience ● Imitation – More flexible than maturation – More efficent than discovery

Piaget's learning ● Relies on two processes: – assimilation: applies a known behavior ( schema) in a new way... “grab the rattle” --> “grab the watch” – accomodation: adapts a known behavior to new circumstances... applies when assimilation fails “grab the rattle” --> “grab the beach ball” --> “squeeze the beach ball” ● Stages of cognitive development – Some dominated by assimilation, some by accomodation info from http://webspace.ship.edu/cgboer/piaget.html Pf. George Boeree, Univ. of Shippensburg

Constructivism ● Knowledge is 'constructed' from a combination of experience and innate principles. – Representations of the world are iteratively improved as they become inadequate – For instance, children have to learn 'conservation of mass' and 'object permanence' ● Something like our usual unsupervised learning – Clustering – Rule inference / Latent-variable modeling info from “Basing Teaching on Piaget's Constructivism”, Constance Kamii and Janice Ewing, '96

How humans learn to imitate ● Famous four-stage model due to Meltzoff ● Body babbling ● Imitating body movements ● Imitating actions with objects ● Imitating intentional actions

Body babbling ● Repetitive, undirected movements ● Learn a mapping between nerve impulses and body state ● Begins in utero ● Relies on innate proprioception (ability to know where one's body parts are) ● But the mapping isn't innate!

Imitating body movements ● Infants begin to do this right after birth – Uses the mapping between nerve impulses and body state learned from babbling. – Also requires map from visual system (observed state) to proprioception (own state)

Imitation with objects ● More complex dynamics – Using an object, not just the body – Starts at about 1 year old. – Not really modeled in this paper! ● A famous experiment – Adult 'teacher' presses a button with his forehead – Infants imitate him

Intentionality ● Full 'imitation' is not just mimicry – Learner may have different actions than teacher – Has to reach the same goal in a different way – (cf. pendulum upswing, Atkeson & Schaal) ● Starts at about 18 months. – Understand that humans have intentions – Learn from a demonstrator who makes 'mistakes'

Learning framework ● Uses an MDP-like structure. ● What we won't cover: – perception (inferring our state from observations; proprioception) – correspondence (inferring someone else's state from our observations) – discretization (clustering states and actions) – intention recognition (learning a useful prior over goal states)

Markov representation ● Discrete states s , actions a and time t . ● Define 'imitation' as 'following a memorized trajectory' s 1 -> s 2 -> s 3 ... s g – Isn't this just mimicry? ● We need a way (inverse model) to get us from state s t to goal s t+1 . ● Optimal action selection is deterministic (MAP solution). – Humans sometimes use probability matching.

Forward model ● Maps state, action to next state: – p(s t+1 | s t , a t ) ● Learned from exploring the state-space at random – Body babbling – Supervised process (assuming proprioception)

Inverse model ● p(a t | s t , s t+1 , s g ) : probability that an action is chosen – given the desired next state, and the goal ● p(a t | s t , s t+1 , s g ) α p(s t+1 | s t , a t ) * p(a t | s t , s g ) – assuming that the forward model is independent of the goal state – the prior over actions is learned (max likelihood or MAP) from observing the teacher

Welcome to mazeworld Simple stochastic transition model (easy to learn)

Learning the maze Learner trajectory Demonstration trajectories

Inferring intentions ● If we have a prior over goals, we can find the posterior over the teacher's intentions. ● No discussion of where this prior comes from.

Conclusions ● The authors propose to get some robots and test using real data. – I imagine they'll have problems. – It looks like this is just the simple part... ● But the human model has some interesting elements – Intention recognition would certainly help out robotics.

A Bayesian Model of Imitation in Infants and Robots Rajesh Rao, - PowerPoint PPT Presentation

A Bayesian Model of Imitation in Infants and Robots Rajesh Rao, Aaron Shon and Andrew Meltzoff (2004) Presented by Micha Elsner How we gain new skills Maturation (more neurons, muscle power &c) Reinforcement learning (trial and

Why do imitation and analogy fail? Why do imitation and analogy fail? Imitation Imitation

The Imitation Game: The New Frontline of Security Fighting Robots Weve been warned for a

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Robots Playing Catch Brandon Tolsch Brandon Tolsch Robots Playing Catch Two robots throwing

UNIVERSAL ROBOTS RUC 2018 Universal Robots - Evolving the future UNIVERSAL ROBOTS SET THE

Are Infants and Younger Children More Vulnerable? YES. Infants and young children are more

Imitation Learning Initial Concept and Approaches Nguyen, Thi Linh Chi Outline Motivation

Imitation Theory and Experimental Evidence Joerg Oechssler University of Heidelberg

Imitation as a Stepping Stone to Innovation Amy Jocelyn Glass Texas A&M University Shift

FAIC Foreign Accent Imitation Corpus Sara Neuhauser University of Jena, Germany IAFPA 2011

Kevin Warwick Coventry University T urings Imitation Game T urings Imitation Game Kevin

A Bayesian Approach to Generative Adversarial Imitation Learning NeurIPS 2018 Presenter Wonseok

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Human robot interaction www.biorobotics.ttu.ee Social robots Traditional robots Tools

Infants and Toddlers Karen Nemeth, Ed.M. www.languagecastle.com Language Castle

Nurturing Bilingual Infants and Toddlers Karen Nemeth, Ed.M. www.languagecastle.com 1

Strategies For Students With Autism Spectrum Disorder in Physical And Health Education Marty

Children With Deb Curtis Session 1- Becoming a Teacher Researcher In this session, you will:

Geriatrics ECHO Clinic AIM Statement : The Geriatrics ECHO clinic will coach interprofessional

BEFORE AND AFTER Blake Swan C.S.C.S, TSAC-F, FMSC, CPT, Director of Sports Performance &

Graeme Smith Supervisors: Dr James Gain Description Create a virtual environment from robot

IMMERSIVE VISUALISATIONS IN DESIGN: USING AUGMENTED REALITY (AR) FOR INFORMATION PRESENTATION A.

Suit Therapy Adeli Suit Alyssa Connifey Doug Eck Josh Egloff Sarah Kibiloski Maria King Sean

An Amphibious Search and Discovery Rover Intel-Cornell Cup Final Presentation Advised By: Team:

Sambuz

Useful Links

Newsletter

Mail Us

A Bayesian Model of Imitation in Infants and Robots Rajesh Rao, - PowerPoint PPT Presentation

A Bayesian Model of Imitation in Infants and Robots Rajesh Rao, Aaron Shon and Andrew Meltzoff (2004) Presented by Micha Elsner How we gain new skills Maturation (more neurons, muscle power &c) Reinforcement learning (trial and

Why do imitation and analogy fail? Why do imitation and analogy fail? Imitation Imitation

The Imitation Game: The New Frontline of Security Fighting Robots Weve been warned for a

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Robots Playing Catch Brandon Tolsch Brandon Tolsch Robots Playing Catch Two robots throwing

UNIVERSAL ROBOTS RUC 2018 Universal Robots - Evolving the future UNIVERSAL ROBOTS SET THE

Are Infants and Younger Children More Vulnerable? YES. Infants and young children are more

Imitation Learning Initial Concept and Approaches Nguyen, Thi Linh Chi Outline Motivation

Imitation Theory and Experimental Evidence Joerg Oechssler University of Heidelberg

Imitation as a Stepping Stone to Innovation Amy Jocelyn Glass Texas A&amp;M University Shift

FAIC Foreign Accent Imitation Corpus Sara Neuhauser University of Jena, Germany IAFPA 2011

Kevin Warwick Coventry University T urings Imitation Game T urings Imitation Game Kevin

A Bayesian Approach to Generative Adversarial Imitation Learning NeurIPS 2018 Presenter Wonseok

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Human robot interaction www.biorobotics.ttu.ee Social robots Traditional robots Tools

Infants and Toddlers Karen Nemeth, Ed.M. www.languagecastle.com Language Castle

Nurturing Bilingual Infants and Toddlers Karen Nemeth, Ed.M. www.languagecastle.com 1

Strategies For Students With Autism Spectrum Disorder in Physical And Health Education Marty

Children With Deb Curtis Session 1- Becoming a Teacher Researcher In this session, you will:

Geriatrics ECHO Clinic AIM Statement : The Geriatrics ECHO clinic will coach interprofessional

BEFORE AND AFTER Blake Swan C.S.C.S, TSAC-F, FMSC, CPT, Director of Sports Performance &amp;

Graeme Smith Supervisors: Dr James Gain Description Create a virtual environment from robot

IMMERSIVE VISUALISATIONS IN DESIGN: USING AUGMENTED REALITY (AR) FOR INFORMATION PRESENTATION A.

Suit Therapy Adeli Suit Alyssa Connifey Doug Eck Josh Egloff Sarah Kibiloski Maria King Sean

An Amphibious Search and Discovery Rover Intel-Cornell Cup Final Presentation Advised By: Team:

Sambuz

Useful Links

Newsletter

Mail Us

Imitation as a Stepping Stone to Innovation Amy Jocelyn Glass Texas A&M University Shift

BEFORE AND AFTER Blake Swan C.S.C.S, TSAC-F, FMSC, CPT, Director of Sports Performance &