Assistive robotics: helping with tasks for fun and profit. Neil - - PowerPoint PPT Presentation

assistive robotics helping with tasks for fun and profit
SMART_READER_LITE
LIVE PREVIEW

Assistive robotics: helping with tasks for fun and profit. Neil - - PowerPoint PPT Presentation

Assistive robotics: helping with tasks for fun and profit. Neil Bell 3 March 2016 CMSC691-HRI Who has... Five to ten year goals? 1. Finish school? 2. Pay off the car loan? 3. First (or second) child? Who has... Five to ten year goals?


slide-1
SLIDE 1

Assistive robotics: helping with tasks for fun and profit.

Neil Bell 3 March 2016 CMSC691-HRI

slide-2
SLIDE 2

Who has...

Five to ten year goals? 1. Finish school? 2. Pay off the car loan? 3. First (or second) child?

slide-3
SLIDE 3

Who has...

Five to ten year goals? 1. Finish school? 2. Pay off the car loan? 3. First (or second) child? Barriers to these goals? 1. Injury? 2. Family emergency? 3. Relative growing old, requiring assistance?

slide-4
SLIDE 4

Who has...

Five to ten year goals? 1. Finish school? 2. Pay off the car loan? 3. First (or second) child? Barriers to these goals? 1. Injury? 2. Family emergency? 3. Relative growing old, requiring assistance? Today’s focus: leveraging robotic assistance to ease the burden of extra or increasingly more difficult tasks.

slide-5
SLIDE 5

Papers du jour

1. The Domesticated Robot: Design Guidelines for Assisting Older Adults to Age in Place. J. Beer, C. Smarr, T. Chen, A. Prakash, T. Mitzner, C. Kemp,

  • W. Rogers. HRI - Living and Working with Service Robots, 2012.

2. Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork. B. Hayes, B. Scassellati. AAAI 2014.

slide-6
SLIDE 6

Papers du jour

1. The Domesticated Robot: Design Guidelines for Assisting Older Adults to Age in Place. J. Beer, C. Smarr, T. Chen, A. Prakash, T. Mitzner, C. Kemp,

  • W. Rogers. HRI - Living and Working with Service Robots, 2012.

2. Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork. B. Hayes, B. Scassellati. AAAI 2014.

slide-7
SLIDE 7

Aging in place

Goal: 1. Retain or enhance functionality despite age-related changes a. Cognition - less working memory b. Physical - arthritis or pain c. Perception - senses weakened, vision loss 2. Therefore, eliminate need for relocation to satisfy or substitute for goal #1.

slide-8
SLIDE 8

Adapting to barriers, measuring successful aging

Selection - Development and commitment to personal goals Reframe or update goals based on life events and changes

slide-9
SLIDE 9

Adapting to barriers, measuring successful aging

Selection - Development and commitment to personal goals Reframe or update goals based on life events and changes Optimization - Increasing odds of success Investment of time and energy to behaviors that support chosen goals

slide-10
SLIDE 10

Adapting to barriers, measuring successful aging

Selection - Development and commitment to personal goals Reframe or update goals based on life events and changes Optimization - Increasing odds of success Investment of time and energy to behaviors that support chosen goals Compensation - Regulation of loss Use mechanisms to prevent or balance age-related changes

slide-11
SLIDE 11

Compensatory mechanisms

Psychological, such as mnemonics, memory aids.

slide-12
SLIDE 12

Compensatory mechanisms

Psychological, such as mnemonics, memory aids. Technological 1. Hearing aids 2. Wheelchairs 3. Eyeglasses

slide-13
SLIDE 13

Compensatory mechanisms

Psychological, such as mnemonics, memory aids. Technological 1. Hearing aids 2. Wheelchairs 3. Eyeglasses 4. Robots

slide-14
SLIDE 14

Compensatory mechanisms

Psychological, such as mnemonics, memory aids. Technological 1. Hearing aids 2. Wheelchairs 3. Eyeglasses 4. Robots “Competence” is dynamic, capturing how a person functions in isolation. Relocation decisions depend heavily on individual’s level of competence.

slide-15
SLIDE 15

Purpose of the paper

Identifies daily upkeep tasks as having highest potential for assistive robotics.

slide-16
SLIDE 16

Purpose of the paper

Identifies daily upkeep tasks as having highest potential for assistive robotics. 1. Assess older adults’ preference for assistance from robots or humans on upkeep tasks. 2. Understand older adults’ opinions of using a home robot. 3. Consider implications of findings for directing improvement efforts for designing home assistive robots.

slide-17
SLIDE 17

Method

1. Questionnaire a. Technology experience b. Demographics, health, and current living situation

slide-18
SLIDE 18

Method

1. Questionnaire a. Technology experience b. Demographics, health, and current living situation 2. Conduct group interviews with adults ranging in age from 65 to 93.

slide-19
SLIDE 19

Method

1. Questionnaire a. Technology experience b. Demographics, health, and current living situation 2. Conduct group interviews with adults ranging in age from 65 to 93. 3. Assess familiarity with robots a. Most were familiar with concept of robots. b. Few had controlled or interacted with one.

slide-20
SLIDE 20

Method

1. Questionnaire a. Technology experience b. Demographics, health, and current living situation 2. Conduct group interviews with adults ranging in age from 65 to 93. 3. Assess familiarity with robots a. Most were familiar with concept of robots. b. Few had controlled or interacted with one. 4. Introduce participants to capabilities of Personal Robot 2 (PR2).

slide-21
SLIDE 21

Method

1. Questionnaire a. Technology experience b. Demographics, health, and current living situation 2. Conduct group interviews with adults ranging in age from 65 to 93. 3. Assess familiarity with robots a. Most were familiar with concept of robots. b. Few had controlled or interacted with one. 4. Introduce participants to capabilities of Personal Robot 2 (PR2). 5. Assistance Preference Checklist

slide-22
SLIDE 22

Personal Robot 2

slide-23
SLIDE 23

Assistance Preference Checklist

Goal: assess how participants’ preferences (robot vs. human) vary per task Process: 1. Assume robot could perform task to the level of a human 2. Imagine participant needed assistance on the given task 3. Rate preference on scale a. Human-only (1) b. No preference (3) c. Robot only (5)

slide-24
SLIDE 24

Results - cleaning

Participants preferred robots over humans in 28

  • f 48 tasks, including

many cleaning tasks. (M > 3.00 => preference for robot assistance, where 3.00 = no preference)

slide-25
SLIDE 25

Results - cleaning

Other tasks provided less decisive results. (M ≅ 3.00, where 3.00 = no preference)

slide-26
SLIDE 26

Results - fetching

Fetching tasks also more geared towards robotic assistance. (M > 3.00 => preference for robot assistance, where 3.00 = no preference)

slide-27
SLIDE 27

Interview results - robot pros and cons

Coding scheme used to identify patterns and themes from the discussion 1. Transcript segmented by the researcher 2. Segments categorized into groups: pro and con 3. Patterns emerge, identifying commonalities in the participant responses

slide-28
SLIDE 28

Pro & con examples

Pro: Compensation Time saving Delegation of undesirable task Effort saving Optimization Con: Damage to environment Dependency Mental model Reliability in the system Storage & space requirements

slide-29
SLIDE 29

Final design recommendations

Customizability Tailor behavior to user preference Interaction Cooperative robot-human effort Manipulation Level of dexterous manipulation Payload Range of weight expected to work with Range of motion Size of kinematic workspace (high/low/near/far) Storage & size Physical attributes such as footprint, height, mass.

slide-30
SLIDE 30

Papers du jour

1. The Domesticated Robot: Design Guidelines for Assisting Older Adults to Age in Place. J. Beer, C. Smarr, T. Chen, A. Prakash, T. Mitzner, C. Kemp,

  • W. Rogers. HRI - Living and Working with Service Robots, 2012.

2. Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork. B. Hayes, B. Scassellati. AAAI 2014.

slide-31
SLIDE 31

Transition

If the previous paper can be seen as the “why”, this is the “how”. General agreement: 1. Human-robot teaming can improve efficiency, quality of life, and safety. 2. Robots: a. Provide assistance when useful. b. Do dull or undesirable tasks when possible. c. Assist with dangerous tasks when feasible Key contribution: One possible process and training model for enabling a robot to learn from demonstration.

slide-32
SLIDE 32

Vocabulary

Learning by demonstration Providing input to a learning system without complex interfaces. Novice operators can demo, and even verbally describe the process. Algorithms can be robust to inconsistencies of inexperienced trainers. Key point: Learn the human’s intent rather than solely a sequence of actions.

slide-33
SLIDE 33

Learning by demonstration

Consider the concerns that participants had from the previous paper: 1. “I keep thinking of it in terms of how it could help prepare my food but I don’t know whether robots could cook.” 2. “I can see that if it does laundry, it needs to be able to sort by color. I can see that that would be a con and it couldn’t do it.” 3. “You tell him to bring glasses, he brings you a pair of shoes.” Learning by demonstration is designed to be flexible and efficient, quick to converge, especially as new tasks are added.

slide-34
SLIDE 34

Vocabulary

Markov Decision Process (MDP) A generalized structure for efficient representation of flexible, arbitrarily complex options, capturing closed-loop policies and action sequences.

slide-35
SLIDE 35

Vocabulary

Markov Decision Process (MDP) A generalized structure for efficient representation of flexible, arbitrarily complex options, capturing closed-loop policies and action sequences. S : a set of possible states (s in S) that the agent can be in at any moment. A : a set of possible actions (a in A) that the agent can take. R (s′ | s , a) : reward for arriving at state s′ having taken action a from state s. P (s′ | s , a) : probability that taking action a from state s actually results in arriving at state s′. Quite necessary in stochastic settings (“robotics...duh”).

slide-36
SLIDE 36

Vocabulary

Markov Decision Process (MDP) It’s all about the policy , fully enumerating what actions can be taken from what states. Generally static once policy graph is defined.

slide-37
SLIDE 37

Vocabulary

Markov Decision Process (MDP) It’s all about the policy , fully enumerating what actions can be taken from what states. Generally static once policy graph is defined. Semi-Markov (SMDP) : extensible, can adapt and learn new connections.

slide-38
SLIDE 38

Vocabulary

Markov Decision Process (MDP) It’s all about the policy , fully enumerating what actions can be taken from what states. Generally static once policy graph is defined. Semi-Markov (SMDP) : extensible, can adapt and learn new connections.

slide-39
SLIDE 39

Vocabulary

Partially Observable Markov Decision Process (POMDP) Builds on MDPs by including realistic lack of perfect knowledge. Observers don’t know exactly what state the agent is in, can only take

  • bservations and reason about behavior probabilistically.
slide-40
SLIDE 40

Vocabulary

Partially Observable Markov Decision Process (POMDP) Builds on MDPs by including realistic lack of perfect knowledge. Observers don’t know exactly what state the agent is in, can only take

  • bservations and reason about behavior probabilistically.

Add two new variables to (S, A, R, P ): O : a set of observations, generally collected from states in S. Ω : conditional probability of an observation given a state and an action. Ω (o in O | s , a) = probability that o is observed given some action a from s. Looping across (s, a) tuples gives best guess (argmax).

slide-41
SLIDE 41

Application domain

Paper’s focus is on sharing workspace with assistive robot while cooperating on high-level construction & assembly tasks: Materials stabilization Materials retrieval Collaborative object manipulation Enhancing awareness Task progression guidance

slide-42
SLIDE 42

Application domain

Paper’s focus is on sharing workspace with assistive robot while cooperating on high-level construction & assembly tasks: Materials stabilization Holding magnifying glass Materials retrieval Fetch cell phone Collaborative object manipulation Open jar Enhancing awareness Recognize danger and alert Task progression guidance Prepare items for task in advance Correlates heavily with tasks necessary for successful aging in place.

slide-43
SLIDE 43

How, then?

Stepping back: Goal is to identify the behavior policy , using SMDPs to adapt and learn new connections. We can map vertices and edges to components of the task at hand.

slide-44
SLIDE 44

Merging task state with assistive behaviors

Standard SMDP directed graph: state-centric, take actions to get to new states Vertices represent possible states. 1. Robot arm position 2. Worker tool selection 3. Worker position 4. Material being worked on 5. Tool state (drill rotating, flashlight shining, ....)

slide-45
SLIDE 45

Merging task state with assistive behaviors

Standard SMDP directed graph: state-centric, take actions to get to new states Vertices represent possible states. 1. Robot arm position 2. Worker tool selection 3. Worker position 4. Material being worked on 5. Tool state (drill rotating, flashlight shining, ....) Edges represent actions to transition between states. 1. Turn on flashlight (if in a state where holding flashlight, and flashlight is off) 2. Turn off flashlight (if in a state where holding flashlight, and flashlight is on)

slide-46
SLIDE 46

Merging task state with assistive behaviors

Graph’s “dual” is swapping purpose of vertices and edges, become action-centric. Edges represent possible actions states, configuration of the workspace. 1. (flashlight = off, worker has hammer) 2. (flashlight = on, worker has hammer) 3. (flashlight = on, worker not present) Vertices represent possible states actions. 1. Turn on flashlight 2. Turn off flashlight An action-centric graph focuses on taking actions when a given state configuration is present.

slide-47
SLIDE 47

Learning assistive behaviors

Skill acquisition: provide guidance and information so the SMDP can be built. Authors use explicit, kinesthetic teaching, clarifying which attributes of the current task are critical to learn. Manipulator distance may be useful in one scenario, but not in another. Different workers have preference or styles (left vs right-handed, …) Interpretation: blend of supervised and unsupervised learning. Supervised: learning is structured and guided Unsupervised: Actual internal model is black-box, thus, POMDP.

slide-48
SLIDE 48

Learning assistive behaviors

Skill refinement: On-the-job retraining, not re-scripting like assembly line robots Task changes over time: new tools, new material, new target. Like humans, sub-optimal behavior requires immediate correction. New training data provided by kinesthetic movement or positioning. Updates are added to SMDP, updating reward function R (). Layered rewards provide normal and sub-level rewards, more granular.

slide-49
SLIDE 49

Performing assistive behaviors

Robot has a difficult role: contribute to current state while predicting next state. 1. Choose actions from set of valid actions from current state based on POMDP. 2. Optimize, minimize time spent in the current state s by reducing the reward according to the duration. Encourage progress. 3. Create list of possible assistive policies based on observed task-level policies. 4. Select policy that most lowers expected policy-wide time-in-state given all

  • bserved durations.
slide-50
SLIDE 50

Performing assistive behaviors, continued...

After policy is chosen, personalize behavior, choose a based on worker. Action rejection is valid, worker denies robot interaction, update SMDP policy. Execute behavior, leveraging kinematic calculations for pose, actions, response.

slide-51
SLIDE 51

Final contributions

Leverage POMDPs for flexible training and refinement. Robots receive on-the-job training with live human workers. Training leads to optimal policy selection and adaptation during execution.

slide-52
SLIDE 52

Fin

Final thoughts, ideas, questions?

slide-53
SLIDE 53