Using Inverse Planning for Personalized Feedback Anna N. Rafferty - - PowerPoint PPT Presentation

β–Ά
using inverse planning for personalized feedback
SMART_READER_LITE
LIVE PREVIEW

Using Inverse Planning for Personalized Feedback Anna N. Rafferty - - PowerPoint PPT Presentation

Using Inverse Planning for Personalized Feedback Anna N. Rafferty Computer Science Department, Carleton College Rachel A. Jansen Thomas L. Griffiths Department of Psychology, University of California, Berkeley Using Data for Personalization


slide-1
SLIDE 1

Using Inverse Planning for Personalized Feedback

Anna N. Rafferty

Computer Science Department, Carleton College

Rachel A. Jansen Thomas L. Griffiths

Department of Psychology, University of California, Berkeley

slide-2
SLIDE 2

Using Data for Personalization

Algorithm

? Provide experience X

slide-3
SLIDE 3

Outline

  • Inverse planning: Diagnosing misunderstandings

about equation solving

  • Developing personalized feedback based on

diagnosis

  • Testing effectiveness of personalized feedback
  • Future directions
slide-4
SLIDE 4

Interpreting Equation Solving: Bayesian Inverse Planning

Θ = space of possible understandings p(ΞΈ | equations) Algebra skills (πœ„1) Algebra skills (πœ„2)

slide-5
SLIDE 5

Representing Understanding: Θ

Conceptual Mal-rules 1+3x => 4x 3(2+5x) => 6+5x Arithmetic 1+5.9x+3.2x => 1+8.1x

  • 3+5+x => -2+x

Planning 3x+5x+4 = 2 => 3x+4 = -5x+2

e.g., Sleeman, 1984; Payne & Squibb, 1990; Koedinger & MacLaren,1997

θ ∈ Θ: 6-dimensional vector of parameters related to skill

slide-6
SLIDE 6

Bayesian Inverse Planning

p(ΞΈ | equations) Algebra skills (πœ„1) Algebra skills (πœ„2)

slide-7
SLIDE 7

Bayesian Inverse Planning

{

Prior

Prior: Encode information about what misunderstandings are common

Likelihood

{

p(ΞΈ | equations) ∝ p(ΞΈ)p(equations | ΞΈ) Algebra skills (πœ„2) Algebra skills (πœ„1)

slide-8
SLIDE 8

Bayesian Inverse Planning

Likelihood: What is the probability of the observed data if the learner has a particular understanding?

{

Prior Likelihood

{

p(ΞΈ | equations) ∝ p(ΞΈ)p(equations | ΞΈ) Algebra skills (πœ„2) Algebra skills (πœ„1)

slide-9
SLIDE 9

Generative Model of Equation Solving: Markov Decision Processes

2 + 3x = 6 3x = 6 + 2 3x = 8 Move 2 to right side Combine 6 and 2 Divide both sides by 3 ...

πœ„ affects what actions are considered and transition probabilities for actions.

slide-10
SLIDE 10

How are Actions Chosen?

Assume a noisily optimal policy: p(a | s) ∝ exp(θβ · Q(s, a))

Q(s, a) = X

s02S

p(s0|s, a) R(s, a) + Ξ³ X

a02A

p(a0|s0)Q(s0, a0) !

Long term expected value:

2 + 3x = 6 3x = 6 + 2 3x = 8 Move 2 to right side Combine 6 and 2 Divide both sides by 3 ...

slide-11
SLIDE 11

Inverse Planning Overview

5 + 9 = 6.0x + 2.0x + 10.0[1 + 1 + 7.0x] 5 + 9 = 6.0x + 2.0x + 10 + 10 + 70.0x 5 + 9 = 6.0x + 2.0x + 20 + 70.0x 14 = 6.0x + 2.0x + 20 + 70.0x 5 + 9 = 6.0x + 2.0x + 10.0[1 + 1 + 7.0x] 5 + 9 = 6.0x + 2.0x + 10 + 10 + 70.0x 5 + 9 = 6.0x + 2.0x + 20 + 70.0x 14 = 6.0x + 2.0x + 20 + 70.0x 14 = 76.0x + 2.0x + 20.0 14 = 78.0x + 20.0 14 + βˆ’20 = 78.0x βˆ’7 = 78.0x βˆ’ 7 78 = 1x

Arithmetic error parameter Action planning parameter Distributive property error parameter Move term error parameter

. . .

Representation of understanding (Θ) Model of equation solving as a (parameterized) MDP Infer posterior probability over Θ (MCMC)

0.5 1 0.5 1 Arithmetic Error Parameter Probability Value

Arithmetic Error Parameter

Probability Value

slide-12
SLIDE 12

Output for One Learner

How do we turn this into a feedback activity?

0 0.5 1

Value

0.5

Probability Move

1

Value

0.5

Probability Combine

0 0.5 1

Value

0.5

Probability Divide

0 0.5 1

Value

0.5

Probability Distributive

2 4

Value

0.5

Probability Planning

0 0.5 1

Value

0.5

Probability Arithmetic

slide-13
SLIDE 13

Feedback Activities

Overview of skills and assessment Text explanation and video from Khan Academy Targeted practice with fading scaffolding

slide-14
SLIDE 14

Testing Personalized Feedback

Session 1: Website Problem Solving and Multiple Choice Test Session 3: Website Problem Solving and Multiple Choice Test Session 2: Feedback Activity

slide-15
SLIDE 15

Results: Changes in Performance Across Sessions

Targeted Feedback Random Feedback 6 12 18 24

Score Accuracy Improvements by Time and Condition

Before Feedback After Feedback

Targeted Feedback

Accuracy Improvements by Time and Condition

Random Feedback

Accuracy Improvements by Time and Condition

Before Feedback After Feedback

Accuracy Improvements by Time and Condition

Before Feedback After Feedback

Reliable improvement, but no difference in amount

  • f improvement across conditions.
slide-16
SLIDE 16

Performance Based on Proficiency Level of Feedback Skill

Skill level > 0.85 Skill level < 0.85 6 12 18 24

Score Accuracy Improvements by Time and Level of Skill

Before Feedback After Feedback

slide-17
SLIDE 17

Performance Change for Participants with Varying Skill Levels

Random Feedback Targeted Feedback 6 12 18 24

Score Accuracy Improvements by Time and Condition for Participants with Some Mastered and Some Unmastered Skills

Before Feedback After Feedback

Reliable difference in amount of improvement by condition.

slide-18
SLIDE 18

Contributions and Next Steps

  • Personalization using inverse planning is helpful for

learners who struggle with only some skills

  • Provides an applied metric assessing the algorithm
  • Next steps:
  • Greater specificity and more interactivity in

feedback

  • Longer term interventions
slide-19
SLIDE 19

Thank you!

Acknowledgements: Thank you to students Jonathan Brodie and Sam Vinitsky for programming contributions. Funding: This work is supported by NSF grant number DRL-1420732.

Contact: Anna Rafferty, arafferty@carleton.edu

slide-20
SLIDE 20

Skill Proficiencies by Participant

1 2 3 4 5 Number of skills with proficiency < 0.85 0.2 0.4 0.6 0.8 1 Proportion of participants

slide-21
SLIDE 21

Markov Decision Processes

a1 a2 a3 s1 s2 s3 ...

Actions:

  • move a term
  • multiply or divide by a constant
  • combine two terms
  • distribute a coefficient
  • stop solving