Choosing Sample Size for Knowledge Tracing Models
DERRICK COETZEE
Choosing Sample Size for Knowledge Tracing Models DERRICK COETZEE - - PowerPoint PPT Presentation
Choosing Sample Size for Knowledge Tracing Models DERRICK COETZEE Motivation BKT parameters are inferred from data But best solution for a given data set may not quite match the parameters that actually generated it ( sampling error )
DERRICK COETZEE
the parameters that actually generated it (sampling error)
0,0,0,0,0 0,0,0,0,0 0,1,1,0,1 0,1,0,0,0 0,0,1,1,0 5 students, 5 problems each, 25 bits of data prior = 0.205 learning = 0.010 guess = 0.142 slip = 0.031 4 parameters, 3 decimal digits each, 39.9 bits of data
Not even possible for all parameter sets to be represented!
generating value
describe variation of estimates
confidence intervals
differently
near zero/one, worst in 05-0.8 range
stddevs for each parameter
approach probably infeasible
with small rates of change
will not give even one sigfig in all parameters
classes!
parameters → predictable variation in error
real-world data sets:
parameters for each, compute stddev
DERRICK COETZEE
learning rate parameter to capture all learning
materials viewed
each item (no interactions with others)
Do exercise Learn independently Interact with partner Do exercise Learn independently
Knows Other student knows Probability knows after interaction No No Yes Yes 1 No Yes teach Yes No 1−mislead
incorrect responses
parameters absorb some of the teach:
learn=0.1586, guess=0.1648, slip=0.0856, teach=0.6481
learn=0.1643, guess=0.1940, slip=0.1102, teach=0.7225
students and high teach
prior=0.2184, learn=0.0841, guess=0.1239, slip=0.2658, teach=0.8793
independent learning and interaction (more observed data)
knowledge flows in only one direction
combine parameters to create lower-dimensional space
predictions on synthetic data
tool
parameters?