Neural Cognitive Diagnosis for Intelligent Education Systems Fei - - PowerPoint PPT Presentation

neural cognitive diagnosis for intelligent education
SMART_READER_LITE
LIVE PREVIEW

Neural Cognitive Diagnosis for Intelligent Education Systems Fei - - PowerPoint PPT Presentation

Neural Cognitive Diagnosis for Intelligent Education Systems Fei Wang, Qi Liu, Enhong Chen, Zhenya Huang, Yuying Chen, Zai Huang, Shijin Wang AAAI 2020 Anhui Province Key Lab. of Big Data Analysis and Application, University of S&T of China


slide-1
SLIDE 1

Neural Cognitive Diagnosis for Intelligent Education Systems

Fei Wang, Qi Liu, Enhong Chen, Zhenya Huang, Yuying Chen, Zai Huang, Shijin Wang

Anhui Province Key Lab. of Big Data Analysis and Application, University of S&T of China

AAAI 2020

slide-2
SLIDE 2

Background

n Cognitive Diagnosis: A fundamental task in many scenarios, especially intelligent education

slide-3
SLIDE 3

Traditional Works

n IRT, MIRT: scalar or latent vectors for students and exercises; logistic- like interaction function n difficulty, discrimination, ability n DINA: binary vectors for students and exercises; conjunctive assumption in interaction function n Q-matrix n MF: latent vectors for students and exercises; inner productive interaction function

! "#$ = 1 '#, )$, *$ = 1 1 + exp(−1.7)$ '# − *$ ) Skill proficiency Discrimination Difficulty Skill proficiency vector Slip Guess

! "45 = 1 64 = (1 − 7

5)89:;5 <=89:

! "#$ = 1 '#, *$ = '# > *$

slide-4
SLIDE 4

l Problems in the interaction functions: l manually designed à labor intensive l mostly linear à limited approximation ability l simplistic assumptions à restricted scope of applications It is urgent to find an automatic way to learn the complex interactions for cognitive diagnosis.

Learn the interaction function with neural network from data

! "#$ = 1 '#, )$, *$, +$ = , '#, )$, *$, +$ = 1 1 + ./0.234 56/74

slide-5
SLIDE 5

Challenges

n Black-box nature of neural network n difficult to get explainable diagnosis results n Leverage rich exercise text information n difficult for traditional non-neural functions n worthy of exploring with the strong ability of neural network

?

Information

slide-6
SLIDE 6

NeuralCD Framework

n Student Factors: knowledge proficiency vector !" n Exercise Factors: knowledge relevancy vector !#$ n other exercise factors !%&'() (optional): e.g., difficulty, discrimination n Interaction Function: interactive multi layers n Output: the probability that the student would correctly answer the exercise

slide-7
SLIDE 7

NeuralCD Framework n Explainable

n !" ∘ !$%: attach each entry of &' to a specific knowledge concept

Exercise ( Knowledge ) Correct P(Correct)=0.3 Too low! Optimization algorithm &

* ' ↑

n Monotonicity Assumption: The probability of correct response to the exercise is monotonically increasing at any dimension of the student’s knowledge proficiency. (widely applicable)

Proficiency &' Relevancy &*+ Educational psychological assumption

slide-8
SLIDE 8

NeuralCDM

n Feasible and effective – basic implementation with Q-matrix input layer: !" ∘ (%& − %()**)×ℎ()&. /01 ∘ /&

Directly from Q-matrix

slide-9
SLIDE 9

NeuralCDM

n Feasible and effective – basic implementation with Q-matrix interaction layer:

  • full connection
  • positive weights

Monotonicity Assumption

slide-10
SLIDE 10

NeuralCDM

n Feasible and effective – basic implementation with Q-matrix

  • utput layer:
slide-11
SLIDE 11

Generality of NeuralCD

n NeuralCD framework is general and can cover some traditional models n e.g., IRT, MIRT, MF

Fix to 1

Multidimensional degraded to unidimensional

Multi-layer degraded to a single Sigmoid

slide-12
SLIDE 12

Generality of NeuralCD

n NeuralCD framework is general and can cover some traditional models n e.g., IRT, MIRT, MF

Where Q is learned instead

  • f labeled by experts. There

is no explicit meaning of each dimension in Q. Fix to 1 Multi-layer degraded to a summation and a single Sigmoid NeuralCD MIRT

slide-13
SLIDE 13

NeuralCDM+

n Extendible – refine Q-matrix with exercise texts n pre-train a CNN to predict knowledge concepts of the input exercise n combine with Q-matrix through a partial order probabilistic scheme: knowledge relevancy: Q-matrix >= predicted > other = 0

原来的 原来的!"# 预测知识点 预测知识点 优化后的 优化后的!"#

slide-14
SLIDE 14

Experiments n Datasets n Student performance prediction

Best n Math: Zhixue1, mathematical exercises (with texts) and logs n ASSIST: Assistment2, mathematical exercises (without texts) and logs

1Private dataset, provided by iFLYTEK Co., Ltd. 2https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/skill-builder-data-2009-2010

slide-15
SLIDE 15

Experiments n Model interpretation

n If student ! has a better mastery on knowledge concept " than student #, then ! is more likely to answer exercises related to " correctly than #. Higher DOA: students who perform well on certain knowledge concept get higher diagnosed knowledge proficiency

slide-16
SLIDE 16

Experiments

n Case study n a student’s performance on 3 exercise in ASSIST n and his/her diagnosed result Q-matrix Logs

Student Knowledge Proficiency (bars) Exercise Knowledge Difficulty (points)

The student is more likely to response correctly when his/her proficiency satisfies the requirement of the exercise.

slide-17
SLIDE 17

Conclusion

n We propose a neural cognitive diagnostic framework: NeuralCD n student and exercise factors, neural network interaction layers n monotonicity assumption n Feasibility: NeuralCDM with Q-matrix n Extendibility: NeuralCDM+ with refined Q-matrix that leverages exercise texts n Generality: covers some traditional models n Effective and explainable: experiments on two real-world datasets Code for NeuralCDM is available at https://github.com/bigdata-ustc/NeuralCD

slide-18
SLIDE 18

Thank you for listening

Q & A