Bayesian Networks and ITS
1
Bayesian Networks and ITS Overview Knowledge acquisition is hard - - PowerPoint PPT Presentation
1 Bayesian Networks and ITS Overview Knowledge acquisition is hard in general, 2 and not well understood. It is time consuming, when everything is to be hand-coded. Can the machine automatically gather the needed information?
1
2
Core System
Inputs Other Observables Quality? Control Box What to Do? Changes
Rote Learning (being told) Fully Automated Discovery Rule Induction Neural Networks Reinforcement Learning Example Based Learning Inductive Logic Programming Version Space Learning Case Based System …......
– Correct knowledge – concepts,
– Misconceptions
– Perturbation model
– Interventions
– Machine learning?
– Learning!
– What kind of uncertainty? – What kind of model?
Fuzzy Logic Certainty Factors Probability models – Bayesian networks Non-numeric Models Non-monotonic Logics & reasoning Dependency Networks Dempster Shafer theory
– Bayesian networks
Probability
Conditional probability
Product rule
Bayes’ Rule:
11
12
P(Cavity|T
P(Cavity ∧ T
P(T
P(C|T) = ?
13
– Unless....
– But P(B|A) may be easier
15
– If n Boolean variables are independent, the
– => all else are conditionally
– P(~X) = 1 – P(X)
Each concept is represented by a node in
A directed edge from one concept to
19
20 CPD for For-loop
21
22
23
25
– Human expert creates the structure and the probability values – Can guess, where real value not available. – Hidden nodes are a problem!
– Use population data from real trial, etc – Approaches vary on what is constructed from data.
– A combination, using domain knowledge to increase
efficiency.
– Given leaf nodes, predict prob of intermediate or
– Given root nodes, etc predict prob of intermediate
– Sibling propagation – earthquake knowledge helps
– domain-general knowledge: encompassing
– Need to stay across sessions. – task-specific knowledge: encompassing
– Can be removed at end of task.
The domain-general part of the stud. model consists of
Rule nodes Context-Rule nodes
A student has mastered a rule when he/she is able to apply it
Rule nodes have binary values T and F, indicating the
Context-Rule nodes represent mastery of physics rules in
28
29
– Fact, – Goal, – Rule-application and – Strategy nodes
30
hence a different Bayesian network.
– Fully propositional!
distinct and share no nodes.
are set to the probabilities of the domain general nodes from the network of the preceding exercise.
32
Strategy nodes represent points where
These are the only non-binary nodes in
The node is always paired with a Goal
33
34
35
– “Guess” probability?
– Use of other related elements can help.
– Sharing credit among them...
– To capture “leak” knowledge such as guess, slips,
– Even if I know all conditions and the rule, I may not
– All can benefit from a probabilistic student model.
– probability not appropriate to capture – tall, short, warm, etc
– Not easy to provide complete interdependencies – If clothes are dirty, detergent = high – If cloth_quantity is high, detergent is high
40
– Tall, medium, short, costly, cheap, etc
– x in A => x not in ~A – A U ~A = UnivSet – A ^ ~A = NullSet
membership marks poor Excellent average good
– good:70, average:50, – Can have all values also.
– r33-confid marginal (0.3), high(0.7), etc
– r33-confid adequate or not.
If the student answered the question
Similarly, if the student answered the
Determine the probability of each concept
Moreover, we can also compute p(ai=known,
46