Some applications of Bayesian networks Ji r Vomlel Institute of - - PowerPoint PPT Presentation

some applications of bayesian networks
SMART_READER_LITE
LIVE PREVIEW

Some applications of Bayesian networks Ji r Vomlel Institute of - - PowerPoint PPT Presentation

Some applications of Bayesian networks Ji r Vomlel Institute of Information Theory and Automation Academy of Sciences of the Czech Republic This presentation is available at http://www.utia.cas.cz/vomlel/ 1 Contents Brief


slide-1
SLIDE 1

Some applications of Bayesian networks

Jiˇ r´ ı Vomlel Institute of Information Theory and Automation Academy of Sciences of the Czech Republic This presentation is available at http://www.utia.cas.cz/vomlel/

1

slide-2
SLIDE 2

Contents

  • Brief introduction to Bayesian networks
  • Typical tasks that can be solved using Bayesian networks
  • 1: Medical diagnosis (a very simple example)
  • 2: Decision making maximizing expected utility (another simple

example)

  • 3: Adaptive testing (a case study)
  • 4: Decision-theoretic troubleshooting (a commercial product)

2

slide-3
SLIDE 3

Bayesian network

  • a directed acyclic graph G = (V, E)
  • each node i ∈ V corresponds to a random variable Xi with a

finite set Xi of mutually exclusive states

  • pa(i) denotes the set of parents of node i in graph G
  • to each node i ∈ V corresponds a conditional probability table

P(Xi | (Xj)j∈pa(i))

  • the DAG implies conditional independence relations between

(Xi)i∈V

  • d-separation (Pearl, 1986) can be used to read the CI relations

from the DAG

3

slide-4
SLIDE 4

Using the chain rule we have that: P((Xi)i∈V)

= ∏

i∈V

P(Xi | Xi−1, . . . , X1) Assume an ordering of Xi, i ∈ V such that if j ∈ pa(i) then j < i. From the DAG we can read conditional independence relations Xi ⊥

⊥ Xk | (Xj)j∈pa(i)

for i ∈ V and k < i and k ∈ pa(i) Using the conditional independence relations from the DAG we get P((Xi)i∈V)

= ∏

i∈V

P(Xi | (Xj)j∈pa(i)) . It is the joint probability distribution represented by the Bayesian network.

4

slide-5
SLIDE 5

Example:

X1 X2 P(X1) P(X2) P(X3 | X1) P(X4 | X2) P(X6 | X3, X4) P(X9 | X6) P(X8 | X7, X6) P(X5 | X1) P(X7 | X5) X5 X7 X3 X8 X6 X9 X4

P(X1, . . . , X9) =

=

P(X9|X8, . . . , X1) · P(X8|X7, . . . , X1) · . . . · P(X2|X1) · P(X1)

=

P(X9|X6) · P(X8|X7, X6) · P(X7|X5) · P(X6|X4, X3)

·P(X5|X1) · P(X4|X2) · P(X3|X1) · P(X2) · P(X1)

5

slide-6
SLIDE 6

Typical use of Bayesian networks

  • to model and explain a domain.
  • to update beliefs about states of certain variables when some
  • ther variables were observed, i.e., computing conditional

probability distributions, e.g., P(X23|X17 = yes, X54 = no).

  • to find most probable configurations of variables
  • to support decision making under uncertainty
  • to find good strategies for solving tasks in a domain with

uncertainty.

6

slide-7
SLIDE 7

Simplified diagnostic example

We have a patient. Possible diagnoses: tuberculosis, lung cancer, bronchitis.

7

slide-8
SLIDE 8

We don’t know anything about the pa- tient Patient is a smoker.

8

slide-9
SLIDE 9

Patient is a smoker. ... and he complains about dyspnoea

9

slide-10
SLIDE 10

Patient is a smoker and complains about dyspnoea ... and his X-ray is positive

10

slide-11
SLIDE 11

Patient is a smoker and complains about dyspnoea and his X-ray is pos- itive ... and he visited Asia recently

11

slide-12
SLIDE 12

Application 2:Decision making

The goal: maximize expected utility Hugin example: mildew4.net

12

slide-13
SLIDE 13

Fixed and Adaptive Test Strategies

wrong correct wrong wrong correct wrong wrong correct correct wrong wrong correct correct correct

Q5 Q4 Q3 Q2 Q1 Q2 Q1 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q6 Q8 Q7 Q8 Q4 Q7 Q10 Q6 Q7 Q9

13

slide-14
SLIDE 14

X3 X1 X3 X3 X2 X3 X2 X1 X2 X1 X2 X2 X3 X1 X1

For all nodes n of a strategy s we have defined:

  • evidence en, i.e. outcomes of

steps performed to get to node n,

  • probability P(en) of getting to

node n, and

  • utility f (en) being a real num-

ber. Let L(s) be the set of terminal nodes of strategy s. Expected utility

  • f

strategy is E f (s) = ∑ℓ∈L(s) P(eℓ) · f (eℓ).

14

slide-15
SLIDE 15

X3 X1 X3 X3 X2 X3 X2 X1 X2 X1 X2 X2 X3 X1 X1

Strategy s⋆ is optimal iff it maxi- mizes its expected utility. Strategy s is myopically optimal iff each step of strategy s is selected so that it maximizes expected utility after the selected step is performed (one step look ahead).

15

slide-16
SLIDE 16

Application 3: Adaptive test of basic

  • perations with fractions

Examples of tasks:

T1: 3

4 · 5 6

− 1

8

=

15 24 − 1 8 = 5 8 − 1 8 = 4 8 = 1 2

T2:

1 6 + 1 12

=

2 12 + 1 12 = 3 12 = 1 4

T3:

1 4 · 1 1 2

=

1 4 · 3 2 = 3 8

T4: 1

2 · 1 2

· 1

3 + 1 3

  • =

1 4 · 2 3 = 2 12 = 1 6 .

16

slide-17
SLIDE 17

Elementary and operational skills

CP Comparison (common nu- merator or denominator)

1 2 > 1 3 , 2 3 > 1 3

AD Addition (comm. denom.)

1 7 + 2 7 = 1+2 7

= 3

7

SB

  • Subtract. (comm. denom.)

2 5 − 1 5 = 2−1 5

= 1

5

MT Multiplication

1 2 · 3 5 = 3 10

CD Common denominator

  • 1

2 , 2 3

  • =
  • 3

6 , 4 6

  • CL

Cancelling out

4 6 = 2·2 2·3 = 2 3

CIM

  • Conv. to mixed numbers

7 2 = 3·2+1 2

= 3 1

2

CMI

  • Conv. to improp. fractions

3 1

2 = 3·2+1 2

= 7

2 17

slide-18
SLIDE 18

Misconceptions

Label Description Occurrence MAD

a b + c d = a+c b+d

14.8% MSB

a b − c d = a−c b−d

9.4% MMT1

a b · c b = a·c b

14.1% MMT2

a b · c b = a+c b·b

8.1% MMT3

a b · c d = a·d b·c

15.4% MMT4

a b · c d = a·c b+d

8.1% MC a b

c = a·b c

4.0%

18

slide-19
SLIDE 19

Student model

CP

ACD HV2 HV1 AD SB CMI CIM CL CD MT MMT1 MMT2 MMT3 MMT4 MC MAD MSB ACMI ACIM ACL

19

slide-20
SLIDE 20

Evidence model for task T1

3 4 · 5 6

  • − 1

8 = 15 24 − 1 8 = 5 8 − 1 8 = 4 8 = 1 2

T1

MT & CL & ACL & SB & ¬MMT3 & ¬MMT4 & ¬MSB

CL MMT4 MSB SB MMT3 ACL MT T1 X1

P(X1 | T1)

Hugin: model-hv-2.net

20

slide-21
SLIDE 21

Using information gain as the utility function

“The lower the entropy of a probability distribution the more we know.” H (P(X)) = −∑

x

P(X = x) · log P(X = x)

0.5 1 0.5 1 entropy probability

Information gain in a node n of a strategy IG(en)

=

H(P(S)) − H(P(S | en))

21

slide-22
SLIDE 22

Skill Prediction Quality

74 76 78 80 82 84 86 88 90 92 2 4 6 8 10 12 14 16 18 20 Quality of skill predictions Number of answered questions adaptive average descending ascending

22

slide-23
SLIDE 23

Application 4: Troubleshooting

Dezide Advisor customized to a specific portal, seen from the user’s perspective through a web browser.

23

slide-24
SLIDE 24

Application 2: Troubleshooting - Light print problem

F F3 F2 F1 F4 Faults Actions A3 A2 A1 Q1 Problem Questions

  • Problems: F1 Distribution problem, F2 Defective toner, F3

Corrupted dataflow, and F4 Wrong driver setting.

  • Actions: A1 Remove, shake and reseat toner, A2 Try another

toner, and A3 Cycle power.

  • Questions: Q1 Is the configuration page printed light?

24

slide-25
SLIDE 25

Troubleshooting strategy

A1 = no A2 = yes Q1 = no A1 = yes A2 = yes Q1 = yes A1 = yes A2 = no A1 = no A2 = no A2 Q1 A1 A2 A1

The task is to find a strategy s ∈ S minimising expected cost of repair ECR(s)

=

ℓ∈L(s)

P(eℓ) · ( t(eℓ) + c(eℓ) ) .

25

slide-26
SLIDE 26

Expected cost of repair for a given strategy

A1 = no A2 = yes Q1 = no A1 = yes A2 = yes Q1 = yes A1 = yes A2 = no A1 = no A2 = no A2 Q1 A1 A2 A1

ECR(s)

=

P(Q1 = no, A1 = yes) ·

  • cQ1 + cA1
  • +P(Q1 = no, A1 = no, A2 = yes) ·
  • cQ1 + cA1 + cA2
  • +P(Q1 = no, A1 = no, A2 = no) ·
  • cQ1 + cA1 + cA2 + cCS
  • +P(Q1 = yes, A2 = yes) ·
  • cQ1 + cA2
  • +P(Q1 = yes, A2 = no, A1 = yes) ·
  • cQ1 + cA2 + cA1
  • +P(Q1 = yes, A2 = no, A1 = no) ·
  • cQ1 + cA2 + cA1 + cCS
  • Demo: www.dezide.com Products/Demo/‘‘Try out expert mode’’

26

slide-27
SLIDE 27

Commercial applications of Bayesian networks in educational testing and troubleshooting

  • Hugin Expert A/S.

software product: Hugin - a Bayesian network tool. http://www.hugin.com/

  • Educational Testing Service (ETS)

the world’s largest private educational testing organization Research unit doing research on adaptive tests using Bayesian networks: http://www.ets.org/research/

  • SACSO Project

Systems for Automatic Customer Support Operations

  • research project of Hewlett Packard and Aalborg University.

The troubleshooter offered as DezisionWorks by Dezide Ltd. http://www.dezide.com/

27