1 Optimization in decision graphs Unfolding to decision tree - PDF document

Decision graphs II Influence Diagrams Advanced Herd Management Anders Ringgaard Kristensen Slide 1 Outline Optimization methods • Decision tree • Strong junction tree • Single Policy Updating Decision node ordering Advantages and disadvantages of decision graphs Slide 2 1

Optimization in decision graphs Unfolding to decision tree • Only option until Shachter (1986) Influence diagram with “no forgetting” (like the decision tree): • Famous article by Jensen, Jensen & Dittmer (1994): • Strict ordering of nodes • Creation of a “strong” junction tree • Implemented in the Hugin software system LImited Memory Influence Diagram (LIMID): • Described by Lauritzen & Nilsson (2001): • Decision nodes converted to chance nodes. • Implemented in the Esthauge LIMID software system Slide 3 The repeated milk test problem The reason for testing the milk from a particular cow is to decide whether or not to pour the milk into the bulk tank: • If the milk from an infected cow is poured into the bulk tank, the dairy will reduce the total payment by 10%. • If the milk from the cow is not poured into the bulk tank, the value of that milk is lost. • The farmer has 50 cows. • Under the action “Pour”: • The value of the milk (if not infected) is 1000 • The value of the milk with reduction is 900 • Under the action “Don’t pour”: • The value of the milk is 1000 × 49/50 = 980 • It doesn’t matter whether or not the milk is infected Slide 4 2

As a decision graph The result of the test is known when the decision is made. Is that enough? Let’s try! Slide 5 Relevant past For a decision made at time t’ the values of all variables observed at time t ≤ t’ are in principle relevant. Moreover, all decisions made at previous time steps t ≤ t’ may be relevant. This observation is referred to, as a “no forgetting” assumption. Requires a strict ordering of the nodes! Slide 6 3

Influence diagrams Jensen, Jensen & Dittmer (1994) “No forgetting” assumption: • The value of any previously observed variable is remembered. • Any decision made earlier is remembered. • Graphically, this means that we must insert numerous implicit edges into the net. • Implemented in the Hugin software system. Slide 7 The decision graph with no forgetting There are 13 edges into Pour7 ! Slide 8 4

A comparison … Without implicit “no forgetting” edges. Implicit “no forgetting” edges visible. Slide 9 Consequences of “no forgetting” The decision strategy found is an optimal one. The optimal strategy gets very complex: • The optimal decision for Pour7 depends on the value 13 other variables. Optimization becomes very demanding from a computational point of view: • Even rather simple decision problems cannot be solved in practice. • The applicational experiences with influence diagrams have been disappointing. • Application to delivery policies in slaughter pigs failed. Slide 10 5

LImited Memory Influence Diagrams Working title: “Demented Influence Diagrams”. Due to the disappointments with influence diagrams in herd management, a research initiative was initiated: • Dennis Nilsson as post doc at Aalborg University (later assistant professor at IHH) • Michael Höhle as PhD student at IHH The goal was to come up with better optimization methods for decision graphs by relaxing the “no forgetting” assumption. Slide 11 LIMIDs – the ideas behind Choice 1 Only one decision: • Try the alternatives one by one and select the best. Extend the idea to larger nets. Opened Choice 2 Gain True Slide 12 6

Single Policy Updating in LIMIDs Pour1 Pour3 Pour2 Pour5 Pour7 Pour4 Pour6 Determine an optimization ordering (usually just backwards) Convert all decisions to chance nodes. Update the policy of each decision node one by one. Repeat until convergence. Slide 13 Single policy updating in LIMIDs Lauritzen & Nilsson (2001) Usually only near-optimal solutions. Never more complex than a Bayesian network. Not so computationally demanding as influence diagrams. The algorithm may be applied to influence diagrams if all implicit edges are added. Rather efficient even for influence diagrams. Implemented in the Esthauge LIMID Software System. Slide 14 7

Soluble LIMIDs For some LIMIDs, the Single Policy Updating algorithm will provide us with an exact solution. Such LIMIDs are called soluble. All influence diagrams are soluble (i.e. if all implicit edges are added): • Some edges may be irrelevant. • The software system can automatically remove irrelevant information edges and find the so-called minimal reduction. • The software system can check whether the minimal reduction is soluble. • If it is soluble, a unique decision node ordering is automatically identified, and only one iteration is necessary. Slide 15 Check for solubility Slide 16 8

Advantages of decision graphs State space representation: • Variable by variable (as opposed to dynamic programming). • Allow unobservable variables. • No forgetting – at least as an option (as opposed to dynamic programming). Slide 17 Disadvantages of decision graphs No forgetting: • Complexity – hard to solve (even though heavily improved with LIMIDs). Only suited for static decision problems: • Time steps must be explicitly modeled (as opposed to dynamic programming). Only suited for strictly symmetric decision problems (cf irregular decision trees) Slide 18 9

Properties of methods for decision support Herd constraints Optimization Decision graphs Biological Functional variation limitations Uncertainty Dynamics Slide 19 10

1 Optimization in decision graphs Unfolding to decision tree - PDF document

Decision graphs II Influence Diagrams Advanced Herd Management Anders Ringgaard Kristensen Slide 1 Outline Optimization methods Decision tree Strong junction tree Single Policy Updating Decision node ordering Advantages and

Concise Preservation by combining Managed Forgetting and Contextualized Remembering Research

Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence Arslan

Overcoming Multi-Model Forgetting Y. Benyahia, K. Yu, K. Bennani-Smires, M. Jaggi, A. Davison, M.

Xilai Li 1* , Yingbo Zhou 2* , Tianfu Wu 1 , Richard Socher 2 , and Caiming Xiong 2 North Carolina

Week 4 Video 7 Memory Algorithms Is future correctness enough? Up until this point weve

Bit attacks D. J. Bernstein University of Illinois at Chicago From: andr...@ise... Date: 11 Feb

The Price of Forgetting in Parallel Routing Jonatha Anselmi & Bruno Gaujal INRIA Aussois

Questionnaire Design II Department of Political Science and Government Aarhus University October

Not that I have already obtained this or am already perfect, but I press on to make it my own,

A Personalized Interest-Forgetting Markov Model for Recommendations Jun Chen , Chaokun Wang,

Building Your Own WAF as a Service and Forgetting about False Positives 1 Building Your Own WAF

Federal student loans First, take inventory Private Check to see if you have federal or

DYNAMICS OF A BISTABLE FRUSTRATED UNIT Hildegard Meyer-Ortmanns Jacobs University Bremen The

Frustration-driven magnetic order on the Shastry-Sutherland lattice Pinaki Sengupta Nanyang

Janus Rau Sorensen User Research Manager (januss@crystald.com / januss@ioi.dk) Io Interactive +

A GING OF C LASSICAL O SCILLATORS DURING A N OISE -D RIVEN M IGRATION OF O SCILLATOR P HASES

ET-805 Cohens Kappa Ramkumar.Rajendran@iitb.ac.in From Last Class - Modeling Learners

THE WELLSPRING OF ALL PHASES ON THE KAGOME LATTICE University of Illinois at Urbana Champaign

Essential information for nonprofits navigating COVID-19 pittsburghfoundation.org/covidwebinar

1 Secondary Behaviors Defensive behaviors Normal reactions to pain and discomfort

Is the T CI a va lid me thod of a sse ssing a tta c hme nt? Compar ing the T CI to the

Navigating the Journey to Behavioral Change Gaye Kerschner, Behavior Specialist Susan Kiley,

Generating Entanglement from Frustration-Free Dissipation Francesco Ticozzi Dept. of Information

0 1 0 0 0 01 10 0 1 0 0 0 0 0 1 1100 1010 1001 0110 1 0101 0011 11 0000

1 Optimization in decision graphs Unfolding to decision tree - PDF document

Decision graphs II Influence Diagrams Advanced Herd Management Anders Ringgaard Kristensen Slide 1 Outline Optimization methods Decision tree Strong junction tree Single Policy Updating Decision node ordering Advantages and

Concise Preservation by combining Managed Forgetting and Contextualized Remembering Research

Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence Arslan

Overcoming Multi-Model Forgetting Y. Benyahia, K. Yu, K. Bennani-Smires, M. Jaggi, A. Davison, M.

Xilai Li 1* , Yingbo Zhou 2* , Tianfu Wu 1 , Richard Socher 2 , and Caiming Xiong 2 North Carolina

Week 4 Video 7 Memory Algorithms Is future correctness enough? Up until this point weve

Bit attacks D. J. Bernstein University of Illinois at Chicago From: andr...@ise... Date: 11 Feb

The Price of Forgetting in Parallel Routing Jonatha Anselmi &amp; Bruno Gaujal INRIA Aussois

Questionnaire Design II Department of Political Science and Government Aarhus University October

Not that I have already obtained this or am already perfect, but I press on to make it my own,

A Personalized Interest-Forgetting Markov Model for Recommendations Jun Chen , Chaokun Wang,

Building Your Own WAF as a Service and Forgetting about False Positives 1 Building Your Own WAF

Federal student loans First, take inventory Private Check to see if you have federal or

DYNAMICS OF A BISTABLE FRUSTRATED UNIT Hildegard Meyer-Ortmanns Jacobs University Bremen The

Frustration-driven magnetic order on the Shastry-Sutherland lattice Pinaki Sengupta Nanyang

Janus Rau Sorensen User Research Manager (januss@crystald.com / januss@ioi.dk) Io Interactive +

A GING OF C LASSICAL O SCILLATORS DURING A N OISE -D RIVEN M IGRATION OF O SCILLATOR P HASES

ET-805 Cohens Kappa Ramkumar.Rajendran@iitb.ac.in From Last Class - Modeling Learners

THE WELLSPRING OF ALL PHASES ON THE KAGOME LATTICE University of Illinois at Urbana Champaign

Essential information for nonprofits navigating COVID-19 pittsburghfoundation.org/covidwebinar

1 Secondary Behaviors Defensive behaviors Normal reactions to pain and discomfort

Is the T CI a va lid me thod of a sse ssing a tta c hme nt? Compar ing the T CI to the

Navigating the Journey to Behavioral Change Gaye Kerschner, Behavior Specialist Susan Kiley,

Generating Entanglement from Frustration-Free Dissipation Francesco Ticozzi Dept. of Information

0 1 0 0 0 01 10 0 1 0 0 0 0 0 1 1100 1010 1001 0110 1 0101 0011 11 0000

The Price of Forgetting in Parallel Routing Jonatha Anselmi & Bruno Gaujal INRIA Aussois