Decision Problems Decision Making under Uncertainty, Part III - PowerPoint PPT Presentation

Decision Problems Decision Making under Uncertainty, Part III Christos Dimitrakakis Chalmers 1/11/2013 Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 1 / 35

1 Introduction 2 Rewards that depend on the outcome of an experiment Formalisation of the problem 3 Bayes risk and Bayes decisions Concavity of the Bayes risk 4 Methods for selecting a decision Alternative notions of optimality Minimax problems Two-person games 5 Decision problems with observations Robust inference and minimax priors Decision problems with two points Calculating posteriors Cost of observations Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 2 / 35

Rewards that depend on the outcome of an experiment Decisions d ∈ D Experiments with outcomes in Ω . Reward r ∈ R depending on experiment and outcome. Utility U : R → R . Example (Taking the umbrella) There is some probability of rain. We don’t like carrying an umbrella. We really don’t like getting wet. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 3 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Assumption (Outcomes) There exists a probability measure P on ( Ω, F Ω ) such that the probability of the random outcome ω being in A ⊂ Ω is: P ( ω ∈ A ) = P ( A ) , ∀ A ∈ F Ω . (2.1) Assumption (Utilities) Preferences about rewards in R are transitive, all rewards are comparable and there exists a utility function U, measurable with respect to F R such that U ( r ′ ) ≥ U ( r ) iff r ≻ ∗ r ′ . Definition (Reward function) r = ρ ( ω, d ) . (2.2) Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 4 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem The probability measure induced by decisions For every d ∈ D , the function ρ : Ω × D → R induces a probability P d on R . In fact, for any B ∈ F R : P d ( B ) � P ( ρ ( ω, d ) ∈ B ) = P ( { ω | ρ ( ω, d ) ∈ B } ) . (2.3) Assumption The sets { ω | ρ ( ω, d ) ∈ B } must belong to F Ω . In other words, ρ must be F Ω -measurable for any d. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 5 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem P ω d r P d r d U U (a) The combined decision problem (b) The separated decision problem Expected utility � � E P i ( U ) = U ( r ) d P i ( r ) = U [ ρ ( ω, i )] d P ( ω ) = E P ( U | d = i ) (2.4) R Ω Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 6 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Example You are going to work, and it might rain. The forecast said that the probability of rain ( ω 1 ) was 20%. What do you do? d 1 : Take the umbrella. d 2 : Risk it! ρ ( ω, d ) d 1 d 2 ω 1 dry, carrying umbrella wet ω 2 dry, carrying umbrella dry U [ ρ ( ω, d )] d 1 d 2 ω 1 0 -10 ω 2 0 1 E P ( U | d ) 0 -1.2 Table: Rewards, utilities, expected utility for 20% probability of rain. . Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 7 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Application to statistical estimation The unknown outcome of the experiment ω is called a parameter. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 8 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Application to statistical estimation The unknown outcome of the experiment ω is called a parameter. The set of outcomes Ω is called the parameter space. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 8 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Application to statistical estimation The unknown outcome of the experiment ω is called a parameter. The set of outcomes Ω is called the parameter space. Definition (Loss) ℓ ( ω, d ) = − U [ ρ ( ω, d )] . (2.5) Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 8 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Application to statistical estimation The unknown outcome of the experiment ω is called a parameter. The set of outcomes Ω is called the parameter space. Definition (Loss) ℓ ( ω, d ) = − U [ ρ ( ω, d )] . (2.5) Definition (Risk) � σ ( P , d ) = ℓ ( ω, d ) d P ( ω ) . (2.6) Ω Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 8 / 35

Rewards that depend on the outcome of an experiment Formalisation of the problem Application to statistical estimation The unknown outcome of the experiment ω is called a parameter. The set of outcomes Ω is called the parameter space. Definition (Loss) ℓ ( ω, d ) = − U [ ρ ( ω, d )] . (2.5) Definition (Risk) � σ ( P , d ) = ℓ ( ω, d ) d P ( ω ) . (2.6) Ω Of course, the optimal decision is d minimising σ . Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 8 / 35

Bayes risk and Bayes decisions Bayes risk Consider parameter space Ω , decision space D , loss function ℓ . Definition (Bayes risk) σ ∗ ( P ) = inf d ∈ D σ ( P , d ) (3.1) Remark For any function f : X → Y , where Y is equipped with a complete binary relation < , we define, for any A ⊂ X M = inf x ∈ A f ( x ) Furthermore, for any M ′ > M, there exists some x ′ ∈ A s.t. M ≤ f ( x ) for any x ∈ A. s.t. M ′ > f ( x ′ ) . Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 9 / 35

Bayes risk and Bayes decisions Example Let Ω = { 0 , 1 } and D = [0 , 1]. For an α ≥ 1, we define the loss L : Ω × D → R as ℓ ( ω, d ) = | ω − d | α . (3.2) Assume that the distribution of outcomes is P ( ω = 0) = u P ( ω = 1) = 1 − u . (3.3) For α = 1 we have σ ( P , d ) = ℓ (0 , d ) u + ℓ (1 , d )(1 − u ) = du + (1 − d )(1 − u ) . (3.4) Hence, if u > 1 / 2 the risk is minimised for d ∗ = 0, otherwise for d ∗ = 1. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 10 / 35

Bayes risk and Bayes decisions α = 1 0.9 u=0.1 u=0.25 u=0.5 0.8 u=0.75 0.7 0.6 σ ( d ) 0.5 0.4 0.3 0.2 0.1 0 0.2 0.4 0.6 0.8 1 d Figure: Risk for four different distributions with absolute loss. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 10 / 35

Bayes risk and Bayes decisions Example Let Ω = { 0 , 1 } and D = [0 , 1]. For an α ≥ 1, we define the loss L : Ω × D → R as ℓ ( ω, d ) = | ω − d | α . (3.2) Assume that the distribution of outcomes is P ( ω = 0) = u P ( ω = 1) = 1 − u . (3.3) For α > 1, σ ( P , d ) = d α u + (1 − d ) α (1 − u ) , (3.4) and by differentiating we find that the optimal decision is � − 1 1 � � 1 � α − 1 d ∗ = 1 + . 1 / u − 1 Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 10 / 35

Bayes risk and Bayes decisions alpha = 2 0.9 u=0.1 u=0.25 u=0.5 0.8 u=0.75 0.7 0.6 σ ( d ) 0.5 0.4 0.3 0.2 0.1 0 0.2 0.4 0.6 0.8 1 d Figure: Risk for four different distributions with quadratic loss. Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 10 / 35

Bayes risk and Bayes decisions Example (Quadratic loss) Now consider Ω = R with measure P and D = R . For any point ω ∈ R , the loss is ℓ ( ω, d ) = | ω − d | 2 . (3.4) The optimal decision minimises � | ω − d | 2 d P ( ω ) . E ( ℓ | d ) = R Then, as long as ∂/∂ d | ω − d | 2 is measurable with respect to F R ∂ � � ∂ | ω − d | 2 d P ( ω ) = ∂ d | ω − d | 2 d P ( ω ) (3.5) ∂ d R R � = 2 ( ω − d ) d P ( ω ) (3.6) R � � = 2 ω d P ( ω ) − 2 d d P ( ω ) (3.7) R R = 2 E ( ω ) − 2 d , (3.8) so the cost is minimised for d = E ( ω ). Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 11 / 35

Bayes risk and Bayes decisions Concavity of the Bayes risk A mixture of distributions Consider two probability measures P , Q on ( Ω, F Ω ). Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 12 / 35

Bayes risk and Bayes decisions Concavity of the Bayes risk A mixture of distributions Consider two probability measures P , Q on ( Ω, F Ω ). These define two alternative distributions for ω . For any A Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 12 / 35

Bayes risk and Bayes decisions Concavity of the Bayes risk A mixture of distributions Consider two probability measures P , Q on ( Ω, F Ω ). These define two alternative distributions for ω . For any A For any P , Q and α ∈ [0 , 1], we define Z α = α P + (1 − α ) Q to mean the probability measure such that Z α ( A ) = α P ( A ) + (1 − α ) Q ( A ) for any A ∈ F Ω . Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 12 / 35

Bayes risk and Bayes decisions Concavity of the Bayes risk Concavity of the Bayes risk Theorem For probability measures P , Q on F Ω and any α ∈ [0 , 1] σ ∗ [ α P + (1 − α ) Q ] ≥ ασ ∗ ( P ) + (1 − α ) σ ∗ ( Q ) . (3.9) Proof. From the definition of risk (2.6), for any decision d ∈ D , σ [ α P + (1 − α ) Q , d ] = ασ ( P , d ) + (1 − α ) σ ( Q , d ) . Hence, by definition (3.1) of the Bayes risk, σ ∗ [ α P + (1 − α ) Q ] = inf d ∈ D σ [ α P + (1 − α ) Q , d ] = inf d ∈ D [ ασ ( P , d ) + (1 − α ) σ ( Q , d )] . Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 13 / 35

Decision Problems Decision Making under Uncertainty, Part III - PowerPoint PPT Presentation

Decision Problems Decision Making under Uncertainty, Part III Christos Dimitrakakis Chalmers 1/11/2013 Christos Dimitrakakis (Chalmers) Decision Problems 1/11/2013 1 / 35 1 Introduction 2 Rewards that depend on the outcome of an experiment

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

6 Decision- -Making Making MVC (revisited) 6 Decision MVC (revisited) decision

PCP Lecture 26 And Hardness of Approximation 1 Promise Problems 2 Promise Problems Decision

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

S C DECISION E N C E decision science SDS CMU What is Decision Science? Behavioral

Decision Tree Decision Trees A decision tree is a decision support tool that uses a tree-like

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

A Decision A Decision A Decision-Analytic Approach for A Decision Analytic Approach for

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

CSE 421 P vs NP / NP Completeness Shayan Oveis Gharan 1 Decision Problems A decision problem

Decision Making 1 Decision Making Skills Establishing a positive decision-making environment.

DECISION MAKING readysetpresent.com Decision Making Program Objectives ( 1 of 2 ) To examine

Decision Tree R Greiner Cmput 466 / 551 Learning Decision Trees Def'n: Decision Trees

Statistical Inverse Problems and abstract inverse problems examples Instrumental Variables

NP-Hardness Textbook Reading Chapter 34 Overview Computational (in)tractability Decision

Model Repair for Markov Decision Model Repair for Markov Decision Model Repair for Markov

Simpler Optimal Algorithm for Contextual Bandits under Realizability Yunzong Xu MIT Joint work

Imprecision in learning: introduction Sebastien Destercke Universit de Technologie de

CMU 15-896 Noncooperative games 2: Learning and minimax Teacher: Ariel Procaccia Reminder: The

CSC304 Lecture 6 Game Theory : Minimax Theorem via Expert Learning CSC304 - Nisarg Shah 1

a chaining algorithm for online non parametric regression Pierre Gaillard December 2, 2015

Outline 1. Standing on the Shoulders of Giants . . . 2. What is Information? 3. Shannon

Introduction to Machine Learning 25. Multiplicative Updates, Games and Boosting Alex Smola

Theory and Statistics Constantinos Daskalakis CSAIL and EECS, MIT Min-Max Optimization Solve: