Dynamic Decision Making: Implications for Recommender System Design - PowerPoint PPT Presentation

Dynamic Decision Making: Implications for Recommender System Design Cleotilde (Coty) Gonzalez Dynamic Decision Making Laboratory (www.cmu.edu/ddmlab) Department of Social and Decision Sciences Carnegie Mellon University 1

Research process and methods: Comparing Cognitive models against human data 2

Choice Explosion 4

Choice explosion in a cyber world 5

“ A wealth of information creates a poverty of attention and a need to allocate it Efficiently” ~Herb Simon (Nobel Prize Winner)

Recommender systems: many flavors 7

Human Decisions: Essence of Recommender systems • Recommender systems aim at predicting preferences and ultimately human choice • Human faced with a decision – Making a choice among a large set of alternatives – Relying on preferences: • Personal knowledge: preferences constructed through past experience (choices & outcomes experienced in the past) • Given knowledge: preferences constructed from information provided • Human preferences are dynamic and contingent to the environment. 8

Premise: Dynamic decision making research may help to build recommender systems that learn and adapt recommendations dynamically to a particular user’s experience to maximize benefits and overall utility from her choices Outline: • Offer a conceptual framework of decision making different from traditional choice: dynamic decision making • Present main behavioral results obtained from experimental studies in dynamic situations – some initial findings on the dynamics of choice and trust on recommendations • A theory (process and representations) and a computational model (algorithm) with demonstrated accuracy in predicting human choice 9

Static Decisions from Description Assumptions: 1) Full information: options may be described by explicit outcomes and probabilities 2) Unlimited time and resources: No constraints in the decision making process 3) Stability: mapping between choice attributes and utility remain constant over time (and across individuals, and Which of the following would you prefer? within a single individual). A: Get $4 with probability .8, $0 otherwise B: Get $3 for sure 10

Dynamic Decisions from Experience 11

Dynamic Decision Making 1. Series of Decisions 2. Decisions are interdependent: the output of one becomes the input of the future ones 3. Environment changes: either independently or dependently as a result of previous decisions 4. Utility of decisions is time- dependent (according to when they are made) 5. Resources and Time are limited 12

Common cognitive process: Memory, Experience, Learning 14

A Continuum of “dynamics” Only requirement: A sequence of decisions Most Dynamic Least Dynamic Environment changes No changes in the environment (Independently and as a although the environment is consequence of the actions of the probabilistic, probabilities and decision maker) values don’t change over the course of decisions Delayed feedback and Credit assignment problem (Multiple Immediate feedback (Action- actions and multiple outcomes Outcome closest in time) separated in time) Value is time independent (Time Value is time-dependent (Value of the decision is determined by decreases the farther away the the decision maker, no penalty for decision is from the optimal time) waiting) Simple Complex

Complex dynamic environments: Microworld research Gonzalez, Vanyukov & Martin, 2005 Conflict Resolution Military Command and Control Military Command and Control Dynamic Visual Detection Real-time resource allocation Medical Diagnosis Climate Change Fire Fighting Supply-Chain 16 Management

Main findings from my research with Microworlds (summarized in Gonzalez 2012) • More “headroom” during training helps adaptation – Time constraints (Gonzalez, 2004): Slow pace training helps adaptation to high time constrains – High workload(Gonzalez, 2005): Low workload during training helps adaptation to high workload • Heterogeneity of experiences helps adaptation – High diversity of experiences (Gonzalez & Quesada, 2003; Gonzalez & Thomas, 2008; Gonzalez & Madhavan, 2011; Brunstein and Gonzalez, 2011) helps detection of novel items • Ability to “pattern-match” and see similarities is associated to better performance in DDM tasks (Gonzalez, Thomas and Vanyukov, 2005) • Feedforward helps future performance of DDM tasks without feedback (Gonzalez, 2005) 17

A Continuum of “dynamics” Only requirement: A sequence of decisions Most Dynamic Least Dynamic Environment changes No changes in the environment (Independently and as a although the environment is consequence of the actions of the probabilistic, probabilities and decision maker) values don’t change over the course of decisions Delayed feedback and Credit assignment problem (Multiple Immediate feedback (Action- actions and multiple outcomes Outcome closest in time) separated in time) Value is time independent (Time Value is time-dependent (Value of the decision is determined by decreases the farther away the the decision maker, no penalty for decision is from the optimal time) waiting) Simple Complex

Choice: Abstract and simple experimental paradigms Sampling Paradigm Repeated choice Paradigm (Hertwig et al. 2004) (Barron & Erev, 2003) 4 4 4 Sampling 4 3 3 0 0 … ….. Make a choice: Fixed number of trials 4 19

Description-Experience Gap Barron & Erev (2003); Hertwig, Barron, Weber & Erev (2004) • Description: • Experience A: Get $4 with probability .8, $0 otherwise B: Get $3 for sure Make a final choice: - Pmax = 88% = 52 DEGap: Pmax (A choices) = 36% Description: According to Prospect Theory people overweight the probability of the rare event Experience: as if people underweight the probability of the rare event 20

Exploration process: a theoretical divide? DE-Gap is due to Sampling Repeated Choice Reliance on recent Reliance on small outcomes samples Exploration transitions – A theoretical divide? Exploration – Exploration - Exploitation two Exploitation distinct processes tradeoff Models often assume Increase selection of that sampling is best known option random over time 21

Gonzalez & Dutt (2011) • Demonstrate the behavioral regularities between sampling and consequential choice paradigms: – Similar Description-Experience(DE)-Gap – Gradual decrease of exploration over time – Maximization in choice – Prediction of choice from memory: Selection of option with the highest experienced expected outcome during past experience • Demonstrate that people rely on remarkably similar cognitive processes in both paradigms: – People explore options aiming to get the best possible outcome – Rely on their (faulty) memories (frequency, recency and noise) • A single cognitive model based on Instance-Based Learning Theory (IBLT; Gonzalez, Lerch, & Lebiere, 2003): – Explains the learning process and predicts choice better than models that were designed for one paradigm alone (e.g., the winners of the Technion 22 Modeling competition - TPT)

Human data sets Description Sampling Repeated Choice 6 problems Hertwig et al., 2004 Hertwig et al., 2004 Barron & Erev, 2003 N=50 N=50 N=144 Technion Prediction N=100 N=100 N=100 Tournament (TPT) 60 problems 60 problems 60 problems Erev et al., 2011 Estimation set Estimation set Estimation set N=100 N=100 N=100 60 problems 60 problems 60 problems Competition set Competition set Competition set 23

Similar DEGap in Sampling and Consequential Choice paradigms Description Sampling Repeated Choice 6 problems Hertwig et al., 2004 Hertwig et al., 2004 Barron & Erev, 2003 N=50 N=50 N=144 Significant gap for each of r = .93, p =.01 the 6 problems Technion Prediction N=100 N=100 N=100 Tournament (TPT) 60 problems 60 problems 60 problems Erev et al., 2011 Estimation set Estimation set Estimation set N=100 N=100 N=100 60 problems 60 problems 60 problems Competition set Competition set Competition set r = .83, p =.0001 r = –.53, p=.0004 r = –.37, p =.004 24

Similar risky choices across DFE paradigms, but is exploration similar? In TPT data sets • P-risky choices (Estimation and Competition) – Sampling = 0.49 & 0.44 – Repeated choice = 0.40 & 0.38 • Alternation rate (A-rate) is a measure of exploration. A-rate (Estimation and Competition) – Sampling = 0.34 & 0.29 – Repeated choice = 0.14 & 0.13 • Alternation correlations between sampling and consequential choice over time – r =.93, p=.01 Estimation set – r =.89, p=.01 Competition set 25

Exploration decreases over time Gonzalez & Dutt, 2011 Repeated Choice Sampling 1 1 0.9 0.9 0.8 0.8 0.7 0.7 6 problems A-rate 0.6 A-rate 0.6 0.5 0.5 Hertwig et 0.4 0.4 al., 2004 0.3 0.3 0.2 0.2 0.1 0.1 0 0 2 22 42 62 82 102 122 142 162 182 20 2 12 22 32 42 52 62 72 82 92 0 Number of Trials Number of Samples 1 1 Technion 0.9 0.9 0.8 0.8 Prediction 0.7 0.7 A-rate 0.6 Tournament A-rate 0.6 0.5 0.5 0.4 (TPT) 0.4 0.3 Erev et al., 0.2 0.3 0.1 0.2 2011 0 0.1 2 22 42 62 82 100 0 Number of Trials 2 12 22 32 42 52 62 72 82 92 Number of Samples

Dynamic Decision Making: Implications for Recommender System Design - PowerPoint PPT Presentation

Dynamic Decision Making: Implications for Recommender System Design Cleotilde (Coty) Gonzalez Dynamic Decision Making Laboratory (www.cmu.edu/ddmlab) Department of Social and Decision Sciences Carnegie Mellon University 1 Research process and

Web Mining and Recommender Systems Recommender Systems: Introduction Learning Goals

6 Decision- -Making Making MVC (revisited) 6 Decision MVC (revisited) decision

DECISION MAKING readysetpresent.com Decision Making Program Objectives ( 1 of 2 ) To examine

Decision Making 1 Decision Making Skills Establishing a positive decision-making environment.

Biases in Decision Making Alexander Felfernig alexander.felfernig@ist.tugraz.at Decision Biases

2. Recommender Systems Recommenders Everywhere Advanced Topics in Information Retrieval /

Affect- and Personality-based Recommender Systems Part II: Acquisition, Usage in Recommender

On the Economics of Recommender Systems Emilio Calvano Center for Studies in Econ and Finance U.

Privacy in Recommender Systems CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 21:

Decision Making Under Decision Making . . . General Set Uncertainty: Proof of This Result

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

S C DECISION E N C E decision science SDS CMU What is Decision Science? Behavioral

Understanding the Implications of Recommender Systems on Our Views and Behaviors Gedas Adomavicius

Supported Decision-Making in Wisconsin Today we will talk about: The concept of Supported

MI MI and Shared MI MI and Shared and Shared Decision Making and Shared Decision Making

Overview of Robot Decision Making Prof. Yuke Zhu Fall 2020 CS391R: Robot Learning (Fall 2020) 1

S chool Rove r Yarrabah and stayed at his mums and to be dropped off and JERAYMAS MYNGHA SETS

The Organization of Knowledge ! History of Information i218 ! Geoff Nunberg ! Feb. 18, 2010 ! 1 ! 1

Thinking Software Today I want to talk about the importance of software/code and why we need to

Bart Vanhercke European Social Observatory (OSE) & University of Leuven (CESO) Outline 1.

Exercise #1 PARIS IN THE THE SPRING Exercise #2 FINISHED FILES ARE FREQUENTLY THE RESULT OF

Communication Ali Rostami , Bin Cheng , Hongsheng Lu , John B. Kenney , and Marco

Introduction: Open Web Platform and Automotive Philipp Hoschka<ph@w3.org> This project is

Query Types IR, session 3 CS6200: Information Retrieval Slides by: Jesse Anderton Query Types

Dynamic Decision Making: Implications for Recommender System Design - PowerPoint PPT Presentation

Dynamic Decision Making: Implications for Recommender System Design Cleotilde (Coty) Gonzalez Dynamic Decision Making Laboratory (www.cmu.edu/ddmlab) Department of Social and Decision Sciences Carnegie Mellon University 1 Research process and

Web Mining and Recommender Systems Recommender Systems: Introduction Learning Goals

6 Decision- -Making Making MVC (revisited) 6 Decision MVC (revisited) decision

DECISION MAKING readysetpresent.com Decision Making Program Objectives ( 1 of 2 ) To examine

Decision Making 1 Decision Making Skills Establishing a positive decision-making environment.

Biases in Decision Making Alexander Felfernig alexander.felfernig@ist.tugraz.at Decision Biases

2. Recommender Systems Recommenders Everywhere Advanced Topics in Information Retrieval /

Affect- and Personality-based Recommender Systems Part II: Acquisition, Usage in Recommender

On the Economics of Recommender Systems Emilio Calvano Center for Studies in Econ and Finance U.

Privacy in Recommender Systems CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 21:

Decision Making Under Decision Making . . . General Set Uncertainty: Proof of This Result

Learning Decision Trees Representation is a decision tree. Bias is towards simple decision

S C DECISION E N C E decision science SDS CMU What is Decision Science? Behavioral

Understanding the Implications of Recommender Systems on Our Views and Behaviors Gedas Adomavicius

Supported Decision-Making in Wisconsin Today we will talk about: The concept of Supported

MI MI and Shared MI MI and Shared and Shared Decision Making and Shared Decision Making

Overview of Robot Decision Making Prof. Yuke Zhu Fall 2020 CS391R: Robot Learning (Fall 2020) 1

S chool Rove r Yarrabah and stayed at his mums and to be dropped off and JERAYMAS MYNGHA SETS

The Organization of Knowledge ! History of Information i218 ! Geoff Nunberg ! Feb. 18, 2010 ! 1 ! 1

Thinking Software Today I want to talk about the importance of software/code and why we need to

Bart Vanhercke European Social Observatory (OSE) &amp; University of Leuven (CESO) Outline 1.

Exercise #1 PARIS IN THE THE SPRING Exercise #2 FINISHED FILES ARE FREQUENTLY THE RESULT OF

Communication Ali Rostami , Bin Cheng , Hongsheng Lu , John B. Kenney , and Marco

Introduction: Open Web Platform and Automotive Philipp Hoschka&lt;ph@w3.org&gt; This project is

Query Types IR, session 3 CS6200: Information Retrieval Slides by: Jesse Anderton Query Types

Bart Vanhercke European Social Observatory (OSE) & University of Leuven (CESO) Outline 1.

Introduction: Open Web Platform and Automotive Philipp Hoschka<ph@w3.org> This project is