Models of Strategic Reasoning Lecture 2 Eric Pacuit University of - PowerPoint PPT Presentation

Models of Strategic Reasoning Lecture 2 Eric Pacuit University of Maryland, College Park ai.stanford.edu/~epacuit August 7, 2012 Eric Pacuit: Models of Strategic Reasoning 1/30

Lecture 1: Introduction, Motivation and Background Lecture 2: The Dynamics of Rational Deliberation Lecture 3: Reasoning to a Solution: Common Modes of Reasoning in Games Lecture 4: Reasoning to a Model: Iterated Belief Change as Deliberation Reasoning in Specific Games: Experimental Results Lecture 5: Eric Pacuit: Models of Strategic Reasoning 2/30

B. Skyrms. The Dynamics of Rational Deliberation . Harvard University Press, 1990. Eric Pacuit: Models of Strategic Reasoning 3/30

Suppose that one deliberates by calculating expected utility. Eric Pacuit: Models of Strategic Reasoning 4/30

Suppose that one deliberates by calculating expected utility. In the simplest case, deliberation is trivial; one calculates expected utility and maximizes Eric Pacuit: Models of Strategic Reasoning 4/30

Suppose that one deliberates by calculating expected utility. In the simplest case, deliberation is trivial; one calculates expected utility and maximizes Information feedback : “the very process of deliberation may generate information that is relevant to the evaluation of the expected utilities. Then, processing costs permitting, a Bayesian deliberator will feed back that information, modifying his probabilities of states of the world, and recalculate expected utilities in light of the new knowledge.” Eric Pacuit: Models of Strategic Reasoning 4/30

Deliberational Equilibrium The decision maker cannot decide to do an act that is not an equilibrium of the deliberational process. ( provided we neglect processing costs...the implementations use a “satisficing level” ) Eric Pacuit: Models of Strategic Reasoning 5/30

Deliberational Equilibrium The decision maker cannot decide to do an act that is not an equilibrium of the deliberational process. ( provided we neglect processing costs...the implementations use a “satisficing level” ) This sort of equilibirium requirement can be seen as a consequence of the expected utility principle (dynamic coherence). It is usually neglected because the process of informational feedback is usually neglected. Eric Pacuit: Models of Strategic Reasoning 5/30

A Bayesian has to choose between n acts: s 1 , s 2 , . . . , s n Eric Pacuit: Models of Strategic Reasoning 6/30

A Bayesian has to choose between n acts: s 1 , s 2 , . . . , s n state of indecision : P = � p 1 , . . . , p n � of probabilities for each act ( � i p i = 1). The default mixed act is the mixed act corresponding to the state of indecision (decision makers always make a decision). Eric Pacuit: Models of Strategic Reasoning 6/30

A Bayesian has to choose between n acts: s 1 , s 2 , . . . , s n state of indecision : P = � p 1 , . . . , p n � of probabilities for each act ( � i p i = 1). The default mixed act is the mixed act corresponding to the state of indecision (decision makers always make a decision). status quo : EU ( P ) = � i p i · u i ( s i ) Eric Pacuit: Models of Strategic Reasoning 6/30

A person’s state of indecision evolves during deliberation. After computing expected utility, she will believe more strongly that she will ultimately do the acts (or one of those acts) that are ranked more highly than her current state of indecision. Eric Pacuit: Models of Strategic Reasoning 7/30

A person’s state of indecision evolves during deliberation. After computing expected utility, she will believe more strongly that she will ultimately do the acts (or one of those acts) that are ranked more highly than her current state of indecision. Why not just do the act with highest expected utility? Eric Pacuit: Models of Strategic Reasoning 7/30

A person’s state of indecision evolves during deliberation. After computing expected utility, she will believe more strongly that she will ultimately do the acts (or one of those acts) that are ranked more highly than her current state of indecision. Why not just do the act with highest expected utility? On pain of incoherence , the player will continue to deliberate if she believes that she is in an informational feedback situation and if she assigns any positive probability at all to the possibility that informational feedback may lead her ultimately to a different decision. Eric Pacuit: Models of Strategic Reasoning 7/30

A person’s state of indecision evolves during deliberation. After computing expected utility, she will believe more strongly that she will ultimately do the acts (or one of those acts) that are ranked more highly than her current state of indecision. Why not just do the act with highest expected utility? On pain of incoherence , the player will continue to deliberate if she believes that she is in an informational feedback situation and if she assigns any positive probability at all to the possibility that informational feedback may lead her ultimately to a different decision. The decision maker follows a “simple dynamical rule” for “making up one’s mind” Eric Pacuit: Models of Strategic Reasoning 7/30

Seeks the good The dynamical rule seeks the good : 1. the rule raises the probability of an act only if that act has utility greater than the status quo 2. the rule raises the sum of the probability of all acts with utility greater than the status quo (if any) Eric Pacuit: Models of Strategic Reasoning 8/30

Seeks the good The dynamical rule seeks the good : 1. the rule raises the probability of an act only if that act has utility greater than the status quo 2. the rule raises the sum of the probability of all acts with utility greater than the status quo (if any) all dynamical rules that seek the good have the same fixed points: those states in which the expected utility of the status quo is maximal. Eric Pacuit: Models of Strategic Reasoning 8/30

Nash Dynamics covetability of act A : given a state of indecision P cov ( A ) = max( EU ( A ) − EU ( P ) , 0) Eric Pacuit: Models of Strategic Reasoning 9/30

Nash Dynamics covetability of act A : given a state of indecision P cov ( A ) = max( EU ( A ) − EU ( P ) , 0) Nash map : P �→ P ′ where each component p ′ i is calculated as follows: p i + cov ( A i ) p ′ i = 1 + � i cov ( A i ) Eric Pacuit: Models of Strategic Reasoning 9/30

Nash Dynamics covetability of act A : given a state of indecision P cov ( A ) = max( EU ( A ) − EU ( P ) , 0) Nash map : P �→ P ′ where each component p ′ i is calculated as follows: p i + cov ( A i ) p ′ i = 1 + � i cov ( A i ) More generally, for k > 0, i = k · p i + cov ( A i ) p ′ k + � i cov ( A i ) where k is the “index of caution”. The higher the k the more slowly the decision maker moves in the direction of acts that look more attractive than the status quo. Eric Pacuit: Models of Strategic Reasoning 9/30

decision maker’s personal state : � x , y � where x is the state of indecision and the probabilities she assigns to the “states of nature” Eric Pacuit: Models of Strategic Reasoning 10/30

decision maker’s personal state : � x , y � where x is the state of indecision and the probabilities she assigns to the “states of nature” Dynamics: ϕ ( � x , y � ) = � x ′ , y ′ � consisting of 1. An “adaptive dynamic map” D sending � x , y � to x ′ 2. the informational feedback process I sending � x , y � to y ′ Eric Pacuit: Models of Strategic Reasoning 10/30

decision maker’s personal state : � x , y � where x is the state of indecision and the probabilities she assigns to the “states of nature” Dynamics: ϕ ( � x , y � ) = � x ′ , y ′ � consisting of 1. An “adaptive dynamic map” D sending � x , y � to x ′ 2. the informational feedback process I sending � x , y � to y ′ A personal state � x , y � is a deliberational equilibrium iff ϕ ( � x , y � ) = � x , y � Eric Pacuit: Models of Strategic Reasoning 10/30

Fact . If D seeks the good and I is continuous, then there is a delbierational equilibrium, � x , y � , for � D , I � . If D ′ also seeks the good, then � x , y � is also a deliberational equilibrium for � D ′ , I � . The default mixed act corresponding to x maximizes expected utility at � x , y � . Eric Pacuit: Models of Strategic Reasoning 11/30

Games played by Bayesian deliberators For each player, the decisions of the other players constitute the relevant state of the world, which together with her decision, determines the consequences in accordance with the payoff matrix. Eric Pacuit: Models of Strategic Reasoning 12/30

Games played by Bayesian deliberators For each player, the decisions of the other players constitute the relevant state of the world, which together with her decision, determines the consequences in accordance with the payoff matrix. 1. Start from the initial position, player i calculates expected utility and moves by her adaptive rule to a new state of indecision. Eric Pacuit: Models of Strategic Reasoning 12/30

Models of Strategic Reasoning Lecture 2 Eric Pacuit University of - PowerPoint PPT Presentation

Models of Strategic Reasoning Lecture 2 Eric Pacuit University of Maryland, College Park ai.stanford.edu/~epacuit August 7, 2012 Eric Pacuit: Models of Strategic Reasoning 1/30 Lecture 1: Introduction, Motivation and Background Lecture 2:

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

Evidential and Causal Reasoning Much reasoning in AI can be seen as evidential reasoning ,

Models for Inexact Reasoning Models for Inexact Reasoning Reasoning with Certainty Factors: The

Models of Strategic Reasoning Lecture 3 Eric Pacuit University of Maryland, College Park

Surface Reasoning Lecture 1: Reasoning with Monotonicity Thomas Icard June 18-22, 2012 Thomas

SECTION 1: Introductions Code Reasoning Forward Reasoning CODE REASONING +

Probabilistic Reasoning; Probabilistic Reasoning; Network-based reasoning Network-based

CHAPTER-4 1 LOGIC AND REASONING ! Knowledge and ! Reasoning in Knowledge- Reasoning Based

Models for Inexact Reasoning Reasoning with Subjective Pseudo Reasoning with Subjective Pseudo

Reasoning and Meta-reasoning Sonia Marin IT-University of Copenhagen, Denmark 85-211

Reasoning Skills Alicia Foy Gifted Specialist 3/21/19 1 www.FLDOE.org Objectives Student

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Automated Reasoning Introduction Jacques Fleuriot Automated Reasoning Introduction Lecture 1,

Foundations of AI 18. Strategic Games Strategic Reasoning and Acting Wolfram Burgard and

Demystifying the AIA Strategic Council STRATEGIC COUNCIL Strategic Council - SC 503 STRATEGIC

STRATEGIC PLANNING STRATEGIC PLANNING STRATEGIC PLANNING STRATEGIC PLANNING AIKEN COUNTY PUBLIC

Software Agents and Multi-Agent Systems Keith S. Decker Department of Computer Science

Johan tting (Sectra) Johan Jonasson (House of Test) Martin Gladh (Frontit) Lecture at LiU

Preparing the Next Testers: An Undergraduate Course in Quality Assurance PACIFIC NW SOFTWARE

Design and Patterns of Human Behavior Professor Larry Heimann Application Design & Development

Eighth International Planning Competition: Deterministic Part Luk a s Chrpa Mauro Vallati

Mimi M. Recker Professor and Department Head October, 2013 1

The human factor Tyler Moore Tandy School of Computer Science, University of Tulsa Outline

More Data Cleaning; Crowdsourcing February 11, 2020 Data Science CSCI 1951A Brown University

Sambuz

Useful Links

Newsletter

Mail Us

Models of Strategic Reasoning Lecture 2 Eric Pacuit University of - PowerPoint PPT Presentation

Models of Strategic Reasoning Lecture 2 Eric Pacuit University of Maryland, College Park ai.stanford.edu/~epacuit August 7, 2012 Eric Pacuit: Models of Strategic Reasoning 1/30 Lecture 1: Introduction, Motivation and Background Lecture 2:

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

Evidential and Causal Reasoning Much reasoning in AI can be seen as evidential reasoning ,

Models for Inexact Reasoning Models for Inexact Reasoning Reasoning with Certainty Factors: The

Models of Strategic Reasoning Lecture 3 Eric Pacuit University of Maryland, College Park

Surface Reasoning Lecture 1: Reasoning with Monotonicity Thomas Icard June 18-22, 2012 Thomas

SECTION 1: Introductions Code Reasoning Forward Reasoning CODE REASONING +

Probabilistic Reasoning; Probabilistic Reasoning; Network-based reasoning Network-based

CHAPTER-4 1 LOGIC AND REASONING ! Knowledge and ! Reasoning in Knowledge- Reasoning Based

Models for Inexact Reasoning Reasoning with Subjective Pseudo Reasoning with Subjective Pseudo

Reasoning and Meta-reasoning Sonia Marin IT-University of Copenhagen, Denmark 85-211

Reasoning Skills Alicia Foy Gifted Specialist 3/21/19 1 www.FLDOE.org Objectives Student

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Automated Reasoning Introduction Jacques Fleuriot Automated Reasoning Introduction Lecture 1,

Foundations of AI 18. Strategic Games Strategic Reasoning and Acting Wolfram Burgard and

Demystifying the AIA Strategic Council STRATEGIC COUNCIL Strategic Council - SC 503 STRATEGIC

STRATEGIC PLANNING STRATEGIC PLANNING STRATEGIC PLANNING STRATEGIC PLANNING AIKEN COUNTY PUBLIC

Software Agents and Multi-Agent Systems Keith S. Decker Department of Computer Science

Johan tting (Sectra) Johan Jonasson (House of Test) Martin Gladh (Frontit) Lecture at LiU

Preparing the Next Testers: An Undergraduate Course in Quality Assurance PACIFIC NW SOFTWARE

Design and Patterns of Human Behavior Professor Larry Heimann Application Design &amp; Development

Eighth International Planning Competition: Deterministic Part Luk a s Chrpa Mauro Vallati

Mimi M. Recker Professor and Department Head October, 2013 1

The human factor Tyler Moore Tandy School of Computer Science, University of Tulsa Outline

More Data Cleaning; Crowdsourcing February 11, 2020 Data Science CSCI 1951A Brown University

Sambuz

Useful Links

Newsletter

Mail Us

Design and Patterns of Human Behavior Professor Larry Heimann Application Design & Development