Homework 5.4 Recall the following theorem from the overconfidence - PowerPoint PPT Presentation

Homework 5.4  Recall the following theorem from the “overconfidence versus paranoia” lecture:  Theorem . In a zero-sum perfect-information game, both the paranoid and overconfident models will lead you to choose the same strategy s 1 *  Professor Prune claims that the following game tree is a counterexample to the theorem. Is he right? Why or why not? Can you suggest any alternative interpretations that might resolve the problem? –1 101 0 2 Updated 11/2/10 Nau: Game Theory 1

Homework 6.1 In repeated games, which of the following strategies are stationary?  AllC : always cooperate  AllD (the Hawk strategy): always defect  Grim : cooperate until the other agent defects, then defect forever  Tit-for-Tat (TFT) : cooperate on the first move. On the n th move, repeat the other agent ( n –1) th move  Random : Randomly intersperse cooperation and defection  Tester : defect on move 1. If the other agent retaliates, play TFT. Otherwise, randomly intersperse cooperation and defection  Tit-For-Two-Tats (TFTT) : Like Tit-for-Tat, except that it retaliates only if the other agent defects twice in a row  Generous Tit-For-Tat (GTFT) : Like Tit-for-Tat, but with a small probability c >0 of cooperation on the n ’th move if the other agent defected on the ( n –1) th move  Pavlov (Win-Stay, Lose-Shift) : Repeats its previous move if it earns 3 or 5 points in the previous iteration, and reverses its previous move if it earns 0 or 1 points in the previous iteration Updated 11/2/10 Nau: Game Theory 2

Homework 6.2  Recall that DBS’s opponent model is as follows:  A set of rules of the following form if our last move was m and their last move was m' then P [their next move will be C ] • Four rules: one for each of (C,C), (C,D), (D,C), and (D,D)  For example, TFT can be described as • (C,C) ⇒ 1, (C, D) ⇒ 1, (D, C ) ⇒ 0, (D, D) ⇒ 0 (a) Which of the strategies in Homework 6.1 can be represented by DBS’s opponent model? (b) Are there any stationary strategies that DBS’s opponent model cannot represent? Why or why not? (b) Are there any stationary strategies that DBS’s opponent model cannot represent? Why or why not? Updated 11/2/10 Nau: Game Theory 3

Homework 6.3  Let S be the following set of interaction traces for the iterated battle of the sexes with 6 iterations. (a) What are the compatible subsets of S ? (b) For each compatible subset, what is the composite trace and its expected utility if we assume agents a 2 , a 4 , a 6 , and a 8 are equally likely? Trace 1 Trace 2 Trace 4 Trace 3 a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 G T G T T G T G G T G T T G T T G G T G T T T G T T G G T G G T T G T G G T G T T G G T G G T T Updated 11/2/10 Nau: Game Theory 4

Homework 6.4 (a) What is the expectiminimax value of the following game tree? MAX 3 − 1 CHANCE 0.5 0.5 0.5 0.5 1/4 3/4 1/3 2/3 2 4 0 − 2 MIN 2 4 7 4 6 0 5 − 2 2 4 6 8 10 –2 –4 –6 Updated 11/2/10 Nau: Game Theory 5

Homework 6.5 (b) Do we need to know z in order to know the expectiminimax value of the game tree? Why or why not? MAX 3 − 1 CHANCE 0.5 0.5 0.5 0.5 1/4 3/4 1/3 2/3 2 4 0 − 2 MIN 2 4 7 4 6 0 5 − 2 2 4 6 8 10 –2 –4 z Updated 11/2/10 Nau: Game Theory 6

Homework 6.5 (c) Do we need to know y and z in order to know the expectiminimax value of the game tree? Why or why not? MAX 3 − 1 CHANCE 0.5 0.5 0.5 0.5 1/4 3/4 1/3 2/3 2 4 0 − 2 MIN 2 4 7 4 6 0 5 − 2 2 4 6 8 10 –2 y z Updated 11/2/10 Nau: Game Theory 7

Homework 6.5 (d) Suppose we know in advance that every terminal node has a value in the range [–10, +10]. For what values of x (if any) will y and z have no effect on the expectiminimax value of the tree? MAX 3 − 1 CHANCE 0.5 0.5 0.5 0.5 1/4 3/4 1/3 2/3 2 4 0 − 2 MIN 2 4 6 8 10 x y z 2 4 7 4 6 0 5 − 2 Updated 11/2/10 Nau: Game Theory 8

Homework 6.6  Consider the double lottery game with the imitate-the-better dynamic. Suppose agent a ’s strategy is R and agent b ’s strategy is RwS . If we compare the two agents’ payoffs in order to decide which of them reproduces, then what are their probabilities of reproducing? Updated 11/2/10 Nau: Game Theory 9

Homework 5.4 Recall the following theorem from the overconfidence - PowerPoint PPT Presentation

Homework 5.4 Recall the following theorem from the overconfidence versus paranoia lecture: Theorem . In a zero-sum perfect-information game, both the paranoid and overconfident models will lead you to choose the same strategy s 1 *

Homework and Exams Homework Context Free Languages Return Homework #2 Homework #3

Homework Homework Context Free Languages Return Homework #2 Homework #3 Due today

Homework Homework #1 returned today Kleene Theorem Homework #2 due today Homework

Homework Homework #5 returned Turing Machines Homework #6 due today Homework #7

Homework Homework #2 returned Context Free Languages Homework #3 returned today (for early

Homework Homework #3 returned Chomsky Normal Form Homework #4 due today Homework #5

Homework Homework #2 returned Context Free Languages Homework #3 due today Homework #4

Homework Homework #1 returned Equivalence and DFA Homework #2 Due today Minimization

Homework Homework #1 returned Equivalence and DFA Homework #2 Due today Minimization

Announcements - Homework Homework 1 is graded, please collect at end of lecture Homework 2

Homework: Realities/Next Steps Hall Middle School - April 26, 2017 Purpose of Homework The

Show My Homework For Teachers, Students and Parents. www.showmyhomework.co.uk The world's No. 1

10/12/18 Homework Code of Conduct discuss homework but write your own HW! discuss homework

Calculator Homework 1 Homework Goal Use Storyboard to create user interface of an app.

Stereo Computer Vision Fall 2018 Columbia University Homework Homework 2 grades are back

Logistics Homework Homework #1 due today. Regular Expressions Homework #2

CS 574: Randomized Algorithms Lecture 1. Introduction to Randomness August 25, 2015 Lecture 1.

Covering Metric Spaces by Few Trees Yair Bartal Nova Fandina Ofer neiman Tree Covers Let

Lagrangian Relaxation via Randomized Rounding, Introduction Neal E. Young UCR, 1/28/04

Predicting data o v er time MAC H IN E L E AR N IN G FOR TIME SE R IE S DATA IN P YTH ON

Game theory (Ch. 17.5) Announcements Test grades up now Find best strategy How does this

Sensibility Testbed Justin Cappos New York University Polytechnic School of Engineering

Public and private BitTorrent communities: A measurement study M. Meulpolder, L. DAcunto, M.

Incentives to relay 1) Incentives to relay traffic 2) Incentives to do it well 3)

Sambuz

Useful Links

Newsletter

Mail Us