CS344M Autonomous Multiagent Systems Todd Hester Department of - - PowerPoint PPT Presentation

cs344m autonomous multiagent systems
SMART_READER_LITE
LIVE PREVIEW

CS344M Autonomous Multiagent Systems Todd Hester Department of - - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Todd Hester Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Todd Hester Logistics Progress reports due at beginning of class 2


slide-1
SLIDE 1

CS344M Autonomous Multiagent Systems

Todd Hester Department of Computer Science The University of Texas at Austin

slide-2
SLIDE 2

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

slide-3
SLIDE 3

Logistics

  • Progress reports due at beginning of class

− 2 hard copies − Attach your proposals − Anonymized soft copy

Todd Hester

slide-4
SLIDE 4

Logistics

  • Progress reports due at beginning of class

− 2 hard copies − Attach your proposals − Anonymized soft copy

  • Peer reviews due next Thursday

Todd Hester

slide-5
SLIDE 5

Logistics

  • Progress reports due at beginning of class

− 2 hard copies − Attach your proposals − Anonymized soft copy

  • Peer reviews due next Thursday
  • Prof. Stone will teach class Thursday

Todd Hester

slide-6
SLIDE 6

Distributed Rational Decision Making

Self-interested, rational agent

Todd Hester

slide-7
SLIDE 7

Distributed Rational Decision Making

Self-interested, rational agent

  • Self-interested:

Todd Hester

slide-8
SLIDE 8

Distributed Rational Decision Making

Self-interested, rational agent

  • Self-interested: maximize own goals

– No concern for global good

Todd Hester

slide-9
SLIDE 9

Distributed Rational Decision Making

Self-interested, rational agent

  • Self-interested: maximize own goals

– No concern for global good

  • Rational:

Todd Hester

slide-10
SLIDE 10

Distributed Rational Decision Making

Self-interested, rational agent

  • Self-interested: maximize own goals

– No concern for global good

  • Rational: agents are smart

– Ideally, will act optimally

Todd Hester

slide-11
SLIDE 11

Distributed Rational Decision Making

Self-interested, rational agent

  • Self-interested: maximize own goals

– No concern for global good

  • Rational: agents are smart

– Ideally, will act optimally The protocol is key

Todd Hester

slide-12
SLIDE 12

Evaluation Criteria

  • Social welfare
  • Pareto efficiency
  • Stability

Todd Hester

slide-13
SLIDE 13

Evaluation Criteria

  • Social welfare
  • Pareto efficiency
  • Stability
  • Individual Rationality

Todd Hester

slide-14
SLIDE 14

Evaluation Criteria

  • Social welfare
  • Pareto efficiency
  • Stability
  • Individual Rationality
  • Efficiency (computational, communication)

Todd Hester

slide-15
SLIDE 15

Voting vs. auctions

  • Voting: maximize social good

– result affects all

Todd Hester

slide-16
SLIDE 16

Voting vs. auctions

  • Voting: maximize social good

– result affects all

  • Auctions: maximize profit

– result affects buyer and seller

Todd Hester

slide-17
SLIDE 17

Activity

  • Pick an integer between 1 and 20, write it down

Todd Hester

slide-18
SLIDE 18

Activity

  • Pick an integer between 1 and 20, write it down
  • Draw a line under it
  • Pick another number, write it under the line.

Todd Hester

slide-19
SLIDE 19

Activity

  • Pick an integer between 1 and 20, write it down
  • Draw a line under it
  • Pick another number, write it under the line.
  • 1st price sealed-bid auction

Todd Hester

slide-20
SLIDE 20

Activity

  • Pick an integer between 1 and 20, write it down
  • Draw a line under it
  • Pick another number, write it under the line.
  • 1st price sealed-bid auction
  • The top number is your utility

Todd Hester

slide-21
SLIDE 21

Activity

  • Pick an integer between 1 and 20, write it down
  • Draw a line under it
  • Pick another number, write it under the line.
  • 1st price sealed-bid auction
  • The top number is your utility
  • Goal: as much profit as possible

Todd Hester

slide-22
SLIDE 22

Activity

  • Pick an integer between 1 and 20, write it down
  • Draw a line under it
  • Pick another number, write it under the line.
  • 1st price sealed-bid auction
  • The top number is your utility
  • Goal: as much profit as possible
  • Write down your bid

Todd Hester

slide-23
SLIDE 23

Activity

  • Pick an integer between 1 and 20, write it down
  • Draw a line under it
  • Pick another number, write it under the line.
  • 1st price sealed-bid auction
  • The top number is your utility
  • Goal: as much profit as possible
  • Write down your bid
  • Repeat with 2nd price sealed-bid auction
  • Number under the line is your utility

Todd Hester

slide-24
SLIDE 24

Auctions

  • Valuations:

Todd Hester

slide-25
SLIDE 25

Auctions

  • Valuations:

− private value

Todd Hester

slide-26
SLIDE 26

Auctions

  • Valuations:

− private value − common value

Todd Hester

slide-27
SLIDE 27

Auctions

  • Valuations:

− private value − common value − correlated value

Todd Hester

slide-28
SLIDE 28

Auctions

  • Valuations:

− private value − common value − correlated value

  • Types:

− first-price open-cry (English)

Todd Hester

slide-29
SLIDE 29

Auctions

  • Valuations:

− private value − common value − correlated value

  • Types:

− first-price open-cry (English) − first-price sealed-bid

Todd Hester

slide-30
SLIDE 30

Auctions

  • Valuations:

− private value − common value − correlated value

  • Types:

− first-price open-cry (English) − first-price sealed-bid − descending (Dutch)

Todd Hester

slide-31
SLIDE 31

Auctions

  • Valuations:

− private value − common value − correlated value

  • Types:

− first-price open-cry (English) − first-price sealed-bid − descending (Dutch) − second-price sealed-bid (Vickrey)

Todd Hester

slide-32
SLIDE 32

Auctions

  • Valuations:

− private value − common value − correlated value

  • Types:

− first-price open-cry (English) − first-price sealed-bid − descending (Dutch) − second-price sealed-bid (Vickrey) Revenue equivalence: private-value, risk-neutral

Todd Hester

slide-33
SLIDE 33

Auctions

  • You value a bunch of flowers at $100

Todd Hester

slide-34
SLIDE 34

Auctions

  • You value a bunch of flowers at $100
  • What strategy if auction is:

– English

Todd Hester

slide-35
SLIDE 35

Auctions

  • You value a bunch of flowers at $100
  • What strategy if auction is:

– English – first-price sealed-bid

Todd Hester

slide-36
SLIDE 36

Auctions

  • You value a bunch of flowers at $100
  • What strategy if auction is:

– English – first-price sealed-bid – Descending

Todd Hester

slide-37
SLIDE 37

Auctions

  • You value a bunch of flowers at $100
  • What strategy if auction is:

– English – first-price sealed-bid – Descending – Vickrey

Todd Hester

slide-38
SLIDE 38

Auctions

  • You value a bunch of flowers at $100
  • What strategy if auction is:

– English – first-price sealed-bid – Descending – Vickrey

  • What if it’s an antique?

Todd Hester

slide-39
SLIDE 39

Auctions

  • Vickrey, English are truthful

Todd Hester

slide-40
SLIDE 40

Auctions

  • Vickrey, English are truthful
  • First-price sealed-bid: bidders bid lower than values

Todd Hester

slide-41
SLIDE 41

Auctions

  • Vickrey, English are truthful
  • First-price sealed-bid: bidders bid lower than values

– Private value case: why?

Todd Hester

slide-42
SLIDE 42

Auctions

  • Vickrey, English are truthful
  • First-price sealed-bid: bidders bid lower than values

– Private value case: why?

  • In common (and correlated) value case, bids lower in all

mechanisms

Todd Hester

slide-43
SLIDE 43

Auctions

  • Vickrey, English are truthful
  • First-price sealed-bid: bidders bid lower than values

– Private value case: why?

  • In common (and correlated) value case, bids lower in all

mechanisms – Why?

Todd Hester

slide-44
SLIDE 44

Auctions

  • Vickrey, English are truthful
  • First-price sealed-bid: bidders bid lower than values

– Private value case: why?

  • In common (and correlated) value case, bids lower in all

mechanisms – Why? Winner’s curse

Todd Hester

slide-45
SLIDE 45

Auctions

  • How could you collude?

Todd Hester

slide-46
SLIDE 46

Auctions

  • How could you collude?

– English

Todd Hester

slide-47
SLIDE 47

Auctions

  • How could you collude?

– English – first-price sealed-bid

Todd Hester

slide-48
SLIDE 48

Auctions

  • How could you collude?

– English – first-price sealed-bid – Descending

Todd Hester

slide-49
SLIDE 49

Auctions

  • How could you collude?

– English – first-price sealed-bid – Descending – Vickrey

Todd Hester

slide-50
SLIDE 50

Auctions

  • How could you collude?

– English – first-price sealed-bid – Descending – Vickrey

  • Incentive to break coalition?

Todd Hester

slide-51
SLIDE 51

Auctions

  • How could you collude?

– English – first-price sealed-bid – Descending – Vickrey

  • Incentive to break coalition?
  • Does everyone need to be in collusion?

Todd Hester

slide-52
SLIDE 52

Auctions

  • How could you collude?

– English – first-price sealed-bid – Descending – Vickrey

  • Incentive to break coalition?
  • Does everyone need to be in collusion?
  • Application of auctions to robot soccer?

Todd Hester

slide-53
SLIDE 53

Auctions vs. voting

  • Auctions: maximize profit

– result affects buyer and seller

  • Voting: maximize social good

– result affects all

Todd Hester

slide-54
SLIDE 54

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

Todd Hester

slide-55
SLIDE 55

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

– Assume your preference is Nader > Gore > Bush – For whom should you vote?

Todd Hester

slide-56
SLIDE 56

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

– Assume your preference is Nader > Gore > Bush – For whom should you vote? – What if we change the system?

Todd Hester

slide-57
SLIDE 57

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

– Assume your preference is Nader > Gore > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

Todd Hester

slide-58
SLIDE 58

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

– Assume your preference is Nader > Gore > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

  • 3+ candidates =

⇒ only dictatorial system eliminates need for tactical voting − One person appointed

Todd Hester

slide-59
SLIDE 59

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

– Assume your preference is Nader > Gore > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

  • 3+ candidates =

⇒ only dictatorial system eliminates need for tactical voting − One person appointed

  • No point thinking of a “better” voting system
  • Assumption: no restrictions on preferences

Todd Hester

slide-60
SLIDE 60

Gibbard-Satterthwaite

  • Example: Bush, Gore, or Nader?

– Assume your preference is Nader > Gore > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

  • 3+ candidates =

⇒ only dictatorial system eliminates need for tactical voting − One person appointed

  • No point thinking of a “better” voting system
  • Assumption: no restrictions on preferences

What about Clarke tax algorithm?

Todd Hester

slide-61
SLIDE 61

Types of Tactical Voting

  • Compromising:

Rank someone higher to get him/her elected − e.g. Gore instead of Nader

Todd Hester

slide-62
SLIDE 62

Types of Tactical Voting

  • Compromising:

Rank someone higher to get him/her elected − e.g. Gore instead of Nader

  • Burying: Rank someone lower to get him/her defeated

− e.g. in Borda protocol

Todd Hester

slide-63
SLIDE 63

Types of Tactical Voting

  • Compromising:

Rank someone higher to get him/her elected − e.g. Gore instead of Nader

  • Burying: Rank someone lower to get him/her defeated

− e.g. in Borda protocol

  • Push-over: Rank someone higher to get someone else

elected − e.g. in a protocol with multiple rounds

Todd Hester

slide-64
SLIDE 64

Arrow’s Theorem

Universality.

Todd Hester

slide-65
SLIDE 65

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots.

Todd Hester

slide-66
SLIDE 66

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality.

Todd Hester

slide-67
SLIDE 67

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality. If everyone prefers X to Y, then the

  • utcome should rank X above Y

.

Todd Hester

slide-68
SLIDE 68

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality. If everyone prefers X to Y, then the

  • utcome should rank X above Y

. Criterion of independence of irrelevant alternatives.

Todd Hester

slide-69
SLIDE 69

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality. If everyone prefers X to Y, then the

  • utcome should rank X above Y

. Criterion of independence of irrelevant alternatives. If

  • ne

set of preference ballots would lead to an an overall ranking of alternative X above alternative Y and if some preference ballots are changed without changing the relative rank of X and Y, then the method should still rank X above Y .

Todd Hester

slide-70
SLIDE 70

Citizen Sovereignty.

Todd Hester

slide-71
SLIDE 71

Citizen Sovereignty. Every possible ranking of alternatives can be achieved from some set of individual preference ballots.

Todd Hester

slide-72
SLIDE 72

Citizen Sovereignty. Every possible ranking of alternatives can be achieved from some set of individual preference ballots. Non-dictatorship.

Todd Hester

slide-73
SLIDE 73

Citizen Sovereignty. Every possible ranking of alternatives can be achieved from some set of individual preference ballots. Non-dictatorship. There should not be one specific voter whose preference ballot is always adopted.

Todd Hester

slide-74
SLIDE 74

Arrow’s Theorem

Universality.

Todd Hester

slide-75
SLIDE 75

Arrow’s Theorem

  • Universality. Complete rankings

Todd Hester

slide-76
SLIDE 76

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality.

Todd Hester

slide-77
SLIDE 77

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree

Todd Hester

slide-78
SLIDE 78

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty.

Todd Hester

slide-79
SLIDE 79

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible

Todd Hester

slide-80
SLIDE 80

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship.

Todd Hester

slide-81
SLIDE 81

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides

Todd Hester

slide-82
SLIDE 82

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides Independence of irrelevant alternatives.

Todd Hester

slide-83
SLIDE 83

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides Independence of irrelevant alternatives. Removing or adding a non-winner doesn’t change winner

Todd Hester

slide-84
SLIDE 84

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides Independence of irrelevant alternatives. Removing or adding a non-winner doesn’t change winner Not all possible!

Todd Hester

slide-85
SLIDE 85

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

Todd Hester

slide-86
SLIDE 86

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

  • A pairwise method

Todd Hester

slide-87
SLIDE 87

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

  • A pairwise method
  • Smith set:

smallest set of candidates such that each candidate in the set preferred over each candidate not in the set

Todd Hester

slide-88
SLIDE 88

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

  • A pairwise method
  • Smith set:

smallest set of candidates such that each candidate in the set preferred over each candidate not in the set

  • Every candidate in the Smith set is relevant

Todd Hester

slide-89
SLIDE 89

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A

Todd Hester

slide-90
SLIDE 90

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B :

Todd Hester

slide-91
SLIDE 91

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

Todd Hester

slide-92
SLIDE 92

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C

Todd Hester

slide-93
SLIDE 93

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C Overall: B > C > A

Todd Hester

slide-94
SLIDE 94

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C Overall: B > C > A

  • Does that solve everything?

Todd Hester

slide-95
SLIDE 95

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C Overall: B > C > A

  • Does that solve everything? What about cycles?

Todd Hester