CS344M Autonomous Multiagent Systems Patrick MacAlpine Department - - PowerPoint PPT Presentation

cs344m autonomous multiagent systems
SMART_READER_LITE
LIVE PREVIEW

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department - - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Patrick MacAlpine Good Afternoon, Colleagues Are there any


slide-1
SLIDE 1

CS344M Autonomous Multiagent Systems

Patrick MacAlpine Department of Computer Science The University of Texas at Austin

slide-2
SLIDE 2

Good Afternoon, Colleagues

Are there any questions?

Patrick MacAlpine

slide-3
SLIDE 3

Good Afternoon, Colleagues

Are there any questions?

  • Sandholm says “no Nash equilibrium exists”?
  • Difference between axiomatic and strategic bargaining?
  • How to calculate social welfare metric of a protocol?
  • Why use Dutch auction?

Patrick MacAlpine

slide-4
SLIDE 4

Logistics

  • Peer review process (due today) - thoughts?

Patrick MacAlpine

slide-5
SLIDE 5

Logistics

  • Peer review process (due today) - thoughts?
  • Progress reports coming back

Patrick MacAlpine

slide-6
SLIDE 6

Logistics

  • Peer review process (due today) - thoughts?
  • Progress reports coming back
  • Final projects due in 3 weeks!

Patrick MacAlpine

slide-7
SLIDE 7

Logistics

  • Peer review process (due today) - thoughts?
  • Progress reports coming back
  • Final projects due in 3 weeks!
  • Final tournament: Wednesday 12/9 at 7pm in GDC 5.302

Patrick MacAlpine

slide-8
SLIDE 8

Your Progress Reports

  • Best ones motivate the problem before giving solutions

Patrick MacAlpine

slide-9
SLIDE 9

Your Progress Reports

  • Best ones motivate the problem before giving solutions
  • Say not only what’s done, but what’s yet to do

Patrick MacAlpine

slide-10
SLIDE 10

Your Progress Reports

  • Best ones motivate the problem before giving solutions
  • Say not only what’s done, but what’s yet to do
  • More about what worked than what didn’t

Patrick MacAlpine

slide-11
SLIDE 11

Your Progress Reports

  • Best ones motivate the problem before giving solutions
  • Say not only what’s done, but what’s yet to do
  • More about what worked than what didn’t
  • Clear enough for outsider to understand

Patrick MacAlpine

slide-12
SLIDE 12

Your Progress Reports

  • Best ones motivate the problem before giving solutions
  • Say not only what’s done, but what’s yet to do
  • More about what worked than what didn’t
  • Clear enough for outsider to understand
  • Be specific - enough detail so that we could reimplement

Patrick MacAlpine

slide-13
SLIDE 13

Your Progress Reports

  • Best ones motivate the problem before giving solutions
  • Say not only what’s done, but what’s yet to do
  • More about what worked than what didn’t
  • Clear enough for outsider to understand
  • Be specific - enough detail so that we could reimplement
  • Break into sections

Patrick MacAlpine

slide-14
SLIDE 14

Your Progress Reports

  • Best ones motivate the problem before giving solutions
  • Say not only what’s done, but what’s yet to do
  • More about what worked than what didn’t
  • Clear enough for outsider to understand
  • Be specific - enough detail so that we could reimplement
  • Break into sections
  • Explain how you will evaluate performance (test statistical

significance)

Patrick MacAlpine

slide-15
SLIDE 15

Auctions vs. voting

  • Auctions: maximize profit

– result affects buyer and seller

  • Voting: maximize social good

– result affects all

Patrick MacAlpine

slide-16
SLIDE 16

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

Patrick MacAlpine

slide-17
SLIDE 17

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

– Assume your preference is Trump > Carson > Bush – For whom should you vote?

Patrick MacAlpine

slide-18
SLIDE 18

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

– Assume your preference is Trump > Carson > Bush – For whom should you vote? – What if we change the system?

Patrick MacAlpine

slide-19
SLIDE 19

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

– Assume your preference is Trump > Carson > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

Patrick MacAlpine

slide-20
SLIDE 20

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

– Assume your preference is Trump > Carson > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

  • 3+ candidates =

⇒ only dictatorial system eliminates need for tactical voting − One person appointed

Patrick MacAlpine

slide-21
SLIDE 21

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

– Assume your preference is Trump > Carson > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

  • 3+ candidates =

⇒ only dictatorial system eliminates need for tactical voting − One person appointed

  • No point thinking of a “better” voting system
  • Assumption: no restrictions on preferences

Patrick MacAlpine

slide-22
SLIDE 22

Gibbard-Satterthwaite

  • Example: Trump, Carson, or Bush?

– Assume your preference is Trump > Carson > Bush – For whom should you vote? – What if we change the system? – Plurality, Binary, Borda?

  • 3+ candidates =

⇒ only dictatorial system eliminates need for tactical voting − One person appointed

  • No point thinking of a “better” voting system
  • Assumption: no restrictions on preferences

What about Clarke tax algorithm?

Patrick MacAlpine

slide-23
SLIDE 23

Types of Tactical Voting

  • Compromising:

Rank someone higher to get him/her elected − e.g. Carson instead of Trump

Patrick MacAlpine

slide-24
SLIDE 24

Types of Tactical Voting

  • Compromising:

Rank someone higher to get him/her elected − e.g. Carson instead of Trump

  • Burying: Rank someone lower to get him/her defeated

− e.g. in Borda protocol

Patrick MacAlpine

slide-25
SLIDE 25

Types of Tactical Voting

  • Compromising:

Rank someone higher to get him/her elected − e.g. Carson instead of Trump

  • Burying: Rank someone lower to get him/her defeated

− e.g. in Borda protocol

  • Push-over: Rank someone higher to get someone else

elected − e.g. in a protocol with multiple rounds

Patrick MacAlpine

slide-26
SLIDE 26

Arrow’s Theorem

Universality.

Patrick MacAlpine

slide-27
SLIDE 27

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots.

Patrick MacAlpine

slide-28
SLIDE 28

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality.

Patrick MacAlpine

slide-29
SLIDE 29

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality. If everyone prefers X to Y, then the

  • utcome should rank X above Y

.

Patrick MacAlpine

slide-30
SLIDE 30

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality. If everyone prefers X to Y, then the

  • utcome should rank X above Y

. Criterion of independence of irrelevant alternatives.

Patrick MacAlpine

slide-31
SLIDE 31

Arrow’s Theorem

  • Universality. The voting method should provide a complete

ranking of all alternatives from any set of individual preference ballots. Pareto optimality. If everyone prefers X to Y, then the

  • utcome should rank X above Y

. Criterion of independence of irrelevant alternatives. If

  • ne

set of preference ballots would lead to an an overall ranking of alternative X above alternative Y and if some preference ballots are changed without changing the relative rank of X and Y, then the method should still rank X above Y .

Patrick MacAlpine

slide-32
SLIDE 32

Citizen Sovereignty.

Patrick MacAlpine

slide-33
SLIDE 33

Citizen Sovereignty. Every possible ranking of alternatives can be achieved from some set of individual preference ballots.

Patrick MacAlpine

slide-34
SLIDE 34

Citizen Sovereignty. Every possible ranking of alternatives can be achieved from some set of individual preference ballots. Non-dictatorship.

Patrick MacAlpine

slide-35
SLIDE 35

Citizen Sovereignty. Every possible ranking of alternatives can be achieved from some set of individual preference ballots. Non-dictatorship. There should not be one specific voter whose preference ballot is always adopted.

Patrick MacAlpine

slide-36
SLIDE 36

Arrow’s Theorem

Universality.

Patrick MacAlpine

slide-37
SLIDE 37

Arrow’s Theorem

  • Universality. Complete rankings

Patrick MacAlpine

slide-38
SLIDE 38

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality.

Patrick MacAlpine

slide-39
SLIDE 39

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree

Patrick MacAlpine

slide-40
SLIDE 40

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty.

Patrick MacAlpine

slide-41
SLIDE 41

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible

Patrick MacAlpine

slide-42
SLIDE 42

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship.

Patrick MacAlpine

slide-43
SLIDE 43

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides

Patrick MacAlpine

slide-44
SLIDE 44

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides Independence of irrelevant alternatives.

Patrick MacAlpine

slide-45
SLIDE 45

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides Independence of irrelevant alternatives. Removing or adding a non-winner doesn’t change winner

Patrick MacAlpine

slide-46
SLIDE 46

Arrow’s Theorem

  • Universality. Complete rankings

Pareto optimality. X > Y if all agree Citizen Sovereignty. Any ranking possible Non-dictatorship. No one voter decides Independence of irrelevant alternatives. Removing or adding a non-winner doesn’t change winner Not all possible!

Patrick MacAlpine

slide-47
SLIDE 47

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

Patrick MacAlpine

slide-48
SLIDE 48

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

  • A pairwise method

Patrick MacAlpine

slide-49
SLIDE 49

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

  • A pairwise method
  • Smith set:

smallest set of candidates such that each candidate in the set preferred over each candidate not in the set

Patrick MacAlpine

slide-50
SLIDE 50

Condorcet Voting

  • Strategy

proof under weaker irrelevant alternatives criterion

  • A pairwise method
  • Smith set:

smallest set of candidates such that each candidate in the set preferred over each candidate not in the set

  • Every candidate in the Smith set is relevant

Patrick MacAlpine

slide-51
SLIDE 51

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A

Patrick MacAlpine

slide-52
SLIDE 52

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B :

Patrick MacAlpine

slide-53
SLIDE 53

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

Patrick MacAlpine

slide-54
SLIDE 54

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C

Patrick MacAlpine

slide-55
SLIDE 55

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C Overall: B > C > A

Patrick MacAlpine

slide-56
SLIDE 56

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C Overall: B > C > A

  • Does that solve everything?

Patrick MacAlpine

slide-57
SLIDE 57

Condorcet Example

  • 48: A > B > C
  • 40: B > C > A
  • 12: C > B > A
  • A vs. B : 48 – 52 =

⇒ B > A

  • A vs. C : 48 – 52 =

⇒ C > A

  • B vs. C : 88 – 12 =

⇒ B > C Overall: B > C > A

  • Does that solve everything? What about cycles?

Patrick MacAlpine

slide-58
SLIDE 58

Bargaining

small market, both can come out favorably

Patrick MacAlpine

slide-59
SLIDE 59

Bargaining

small market, both can come out favorably

  • Two people bargaining, each with a preference over
  • utcomes O
  • Let o∗ be the selected outcome

Patrick MacAlpine

slide-60
SLIDE 60

Bargaining

small market, both can come out favorably

  • Two people bargaining, each with a preference over
  • utcomes O
  • Let o∗ be the selected outcome
  • Example: “split the dollar”

Patrick MacAlpine

slide-61
SLIDE 61

Bargaining

small market, both can come out favorably

  • Two people bargaining, each with a preference over
  • utcomes O
  • Let o∗ be the selected outcome
  • Example: “split the dollar”

− One person makes offer o − Other rejects with probaility p(o) — based on offer − If rejects, both get nothing

Patrick MacAlpine

slide-62
SLIDE 62

Bargaining

small market, both can come out favorably

  • Two people bargaining, each with a preference over
  • utcomes O
  • Let o∗ be the selected outcome
  • Example: “split the dollar”

− One person makes offer o − Other rejects with probaility p(o) — based on offer − If rejects, both get nothing

  • Another version

− One person makes an offer − Other accepts, rejects, or counters − If counters, $.05 lost − Game ends with an accept or reject

Patrick MacAlpine

slide-63
SLIDE 63

Nash Bargaining Solution

Unique solution that satisfies:

Patrick MacAlpine

slide-64
SLIDE 64

Nash Bargaining Solution

Unique solution that satisfies: Invariance: only preference orders matter Anonymity: no discrimination Pareto efficiency: if one does better, other does worse Independence of irrelevant alternatives: removing outcomes doesn’t change things

Patrick MacAlpine

slide-65
SLIDE 65

Nash Bargaining Solution

Unique solution that satisfies: Invariance: only preference orders matter Anonymity: no discrimination Pareto efficiency: if one does better, other does worse Independence of irrelevant alternatives: removing outcomes doesn’t change things Maximize u1(o) ∗ u2(o)

Patrick MacAlpine

slide-66
SLIDE 66

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods

Patrick MacAlpine

slide-67
SLIDE 67

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods Equilibrium: allocation (prices) such that consumers maximize preferences, producers maximize profits

Patrick MacAlpine

slide-68
SLIDE 68

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods Equilibrium: allocation (prices) such that consumers maximize preferences, producers maximize profits

  • Assumption: agent doesn’t affect prices

Patrick MacAlpine

slide-69
SLIDE 69

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods Equilibrium: allocation (prices) such that consumers maximize preferences, producers maximize profits

  • Assumption: agent doesn’t affect prices

− Only true if market is infinitely large − Else, strategic bidding (like bargaining) possible

Patrick MacAlpine

slide-70
SLIDE 70

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods Equilibrium: allocation (prices) such that consumers maximize preferences, producers maximize profits

  • Assumption: agent doesn’t affect prices

− Only true if market is infinitely large − Else, strategic bidding (like bargaining) possible

  • Assumption: no externalities

Patrick MacAlpine

slide-71
SLIDE 71

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods Equilibrium: allocation (prices) such that consumers maximize preferences, producers maximize profits

  • Assumption: agent doesn’t affect prices

− Only true if market is infinitely large − Else, strategic bidding (like bargaining) possible

  • Assumption: no externalities

− Utilities or production sets don’t depend on others’

Patrick MacAlpine

slide-72
SLIDE 72

General Equilibrium

Consumers: utilities, endowments Producers: production possibility sets Variables: prices on goods Equilibrium: allocation (prices) such that consumers maximize preferences, producers maximize profits

  • Assumption: agent doesn’t affect prices

− Only true if market is infinitely large − Else, strategic bidding (like bargaining) possible

  • Assumption: no externalities

− Utilities or production sets don’t depend on others’ − Braess’ paradox

Patrick MacAlpine

slide-73
SLIDE 73

Other DRDM

  • Contract nets: task allocation among agents

Patrick MacAlpine

slide-74
SLIDE 74

Other DRDM

  • Contract nets: task allocation among agents

− Contingencies − Leveled commitment (price)

Patrick MacAlpine

slide-75
SLIDE 75

Other DRDM

  • Contract nets: task allocation among agents

− Contingencies − Leveled commitment (price)

  • Coalitions

Patrick MacAlpine

slide-76
SLIDE 76

Other DRDM

  • Contract nets: task allocation among agents

− Contingencies − Leveled commitment (price)

  • Coalitions

− Formation − Optimization within − Payoff division

Patrick MacAlpine

slide-77
SLIDE 77

Contract Nets

Task allocation among agents

Patrick MacAlpine

slide-78
SLIDE 78

Contract Nets

Task allocation among agents

  • OCSM-contracts: original, cluster, swap, multiagent

− Hill-climbing leads to optimum − Without any type, may be no sequence to optimum

Patrick MacAlpine

slide-79
SLIDE 79

Contract Nets

Task allocation among agents

  • OCSM-contracts: original, cluster, swap, multiagent

− Hill-climbing leads to optimum − Without any type, may be no sequence to optimum

  • Backing out of contracts

Patrick MacAlpine

slide-80
SLIDE 80

Contract Nets

Task allocation among agents

  • OCSM-contracts: original, cluster, swap, multiagent

− Hill-climbing leads to optimum − Without any type, may be no sequence to optimum

  • Backing out of contracts

− Contingency (future events)

Patrick MacAlpine

slide-81
SLIDE 81

Contract Nets

Task allocation among agents

  • OCSM-contracts: original, cluster, swap, multiagent

− Hill-climbing leads to optimum − Without any type, may be no sequence to optimum

  • Backing out of contracts

− Contingency (future events) − Leveled commitment (price)

Patrick MacAlpine

slide-82
SLIDE 82

Contract Nets

Task allocation among agents

  • OCSM-contracts: original, cluster, swap, multiagent

− Hill-climbing leads to optimum − Without any type, may be no sequence to optimum

  • Backing out of contracts

− Contingency (future events) − Leveled commitment (price) − What are some of the tradeoffs?

Patrick MacAlpine

slide-83
SLIDE 83

Contingency vs. leveled commitment

Contingency problems:

Patrick MacAlpine

slide-84
SLIDE 84

Contingency vs. leveled commitment

Contingency problems:

  • 1. Hard to track all contingencies

Patrick MacAlpine

slide-85
SLIDE 85

Contingency vs. leveled commitment

Contingency problems:

  • 1. Hard to track all contingencies
  • 2. Could be impossible to enumerate all possible

contingencies

Patrick MacAlpine

slide-86
SLIDE 86

Contingency vs. leveled commitment

Contingency problems:

  • 1. Hard to track all contingencies
  • 2. Could be impossible to enumerate all possible

contingencies

  • 3. What if only one agent observes that relevant event

happened?

Patrick MacAlpine

slide-87
SLIDE 87

Contingency vs. leveled commitment

Contingency problems:

  • 1. Hard to track all contingencies
  • 2. Could be impossible to enumerate all possible

contingencies

  • 3. What if only one agent observes that relevant event

happened? Leveled commitment problems:

Patrick MacAlpine

slide-88
SLIDE 88

Contingency vs. leveled commitment

Contingency problems:

  • 1. Hard to track all contingencies
  • 2. Could be impossible to enumerate all possible

contingencies

  • 3. What if only one agent observes that relevant event

happened? Leveled commitment problems:

  • 1. Breacher’s gain may be smaller than victim’s loss

Patrick MacAlpine

slide-89
SLIDE 89

Contingency vs. leveled commitment

Contingency problems:

  • 1. Hard to track all contingencies
  • 2. Could be impossible to enumerate all possible

contingencies

  • 3. What if only one agent observes that relevant event

happened? Leveled commitment problems:

  • 1. Breacher’s gain may be smaller than victim’s loss
  • 2. May decommit insincerely (wait for other) -

inefficent contracts executed.

Patrick MacAlpine

slide-90
SLIDE 90

Coalitions

  • Formation
  • Optimization within
  • Payoff division

Patrick MacAlpine

slide-91
SLIDE 91

DRDM Summary

For many agents: voting, general equilibrium, auctions For fewer agents: auctions, contract nets, bargaining Possible in all: coalitions

Patrick MacAlpine

slide-92
SLIDE 92

DRDM Summary

For many agents: voting, general equilibrium, auctions For fewer agents: auctions, contract nets, bargaining Possible in all: coalitions All self-interested, rational agents

Patrick MacAlpine