agent based systems
play

Agent-Based Systems Different auction types and properties - PowerPoint PPT Presentation

Agent-Based Systems Agent-Based Systems Where are we? Agent-Based Systems Different auction types and properties Combinatorial Auctions Michael Rovatsos Bidding Languages mrovatso@inf.ed.ac.uk The VCG mechanism Today . . .


  1. Agent-Based Systems Agent-Based Systems Where are we? Agent-Based Systems • Different auction types and properties • Combinatorial Auctions Michael Rovatsos • Bidding Languages mrovatso@inf.ed.ac.uk • The VCG mechanism Today . . . • Bargaining Lecture 12 – Bargaining 1 / 19 2 / 19 Agent-Based Systems Agent-Based Systems Bargaining Alternating Offers • Common one-to-one protocol • Reaching agreement in the presence of conflicting goals and preferences (a bit like a multi-step game with specific protocol) start – Negotiation takes place in a • Negotiation setting: sequence of rounds - The negotiation set is the space of possible proposals – Agent 1 begins at round 0 by making - The protocol defines the proposals the agents can make, as a agent 1 makes proposal a proposal x 0 function of prior negotiation history - Strategies determine the proposals the agents will make (private) – Agent 2 can either accept or reject agent 2 end accepts • Number of issues: the proposal - Single-issue, e.g. price of a good agent 1 agent 2 rejects – If the proposal is accepted the deal rejects - multiple-issues , e.g. buying a car: price, extras, service x 0 is implemented · Concessions may be hard to identify in multiple-issue negotiations agent 1 · Number of possible deals: m n for n attributes with m possible values accepts – Otherwise, negotiation moves to the • Number of agents: agent 2 makes proposal next round where agent 2 makes a - one-to-one , simplified when preferences are symmetric proposal - many-to-one , e.g. auctions - many-to-many , n ( n − 1 ) / 2 negotiation threads for n agents 3 / 19 4 / 19

  2. Agent-Based Systems Agent-Based Systems Scenario: Dividing the Pie Negotiation Rounds • The ultimatum game : a single negotiation round - Suppose that player 1 proposes to get all the pie, i.e. ( 1 , 0 ) - Player 2 will have to agree to avoid getting the conflict deal Θ • Scenario: Dividing the pie - Player 1 has all the power - There is some resource whose value is 1 • Two rounds of negotiation - The resource can be divided into two parts, such as 1 The values of each part must be between 0 and 1 - Agent 1 makes a proposal in the first round 2 The sum of the values of the parts sum to 1 - Player 2 can reject and turn the game into an ultimatum - A proposal is a pair ( x , 1 − x ) (agent 1 gets x , agent 2 gets 1 − x ) • If the number of rounds is fixed, whoever moves last gets all the pie - The negotiation set is: { ( x , 1 − x ) : 0 ≤ x ≤ 1 } • If there are no bounds on the number of rounds: • Some assumptions: - Suppose agent 1’s strategy is: propose ( 1 , 0 ) , reject any other offer - Disagreement is the worst outcome, we call this the conflict deal Θ - If agent 2 rejects the proposal, the agents will never reach - Agents seek to maximise utility agreement (the conflict deal is enacted) - Agent 2 will have to accept to avoid Θ - Infinite set of Nash equilibrium outcomes (of course agent 2 must understand the situation, e.g. given access to agent 1’s strategy) 5 / 19 6 / 19 Agent-Based Systems Agent-Based Systems Time Negotiation Decision Functions • Non-strategic approach, does not depend on how other’s behave • Additional assumption: Time is valuable (agents prefer outcome x • Agents use a time-dependent decision function to determine what at time t 1 over outcome x at time t 2 if t 2 > t 1 ) proposal they should make • Model agent i ’s patience using discount factor δ i (0 ≤ δ i ≤ 1) • Boulware strategy: exponentially decay offers to reserve price the value of slice x at time 0 is δ 0 i x = x • Conceder strategy: make concessions early, do not concede much the value of slice x at time 1 is δ 1 i x = δ i x as negotiation progresses the value of slice x at time 2 is δ 2 i x = ( δ i δ i ) x Price Price • More patient players (larger δ i ) have more power Conceder 1.0 1.0 • Games with two rounds of negotiation 0.8 Boulware 0.8 - The best possible outcome for agent 2 in the second round is δ 2 0.6 0.6 - If agent 1 initially proposes ( 1 − δ 2 , δ 2 ) , agent 2 can do no better than accept 0.4 0.4 Conceder • Games with no bounds on the number of rounds Boulware 0.2 0.2 - Agent 1 proposes what agent 2 can enforce in the second round 1 − δ 1 δ 2 , agent 2 gets δ 2 ( 1 − δ 1 ) 1 − δ 2 0.2 0.4 0.6 0.8 1.0 Time 0.2 0.4 0.6 0.8 1.0 Time - Agent 1 gets 1 − δ 1 δ 2 Seller Buyer 7 / 19 8 / 19

  3. Agent-Based Systems Agent-Based Systems Task-oriented domains (I) Task-Oriented Domains (II) • A task-oriented domain (TOD) is a triple � T , Ag , c � with - T a finite set of tasks, Ag a set of agents, and • Utility represents how much agent has to gain from the deal - c : 2 T → R + function describing cost of executing any set of tasks • If no agreement is reached, conflict deal is Θ = � T 1 , T 2 � (symmetric for all agents) • A deal δ 1 dominates another deal δ 2 (denoted δ 1 ≻ δ 2 ) iff • We assume that c ( ∅ ) = 0, and that c is monotonic i.e. 1 Deal δ 1 is at least as good as δ 2 for every agent: T 1 , T 2 ⊆ T ∧ T 1 ⊆ T 2 ⇒ c ( T 1 ) ≤ c ( T 2 ) ∀ i ∈ { 1 , 2 } , utility i ( δ 1 ) ≥ utility i ( δ 2 ) • An encounter in a TOD is a collection � T 1 , . . . , T n � such that each 2 Deal δ 1 is better for some agent than δ 2 : T i ⊆ T is executed by agent i ∈ Ag ∃ i ∈ { 1 , 2 } , utility i ( δ 1 ) > utility i ( δ 2 ) • Below, we only consider one-to-one negotiation scenarios where a • If δ 1 is not dominated by any other δ 2 , then δ is Pareto optimal deal is a pair δ = � D 1 , D 2 � such that D 1 ∪ D 2 = T 1 ∪ T 2 • A deal is individually rational if it weakly dominates (i.e. is at least • Agent i will execute D i in a deal with as good as) the conflict deal Θ - cost i ( δ ) = c ( D i ) , and - utility i ( δ ) = c ( T i ) − cost i ( δ ) 9 / 19 10 / 19 Agent-Based Systems Agent-Based Systems Task-Oriented Domains (III) The monotonic concession protocol this oval deals on this line from delimits the space B to C are Pareto optimal, • Start with simultaneous deals proposed by both agents and of all possible deals hence in the negotiation set proceed in rounds • Agreement reached if B - utility 1 ( δ 2 ) ≥ utility 1 ( δ 1 ) or - utility 2 ( δ 1 ) ≥ utility 2 ( δ 2 ) • If both proposals match or exceed other’s offer, outcome is chosen A E at random between δ 1 and δ 2 C • If no agreement, in round u + 1 agents are not allowed to make deals less preferred by other agent than proposal made in round u • If no proposals are made, negotiation terminates with outcome Θ D the conflict • Protocol verifiable and guaranteed to terminate, but not necessarily deal efficient Negotiation set contains individually rational and Pareto optimal deals 11 / 19 12 / 19

  4. Agent-Based Systems Agent-Based Systems The Zeuthen strategy The Zeuthen strategy (II) • Agent with smaller value of risk should concede on round t • The above protocol doesn’t describe when and how much to concede • Concession should be just good enough but of course this is • Intuitively, agents will be more willing to risk conflict if difference inefficient, smallest concession that changes balance of risk between current proposal and conflict deal is low • Problem if agents have equal risk: we have to flip a coin, otherwise • Model agent i ’s willingness to risk conflict at round t as one of them could defect (and conflict would occur) • Looking at our protocol criteria: i = utility lost by conceding and accepting j ’s offer risk t - Protocol terminates, doesn’t always succeed, simplicity? (too many utility lost by not conceding and causing conflict deals), Zeuthen strategy is Nash, no central authority needed, • Formally, we can calculate risk as a value between 0 and 1 individual rationality (in case of agreement), Pareto optimality • Zlotkin/Rosenschein also analysed a number of scenarios in which � if utility i ( δ t i ) = 0 1 agents lie about their tasks: risk t i = utility i ( δ t i ) − utility i ( δ t j ) otherwise - Phantom/decoy tasks: advantage for deceitful agent utility i ( δ t i ) - Hidden tasks: agents may benefit from hiding tasks (!) 13 / 19 14 / 19 Agent-Based Systems Agent-Based Systems Bargaining for Resource Allocation (I) Bargaining for Resource Allocation (II) • A resource allocation setting is a tuple � Ag , Z , v 1 , . . . , v n � , • A pay-off vector p = � p 1 , p 2 , . . . , p n � is a tuple of side payments - Agents Ag = { 1 , . . . , n } such that � n - Resources Z = { z 1 , . . . , z m } i = 1 p i = 0 - Valuation functions v i : 2 Z → R p � , where Z , Z ′ ∈ alloc ( Z , Ag ) are distinct • A deal is a triple � Z , Z ′ , ¯ • An allocation Z 1 , . . . , Z n is a partition of resources over the agents allocations and ¯ p is a payoff vector • Negotiating a change from P i to Q i ( P i , Q i ∈ Z and P i � = Q i ) will • � Z , Z ′ , ¯ p � is individually rational if v i ( Z ′ i ) − p i > v i ( Z ) for each lead to i ∈ Ag , p i is allowed to be 0 if Z i = Z ′ i - v i ( P i ) < v i ( P i ) , • Pareto optimal: every other allocation that makes some agents - v i ( P i ) = v i ( P i ) or strictly better off makers some other agent strictly worse off - v i ( P i ) > v i ( P i ) • Agents can make side payments as compensations 15 / 19 16 / 19

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend