reserve pricing in repeated second price auctions with
play

Reserve Pricing in Repeated Second-Price Auctions with Strategic - PowerPoint PPT Presentation

Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders Alexey Drutsa Setup Second-Price (SP) Auction with Reserve Prices Setting A good (e.g., an ad space) is offered for sale by a seller to buyers Each buyer


  1. Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders Alexey Drutsa

  2. Setup

  3. Second-Price (SP) Auction with Reserve Prices β–Œ Setting β€Ί A good (e.g., an ad space) is offered for sale by a seller to 𝑁 buyers β€Ί Each buyer 𝑛 holds a private valuation 𝑀 $ ∈ [0,1] for this good ( 𝑀 $ is unknown to the seller) β–Œ Actions β€Ί The seller selects a reserve price π‘ž $ for each buyer 𝑛 β€Ί Each buyer 𝑛 submits a bid 𝑐 $ β–Œ Allocation and payments β€Ί Determine actual buyer-participants: 𝕅 = {𝑛 ∣ 𝑐 $ β‰₯ π‘ž $ } β€Ί The good is received by the buyer 𝑛 4 = argmax $βˆˆπ•… 𝑐 $ (that has the highest bid) β€Ί This buyer pays π‘ž $ 4 = max {π‘ž $ 4 , max $βˆˆπ•…βˆ–{$ 4} 𝑐 $ }

  4. Repeated Second-Price Auctions with Reserve Equal goods (e.g., ad spaces) are repeatedly offered for sale β€Ί by a seller (e.g., RTB platform) to 𝑁 buyers (e.g., advertisers) β€Ί over π‘ˆ rounds (one good per round). Each buyer 𝑛 β€Ί holds a private fixed valuation 𝑀 $ ∈ [0,1] for each of those goods, β€Ί 𝑀 $ is unknown to the seller. At each round 𝑒 = 1, … , π‘ˆ , the seller conducts SP auction with reserves: β€Ί the seller selects a reserve price π‘ž > $ for each buyer 𝑛 β€Ί and a bid 𝑐 > $ is submitted by each buyer 𝑛 .

  5. Seller’s pricing algorithm β€Ί The seller applies a pricing algorithm 𝐡 that sets reserve prices {π‘ž > B,C $ } >@A,$@A B,C $ } >@A,$@A in response to bids 𝐜 = {𝑐 > of buyers 𝑛 = 1, … , 𝑁 β€Ί A price π‘ž > $ can depend only on past bids {𝑐 E >GA,C F } E@A,F@A and the horizon π‘ˆ .

  6. Strategic buyers β–Œ The seller announces her pricing algorithm 𝐡 in advance In each round 𝑒 , each buyer 𝑛 β€Ί observes a history of previous rounds (available to this buyer) and β€Ί chooses his bid 𝑐 > $ s.t. it maximizes his future 𝛿 $ -discounted surplus: B 4 O (𝑀 $ βˆ’ π‘ž E $ ) Sur > 𝐡, 𝑀 $ , 𝛿 $ , {𝑐 E $ } : = 𝔽 M EGA 𝕁 $@$ 𝛿 $ , 𝛿 $ ∈ 0,1 , E@> where 𝕁 $@$ 4 O is the indicator of the event when buyer 𝑛 is the winner in round 𝑑 $ is the payment of the buyer 𝑛 in this case π‘ž E

  7. Seller’s goal The seller’s strategic regret: $ 𝑀 $ βˆ’ 𝕁 𝕅 W Xβˆ… π‘ž > 4 W ) $ SReg π‘ˆ, 𝐡, 𝑀 $ $ , 𝛿 $ $ : = βˆ‘ B (max >@A She seeks for a no-regret pricing for worst-case valuation: sup \ ] ,…,\ ^ ∈ _,A SReg π‘ˆ, 𝐡, 𝑀 $ $ , 𝛿 $ $ = 𝑝 π‘ˆ Optimality : the lowest possible upper bound for the regret of the form 𝑃 𝑔(π‘ˆ) .

  8. Background, Research question & Main contribution

  9. Background: 1-buyer case (posted-price auctions) If one buyer ( 𝑁 = 1 ), a SP auction reduces to a posted-price auction: β€Ί the buyer either accepts or rejects a currently offered price π‘ž > A β€Ί the seller either gets payment equal to π‘ž > A or nothing [Kleinberg et al., FOCS’2003] Optimal algorithm against myopic buyer with truthful regret Θ(log log π‘ˆ) . [Amin et al., NIPS’2013] The strategic setting is introduced. βˆ„ no-regret pricing for non-discount case 𝛿 = 1 . [Drutsa, WWW’2017] Optimal algorithm against strategic buyer with regret Θ(log log π‘ˆ) for 𝛿 < 1 .

  10. Research question The known optimal algorithms (PRRFES & prePRRFES) from posted-price auctions cannot be directly applied to set reserve prices in second-price auctions β€Ί buyers in SP auctions have incomplete information due to presence of rivals β€Ί the proofs of optimality of [pre]PRRFES strongly rely on complete information β–Œ In this study, I try to find an optimal algorithm for the multi-buyer setup

  11. Main contribution A novel algorithm for our strategic buyers with regret upper bound of Θ(log log π‘ˆ) for 𝛿 < 1 A novel transformation that maps any pricing algorithm designed for posted-price auctions to a multi-buyer setup

  12. Main ideas

  13. Two learning processes $ 𝑀 $ βˆ’ 𝕁 𝕅 W Xβˆ… π‘ž > 4 W ) $ B SReg π‘ˆ, 𝐡, 𝑀 $ $ , 𝛿 $ $ : = βˆ‘ (max >@A Find which buyer has Find the buyers’ valuations the maximal valuation Learning process #1 Learning process #2

  14. Learning proc.#1: an idea to localize a valuation PRRFES is an optimal learner of a valuation in posted-price auctions. However, its core localization technique relies on: β–Œ The buyer completely knows the outcomes of current and all future rounds β–Œ given their bids (due to absence of rivals) Can we use PRRFES in the second-price scenario where each buyer does not know perfectly the outcomes of rounds?

  15. Barrage pricing β€Ί Reserve prices are personal (individual) in our setup β€Ί Thus, we are able to β€œeliminate” particular buyers from particular rounds β€Ί Namely, a buyer 𝑛 will not bid above 1/(1 βˆ’ 𝛿 $ ) β€Ί We call this price as β€œbarrage” one and denote it by ∞ Let β€œeliminate” all buyers except some buyer 𝑛 in a round 𝑒 Then the buyer 𝑛 will have com round 𝑒 complete i ete inf nfor ormati tion on abo about outcome of this s ro

  16. Learning proc.#2: an idea to find max valuation The search algorithm works by maintaining a feasible interval [𝑣 $ , π‘₯ $ ] that β€Ί is aimed to localize the valuation 𝑀 $ , i.e. 𝑀 $ ∈ [𝑣 $ , π‘₯ $ ] β€Ί shrinks as 𝑒 β†’ ∞ [𝑣 o , π‘₯ o ] [𝑣 A , π‘₯ A ] [𝑣 p , π‘₯ p ] 𝑀 A 𝑀 p 𝑀 o 0 1 round 𝑒 A round 𝑒 p round 𝑒 o β–Œ If, in a round 𝑒 , it becomes that π‘₯ $ < 𝑣 m for some buyers 𝑛 and π‘œ , β–Œ then buyer 𝑛 has non-maximal valuation which should not be searched anymore

  17. Dividing algorithms

  18. Key instrument that implements the ideas transformation di div

  19. Transformation di div : cyclic elimination Let 𝐡 be an algorithm designed for repeated posted-price auctions β–Œ Its transformation 𝐞𝐣𝐰 𝐡 is an algorithm for repeated SP auctions as follows Buyers: Reserve prices are set by: Reserve Prices (only one non-barrage in a round): A A A π‘ž A ∞ ∞ Algorithm 𝐡 π‘ž t ∞ ∞ π‘ž u ∞ . . . p p p ∞ ∞ Algorithm 𝐡 π‘ž p ∞ ∞ ∞ . . . π‘ž v π‘ž w o o ∞ ∞ Algorithm 𝐡 π‘ž o ∞ ∞ ∞ ∞ . . . π‘ž x Rounds, 𝑒 = 3 1 2 4 5 6 7 8 1 2 3 Periods, 𝑗 =

  20. Transformation di div : stopping rule We stop considering a buyer 𝑛 in periods when π‘₯ $ < 𝑣 m for some buyer π‘œ. β–Œ The number of periods with buyer 𝑛 is referred to as subhorizon, 𝐽 $ . We stopped learning of 𝑀 A and 𝐽 A = 𝑙 , when π‘₯ A < 𝑣 p Buyers: Reserve prices are set by: Reserve Prices : A π‘ž E ∞ ∞ ∞ Algorithm 𝐡 ∞ ∞ ∞ ∞ . . . p p p p ∞ ∞ Algorithm 𝐡 π‘ž E|A ∞ ∞ π‘ž E|u . . . π‘ž E|o π‘ž E|v o o o ∞ ∞ Algorithm 𝐡 π‘ž E|p ∞ ∞ ∞ . . . π‘ž E|t π‘ž E|x 𝑑 + 2 𝑑 𝑑 + 1 𝑑 + 3 𝑑 + 4 𝑑 + 5 𝑑 + 6 𝑑 + 7 Rounds, 𝑒 = 𝑙 + 3 𝑙 𝑙 + 1 𝑙 + 2 Periods, 𝑗 =

  21. οΏ½ οΏ½ Transformation div: regret decomposition Lemma 1. For the described transformation, strategic regret has decomposition: SReg π‘ˆ, 𝐞𝐣𝐰 𝐡 , 𝑀 $ $ , 𝛿 $ $ = 𝑀 m βˆ’ 𝑀 $ ) = M Reg $ (π‘ˆ, 𝐡, 𝑀 $ , 𝛿 $ ) 𝐽 $ (max + M m $ $ Individual regrets Deviation regret Measure how the algorithm 𝐡 learns Measures how fast we stop learning of non-maximal valuations the valuation of each buyer

  22. Key challenge against strategic buyer Strategic buyer may lie and mislead algorithms, thus a good algorithm must Extract correct information about a buyer’s valuation from his actions (bids) β–Œ Dividing structure in a round allows to construct a tool to locate valuations: β–Œ it is enough to make complete information situation in a round

  23. Upper bound on valuation of strategic buyer Let buyer 𝑛 is the non-”eliminated ” one in a round 𝑒 . β–Œ If the buyer accepts (bids above) the current reserve price π‘ž > $ B 4 W (𝑀 $ βˆ’ π‘ž > 4 O (𝑀 $ βˆ’ π‘ž E >GA 𝕁 $@$ $ ) + 𝔽 M EGA 𝕁 $@$ $ ) Surplus > = 𝔽 𝛿 $ 𝛿 $ E@>|A = ≀ Ε½ (𝑀 $ βˆ’ π‘ž > >GA (𝑀 $ βˆ’ π‘ž > >GA 𝕁 β€’ W $ ) = 𝛿 $ $ ) 0 𝛿 $ Ε½ β€˜β€™ W β–Œ If the buyer rejects (bids below) the current reserve price π‘ž > $ >|†GA B ≀ 𝛿 $ 4 O (𝑀 $ βˆ’ π‘ž E (𝑀 $ βˆ’ [lowest_price]) $ ) EGA 𝕁 $@$ Surplus > = 𝔽 M 𝛿 $ 1 βˆ’ 𝛿 $ E@>|† If we observe that a buyer rejects non-”barrage” reserve price, then: β€’ 𝑀 $ βˆ’ π‘ž > $ < $ βˆ’ [lowest_price]) β€’ Ε½ β€’ (π‘ž > AGβ€’ Ε½ Gβ€’ Ε½

  24. Optimal algorithm

  25. Pricing algorithm divPRRFES Apply the transformation div div to PRRFES algorithm

  26. divPRRFES: individual and deviation regrets β–Œ Individual regrets Our tool to locate valuations provides the upper bound (as in 1-buyer case): Reg $ π‘ˆ, 𝐡, 𝑀 $ , 𝛿 $ = 𝑃 log p log p π‘ˆ βˆ€π‘› β–Œ Deviation regrets β€Ί For each buyer 𝑛 with non-maximal valuation (i.e., 𝑀 $ < max 𝑀 m ) m β€Ί We can upper bound its subhorizon 𝐽 $ : 𝐷 𝐽 $ ≀ 𝑀 m βˆ’ 𝑀 $ max m

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend