Reserve Pricing in Repeated Second-Price Auctions with Strategic - - PowerPoint PPT Presentation

β–Ά
reserve pricing in repeated second price auctions with
SMART_READER_LITE
LIVE PREVIEW

Reserve Pricing in Repeated Second-Price Auctions with Strategic - - PowerPoint PPT Presentation

Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders Alexey Drutsa Setup Second-Price (SP) Auction with Reserve Prices Setting A good (e.g., an ad space) is offered for sale by a seller to buyers Each buyer


slide-1
SLIDE 1
slide-2
SLIDE 2

Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders

Alexey Drutsa

slide-3
SLIDE 3

Setup

slide-4
SLIDE 4

Second-Price (SP) Auction with Reserve Prices

β–Œ Setting

β€Ί A good (e.g., an ad space) is offered for sale by a seller to 𝑁 buyers β€Ί Each buyer 𝑛 holds a private valuation 𝑀$ ∈ [0,1] for this good

(𝑀$ is unknown to the seller)

β–Œ Actions

β€Ί The seller selects a reserve price π‘ž$ for each buyer 𝑛 β€Ί Each buyer 𝑛 submits a bid 𝑐$

β–Œ Allocation and payments

β€Ί Determine actual buyer-participants: 𝕅 = {𝑛 ∣ 𝑐$ β‰₯ π‘ž$} β€Ί The good is received by the buyer 𝑛

4 = argmax$βˆˆπ•…π‘$ (that has the highest bid)

β€Ί This buyer pays π‘ž$

4 = max {π‘ž$ 4, max$βˆˆπ•…βˆ–{$ 4}𝑐$}
slide-5
SLIDE 5

Repeated Second-Price Auctions with Reserve

Equal goods (e.g., ad spaces) are repeatedly offered for sale

β€Ί by a seller (e.g., RTB platform) to 𝑁 buyers (e.g., advertisers) β€Ί over π‘ˆ rounds (one good per round).

Each buyer 𝑛

β€Ί holds a private fixed valuation 𝑀$ ∈ [0,1] for each of those goods, β€Ί 𝑀$ is unknown to the seller.

At each round 𝑒 = 1, … , π‘ˆ, the seller conducts SP auction with reserves:

β€Ί the seller selects a reserve price π‘ž>

$ for each buyer 𝑛

β€Ί and a bid 𝑐>

$ is submitted by each buyer 𝑛.
slide-6
SLIDE 6

Seller’s pricing algorithm

β€Ί The seller applies a pricing algorithm 𝐡 that sets reserve prices {π‘ž>

$}>@A,$@A B,C

in response to bids 𝐜 = {𝑐>

$}>@A,$@A B,C
  • f buyers 𝑛 = 1, … , 𝑁

β€Ί A price π‘ž>

$ can depend only on past bids {𝑐E F}E@A,F@A >GA,C

and the horizon π‘ˆ.

slide-7
SLIDE 7

Strategic buyers

β–Œ The seller announces her pricing algorithm 𝐡 in advance

In each round 𝑒, each buyer 𝑛

β€Ί observes a history of previous rounds (available to this buyer) and β€Ί chooses his bid 𝑐>

$ s.t. it maximizes his future 𝛿$-discounted surplus:

Sur> 𝐡, 𝑀$, 𝛿$, {𝑐E

$} : = 𝔽 M

𝛿$

EGA𝕁 $@$ 4 O (𝑀$ βˆ’ π‘žE $) B E@>

, 𝛿$ ∈ 0,1 , where 𝕁 $@$

4 O is the indicator of the event when buyer 𝑛 is the winner in round 𝑑

π‘žE

$ is the payment of the buyer 𝑛 in this case
slide-8
SLIDE 8

Seller’s goal

The seller’s strategic regret: SReg π‘ˆ, 𝐡, 𝑀$ $, 𝛿$ $ : = βˆ‘ (max

$ 𝑀$ βˆ’ 𝕁 𝕅WXβˆ… π‘ž> $ 4 W) B >@A

She seeks for a no-regret pricing for worst-case valuation: sup\],…,\^∈ _,A SReg π‘ˆ, 𝐡, 𝑀$ $, 𝛿$ $ = 𝑝 π‘ˆ Optimality: the lowest possible upper bound for the regret of the form 𝑃 𝑔(π‘ˆ) .

slide-9
SLIDE 9

Background, Research question & Main contribution

slide-10
SLIDE 10

Background: 1-buyer case (posted-price auctions)

[Kleinberg et al., FOCS’2003] Optimal algorithm against myopic buyer with truthful regret Θ(log log π‘ˆ). [Drutsa, WWW’2017] Optimal algorithm against strategic buyer with regret Θ(log log π‘ˆ) for 𝛿 < 1. [Amin et al., NIPS’2013] The strategic setting is introduced. βˆ„ no-regret pricing for non-discount case 𝛿 = 1. If one buyer (𝑁 = 1), a SP auction reduces to a posted-price auction:

β€Ί the buyer either accepts or rejects a currently offered price π‘ž>

A

β€Ί the seller either gets payment equal to π‘ž>

A or nothing
slide-11
SLIDE 11

Research question

The known optimal algorithms (PRRFES & prePRRFES) from posted-price auctions cannot be directly applied to set reserve prices in second-price auctions

β€Ί buyers in SP auctions have incomplete information due to presence of rivals β€Ί the proofs of optimality of [pre]PRRFES strongly rely on complete information

β–Œ In this study, I try to find an optimal algorithm for the multi-buyer setup

slide-12
SLIDE 12

A novel algorithm for our strategic buyers with regret upper bound of Θ(log log π‘ˆ) for 𝛿 < 1 Main contribution

A novel transformation that maps any pricing algorithm designed for posted-price auctions to a multi-buyer setup

slide-13
SLIDE 13

Main ideas

slide-14
SLIDE 14

Two learning processes

SReg π‘ˆ, 𝐡, 𝑀$ $, 𝛿$ $ : = βˆ‘ (max

$ 𝑀$ βˆ’ 𝕁 𝕅WXβˆ… π‘ž> $ 4 W) B >@A

Find the buyers’ valuations Learning process #1 Find which buyer has the maximal valuation Learning process #2

slide-15
SLIDE 15

Learning proc.#1: an idea to localize a valuation

PRRFES is an optimal learner of a valuation in posted-price auctions. However, its core localization technique relies on:

β–Œ The buyer completely knows the outcomes of current and all future rounds β–Œ given their bids (due to absence of rivals)

Can we use PRRFES in the second-price scenario where each buyer does not know perfectly the outcomes of rounds?

slide-16
SLIDE 16

Barrage pricing

β€Ί Reserve prices are personal (individual) in our setup β€Ί Thus, we are able to β€œeliminate” particular buyers from particular rounds β€Ί Namely, a buyer 𝑛 will not bid above 1/(1 βˆ’ 𝛿$) β€Ί We call this price as β€œbarrage” one and denote it by ∞

Let β€œeliminate” all buyers except some buyer 𝑛 in a round 𝑒 Then the buyer 𝑛 will have com

complete i ete inf nfor

  • rmati

tion

  • n abo

about outcome of this s ro round 𝑒

slide-17
SLIDE 17

Learning proc.#2: an idea to find max valuation

The search algorithm works by maintaining a feasible interval [𝑣$, π‘₯$] that

β€Ί is aimed to localize the valuation 𝑀$, i.e. 𝑀$ ∈ [𝑣$, π‘₯$] β€Ί shrinks as 𝑒 β†’ ∞

β–Œ If, in a round 𝑒, it becomes that π‘₯$ < 𝑣m for some buyers 𝑛 and π‘œ, β–Œ then buyer 𝑛 has non-maximal valuation which should not be searched anymore

𝑀o 𝑀p 𝑀A 1

round 𝑒A round 𝑒p round 𝑒o

[𝑣A, π‘₯A] [𝑣p, π‘₯p] [𝑣o, π‘₯o]

slide-18
SLIDE 18

Dividing algorithms

slide-19
SLIDE 19

Key instrument that implements the ideas

transformation

di div

slide-20
SLIDE 20

Transformation di

div: cyclic elimination

Let 𝐡 be an algorithm designed for repeated posted-price auctions

β–Œ Its transformation 𝐞𝐣𝐰 𝐡 is an algorithm for repeated SP auctions as follows

π‘žA

A

Algorithm 𝐡 ∞ ∞ π‘žt

A

∞ ∞ π‘žu

A

∞

Buyers: Reserve Prices (only one non-barrage in a round): Reserve prices are set by:

. . . ∞ π‘žp

p

∞ ∞ π‘žv

p

∞ ∞ π‘žw

p

. . . ∞ ∞ π‘žo

  • ∞

∞ π‘žx

  • ∞

∞ . . . Algorithm 𝐡 Algorithm 𝐡

Rounds, 𝑒 = Periods, 𝑗 = 1 2 3 4 5 6 7 8

1 2 3

slide-21
SLIDE 21

Transformation di

div: stopping rule

We stop considering a buyer 𝑛 in periods when π‘₯$ < 𝑣m for some buyer π‘œ.

β–Œ The number of periods with buyer 𝑛 is referred to as subhorizon, 𝐽$.

π‘žE

A

Algorithm 𝐡 ∞ ∞ ∞ ∞ ∞ ∞ ∞

Buyers: Reserve Prices: Reserve prices are set by:

. . . ∞ π‘žE|A

p

∞ ∞ π‘žE|o

p

∞ π‘žE|v

p

. . . ∞ ∞ π‘žE|p

  • ∞

∞ π‘žE|t

  • ∞

. . . Algorithm 𝐡 Algorithm 𝐡

Rounds, 𝑒 = Periods, 𝑗 = 𝑑 𝑑 + 1 𝑑 + 2 𝑑 + 3 𝑑 + 4 𝑑 + 5 𝑑 + 6 𝑑 + 7

𝑙 𝑙 + 1 𝑙 + 2 π‘žE|u

p

π‘žE|x

  • 𝑙 + 3
We stopped learning of 𝑀A and 𝐽A = 𝑙, when π‘₯A < 𝑣p
slide-22
SLIDE 22

Transformation div: regret decomposition

Lemma 1. For the described transformation, strategic regret has decomposition: SReg π‘ˆ, 𝐞𝐣𝐰 𝐡 , 𝑀$ $, 𝛿$ $ = = M Reg$(π‘ˆ, 𝐡, 𝑀$, 𝛿$)

  • $

+ M 𝐽$(max

m

𝑀m βˆ’ 𝑀$)

  • $

Deviation regret Measures how fast we stop learning

  • f non-maximal valuations

Individual regrets Measure how the algorithm 𝐡 learns the valuation of each buyer

slide-23
SLIDE 23

Key challenge against strategic buyer

Strategic buyer may lie and mislead algorithms, thus a good algorithm must Extract correct information about a buyer’s valuation from his actions (bids)

β–Œ Dividing structure in a round allows to construct a tool to locate valuations: β–Œ it is enough to make complete information situation in a round

slide-24
SLIDE 24

Upper bound on valuation of strategic buyer

Let buyer 𝑛 is the non-”eliminated ” one in a round 𝑒.

β–Œ If the buyer accepts (bids above) the current reserve price π‘ž>

$

Surplus> = 𝔽 𝛿$

>GA𝕁 $@$ 4 W (𝑀$ βˆ’ π‘ž> $) + 𝔽 M

𝛿$

EGA𝕁 $@$ 4 O (𝑀$ βˆ’ π‘žE $) B E@>|A

β–Œ If the buyer rejects (bids below) the current reserve price π‘ž>

$

Surplus> = 𝔽 M 𝛿$

EGA𝕁 $@$ 4 O (𝑀$ βˆ’ π‘žE $) B E@>|†

≀ 𝛿$

>|†GA

1 βˆ’ 𝛿$ (𝑀$ βˆ’ [lowest_price]) If we observe that a buyer rejects non-”barrage” reserve price, then: 𝑀$ βˆ’ π‘ž>

$ <
  • Ε½
  • AGβ€’Ε½Gβ€’Ε½
  • (π‘ž>
$ βˆ’ [lowest_price])

𝛿$

>GA𝕁 β€’W Ε½β€˜β€™W Ε½ (𝑀$ βˆ’ π‘ž> $)=𝛿$ >GA (𝑀$ βˆ’ π‘ž> $)

= ≀

slide-25
SLIDE 25

Optimal algorithm

slide-26
SLIDE 26

Pricing algorithm divPRRFES

Apply the transformation div

div

to PRRFES algorithm

slide-27
SLIDE 27

divPRRFES: individual and deviation regrets

β–Œ Individual regrets

Our tool to locate valuations provides the upper bound (as in 1-buyer case): Reg$ π‘ˆ, 𝐡, 𝑀$, 𝛿$ = 𝑃 logp logp π‘ˆ βˆ€π‘›

β–Œ Deviation regrets

β€Ί For each buyer 𝑛 with non-maximal valuation (i.e., 𝑀$ < max

m

𝑀m)

β€Ί We can upper bound its subhorizon 𝐽$:

𝐽$ ≀ 𝐷 max

m

𝑀m βˆ’ 𝑀$

slide-28
SLIDE 28

divPRRFES is optimal

Theorem. Let 𝛿_ ∈ (0,1) Then for the pricing algorithm divPRRFES 𝐡 with:

β€Ί the number of penalization rounds 𝑠 β‰₯ log‒–

AG‒– p

and

β€Ί the exploitation rate 𝑕 π‘š = 2pβ„’, π‘š ∈ β„€|,

for any valuations 𝑀A, … , 𝑀C ∈ 0,1 , any discounts 𝛿A, … , 𝛿C ∈ 0, 𝛿_ , and π‘ˆ β‰₯ 2, the strategic regret is upper bounded: SReg π‘ˆ, 𝐡, 𝑀$ $, 𝛿$ $ ≀ 𝐷 logp logp π‘ˆ + 2 + 𝐢, 𝐷 ≔ 𝑁 𝑠 max

$ 𝑀$ + 4 ,

𝐢 ≔ (24 + 5𝑠)(𝑁 βˆ’ 1).

slide-29
SLIDE 29

Summary

slide-30
SLIDE 30

A novel algorithm for setting reserve prices in second-price auctions with strategic buyers. Its worst-case regret is optimal: Θ(log log π‘ˆ) for 𝛿 < 1 Main contribution: reminding

A novel transformation that maps any pricing algorithm designed for posted-price auction to a multi-buyer setups

slide-31
SLIDE 31

adrutsa@yandex.ru

Thank you!

Alexey Drutsa Yandex