Machine Learning & Mechanism Design: Dynamic and Discriminatory - - PowerPoint PPT Presentation

machine learning mechanism design
SMART_READER_LITE
LIVE PREVIEW

Machine Learning & Mechanism Design: Dynamic and Discriminatory - - PowerPoint PPT Presentation

Machine Learning & Mechanism Design: Dynamic and Discriminatory Pricing in Auctions Jason D. Hartline Microsoft Research Silicon Valley (Joint with with Maria-Florina Balcan, Avrim Blum, and Yishay Mansour) August 5, 2005 The Problem


slide-1
SLIDE 1

Machine Learning & Mechanism Design:

Dynamic and Discriminatory Pricing in Auctions Jason D. Hartline

Microsoft Research – Silicon Valley

(Joint with with Maria-Florina Balcan, Avrim Blum, and Yishay Mansour)

August 5, 2005

slide-2
SLIDE 2

The Problem

Sellers can extract more of surplus with discriminatory pricing.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

1

slide-3
SLIDE 3

The Problem

Sellers can extract more of surplus with discriminatory pricing. Two approaches:

  • 1. Distinguish between products.

(E.g., software versioning, airline tickets, etc.)

  • 2. Price discriminate with observable customer features.

(E.g., college tuition, DVDs, car insurance, shipping)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

1

slide-4
SLIDE 4

The Problem

Sellers can extract more of surplus with discriminatory pricing. Two approaches:

  • 1. Distinguish between products.

(E.g., software versioning, airline tickets, etc.)

  • 2. Price discriminate with observable customer features.

(E.g., college tuition, DVDs, car insurance, shipping) Goal: design mechanism to optimally price discriminate.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

1

slide-5
SLIDE 5

Optimal Mechanism Design

Typical Economic approach to optimal mechanism design:

  • Assume valuations are from known distribution.
  • Design optimal auction for distribution.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

2

slide-6
SLIDE 6

Optimal Mechanism Design

Typical Economic approach to optimal mechanism design:

  • Assume valuations are from known distribution.
  • Design optimal auction for distribution.

Notes on optimal mechanism design problem:

  • Solved by Myerson (for single-parameter case).
  • non-identical distributions =

⇒ discriminatory pricing.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

2

slide-7
SLIDE 7

Optimal Mechanism Design

Typical Economic approach to optimal mechanism design:

  • Assume valuations are from known distribution.
  • Design optimal auction for distribution.

Notes on optimal mechanism design problem:

  • Solved by Myerson (for single-parameter case).
  • non-identical distributions =

⇒ discriminatory pricing.

  • Assumed known distribution ignores:

– incentives (of acquiring distribution) – performance (from inaccurate distribution)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

2

slide-8
SLIDE 8

Optimal Mechanism Design

Typical Economic approach to optimal mechanism design:

  • Assume valuations are from known distribution.
  • Design optimal auction for distribution.

Notes on optimal mechanism design problem:

  • Solved by Myerson (for single-parameter case).
  • non-identical distributions =

⇒ discriminatory pricing.

  • Assumed known distribution ignores:

– incentives (of acquiring distribution) – performance (from inaccurate distribution) Goal: understand how quality and incentives of learning distribution affect profit.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

2

slide-9
SLIDE 9

Setting

  • 1. Unlimited supply of stuff to sell.
  • 2. bidders with private valuations for stuff.
  • 3. make each bidder an offer.
  • 4. revenue is incentive compatible function of offer and valuation.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-10
SLIDE 10

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV))

  • 2. bidders with private valuations for stuff.
  • 3. make each bidder an offer.
  • 4. revenue is incentive compatible function of offer and valuation.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-11
SLIDE 11

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV))

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”)

  • 3. make each bidder an offer.
  • 4. revenue is incentive compatible function of offer and valuation.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-12
SLIDE 12

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV))

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”)

  • 3. make each bidder an offer.

(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)

  • 4. revenue is incentive compatible function of offer and valuation.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-13
SLIDE 13

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV))

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”)

  • 3. make each bidder an offer.

(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)

  • 4. revenue is incentive compatible function of offer and valuation.

(Example 1: Sold: SV for $124.99!)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-14
SLIDE 14

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV)) (Example 2: Tuition for in state (IS) and out of state (OS) students)

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”)

  • 3. make each bidder an offer.

(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)

  • 4. revenue is incentive compatible function of offer and valuation.

(Example 1: Sold: SV for $124.99!)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-15
SLIDE 15

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV)) (Example 2: Tuition for in state (IS) and out of state (OS) students)

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”) (Example 2: Bidder (OS): “Tuition worth $15,000”)

  • 3. make each bidder an offer.

(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)

  • 4. revenue is incentive compatible function of offer and valuation.

(Example 1: Sold: SV for $124.99!)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-16
SLIDE 16

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV)) (Example 2: Tuition for in state (IS) and out of state (OS) students)

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”) (Example 2: Bidder (OS): “Tuition worth $15,000”)

  • 3. make each bidder an offer.

(Example 1: Seller: “PV costs $369.88, SV costs $124.99”) (Example 2: Seller: “IS costs $9,256.80, OS costs $16,855.30,”)

  • 4. revenue is incentive compatible function of offer and valuation.

(Example 1: Sold: SV for $124.99!)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-17
SLIDE 17

Setting

  • 1. Unlimited supply of stuff to sell.

(Example 1: MS Office Professional (PV) & Student Version (SV)) (Example 2: Tuition for in state (IS) and out of state (OS) students)

  • 2. bidders with private valuations for stuff.

(Example 1: Bidder: “PV worth $400, SV worth $300”) (Example 2: Bidder (OS): “Tuition worth $15,000”)

  • 3. make each bidder an offer.

(Example 1: Seller: “PV costs $369.88, SV costs $124.99”) (Example 2: Seller: “IS costs $9,256.80, OS costs $16,855.30,”)

  • 4. revenue is incentive compatible function of offer and valuation.

(Example 1: Sold: SV for $124.99!) (Example 2: No Sale!)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

3

slide-18
SLIDE 18

Overview

1.

= ⇒

Auction Problem (a) Random Sampling Solution (b) Retrospective bounds. (c) Software Versioning Example.

  • 2. Online Auction Problem

(a) Expert Learning based Auction. (b) Expert Learning with non-uniform bounds.

  • 3. Conclusions

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

4

slide-19
SLIDE 19

Auction Problem

The Unlimited Supply Auction Problem: Given:

  • unlimited supply of stuff.
  • Set S of n bidders with valuations for stuff.
  • class G of reasonable offers.

Design: Auction with profit near that of optimal single offer.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

5

slide-20
SLIDE 20

Auction Problem

The Unlimited Supply Auction Problem: Given:

  • unlimited supply of stuff.
  • Set S of n bidders with valuations for stuff.
  • class G of reasonable offers.

Design: Auction with profit near that of optimal single offer. Notation:

  • g(i) = payoff from bidder i when offered g.
  • g(S) =

i∈S g(i).

  • optG(S) = argmaxg∈G g(S).
  • OPTG(S) = maxg∈G g(S).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

5

slide-21
SLIDE 21

Random Sampling Auction

Random Sampling Optimal Offer Auction, RSOOG

  • 1. Randomly partition bidders into two sets: S1 and S2.
  • 2. compute g1 (resp. g2), optimal offer for S1 (resp. S2)
  • 3. Offer g1 to S2 and g2 to S1.

S

(Random Sampling Auction from [Goldberg, Hartline, Wright 2001])

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

6

slide-22
SLIDE 22

Random Sampling Auction

Random Sampling Optimal Offer Auction, RSOOG

  • 1. Randomly partition bidders into two sets: S1 and S2.
  • 2. compute g1 (resp. g2), optimal offer for S1 (resp. S2)
  • 3. Offer g1 to S2 and g2 to S1.

S S1 S2

(Random Sampling Auction from [Goldberg, Hartline, Wright 2001])

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

6

slide-23
SLIDE 23

Random Sampling Auction

Random Sampling Optimal Offer Auction, RSOOG

  • 1. Randomly partition bidders into two sets: S1 and S2.
  • 2. compute g1 (resp. g2), optimal offer for S1 (resp. S2)
  • 3. Offer g1 to S2 and g2 to S1.

S S1 S2

g1 = opt(S1) g2 = opt(S2)

(Random Sampling Auction from [Goldberg, Hartline, Wright 2001])

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

6

slide-24
SLIDE 24

Random Sampling Auction

Random Sampling Optimal Offer Auction, RSOOG

  • 1. Randomly partition bidders into two sets: S1 and S2.
  • 2. compute g1 (resp. g2), optimal offer for S1 (resp. S2)
  • 3. Offer g1 to S2 and g2 to S1.

S S1 S2

g1 = opt(S1) g2 = opt(S2) g1 = opt(S1) g2 = opt(S2)

(Random Sampling Auction from [Goldberg, Hartline, Wright 2001])

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

6

slide-25
SLIDE 25

Random Sampling Auction

Random Sampling Optimal Offer Auction, RSOOG

  • 1. Randomly partition bidders into two sets: S1 and S2.
  • 2. compute g1 (resp. g2), optimal offer for S1 (resp. S2)
  • 3. Offer g1 to S2 and g2 to S1.

S S1 S2

g1 = opt(S1) g2 = opt(S2) g1 = opt(S1) g2 = opt(S2)

Question: when is RSOOG good? (Random Sampling Auction from [Goldberg, Hartline, Wright 2001])

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

6

slide-26
SLIDE 26

Performance Analysis

Lemma: For g and random partitions S1 and S2: Pr[|g(S1) − g(S2)| > ǫ max(p, g(S))] ≤ 2e−ǫ2p/2h.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

7

slide-27
SLIDE 27

Performance Analysis

Lemma: For g and random partitions S1 and S2: Pr[|g(S1) − g(S2)| > ǫ max(p, g(S))] ≤ 2e−ǫ2p/2h. Consider:

  • Use p = OPTG.
  • If |G| e−ǫ2 OPTG /2h ≤ δ,
  • union bound probability any g ∈ G is bad by δ.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

7

slide-28
SLIDE 28

Performance Analysis

Lemma: For g and random partitions S1 and S2: Pr[|g(S1) − g(S2)| > ǫ max(p, g(S))] ≤ 2e−ǫ2p/2h. Consider:

  • Use p = OPTG.
  • If |G| e−ǫ2 OPTG /2h ≤ δ,
  • union bound probability any g ∈ G is bad by δ.

Theorem: With probability 1 − δ profit from RSOOG is at least

(1 − ǫ) OPTG − O( h

ǫ2 log |G| δ )

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

7

slide-29
SLIDE 29

Performance Analysis

Lemma: For g and random partitions S1 and S2: Pr[|g(S1) − g(S2)| > ǫ max(p, g(S))] ≤ 2e−ǫ2p/2h. Consider:

  • Use p = OPTG.
  • If |G| e−ǫ2 OPTG /2h ≤ δ,
  • union bound probability any g ∈ G is bad by δ.

Theorem: With probability 1 − δ profit from RSOOG is at least

(1 − ǫ) OPTG − O( h

ǫ2 log |G| δ )

Interpretation: O(h log |G|) is convergence time.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

7

slide-30
SLIDE 30

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time: O(h log |G|) = O(h log log h)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

8

slide-31
SLIDE 31

Overview

  • 1. Auction Problem

(a) Random Sampling Solution (b)

= ⇒

Retrospective bounds. (c) Software Versioning Example.

  • 2. Online Auction Problem

(a) Expert Learning based Auction. (b) Expert Learning with non-uniform bounds.

  • 3. Conclusions

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

9

slide-32
SLIDE 32

|G| = ∞?

What if |G| = ∞?

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

10

slide-33
SLIDE 33

|G| = ∞?

What if |G| = ∞? Example: Selling tee shirts. (non-discretized prices)

  • Bidders with valuations v1, . . . , vn in [1, h] for a tee shirt.
  • Reasonable offers: G = {price p ∈ [1, h]}.
  • Convergence Time:

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

10

slide-34
SLIDE 34

|G| = ∞?

What if |G| = ∞? Example: Selling tee shirts. (non-discretized prices)

  • Bidders with valuations v1, . . . , vn in [1, h] for a tee shirt.
  • Reasonable offers: G = {price p ∈ [1, h]}.
  • Convergence Time:

Observation:

  • Suppose RSOOG on S only offers g ∈ GS ⊂ G.
  • Then RSOOGS(S) is same as RSOOG(S).
  • Retrospectively perform analysis on GS instead of G.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

10

slide-35
SLIDE 35

|G| = ∞?

What if |G| = ∞? Example: Selling tee shirts. (non-discretized prices)

  • Bidders with valuations v1, . . . , vn in [1, h] for a tee shirt.
  • Reasonable offers: G = {price p ∈ [1, h]}.
  • Convergence Time:

Observation:

  • Suppose RSOOG on S only offers g ∈ GS ⊂ G.
  • Then RSOOGS(S) is same as RSOOG(S).
  • Retrospectively perform analysis on GS instead of G.

Consider: GS = {“offer vi” : i ∈ S}.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

10

slide-36
SLIDE 36

|G| = ∞?

What if |G| = ∞? Example: Selling tee shirts. (non-discretized prices)

  • Bidders with valuations v1, . . . , vn in [1, h] for a tee shirt.
  • Reasonable offers: G = {price p ∈ [1, h]}.
  • Convergence Time: O(h |GS|) = O(h log n)

Observation:

  • Suppose RSOOG on S only offers g ∈ GS ⊂ G.
  • Then RSOOGS(S) is same as RSOOG(S).
  • Retrospectively perform analysis on GS instead of G.

Consider: GS = {“offer vi” : i ∈ S}.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

10

slide-37
SLIDE 37

Example: Software Versioning

Example: Software versioning.

  • Microsoft has M possible versions of MS Office,
  • but can only package and sell m ≪ M distinct versions.

(E.g., Student and Professional).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

11

slide-38
SLIDE 38

Example: Software Versioning

Example: Software versioning.

  • Microsoft has M possible versions of MS Office,
  • but can only package and sell m ≪ M distinct versions.

(E.g., Student and Professional).

  • Reasonable offers: G = {p ∈ (R ∪ {∞})M with pj = ∞ for all

but m items}.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

11

slide-39
SLIDE 39

Example: Software Versioning

Example: Software versioning.

  • Microsoft has M possible versions of MS Office,
  • but can only package and sell m ≪ M distinct versions.

(E.g., Student and Professional).

  • Reasonable offers: G = {p ∈ (R ∪ {∞})M with pj = ∞ for all

but m items}.

  • What is |GS|?

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

11

slide-40
SLIDE 40

Example: Software Versioning

Example: Software versioning.

  • Microsoft has M possible versions of MS Office,
  • but can only package and sell m ≪ M distinct versions.

(E.g., Student and Professional).

  • Reasonable offers: G = {p ∈ (R ∪ {∞})M with pj = ∞ for all

but m items}.

  • What is |GS|?

Lemma: |GS| = O((nm2M)m).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

11

slide-41
SLIDE 41

Example: Software Versioning

Example: Software versioning.

  • Microsoft has M possible versions of MS Office,
  • but can only package and sell m ≪ M distinct versions.

(E.g., Student and Professional).

  • Reasonable offers: G = {p ∈ (R ∪ {∞})M with pj = ∞ for all

but m items}.

  • What is |GS|?

Lemma: |GS| = O((nm2M)m). Convergence time = O(hm log(nm2M)) .

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

11

slide-42
SLIDE 42

Example: Software Versioning

Example: Software versioning.

  • Microsoft has M possible versions of MS Office,
  • but can only package and sell m ≪ M distinct versions.

(E.g., Student and Professional).

  • Reasonable offers: G = {p ∈ (R ∪ {∞})M with pj = ∞ for all

but m items}.

  • What is |GS|?

Lemma: |GS| = O((nm2M)m). Convergence time = O(hm log(nm2M)) ≈ O(hm log n).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

11

slide-43
SLIDE 43

Proof of Lemma

Lemma: |GS| = O((nm2M)m).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-44
SLIDE 44

Proof of Lemma

Lemma: |GS| = O((nm2M)m). Proof:

  • 1. Fix set of m items to sell.
  • 2. Bidder i’s valuation divides price space into m + 1 convex regions.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-45
SLIDE 45

Proof of Lemma

Lemma: |GS| = O((nm2M)m). Proof:

  • 1. Fix set of m items to sell.
  • 2. Bidder i’s valuation divides price space into m + 1 convex regions.
  • 3. Regions are joined by (m + 1)2 hyperplanes.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-46
SLIDE 46

Proof of Lemma

Lemma: |GS| = O((nm2M)m). Proof:

  • 1. Fix set of m items to sell.
  • 2. Bidder i’s valuation divides price space into m + 1 convex regions.
  • 3. Regions are joined by (m + 1)2 hyperplanes.
  • 4. n bidders total for n(m + 1)2 hyperplanes.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-47
SLIDE 47

Proof of Lemma

Lemma: |GS| = O((nm2M)m). Proof:

  • 1. Fix set of m items to sell.
  • 2. Bidder i’s valuation divides price space into m + 1 convex regions.
  • 3. Regions are joined by (m + 1)2 hyperplanes.
  • 4. n bidders total for n(m + 1)2 hyperplanes.
  • 5. RSOOG offer price must be at intersection of hyperplanes.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-48
SLIDE 48

Proof of Lemma

Lemma: |GS| = O((nm2M)m). Proof:

  • 1. Fix set of m items to sell.
  • 2. Bidder i’s valuation divides price space into m + 1 convex regions.
  • 3. Regions are joined by (m + 1)2 hyperplanes.
  • 4. n bidders total for n(m + 1)2 hyperplanes.
  • 5. RSOOG offer price must be at intersection of hyperplanes.
  • 6. K = n(m + 1)2 hyperplanes in m dimensions intersect in Km.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-49
SLIDE 49

Proof of Lemma

Lemma: |GS| = O((nm2M)m). Proof:

  • 1. Fix set of m items to sell.
  • 2. Bidder i’s valuation divides price space into m + 1 convex regions.
  • 3. Regions are joined by (m + 1)2 hyperplanes.
  • 4. n bidders total for n(m + 1)2 hyperplanes.
  • 5. RSOOG offer price must be at intersection of hyperplanes.
  • 6. K = n(m + 1)2 hyperplanes in m dimensions intersect in Km.
  • 7. Sum over M m possible m-item sets.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

12

slide-50
SLIDE 50

Other Results

See paper for details on:

  • Bounds for RSOOG for item-pricing in combinatorial auctions.
  • Bounds for RSOOG on bidders with observable features.
  • Better bounds with ǫ-covers of G.
  • Better random sampling auction with structural risk minimization.
  • Using approximation algorithms in RSOOG.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

13

slide-51
SLIDE 51

Overview

  • 1. Auction Problem

(a) Random Sampling Solution (b) Retrospective bounds. (c) Software Versioning Example. 2.

= ⇒

Online Auction Problem (a) Expert Learning based Auction. (b) Expert Learning with non-uniform bounds.

  • 3. Conclusions

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

14

slide-52
SLIDE 52

Online Auction Problem

Online Auction Problem:

  • unlimited supply of stuff.
  • class G of reasonable offers.
  • Bidders arrive one at a time and place bids, b1, b2, . . .
  • Auctioneer makes offer g from G before next bidder arrives.
  • Goal: Auction with profit close to optimal single offer.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

15

slide-53
SLIDE 53

Online Auction Problem

Online Auction Problem:

  • unlimited supply of stuff.
  • class G of reasonable offers.
  • Bidders arrive one at a time and place bids, b1, b2, . . .
  • Auctioneer makes offer g from G before next bidder arrives.
  • Goal: Auction with profit close to optimal single offer.

Two Difficulties:

  • 1. Incentive Compatibility requirement:
  • 2. Online Requirement (do not know future):

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

15

slide-54
SLIDE 54

Online Auction Problem

Online Auction Problem:

  • unlimited supply of stuff.
  • class G of reasonable offers.
  • Bidders arrive one at a time and place bids, b1, b2, . . .
  • Auctioneer makes offer g from G before next bidder arrives.
  • Goal: Auction with profit close to optimal single offer.

Two Difficulties:

  • 1. Incentive Compatibility requirement:
  • ffer to bidder i not function of bi.
  • 2. Online Requirement (do not know future):

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

15

slide-55
SLIDE 55

Online Auction Problem

Online Auction Problem:

  • unlimited supply of stuff.
  • class G of reasonable offers.
  • Bidders arrive one at a time and place bids, b1, b2, . . .
  • Auctioneer makes offer g from G before next bidder arrives.
  • Goal: Auction with profit close to optimal single offer.

Two Difficulties:

  • 1. Incentive Compatibility requirement:
  • ffer to bidder i not function of bi.
  • 2. Online Requirement (do not know future):

price offered bidder i not function of future bids.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

15

slide-56
SLIDE 56

Online Auction Problem

Online Auction Problem:

  • unlimited supply of stuff.
  • class G of reasonable offers.
  • Bidders arrive one at a time and place bids, b1, b2, . . .
  • Auctioneer makes offer g from G before next bidder arrives.
  • Goal: Auction with profit close to optimal single offer.

Two Difficulties:

  • 1. Incentive Compatibility requirement:
  • ffer to bidder i not function of bi.
  • 2. Online Requirement (do not know future):

price offered bidder i not function of future bids. Conclusion: offer for bidder i based only on prior bids: b1, . . . , bi−1.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

15

slide-57
SLIDE 57

Assumptions

  • 1. We learn each bidders full valuation.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

16

slide-58
SLIDE 58

Assumptions

  • 1. We learn each bidders full valuation.

for partial information case see multi-armed bandit solutions:

[Blum, Kumar, Rudra, Wu ’03][Kleinberg, Leighton ’03][Blum, Hartline 05]

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

16

slide-59
SLIDE 59

Assumptions

  • 1. We learn each bidders full valuation.

for partial information case see multi-armed bandit solutions:

[Blum, Kumar, Rudra, Wu ’03][Kleinberg, Leighton ’03][Blum, Hartline 05]

  • 2. Bidders cannot come back.
  • 3. Bidders cannot lie about their arrival time.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

16

slide-60
SLIDE 60

Assumptions

  • 1. We learn each bidders full valuation.

for partial information case see multi-armed bandit solutions:

[Blum, Kumar, Rudra, Wu ’03][Kleinberg, Leighton ’03][Blum, Hartline 05]

  • 2. Bidders cannot come back.
  • 3. Bidders cannot lie about their arrival time.

for temporal strategyproofness see: [Hajiaghayi, Kleinberg, Parkes ’04]

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

16

slide-61
SLIDE 61

Assumptions

  • 1. We learn each bidders full valuation.

for partial information case see multi-armed bandit solutions:

[Blum, Kumar, Rudra, Wu ’03][Kleinberg, Leighton ’03][Blum, Hartline 05]

  • 2. Bidders cannot come back.
  • 3. Bidders cannot lie about their arrival time.

for temporal strategyproofness see: [Hajiaghayi, Kleinberg, Parkes ’04]

  • 4. items in unlimited supply.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

16

slide-62
SLIDE 62

Assumptions

  • 1. We learn each bidders full valuation.

for partial information case see multi-armed bandit solutions:

[Blum, Kumar, Rudra, Wu ’03][Kleinberg, Leighton ’03][Blum, Hartline 05]

  • 2. Bidders cannot come back.
  • 3. Bidders cannot lie about their arrival time.

for temporal strategyproofness see: [Hajiaghayi, Kleinberg, Parkes ’04]

  • 4. items in unlimited supply.

for limited supply see: [Hajiaghayi, Kleinberg, Parkes ’04][Kleinberg ’05]

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

16

slide-63
SLIDE 63

Online Learning

Expert Online Learning Problem: In round i:

  • 1. Each of k experts propose a strategy.
  • 2. We choose an expert’s strategy.
  • 3. Find out how each strategy performed (payoff)

Goal: Obtain payoff close to single best expert overall (in hindsight).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

17

slide-64
SLIDE 64

Online Learning

Expert Online Learning Problem: In round i:

  • 1. Each of k experts propose a strategy.
  • 2. We choose an expert’s strategy.
  • 3. Find out how each strategy performed (payoff)

Goal: Obtain payoff close to single best expert overall (in hindsight). Weighted Majority Algorithm: (for round i) Let h be maximum payoff. For expert j, let sj be total payoff thus far. Choose expert j’s strategy with probability proportional to (1+2ǫ)sj/h.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

17

slide-65
SLIDE 65

Online Learning

Expert Online Learning Problem: In round i:

  • 1. Each of k experts propose a strategy.
  • 2. We choose an expert’s strategy.
  • 3. Find out how each strategy performed (payoff)

Goal: Obtain payoff close to single best expert overall (in hindsight). Weighted Majority Algorithm: (for round i) Let h be maximum payoff. For expert j, let sj be total payoff thus far. Choose expert j’s strategy with probability proportional to (1+2ǫ)sj/h. Result: E[payoff] = (1 − ǫ) OPT − h

2ǫ log k.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

17

slide-66
SLIDE 66

Application to Online Auctions

Application: (to online auctions) [Blum Kumar Rudra Wu 2003]

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

18

slide-67
SLIDE 67

Application to Online Auctions

Application: (to online auctions) [Blum Kumar Rudra Wu 2003]

  • 1. Expert for each g ∈ G

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

18

slide-68
SLIDE 68

Application to Online Auctions

Application: (to online auctions) [Blum Kumar Rudra Wu 2003]

  • 1. Expert for each g ∈ G
  • 2. Best expert ⇒ best offer.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

18

slide-69
SLIDE 69

Application to Online Auctions

Application: (to online auctions) [Blum Kumar Rudra Wu 2003]

  • 1. Expert for each g ∈ G
  • 2. Best expert ⇒ best offer.

Result: E[profit] = (1 − ǫ) OPTG − h

ǫ log |G|.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

18

slide-70
SLIDE 70

Application to Online Auctions

Application: (to online auctions) [Blum Kumar Rudra Wu 2003]

  • 1. Expert for each g ∈ G
  • 2. Best expert ⇒ best offer.

Result: E[profit] = (1 − ǫ) OPTG − h

ǫ log |G|.

Note: Same convergence time as for RSOOG.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

18

slide-71
SLIDE 71

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time: O(h log |G|)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

19

slide-72
SLIDE 72

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time: O(h log |G|)= O(h log log h).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

19

slide-73
SLIDE 73

Better Bounds?

Can we get better bounds? Retrospective technique like using GS does not work.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

20

slide-74
SLIDE 74

Overview

  • 1. Auction Problem

(a) Random Sampling Solution (b) Retrospective bounds. (c) Software Versioning Example.

  • 2. Online Auction Problem

(a) Expert Learning based Auction. (b)

= ⇒

Expert Learning with non-uniform bounds.

  • 3. Conclusions

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

21

slide-75
SLIDE 75

Non-uniform Bounds on Payoff

Expert Online Learning Problem: In round i:

  • 1. Each of k experts propose a strategy.
  • 2. We choose an expert’s strategy.
  • 3. Find out how each strategy performed (payoff)
  • 4. Expert i’s payoff is always less than hi.

Goal: Obtain payoff close to single best expert overall (in hindsight).

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

22

slide-76
SLIDE 76

Non-uniform Bounds on Payoff

Expert Online Learning Problem: In round i:

  • 1. Each of k experts propose a strategy.
  • 2. We choose an expert’s strategy.
  • 3. Find out how each strategy performed (payoff)
  • 4. Expert i’s payoff is always less than hi.

Goal: Obtain payoff close to single best expert overall (in hindsight). Non-uniform Experts Algorithm: [Kalai ’01][Blum, Hartline ’05]

  • 1. (initialization) For each expert, j, add initial score, sj, as:

hi × number of heads flipped in a row.

  • 2. Run deterministic “go with best expert” algorithm.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

22

slide-77
SLIDE 77

Non-uniform Bounds on Payoff

Expert Online Learning Problem: In round i:

  • 1. Each of k experts propose a strategy.
  • 2. We choose an expert’s strategy.
  • 3. Find out how each strategy performed (payoff)
  • 4. Expert i’s payoff is always less than hi.

Goal: Obtain payoff close to single best expert overall (in hindsight). Non-uniform Experts Algorithm: [Kalai ’01][Blum, Hartline ’05]

  • 1. (initialization) For each expert, j, add initial score, sj, as:

hi × number of heads flipped in a row.

  • 2. Run deterministic “go with best expert” algorithm.

Result: E[profit] ≥ OPT /2 −

i hi.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

22

slide-78
SLIDE 78

Application to Online Auctions

Application: (to online auctions)

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

23

slide-79
SLIDE 79

Application to Online Auctions

Application: (to online auctions)

  • 1. Bound hg for each g ∈ G.
  • 2. Expert for each g ∈ G
  • 3. Best expert ⇒ best offer.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

23

slide-80
SLIDE 80

Application to Online Auctions

Application: (to online auctions)

  • 1. Bound hg for each g ∈ G.
  • 2. Expert for each g ∈ G
  • 3. Best expert ⇒ best offer.

Result: E[profit] = OPTG /2 −

g∈G hg.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

23

slide-81
SLIDE 81

Application to Online Auctions

Application: (to online auctions)

  • 1. Bound hg for each g ∈ G.
  • 2. Expert for each g ∈ G
  • 3. Best expert ⇒ best offer.

Result: E[profit] = OPTG /2 −

g∈G hg.

Note: Convergence time =

g∈G hg

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

23

slide-82
SLIDE 82

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time:

g∈G hg

.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

24

slide-83
SLIDE 83

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time:

g∈G hg = log h i

2i

.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

24

slide-84
SLIDE 84

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time:

g∈G hg = log h i

2i ≤ 2h.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

24

slide-85
SLIDE 85

Example

Example: Selling tee shirts. (discretized prices)

  • Bidders with valuations in [1, h] for a tee shirt.
  • Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.
  • Convergence Time:

g∈G hg = log h i

2i ≤ 2h.

Note: this is optimal up to constant factors.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

24

slide-86
SLIDE 86

Conclusions

  • 1. Used machine learning techniques for auction design/analysis.
  • 2. Prior-free discriminatory optimal mechanism design.

(a) distinguishing between products (and selecting products to sell). (b) price discriminate based on observable customer features.

  • 3. Similar bounds for offline and online auctions.
  • 4. Retrospective analysis for offline auctions.

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

25

slide-87
SLIDE 87

Conclusions

  • 1. Used machine learning techniques for auction design/analysis.
  • 2. Prior-free discriminatory optimal mechanism design.

(a) distinguishing between products (and selecting products to sell). (b) price discriminate based on observable customer features.

  • 3. Similar bounds for offline and online auctions.
  • 4. Retrospective analysis for offline auctions.
  • 5. Open: ǫ-cover arguments for online auctions?
  • 6. Open: limited supply?
  • 7. Open: general cost function on outcomes?

DISCRIMINATORY AUCTIONS – AUGUST 5, 2005

25