Learning the Valuations of a k-demand Agent Hanrui Zhang - - PowerPoint PPT Presentation

learning the valuations of a k demand agent
SMART_READER_LITE
LIVE PREVIEW

Learning the Valuations of a k-demand Agent Hanrui Zhang - - PowerPoint PPT Presentation

Learning the Valuations of a k-demand Agent Hanrui Zhang Vincent Conitzer Duke University this talk: optimal (up to lower order terms) algorithm for actively learning the valuations of a k- demand agent algorithm with


slide-1
SLIDE 1

Learning the Valuations

  • f a k-demand Agent

Hanrui Zhang Vincent Conitzer

  • Duke University
slide-2
SLIDE 2

this talk:

  • optimal (up to lower order terms) algorithm

for actively learning the valuations of a k- demand agent

  • algorithm with polynomial time & sample

complexity for passively learning the valuations of a k-demand agent

slide-3
SLIDE 3

k-demand agent: demands a set of items

  • f size <=k maximizing her utility, i.e.,

total value - total price

  • demand set: the set of items the agent

demands

k-demand agents and demand sets

slide-4
SLIDE 4

Unit-demand agents

value: price: $10 $12 $8 $6 $5 $5 surplus: $4 $7 $3 agent buys: ✘ ✔ ✘

slide-5
SLIDE 5

k-demand agents and demand sets

value: price: $5 $6 $4 $3 $4 $3 $2 $2 agent is 2-demand — they want no more than 2 items

slide-6
SLIDE 6

k-demand agents and demand sets

value: price: $5 surplus: 2-demand agent buys: $6 $4 $3 $4 $3 $2 $2 $1 $3 $2 $1 ✘ ✔ ✔ ✘

slide-7
SLIDE 7

k-demand agents and demand sets

value: price: $5 surplus: 2-demand agent buys: $6 $4 $3 $4 $3 $2 $2 $1 $3 $2 $1 ✘ ✔ ✔ ✘ demand set

slide-8
SLIDE 8

Demand queries

demand query: given a vector of prices, returns a demand set (which may not be unique)

slide-9
SLIDE 9

value: price: v1 = $5 2-demand agent buys: v2 = $6 v3 = $4 v4 = $3 p1 = $4 p2 = $2 p3 = $2 p4 = $2 ✘ ✔ ✔ ✘

slide-10
SLIDE 10

value: price: 2-demand agent buys: p1 = $4 p2 = $2 p3 = $2 p4 = $2 ✘ ✔ ✔ ✘ price: 2-demand agent buys: p1 = $2 p2 = $5 p3 = $3 p4 = $1.5 ✔ ✘ ✘ ✔ v1 = $5 v2 = $6 v3 = $4 v4 = $3

slide-11
SLIDE 11

value: price: 2-demand agent buys: p1 = $7 p2 = $3.5 p3 = $5.5 p4 = $4 ✘ ✔ ✘ ✘ price: 2-demand agent buys: p1 = $4 p2 = $2 p3 = $2 p4 = $2 ✘ ✔ ✔ ✘ price: 2-demand agent buys: p1 = $2 p2 = $5 p3 = $3 p4 = $1.5 ✔ ✘ ✘ ✔ v1 = $5 v2 = $6 v3 = $4 v4 = $3

slide-12
SLIDE 12

Actively learning the valuations

  • suppose there are n items, and the value vi of each

item is an integer between 1 and W

  • how many demand queries suffice to learn the full

valuations (i.e., (vi)i) of a k-demand agent?

  • spoiler: optimal number of queries is

(n log W) / (k log (n / k)) + n / k ± o(…)

slide-13
SLIDE 13

(n log W) / (k log (n / k)) + n / k ± o(…)

Sketch of lower bound

amount of information encoded in (vi)i maximum amount

  • f information

per query

slide-14
SLIDE 14

(n log W) / (k log (n / k)) + n / k ± o(…) necessary in the following case:

  • exactly one item is special, which has value 0
  • all other items have value 1
  • the special item is chosen uniformly at random

Sketch of lower bound

slide-15
SLIDE 15

Sketch of upper bound

  • warmup: n = k = 1
  • need to learn: a single number v1 in {1, 2, …, W}
  • query: given p, returns whether p < v1
  • optimal solution: binary search — log W queries
slide-16
SLIDE 16

Sketch of upper bound

  • slight generalization: n = k (= 1)
  • need to learn: a vector (vi)i of integers in {1, 2, …, W}
  • query: given (pi)i, returns, for each item i, whether pi < vi
  • optimal solution: simultaneous binary search — log W

queries

slide-17
SLIDE 17

Sketch of upper bound

  • general case: n ≥ k ≥ 1
  • straightforward solution: (1) divide items into groups of

size k, and (2) perform simultaneous binary search for each group sequentially

  • (n / k) log W queries
  • LB is (n log W) / (k log (n / k)) — can we do better?
slide-18
SLIDE 18

Sketch of upper bound

idea: biased binary search

  • learn v1 using log W queries, use item 1 as reference
  • in each query, post p1 = v1 - 0.5, so item 1 is marginally

attractive

  • for all other items, post biased (rather than middle-of-

possible-range) prices

slide-19
SLIDE 19

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

n = 4 k = 1 v1

slide-20
SLIDE 20

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 prices biased toward higher end of possible ranges

slide-21
SLIDE 21

Sketch of upper bound

  • in each query, post p1 = v1 - 0.5, so item 1 is marginally

attractive

  • for all other items, post biased (rather than middle-of-

possible range) prices

  • if item 1 in demand set: many items are overpriced;

shrink their possible ranges by a little

  • if item 1 not in demand set: a few items are underpriced;

shrink their possible ranges by a lot

slide-22
SLIDE 22

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 if item 1 in demand set: many items are overpriced; shrink their possible ranges by a little

slide-23
SLIDE 23

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 if item 1 in demand set: many items are overpriced; shrink their possible ranges by a little

slide-24
SLIDE 24

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 if item 1 in demand set: many items are overpriced; shrink their possible ranges by a little

slide-25
SLIDE 25

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 if item 1 not in demand set: a few items are underpriced; shrink their possible ranges by a lot

slide-26
SLIDE 26

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 if item 1 not in demand set: a few items are underpriced; shrink their possible ranges by a lot

slide-27
SLIDE 27

Sketch of upper bound

25 50 75 100 item 1 item 2 item 3 item 4

p1 = v1 - 0.5 p2 p3 p4 n = 4 k = 1 if item 1 not in demand set: a few items are underpriced; shrink their possible ranges by a lot

slide-28
SLIDE 28

Sketch of upper bound

  • if item 1 in demand set: many items are overpriced;

shrink their possible ranges by a little

  • if item 1 not in demand set: a few items are underpriced;

shrink their possible ranges by a lot

  • adjust bias to equalize information gain
  • larger information gain (~ k log (n / k)) in both cases!
slide-29
SLIDE 29
  • so far: tight UB & LB for active learning
  • next: (very brief discussion of)

computation & sample efficient algorithm for passive learning

slide-30
SLIDE 30

Passively learning valuations

  • prices are distributed according to a distribution 𝒠
  • true valuations v: a vector of real numbers
  • algorithm observes m iid sample price vectors pj

together with demand set Sj under pj

  • given {(Sj, pj)}, algorithm outputs a hypothesis vector h

which recovers v in a PAC sense — algorithm succeeds with probability 1 - 𝜀, in which case with probability 1 - 𝛇, demand set under (v, p) = demand set under (h, p)

slide-31
SLIDE 31

Passively learning valuations

  • idea: empirical risk minimization
  • tool: multiclass ERM principle & Natarajan dimension
  • treat problem as multiclass classification with < nk labels
  • hypothesis class has Natarajan dimension n
  • sample complexity is poly(n, k, log(1 / 𝜀), 1 / 𝛇)
  • solving ERM = finding a feasible solution to an LP
slide-32
SLIDE 32

Future directions

  • more general valuations, e.g., matroid-demand
  • tighter sample complexity bounds for passive learning
slide-33
SLIDE 33

Thanks for your attention!

Questions?

slide-34
SLIDE 34
  • in economic theory: learning utility functions from

revealed preferences (Samuelson, 1938; Afriat, 1967; Beigman & Vohra, 2006; …)

  • in CS: preference elicitation (Blum et al., 2004; Lahaie

& Parkes, 2004; Sandholm & Boutilier, 2006; …)

Related research