online learning via convex geometry with applications to
play

Online Learning via Convex Geometry, with Applications to Pricing - PowerPoint PPT Presentation

Online Learning via Convex Geometry, with Applications to Pricing Adrian Vladu MIT Dynamic Pricing Problem ? ? ? $0.5 $2.0 $1.0 ? ? ? $0.5 $2.0 $1.0 ? ? ? day 1 x 1 x 3 x 0 ? $0.5 $2.0 $1.0 ? ? ?


  1. Online Learning via Convex Geometry, with Applications to Pricing Adrian Vladu MIT

  2. Dynamic Pricing Problem ? ? ?

  3. $0.5 $2.0 $1.0 🙋 ? ? ?

  4. $0.5 $2.0 $1.0 🙋 ? ? ? day 1 x 1 x 3 x 0 💶 ?

  5. $0.5 $2.0 $1.0 🙋 ? ? ? day 1 x 1 x 3 x 0 💶 ? $7

  6. $0.5 $2.0 $1.0 🙋 ? ? ? day 1 x 1 x 3 x 0 💶 ? $7 ❌

  7. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=6.5 day 1 x 1 x 3 x 0 💶 ? $7 ❌

  8. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=6.5 day 1 day 2 x 2 x 1 x 0 x 3 x 2 x 0 💶 ? 💶 ? $7 ❌

  9. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=6.5 day 1 day 2 x 2 x 1 x 0 x 3 x 2 x 0 💶 ? 💶 ? $2 $7 ❌

  10. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=7.5 day 1 day 2 x 2 x 1 x 0 x 3 x 2 x 0 💶 ? 💶 ? $2 $7 ✅ ❌

  11. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=7.5 day 1 day 2 day 3 x 2 x 1 x 0 x 0 x 3 x 1 x 2 x 0 x 1 💶 ? 💶 ? 💶 ? $2 $7 ✅ ❌

  12. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=7.5 day 1 day 2 day 3 x 2 x 1 x 0 x 0 x 3 x 1 x 2 x 0 x 1 💶 ? 💶 ? 💶 ? $2 $7 $4 ✅ ❌

  13. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=10. 5 day 1 day 2 day 3 x 2 x 1 x 0 x 0 x 3 x 1 x 2 x 0 x 1 💶 ? 💶 ? 💶 ? $2 $7 $4 ✅ ❌ ❌

  14. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=10.5 day 1 day 2 day 3 day 4 x 2 x 1 x 0 x 1 x 0 x 3 x 1 x 1 x 2 x 0 x 1 x 1 💶 ? 💶 ? 💶 ? 💶 ? $2 $7 $4 ✅ ❌ ❌

  15. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=10.5 day 1 day 2 day 3 day 4 x 2 x 1 x 0 x 1 x 0 x 3 x 1 x 1 x 2 x 0 x 1 x 1 💶 ? 💶 ? 💶 ? 💶 ? $2 $7 $4 $3 ✅ ❌ ❌

  16. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=11 day 1 day 2 day 3 day 4 x 2 x 1 x 0 x 1 x 0 x 3 x 1 x 1 x 2 x 0 x 1 x 1 💶 ? 💶 ? 💶 ? 💶 ? $2 $7 $4 $3 ✅ ❌ ❌ ✅

  17. Dynamic Pricing Problem • Warm-up: 1-dimensional case • this work

  18. Dynamic Pricing Problem • Warm-up: 1-dimensional case • this work ?

  19. Dynamic Pricing Problem • Warm-up: 1-dimensional case • this work ? $0 $1

  20. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? $0 $1

  21. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? ✂ $0 $1 $0.5

  22. Dynamic Pricing Problem • Warm-up: 1-dimensional case ✅ • this work ? ✂ $0 $1 $0.5

  23. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? $0 $1

  24. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? ✂ $0 $1 $0.75

  25. Dynamic Pricing Problem • Warm-up: 1-dimensional case ❌ • this work ? ✂ $0 $1 $0.75

  26. Dynamic Pricing Problem • Warm-up: 1-dimensional case ❌ • this work ? ✂ $0 $1 $0.75

  27. Dynamic Pricing Problem • Warm-up: 1-dimensional case ⌛ O(log 1/ ε ) days • this work ? $0 $1

  28. Dynamic Pricing Problem • Warm-up: 1-dimensional case ⌛ O(log 1/ ε ) days • this work ? ≤ε $0 $1

  29. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy ⌛ O(log 1/ ε ) days ? • this work ≤ε $0 $1

  30. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  31. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  32. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  33. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  34. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  35. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  36. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • O(d log T) [this work] • this work

  37. Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining

  38. Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price ≥ε • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly Cut • repeat until no valid directions remaining perpendicular to this direction

  39. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price ≥ε • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly Cut • repeat until no valid directions remaining perpendicular to this direction

  40. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half

  41. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half

  42. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half Theorem : there exists a strategy for cutting such that w(K,u) ≤ ε for all u after O(d log (d/ ε )) iterations. Set ε =d log T / T , regret is at most T ε + O(d log (d/ ε )) = O(d log T) .

  43. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half Theorem : there exists a strategy for cutting such that w(K,u) ≤ ε for all u after O(d log (d/ ε )) iterations. Set ε =d log T / T , regret is at most T ε + O(d log (d/ ε )) = O(d log T) .

  44. Convex Geometry Toolbox

  45. Convex Geometry Toolbox ✂ Grunbaum’s Theorem : any cut through the centroid partitions K in two pieces of roughly equal volume (largest/ smallest ≤ e ).

  46. Convex Geometry Toolbox ✂ Not quite sufficient Grunbaum’s Theorem : any cut through the centroid partitions K in two pieces of roughly equal volume (largest/ smallest ≤ e ).

  47. Convex Geometry Toolbox ✂ Not quite sufficient Grunbaum’s Theorem : any cut through the centroid partitions K in two pieces of roughly equal volume (largest/ smallest ≤ e ).

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend