online stochastic matching with unequal probabilities
play

Online Stochastic Matching with Unequal Probabilities Aranyak Mehta - PowerPoint PPT Presentation

Online Stochastic Matching with Unequal Probabilities Aranyak Mehta Bo Waggoner Harvard Morteza Zadimoghaddam SODA 2015 1 Outline Problem and motivation Prior work, our main result Key idea: Adaptivity Ideas behind


  1. Online Stochastic Matching with Unequal Probabilities Aranyak Mehta Bo Waggoner Harvard Morteza Zadimoghaddam SODA 2015 1

  2. Outline ● Problem and motivation ● Prior work, our main result ● Key idea: Adaptivity ● Ideas behind algorithm/analysis 2

  3. Motivation: Search ads advertisers search queries Time 3

  4. Motivation: Search ads advertisers search queries Simplified problem: - display one ad per query - have estimate of click Time probabilities - advertisers pay $1 if click, $0 if no click - advertisers have budget for one click per day How to assign ads? 4

  5. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online offline vertices arrivals Time 5

  6. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online offline vertices arrivals p 11 p 31 Time p 41 6

  7. Online Stochastic Matching [Mehta and Panigrahi, 2012] Pr[ searcher clicks if we show this ad ] fixed, online offline vertices arrivals p 11 p 31 Time p 41 7

  8. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online offline vertices arrivals p 31 Time Assign to vertex 3 ! Alg 8

  9. Online Stochastic Matching [Mehta and Panigrahi, 2012] With prob p 31 : match succeeds fixed, online offline vertices arrivals With prob 1 - p 31 : match fails p 31 Time Alg 9

  10. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online match succeeded offline vertices arrivals Time cannot be matched again Alg 10

  11. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online match failed offline vertices arrivals Time may be matched again later disappears (cannot re-try) Alg 11

  12. Measuring algorithm performance fixed, online offline vertices arrivals Alg’s performance = # successes Alg 12

  13. Measuring algorithm performance fixed, online offline vertices arrivals Alg’s performance = E[ # successes ] Alg 13 13

  14. Measuring algorithm performance fixed, online offline vertices arrivals Alg’s performance = E[ # successes ] Opt’s performance = size of max weighted assignment, budget 1 Alg Opt 14 14

  15. Measuring algorithm performance fixed, online Competitive ratio = offline vertices arrivals min Alg Alg’s performance = E[ size of matching ] Opt Opt’s performance = size of max weighted over all input instances. assignment, budget 1 (Note: Opt is a bit funky … not achievable even with foreknowledge of instance.) Alg Opt 15 15

  16. Prior Work ● Online Matching with Stochastic Rewards Mehta, Panigrahi, FOCS 2012. ○ Greedy = 0.5 . Opt ○ For case where all p are equal and vanishing : Alg ≥ 0.567 . Opt Open: anything better than Greedy for unequal p 16

  17. This work For unequal, vanishing edge probabilities: Alg ≥ 0.534 Opt 17

  18. This work For unequal, vanishing edge probabilities: So what? Alg ≥ 0.534 algorithmic ideas to beat Opt Greedy 18

  19. Outline ● Problem and motivation ● Prior work, our main result ● Key idea: Adaptivity ● Ideas behind algorithm/analysis 19

  20. Adaptive: sees whether or not assignment succeeds fixed, online offline vertices arrivals 20

  21. Our Approach 1. Start with an optimal non-adaptive alg that is straightforward to analyze 2. Add a small amount of adaptivity (second choices) 3. Analysis remains tractable by limiting amount of adaptivity 21

  22. An optimal non-adaptive algorithm ● MP-2012: nonadaptive algs have upper bound of 0.5 ● How to achieve 0.5 ? (Previously unknown.) Seems nonobvious. 22

  23. Maximize marginal expected gain offline online Assign first arrival to vertices arrivals vertex with largest p i1 0.3 0.4 0.2 23

  24. Maximize marginal expected gain offline online Assign next arrival to vertices arrivals max 0.1 Pr[ i available ] p i2 0.3 0.2 24

  25. Maximize marginal expected gain offline online = (1 - 0.4) * 0.3 Assign next arrival to vertices arrivals = 0.18 max 0.1 Pr[ i available ] p i2 0.3 0.2 = (1) * 0.2 = 0.2 25

  26. NonAdaptive Theorem: NonAdaptive has a competitive ratio of 0.5 for the general online stochastic matching problem. Does not require vanishing probabilities. 26

  27. Why do we like NonAdaptive ? ● On a given instance, an arrival has the same “first choice” every time (regardless of previous realizations) ● Algorithm tracks/uses competitive ratio (probabilities of success) 27

  28. Add Adaptivity (but not too much) Proposed SemiAdaptive : Assign next arrival to max Pr[ i available ] p ij unless already taken, in which case assign to second-highest. 28

  29. Why do we like SemiAdaptive ? ● On a given instance, an arrival has the same first and second choices every time (regardless of previous realizations) ● Algorithm tracks/uses competitive ratio (probabilities of success) These allow us to analyze SemiAdaptive -- almost... 29

  30. (Analysis?) Roadblock ● Want: when first-choice is not available, get measurable benefit by assigning to second choice → giving improvement over NonAdaptive ’s 0.5 30

  31. (Analysis?) Roadblock ● Want: when first-choice is not available, get measurable benefit by assigning to second choice → giving improvement over NonAdaptive ’s 0.5 ● Problem: correlation between availability of first and second choice . Perhaps when first choice is not available, most likely second choice is not available either. → cannot guarantee improvement over NonAdaptive 31

  32. (Analysis?) Roadblock ● Want: when first-choice is not available, get measurable benefit by assigning to second choice → giving improvement over NonAdaptive ’s 0.5 ● Problem: correlation between availability of first and second choice . Perhaps when first choice is not available, most likely second choice is not available either. → cannot guarantee improvement over NonAdaptive ● Fix: introduce independence / even less adaptivity. (no time to say more! sorry!) 32

  33. RECAP Online stochastic matching problem: - edges succeed probabilistically - maximize expected number of successes - input instance chosen adversarially New here: p 11 - edge probabilities may be unequal p 31 33

  34. RECAP Results: - optimal 0.5 -competitive NonAdaptive - 0.534 -competitive SemiAdaptive (with tweak) for vanishing probabilities Key idea: p 11 - control adaptivity to control analysis p 31 34

  35. Future Work Everything about Online Stochastic Matching: ● Vanishing probabilities: ○ Equal: 0.567 … ? … 0.62 ○ Unequal: 0.534 … ? … 0.62 ● Large probabilities: ○ Equal: 0.53 … ? … 0.62 ○ Unequal: 0.5 … ? … 0.62 35

  36. Future Work Everything about Online Stochastic Matching: ● Vanishing probabilities: ○ Equal: 0.567 … ? … 0.62 ○ Unequal: 0.534 … ? … 0.62 ● Large probabilities: ○ Equal: 0.53 … ? … 0.62 ○ Unequal: 0.5 … ? … 0.62 Thanks! 36

  37. Additional slides 37

  38. Final Algorithm “ SemiAdaptive ” Assign next arrival to max Pr[ i available ] p ij * unless already taken, in which case assign to second-highest. * “it would have already been taken by a previous first-choice” (key point: even less adaptive, more independence) 38

  39. Ideas behind analysis Pr[ available ] Either first choice is q 1 p 12 the same as Opt’s... p 22 q 2 q 3 p 42 q 4 q 5 39

  40. Ideas behind analysis Pr[ available ] Either first choice is q 1 p 12 the same as Opt’s... p 22 q 2 ...or both first and second choice would q 3 give at least as much p 42 “gain” as Opt’s. q 4 q 5 40

  41. Ideas behind analysis Pr[ available ] Either first choice is Very good because gains q 1 p 12 the same as Opt’s... “compound”. p 22 q 2 ...or both first and second choice would q 3 Good because we get give at least as much p 42 “second-choice gains”. “gain” as Opt’s. q 4 q 5 41

  42. Note: Can only get 1 - 1/e ≈ 0.632 even with knowledge of instance online Weighted matching: 1 arrivals 1/n E[ # of matches ] = 1 - Pr[ all fail ] 1/n = 1 - (1 - 1/n) n 1/n → 1 - 1/e 1/n 1/n 1/n Alg Opt 42 42

  43. Example of defining Opt fixed, online offline vertices arrivals 1/2 Opt gets 1 Alg’s performance = 2/3 E[ size of matching ] Opt’s performance = size of max weighted assignment, budget 1 1/4 1/4 Opt gets 1/2 Alg Opt 43 43

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend