an optimal private stochastic mab algorithm based on an
play

An Optimal Private Stochastic-MAB Algorithm Based on an Optimal - PowerPoint PPT Presentation

An Optimal Private Stochastic-MAB Algorithm Based on an Optimal Private Stopping Rule Touqir Sajed & Or Sheffet K-Armed Stochastic Bandit Problem There are K arms The learner pulls an arm at rounds : 1, , T Pulling an arm i


  1. An Optimal Private Stochastic-MAB Algorithm Based on an Optimal Private Stopping Rule Touqir Sajed & Or Sheffet

  2. K-Armed Stochastic Bandit Problem ● There are K arms The learner pulls an arm at rounds : 1, … , T ● ● Pulling an arm i t at round t generates a reward: ● Minimize Pseudo Regret: ● UCB family meets the lower bound by Lai and Robbins 1985 :

  3. Differential Privacy ● Let D be a dataset of m datums and D’ be its neighbour They only differ in 1 reward sample ○ ● An Algorithm M is epsilon-DP if for any output set O, the following holds: ● A function has a sensitivity of if for all neighbours D and D’:

  4. Previous DP-MAB results ● DP-UCB algorithms by Mishra & Thakurta (2015), Tossou & Dimitrakakis (2016) Rely on tree-based binary mechanism by Chan et al (2011). ● ● Laplace noise of magnitude: ● Hence the extra pseudo regret bound of Shariff & Sheffet (2018) showed a lower bound of ● ● We propose two algorithms that match the lower bound:

  5. Our Contributions ● Proposed the first DP-MAB algorithm that meets the lower bound: ● Showed a lower bound for the private stopping rule problem: Proposed an optimal DP-stopping rule that meets the lower bound: ●

  6. Thank you! Come visit our poster today from 6:30 - 9pm at Pacific Ballroom #173

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend