Privacy Preserving Bandits
Dimitrios Athanasakis (Brave)• 02.03.2020 @dimmu Joint work with:
- Mohammad Malekzadeh (QMUL/Brave)
- Hamed Haddadi(ICL/Brave)
- Ben Livshits (ICL/Brave)
Privacy Preserving Bandits Joint work with: Mohammad Malekzadeh - - PowerPoint PPT Presentation
Privacy Preserving Bandits Joint work with: Mohammad Malekzadeh (QMUL/Brave) Hamed Haddadi(ICL/Brave) Ben Livshits (ICL/Brave) Dimitrios Athanasakis (Brave) 02.03.2020 @dimmu Why this is an important topic Personalization
Dimitrios Athanasakis (Brave)• 02.03.2020 @dimmu Joint work with:
Personalization is ubiquitous
personalized experiences
single biggest application of personalization) fuels the internet.
Personalization is often invasive
internet
little pony relevant to the pricing of my plane tickets?
personal
Image source: The economist Big tech faces competition and privacy concerns in Brussels
https://www.economist.com/briefi ng/2019/03/23/big-tech-faces-co mpetition-and-privacy-concerns-i n-brussels
Great for privacy
user’s device, therefore fewer things to worry from a privacy perspective.
will learn a very accurate model recommendation policy for the user.
Not so good for utility
the local model to learn a useful recommendation policy
personalization options appear
Earning
the user how can we maximise his engagement?
Learning
interests?
product X to user Y?
user changed?
Problem Definition
6
State(t) Action(t)
A_1 : P_1 A_2 : P_2 . . . A_K : P_K
D K Complexity? data tuple = (S = [S_0, S_1, …, S_D] , A {1,2,...,K} , R {0,1})
∋ ∋
Privacy first!
○ Past 100 page visits? (%)
State? What state?
7
know its user faster and better?
Research Question
8
9
Can you recognize yourself by your own data?
VS Vanilla model inversion VS Model inversion on noised data
Crowd-blending Differential Privacy:
( Gehrke et al 2011) (Dwork & Roth 2013)
Our approach: ESA + LinUCB
12
State Space
○
D-dimensional vector of real numbers ○ Its sum is 1 ○ It’s rounded to F decimal points
○ with F=1 we have ~ 100K possible states ○ with F=2 it is ~ 4T
Number of possible states is too large
10 Stars into D Bars
F
Encoding
○ Locality-sensitive hashing
This helps increasing the size of the crowd a user can blend in. E.g. D=10 → 10 bits : → 1K 4T
* size shows the value
Shuffling
agents
shuffle their order.
batch is less than a defined threshold.
Model updates
shuffler releases.
weights
○ iid random sampling with probability p
Privacy Model
17
ƐDP =
ƐDP
p
ƐCB
○ Linear and nonlinear randomly initialized mapping functions ■ Input: a histogram ■ Output: a stochastic preference model
○ Input: a binary vector (features) ○ Output: a binary vector (labels)
○ Input: Integer values (unknown features) ○ Output: a one-hot vector (product category)
Evaluation
18
Environment Algorithm
Context
Github: https://github.com/mmalekzadeh/privacy-preserving-bandits
Results: Synthetic Data
19
reward for varying numbers of users
context on expected reward
Results: Multi-Label Classification
20
Results: Ad. Recommendation (Criteo)
21
|A|=40, d=10, u=3,000 agents
experiments are somewhat strange but surely interesting
Revisited)
22
Some Remarks
Github: https://github.com/mmalekzadeh/privacy-preserving-bandits
Personal Notes
job soon.
some remote presentations.
1. Poster #15 2. Working on privacy? Let’s talk. Have experiences in the adtech ecosystem? We’d like to hear from you. 3. We’re always looking for great engineers: https://brave.com/careers/
Also @dimmu