TBD
R a v i K u m a r Google Mountain View, CA
TBD R a v i K u m a r Google Mountain View, CA T heory B ehind D - - PowerPoint PPT Presentation
TBD R a v i K u m a r Google Mountain View, CA T heory B ehind D iscrete choice R a v i K u m a r Google Mountain View, CA (Joint work with Flavio Chierichetti & Andrew Tomkins) Discrete choice Random user { } Slate , , 25% 10%
R a v i K u m a r Google Mountain View, CA
R a v i K u m a r Google Mountain View, CA (Joint work with Flavio Chierichetti & Andrew Tomkins)
Random user Choice distribution 25% 10% 65% Slate
Random user How to learn the probability distributions governing the choice in a generic slate? 30% 70%
Slate
Random user 45% 25% 20% 10%
Random user 45% 25% 30%
Quickly learning the winning distributions of the slates is imporuant for applications … but there are exponentially many slates!
Universe = [n] = {1, 2, …, n} Slates = non-empty subsets of [n]
Discrete choice models can codify rational behavior S and T highly overlap ⟹ f(S) and f(T) may be related
a slate T (ie, argmaxt∈T D(t))
60% 40% Random User
40% 60%
Random User
60% 40% 40% 60% 0%
Assume a universe [n] and an unknown distribution on the permutations of [n] Given a slate S ⊆ [n], let DS(i) for i ∈ S be the probability that a random permutation (ie, user) prefers i to every other element of S
The type of queries that we allow can signifjcantly change the hardness of the problem By obtaining O((n/ε)2) random independent permutations (according to the unknown distribution), one can approximate each slate’s winning distribution to within an 𝓂1-error of ε Given a generic slate, return the winning probabilities induced by a random permutation chosen in the set of samples
1st 2nd
3rd
4th
Click It is easier to ask/infer the preferred
The random permutation query is infeasible in many applications
and observe which options they select
adaptively) some sequence S1, S2, … of slates to
DS1(·), DS2(·), ….
Given a slate S
permutation π, and returns the element of S with maximum rank in π
probability that i wins in S given a random permutation
Ω(2n) queries are needed to learn DS exactly
the expected total variation distance is going to be Ω(2-3n/2)
30% 10%
A > B > C > D B > C > A > D
There are only a few types of users
If there are only k types of users, then
calls to the max-dist oracle
weight au for each item u in U For a subset (slate) S of U, the probability of choosing u in slate S is proporuional to au Pr[choosing u in S] = au / ∑v∈S av
2 3 4 1 2 5 2 3 4 1 2 5
> > > > >
Random Permutation
Pick the next item in the permutation at random between the remaining ones, with probability proporuional to its weight 3/17 5/14 2/9
Assume for a slate S we get the choice distribution DS(·) exactly (max-dist oracle) For i = 1, …, n-1, query the MNL using slate {i, n} to get the choice distribution Di,n(·) (ai / (ai + an), an / (ai + an))
an/ (a1 + an) = D1,n(n) an / (a2 + an) = D2,n(n) … ∑ ai = 1 Solve the resulting system of linear equations to
1-MNL can be learnt with O(n) queries and slates of size 2
1 1
50% 50%
4
~ 50%
~ 10%
~ 40%
ε ε ε
~ 50%
1-MNLs are insuffjcient to capture common setuings
the problem
population-specifjc MNL, can solve the problem
2-MNL mixture: Given a universe U of items and positive weights au and bu for each item u in U For a slate S, the probability of choosing u in S equals γ · au / ∑v∈S av + (1 – γ) · bu / ∑v∈S bv Uniform mixture when γ = 1/2
MNL mixtures can approximate arbitrarily well any RUM (McFadden & Train 2000)
Choice models RUMs 1-MNLs k-MNLs
distribution DS(·) exactly
the choice distributions Di,j(·) and Di,j,k(·) 2 Di,j(i) = ai/(ai + aj) + bi/(bi + bj) 2 Di,j,k(i) = ai/(ai + aj + ak) + bi/(bi + bj + bk)
2 Di,j(i) = ai/(ai + aj) + bi/(bi + bj) 2 Di,k(i) = ai/(ai + ak) + bi/(bi + bk) 2 Dj,k(j) = aj/(aj + ak) + bj/(bj + bk) 2 Di,j,k(i) = ai/(ai + aj + ak) + bi/(bi + bj + bk) 2 Di,j,k(j) = aj/(ai + aj + ak) + bj/(bi + bj + bk) ai + aj + ak =1, bi + bj + bk = 1
= {i, j, k}, the choice distributions of all the subsets of S determine uniquely the weights of i, j, k in each of the two MNLs Proof steps.
and give combinatorial algorithm to determine it
pergorming max-dist queries on O(n) slates of sizes 2 and 3, that reconstructs the weights of any uniform 2-MNL system on n elements
pergorming max-dist queries on O(n2) slates of sizes 2 and 3, that reconstructs the weights of any uniform 2-MNL system on n elements
related to discrete choice
imporuant and relevant in practice
max-dist oracles?
Questions/Comments ravi.k53 @ gmail