Gibbs Sampling from π-Determinantal Point Processes
Based on joint work with Shayan Oveis Gharan
Alireza Rezaei
University of Washington
Gibbs Sampling from -Determinantal Point Processes Alireza Rezaei - - PowerPoint PPT Presentation
Gibbs Sampling from -Determinantal Point Processes Alireza Rezaei University of Washington Based on joint work with Shayan Oveis Gharan Point Process: A distribution on subsets of = {1,2, , } . Determinantal Point Process:
Based on joint work with Shayan Oveis Gharan
University of Washington
Point Process: A distribution on subsets of π = {1,2, β¦ , π}. Determinantal Point Process: There is a PSD kernel π β βπΓπ such that βπ β π : β π β det ππ
Point Process: A distribution on subsets of π = {1,2, β¦ , π}. Determinantal Point Process: There is a PSD kernel π β βπΓπ such that βπ β π : β π β det ππ π-DPP: Conditioning of a DPP on picking subsets of size π if π = π: β π β det ππ
Focus of the talk: Sampling from π- DPPs
Point Process: A distribution on subsets of π = {1,2, β¦ , π}. Determinantal Point Process: There is a PSD kernel π β βπΓπ such that DPPs are Very popular probabilistic models in machine learning to capture diversity. βπ β π : β π β det ππ π-DPP: Conditioning of a DPP on picking subsets of size π if π = π: β π β det ππ
Focus of the talk: Sampling from π- DPPs
Applications [Kulesza-Taskarβ11, Dangβ05, Nenkova-Vanderwende-McKeownβ06, Mirzasoleiman-Jegelka-Krauseβ17]
β Image search, document and video summarization, tweet timeline generation, pose estimation, feature selection
Input: PSD operator π: π Γ π β β and π select a subset π β π with π points from a distribution with PDF function π(π) β det π(π¦, π§) π¦,π§βπ
Input: PSD operator π: π Γ π β β and π select a subset π β π with π points from a distribution with PDF function π(π) β det π(π¦, π§) π¦,π§βπ
Applications.
β Hyper-parameter tuning [Dodge-Jamieson-Smithβ17] β Learning mixture of Gaussians[Affandi-Fox-Taskarβ13]
π¦βπ§ Ξ£β1 π¦βπ§ 2
1 2.
chosen) Continuous: PDF π§ β π π¦1, β¦ π¦πβ1, π§, π¦π+1, β¦ , π¦π )
y π¦π
Main Result
Given a π-DPP π, an βapproximateβ sample from π can be generated by running the Gibbs sampler for π = ΰ·© π· ππ β π¦π©π‘ (π°ππ¬π
ππ ππ ) steps where π is the starting dist.
Main Result
Given a π-DPP π, an βapproximateβ sample from π can be generated by running the Gibbs sampler for π = ΰ·© π· ππ β π¦π©π‘ (π°ππ¬π
ππ ππ ) steps where π is the starting dist.
Discrete: A simple greedy initialization gives π = π π5log π . Total running time is π π . poly π .
Main Result
Given a π-DPP π, an βapproximateβ sample from π can be generated by running the Gibbs sampler for π = ΰ·© π· ππ β π¦π©π‘ (π°ππ¬π
ππ ππ ) steps where π is the starting dist.
Discrete: A simple greedy initialization gives π = π π5log π . Total running time is π π . poly π .
Continuous: Given access to conditional oracles, π can be found so π = π(π5log π).
Being able to run the chain.
Main Result
Given a π-DPP π, an βapproximateβ sample from π can be generated by running the Gibbs sampler for π = ΰ·© π· ππ β π¦π©π‘ (π°ππ¬π
ππ ππ ) steps where π is the starting dist.
Discrete: A simple greedy initialization gives π = π π5log π . Total running time is π π . poly π .
Continuous: Given access to conditional oracles, π can be found so π = π(π5log π).
π¦βπ§ 2 π2
) defined a unit sphere in βπ, the total running time is
π)
Being able to run the chain.