filtering with abstract particles
play

Filtering with Abstract Particles Jacob Steinhardt Percy Liang - PowerPoint PPT Presentation

Filtering with Abstract Particles Jacob Steinhardt Percy Liang Stanford University { jsteinhardt,pliang } @cs.stanford.edu May 1, 2013 J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 1 / 12 Motivation


  1. Filtering with Abstract Particles Jacob Steinhardt Percy Liang Stanford University { jsteinhardt,pliang } @cs.stanford.edu May 1, 2013 J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 1 / 12

  2. Motivation Goal. Given an (un-normalized) target distribution f ∗ ( x ) , p ∗ ( x ) = 1 Z f ∗ ( x ) , want to compute normalization constant Z . Issue. Often computationally intractable, so use some approximation ˆ f to f ∗ . variational Bayes, expectation propagation (drop dependencies) MCMC, sequential Monte Carlo, beam search (use samples) We will show how to combine advantages of both types of methods. J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 2 / 12

  3. Variational vs. Particle Methods Goal: infer missing characters in r e c e J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 3 / 12

  4. Variational vs. Particle Methods Goal: infer missing characters in r e c e Particle 0.5 replace 0.5 retrace J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 3 / 12

  5. Variational vs. Particle Methods Goal: infer missing characters in r e c e Particle Actual 0.5 replace 0.33 replace 0.5 retrace 0.33 retrace 0.33 rejoice 0.01 . . . J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 3 / 12

  6. Variational vs. Particle Methods Goal: infer missing characters in r e c e Particle Actual Variational 0.5 replace 0.33 replace 0.33 j 0.33 l 0.66 a 0.5 retrace 0.33 retrace r e c e 0.33 p 0.33 o 0.33 i 0.33 t 0.33 r 0.33 rejoice 0.01 . . . J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 3 / 12

  7. Variational vs. Particle Methods Goal: infer missing characters in r e c e Particle Actual Variational 0.5 replace 0.33 replace 0.33 j 0.33 l 0.66 a 0.5 retrace 0.33 retrace r e c e 0.33 p 0.33 o 0.33 i 0.33 t 0.33 r 0.33 rejoice 0.01 . . . Particles provide precision but lack coverage , while variational inference lacks precision. J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 3 / 12

  8. Our Proposal Define approximations over intermediate regions . variational re ⋆⋆⋆ ce particle replace retrace J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 4 / 12

  9. Our Proposal Define approximations over intermediate regions . variational re ⋆⋆⋆ ce particle replace retrace rejoice J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 4 / 12

  10. Our Proposal Define approximations over intermediate regions . variational re ⋆⋆⋆ ce intermediate re ⋆⋆ ace re ⋆⋆ ice particle replace retrace rejoice J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 4 / 12

  11. Our Proposal Define approximations over intermediate regions . variational re ⋆⋆⋆ ce intermediate re ⋆⋆ ace re ⋆⋆ ice particle replace retrace rejoice Goal. Stitch together approximations at multiple levels to simultaneously obtain precision (from lower levels) and coverage (from higher levels). J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 4 / 12

  12. Stitching Together Models Question. How to combine the different models? re ⋆⋆⋆ ce re ⋆⋆ ace re ⋆⋆ ice replace rejoice retrace J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 5 / 12

  13. Stitching Together Models Question. How to combine the different models? re ⋆⋆⋆ ce re ⋆⋆ ace re ⋆⋆ ice replace rejoice retrace ⇓ Answer. Just use most precise model available at each point (relies on nested structure, e.g. the regions form a hierarchical decomposition ). J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 5 / 12

  14. Generalizing the Construction Let X be some space. Suppose we have a hierarchical decomposition A ⊆ 2 X together with an approximation ˆ f a to f ∗ defined on each region a ∈ A . r e c e ⋆ ⋆ ⋆ r e a c e ⋆ ⋆ r e a c e l ⋆ p r e a c e l J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 6 / 12

  15. Generalizing the Construction Let X be some space. Suppose we have a hierarchical decomposition A ⊆ 2 X together with an approximation ˆ f a to f ∗ defined on each region a ∈ A . If a = { x 0 } is a singleton set, can have r e c e ⋆ ⋆ ⋆ ˆ f a ( x 0 ) = f ∗ ( x 0 ) . r e a c e ⋆ ⋆ r e a c e l ⋆ p r e a c e l J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 6 / 12

  16. Generalizing the Construction Let X be some space. Suppose we have a hierarchical decomposition A ⊆ 2 X together with an approximation ˆ f a to f ∗ defined on each region a ∈ A . If a = { x 0 } is a singleton set, can have r e c e ⋆ ⋆ ⋆ ˆ f a ( x 0 ) = f ∗ ( x 0 ) . If a = X , will need to drop most of the r e a c e ⋆ ⋆ dependencies. r e a c e l ⋆ p r e a c e l J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 6 / 12

  17. Generalizing the Construction Let X be some space. Suppose we have a hierarchical decomposition A ⊆ 2 X together with an approximation ˆ f a to f ∗ defined on each region a ∈ A . If a = { x 0 } is a singleton set, can have r e c e ⋆ ⋆ ⋆ ˆ f a ( x 0 ) = f ∗ ( x 0 ) . If a = X , will need to drop most of the r e a c e ⋆ ⋆ dependencies. r e a c e For intermediate values of a (for instance, l ⋆ fixing the values of certain variables) can keep some subset of the dependencies. p r e a c e l J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 6 / 12

  18. Generalizing the Construction Let X be some space. Suppose we have a hierarchical decomposition A ⊆ 2 X together with an approximation ˆ f a to f ∗ defined on each region a ∈ A . If a = { x 0 } is a singleton set, can have r e c e ⋆ ⋆ ⋆ ˆ f a ( x 0 ) = f ∗ ( x 0 ) . If a = X , will need to drop most of the r e a c e ⋆ ⋆ dependencies. r e a c e For intermediate values of a (for instance, l ⋆ fixing the values of certain variables) can keep some subset of the dependencies. p r e a c e l f ( x ) def Set ˆ = ˆ f a ( x ) , where a is the smallest region containing x . Can think of each region a ∈ A as an abstract particle . J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 6 / 12

  19. Inference If ˆ f is constructed as in the previous slide, then we can compute normalization constant Z as long as we can compute ∑ x ∈ b ˆ f a ( x ) for all regions b ⊆ a . Proof by picture: re ⋆⋆⋆ ce re ⋆⋆ ace re ⋆⋆ ice replace rejoice retrace ⇓ J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 7 / 12

  20. Inference If ˆ f is constructed as in the previous slide, then we can compute normalization constant Z as long as we can compute ∑ x ∈ b ˆ f a ( x ) for all regions b ⊆ a . Proof by picture: re ⋆⋆⋆ ce re ⋆⋆ ace re ⋆⋆ ice replace rejoice retrace ⇓ J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 7 / 12

  21. Inference If ˆ f is constructed as in the previous slide, then we can compute normalization constant Z as long as we can compute ∑ x ∈ b ˆ f a ( x ) for all regions b ⊆ a . Proof by picture: re ⋆⋆⋆ ce − + re ⋆⋆ ace re ⋆⋆ ice replace rejoice retrace ⇓ J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 7 / 12

  22. Inference If ˆ f is constructed as in the previous slide, then we can compute normalization constant Z as long as we can compute ∑ x ∈ b ˆ f a ( x ) for all regions b ⊆ a . Proof by picture: re ⋆⋆⋆ ce − + re ⋆⋆ ace re ⋆⋆ ice − + replace rejoice retrace ⇓ J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 7 / 12

  23. Inference If ˆ f is constructed as in the previous slide, then we can compute normalization constant Z as long as we can compute ∑ x ∈ b ˆ f a ( x ) for all regions b ⊆ a . Proof by picture: re ⋆⋆⋆ ce − + re ⋆⋆ ace re ⋆⋆ ice − + replace rejoice retrace = ⇓ J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 7 / 12

  24. A Family of Approximations A hierarchical decomposition A leads to an approximation ˆ f . We would like to define a family of approximations and choose the best one. Key idea. Every subset B of a hierarchical decomposition A is itself a hierarchical decomposition. Can let A have large cardinality and search for a small subset B that yields a good approximation. Example: A : re ⋆⋆⋆ ce re ⋆⋆ ace re ⋆⋆ ice replace retrace rejoice J. Steinhardt & P. Liang (Stanford) Filtering with Abstract Particles May 1, 2013 8 / 12

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend