recurrent pixel embedding for grouping shu kong
play

Recurrent Pixel Embedding for Grouping Shu Kong CS, ICS, UCI - PowerPoint PPT Presentation

Recurrent Pixel Embedding for Grouping Shu Kong CS, ICS, UCI Outline 1. Problem Statement -- Pixel Grouping 2. Pixel-Pair Spherical Max-Margin Embedding 3. Recurrent Mean Shift Grouping 4. Experiment 5. Conclusion and Extension Note: the


  1. Recurrent Pixel Embedding for Grouping Shu Kong CS, ICS, UCI

  2. Outline 1. Problem Statement -- Pixel Grouping 2. Pixel-Pair Spherical Max-Margin Embedding 3. Recurrent Mean Shift Grouping 4. Experiment 5. Conclusion and Extension Note: the slides were made before paper submission, please treat them as supplemental material and refer to the paper for updated content.

  3. Pixel Labeling Tasks diving into pixels --

  4. Pixel Labeling: Low-Level Vision Tasks diving into pixels -- Low-level vision: edge, boundary, contour

  5. Pixel Labeling: Mid-Level Vision Tasks diving into pixels -- Low-level vision: edge, boundary, contour Mid-level vision: object proposal

  6. Pixel Labeling: High-Level Vision Tasks diving into pixels -- Low-level vision: edge, boundary, contour Mid-level vision: object proposal High-level vision: semantic segmentation instance-level semantic segmentation

  7. Pixel Labeling: Learning Tasks diving into pixels -- Low-level vision: logistic loss edge, boundary, contour Mid-level vision: logistic loss for score object proposal regression for location High-level vision: semantic segmentation logistic loss for mask&score instance-level semantic segmentation cross-entropy for category

  8. Pixel Labeling: New Framework A new framework consisting of two novel modules --

  9. Pixel Labeling: New Framework This framework is A new framework consisting of two novel modules -- agnostic to architecture, so ignore deep learning for now!

  10. Pixel Labeling: New Framework A new framework consisting of two novel modules -- 1. pixel-pair spherical max-margin regression 2. recurrent mean shift grouping

  11. Pixel Labeling: New Framework A new framework consisting of two novel modules -- 1. pixel-pair spherical max-margin regression l learning an embedding space on the hyper-sphere such that • if meeting the pair-wise criterion , learn to push pixels to be close to each other, e.g. both are boundaries, from same instance; • if not, learn to pull them apart. 2. recurrent mean shift grouping

  12. Pixel Labeling: New Framework A new framework consisting of two novel modules -- 1. pixel-pair spherical max-margin regression l learning an embedding space on the hyper-sphere such that • if meeting the pair-wise criterion , learn to push pixels to be close to each other; e.g. both are boundaries, from same instance; • if not, learn to pull them apart. 2. recurrent mean shift grouping iteratively group the pixels into discrete clusters, such as criteria: l boundary vs. non-boundary; object proposals; semantic segments

  13. Pixel-Pair Spherical Max-Margin Regression

  14. Pixel-Pair Spherical Max-Margin Regression date back to Fisher Linear discriminant analysis (LDA)

  15. Pixel-Pair Spherical Max-Margin Regression date back to Fisher Linear discriminant analysis (LDA) To utilize the label information in finding informative projection, maximizing the following objective where

  16. Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level?

  17. Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level? Principle -- 1. for positive pairs of pixels (meeting the criterion), minimizing the pair-wise discrepancy/distance; 2. for negative pairs, minimizing the similarity.

  18. Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level? Principle -- 1. for positive pairs of pixels (meeting the criterion), minimizing the pair-wise discrepancy/distance; 2. for negative pairs, minimizing the similarity. Bert De Brabandere, Davy Neven, Luc Van GoolSemantic Instance Segmentation with a Discriminative Loss Function, arxiv, 2017

  19. Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level? Principle -- 1. for positive pairs of pixels (meeting the criterion), minimizing the pair-wise discrepancy/distance; 2. for negative pairs, minimizing the similarity. for example: Euclidean distance between pixel feature vectors for measuring distance. Its inverse, or Gaussian transform, can measure the similarity. ..... Bert De Brabandere, Davy Neven, Luc Van GoolSemantic Instance Segmentation with a Discriminative Loss Function, arxiv, 2017 Alejandro Newell, Jia Deng, Associative Embedding: End-to-End Learning for Joint Detection and Grouping, NIPS, 2017 Alireza Fathi, Zbigniew Wojna, Vivek Rathod, Peng Wang, Hyun Oh Song, Sergio Guadarrama, Kevin P. Murphy, Semantic Instance Segmentation via Deep Metric Learning

  20. Pixel-Pair Spherical Max-Margin Regression We propose the module to learn a hyper-sphere (embedding space), such that positive pairs have high cosine similarity; negative pairs have low cosine similarity.

  21. Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

  22. Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? 1. scale-invariant to the length of feature vector; E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

  23. Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? 1. scale-invariant to the length of feature vector; 2. easy to analyze how to set hyper-parameters; E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

  24. Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? 1. scale-invariant to the length of feature vector; 2. easy to analyze how to set hyper-parameters; E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

  25. Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below

  26. Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below loss function contains postive and negative pairs

  27. Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below loss function contains postive and negative pairs alpha is the margin, hyper parameter to be set.

  28. Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below loss function contains postive and negative pairs alpha is the margin, hyper parameter to be set. Gradient is one, didn't penalize hard pixels in sensitive regions, say nearby boundary, segments, etc.

  29. Pixel-Pair Spherical Max-Margin Regression Important theories 1. the loss has a lower bound, minimum; 2. the lower bound does not depend on the dimension of the embedding space.

  30. Pixel-Pair Spherical Max-Margin Regression 2D case

  31. Pixel-Pair Spherical Max-Margin Regression 3D case

  32. Pixel-Pair Spherical Max-Margin Regression https://en.wikipedia.org/wiki/N-sphere

  33. Pixel-Pair Spherical Max-Margin Regression https://en.wikipedia.org/wiki/N-sphere

  34. Pixel-Pair Spherical Max-Margin Regression https://en.wikipedia.org/wiki/N-sphere

  35. Pixel-Pair Spherical Max-Margin Regression One more

  36. Pixel-Pair Spherical Max-Margin Regression Last one -- Combination-aware Weighting

  37. Recurrent Mean Shift Grouping From good embedding space to pixel labeling How to get the instances? How to group the pixels?

  38. Recurrent Mean Shift Grouping From good embedding space to pixel labeling How to get the instances? How to group the pixels? k-means, k-medoids?

  39. Recurrent Mean Shift Grouping From good embedding space to pixel labeling How to get the instances? How to group the pixels? k-means, k-medoids? mean shift

  40. Recurrent Mean Shift Grouping mean shift R.Collins, CSE, PSU, CSE598G Spring 2006

  41. Recurrent Mean Shift Grouping mean shift R.Collins, CSE, PSU, CSE598G Spring 2006

  42. Recurrent Mean Shift Grouping mean shift K. Fukunaga, L. Hostetler, The Estimation of the Gradient of a Density Function, with Applications in Pattern - Recognition, PAMI, 1975

  43. Recurrent Mean Shift Grouping mean shift Other than estimating the PDF directly, estimating the gradient --

  44. Recurrent Mean Shift Grouping mean shift then

  45. Recurrent Mean Shift Grouping mean shift: iteratively updating by shifting the data by such an amount

  46. Recurrent Mean Shift Grouping mean shift: iteratively updating by shifting the data by such an amount

  47. Recurrent Mean Shift Grouping mean shift: iteratively updating by shifting the data by such an amount Gaussian blurring mean-shift (GBMS) algorithm the new iterate is the data average under the posterior probabilities given the current iterate:

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend