Recurrent Pixel Embedding for Grouping Shu Kong CS, ICS, UCI

Outline 1. Problem Statement -- Pixel Grouping 2. Pixel-Pair Spherical Max-Margin Embedding 3. Recurrent Mean Shift Grouping 4. Experiment 5. Conclusion and Extension Note: the slides were made before paper submission, please treat them as supplemental material and refer to the paper for updated content.

Pixel Labeling Tasks diving into pixels --

Pixel Labeling: Low-Level Vision Tasks diving into pixels -- Low-level vision: edge, boundary, contour

Pixel Labeling: Mid-Level Vision Tasks diving into pixels -- Low-level vision: edge, boundary, contour Mid-level vision: object proposal

Pixel Labeling: High-Level Vision Tasks diving into pixels -- Low-level vision: edge, boundary, contour Mid-level vision: object proposal High-level vision: semantic segmentation instance-level semantic segmentation

Pixel Labeling: Learning Tasks diving into pixels -- Low-level vision: logistic loss edge, boundary, contour Mid-level vision: logistic loss for score object proposal regression for location High-level vision: semantic segmentation logistic loss for mask&score instance-level semantic segmentation cross-entropy for category

Pixel Labeling: New Framework A new framework consisting of two novel modules --

Pixel Labeling: New Framework This framework is A new framework consisting of two novel modules -- agnostic to architecture, so ignore deep learning for now!

Pixel Labeling: New Framework A new framework consisting of two novel modules -- 1. pixel-pair spherical max-margin regression 2. recurrent mean shift grouping

Pixel Labeling: New Framework A new framework consisting of two novel modules -- 1. pixel-pair spherical max-margin regression l learning an embedding space on the hyper-sphere such that • if meeting the pair-wise criterion , learn to push pixels to be close to each other, e.g. both are boundaries, from same instance; • if not, learn to pull them apart. 2. recurrent mean shift grouping

Pixel Labeling: New Framework A new framework consisting of two novel modules -- 1. pixel-pair spherical max-margin regression l learning an embedding space on the hyper-sphere such that • if meeting the pair-wise criterion , learn to push pixels to be close to each other; e.g. both are boundaries, from same instance; • if not, learn to pull them apart. 2. recurrent mean shift grouping iteratively group the pixels into discrete clusters, such as criteria: l boundary vs. non-boundary; object proposals; semantic segments

Pixel-Pair Spherical Max-Margin Regression

Pixel-Pair Spherical Max-Margin Regression date back to Fisher Linear discriminant analysis (LDA)

Pixel-Pair Spherical Max-Margin Regression date back to Fisher Linear discriminant analysis (LDA) To utilize the label information in finding informative projection, maximizing the following objective where

Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level?

Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level? Principle -- 1. for positive pairs of pixels (meeting the criterion), minimizing the pair-wise discrepancy/distance; 2. for negative pairs, minimizing the similarity.

Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level? Principle -- 1. for positive pairs of pixels (meeting the criterion), minimizing the pair-wise discrepancy/distance; 2. for negative pairs, minimizing the similarity. Bert De Brabandere, Davy Neven, Luc Van GoolSemantic Instance Segmentation with a Discriminative Loss Function, arxiv, 2017

Pixel-Pair Spherical Max-Margin Regression What loss functions can we use at pixel-level? Principle -- 1. for positive pairs of pixels (meeting the criterion), minimizing the pair-wise discrepancy/distance; 2. for negative pairs, minimizing the similarity. for example: Euclidean distance between pixel feature vectors for measuring distance. Its inverse, or Gaussian transform, can measure the similarity. ..... Bert De Brabandere, Davy Neven, Luc Van GoolSemantic Instance Segmentation with a Discriminative Loss Function, arxiv, 2017 Alejandro Newell, Jia Deng, Associative Embedding: End-to-End Learning for Joint Detection and Grouping, NIPS, 2017 Alireza Fathi, Zbigniew Wojna, Vivek Rathod, Peng Wang, Hyun Oh Song, Sergio Guadarrama, Kevin P. Murphy, Semantic Instance Segmentation via Deep Metric Learning

Pixel-Pair Spherical Max-Margin Regression We propose the module to learn a hyper-sphere (embedding space), such that positive pairs have high cosine similarity; negative pairs have low cosine similarity.

Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? 1. scale-invariant to the length of feature vector; E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

Pixel-Pair Spherical Max-Margin Regression Why cosine similarity? 1. scale-invariant to the length of feature vector; 2. easy to analyze how to set hyper-parameters; E. B. Saff and A. B. Kuijlaars. Distributing many points on a sphere. The mathematical intelligencer, 19(1):5–11, 1997. L. Lovisolo ; E.A.B. da Silva, uniform distribution of points on a hyper-sphere with applications to vector bit-plane encoding, IEE Proc. Vision, Image and Signal Processing, 2001

Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below

Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below loss function contains postive and negative pairs

Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below loss function contains postive and negative pairs alpha is the margin, hyper parameter to be set.

Pixel-Pair Spherical Max-Margin Regression We use the calibrated cosine similarity as below loss function contains postive and negative pairs alpha is the margin, hyper parameter to be set. Gradient is one, didn't penalize hard pixels in sensitive regions, say nearby boundary, segments, etc.

Pixel-Pair Spherical Max-Margin Regression Important theories 1. the loss has a lower bound, minimum; 2. the lower bound does not depend on the dimension of the embedding space.

Pixel-Pair Spherical Max-Margin Regression 2D case

Pixel-Pair Spherical Max-Margin Regression 3D case

Pixel-Pair Spherical Max-Margin Regression https://en.wikipedia.org/wiki/N-sphere

Pixel-Pair Spherical Max-Margin Regression One more

Pixel-Pair Spherical Max-Margin Regression Last one -- Combination-aware Weighting

Recurrent Mean Shift Grouping From good embedding space to pixel labeling How to get the instances? How to group the pixels?

Recurrent Mean Shift Grouping From good embedding space to pixel labeling How to get the instances? How to group the pixels? k-means, k-medoids?

Recurrent Mean Shift Grouping From good embedding space to pixel labeling How to get the instances? How to group the pixels? k-means, k-medoids? mean shift

Recurrent Mean Shift Grouping mean shift R.Collins, CSE, PSU, CSE598G Spring 2006

Recurrent Mean Shift Grouping mean shift K. Fukunaga, L. Hostetler, The Estimation of the Gradient of a Density Function, with Applications in Pattern - Recognition, PAMI, 1975

Recurrent Mean Shift Grouping mean shift Other than estimating the PDF directly, estimating the gradient --

Recurrent Mean Shift Grouping mean shift then

Recurrent Mean Shift Grouping mean shift: iteratively updating by shifting the data by such an amount

Recurrent Mean Shift Grouping mean shift: iteratively updating by shifting the data by such an amount Gaussian blurring mean-shift (GBMS) algorithm the new iterate is the data average under the posterior probabilities given the current iterate:

Recurrent Pixel Embedding for Grouping Shu Kong CS, ICS, UCI - PowerPoint PPT Presentation

Recurrent Pixel Embedding for Grouping Shu Kong CS, ICS, UCI Outline 1. Problem Statement -- Pixel Grouping 2. Pixel-Pair Spherical Max-Margin Embedding 3. Recurrent Mean Shift Grouping 4. Experiment 5. Conclusion and Extension Note: the

What is a Grouping? What is a Grouping? A Grouping is a category within the new tiered A

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

Pixel Presentation What is Pixel Pixel is an education and training institution with a specific

Pay Attention to the Pixel, Understand the Scene Better Shu Kong CS, ICS, UCI Background: Scene

Pixel Art What is pixel art? Pixel art is a digital art form that is created in raster in its

The pixel hybrid photon detectors The pixel hybrid photon detectors f or the LHCb LHCb- RI CH

Galicia Norte Portugal Norte Portugal Galicia European Grouping Grouping for for

From YouTube to SHU Tube Part 1: The What and Why of SHU Tube October 4, 2009 From YouTube to

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Pixel Recurrent Neural Networks Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu Google

Graph Drawing Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 )

Planarity Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 ) assigns

Development of the CMS Phase-1 Pixel Online Monitoring System and the Evolution of Pixel Leakage

The ATLAS Pixel Detector Vclav Vrba Institute of Physics, Praha Representing the ATLAS Pixel

DEPFET Pixel: A Pixel Device with Integrated Amplification Johannes Ulrici Bonn University

3.2 Hypergeometric Distribution 3.5, 3.9 Mean and Variance Prof. Tesler Math 186 Winter 2017

Hyper-Vacancy in a census tract More than 10 percent of housing units in this category Cuyahoga

ADVANCED DATABASE SYSTEMS Query Compilation & Code Generation @ Andy_Pavlo // 15- 721 //

CMU-SMU@TRECVID 2015: Video Hyperlinking Zhiyong Cheng 1 , Xuanchong Li 2 , Jialie Shen 1 ,

Hyper-local sustainable assortment planning Nupur Aggarwal, Abhishek Bansal, Kushagra Manglik,

Hypercolumns for Object Segmentation and Fine-grained Localization Bharath Hariharan, Pablo

Public clouds and vulnerable CPUs: are we secure? FOSDEM 2020 Vitaly Kuznetsov

Learning From Data Lecture 23 SVMs: Maximizing the Margin A Better Hyperplane Maximizing the