Multiclass object recognition Sharing parts and transfer learning - PowerPoint PPT Presentation

Multiclass object recognition Sharing parts and transfer learning Sharat Chikkerur

Outline  Historical perspective and motivation  Discriminative approach  A. Torralba, K. Murphy, W. Freeman, Sharing visual features for multiclass and multiview object detection, IEEE PAMI 2007  Bayesian approach  (Prelude) R. Fergus, P. Perona, A. Zisserman, Object recognition by unsupervised scaleinvariant learning, CVPR 03  L. FeiFei, R. Fergus and P. Perona. OneShot learning of object categories. PAMI, 2006

Perspective: Template vs parts [Dalal & Triggs 05] [Viola & Jones 01] [Fergus 03] [Fischler73] Sparse representation Dense representation   Rigid and articulate objects Useful for rigid objects   More robust Less robust   Appearance and shape Appearance only   Objects share parts Objects share features   Summary: Partbased representation make more sense!

Motivation: Sharing parts

 Benefits  Learning is faster  Features are reused  Time complexity ~ O(log n) instead of O(n)  Better generalization  Individual parts share training data across classes  Robust to interclass variation  Challenges  Identity of shared parts/classes unknown  Sharing may not follow tree structure  Exhaustive search ~ O(2 P )

How do you share parts?  Create a universal dictionary of parts  Serre et al 07 (HMAX), Ke and Sukhtankar (PCA SIFT)  Learn the shared dictionary of parts  Discriminative  Embed sharing into optimization  Discriminative dictionary (Marial et al 08)  Joint boosting (Torralba et al 07)  Generative  Use unlabeled data to learn prior  Constellation model (FeiFei et al 06)

Discriminative approach A. Torralba, K. Murphy, W. Freeman, Sharing visual features for multiclass and multiview object detection, IEEE PAMI 2007

Recap: Part representation Feature (Appearance + Position)

Recap: Boosting An additive model for combining weak classifiers  Weak classifier:  Algorithm: 

Choosing a weak classifier  For each feature  Evaluate the weighted error  Pick the feature with minimum error

Joint Boosting An additive model that jointly optimizes for all classes  Weak classifier:  h 1 (v,1) h 2 (v,1) H(v,1) = h 1 (v,2) h 2 (v,2) H(v,2) ..... + h 1 (v,3) h 2 (v,3) H(v,3) H(v,4) h 1 (v,4) h 2 (v,4) H(v,5) h 1 (v,5) h 2 (v,5)

Vector valued

Example Joint boosting Independent Feature sharing

Greedy approach  Exhaustive search of all classes ~ O(2 C )  Greedy approach  Select the class with best reduction in error  Insert next class with lowest error  Continue till all classes are selected  Select the best member from the set  Complexity ~ O(C 2 )

Typical behavior  Independent features/ pairs ~ O(N)  Shared features ~ O(log N)

Application: Object categorization Feature (Appearance + Position) Training examples ● Data: 21 object categories ● 2000 candidate features (extracted by random sampling) ● 50 training examples per category

Object categorization: Performance

Object categorization: Shared features

Summary  Joint boosting allows learning of shared parts (even nontree structures)  Learning time reduces from O(N) to O(log N)  Allows scaling to large number of categories  Reduces training sample size (per class)  Useful for multiclass as well as multiview recognition  Wishlist?  Automatic scale selection for features  Handling occlusion

Bayesian approach

Bayes 101: Coin tossing  MLE  Let p : probability of heads  Data : we observed H heads and T tails.  Inference: What is the chance of next head?  P(Head) = p = H/(H+T)  Bayesian  Let p: probability of heads (unknown!), p ~ f(p)  P(Head) = ∫ P(Head|p) f(p) dp  Data: we observed H heads and T tails  p~ f(p|D), still not fixed!  P(Head|D) = ∫ P(Head|p) f(p|D) dp

Learning parameters: Conjugate priors Conjugate prior: Functional form of the prior and posterior  distribution are identical Assumption With no data, we assume that the “shared  coin is likely to be fair Knowledge” Uncertainty based on hyper  parameters p(h)~ B(a,b) After we observe data  Learning  D= (Hheads, Ttails)  the uncertainty in h is altered p(h|D) ~ B(a+H,b+T)

Transfer learning  Discriminative  Given data: Learn shared parameters  New data : Use all old parameters (+ new)  Bayesian  Given data: Learn priors (“assumptions”)  New data : Update priors

A prelude Object class recognition by unsupervised scaleinvariant learning

Constellation model A 1 ,X 1 A 5 ,X 5 A 2 ,X 2 A 4 ,X 4 Torralba et al. ~100 parts A 3 ,X 3 A 1 ,X 1 A 5 ,X 5 A 2 ,X 2 A 4 ,X 4 A 3 ,X 3 Fergus et al < 10 parts

Generative model/Bayesian detection Generative model for shape, appearance and scale  Latent variable H encodes the mapping from part (P) to interest point (N)  Example: N=10,P4, h=[0 3 4 5] or h=[3 1 2 10], |h| ~ O(N P )  Bayesian detection:  MLE approximation

Factorization Appearance  Shape  Scale  Occlusion 

Representation and learning  Position/Shape  Candidate part locations are obtained using Kadir Brady interest point detector  Appearance  Modeled using 11x11 pixels around the interest point (PCA used for reducing dimension)  Learning (EM) X A h S

Example: faces

Example: motorbikes

Comparison (Caltech 4) Models are classspecific  Models are robust to scale variation 

Bayesian approach L. FeiFei, R. Fergus and P. Perona. OneShot learning of object categories. PAMI, 2006

Bayesian approach  Fergus et al (I = X,S,A) MLE approximation  FeiFei et al. Parameter integration

Generative model for shape and appearance Foreground object (integrate over all hypotheses)  Latent variable In the paper,  =1 (  >1 can handle pose variation)  Background (has a single null hypothesis) 

Factorization Appearance  Fergus et al. Fei Fei et al. Shape  Fergus et al. Fei Fei et al. Scale and occlusion are not modeled 

Comparison P (ө) MLE  ө ө* MAP  P (ө) Bayesian  ө*

Conjugate priors Parameters  Priors  Dirichlet Wishart Normal Hyperparameters  Closed form solution 

Learning  p(x|ө) = ∑p(x,h|ө)p(h|ө) , hunknown, but 'convenient'  Regular EM  Estep: Estimate p(h|x,ө n ), Q(ө)=E h { log(p(x,h|ө)) | h}  (Usually available in closed form)  Mstep: Ө n+1 = argmax Q(ө)  Variational (EM)  Getting p(h|x,ө n ) is hardno closed form  p(h|x,ө n ) ~ q(x) , approximate the posterior

Performance: Caltech 4

Caltech 4 (cont.)

Performance : Caltech 101 Performance: 10.4%, 13.9%,17.7% with 3,6,15 training example  Stateoftheart : > 60% 

Comparison

Summary  Transfer learning in a Bayesian setting  Recipe = learning priors on given data+ updating priors on new data  Good results with just 1~5 training examples (compared to MLE approaches)  Learning is hard (computationally)  Wishlist  Handling multiple objects within the image.

Thank You!

Multiclass object recognition Sharing parts and transfer learning - PowerPoint PPT Presentation

Multiclass object recognition Sharing parts and transfer learning Sharat Chikkerur Outline Historical perspective and motivation Discriminative approach A. Torralba, K. Murphy, W. Freeman, Sharing visual features for multiclass and

Multiclass Boosting with Repartitioning Ling Li Learning Systems Group, Caltech ICML 2006

Multiclass Predictions CMSC 422 M ARINE C ARPUAT marine@cs.umd.edu T opics Given an arbitrary

Parts Tracking and Parts Room Management System Parts Tracking and Parts Room Management System

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Parts of a Plant Plants have different parts to them, just like you. We have different body parts

Secret Sharing and Visual Cryptography Outline Secret Sharing Visual Secret Sharing

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

MOTIVATE 1 and 2 Trials Maraviroc in Patients with Multiclass Drug Resistance MOTIVATE 1 and 2:

Perception of Average Value in Multiclass Scatterplots Michael Gleicher, Michael Correll,

Lecture 7: Multiclass Classification Princeton University COS 495 Instructor: Yingyu Liang

Multiclass Multilabel Classification with More Classes than Examples Ohad Shamir Weizmann

Selective sampling algorithms for cost-sensitive multiclass prediction Alekh Agarwal Microsoft

Model Combination in Multiclass Classification Sam Reid Advisors: Mike Mozer, Greg Grudic

CSC 411: Lecture 07: Multiclass Classification Class based on Raquel Urtasun & Rich Zemels

Adversarial Surrogate Losses for General Multiclass Classification Rizal Zaini Ahmad Fathony

Beyond Object Recognition in 2D Georgia Gkioxari Object Recognition in 2D The World is 3D

Multi-Task Learning & Transfer Learning Basics CS 330 1 Logistics Homework 1 posted Monday

globus online Simplify big data sharing with Globus Online Steve Tuecke Computation Institute

Campaigning in Britain Justin Fisher (Brunel University) PARTY SYSTEMS IN THE UK Calculation of

Designing an ultra low-overhead multithreading runtime for Nim Mamy Ratsimbazafy Weave

PhEDEx and CMS Data Transfers Paul Rossman Fermilab Global CMS Data Network Paul Rossman

pab stakeholder discussion January 30 and January 31 pab stakeholder goals and timeline

CS11-737: Multilingual Natural Language Processing Translation Yulia Tsvetkov Translation Mr.

Deep Transfer Mapping for Unsupervised Writer Adaptation Hong-Ming Yang 1,2 , Xu-Yao Zhang 1,2 ,

Multiclass object recognition Sharing parts and transfer learning - PowerPoint PPT Presentation

Multiclass object recognition Sharing parts and transfer learning Sharat Chikkerur Outline Historical perspective and motivation Discriminative approach A. Torralba, K. Murphy, W. Freeman, Sharing visual features for multiclass and

Multiclass Boosting with Repartitioning Ling Li Learning Systems Group, Caltech ICML 2006

Multiclass Predictions CMSC 422 M ARINE C ARPUAT marine@cs.umd.edu T opics Given an arbitrary

Parts Tracking and Parts Room Management System Parts Tracking and Parts Room Management System

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Parts of a Plant Plants have different parts to them, just like you. We have different body parts

Secret Sharing and Visual Cryptography Outline Secret Sharing Visual Secret Sharing

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

MOTIVATE 1 and 2 Trials Maraviroc in Patients with Multiclass Drug Resistance MOTIVATE 1 and 2:

Perception of Average Value in Multiclass Scatterplots Michael Gleicher, Michael Correll,

Lecture 7: Multiclass Classification Princeton University COS 495 Instructor: Yingyu Liang

Multiclass Multilabel Classification with More Classes than Examples Ohad Shamir Weizmann

Selective sampling algorithms for cost-sensitive multiclass prediction Alekh Agarwal Microsoft

Model Combination in Multiclass Classification Sam Reid Advisors: Mike Mozer, Greg Grudic

CSC 411: Lecture 07: Multiclass Classification Class based on Raquel Urtasun &amp; Rich Zemels

Adversarial Surrogate Losses for General Multiclass Classification Rizal Zaini Ahmad Fathony

Beyond Object Recognition in 2D Georgia Gkioxari Object Recognition in 2D The World is 3D

Multi-Task Learning &amp; Transfer Learning Basics CS 330 1 Logistics Homework 1 posted Monday

globus online Simplify big data sharing with Globus Online Steve Tuecke Computation Institute

Campaigning in Britain Justin Fisher (Brunel University) PARTY SYSTEMS IN THE UK Calculation of

Designing an ultra low-overhead multithreading runtime for Nim Mamy Ratsimbazafy Weave

PhEDEx and CMS Data Transfers Paul Rossman Fermilab Global CMS Data Network Paul Rossman

pab stakeholder discussion January 30 and January 31 pab stakeholder goals and timeline

CS11-737: Multilingual Natural Language Processing Translation Yulia Tsvetkov Translation Mr.

Deep Transfer Mapping for Unsupervised Writer Adaptation Hong-Ming Yang 1,2 , Xu-Yao Zhang 1,2 ,

CSC 411: Lecture 07: Multiclass Classification Class based on Raquel Urtasun & Rich Zemels

Multi-Task Learning & Transfer Learning Basics CS 330 1 Logistics Homework 1 posted Monday