Unsupervised Learning of Object Deformation Models Iasonas Kokkinos - PowerPoint PPT Presentation

Unsupervised Learning of Object Deformation Models Iasonas Kokkinos and Alan Yuille Center for Image and Vision Sciences Seminar, Department of Statistics, UCLA Oct. 2007 Work appearing in ICCV 07

Modelling ShapeVariation via Deformations • Top-down approach: – Model variation in shape and appearance separately (`one thing at a time’) – Shape modelling via deformations I(S(x)) = T(x) S(X) X Template Instance • Success Stories: – Deformable Templates, Active Contours. – ASM-AAM models. – MRF models for object detection/tracking. • Our goal: Learn without manual annotations.

Mainstream on Learning Object Models • Appearance and shape of interest points. + Efficient, scale-invariant detection + Automated learning of object detection models - Not really generative models - Therefore unsuitable for - Segmentation - Tracking/analysis - Other tasks in the `pipeline’ - Questionable shape models (Gaussians for fixed set of interest points) Parallelograms: All contour info can be recovered. 4 point model? But why require `magic picture’ skills?

Primal Sketch Representation • Image sketch: low-level summary of pixel information – Marr, Perceptual grouping, Lindeberg, Guo, Wu & Zhu. • Lindeberg Edges & Ridges: Focus on Scale Invariance Scale Smoothed Image Edge Strength Ridge Strength • Remove appearance variation, focus on shape. • Extract information related to boundaries/symmetry axes. • Use as features for both modelling and detection.

Datasets • Overall goal is to learn models and use them for object detection. • Datasets from detection benchmarks were used, containing unsegmented images with significant noise and occlusions. Cars: Aggarwal and Roth, ECCV 02 Horses: Borenstein & Ullman, ECCV 02 Cows: Leibe & Schiele, ECCV 04 Faces: Fergus et al, CVPR Hands: Gomez & Stegmann, imm.dtu.dk

Learning Active Appearance Models

AAMs: Linear, Global Deformation Models • Composite deformations modelled as superposition of simpler ones. • Combination of Shape : & Texture: for Synthesis: • Model fitting: – Minimization of – Stochastic GD, Newton Raphson, Inverse Compositional • Model Learning?

Unsupervised Learning of AAMs • Goal: register training images with prototype template. – Unknowns: template, deformation basis and coefficients. • Previous Work: – Vetter, Jones & Poggio, CVPR 97: Bootstrapping • Iterate: AAM fitting - optical flow - PCA on optical flow – Cootes et al. ECCV 2004: Diffeomorphisms • Guarantee 1-1 mapping, but not of the typical PCA type. – Baker, Matthews, Schneider, PAMI 04: Coding Length • Global reconstruction criterion. • Our Contributions – EM Formulation – Mean Shift Clustering for Eigenvector Initialization – `Feature Transport’ PDE.

AAM Learning Block Diagram Mean Shift Clustering Input Images New Eigenvector E: Deform M: Update s Primal Sketch T S AAM Fit EM Loop Registrations Model Parameters

EM Approach to Learning AAMs • Starting Points – ‘Coding’ criterion (BMS ’04) • EM- formulation • Parameters: Synthesis model for deformations and template. • Hidden variables: Coefficients matching template with individual images. • EM-based minimization: – E-step: Posterior on hidden variables, given parameters. • model fitting, estimate – M-step: Maximize expected log-likelihood • maximize observation likelihood w.r.t. basis elements & template.

Mean Shift Clustering for Initialization • Alternative views of Primal Sketch Contours: – 2D images, 1D contours, 0D point sets. • Phrase registration as clustering of points. • Nonparametric clustering: Mean Shift – Variation: remove motion component along contour orientation. – Collapses contour `spaghetti’ onto single contour. – Used for eigenvector initialization and template construction.

Feature Transport PDE • Problem Addressed: – Deformations can `swallow’ template features. – Instead of matching template to image, its features are hidden. • `Feature Transport’ idea: do not `accelerate’ across features. – Constraint on deformation field – Project onto nearest function satisfying constraint – Calculus of Variations:

Deformation Eigenmodes

Registration Results: Improvements in Template Clarity

Quantitative Results • 50 Images from each category, 18-50 landmarks per image. • Error measure: – Backward wrap images to template coordinate system. – Calculate covariance of landmark locations. – Estimate `radius’ of containing circle Green: Only translation Blue: Learned model Red: Manual Model

Learning Part-Based Models

Part-Based Deformation models • AAM limitations: – Global deformation models. – No occlusion or layered appearance modelling. – Local minima due to greedy fitting. • Part-Based Models (Graphical Models): – Each object part corresponds to a node on the graph. – Node state: parameters of local deformation model – Clique potentials: kinematic constraints. Deformed Template Template Deformed State of i • Divide-and-conquer-type modelling of deformations. – Small set of simple models. – Inference can avoid local minima, handle occlusions.

Model Initialization • Part Detection: – Cluster ridges ( symmetry axes ) using Mean Shift – But now only move along contour orientation – Detected Parts: • Initial parameter estimates: obtained from AAM fitting results.

Learning the Model via EM. • Split problem unknowns – Hidden variables: Node states for each individual image. – Parameters: clique expressions, network structure, template. • E-step: Estimate posterior distribution on hidden states – Tree-structured model. – Inference on graphs. • M-step: Use posterior to update model parameters – Hinged joint model estimation for clique potentials (Least Squares). – Structure Learning (Minimum Spanning Tree). – Learning the observation model (Niblack Thresholding).

Nonparametric Belief Propagation for the E-step • Belief propagation algorithm: circulate messages in graph Problem! • NBP (Sudderth Ihler et. al, CVPR’03): – Sample-based approximation to messages. – Gibbs-sampling based computation of product between messages. – Posterior on nodes: • Still, problematic if we must evaluate by cropping, rotating, and rescaling e.g. 100 image patches for each node.

Speeding Up the E-step • Use binary templates in appearance model (Niblack thresholding). • For each state being summed over: – Deform part template correspondingly – Sum ridge/edge strength within/outside template interior. S 2 S 1 T T S 3 S 1 S 3 S 2 Binary Template Ridge interior Edge interior – Use as feature in observation potential expression. • Key idea: replace summations using Stoke’s theorem:

Part-Based Model Results • Top-down Syntheses: – Sketch image using most likely samples from the node posteriors. • Quantitative Results: – Better than AAM, almost as good as manual model Cows Horses 10 20 AAM AAM MRF−M MRF−M MRF−U MRF−U RCD RCD 5 10 0 0 10 20 30 10 20 30 40 Landmark # Landmark #

Top-down fitting results

Conclusions & Discussion • Modelling deformations – Basic prerequisite for building accurate models. – Requires `handholding’ and careful design. • Primal Sketch greatly facilitates modelling – Amenable to Mean Shift clustering – Averaging over training set provides boundaries & symmetry axes. – Facilitates part detection by clustering symmetry axes. – Removes appearance information. • Learning difficulty & performance: – Learned AAM performed equally with manually trained AAM. – Learning Part-Based models is feasible, but there is place for improvement.

Future Work • Use for detection – How can we combine the sparse set of primal sketch tokens to detect an object? • Extend learning: – Use 1-D aspect of primal sketch. – Learn the hierarchy of parts building up the object? – Allow alternative structures (And-Or graph)? • Use top-down models for segmentation – Top-down filling in does most of the work – Use hallucinated edges and ridges to segment the image CVPR 06 ICCV 07 • Model appearance variation.

Unsupervised Learning of Object Deformation Models Iasonas Kokkinos - PowerPoint PPT Presentation

Unsupervised Learning of Object Deformation Models Iasonas Kokkinos and Alan Yuille Center for Image and Vision Sciences Seminar, Department of Statistics, UCLA Oct. 2007 Work appearing in ICCV 07 Modelling ShapeVariation via Deformations

Deformation http://www.cse.iitd.ac.in/ Deformation IIT Delhi http://www.cse.iitd.ac.in/

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

Deformation causes change in the shape keeping typically the same topology Geometric deformation

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Unsupervised Learning Andrea Passerini passerini@disi.unitn.it Machine Learning Unsupervised

Hierarchical Deformation of Locally Rigid Meshes Josiah Manson and Scott Schaefer Motivation

12.1 Surface Deformation II Hao Li http://cs621.hao-li.com 1 Last Time Linear Surface

Introduction to PCA Unsupervised Learning in R Unsupervised learning Two methods of

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

Unsupervised Language Learning: Representation Learning for NLP Katia Shutova ILLC University

Unsupervised Learning Unsupervised Learning Learning without Class Labels (or correct Learning

A Bayesian Approach to A Bayesian Approach to Unsupervised One- Unsupervised One -Shot Shot

Unsupervised Learning Introduction Nakul Verma Unsupervised Learning What can we learn from

12. Unsupervised Deep Learning CS 535 Deep Learning, Winter 2018 Fuxin Li With materials from

Machine Learning for NLP Unsupervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

31) Feature Models and MDA for Product Lines 1. Feature Models 2. Product Linie Configuration with

Introduction: What is Image Processing? CS 4640: Image Processing Basics January 10, 2012 What

Robotic Navigation - Experience Gained with RADAR Martin Adams Dept. Electrical Engineering, AMTC

Feature Detection ] Logistics Write the use of free late days right below the title. We

Natural Language Processing Anoop Sarkar anoopsarkar.github.io/nlp-class Simon Fraser University

Cryptanalysis of MORUS (Initially discussed at Lorentz center in Mar 2018) Tomer Ashur

Monte Carlo Methods and Neural Networks Alexander Keller, partially joint work with Noah Gamboa

Viterbi Algorithm Saravanan Vijayakumaran sarva@ee.iitb.ac.in Department of Electrical