Structure of Vision Problems Alan Yuille (UCLA). Machine Learning - PowerPoint PPT Presentation

Structure of Vision Problems Alan Yuille (UCLA).

Machine Learning � Theory of Machine Learning is beautiful and deep. � But, how useful is it for vision? � Vision rarely has an obvious vector space structure.

Image Formation � Images formation is complicated. � E.g. the image of a face depends on viewpoint, lighting, facial expression.

Image Formation. � Parable of the Theatre, the Carpenter, the Painter, and the Lightman. (Adelson and Pentland). � How many ways can you construct a scene so that the image looks the same when seen from the Royal Box?

Nonlinear Transformations � Mumford suggested that images involve basic nonlinear transformations. � (I) Image warping: x W(x) (e.g. → change of viewpoint, expression, etc.). � (II) Occlusion: foreground objects occlude background objects. � (III) Shadows, Multi-Reflectance.

Complexity of Images � Easy, Medium, and Hard Images.

Discrimination or Probabilities � Statistical Edge Detection (Konishi,Yuille, Coughlan, Zhu). � Use segmented image database to learn probability distributions of P(f|on) and P(f|off), where “f” is filter response.

P-on and P-off � P-on and P-off become more powerful when combining multiple edge cues (by joint distributions). � Results as good, or better than, standard edge detectors when evaluated on images with groundtruth.

P-on and P-off � Why not do discrimination and avoid learning the distributions? (Malik et al). � Learning the distributions and using log- likelihood is optimal provided there is sufficient data. � But “Don’t solve a harder problem than you have to”.

Probabilities or Discrimination � Two Reasons for Probabilities: � (I) They can be used for other problems such as detecting contours by combining local edge cues. � (II) They can be used to synthesize edges as a “reality check”.

Combining Local Edge Cues � Detect contours by edge cues with shape priors P_g (Geman & Jedynak). 1 P ( y ) ∑ N = r ({ t }, { y }) log on i i i = N P ( y ) i 1 off i P ( t ) 1 ∑ N g i + log , = N i 1 U ( t ) i U (.) is uniform distributi on .

Manhattan World � Coughlan and Yuille use P-on, P-off to estimate scene orientation wrt viewer.

Synthesis as Reality Check � Synthesis of Images using P-on, P-off distributions (Coughlan & Yuille).

Machine Learning Success � Fixed geometry, lighting, viewpoint. � AdaBoost Learning: Viola and Jones.

Machine Vision Success � Other examples: � Classification (Le Cun et al, Scholkopf et al, Caputo et al). � Demonstrate the power of statistics – rather than the power of machine learning?

Bayesian Pattern Theory. � This approach seeks to model the different types of image patterns. � Vision as statistical inference – inverse computer graphics. � Analysis by Synthesis (Bayes). � Computationally expensive?

Example: Image Segmentation � Standard computer vision task. � Pattern Theory formulation (Zhu,Tu): Decompose images into their underlying patterns. � Requires a set of probability models which can describe image patterns. Learnt from data.

Image Pattern Models � Images (top) and Synthesized (bottom).

Image Parsing: Zhu & Tu

Image Parsing: Zhu & Tu. � Bayesian Formulation: model image as being composed of multiple regions. � Boundaries of regions obey (probabilistic) constraints (e.g. smoothness) � Intensity properties within regions are described by a set of models with unknown parameters (to be estimated).

Image Parsing Results: Input, Segmentation, and Synthesis.

Regions, Curves, Occlusions.

Removing Foreground. � “Denoising” images by removing foreground clutter.

Image Parsing Solution Space � No. regions, Types of regions, Properties of regions.

Machine Learning & Bayes. � Zhu-Tu’s algorithm is called DDMCMC Data-Driven Markov Chain Monte Carlo. � Discrimination methods (e.g. AdaBoost) can be used as proposal probabilities , which can be verified by Bayesian pattern models.

Machine Learning & Bayes � Machine Learning seems to concentrate on discrimination problems. � A whole range of other vision problems – image segmentation, image matching, viewpoint estimation, etc. � Probability models for image patterns are learnable. These models give reality checks by synthesis.

Machine Learning & Bayes � Machine Learning’s big advantage over Bayes is speed (when applicable). � AdaBoost may be particularly useful for combining local cues. � Machine Learning for computational search to enable Bayesian estimation?

Structure of Vision Problems Alan Yuille (UCLA). Machine Learning - PowerPoint PPT Presentation

Structure of Vision Problems Alan Yuille (UCLA). Machine Learning Theory of Machine Learning is beautiful and deep. But, how useful is it for vision? Vision rarely has an obvious vector space structure. Image Formation Images

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

Statistical Inverse Problems and abstract inverse problems examples Instrumental Variables

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Building Our Vision St. Andrews Vision and Mission Our Vision: Our Vision: The Tree of Life is

FLITTER FLITTER The Foldable Litter Pink B Our Vision Our Vision Our Vision Our Vision A

STRUCTURE STRUCTURE Highlight the structure of Highlight the structure of material material

Part IV I/O System Chapter 12: Mass Storage Structure Chapter 12: Mass Storage Structure 1

Wicked Problems & Leadership Keith Grint The Problem with Change Do d ifferent kinds of

WITH DEEP NEURAL NETWORKS INTELLIGENT ROBOTICS SEMINAR PIA UK 25.11.2019 OUTLINE 1.

3D Pattern Recognition Using Deep Neural Networks for Liquid Argon Time Projection Chambers

CreativeAI 3D (Geometric) Domain Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey

Intro to Image Understanding (CSC420) Projects Proposal Deadline : Nov 2 (Sunday), 11.59pm, 2014

Lecture: Motion Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab

Junning Li April 17 th , 2012 Laboratory of Neuro Imaging University of California, Los Angeles

Carlos Geijo 6th WMO Symposium on Data Assimilation. Maryland 7-11 October 2013 INTRODUCTION

HDR videos acquisition dr. Francesco Banterle francesco.banterle@isti.cnr.it How to capture?

Structure of Vision Problems Alan Yuille (UCLA). Machine Learning - PowerPoint PPT Presentation

Structure of Vision Problems Alan Yuille (UCLA). Machine Learning Theory of Machine Learning is beautiful and deep. But, how useful is it for vision? Vision rarely has an obvious vector space structure. Image Formation Images

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

Vision Services Vision Services &amp; &amp; Vision Therapy Vision Therapy February 2, 2007

Vision Our National Church partners .. Vision Our National Network partners Vision Getting

HIM Without Walls Realizing Our Vision! Realizing Our Vision Realize Our Vision Realizing Our

Statistical Inverse Problems and abstract inverse problems examples Instrumental Variables

J J R R Our Vision . . . Our Vision . . . Our Vision . . . Our Vision . . . TO BE THE BEST

Post- -trauma vision trauma vision Post Post- -trauma vision trauma vision Post syndrome

2017 Humana Vision 130 LOOK Whats NEW! NEW RETAIL FRAME BENEFIT 2 Humana Vision 100

Vision What is the Vision? The American Fork Canyon Vision (Vision) will ho- Few places in the

Building Our Vision St. Andrews Vision and Mission Our Vision: Our Vision: The Tree of Life is

FLITTER FLITTER The Foldable Litter Pink B Our Vision Our Vision Our Vision Our Vision A

STRUCTURE STRUCTURE Highlight the structure of Highlight the structure of material material

Part IV I/O System Chapter 12: Mass Storage Structure Chapter 12: Mass Storage Structure 1

Wicked Problems &amp; Leadership Keith Grint The Problem with Change Do d ifferent kinds of

WITH DEEP NEURAL NETWORKS INTELLIGENT ROBOTICS SEMINAR PIA UK 25.11.2019 OUTLINE 1.

3D Pattern Recognition Using Deep Neural Networks for Liquid Argon Time Projection Chambers

CreativeAI 3D (Geometric) Domain Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey

Intro to Image Understanding (CSC420) Projects Proposal Deadline : Nov 2 (Sunday), 11.59pm, 2014

Lecture: Motion Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab

Junning Li April 17 th , 2012 Laboratory of Neuro Imaging University of California, Los Angeles

Carlos Geijo 6th WMO Symposium on Data Assimilation. Maryland 7-11 October 2013 INTRODUCTION

HDR videos acquisition dr. Francesco Banterle francesco.banterle@isti.cnr.it How to capture?

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Vision Services Vision Services & & Vision Therapy Vision Therapy February 2, 2007

Wicked Problems & Leadership Keith Grint The Problem with Change Do d ifferent kinds of