Sliding windows and face detection Tuesday, Nov 10 Kristen Grauman UT - PDF document

11/10/2009 Sliding windows and face detection Tuesday, Nov 10 Kristen Grauman UT ‐ Austin Last time • Modeling categories with local features and spatial information: – Histograms, configurations of visual words to capture Histograms configurations of visual words to capture global or local layout in the bag-of-words framework • Pyramid match, semi-local features 1

11/10/2009 Pyramid match Histogram intersection counts number of possible matches at a given partitioning. Spatial pyramid match • Make a pyramid of bag ‐ of ‐ words histograms. • Provides some loose (global) spatial layout information [Lazebnik, S chmid & Ponce, CVPR 2006] 2

11/10/2009 Last time • Modeling categories with local features and spatial information: – Histograms, configurations of visual words to capture Histograms configurations of visual words to capture global or local layout in the bag-of-words framework • Pyramid match, semi-local features – Part-based models to encode category’s part appearance together with 2d layout, – Allow detection within cluttered image Allow detection within cluttered image • “implicit shape model”, Generalized Hough for detection • “constellation model”: exhaustive search for best fit of features to parts Implicit shape models • Visual vocabulary is used to index votes for object position [a visual word = “part”] visual codeword with displacement vectors training image annotated with object localization info B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004 3

11/10/2009 Implicit shape models • Visual vocabulary is used to index votes for object position [a visual word = “part”] test image B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004 Shape representation in part-based models Fully connected “Star” shape model constellation model ng ory Augmented Computi x 1 x 1 x 6 x 2 x 6 x 2 gnition Tutorial x 5 x 3 x 5 x 3 x 4 x 4 � e g Constellation Model � e.g. Constellation Model Visual Object Recog Perceptual and Sens � e.g. implicit shape model i li it h d l � Parts fully connected � Parts mutually independent � Recognition complexity: O(N P ) � Recognition complexity: O(NP) � Method: Exhaustive search � Method: Gen. Hough Transform N image features, P parts in the model S lide credit: Rob Fergus 4

11/10/2009 Coarse genres of recognition approaches • Alignment: hypothesize and test – Pose clustering with object instances Pose clustering with object instances – Indexing invariant features + verification • Local features: as parts or words – Part-based models – Bags of words models g • Global appearance: “texture templates” – With or without a sliding window Today • Detection as classification – Supervised classification • Skin color detection example – Sliding window detection • Face detection example 5

11/10/2009 Supervised classification • Given a collection of labeled examples, come up with a function that will predict the labels of new examples. “four” “nine” ? Novel input Training examples • How good is some function we come up with to do the classification? • Depends on – Mistakes made – Cost associated with the mistakes Supervised classification • Given a collection of labeled examples, come up with a function that will predict the labels of new examples. • Consider the two-class (binary) decision problem – L(4 → 9): Loss of classifying a 4 as a 9 – L(9 → 4): Loss of classifying a 9 as a 4 • Risk of a classifier s is expected loss: ( ) ( ) ( ) ( ) = → → + → → ( ) Pr 4 9 | using 4 9 Pr 9 4 | using 9 4 R s s L s L • We want to choose a classifier so as to minimize this total risk 6

11/10/2009 Supervised classification Optimal classifier will minimize total risk. At decision boundary, either choice of label yields same expected Feature value x loss. If we choose class “four” at boundary, expected loss is: = → + → P ( class is 9 | ) L (9 4) P (class is 4 | ) L (4 4) x x = → P ( class is 9 | ) L (9 4) x If we choose class “nine” at boundary, expected loss is: = → ( class is 4 | ) (4 9) P L x Supervised classification Optimal classifier will minimize total risk. At decision boundary, either choice of label yields same expected Feature value x loss. So, best decision boundary is at point x where → = → ( class is 9 | ) (9 4) P(class is 4 | ) (4 9) P L L x x To classify a new point, choose class with lowest expected loss; i.e., choose “four” if → > → ( 4 | ) ( 4 9 ) ( 9 | ) ( 9 4 ) P L P L x x 7

11/10/2009 Supervised classification Optimal classifier will P(4 | x ) P(9 | x ) minimize total risk. At decision boundary, either choice of label yields same expected Feature value x loss. So, best decision boundary is at point x where → = → ( class is 9 | ) (9 4) P(class is 4 | ) (4 9) P L L x x To classify a new point, choose class with lowest expected loss; i.e., choose “four” if → > → ( 4 | ) ( 4 9 ) ( 9 | ) ( 9 4 ) P L P L x x How to evaluate these probabilities? Probability Basic probability • X is a random variable • P(X) is the probability that X achieves a certain value called a PDF ll d PDF -probability distribution/density function • • or continuous X discrete X • Conditional probability: P(X | Y) – probability of X given that we already know Y Source: Steve Seitz 8

11/10/2009 Example: learning skin colors • We can represent a class-conditional density using a histogram (a “non-parametric” distribution) Percentage of skin pixels in each bin P(x|skin) Feature x = Hue P(x|not skin) Feature x = Hue Example: learning skin colors • We can represent a class-conditional density using a histogram (a “non-parametric” distribution) P(x|skin) Feature x = Hue N Now we get a new image, t i P(x|not skin) and want to label each pixel as skin or non-skin. What’s the probability we care about to do skin detection? Feature x = Hue 9

11/10/2009 Bayes rule posterior prior likelihood ( ( | | ) ) ( ( ) ) P P x x skin skin P P skin skin = ( | ) P skin x ( ) P x α ( ( | | ) ) ( ( | | ) ) ( ( ) ) P skin x P x skin P skin Where does the prior come from? Why use a prior? Example: classifying skin pixels Now for every pixel in a new image, we can estimate probability that it is generated by skin. Brighter pixels � higher probability of being skin Classify pixels based on these probabilities 10

11/10/2009 Example: classifying skin pixels Gary Bradski, 1998 Example: classifying skin pixels Using skin color-based face detection and pose estimation as a video-based interface Gary Bradski, 1998 11

11/10/2009 Supervised classification • Want to minimize the expected misclassification • Two general strategies T l t t i – Use the training data to build representative probability model; separately model class-conditional densities and priors ( generative ) – Directly construct a good decision boundary, model the posterior ( discriminative ) Today • Detection as classification – Supervised classification • Skin color detection example – Sliding window detection • Face detection example 12

11/10/2009 Detection via classification: Main idea Basic component: a binary classifier ng ory Augmented Computi gnition Tutorial gnition Tutorial Car/non-car Classifier Visual Object Recog Visual Object Recog Perceptual and Sens No, not a car. Yes, car. Detection via classification: Main idea If object may be in a cluttered scene, slide a window around looking for it. ng ory Augmented Computi gnition Tutorial gnition Tutorial Car/non-car Classifier Visual Object Recog Visual Object Recog Perceptual and Sens (Essentially, our skin detector was doing this, with a window that was one pixel big.) 13

11/10/2009 Detection via classification: Main idea Fleshing out this pipeline a bit more, we need to: ng ory Augmented Computi 1. Obtain training data 2. Define features 3. Define classifier gnition Tutorial gnition Tutorial Training examples Visual Object Recog Visual Object Recog Perceptual and Sens Car/non-car Classifier Feature extraction Detection via classification: Main idea • Consider all subwindows in an image � Sample at multiple scales and positions (and orientations) ng ory Augmented Computi • Make a decision per window: � “Does this contain object category X or not?” gnition Tutorial gnition Tutorial Visual Object Recog Visual Object Recog Perceptual and Sens 14

Sliding windows and face detection Tuesday, Nov 10 Kristen Grauman UT - PDF document

11/10/2009 Sliding windows and face detection Tuesday, Nov 10 Kristen Grauman UT Austin Last time Modeling categories with local features and spatial information: Histograms, configurations of visual words to capture Histograms

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

Sliding right into disaster - Left-to-right sliding windows leak Daniel J. Bernstein, Joachim

Face detection and recognition Detection Recognition Sally Face detection &

Glacier Sliding Ian Hewitt, University of Oxford hewitt@maths.ox.ac.uk Sliding / friction laws -

Windows Not just for houses Windows 1-10 Windows Server Essentially a jacked up windows 8 box

Platform Convergence Journey Windows Embedded Standard 7 Windows Embedded Standard 8 Converged

Face detection, local features face alignment, and Face detection

Sliding system for concertina doors 271 Sliding system for concertina doors - Technical features

Lecture 3: Introduction to Sliding Mode Control Reference: S.C. Tan, Chapter 1. Sliding Mode

Face Cover Face Coverings In School Guidelines Face Coverings Face Coverings and PPE Cloth

Windows 8 Heap Internals Windows 8 Heap Internals Windows 8 Heap Internals INTRODUCTION Windows 8

1. 2. 3. 1. 2. 3. Windows 10 IoT Core Universal Windows Platform (UWP) Microsoft Azure v7

Windows Not Just For Houses Everyone Uses Windows! Versions of Windows 10 There are multiple

Deciphering the Face Deciphering the Face Aleix M. Martinez Computational Biology Computational

To provide you with a comprehensive overview on conducting effective face-to face contacts

Finishing Face to Face: The Priesthood Fulfilled in the Book of Revelation Steve Midgley

CPSC 418/MATH 318 Introduction to Cryptography Outline Course Technicalities, Symmetric

Proactive Adaptations in Sensor Network Query Processing Alan B. Stokes , Alvaro A. A. Fernandes,

Developing and Deploying Sensor Network Applications with AnduIN Daniel Klan Katja Hose Kai-Uwe

Cryptography Basics Network Security Instructor: Haojin Zhu 1 Cryptography What is

The case for Geoengineering Research MIT 30 October 2009 Cambridge, MA David Keith

Department of Biological Sciences Dept meeting 4 th June 2019 Agenda HoD news Grant Hose

182.694 Microcontroller VU Martin Perner SS 2017 Featuring Today: Digital Communication Weekly

182.694 Microcontroller VU Martin Perner SS 2014 Featuring Today: Digital Communication 160

Sliding windows and face detection Tuesday, Nov 10 Kristen Grauman UT - PDF document

11/10/2009 Sliding windows and face detection Tuesday, Nov 10 Kristen Grauman UT Austin Last time Modeling categories with local features and spatial information: Histograms, configurations of visual words to capture Histograms

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

Sliding right into disaster - Left-to-right sliding windows leak Daniel J. Bernstein, Joachim

Face detection and recognition Detection Recognition Sally Face detection &amp;

Glacier Sliding Ian Hewitt, University of Oxford hewitt@maths.ox.ac.uk Sliding / friction laws -

Windows Not just for houses Windows 1-10 Windows Server Essentially a jacked up windows 8 box

Platform Convergence Journey Windows Embedded Standard 7 Windows Embedded Standard 8 Converged

Face detection, local features face alignment, and Face detection

Sliding system for concertina doors 271 Sliding system for concertina doors - Technical features

Lecture 3: Introduction to Sliding Mode Control Reference: S.C. Tan, Chapter 1. Sliding Mode

Face Cover Face Coverings In School Guidelines Face Coverings Face Coverings and PPE Cloth

Windows 8 Heap Internals Windows 8 Heap Internals Windows 8 Heap Internals INTRODUCTION Windows 8

1. 2. 3. 1. 2. 3. Windows 10 IoT Core Universal Windows Platform (UWP) Microsoft Azure v7

Windows Not Just For Houses Everyone Uses Windows! Versions of Windows 10 There are multiple

Deciphering the Face Deciphering the Face Aleix M. Martinez Computational Biology Computational

To provide you with a comprehensive overview on conducting effective face-to face contacts

Finishing Face to Face: The Priesthood Fulfilled in the Book of Revelation Steve Midgley

CPSC 418/MATH 318 Introduction to Cryptography Outline Course Technicalities, Symmetric

Proactive Adaptations in Sensor Network Query Processing Alan B. Stokes , Alvaro A. A. Fernandes,

Developing and Deploying Sensor Network Applications with AnduIN Daniel Klan Katja Hose Kai-Uwe

Cryptography Basics Network Security Instructor: Haojin Zhu 1 Cryptography What is

The case for Geoengineering Research MIT 30 October 2009 Cambridge, MA David Keith

Department of Biological Sciences Dept meeting 4 th June 2019 Agenda HoD news Grant Hose

182.694 Microcontroller VU Martin Perner SS 2017 Featuring Today: Digital Communication Weekly

182.694 Microcontroller VU Martin Perner SS 2014 Featuring Today: Digital Communication 160

Face detection and recognition Detection Recognition Sally Face detection &