Categorizing objects: global and part based models global and - PDF document

9/21/2012 Categorizing objects: global and part based models global and part-based models of appearance Kristen Grauman UT ‐ Austin Generic categorization problem 1

9/21/2012 Challenges: robustness Realistic scenes are crowded, cluttered, have overlapping objects. Generic category recognition: basic framework • Build/train object model Build/train object model – Choose a representation – Learn or fit parameters of model / classifier • Generate candidates in new image • Score the candidates 2

9/21/2012 Generic category recognition: representation choice Window ‐ based Part ‐ based Window-based models Building an object model Simple holistic descriptions of image content  grayscale / color histogram  vector of pixel intensities Kristen Grauman 3

9/21/2012 Window-based models Building an object model • Pixel-based representations sensitive to small shifts • Color or grayscale-based appearance description can be sensitive to illumination and intra-class appearance variation Kristen Grauman Window-based models Building an object model • Consider edges, contours, and (oriented) intensity gradients Kristen Grauman 4

9/21/2012 Window-based models Building an object model • Consider edges, contours, and (oriented) intensity gradients • Summarize local distribution of gradients with histogram  Locally orderless: offers invariance to small shifts and rotations  Contrast-normalization: try to correct for variable illumination Kristen Grauman Window-based models Building an object model Given the representation, train a binary classifier Car/non-car Classifier No, not a car. Yes, car. Kristen Grauman 5

9/21/2012 Discriminative classifier construction Neural networks Nearest neighbor 10 6 examples LeCun, Bottou, Bengio, Haffner 1998 Shakhnarovich, Viola, Darrell 2003 Rowley, Baluja, Kanade 1998 Berg, Berg, Malik 2005... … Conditional Random Fields Support Vector Machines Boosting Guyon, Vapnik Viola, Jones 2001, McCallum, Freitag, Pereira Heisele, Serre, Poggio, Torralba et al. 2004, 2000; Kumar, Hebert 2003 2001,… Opelt et al. 2006,… … Kristen Grauman Slide adapted from Antonio Torralba Generic category recognition: basic framework • Build/train object model Build/train object model – Choose a representation – Learn or fit parameters of model / classifier • Generate candidates in new image • Score the candidates 6

9/21/2012 Window-based models Generating and scoring candidates Car/non-car Classifier Kristen Grauman Window-based object detection: recap Training: 1. Obtain training data 2. Define features 3. 3 Define classifier Define classifier Given new image: 1. Slide window Training examples 2. Score by classifier Car/non-car Classifier Feature extraction Kristen Grauman 7

9/21/2012 Issues • What classifier? – Factors in choosing: • Generative or discriminative model? • Data resources – how much training data? • How is the labeled data prepared? • Training time allowance • Test time requirements – real-time? T t ti i t l ti ? • Fit with the representation Kristen Grauman Issues • What classifier? • What features or representations? • How to make it affordable? • What categories are amenable? Kristen Grauman 8

9/21/2012 Issues • What categories are amenable? – Similar to specific object matching, we expect spatial layout to be fairly rigidly preserved. – Unlike specific object matching , by training classifiers we attempt to capture intra-class variation or determine required discriminative features. Kristen Grauman What categories are amenable to window-based reps? Kristen Grauman 9

9/21/2012 Window-based models: Three case studies Boosting + face SVM + person NN + scene Gist detection detection classification e.g., Hays & Efros e.g., Dalal & Triggs Viola & Jones Viola-Jones face detector Main idea: – Represent local texture with efficiently computable “rectangular” features within window of interest – Select discriminative features to be weak classifiers – Use boosted combination of them as final classifier – Form a cascade of such classifiers, rejecting clear F d f h l ifi j ti l negatives quickly Kristen Grauman 10

9/21/2012 Boosting intuition Weak Classifier 1 Slide credit: Paul Viola Boosting illustration Weights Increased 11

9/21/2012 Boosting illustration Weak Classifier 2 Boosting illustration Weights Increased 12

9/21/2012 Boosting illustration Weak Classifier 3 Boosting illustration Final classifier is a combination of weak classifiers 13

9/21/2012 Boosting: training • Initially, weight each training example equally • • In each boosting round: In each boosting round: – Find the weak learner that achieves the lowest weighted training error – Raise weights of training examples misclassified by current weak learner • Compute final classifier as linear combination of all weak learners (weight of each learner is directly proportional to its accuracy) y) • Exact formulas for re-weighting and combining weak learners depend on the particular boosting scheme (e.g., AdaBoost) Slide credit: Lana Lazebnik Boosting: pros and cons • Advantages of boosting • Integrates classification with feature selection • Complexity of training is linear in the number of training examples examples • Flexibility in the choice of weak learners, boosting scheme • Testing is fast • Easy to implement • Disadvantages • Needs many training examples Needs man training e amples • Often found not to work as well as an alternative discriminative classifier, support vector machine (SVM) – especially for many-class problems Slide credit: Lana Lazebnik 14

9/21/2012 Viola-Jones detector: features “ Rectangular” filters Feature output is difference between p adjacent regions Value at (x,y) is Efficiently computable sum of pixels above and to the with integral image: any left of (x,y) sum can be computed in p constant time. Integral image Kristen Grauman Computing the integral image Lana Lazebnik 15

9/21/2012 Computing the integral image ii(x, y-1) s(x-1, y) i(x, y) Cumulative row sum: s(x, y) = s(x–1, y) + i(x, y) Integral image: ii(x, y) = ii(x, y − 1) + s(x, y) Lana Lazebnik Computing sum within a rectangle • Let A,B,C,D be the values of the integral image at the corners of a D D B B rectangle t l • Then the sum of original image values within the A rectangle can be C computed as: sum = A – B – C + D • Only 3 additions are required for any size of rectangle! Lana Lazebnik 16

9/21/2012 Viola-Jones detector: features “ Rectangular” filters Feature output is difference between p adjacent regions Value at (x,y) is Efficiently computable sum of pixels above and to the with integral image: any left of (x,y) sum can be computed in p constant time Avoid scaling images  scale features directly Integral image for same cost Kristen Grauman Viola-Jones detector: features Considering all possible filter parameters: position, scale, and type: 180,000+ possible features associated with each 24 x 24 window Which subset of these features should we use to determine if a window has a face? Use AdaBoost both to select the informative features and to form the classifier Kristen Grauman 17

9/21/2012 Viola-Jones detector: AdaBoost • Want to select the single rectangle feature and threshold that best separates positive (faces) and negative (non- faces) training examples, in terms of weighted error. Resulting weak classifier: For next round, reweight the … examples according to errors, Outputs of a possible choose another filter/threshold rectangle feature on combo. faces and non-faces. Kristen Grauman Viola-Jones Face Detector: Results ng First two features First two features ory Augmented Computi selected gnition Tutorial gnition Tutorial Visual Object Recog Visual Object Recog Perceptual and Sens 18

9/21/2012 • Even if the filters are fast to compute, each new image has a lot of possible windows to search image has a lot of possible windows to search. • How to make the detection more efficient? Cascading classifiers for detection • Form a cascade with low false negative rates early on • Apply less accurate but faster classifiers first to immediately discard windows that clearly appear to be negative Kristen Grauman 19

9/21/2012 Viola-Jones detector: summary Train cascade of classifiers with Ad B AdaBoost t Faces New image Selected features, thresholds, and weights Non-faces Train with 5K positives, 350M negatives Real ‐ time detector using 38 layer cascade 6061 features in all layers [Implementation available in OpenCV: Kristen Grauman http://www.intel.com/technology/computing/opencv/] Viola-Jones detector: summary • A seminal approach to real-time object detection • Training is slow but detection is very fast • Training is slow, but detection is very fast • Key ideas  Integral images for fast feature evaluation  Boosting for feature selection  Attentional cascade of classifiers for fast rejection of non- face windows P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. CVPR 2001. P. Viola and M. Jones. Robust real-time face detection. IJCV 57(2), 2004. 20

Categorizing objects: global and part based models global and - PDF document

9/21/2012 Categorizing objects: global and part based models global and part-based models of appearance Kristen Grauman UT Austin Generic categorization problem 1 9/21/2012 Challenges: robustness Realistic scenes are crowded, cluttered,

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Natural Language Processing with Python CS372: Spring, 20 15 Lecture 12 Categorizing and

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

Review Objects Classes Objects and Arrays Models of Motion with Objects Linear

Object Oriented Programming Sunil Pai, Y! Objects Objects and Javascript Numbers Strings

Objects (Demo1) Objects Objects represent information They consist of data and behavior,

Mutable Values Announcements Objects (Demo) Objects Objects represent information They

61A Lecture 12 Announcements Objects (Demo) Objects Objects represent information They

Transforming Objects Ray : R(t) = s + c t Objects : Sphere, box, cone etc. We assume the objects

1 Models for Translucent Objects Models for Translucent Objects Models for Translucent Objects

A Machine-learning Approach for Classifying and Categorizing Android Sources and Sinks Siegfried

Microbial Profile of Hummus. Categorizing and identifying microflora Brittany Richards Maren

Statements of Functional Expenses for Non-Profits Best Practices for Categorizing and Reporting

The Issues of Categorizing Race and Ethnicity in Everyday Life By: Don Crumley Thesis Statement

Almost over... Feedback Website schedule for talk feedback Booklet (infodesk) feedback@

Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview 1. Modeling sequences 2.

One Model To Learn Them All Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki

Do Judgments Screen Evidence? Brian Weatherson Rutgers/Arch e March, 2010 Brian Weatherson

Learning to Compose Relational Embeddings in Knowledge Graphs Wenye Chen, Huda Hakami, Danushka

Talk to me Drupal Talk to me Drupal Using Drupal to power a Voice App Speaker notes Talk to me

j t t t q t t t t t q t t LAW-MWE-CxG-2018 @ COLING 2018 t q t q t t t t t q t t t t t

Microservices, Unikernels Portland State University CS 430P/530 Internet, Web & Cloud Systems

Categorizing objects: global and part based models global and - PDF document

9/21/2012 Categorizing objects: global and part based models global and part-based models of appearance Kristen Grauman UT Austin Generic categorization problem 1 9/21/2012 Challenges: robustness Realistic scenes are crowded, cluttered,

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Natural Language Processing with Python CS372: Spring, 20 15 Lecture 12 Categorizing and

Objects &amp; Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

Review Objects Classes Objects and Arrays Models of Motion with Objects Linear

Object Oriented Programming Sunil Pai, Y! Objects Objects and Javascript Numbers Strings

Objects (Demo1) Objects Objects represent information They consist of data and behavior,

Mutable Values Announcements Objects (Demo) Objects Objects represent information They

61A Lecture 12 Announcements Objects (Demo) Objects Objects represent information They

Transforming Objects Ray : R(t) = s + c t Objects : Sphere, box, cone etc. We assume the objects

1 Models for Translucent Objects Models for Translucent Objects Models for Translucent Objects

A Machine-learning Approach for Classifying and Categorizing Android Sources and Sinks Siegfried

Microbial Profile of Hummus. Categorizing and identifying microflora Brittany Richards Maren

Statements of Functional Expenses for Non-Profits Best Practices for Categorizing and Reporting

The Issues of Categorizing Race and Ethnicity in Everyday Life By: Don Crumley Thesis Statement

Almost over... Feedback Website schedule for talk feedback Booklet (infodesk) feedback@

Recurrent Neural Networks CS 6956: Deep Learning for NLP Overview 1. Modeling sequences 2.

One Model To Learn Them All Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki

Do Judgments Screen Evidence? Brian Weatherson Rutgers/Arch e March, 2010 Brian Weatherson

Learning to Compose Relational Embeddings in Knowledge Graphs Wenye Chen, Huda Hakami, Danushka

Talk to me Drupal Talk to me Drupal Using Drupal to power a Voice App Speaker notes Talk to me

j t t t q t t t t t q t t LAW-MWE-CxG-2018 @ COLING 2018 t q t q t t t t t q t t t t t

Microservices, Unikernels Portland State University CS 430P/530 Internet, Web &amp; Cloud Systems

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Microservices, Unikernels Portland State University CS 430P/530 Internet, Web & Cloud Systems