Lecture 2: Object Detection Professor Fei Fei Li Stanford Vision Lab - PowerPoint PPT Presentation

Lecture 2: Object Detection Professor Fei ‐ Fei Li Stanford Vision Lab 1 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

What we will learn today? • Visual recognition overview – Representation – Learning – Recognition • Implicit Shape Model – Representation – Recognition – Experiments and results 2 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

What are the different visual recognition tasks? 3 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Categorization vs Single instance recognition Does this image contain the Chicago Macy building’s? 4 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Categorization vs Single instance recognition Where is the crunchy nut? 5 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Applications of computer vision • Recognizing landmarks in mobile platforms + GPS 6 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Classification: Does this image contain a building? [yes/no] Yes! 7 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Classification: Is this an beach? 8 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Image Search Organizing photo collections 9 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Detection: Does this image contain a car? [where?] car 10 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Detection: Which object does this image contain? [where?] Building clock person car 11 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Detection: Accurate localization (segmentation) clock 12 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Detection: Estimating object semantic & geometric attributes Object: Building, 45º pose, 8 ‐ 10 meters away It has bricks Object: Person, back; 1 ‐ 2 meters away Object: Police car, side view, 4 ‐ 5 m away 13 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Applications of computer vision Surveillance Assistive technologies Computational photography Assistive driving Security 14 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Activity or Event recognition What are these people doing? 17 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Visual Recognition • Design algorithms that are capable to – Classify images or videos – Detect and localize objects – Estimate semantic and geometrical attributes – Classify human activities and events Why is this challenging? 18 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

How many object categories are there? 19 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: viewpoint variation Michelangelo 1475-1564 20 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: illumination image credit: J. Koenderink 21 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: scale 22 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: deformation 23 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: occlusion Magritte, 1957 24 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: background clutter Kilmeny Niland. 1995 25 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Challenges: intra ‐ class variation 26 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Some early works on object categorization • Turk and Pentland, 1991 • Belhumeur, Hespanha, & Kriegman, 1997 • Schneiderman & Kanade 2004 • Viola and Jones, 2000 • Amit and Geman, 1999 • LeCun et al. 1998 • Belongie and Malik, 2002 • Schneiderman & Kanade, 2004 • Argawal and Roth, 2002 • Poggio et al. 1993 29 ‐ Mar ‐ 11 Lecture 2 -

Basic issues • Representation – How to represent an object category; which classification scheme? • Learning – How to learn the classifier, given training data • Recognition – How the classifier is to be used on novel data 28 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Representation ‐ Building blocks: Sampling strategies Interest operators Dense, uniformly Image credits: L. Fei ‐ Fei, E. Nowak, J. Sivic Randomly Multiple interest operators 29 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Representation – Appearance only or location and appearance 31 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Representation – Invariances • View point • Illumination • Occlusion • Scale • Deformation • Clutter • etc. 32 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Learning • Learning parameters: What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) 43 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Learning • Learning parameters: What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) • Level of supervision • Manual segmentation; bounding box; image labels; noisy labels • Batch/incremental • Priors 44 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Learning • Learning parameters: What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) • Level of supervision • Manual segmentation; bounding box; image labels; noisy labels • Batch/incremental • Priors • Training images: •Issue of overfitting •Negative images for discriminative methods 45 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Recognition – Recognition task: classification, detection, etc.. 47 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Recognition – Recognition task – Search strategy: Sliding Windows Viola, Jones 2001, • Simple • Computational complexity (x,y, S, θ , N of classes) ‐ BSW by Lampert et al 08 ‐ Also, Alexe, et al 10 48 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Recognition – Recognition task – Search strategy: Sliding Windows Viola, Jones 2001, • Simple • Computational complexity (x,y, S, θ , N of classes) ‐ BSW by Lampert et al 08 ‐ Also, Alexe, et al 10 • Localization • Objects are not boxes 49 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Recognition – Recognition task – Search strategy: Sliding Windows Viola, Jones 2001, • Simple • Computational complexity (x,y, S, θ , N of classes) ‐ BSW by Lampert et al 08 ‐ Also, Alexe, et al 10 • Localization • Objects are not boxes • Prone to false positive Non max suppression: Canny ’86 …. Desai et al , 2009 50 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Recognition • Savarese, 2007 – Recognition task • Sun et al 2009 • Liebelt et al., ’08, 10 – Search strategy • Farhadi et al 09 – Attributes Category: car Azimuth = 225º Zenith = 30 º ‐ It has metal ‐ it is glossy ‐ has wheels • Farhadi et al 09 • Lampert et al 09 • Wang & Forsyth 09 54 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

Recognition – Recognition task – Search strategy – Attributes – Context Semantic: • Torralba et al 03 • Rabinovich et al 07 • Gupta & Davis 08 • Heitz & Koller 08 • L ‐ J Li et al 08 • Yao & Fei ‐ Fei 10 Geometric • Hoiem, et al 06 • Gould et al 09 • Bao, Sun, Savarese 10 55 29 ‐ Mar ‐ 11 Fei-Fei Li Lecture 2 -

What we will learn today? • Visual recognition overview – Representation – Learning – Recognition • Implicit Shape Model – Representation – Recognition – Experiments and results 57 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Implicit Shape Model (ISM) • Basic ideas x 1 – Learn an appearance codebook x 6 x 2 – Learn a star ‐ topology structural model x 5 x 3 x 4 • Features are considered independent given obj. center • Algorithm: probabilistic Gen. Hough Transform → – Exact correspondences Prob. match to object part → – NN matching Soft matching – Feature location on obj. → Part location distribution → – Uniform votes Probabilistic vote weighting → – Quantized Hough array Continuous Hough space Source: Bastian Leibe 58 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Implicit Shape Model: Basic Idea • Visual vocabulary is used to index votes for object position [a visual word = “part”]. Visual codeword with displacement vectors Training image B. Leibe, A. Leonardis, and B. Schiele, Robust Object Detection with Interleaved Categorization and Segmentation, International Journal of Computer Vision, Vol. 77(1 ‐ 3), 2008. Source: Bastian Leibe 59 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Implicit Shape Model: Basic Idea • Objects are detected as consistent configurations of the observed parts (visual words). Test image B. Leibe, A. Leonardis, and B. Schiele, Robust Object Detection with Interleaved Categorization and Segmentation, International Journal of Computer Vision, Vol. 77(1 ‐ 3), 2008. Source: Bastian Leibe 60 29 ‐ Mar ‐ 11 Lecture 2 - Fei-Fei Li

Lecture 2: Object Detection Professor Fei Fei Li Stanford Vision Lab - PowerPoint PPT Presentation

Lecture 2: Object Detection Professor Fei Fei Li Stanford Vision Lab 1 29 Mar 11 Lecture 2 - Fei-Fei Li What we will learn today? Visual recognition overview Representation Learning Recognition Implicit Shape Model

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Detection, Segmentation Overview Object Detection deer cat Object Detection as Classification

Object Detection Sanja Fidler CSC420: Intro to Image Understanding 1 / 48 Object Detection The

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

From image classification to object detection Image classification Object detection Image source

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 AutoML for Advances in AutoML

Lecture 11: Object detection Contains slides from S. Lazebnik, R. Girshick, B. Hariharan 1

Object-Oriented Databases Object Oriented Databases ODMG Standard Object Model, Object

Object oriented Object oriented Object oriented Object oriented approach and UML approach and

CS6501: Deep Learning for Visual Recognition Object Detection: RCNN, Fast-RCNN, Faster-RCNN

Object Detection Ujjwal Post-Doc, STARS Team INRIA Sophia Antipolis Outline What is Object

A Review on Salient Object Detection Feng Lin Salient Object Detection Target Detect and

Object Space Volume Rendering Object Space Volume Rendering Ronald Peikert SciVis 2010 - Object

Multi-Object Tracking Challenge CV3DST Lecture Exercises Multi-Object Tracking Multi-Object

Holistic Scene Understanding for 3D Object Detection with RGB-D cameras Dahua Lin, Sanja Fidler,

Deep Neural Networks for Object Detection Paper by C. Szegedy, A. Toshev, D. Erhan [2013]

Tessellation: Tiling a plane Filling a plane with a shape or image no gaps From Latin

Information Extraction Philipp Koehn 28 October 2019 Philipp Koehn Introduction to Human

Optimal Control of Parabolic Equations in Tailored Control Spaces Christian Meyer TU Dortmund,

The Challenge to Adapt: New Prac5ces for a New Era in the Arts Richard

Using conte x t managers W R ITIN G FU N C TION S IN P YTH ON Sha y ne Miel Director of So w

F u nctions as objects W R ITIN G FU N C TION S IN P YTH ON Sha y ne Miel Director of So w

Mommy, When I Grow Up, I Want T o Be An Architect! Mommy, When I Grow Up, I Want T o Be An

Microinteractions.01 India HCI 2016. 7 Dec 2016 Venkatesh Rajamanickam (@venkatrajam)