Introductions Class : Thursday 3:30-6:30 PM Instructor : - PDF document

1/22/2009 Visual Recognition & Search January 22, 2009 Introductions • Class : Thursday 3:30-6:30 PM • Instructor : • Instructor : Kristen Grauman Kristen Grauman grauman at cs.utexas.edu CSA 114 • Office hours : by appointment • TA Harshdeep Singh • Class page : link from http://www.cs.utexas.edu/~grauman/ Check for updates to schedule. 1

1/22/2009 CSA My office : CSA 114 Plan for today • Topic overview: What is visual recognition and search? Why are these hard problems? What sorta works? • Course overview: Requirements, syllabus tour 2

1/22/2009 Computer Vision • Automatic understanding of images and video – Computing properties of the 3D world from visual Computing properties of the 3D world from visual data (measurement) – Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) – Algorithms to mine, search, and interact with visual g , , data ( search and organization ) Vision for measurement Real-time stereo Structure from motion Tracking NASA Mars Rover Demirdjian et al. Snavely et al. Wang et al. 3

1/22/2009 Vision for perception, interpretation Objects amusement park sky Activities Scenes Locations The Wicked Cedar Point Text / writing Text / writing Twister Twister Faces Gestures Ferris ride wheel Motions ride Emotions… 12 E Lake Erie water ride tree tree people waiting in line people sitting on ride umbrellas tree maxair carousel deck bench tree pedestrians Visual search, organization Query Image or video Relevant archives content 4

1/22/2009 Why recognition and search? – Recognition a fundamental part of perception • e.g., robots, autonomous agents – Organize and give access to visual content • Connect to information • Detect trends and themes • Why now? Vision in 1963 L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963. 5

1/22/2009 Today: visual data in the wild Movies, news, sports Personal photo albums Medical and scientific images Surveillance and security Slide credit; L. Lazebnik Today: visual data in the wild 350 mil. photos, 916,271 titles 1 mil. added daily 1 6 bil images indexed 1.6 bil. images indexed 10 mil. videos, 65,000 added daily as of summer 2005 Images on the Web Movies, news, sports Satellite imagery City streets Slide by Lana Lazebnik 6

1/22/2009 Autonomous agents able to detect objects http://www.darpa.mil/grandchallenge/gallery.asp Linking to info with a mobile device Situated search Yeh et al., MIT kooaba kooaba MSR Lincoln 7

1/22/2009 Finding visually similar objects Exploring community photo collections Snavely et al. Simon & Seitz 8

1/22/2009 Discovering visual patterns Sivic & Zisserman Objects Lee & Grauman Lee & Grauman Categories Wang et al. Actions Plan for today • Topic overview: What is visual recognition and search? Why are these hard problems? What sorta works? • Course overview: Requirements, syllabus tour 9

1/22/2009 The Instance-Level Recognition Problem John’s car The Categorization Problem • How to recognize ANY car 10

1/22/2009 Levels of Object Categorization “cow” ng ory Augmented Computi “car” “motorbike” gnition Tutorial • Different levels of recognition Visual Object Recog Perceptual and Sens ⇒ Obj/Img classification � Which object class is in the image? ⇒ Detection/Localization � Where is it in the image? � Where exactly ― which pixels? ⇒ Figure/Ground segmentation 21 K. Grauman, B. Leibe K. Grauman, B. Leibe Object Categorization • Task Description � “Given a small number of training images of a category, recognize a-priori unknown instances of that category and assign g p g y g ng ory Augmented Computi the correct category label.” • Which categories are feasible visually? gnition Tutorial Visual Object Recog Perceptual and Sens “Fido” German dog animal living shepherd being K. Grauman, B. Leibe K. Grauman, B. Leibe 11

1/22/2009 Visual Object Categories • Basic Level Categories in human categorization [Rosch 76, Lakoff 87] ng � The highest level at which category members have similar ory Augmented Computi perceived shape � The highest level at which a single mental image reflects the gnition Tutorial entire category � The level at which human subjects are usually fastest at identifying category members � The first level named and understood by children y Visual Object Recog Perceptual and Sens � The highest level at which a person uses similar motor actions for interaction with category members K. Grauman, B. Leibe K. Grauman, B. Leibe Visual Object Categories • Basic-level categories in humans seem to be defined predominantly visually. ng • There is evidence that humans (usually) • There is evidence that humans (usually) ory Augmented Computi … start with basic-level categorization before doing identification. animal gnition Tutorial ⇒ Basic-level categorization is easier Abstract and faster for humans than object … … levels identification! quadruped ⇒ How does this transfer to automatic … Visual Object Recog Perceptual and Sens classification algorithms? Basic level dog cat cow German Doberman shepherd Individual … … “ Fido” level K. Grauman, B. Leibe K. Grauman, B. Leibe 12

1/22/2009 How many object categories are there? Biederman 1987 Source: Fei-Fei Li, Rob Fergus, Antonio Torralba. 13

1/22/2009 Other Types of Categories • Functional Categories � e.g. chairs = “something you can sit on” ng ory Augmented Computi gnition Tutorial Visual Object Recog Perceptual and Sens K. Grauman, B. Leibe K. Grauman, B. Leibe Other Types of Categories • Ad-hoc categories � e.g. “something you can find in an office environment” ng ory Augmented Computi gnition Tutorial Visual Object Recog Perceptual and Sens K. Grauman, B. Leibe K. Grauman, B. Leibe 14

1/22/2009 Challenges: robustness Illumination Object pose Clutter Occlusions Intra-class Viewpoint appearance Challenges: robustness Realistic scenes are crowded, cluttered, have overlapping objects. 15

1/22/2009 Challenges: importance of context slide credit: Fei-Fei, Fergus & Torralba Challenges: importance of context 16

1/22/2009 Challenges: complexity • Thousands to millions of pixels in an image • 3,000-30,000 human recognizable object categories • 30+ degrees of freedom in the pose of articulated • 30+ degrees of freedom in the pose of articulated objects (humans) • Billions of images indexed by Google Image Search • 18 billion+ prints produced from digital camera images in 2004 • 295.5 million camera phones sold in 2005 • About half of the cerebral cortex in primates is devoted to processing visual information [Felleman and van Essen 1991] Challenges: learning with minimal supervision More Less 17

1/22/2009 What “works” today • Reading license plates, zip codes, checks Source: Lana Lazebnik What “works” today • Reading license plates, zip codes, checks • Fingerprint recognition Fi i t iti Source: Lana Lazebnik 18

1/22/2009 What “works” today • Reading license plates, zip codes, checks • Fingerprint recognition Fi i t iti • Face detection Source: Lana Lazebnik What “works” today • Reading license plates, zip codes, checks • Fingerprint recognition Fi i t iti • Face detection • Recognition of flat textured objects (CD covers, book covers, etc.) Source: Lana Lazebnik 19

1/22/2009 • Active research area with exciting progress! … … … … … … … … … … … Today’s challenge 20

1/22/2009 This course • Focus on current research in – visual category and object recognition – image/video retrieval – organization, exploration, interaction with visual content • High-level vision and learning problems, innovative applications. Goals • Understand current approaches • Analyze • Identify interesting research questions 21

1/22/2009 Expectations • Discussions will center on recent papers in the field the field – Paper reviews • Student presentations – Papers and background reading – Demos • Projects – Research-oriented • Workload = reasonably high Prerequisites • Courses in: – Computer vision C t i i – Machine learning – Basic probability – Linear algebra • Ability to analyze high-level conference papers 22

1/22/2009 Paper reviews • For each class, review two of the assigned papers papers. • Post by Wed night 10 PM on Google docs (instructions are on Blackboard) • Don’t review papers the week(s) you are presenting. ti Paper review guidelines • Brief (2-3 sentences) summary • Main contribution • Strengths? Weaknesses? • How convincing are the experiments? Suggestions to improve them? • Extensions? • Additional comments unclear points • Additional comments, unclear points • Relationships observed between the papers we are reading • ½ page to 1 page. 23

Introductions Class : Thursday 3:30-6:30 PM Instructor : - PDF document

1/22/2009 Visual Recognition & Search January 22, 2009 Introductions Class : Thursday 3:30-6:30 PM Instructor : Instructor : Kristen Grauman Kristen Grauman grauman at cs.utexas.edu CSA 114 Office hours : by appointment

Bus Use of Shoulders in ODOT District 12 March 19, 2015 Introductions Introductions

COURSE SELECTION 2016-2017 School Administration INTRODUCTIONS INTRODUCTIONS INTRODUCTIONS

For Muswell Hill Primary School Phase 2 Conditions Works Introductions ! 1. Introductions

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

Transportation Commission RETREAT March 2, 2019 Introductions Agenda Introductions

Court Sign Language Interpreting Standards Standards Introductions Introductions Emily Hill

Investor Presentation 19 November 2015 Introductions and Agenda Introductions Board

Standard Expenditure Coding July 2019 Welcome and Introductions Introductions Please

WELCOME Training Agenda, Introductions & Info AGENDA INTRODUCTIONS EMERGENCY

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Healthcare William Lilley Dr Helen Smith Introductions Plan for the workshop Introductions

Dr r Jo Jona nathan Ll Lloyd Introductions Group Introductions including hopes and fears

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Welcome/Introductions Public Meeting Agenda June 2, 2015 Introductions Presentation

media. Introductions and Welcome Introductions and Welcome Dr. Terri Cox-Cruey Kenton County

CLIP Consultative Conversation May 19, 2020 gradelevelreading.net / @readingby3rd / #GLReading

Veracity Claims Assessment by Christian College Students Michael Tenneson, PhD Professor of

BANK DDM valuation approach Olivier Levyne Olivier Levyne Business plan 2019-2021 P&L

GUN VIOLENCE AND MENTAL HEALTH Learning Objective Engage in thoughtful discourse on various

'http://www.mpi4forum.org/' This work was performed under the auspices of the U.S. Department of

Assessing Theory Errors from Residual Cutoff Dependence H. W. Griehammer INS Institute for

Programming LSQUIC Dmitri Tikhonov / LiteSpeed Technologies Netdev 0x14 Dmitri Tikhonov /

Implementation of the dG method Matteo Cicuttin CERMICS - Ecole Nationale des Ponts et

Introductions Class : Thursday 3:30-6:30 PM Instructor : - PDF document

1/22/2009 Visual Recognition & Search January 22, 2009 Introductions Class : Thursday 3:30-6:30 PM Instructor : Instructor : Kristen Grauman Kristen Grauman grauman at cs.utexas.edu CSA 114 Office hours : by appointment

Bus Use of Shoulders in ODOT District 12 March 19, 2015 Introductions Introductions

COURSE SELECTION 2016-2017 School Administration INTRODUCTIONS INTRODUCTIONS INTRODUCTIONS

For Muswell Hill Primary School Phase 2 Conditions Works Introductions ! 1. Introductions

Welcome &amp; Introductions Welcome &amp; Introductions Ken Frazier, Chairman and Chief

Welcome &amp; Introductions Welcome &amp; Introductions Ken Frazier, Chairman and Chief

Transportation Commission RETREAT March 2, 2019 Introductions Agenda Introductions

Court Sign Language Interpreting Standards Standards Introductions Introductions Emily Hill

Investor Presentation 19 November 2015 Introductions and Agenda Introductions Board

Standard Expenditure Coding July 2019 Welcome and Introductions Introductions Please

WELCOME Training Agenda, Introductions &amp; Info AGENDA INTRODUCTIONS EMERGENCY

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Healthcare William Lilley Dr Helen Smith Introductions Plan for the workshop Introductions

Dr r Jo Jona nathan Ll Lloyd Introductions Group Introductions including hopes and fears

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Welcome/Introductions Public Meeting Agenda June 2, 2015 Introductions Presentation

media. Introductions and Welcome Introductions and Welcome Dr. Terri Cox-Cruey Kenton County

CLIP Consultative Conversation May 19, 2020 gradelevelreading.net / @readingby3rd / #GLReading

Veracity Claims Assessment by Christian College Students Michael Tenneson, PhD Professor of

BANK DDM valuation approach Olivier Levyne Olivier Levyne Business plan 2019-2021 P&amp;L

GUN VIOLENCE AND MENTAL HEALTH Learning Objective Engage in thoughtful discourse on various

'http://www.mpi4forum.org/' This work was performed under the auspices of the U.S. Department of

Assessing Theory Errors from Residual Cutoff Dependence H. W. Griehammer INS Institute for

Programming LSQUIC Dmitri Tikhonov / LiteSpeed Technologies Netdev 0x14 Dmitri Tikhonov /

Implementation of the dG method Matteo Cicuttin CERMICS - Ecole Nationale des Ponts et

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

WELCOME Training Agenda, Introductions & Info AGENDA INTRODUCTIONS EMERGENCY

BANK DDM valuation approach Olivier Levyne Olivier Levyne Business plan 2019-2021 P&L