Introductions Instructor : Prof. Kristen Grauman TA : Dongguang - PDF document

Honors Machine Vision Jan 17, 2017 Kristen Grauman, University of Texas at Austin Introductions • Instructor : Prof. Kristen Grauman • TA : Dongguang You 1

Today • Course overview • Requirements, logistics What is computer vision? Done? 2

Computer Vision • Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 1. Vision for measurement Real-time stereo Structure from motion Tracking NASA Mars Rover Demirdjian et al. Snavely et al. Wang et al. 3

Computer Vision • Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 2. Vision for perception, interpretation Objects amusement park sky Activities Scenes Locations The Wicked Cedar Point Text / writing Twister Faces Gestures Ferris ride Motions wheel ride Emotions… 12 E Lake Erie water ride tree tree people waiting in line people sitting on ride umbrellas tree maxair carousel deck bench tree pedestrians 4

Computer Vision • Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 3. Algorithms to mine, search, and interact with visual data ( search and organization ) 3. Visual search, organization Query Image or video Relevant archives content 5

Computer Vision • Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 3. Algorithms to mine, search, and interact with visual data ( search and organization ) Course focus Related disciplines Artificial intelligence Machine Graphics learning Computer vision Image Cognitive processing science Algorithms 6

Vision and graphics Images Model Vision Graphics Inverse problems: analysis and synthesis. Visual data in 1963 L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963. 7

Visual data in 2017 Movies, news, sports Personal photo albums Medical and scientific images Surveillance and security Slide credit; L. Lazebnik Why vision? • As image sources multiply, so do applications – Relieve humans of boring, easy tasks – Enhance human abilities – Advance human-computer interaction, visualization – Perception for robotics / autonomous agents – Organize and give access to visual content 8

Faces and digital cameras Setting camera Camera waits for focus via face everyone to smile to detection take a photo [Canon] Linking to info with a mobile device Situated search Yeh et al., MIT Google Goggles MSR Lincoln kooaba 9

Video-based interfaces Assistive technology systems Human joystick, NewsBreaker Live Camera Mouse, Boston College Microsoft Kinect What else? 10

Vision for medical & neuroimages fMRI data Golland et al. Image guided surgery MIT AI Vision Group Special visual effects The Matrix Mocap for Pirates of the Carribean , Industrial Light and Magic Source: S. Seitz What Dreams May Come 11

Safety & security Navigation, driver safety Monitoring pool (Poseidon) Surveillance Pedestrian detection MERL, Viola et al. Obstacles? 12

What the computer gets Why is vision difficult? • Ill-posed problem: real world much more complex than what we can measure in images – 3D  2D • Impossible to literally “invert” image formation process 13

Challenges: many nuisance parameters Illumination Object pose Clutter Intra-class Occlusions Viewpoint appearance Challenges: intra-class variation slide credit: Fei-Fei, Fergus & Torralba 14

Challenges: importance of context Challenges: importance of context 15

Challenges: importance of context slide credit: Fei-Fei, Fergus & Torralba Challenges: complexity • Millions of pixels in an image • 30,000 human recognizable object categories • 30+ degrees of freedom in the pose of articulated objects (humans) • Billions of images online • 144K hours of new video on YouTube daily • … • About half of the cerebral cortex in primates is devoted to processing visual information [Felleman and van Essen 1991] 16

Progress charted by datasets Roberts 1963 COIL 1963 … 1996 Progress charted by datasets MIT-CMU Faces MIT-CMU Faces MIT-CMU Faces INRIA Pedestrians INRIA Pedestrians INRIA Pedestrians UIUC Cars UIUC Cars UIUC Cars 1963 … 1996 2000 17

Progress charted by datasets MSRC 21 Objects MSRC 21 Objects MSRC 21 Objects Caltech-101 Caltech-101 Caltech-101 Caltech-256 Caltech-256 Caltech-256 1963 … 2005 1996 2000 Progress charted by datasets ImageNet ImageNet ImageNet 80M Tiny Images 80M Tiny Images 80M Tiny Images PASCAL VOC PASCAL VOC PASCAL VOC PASCAL VOC PASCAL VOC Birds-200 Birds-200 Birds-200 Faces in the Wild Faces in the Wild Faces in the Wild 1963 … 1996 2000 2005 2007 2008 2013 18

Expanding horizons: large-scale recognition Expanding horizons: captioning https://pdollar.wordpress.com/2015/01/21/image-captioning/ 19

Expanding horizons: vision for autonomous vehicles KITTI dataset – Andreas Geiger et al. Expanding horizons: interactive visual search WhittleSearch – Adriana Kovashka et al. 20

Expanding horizons: first-person vision Activities of Daily Living – Hamed Pirsiavash et al. Brainstorm Pick an application or task among any of those we’ve described so far. 1. What functionality should the system have? 2. Intuitively, what are the technical sub-problems that must be solved? 21

Goals of this course • Upper division honors undergrad course • Introduction to primary topics – Fundamentals of computer vision – image processing, grouping, multiple views – Recognition - emphasis on learning (~last third of the course) • Hands-on experience with algorithms • Views of vision as a research area Topics overview • Features & filters • Grouping & fitting • Multiple views • Recognition 22

Features and filters Transforming and describing images; textures, colors, edges Grouping & fitting [fig from Shi et al] Clustering, segmentation, fitting; what parts belong together? 23

Multiple views Matching, invariant features, stereo vision, instance recognition Lowe Hartley and Zisserman Fei-Fei Li Recognition and learning Recognizing categories (objects, scenes, activities, attributes…), learning techniques 24

Textbooks • Recommended book: – Computer Vision: Algorithms and Applications – By Rick Szeliski – http://szeliski.org/Book/ Requirements / Grading • Problem sets (50%) • Midterm exam (15%) • Final exam (25%) • Class participation, including attendance (10%) • Check grades on Canvas – A quote from a prior student evaluation: “To be honest, I think without going to class, the course would be very hard. “ 25

Assignments • Majority - Programming problem – Implementation – Explanation, results • Code in Matlab – available on CS Unix machines (see course page) • Optional Latex templates • Most of these assignments take significant time to do. We recommend starting early. Matlab • Built-in toolboxes for low- level image processing, visualization • Compact programs • Intuitive interactive debugging • Widely used in engineering 26

Assignment 0 • A0: Matlab warmup + basic image manipulation • Out today, due Fri Jan 27 • Verify CS account and Matlab access • Look at the tutorial online Digital images Images as matrices 27

Digital images width 520 j=1 i=1 Intensity : [0,255] 500 height im[176][201] has value 164 im[194][203] has value 37 Color images, RGB color space R G B 28

Preview of assignments Seam carving Preview of assignments Grouping for segmentation 29

Preview of assignments Image mosaics / stitching Image from Fei-Fei Li Preview of assignments Matching and recognition 30

Preview of assignments Object detection Collaboration policy All responses and code must be written individually unless otherwise specified. Students submitting answers or code found to be identical or substantially similar (due to inappropriate collaboration) risk failing the course. 31

Assignment deadlines • Due about every two weeks – tentative deadlines posted online but could slightly shift depending on lecture pace • Assignments in by 11:59 PM on due date – Submit on Canvas, following submission instructions given in assignment. – Deadlines are firm. We’ll use timestamp. • Use Piazza, office hours for questions Miscellaneous • Slides, announcements via class website • No laptops, phones, tablets, etc. open in class please. 32

Coming up • Now: check out Matlab tutorial online • A0 due Fri Jan 27 • Textbook reading posted for next week 33

Introductions Instructor : Prof. Kristen Grauman TA : Dongguang - PDF document

Honors Machine Vision Jan 17, 2017 Kristen Grauman, University of Texas at Austin Introductions Instructor : Prof. Kristen Grauman TA : Dongguang You 1 Today Course overview Requirements, logistics What is computer vision?

Bus Use of Shoulders in ODOT District 12 March 19, 2015 Introductions Introductions

COURSE SELECTION 2016-2017 School Administration INTRODUCTIONS INTRODUCTIONS INTRODUCTIONS

For Muswell Hill Primary School Phase 2 Conditions Works Introductions ! 1. Introductions

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

Transportation Commission RETREAT March 2, 2019 Introductions Agenda Introductions

Court Sign Language Interpreting Standards Standards Introductions Introductions Emily Hill

Investor Presentation 19 November 2015 Introductions and Agenda Introductions Board

Standard Expenditure Coding July 2019 Welcome and Introductions Introductions Please

WELCOME Training Agenda, Introductions & Info AGENDA INTRODUCTIONS EMERGENCY

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Healthcare William Lilley Dr Helen Smith Introductions Plan for the workshop Introductions

Dr r Jo Jona nathan Ll Lloyd Introductions Group Introductions including hopes and fears

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Welcome/Introductions Public Meeting Agenda June 2, 2015 Introductions Presentation

media. Introductions and Welcome Introductions and Welcome Dr. Terri Cox-Cruey Kenton County

VIREO-TNO @ TRECVID 2014 Zero-Shot Event Detection and Recounting Speaker: Maaike de Boer (TNO)

Value Proposition How Would You Define YOUR Value Proposition? F oc us c ome s Target fr om c

CPE/CSC 481: Knowledge-Based Systems Franz J. Kurfess Computer Science Department California

Welcome to Intelligent Robotics McGill COMP 765 Sept 5 th , 2017 Robotics Today The

Can Some Programming Languages Be Considered Harmful? S.Janssens U.P.Schultz V.Zaytsev

Proposal to the Quekett Microscopical Club The Arthur Earland Collection of Foram inifera Slides

Mesozoic Cenozoic Bolides! Asteroid: A rocky or metallic minor planet, or planetoid Comet: an

FORCE AGENDA Goals of the Meeting Recap of Current Status Review Draft Report Sections

Sambuz

Useful Links

Newsletter

Mail Us

Introductions Instructor : Prof. Kristen Grauman TA : Dongguang - PDF document

Honors Machine Vision Jan 17, 2017 Kristen Grauman, University of Texas at Austin Introductions Instructor : Prof. Kristen Grauman TA : Dongguang You 1 Today Course overview Requirements, logistics What is computer vision?

Bus Use of Shoulders in ODOT District 12 March 19, 2015 Introductions Introductions

COURSE SELECTION 2016-2017 School Administration INTRODUCTIONS INTRODUCTIONS INTRODUCTIONS

For Muswell Hill Primary School Phase 2 Conditions Works Introductions ! 1. Introductions

Welcome &amp; Introductions Welcome &amp; Introductions Ken Frazier, Chairman and Chief

Welcome &amp; Introductions Welcome &amp; Introductions Ken Frazier, Chairman and Chief

Transportation Commission RETREAT March 2, 2019 Introductions Agenda Introductions

Court Sign Language Interpreting Standards Standards Introductions Introductions Emily Hill

Investor Presentation 19 November 2015 Introductions and Agenda Introductions Board

Standard Expenditure Coding July 2019 Welcome and Introductions Introductions Please

WELCOME Training Agenda, Introductions &amp; Info AGENDA INTRODUCTIONS EMERGENCY

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Healthcare William Lilley Dr Helen Smith Introductions Plan for the workshop Introductions

Dr r Jo Jona nathan Ll Lloyd Introductions Group Introductions including hopes and fears

1 AGENDA Safety minute Introductions Introductions PEL definition, goals, and

Welcome/Introductions Public Meeting Agenda June 2, 2015 Introductions Presentation

media. Introductions and Welcome Introductions and Welcome Dr. Terri Cox-Cruey Kenton County

VIREO-TNO @ TRECVID 2014 Zero-Shot Event Detection and Recounting Speaker: Maaike de Boer (TNO)

Value Proposition How Would You Define YOUR Value Proposition? F oc us c ome s Target fr om c

CPE/CSC 481: Knowledge-Based Systems Franz J. Kurfess Computer Science Department California

Welcome to Intelligent Robotics McGill COMP 765 Sept 5 th , 2017 Robotics Today The

Can Some Programming Languages Be Considered Harmful? S.Janssens U.P.Schultz V.Zaytsev

Proposal to the Quekett Microscopical Club The Arthur Earland Collection of Foram inifera Slides

Mesozoic Cenozoic Bolides! Asteroid: A rocky or metallic minor planet, or planetoid Comet: an

FORCE AGENDA Goals of the Meeting Recap of Current Status Review Draft Report Sections

Sambuz

Useful Links

Newsletter

Mail Us

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

Welcome & Introductions Welcome & Introductions Ken Frazier, Chairman and Chief

WELCOME Training Agenda, Introductions & Info AGENDA INTRODUCTIONS EMERGENCY