CS325 Artificial Intelligence Ch. 24, Computer Vision I Object - PowerPoint PPT Presentation

CS325 Artificial Intelligence Ch. 24, Computer Vision I – Object Recognition Cengiz Günay, Emory Univ. Spring 2013 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 1 / 27

Computer Vision Done with games, except homework :) Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

Computer Vision Done with games, except homework :) Vision is one of our main perceptions Computer vision is what robots use to understand their surrounding Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

Computer Vision Done with games, except homework :) Vision is one of our main perceptions Computer vision is what robots use to understand their surrounding 3 lectures: 1 Object recognition (today) 2 3D reconstruction 3 Motion analysis Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 2 / 27

Entry/Exit Surveys Exit survey: Advanced Planning Why isn’t classical planning schema adequate for resource planning? What is the advantage gained in abstract plans by having surely-reachable versus potentially-reachable states? Entry survey: Computer Vision I – Image Processing (0.25 points) List three specific tasks where computer vision would be desirable. What do you think are the major hurdles in computer vision? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 3 / 27

How Machines See: Cameras A charge-coupled device (CCD) photo sensor array: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 4 / 27

Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f What can we can figure out from this? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

Focal Optics for Determining Distance and Size See the videos, I’ll summarize: X Z = x f What can we can figure out from this? Object’s distance ( Z ) & height ( X ) based on projection height ( x ) and focal distance ( f ) Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 5 / 27

We All See a Perspective Projection Vanishing points from parallel lines: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

We All See a Perspective Projection Vanishing points from parallel lines: Giant panda, or just close? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 6 / 27

Object Recognition: How Hard Can It Be? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

Object Recognition: How Hard Can It Be? Problems? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

Object Recognition: How Hard Can It Be? Problems? Rotation, scale, illumination, occlusion, viewpoint, deformation Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 7 / 27

Not Hard for Us Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow How does our brain do it? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

Not Hard for Us Diffuse reflection, bright Specularities Diffuse reflection, dark Cast shadow How does our brain do it? Will have examples later. Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 8 / 27

Invariance is Crucial for Computer Vision Must recognize objects invariant of their: Rotation, scale, illumination, occlusion, viewpoint, deformation Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 9 / 27

Invariance is Crucial for Computer Vision Must recognize objects invariant of their: Rotation, scale, illumination, occlusion, viewpoint, deformation Let’s start by simplifying: 1 Greyscale (monochrome) images 2 Pixels can have values: 0. . . 255 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 9 / 27

Even Terminator Has Monochrome Vision Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 10 / 27

Extracting Features: Edge Detection Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

Extracting Features: Edge Detection How to detect the vertical edge? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? 2 Filter with mask: + 1 − 1 Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27

Extracting Features: Edge Detection How to detect the vertical edge? 1 Spatial derivative? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 11 / 27 2 Filter with mask: + 1 − 1

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . What’s the advantage? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Edge Detection: Linear Filter What we did is called convolution : I ′ I ⊗ g = = ⊗ For each pixel, we multiply by mask and sum: � I ′ ( x , y ) = I ( x − u , y − v ) g ( u , v ) u , v Does that equation look familiar? Perceptron? What are the weights? The mask, g . What’s the advantage? Works in parallel! Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 12 / 27

Neurons Can Do It Faster? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 13 / 27

Detect Only Vertical Edges? Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 14 / 27

Horizontal and Vertical Gradients Original: Günay Ch. 24, Computer Vision I – Object Recognition Spring 2013 15 / 27

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object - PowerPoint PPT Presentation

CS325 Artificial Intelligence Ch. 24, Computer Vision I Object Recognition Cengiz Gnay, Emory Univ. Spring 2013 Gnay Ch. 24, Computer Vision I Object Recognition Spring 2013 1 / 27 Computer Vision Done with games, except

CS325 Artificial Intelligence Computer Vision II 3D Vision (Ch. 24) Dr. Cengiz Gnay, Emory

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

CS325 Artificial Intelligence Computer Vision III Structure from Motion (Ch. 24) Dr. Cengiz

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

CS325 Artificial Intelligence Ch 14b Probabilistic Inference Cengiz Gnay Spring 2013

CS325 Artificial Intelligence Ch. 5, Games! Cengiz Gnay, Emory Univ. vs. Spring 2013 Gnay

CS325 Artificial Intelligence Ch. 21 Reinforcement Learning Cengiz Gnay, Emory Univ.

CS325 Artificial Intelligence Chs. 9, 12 Knowledge Representation and Inference Cengiz

CS325 Artificial Intelligence Spring 2013 Midterm Solution Guide Instructor: Cengiz Gunay,

CS325 Artificial Intelligence Ch. 7, 8, 9 Logic, Knowledge, and Inference Cengiz Gnay,

CS325 Artificial Intelligence Natural Language Processing II (Ch. 23) Dr. Cengiz Gnay, Emory

CS325 ARTIFICIAL INTELLIGENCE Introduction: Chapter 1 Outline Course overview What is

Computer vision techniques for video surveillance Huiyu Zhou, Ph.D. January, 2016 Film: Spectre

Soft modes from black hole microstates Onkar Parrikar Department of Physics and Astronomy

Introduction to Topological Data Analysis Persistent Homology Norm Matloff University of

An introduction to shape and topology optimization ric Bonnetier and Charles Dapogny

CPSC 4040/6040 Computer Graphics Images Joshua Levine levinej@clemson.edu Lecture 10 Point

CMPE 450/490 Capstone Project Intruder Alert System Jordan Tymburski

Physics and chemistry of irradiated protostars Johan E. Lindberg 1,2 Jes K. Jrgensen 2,1 , J. D.

CS 188: Artificial Intelligence Markov Decision Processes II Instructors: Dan Klein and Pieter