CS4495/6495 Introduction to Computer Vision 1A-L1 Introduction - - PowerPoint PPT Presentation

cs4495 6495
SMART_READER_LITE
LIVE PREVIEW

CS4495/6495 Introduction to Computer Vision 1A-L1 Introduction - - PowerPoint PPT Presentation

CS4495/6495 Introduction to Computer Vision 1A-L1 Introduction Outline What is computer vision? State of the art Why is this hard? Course overview Software Why study Computer Vision? Images (and movies) have become


slide-1
SLIDE 1

1A-L1 Introduction

CS4495/6495 Introduction to Computer Vision

slide-2
SLIDE 2

Outline

  • What is computer vision?
  • State of the art
  • Why is this hard?
  • Course overview
  • Software
slide-3
SLIDE 3

Why study Computer Vision?

  • Images (and movies) have become ubiquitous in

both production and consumption.

  • Therefore applications to manipulate images

(movies) are becoming core.

  • As are systems that extract information from

imagery

  • Surveillance
  • Building 3D representations
  • Motion capture assisted
slide-4
SLIDE 4

Why study Computer Vision?

  • But most of all…

It is a really deep and cool set of problems!

slide-5
SLIDE 5

Every picture tells a story

Goal of computer vision is to write computer programs that can interpret images

Steve Seitz

slide-6
SLIDE 6

Making sense of a picture

  • We want to extract meaning out of an

image/sequence of images

  • This is different from image processing, which

is mainly concerned with transforming images

  • Image processing operations such as blurring,

thresholding etc. are often used as part of CV algorithms

slide-7
SLIDE 7

Making sense of a picture

  • Look at this scene carefully…
slide-8
SLIDE 8
slide-9
SLIDE 9

Making sense of a picture

  • What items could you identify? How did you

recognize them?

  • What about other objects/spaces/time of day

etc.?

slide-10
SLIDE 10

Current state of the art

  • Can computers match (or beat) human vision?
  • Yes and no (but mostly no!)
  • Humans are much better at “hard” things
  • Computers can be better at “easy” things
  • Though getting really good at labeling using

machine learning techniques. Only a little on that in this course.

Steve Seitz

slide-11
SLIDE 11

Current state of the art

  • The next slides show some examples of what

current vision systems can do

slide-12
SLIDE 12

Optical character recognition (OCR)

Handwritten Digit recognition

Technology to convert scanned docs to text

If you have a scanner, it probably came with OCR software License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition

Steve Seitz

slide-13
SLIDE 13

Face detection and more…

  • Most digital cameras can detect faces…
slide-14
SLIDE 14

Face detection and more…

  • Some can detect blinking or smiling…

SONY “Smile Shutter”

slide-15
SLIDE 15

Face detection and more…

  • And some can even recognize you!
slide-16
SLIDE 16

Object recognition (in supermarkets)

  • Evolution Robotics Retail

developed LaneHawk™, a retail loss-prevention solution that helps turn bottom-of- basket (BOB) losses and in-cart losses into profits in real time.

  • The company was acquired by

Datalogic 5 years later!

slide-17
SLIDE 17

Object recognition (in mobile devices!)

slide-18
SLIDE 18

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Special effects: shape capture

Steve Seitz

slide-19
SLIDE 19

Pirates of the Caribbean Industrial Light and Magic www.ilm.com

Special effects: motion capture

Steve Seitz

slide-20
SLIDE 20

Earth viewers (3D modeling)

Image from Microsoft’s Virtual Earth (see also: Google Earth)

Steve Seitz

slide-21
SLIDE 21

Smart cars

Mobileye

Slide content courtesy of Amnon Shashua

slide-22
SLIDE 22

Smart cars are here!

slide-23
SLIDE 23

Sports

Sportvision first down line

Steve Seitz

slide-24
SLIDE 24

Vision-based interaction (and games)

Nintendo Wii has camera-based IR tracking built in.

Steve Seitz

slide-25
SLIDE 25

But the game changer:

slide-26
SLIDE 26

Security and surveillance

slide-27
SLIDE 27

Medical imaging

Image guided surgery Grimson et al., MIT 3D imaging MRI, CT

Steve Seitz

slide-28
SLIDE 28

Current state of the art

  • This is just a taste of the state of the art.
  • Some of these are less than 5 years old, most

less than 10

  • This is a very active research area, and rapidly

changing

  • Many new apps in the next 5 years
slide-29
SLIDE 29

Why is this hard?

slide-30
SLIDE 30

Simple scene right?

Dark square Light square

Edward Adelson

slide-31
SLIDE 31

Really?

Edward Adelson

slide-32
SLIDE 32

Really!

Edward Adelson

slide-33
SLIDE 33

Vision is NOT Image Processing

  • In the previous example, the two squares have

exactly the same measurement of intensity.

  • So, seeing is not the same as measuring

properties in the image.

  • Rather, “seeing” is building a percept of what is

in the world based upon the measurements made by an imaging sensor.

slide-34
SLIDE 34

Building models from change (1)

Michael Black

slide-35
SLIDE 35

Building models from change (1)

Left Image

Michael Black

slide-36
SLIDE 36

Building models from change (1)

Right Image

Michael Black

slide-37
SLIDE 37

Building models from change (2)

Dan Kersten

http://vision.psych.umn.edu/users/kersten/kersten-lab/shadows.html

slide-38
SLIDE 38

Building models from change (3)

Dan Kersten

http://vision.psych.umn.edu/users/kersten/kersten-lab/shadows.html

slide-39
SLIDE 39

Interpreting images

  • The previous example is one where the human

system is again “wrong” – nothing is moving

  • upwards. But feels like the best interpretation.
  • Our goal is to develop your understanding of

some of what it takes to go from image to interpretation.

slide-40
SLIDE 40

Course overview

slide-41
SLIDE 41

A little bit of pedagogy…

slide-42
SLIDE 42

A little bit of pedagody…

Computational Models (Math!)

slide-43
SLIDE 43

Computational Models (Math!)

A little bit of pedagody…

Algorithm

slide-44
SLIDE 44

Computational Models (Math!)

A little bit of pedagogy…

Real Images Scene Ground truth Algorithm

slide-45
SLIDE 45

Computational Models (Math!) Real Images Algorithm

A little bit of pedagogy…

Introduction to Computer Vision

slide-46
SLIDE 46

Topic outline

1.

INTRODUCTION

2.

IMAGE PROCESSING FOR COMPUTER VISION

3.

CAMERA MODELS AND VIEWS

4.

FEATURES AND MATCHING

5.

LIGHTNESS AND BRIGHTNESS

6.

IMAGE MOTION

7.

MOTION AND TRACKING

8.

CLASSIFICATION AND RECOGNITION

9.

MISCELLANEOUS OPERATIONS

  • 10. HUMAN VISION
slide-47
SLIDE 47

Problem sets

  • 8 problem sets (PS0 to PS7)
slide-48
SLIDE 48

Policies

  • Blackboard-level conversations OK, esp. on

forums

  • Write your own code
  • Ask questions on forum first, then contact

TA/instructor

slide-49
SLIDE 49

Exam

  • There will be a final exam.
  • It’s not hard – it simply designed to require

folks to go back over the slides (and text) and remember what we’ve learned.

slide-50
SLIDE 50

Grading

  • The general rubric is 85% of the final grade is

based upon the problem sets.

  • 15% is the final.
slide-51
SLIDE 51

Software

  • Embedded programming exercises (in Octave)
  • Matlab/Octave: Primary platform for exercises,

problem sets

  • Python + NumPy + OpenCV: You can submit

your problem set solutions in Python, but there will be very limited support

slide-52
SLIDE 52

Learning goals

What do you expect to learn from this course?

  • Note down somewhere and track your progress.
  • In the end, you may not have learnt everything you

expected.

  • At the same time, you may have learnt some things

you did not know about at all 