A Computer Vision Sampler COMPSCI 527 Today: Introduction to - - PowerPoint PPT Presentation

a computer vision sampler compsci 527
SMART_READER_LITE
LIVE PREVIEW

A Computer Vision Sampler COMPSCI 527 Today: Introduction to - - PowerPoint PPT Presentation

A Computer Vision Sampler COMPSCI 527 Today: Introduction to computer vision Course logistics Sameer Agarwal et al CACM 2011 Benfold and Reid CVPR 2011 Apple Computers 2010 Tian Lan et al ICCV 2013 [slide idea by Fei-Fei Li] Detect:


slide-1
SLIDE 1

A Computer Vision Sampler COMPSCI 527

Today:

  • Introduction to computer vision
  • Course logistics
slide-2
SLIDE 2
slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6

Sameer Agarwal et al CACM 2011

slide-7
SLIDE 7

Benfold and Reid CVPR 2011

slide-8
SLIDE 8

Apple Computers 2010

slide-9
SLIDE 9

Tian Lan et al ICCV 2013

slide-10
SLIDE 10

[slide idea by Fei-Fei Li]

slide-11
SLIDE 11
slide-12
SLIDE 12
  • Detect: Is there one or more instance of x in this

image?

  • Localize: Where are the instances?
  • Segment: Where are the boundaries of each instance?
  • Track: How is this instance moving from one image to

the next?

  • Recognize = classify: What is this?
  • Reconstruct: Given two or more 2D images of a scene,

compute a 3D model of it

slide-13
SLIDE 13

History of Recognition

  • Up to mid Nineties: Feature extraction and recognition algorithm engineered by hand
  • No data needed, intuition, lots of special-case programming
  • Mid Nineties to circa 2010: Feature extraction by hand, and recognition algorithm with

machine learning

  • With carefully designed features, recognition can learned from modest-size training

sets

  • Last decade: Entire pipeline by machine learning (deep convolutional neural networks)
  • Massive amounts of data needed to train the system

[Picture from www.driverseducationusa.com]

feature extraction

detection or recognition x

"person"

slide-14
SLIDE 14

History of Tracking

  • Up to circa 2000: Search for most similar window in second frame
  • Last decade: repeatedly detect object of interest through recognition

matching

(Δx, Δy)

feature extraction

detection x

"person A"

feature extraction

detection x

"person A"

slide-15
SLIDE 15

History of 3D Reconstruction

  • Still mostly geometry: Model the image formation

process, 3D -> 2D: images = f(3D shape, camera position)

  • Solve for 3D shape, camera position, given

images, 2D -> 3D

  • Starting to see first deep neural networks