Machine Learning Competition Aditya Khosla Image by - - PowerPoint PPT Presentation

machine learning competition
SMART_READER_LITE
LIVE PREVIEW

Machine Learning Competition Aditya Khosla Image by - - PowerPoint PPT Presentation

6.S093 Visual Recognition through Machine Learning Competition Aditya Khosla Image by kirkh.deviantart.com Todays class Part 1: Competition details Part 2: Image representation lecture Bag-of-words Spatial pyramid Part 3:


slide-1
SLIDE 1

6.S093 Visual Recognition through Machine Learning Competition

Image by kirkh.deviantart.com

Aditya Khosla

slide-2
SLIDE 2

Today’s class

  • Part 1: Competition details
  • Part 2: Image representation lecture

– Bag-of-words – Spatial pyramid

  • Part 3: Feature extraction tutorial
slide-3
SLIDE 3

Competition details: dataset

person

10 object categories

airplane bicycle car cup/mug dog(s) guitar hamburger sofa traffic light

slide-4
SLIDE 4

Competition details: dataset Training set

8,000 images

Validation set

2,000 images

Testing set

5,000 images

labels provided NO labels provided

Leaderboard set

slide-5
SLIDE 5

Competition details: submission

  • For each image, you provide the probability of every

class belonging in it (as returned by your algorithm) airplane bicycle car cup dog guitar hamburger sofa traffic light person 1

slide-6
SLIDE 6

Competition details: evaluation

  • Average precision
slide-7
SLIDE 7

Competition details: prizes Cash

first

+ cash

second third

+ cash

slide-8
SLIDE 8

Competition details: thank you!

slide-9
SLIDE 9

Image representation: bag-of-words

slide-10
SLIDE 10

Document representation: bag-of-words

  • Order-less document representation: frequencies
  • f words from a dictionary Salton & McGill (1983)
slide-11
SLIDE 11

Document representation: bag-of-words

  • Order-less document representation: frequencies
  • f words from a dictionary Salton & McGill (1983)

US Presidential Speeches Tag Cloud

slide-12
SLIDE 12

Document representation: bag-of-words

  • Order-less document representation: frequencies
  • f words from a dictionary Salton & McGill (1983)

US Presidential Speeches Tag Cloud

slide-13
SLIDE 13

Document representation: bag-of-words

  • Order-less document representation: frequencies
  • f words from a dictionary Salton & McGill (1983)

US Presidential Speeches Tag Cloud

slide-14
SLIDE 14

Image representation: bag-of-words

document bag-of-words

slide-15
SLIDE 15

Image representation: bag-of-words

document bag-of-words image bag-of-visual words

slide-16
SLIDE 16

Object Bag of ‘words’

slide-17
SLIDE 17

Object Ugly bag of ‘words’

slide-18
SLIDE 18

Object Stylish bag of ‘words’

slide-19
SLIDE 19

Object Stylish bag of ‘words’

slide-20
SLIDE 20

visual dictionary

slide-21
SLIDE 21

Image representation: bag-of-words

  • 1. Extract descriptors
slide-22
SLIDE 22

Image representation: bag-of-words

  • 1. Extract descriptors
  • 2. Learn “visual dictionary”
slide-23
SLIDE 23

Image representation: bag-of-words

  • 1. Extract descriptors
  • 2. Learn “visual dictionary”
  • 3. Quantize features using visual vocabulary
slide-24
SLIDE 24

Image representation: bag-of-words

  • 1. Extract descriptors
  • 2. Learn “visual dictionary”
  • 3. Quantize features using visual vocabulary
slide-25
SLIDE 25

Image representation: bag-of-words

  • 1. Extract descriptors
  • 2. Learn “visual dictionary”
  • 3. Quantize features using visual vocabulary
  • 4. Represent images by frequencies of “visual words”
slide-26
SLIDE 26
  • 1. Extracting descriptors

regular grid interest points

slide-27
SLIDE 27

Image representation: yesterday

gradient magnitude gradient orientation feature vector

slide-28
SLIDE 28

Image representation: yesterday

gradient magnitude gradient orientation descriptor

slide-29
SLIDE 29
  • 2. Learning “visual dictionary”

Compute descriptor

slide-30
SLIDE 30
  • 2. Learning “visual dictionary”

descriptors

slide-31
SLIDE 31
  • 2. Learning visual dictionary

descriptors

slide-32
SLIDE 32
  • 2. Learning visual dictionary

descriptors

Clustering

slide-33
SLIDE 33
  • 2. Learning visual dictionary

descriptors

Clustering visual vocabulary

slide-34
SLIDE 34

Example visual vocabulary

Fei-Fei et al. 2005

slide-35
SLIDE 35

Image patch examples

Sivic et al. 2005

slide-36
SLIDE 36

Image patch examples

Sivic et al. 2005

How to choose the vocabulary size?

slide-37
SLIDE 37

Bag-of-words: limitations

  • What about the structure of the image?

=?

slide-38
SLIDE 38

Image representation: spatial pyramids

level 0

slide-39
SLIDE 39

Image representation: spatial pyramids

level 0 level 1

slide-40
SLIDE 40

Image representation: spatial pyramids

level 0 level 1 level 2

slide-41
SLIDE 41

Tutorial