Detecting Categories in News Video Using Image Features Slav - - PowerPoint PPT Presentation

detecting categories in news video using image features
SMART_READER_LITE
LIVE PREVIEW

Detecting Categories in News Video Using Image Features Slav - - PowerPoint PPT Presentation

Detecting Categories in News Video Using Image Features Slav Petrov, Arlo Faria, Pascal Michaillat, Alex Berg, Andreas Stolcke, Dan Klein, Jitendra Malik System Overview Images GB SVM Category correlation Sequential context Source Video


slide-1
SLIDE 1

Detecting Categories in News Video Using Image Features

Slav Petrov, Arlo Faria, Pascal Michaillat, Alex Berg, Andreas Stolcke, Dan Klein, Jitendra Malik

slide-2
SLIDE 2

System Overview

Video Category correlation Sequential context

Audio Images ASR TFIDF MFCC GB Feature extraction SVM GMM SVM Primary systems Source combination 1-best selection Higher-level systems

slide-3
SLIDE 3

Image Features in TrecVid ’05

  • IBM:
  • Color Histogram
  • Color Correlogram
  • Color Moments
  • CMU (local):
  • Color Histograms (in different color spaces)
  • Texture Histograms
  • Columbia (part based model):
  • Color
  • Texture
  • Tsinghua (local and global):
  • Color Auto-Correlograms
  • Color Coherence Vectors
  • Color Histograms
  • Co-occurence Texture
  • Wavelet Texture Grid
  • Edge Histogram Layout
  • Edge Histograms
  • Size
  • Spatial Relation
  • Color Moments
  • Edge Histograms
  • Wavelet Texture
slide-4
SLIDE 4

Image Features in TrecVid ’05

Shape

✓ ✓ ✓

Edge Histograms

Wavelets

✓ ✓ ✓

Histograms

Texture ✓ ✓

Correlograms

✓ ✓

Moments

✓ ✓ ✓ ✓

Histograms

Color

Berkeley Tsinghua Columbia CMU IBM

slide-5
SLIDE 5

Exemplars for Recognition

  • Use exemplars for recognition
  • Compare query image and each exemplar

using shape cues

Query Image Database of Exemplars

slide-6
SLIDE 6

Finding similar patches

Exemplar Query

slide-7
SLIDE 7

Compute sparse channels from image

Geometric Blur (Local Appearance Descriptor)

Geometric Blur Descriptor

~

Apply spatially varying blur and sub-sample Extract a patch in each channel

Idealized signal Descriptor is robust to small affine distortions

[Berg & Malik, CVPR’01]

slide-8
SLIDE 8

Horizontal Channel Vertical Channel Increasing Blur

GB in Practice

  • In practice compute discrete blur levels for whole

image and sample as needed for each feature location.

slide-9
SLIDE 9

Comparing Images

  • Sample 200 GB features from edge points
  • Dissimilarity from A to B is

where the Fx are the GB features.

[Berg, Berg & Malik, CVPR’05]

slide-10
SLIDE 10

Caltech 101 Dataset

  • Object Recognition Benchmark
  • 101 Categories:
  • Stereotypical pose
  • Little clutter
  • Objects centered
  • One object per image
slide-11
SLIDE 11

Caltech 101 Results

[Zhang, Berg, Maire & Malik, CVPR’06]

uses GB features

slide-12
SLIDE 12

Primal features for SVM

  • Compare to 50 prototypes from each class
  • Use distances as feature vector for an SVM

…… .. .. ..

Query Prototype s Featur e Vector

0.7 0.9

0.1

0.8

… ……….

0.7

slide-13
SLIDE 13

SVM features interpretation

  • Slices of the Kernel

Matrix:

  • Fixed-points in a

higher dimensional vector space:

q q ti ti tj tk tk tj

slide-14
SLIDE 14

SVM Specifics

  • SVMlight package
  • Same parameters for all categories:
  • Linear kernel
  • Default regularization parameter
  • Asymmetric cost doubling the weight of

positive examples

slide-15
SLIDE 15

Results ’06

Sports Meeting Computer- TV-Screen Car

mAP = 0.11

Results ’05 Berkeley-Shape mAP = 0.38 Best ’05 (IBM) mAP = 0.34

Best Berkeley-Shape Median

slide-16
SLIDE 16

Limitations

  • Several objects per image:
  • Features do not capture:
  • Different Scales
  • Color
slide-17
SLIDE 17

Conclusions

  • Shape is an important cue for object

recognition.

  • System that uses shape features only

can have competitive performance.

  • Shape features are orthogonal to

features used in the past.

slide-18
SLIDE 18

Thank You!

petrov@eecs.berkeley.edu