HISTOGRAMS OF Rafael ORIENTED GRADIENTS Cosman Tao Wang FOR - - PowerPoint PPT Presentation

histograms of
SMART_READER_LITE
LIVE PREVIEW

HISTOGRAMS OF Rafael ORIENTED GRADIENTS Cosman Tao Wang FOR - - PowerPoint PPT Presentation

HISTOGRAMS OF Rafael ORIENTED GRADIENTS Cosman Tao Wang FOR HUMAN DETECTION (NAVNEET DALAL AND BILL TRIGGS) HUMAN DETECTION TRAINING SVM Descriptor HOG/SIFT/SURF Class Person (x 0 , x 1 x n ) SVM (x 0 , x 1 x n ) Class


slide-1
SLIDE 1

Rafael Cosman Tao Wang

HISTOGRAMS OF ORIENTED GRADIENTS FOR HUMAN DETECTION

(NAVNEET DALAL AND BILL TRIGGS)

slide-2
SLIDE 2

HUMAN DETECTION

slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5

TRAINING SVM

HOG/SIFT/SURF… (x0, x1 … xn) SVM (x0, x1 … xn) Descriptor Class “Person” Class “No Person”

slide-6
SLIDE 6

TESTING: SCANNING WINDOW

slide-7
SLIDE 7

TESTING SVM

(x0, x1 … xn) SVM “Person” “No Person” Result

slide-8
SLIDE 8

¡ Very simple to implement ¡ Performs as well or better than many descriptors ¡ Cited over 5000 times

§ Basis for Deformable Parts Model

MOTIVATION

slide-9
SLIDE 9

Image

Gradients Cells Blocks

Descriptor

(x0, x1 … xn) HOG

slide-10
SLIDE 10

Image

Gradients Cells Blocks

Descriptor

¡ Convolve the image with discrete derivative mask

§ [-1, 0, 1] § [-1, 0, 1]T

Original Image X gradient Y gradient

slide-11
SLIDE 11

Image

Gradients Cells Blocks

Descriptor

¡ Now we count up the gradient angles in 8x8 cells

§ Vote weight = magnitude = √⁠​𝑒𝑦↑2 +𝑒​𝑧↑2 § Who you vote for ~ angle = arctan​(​𝑒𝑧/𝑒𝑦 )

slide-12
SLIDE 12

Image

Gradients Cells Blocks

Descriptor

Normalization Concatenation Normalization Overlapping blocks yields better results! (Overcomplete features)

slide-13
SLIDE 13

Image

Gradients Cells Blocks

Descriptor

VARIATIONS

Circular cells Circular Blocks Alternate masks Gamma normalization Gaussian smoothing Unsigned binning Block size Cell size Normalization

slide-14
SLIDE 14

EVALUATION METRIC

¡ Detection task, so a single accuracy value does not make sense

§ Low threshold -- Low miss rate, but many false positives § High threshold – Few false positives, but misses a lot

¡ Detection E Error T Tradeoff ( (DET) Curves

§ Vertical Axis: Miss Rate = 1-Recall = (1 – True Pos / Total Pos in GT) § Horizontal Axis: False Positive Per Window (FPPW)

slide-15
SLIDE 15

Image

Gradients Cells Blocks

Descriptor

¡ Using Grayscale/Color

§ Compute gradients in all color channels, pick the highest one § Using color gives slightly better results

¡ Gamma/Color Normalization

§ As a preprocessing step § Has little effect on results

slide-16
SLIDE 16

Image

Gradients Cells Blocks

Descriptor

¡ Gaussian smoothing

§ Reduces performance § Fine-grained edge detection is crucial

raw image σ>0 σ=0

slide-17
SLIDE 17

Image

Gradients Cells Blocks

Descriptor

¡ Choice of masks

§ 1-D centered: [-1, 0, 1], [-1, 0, 1]T § 1-D uncentered: [-1, 1], [-1, 1]T § 1-D Cubic corrected: [1, -8, 0, 8, -1], [1, -8, 0, 8, -1]T § 2-D Sobel mask: [█□−1&0&1@−2&0&2@−1&0&1 ],​[█□−1&0&1@ −2&0&2@−1&0&1 ]↑𝑈

¡ 1-D centered [-1, 0, 1], [-1, 0, 1]T with no Gaussian smoothing works best

slide-18
SLIDE 18

Image

Gradients Cells Blocks

Descriptor

¡ Rectangular vs Circular cells

slide-19
SLIDE 19

Image

Gradients Cells Blocks

Descriptor

¡ Bilinear interpolation to reduce aliasing ¡ Across both orientations and locations (weighted by distance in pixels)

slide-20
SLIDE 20

Image

Gradients Cells Blocks

Descriptor

¡ # of orientation bins

§ Increasing orientation bins from 4 to 9 decreases false positives by 10 times ¡ Unsigne Unsigned ce d cells lls § 0-180 degrees instead

  • f 0-360 degrees

§ Actually improves performance slightly!

§ Why?

slide-21
SLIDE 21

Image

Gradients Cells Blocks

Descriptor

¡ Motivation

§ Gradient magnitude lacks invariance to changes in illumination and foreground background contrast.

¡ Need local contrast normalization to find the “true” weight of an edge

slide-22
SLIDE 22

Image

Gradients Cells Blocks

Descriptor

R-HOG

Normalization Concatenation Normalization Overlapping blocks yields better results! (Overcomplete features)

slide-23
SLIDE 23

Image

Gradients Cells Blocks

Descriptor

R-HOG versus C-HOG

slide-24
SLIDE 24

Image

Gradients Cells Blocks

Descriptor

¡ Normalizations

§ L2-norm § L2-hys: L2-norm followed by clipping (limiting the maximum values of v to 0.2) and renormalizing § L1-norm: § L1-sqrt:

slide-25
SLIDE 25

Image

Gradients Cells Blocks

Descriptor

¡ Best cell & block sizes:

§ Cell size of 6x6 § Block size of 3x3

slide-26
SLIDE 26

VISUALIZATION AND INSIGHTS

  • a. Average gradient over positive examples
  • b. Maximum positive SMV weight in each block
  • c. Maximum negative SMV weight in each block
  • d. A test image
  • e. It’s R-HOG descriptor

f. R-HOG descriptor weighted by positive SVM weights

  • g. R-HOG descriptor weighted by negative SVM weights
slide-27
SLIDE 27

HOG COMPARED TO OTHERS

slide-28
SLIDE 28

Image

Gradients Cells Blocks

Descriptor

BEST SETUP

Overlapping blocks 1-D centered mask Use color if possible No Gaussian smoothing Unsigned binning Block size 3x3 Cell size 6x6 L-2 Normalization

slide-29
SLIDE 29

RULES OF THUMB

¡ Abrupt edges at fine scales are essential ¡ No blurring ¡ Local contrast normalization is essential ¡ Overlapping blocks w/ “redundant” information improves results significantly. ¡ Fine orientation quantization is more important than fine spatial orientation