Person re-identification by Local Maximal Occurrence representation - - PowerPoint PPT Presentation

person re identification by local maximal occurrence
SMART_READER_LITE
LIVE PREVIEW

Person re-identification by Local Maximal Occurrence representation - - PowerPoint PPT Presentation

Person re-identification by Local Maximal Occurrence representation and metric learning Liao Shengcai, Hu Yang, Zhu Xiangyu, Li Stan Z. Experiment Presenter: Zhenpei Yang Person Re-identification: Given an image of a person from one camera,


slide-1
SLIDE 1

Person re-identification by Local Maximal Occurrence representation and metric learning

Experiment Presenter: Zhenpei Yang

Liao Shengcai, Hu Yang, Zhu Xiangyu, Li Stan Z.

slide-2
SLIDE 2

Person Re-identification: Given an image of a person from one camera, identifying the person from images taken from different cameras

Slides credit: liangzheng

slide-3
SLIDE 3

Person re-identification is a challenging problem because:

  • Big Intra-class variance due to pose, viewpoint, illumination change.
  • Need a proper metric to compute cross-class distance.

Contribution

  • Extract good features Local Occurrence Maximum (LOMO)
  • Use good distance metric Cross-view Quadratic Discriminant Analysis (XQDA)
slide-4
SLIDE 4

A B Examine Images

About distance metric

Which image is more likely correspond to image Q? A

  • r B?

Q Query Image

Model the distribution for intra-class distance and extra-class distance!

slide-5
SLIDE 5

Discriminative model

Linear Discriminant Analysis (LDA) Quadratic Discriminant Analysis (QDA) Cross-view Quadratic Discriminant Analysis (XQDA)

Intuition: model the covariance for Intra-class distance and extra-class distance respectively using gaussian

slide-6
SLIDE 6

Intuition: Original feature space is too high dimension. Maybe it’s helpful to consider the problem in subspace

Hard to measure precisely in high dimension space Measure this in subspace!

?

Which subspace What about PCA

Cross-view Quadratic Discriminant Analysis (XQDA)

?

slide-7
SLIDE 7

Cross-view Quadratic Discriminant Analysis (XQDA)

The two distribution for intra-class and extra-class distance both have zero means

slide-8
SLIDE 8

The QXDA chose subspace that maximize the two classes’ variance ratio

PCA QXDA

slide-9
SLIDE 9

Viewpoint Invariance Analysis

  • Video taken by hand-hold camera
  • #Total 23 seconds/705 frames(48*128)
  • 0-360 degree view

Slides credit: my roomate

slide-10
SLIDE 10

Viewpoint Invariance Analysis

discance? discance? discance?

Choose feature Extract features on each frame Learn a distance metric d( ) using XQDA Measure the d(f_t, f_1)

slide-11
SLIDE 11

Investigated Features

  • Local Maximum Occurrence

(LOMO)

  • LOMO without Maximum Operator
  • Convolutional Neural Network

Feature (CNN)

slide-12
SLIDE 12

Distance Metric

  • Quadratic Discriminant Analysis

(XQDA)

  • Cosine Similarity
slide-13
SLIDE 13
  • The max operation in LOMO makes it more robust to viewpoint change
  • XQDA can learn more robust metric against viewpoint variation

XQDA similarity measure Cosine similarity measure

slide-14
SLIDE 14

Which region contribute mostly?

  • Conduct training on

four different body parts

  • Compute the matching

performance using each body parts

slide-15
SLIDE 15

The upper body is the most distinguishable part

slide-16
SLIDE 16

Sensitivity to Occlusion

Parameter: the size of

  • cclusion area
slide-17
SLIDE 17

The performance degrades monotonous as occlusion become more severe

slide-18
SLIDE 18

1st rank accuracy degrades monotonous as occlusion become more severe

slide-19
SLIDE 19

Conclusion

  • XQDA find the subspace that maximize the covariance odds of

intra-class and extra-class distance.

  • Doesn’t robust to occlusion.
  • LOMO feature has some viewpoint invariance due to the max operation.
  • XQDA can learn more robust metric against viewpoint variation
  • Upper body is the most distinct part for person-reidentification