Rotational Rectification Network (R2N): Enabling Pedestrian - - PowerPoint PPT Presentation

rotational rectification network r2n enabling pedestrian
SMART_READER_LITE
LIVE PREVIEW

Rotational Rectification Network (R2N): Enabling Pedestrian - - PowerPoint PPT Presentation

Rotational Rectification Network (R2N): Enabling Pedestrian Detection for Mobile Vision Xinshuo Weng 1 , Shangxuan Wu 1 , Fares Beainy 2 , Kris M. Kitani 1 1 Carnegie Mellon University, 2 Volvo Construction Equipment WACV 2018, Lake Tahoe


slide-1
SLIDE 1

Rotational Rectification Network (R2N): Enabling Pedestrian Detection for Mobile Vision

Xinshuo Weng1, Shangxuan Wu1, Fares Beainy2, Kris M. Kitani1

1Carnegie Mellon University, 2Volvo Construction Equipment

WACV 2018, Lake Tahoe

slide-2
SLIDE 2

Pedestrian Detection

slide-3
SLIDE 3

Pedestrian Detection

  • Results on Caltech dataset

Zhang et al. Is Faster R-CNN Doing Well for Pedestrian Detection? ECCV, 2016.

slide-4
SLIDE 4

Arbitrary-Oriented Pedestrian Detection

slide-5
SLIDE 5

Arbitrary-Oriented Pedestrian Detection

slide-6
SLIDE 6

Arbitrary-Oriented Pedestrian Detection

  • Random failure cases on Caltech dataset.
slide-7
SLIDE 7

Why is it interesting?

Imagine the cases:

  • Mobile phones
slide-8
SLIDE 8

Why is it interesting?

Imagine the cases:

  • Mobile phones
  • UAVs/drones
slide-9
SLIDE 9

Why is it interesting?

Imagine the cases:

  • Mobile phones
  • UAVs/drones
  • Construction vehicles on a

rugged terrain

slide-10
SLIDE 10

Why is it interesting?

Imagine the cases:

  • Mobile phones
  • UAVs/drones
  • Construction vehicles on a

rugged terrain

  • Wearable cameras
  • ….
slide-11
SLIDE 11

Why is it interesting?

Imagine the cases:

  • Mobile phones
  • UAVs/drones
  • Construction vehicles on a rugged terrain
  • Wearable cameras
  • ….

Camera orientation can be very flexible with respect to the ground in the real world.

slide-12
SLIDE 12

Modeling Rotation Invariance or Equivariance

slide-13
SLIDE 13

Modelling Rotation Invariance/Equivariance

Rotating the inputs

  • Data augmentation
  • TI-Pooling [Laptev et al

CVPR’ 16]

  • ….
  • Cons:

○ Low efficiency ○ More parameters

Rotating the filters Changing sampling grids

slide-14
SLIDE 14

Modelling Rotation Invariance/Equivariance

Rotating the inputs

  • Data augmentation
  • TI-Pooling [Laptev et al,

CVPR’ 16]

  • ….
  • Cons:

○ Low efficiency ○ More parameters

Rotating the filters

  • RotEqNet [Marcos et al,

ICCV’ 17]

  • ORNs [Zhou et al, CVPR’

17]

  • ….
  • Cons:

○ Approximated rotations ○ Memory issues

Changing sampling grids

slide-15
SLIDE 15

Modelling Rotation Invariance/Equivariance

Rotating the inputs

  • Data augmentation
  • TI-Pooling [Laptev et al,

CVPR’ 16]

  • ….
  • Cons:

○ Low efficiency ○ More parameters

Rotating the filters

  • RotEqNet [Marcos et al,

ICCV’ 17]

  • ORNs [Zhou et al, CVPR’

17]

  • ….
  • Cons:

○ Approximated rotations ○ Memory issues

Changing sampling grids

  • Spatial Transformer

[Jaderberg et al, NIPS’ 15]

  • Deformable ConvNets [Dai

et al, ICCV’ 17]

  • GPPooling (Ours)
  • ….
slide-16
SLIDE 16

Global Polar Pooling (GPPooling)

Inputs Activations

slide-17
SLIDE 17

GPPooling vs Pooling

GPPooling Pooling

Noh et al. Learning Deconvolution Network for Semantic Segmentation? ICCV, 2015.

slide-18
SLIDE 18

What is Rotational Rectification Network (R2N)?

R2N = Rotation Estimation Module (including GPPooling) + Spatial Transformer

slide-19
SLIDE 19

Results

slide-20
SLIDE 20

Take Home Messages

  • GPPooling can be used to model global rotation equivariance/invariance in

general CNNs.

  • R2N is easy to plug in and improves the performance on oriented detection

without bells and whistles.