rotational rectification network r2n enabling pedestrian
play

Rotational Rectification Network (R2N): Enabling Pedestrian - PowerPoint PPT Presentation

Rotational Rectification Network (R2N): Enabling Pedestrian Detection for Mobile Vision Xinshuo Weng 1 , Shangxuan Wu 1 , Fares Beainy 2 , Kris M. Kitani 1 1 Carnegie Mellon University, 2 Volvo Construction Equipment WACV 2018, Lake Tahoe


  1. Rotational Rectification Network (R2N): Enabling Pedestrian Detection for Mobile Vision Xinshuo Weng 1 , Shangxuan Wu 1 , Fares Beainy 2 , Kris M. Kitani 1 1 Carnegie Mellon University, 2 Volvo Construction Equipment WACV 2018, Lake Tahoe

  2. Pedestrian Detection

  3. Pedestrian Detection ● Results on Caltech dataset Zhang et al. Is Faster R-CNN Doing Well for Pedestrian Detection? ECCV , 2016.

  4. Arbitrary-Oriented Pedestrian Detection

  5. Arbitrary-Oriented Pedestrian Detection

  6. Arbitrary-Oriented Pedestrian Detection ● Random failure cases on Caltech dataset.

  7. Why is it interesting? Imagine the cases: ● Mobile phones

  8. Why is it interesting? Imagine the cases: ● Mobile phones ● UAVs/drones

  9. Why is it interesting? Imagine the cases: ● Mobile phones ● UAVs/drones ● Construction vehicles on a rugged terrain

  10. Why is it interesting? Imagine the cases: ● Mobile phones ● UAVs/drones ● Construction vehicles on a rugged terrain ● Wearable cameras ● ….

  11. Why is it interesting? Imagine the cases: ● Mobile phones ● UAVs/drones ● Construction vehicles on a rugged terrain ● Wearable cameras ● …. Camera orientation can be very flexible with respect to the ground in the real world.

  12. Modeling Rotation Invariance or Equivariance

  13. Modelling Rotation Invariance/Equivariance Rotating the inputs Rotating the filters Changing sampling grids ● Data augmentation ● TI-Pooling [Laptev et al CVPR’ 16] ● …. ● Cons: ○ Low efficiency ○ More parameters

  14. Modelling Rotation Invariance/Equivariance Rotating the inputs Rotating the filters Changing sampling grids ● ● Data augmentation RotEqNet [Marcos et al, ● ICCV’ 17] TI-Pooling [Laptev et al, CVPR’ 16] ● ORNs [Zhou et al, CVPR’ ● …. 17] ● …. ● ● Cons: Cons: ○ Approximated ○ Low efficiency rotations ○ More parameters ○ Memory issues

  15. Modelling Rotation Invariance/Equivariance Rotating the inputs Rotating the filters Changing sampling grids ● ● ● Data augmentation RotEqNet [Marcos et al, Spatial Transformer ● ICCV’ 17] [Jaderberg et al, NIPS’ 15] TI-Pooling [Laptev et al, CVPR’ 16] ● ORNs [Zhou et al, CVPR’ ● Deformable ConvNets [Dai ● …. et al, ICCV’ 17] 17] ● …. ● GPPooling (Ours) ● …. ● ● Cons: Cons: ○ Approximated ○ Low efficiency rotations ○ More parameters ○ Memory issues

  16. Global Polar Pooling (GPPooling) Inputs Activations

  17. GPPooling vs Pooling GPPooling Pooling Noh et al. Learning Deconvolution Network for Semantic Segmentation? ICCV , 2015.

  18. What is Rotational Rectification Network (R2N)? R2N = Rotation Estimation Module (including GPPooling) + Spatial Transformer

  19. Results

  20. Take Home Messages ● GPPooling can be used to model global rotation equivariance/invariance in general CNNs. ● R2N is easy to plug in and improves the performance on oriented detection without bells and whistles.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend