semantic segmentation
play

Semantic segmentation Image classification Object detection - PowerPoint PPT Presentation

Accel : A Corrective Fusion Network for Efficient Semantic Segmentation on Video Samvit Jain , Xin Wang , Joseph Gonzalez RISE Lab, UC Berkeley Semantic segmentation Image classification Object detection Semantic segmentation Evolution


  1. Accel : A Corrective Fusion Network for Efficient Semantic Segmentation on Video Samvit Jain , Xin Wang , Joseph Gonzalez RISE Lab, UC Berkeley

  2. Semantic segmentation Image classification Object detection Semantic segmentation

  3. Evolution … Multi-Scale Aggregation by Efficient Graph-Based Fully Convolutional Dilated Convolutions Image Segmentation Networks for SS (2015) (2004) (2014) DeepLab-v2 PSPNet DeepLab-v3 (2016) (2017) (2017)

  4. Evolution Fully Convolutional DeepLab-v3 Networks (2014) (2017) Dataset Pascal VOC 2012 Accuracy (mIoU) 62.2 85.7 Inference Time 175 ms 750 ms

  5. Motivation ● Image models don’t translate to video ○ High frame rates (e.g. 30 fps) ○ High resolution (e.g. full-HD, 1920 x 1080 p) ○ Scene complexity (e.g. ego motion, urban streets) Cityscapes dataset : Frankfurt

  6. Deep Feature Flow ● Idea: run feature net on keyframes , warp features to intermediate frames

  7. Problems ● Accuracy degradation ○ Warping with a flow field is a coarse operation ○ Non-translational temporal change (e.g. new objects, occlusions, lighting) ignored (a) k (b) k+2 (c) k+4 (d) k+6

  8. Accel ResNet-101 keyframe N R I k feat ... W optical flow reference ... branch score fusion N R optical flow W ... task S k+i warp SF N U I k+i N U feat task update branch segmentation current frame ResNet-{18,34,51,101} Accel : a family of corrective, two-stream fusion networks combining: N R ( reference branch ) – optical flow-based keyframe feature warping (1) N U ( update branch ) – per-frame correction with residual segmentation network (2)

  9. Accel N R + N U N Rfeat N Ufeat (reference branch) (update branch) (full network) ResNet-101 ResNet-18 Accel-18 ResNet-101 ResNet-34 Accel-34 ResNet-101 ResNet-51 Accel-51 ResNet-101 ResNet-101 Accel-101

  10. Results Cityscapes CamVid Accuracy (mIoU) vs. inference time (s/frame)

  11. Results Accuracy (mIoU) vs. keyframe interval

  12. Visualizations DFF (reference branch) DeepLab-18 (update branch) Accel-18

  13. Thank you! Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video S. Jain, X. Wang, J. Gonzalez In: CVPR 2019 (oral) https://arxiv.org/abs/1807.06667

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend