ContextNet: Exploring Context and Detail for Semantic Segmentation - PowerPoint PPT Presentation

ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time Rudra PK Poudel Ujwal Bonde Stephan Liwicki Christopher Zach Computer Vision Group Toshiba Research Europe TRE 2018 R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 1 / 17

Real-time Semantic Image Segmentation Real-time perception is critical for autonomous systems What am I seeing and where is it? R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 2 / 17

Motivation Problem: SOTA models are accurate but not real-time Observations: Deeper models improve accuracy (He et al., 2015) Multi-scale information fusion is beneficial (Burt et al. 1987) Downside: increased cost Floating point ops Memory usage Power consumption Hypothesis: efficient semantic segmentation based on what (global context), and where (spatial detail) Aim: real-time system for low resource (embedded) devices R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 3 / 17

Proposed Model: Overview Context branch at low resolution captures global context information Detail branch focuses on high resolution segmentation details ContextNet R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 4 / 17

Proposed Model: Context Branch Context branch at low resolution captures global context information Deep Network for Context No need for high resolution images to know what is there Lower resolution input reduces the computational cost R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 5 / 17

Proposed Model: Detail Branch Detail branch focuses on high resolution segmentation details Shallow Network for Spatial Detail No need for very deep network to detect segmentation boundary R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 6 / 17

Proposed Model: Combined Branchs Context branch at low resolution captures global context information Detail branch focuses on high resolution segmentation details Losses at context and detail branches help to learn auxiliary tasks Efficiently learning global context and spatial detail separately to reduce cost R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 7 / 17

Proposed Model: Qualitative Validation ✒ � � Input image ContextNet: using Both Branches ✒ � � ✒ � � Context Branch Detail Branch R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 8 / 17

Proposed Model: Qualitative Validation ❅ ❅ ❘ Input image ContextNet: using Both Branches ❅ ❅ ❘ ❅ ❅ ❘ Context Branch Detail Branch R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 8 / 17

Proposed Model: Qualitative Validation ■ ❅ ❅ Input image ContextNet: using Both Branches ■ ❅ ❅ ■ ❅ ❅ Context Branch Detail Branch R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 8 / 17

Network Design Depthwise Convolution Factorizes standard convolution to spatial and 1x1 conv(s) Fewer number of parameters Fewer number of floating point operations Bottleneck residual block (Sandler et al., 2018) R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 9 / 17

Network Design Multi-scale features fusion Two branches (cn14) balances between accuracy and runtime cn14 with 160K params get 57.7% mIoU in Cityscapes (Cordts et al., 2016) R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 10 / 17

Network Pruning Pruning: Start with “wider” network Pruning to obtain “skinnier” network Pruning strategy improves accuracy compared to direct training! Lottery ticket hypothesis (Frankle et al., 2018): More feature channels = ⇒ more chances of success R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 11 / 17

ContextNet: Quantitative Evaluation R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 12 / 17

ContextNet: Quantitative Evaluation Runtime measured on Nvidia Titan X (Maxwell, 3072 CUDA cores) ContextNet balances accuracy and speed Class mIoU% Category mIoU% Parameters in Millions 1024x2048 SegNet 56.1 79.8 29.46 1.6 ENet 58.3 80.4 0.37 20.4 ICNet* 69.5 - 6.68 14.2 ERFNet 68.0 86.5 2.1 11.2 ContextNet 66.1 82.7 0.85 23.2 R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 13 / 17

ContextNet: Qualitative Evaluation R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 14 / 17

Conclusion ContextNet: Efficiently learn global and local context separately Runs in real-time for 2 megapixels images 2048x1024 images @ >16 fps in Nvidia Jetson TX2 Our pruning strategy increases accuracy Limitations: accuracy gap with bigger off-line models R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 15 / 17

References Burt, P .J. and Adelson, E.H., The Laplacian pyramid as a compact image code. In Readings in Computer Vision, 1987. Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S. and Schiele, B., The Cityscapes Dataset for Semantic Urban Scene Understanding. In CVPR, 2016. Frankle, J. and Carbin, M., The lottery ticket hypothesis: Training pruned neural networks. In arXiv:1803.03635, 2018. He, K., Zhang, X., Ren, S. and Sun, J., Deep residual learning for image recognition. In arXiv:1512.03385, 2015. Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. and Chen, L.-C., Inverted residuals and linear bottlenecks: Mobile networks for classification, detection and segmentation. In arXiv:1801.04381, 2018. R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 16 / 17

Questions? Thank you for your attention! R. Poudel et al. (CVG) ContextNet: Exploring Context and Detail. . . TRE 2018 17 / 17

ContextNet: Exploring Context and Detail for Semantic Segmentation - PowerPoint PPT Presentation

ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time Rudra PK Poudel Ujwal Bonde Stephan Liwicki Christopher Zach Computer Vision Group Toshiba Research Europe TRE 2018 R. Poudel et al. (CVG) ContextNet:

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds Francis Engelmann*

Features of Master/Detail Presentation Excellent master/detail support in Data Aquarium Framework

Exploring and Using the Semantic Web Mathieu dAquin KMi, The Open University

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Semantic Analysis and Semantic Roles Ling 571 Deep Processing Techniques for NLP February 10,

Semantic Analysis Wilhelm/Seidl/Hack: Compiler Design Syntactic and Semantic Analysis,

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Semantic change : a words meaning changes independently of its form Evidence for semantic

One Page Everywhere Fluid, Responsive Design with Semantic.gs The Semantic Grid System Grid

Adarules: Learning rules for real-time road-traffic prediction Rafael Mena-Yedra 1,2 Ricard

Using a WCET Analysis Tool in Real-Time Systems Education Samuel Petersson, Andreas Ermedahl,

Computable analysis, exact real arithmetic and analytic functions in Coq Holger Thies, Kyushu

Supervised Learning: The Setup Machine Learning 1 Last lecture We saw What is learning?

Towards Real-Time Metric-Semantic SLAM Antoni Rosinol* 1 , Yun Chang 1 , Marcus Abate 1 , Daniel

Bench'19 Benchmarking Database Ingestion Ability with Real-Time Big Astronomical Data Qing Tang

Lessons Learned Michael Bunnell Fantasy Lab Introduction Point-based GI has been used for

Read admissi ssions s Reboo oot Kickoff Webinar November 21, 2019 A new focus on an old

ContextNet: Exploring Context and Detail for Semantic Segmentation - PowerPoint PPT Presentation

ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time Rudra PK Poudel Ujwal Bonde Stephan Liwicki Christopher Zach Computer Vision Group Toshiba Research Europe TRE 2018 R. Poudel et al. (CVG) ContextNet:

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds Francis Engelmann*

Features of Master/Detail Presentation Excellent master/detail support in Data Aquarium Framework

Exploring and Using the Semantic Web Mathieu dAquin KMi, The Open University

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Semantic Analysis and Semantic Roles Ling 571 Deep Processing Techniques for NLP February 10,

Semantic Analysis Wilhelm/Seidl/Hack: Compiler Design Syntactic and Semantic Analysis,

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Semantic change : a words meaning changes independently of its form Evidence for semantic

One Page Everywhere Fluid, Responsive Design with Semantic.gs The Semantic Grid System Grid

Adarules: Learning rules for real-time road-traffic prediction Rafael Mena-Yedra 1,2 Ricard

Using a WCET Analysis Tool in Real-Time Systems Education Samuel Petersson*, Andreas Ermedahl*,

Computable analysis, exact real arithmetic and analytic functions in Coq Holger Thies, Kyushu

Supervised Learning: The Setup Machine Learning 1 Last lecture We saw What is learning?

Towards Real-Time Metric-Semantic SLAM Antoni Rosinol* 1 , Yun Chang 1 , Marcus Abate 1 , Daniel

Bench'19 Benchmarking Database Ingestion Ability with Real-Time Big Astronomical Data Qing Tang

Lessons Learned Michael Bunnell Fantasy Lab Introduction Point-based GI has been used for

Read admissi ssions s Reboo oot Kickoff Webinar November 21, 2019 A new focus on an old

Using a WCET Analysis Tool in Real-Time Systems Education Samuel Petersson, Andreas Ermedahl,