[PPT] - Lecture 8: Image Segmentation Peng Chao Face++ Researcher PowerPoint Presentation

SLIDE 1

Lecture 8: Image Segmentation

彭超 Peng Chao Face++ Researcher pengchao@megvii.com

Nov. 2017

SLIDE 2

Image Segmentation

SLIDE 3

Semantic Segmentation

SLIDE 4

Instance Segmentation

SLIDE 5

Scene Parsing

SLIDE 6

Human Parsing

SLIDE 7

Stuff Segmentation

New Track in COCO 2017
Stuff: mountain, grass, wall, sky

……

Stuff covers about 66% of the

pixels in COCO

SLIDE 8

UlrtraSound Segmentation

Figure credit: Ultrasound Nerve Segmentation on Kaggle

SLIDE 9

Selfie Segmentation

SLIDE 10

Evaluation

Normally, we use mean IOU to judge the results!

SLIDE 11

Outline

Semantic Segmentation
Instance Segmentation

SLIDE 12

Outline

Semantic Segmentation
Instance Segmentation

SLIDE 13

Fully Convolutional Network

Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015

SLIDE 14

Fully Convolutional Network

Feature Map Downsampling Score Map Upsampling

Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015

SLIDE 15

Fully Convolutional Network

Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015

SLIDE 16

Fully Convolutional Network

First work using CNN to solve the semantic segmentation
Introducing skip-net framework
Large Improvement! (60 vs 30)

Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015

SLIDE 17

Learning Deconvolution Network for Semantic Segmentation

Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. "Learning Deconvolution Network for Semantic Segmentation." ICCV 2015

SLIDE 18

Learning Deconvolution Network for Semantic Segmentation

Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. "Learning Deconvolution Network for Semantic Segmentation." ICCV 2015

SLIDE 19

Learning Deconvolution Network for Semantic Segmentation

Introducing un-pool and de-convolution operations.
Introducing hourglass-like framework.

Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. "Learning Deconvolution Network for Semantic Segmentation." ICCV 2015

SLIDE 20

DeepLab

Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille (*equal contribution), arXiv preprint, 2016

SLIDE 21

DeepLab

Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille (*equal contribution), arXiv preprint, 2016

SLIDE 22

DeepLab

Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille (*equal contribution), arXiv preprint, 2016

SLIDE 23

DeepLab

Introducing dilated-convolution
Combining traditional method (post processing): DenseCRF

SLIDE 24

Conditional Random Field

Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)

SLIDE 25

y is the label, x is the image U: Unary relation; V: pairwise relation

Conditional Random Field

Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)

SLIDE 26

CRF Inference

❏ However, for loopy graph, the above problem is NP-hard. (The nodes relations are complex, making computing marginal probability harder) ❏ Approximated methods:

❏ MCMC (Gibbs Sampling) ❏ Loopy Belief propagation ❏ Mean Field

Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)

SLIDE 27

CRF Inference: Mean field

Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)

SLIDE 28

DenseCRF

Krahenbuhl, Philipp, and Vladlen Koltun. "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials." NIPS 2011

SLIDE 29

DenseCRF

Krahenbuhl, Philipp, and Vladlen Koltun. "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials." NIPS 2011

SLIDE 30

DenseCRF

Best Traditional Method!
Poor accuracy on segmentation! (poor feature)

SLIDE 31

CRF AS RNN

Zheng, Shuai, et al. "Conditional Random Fields as Recurrent Neural Networks." ICCV 2015

SLIDE 32

CRF AS RNN

Krahenbuhl, Philipp, and Vladlen Koltun. "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials." NIPS 2011

SLIDE 33

CRF AS RNN

Encoding the DenseCRF into the CNN framework!
Better results than DenseCRF in post-processing (Deeplab)

SLIDE 34

Deeplab Attention

Attention to Scale: Scale-aware Semantic Image Segmentation CVPR 2016

SLIDE 35

Deeplab Attention

Attention to Scale: Scale-aware Semantic Image Segmentation CVPR 2016

SLIDE 36

Deeplab Attention

Attention to Scale: Scale-aware Semantic Image Segmentation CVPR 2016

SLIDE 37

Deeplab Attention

Fusion framework for multi-scale training and inference!
Combining Attention model into Segmentation framework!

SLIDE 38

PSPNet

Pyramid Secen Parsing Network

SLIDE 39

PSPNet

Pyramid Secen Parsing Network

SLIDE 40

PSPNet

Pyramid Secen Parsing Network

SLIDE 41

PSPNet

Propose the Pyramid Pooling Module!
Hard to reproduce!

SLIDE 42

Global Convolutional Network

TRF of two 3x3 conv is: 5 However the VRF maybe different！

Figure credit: Szegedy C, Vanhoucke V, Ioffe S, et al. Rethinking the Inception Architecture for Computer Vision[J]. Computer Science, 2016. Zhou B, Khosla A, Lapedriza A, et al. Object Detectors Emerge in Deep Scene CNNs[J]. Computer Science, 2015.

SLIDE 43

Global Convolutional Network

Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017

SLIDE 44

Global Convolutional Network

Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017

SLIDE 45

Global Convolutional Network

Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017

Baseline (FCN)

SLIDE 46

Global Convolutional Network

Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017

Baseline (FCN) Gloabl Convolutional Network (GCN)

SLIDE 47

Global Convolutional Network

Image GCN Baseline (FCN)

SLIDE 48

Global Convolutional Network

Region Mis-Classifications are corrected!

Image GCN Baseline (FCN)

SLIDE 49

Global Convolutional Network

Region Mis-Classifications are corrected! The Details are lost!

Image GCN Baseline (FCN)

SLIDE 50

Boundary Refinement (BR)

SLIDE 51

GCN GCN + BR Boundary Refinement (BR)

SLIDE 52

GCN GCN + BR Boundary Refinement (BR) The Details are recoved!

SLIDE 53

GCN GCN + BR Boundary Refinement (BR) Ground-Truth The Details are recoved!

SLIDE 54

SLIDE 55

Global Convolutional Network

Extend the FCN framework!
Partially Solve the Receptive Field Problem!
Two key components! (GCN and BRN)

SLIDE 56

Deeplab V3

Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv

SLIDE 57

Deeplab V3

Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv

SLIDE 58

Deeplab V3

Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv

SLIDE 59

Deeplab V3

Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv

SLIDE 60

Deeplab V3

Currently State-Of-Art on PASCAL VOC 2012
Conclude the dilate-convolution technique on segmentation

Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv

SLIDE 61

Deformable Convolution

Deformable Convolutional Networks, arxiv

SLIDE 62

Deformable Convolution

Deformable Convolutional Networks, arxiv

SLIDE 63

Deformable Convolution

Deformable Convolutional Networks, arxiv

SLIDE 64

Deformable Convolution

Deformable Convolutional Networks, arxiv

SLIDE 65

Deformable Convolution

Solve the receptive field problem using learned offsets!
Also valid for detection!

Deformable Convolutional Networks, arxiv

SLIDE 66

Re-Cap

Segmentation with CNN: FCN, Deeplab, GCN ...
Segmentation with CRF: DenseCRF, CRFAsRNN, ...
Different Convolutions: Dilated Conv, Global Conv, Deformable, ...

SLIDE 67

Outline

Semantic Segmentation
Instance Segmentation

SLIDE 68

Top-Down Pipeline

SLIDE 69

Top-Down Pipeline

SLIDE 70