Lecture 8: Image Segmentation
彭 超 Peng Chao Face++ Researcher pengchao@megvii.com
- Nov. 2017
Lecture 8: Image Segmentation Peng Chao Face++ Researcher - - PowerPoint PPT Presentation
Lecture 8: Image Segmentation Peng Chao Face++ Researcher pengchao@megvii.com Nov. 2017 Image Segmentation Semantic Segmentation Instance Segmentation Scene Parsing Human Parsing Stuff Segmentation New Track in COCO 2017
彭 超 Peng Chao Face++ Researcher pengchao@megvii.com
……
pixels in COCO
Figure credit: Ultrasound Nerve Segmentation on Kaggle
Normally, we use mean IOU to judge the results!
Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015
Feature Map Downsampling Score Map Upsampling
Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015
Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015
Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015
Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. "Learning Deconvolution Network for Semantic Segmentation." ICCV 2015
Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. "Learning Deconvolution Network for Semantic Segmentation." ICCV 2015
Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. "Learning Deconvolution Network for Semantic Segmentation." ICCV 2015
Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille (*equal contribution), arXiv preprint, 2016
Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille (*equal contribution), arXiv preprint, 2016
Liang-Chieh Chen*, George Papandreou*, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille (*equal contribution), arXiv preprint, 2016
Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)
y is the label, x is the image U: Unary relation; V: pairwise relation
Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)
❏ However, for loopy graph, the above problem is NP-hard. (The nodes relations are complex, making computing marginal probability harder) ❏ Approximated methods:
❏ MCMC (Gibbs Sampling) ❏ Loopy Belief propagation ❏ Mean Field
Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)
Sutton, Charles A., and Andrew Mccallum. "An Introduction to Conditional Random Fields." arXiv: Machine Learning 4.4 (2012)
Krahenbuhl, Philipp, and Vladlen Koltun. "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials." NIPS 2011
Krahenbuhl, Philipp, and Vladlen Koltun. "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials." NIPS 2011
Zheng, Shuai, et al. "Conditional Random Fields as Recurrent Neural Networks." ICCV 2015
Krahenbuhl, Philipp, and Vladlen Koltun. "Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials." NIPS 2011
Attention to Scale: Scale-aware Semantic Image Segmentation CVPR 2016
Attention to Scale: Scale-aware Semantic Image Segmentation CVPR 2016
Attention to Scale: Scale-aware Semantic Image Segmentation CVPR 2016
Pyramid Secen Parsing Network
Pyramid Secen Parsing Network
Pyramid Secen Parsing Network
TRF of two 3x3 conv is: 5 However the VRF maybe different!
Figure credit: Szegedy C, Vanhoucke V, Ioffe S, et al. Rethinking the Inception Architecture for Computer Vision[J]. Computer Science, 2016. Zhou B, Khosla A, Lapedriza A, et al. Object Detectors Emerge in Deep Scene CNNs[J]. Computer Science, 2015.
Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017
Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017
Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017
Baseline (FCN)
Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network, CVPR 2017
Baseline (FCN) Gloabl Convolutional Network (GCN)
Image GCN Baseline (FCN)
Region Mis-Classifications are corrected!
Image GCN Baseline (FCN)
Region Mis-Classifications are corrected! The Details are lost!
Image GCN Baseline (FCN)
Boundary Refinement (BR)
GCN GCN + BR Boundary Refinement (BR)
GCN GCN + BR Boundary Refinement (BR) The Details are recoved!
GCN GCN + BR Boundary Refinement (BR) Ground-Truth The Details are recoved!
Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv
Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv
Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv
Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv
Rethinking Atrous Convolution for Semantic Image Segmentation, arxiv
Deformable Convolutional Networks, arxiv
Deformable Convolutional Networks, arxiv
Deformable Convolutional Networks, arxiv
Deformable Convolutional Networks, arxiv
Deformable Convolutional Networks, arxiv
Fully Convolutional Instance-aware Semantic Segmentation, CVPR 2017
Fully Convolutional Instance-aware Semantic Segmentation, CVPR 2017
Fully Convolutional Instance-aware Semantic Segmentation, CVPR 2017
Fully Convolutional Instance-aware Semantic Segmentation, CVPR 2017
Fully Convolutional Instance-aware Semantic Segmentation, CVPR 2017
Mask-RCNN, ICCV 2017
Mask-RCNN, ICCV 2017
Mask-RCNN, ICCV 2017
https://qiita.com/yu4u/items/5cbe9db166a5d72f9eb8
https://qiita.com/yu4u/items/5cbe9db166a5d72f9eb8
https://qiita.com/yu4u/items/5cbe9db166a5d72f9eb8
Mask-RCNN, ICCV 2017
Mask-RCNN, ICCV 2017
Semantic Instance Segmentation via Deep Metric Learning, arxiv
Semantic Instance Segmentation via Deep Metric Learning, arxiv
Semantic Instance Segmentation via Deep Metric Learning, arxiv
Semantic Instance Segmentation via Deep Metric Learning, arxiv
Semantic Instance Segmentation via Deep Metric Learning, arxiv
(for each instance)
between det and seg
instance
Track Rank Ensemble Single COCO BBox Detection 1st 52.8 50.5 Places InstanceSeg 1st 30.7 28.7 COCO Keypoint 1st 72.6 70.9 COCO InstanceSeg 2nd 46.4 45.0
Our Single Model is Here: 50.5.
Google Research SenseTime