Holistically-Nested Edge Detection (HED) Saining Xie, Zhuowen Tu - - PowerPoint PPT Presentation

holistically nested edge detection hed
SMART_READER_LITE
LIVE PREVIEW

Holistically-Nested Edge Detection (HED) Saining Xie, Zhuowen Tu - - PowerPoint PPT Presentation

Holistically-Nested Edge Detection (HED) Saining Xie, Zhuowen Tu Presented by Yuxin Wu February 10, 2016 . . . . . .. . . . . . . .. . . . . . . .. . . . . . . .. . . .. . . . . . . . . . . . . . . . . . . . . What


slide-1
SLIDE 1

Holistically-Nested Edge Detection (HED)

Saining Xie, Zhuowen Tu Presented by Yuxin Wu February 10, 2016 . . . . . .. . . . . . . .. . . . . . . .. . . . . . . .. . . .. . . . . .

slide-2
SLIDE 2

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

What is an Edge?

Local intensity change? Used in traditional methods: Canny, Sobel, etc. Learn it!

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 2 / 15

slide-3
SLIDE 3

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

What is an Edge?

Local intensity change? Used in traditional methods: Canny, Sobel, etc. Learn it!

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 2 / 15

slide-4
SLIDE 4

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Inspiration

Fully Convolutional Network (FCN)

Concept originally brought out for semantic segmentation No fully-connected layers (can be converted) Allow inputs of any sizes

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 3 / 15

slide-5
SLIDE 5

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Holistically-Nested architecture

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 4 / 15

slide-6
SLIDE 6

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Multiple Supervision Signals

Single output, multiple cost Learn earlier, learn better Alleviate gradient vanishing

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 5 / 15

slide-7
SLIDE 7

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Convolutional Layers

Fine-tuning from VGG16: Lots of people do ine-tuning on top of VGG16. 5 stage. 3x3 convolution only. HED adds a side output (conv1x1) after each stage.

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 6 / 15

slide-8
SLIDE 8

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Upsampling by Deconvolution

Upsampling by a factor of k ∈ N+ is implemented by a deconvolution with a 2k × 2k kernel and output stride k. An mathematically equivalent explanation (assume k = 2):

1 Input image with shape n 2 Zero-illed upsample as above, by a factor of 2. Shape becomes 2n − 1 3 Convolve with a ilter

   

1 16 3 16 3 16 1 16 3 16 9 16 9 16 3 16 3 16 9 16 9 16 3 16 1 16 3 16 3 16 1 16

    with padding = 3, shape becomes (2n − 1) + 3 = 2n + 2. Then center-crop to 2n

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 7 / 15

slide-9
SLIDE 9

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Upsampling by Deconvolution

Upsampling by a factor of k ∈ N+ is implemented by a deconvolution with a 2k × 2k kernel and output stride k. An mathematically equivalent explanation (assume k = 2):

1 Input image with shape n 2 Zero-illed upsample as above, by a factor of 2. Shape becomes 2n − 1 3 Convolve with a ilter

   

1 16 3 16 3 16 1 16 3 16 9 16 9 16 3 16 3 16 9 16 9 16 3 16 1 16 3 16 3 16 1 16

    with padding = 3, shape becomes (2n − 1) + 3 = 2n + 2. Then center-crop to 2n

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 7 / 15

slide-10
SLIDE 10

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Class-Balanced Sigmoid Cross Entropy Loss

Sigmoid Cross Entropy Loss For each pixel, loss L = −[y⋆ log(y) + (1 − y⋆) log(1 − y)] where ground truth label y⋆ ∈ {0, 1}, y = 1 1 + e−z In images, 90% pixels are not edge, cost function is dominated by negative labels. To avoid this, re-weight the terms: Class-Balanced Sigmoid Cross Entropy Loss L = −[βy⋆ log(y) + (1 − β)(1 − y⋆) log(1 − y)] where β is the ratio of negative ground truth labels in this batch of data This loss function is computed for ℓ1..5 as well as ℓfuse =

5

i=1

αiℓi

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 8 / 15

slide-11
SLIDE 11

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Class-Balanced Sigmoid Cross Entropy Loss

Sigmoid Cross Entropy Loss For each pixel, loss L = −[y⋆ log(y) + (1 − y⋆) log(1 − y)] where ground truth label y⋆ ∈ {0, 1}, y = 1 1 + e−z In images, 90% pixels are not edge, cost function is dominated by negative labels. To avoid this, re-weight the terms: Class-Balanced Sigmoid Cross Entropy Loss L = −[βy⋆ log(y) + (1 − β)(1 − y⋆) log(1 − y)] where β is the ratio of negative ground truth labels in this batch of data This loss function is computed for ℓ1..5 as well as ℓfuse =

5

i=1

αiℓi

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 8 / 15

slide-12
SLIDE 12

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

HED Design

Holistically-Nested architecture

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 9 / 15

slide-13
SLIDE 13

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Experiements

Outputs

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 10 / 15

slide-14
SLIDE 14

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Experiements

Qualitative Results

Figure: Results on BSD500 (a small dataset)

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 11 / 15

slide-15
SLIDE 15

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Experiements

Efect of Supervision

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 12 / 15

slide-16
SLIDE 16

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Experiements

Efect of Supervision

Figure: Output of 2nd stage with(left) and without(right) extra supervision

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 13 / 15

slide-17
SLIDE 17

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Experiements

Misc.

Rotation/lip/scaling as data augmentation Using depth information (in NYUD dataset) gives better performance Pure FCN / HED without multiple supervision don’t work as good 2.5 fps on K40 for 320×480 input

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 14 / 15

slide-18
SLIDE 18

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Experiements

CMU Pano

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 15 / 15

slide-19
SLIDE 19

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Thanks!

Yuxin Wu

Presented by Yuxin Wu Holistically-Nested Edge Detection (HED) February 10, 2016 16 / 15