Recap from Monday Visualizing Networks Caffe overview Slides are - - PowerPoint PPT Presentation

recap from monday
SMART_READER_LITE
LIVE PREVIEW

Recap from Monday Visualizing Networks Caffe overview Slides are - - PowerPoint PPT Presentation

Recap from Monday Visualizing Networks Caffe overview Slides are now online Today Edges and Regions, GPB Fast Edge Detection Using Structured Forests Zhihao Li Holistically-Nested Edge Detection Yuxin Wu


slide-1
SLIDE 1

Recap from Monday

  • Visualizing Networks
  • Caffe overview
  • Slides are now online
slide-2
SLIDE 2

Today

  • Edges and Regions, GPB
  • Fast Edge Detection Using Structured Forests

– Zhihao Li

  • Holistically-Nested Edge Detection

– Yuxin Wu

  • Selective Search for Object Recognition

– Chun-Liang Li

slide-3
SLIDE 3

Logistics

  • Please read:

– Region-based Convolutional Networks for Accurate Object Detection and Semantic Segmentation

  • If you’re up next, please meet us
  • Project Proposals Due in < 1 week

– If you have questions, ask to meet

slide-4
SLIDE 4

Edges and Regions

David Fouhey

slide-5
SLIDE 5

Task

"I stand at the window and see a house, trees,

  • sky. Theoretically I might

say there were 327 brightnesses and nuances of colour. Do I have "327"? No. I have sky, house, and trees.”

  • Max Wertheimer

Quote from Jitendra Malik’s page

slide-6
SLIDE 6

Approaching the Task – Regions

Decomposing image into K connected regions (Clustering task)

slide-7
SLIDE 7

Approaching the Task – Edges

HxWx {0,1} classification problem

slide-8
SLIDE 8

Are the Tasks Equivalent?

Segmentation Boundaries

?

slide-9
SLIDE 9

Are the Tasks Equivalent?

Segmentation Boundaries

?

slide-10
SLIDE 10

Are the Tasks Equivalent?

Segmentation Boundaries

?

Contours have to be closed!

slide-11
SLIDE 11

Does This Matter in the CNN Era?

HED – State of the Art

slide-12
SLIDE 12

Are These Well-Defined Tasks?

Should blue and yellow go in the same segment?

Image credit: NYU depth dataset

slide-13
SLIDE 13

Successes – Superpixels

Problem: >10^5 pixels intractable for reasoning Solution: use bigger/super pixels that don’t ruin any boundaries

First from Ren et al. 2003, Fish image from Achanta et al. 2012

slide-14
SLIDE 14

Successes – Multiple Segmentations

  • Problem: No one segmentation is good
  • Solution: Use many, figure it out later

Hoiem et al. 2005

slide-15
SLIDE 15

Contributions of Paper

  • Merges the (edges + regions) approaches
  • Introduces machinery used throughout vision
  • Landmark paper in segmentation/boundary

detection

  • Note: the questions are often as important as

the answers

slide-16
SLIDE 16

Questions from Piazza

  • Where’s the learning?!

– Great idea! Two papers next

  • What’s this useful for?

– Great question! Last paper today, paper for Monday.

slide-17
SLIDE 17

Dataset – BSDS 500

Images

– 500 Total – 300 Training, 200 Testing

Annotation

– 5 annotators (CV students) per image – Annotators annotate segment

slide-18
SLIDE 18

Dataset – Instructions

Divide each image into pieces, where each piece represents a distinguished thing in the image. It is important that all of the pieces have approximately equal importance. The number of things in each image is up to you. Something between 2 and 20 should be reasonable for any

  • f our images

Martin et al. “A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics.” ICCV 2001

slide-19
SLIDE 19

Dataset – Image and Annotations

slide-20
SLIDE 20

Evaluation Criteria – Boundaries

Precision = TP / (TP+FP)

(fraction of predicted + results that are +)

Recall = TP / (TP+FN)

(fraction of + results that are predicted +)

slide-21
SLIDE 21

Evaluation Criteria – Segments

  • In words: Average the intersection/union of

the best predicted region for all GT regions, weighted by GT region size

  • Previous evaluation criteria don’t clearly

distinguish dumb baselines from algorithm

  • utputs
slide-22
SLIDE 22

GPB-OWT-UCM

Boundary Detection

Local Discontinuity

Segmentation Machinery

Spectral Embedding

Boundary Detection

Spectral Discontinuity

Segmentation Machinery

OWT+UCM

slide-23
SLIDE 23

Local Terms

  • Core Idea: can compute histogram distances
slide-24
SLIDE 24

Local Terms

Orientation 1 Orientation 2 Max over Orientations Luminance Image

slide-25
SLIDE 25

Local Terms

Max over Orientations Luminance Image

slide-26
SLIDE 26

Local Terms – Multiple Cues

Accumulate evidence per-orientation

Weighted Sum of Predictions

slide-27
SLIDE 27

Learning

  • Simple linear combinations = few parameters
  • Gradient ascent in the reading
  • Logistic regression in past

weights Contour strength in feature + scale

slide-28
SLIDE 28

GPB-OWT-UCM

Boundary Detection

Local Discontinuity

Segmentation Machinery

Spectral Embedding

Boundary Detection

Spectral Discontinuity

Segmentation Machinery

OWT+UCM Probability of contour at location x,y, orientation t

slide-29
SLIDE 29

Globalization – Motivation

Local Globalized

slide-30
SLIDE 30

Globalization

𝑋 ∈ 𝑆 𝐼𝑋 𝑦 𝐼𝑋

Normal Spectral Clustering

  • 1. Use W to produce embedding/space

defined by eigenvectors of a system of

  • equations. See links on Piazza for why
  • 2. Cluster in induced space

This Paper

  • 1. Use W to produce embedding/space

defined by eigenvectors of a system of equations

  • 2. Treat eigenvectors as images, compute

gradient

slide-31
SLIDE 31

Globalization

Input Eigenvectors of Spectral System Weighted Sum of Gradients

slide-32
SLIDE 32

Combining Global + Local

  • Linear weighting; weights learned with

gradient ascent

Orientations processed separately throughout Why is this important?

slide-33
SLIDE 33

GPB-OWT-UCM

Boundary Detection

Local Discontinuity

Segmentation Machinery

Spectral Embedding

Boundary Detection

Spectral Discontinuity

Segmentation Machinery

OWT+UCM Could cluster in this space Probability of contour at location x,y, orientation t taking into consideration soft segmentations

slide-34
SLIDE 34

Watershed Transform – 1D Version

  • Black region: probability of boundary
  • Black lines: watershed boundaries
slide-35
SLIDE 35

Problem: probability of boundary is

  • rientation-

dependent Solution: get probability of boundary in direction

Orientation

slide-36
SLIDE 36

Output of Watershed Transform

“Oversegmentation” of image with boundary strengths

slide-37
SLIDE 37

UCM

  • Hierarchical merging; guarantees closed contours
slide-38
SLIDE 38

GPB-OWT-UCM

Boundary Detection

Local Discontinuity

Segmentation Machinery

Spectral Embedding

Boundary Detection

Spectral Discontinuity

Segmentation Machinery

OWT+UCM Contour that can be cut at any point to yield closed regions

slide-39
SLIDE 39

Results – State of the Art

This: 72.6 Current SOA: 78.2

slide-40
SLIDE 40

Results – Ablative Analysis

  • Combining Local +

Global helps

  • Why does local

help in high-recall regime?

slide-41
SLIDE 41

Results – Ablative Analysis

OWT/UCM:

  • Ensures closed

boundaries

  • Helps a little
slide-42
SLIDE 42

Next Up

  • Fast Edge Detection Using Structured Forests

– Zhihao Li

  • Holistically-Nested Edge Detection

– Yuxin Wu

  • Selective Search for Object Recognition

– Chun-Liang Li

slide-43
SLIDE 43
slide-44
SLIDE 44

Stash