Contours and Regions Pablo Arbelez UC Berkeley I. HISTORICAL - - PowerPoint PPT Presentation

contours and regions
SMART_READER_LITE
LIVE PREVIEW

Contours and Regions Pablo Arbelez UC Berkeley I. HISTORICAL - - PowerPoint PPT Presentation

Contours and Regions Pablo Arbelez UC Berkeley I. HISTORICAL MOTIVATION Some Computer Vision Prehistory Hubel and Wiesel (1981 Nobel Price winners): MEASUREMENT system INPUT Selective response: Physiological evidence for the


slide-1
SLIDE 1

Contours and Regions

Pablo Arbeláez UC Berkeley

slide-2
SLIDE 2
  • I. HISTORICAL MOTIVATION
slide-3
SLIDE 3

Some Computer Vision “Prehistory”

Selective response: Physiological evidence for the importance of oriented edges in early visual perception

  • Hubel and Wiesel (1981 Nobel Price winners):
  • Hubel, D. H. & T. N. Wiesel, Receptive Fields Of Single Neurons In The Cat's Striate Cortex, Journal of

Physiology, (I959) I48, 574-59I.

  • Hubel, D. H. & T. N. Wiesel. Receptive Fields, Binocular Interaction And Functional Architecture In

The Cat's Visual Cortex, Journal of Physiology, (1962), 160, pp. 106-154, With 2 plates and 20 text- figures.

system INPUT MEASUREMENT

slide-4
SLIDE 4

Some Computer Vision “Prehistory”

Hubel and Wiesel’s eureka moment In memoriam: the poor cat

http://www.youtube.com/watch?v=IOHayh06LJ4

slide-5
SLIDE 5
  • Attneave’s sleeping cat (1954)
  • Attneave, F. (1954). Some Informational Aspects Of Visual Perception. Psychological Review, 61, 183-193.

Humans can interpret visual information even from simplified line drawings

Some Computer Vision “Prehistory”

slide-6
SLIDE 6

First Computer Vision Thesis

Lawrence Roberts (MIT - 1963) Machine Perception Of Three-Dimensional Solids

ABSTRACT: “(…) A computer program has been written which can process a photograph into a line drawing , transform the line drawing into a three- dimensional representation, and ,finally, display the three-dimensional structure with all the hidden lines removed, from any point of view. The 2-D to 3-D construction and 3-D to 2-D display processes are sufficiently general to handle most collections of planar-surfaced objects and provide a valuable starting point for future investigation of computer- aided three-dimensional systems.”

http://www.packet.cc/files/mach-per-3D-solids.html

slide-7
SLIDE 7

After 30 Years of Intensive Research…

Edge Detection Image Segmentation

  • Sobel (1968)
  • Prewitt (1970)
  • Hildreth, Marr (1980)
  • Canny (1986)
  • Perona, Malik (1990)
  • Horowitz, Pavlidis (1974)
  • Beucher, Lantuéjoul (1979)
  • Mumford, Shah (1989)
  • Wu, Leahy (1993)

Today it remains an active field of research: (Google Scholar search with exact expression in article title) 17,200 results 8,290 results

slide-8
SLIDE 8

Example of segmentation papers from the 1980s: Mumford and Shah’s formulation

  • The segmentation u of an observed image u0 is given by

the minimization of the functional:

  • D. Mumford and J. Shah, “Optimal approximations by piecewise smooth functions, and associated

variational problems,” Communications on Pure and Applied Mathematics, pp. 577–684, 1989.

Data fidelity Smoothness Regularization

slide-9
SLIDE 9

Folk’s Wisdom circa 1995

Edge Detection

“Canny is as good as you get” “Segmentation is an ill-posed problem”

Image Segmentation Lack of data to study the problem on empirical grounds.

slide-10
SLIDE 10
  • II. RECENT RESEARCH
slide-11
SLIDE 11

Contour Detection and Image Segmentation

Pablo Arbel´ aez1, Michael Maire2, Charless Fowlkes3, and Jitendra Malik1

1University of California at Berkeley 2California Institute of Technology 3University of California at Irvine

slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14

How to train/test?

slide-15
SLIDE 15

Berkeley Segmentation Dataset

  • D. Martin, C. Fowlkes, D. Tal, and J. Malik. “A Database of Human Segmented Natural Images and its

Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics”, ICCV, 2001

slide-16
SLIDE 16

Berkeley Segmentation Dataset

  • D. Martin, C. Fowlkes, D. Tal, and J. Malik. “A Database of Human Segmented Natural Images and its

Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics”, ICCV, 2001

slide-17
SLIDE 17

Results: Contours

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 iso−F Recall Precision [F = 0.79] Human [F = 0.70] gPb [F = 0.68] Multiscale − Ren (2008) [F = 0.66] BEL − Dollar, Tu, Belongie (2006) [F = 0.66] Mairal, Leordeanu, Bach, Herbert, Ponce (2008) [F = 0.65] Min Cover − Felzenszwalb, McAllester (2006) [F = 0.65] Pb − Martin, Fowlkes, Malik (2004) [F = 0.64] Untangling Cycles − Zhu, Song, Shi (2007) [F = 0.64] CRF − Ren, Fowlkes, Malik (2005) [F = 0.58] Canny (1986) [F = 0.56] Perona, Malik (1990) [F = 0.50] Hildreth, Marr (1980) [F = 0.48] Prewitt (1970) [F = 0.48] Sobel (1968) [F = 0.47] Roberts (1965)
  • M. Maire, P. Arbel´

aez, C. Fowlkes, and J. Malik. “Using Contours to Detect and Localize Junctions in Natural Images”, CVPR, 2008

slide-18
SLIDE 18

Results: Segmentation

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 iso−F Recall Precision [F = 0.79] Human [F = 0.71] gPb−owt−ucm [F = 0.67] UCM − Arbelaez (2006) [F = 0.63] Mean Shift − Comaniciu, Meer (2002) [F = 0.62] Normalized Cuts − Cour, Benezit, Shi (2005) [F = 0.58] Canny−owt−ucm [F = 0.58] Felzenszwalb, Huttenlocher (2004) [F = 0.58] Av. Diss. − Bertelli, Sumengen, Manjunath, Gibou (2008) [F = 0.55] ChanVese − Bertelli, Sumengen, Manjunath, Gibou (2008) [F = 0.55] Donoser, Urschler, Hirzer, Bischof (2009) [F = 0.53] Yang, Wright, Ma, Sastry (2007)
  • P. Arbel´

aez, M. Maire, C. Fowlkes, and J. Malik. “From Contours to Regions: An Empirical Evaluation”, CVPR, 2009

slide-19
SLIDE 19

Overview

slide-20
SLIDE 20

Overview

◮ Contour Detection

◮ Multiscale Local Cues ◮ Globalization

slide-21
SLIDE 21

Overview

◮ Contour Detection

◮ Multiscale Local Cues ◮ Globalization

◮ Contours → Hierarchical Segmentation

◮ Oriented Watershed Transform (OWT)

(Contours → Initial Regions)

◮ Ultrametric Contour Map (UCM)

(Initial Regions → Hierarchy)

slide-22
SLIDE 22

Overview

◮ Contour Detection

◮ Multiscale Local Cues ◮ Globalization

◮ Contours → Hierarchical Segmentation

◮ Oriented Watershed Transform (OWT)

(Contours → Initial Regions)

◮ Ultrametric Contour Map (UCM)

(Initial Regions → Hierarchy)

◮ Empirical Evaluation

◮ Boundary Quality ◮ Region Quality

slide-23
SLIDE 23

Overview

◮ Contour Detection

◮ Multiscale Local Cues ◮ Globalization

◮ Contours → Hierarchical Segmentation

◮ Oriented Watershed Transform (OWT)

(Contours → Initial Regions)

◮ Ultrametric Contour Map (UCM)

(Initial Regions → Hierarchy)

◮ Empirical Evaluation

◮ Boundary Quality ◮ Region Quality

◮ Interactive Segmentation

slide-24
SLIDE 24

Overview

◮ Contour Detection

◮ Multiscale Local Cues ◮ Globalization

◮ Contours → Hierarchical Segmentation

◮ Oriented Watershed Transform (OWT)

(Contours → Initial Regions)

◮ Ultrametric Contour Map (UCM)

(Initial Regions → Hierarchy)

◮ Empirical Evaluation

◮ Boundary Quality ◮ Region Quality

◮ Interactive Segmentation ◮ Multiscale Object Analysis

slide-25
SLIDE 25

Contour Detection

slide-26
SLIDE 26

Local Cues for Contour Detection

Estimate the posterior probability of a boundary Pb(x, y, θ)

  • D. Martin, C. Fowlkes, and J. Malik. “Learning to Detect Natural Image Boundaries

using Local Brightness, Color and Texture Cues”, TPAMI 2004.

slide-27
SLIDE 27

Local Cues for Contour Detection

◮ 1976 CIE L*a*b* colorspace ◮ Brightness Gradient BG(x, y, r, θ)

Difference of L* distributions

◮ Color Gradient CG(x, y, r, θ)

Difference of a*b* distributions

◮ Texture Gradient TG(x, y, r, θ)

Difference of distributions of V1-like filter responses

We combine these cues across multiple scales (r)

slide-28
SLIDE 28

Local Cues for Contour Detection

L a b textons

slide-29
SLIDE 29

Local Cues for Contour Detection

0.5 1 Upper Half−Disc Histogram 0.5 1 Lower Half−Disc Histogram

L

slide-30
SLIDE 30

Local Cues for Contour Detection

0.5 1 Upper Half−Disc Histogram 0.5 1 Lower Half−Disc Histogram

L G(x, y, θ = π

4 )

slide-31
SLIDE 31

Local Cues for Contour Detection

channel π

2

θ = 0 π

2

θ = π

2

slide-32
SLIDE 32

Local Cues for Contour Detection

channel π

2

θ = 0 π

2

θ = π

2

slide-33
SLIDE 33

Local Cues for Contour Detection

channel π

2

θ = 0 π

2

θ = π

2

slide-34
SLIDE 34

Globalization through Graph Partitioning

Build a weighted graph G = (V, E, W) from the image

slide-35
SLIDE 35

Globalization through Graph Partitioning

Build a weighted graph G = (V, E, W) from the image

◮ Nonmax suppression

slide-36
SLIDE 36

Globalization through Graph Partitioning

Build a weighted graph G = (V, E, W) from the image

◮ Nonmax suppression

slide-37
SLIDE 37

Globalization through Graph Partitioning

Build a weighted graph G = (V, E, W) from the image

◮ Nonmax suppression ◮ Define W using Intervening

Contour (i, j) low affinity

slide-38
SLIDE 38

Globalization through Graph Partitioning

Build a weighted graph G = (V, E, W) from the image

◮ Nonmax suppression ◮ Define W using Intervening

Contour (i, k) high affinity

slide-39
SLIDE 39

Globalization through Graph Partitioning

Build a weighted graph G = (V, E, W) from the image

◮ Nonmax suppression ◮ Define W using Intervening

Contour (i, k) high affinity

◮ Normalized Cuts

[Shi & Malik 1997]

slide-40
SLIDE 40

Normalized Cuts

◮ Graph G = (V , E, W ) ◮ Split into A, B disjoint, A ∪ B = V

  • J. Shi and J. Malik. “Normalized Cuts and Image Segmentation”, PAMI, 2000.
slide-41
SLIDE 41

Normalized Cuts

◮ Graph G = (V , E, W ) ◮ Split into A, B disjoint, A ∪ B = V

cut(A, B) =

  • u∈A,v∈B

w(u, v) assoc(A, V ) =

  • u∈A,v∈V

w(u, v) Ncut(A, B) = cut(A, B) assoc(A, V ) + cut(A, B) assoc(B, V )

  • J. Shi and J. Malik. “Normalized Cuts and Image Segmentation”, PAMI, 2000.
slide-42
SLIDE 42

Normalized Cuts

◮ Graph G = (V , E, W ) ◮ Split into A, B disjoint, A ∪ B = V

cut(A, B) =

  • u∈A,v∈B

w(u, v) assoc(A, V ) =

  • u∈A,v∈V

w(u, v) Ncut(A, B) = cut(A, B) assoc(A, V ) + cut(A, B) assoc(B, V )

◮ General case: partition using smallest eigenvectors of

(D − W )v = λDv where Dii =

j Wij

  • J. Shi and J. Malik. “Normalized Cuts and Image Segmentation”, PAMI, 2000.
slide-43
SLIDE 43

Do NOT Cluster Eigenvectors!

Image Eigenvectors Clustering eigenvector values leads to artifacts on uniform regions.

slide-44
SLIDE 44

Eigenvectors Carry Contour Information

Image Eigenvectors We use the gradients of eigenvectors rather than their values.

slide-45
SLIDE 45

Eigenvectors Carry Contour Information

Gradients of eigenvectors indicate salient contours in the image.

slide-46
SLIDE 46

Contour Detection

◮ Multiscale Brightness, Color, Texture Gradients:

mPb(x, y, θ) =

  • s
  • i

αi,sGi,σ(s)(x, y, θ)

slide-47
SLIDE 47

Contour Detection

◮ Multiscale Brightness, Color, Texture Gradients:

mPb(x, y, θ) =

  • s
  • i

αi,sGi,σ(s)(x, y, θ)

◮ Gradients of Eigenvectors:

sPb(x, y, θ) =

  • k

1 √λk · ∇θvk(x, y)

slide-48
SLIDE 48

Contour Detection

◮ Multiscale Brightness, Color, Texture Gradients:

mPb(x, y, θ) =

  • s
  • i

αi,sGi,σ(s)(x, y, θ)

◮ Gradients of Eigenvectors:

sPb(x, y, θ) =

  • k

1 √λk · ∇θvk(x, y)

◮ Global Probability of Boundary:

gPb(x, y, θ) =

  • s
  • i

βi,sGi,σ(s)(x, y, θ) + γ · sPb(x, y, θ)

slide-49
SLIDE 49

Contour Detection

◮ Multiscale Brightness, Color, Texture Gradients:

mPb(x, y, θ) =

  • s
  • i

αi,sGi,σ(s)(x, y, θ)

◮ Gradients of Eigenvectors:

sPb(x, y, θ) =

  • k

1 √λk · ∇θvk(x, y)

◮ Global Probability of Boundary:

gPb(x, y, θ) =

  • s
  • i

βi,sGi,σ(s)(x, y, θ) + γ · sPb(x, y, θ) Weights learned from training data

slide-50
SLIDE 50

Benefits of Globalization

Thresholded Pb Thresholded gPb

slide-51
SLIDE 51

Benefits of Globalization

Thresholded Pb Thresholded gPb

slide-52
SLIDE 52

Benefits of Globalization

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 iso−F Recall Precision [F = 0.79] Human [F = 0.70] gPb [F = 0.68] sPb [F = 0.67] mPb [F = 0.65] Pb − Martin, Fowlkes, Malik (2004)
slide-53
SLIDE 53

Contours to Hierarchical Regions

slide-54
SLIDE 54

Contours to Hierarchical Regions

pb OWT-UCM Segmentation

slide-55
SLIDE 55

Watershed Transform

◮ Compute pb(x, y) =

maxθ pb(x, y, θ)

slide-56
SLIDE 56

Watershed Transform

◮ Compute pb(x, y) =

maxθ pb(x, y, θ)

◮ Seed locations are

regional minima of pb(x, y)

slide-57
SLIDE 57

Watershed Transform

◮ Compute pb(x, y) =

maxθ pb(x, y, θ)

◮ Seed locations are

regional minima of pb(x, y)

◮ Apply watershed

transform

slide-58
SLIDE 58

Watershed Transform

◮ Compute pb(x, y) =

maxθ pb(x, y, θ)

◮ Seed locations are

regional minima of pb(x, y)

◮ Apply watershed

transform

◮ Catchment basins P0 are

regions

slide-59
SLIDE 59

Watershed Transform

◮ Compute pb(x, y) =

maxθ pb(x, y, θ)

◮ Seed locations are

regional minima of pb(x, y)

◮ Apply watershed

transform

◮ Catchment basins P0 are

regions

◮ Arcs K0 are boundaries

slide-60
SLIDE 60

Oriented Watershed Transform (OWT)

pb(x, y) Watershed

slide-61
SLIDE 61

Oriented Watershed Transform (OWT)

pb(x, y) Watershed Subdivision

slide-62
SLIDE 62

Oriented Watershed Transform (OWT)

pb(x, y, θ) Watershed Subdivision

slide-63
SLIDE 63

Oriented Watershed Transform (OWT)

pb(x, y, θ) OWT Subdivision

slide-64
SLIDE 64

Oriented Watershed Transform (OWT)

pb(x, y, θ) OWT Watershed

slide-65
SLIDE 65

Ultrametric Contour Map (UCM)

◮ Duality between closed, non-self-intersecting weighted

contours and a hierarchy of regions1

  • 1P. Arbel´
  • aez. “Boundary Extraction in Natural Images using Ultrametric Contour Maps”, POCV, 2006.
slide-66
SLIDE 66

Ultrametric Contour Map (UCM)

◮ Duality between closed, non-self-intersecting weighted

contours and a hierarchy of regions1

◮ Graph G = (P0, K0, W (K0)) given by OWT

  • 1P. Arbel´
  • aez. “Boundary Extraction in Natural Images using Ultrametric Contour Maps”, POCV, 2006.
slide-67
SLIDE 67

Ultrametric Contour Map (UCM)

◮ Duality between closed, non-self-intersecting weighted

contours and a hierarchy of regions1

◮ Graph G = (P0, K0, W (K0)) given by OWT ◮ Iteratively merge regions by removing minimum weight

boundary

  • 1P. Arbel´
  • aez. “Boundary Extraction in Natural Images using Ultrametric Contour Maps”, POCV, 2006.
slide-68
SLIDE 68

Ultrametric Contour Map (UCM)

◮ Duality between closed, non-self-intersecting weighted

contours and a hierarchy of regions1

◮ Graph G = (P0, K0, W (K0)) given by OWT ◮ Iteratively merge regions by removing minimum weight

boundary

◮ Produces region tree

◮ Root is entire image ◮ Leaves are P0 ◮ Height(R) is boundary threshold at which R first appears ◮ Distance(R1, R2) = min{Height(R) : R1, R2 ⊆ R}

  • 1P. Arbel´
  • aez. “Boundary Extraction in Natural Images using Ultrametric Contour Maps”, POCV, 2006.
slide-69
SLIDE 69

Ultrametric Contour Map (UCM)

slide-70
SLIDE 70

Ultrametric Contour Map (UCM)

slide-71
SLIDE 71

Ultrametric Contour Map (UCM)

slide-72
SLIDE 72

OWT-UCM Preserves Boundary Quality

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 iso−F Recall Precision [F = 0.79] Human [F = 0.71] gPb−owt−ucm [F = 0.70] gPb [F = 0.58] Canny−owt−ucm [F = 0.58] Canny
slide-73
SLIDE 73

Hierarchical Segmentation Results

gPb-owt-ucm ODS OIS

slide-74
SLIDE 74

Hierarchical Segmentation Results

gPb-owt-ucm ODS OIS

slide-75
SLIDE 75

Empirical Evaluation

slide-76
SLIDE 76

Benchmarking Region Boundaries

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 iso−F Recall Precision [F = 0.79] Human [F = 0.71] gPb−owt−ucm [F = 0.67] UCM − Arbelaez (2006) [F = 0.63] Mean Shift − Comaniciu, Meer (2002) [F = 0.62] Normalized Cuts − Cour, Benezit, Shi (2005) [F = 0.58] Canny−owt−ucm [F = 0.58] Felzenszwalb, Huttenlocher (2004) [F = 0.58] Av. Diss. − Bertelli, Sumengen, Manjunath, Gibou (2008) [F = 0.55] ChanVese − Bertelli, Sumengen, Manjunath, Gibou (2008) [F = 0.55] Donoser, Urschler, Hirzer, Bischof (2009) [F = 0.53] Yang, Wright, Ma, Sastry (2007)
slide-77
SLIDE 77

Region Quality

◮ Segmentation methods burdened with the constraint of

producing closed boundaries

slide-78
SLIDE 78

Region Quality

◮ Segmentation methods burdened with the constraint of

producing closed boundaries

◮ BSDS boundary benchmark might favor contour detectors

slide-79
SLIDE 79

Region Quality

◮ Segmentation methods burdened with the constraint of

producing closed boundaries

◮ BSDS boundary benchmark might favor contour detectors ◮ Region-based performance metrics

slide-80
SLIDE 80

Region Quality

◮ Segmentation methods burdened with the constraint of

producing closed boundaries

◮ BSDS boundary benchmark might favor contour detectors ◮ Region-based performance metrics

◮ Variation of Information

slide-81
SLIDE 81

Region Quality

◮ Segmentation methods burdened with the constraint of

producing closed boundaries

◮ BSDS boundary benchmark might favor contour detectors ◮ Region-based performance metrics

◮ Variation of Information ◮ Rand Index

slide-82
SLIDE 82

Region Quality

◮ Segmentation methods burdened with the constraint of

producing closed boundaries

◮ BSDS boundary benchmark might favor contour detectors ◮ Region-based performance metrics

◮ Variation of Information ◮ Rand Index ◮ Segmentation Covering

slide-83
SLIDE 83

Variation of Information

Distance between two clusterings of data C and C ′ given by VI(C, C ′) = H(C) + H(C ′) − 2I(C, C ′) Here C and C ′ are test and ground-truth segmentations.

slide-84
SLIDE 84

Probabilistic Rand Index

Given a set of ground-truth segmentations {Gk}, PRI(S, {Gk}) = 1 T

  • i<j

[cijpij + (1 − cij)(1 − pij)] where cij is the event that pixels i and j have the same label and pij its probability.

slide-85
SLIDE 85

Segment Covering

Overlap between two regions R and R′: O(R, R′) = |R ∩ R′| |R ∪ R′| Covering of a segmentation S by a segmentation S′: C(S′ → S) = 1 N

  • R∈S

|R| · max

R′∈S′ O(R, R′)

We report the covering of groundtruth by test.

slide-86
SLIDE 86

Region Benchmarks on the BSDS

Covering PRI VI ODS OIS Best ODS OIS ODS OIS Human 0.73 0.73 − 0.87 0.87 1.16 1.16 gPb-owt-ucm 0.59 0.65 0.75 0.81 0.85 1.65 1.47 Mean Shift 0.54 0.58 0.66 0.78 0.80 1.83 1.63 Felz-Hutt 0.51 0.58 0.68 0.77 0.82 2.15 1.79 Canny-owt-ucm 0.48 0.56 0.66 0.77 0.82 2.11 1.81 NCuts 0.44 0.53 0.66 0.75 0.79 2.18 1.84 Total Var. 0.57 − − 0.78 − 1.81 − T+B Encode 0.54 − − 0.78 − 1.86 −

  • Av. Diss.

0.47 − − 0.76 − 2.62 − ChanVese 0.49 − − 0.75 − 2.54 −

slide-87
SLIDE 87

Interactive Segmentation

◮ Relevant for graphics applications

slide-88
SLIDE 88

Interactive Segmentation

◮ Relevant for graphics applications ◮ Graph cuts formalism has become popular1,2,3

◮ User marks foreground/background ◮ Region model learned on the fly

  • 1Y. Boykov and M.-P. Jolly. “Interactive Graph Cuts for Optimal Boundary &

Region Segmentation of Objects in N-D Images”, ICCV, 2001

  • 3C. Rother, V. Kolmogorov, A. Blake. ““Grabcut”: Interactive Foreground Extraction

using Iterated Graph Cuts”, SIGGRAPH, 2004

  • 2Y. Li, J. Sun, C.-K. Tang, and H.-Y. Shum. “Lazy Snapping”, SIGGRAPH, 2004
slide-89
SLIDE 89

Interactive Segmentation

◮ Relevant for graphics applications ◮ Graph cuts formalism has become popular1,2,3

◮ User marks foreground/background ◮ Region model learned on the fly

  • 1Y. Boykov and M.-P. Jolly. “Interactive Graph Cuts for Optimal Boundary &

Region Segmentation of Objects in N-D Images”, ICCV, 2001

  • 3C. Rother, V. Kolmogorov, A. Blake. ““Grabcut”: Interactive Foreground Extraction

using Iterated Graph Cuts”, SIGGRAPH, 2004

  • 2Y. Li, J. Sun, C.-K. Tang, and H.-Y. Shum. “Lazy Snapping”, SIGGRAPH, 2004

◮ Alternative: use precomputed segmentation tree4

◮ Distance(R1, R2) = min{Height(R) : R1, R2 ⊆ R} ◮ Assign missing labels using closest labeled region

  • 4P. Arbel´

aez and L. Cohen. “Constrained Image Segmentation from Hierarchical Boundaries”, CVPR, 2008

slide-90
SLIDE 90

Interactive Segmentation

User Annotation Automatic Refinement

slide-91
SLIDE 91

Interactive Segmentation

slide-92
SLIDE 92

Multiscale Object Analysis

◮ Real scenes are multiscale

slide-93
SLIDE 93

Multiscale Object Analysis

◮ Real scenes are multiscale ◮ Three scales of local cues are insufficient

slide-94
SLIDE 94

Multiscale Object Analysis

◮ Real scenes are multiscale ◮ Three scales of local cues are insufficient ◮ Scanning object detectors:

◮ loop over scales, loop over windows ◮ apply classifier to each image window

slide-95
SLIDE 95

Multiscale Object Analysis

◮ Real scenes are multiscale ◮ Three scales of local cues are insufficient ◮ Scanning object detectors:

◮ loop over scales, loop over windows ◮ apply classifier to each image window

◮ Detector input should be scale-dependent

slide-96
SLIDE 96

Multiscale Object Analysis

◮ Real scenes are multiscale ◮ Three scales of local cues are insufficient ◮ Scanning object detectors:

◮ loop over scales, loop over windows ◮ apply classifier to each image window

◮ Detector input should be scale-dependent ◮ Generate scale-dependent contours/segments

slide-97
SLIDE 97

Multiscale Object Analysis

slide-98
SLIDE 98

Multiscale Object Analysis

slide-99
SLIDE 99

Multiscale Object Analysis

slide-100
SLIDE 100

Multiscale Object Analysis

slide-101
SLIDE 101

Multiscale Object Analysis

slide-102
SLIDE 102

Thank You

slide-103
SLIDE 103

Take-Home Messages

  • Image segmentation, at least in the BSDS setting, is a well-

posed problem and the high consistency among human segmentations allows for its study on empirical bases.

  • Canny is not as good as you get. The existence of a

quantitative evaluation framework has led to measurable progress in the field over the last decade.

  • Image segmentation and contour detection are two aspects
  • f the same problem and can be studied jointly. Our

particular approach consists in reducing the former to the latter.

  • Berkeley Segmentation Resources:

http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html