[PPT] - Graph Based Image Segmentation Jianbo Shi University of PowerPoint Presentation

SLIDE 1

Graph Based Image Segmentation

Jianbo Shi University of Pennsylvania

SLIDE 2

A top-down process?

SLIDE 3

Or a bottom up process?

SLIDE 4

young woman, old woman

Both segmentation and recognition are context sensitive: Need the whole to see its parts

SLIDE 5

SLIDE 6

segmentation ill defined?

SLIDE 7

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

7

Berkeley Human Segmentation Dataset

SLIDE 8

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

8

Dataset Summary

30 subjects, age 19-23

– 17 men, 13 women – 9 with artistic training

8 months
1,458 person hours
1,020 Corel images
11,595 Segmentations

– 5,555 color, 5,554 gray, 486 inverted/negated

SLIDE 9

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

9

Do you even have one consistent segmentation?

SLIDE 10

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

10 Basket Water Rail Trees Sky Shirt Skirt Arm Legs Head Arm Walk Curb Box Road Shirt Pack Bag Skirt Head Arms Leg Leg Left Woman Bridge Right Woman

Percept Tree

SLIDE 11

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

11

A B D C

SLIDE 12

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

12

A

Scene Background Trees Shore Water Small Top L R Mermai d Foreground Rocks Base Land Sky

B

Scene Background Trees Shore Water Small Top L R Mermai d Foreground Rocks Base Land Sky

SLIDE 13

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

13

Scen e Backgroun d Sky Tree s Shor e Wate r Smal l Top L R Mermai d Foregroun d Rock s Bas e Lan d

A +B A B

Scen e Backgroun d Tree s Shor e Wate r Smal l Top L R Mermai d Foregroun d Rock s Bas e Lan d Scen e Backgroun d Tree s Shor e Wate r Smal l Top L R Mermai d Foregroun d Rock s Bas e Lan d Sky Sky

SLIDE 14

D. Martin, CVPR 2004 Graph-Based Image Segmentation Tutorial

14

A B D C

SLIDE 15

Insight 1: Segmentation/Clustering is always hierarchical

The difficult part is getting the top of the tree correct

Scen e Backgroun d Sky Tree s Shor e Wate r Smal l Top L R Mermai d Foregroun d Rock s Bas e Lan d

A +B

SLIDE 16

CVPR 2004 Graph-Based Image Segmentation Tutorial 16

Is Segmentation Solvable?

Human Superman

C. Fowlkes

SLIDE 17

Graph Based Image Segmentation

Wij

i j

V: graph nodes E: edges connection nodes Image = { pixels } Pixel similarity Segmentation = Graph partition

SLIDE 18

Right partition cost function? Efficient optimization algorithm?

SLIDE 19

For simple cases, can try this:

SLIDE 20

Minimal/Maximal Spanning Tree

Maximal Minimal Tree is a graph G without cycle Graph

SLIDE 21

building a MST

Let X be any subset of the vertices of G, and let edge e be the smallest edge connecting X to G-X. Then e is part of the minimum spanning tree

SLIDE 22

Prim’s algorithm

let T be a single vertex x while (T has fewer than n vertices) { find the smallest edge connecting T to G-T add it to T }

SLIDE 23

Kruskal’s algorithm

sort the edges of G in increasing order by length
for each edge e in sorted order

if the endpoints of e are disconnected in S add e to S

Randomized version can compute Typical cuts

SLIDE 24

Leakage problem in MST

Leakage

SLIDE 25

SLIDE 26

SLIDE 27

Image I Graph Affjnities W=W(I,Θ)

Intensity Color Edges Texture

…

Graph Segmentation: image cues

SLIDE 28

Colo r

a* b*

Brightnes s

L*

Texture

Original Image

Wij

Proximity E D χ2 Boundary Processing Textons A B C A B C χ2 Region Processing

C. Fowlkes

SLIDE 29

29

What is so hard about segmentation?

SLIDE 30

30

SLIDE 31

31

SLIDE 32

32

Non-Boundaries Boundaries I T B C

C. Fowlkes

SLIDE 33

33

Pb Images I

Canny 2MM Us Human Image

C. Fowlkes

SLIDE 34

34

Image I Graph Affinities W=W(I,Θ)

Intensity Color Edges Texture

…

Edge extraction A B A B High affinity Low affinity

SLIDE 35

35

Intervening Contour

…turning a boundary map into Wij

1 - maximum Pb along the line connecting i and j

SLIDE 36

36

Image I Graph Affinities W=W(I,Θ)

Intensity Color Edges Texture

…

Graph Segmentation

SLIDE 37

37

Image I Graph Affinities W=W(I,Θ)

Intensity Color Edges Texture

…

Graph Segmentation: How to break the graph

Graph to encode Gestalt: Getting the big picture

f scene

SLIDE 38

38

Image I Graph Affinities W=W(I,Θ) Eigenvector X(W)

Spectral Graph Segmentation

Discretisation

SLIDE 39

Image I Graph Affjnities W=W(I,Θ)

Intensity Color Edges Texture

…

Graph to encode Gestalt: Getting the big picture of scene

Graph Segmentation: How to break the graph

SLIDE 40

Graph Terminology

adjacency matrix, degree, volume, graph cuts

SLIDE 41

Graph Terminology

Similarity matrix S = [ Sij ] is generalized adjacency matrix

Sij

i j

SLIDE 42

Graph Terminology

i

Degree of node:

SLIDE 43

Graph Terminology

A

Volume of set:

SLIDE 44

Cuts in a graph

SLIDE 45

Graph Terminology

Similarity matrix S = [ Sij ]

Sij

i j i A

Volume of set: Degree of node: Graph Cuts

SLIDE 46

Useful Graph Algorithms

Minimal Spanning Tree
Shortest path
s-t Max. graph flow, Min. cut

SLIDE 47

Graph Cut and Flow

Sink Source

1) Given a source (s) and a sink node (t) 2) Define Capacity on each edge, C_ij = W_ij 3) Find the maximum flow from s->t, satisfying the capacity constraints

Min. Cut = Max. Flow

SLIDE 48

(Boykov)

n-links s t a cut

hard constraint hard constraint

Minimum cost cut can be computed in polynomial time

(max-flow/min-cut algorithms)

SLIDE 49

Problem with min cuts

Min. cuts favors isolated clusters

SLIDE 50

Normalize cuts in a graph

(edge) Ncut = balanced cut

SLIDE 51

Normalized Cut and Normalized Association

Minimizing similarity between the groups, and maximizing

similarity within the groups can be achieved simultaneously.

B) A, ( 2 B) A, ( Nassoc Ncut − =

) ( B) A, ( B) A, ( B) A, ( B Vol cut Vol(A) cut Ncut + = ) ( B) B, ( B) A, ( B Vol assoc Vol(A) assoc(A,A) Nassoc + =

SLIDE 52

Image I Graph Affjnities W=W(I,Θ) Eigenvector X(W)

Spectral Graph Segmentation

SLIDE 53

Image I Graph Affjnities W=W(I,Θ) Eigenvector X(W)

Spectral Graph Segmentation

Discretisation

SLIDE 54

Representation

Partition matrix:

segments pixels

SLIDE 55

Representation

Partition matrix: Pair-wise similarity matrix W Degree matrix D:

segments pixels

SLIDE 56

Representation

Partition matrix: Laplacian matrix D-W Pair-wise similarity matrix W Degree matrix D:

segments pixels

SLIDE 57

Graph weight matrix W

SLIDE 58

Laplacian matrix D-W

asso(A, A) = Let x = X(1,:) be the indicator of group 1

SLIDE 59

asso(A,A)

Laplacian matrix D-W

vol(A) Cut(A, V-A) =

SLIDE 60

SLIDE 61

Minimize Ncut is NP-hard

SLIDE 62

Step I: Find Continuous Global Optima

Scaled partition matrix.

SLIDE 63

Step I: Find Continuous Global Optima

becomes Ncut

SLIDE 64

becomes

We use the generalization of the Rayleigh-Ritz theorem to solve it.

Rayleigh and… Ritz

SLIDE 65

becomes Eigensolutions

Rayleigh and Ritz Says:

SLIDE 66

becomes Eigensolutions

y2

i

A

y2

i

A Rayleigh and Ritz Says:

SLIDE 67

Interpretation as a Dynamical System

SLIDE 68

Step II: Discretize Continuous Optima

If Z* is an optimal, so is

Partition Scaled Partition Eigenvector solution

SLIDE 69

Step II: Discretize Continuous Optima

Z1 Z2 Target partition Eigenvector solution Rotation R Rotation R can be found exactly in 2-way partition

SLIDE 70

Image I Graph Affinities W=W(I,Θ) Eigenvector X(W)

Spectral Graph Segmentation

Discretisation

SLIDE 71

[Cour,Benezit,Shi, CVPR05]

Multiscale NCut Segmentation

SLIDE 72

SLIDE 73

SLIDE 74

SLIDE 75

75

SLIDE 76

V i s u a l P o p o u t [Yu, Shi 2001]:

SLIDE 77

V i s u a l P o p o u t : G r a p h w i t h n e g a t i v e w e i g h t s

positive links only

SLIDE 78

78

Graph Partitioning Normalized Cuts Random Walk Linear System Eigenvectors of Graph Weight Matrix

Graph Embedding

SLIDE 79

The random walks view

Construct the matrix

P = D-1S D = S =

P is stochastic matrix Σj Pij = 1
P is transition matrix of Markov chain with state space I

π = [ d1 d2 . . . dn ]T is stationary distribution

d1 d2 . . . dn S11 S12 S1n S21 S22 S2n . . . Sn1 Sn2 Snn 1 . vol I

SLIDE 80

Reinterpreting the NCut criterion NCut( A, A ) = PAA + PAA PAB = Pr[ A --> B | A ] under P, π

NCut looks for sets that “trap” the random walk
Related to Cheeger constant, conductivity in

Markov chains

SLIDE 81

Reinterpreting the NCut algorithm

(D-W)y = µDy

µ1=0 µ2 . . . µn

y1 y2 . . . Yn

µk = 1 - λk

yk = xk Px = λx

λ1=1 λ2 . . . λn

x1 x2 . . . xn

The NCut algorithm segments based on the second largest eigenvector of P

SLIDE 82

Relationship to Graph embedding

Z1 Z2 Target partition Eigenvector solution Rotation R Rotation R can be found exactly in 2-way partition

SLIDE 83

Seeing Through Water…

Efros, Shi, Visontai, Esler, NIPS 04

SLIDE 84

84

Patches observed at one fixed location from the previous slide:

SLIDE 85

Patches observed at one fixed location: Hypothesized embedding of these patches (we assume they form a manifold)

SLIDE 86

86

Edges cues ? Color cues ? Texture cues ? Do you use

Where is Waldo ?

That’s not enough, you need

Shape cues High-level object priors

SLIDE 87

87

SLIDE 88

Con-current recognition-segmentation

Appearance parts detection Part - Part grouping Filter-edge detection Pixel-pixel grouping

Parts-pixel consistency

[Yu, Shi, CVPR’03]

SLIDE 89

[Yu, Shi, CVPR’03]

Con-current recognition-segmentation

SLIDE 90

This could be the back

A A’

SLIDE 91

A’

B’

A B

A → B

SLIDE 92

A’

B’

C’ A B C

A → B → C

Context enhances contour perception

SLIDE 93

A’

B’ C ’ D’

C’

Not good!

A B C D

Nothing?

A → B → C → D

X

SLIDE 94

 

Accidental alignment happens. Context is very useful, but need to find the ‘Right’ context first.

x

SLIDE 95

[Srinivasan&Shi, 07]

SLIDE 96

[Toshev, Shi, Daniidis, 07]

SLIDE 97

SLIDE 98

Contour Packing for Object Recognition

?

Jianbo Shi

Joint work with Qihui Zhu, Praveen Srinivasan, Liming Wang, Yang Wu

SLIDE 99

Object Recognition via Region Packing

SLIDE 100

Results on ETHZ

Zhu & Shi, ECCV 08

Detection results of our method. Model selection is shown on the top-left corner. false positives pruned by joint contour selection failure cases

Used only one hand-drawn model per class

SLIDE 101

Model Shape Learning

Positive training images with bounding boxes

Input Output

Learned model shapes composed by contours

Learn shapes without initial models
Contour packing for finding common shapes

– Choose all contours in each bounding box as model – Do contour packing across all other training images – Contours selected on the model compose common shapes

Contour Packing

“Model contours“

SLIDE 102

Preliminary Results

Top 5 learned model from bounding boxes