Shared Segmentation of Natural Scenes using Dependent Pitman-Yor - - PowerPoint PPT Presentation

shared segmentation of natural scenes
SMART_READER_LITE
LIVE PREVIEW

Shared Segmentation of Natural Scenes using Dependent Pitman-Yor - - PowerPoint PPT Presentation

Shared Segmentation of Natural Scenes using Dependent Pitman-Yor Processes Erik Sudderth & Michael Jordan University of California, Berkeley bell dome Parsing Visual Scenes temple sky skyscraper trees buildings sky Are Images Bags


slide-1
SLIDE 1

Shared Segmentation

  • f Natural Scenes

using

Dependent Pitman-Yor Processes

Erik Sudderth & Michael Jordan

University of California, Berkeley

slide-2
SLIDE 2

Parsing Visual Scenes

trees skyscraper sky bell dome temple buildings sky

slide-3
SLIDE 3

Are Images Bags of Features?

Inspired by the successes of topic models for text data, some have proposed learning from local image features

slide-4
SLIDE 4

Are Images Bags of Features?

Inspired by the successes of topic models for text data, some have proposed learning from local image features

  • Ignore spatial structure entirely (bag of “visual words”)

First Approach:

Fei-Fei & Perona 2005, Sivic et. al. 2005

  • Cluster features via one or more bottom-up segmentations

Second Approach:

Russell et. al. 2006, Todorovic & Ahuja 2007

Compute color & texture descriptors for each superpixel

slide-5
SLIDE 5

Segmentation: Mean Shift

EDISON: Comaniciu & Meer, 2002

  • Cluster by modes of appearance features
  • Often sensitive to bandwidth parameter
slide-6
SLIDE 6

Segmentation: Normalized Cuts

Shi & Malik 2000; Fowlkes, Martin, & Malik 2003

  • Implicit bias towards equal-sized regions
  • Is this a good model for real scenes?
slide-7
SLIDE 7

Segmentation: New Approach

  • Automatically infers the number of segments
  • Handles regions of widely varying size and appearance
  • Statistical framework for discovering shared categories

Spatially Dependent Pitman-Yor Processes

slide-8
SLIDE 8

Outline

Spatial Priors for Image Partitions What’s wrong with Potts models? Spatial dependence via Gaussian processes Counts, partitions, and power laws Hierarchical Pitman-Yor processes Natural Scene Statistics Unsupervised Image Analysis Image segmentation Visual category discovery

slide-9
SLIDE 9

Priors on Counts & Partitions

  • How many regions does this image contain?
  • What are the sizes of these regions?

Segmentation as Partitioning

  • How many object categories have I observed?
  • How frequently does each category appear?

Unsupervised Object Category Discovery

slide-10
SLIDE 10

Pitman-Yor Processes

The Pitman-Yor process defines a distribution on infinite discrete measures, or partitions

Dirichlet process:

1

slide-11
SLIDE 11

Why Pitman-Yor?

Jim Pitman Marc Yor

Generalizing the Dirichlet Process Distribution on partitions leads to a generalized Chinese restaurant process Special cases arise as excursion lengths for Markov chains, Brownian motions, … Pow er Law Distributions DP PY

Number of unique clusters in N

  • bservations

Size of sorted cluster weight k

Goldwater, Griffiths, & Johnson, 2005 Teh, 2006

Natural Language Statistics

slide-12
SLIDE 12

Natural Scene Statistics

  • Does Pitman-Yor prior match human segmentation?
  • How do statistics vary across scene categories?

Insidecity Tallbuilding Coast Highway Forest Mountain Street Opencountry

Oliva & Torralba, 2001

slide-13
SLIDE 13

Manual Image Segmentation

Labels for more than 29,000 segments in 2,688 images of natural scenes

slide-14
SLIDE 14

Object Sizes and Counts

insidecity region counts insidecity region areas

Small Objects Large Objects

slide-15
SLIDE 15

Object Name Frequencies

sky trees person rainbow waterfall lichen wheelbarrow

forest scenes insidecity scenes

slide-16
SLIDE 16

Hierarchical Pitman-Yor Model

Hierarchical DP: Teh et. al. 2004

Set of segments or layers

Hierarchical PY N-gram: Teh 2006

Set of global, shared visual categories Set of images Set of features in image j (superpixel color & texture) Pitman-Yor prior: segment sizes Pitman-Yor prior: label frequencies

No supervision aside from Pitman-Yor hyperparameters

slide-17
SLIDE 17

Bag of Features Segmentation

LabelMe Segments:

slide-18
SLIDE 18

Outline

Spatial Priors for Image Partitions What’s wrong with Potts models? Spatial dependence via Gaussian processes Counts, partitions, and power laws Hierarchical Pitman-Yor processes Natural Scene Statistics Unsupervised Image Analysis Image segmentation Visual category discovery

slide-19
SLIDE 19

Discrete Markov Random Fields

Ising and Potts Models

  • Interactive foreground segmentation
  • Supervised training for known categories

Previous Applications

…but very little success at segmentation of unconstrained natural scenes.

GrabCut: Rother, Kolmogorov, & Blake 2004 Verbeek & Triggs, 2007

slide-20
SLIDE 20

10-State Potts Samples

States sorted by size: largest in blue, smallest in red

slide-21
SLIDE 21

number of edges on which states take same value

1996 IEEE DSP Workshop

edge strength

Even within the phase transition region, samples lack the size distribution and spatial coherence of real image segments

natural images giant cluster very noisy

slide-22
SLIDE 22

Geman & Geman, 1984

200 Iterations

128 x128 grid 8 nearest neighbor edges K = 5 states Potts potentials:

10,000 Iterations

slide-23
SLIDE 23

Spatially Dependent Pitman-Yor

  • Cut random surfaces

(samples from a GP) with thresholds

(as in Level Set Methods)

  • Assign each pixel to

the first surface which exceeds threshold

(as in Layered Models)

Duan, Guindani, & Gelfand, Generalized Spatial DP, 2007

slide-24
SLIDE 24

Spatially Dependent Pitman-Yor

Non-Markov Gaussian Processes: PY prior: Segment size Feature Assignments

Normal CDF

slide-25
SLIDE 25

Preservation of PY Marginals

Why Ordered Layer Assignments? Stick Size Prior Random Thresholds

slide-26
SLIDE 26

Samples from Spatial Prior

Comparison: Potts Markov Random Field

slide-27
SLIDE 27

Bag of features: Image distance Intervening countours

Learning & Inference

GP Covariance

UC Berkeley Pb boundary detector

probability that features at locations are in the same segment

Factorized Gaussian posteriors on thresholds & eigenvector expansion of dense covariance Jointly optimize surface & threshold via conjugate gradient Initialize by annealing to reduce local optima

Mean Field Variational Inference

slide-28
SLIDE 28

Outline

Spatial Priors for Image Partitions What’s wrong with Potts models? Spatial dependence via Gaussian processes Counts, partitions, and power laws Hierarchical Pitman-Yor processes Natural Scene Statistics Unsupervised Image Analysis Image segmentation Visual category discovery

slide-29
SLIDE 29

Tallbuilding Segments: PY-Edge

LabelMe Segments:

slide-30
SLIDE 30

Mountain Segments: PY-Edge

LabelMe Segments:

slide-31
SLIDE 31

Mountain Baseline: NCuts

LabelMe Segments:

slide-32
SLIDE 32

Visual Categories: Coast

slide-33
SLIDE 33

Visual Categories: Tallbuilding

slide-34
SLIDE 34

Challenge: Structured Objects

LabelMe Segments:

slide-35
SLIDE 35

Conclusions

Dependent Pitman-Yor Processes allow… efficient variational parsing of scenes into unknown numbers of segments empirically justified power law priors learning of shared appearance models from related images & scenes Future Directions parallelized, scalable learning from extremely large image databases nonparametric models of dependency in other application domains