Spatial Pattern Templates for Recognition of Objects with Regular - - PowerPoint PPT Presentation

spatial pattern templates for recognition of objects with
SMART_READER_LITE
LIVE PREVIEW

Spatial Pattern Templates for Recognition of Objects with Regular - - PowerPoint PPT Presentation

Spatial Pattern Templates for Recognition of Objects with Regular Structure GCPR 2013 cek and Radim Radim Tyle S ara Center for Machine Perception Czech Technical University in Prague cmp.felk.cvut.cz cek, Radim 1 / 12 Radim


slide-1
SLIDE 1

Spatial Pattern Templates for Recognition of Objects with Regular Structure

GCPR 2013 Radim Tyleˇ cek and Radim ˇ S´ ara

Center for Machine Perception Czech Technical University in Prague

cmp.felk.cvut.cz

1 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-2
SLIDE 2

Introduction

Observation: Some objects in images have regular layout.

2 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-3
SLIDE 3

Introduction

Observation: Some objects in images have regular layout. Regularity: Repetition of elements according to simple rules (symmetry).

2 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-4
SLIDE 4

Introduction

Observation: Some objects in images have regular layout. Regularity: Repetition of elements according to simple rules (symmetry). Task: Incorporate regular contextual cues as a structure prior for recognition.

2 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-5
SLIDE 5

Introduction

Observation: Some objects in images have regular layout. Regularity: Repetition of elements according to simple rules (symmetry). Task: Incorporate regular contextual cues as a structure prior for recognition. Problem: How to specify a language for complex relations between many object instances of many classes?

2 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-6
SLIDE 6

Related Work

Shape Grammars

[Simon 2011]

set of production rules restrictive split layout grammar specification Sparse Graphical Models [Gould 2008] adjacency, associative potentials complex relations not captured Dense Graphical Models [Schmidt 2010] complete, high order cliques learning weights jointly in large graphs intractable

[Simon 2011] Simon, Teboul, Koutsourakis, Paragios: Random exploration of the procedural space. IJCV (2011) [Gould 2008] Gould et al.: Multi-class segmentation with relative location prior. IJCV (2008) [Schmidt 2010] Schmidt, Murphy: Convex structure learning: Beyond pairwise potentials. AISTATS (2010) 3 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-7
SLIDE 7

Our Approach

Annotated Train Set Segmented Train Set Core Relation Pool Spatial Pattern Templates Dense Graphical Model Input Test Image Output Test Labels

Principal Idea: Identify which dense structural relations are important for recognition

Learning Parsing Effective Cliques

4 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-8
SLIDE 8

Problem Formulation

Input X – rectified image data S – unsupervised segmentation Output L – class labels for segments in S X S L Probability Model: Conditional Random Field [CRF]

l1 l2 l3 x1 x2 x3

General CRF find labeling L : S → C that maximizes p(L|X, S) = 1 Z

  • q∈Q(S)

exp

  • j∈Φq

θjϕj(lq, xq, sq)

  • topology given by cliques (factors) in Q(S)

[CRF] Lafferty et al.: CRF: Probabilistic models for segmenting and labeling sequence data. ICML (2001) 5 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-9
SLIDE 9

Spatial Pattern Templates

SPT = representation for learning dense structural relations

  • 1. Specify core attribute relation functions

relations act on attributes of segments tuples

6 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-10
SLIDE 10

Spatial Pattern Templates

SPT = representation for learning dense structural relations

  • 1. Specify core attribute relation functions

relations act on attributes of segments tuples

  • 2. Discretize relation functions

value range is split into discrete intervals

6 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-11
SLIDE 11

Spatial Pattern Templates

SPT = representation for learning dense structural relations

  • 1. Specify core attribute relation functions

relations act on attributes of segments tuples

  • 2. Discretize relation functions

value range is split into discrete intervals

6 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-12
SLIDE 12

Spatial Pattern Templates

SPT = representation for learning dense structural relations

  • 1. Specify core attribute relation functions

relations act on attributes of segments tuples

  • 2. Discretize relation functions

value range is split into discrete intervals

  • 3. Create composite relations

subsets in Cartesian product of relations

A A N N

6 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-13
SLIDE 13

Spatial Pattern Templates

SPT = representation for learning dense structural relations

  • 1. Specify core attribute relation functions

relations act on attributes of segments tuples

  • 2. Discretize relation functions

value range is split into discrete intervals

  • 3. Create composite relations

subsets in Cartesian product of relations

  • 4. Template domain

indicate allowed combinations

A A N N

6 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-14
SLIDE 14

Spatial Pattern Templates

SPT = representation for learning dense structural relations

  • 1. Specify core attribute relation functions

relations act on attributes of segments tuples

  • 2. Discretize relation functions

value range is split into discrete intervals

  • 3. Create composite relations

subsets in Cartesian product of relations

  • 4. Template domain

indicate allowed combinations

  • 5. Learn weights

for each allowed combination

A A N N

0AA

  • 0AN
  • 0NA
  • 6 / 12

Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-15
SLIDE 15

Aligned Pairs

factor graph

u v u v v u A W N

core relation function d capture pair-wise alignment connect two segments aligned horizontally or vertically relative position relations d1, d2 nesting, overlap possible statistical potential [Tighe11] ϕ2(u, v) = θd1d2 fc(lu, lv, d1, d2)

0WA

  • 0WW
  • 0AW
  • 0WN
  • 0NW
  • 0AA
  • 0AN
  • 0NA
  • 0NN
  • .64
.14 .05 .03 .07 .02 .02 .04 .06 .74 .06 .03 .04 .02 .06 .05 .21 .47 .01 .14 .05 .02 .06 .04 .02 .02 .60 .32 .03 .05 .12 .02 .53 .16 .09 .04 .02 .09 .01 .17 .58 .08 .10 .10 .80 .09 .05 .06 .29 .01 .51 window wall balcony door roof chimney sky shop window wall balcony door roof chimney sky shop

SPT domain label co-occurence

[Tighe 2011] Tighe, Lazebnik: Understanding scenes on many levels. ICCV (2011) 7 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-16
SLIDE 16

Regular Triplets

factor graph capture basis for repetitive structure (rows, columns) connect three equally spaced segments relative position relation similarity in segment size generalized associative potential

[Kohli 2009]

ϕ3(u, v, w) =      θc if lu = lv = lw, θ0 if different, if irregular enforcing same labels in a triplet

[Kohli 2009] Kohli, Ladicky, Torr: Robust higher order potentials for enforcing label consistency. IJCV (2009) 8 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-17
SLIDE 17

Inference

Learning Algorithm piece-wise estimation of parameters θ unary SVM classifier CRF pseudo-likelihood maximization (50 instances)

123456789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 100 200 300 400 500 600

Aligned Pairs weights θd1d2

9 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-18
SLIDE 18

Inference

Learning Algorithm piece-wise estimation of parameters θ unary SVM classifier CRF pseudo-likelihood maximization (50 instances)

123456789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 100 200 300 400 500 600

Aligned Pairs weights θd1d2 Decoding Algorithm recognition of labels L max-product tree-reweighted message passing voting scheme

9 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-19
SLIDE 19

Inference

Learning Algorithm piece-wise estimation of parameters θ unary SVM classifier CRF pseudo-likelihood maximization (50 instances)

123456789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 100 200 300 400 500 600

Aligned Pairs weights θd1d2 Decoding Algorithm recognition of labels L max-product tree-reweighted message passing voting scheme

9 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-20
SLIDE 20

Standard Dataset Evaluation

ECP Monge – 104 images, 8 classes

[3L]: 85.1%, [SG]: 74.7%

image GT NC AP APRT SGT accuracy [%] 59.6 79.0 84.2 88.5

[3L] Martinovic et al.: A three-layered approach to facade parsing. ECCV (2012) [SG] Simon, Teboul, Koutsourakis, Paragios: Random exploration of the procedural space. IJCV (2011) [HCRF] Yang, Foerstner: A hierarchical CRF model for images of man-made scenes. ICCV (2011) 10 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-21
SLIDE 21

Standard Dataset Evaluation

ECP Monge – 104 images, 8 classes

[3L]: 85.1%, [SG]: 74.7%

image GT NC AP APRT SGT accuracy [%] 59.6 79.0 84.2 88.5 eTrims – 60 images, 8 classes

[3L]: 81.9%, [HCRF]: 65.8%

image GT NC AP APRT SGT accuracy [%] 56.7 77.4 82.1 93.7

[3L] Martinovic et al.: A three-layered approach to facade parsing. ECCV (2012) [SG] Simon, Teboul, Koutsourakis, Paragios: Random exploration of the procedural space. IJCV (2011) [HCRF] Yang, Foerstner: A hierarchical CRF model for images of man-made scenes. ICCV (2011) 10 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-22
SLIDE 22

CMP Facade Database

New public dataset, 400 images, rectified, annotated Website: cmp.felk.cvut.cz/~tylecr1/facade/ image GT NC AP APRT SGT acc [%] 33.2 54.3 60.3 84.8 12 classes: facade molding cornice pillar window door sill blind balcony shop deco

11 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-23
SLIDE 23

Conclusion

General Spatial Pattern Template – new representation for learning dense structural relations

12 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-24
SLIDE 24

Conclusion

General Spatial Pattern Template – new representation for learning dense structural relations Templates for Regular Scenes Aligned Pairs – capture pairwise alignment and co-occurrence Regular Triplets – capture regular spacing in triplets

12 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition

slide-25
SLIDE 25

Conclusion

General Spatial Pattern Template – new representation for learning dense structural relations Templates for Regular Scenes Aligned Pairs – capture pairwise alignment and co-occurrence Regular Triplets – capture regular spacing in triplets Application to Facade Parsing Standard Datasets – performance comparable to SOA CMP Facade Database – new 12-class challenge

Center for Machine Perception Czech Technical University in Prague cmp.felk.cvut.cz/~tylecr1/facade

12 / 12 Radim Tyleˇ cek, Radim ˇ S´ ara, CMP CTU Prague Spatial Pattern Templates for Regular Structure Recognition