Segment-Phrase Table for Semantic Segmentation, Visual Entailment - - PowerPoint PPT Presentation

segment phrase table for semantic segmentation visual
SMART_READER_LITE
LIVE PREVIEW

Segment-Phrase Table for Semantic Segmentation, Visual Entailment - - PowerPoint PPT Presentation

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing Hamid Izadinia, Fereshteh Sadeghi, Santosh K. Divvala, Hannaneh Hajishirzi, Yejin Choi, Ali Farhadi Presentated by Edward Banner Outline What is a SPT?


slide-1
SLIDE 1

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing

Hamid Izadinia, Fereshteh Sadeghi, Santosh K. Divvala, Hannaneh Hajishirzi, Yejin Choi, Ali Farhadi

Presentated by Edward Banner

slide-2
SLIDE 2

Outline

  • What is a SPT?
  • Motivation: What does a SPT enable us to do?
  • How to build a SPT?
  • How to make use of a SPT?
  • Evaluation
  • Discussion
slide-3
SLIDE 3

What is a segment-phrase table?

One to many mapping from phrases to segmentation models

slide-4
SLIDE 4

What is a segment-phrase table?

One to many mapping from phrases to segmentation models

Phrases Image credit: Izadinia et al.

slide-5
SLIDE 5

What is a segment-phrase table?

One to many mapping from phrases to segmentation models

Phrases Segments Image credit: Izadinia et al.

slide-6
SLIDE 6

Why build a segment-phrase table?

Many reasons!

slide-7
SLIDE 7

Why build a segment-phrase table?

Entailment If a horse is grazing, is it also standing?

slide-8
SLIDE 8

Why build a segment-phrase table?

Entailment If a horse is grazing, is it also standing?

Image credit: Izadinia et al.

slide-9
SLIDE 9

Why build a segment-phrase table?

Paraphrasing Are “horse jumping” and “horse leaping” paraphrases of each other?

slide-10
SLIDE 10

Why build a segment-phrase table?

Paraphrasing Are “horse jumping” and “horse leaping” paraphrases of each other?

Image credit: Izadinia et al.

slide-11
SLIDE 11

Why build a segment-phrase table?

Relative similarity Is “cat standing up” closer to “bear standing up” or “deer standing up”?

slide-12
SLIDE 12

Why build a segment-phrase table?

Relative similarity Is “cat standing up” closer to “bear standing up” or “deer standing up”?

Image credit: Izadinia et al.

slide-13
SLIDE 13

Why build a segment-phrase table?

Semantic segmentation

Image credit: Izadinia et al.

slide-14
SLIDE 14

Considerations in building segment-phrase table

Human annotators?

slide-15
SLIDE 15

Considerations in building segment-phrase table

Human annotators? Too expensive to obtain human-labeled pixel labels Opt instead for weakly-supervised approach instead

slide-16
SLIDE 16

How do they build it?

Three components: 1. Train a webly-supervised detection model for each phrase 2. Model each phrase as a deformable parts model 3. Learn segmentation model for each part

slide-17
SLIDE 17

How do they build it?

1. Train a webly-supervised detection model for each phrase e.g. running horse

slide-18
SLIDE 18

How do they build it?

  • 2. Model each phrase as a deformable parts model

Concerned about intra-class variation?

slide-19
SLIDE 19

How do they build it?

  • 2. Model each phrase as a deformable parts model

Concerned about intra-class variation?

horse

slide-20
SLIDE 20

How do they build it?

  • 2. Model each phrase as a deformable parts model

Concerned about intra-class variation?

horse running horse

slide-21
SLIDE 21

How do they build it?

  • 2. Model each phrase as a deformable parts model

Concerned about intra-class variation? Key insight: parts of phrases have low intra-class variation

horse running horse

slide-22
SLIDE 22

How do they build it?

  • 3. Learn segmentation model for each part

Model superpixels with GMM and solve with EM and Graphcut Rough initialization with Grabcut and HOG root filter

slide-23
SLIDE 23

How do they build it?

  • 3. Learn segmentation model for each part

Model superpixels with GMM and solve with EM and Graphcut Rough initialization with Grabcut and HOG root filter

“horse running right”

slide-24
SLIDE 24

Segment-phrase table built

Results: For each phrase, we have learned:

  • Bounding box detector
  • Segmentation model for each part

What can we do now?

Phrases Segments Image credit: Izadinia et al.

slide-25
SLIDE 25

Semantic segmentation

Example: “horse”

Image credit: Izadinia et al.

slide-26
SLIDE 26

Semantic segmentation

Example: “horse”

Image credit: Izadinia et al.

slide-27
SLIDE 27

Semantic segmentation

Example: “horse”

Image credit: Izadinia et al.

slide-28
SLIDE 28

Semantic segmentation

Example: “horse”

Image credit: Izadinia et al.

slide-29
SLIDE 29

Semantic segmentation

Example: “horse”

Image credit: Izadinia et al.

slide-30
SLIDE 30

Semantic segmentation using linguistic constraints

Example: “horse”

Image credit: Izadinia et al.

slide-31
SLIDE 31

Semantic segmentation using linguistic constraints

Example: “horse”

standing sitting kicking posing standing sitting kicking posing

Image credit: Izadinia et al.

slide-32
SLIDE 32

Semantic segmentation using linguistic constraints

Example: “horse”

standing sitting kicking posing standing sitting kicking posing

Image credit: Izadinia et al.

slide-33
SLIDE 33

Entailment

Does phrase X entail phrase Y? Intuition: All segments for which phrase X is a valid description, then phrase Y is also a valid description

slide-34
SLIDE 34

Entailment

Does phrase X entail phrase Y? Intuition: All segments for which phrase X is a valid description, then phrase Y is also a valid description

horse standing horse grazing

slide-35
SLIDE 35

Entailment

Does phrase X entail phrase Y? Intuition: All segments for which phrase X is a valid description, then phrase Y is also a valid description

horse standing horse grazing

slide-36
SLIDE 36

Entailment

Does phrase X entail phrase Y? Intuition: All segments for which phrase X is a valid description, then phrase Y is also a valid description

horse standing horse grazing

slide-37
SLIDE 37

Paraphrasing

Are phrase X and phrase Y paraphrases of each other? Strategy: compute X ⊨ Y and Y ⊨ X and say they’re paraphrases if they’re close

Image credit: Izadinia et al.

slide-38
SLIDE 38

Paraphrasing

Are phrase X and phrase Y paraphrases of each other? Strategy: compute X ⊨ Y and Y ⊨ X and say they’re paraphrases if they’re close

Image credit: Izadinia et al.

slide-39
SLIDE 39

Relative Semantic Similarity

Is phrase X closer to phrase Y or phrase Z? Strategy: compute X ⊨ Y and X ⊨ Z and pick highest number of the two

Image credit: Izadinia et al.

slide-40
SLIDE 40

Relative Semantic Similarity

Is phrase X closer to phrase Y or phrase Z? Strategy: compute X ⊨ Y and X ⊨ Z and pick highest number of the two

Image credit: Izadinia et al.

slide-41
SLIDE 41

Evaluation - Takeaways

Semantic segmentation state of the art or near it Highlights tradeoffs between unsupervised approach on large data and supervised approaches on small dataset Linguistic constraints help semantic segmentation SPT approach beats language-only and vision-only baselines on entailment, paraphrasing, and relative similarity

slide-42
SLIDE 42

Discussion

slide-43
SLIDE 43

Discussion

Leverage supervision Variable number of part models per phrase Larger evaluation dataset Comparison against state-of-the-art entailment and paraphrase systems