Spatially Accurate and Temporally Dense Extraction of Primary Object - - PowerPoint PPT Presentation

spatially accurate and temporally dense
SMART_READER_LITE
LIVE PREVIEW

Spatially Accurate and Temporally Dense Extraction of Primary Object - - PowerPoint PPT Presentation

Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions Dong Zhang 1 , Omar Javed 2 , Mubarak Shah 1 presented by Sehyun Joo based on Dongs slide Review : Semantic Based Action


slide-1
SLIDE 1

Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions

Dong Zhang1, Omar Javed2, Mubarak Shah1 presented by Sehyun Joo based on Dong’s slide

slide-2
SLIDE 2

Review : Semantic Based Action Retrievals

slide-3
SLIDE 3

Review : NETVLAD

slide-4
SLIDE 4

Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions

Dong Zhang1, Omar Javed2, Mubarak Shah1 presented by Sehyun Joo based on Dong’s slide

slide-5
SLIDE 5

Motivation

  • Interesting, but ‘easy-to-understand’ approach

to solve the whole problem

  • Might have insight of solving Image Search

problems also.

slide-6
SLIDE 6

Video Object Segmentation: Outline

  • Problem Statement
  • Related Work
  • Problem Analysis
  • Solution

– Method Framework – Object Proposal Expansion – Layered DAG based Primary Object Selection – GMMs and MRF Optimization

  • Results
slide-7
SLIDE 7

Segmentation of the primary moving object in videos

Problem Statement

slide-8
SLIDE 8

Video Object Segmentation

  • D. Tsai, M. Flagg and J. Rehg, “Motion Coherent Tracking with Multi-

label MRF optimization”, BMVC, 2010:

– SegTrack dataset – Assume manual annotations

  • Y. Lee, J. Kim and K. Grauman, ”Key-segments for video object

segmentation”, ICCV, 2011:

– Object proposals – Extract “key-segments”

  • T. Ma, M. Yi and L. Latecki, “Maximum weight cliques with mutex

constraints for video object segmentation”, CVPR, 2012:

– Object proposals – NP-hard problem, approximate optimization, computation cost

slide-9
SLIDE 9

Object Proposal

  • Object proposal methods [1,2] output regions

as potential object regions

[1] Ian Endres and Derek Hoiem, “Category Independent Object Proposals”, ECCV, 2010 [2] Alexe, B., Deselares, T. and Ferrari, V., “What is an object?”, CVPR, 2010

… …

slide-10
SLIDE 10

Frame index 1 2 3 4 # Segtrack (monkeydog)

… …

100 1 2 3 4 30 40 17 21

… … … … … …

100 1 2 3 4 1 2 3 4 1 2 3 4 51 60

… …

100 1 2 3 4 18 Ranked object proposals

Object proposals are not reliable Sampling a lot of proposals and selecting the right ones is important!

slide-11
SLIDE 11

100 100

… … … … … … … … … …

Frame index 1 2 3 4 # 96 98 100 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 Segtrack (parachute) 33 38 40 43 49 Ranked object proposals expansion

True object region may break into multiple object proposals

slide-12
SLIDE 12

Challenges

  • HOW to select correct
  • bject regions consistently?
  • HOW to formulate the

problem for an efficient solution.

slide-13
SLIDE 13

Solutions?

  • HOW to select correct
  • bject regions consistently?
  • HOW to formulate the

problem for an efficient solution.

(1) Object proposal expansion (1) A novel DAG formulation (2) Dynamic programming solution

Yes!

(2) Flow warped shape and color similarity measure  connect frames (3) Optical flow gradient  scoring function

slide-14
SLIDE 14

Layered DAG Optimization for Primary Object Selection GMMs and MRF based Optimization

Input Videos Object Segmentation

Object Proposal Generation & Expansion

Framework

slide-15
SLIDE 15

Beginning node Ending node Unary edge

Represents object-ness

An object proposal

Graph Structure for Object Proposal Selection: Unary Edge

slide-16
SLIDE 16

𝑻𝒗𝒐𝒃𝒔𝒛 = 𝑵 𝒔 + 𝑩(𝒔)

𝑩 𝒔 : appearance score from [1] 𝑵(𝒔) : average Frobenius norm for optical flow gradient

𝑽𝒚 = 𝒗𝒚 𝒗𝒛 𝒘𝒚 𝒘𝒛

𝑮

= 𝒗𝒚

𝟑 + 𝒗𝒛 𝟑 + 𝒘𝒚 𝟑 + 𝒘𝒛 𝟑

Unary Edge: Score

[1] Ian Endres and Derek Hoiem, “Category Independent Object Proposals”, ECCV, 2010

slide-17
SLIDE 17

Original video frame Optical flow Object region Optical flow gradient Boundary region OF gradient around boundary

Unary Edge: Motion Score

slide-18
SLIDE 18

[1] I. Endres and D. Hoiem “Category Independent Object Proposals” ECCV, 2010

slide-19
SLIDE 19

Binary edge

Frame i Frame i+1

… … … … … … … …

Frame i+2

… … … …

… … … … … … … … … … … … … … … … … … … …

Graph Structure for Object Proposal Selection: Binary Edges

slide-20
SLIDE 20

𝑻𝒄𝒋𝒐𝒃𝒔𝒛 𝒔𝒏, 𝒔𝒐 = 𝝁 ∙ 𝑻𝒑𝒘𝒇𝒔𝒎𝒃𝒒 𝒔𝒏, 𝒔𝒐 ∙ 𝑻𝒅𝒑𝒎𝒑𝒔 (𝒔𝒏, 𝒔𝒐) 𝑻𝒅𝒑𝒎𝒑𝒔(𝒔𝒏, 𝒔𝒐) = 𝒊𝒋𝒕𝒖(𝒔𝒏) ∙ 𝒊𝒋𝒕𝒖(𝒔𝒐) 𝑼 𝑻𝒑𝒘𝒇𝒔𝒎𝒃𝒒(𝒔𝒏, 𝒔𝒐) = 𝒔𝒏 ∩ 𝒙𝒃𝒔𝒒𝒏𝒐(𝒔𝒐) 𝒔𝒏 ∪ 𝒙𝒃𝒔𝒒𝒏𝒐(𝒔𝒐)

Binary Edge Score

slide-21
SLIDE 21

…… …… …… ……

Frame i-1 Frame i Frame i+1

…… ……

t s Layer 2i-3 Layer 2i-2 Layer 2i-1 Layer 2i Layer 2i+1 Layer 2i+2

Goal: Find only one object proposal from each frame, such that all of them have high object-ness and high similarity across frames.  Find the highest weighted path in the DAG. Longest Path Problem of DAG Dynamic Programming Solution.

Graph Structure for Object Proposal Selection

slide-22
SLIDE 22

SegTrack Dataset

Original video Ground truth Selected object proposals Segmentation results

Region within the red boundary is the object region

slide-23
SLIDE 23

SegTrack Dataset

Original video Ground truth Selected object proposals Segmentation results

Region within the red boundary is the object region

slide-24
SLIDE 24

Original video Ground truth Segmentation results Region within the red boundary is the object region

SegTrack Dataset

slide-25
SLIDE 25

Original video Ground truth Segmentation results Region within the red boundary is the object region

SegTrack Dataset

slide-26
SLIDE 26

* Average per-frame pixel error rate. The smaller, the better.

SegTrack: Quantitative Results*

Ours [14] [13] [20] [6]

Supervised? N N N Y Y Birdfall 155 189 288 252 454 Cheetah 633 806 905 1142 1217 Girl 1488 1698 1785 1304 1755 Monkeydog 365 472 521 533 683 Parachute 220 221 201 235 502 Avg. 452 542 592 594 791

slide-27
SLIDE 27

Original video Segmentation results Region within the red boundary is the object region

GaTech Dataset

slide-28
SLIDE 28

Original video Segmentation results Region within the red boundary is the object region

GaTech Dataset

slide-29
SLIDE 29

Thank you!

slide-30
SLIDE 30

Quiz

  • 1. What kind of graph this paper is trying to build?

(a) Directed Acyclic Graph (b) Directed Cyclic Graph (c) Undirected Acyclic Graph (d) Undirected Cyclic Graph

  • 2. Unary edge is weight of how much object-ness.

What is criteria of weight of binary edge?

(a) how much non-moving (b) how much similar between object across frames (c) how much the difference of color histogram (d) how much the object in frame has precise shape