spatially accurate and temporally dense
play

Spatially Accurate and Temporally Dense Extraction of Primary Object - PowerPoint PPT Presentation

Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions Dong Zhang 1 , Omar Javed 2 , Mubarak Shah 1 presented by Sehyun Joo based on Dongs slide Review : Semantic Based Action


  1. Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions Dong Zhang 1 , Omar Javed 2 , Mubarak Shah 1 presented by Sehyun Joo based on Dong’s slide

  2. Review : Semantic Based Action Retrievals

  3. Review : NETVLAD

  4. Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions Dong Zhang 1 , Omar Javed 2 , Mubarak Shah 1 presented by Sehyun Joo based on Dong’s slide

  5. Motivation • Interesting, but ‘easy -to- understand’ approach to solve the whole problem • Might have insight of solving Image Search problems also.

  6. Video Object Segmentation: Outline • Problem Statement • Related Work • Problem Analysis • Solution – Method Framework – Object Proposal Expansion – Layered DAG based Primary Object Selection – GMMs and MRF Optimization • Results

  7. Problem Statement Segmentation of the primary moving object in videos

  8. Video Object Segmentation • D. Tsai, M. Flagg and J. Rehg , “Motion Coherent Tracking with Multi - label MRF optimization”, BMVC, 2010: – SegTrack dataset – Assume manual annotations • Y . Lee, J. Kim and K. Grauman, ” Key-segments for video object segmentation ”, ICCV, 2011: – Object proposals – Extract “key - segments” • T. Ma, M. Yi and L. Latecki , “Maximum weight cliques with mutex constraints for video object segmentation ”, CVPR, 2012: – Object proposals – NP-hard problem, approximate optimization, computation cost

  9. Object Proposal • Object proposal methods [1,2] output regions as potential object regions … … [1] Ian Endres and Derek Hoiem, “ Category Independent Object Proposals”, ECCV, 2010 [2] Alexe, B., Deselares, T. and Ferrari, V., “What is an object ?”, CVPR, 2010

  10. Segtrack (monkeydog) Frame Ranked object proposals index # 3 1 2 4 18 1 2 100 3 4 … … 30 Object proposals are not reliable 1 2 3 4 100 … … 40 100 1 2 3 4 … … Sampling a lot of proposals and selecting the right ones is important! 51 1 2 17 3 4 … … 60 1 2 21 3 4 … …

  11. Segtrack (parachute) Frame index Ranked object proposals expansion # 3 1 2 4 33 1 2 96 3 4 … … 38 1 2 3 4 98 … … 40 100 1 2 3 4 … … 43 True object region may break 1 2 100 3 4 … … into multiple object proposals 49 1 2 100 3 4 … …

  12. Challenges • HOW to select correct object regions consistently ? • HOW to formulate the problem for an efficient solution.

  13. Yes! Solutions? (1) Object proposal expansion • HOW to select correct (2) Flow warped shape and color object regions consistently ? similarity measure  connect frames (3) Optical flow gradient  scoring function • HOW to formulate the (1) A novel DAG formulation problem for an efficient (2) Dynamic programming solution solution.

  14. Framework Input Videos Object Proposal Generation & Expansion Layered DAG Optimization for Primary Object Selection GMMs and MRF based Optimization Object Segmentation

  15. Graph Structure for Object Proposal Selection: Unary Edge An object proposal Beginning node Ending node Unary edge Represents object-ness

  16. Unary Edge: Score 𝑻 𝒗𝒐𝒃𝒔𝒛 = 𝑵 𝒔 + 𝑩(𝒔) 𝑵(𝒔) : average Frobenius norm for optical flow gradient 𝒗 𝒚 𝒗 𝒛 𝟑 + 𝒗 𝒛 𝟑 + 𝒘 𝒚 𝟑 + 𝒘 𝒛 𝟑 𝑽 𝒚 = = 𝒗 𝒚 𝒘 𝒚 𝒘 𝒛 𝑮 𝑩 𝒔 : appearance score from [1] [1] Ian Endres and Derek Hoiem, “ Category Independent Object Proposals”, ECCV, 2010

  17. Unary Edge: Motion Score Boundary region Original video frame Object region Optical flow Optical flow gradient OF gradient around boundary

  18. [1] I. Endres and D. Hoiem “ Category Independent Object Proposals ” ECCV, 2010

  19. Graph Structure for Object Proposal Selection: Binary Edges Frame i+1 … … Frame i Frame i+2 … … … … … … Binary edge … … … … … … … … … … … … … … … … … … … … … … … …

  20. Binary Edge Score 𝑻 𝒄𝒋𝒐𝒃𝒔𝒛 𝒔 𝒏 , 𝒔 𝒐 = 𝝁 ∙ 𝑻 𝒑𝒘𝒇𝒔𝒎𝒃𝒒 𝒔 𝒏 , 𝒔 𝒐 ∙ 𝑻 𝒅𝒑𝒎𝒑𝒔 (𝒔 𝒏 , 𝒔 𝒐 ) 𝑻 𝒑𝒘𝒇𝒔𝒎𝒃𝒒 (𝒔 𝒏 , 𝒔 𝒐 ) = 𝒔 𝒏 ∩ 𝒙𝒃𝒔𝒒 𝒏𝒐 (𝒔 𝒐 ) 𝒔 𝒏 ∪ 𝒙𝒃𝒔𝒒 𝒏𝒐 (𝒔 𝒐 ) 𝑻 𝒅𝒑𝒎𝒑𝒔 (𝒔 𝒏 , 𝒔 𝒐 ) = 𝒊𝒋𝒕𝒖(𝒔 𝒏 ) ∙ 𝒊𝒋𝒕𝒖(𝒔 𝒐 ) 𝑼

  21. Graph Structure for Object Proposal Selection Frame i-1 Frame i Frame i+1 …… …… …… …… s t …… …… Layer Layer Layer Layer Layer Layer 2i-3 2i-2 2i-1 2i 2i+1 2i+2 Goal: Find only one object proposal from each frame, such that all of them have high object-ness and high similarity across frames.  Find the highest weighted path in the DAG.  Longest Path Problem of DAG  Dynamic Programming Solution.

  22. SegTrack Dataset Original video Ground truth Selected object proposals Segmentation results Region within the red boundary is the object region

  23. SegTrack Dataset Original video Ground truth Selected object proposals Segmentation results Region within the red boundary is the object region

  24. SegTrack Dataset Original video Ground truth Segmentation results Region within the red boundary is the object region

  25. SegTrack Dataset Original video Ground truth Segmentation results Region within the red boundary is the object region

  26. SegTrack: Quantitative Results* Ours [14] [13] [20] [6] Supervised? N N N Y Y Birdfall 155 189 288 252 454 Cheetah 633 806 905 1142 1217 Girl 1488 1698 1785 1304 1755 Monkeydog 365 472 521 533 683 Parachute 220 221 201 235 502 Avg. 452 542 592 594 791 * Average per-frame pixel error rate. The smaller, the better.

  27. GaTech Dataset Original video Segmentation results Region within the red boundary is the object region

  28. GaTech Dataset Original video Segmentation results Region within the red boundary is the object region

  29. Thank you!

  30. Quiz 1. What kind of graph this paper is trying to build? (a) Directed Acyclic Graph (b) Directed Cyclic Graph (c) Undirected Acyclic Graph (d) Undirected Cyclic Graph 2. Unary edge is weight of how much object-ness. What is criteria of weight of binary edge? (a) how much non-moving (b) how much similar between object across frames (c) how much the difference of color histogram (d) how much the object in frame has precise shape

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend