2/9/2017 1
Interactive Foreground Segmentation in Images and Videos
Suyog Jain
1
Foreground Segmentation
Generate pixel level foreground masks for
- bjects in a given image or video
2
Interactive Foreground Segmentation in Images and Videos Suyog - - PDF document
2/9/2017 Interactive Foreground Segmentation in Images and Videos Suyog Jain 1 Foreground Segmentation Generate pixel level foreground masks for objects in a given image or video 2 1 2/9/2017 Why is Foreground Segmentation useful?
2/9/2017 1
1
2
2/9/2017 2
3
Results from AlchemyAPI search
4
Results from AlchemyAPI search
2/9/2017 3
5
Training Images Recognition System
6
2/9/2017 4
7
8
2/9/2017 5
9
10
[ Boykov 2001, Zabih 2001, Gulshan 2010, Kohli 2008]
2/9/2017 6
11
Image
p q
[ Boykov 2001, Zabih 2001, Gulshan 2010, Kohli 2008]
Unary Term Pairwise Term
Background distribution Foreground distribution
Frequency Frequency
Unary Term Pairwise Term
High penalty Low penalty
Foreground Background
Optimal labeling
12
Image
p q
[ Boykov 2001, Zabih 2001, Gulshan 2010, Kohli 2008]
Unary Term Pairwise Term
Combinatorial Optimization Segmentation
2/9/2017 7
Foreground Background
13
[ Boykov 2001, Zabih 2001, Gulshan 2010, Kohli 2008]
14
2/9/2017 8
15
16
2/9/2017 9
[ Boykov 2001, Zabih 2001, Rother 2004, Kohli 2008]
Bounding Box Scribbles Sloppy Contour User Input System Output
[ Boykov 2001, Zabih 2001, Rother 2004, Kohli 2008]
Bounding Box Scribbles Sloppy Contour
2/9/2017 10
19
Pre-generate thousands of segmentations with no human input.
20
Use boundary clicks to quickly “carve” out the accurate ones.
2/9/2017 11
[ Boykov 2001, Zabih 2001, Rother 2004, Kohli 2008]
2/9/2017 12
Static Boundaries Motion Boundaries
Hierarchical segmentation and region grouping
2/9/2017 13
25
1 1 1 1 1 1 1 Votes
26
1 1 1 1 1 1 1 Votes
2/9/2017 14
27
1 2 1 2 1 2 2 Votes
2/9/2017 15
2/9/2017 16
2/9/2017 17
34
2/9/2017 18
36
[Jain ECCV 2014]
2/9/2017 19
37
– Segtrack-v2 [Li et al. 2013] – VSB 100 [Sundber et al. 2011] – iVideoSeg [Nagaraja et al. 2015]
– 3 annotators with a max annotation budget of 10 clicks. – Record number of clicks, time spent and best object mask chosen by the annotator. – Compare with several existing methods which use different amount of human annotation.
38
2/9/2017 20
Ours
40
2/9/2017 21
41
42
2/9/2017 22
2/9/2017 23
46
2/9/2017 24
48
2/9/2017 25
49