Vi Video Ob eo Object ject Segm Segmen enta tati tion
- n
CV3DST | Prof. Leal-Taixé 1
Vi Video Ob eo Object ject Segm Segmen enta tati tion on - - PowerPoint PPT Presentation
Vi Video Ob eo Object ject Segm Segmen enta tati tion on CV3DST | Prof. Leal-Taix 1 Vi Video deo Objec ject Seg egmen entat ation on Lectures 2-3 Lectures 4-5 Object Detection Object Tracking Lectures 7-8 This lecture
CV3DST | Prof. Leal-Taixé 1
Object Detection Lectures 2-3 Object Tracking Lectures 4-5 Object Segmentation Lectures 7-8 Video Object Segmentation This lecture
CV3DST | Prof. Leal-Taixé 2
pixel masks for objects in a video sequence.
CV3DST | Prof. Leal-Taixé 3
CV3DST | Prof. Leal-Taixé 4
CV3DST | Prof. Leal-Taixé 5
CV3DST | Prof. Leal-Taixé 6
Hard to make assumptions about
Hard to make assumptions about
CV3DST | Prof. Leal-Taixé 7
Semi-supervised (one-shot) video
Unsupervised (zero- shot) video object segmentation
We get the first frame ground truth mask, we know what object to segment We have to find the
masks
CV3DST | Prof. Leal-Taixé 8
Semi-supervised (one-shot) video
Unsupervised (zero- shot) video object segmentation
We get the first frame ground truth mask, we know what object to segment We have to find the
masks
CV3DST | Prof. Leal-Taixé 9
Motion segmentation, salient object detection..
Semi-supervised (one-shot) video
Unsupervised (zero- shot) video object segmentation
We get the first frame ground truth mask, we know what object to segment We have to find the
masks
CV3DST | Prof. Leal-Taixé 10
This lecture
– Given: segmentation mask of target object(s) in the first frame – Goal: pixel-accurate segmentation of the entire video – Currently a major testing ground for segmentation-based tracking
Given: First-frame ground truth Goal: Complete video segmentation
CV3DST | Prof. Leal-Taixé 11
learning-based methods
DAVIS 2016 (30/20, single objects, first frames) DAVIS 2017 (60/90, multiple
YouTube-VOS 2018 (3471/982, multiple
where object appears)
https://davischallenge.org https://youtube-vos.org
CV3DST | Prof. Leal-Taixé 12
is a concept in Computer Vision that we need to know first
CV3DST | Prof. Leal-Taixé 13
14 CV3DST | Prof. Leal-Taixé
image B
motion of the object
15 CV3DST | Prof. Leal-Taixé
16 CV3DST | Prof. Leal-Taixé
17 CV3DST | Prof. Leal-Taixé
18 CV3DST | Prof. Leal-Taixé
19 CV3DST | Prof. Leal-Taixé
channels
20 CV3DST | Prof. Leal-Taixé
21 CV3DST | Prof. Leal-Taixé
22 CV3DST | Prof. Leal-Taixé
How to combine the information from both images?
23 CV3DST | Prof. Leal-Taixé
Fixed operation. No learnable weights!
two feature vectors are
24 CV3DST | Prof. Leal-Taixé
Useful for finding image correspondences
25 CV3DST | Prof. Leal-Taixé
Find a transformation from image A to image B A B
26 CV3DST | Prof. Leal-Taixé
How to combine the information from both images? How to obtain high- quality results?
flow of the object
segmentation and OF iteratively (no DL yet)
29 CV3DST | Prof. Leal-Taixé
Y.H. Tsai et al. “Video Segmentation via Object Flow“. CVPR 2016