SLIDE 1
SLIDE 2 Frame
P e
l e
Baby Cake People Time
SLIDE 3
A very sparse semantical vector for frame: Emphasize primary object Overlook small size regional object
Vector sparser Vocabulary larger
SLIDE 4
SLIDE 5
Only around 40% of Regional Information left Discriminatory power of deep features consistently improves
max pooling
SLIDE 7
SLIDE 8
Selective search Candidate Objects Frame
On average, each frame has 20 candidate object regions.
SLIDE 9 Observations
- Possible reasons:
- Alternative method:
SLIDE 10
SLIDE 11
SLIDE 12
VLAD
SLIDE 13
Selective search Candidate Objects Spatial & temporal features clustering Regional objects Deep features K-means & VLAD
Zhongwen Xu, Yi Yang, Alexander G. Hauptmann (CVPR’15)
SLIDE 14 Deep Feature Map Extraction
Feature Map 7 X 7
Spatial Pyramid Pooling
7 X 7 6 X 6 5 X 5 2 X 2 50 descriptors
VLAD
Max pool filter: 50 descriptors
Feature Spatial Pyramid Pooling filter: Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun (ECCV’14)
SLIDE 15
VLAD
SLIDE 16
SLIDE 17
20 25 30 35 40 45 MED14-Test (mAP%) MED16-EvalSub (MinfAP200%) MED16-EvalFull (MinfAP200%)
PS-10Ex
CNN-VLAD Object-VLAD
20 25 30 35 40 45 MED14-Test (mAP%) MED16-EvalSub (MinfAP200%) MED16-EvalFull (MinfAP200%)
PS-100Ex
CNN-VLAD Object-VLAD
SLIDE 18
SLIDE 19
20 22 24 26 28 30 32 34 36 38 40 MED14-Test (mAP%) MED16-EvalSub (MinfAP200%) MED16-EvalFull (MinfAP200%)
PS-10Ex
Concept-Bank_N2 Object-VLAD Visual-System (Concept-Bank_N2 + Object-VLAD)
SLIDE 20
25 27 29 31 33 35 37 39 41 43 45 VIREO Team2 Team3 Team4
MED16-EvalFull-As- ProgressSubset (MinfAP200%)
25 30 35 40 45 50 Team2 VIREO Team3 Team4
MED16-EvalFull (MinfAP200%)
5 10 15 20 25 30 35 40 45 Team2 VIREO Team3 Team4 Team5 Team6 Team7 Team8 Team9 Team10 Team11
MED16-EvalSub (MinfAP200%)
25 27 29 31 33 35 37 39 41 Team2 VIREO Team3 Team4
MED16-EvalFull (MinfAP200%)
SLIDE 21
5 10 15 20 25 30 35 40 45 50 Team2 Team3 Team4 Team5 VIREO Team6 Team7 Team8 Team9
MED16-EvalSub (MinfAP200%)
5 10 15 20 25 30 35 40 45 Team2 VIREO Team4 Team5
MED16-EvalFull (MinfAP200%)
SLIDE 22