Weilong Yang Simon Fraser University
1
Learning Transferable Distance Functions For Human Action - - PowerPoint PPT Presentation
Learning Transferable Distance Functions For Human Action Recognition and Detection Weilong Yang Simon Fraser University 1 Action Recognition and Detection Walking Running Jogging Boxing Waving T X Y 2 Applications Action related
Weilong Yang Simon Fraser University
1
2
Walking Running Jogging Boxing Waving T X Y
3
Action related video search
Sports and Dancing video search
Event Detection
Automatic abnormality detection in surveillance
videos
On KTH & Weizmann action datasets, almost
100% accurancy is achieved. [Jhuang et al. ICCV07,
Fathi & Mori CVPR08 ]
Most of methods rely on a large amout of
training set.
Half-half split or Leave-one-out cross validation
It is unrealistic to collect this many training
samples for some action.
4
Query Action Template Set
Template A Template B Template C Template D Throwing R-dancing M-dancing Kicking Label:
One shot learning of object categories [Fei-Fei et al.
ICCV03]
Visual Object Identification [Ferencz et al. IJCV07]
The ability of a system to recognize and apply knowledge and skills learned in previous tasks to novel tasks .[Pan & Yang, TKDE 2009]
6
7
Query
A B C Hyper- Features Fq
Learning Action Recognition Distance
Templates
Action Detection
Query Template
Frame-to-Frame Distance
8
Motion Descriptor
[Efros et al. ICCV 03]
Query Template
Frame Correspondence Frame-to-Frame Distance
9
Elementary Patch-to-Patch Distance
10
Query
A B C Hyper- Features Fq
Learning Action Recognition Distance
Templates
Action Detection
[Frome et al. NIPS06]
11
[Frome et al. NIPS06]
Triplet
12
Large Training set required
13
14
Hyper- Feature Transferable
15
16
Codebook representation
Descriptor for each patch
○ HOG + Positions
Obtaining codebook with the size of
○ K-means clustering
Hyper-feature for each patch
○ A dimensional vector
17
18
Motion Cue Shape Cue & Positions
Patch Matching
Hyper- Feature
Patch Weighting
19
Query
A B C Hyper- Features Fq
Learning Action Recognition Distance
Templates
Action Detection
20
Query Template A Template B Template C
Hyper- Features Fq
21
Query
A B C Hyper- Features Fq
Learning Action Recognition Distance
Templates
Action Detection
Train the transferable distance function on
Weizmann, and test on KTH.
template set
transfer
22
skip jack jump side bend wave1 pjump
Source Training Set
Weizmann walk run jog clap wave2
Testing set
KTH
23
Codeword Ranking Learnt Weights on Testing Actions
For each round, we randomly select one
5% improvement
24
Dc : Direct Comparison (W = 1) Tr : Transferable Distance Function
Direct Comparison Avg: 70.9% Transfer Avg: 76.7%
25
Clpping vs. Waving Jogging vs. Running
With the learnt distance function, we can sort the
patches on each frame by their saliency.
Instead of using all patches, we can choose the
top N patches with high weights for matching.
10 Patches on Each Frame
26
27
Query
A B C Hyper- Features Fq
Learning Action Recognition Distance
Templates
Action Detection
28
29
Reject Reject Reject Cascade Stage 1 Cascade Stage 2 Cascade Stage N
30
All Sub- Windows
Decision
Reject Reject Reject Hyper- Features Fq
31
Transferable distance function Learning
Hyper-features based on appearance and
positions
Max-margin Learning framework
Action recognition from one clip
Template Matching based on motion
Efficient action detection from one clip
Cascade structure
32
33