DVMMlab
digital video | multimedia laboratoryCoping with Video Domain Change
Eric Zavesky, Wei Jiang, Akira Yanagawa, Shih-Fu Chang TRECVID HLF 2007
Analysis of Cross-Domain Learning Methods for High-Level Visual Concept Detection
Columbia HLF: TRECVID2006 TRECVID TRECVID TRECVID 2005 2005 - - PowerPoint PPT Presentation
DVMM lab d igital video | multimedia laboratory Coping with Video Domain Change Analysis of Cross-Domain Learning Methods for High-Level Visual Concept Detection Eric Zavesky, Wei Jiang, Akira Yanagawa, Shih-Fu Chang TRECVID HLF 2007 Columbia
DVMMlab
digital video | multimedia laboratoryCoping with Video Domain Change
Eric Zavesky, Wei Jiang, Akira Yanagawa, Shih-Fu Chang TRECVID HLF 2007
Analysis of Cross-Domain Learning Methods for High-Level Visual Concept Detection
Coping with Video Domain Change; Zavesky 2007
2
Columbia HLF: TRECVID2006
TRECVID 2005 (development) Columbia 374 (baseline models) Concept fusion (explore concept relations) BCRF (20 improved models) TRECVID 2005 (development) Columbia 374 (baseline models) TRECVID 2005 (development) Columbia 374 (baseline models) Concept fusion (explore concept relations) BCRF (20 improved models) TRECVID 2006 (test)Coping with Video Domain Change; Zavesky 2007
3
Columbia HLF: TRECVID2007
Concept fusion (explore concept relations) TRECVID 2005 (development) Columbia 374 (baseline models) TRECVID 2007 (test) BCRF (20 improved models)1
Concept fusion (explore concept relations) TRECVID 2005 (development) Columbia 374 (baseline models) TRECVID 2007 (development) New baseline TRECVID 2007 (test) BCRF (20 improved models)1 2
Concept fusion (explore concept relations) TRECVID 2005 (development) Columbia 374 (baseline models) TRECVID 2007 (development) Cross domain New baseline TRECVID 2007 (test) BCRF (20 improved models) Model adaptation1 2 3
Coping with Video Domain Change; Zavesky 2007
4
cross-domain learning
Coping with Video Domain Change; Zavesky 2007
Definition:
method and content quality
Problem:
domain often do not work well on others Goal:
additional complexity
5
documentary (new domain) TRECVID 2007 news (old domain) TRECVID 2005
Coping with Video Domain Change; Zavesky 2007
Cross-Domain Problem: What is it?
Approach:
Data:
6
Coping with Video Domain Change; Zavesky 2007
Cross-Domain Problem: Common approaches
7
increasing time & complexity
method
new applicable condition
use old model all
similar to new domain training data
Coping with Video Domain Change; Zavesky 2007
Case 1: old model works best
Studio
8
learn new domain, test new domain learn old domain, test old domain learn old domain, test new domain
Coping with Video Domain Change; Zavesky 2007
Cross-Domain Problem: Common approaches
9
increasing time & complexity
train new domain model
new and old domains very dissimilar
method
new applicable condition
use old model all
similar to new domain training data
Coping with Video Domain Change; Zavesky 2007
Case 2: new model works best
Waterscape
10
learn new domain, test new domain learn old domain, test old domain learn old domain, test new domain
Coping with Video Domain Change; Zavesky 2007
Cross-Domain Problem: Common approaches
11
increasing time & complexity
adapt old model small all new and old domains slightly dissimilar
method
new applicable condition
use old model all
similar to new domain train new model
new and old domains very dissimilar training data
Coping with Video Domain Change; Zavesky 2007
Case 3: old model adaptation works best
Charts
12
learn new domain, test new domain learn old domain, test old domain adapt old domain+new domain, test new domain
Coping with Video Domain Change; Zavesky 2007
Cross-Domain Problem: Common approaches
13
increasing time & complexity
method
new applicable condition
use old model all
similar to new domain train new model
new and old domains very dissimilar adapt old model small all new and old domains slightly dissimilar training data train combined new+old model all all
similar; sparse new domain
Coping with Video Domain Change; Zavesky 2007
Case 4: combined model works best
Sports
14
learn new domain, test new domain learn old domain, test new domain learn old domain+new domain, test new domain
Coping with Video Domain Change; Zavesky 2007
Cross-Domain Problem: Common approaches
15
method
new applicable condition
use old model all
similar to new domain train new model
new domain and old domains very dissimilar adapt old model small all new and old domains slightly dissimilar train combined new+old model all all
similar; sparse new domain
increasing time & complexity
training data adapt old model small all new and old domains slightly dissimilar train combined new+old model all all
similar; sparse new domain
Coping with Video Domain Change; Zavesky 2007
Topic Review: Support Vector Machine (SVM)
16
+ + + + + + Old Feature Space + positive
+ + +
+ + +
+ + + + + + Old Feature Space + positive
+ + + x x x x x x x x x x xx x negative
+ + +
decision boundary (hyperplane)
(evidence for decision boundary) +
+
+ + + + + Old Feature Space + positive
+ + + x x x x x x x x x x
x
x negative
margin distance 1 || w ||
x + +
New Feature Space positive new samples negative new samples +
+ + x x
x x + + +
NOTE: new domain can be sparse
x x x x new samples
support vectors classification errors (using old hyperplane) New Feature Space +
+ + x
x x + + + x x x x
+ x x + +
new samples
samples
helpful support vector from old domain new samples
support vectors New Feature Space +
+ + x
x x + + + x x x x
+ x x + +
ignored support vector from old domain adapted hyperplane
Coping with Video Domain Change; Zavesky 2007
domain models
Combined model: Uniform sample importance
17
new
1x new
learn new models new samples
samples
features
Coping with Video Domain Change; Zavesky 2007
new
1x new
across many dimensions
Replication model: Kernel matrix replication
18
learn new models new samples
samples
features
new
2x 1x new 1x 2x
replication processor
Coping with Video Domain Change; Zavesky 2007
weights across many dimensions
Replication model: Kernel matrix replication
19
D*3 features
+1 0.2 0.4 ...
+1 0.9 0.9 ...
new D features M samples
+1 0.4 0.2 0.4 0.2 0.0 0.0 ... +1 0.9 0.9 0.0 0.0 0.9 0.9 ...
N samples M + N samples
general feats
features new features
Coping with Video Domain Change; Zavesky 2007
weighted new samples and a constant offset
Adaptive SVM (A-SVM): Constrained model adaptation
20
models
extract support vectors
learn new models
support vectors new samples
and old support vectors
J. Yang, et al., “Cross-domain video concept detection using adaptive svms”, ACM Multimedia, 2007.f(x) = f old(x) + ∆f(x)
Coping with Video Domain Change; Zavesky 2007
Adaptive SVM (A-SVM): Constrained model adaptation
21
J. Yang, et al., “Cross-domain video concept detection using adaptive svms”, ACM Multimedia, 2007. decision boundary (hyperplane)+
Old Feature Spacex
margin distance 1 || w ||x + +
New Feature Space ++ + x
x x + + + x x x xNew Feature Space
+ + x
x x + + + x x x x
+ x x + +
margin distance 1 || w || constrained adjustment
new samplesCoping with Video Domain Change; Zavesky 2007
model as best observations in old domain
Cross-domain SVM (CD-SVM): Adapting prior models
22
models
extract support vectors
learn new models
new samples
support vectors compute similarity
features samples weighted samples
Submitted: W. Jiang, E. Zavesky, S.F. Chang, A. Loui, “Cross-domain learning methods for high-level concept classification,” ICASSP 2008.Coping with Video Domain Change; Zavesky 2007
Cross-domain SVM (CD-SVM): Adapting prior models
23
Submitted: W. Jiang, E. Zavesky, S.F. Chang, A. Loui, “Cross-domain learning methods for high-level concept classification,” ICASSP 2008.min
w
1 2||w||2
2 + C
|Dnew|
i=1
ǫi +C M
j=1 σ(vold j
, Dnew)ǫj σ(vold
j
, Dnew) = 1 |Dnew|
j
− xi||2
2
+
Old Feature Spacex
margin distance 1 || w ||x + +
helpful support vector from old domain new samples+ + x
x x + + + x x x x+ x x + +
ignored support vector from old domain adapted hyperplaneCoping with Video Domain Change; Zavesky 2007
Cross-domain methods: Observed speed trends
24 method
new example training cost
all
combined all all 3x replication all all 9x CDSVM small all 1.25x
increasing observed performance
Theoretical training cost with 40k samples in old domain, 20k in new domain (similar to TRECVID problem)
new model
1x
training data
Coping with Video Domain Change; Zavesky 2007
Choosing an approach...
predictions can be found in a piece-wise manner
ideal training conditions for each approach...
25
Coping with Video Domain Change; Zavesky 2007
Performance comparison: High positive frequency
26
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Person Face Outdoor Crowd Vegetation Sky Building Walking_Running Urban Office Road Waterscaoe Concept AP (Positive freq > 0.05) AP target combined replication CDSVM A-SVMCoping with Video Domain Change; Zavesky 2007
Performance comparison: Mid positive frequency
27
shaded regions best per concept (5% relative improvement over all others)
Coping with Video Domain Change; Zavesky 2007
0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 Charts Snow Mountain Weather Court Maps Explosion_Fire Prisoner Desert Bus Natural-Disaster Airplane Flag-US Concept AP (Positive freq. < 0.01) APPerformance comparison: Low positive frequency
28
shaded regions best per concept (5% relative improvement over all others)
Observed differences but performance too low for significant conclusions?
Coping with Video Domain Change; Zavesky 2007
Which approach to choose to obtain good new domain performance
and performance of old model...
select CDSVM (adapts old to well-defined new domain)
29
Coping with Video Domain Change; Zavesky 2007
Which approach to choose to obtain good new domain performance
and performance of old model...
select replication (learn combined trends)
select new domain only (not enough evidence)
30
Coping with Video Domain Change; Zavesky 2007
Which approach to choose to obtain good new domain performance
and performance of old model...
select new (sparsity risk too high)
31
Coping with Video Domain Change; Zavesky 2007
Which approach to choose to obtain good new domain performance
and performance of old model...
32
Coping with Video Domain Change; Zavesky 2007
Approach selection: Empirical rule set
choose an approach that optimizes new domain performance
33
strong old model high frequency new and old too sparse default choice
Coping with Video Domain Change; Zavesky 2007
Approach selection: Rule-based benefits
34
Observed MAP improvement over new model alone
high frequency mid frequency low frequency
8.7% 29.8% 24.6% shaded regions best per concept (5% relative improvement over all others)
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Person Face Outdoor Crowd Vegetation Sky Building Walking_Running Urban Office Road Waterscaoe Concept AP (Positive freq > 0.05) AP 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 Meeting Car Animal Computer_TV Studio Military People-Marching Boat_Ship Sports Police_Security Truck Concept AP (0.05 <= Positive freq <= 0.01) 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 Charts Snow Mountain Weather Court Maps Explosion_Fire Prisoner Desert Bus Natural-Disaster Airplane Flag-US Concept AP (Positive freq. < 0.01) target combined replication CDSVM A-SVMCoping with Video Domain Change; Zavesky 2007
35
TRECVID 2007 high level features
Coping with Video Domain Change; Zavesky 2007
36
Columbia HLF: TRECVID2007
Concept fusion (explore concept relations) TRECVID 2005 (development) Columbia 374 (baseline models) TRECVID 2007 (development) Cross domain New baseline TRECVID 2007 (test) BCRF (20 improved models) Model adaptation1 2 3
Coping with Video Domain Change; Zavesky 2007
Empirical Results: TRECVID2007
replicated models
was submitted
improved performance for most concepts
edge direction histogram Gabor texture
37
Concept fusion (explore concept relations) TRECVID 2005 (development) Columbia 374 (baseline models) TRECVID 2007 (development) Cross domain New baseline TRECVID 2007 (test) BCRF (20 improved models) Model adaptation 1 2 3Coping with Video Domain Change; Zavesky 2007
0.05 0.10 0.15 0.20 0.25 0.30 0.35 MIAP sports weatherEmpirical performance:
Method comparisons; new vs. replication
38
R5: replicate R6: new R4: bcrf+new R3: bcrf+replicate+new R2: bcrf+replicate+new+old R1: adaptive selectionreplication replication
Coping with Video Domain Change; Zavesky 2007
conclusions & next steps
39
Coping with Video Domain Change; Zavesky 2007
Conclusions
data and models
increase and maintain performance
domain and domain similarity
40
Coping with Video Domain Change; Zavesky 2007
Next Steps: Technical questions for adaptation
distribution analysis is underway
approaches on new data
41
Coping with Video Domain Change; Zavesky 2007
Thanks for your time.
42