They Are Not Equally Reliable: Semantic Event Search using - - PowerPoint PPT Presentation
They Are Not Equally Reliable: Semantic Event Search using - - PowerPoint PPT Presentation
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers Inkyu An Content 1. Motivation 2. Previous paper 3. Goal 4. Related Work 5. Approach 6. Result 2 They Are Not Equally Reliable: Semantic Event
2
- 1. Motivation
- 2. Previous paper
- 3. Goal
- 4. Related Work
- 5. Approach
- 6. Result
Content
3
MOTIVATION
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers
4
Motivation | Semantic image retrieval
Person interacting with panda
<Query>
Is it better to use meaning of sentence?
<Query sentence>
5
PREVIOUS PAPER
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers
6
Person interacting with panda
Query image
Person feeding panda Person holding animals Person feeding calf
Implied-by Type-of Mutual-exclusive Result images
Previous paper | Semantic image retrieval
7
[Girls doing handstand]
CNN feature
Extracting image features (CNN)
Word Vector
Word2Vector (Skip-grams)
Query
Sentence Image
- Extract the image features and word vectors
Previous paper | Semantic image retrieval
8
[Girls doing handstand] CNN feature Word Vector Nonsimilar Similar Query CNN feature Word Vector CNN feature Word Vector
System
ο Measure scores of Mutually exclusive, Implied-by and Type-of
Training β¦
[Girl dancing on beach] [Girl doing cartwheel]
Previous paper | Semantic image retrieval
9
Previous paper | Semantic image retrieval
10
Previous paper | Semantic image retrieval
- There are scalability & Time-consuming
issue
- π = π ππ + π·ππ«πππ + π·ππ«πππ + π·π π«π πππ + π πΏ π
π
π·ππ ππππ’π£π ππ‘ ππππππ‘πππ βΆ 4096 π΅ππ’ππππ‘ ππππππ‘πππ βΆ 27425, π ππππ’ππ ππππππ‘ βΆ 100 πΉππππππππ ππππππ‘πππ βΆ π(64) Especially Those issues could be fatal in video search algorithms
11
GOAL
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers
12
Goal | Semantic event search from videos
Input sentence : βHorse Riding Competitionβ without video
Video Search System
Result videos Test videos 98,000 videos
13
Goal | Semantic event search from videos
< Main Contribution > 1) Unsupervised Learning 2) Solve the scalability issue 3) Faster than other method 4) Differentiated Concept Classifiers
14
APPROACH
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers
15
Related Work |
- 1. Skip-grams
- Weight vectors of actions(Sentence)
- 2. Spectral meta-learning
- Unsupervised Learning Method
16
Approach | Proposed framework
Detected Videos
17
Approach | Proposed framework
Detected Videos
Relevance Vector [Binary Vector] Warped Spectral meta-learning [Unsupervised & Fast conversion]
18
Approach | Unsupervised Learning
Test Videos, π€
1 π€ 3 2
βHorse riding competitionβ true or false?
???
- Because this is unsupervised learning, We donβt know the
test video is βHorse riding competitionβ or not. ??? ??? ???
19
Approach | Unsupervised Learning
Test Videos, π€
1 π€ 3 2
βHorse riding competitionβ true or false?
true
- Because this is unsupervised learning, We donβt know the
test video is βHorse riding competitionβ or not. true false true Proposed System Word2Vector
20
Approach | Word to Vector
βHorse riding competitionβ Bee Biking Horse Riding Blowing Candle
Concept Vocabulary (total m) Event Description Skip-Gram Model
π
πΌππ π‘π π πππππ π·πππππ’ππππ
π
πΆπππ₯πππ π·πππππ
π
πΆππ
π
πΆπππππ ππΌππ π‘π ππππππ
π
πΌππ π‘π π πππππ π·πππππ’ππππ
ππΌππ π‘π
ππππππ
π
πΆπππππ
π
πΆπππ₯πππ π·πππππ
π
πΆππ
Word Vector
- Apply Skip-Gram method to both the event and concepts
21
Approach | Relevance Score Vector
π
πΌππ π‘π π πππππ π·πππππ’ππππ
π
ππππππ
π
πΌππ π‘π π πβππ₯ ππ£πππππ
π
πΊππππ
π
πΌππ π‘π π πππππ π·πππππ’ππππ
Compute distance
Concept Vocabulary(total m) 0.8726 0.7647 0.7256 0.0624 π
ππππππ
π
πΌππ π‘π
π πβππ₯
ππ£πππππ
π
πΊππππ
Too High Too Low
Event Description Relevance Vector βwβ 1 1 Relevance Score Vector
- Compute distances between the event and concepts and make
Relevance Vectors
- Relevance Vector means how the event is similar with concepts
Binary vector
22
Approach | Proposed framework
Detected Videos
Relevance Vector [Binary Vector] Waped Spectral meta-learning [Unsupervised & Fast conversion]
23
Approach | Differentiated Concept Classifier
Concepts, π Test Videos, π€
π1,1, π1,2, β¦ , π1,π π2,1, π2,2, β¦ , π2,π π3,1, π3,2, β¦ , π3,π ππ€,1, ππ€,2, β¦ , ππ€,π Compute Similarity 1 π 3 2 1 π€ 3 2
π»π,π β βπ, π
- Differentiated Concept Classifier measures the similarity
between the test video and concepts If the 1st video is similar with concept 1, π1,1 is 1. If the 1st video isnβt similar with concept 1, π1,1 is -1.
24
Approach | Spectral meta-learning
Test Videos, π€
1 π€ 3 2
βHorse riding competitionβ true or false?
???
- Because this is unsupervised learning,
??? ??? ???
πβ = ππππ π=π
π π»π π
πππ β π β ππππ π=π
π π»π π ππ ο Estimate the eigenvector π£π of concept classifierβs covariance matrix to find the optimal solution ππ βΆ Accuracy of the iβ²th concept classifier
25
Approach | Generalized Conditional Gradient (GCG)
- Because this is unsupervised learning,
- to find a eigenvector π£π of covariance matrix, They used
Generalized Conditional Gradient(GCG) algorithm.
- GCG algorithm can be converged quickly.
min
π πβ π
π£π£π π,π β π π,π
2
+ π π£ 2
- Update the eigenvector π£
π£ β πππππππ ππππππ€πππ’ππ ππ β π» Repeat until convergence β¦ { }
π§β β π‘πππ π=1
π ππ π€ π£π
- Local minimizer
- Rank Test videos using below equation
26
RESULT
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers
27
Result | Speed comparison on synthetic data
- It is Faster than previous works
28
Result | Mean average precision result
πβπ ππ£ππππ ππ πππππππ’π‘ βΆ 3,135 (π)
29
Summary |
- Solve scalability & time-consuming issues
- n unsupervised learning.
- They used Skip-grams to convert a word to
a vector.
- They used Spectral-meta learning method
to solve the unsupervised problem.
- They used Generalized Conditional Gradient
(GCG) algorithm to improve the calculation speed.
30
- Thank you.
Q & A |
31
APPENDIX
They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers
32
Appendix | Spectral meta-learning
ππ = Pr ππ π€ = 1 | π§ = 1 ππ = Pr ππ π€ = β1 | π§ = β1 ππ = ππ + ππ 2
- The accuracy of the π-th
concept classifier at π€ video
π§β = argmaxyβ π1 π€ , β¦ , ππ π€ ; π§
π=1
π Pr ππ π |π§
= β π1 π€ , β¦ , ππ π€ ; π§
ο Find a maximum π§ point
- f likelihood
π§β = π‘πππ π=1
π ππ π€
2ππ β 1 β π‘πππ π=1
π ππ π€ π£π
ο Estimate the eigenvector π£π by finding the optimal solution rather than ππ ο Because π£π β 2ππ β 1
33
Appendix | Spectral meta-learning
min
πβ₯0, π πππ π =1 πβ π
π π,π β ππ,π
2
ππππ πππ πππ’π ππ¦ πΊ = ππππΌ π: ππππππ€πππ£π, π: ππππππ€πππ’ππ
π π,π = πΉπ€[ ππ π€ β ππ (π
π π€ β ππ)] =
1 β ππ
2, π = π
2ππ β 1 2ππ β 1 1 β π2 , π β π
- Covariance matrix π between concept i, and concept j at video v
Ranking and combining multiple predictors without labeled data [PNAS, 2014]
- mean prediction π of concept i
ππ = πΉπ€[ππ π€ ]
34
Appendix | warping function
π’π π€ = π
π ππ π€
= π₯πππ π€ , π = 1, β¦ , π
- To incorporate the relevance vector βwβ (page 16), They made
warping functions
- Also, covariance matrix π and mean is converted into
π π and ππ
π π,π
π =
1 π β 1
π=1 π
π’π π€π β ππ
π
π’π π€π β ππ
π ,
π£π
π = 1
π
π=1 π
π’π π€π π§π = π‘πππ
π=1 π
π
π ππ π£π
- The spectral meta-learner for the warped classifiers
35
Appendix | Generalized Conditional Gradient (GCG)
π» = π π¦ = 0, π¦ = 0 ππ,π β π π,π, π¦ β 0, {π β π£π’π£π’
π}
min
π πβ π
π£π£π π,π β π π,π
2
+ π π£ 2
- Update the eigenvector π£
π£ β πππππππ ππππππ€πππ’ππ ππ β π» Repeat until convergence β¦ { }
π§β β π‘πππ π=1
π ππ π€ π£π
- Local minimizer
- Rank Test videos using below equation
36
Appendix | Generalized Conditional Gradient (GCG)
min
πβ₯0, π πππ π =1 πβ π
π π,π β ππ,π
2
ππππ πππ πππ’π ππ¦ πΊ = ππππΌ π: ππππππ€πππ£π, π: ππππππ€πππ’ππ
- Because
π π,π β ππ,π
2 is not convex function, We need other
function to convergence.
- GCG is algorithm for solving a optimization problem quickly.
min
π πβ π
πππ π,π β π π,π
2
+ π π 2
- Update the eigenvector π£ every iteration
π» = πΌπ[
πβ π
ππ,π β π π,π
2]