TRECVID 2011 TokyoTech+Canon
Semantic Indexing Using GMM Supervectors and Tree-structured GMMs
Nakamasa Inoue, Koichi Shinoda, Department of Computer Science, Tokyo Institute of Technology
Semantic Indexing Using GMM Supervectors and Tree-structured GMMs - - PowerPoint PPT Presentation
TRECVID 2011 TokyoTech+Canon Semantic Indexing Using GMM Supervectors and Tree-structured GMMs Nakamasa Inoue, Koichi Shinoda, Department of Computer Science, Tokyo Institute of Technology TRECVID 2011 TokyoTech+Canon Outline System
Nakamasa Inoue, Koichi Shinoda, Department of Computer Science, Tokyo Institute of Technology
System overview Fast and high-performance semantic indexing system
Best result: Mean InfAP = 17.3% 1
1) SIFT-Har 6) MFCC GMM supervectors SVM score Score fusion
Fast and high-performance semantic indexing system
Tree-sturuc tured GMMs 2) SIFT-Hes 3) SIFTH-Dense 4) HOG-Dense 5) HOG-Sub … … … … SVM score
2
1) SIFT-Har 6) MFCC GMM supervectors SVM score Score fusion
Fast and high-performance semantic indexing system
Tree-sturuc tured GMMs 2) SIFT-Hes 3) SIFTH-Dense 4) HOG-Dense 5) HOG-Sub … … … … SVM score
GMM supervectors SVM score Score fusion Tree-sturuc tured GMMs nse se … … … … SVM score
3
4
5
6
1) SIFT-Har 6) MFCC GMM supervectors SVM score Score fusion
Fast and high-performance semantic indexing system
Tree-sturuc tured GMMs 2) SIFT-Hes 3) SIFTH-Dense 4) HOG-Dense 5) HOG-Sub … … … … SVM score
1) SIFT-Har 6) MFCC 2) SIFT-Hes 3) SIFTH-D 4) HOG-De 5) HOG-Sub SVM score Score fusion SVM score
7
Each shot is model by a GMM
GMM parameters are estimated by using
8
(Basic) MAP adaptation for mean vectors:
where
9
: responsibility of component for
Gaussian components
10
: responsibility of component for
Gaussian components
10
Calculate responsibilities quickly. [ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011]
Gaussian components
11
Leaf layer
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011]
Gaussian components
11
Non-leaf layers
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011]
Gaussian components
11
Non-leaf layers [ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011]
Gaussian components
11
Non-leaf layers
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011]
Gaussian components
11
Calculate responsibilities quickly. [ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011]
Gaussian components
12
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011] Calculate responsibilities quickly.
12
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011] Calculate responsibilities quickly.
12
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011] Calculate responsibilities quickly.
12
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011] Calculate responsibilities quickly.
12
Summary of the algorithm
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011] 13
Summary of the algorithm
[ Nakamasa Inoue, Koichi Shinoda, “A Fast MAP Adaptation Technique for GMM- supervector-based Video Semantic Indexing Systems,”In Proc. of ACM Multimedia (short paper), 2011] 13
Calculation time for MAP adaptation
14
Combine normalized mean vectors.
15
1) SIFT-Har 6) MFCC GMM supervectors SVM score Score fusion
Fast and high-performance semantic indexing system
Tree-sturuc tured GMMs 2) SIFT-Hes 3) SIFTH-Dense 4) HOG-Dense 5) HOG-Sub … … … … SVM score
1) SIFT-Har 6) MFCC GMM supervector Tree-sturuc tured GMMs 2) SIFT-Hes 3) SIFTH-Dense 4) HOG-Dense 5) HOG-Sub … … … …
16
SVMs are trained with RBF-kernels
Score fusion
17
TokyoTech_Canon_1
TokyoTech_Canon_2
18
TokyoTech_Canon_3
TokyoTech_Canon_4
19
20
21
22
6 types of audio and visual GMM supervectors
Fast MAP adaptation
Future work
23