Tsinghua @ TRECVID2007.search Zhikun Wang, Dong Wang, Huiyi Wang, - - PowerPoint PPT Presentation

tsinghua trecvid2007 search
SMART_READER_LITE
LIVE PREVIEW

Tsinghua @ TRECVID2007.search Zhikun Wang, Dong Wang, Huiyi Wang, - - PowerPoint PPT Presentation

Tsinghua @ TRECVID2007.search Zhikun Wang, Dong Wang, Huiyi Wang, Tongchun Xiao, Duanpeng Wang, Yingyu Liang, Yang Pang Jianmin Li, Fuzong Lin, Bo Zhang Outline System Overview Concept-Based Search Experiments & Results


slide-1
SLIDE 1

Tsinghua @ TRECVID2007.search

Zhikun Wang, Dong Wang, Huiyi Wang, Tongchun Xiao, Duanpeng Wang, Yingyu Liang, Yang Pang Jianmin Li, Fuzong Lin, Bo Zhang

slide-2
SLIDE 2

Outline

 System Overview  Concept-Based Search  Experiments & Results  Conclusion

slide-3
SLIDE 3

Outline

 System Overview  Concept-Based Search  Experiments & Results  Conclusion

slide-4
SLIDE 4

Automatic Search System

 Framework

Find shots of water with boats or ships

Multimedia Query

Text-based Retrieval Visual-based Retrieval Concept Concept- based Retrieval Multi- Modal Fusion Text Visual

slide-5
SLIDE 5

Automatic Search System

 Text-based search

 Keywords: expanded by WordNet  Transcript segmentation:

shot-level, story-level, video-level

 Result expansion for shot-level search:

scores spread along the timeline

slide-6
SLIDE 6

Automatic Search System

 Text-based search  Visual-based search

 Richer feature set  Feature selection & fixed-value fusion weight:

MAP & consistency 5 features involved

 Several SVM classifiers for each feature  Weighted average multi-feature fusion

slide-7
SLIDE 7

Automatic Search System

 Text-based search  Visual-based search  Concept-based search

 Query-concept mapping

Text-concept mapping Example-concept mapping

 More details come later.

slide-8
SLIDE 8

Automatic Search System

 Text-based search  Visual-based search  Concept-based search  Fusion

 Weighted average  Query-independent

slide-9
SLIDE 9

Interactive Search System

 User interface

 faster, faster and faster  Browsing functions

 Server end

 Several options

slide-10
SLIDE 10

Interactive Search System: UI

 Double-screen interface  Multi-thread browsing

 Temporal thread  Visual neighbor thread

 Frame-level browsing  Browsing function

 Forward, Backward, Bookmark

 Hotkey

slide-11
SLIDE 11

Browsing

Rank list Temporal thread Visual neighbor thread

Faster Browsing

Story browser Frame-level browsing

slide-12
SLIDE 12

Labeling:

Hotkey & Mouse

slide-13
SLIDE 13

Refining

Positive samples Negative samples Uncertain samples

Bookmark

slide-14
SLIDE 14

Server end

 Distributed server end  More options

 1 text-based server  4 SVM models with different features  2 concept-based servers  manually adjusted options Vs. default options

slide-15
SLIDE 15

Outline

 System Overview  Concept-Based Search  Experiments & Results  Conclusion

slide-16
SLIDE 16

Concept-Based Search

 Well established approach  Need theoretical guidance for practical issues  Query-Concept Mapping (QUCOM) Query Image Boat/Ship, Waterscape, …

slide-17
SLIDE 17

Possible Solutions for QUCOM

 User choice?

Text Match

([Snoek, 2006], [Chang, 2006], et c)

 Effective if well matched  Fails to consider  visual correlation  concept performance  concept distribution over

the collection

 Concept Space

Search in Full Space

(e.g. SVM, KNN

[Natsev, 2006], PMIWS [Zheng, 2006])

Search in Concept Subspace

slide-18
SLIDE 18

Concept Selection via c-tf-idf Metric

Concept Relevance Ranking

 tf: frequency of a term in a document  term popularity

 idf: inverse document frequency of a term  term specificity Definition in text area

c: concept, d: shot

c-tf-idf: tf-idf for concept

) ) ( log( ) | ( ) ) ( log( ) , ( : ) , ( c freq N d c P c freq N d c freq d c idf tf c    

slide-19
SLIDE 19

Insight of the tf-idf based Principle

) ) ( log( ) , ( : ) , ( c freq N d c freq d c idf tf c   

Concept Relevance Concept Specificity Query- Dependent Rank Query- Independent Rank

c-tf-idf is a good combination of query-dependent ranks and query-independent ranks, and a promising solution for QUCOM.

slide-20
SLIDE 20

Two Settings for QUCOM

 Automatic video retrieval (AVR)

 limited information as text input, and possibly, image examples

 Interactive video retrieval (IVR)

 unrealistic to ask user provide relevant concepts  Infer the implicit semantic concepts by explicit user feedback

 QUCOM should be

 On a per query analysis basis, on-the-fly,  Combat against varied concept detection performance  Scalable to

 Concepts in a given lexicon  Video archive size

slide-21
SLIDE 21

Concept-Based Search: Search

 Search in concept subspace  Impact of dimension of subspace

0.31 0.315 0.32 0.325 0.33 0.335 1 2 3 4 5 6

MAP

MAP

Experiment on TRECVID 2006, interactive search Experiment on TRECVID 2006, automatic search

slide-22
SLIDE 22

Inferring implicit concepts through explicit feedback: Interactive Search

 Interactive Search

 Using relevance feedback as examples  Higher efficiency: Vs. user-provided examples  Pre-computed offline  Lower user labor: Vs. manual concept selection  Better performance: Vs. previous system  65% improvement upon previous method (without

using concepts)

 experiment on TRECVID 2006, interactive search

slide-23
SLIDE 23

Concept-Based Search: Lexicons

 LSCOM-lite

 39 concept detectors from HLF task

 LSCOM

 374 concepts chosen from LSCOM

 Impact of quality & quantity?

 Experiment on TRECVID 2006, interactive search

slide-24
SLIDE 24

Outline

 System Overview  Concept-Based Search  Experiments & Results  Conclusion

slide-25
SLIDE 25

Automatic runs

 Run1:text

:0.011

 Run2:image + LSCOM-Lite

:0.042

 Run3:text+image

:0.038

 Run4:text+image+LSCOM

:0.043

0.05 0.1 0.15 0.2 0.25 MAP 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 Run1: text Run2: image+LSCOM-Lite Run3: text+image Run4: text+image+LSCOM

slide-26
SLIDE 26

Run1

 Text-based search

 Helpful to topics about Object  Useless to topics about Event or Scene  Unsatisfactory upon non-news video

slide-27
SLIDE 27

Run2, Run3, Run4

 Run2 Vs. Run 4

 Concept detectors from LSCOM(except 39 concepts

from HLF) are trained upon different dataset.

 Run2 Vs. Run3

 Involving concept-based search brings improvement.

0.01 0.02 0.03 0.04 0.05 Run1: text Run2:image+LSCOM-lite Run3:text+image Run4:text+image+LSCOM MAP 0.0104 0.041 0.0376 0.0426

MAP

slide-28
SLIDE 28

Interactive runs

 Run5: expert with manually adjusted options

:0.209

 Run6: expert with default options

:0.171

 RunS: novice with default options

:0.149

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 MAP 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 Run5: expert with manually adjusted options Run6: expert with default options RunS: novice

slide-29
SLIDE 29

Relevant results retrieved

200 400 600 800 1000 1200 ret_rel rel

slide-30
SLIDE 30

Outline

 System Overview  Concept-Based Search  Experiments & Results  Conclusion

slide-31
SLIDE 31

Conclusion

 Concept-based search is fruitful and

complement to text and visual search

 A easy-to-use UI is essential to interactive

search

 User can make-up the drop in automatic