MCG-ICT-CAS TRECVID 2008 Automatic Video 2008 Automatic Video - - PowerPoint PPT Presentation

mcg ict cas trecvid 2008 automatic video 2008 automatic
SMART_READER_LITE
LIVE PREVIEW

MCG-ICT-CAS TRECVID 2008 Automatic Video 2008 Automatic Video - - PowerPoint PPT Presentation

MCG-ICT-CAS TRECVID 2008 Automatic Video 2008 Automatic Video Retrieval System Retrieval System Juan Cao, Yong-dong Zhang, , g g g, Bai-lan Feng, Xiu-feng Hua, Lei Bao, Xu Zhang INSTITUT INS TE O E OF CO Multimedia Computing Group


slide-1
SLIDE 1

MCG-ICT-CAS TRECVID 2008 Automatic Video 2008 Automatic Video Retrieval System Retrieval System

Juan Cao, Yong-dong Zhang,

INS INSTITUT

, g g g, Bai-lan Feng, Xiu-feng Hua, Lei Bao, Xu Zhang

TE O E OF CO COMPUTIN MPUTING

Multimedia Computing Group Institute of Computing Technology Chinese Academy of Sciences

G TE TECHN CHNOLOGY

NIST TRECVID Workshop November 17,2008

slide-2
SLIDE 2

INSTITUTE OF COMPUTING TECHNOLOGY

Outline

Overall system

Review of baseline retrieval

Review of baseline retrieval Performance analysis

Concept-based retrieval Re-ranking Dynamic fusion

Conclusion Conclusion

slide-3
SLIDE 3

INSTITUTE OF COMPUTING TECHNOLOGY

System Overview

slide-4
SLIDE 4

R i f b li t i l

INSTITUTE OF COMPUTING TECHNOLOGY

Review of baseline retrieval

Text-based retrieval 0.009

ASR shot matching

g A window of 3 shots

Pre-processing Pre processing

Stop words removing stemming

Indexing Indexing

lucence

slide-5
SLIDE 5

R i f b li t i l

INSTITUTE OF COMPUTING TECHNOLOGY

Review of baseline retrieval

Text-based retrieval

0.009

Visual-based retrieval 0.033

Feature extraction

EH CM Sift i l k d EH CM Sift-visual-keywords Early fusion and LDA embedding

Retrieval model Retrieval model

Multi-bag SVM cosine-similarity

Fusion

SSC dynamic fusion

slide-6
SLIDE 6

R i f b li t i l

INSTITUTE OF COMPUTING TECHNOLOGY

Review of baseline retrieval

Text-based retrieval 0.009 Visual-based retrieval

0 033

Visual based retrieval 0.033 HLF-based retrieval

0.029

C d

Concept detectors

  • CU-VIREO374

R t i l M d l

Retrieval Model

  • Multi-bag svm

[Acknowledgement]: Thank Dr. Yu-Gang Jiang for great help in the experiments. p p

slide-7
SLIDE 7

R i f b li t i l

INSTITUTE OF COMPUTING TECHNOLOGY

Review of baseline retrieval

Text-based retrieval 0.009 Visual-based retrieval

0 033

Visual based retrieval 0.033 HLF-based retrieval 0.029 Concept-based retrieval 0.044

p

Keywords mapping DBCS mapping DBCS mapping

slide-8
SLIDE 8

Review of baseline retrieval

INSTITUTE OF COMPUTING TECHNOLOGY

Review of baseline retrieval

0 009

Text-based retrieval 0.009 Visual-based retrieval 0.033 HLF-based retrieval 0.029 Concept-based retrieval

0.044

24%

Re-ranking 0.036

Face Face motion

slide-9
SLIDE 9

R i f b li t i l

INSTITUTE OF COMPUTING TECHNOLOGY

Review of baseline retrieval

Text-based retrieval 0.009 Visual-based retrieval 0.033 HLF-based retrieval 0.029 Concept-based retrieval

0.044

Re-ranking 0.036 SSC Dynamic fusion SSC Dynamic fusion

slide-10
SLIDE 10

Q t t i

INSTITUTE OF COMPUTING TECHNOLOGY

Query-to-concept mapping

Semantic similarity

Data:

retrieve the most similar Related work

query(textual description,visual examples) Aim: Max{similarity( query, concept)}

most similar concepts

Statistic similarity

d th

Statistic similarity

Data: Collection(ASR text, concept distribution)

reduce the most co-occurrence

Aim: Max {p(query , concept)}

concepts

slide-11
SLIDE 11

Q t t i

INSTITUTE OF COMPUTING TECHNOLOGY

Query-to-concept mapping

Semantic similarity

Data:

retrieve the most similar Related work

query(textual description,visual examples) Aim: Max{similarity( query, concept)}

most similar concepts Correct to describe the query useful to identify the query

Statistic similarity

d th

Statistic similarity

Data: Collection(ASR text, concept distribution)

reduce the most co-occurrence

Aim: Max {p(query , concept)}

concepts

slide-12
SLIDE 12

Wh t i f l ?

INSTITUTE OF COMPUTING TECHNOLOGY

What is useful ?

Discriminability-ranking

The distributions fluctuate widely between The distributions fluctuate widely between

the given category and the others, but i t bl ithi thi remain stable within this one.

Factors Factors

Difference of the concept distribution Detector performance Collection characteristic Collection characteristic

slide-13
SLIDE 13

Distribution Based Concept

INSTITUTE OF COMPUTING TECHNOLOGY

Distribution Based Concept Selection Framework(DBCS)

2 ( ) ( ( ) ( ))( ( ) ( )) VAC t i F t F t F t F t ∑

VAC: the difference between categories

2 ( , ) ( ( , ) ( , ))( ( , ) ( , )) VAC t c sign F t c F t c F t c F t c i i j i j j i ← − − ∑ ≠

VIC: the difference within the given category

2

1 ( , ) ( ( , ) ( , ))

i

i i s c i

V IC t c F t s F t c n

← −

g g y

( ) ( , ) / ( , )

i i

Score t VAC t c VIC t c =

Discriminability-score

Where F(t,s) is the distribution function of concept

( ) ( , ) ( , )

i i

( , ) p t in shot s.

slide-14
SLIDE 14

Example-1

INSTITUTE OF COMPUTING TECHNOLOGY

Discriminability-similarity consistency

Topic248 Find shots of a crowd of people

  • utdoors

Topic248 Find shots of a crowd of people , outdoors ,

filling more than half of the frame area DBCS approach Text selection approach DBCS approach infAP=0.321

Crowd 1 40

Text selection approach infAP=0.203

Crowd Crowd 1.40 People_Marching 0.92 Crowd Outdoors Demonstration_Or_ Protest 0.64 Protesters 0 55 Person

Factor-1: outdoors and

Protesters 0.55 Dark- skinned People 0.52

Factor 1: outdoors and person also frequently

  • ccur in other case.

skinned_People

Factor-3: collection characteristic

slide-15
SLIDE 15

Example-2 Di i i bilit i il it i i t

INSTITUTE OF COMPUTING TECHNOLOGY

Discriminability-similarity inconsistency

Topic261 Find shots of one or more people at a Topic261 Find shots of one or more people at a

table or desk , with a computer visible T t l ti h DBCS approach infAP=0.116 Text selection approach infAP=0.012

Attached_Body_Parts

0.55 Classroom 0.30 Computer Computer_Or_Television_Sc reens Medical_Personnel 0.27 Body Parts 0 25 reens person Body_Parts 0.25 Hand 0.23

Factor 2: computer d t t i t li bl detector is not reliable

slide-16
SLIDE 16

INSTITUTE OF COMPUTING TECHNOLOGY

Re-ranking

f f

face and motion factors

shot-level average face size and position shot-leve principal motion direction and intensity

S S F t S F t C ffi i t ′ Score Score FactorScore FactorCoefficient ′ = + ×

Shot-level vs. Keyframe-level Extract the stable factor Re-ranking selection R d th ti ff t Reduce the negative effect

slide-17
SLIDE 17

INSTITUTE OF COMPUTING TECHNOLOGY

Dynamic fusion

S S C (SSC)

Smoothed Similarity Cluster(SSC)

  • A feature undergoes a rapid change in its normalized scores is

likely to perform better than a feature which undergoes a more likely to perform better than a feature which undergoes a more gradual transition.

1 1000 ( ( ) ( )) 1 1000 ( ( ) ( 1)) 1 1000 1 ( ( ) ( 1)) 1 score n score n n SC N score n score n n N − + ∑ = = − + ∑ =

[P. Wilkins,2007]

N

( ) ( ) median SC SSC t d d d i ti SC =

SC is unstable in real noisy data.

( ) standard deviation SC

Run SSC Score Run Weight

In our system, all fusion y

Run Weight All SSC Scores = ∑

processes are realized by SSC method.

slide-18
SLIDE 18

INSTITUTE OF COMPUTING TECHNOLOGY

Outline

Overall System

Review of baseline retrieval

Review of baseline retrieval Performance analysis

Concept-based retrieval Re-ranking Dynamic fusion

conclusion conclusion

slide-19
SLIDE 19

Overview of submitted and b itt d

INSTITUTE OF COMPUTING TECHNOLOGY

unsubmitted runs

Run Description Mean InfAP Run Description Mean InfAP Run 1: Text baseline 0.009 Run2*: Visual baseline(Multi-bag SVM) 0.024 Run2 : Visual baseline(Multi bag SVM) 0.024 Run3*: Visual baseline(LDA) 0.028 Run4: SSC(Run2, Run3) 0.033

* is the

Run 5: HLF baseline(svm, CU-VIREO374) 0.029 Run 6: HLF baseline +re-ranking 0.036

unsubmiited run

Run 7*: Concept retrieval(text map, CU-VIREO374) 0.026 Run 8*: Concept retrieval(DBCS map, CU-VIREO374) 0.039 Run 9*: SSC(Run7 + Run8) 0.043 Run 10: SSC(Run5 + Run9) 0.053 Run 11: SSC(Run4 + Run9) 0.067

slide-20
SLIDE 20

INSTITUTE OF COMPUTING TECHNOLOGY

Overall performance analysis

0.07 0.08 0.05 0.06 0.07

Fusion of visual and concept-based runs Best concept-based run

0.02 0.03 0.04

Best visual-based run Our text-based run

0.01 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57

Our text-based run

A t ti h f TRECVID2008 Automatic search runs of TRECVID2008

slide-21
SLIDE 21

INSTITUTE OF COMPUTING TECHNOLOGY

Conclusion-1

Concept-based retrieval is a promising

direction. direction. DBCS i th d hi

DBCS mapping method can achieve a

stable good performance.

The difference of the concept distribution is

more useful than the distribution itself .

Select concepts independent of the detector

performance is not reasonable.

slide-22
SLIDE 22

INSTITUTE OF COMPUTING TECHNOLOGY

Conclusion-2

Face and motion based re ranking

Face and motion based re-ranking

technology is important for some special t i topics.

Shot-level feature is stable Reducing the negative effect is important

slide-23
SLIDE 23

INSTITUTE OF COMPUTING TECHNOLOGY

Conclusion-3

SSC dynamic fusion can make

y improvement in more than 80% cases, especially in the case of fusing different especially in the case of fusing different features.

slide-24
SLIDE 24

INSTITUTE OF COMPUTING TECHNOLOGY

Th k ! Thank you!

Any more details can contact: caojuan@ict ac cn caojuan@ict.ac.cn