TRECVID-2006: Shot Boundary Detection Task Overview Alan Smeaton - PowerPoint PPT Presentation

TRECVID-2006: Shot Boundary Detection Task Overview Alan Smeaton Dublin City University & Paul Over NIST

SB Task Definition  Shot boundary detection is a fundamental task in any kind of video content manipulation  Task provides a good entry for groups who wish to “break into” video retrieval and TRECVID gradually  Task is to identify the shot boundaries with their location and type (cut or gradual) in the given video clip(s) TRECVID 2006 2

SB Task Details  Groups may submit up to 10 runs  Comparison to human-annotated reference (thanks to Jonathan Lasko, again)  Groups were asked to provide some standard information on the processing complexity of each run:  Total runtime in seconds  Total decode time in seconds  Total segmentation time in seconds  Processor description TRECVID 2006 3

Shot boundary task: Participating groups (26) 1. AIIA Laboratory Greece 14. IIT / NCSR Demokritis Greece 2. AT&T Laboratories USA 15. KDDI / Tokushima U. / ISM / NII Japan 3. Chinese Academy of Sciences / JDL China 16. ETIS Greece 4. City University of Hong Kong China 17. Motorola Research Lab. USA 5. CLIPS-IMAG, LSR-IMAG France 18. RMIT University Australia 6. COST292 EU 19. Tokyo Institute of Technology Japan 7. Curtin University Australia 20. Tsinghua University China 8. Dokuz Eylol Turkey 21. University of Marburg Germany 9. Florida International University USA 22. University of Modena Reggio Italy 10. FX Palo Alto Laboratory USA 23. Carleton University (Ottawa) Canada 11. Helsinki University of Technology Finland 24. University of Sao Paulo (USP) Brazil 12. Huazhong U. of Science & Tech. China 25. University Rey Juan Carlos Spain 13. Indian Institute of Tecnology, 26. Zhejiang University China Bombay India 2005 had 21 groups, of whom 9 appear again in 2006 TRECVID 2006 4

Shot boundary data  13 representative news videos  Total frames: 597043  Total transitions: 3785  Transition types:  1,844 (48.7%) Cuts (2005: 60.8%)  1,509 (39.9%) Dissolves (2005:30.5%)  51 ( 1.3%) Fade-out/-in (2005: 1.8%)  381 (10.1%) other (2005: 6.9%)  More graduals, which are harder to match TRECVID 2006 5

Shot boundary data – more short graduals  Short graduals: graduals <= 5 frames in length  Harder to match - treated as “cuts” but no 5-frame expansion as with other cuts to handle differences in decoders  2006 data has more “short graduals” Short graduals 2006 2005 2004 2003 % of graduals 47 35 24 7 % of all 24 14 10 2 TRECVID 2006 6

Evaluation Measures # Transitions Correctly Reported Precision = # Transitions Reported # Transitions Correctly Reported Recall = # Transitions in Reference # Frames Correctly Reported in Detected Transitions Frame Precision = # Frames reported in Detected Transitions # Frames Correctly Reported in Detected Transitions Frame Recall = # Frames in Reference Data for Detected Transitions TRECVID 2006 7

Cuts AIIA ATT 1 CityUHK CLIPS 0.9 COST292 Curtin 0.8 0.7 Carleton.UO DokuzEylulU Precision 0.6 ETIS FIU 0.5 FXPAL Huazhong 0.4 IIT.NCSR CAS.JDL 0.3 KDDI.TU.TUT USaoPaolo 0.2 Motorola Marburg 0.1 HelsinkiUT RMIT 0 Zhejiang Tsinghua 0 1 0 .1 0 .2 0 .3 0 .4 0 .5 0 .6 0 .7 0 .8 0 .9 TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 8

Cuts (zoomed) AIIA ATT 1 CityUHK CLIPS 0.95 COST292 Curtin 0.9 0.85 Carleton.UO DokuzEylulU Precision 0.8 ETIS FIU 0.75 FXPAL Huazhong 0.7 IIT.NCSR CAS.JDL 0.65 KDDI.TU.TUT USaoPaolo 0.6 Motorola Marburg 0.55 HelsinkiUT RMIT 0.5 Zhejiang Tsinghua 1 0 .5 0 .6 0 .7 0 .8 0 .9 TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 9

Cuts (zoomed again) AIIA ATT 1 CityUHK CLIPS COST292 Curtin 0.95 Carleton.UO DokuzEylulU Precision 0.9 ETIS FIU FXPAL Huazhong 0.85 IIT.NCSR CAS.JDL KDDI.TU.TUT USaoPaolo 0.8 Motorola Marburg HelsinkiUT RMIT 0.75 0 .7 5 0 .8 5 0 .9 5 Zhejiang Tsinghua TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 10

Gradual transitions AIIA ATT 1 CityUHK CLIPS 0.9 COST292 Curtin 0.8 0.7 Carleton.UO DokuzEylulU Precision 0.6 ETIS FIU 0.5 FXPAL Huazhong 0.4 IIT.NCSR CAS.JDL 0.3 KDDI.TU.TUT USaoPaolo 0.2 Motorola Marburg 0.1 HelsinkiUT RMIT 0 Zhejiang Tsinghua 0 1 0 .1 0 .2 0 .3 0 .4 0 .5 0 .6 0 .7 0 .8 0 .9 TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 11

Gradual transitions (zoomed) AIIA ATT 1 CityUHK CLIPS 0.95 COST292 Curtin 0.9 0.85 Carleton.UO DokuzEylulU Precision 0.8 ETIS FIU 0.75 FXPAL Huazhong 0.7 IIT.NCSR CAS.JDL 0.65 KDDI.TU.TUT USaoPaolo 0.6 Motorola Marburg 0.55 HelsinkiUT RMIT 0.5 Zhejiang Tsinghua 1 0 .5 0 .6 0 .7 0 .8 0 .9 TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 12

Gradual transitions (Frame-P & -R) AIIA ATT 1 CityUHK CLIPS 0.9 COST292 Curtin 0.8 0.7 Carleton.UO DokuzEylulU Precision 0.6 ETIS FIU 0.5 FXPAL Huazhong 0.4 IIT.NCSR CAS.JDL 0.3 KDDI.TU.TUT USaoPaolo 0.2 Motorola Marburg 0.1 HelsinkiUT RMIT 0 Zhejiang Tsinghua 0 1 0 .1 0 .2 0 .3 0 .4 0 .5 0 .6 0 .7 0 .8 0 .9 TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 13

Gradual transitions (Frame-P & -R) zoomed AIIA ATT 1 CityUHK CLIPS 0.95 COST292 Curtin 0.9 0.85 Carleton.UO DokuzEylulU Precision 0.8 ETIS FIU 0.75 FXPAL Huazhong 0.7 IIT.NCSR CAS.JDL 0.65 KDDI.TU.TUT USaoPaolo 0.6 Motorola Marburg 0.55 HelsinkiUT RMIT 0.5 Zhejiang Tsinghua 1 0 .5 0 .6 0 .7 0 .8 0 .9 TokyoInstTech UniMore Recall URJC ITT.Bombay TRECVID 2006 14

Mean runtime in seconds 420000 400000 380000 360000 340000 Mean runtim e ( s) 320000 300000 280000 260000 240000 220000 200000 180000 160000 140000 120000 100000 80000 60000 40000 20000 0 REALTIME ====> Chinese U HK UNIMORE Tsinghua U. RMIT IIT-Bombay AIIA MR Motorola FIU CU-Uottawa TokyoTech HUT SirCy9 KDDI ATT USP FXPAL ETIS hust COST292 JDL DEU CLIPS URJC Participant TRECVID 2006 15

Mean runtime in seconds (faster than realtime) 20000 Mean runtim e ( s) 15000 10000 5000 0 E a T E R I K 9 t 2 T . L P s D U l R I M y 9 o D M H T S u M D C 2 O r A J I h a U o T R T K U r u M t i S o L S h e I A M O N g s E n e C U i R n s i T h C Participant TRECVID 2006 16

Mean total runtime vs effectiveness on cuts (for systems faster than realtime) Average F1 (harmonic mean 1 ATT of precision and recall) COST292 0.9 Huazhong JDL (CAS) 0.8 KDDI.TU.TUT USaoPaolo 0.7 Motorola Marburg 0.6 RMIT Tsinghua 0.5 TokyoInstTech UniMore 0.4 2000 4000 6000 8000 10000 12000 14000 16000 18000 0 Mean total runtime (seconds) TRECVID 2006 17

Mean total runtime vs effectiveness on graduals (for systems faster than realtime) Average F1 (harmonic mean 1 ATT of precision and recall) COST292 0.9 Huazhong JDL (CAS) 0.8 KDDI.TU.TUT USaoPaolo 0.7 Motorola Marburg 0.6 RMIT Tsinghua 0.5 TokyoInstTech UniMore 0.4 2000 4000 6000 8000 10000 12000 14000 16000 18000 0 Mean total runtime (seconds) TRECVID 2006 18

1. AIIA Laboratory  ICASSP2006 paper describes using information from multiple pairs of frames, within a temporal window;  Good for GTs, which it targets  10 runs, varying thresholds  Frame similarity is color based, not histogram bins but intensity of R,G,B, window size  Downsampled frame size for 25%  Performance … several others do better for cuts and also for GTs, but in FR/FP they are better  Computational expense as expected, several xRT, but novel TRECVID 2006 19

2. AT&T Laboratories  Built 6x independent detectors for cuts, fast dissolves (<5Fs), fade-in, fade-out, dissolve, and wipes;  Easy to plug in new detectors;  Fusion of outputs, fuse & resolve conflicts  Each detector is a FSM (details in paper)  Extract color RGB & intensity, histograms, edges, average, variance, skew, flatness, all from a central area of frame -> losing the borders;  Compute frame-frame for adjacent and 6-distant frames;  Late fusion with prioritisation of detection types;  7th fastest in execution and rates well in performance TRECVID 2006 20

3. Chinese Academy of Sciences / JDL  2-pass approach … histograms and mutual information  Thresholding to locate possible SBs then a SVM on those candidate areas;  Rationale based on not needing detailed features around every frame;  Needs to improve distinction between GTs and camera motion, which gives false +s;  Histograms are color based  Results deflated by their decoder being 1 frame out of sync with evaluation numbering; TRECVID 2006 21

4. City University of Hong Kong  Used RGB and HSV color spaces;  Euclidean distance, color moments and Earth Mover distances  EMD best  Used adaptive thresholding, adapting to mean and standard deviations in 11-frame window;  Good for cuts and short GTs;  Separate GT detector; TRECVID 2006 22

TRECVID-2006: Shot Boundary Detection Task Overview Alan Smeaton - PowerPoint PPT Presentation

TRECVID-2006: Shot Boundary Detection Task Overview Alan Smeaton Dublin City University & Paul Over NIST SB Task Definition Shot boundary detection is a fundamental task in any kind of video content manipulation Task provides

TRECVID-2005: Shot Boundary Detection Task Overview Alan Smeaton Dublin City University &

SHOT Brand Price NOTES WEST COAST MAGNUM SIZES 4 - 9 $ 39.20 Eagle shot prices may not be

CMU @ TRECVID Event Detection @ Ming-yu Chen & Alex Hauptmann School of Computer Science

Event Detection in Airport Surveillance The TRECVid 2008 Evaluation The TRECVid 2008 Evaluation

TRECVID-2005: Search Task Alan Smeaton Dublin City University & Tzveta Ianeva NIST

VIREO-TNO @ TRECVID 2014 Zero-Shot Event Detection and Recounting Speaker: Maaike de Boer (TNO)

George Awad National Institute of Standards and Technology Dakota Consulting, Inc 2 TRECVID

TRECVID-2005 Low-level (camera motion) feature task Wessel Kraaij TNO & Tzveta Ianeva

Columbia HLF: TRECVID2006 TRECVID TRECVID TRECVID 2005 2005 2005 (development)

TRECVID 2015 INSTANCE RETRIEVAL INTRODUCTION AND TASK OVERVIEW Wessel Kraaij TNO; Radboud

TRECVID 2016 AD-HOC VIDEO SEARCH TASK : OVERVIEW Georges Qunot Laboratoire d'Informatique de

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Boundary Line Discussion February 2018 Agenda District vision Why boundary line

Boundary value problems What problems does boundary value testing have? ECT2 Boundary

Learning From Video Browse Behavior Learning From Video Browse Behavior TRECVID 2009 TRECVID

TRECVID 2008 CBCD TRECVID 2008. CBCD MCG-ICT-CAS MCG-ICT-CAS Sheng Tang Yongdong Zhang Ke Gao

Algorithms R OBERT S EDGEWICK | K EVIN W AYNE 2.1 E LEMENTARY S ORTS rules of the game

heaps 3 Nov. 6, 2017 1 STEM Support https://infomcgillste m.wixsite.com/stems upportmcgill

Blazar Optical Sky Survey BOSS Project (2013-2016) and the long-term optical variability

X-ray irradiation Peter Wheatley and evaporation University of Warwick Contributors: Tom Louden,

Jam-barrel politics: Road building and legislative voting in Colombia Leonardo Bonilla-Mej a

Multi-parameter regularization for ill-posed problems with noisy right hand side and noisy

r r

TeV Par(cle Astrophysics with the High Al(tude Water Cherenkov (HAWC) Gamma-Ray Observatory

Sambuz

Useful Links

Newsletter

Mail Us

TRECVID-2006: Shot Boundary Detection Task Overview Alan Smeaton - PowerPoint PPT Presentation

TRECVID-2006: Shot Boundary Detection Task Overview Alan Smeaton Dublin City University & Paul Over NIST SB Task Definition Shot boundary detection is a fundamental task in any kind of video content manipulation Task provides

TRECVID-2005: Shot Boundary Detection Task Overview Alan Smeaton Dublin City University &amp;

SHOT Brand Price NOTES WEST COAST MAGNUM SIZES 4 - 9 $ 39.20 Eagle shot prices may not be

CMU @ TRECVID Event Detection @ Ming-yu Chen &amp; Alex Hauptmann School of Computer Science

Event Detection in Airport Surveillance The TRECVid 2008 Evaluation The TRECVid 2008 Evaluation

TRECVID-2005: Search Task Alan Smeaton Dublin City University &amp; Tzveta Ianeva NIST

VIREO-TNO @ TRECVID 2014 Zero-Shot Event Detection and Recounting Speaker: Maaike de Boer (TNO)

George Awad National Institute of Standards and Technology Dakota Consulting, Inc 2 TRECVID

TRECVID-2005 Low-level (camera motion) feature task Wessel Kraaij TNO &amp; Tzveta Ianeva

Columbia HLF: TRECVID2006 TRECVID TRECVID TRECVID 2005 2005 2005 (development)

TRECVID 2015 INSTANCE RETRIEVAL INTRODUCTION AND TASK OVERVIEW Wessel Kraaij TNO; Radboud

TRECVID 2016 AD-HOC VIDEO SEARCH TASK : OVERVIEW Georges Qunot Laboratoire d'Informatique de

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Boundary Line Discussion February 2018 Agenda District vision Why boundary line

Boundary value problems What problems does boundary value testing have? ECT2 Boundary

Learning From Video Browse Behavior Learning From Video Browse Behavior TRECVID 2009 TRECVID

TRECVID 2008 CBCD TRECVID 2008. CBCD MCG-ICT-CAS MCG-ICT-CAS Sheng Tang Yongdong Zhang Ke Gao

Algorithms R OBERT S EDGEWICK | K EVIN W AYNE 2.1 E LEMENTARY S ORTS rules of the game

heaps 3 Nov. 6, 2017 1 STEM Support https://infomcgillste m.wixsite.com/stems upportmcgill

Blazar Optical Sky Survey BOSS Project (2013-2016) and the long-term optical variability

X-ray irradiation Peter Wheatley and evaporation University of Warwick Contributors: Tom Louden,

Jam-barrel politics: Road building and legislative voting in Colombia Leonardo Bonilla-Mej a

Multi-parameter regularization for ill-posed problems with noisy right hand side and noisy

r r

TeV Par(cle Astrophysics with the High Al(tude Water Cherenkov (HAWC) Gamma-Ray Observatory

Sambuz

Useful Links

Newsletter

Mail Us

TRECVID-2005: Shot Boundary Detection Task Overview Alan Smeaton Dublin City University &

CMU @ TRECVID Event Detection @ Ming-yu Chen & Alex Hauptmann School of Computer Science

TRECVID-2005: Search Task Alan Smeaton Dublin City University & Tzveta Ianeva NIST

TRECVID-2005 Low-level (camera motion) feature task Wessel Kraaij TNO & Tzveta Ianeva