DIGITAL G Institute for Information and Communication Technologies - PowerPoint PPT Presentation

DIGITAL G Institute for Information and Communication Technologies JOANNEUM RESEARCH and Vienna JOANNEUM RESEARCH and Vienna University of Technology at INS Task Werner Bailer TRECVID Workshop Nov 2010 TRECVID Workshop, Nov. 2010

O tli Outline � Approach A h � Subsystems and features Subsystems and features � Fusion strategies � Results � Conclusion Conclusion 2

A Approach h � fully automatic f ll t ti � set of independent subsystems, using different features � query each sample of a topic independently q y p p p y � each subsystem returns a ranked result list for each sample for each sample � research focus: fusion strategies 3

System Overview i O t S 4

S b Subsystems (1) t (1) � Gabor feature G b f t � perform face detection (Viola-Jones) � if face detected, extract Gabor wavelet descriptor from face region , p g � match against descriptors of all face regions in database � k-NN search � Histogram of gradients Hi t f di t � not used for person/character � descriptor with 36 bins (9 orientations, 4 cells) descriptor with 36 bins (9 orientations, 4 cells) � cell layout is adapted to aspect ratio of query object: 2x2 or 1x4 cells � search window is shifted ¼ cell size � search window is shifted ¼ cell size � 3 scales: 1x, 1.5x and 2x initial size 5

S b Subsystems (2) t (2) � Region covariance R i i � covariance of rectangular region (can be determined efficiently using integral images) � from RGB and first-order derivatives of intensity � same cell sizes/scales as for HoG � SIFT � SIFT � from DoG points � matching: voting in a position histogram (1/10 of image size), g g g ( g ) report match for bins with 5+ votes � Bag of visual features (BoF) � SIFT descriptors from DoG points and global SIFT descriptors from DoG points and global � codebook sizes 100 and 1000 for both 6

P Pre-computed features t d f t � Pre-computed for database P t d f d t b � face detection + Gabor descriptor � global SIFT extraction � BoF codebook generation � At query time � interest point detection + SIFT extraction � HoG � Region covariance g 7

Fusion strategies (1) F i t t i (1) � Two simple methods, not making use of T i l th d t ki f query samples � Max-max � For each shot in the results, take maximum scope of all samples and features l d f t � Top-k � For each feature take for each shot the maximum of all � For each feature, take for each shot the maximum of all samples � Rerank per feature � Take the top-k per feature (k=1000/no. features used) 8

Fusion strategies (2) F i t t i (2) � Two methods using query samples T th d i l � idea: weight features by their relative performance � for each sample, determine where the other samples would for each sample determine where the other samples would be ranked in the result if they were in the database � best rank best rank � determine mean best rank over all samples for each feature � calculate feature weight as � top 100 � determine how many samples are in the top 100 results d i h l i h 100 l � calculate feature weight as 9

R Results per topic/type lt t i /t 0,1 JRS rank max_max 0,09 JRS rank topK JRS rank w bestR JRS rank w_bestR 0 08 0,08 JRS rank w_t100 0,07 0,06 0,05 0,04 0,03 0,02 0,01 0 10

R Results per feature lt f t mean (all) mean (person) mean (character) mean (object) mean (location) 0,025 BOF100G BOF100L BOF1000G BOF1000L BOF1000L Gabor 0,020 HoG RegCov SIFT 0,015 0 010 0,010 0,005 0,000 11

C Conclusion (1) l i (1) � Task is difficult, results for automatic system poor T k i diffi lt lt f t ti t � different sizes, lighting, perspectives, … � needle in a haystack : very few relevant results in a large � “needle in a haystack”: very few relevant results in a large set with many similar objects (e.g. pedestrian crossing, blinds) � Features � as expected, our features perform best for object queries � better results could be possible for some of the features but � better results could be possible for some of the features, but would make matching process more costly 12

C Conclusion (2) l i (2) � Fusion methods F i th d � Overall, the fusion methods using information from query samples perform better samples perform better � Only slight difference for object queries � To fuse or not to fuse? To fuse or not to fuse? � for person and object queries, a single feature outperforms the best fused results � few topics for the other query types, thus difficult to say if fusion is actually useful in these cases 13

Th The research leading to these results has received funding from the European h l di t th lt h i d f di f th E Union’s Seventh Framework Programme under the grant agreements no. FP7- 215475, “2020 3D Media – Spatial Sound and Vision” (http://www.20203dmedia.eu/) and no. FP7-248138, “FascinatE – Format- (http://www.20203dmedia.eu/) and no. FP7 248138, FascinatE Format Agnostic Script based INterAcTive Experience” (http://www.fascinate- project.eu/), as well as from the Austrian FIT-IT project “IV-ART – Intelligent Video Annotation and Retrieval Techniques”. 14

DIGITAL G Institute for Information and Communication Technologies - PowerPoint PPT Presentation

DIGITAL G Institute for Information and Communication Technologies JOANNEUM RESEARCH and Vienna JOANNEUM RESEARCH and Vienna University of Technology at INS Task Werner Bailer TRECVID Workshop Nov 2010 TRECVID Workshop, Nov. 2010 O tli

2. Digital Data CHAPTER HIGHLIGHTS Elements of digital media. Digital codes. Di it l d

The Digital Revolution 1 Digital Revolution Nadias Theme 2 Digital Revolution Digital

Chapter 8 Digital Media Computer Concepts 2013 8 Section A: Digital Sound Digital Audio

Digital Equity Matters Mary Beth Henry Digital Equity/Broadband Advocate Oregon Connections

Shift to Digital Strategy Shift to Digital Strategy Contents Introduction The Digital

www.centre-for-digital-business.com t hemes Did we not see the digital era coming? o Digital

A Framework for Analysis 1. Digital Endowments 2. Digital Intensities 3. Digital Restrictions

Digital Matting Digital Matting Introduction to Digital Matting Introduction to Digital Matting

Digital Signatures Digital Signatures And Putting It All Together Digital Signatures And

AN E-COMMERCE PERSPECTIVE June 2016 | Hong Kong PART ONE So What is Digital? WHAT IS DIGITAL?

Digital Commons Dr. Mohamed Moawad Regional Manager Digital Commons Agenda: Agenda:

DMR and Digital Voice Modes DMR and Digital Voice Modes DMR and Digital Voice Modes DMR and

DIGITAL FUTURES. Michael Bromley Group Head of Digital Strategy, Creative Services and Edge

DIGITAL INDIA VISION Use of Digital Signatures to create a completely paperless environment

Digital Strategy Update April 29, 2015 Jessie Adcock, Chief Digital Officer Sandra Singh, Chief

Digital Gold in Precious Metals Digital Gold in Precious Metals Digital Gold in Precious Metals

The Evolutionary Origins of Genetic Information Steve Freeland Biological evolution describes a

Week 4 -Wednesday What did we talk about last time? if and else examples Pitfalls

Entanglement spectrum in quantum many-body systems V. Alba 1 ,M. Haque 1 , A. L auchli 2 1 Max

Robust Bounds for Emptiness Formation Probability for Dimers Shannon Starr University of Alabama

Status of High Gradient Tests of Single Cell Standing Wave Structures at SLAC Valery Dolgashev,

Intro to Sketches Sketch data structures are compact, randomized summaries Term

Everyone Counts and Everyone Should Be Counted: The 2020 Census and Why It Matters March 18,

The Business of Counting Trees By , JAMES T. MORGAN , chief of the division of forest economics