INS Task in 2016-present 2016-present: find a specific person in a - PowerPoint PPT Presentation

Person-Location Instance Search via Progressive Extension and Intersection Pushing —— NII_Hitachi_UIT team at TRECVID2018 Instance Search T ask Zheng Wang and Shin’ichi Satoh National Institute of Informatics, Japan

INS Task in 2016-present • 2016-present: find a specific person in a specific location The Figure refers to [1] PKU_ICST at TRECVID 2017: Instance Search Task

INS Task Data • BBC EeastEnders (2013-present): drama series, “small world” many repeated instances (person, location, objects, ...) • The BBC and the AXES project made 464 hours of the BBC soap opera EastEnders available for research in MPEG-4 • 244 weekly “omnibus” files from 5 years of broadcasts • 471527 shots • Average shot length: 3.5 seconds • Transcripts from BBC • Per-file metadata • Represents a “small world” with a slowly changing set of: • People (several dozen) • Locales: homes, workplaces, pubs, cafes, open-air market, clubs • Objects: clothes, cars, household goods, personal possessions, pets, etc • Views: various camera positions, times of year, times of day

Comparison with task in 2013-2015 2013-2015 2016-present Data Source The same Topics object / person / location person + location query Image + mask Person: image + mask Location: 6-12 images Related video shots Characteristic One condition Two conditions together Difficulty Instance with different scales Persons / locations have and types different views Person and location influence to each other, can not be searched out simultaneously

Example

Related Systems Location retrieval The same routine: Merge results Person retrieval Person retrieval Location retrieval Merge results BUPT- face retrieval (dlib) RootSIFT+AlexNet Peron guide location+ location guide MCPRL person re-identification (Faster RCNN + VGG-16 Places365 person + random forest fc layer feature) transcript-based IRIM HOG detector + ResNet pre-trained on Bow + Filter out person Credits shots filtering FaceScrub & VGG-Face Pretrained GoogLeNet Places365 Indoor/Outdoor shots filtering Viola-Jones detector + FC7 of a VGG16 Shots threads filtering network Late fusion PKU_ICST VGG-Face + Cosine + SVM+ AKM-based (6 kinds of BoW) Peron guide location+ location guide Progressive training DNN-based person + highlight common clues (VGGnet+GoogleNet+ResNet) + Semi-supervised re-ranking Progressive training

State-of-the-art Systems IRIM at TRECVID 2017 (MAP = 0.4466) PKU_ICST at TRECVID 2017 (0.549) Pers1 ： HOG detector + ResNet pre-trained on FaceScrub & VGG-Face Location-specific search : AKM-based (6 kinds of BoW) + DNN- Pers2 ： Viola-Jones detector + FC7 of a VGG16 network based (VGGnet+GoogleNet+ResNet) Loc1 ： Bow + Filter out person Person-specific search ： VGG-Face + Cosine + SVM Loc2 ： GoogLeNet Places365 Re-ranking ： Semi-supervised re-ranking method (fusion)

Difficulties • additional difficulties for person + location : person search and location search are always in a dilemma. scenes are with low light or blur person faces are non-front or occluded although it is a wide-angle view scene, scenes are blocked by persons the person faces are very small [2] J Lan, J Chen, Z Wang, C Liang, S Satoh, PS Instance Retrieval via Early Elimination and Late Expansion, ACM MM Workshop, 2017

Difficulties • additional difficulties for person + location : person search and location search are always in a dilemma. Topic 9210 in TRECVID INS 2017 Topic 9170 in TRECVID INS 2016 low scene score V.S. high person score high scene score V.S. low person score

Motivation An example for consecutive shots in a time slice. Although the shots contain the target person in the target location, the person and location scores are not always high simultaneously. Neighbor shots will be helpful.

Framework • Person search • We use face cues to do person search. • FaceNet [3] framework is used for face recognition. • Multi-task CNN [4] is utilized for face detection and alignment. • The network is trained using softmax loss with the Inception-Resnet-v1 model. • The training data is VGGFace2. • No BBC EeastEnders data is used for training. • For each query person, we collect 10 face images with different views. • We use max-pooling strategy to achieve the similarity between one shot and one query topic. • The similarity scores are normalized to [0, 1]. [3] FaceNet: A Unified Embedding for Face Recognition and Clustering, https://github.com/davidsandberg/facenet [4] Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks, https://kpzhang93.github.io/MTCNN_face_detection_alignment/index.html

Framework • Location search • We use two kinds of routines to obtain the location similarity scores. • BOW stands for the hand crafted based routine. We exploit the method used in TRECVID INS 2017. • DIR [5] stands for the deep learning based routine. • For each query location, we use query extension to combine global features of all corresponding query images. • We use max-pooling strategy to achieve the similarity between one shot and one query topic. • The similarity scores are normalized to [0, 1]. [5] Deep Image Retrieval: Learning global representations for image search, https://github.com/figitaki/deep-retrieval

Framework • Fusion threshold Person score threshold Extension Intersection Location score • The method includes multiple iterations, so that we can obtain top results progressively. • For each iteration, • We first extend the shot scores with high neighbor shot scores. • We do intersection to get the top results.

Results Extension: the number of neighbor shots extended. Iteration: the times of intersection shots pushing. Shots before Intersection: the number of shots selected before intersection. Extension should be fine-grained The iteration times should be large

Results-AP 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 a d a p t 0 9219 9220 9221 9222 9223 9224 9225 9226 9227 9228 9229 9230 9231 9232 9233 9234 9235 9236 9237 9238 9239 9240 9241 9242 9243 9244 9245 9246 9247 9248 NII_Hitachi_UIT PKU_ICST IRIM 9 2 3 3 M o + L a u n d r e t t e G o o d r e s u l t s : 9 2 3 6 D a r r i n + L a u n d r e t t e O u r m e t h o d d o e s n o t p e r f o r m w e l l i n s o m e s c e n e s . O u r l o c a t i o n s e a r c h m o d e l d o e s n o t 9 2 2 2 Chelsea+Cafe2 a d a p t t o t h e n e w I N S d o m a i n . 9 2 2 8 Garry+Cafe2 B a d r e s u l t s : 9 2 3 7 Zainab+Cafe2 9240 Heather+Cafe2

Results-Hits at depth 10/30 in the result set 10 8 6 4 2 0 9219 9220 9221 9222 9223 9224 9225 9226 9227 9228 9229 9230 9231 9232 9233 9234 9235 9236 9237 9238 9239 9240 9241 9242 9243 9244 9245 9246 9247 9248 NII_Hitachi_UIT PKU_ICST IRIM 30 25 20 15 10 5 0 9219 9220 9221 9222 9223 9224 9225 9226 9227 9228 9229 9230 9231 9232 9233 9234 9235 9236 9237 9238 9239 9240 9241 9242 9243 9244 9245 9246 9247 9248 NII_Hitachi_UIT PKU_ICST IRIM 9 2 2 8 Garry+Cafe2 F o r t h e t o p r e s u l t s , o u r m e t h o d p e r f o r m s s i m i l a r 9 2 3 4 Darrin+Cafe2 B a d r e s u l t s : t o t h e o t h e r m e t h o d s . 9 2 3 7 Zainab+Cafe2

Thanks

INS Task in 2016-present 2016-present: find a specific person in a - PowerPoint PPT Presentation

Person-Location Instance Search via Progressive Extension and Intersection Pushing NII_Hitachi_UIT team at TRECVID2018 Instance Search T ask Zheng Wang and Shinichi Satoh National Institute of Informatics, Japan INS Task in

Inertial Sensing & Navigation 3/9/2016 3/15/2017 GPS Noise Source: www.insidegnss.com INS

Introducing OSGi Eclipse Plug-ins 1 Plug-in State Information Plug-in Structure

Present and Powerful Present and Powerful Psalm 46:1 God is our refuge and strength, an

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

p wered Yva productivity AI Task Manager @nerdybff Task Management Task Management Todoist

CGO Task Presentation CGO Task Presentation CGO Task Presentation Effective Task Presentation

Installing The Web Tools Platform Plug-ins In this column, I present how to get The Eclipse Web

LENNAR CORP LENNAR CORP v. v. MARKEL AMERICAN INS. MARKEL AMERICAN INS. R. Brent Cooper R.

F elipe Mandarino Pereira P Pereira P as as s s os os Ins Ins titute, R titute, R io

Ins & Outs Ins & Outs of Contrac of Contracts Office of Research Support &

Ins & Outs of Contrac Ins & Outs of Contracts Office of Research Support &

Market Risk Management Gen Ins. Risk Management Life Ins. Risk Management Op.

2/17/2016 1 2/17/2016 2 2/17/2016 3 2/17/2016 4 2/17/2016 5 2/17/2016 6 2/17/2016 7

IEA Bioenergy IEA BIOENERGY Task 42 Biorefinery 5 th Task Meeting Dublin, Ireland, 25/26 March

WASC 2019 Findings Presentation to stakeholders February 2019 2 3 Process Task 1 Task 2

SEASON EXTENSION CONSIDERATIONS FOR ANNUAL VEGET ABLE PRODUCTION Brian Sebade Agriculture and

CollectionSpace Configuration and Extension i290-rmm Patrick Schmitz Overview

Symbolic Integration DART IV, Beijing, China V. Ravi Srinivasan Rutgers University-Newark

Evolving Adabot: A Mobile Robot with Adjustable Wheel Extensions Motivation: Search and Rescue

Variable Length Relationship Pattern Extensions Teon Banek March , cbnd About

Functional properties of Sobolev extensions Pekka Koskela Modern Aspects of Complex Analysis and

Designing a Web of Highly-Configurable Designing a Web of Highly-Configurable Intrusion Detection

HASHI: An Application-Specific Instruction Set Extension for Hashing Oliver Arnold, Sebastian

INS Task in 2016-present 2016-present: find a specific person in a - PowerPoint PPT Presentation

Person-Location Instance Search via Progressive Extension and Intersection Pushing NII_Hitachi_UIT team at TRECVID2018 Instance Search T ask Zheng Wang and Shinichi Satoh National Institute of Informatics, Japan INS Task in

Inertial Sensing &amp; Navigation 3/9/2016 3/15/2017 GPS Noise Source: www.insidegnss.com INS

Introducing OSGi Eclipse Plug-ins 1 Plug-in State Information Plug-in Structure

Present and Powerful Present and Powerful Psalm 46:1 God is our refuge and strength, an

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

p wered Yva productivity AI Task Manager @nerdybff Task Management Task Management Todoist

CGO Task Presentation CGO Task Presentation CGO Task Presentation Effective Task Presentation

Installing The Web Tools Platform Plug-ins In this column, I present how to get The Eclipse Web

LENNAR CORP LENNAR CORP v. v. MARKEL AMERICAN INS. MARKEL AMERICAN INS. R. Brent Cooper R.

F elipe Mandarino Pereira P Pereira P as as s s os os Ins Ins titute, R titute, R io

Ins &amp; Outs Ins &amp; Outs of Contrac of Contracts Office of Research Support &amp;

Ins &amp; Outs of Contrac Ins &amp; Outs of Contracts Office of Research Support &amp;

Market Risk Management Gen Ins. Risk Management Life Ins. Risk Management Op.

2/17/2016 1 2/17/2016 2 2/17/2016 3 2/17/2016 4 2/17/2016 5 2/17/2016 6 2/17/2016 7

IEA Bioenergy IEA BIOENERGY Task 42 Biorefinery 5 th Task Meeting Dublin, Ireland, 25/26 March

WASC 2019 Findings Presentation to stakeholders February 2019 2 3 Process Task 1 Task 2

SEASON EXTENSION CONSIDERATIONS FOR ANNUAL VEGET ABLE PRODUCTION Brian Sebade Agriculture and

CollectionSpace Configuration and Extension i290-rmm Patrick Schmitz Overview

Symbolic Integration DART IV, Beijing, China V. Ravi Srinivasan Rutgers University-Newark

Evolving Adabot: A Mobile Robot with Adjustable Wheel Extensions Motivation: Search and Rescue

Variable Length Relationship Pattern Extensions Teon Banek March , cbnd About

Functional properties of Sobolev extensions Pekka Koskela Modern Aspects of Complex Analysis and

Designing a Web of Highly-Configurable Designing a Web of Highly-Configurable Intrusion Detection

HASHI: An Application-Specific Instruction Set Extension for Hashing Oliver Arnold, Sebastian

Inertial Sensing & Navigation 3/9/2016 3/15/2017 GPS Noise Source: www.insidegnss.com INS

Ins & Outs Ins & Outs of Contrac of Contracts Office of Research Support &

Ins & Outs of Contrac Ins & Outs of Contracts Office of Research Support &