Dipl.-Inf. Robert Manthey HSMW_TUC at TRECVID Instance Search 2018 - - PowerPoint PPT Presentation

▶

Jul 29, 2023 570 likes •705 views

INS approach with pertained models and web based interactive evaluation (HSMW_TUC) Dipl.-Inf. Robert Manthey HSMW_TUC at TRECVID Instance Search 2018 13. November 2018 1 General System Design Used frameworks: Focus mainly on

SLIDE 1

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

1 INS approach with pertained models and web based interactive evaluation (HSMW_TUC)

Dipl.-Inf. Robert Manthey

SLIDE 2

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

2

Used frameworks:

Places365

(Locations)

Color Thief

(Color Features)

Detectron

(Persons&Objects)

Yolo9000

(Persons&Objects)

FaceNet

(Faces)

OpenFace

(Faces)

FaceRecognition

(Faces)

TuriCreate

(Clustering)

Laravel

(Web service)

General System Design

[1]

Focus mainly on architecture
Docker containers
Metadata in relational database
Data and feature extraction

through existing frameworks

Management and data

distribution through webservice, API and HTTP

[2] [3] [4] [5] [6] [7] [8] [9]

SLIDE 3

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

3

BBC EastEnders

characters known

Google image search

grab samples

Semi-automatic

enhancement

Ground Truth with

50-300 images/character Preprocessing Person Images

SLIDE 4

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

4

Multiple detections frameworks

per frame

Use Ground Truth to recognize

EastEnders characters

Multiple recognition frameworks

per detection

Storing of intermediate

recognition results and their scoring for further processing Recognizing Person Unit

SLIDE 5

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

5

Visual representation of

results with webservice

False detections

decreases with increasing of score value

Number of images

decreases with increasing of score value Person Recognition Results

No knowledge from visualisation included into automatic evaluation

SLIDE 6

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

6

Google image search grab sample

images of classes

Ground Truth to recognize

locations classes

Processed by multiple frameworks
Storing ten most probable

classifications of Places per image

Ten most dominant color

from Colorthief

TuriCreate determine ten

most similar images to create similarity classifier

Storing of intermediate results and

their scoring

Recognizing Location Unit

SLIDE 7

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

7

Visual represen-

tation of results

Analysing the query
Combination of

person and location

Retrieving best

match form database

Multiple iterations
f replenish to get

1000 result images if needed

Location Recognition Results & Fusion

No knowledge from visualisation included into automatic evaluation

SLIDE 8

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

9

Holistic Workflow

SLIDE 9

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

11

Fully reconstructed, flexible and extendable system
Main focus on infrastructure cause only mediocre results
Fusion of results from different frameworks need optimization
Automatic runs:

MAP: ~0.1 (1-3) Prec@100: ~0.26

Interactive run:

MAP: ~0.25 (4) Prec@100: ~0.45

Two different frameworks for reliable person detection
Small differences in frames result in different prediction values

Results

SLIDE 10

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

12

Thank you for your attention. Any questions?

Multiple use of containers and frameworks
Flexible and extendable infrastructure design
Web-based UI for visualisation and interactive evaluation
Interactive outperforms automatic runs
Multiple frameworks for same task may improve results
Advantages in data fusion needed

Summary

SLIDE 11

HSMW_TUC at TRECVID Instance Search 2018

13. November 2018

13

1. Zhou, B., Lapedriza, A., Khosla, A., Oliva, A., and Torralba, A.: Places: A 10 million Image Database for Scene Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017. 2. Feng, S.: 27708 69459 Thief, https://github.com/fengsp/color-thief-py 3. Girshick, R., Radosavovic, I., Gkioxari, G., Dollár, P., and He, K.: Detectron, https://github.com/facebookresearch/detectron, 2018 4. Redmon, J. and Farhadi, A.: YOLO9000: Better, Faster, Stronger, arXiv.org,

p. arXiv:1612.08242, http://arxiv.org/abs/1612.08242v1, 2016.

5. Schroff, F., Kalenichenko, D., and Philbin, J.: FaceNet: A Unified Embedding for Face Recognition and Clustering, ArXiv e-prints, 2015. 6. Satyanarayanan, M., Ludwiczuk, B., and Amos, B.: OpenFace: A general-purpose face recognition library with mobile applications, https://cmusatyalab.github.io/openface/. 7. Geitgey, A. and Nazario, J.: Face Recognition, https://github.com/ageitgey/face recognition, 2017. 8. Sridhar, K., Larsson, G., Nation, Z., Roseman, T., Chhabra, S., Giloh, I., de Oliveira Carvalho, E. F., Joshi, S., Jong, N., Idrissi, M., and Gnanachandran, A.: Turi Create, https://github.com/apple/turicreate, viewed: 2018-10-12, 2018. 9. Chen, X., Ji, Z., Fan, Y., and Zhan, Y.: Restful API Architecture Based on Laravel Framework, Journal of Physics: Conference Series, 910, 012 016, 2017

HSMW_TUC at TRECVID Instance Search 2018

1

INS approach with pertained models and web based interactive evaluation (HSMW_TUC)

Dipl.-Inf. Robert Manthey

HSMW_TUC at TRECVID Instance Search 2018

2

Used frameworks:

(Locations)

(Color Features)

(Persons&Objects)

(Persons&Objects)

(Faces)

(Faces)

(Faces)

(Clustering)

(Web service)

General System Design

through existing frameworks

distribution through webservice, API and HTTP

HSMW_TUC at TRECVID Instance Search 2018

3

characters known

grab samples

enhancement

50-300 images/character Preprocessing Person Images

HSMW_TUC at TRECVID Instance Search 2018

4

per frame

EastEnders characters

per detection

recognition results and their scoring for further processing Recognizing Person Unit

HSMW_TUC at TRECVID Instance Search 2018

5

results with webservice

decreases with increasing of score value

decreases with increasing of score value Person Recognition Results

HSMW_TUC at TRECVID Instance Search 2018

6

images of classes

locations classes

classifications of Places per image

from Colorthief

most similar images to create similarity classifier

their scoring

Recognizing Location Unit

HSMW_TUC at TRECVID Instance Search 2018

7

tation of results

person and location

match form database

1000 result images if needed

Location Recognition Results & Fusion

HSMW_TUC at TRECVID Instance Search 2018

9

Holistic Workflow

HSMW_TUC at TRECVID Instance Search 2018

11

MAP: ~0.1 (1-3) Prec@100: ~0.26

MAP: ~0.25 (4) Prec@100: ~0.45

Results

HSMW_TUC at TRECVID Instance Search 2018

12

Thank you for your attention. Any questions?

Summary

HSMW_TUC at TRECVID Instance Search 2018

13

References