Real - Time Face Recognition on Jetson Tx2 using TensorRT Tamas - - PowerPoint PPT Presentation

real time face recognition on jetson tx2 using tensorrt
SMART_READER_LITE
LIVE PREVIEW

Real - Time Face Recognition on Jetson Tx2 using TensorRT Tamas - - PowerPoint PPT Presentation

Real - Time Face Recognition on Jetson Tx2 using TensorRT Tamas Grobler 11 . 10 . 2017 GTC Table of Contents The The Problem Results Conclusion solution Real Time Video Recognition Testing Method Real Time Analysis Framework Recognition


slide-1
SLIDE 1

Real-Time Face Recognition

  • n Jetson Tx2 using

TensorRT

Tamas Grobler 11.10.2017 GTC

slide-2
SLIDE 2

Real-Time Face Recognition

Table of Contents

Real Time Video Analysis Face Detection Face Recognition

The Problem

Recognition Framework Training Method

The solution

Testing Method Recognition results Inference time comparison Real Time Considerations Video

Conclusion

01

Results

slide-3
SLIDE 3

Real-Time Face Recognition

02

Face Detection

slide-4
SLIDE 4

WHO?

Real-Time Face Recognition

03

Face Recognition

slide-5
SLIDE 5

Recognition Pipeline

Real-Time Face Recognition

04

slide-6
SLIDE 6

Head detection

Region-based CNN methodology

Backbone: PVANet with HyperNet-inspired hyper- features Region Proposal Network from Faster R-CNN Network Scales with Image Size

Training:

Non Maximum Suppression Backbone pre-trained on ImageNet Hard Negative Mining 05

Real-Time Face Recognition

Ultinous film head corpus + subset of HollywoodHeads dataset

slide-7
SLIDE 7

Face Recognition

Base Model: GoogLeNet Multiple crops of each image (160x160 -> 144x144) Frontalization using Spatial Transformer Network (144x144 -> 128x128) Training

MsCeleb dataset: ~10M images from ~100k classes Augmentation: random crops, mirroring Trained for both Classification and Discrimination (Triplet Loss) with Hard Negative Mining

Real-Time Face Recognition

06

slide-8
SLIDE 8

Results: Recognition

Test Data: NIST IJB-A Dataset

Average over 10 splits 1000 positive/10000 negative pairs per split 07

Real-Time Face Recognition

slide-9
SLIDE 9

Measurement Setup

NVIDIA Pascal architecture 3840 NVIDIA CUDA Cores 12 GB memory; 547.7 GB/s NVIDIA Pascal architecture 256 NVIDIA CUDA Cores 8 GB memory; 58.3 GB/s

NVIDIA Titan Xp

Caffe NVIDIA TensorRT

NVIDIA Jetson TX2

08

Real-Time Face Recognition

NVIDIA CUDA 8 NVIDIA cuDNN 7 3.0 Release Candidate 32b, 16b, 8b arithmetic

slide-10
SLIDE 10

Inference Time Comparison: Caffe vs. TensorRT

09

Real-Time Face Recognition

slide-11
SLIDE 11

Inference Time Comparison: Titan Xp vs. Jetson TX2

10

Real-Time Face Recognition

slide-12
SLIDE 12

Conclusion

Frame rate: 10 fps -> 100 ms for detection + recognition Head Detection time per 1536x864 frame (speculative for 16 bit): < 30 ms Face recognition can handle 50 images in ~66 ms 5 crops per face This allows for 10 simultaneous recognitions per frame (30 ms + 66 ms) Future work: test TensorRT with Int8 arithmetic for both accuracy and inference time 11

Real-Time Face Recognition

Real-Time Considerations for Jetson TX2 with TensorRT (16-bit arithmetic)

slide-13
SLIDE 13

Real-Time Face Recognition

12

slide-14
SLIDE 14

Contributors Contact

Real-Time Face Recognition

13 György Balogh János Locki József Németh Attila Szabó tamas.grobler@ultinous.com ultinous.com

slide-15
SLIDE 15

References

[1] arXiv:1611.08588 PVANet: Lightweight Deep Neural Networks for Real-time Object Detection Sanghoon Hong, Byungseok Roh, Kye-Hyeon Kim, Yeongjae Cheon, Minje Park [2] arXiv:1604.00600 HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection Tao Kong, Anbang Yao, Yurong Chen, Fuchun Sun [3] arXiv:1506.01497 Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun [4] arXiv:1409.4842 Going deeper with convolutions Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erha, Vincent Vanhoucke, Andrew Rabinovich [5] arXiv:1701.07174 Towards End-to-End Face Recognition through Alignment Learning Yuanyi Zhong, Jiansheng Chen, Bo Huang [6] “This product contains or makes use of the following data made available by the Intelligence Advanced Research Projects Activity (IARPA): IARPA Janus Benchmark A (IJB-A) data detailed at Face Challenges homepage." [7] arXiv:1408.5093 Caffe: Convolutional Architecture for Fast Feature Embedding Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, Trevor Darrell

Real-Time Face Recognition

14