AI Argus A Unique Insight Into Logistics cs Neo Song SF Technology - - PowerPoint PPT Presentation

ai argus
SMART_READER_LITE
LIVE PREVIEW

AI Argus A Unique Insight Into Logistics cs Neo Song SF Technology - - PowerPoint PPT Presentation

AI Argus A Unique Insight Into Logistics cs Neo Song SF Technology Department of Computer Vision 2019.02 CONTENT AI Argus Introduction Scenario Analysis and Algorithm Design Acceleration with NVIDIA Future Planning


slide-1
SLIDE 1

Neo Song SF Technology · Department of Computer Vision 2019.02

AI Argus

A Unique Insight Into Logistics cs

slide-2
SLIDE 2
  • AI Argus Introduction
  • Scenario Analysis and Algorithm Design
  • Acceleration with NVIDIA

CONTENT

  • Future Planning
slide-3
SLIDE 3

Argus Introduction

AI Argus

A Unique Insight Into Logistics cs

slide-4
SLIDE 4

Vehicle License Plate Analysis Vehicle Trajectory Analysis Loading Rate Detection Staff Efficiency Analysis

LPSS

Loading Procedure Structuring System

slide-5
SLIDE 5

Violent Operation Detection

6s Regularization Detection Business Management Safety Production

VAPD

Violated Action Pattern Detection

slide-6
SLIDE 6

Push h Notifi fications ns

 Web  APP

Trend nd Graph Rank nking ng List. Condition Moni nitoring ng

Config

Computing Node

camera

74% 44% 0% 5% 5% 0% 16% 28% 0% 10% 20% 30% 40% 50% 60% 70% 80% 粤BDB566 粤BFH239 粤BBT853 粤BBY411 LPSS recorded data in 2nd, April,2018

Loading Rate at Arrival Time Loading Rate at Departure Time

Argus Cloud Service

slide-7
SLIDE 7

Scenario Analysis and Algorithm Design

AI Argus

A Unique Insight Into Logistics cs

slide-8
SLIDE 8

Scenario Analysis

01

Loading Gate Operation Analysis

02

Behavior Monitoring

03

Equipment Monitoring

04

Specific Area Monitoring

Sorting Cen enter er

01

Standardization management

02

Behavior Monitoring

03

Tool Positioning Detection

04

Safety Production

Distribution Cen enter er

slide-9
SLIDE 9

Active Data Collection for Unfamiliar Scenes and Transfer Learning

Image Database Unseen Sample VGG16 Database New image 32bit binary code 32bit binary codes calculate hamming distance

>thres hold

yes no discard update VGG16

LPSS

slide-10
SLIDE 10

Loading Gate Working Status and Staff Efficiency Analysis

Optical Flow Calculator

01 01 02 02 03 03

Action Detection State Machine

flow_x+flowy+gray merge Pelee-net Classification result

arrive

arrive arrive arrive arrive arrive

arrival

Image sequence flow+gray merge sequence Classification result Pelee net State machine result Arrival or departure x-axis y-axis

LPSS

slide-11
SLIDE 11

Vehicle License Plate Analysis LPSS

What you expect to see VS What Argus actually sees

Asymmetric Illumination Image Blur Partial Covered Deformation/Soiling

slide-12
SLIDE 12

Object Detection Image Quality Assessment Attention OCR Video(Image) SSD+tracker GoogleNet CNN+LSTM+CTC end

Vehicle License Plate Analysis

SSD GoogleNet CNN+LSTM+CTC

SoftMax

粤BCG570

LPSS

slide-13
SLIDE 13

The Instant Loading Rate Detection

true ResNet50 Network1 =A/B/C? =E? =D? ResNet50 & true true predict1 predict2 predict6 Decision-making tree Network 2 discard

LPSS

slide-14
SLIDE 14

The Process Loading Rate Detection

Variable Length Sequence Feature Learning

LPSS

slide-15
SLIDE 15

Structured Data List

⚫ Vehicle License Plate ⚫ Gate Number ⚫ The Vehicle Arrival Time ⚫ Loading Rate at Arrival Time ⚫ Working Start Time ⚫ Working End Time ⚫ Loading Rate at Departure Time ⚫ The Vehicle Departure Time ⚫ Working State judgment

Vehicle License Plat ate Gate Number The Ve Vehicle Arrival al Time Loading Rat ate at Arrival al Time Star art Time End Time Load ading Rat ate at at Depar arture Time The Ve Vehicle Depar arture Time Stat ate 粤BDB566 No.1 uploading gate 2018-04-12 22:07:53 74% 2018-04-12 22:07:59 2018-04-12 22:23:24 5% 2018-04-12 22:23:34 uploading 粤BFH239 No.1 uploading gate 2018-04-12 22:24:16 44% 2018-04-12 22:24:20 2018-04-12 22:44:29 0% 2018-04-12 22:44:39 uploading 粤BGZ502 No.2 uploading gate 2018-04-12 22:45:41 95% 2018-04-12 22:45:45 2018-04-12 23:13:40 0% 2018-04-12 23:13:45 uploading 粤BV8026 No.1 loading gate 2018-04-12 22:13:54 5% 2018-04-12 22:13:59 2018-04-13 01:21:34 49% 2018-04-13 01:21:40 loading 粤B3G15U No.2 loading gate 2018-04-13 03:34:48 5% 2018-04-13 03:34:54 2018-04-13 04:07:05 79% 2018-04-13 04:07:12 loading 粤BZ5717 No.12 loading gate 2018-04-12 22:21:15 0% 2018-04-12 22:21:21 2018-04-13 02:18:49 85% 2018-04-13 02:18:56 loading 粤BBT853 No.13 loading gate 2018-04-12 22:10:06 0% 2018-04-12 22:10:11 2018-04-13 01:00:58 16% 2018-04-13 01:01:04 loading 粤BBY411 No.16 loading gate 2018-04-12 22:08:10 5% 2018-04-12 22:08:14 2018-04-13 03:51:56 28% 2018-04-13 03:52:15 loading

LPSS

slide-16
SLIDE 16

The Illegal Throwing Behavior Detection VAPD

challenges

Pushing a box is not an Illegal Throwing Behavior. ACTION RATING Throwing a file is not an Illegal Throwing Behavior. PARCEL TYPE A short distance throwing is not an Illegal Throwing Behavior. SPATIAL DISTANCE

slide-17
SLIDE 17

The Illegal Throwing Behavior Detection VAPD

slide-18
SLIDE 18

Fine Grained Illegal Throwing Behavior Detection via ROI Extraction and 3D Space Recovery VAPD

slide-19
SLIDE 19

Structured Data List

⚫ Warning Start Time ⚫ Latest Warning Time ⚫ Number of Continuous Warning ⚫ Duration ⚫ Time of violation ⚫ Precision

VAPD

Warning Start Ti Time Lates est Warning Ti Time Number er of Continuous Warning Duration Time of violation Precision 2018/11/29 15:33:50 2018/11/29 15:33:50 1 15.0s 1 67% 2018/11/29 15:13:35 2018/11/29 15:13:35 1 15.0s 1 91% 2018/11/29 14:41:08 2018/11/29 14:41:08 1 2.0min 2 50% 2018/11/29 14:20:41 2018/11/29 14:20:41 1 15.0s 1 86% 2018/11/29 13:48:43 2018/11/29 13:48:43 1 15.0s 1 52% 2018/11/29 13:47:51 2018/11/29 13:47:51 1 15.0s 1 84% 2018/11/29 13:43:39 2018/11/29 13:43:39 1 15.0s 1 79% 2018/11/29 12:53:50 2018/11/29 12:53:50 1 25.5s 1 72%

slide-20
SLIDE 20

Acceleration with NVIDIA

AI Argus

A Unique Insight Into Logistics cs

slide-21
SLIDE 21

Technology Stack - Edge

Nvidia Tesla P4 / Xavier x86 / ARM 64bit Linux Distribution TensorRT DeepStream CUDA Argus Edge Framework User Applications

Daemon Payload

Frame Pool Frame Stack Pre-Processing Primitives Parallel Processing Queue Inter-Process Communication HTTPS Token Encryption Asynchronous I/O Interface Heterogeneous Computing Memory Model Node Status Report System Failure Recovery In system upgrade Initial Setup Flow Video Quality Assessment Multi-Payload Management Loading Monitoring Action Recognition Retail Analytics Smart City 3D Perception

slide-22
SLIDE 22

Technology Stack - Cloud

IAAS PAAS SAAS API Gateway TCP API HTTP API

Video File Image File Setup Param Config APP Upgrade Model Upgrade Command Token Distribution

WEB NAS APP

Payload Monitor Auth Manage Msg Notification Alarm Handler

JDK Spring Netty MyBatis SFIM Msg SpringBoot VUE ElementUI Docker MySQL Kafka Redis ZooKeeper RocketMQ Jetty

Payload Msg Node Status System Management Device Management User Management Statistics Report

slide-23
SLIDE 23

Argus System Architecture: Edge Computing

Camera 1 Camera 2 Camera 3 Camera N

…….

SWITCH NVR NVR

LPSS LPSS VAPD VAPD

…….

Deamon

Argus Edge

…….

Edge Network

INTERNET (HTTPS/TCP)

Argus Cloud

Edge Network 1 Edge Network 2 Edge Network N

…….

INTERNET (HTTPS/TCP) INTERNET (HTTPS/TCP)

…….

slide-24
SLIDE 24

Argus System Architecture: Cloud Computing

Mobile Phone Bar Code Scanner Camera NVR Client Req/Res Handling ... ... ...

Per Model scheduling Q ueues Framework Backends

TensorRT TensorFlow+T RT Caffe2 Model Management Model Repo GPU1 GPU2 GPU3 Camera LPSS VAPD

HTTP/gRPC Inference Request Inference Request

slide-25
SLIDE 25

Mapping Deep Stream into Argus Software Architecture

DeepStream

Camera 1 Camera 2 Camera N

…….

NVR Compute Node Input Processing Business Node User/Client Output

Camera 1 Camera 2 Camera N

…….

NVR Compute Node

…….

slide-26
SLIDE 26

Flexible Streaming Pipeline Design

Optical Flow

Detect Network

Discrimination network Discrimination network Recognization Network

  • 1. A plugin Model based pipeline architecture
  • 2. Graph-based pipeline interface to allow high-level component interconnect
  • 3. Heterogenous processing on GPU and CPU
  • 4. Hides parallelization and synchronization under the hood
  • 5. Inherently multi-threaded

Deep Stream

  • On-Demand Computing
  • Reuse Calculation
slide-27
SLIDE 27

Optical Flow Speed-up with CUDA

Runtime:8ms cv::cuda:OpticalFlowDual_TVL1

slide-28
SLIDE 28

Optical Flow Speed-up with CUDA

Runtime: 3.6ms Assume camera is fixed in cv::cuda:OpticalFlowDual_TVL1 Runtime: 2.8ms Using CUDA float array instead of cv:GpuMat

Motion compensation on non-stationary camera Security camera is fixed

slide-29
SLIDE 29

Optical Flow Speed-up with CUDA

224*224 pre 224*224 current

  • 1. Share Memory
  • 2. Block =

(224*224)/(32*32)=49

  • 3. Finally Sync all Block

224*224 current = 32*32* 49 224*224 pre = 32*32* 49

1.3ms

224*224 pre 224*224 current

  • 1. Global Memory
  • 2. Block = (224*224)/(32*8)
  • 3. each Step Sync all Block

2.8ms

In deployment, the GPU server adopts Tesla P4 GPU.

slide-30
SLIDE 30

Concurrent Asynchronous

Buffer Queue

Producer Producer Producer Producer Producer Producer Producer Producer Consumer Consumer Consumer Consumer Consumer Consumer Consumer Consumer

slide-31
SLIDE 31

HD1 K1.1 K1.2 K1.3 K1.4 DH1 K1.5 K1.6 HD2 K2.1 K2.2 K2.3 K2.4 DH2 K2.5 K2.6 HD3 K3.1 K3.2 K3.3 K3.4 DH3 K3.5 K3.6 HD4 K4.1 K4.2 K4.3 K4.4 DH4 K4.5 K4.6 HD5 K5.1 K5.2 K5.3 K5.4 DH5 K5.5 K5.6 HD6 K6.1 K6.2 K6.3 K6.4 DH6 K6.5 K6.6

Buffer Queue

Producer Producer Producer Producer Producer Producer Producer Producer Stream1 Stream2 Stream3 Stream4 Stream5 Stream6

Concurrent Asynchronous With Mutil-Stream

slide-32
SLIDE 32

Model Acceleration based on TensorRT

Model el Accuracy Infer eren ence Spee eed VGG16 93% 113ms VGG16-Pruning 89% 32ms VGG16-lowrank 94% 37ms VGG16-lowrank-Pruning 93.5% 32ms VGG16-lowrank-Pruning-TensorRT 93.5% 15.9ms VGG16-lowrank-Pruning-TensorRT-Int8 93.5% 7.5ms Measurement on Tesla P4 GPU Model el Accuracy Infer eren ence Spee eed PELEE 97.1% 2.48ms PELEE-TensorRT 98.07% 1.24ms PELEE-TensorRT-Int8 98% 0.91ms

slide-33
SLIDE 33

Flexible Computing Resources Allocation

Pre-Process Detection Network Optical Flow Recognition Network Output Discrimination Network Run in CPU Run in GPU Run in CPU or GPU

slide-34
SLIDE 34

Flexible Product Line Based on Various Computing Platforms

slide-35
SLIDE 35

In deployment, the device adopts two INTEL Xeon E5-2620V4 CPUs and two Tesla P4 GPUs, which can process 32 video streams.

System Metric of LPSS Based on NVIDIA Tesla P4

CPU Memory 7G

32G

GPU Utilization 50%

100%

GPU Memory 2.6G

8G

CPU Utilization 12%

100% 100%

45% average peak

100%

86%

slide-36
SLIDE 36

In deployment, the device adopts the Intel Core i7-6800k CPU and Tesla P4 GPU, which can process 16 video streams.

System Metric of VAPD Based on NVIDIA Tesla P4

CPU Memory 8.5G

32G

GPU Utilization 40%

100%

GPU Memory 2.8G

8G

CPU Utilization 10%

100% 100%

17% average peak

100%

85%

slide-37
SLIDE 37

System Metric of VAPD Based on NVIDIA Jetson Xavier

In deployment, the device adopts Xavier, which can process 20 video streams.

CPU Memory 3.3G

16G

Single GPU Utilization 40%

100%

SingleGPU Memory 1.0G

1.2G

CPU Utilization 43%

100% 100%

85% average peak

100%

92%

slide-38
SLIDE 38

Future Planning

AI Argus

A Unique Insight Into Logistics cs

slide-39
SLIDE 39

Future Planning

⚫ Loading Procedure StructuringSystem ⚫ ViolatedActionPattern Detection ⚫ 6S Pattern Detection ⚫ Unfamiliar Scene andSample Collection ⚫ Package LifecycleTracking System ⚫ Facility Abnormal InvasionDetection ⚫ Staff Efficiency Analysis ⚫ Freight Reflux Detectionand Counting ⚫ Employee Image AssuranceSystem

slide-40
SLIDE 40

Argus

A Unique Ins Insight Int Into Logistics

Thank You For Watching

SF Technology · Department of Computer Vision

Neo Song 2019.02