of Road Images with Deep Learning Frank Geujen Senior Product - - PowerPoint PPT Presentation

of road images with
SMART_READER_LITE
LIVE PREVIEW

of Road Images with Deep Learning Frank Geujen Senior Product - - PowerPoint PPT Presentation

Detection and Segmentation of Road Images with Deep Learning Frank Geujen Senior Product Manager William Raveane Computer Vision Engineer Mapscape, a Navinfo company GTC Europe, October 2017, Talk #23304 Who is NavInfo SD &


slide-1
SLIDE 1

Detection and Segmentation

  • f Road Images with

Deep Learning

GTC Europe, October 2017, Talk #23304

Frank Geujen – Senior Product Manager William Raveane – Computer Vision Engineer Mapscape, a Navinfo company

slide-2
SLIDE 2

CONTENTS

目录

Who is NavInfo SD & HD Map Making Road Feature Extraction Traffic Sign Detection Looking Ahead

slide-3
SLIDE 3

Who is NavInfo

slide-4
SLIDE 4

NavInfo is the leading map provider in China, with focus on location big data platform, HD / SD map, Telematics and ADAS comprehensive solutions.

  • Established on 2002 in Beijing China
  • More than 4500 employees Globally

NavInfo Introduction

Automated Driving Connected Car Navigation

slide-5
SLIDE 5

Branches

 In Inte ternati tion

  • nal

l bu busines iness exp expansion ion  Adv Advanced ced te tech chnol

  • logy
  • gy

res esea earch rch

Ameri merica ca Ne Nethe therlan rlands ds

 Mapscape Mapscape: : Com Compila pilatio tion Tech echnol

  • log
  • gy (ND

(NDS)  EU EU Tech echnolo

  • logy Cent

Centre re:

  • Com

Compute ter r Vis ision ion

  • Deep

Deep Le Learni rning

China hina

 31 31 loca locali liza zati tion ba base e for r da data ta colle collect ct and d te tech chnolo

  • logy

ser ervice. ice.  4 4 R&D &D Center Centers (Sh Shangh ghai、Xia Xian、Sh Shenyang、Wuhan)  Beij Beijing ing Hea eadqu quarte rters rs

slide-6
SLIDE 6

SD & HD Map Making

slide-7
SLIDE 7

Challenge of Map Making

Ingestion/ Ingestion/ Extra xtraction tion Data Data Sour Source

10 100+ 0+

Collection vehicles

600+ 600+

dispersed field staff in China

360 360+ + cities cities

Big data mining 99 99% Highway 80 80% Main Road

22 22M+ M+

community contributions

3. 3.2+ 2+ Mill Millio ion

Signs processed per year

20 20+ Mil + Milli lion

POI updated per year

4+ 4+ Mi Mill llion ion

Road distance updated per year

Map Cre Map Creat ation ion

> 500 500

production staff

4,0 ,000 00+

page specifications

Del Deliv iver ery y

6.16+ 6.16+ Mil Milli lion n

Kilometer

24.9 .95+ Million

Core POI

260+ 260+

Attributes in the portfolio

60+ 60+

Cities of ground truth testing in 2017 Q1

80 80%/7 %/70% 0%

Update 80% POI and 70% road link in China per year field local offices

31 31

Hong Kon Kong, , Lao Laos, s, Ma Maca cao, Cam , Cambodia ia

Map data

slide-8
SLIDE 8

SD Map

POI OI

St Standa andard Def rd Defin init ition ion Map, Map, i is s pr prim imar arily fo ly for A r A to B to B routing

  • uting & guidan

& guidance and and is is a sim a simpl plif ified represen ied representa tation tion of

  • f th

the road in e road in lin links s and and nodes nodes.

slide-9
SLIDE 9

HD Map

Hi High gh Defini Definition ion M Map, is ap, is us used for au ed for automated/ tomated/autono autonomous

  • us driv

drivin ing and in and includes ludes high high accurat accurate lane e lane and and ro road ad fea features tures.

slide-10
SLIDE 10

The Field drive use cases

Live feed mono camera Real Time sign extraction

(Semi)

automated core map updating 2 FPS stored mono camera imagery Off line Feature extraction

(Semi)

automated core map updating Panoramic imagery Road feature extraction Projection

  • n LIDAR

data Feature extraction from LIDAR HD map creation/ updating

On-board SD map Off-board SD map Off-board HD map

slide-11
SLIDE 11

Traffic Sign Detection

Technical Deep Dive:

slide-12
SLIDE 12

Real Time Traffic Sign Detection

Use case: In-car data collection on an NVIDIA Jetson TX2

  • Over 180 traffic sign classes supported today
  • Up to 32 fps at 1920x1080 in 15W MAX-N mode
  • Detection based on Single Shot Detector (SSD)
  • Training on a Titan X GPU server
  • Inference through TensorRT
slide-13
SLIDE 13

Supported Features

Supported today

  • Speed Limits
  • Warning Signs
  • Information Signs
  • Prohibition Signs
  • Directional Signs

In development

  • Gantry Sign Boards
  • Traffic Lights
  • Digital Traffic Signs
slide-14
SLIDE 14

Sign Detection Demo

Watc tch O Onli line: e: htt ttps ps://go ://goo.gl/

  • .gl/KBgG

KBgGo8

  • 8
slide-15
SLIDE 15

Performance Highlights

Multip ltiple le Si Simu mult ltaneou aneous s Detect Detections

  • ns
slide-16
SLIDE 16

Performance Highlights

Dist istant ant Tr Traffic ic Si Sign gns

slide-17
SLIDE 17

Performance Highlights

Bad ad Li Light ghting ing Cond Conditio itions

slide-18
SLIDE 18

Optimization of SSD on Jetson TX2

  • With TensorRT:
  • 6x speedup in inference performance
  • 3x reduction in memory consumption
  • And with our in-house CUDA Kernels
  • Additional 3x speedup in inference performance
  • Allows full utilization of GPU resources
slide-19
SLIDE 19

Implementation of SSD

Two-stage system:

  • ResNet-based SSD for Detection
  • ResNet for Fine Classification

Custom Layer API:

  • Bridges both TensorRT Stages

Detector: SSD Classifier: ResNet

slide-20
SLIDE 20

SSD Custom Layers

  • Implementation of SSD layers as custom

CUDA kernels:

  • Executed by Custom Layer API
  • Priors replaced by on-demand calculations
  • Softmax calculated only when required
  • Non-maximum suppression replaced by a

batched data feeder for the classifier

slide-21
SLIDE 21

SSD on the Jetson TX2

SSD Caffe TensorRT + our CUDA kernels

Profile visualization of SSD inference 510ms 31ms

slide-22
SLIDE 22

Detection Accuracy

  • Single Image:
  • Precision: 92.5%
  • Recall: 98%
  • Tracking over Time:
  • Precision: 96.0%
  • Recall: 98.5%

Single Image Per-Class Detection Accuracy Single Image Per-Class Classification Accuracy Single Image Detection PR Curve

Class ID Class ID Accuracy Accuracy 100 100

slide-23
SLIDE 23

Road Feature Extraction

Technical Deep Dive:

slide-24
SLIDE 24

Road Feature Extraction

  • Road feature and object extraction
  • Semantic segmentation network architecture
  • Automatic lane grouping
  • Training & inference on NVIDIA Titan X GPU server

Lane numbering Road features Gantry sign boards

slide-25
SLIDE 25

Supported today:

  • Surface level:
  • Lane markings
  • Text, numbers, speed limits
  • Arrows
  • Road objects:
  • Gantry sign boards
  • Guard Rails
  • Curbs

In Development:

  • Poles
  • Traffic Lights
  • Tunnels

Road Features

slide-26
SLIDE 26

The On-road feature extraction process

Crop from Panoramic Image Camera Calibration Transformation to Top View Segmentation Network Transformation to Front View Semantic Segmentation Deep Neural Network Lane Number Grouping

slide-27
SLIDE 27

Lane Segmentation Demo Watc

tch O Onli line: e: htt ttps ps://go ://goo.gl/

  • .gl/4CXT

4CXTD5 D5

slide-28
SLIDE 28

Semantic Segmentation Performance

  • Inference at 5 images per second using an NVIDIA Titan X GPU
  • Common lane marking classes
  • Recall: 92.8%
  • Precision: 82%
  • Common road arrow marking classes
  • Recall: 85.6%
  • Precision: 72.8%

Confusion Matrix Performance of the system expected to further improve as we continue development

slide-29
SLIDE 29

Looking Ahead

slide-30
SLIDE 30

Looking Ahead

  • Deep Learning continues being more integrated into our:
  • Field collection
  • Map creation
  • Distribution processes
  • On-going developments:
  • Real-time semantic segmentation system on-board vehicles
  • Crowdsource data processing supporting self-healing maps
  • Applications for crowdsourcing
slide-31
SLIDE 31

Detection and Segmentation

  • f Road Images with

Deep Learning

GTC Europe, October 2017 , Talk #23304

Frank Geujen – Senior Product Manager William Raveane – Computer Vision Engineer Mapscape, a Navinfo company