Ford's Autonomous Vehicles Bryan Goodman Argo AI 9 May 2017 1 - - PowerPoint PPT Presentation

ford s autonomous vehicles
SMART_READER_LITE
LIVE PREVIEW

Ford's Autonomous Vehicles Bryan Goodman Argo AI 9 May 2017 1 - - PowerPoint PPT Presentation

S7348: Deep Learning in Ford's Autonomous Vehicles Bryan Goodman Argo AI 9 May 2017 1 Ford s 12 Year History in Autonomous Driving Today: examples from Stereo image processing Object detection Using RNNs Motorsports 2


slide-1
SLIDE 1

S7348: Deep Learning in Ford's Autonomous Vehicles

Bryan Goodman Argo AI 9 May 2017

1

slide-2
SLIDE 2

Today: examples from

  • Stereo image processing
  • Object detection
  • Using RNN’s
  • Motorsports

2

Ford’s 12 Year History in Autonomous Driving

slide-3
SLIDE 3

Stereo Matching Problem

  • Determining the correspondences in stereo images
  • Calculating the disparities
  • But what is the correct correspondence?
  • Basic stereo matching algorithm

− Compare pixels on the same epipolar line in two images − Choose the best match

3

slide-4
SLIDE 4

Deep neural networks for stereo matching

  • The brain can estimate the distance of

an object using the visual information from two eyes.

  • We can use deep neural networks

4

Right Stereo Camera Deep Convolutional Neural Networks Post-Processing Left Stereo Camera Distance Map Estimation

slide-5
SLIDE 5

Proposed deep convolutional neural network

  • AV driving requires an intelligent distance map estimation, which filters out the
  • bjects not of interest.
  • Network I

− General network − Encoding and decoding layers − Retain objects of interest in the training data sets

5

Conv1

Conv5

Conv2 Conv3

Conv6 Deconv6

Conv7 Deconv7 Deconv8 Conv8 Deconv9

Encoder Decoder

Conv9

Loss Function

Deconv10 Conv10 Conv4

slide-6
SLIDE 6

Proposed deep convolutional neural network II

− Specialized network − Encoding and decoding layers − The cross correlation layers force the network to look for correspondence on the epipolar line − The weights in the encoding layers are shared

6 Conv1L

Conv4L Loss Function Encoder Decoder

Conv2L Conv3L Conv1R

Conv4R

Conv2R Conv3R

CC5

Conv5

Conv6 CC6

Deconv6

Conv7 CC7 Deconv7 Deconv8 Conv8 Deconv9 Conv9

slide-7
SLIDE 7

Proposed deep convolutional neural network

  • Cross correlation (CC) layer

− Computes CC values between each pairs of patches − Outputs the CC values for each pair of patches − Does not lose any information

  • Loss function

− In AV driving, closer objects are more important than distant ones − Assigns more weight to the closer objects − The closer object distance is estimated more accurately

7 0.2 0.4 0.6 0.8 1 1 0.4 0.2 α d 0.6 0.8

slide-8
SLIDE 8

Performance on synthetic and real stereo data

  • Synthetic data generation

− Generate 14,000 pairs of RGB stereo images − Synthetic distance maps are only generated for the objects of interest, e.g. cars or pedestrians − Gaussian noise added to the stereo images

8

slide-9
SLIDE 9

Performance on synthetic and real stereo data

  • Fine tuning with LIDAR data sets

− Project LIDAR point clouds onto the camera images − The baseline and optic axes are not the same as the synthetic data

9

Left camera Right camera Network I Network II

slide-10
SLIDE 10

1/2x

10

slide-11
SLIDE 11

Comparing Manual Annotation to DNN Model

11

slide-12
SLIDE 12

12

Detection Result Original Image Enhanced Contrast

Network’s detection outperforms human labeler in low-contrast areas

Pedestrian detection Pedestrian misdetection Detected, but not labeled

slide-13
SLIDE 13

Introducing Recurrence in Detection and Tracking

  • Use RNN’s to detect occluded objects
  • Remember location of static objects
  • Predict location of non-static objects

13

Image 0 Feature Map

RNN Conv

Image 1 Feature Map Image 2 Feature Map

RNN Conv RNN Conv Detector Detector Detector

slide-14
SLIDE 14

Orange = ground truth; Green = model prediction

14

slide-15
SLIDE 15

15

Classifying NASCAR images

The Ford team reviews pictures during the race

slide-16
SLIDE 16

16

Classifying NASCAR images

Looking for damage and

  • ther performance indicators

Gap

slide-17
SLIDE 17

17

Results – Boxing the Cars

Using ~2k images labeled with boxes around the vehicles, the model does well detecting cars

slide-18
SLIDE 18

18

Results – Boxing the Cars

slide-19
SLIDE 19

Classifying NASCAR images

Next – determine car number: labeled ~30k images

slide-20
SLIDE 20

Classifying NASCAR images

Outliers easy to find in review

slide-21
SLIDE 21

Classifying NASCAR images

Human: ??? Model: 78 Confidence: 0.999

slide-22
SLIDE 22

Classifying NASCAR images

Human: ??? Model: 42 Confidence: 0.985

slide-23
SLIDE 23

Inspecting the Neural Network

23

Activated Filter Input Image

The Model is not a black box. We can see that it is detecting the numbers – important for robustness when the paint changes

slide-24
SLIDE 24

Argo AI

  • Argo AI is an artificial intelligence company, established to tackle one
  • f the most challenging applications in computer science, robotics

and artificial intelligence: self-driving vehicles

  • Engineering hubs in Pittsburgh, Southeastern Michigan and the Bay

Area of California

  • For more information regarding Argo AI and its work, please talk to

me at GTC or visit: www.argo.ai

24