Autonomous Driving on Benchmarks Xiaodi Hou TWO DECADES OF - - PowerPoint PPT Presentation

▶

Jun 18, 2023 270 likes •526 views

Autonomous Driving on Benchmarks Xiaodi Hou TWO DECADES OF BENCHMARKING Two decades of benchmarking MNIST 1998 Character recognition 60,000 images Inspired Convolutional Neural Net Two decades of benchmarking PASCAL-VOC

SLIDE 1

Autonomous Driving on Benchmarks

Xiaodi Hou

SLIDE 2

TWO DECADES OF BENCHMARKING

SLIDE 3

Two decades of benchmarking

MNIST

– 1998 – Character recognition – 60,000 images

Inspired Convolutional Neural

Net

SLIDE 4

Two decades of benchmarking

PASCAL-VOC

– 2005 – Object detection & classification – 3787 images

Inspired Deformable Part-

based Model

SLIDE 5

Two decades of benchmarking

ImageNet

– 2010 – Object classification – 1,000,000 images

Inspired deep learning

SLIDE 6

LIMITATIONS OF BENCHMARKS

SLIDE 7

Upper bounds of benchmarks

Objective tasks Intermediate tasks Subjective tasks

Measuring physical reality
Bounded by measurement

accuracy

Stereo/Optical flow/Face

recognition

Measuring human cognition
Bounded by subject

agreement

Saliency/Memorability/Image

captioning

SLIDE 8

Imperfect benchmarks

Marriage market in China

– Tall, rich, and handsome

80% girls are forced to choose among

– tall poor ugly guy – short rich ugly guy – short poor handsome guy

Dimensionality reduction

– Guaranteed information loss! – A projection of 𝑺𝒐 → 𝑺

Red or Blue?

SLIDE 9

Signs of a fading benchmark

Saturated competition

– Labeled Face in the Wild (0.9978 ± 0.0007)

Weak transferability

– Middlebury Optical Flow → KITTI Optical Flow

Poor inert-subject consistency

– Image captioning and BLEU scores

A man throwing a frisbee in a park.
A man holding a frisbee in his hand.
A man standing in the grass with a frisbee.

SLIDE 10

BENCHMARKS AND AUTONOMOUS DRIVING

SLIDE 11

Vision-based autonomous driving benchmarks

KITTI & CityScapes

– Detection – Tracking – Stereo/Flow – SLAM – Semantic segmentation

100% traditional vision

challenges

Are we ready?

SLIDE 12

Not yet…

SLIDE 13

Challenge 1: Data distribution

Academia

– Average performance

Silicon valley startup

– Demo oriented – Best case performance

Real products

– Murphy’s law – Worst case performance

SLIDE 14

Challenge 2: Gruond-truth representation

Bbox

– Almost no bbox in real world! – Missing hidden variables (distance & velocity)

Semantic segmentation

– “pixel classification” – How to assemble all the pixels?

Stixels

– Representing the world using matchstick – Distance and 3D shape – Missing the notion of whole objects

SLIDE 15

Challenge 3: Structured prior

What’s wrong with end-to-end learning?

SLIDE 16

Challenge 3: Structured prior

Two types of priors:

– Implicit prior

Data driven (e.g. images)
Good for deep learning models

– Explicit prior

Rule driven (e.g. cars cannot fly)
Good for probabilistic models
The road ahead

– An image based problem with strong explicit priors

SLIDE 17

TUSIMPLE CHALLENGES! WORKSHOP@CVPR 2017

SLIDE 18

TuSimple Challenge 1: Lane challenge

SLIDE 19

TuSimple Challenge 1: Lane challenge

Deep learning for lane?

– Parametrization of pixels

Strong structure priors

– ~ 3.75m lane width – Parallel lines – (almost) flat road surface

Over-representing corner cases

– 20% hard cases (heavy occlusion/strong light condition change/bad markings) are unlikely to occurs, if sampled uniformly

SLIDE 20

TuSimple Challenge 2: Velocity estimation

Representing the world with cam + LiDAR

SLIDE 21

TuSimple Challenge 2: Velocity estimation

Object-level representation for motion planning

– Stereo map? – SLAM? – Estimation based on bbox size?

LiDAR vs Camera

– No LiDAR solution for 200m perception

SLIDE 22

TuSimple challenges

Video clip based

– We expect non-trivial temporal aggregation!

Confidence based

– Each entry has a “confidence” field – We evaluate the most confident 80% entries

Run-time

– Must report single GPU runtime speed – Slow algorithms (< 3fps) will not be included in the leaderboard

SLIDE 23

HTTP://BENCHMARK.TUSIMPIE.AI

Available now!!

SLIDE 24