April 4-7, 2016 | Silicon Valley
Shri Sundaram, Spring 2016
DRIVE PX 2 SELF-DRIVING CAR COMPUTER Shri Sundaram, Spring 2016 - - PowerPoint PPT Presentation
April 4-7, 2016 | Silicon Valley DRIVE PX 2 SELF-DRIVING CAR COMPUTER Shri Sundaram, Spring 2016 SELF-DRIVING AND AI SUPERCOMPUTING 2 SELF-DRIVING CAR PLATFORM DRIVEWORKS Perception Localization Planning Visualization NVIDIA DRIVE PX
April 4-7, 2016 | Silicon Valley
Shri Sundaram, Spring 2016
2
3
NVIDIA DRIVE™ PX 2 NVIDIA DIGITS NVIDIA DRIVENET
Localization Planning Visualization Perception
DRIVEWORKS
4
12 CPU cores | Pascal GPU | 8 TFLOPS | 24 DL TOPS | 16nm FF | Scalable from 1-4 processors World’s First AI Supercomputer for Self-Driving Cars
5
Sensor Fusion Interfaces:
GMSL Camera, CAN, GbE, BroadR-Reach, FlexRay, LIN, GPIO
Displays and Cockpit Computer Interfaces
HDMI, FPDLink III and GMSL
Development and Debug Interfaces
HDMI, GbE, 10GbE, USB3, USB 2 (UART/debug), JTAG
Auto Grade connectors Debug/Lab interfaces
6
Dual CPU-GPU Cluster Connected over Gigabit Ethernet
compute Each GPU has dedicated memory
performance Specialized Instructions for Discrete GPU
DRIVE™ PX 2
7
NVIDIA Vibrante Linux & Comprehensive BSP Rich Middleware SDK, Samples and more
8
Sensor Fusion, Point Cloud etc.
Detection/Perception (DNN Inference), Localization, Path Planning, Visualization
Automobile Network
Algorithm development, Hardware in Loop etc.
9
Develop applications using DRIVE™ PX 2 Deploy a pre-trained DNN
Migrate applications from PC to DRIVE™ PX 2 Typically in a lab set up or a few cars DRIVE PX™ 2 as reference for custom ECU Safety concept to derive certain Safety Integrity Level Validate algorithms in car with live cameras/sensors Partition existing system (say, PC); include DRIVE™ PX 2 for sensor fusion 10s of cars
10
Computation & Memory Interfaces & I/O Bandwidth Rich SW & HW Capabilities Functional Safety Automotive HW Automotive SW (RTOS, AUTOSAR & Code Compliance) Product Partnerships (HW & SW)
11
CPU Complex: 2x Denver2 plus 4x Cortex-A57
Fully coherent HMP system; ARM V8 64-bit
Pascal GPU – 5th Gen GPU Architecture
Custom acceleration for deep learning
Separate memory for Tegra (CPU + iGPU) and dGPU
Tegra (CPU + iGPU) to 8GB LPDDR4 (UMA): 50+GB/s Discrete GPU to 4GB GDDR5: 80+ GB/s
CORE DRIVE™ PX 2 CAPABILITIES
12
Prepared to handle many use-cases
Train on DevBox/DIGITS; Inference on DRIVE™ PX 2 Develop on PC (CUDA); Run on DRIVE™ PX 2 Develop on another NVIDIA embedded platform; Run on DRIVE™ PX 2 …
By making the experience across homogeneous
Unified tool chain (across DevBox, DRIVE™ PX2 and other platforms) Multimedia APIs aligned across all platforms (Linux) Full Open GL (not just ES) Ubuntu Unity desktop CUDA/CuDNN/DL Frameworks …
CORE DRIVE™ PX 2 CAPABILITIES
13
Data Logger DRIVE™ PX 2
Camera Bypass
Vehicle Information Other Sensors CAN/FlexRay Gigabit Ethernet Other Sensors & GPS (LiDAR/Radar) 2x 10 Gigabit Ethernet Uncompressed Camera stream 12x Camera GMSL 2x HDMI (4K/60) as Data pipe Optional: for uncompressed video 10 Gigabit Ethernet Optional: for uncompressed video ECU debug data 12x Camera GMSL
TOOLS & ECOSYSTEM
14
User can define 20 different CAN node IDs between Infineon Aurix and NVIDIA Tegra. No special tool-chain required. Messages defined through configuration files on Linux (running on Tegra A). E.g. one of the 20.
CAN Channel F (on Aurix): Standard Can Id (0x244) - forwarded to Tegra B
* Provided by NVIDIA Partner Elektrobit
TOOLS & ECOSYSTEM
15
Graphics Debugger, System Profiler Multi Process, Multi Node, Multi GPU CUDA tools
TOOLS & ECOSYSTEM
16
Tools and building blocks to create your autonomous driving applications. Object detection, map localization, path planning and visualization.
TOOLS & ECOSYSTEM
17
USB/GigE Cameras LiDAR Image Sensors Hardware Reference board partners SDKs and frameworks from Partners
TOOLS & ECOSYSTEM
18
Being developed as SEooC for autonomous driving Developing work products
Functional and Technical Safety Concept – Scheme to achieve certain Safety Integrity Level for a particular use-case. System Design Specification – Functional details of HW / SW components & interfaces that are compliant to various standards (i.e. ISO26262) Hardware-Software Interface Specification – Register level details of HW/SW components Safety Manual – Safety architecture (e.g. RTOS/AUTOSAR) and list of all safety mechanisms and instructions to use them (e.g. DRAM ECC) FMEDA - Documentation of random failure metrics
SPRINGBOARDS TO PRODUCTION
19
Certified Hypervisor and foundation partitions Safety OS (CPU Complex, SCE, Safety MCU) Safety supervision framework Safety diagnostics libraries Safety AUTOSAR Stack (SCE, Safety MCU) Applications, Middleware and Libraries
L1SS L2SS L3SS IPC SPI nSAFE WD WD STLib STLib
Tegra
Tegra/Pascal HW
SPRINGBOARDS TO PRODUCTION
20
Running on NVIDIA Tegra and Infineon AURIX™ 32-bit TriCore™ microcontroller Integrated Linux and AUTOSAR apps Functionality to monitor & redundancy management. IPC in safe/reliable execution environment.
Also available reference stacks from other major AUTOSAR solution providers.
SPRINGBOARDS TO PRODUCTION
21
DRIVE™ PX 2: AI Supercomputer for Self Driving Cars Built for application development, rapid embedded prototyping and to help migrate to series production. Delivers powerful I/O and processing capabilities, rapidly expanding product ecosystem and several means to shorten the path to production.
22
DRIVE™ PX 2 Launch at CES https://youtu.be/C_8MZZ2TZUk (View Part 1 through 9) Deep Learning for Cars (GTC 2016) https://youtu.be/RVmV9SXJeBg & https://youtu.be/YuyT2SDcYrU NVIDIA Self-Driving Car https://youtu.be/5Wr_6Wk0DmY Roborace https://youtu.be/KN_qostst-s
April 4-7, 2016 | Silicon Valley
JOIN THE NVIDIA DEVELOPER PROGRAM AT developer.nvidia.com/join