IBM Research
MULTI-VEHICLE MAP FUSION USING GNU RADIO
OPTIMIZATION AND ACCELERATION OPPORTUNITIES
Augusto Vega Akin Sisbot Alper Buyuktosunoglu Arun Paidimarri David Trilla John-David Wellman Pradip Bose IBM T. J. Watson Research Center
Acknowledgment Thanks to the many IBM colleagues who contribute to - - PowerPoint PPT Presentation
M ULTI -V EHICLE M AP F USION USING GNU R ADIO O PTIMIZATION AND A CCELERATION O PPORTUNITIES Augusto Vega Akin Sisbot Alper Buyuktosunoglu Arun Paidimarri David Trilla John-David Wellman Pradip Bose IBM T. J. Watson Research Center IBM
IBM Research
Augusto Vega Akin Sisbot Alper Buyuktosunoglu Arun Paidimarri David Trilla John-David Wellman Pradip Bose IBM T. J. Watson Research Center
IBM Research
2 February 2020
IBM Research
3 February 2020
IBM Research
4 February 2020
IBM Research
5 February 2020
Source: IEEE Spectrum (July 2018) * Source: https://www.darpa.mil/program/domain-specific-system-on-chip
IBM Research
6 February 2020 Source: MIT Technology Review
When’s a pedestrian not a pedestrian? When it’s a decal.
IBM Research
7
Sensing and computation capabilities False predic4ons car-centric swarm-based
February 2020 Source: MIT Technology Review
When’s a pedestrian not a pedestrian? When it’s a decal.
IBM Research
8 February 2020
IBM Research
9 February 2020
EPOCHS Reference Application Compiler + Scheduler Ontology & Design Space Exploration Accelerators + NoC + Memory Architecture Implementation Domain-Specific SoC Hardware
FPGA Prototype
IBM Research
10 February 2020
EPOCHS Reference Application Compiler + Scheduler Ontology & Design Space Exploration Accelerators + NoC + Memory Architecture Implementation Domain-Specific SoC Hardware
FPGA Prototype
IBM Research 11 February 2020
Application Integrated performance analysis Development Environment and Programming Languages Libraries Operating System Compiler, linker, assembler Intelligent scheduling/routing Heterogeneous architecture composed of Processor Elements:
Decoupled Software development Hardware-Software Co-design
Medium Access Control
IBM Research
12 February 2020
IBM Research
13 February 2020
Communication Fabric Sensing Fabric
V2V Communications To other control modules Real-Time Map Fusion Map Generation Computer Vision
EPOCHS Reference Application
IBM Research
14 February 2020
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
IBM Research
15 February 2020
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
label label label label label label
IBM Research
16 February 2020
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
Transmitter Receiver
https://github.com/bastibl/gr-ieee802-11
IBM Research
17 February 2020
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
IBM Research
18 February 2020
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
USRP
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
USRP
Robot 1 Robot 2
802.11p
IBM Research
19 February 2020
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
Costmap 2D ERA Msg Builder Map Merger ERA Msg Interpreter GNU Radio ROS-GR Interface
Gazebo
Depth Camera
2D Map
Scan Occupancy Grid Map Occupancy Grid Map ERA Msg Pose Occupancy Grid Map ERA Msg Payload
Robot 1 Robot 2 socket
IBM Research
20 February 2020
IBM Research
Communication Fabric Sensing Fabric
V2V Communications To other control modules Real-Time Map Fusion Map Generation Computer Vision
21 February 2020
EPOCHS Reference Application
From RX ADC
TRANSMITTER FLOWGRAPH RECEIVER FLOWGRAPH
WiFi Encoding and Packet Generation OFDM Carrier Allocation IFFT Cyclic Prefix Generation WiFi MAC
From ROS To TX DAC
OFDM Frame Equalizer FFT Sync Long Sync Short Packet Decoder and Mac
To ROS
https://github.com/bastibl/gr-ieee802-11
IBM Research
Communication Fabric Sensing Fabric
V2V Communications To other control modules Real-Time Map Fusion Map Generation Computer Vision
22 February 2020
EPOCHS Reference Application
From RX ADC
TRANSMITTER FLOWGRAPH RECEIVER FLOWGRAPH
WiFi Encoding and Packet Generation OFDM Carrier Allocation IFFT Cyclic Prefix Generation WiFi MAC
From ROS To TX DAC
OFDM Frame Equalizer FFT Sync Long Sync Short Packet Decoder and Mac
To ROS
https://github.com/bastibl/gr-ieee802-11
0% 5% 10% 15% 20% 25% 30% 35%
cexpf viterbi_butterfly2 volk_32fc_32f_dot_prod… fastnoise_source_c_impl::work volk_32fc_x2_dot_prod… mulsc3
Execution Time (%)
Functions identified for acceleration
IBM Research
23 February 2020
5 10 15 20 25 30 35 40
CPU Baseline FPGA w/DMA CPU Vectorized FPGA w/DMA (Fully Optimized) CPU Cycles Computation Time Mem Copy Overhead
Execution time per cexpf operation
ARM Cortex-A53
IBM Research
24 February 2020
5 10 15 20 25 30 35 40
CPU Baseline FPGA w/DMA CPU Vectorized FPGA w/DMA (Fully Optimized) CPU Cycles Computation Time Mem Copy Overhead
Execution time per cexpf operation
ARM Cortex-A53 Xilinx UltraScaleMP+ ZYNQ ZCU102
FPGA Implementation (dual data-path pipeline)
(cos𝑐 + sin𝑐 𝑗) 𝑓,
IBM Research
25 February 2020
5 10 15 20 25 30 35 40
CPU Baseline FPGA w/DMA CPU Vectorized FPGA w/DMA (Fully Optimized) CPU Cycles Computation Time Mem Copy Overhead
Execution time per cexpf operation
ARM Cortex-A53 ARM’s SIMD extension (NEON) Xilinx UltraScaleMP+ ZYNQ ZCU102
FPGA Implementation (dual data-path pipeline)
(cos𝑐 + sin𝑐 𝑗) 𝑓,
IBM Research
26 February 2020
5 10 15 20 25 30 35 40
CPU Baseline FPGA w/DMA CPU Vectorized FPGA w/DMA (Fully Optimized) CPU Cycles Computation Time Mem Copy Overhead
Execution time per cexpf operation
Fully-optimized implementation (idealized bound) – 300 MHz (instead of 100 MHz) – Four parallel computation engines – Memory-copy elimination ARM Cortex-A53 ARM’s SIMD extension (NEON) Xilinx UltraScaleMP+ ZYNQ ZCU102
FPGA Implementation (dual data-path pipeline)
(cos𝑐 + sin𝑐 𝑗) 𝑓,
IBM Research
27
Computer System ROS/Gazebo 802.11p
Computer System ROS/Gazebo Computer System ROS/Gazebo UDP Computer System CarSim/Apollo 802.11p
Computer System CarSim/Apollo
February 2020
Version 1 Version 2 Version 3
IBM Research
28
Computer System ROS/Gazebo 802.11p
Computer System ROS/Gazebo Computer System ROS/Gazebo UDP Computer System CarSim/Apollo 802.11p
Computer System CarSim/Apollo
February 2020
Version 1 Version 2 Version 3 LAYER 1 World Simulators (sensor data source) LAYER 2 Automotive Platforms (perception, plan and control) LAYER 3 Coopera4ve Vehicles PlaIorm (swarming and V2X support) CarSim Gazebo CARLA LGSV Apollo Autoware ERA
802.11p 5G Multi-Vehicle Cooperation Logic
Apollo API Autoware API ROS GNU Radio
some raw sensor data is directly fed to ERA
IBM Research
29 February 2020
IBM Research
30 February 2020
IBM Research
Photo by Balthazar Korab Source: http://www.shorpy.com/node/15488