A New DSP Approach for 5G and AI Albert Camilleri VP Business - - PowerPoint PPT Presentation

a new dsp approach for 5g and ai
SMART_READER_LITE
LIVE PREVIEW

A New DSP Approach for 5G and AI Albert Camilleri VP Business - - PowerPoint PPT Presentation

A New DSP Approach for 5G and AI Albert Camilleri VP Business Development North America VSORA Inc. Company Background Company founded in 2015 Headquarters: France Paris Each founder has more than 10 years experience in Digital


slide-1
SLIDE 1

A New DSP Approach for 5G and AI

Albert Camilleri VP Business Development North America VSORA Inc.

slide-2
SLIDE 2
  • Company founded in 2015
  • Headquarters: France
  • Each founder has more than 10 years experience in Digital Signal

Processor (DSP) design, working in global consumer markets

  • Previous founders’ designs widely used in successful consumer,

automotive and industrial high volume products

Paris

San Diego Taipei Shenzhen Tokyo

Company Background

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

slide-3
SLIDE 3

Reinventing ‘Digital Signal Processing’ (DSP)

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

Wireless

Comms RF Baseband

Channel Encoder / Decoder

App Processor

DSP

Data DSP INFERENCE

Trained Model

Result

App Processor Artificial Intelligence Artificial Intelligence (Terminals / Edge)

  • Neural Networks
  • Image / video
  • Speech recognition / Audio
  • Language Translation

5G Wireless Communications

  • mmWave, MiMo, Beamforming, Carrier Aggregation
  • Enhanced 1Gbps+ Mobile Broadband
  • Massive Machine Type Comms, Smart Home / Cities
  • Ultra reliable low latency comms (< 1ms), IoT
  • New Short Range Wireless, 802.11af, ay, bb (LiFi)
  • Both terminals and infrastructure
slide-4
SLIDE 4

Traditional Architecture Limits Flexibility

  • Single threaded processors falling

further behind 1 Gbps+ demand

  • Bespoke, fixed algorithm, co-processors

increase the well known ASIC problems

  • Inflexible, hard to mature quickly,

inappropriate in the new world of rapid standards evolutions

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

Host CPU + Memory Signals In / Out

slide-5
SLIDE 5

The Memory Bottleneck Problem

Signal Memory bottleneck will stall and limit the promise of 5G and AI

  • Need for ever greater symbol word length and

depth

  • Signal Memory (Cache) I/O bandwidth explosion
  • 5G modems and Massively Parallel Neural

Network Processors are predominantly built on the same DSP type architectures today

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

slide-6
SLIDE 6
  • Completely configurable:
  • Number of ALUs
  • Memory size
  • Quantization (IEEE754 like), i.e.

number of exponent/mantissa bits

  • Liberates the “Bottleneck”
  • Signal (cache) memory more tightly

coupled

  • Signals manager pre-configures

signal data

  • DSP is tightly controlled by the

host processor

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

VSORA MPU

ALUs

High BW Signal Memory Signals Manager

+ / x ACC

a1 b1 c1 1 2 4 3

+ / x ACC

a2 b2 c2

+ / x ACC

a3 b3 c3

+ / x ACC

a4 b4 c4

Host Processor

Introducing the Matrix Processor Unit (MPU)

slide-7
SLIDE 7

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

Single-core / Multi-core Architecture

  • MPUs are programmed at an

algorithm level in C++ with a MATLAB like API

  • High-level simulation methodology

provides performance/power/area trade-off data

  • Can be modified and iterated at

the algorithmic level to attempt 100% DSP utilization

  • Algorithm code compiled directly

to DSP via modified LLVM compiler

  • No low level code required
  • Engineering productivity enhancer

Completely configurable in terms of:

  • The number of cores (single/multi-core)
  • The number of DMAs/core

Multi-Core MPU Vsora MPU-1 Vsora MPU-4 Vsora MPU-2 Vsora MPU-3

Signals In / Out Host CPU + Memory

Ability to map complex systems onto multiple cores, and dimension optimal solutions.

slide-8
SLIDE 8

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

AI Supported Frameworks

VSORA AI Framework Load Tool Graph Optimization Model Quantization VSORA Library VSORA AI-DSP Compiler

ALUs

High BW Signal Memory Signals Manager

+ / x ACC a1 b1 c1 1 2 4 3 + / x ACC a2 b2 c2 + / x ACC a3 b3 c3 + / x ACC a4 b4 c4

slide-9
SLIDE 9
  • Fully programmable Solution
  • TensorFlow, PyTorch, …, supported frameworks
  • Configurable:
  • Number of MACs: 256, 1024, 2304, 4096, 6400, 9216, 12544, 16384, …, 65536
  • IEEE754 Quantization: number of bits (sign/exponent/mantissa)
  • Number of DMAs
  • High MPU processing efficiency
  • Does not suffer memory bandwidth bottleneck to load large numbers of MACs

VSORA AI Solution

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

slide-10
SLIDE 10

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

Reinvented Development Flow

Drawbacks

  • Four different, large engineering teams
  • Very slow process, exceedingly expensive

Algorithm Definition Implementation

  • Wired logic

(DSP Co-Pro)

  • DSP

Link Layer Software Development ASIC Hardware Integration

months High-Level Code

  • r Specifications

Algo Engineers [MATLAB] DSP Engineers [C/C++/Verilog/VHDL] Binary Code API Definition HW Engineers [Verilog, System Verilog] SW Engineers [C/C++]

Simulation Platform

Algorithms definition MSP dimensioning

min Algo Engineers [MATLAB Like]

Link Layer Software Development ASIC Hardware Integration

HW Engineers [Verilog, System Verilog] SW Engineers [C/C++] MSP Configuration

Benefits

  • Reduced personnel
  • Fast algorithm definition and DSP

dimensioning

  • Easy integration of Signal Processing &

Embedded SW code

High Level Code

Algorithms Rework Required Signal Processing Related HW

slide-11
SLIDE 11

Summary

The information contained in this document is confidential and shall not be disclosed to third parties without a written consent from Vsora

Highly configurable “tiled” solution

  • “Unlimited” number of Cores
  • Scalable memory/DMA bandwidth avoids bottlenecks

Eliminates need for inflexible co-processors

  • Flexible coding: mix signal processing and link-layer/neural-processing SW

Implementation independent, high-level programmability

  • Supports design flexibility to facilitate market evolution

Tiered simulation platforms

  • MATLAB/Tensorflow level, FPGA (Cloud) platform, IP/RTL simulation

Compiler technology empowers 100% DSP utilization

  • Optimizes engineering efficiency
  • Facilitates performance/area/power tradeoffs
slide-12
SLIDE 12

Thank You