WWRF12 Meeting, WWRF12 Meeting, 4- -5 November 2004, 5 November - - PowerPoint PPT Presentation

wwrf12 meeting wwrf12 meeting 4 5 november 2004 5
SMART_READER_LITE
LIVE PREVIEW

WWRF12 Meeting, WWRF12 Meeting, 4- -5 November 2004, 5 November - - PowerPoint PPT Presentation

WWRF12 Meeting, WWRF12 Meeting, 4- -5 November 2004, 5 November 2004, 4 Toronto, Canada Toronto, Canada Reconfigurable Architectures for Wireless Systems: Design Exploration and Integration Challenges Joseph R. Cavallaro, Michael C.


slide-1
SLIDE 1

Reconfigurable Architectures for Wireless Systems: Design Exploration and Integration Challenges

Joseph R. Cavallaro, Michael C. Brogioli, Alexandre de Baynast, and Predrag Radosavljevic Rice University, Houston, TX USA {cavallar, brogioli, debaynas, rpredrag}@rice.edu www.ece.rice.edu/~cavallar

WWRF12 Meeting, WWRF12 Meeting, 4 4-

  • 5 November 2004,

5 November 2004, Toronto, Canada Toronto, Canada

slide-2
SLIDE 2

Page 2 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Outline

Background – ASIPs Context on WG6 Issues Hardware Partitioning and Design Exploration Imagine and TTA Interconnect Challenges System Simulation

slide-3
SLIDE 3

Page 3 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Background

WWRF11 Oslo June 2004

ASIP Architecture for Future Wireless Systems: Flexibility and Customization - Joseph Cavallaro and Predrag Radosavljevic

Application Specific Instruction Processor Design Flow

Channel Equalization for HSDPA in MIMO Environment Multiple Equalizer Algorithms to same Architecture Design Exploration for Area, Time, Power Constraints

Issues Remain in System Integration

slide-4
SLIDE 4

Page 4 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Context of WG6

White Paper on:

Element Management, Flexible Air Interfaces, SDR

4.3 SDR Baseband Reference Models and Architectures 4.6 4G/Beyond 3G Verification Tools

  • PRAGA Platform and Design Flow

Cognitive Radio, Spectrum and Radio Resource Management

6 Enabling Technologies 6.1 Cognitive Radio

XG and JTRS Initiatives

slide-5
SLIDE 5

Page 5 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Processors in Future Wireless Systems

  • ASIPs (Application Specific Instruction Processors):

Excellent tradeoff between efficiency of ASICs and flexibility of DSPs

slide-6
SLIDE 6

Page 6 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

ASIP Architecture Design

  • Flexible processors for mobile handsets:

Different modifications of wireless base-band algorithms (processing in slow/fast fading, low/high scattering environments) Support for evolution of standards (3GPP, 4G, 802.11x, WiFi, etc)

  • Efficient processors to achieve high-demanding real time

requirements:

Customized architecture is needed Extension of ASIP instruction set with application-specific

  • perations
  • Examples: Imagine Media Processor and Transport Triggered

Architecture (TTA)

slide-7
SLIDE 7

Page 7 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Design Exploration Strategies

Data-Parallel Systems

  • Algorithm

mapping: Design of algorithms for efficient mapping and performance Architecture scaling: Having designed the algorithms, find a low power processor Workload adaptation: Having designed the processor, improve power at run-time

slide-8
SLIDE 8

Page 8 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Example MIMO Downlink Equalization for 3G HSDPA

  • Physical layer of mobile handset in MIMO downlink

ASIP architecture based on TTA

  • Flexible architecture solution for different modifications of

channel equalization algorithm

  • Highly optimized for the most computationally complex

version of channel equalization

slide-9
SLIDE 9

Page 9 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Example TTA Equalizer ASIP

slide-10
SLIDE 10

Page 10 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Example Imagine 3G BS ASIP

slide-11
SLIDE 11

Page 11 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Interconnect Challenges

System partitioning and interconnect modeling At higher level:

Fabric between processors and co-processors, memory, and peripherals

At lower level:

aggressive process technology scaling increasing operating frequencies delay, noise, and power problems

Massive network servers down to mobile wireless handheld devices.

slide-12
SLIDE 12

Page 12 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Intra- and Inter-chip Communication

Interfaces between DSP, ASIP, ASIC, FPGA Example 3G Multi-user Detector on Multiple DSP-FPGA Bus-based Vbus in TI C6X DSP SoC Core Socket-based OCP-IP Initiative for Host and Multiple Co-

Processor Cores

slide-13
SLIDE 13

Page 13 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

System Simulation: Spinach Composable Software Modules

Make software modules act and connect like real hardware

1:1 Mapping between software modules and real hardware

  • components. . .facilitates intuitive hardware modeling

Well abstracted port API.

  • Example: Processors only knowledge of “memory” state is through

requests to imem/dmem via port interfaces.

Accurately supports asynchronous events

Eliminating global state = modularity and composability.

Rapidly prototype systems in minutes (not hours/days) No global machine state means less software engineering

  • verhead

Enables complete flexibility, configurability

Higher-level than Mentor Seamless SoC Simulator

slide-14
SLIDE 14

Page 14 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Spinach Modules

  • Processing Elements

Bit true cycle accurate TI C6x DSPs FPGA based Coprocessors (user defined, fully flexible) MIPS R4000 microcontrollers

  • Memory System

Bus arbiters, multiported memories Caches and cache controllers, SRAM and DRAM controllers.

  • Interconnect

Mux, demux, pipe delays, user defined functions.

  • Input/Output

DMA assists, medium access assists, I/O harness

  • Support for multiple clock domains
slide-15
SLIDE 15

Page 15 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Typical Simulator Configuration

On chip instruction memory Memory Bus Arbiter Memory Bus Arbiter MIPS Microcontroller Coproc Mem

TI C6x DSP FPGA Coproc MIPS R4K DMA 0 DMA N

Memory Bus Arbiter On chip data memory

slide-16
SLIDE 16

Page 16 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Case Study:

Coprocessor based Matrix Multiplication

  • Idea: Use custom FPGA coprocessor with DSP.

Run dot product of matrix multiply vectors on coprocessor. Use host DSP for syncronization, DMA control, all other code.

  • Simulated System.

167MHz TI C62x DSP 64k single cycle on-chip instruction and data memories. Coprocessor software controlled via memory mapped registers.

  • Asynchronous. More on this later…

Data transfers to coprocessor via on-chip DMA engines.

slide-17
SLIDE 17

Page 17 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Experimental Setup

  • Workloads:

16 bit fixed point matrix multiply kernels compiled at –O3 in CCS Array sizes and offsets known statically at compile time

  • Compiler can aggressively schedule and unroll loops.

DSP controls DMA of data to/from coprocessor

  • Coprocessor Specifications.

50 11 21 N/a 50 5 9 N/a 50 3 5 N/a 50 2 3 N/a 50 2 2 N/a

DSP only 8 wide coproc 16 wide coproc

16 element dot product 32 element dot product 64 element dot product 128 element dot product 8 element dot product

slide-18
SLIDE 18

Page 18 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Future Directions for 4G

System on Chip (SOC)

Integration Challenges with DSP – FPGA/ASIP Co-Design, Simulation, Verification – Very Error Prone – Needs Standardized Interfaces

Gigabit/sec Systems

Modular Terminals will Support from Voice to over 1 Gbps Wireless through Several Interfaces Needs Unified, Extensible, Parameterized Signal Processing Architecture Family with Integration to Host Controller. OFDM, MC-CDMA in High Mobility

slide-19
SLIDE 19

Page 19 WWRF12 Meeting, 4-5 November 2004, Toronto, Canada

Research Topics and Challenges

Architectures and Design Environments (From algorithm mapping

and analysis up to SoC and Reconfigurable):

Hardware Abstraction Layer (HAL) development for reconfigurable

SoC systems

DSP host to ASIP and ASIC co-processor system integration for

System on Chip (SoC) design

SoC simulation environment based on DSP with programmable

ASIP co-processors and fixed ASIC blocks