Enter the Bathysphere Measuring Complexity-Effectiveness of - - PowerPoint PPT Presentation

enter the bathysphere
SMART_READER_LITE
LIVE PREVIEW

Enter the Bathysphere Measuring Complexity-Effectiveness of - - PowerPoint PPT Presentation

Enter the Bathysphere Measuring Complexity-Effectiveness of Future-Generation Silicon Architectures Using FPGAs Andrew Schwerin Steven Swanson Mark Oskin Simulation Methodology Have new idea while(not published) Hack simulator Run


slide-1
SLIDE 1

Enter the Bathysphere

Measuring Complexity-Effectiveness

  • f Future-Generation Silicon

Architectures Using FPGAs

Andrew Schwerin Steven Swanson Mark Oskin

slide-2
SLIDE 2

June 03 2

Simulation Methodology

Have new idea while(not published) Hack simulator Run simulator Refine idea

  • Quick to implement
  • Short iteration period
slide-3
SLIDE 3

June 03 3

Simulation Drawbacks

  • Difficult to validate
  • Slow to execute

Underestimate delay Choose unrepresentative data Subtle bug

slide-4
SLIDE 4

June 03 4

Custom Prototyping

  • Validates assumptions
  • Expensive
  • Time consuming
  • Labor intensive
slide-5
SLIDE 5

June 03 5

Can we have it all?

  • Short iteration period?
  • Low incremental cost?
  • Fast execution?
  • Validation?
slide-6
SLIDE 6

June 03 6

The Bathysphere

  • Deep submicron

exploration vehicle!

  • ASIC model
  • FPGA Implementation

Substrate

slide-7
SLIDE 7

June 03 7

  • 16 nodes / board

Bathysphere

  • 4 boards

Each node

  • Virtex 1k FPGA
  • 2x1MB SDRAM

SDRAM High-density 1M gate FPGA Total

  • 64M logic gates
  • 128 MB RAM
slide-8
SLIDE 8

June 03 8

Design Model

slide-9
SLIDE 9

June 03 9

Bathysphere Advantages

  • An Architecture Research Methodology

– Brings physical constraints to fore – Faster than software simulation – Cheap: Approx $50k – Lots of iterations

  • Not just an emulation system

– Different than a QuickTurn

slide-10
SLIDE 10

June 03 10

FPGA-ASIC Mismatch

  • Bathysphere is not an ASIC

– Late binding of functionality costs flexibility

  • Work around FPGA-unfriendly structures

slide-11
SLIDE 11

June 03 11

Challenges for the Bathysphere

  • Multiported Memories
  • Content-addressable Memories
  • Inter-FPGA Bandwidth
slide-12
SLIDE 12

June 03 12

Problem: Multiported Memories

  • Multiported memories are everywhere
  • Silicons provides

– Multiported memories (e.g., register files) – Large memories (e.g., caches)

  • FPGAs provide

– Limited onboard memory resources

  • 16 KB on Virtex 1000
  • Single, or at most dual-ported structures

– Limited bandwidth to external memories

slide-13
SLIDE 13

June 03 13

Problem: Multiportedness

How do we use this:

  • Single-ported memories

Pn P2 P1 n-port RAM To build this?

  • n-ported memories

1-port RAM P1

slide-14
SLIDE 14

June 03 14

Let the Tools Handle It?

A Memory:

  • 7 write ports
  • 512 words
  • 64-bit words

% of resources to implement

  • Red box

– In 35 nm silicon

  • Blue box

– In the bathysphere

slide-15
SLIDE 15

June 03 15

Memory Ports vs. Area

0.5 1 1.5 2 2.5 3 3.5 4 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Ports Percent of Resources FPGA Write Ports FPGA Read Ports ASIC Write Ports ASIC Read Ports Virtex 1000 Capacity

slide-16
SLIDE 16

June 03 16

Memory Ports vs Latency

5 10 15 20 25 30 35 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Ports Access Latency (ns) FPGA Write Ports FPGA Read Ports ASIC Write Ports ASIC Read Ports

slide-17
SLIDE 17

June 03 17

Multiportedness Solution

  • Time-multiplexing
  • Split logical cycle

– Create 2n+1 µ-cycles – One µ-cycle per read – One µ-cycle for logic – One µ-cycle per write

  • More sophisticated

multiplexing possible

1-port RAM

Pn P2 P1

*

n-port RAM

slide-18
SLIDE 18

June 03 18

Content Addressable Memories

  • Common in architectures
  • Straightforward to build in silicon
  • Require parallel access to memory words

– Extremely resource intensive in FPGAs – But, you can build a small one if you need – Or, use the off-chip RAM to back a hash table

slide-19
SLIDE 19

June 03 19

Bathysphere Communication

  • 76 I/O pins to 4

nearest neighbors

  • Long distance

communication via adjoining FPGAs

  • Must confront

communication costs

slide-20
SLIDE 20

June 03 20

The Bathysphere Methodology

slide-21
SLIDE 21

June 03 21

“This is your bathysphere” What would make it useful to you?