Enter the Bathysphere
Measuring Complexity-Effectiveness
- f Future-Generation Silicon
Architectures Using FPGAs
Andrew Schwerin Steven Swanson Mark Oskin
Enter the Bathysphere Measuring Complexity-Effectiveness of - - PowerPoint PPT Presentation
Enter the Bathysphere Measuring Complexity-Effectiveness of Future-Generation Silicon Architectures Using FPGAs Andrew Schwerin Steven Swanson Mark Oskin Simulation Methodology Have new idea while(not published) Hack simulator Run
Measuring Complexity-Effectiveness
Architectures Using FPGAs
Andrew Schwerin Steven Swanson Mark Oskin
June 03 2
Simulation Methodology
Have new idea while(not published) Hack simulator Run simulator Refine idea
June 03 3
Simulation Drawbacks
Underestimate delay Choose unrepresentative data Subtle bug
June 03 4
Custom Prototyping
June 03 5
Can we have it all?
June 03 6
The Bathysphere
exploration vehicle!
Substrate
June 03 7
Bathysphere
Each node
SDRAM High-density 1M gate FPGA Total
June 03 8
Design Model
June 03 9
Bathysphere Advantages
– Brings physical constraints to fore – Faster than software simulation – Cheap: Approx $50k – Lots of iterations
– Different than a QuickTurn
June 03 10
FPGA-ASIC Mismatch
– Late binding of functionality costs flexibility
June 03 11
Challenges for the Bathysphere
June 03 12
Problem: Multiported Memories
– Multiported memories (e.g., register files) – Large memories (e.g., caches)
– Limited onboard memory resources
– Limited bandwidth to external memories
June 03 13
Problem: Multiportedness
How do we use this:
…
Pn P2 P1 n-port RAM To build this?
1-port RAM P1
June 03 14
Let the Tools Handle It?
A Memory:
% of resources to implement
– In 35 nm silicon
– In the bathysphere
June 03 15
Memory Ports vs. Area
0.5 1 1.5 2 2.5 3 3.5 4 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Ports Percent of Resources FPGA Write Ports FPGA Read Ports ASIC Write Ports ASIC Read Ports Virtex 1000 Capacity
June 03 16
Memory Ports vs Latency
5 10 15 20 25 30 35 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Ports Access Latency (ns) FPGA Write Ports FPGA Read Ports ASIC Write Ports ASIC Read Ports
June 03 17
Multiportedness Solution
– Create 2n+1 µ-cycles – One µ-cycle per read – One µ-cycle for logic – One µ-cycle per write
multiplexing possible
1-port RAM
…
Pn P2 P1
*
n-port RAM
June 03 18
Content Addressable Memories
– Extremely resource intensive in FPGAs – But, you can build a small one if you need – Or, use the off-chip RAM to back a hash table
June 03 19
Bathysphere Communication
nearest neighbors
communication via adjoining FPGAs
communication costs
June 03 20
The Bathysphere Methodology
June 03 21
“This is your bathysphere” What would make it useful to you?