http://www.c2s2.org
Amin Firoozshahian, Alex Solomatnikov
Hicamp Systems Inc.
Ofer Shacham, Zain Asgar, Stephen Richardson, Christos Kozyrakis, Mark Horowitz
Stanford University
A Memory System Design Framework: Creating Smart Memories Amin - - PowerPoint PPT Presentation
A Memory System Design Framework: Creating Smart Memories Amin Firoozshahian, Alex Solomatnikov Hicamp Systems Inc. Ofer Shacham, Zain Asgar, http://www.c 2 s 2 .org Stephen Richardson, Christos Kozyrakis, Mark Horowitz Stanford University An
http://www.c2s2.org
Hicamp Systems Inc.
Stanford University
IBM Cell Intel Nehalem Sun Rock 2 Amin Firoozshahian
Sun Rock
4 Amin Firoozshahian
Amin Firoozshahian 5
Amin Firoozshahian
$ Cache Controller Proc Proc $ Cache Controller Interconnect Memory miss
Msg
6
Amin Firoozshahian
Local Mem Interconnect
Memory Proc Local Mem DMA DMA Proc 7
$ Commit Controller $ Commit Controller Interconnect Memory Addr. FIFO Addr. FIFO Proc Proc 8 Amin Firoozshahian
Amin Firoozshahian 9
[K. Mai et.al., “Architecture and Circuit Techniques for a Reconfigurable Memory Block,” IEEE International Solid-State Circuits Conference, February 2004 10 Data State Cmp Update Address Opcode
Amin Firoozshahian 11
Amin Firoozshahian 12
Msg
Unit 1 Unit 3
Msg
Unit 2
13 Amin Firoozshahian
Tracking State Update Data Movement
USHR MSHR Line Buffers
Processor Interface Network Interface
Interrupt
DMA DMA DMA
To/From Processors To/From Network To/From local storages
14 Amin Firoozshahian
Amin Firoozshahian 15 Tracking State Update Data Movement
USHR MSHR Line Buffers
Processor Interface Network Interface
Interrupt
DMA DMA DMA Miss
Read Miss Evict
Access Tags
Line Read
Access Data
WB / Miss
Spill Read Miss
Read Miss
Read Miss
16 Amin Firoozshahian
7.77mm 7.77mm 17
Test Chip 18 Amin Firoozshahian
Amin Firoozshahian 19
Amin Firoozshahian 20 Coherent Shared Memory Streaming Transactions
1 2 3 4 5 6 7 8 9 FFT MPEG2 Enc Barnes FMM 179.art Bitonic Sort MPEG2 Enc Barnes MP3D Cycles
Average Read Latency - 32 processor system
Real Controllers Ideal controllers
Amin Firoozshahian 21 Coherent Shared Memory Streaming Transactions
10.64 14.51 24.29 6.93 7.58 1.88 14.14 8.33 20.03 5 10 15 20 25 30
Overhead (%)
Average Overhead (%)
Amin Firoozshahian 22