Faculty of Computer Science, Institute for System Architecture, Operating Systems Group
M3: INTEGRATING ARBITRARY COMPUTE UNITS AS FIRST-CLASS CITIZENS
OS: Nils Asmussen, Hermann H¨ artig, Marcus V ¨
- lp
EE: Benedikt N ¨
- then, Gerhard Fettweis
M 3 : INTEGRATING ARBITRARY COMPUTE UNITS AS FIRST-CLASS CITIZENS - - PowerPoint PPT Presentation
Faculty of Computer Science, Institute for System Architecture, Operating Systems Group M 3 : INTEGRATING ARBITRARY COMPUTE UNITS AS FIRST-CLASS CITIZENS OS: Nils Asmussen, Hermann H artig, Marcus V olp EE: Benedikt N othen, Gerhard
Faculty of Computer Science, Institute for System Architecture, Operating Systems Group
OS: Nils Asmussen, Hermann H¨ artig, Marcus V ¨
EE: Benedikt N ¨
[1] Thin servers with smart pipes: Designing SoC accelerators for memcached, ISCA’13 [2] PuDianNao: A polyvalent machine learning accelerator, ASPLOS’15 Nils Asmussen Slide 2 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA
Nils Asmussen Slide 3 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA Kernel Kernel
Nils Asmussen Slide 3 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA Kernel Kernel Kernel Kernel
Nils Asmussen Slide 3 of 16
Intel Xeon Intel Xeon
Kernel Kernel Kernel Kernel
Nils Asmussen Slide 3 of 16
1
2
3
Nils Asmussen Slide 4 of 16
Nils Asmussen Slide 5 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA
Mem Mem Mem Mem Mem Mem Mem Mem
Asmussen et al.: M3: A Hardware/OS Co-Design to Tame Heterogeneous Manycores, ASPLOS’16 Nils Asmussen Slide 6 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA
Mem Mem Mem Mem Mem Mem Mem Mem DTU DTU DTU DTU DTU DTU DTU DTU
Asmussen et al.: M3: A Hardware/OS Co-Design to Tame Heterogeneous Manycores, ASPLOS’16 Nils Asmussen Slide 6 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA
Mem Mem Mem Mem Mem Mem Mem Mem DTU DTU DTU DTU DTU DTU DTU DTU
PE PE PE PE PE PE PE PE
Asmussen et al.: M3: A Hardware/OS Co-Design to Tame Heterogeneous Manycores, ASPLOS’16 Nils Asmussen Slide 6 of 16
Intel Xeon Intel Xeon
DSP DSP Audio Decoder FPGA
Mem Mem Mem Mem Mem Mem Mem Mem DTU DTU DTU DTU DTU DTU DTU DTU
PE PE PE PE PE PE PE PE App App App App App App App Kernel
Asmussen et al.: M3: A Hardware/OS Co-Design to Tame Heterogeneous Manycores, ASPLOS’16 Nils Asmussen Slide 6 of 16
1
2
3
Nils Asmussen Slide 7 of 16
Nils Asmussen Slide 8 of 16
Xtensa LX4
Instr. SPM Data SPM DTU PE PE PE PE PE PE PE DRAM
R R R R R R R R R
PE
Mem Ctrl.
Nils Asmussen Slide 9 of 16
DRAM Ctl
VM
SPM
VM
SPM
Nils Asmussen Slide 10 of 16
Nils Asmussen Slide 11 of 16
Nils Asmussen Slide 12 of 16
Nils Asmussen Slide 12 of 16
Nils Asmussen Slide 13 of 16
Mem DTU
S Mem DTU
Mem DTU
S R S R
Nils Asmussen Slide 14 of 16
Mem DTU
S Mem DTU
Mem DTU
S R S R
Nils Asmussen Slide 14 of 16
Mem DTU
S Mem DTU
Mem DTU
S R S R
Nils Asmussen Slide 14 of 16
Mem DTU
S Mem DTU
Mem DTU
S R S R M
Nils Asmussen Slide 14 of 16
M3 Lx
M3 Lx
M3 Lx
M3 Lx
App Xfers OS
Nils Asmussen Slide 15 of 16
Nils Asmussen Slide 16 of 16