. . . .
An Initial Characterization
- f the Emu Chick
Eric Hein, Tom Conte, (ECE) Jeff Young, (CS) Srinivas Eswar, Jiajia Li, Patrick Lavin, Richard Vuduc, Jason Riedy (CSE)
5/21/2018
An Initial Characterization of the Emu Chick Eric Hein, Tom Conte, - - PowerPoint PPT Presentation
An Initial Characterization of the Emu Chick Eric Hein, Tom Conte, (ECE) Jeff Young, (CS) Srinivas Eswar, Jiajia Li, Patrick Lavin, Richard Vuduc, Jason Riedy (CSE) 5/21/2018 . . . . Migratory Memory-side Processing Main innovation:
. . . .
5/21/2018
. . . .
2
5/21/2018
. . . .
3
5/21/2018
. . . .
4
5/21/2018
. . . .
5
5/21/2018
. . . .
6
5/21/2018
7
Emu Nodelet Emu Node Card (8 nodelets) Emu Chick (8 nodes) Emu1 Rack (256 nodes) Current Future Current Future Current Future # of cores 1 core 4 cores 8 cores 32 cores 64 cores 256 cores 8192 cores # of threads 64 256 512 2048 4096 16384 > 2 million Memory capacity 2 GiB 8 GiB 16 GiB 64 GiB 128 GiB 512 GiB 16 TiB # of 8-bit DDR4 channels 1 channel 1 channel 8 channels 8 channels 64 channels 64 channels 2048 Memory bandwidth 120 MB/s 2.5 GB/s 1.2 GB/s 20 GB/s 8 GB/s 160 GB/s 5.12 TB/s
Images and data from www.emutechnology.com
. . . .
8
5/21/2018
https://www.cilkplus.org/tutorial-cilk-plus-keywords#cilk_for
. . . .
9
5/21/2018
10
Nodelet 2 Nodelet 1 Nodelet 0 Nodelet 3
Serial Spawn Recursive Remote Spawn Serial Remote Spawn
. . . .
11
5/21/2018
~140 MB/s per nodelet ~1.2 GB/s per node (8 nodelets)
. . . .
12
5/21/2018
. . . .
13
5/21/2018
. . . .
14
5/21/2018
. . . .
15
5/21/2018
. . . .
16
5/21/2018
. . . .
17
5/21/2018
. . . .
18
5/21/2018
. . . .
19
5/21/2018
. . . .
20
5/21/2018
. . . .
21
5/21/2018
. . . .
22
5/21/2018
. . . .
23
5/21/2018
When configured to match the current hardware specifications, the simulator results match closely for local stream and global stream.
. . . .
24
5/21/2018
. . . .
25
5/21/2018