Embedded Supercomputing: (to combat the malaise that is big - - PowerPoint PPT Presentation

embedded supercomputing
SMART_READER_LITE
LIVE PREVIEW

Embedded Supercomputing: (to combat the malaise that is big - - PowerPoint PPT Presentation

Embedded Supercomputing: (to combat the malaise that is big sofuware) Radio Astronomy at the Limit Simon Ratclifge & Bruce Merry SKA South Africa JSNTB In the beginning.... more recently = Johannes Hevelius 60fu - 1673 (the Ewan


slide-1
SLIDE 1

(to combat the malaise that is big sofuware)

Embedded Supercomputing: Radio Astronomy at the Limit

SKA South Africa

Simon Ratclifge & Bruce Merry

JSNTB

slide-2
SLIDE 2

In the beginning....

slide-3
SLIDE 3

more recently

=

Johannes Hevelius 60fu - 1673

(the Ewan McTeagle of his day)
slide-4
SLIDE 4

everything we know so far

http://xkcd.com/273/

slide-5
SLIDE 5

getting the full picture

+ = = +

Jansky VLA, New Mexico

slide-6
SLIDE 6

Watt?

1 Jy = 10-26 Wm-2Hz-1

I cannae do it, captain, ye cannae change the laws of physics

slide-7
SLIDE 7

how much wood could a wood chuck, chuck

0.00000000000007 J 0.00000016 J 1.008 J

slide-8
SLIDE 8

“Know your enemy and know yourself”

105 Jy 108 Jy 0 Jy

Sun @ 5 GHz GSM Phone @ 1km 'Smart' Phone @ 1m

slide-9
SLIDE 9

something a little bigger

slide-10
SLIDE 10 X X X X X X

SKY Image

Detect & amplify Digitise & delay Correlate Process

Calibrate, grid, FFT

Integrate

s B . s Astronomical signal (EM wave)

photon to image

slide-11
SLIDE 11

Will it blend ?

Andrew Cooper

1000000000000000000 B

@ 1 bit per grain of standardised* sand

* assumptions apply (10gpmm^3, 3kmx2kmx700m, only valid when calculated on paper napkin, just say no to assumptions)

X

slide-12
SLIDE 12

You sir, are a blaggard and a coward

1 exbibyte

  • 1 exabyte

38, 230 x

slide-13
SLIDE 13

Big Iron

slide-14
SLIDE 14

Medium Iron (and a fair bit of aluminium)

slide-15
SLIDE 15

In theory

IO / Cache / FLOPS / kW / $ Vij = Mij Bij Gij Dij Eij Pij Tij Vij

IDEAL

MAGIC

MeerKAT64

slide-16
SLIDE 16

“My god it's full of data”

68

Gibps

input data rate

50

hour

  • bservation

1

PiB

bufger

@ =

slide-17
SLIDE 17

50 hour totals

1.1

Tibps

bufger read rate

1.7

ExaFlop

total FP operations

1.9

TB

working memory

slide-18
SLIDE 18

easy…..

slide-19
SLIDE 19

whither Mr. Fusion ?

20

kW

available power

slide-20
SLIDE 20

rosetta stone (edition 2010)

slide-21
SLIDE 21

...shake your windows, and rattle your walls

TK1: 327 GFLOPs / 12.7 GiBps

slide-22
SLIDE 22

measurement equation (redux)

V (u,v,w)=∫ A (l,m,w ) I (l,m) e−2 πi [ul+vm]dldm

input efgects image Fourier transform baseline direction

slide-23
SLIDE 23

convolutional gridding

V (u,v,w )=∫ A (l,m,w ) I (l,m) e−2 πi [ul+vm]dldm

slide-24
SLIDE 24

Working Model

8 Hours, 64 Antennas, Single Channel, 4k Image

slide-25
SLIDE 25

I cannae push it any faster, Captain!

slide-26
SLIDE 26

Trade-ofg between gridding and FFT costs

slide-27
SLIDE 27

Anatomy of a modern radio telescope

slide-28
SLIDE 28

Graphs all the way down until you hit the turtles...

slide-29
SLIDE 29

How embarrassing is your parallelism ?

slide-30
SLIDE 30

ignore the logo, it's clearly an internal design.

slide-31
SLIDE 31

How big ?

peltier exchange

Ground Loop

Hive

FERRO

100Nodes 50TFLOPS 0.4TB RAM 50TB SSD 40Gbps Eth 0.7kW

pre cambrian cooling inc.

backplane

SSD SoC Carrier Thermal Compound

slide-32
SLIDE 32

deep fried

slide-33
SLIDE 33

The build TEGRA X1 TESLA K40

1056 Nodes

Tegra X1 4 GB RAM 512 GB SSD

22 Switches

2 x 10 GbE SFP+ 48 x 1 GbE

11 Pods

15M Ground Loop 50L Mineral Oil

50 Servers

2 x Tesla K40 2 x E5-2660v3 6 x 2TB SATA 64 GB RAM

3 Switches

4 x 40 GbE QSFP

36 x 10 GbE SFP+

3 Racks

Just a rack

slide-34
SLIDE 34

Super green ? Super green. TEGRA X1 TESLA K40

$350 kilo

$310k Hardware $40k Infrastructure

12.4 kW

11.9 kW Hardware

0.5 kW Cooling

$1,056 kilo

$816k Hardware $58k Infrastructure

57.5 kW

44.3 kW Hardware 13.2 kW Cooling