Mflop 300 200 100 0 6 8 10 12 14 16 18 20 22 Vector - - PDF document

mflop
SMART_READER_LITE
LIVE PREVIEW

Mflop 300 200 100 0 6 8 10 12 14 16 18 20 22 Vector - - PDF document

Performance of scalar product benchmark 600 R10000/195MHz R8000/75MHz Alpha/500Mhz IBM Power 2/66Mhz 500 IBM PPC604e/166Mhz IBM PPC604e/233Mhz PPro 200/Mhz, NAG F90 PPro 200/Mhz, PG F77 400 Mflop 300 200 100 0 6 8 10 12 14 16


slide-1
SLIDE 1
slide-2
SLIDE 2
slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15

100 200 300 400 500 600 6 8 10 12 14 16 18 20 22 Mflop Vector length 2^n Performance of scalar product benchmark R10000/195MHz R8000/75MHz Alpha/500Mhz IBM Power 2/66Mhz IBM PPC604e/166Mhz IBM PPC604e/233Mhz PPro 200/Mhz, NAG F90 PPro 200/Mhz, PG F77

slide-16
SLIDE 16

100 200 300 400 500 600 6 8 10 12 14 16 18 20 22 "sk2.a500" "sk1.a500" "sk2.R10000" "sk1.R10000"

slide-17
SLIDE 17

20 40 60 80 100 120 6 8 10 12 14 16 18 20 22 "sk3.a500" "sk3.r10" "sk3.rs6" "sk3.r8"

slide-18
SLIDE 18
slide-19
SLIDE 19
slide-20
SLIDE 20
slide-21
SLIDE 21

CPU 1 GW Disk Space 32 Registers 1000 W. Lev. 1 Cache 12000 W. Level 2 Cache 0.5 MW ext. Level 3 Cache 64 MW Main Memory

slide-22
SLIDE 22
slide-23
SLIDE 23

20 40 60 80 100 120 140 160 180 4 16 64 256 1024 Mflop grid size red black(1) red black(2) red black(4)

slide-24
SLIDE 24

20 40 60 80 100 120 140 160 180 4 16 64 256 1024 Mflop grid size red black(4) fused (1, 1) fused (2, 2) fused (4, 0)

slide-25
SLIDE 25

20 40 60 80 100 120 140 160 180 4 16 64 256 1024 Mflop grid size

  • ptimised RB

melt(2, 2) melt(3, 3) melt(4, 4)

slide-26
SLIDE 26
slide-27
SLIDE 27
slide-28
SLIDE 28
slide-29
SLIDE 29
slide-30
SLIDE 30
slide-31
SLIDE 31
slide-32
SLIDE 32
slide-33
SLIDE 33
slide-34
SLIDE 34
slide-35
SLIDE 35
slide-36
SLIDE 36
slide-37
SLIDE 37