mflop
play

Mflop 300 200 100 0 6 8 10 12 14 16 18 20 22 Vector - PDF document

Performance of scalar product benchmark 600 R10000/195MHz R8000/75MHz Alpha/500Mhz IBM Power 2/66Mhz 500 IBM PPC604e/166Mhz IBM PPC604e/233Mhz PPro 200/Mhz, NAG F90 PPro 200/Mhz, PG F77 400 Mflop 300 200 100 0 6 8 10 12 14 16


  1. Performance of scalar product benchmark 600 R10000/195MHz R8000/75MHz Alpha/500Mhz IBM Power 2/66Mhz 500 IBM PPC604e/166Mhz IBM PPC604e/233Mhz PPro 200/Mhz, NAG F90 PPro 200/Mhz, PG F77 400 Mflop 300 200 100 0 6 8 10 12 14 16 18 20 22 Vector length 2^n

  2. 600 "sk2.a500" "sk1.a500" "sk2.R10000" 500 "sk1.R10000" 400 300 200 100 0 6 8 10 12 14 16 18 20 22

  3. 120 "sk3.a500" "sk3.r10" "sk3.rs6" 100 "sk3.r8" 80 60 40 20 0 6 8 10 12 14 16 18 20 22

  4. CPU 32 Registers 1000 W. Lev. 1 Cache 12000 W. Level 2 Cache 0.5 MW ext. Level 3 Cache 64 MW Main Memory 1 GW Disk Space

  5. 180 red black(1) red black(2) red black(4) 160 140 120 100 Mflop 80 60 40 20 0 4 16 64 256 1024 grid size

  6. 180 red black(4) fused (1, 1) fused (2, 2) 160 fused (4, 0) 140 120 100 Mflop 80 60 40 20 0 4 16 64 256 1024 grid size

  7. 180 optimised RB melt(2, 2) melt(3, 3) 160 melt(4, 4) 140 120 100 Mflop 80 60 40 20 0 4 16 64 256 1024 grid size

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend