evaluation of improved scalability
play

Evaluation of Improved Scalability Comparison points Throughput - PowerPoint PPT Presentation

Evaluation of Improved Scalability Comparison points Throughput (IPC/Node) Throughput (IPC/Node) Throughput (IPC/Node) Throughput (IPC/Node) Baseline bufferless : doesnt 1 1 1 1 scale 0.8 0.8 0.8 0.8 0.6 0.6


  1. Evaluation of Improved Scalability • Comparison points… • Throughput (IPC/Node) Throughput (IPC/Node) Throughput (IPC/Node) Throughput (IPC/Node) Baseline bufferless : doesn’t 1 1 1 1 scale 0.8 0.8 0.8 0.8 • 0.6 0.6 0.6 0.6 Buffered: area/power 0.4 0.4 0.4 0.4 Baseline Bufferless Baseline Bufferless Baseline Bufferless expensive Buffered Buffered 0.2 0.2 0.2 0.2 Throttling Bufferless 0 0 0 0 • 16 16 16 16 64 64 64 64 256 256 256 256 1024 1024 1024 1024 4096 4096 4096 4096 Contribution: keep area and Number of Cores Number of Cores Number of Cores Number of Cores power benefits of bufferless, while achieving comparable performance • Application-aware throttling 20 Power Consumption % Reduction in • 15 Overall reduction in congestion 10 • 5 Power consumption reduced through increase in net efficiency 0 16 64 256 1024 4096 Number of Cores • Many other results in paper, e.g., Fairness, starvation, latency… 32

  2. Summary of Study, Results, and Conclusions • Highlighted a traditional networking problem in a new context - Unique design requires novel solution • We showed congestion limited efficiency and scalability, and that self-throttling nature of cores prevents collapse • Study showed congestion control would require app- awareness • Our application-aware congestion controller provided: - A more efficient network-layer (reduced latency) - Improvements in system throughput (up to 27%) - Effectively scale the CMP (shown for up to 4096 cores) 33

  3. Discussion • Congestion is just one of many similarities, discussion in paper , e.g., - Traffic Engineering: multi-threaded workloads w/ “hotspots” - Data Centers: similar topology, dynamic routing & computation - Coding : “XOR’s In -The- Air” adapted to the on-chip network: • i.e., instead of deflecting 1 of 2 packets, XOR the packets and forward the combination over the optimal hop 34

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend